llvm-project

Commit Graph

Author	SHA1	Message	Date
Matt Arsenault	d8d193d5e2	GlobalISel: Partially implement widenScalar for MERGE_VALUES llvm-svn: 352560	2019-01-29 23:17:35 +00:00
Alina Sbirlea	f9027e554a	Check bool attribute value in getOptionalBoolLoopAttribute. Summary: Check the bool value of the attribute in getOptionalBoolLoopAttribute not just its existance. Eliminates the warning noise generated when vectorization is explicitly disabled. Reviewers: Meinersbur, hfinkel, dmgreen Subscribers: jlebar, sanjoy, llvm-commits Differential Revision: https://reviews.llvm.org/D57260 llvm-svn: 352555	2019-01-29 22:33:20 +00:00
Sam Clegg	2a193e0d12	[WebAssembly] Ensure BasicSymbolRef.getRawDataRefImpl().p is non-null Store a non-zero value to ref.d.a and use ref.d.b to store the symbol index. This means that ref.p is never null, which was confusing llvm-nm. Fixes PR40497 Differential Revision: https://reviews.llvm.org/D57373 llvm-svn: 352551	2019-01-29 22:22:32 +00:00
Shoaib Meenai	ed2ebf82e7	[docs] Prevent O0 optnone for opt input If we just compile with -O0, clang will add optnone attributes everywhere, so opt won't actually be able to perform any passes. Instruct clang to not emit the optnone so opt can do its thing. Differential Revision: https://reviews.llvm.org/D56950 llvm-svn: 352550	2019-01-29 22:17:51 +00:00
Amara Emerson	102c9ed768	[AArch64][GlobalISel] Unmerge into scalars from a vector should use FPR bank. This currently shows up as a selection fallback since the dest regs were given GPR banks but the source was a vector FPR reg. Differential Revision: https://reviews.llvm.org/D57408 llvm-svn: 352545	2019-01-29 21:19:33 +00:00
Paul Robinson	4ca29477d9	[DWARF] Emit reasonable debug info for empty .s files. llvm-svn: 352541	2019-01-29 20:53:51 +00:00
Sanjay Patel	18db56209c	[InstCombine] canonicalize cmp/select form of uadd saturate with constant I'm circling back around to a loose end from D51929. The backend (either CGP or DAG) doesn't recognize this pattern, so we end up with different asm for these IR variants. Regardless of any future changes to canonicalize to saturation/overflow intrinsics, we want to get raw IR variations into the minimal number of raw IR forms. If/when we can canonicalize to intrinsics, that will make that step easier. Pre: C2 == ~C1 %a = add i32 %x, C1 %c = icmp ugt i32 %x, C2 %r = select i1 %c, i32 -1, i32 %a => %a = add i32 %x, C1 %c2 = icmp ult i32 %x, C2 %r = select i1 %c2, i32 %a, i32 -1 https://rise4fun.com/Alive/pkH Differential Revision: https://reviews.llvm.org/D57352 llvm-svn: 352536	2019-01-29 20:02:45 +00:00
Sanjay Patel	a61d586f74	[DAGCombiner] fold extract_subvector of extract_subvector This is the sibling fold for insert-of-insert that was added with D56604. Now that we have x86 shuffle narrowing (D57156), this change shows improvements for lots of AVX512 reduction code (not sure that we would ever expect extract-of-extract otherwise). There's a small regression in some of the partial-permute tests (extracting followed by splat). That is tracked by PR40500: https://bugs.llvm.org/show_bug.cgi?id=40500 Differential Revision: https://reviews.llvm.org/D57336 llvm-svn: 352528	2019-01-29 19:13:39 +00:00
Michael J. Spencer	2a5a0ad1e4	[VFS] Fix warning and use better check. llvm-svn: 352527	2019-01-29 19:07:15 +00:00
Matt Arsenault	18619afe1d	GlobalISel: Fix narrowScalar for load/store with different mem size This was ignoring the memory size, and producing multiple loads/stores if the operand size was different from the memory size. I assume this is the intent of not having an explicit G_ANYEXTLOAD (although I think that would probably be better). llvm-svn: 352523	2019-01-29 18:13:02 +00:00
Sanjay Patel	cd6b240303	[x86] add tests for vector bool math; NFC llvm-svn: 352520	2019-01-29 17:00:47 +00:00
Sanjay Patel	22dd34b0ec	[AArch64] add tests for vector bool math; NFC llvm-svn: 352519	2019-01-29 17:00:07 +00:00
Andrea Di Biagio	815cdbff29	[X86][Btver2] Improved latency/throughput model for scalar int-to-float conversions. Account for bypass delays when computing the latency of scalar int-to-float conversions. On Jaguar we need to account for an extra 6cy latency (see AMD fam16h SOG). This patch also fixes the number of micropcodes for the register-memory variants of scalar int-to-float conversions. Differential Revision: https://reviews.llvm.org/D57148 llvm-svn: 352518	2019-01-29 16:47:27 +00:00
Sanjay Patel	2e87df9112	[InstCombine] regenerate test checks; NFC llvm-svn: 352517	2019-01-29 16:44:05 +00:00
Sanjay Patel	f044d1884d	[InstCombine] add tests for ext-of-bool + add/sub; NFC We should choose one of these as canonical: %z = zext i1 %cmp to i32 %r = sub i32 %x, %z => %s = sext i1 %cmp to i32 %r = add i32 %x, %s The test comments assume that the zext form is better, but we can adjust that if we decide to go the other way. llvm-svn: 352515	2019-01-29 16:39:23 +00:00
James Y Knight	5d71fc5d7b	Adjust documentation for git migration. This fixes most references to the paths: llvm.org/svn/ llvm.org/git/ llvm.org/viewvc/ github.com/llvm-mirror/ github.com/llvm-project/ reviews.llvm.org/diffusion/ to instead point to https://github.com/llvm/llvm-project. This is not a trivial substitution, because additionally, all the checkout instructions had to be migrated to instruct users on how to use the monorepo layout, setting LLVM_ENABLE_PROJECTS instead of checking out various projects into various subdirectories. I've attempted to not change any scripts here, only documentation. The scripts will have to be addressed separately. Additionally, I've deleted one document which appeared to be outdated and unneeded: lldb/docs/building-with-debug-llvm.txt Differential Revision: https://reviews.llvm.org/D57330 llvm-svn: 352514	2019-01-29 16:37:27 +00:00
Nirav Dave	1527c0e727	[SelectionDAGBuilder] Remove redundant variable. NFCI. llvm-svn: 352506	2019-01-29 15:14:07 +00:00
Jordan Rupprecht	c892741e74	[llvm-objcopy] Implement --set-section-flags. Summary: --set-section-flags is used to change the section flags (e.g. SHF_ALLOC) for given sections. The flags allowed are the same from the existing --rename-section=.old=.new[,flags] feature. Additionally, make sure that --set-section-flag cannot be used with --rename-section (either the source or destination), since --rename-section accepts flags. This avoids ambiguity for something like "--rename-section=.foo=.bar,alloc --set-section-flag=.bar,code". Reviewers: jhenderson, jakehehrlich, alexshap, espindola Reviewed By: jhenderson, jakehehrlich Subscribers: llvm-commits, emaste, arichardson Differential Revision: https://reviews.llvm.org/D57198 llvm-svn: 352505	2019-01-29 15:05:38 +00:00
Ayonam Ray	a1f6973ade	Reversing the checkin for version 352484 as tests are failing. llvm-svn: 352504	2019-01-29 15:00:50 +00:00
Nico Weber	a963821052	gn build: Merge r352444, r352431, r352430 llvm-svn: 352502	2019-01-29 14:39:54 +00:00
Neil Henning	0799352026	[AMDGPU] Fix a weird WWM intrinsic issue. I found a really strange WWM issue through a very convoluted shader that essentially boils down to a bug in SIInstrInfo where canReadVGPR did not correctly identify that WWM is like a copy and can have a VGPR as its source. Differential Revision: https://reviews.llvm.org/D56002 llvm-svn: 352500	2019-01-29 14:28:17 +00:00
Hans Wennborg	81675c8f3b	Revert r351833 and r352250. They were breaking the Windows build when using MSBuild, see the discussion on D56781. r351833: "Use response file when generating LLVM-C.dll" > Use response file when generating LLVM-C.dll > > As discovered in D56774 the command line gets to long, so use a response file to give the script the libs. This change has been tested and is confirmed working for me. > > Commited on behalf of Jakob Bornecrantz > > Differential Revision: https://reviews.llvm.org/D56781 r352250: "Build LLVM-C.dll by default on windows and enable in release package" > Build LLVM-C.dll by default on windows and enable in release package > > With the fixes to the building of LLVM-C.dll in D56781 this should now > be safe to land. This will greatly simplify dealing with LLVM for people > that just want to use the C API on windows. This is a follow up from > D35077. > > Patch by Jakob Bornecrantz! > > Differential revision: https://reviews.llvm.org/D56774 llvm-svn: 352492	2019-01-29 13:43:22 +00:00
Ayonam Ray	4272af9b3e	[CodeGen] Omit range checks from jump tables when lowering switches with unreachable default During the lowering of a switch that would result in the generation of a jump table, a range check is performed before indexing into the jump table, for the switch value being outside the jump table range and a conditional branch is inserted to jump to the default block. In case the default block is unreachable, this conditional jump can be omitted. This patch implements omitting this conditional branch for unreachable defaults. Review ID: D52002 Reviewers: Hans Wennborg, Eli Freidman, Roman Lebedev llvm-svn: 352484	2019-01-29 12:01:32 +00:00
Simon Pilgrim	4293ad8ab2	[X86] Add PR40483 test case llvm-svn: 352480	2019-01-29 10:58:42 +00:00
Dan Gohman	4684f824d4	[WebAssembly] Re-enable main-function signature rewriting Re-enable the code to rewrite main-function signatures into "int main(int argc, char argv[])", but limited to only handling the case of "int main(void)", so that it doesn't silently strip an argument in the "int main(int argc, char argv[], char *envp[])" case. This allows main to be called by C startup code, since WebAssembly requires caller and callee signatures to match, so it can't rely on passing main a different number of arguments than it expects. Differential Revision: https://reviews.llvm.org/D57323 llvm-svn: 352479	2019-01-29 10:53:42 +00:00
James Henderson	6f39f6ace7	[llvm-symbolizer][doc] Tweak wording of --adjust-vma switch description The address isn't dynamically relocated. The object is. llvm-svn: 352477	2019-01-29 10:43:48 +00:00
Simon Pilgrim	06a342b2d6	[X86] Fix linux32 pic tests to use correct relocation model (PR39684) Differential Revision: https://reviews.llvm.org/D57301 llvm-svn: 352476	2019-01-29 10:41:48 +00:00
David Green	54b0115547	[ARM] Use sub for negative offset load/store in thumb1 This attempts to optimise negative values used in load/store operands a little. We currently try to selct them as rr, materialising the negative constant using a MOV/MVN pair. This instead selects ri with an immediate of 0, forcing the add node to become a simpler sub. Differential Revision: https://reviews.llvm.org/D57121 llvm-svn: 352475	2019-01-29 10:40:31 +00:00
Simon Pilgrim	0b7fce6d72	[X86] Regenerate abi-isel.ll test Adds note requested in D57301 and fixes some missing GOTPCREL addressmath checks llvm-svn: 352474	2019-01-29 10:39:02 +00:00
David Green	5c33c5da1a	[ARM] Add extra testcases for D57121. NFC llvm-svn: 352472	2019-01-29 10:25:56 +00:00
Jeremy Morse	ba467024f4	Remove 'XFAIL: powerpc64' from a debuginfo test This test started XPASSing with r352467, and the change in behaviour performed by that patch does appear to fix the cause of the original XFAIL (missing FrameIndex DBG_VALUE), which I've replicated locally with -mtriple=powerpc64--. I'll write this up in PR21881 which documents the XFAIL, and seek confirmation I haven't overlooked something here. llvm-svn: 352471	2019-01-29 10:23:43 +00:00
Bjorn Pettersson	d014d576a9	[IPCP] Don't crash due to arg count/type mismatch between caller/callee Summary: This patch avoids an assert in IPConstantPropagation when there is a argument count/type mismatch between the caller and the callee. While this is actually UB on C-level (clang emits a warning), the IR verifier seems to accept it. I'm not sure what other frontends/languages might think about this, so simply bailing out to avoid hitting an assert (in CallSiteBase<>::getArgOperand or Value::doRAUW) seems like a simple solution. The problem is exposed by the fact that AbstractCallSites will look through a bitcast at the callee position of a call/invoke. Reviewers: jdoerfert, reames, efriedma Reviewed By: jdoerfert, efriedma Subscribers: eli.friedman, efriedma, llvm-commits Differential Revision: https://reviews.llvm.org/D57052 llvm-svn: 352469	2019-01-29 10:19:44 +00:00
Jeremy Morse	66ac86b58d	[DebugInfo][DAG] Process FrameIndex dbg.values unconditionally A FrameIndex should be valid throughout a block regardless of what instructions get selected in that block -- therefore we shouldn't harness dbg.values that refer to FrameIndexes to an SDNode. There are numerous codegen reasons why an SDNode never appears or doesn't become a location that a DBG_VALUE can refer to. None of them actually affect the variable location. Therefore, before any other tests to encode dbg_values in a SelectionDAG, identify FrameIndex operands and encode them unattached to any SDNode. Differential Revision: https://reviews.llvm.org/D57328 llvm-svn: 352467	2019-01-29 09:40:05 +00:00
Max Kazantsev	23e642248d	[NFC] Use ArrayRef instead of SmallVectorImpl where possible llvm-svn: 352466	2019-01-29 09:39:15 +00:00
Martin Storsjo	f5884d255e	[COFF, ARM64] Don't put jump table into a separate COFF section for EK_LabelDifference32 Windows ARM64 has PIC relocation model and uses jump table kind EK_LabelDifference32. This produces jump table entry as ".word LBB123 - LJTI1_2" which represents the distance between the block and jump table. A new relocation type (IMAGE_REL_ARM64_REL32) is needed to do the fixup correctly if they are in different COFF section. This change saves the jump table to the same COFF section as the associated code. An ideal fix could be utilizing IMAGE_REL_ARM64_REL32 relocation type. Patch by Tom Tan! Differential Revision: https://reviews.llvm.org/D57277 llvm-svn: 352465	2019-01-29 09:36:48 +00:00
Jonas Paulsson	5ed4d4638f	[CodeGenPrepare] Handle all debug calls in dupRetToEnableTailCallOpts() This patch makes sure that a debug value that is after the bitcast in dupRetToEnableTailCallOpts() is also skipped. The reduced test case is from SPEC-2006 on SystemZ. Review: Vedant Kumar, Wolfgang Pieb https://reviews.llvm.org/D57050 llvm-svn: 352462	2019-01-29 09:03:35 +00:00
Jeremy Morse	27631cc670	Fix an incorrectly configured test. This should have had a target triple in it, my mistake. llvm-svn: 352460	2019-01-29 08:41:44 +00:00
Mikael Holmen	b792627ce9	Fix compiler warning when using clang 3.6.0 Without the fix we get the following (with -Werror): ../lib/Target/X86/X86ISelLowering.cpp:14181:58: error: suggest braces around initialization of subobject [-Werror,-Wmissing-braces] SmallVector<std::array<int, 2>, 2> LaneSrcs(NumLanes, {-1, -1}); ^~~~~~ { } 1 error generated. llvm-svn: 352455	2019-01-29 06:51:28 +00:00
Philip Reames	3cfd351efc	Correct contents for r352453 I had a local change I hadn't realized when submitting that auto-update. As such, the auto-update was wrong. This should fix it, and with that, it's clearly time to stop submitting changes and go to bed. llvm-svn: 352454	2019-01-29 06:40:02 +00:00
Philip Reames	2ddf96db50	[Tests] Regen to remove future test diffs This file appears to have been manually editted at some point after being auto-updated. A future change adjusts this file slightly, and all of the updates makes the diff super confusing. llvm-svn: 352453	2019-01-29 06:34:46 +00:00
Philip Reames	3846b9b443	[Test] Add tests for gather/maked.load demanded elements, and convert the whole file to auto generated checks. llvm-svn: 352452	2019-01-29 05:58:32 +00:00
Max Kazantsev	468ad52213	[SCEV] Take correct loop in AddRec simplification. PR40420 The code of AddRec simplification is using wrong loop when it creates a new AddRecExpr. It should be using AddRecLoop which we have saved and against which all gate checks are made, and not calling AddRec->getLoop() over and over again because AddRec may change and become an AddRecurrency from outer loop during the transform iterations. Considering this change trivial, commiting for postcommit review. llvm-svn: 352451	2019-01-29 05:37:59 +00:00
Max Kazantsev	d4de606ddb	[NFC] Merge failing test from PR40420 llvm-svn: 352450	2019-01-29 05:12:40 +00:00
Teresa Johnson	87cc05055a	Try to make new test more resilient to different orderings New test added in r352441 getting a bot failure which I believe is due to different ordering in the dumping which isn't being handled well. Try to make test more resilient to ordering differences. llvm-svn: 352446	2019-01-29 02:04:01 +00:00
Sam Clegg	b54927cc48	[WebAssembly] Handle more types of uses in WebAssemblyAddMissingPrototypes Previously we were only handling bitcast operations, however prototypeless functions can also appear in other places such as comparisons and as function params. Switch to using replaceAllUsesWith() to replace the prototype-less function uses. This new approach results in some redundant bitcasting but is much simpler and handles all cases. Differential Revision: https://reviews.llvm.org/D56938 llvm-svn: 352445	2019-01-29 00:30:46 +00:00
Reid Kleckner	85e72c3d56	[PPC] Include tablegenerated PPCGenCallingConv.inc once Move the CC analysis implementation to its own .cpp file instead of duplicating it and artificually using functions in PPCISelLowering.cpp and PPCFastISel.cpp. Follow-up to the same change done for X86, ARM, and AArch64. llvm-svn: 352444	2019-01-29 00:30:35 +00:00
Thomas Lively	33f87b8aef	[WebAssembly] Expand BUILD_PAIR nodes Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish Differential Revision: https://reviews.llvm.org/D57276 llvm-svn: 352442	2019-01-28 23:44:31 +00:00
Teresa Johnson	2f616e479b	[ThinLTO] Add option to dump per-module summary dot graph Summary: I found that there currently isn't a way to invoke exportToDot from the command line for a per-module summary index, and therefore no testing of that case. Add an internal option and use it to test dumping of per module summary indexes. In particular, I am looking at fixing the limitation that causes the aliasee GUID in the per-module summary to be 0, and want to be able to test that change. Reviewers: evgeny777 Subscribers: mehdi_amini, inglorion, eraman, steven_wu, dexonsmith, llvm-commits Differential Revision: https://reviews.llvm.org/D57206 llvm-svn: 352441	2019-01-28 23:43:26 +00:00
Philip Reames	6c5341bc5a	Demanded elements support for vector GEPs GEPs can produce either scalar or vector results. If we're extracting only a subset of the vector lanes, simplifying the operands is helpful in eliminating redundant computation, and (eventually) allowing further optimizations Differential Revision: https://reviews.llvm.org/D57177 llvm-svn: 352440	2019-01-28 23:24:49 +00:00
Eli Friedman	f0e676819f	[docs] Fix a couple spelling errors. llvm-svn: 352439	2019-01-28 23:03:41 +00:00
Teresa Johnson	5b2f6a1bc2	[ThinLTO] Refine reachability check to fix compile time increase Summary: A recent fix to the ThinLTO whole program dead code elimination (D56117) increased the thin link time on a large MSAN'ed binary by 2x. It's likely that the time increased elsewhere, but was more noticeable here since it was already large and ended up timing out. That change made it so we would repeatedly scan all copies of linkonce symbols for liveness every time they were encountered during the graph traversal. This was needed since we only mark one copy of an aliasee as live when we encounter a live alias. This patch fixes the issue in a more efficient manner by simply proactively visiting the aliasee (thus marking all copies live) when we encounter a live alias. Two notes: One, this requires a hash table lookup (finding the aliasee summary in the index based on aliasee GUID). However, the impact of this seems to be small compared to the original pre-D56117 thin link time. It could be addressed if we keep the aliasee ValueInfo in the alias summary instead of the aliasee GUID, which I am exploring in a separate patch. Second, we only populate the aliasee GUID field when reading summaries from bitcode (whether we are reading individual summaries and merging on the fly to form the compiled index, or reading in a serialized combined index). Thankfully, that's currently the only way we can get to this code as we don't yet support reading summaries from LLVM assembly directly into a tool that performs the thin link (they must be converted to bitcode first). I added a FIXME, however I have the fix under test already. The easiest fix is to simply populate this field always, which isn't hard, but more likely the change I am exploring to store the ValueInfo instead as described above will subsume this. I don't want to hold up the regression fix for this though. Reviewers: trentxintong Subscribers: mehdi_amini, inglorion, dexonsmith, llvm-commits Differential Revision: https://reviews.llvm.org/D57203 llvm-svn: 352438	2019-01-28 22:27:05 +00:00
Sanjay Patel	a36a293a56	[CGP] auto-generate complete checks for add overflow tests; NFC llvm-svn: 352437	2019-01-28 22:07:37 +00:00
Craig Topper	390ac61b93	Recommit r352255 "[SelectionDAG][X86] Don't use SEXTLOAD for promoting masked loads in the type legalizer" This did not cause the buildbot failure it was previously reverted for. Original commit message: I'm not sure why we were using SEXTLOAD. EXTLOAD seems more appropriate since we don't care about the upper bits. This patch changes this and then modifies the X86 post legalization combine to emit a extending shuffle instead of a sign_extend_vector_inreg. Could maybe use an any_extend_vector_inre On AVX512 targets I think we might be able to use a masked vpmovzx and not have to expand this at all. llvm-svn: 352433	2019-01-28 21:38:47 +00:00
Yonghong Song	61bc1d7ed5	[RuntimeDyld] load all sections with ProcessAllSections This patch tried to address the following use case. . bcc (https://github.com/iovisor/bcc) utilizes llvm JIT to compile for BTF target. . with -g, .BTF and .BTF.ext sections (BPF debug info) will be generated by LLVM. . .BTF does not have relocations and .BTF.ext has some relocations. . With ProcessAllSections, .BTF.ext is loaded by JIT dynamic linker and is available to application. But .BTF is not loaded. The bcc application needs both .BTF.ext and .BTF for debugging purpose, and .BTF is not loaded. This patch addressed this issue by iterating over all sections and loading any missing sections, after symbol/relocation processing in loadObjectImpl(). Signed-off-by: Yonghong Song <yhs@fb.com> Differential Revision: https://reviews.llvm.org/D55943 llvm-svn: 352432	2019-01-28 21:35:23 +00:00
Reid Kleckner	27fd307b83	[ARM] Deduplicate table generated CC analysis code Create ARMCallingConv.cpp and emit code for calling convention analysis from there. llvm-svn: 352431	2019-01-28 21:28:43 +00:00
Reid Kleckner	96c581d7d0	[AArch64] Include AArch64GenCallingConv.inc once Summary: Avoids duplicating generated static helpers for calling convention analysis. This also means you can modify AArch64CallingConv.td without recompiling the AArch64ISelLowering.cpp monolith, so it provides faster incremental rebuilds. Saves 12K in llc.exe, but adds a new object file, which is large. Reviewers: efriedma, t.p.northover Subscribers: mgorny, javed.absar, kristof.beyls, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D56948 llvm-svn: 352430	2019-01-28 21:28:40 +00:00
Jessica Paquette	2d73ecd0a3	[GlobalISel][AArch64] Add legalization for G_FLOG This adds support for legalizing G_FLOG into a RTLib call. It adds a legalizer test, and updates the existing floating point tests. https://reviews.llvm.org/D57347 llvm-svn: 352429	2019-01-28 21:27:23 +00:00
Sanjay Patel	8965411619	[InstCombine] add another saturating uadd test (no undefs); NFC I forgot that our undef matching hasn't been completed in the previous commit. llvm-svn: 352424	2019-01-28 20:37:18 +00:00
Sanjay Patel	dc543300a9	[InstCombine] add tests for saturating uadd with constant; NFC llvm-svn: 352423	2019-01-28 20:32:48 +00:00
Matt Arsenault	cdd191d9db	AMDGPU: Add DS append/consume intrinsics Since these pass the pointer in m0 unlike other DS instructions, these need to worry about whether the address is uniform or not. This assumes the address is dynamically uniform, and just uses readfirstlane to get a copy into an SGPR. I don't know if these have the same 16-bit add for the addressing mode offset problem on SI or not, but I've just assumed they do. Also includes some misc. changes to avoid test differences between the LDS and GDS versions. llvm-svn: 352422	2019-01-28 20:14:49 +00:00
Nico Weber	285becfa4c	gn build: Add get.py script to download prebuilt gn, make gn.py run downloaded gn if gn is not on PATH Prebuilts are available for x86_64 Linux, macOS, Windows. The script always pulls the latest GN version. Differential Revision: https://reviews.llvm.org/D57256 llvm-svn: 352420	2019-01-28 19:54:41 +00:00
Nico Weber	3d4f49fa78	gn build: Make cmake sync script work on Windows if git is a bat file Differential Revision: https://reviews.llvm.org/D57338 llvm-svn: 352419	2019-01-28 19:53:52 +00:00
Jessica Paquette	c49428a97d	[GlobalISel][AArch64] Add instruction selection support for @llvm.log10 This adds instruction selection support for @llvm.log10 in AArch64. It teaches GISel to lower it to a library call, updates the relevant tests, and adds a legalizer test for log10. https://reviews.llvm.org/D57341 llvm-svn: 352418	2019-01-28 19:53:14 +00:00
Alina Sbirlea	8e1d65771a	[AliasSetTracker] Cleanup more comments. [NFCI] llvm-svn: 352416	2019-01-28 19:38:03 +00:00
Nico Weber	b4980cd84f	gn build: Fix `lld-link: unknown flag: -fuse-ld=lld` warnings on Windows Fixes a minor regression from r351248. While here, also make it possible to opt out of lld by saying use_lld=false when clang_base_path is set. (use_lld still defaults to true if clang_base_path is set.) llvm-svn: 352415	2019-01-28 19:32:52 +00:00
Scott Linder	b5d6292822	[MC] Do not consider .ifdef/.ifndef as a use This is allowed by GAS and seems correct. Differential Revision: https://reviews.llvm.org/D55439 llvm-svn: 352414	2019-01-28 19:32:08 +00:00
Francis Visoiu Mistrih	556ea7d2e0	[AArch64] Add 'apple-latest' CPU alias The 'apple-latest' alias is supposed to provide a CPU that contains the latest Apple processor model supported by LLVM. This is supposed to be used by tools like lldb to provide a target that supports most of the CPU features. For now, this is mapped to Cyclone. Differential Revision: https://reviews.llvm.org/D56384 llvm-svn: 352412	2019-01-28 19:27:33 +00:00
Petr Hosek	12f4b86808	Revert "[CMake] Use __libc_start_main rather than fopen when checking for C library" This reverts commit r352341: it broke the build on macOS which doesn't seem to provide __libc_start_main in its C library. llvm-svn: 352411	2019-01-28 19:26:41 +00:00
Jessica Paquette	2e35dc5185	[GlobalISel] Add ISel support for @llvm.lifetime.start and @llvm.lifetime.end This adds ISel support for lifetime markers in opt levels above O0. It also updates the arm64-irtranslator test, and updates some AArch64 tests that use them for added coverage. It also adds a testcase taken from the X86 codegen tests which verified a bug caused by lifetime markers + stack colouring in the past. This is intended to make sure that GISel doesn't re-introduce the bug. (This is basically a straight copy from what SelectionDAG does in SelectionDAGBuilder.cpp) https://reviews.llvm.org/D57187 llvm-svn: 352410	2019-01-28 19:22:29 +00:00
Nikita Popov	8e1a464e6a	[CodeGen][X86] Expand UADDSAT to NOT+UMIN+ADD Followup to D56636, this time handling the UADDSAT case by expanding uadd.sat(a, b) to umin(a, ~b) + b. Differential Revision: https://reviews.llvm.org/D56869 llvm-svn: 352409	2019-01-28 19:19:09 +00:00
Vedant Kumar	1c3694a4d4	[CodeExtractor] Add support for the `swifterror` attribute When passing a `swifterror` argument or alloca as an input to an extraction region, mark the input parameter `swifterror`. llvm-svn: 352408	2019-01-28 19:13:37 +00:00
Alina Sbirlea	d8c829bc22	[AliasSetTracker] Cleanup comments. [NFCI] llvm-svn: 352406	2019-01-28 19:01:32 +00:00
Jessica Paquette	7db82d7257	[GlobalISel][AArch64] Add instruction selection support for G_FCOS and G_FSIN This contains all of the legalizer changes from D57197 necessary to select G_FCOS and G_FSIN. It also updates several existing IR tests in test/CodeGen/AArch64 that verify that we correctly lower the G_FCOS and G_FSIN instructions. https://reviews.llvm.org/D57197 3/3 llvm-svn: 352402	2019-01-28 18:34:18 +00:00
Jessica Paquette	296f19b3d9	[GlobalISel][AArch64] Add IRTranslator support for G_FCOS and G_FSIN This adds IRTranslator support for the G_FCOS and G_FSIN generic instructions. https://reviews.llvm.org/D57197 2/3 llvm-svn: 352401	2019-01-28 18:34:17 +00:00
Jessica Paquette	9f6afad913	[GlobalISel] Add G_FSIN and G_FCOS generic instructions This introduces generic instrutions for floating point sin and cos, G_FCOS and G_FSIN. It updates the tests, etc. https://reviews.llvm.org/D57197 1/3 llvm-svn: 352400	2019-01-28 18:34:16 +00:00
Alina Sbirlea	3d1d95ca55	[AliasSetTracker] Update signature to aliasesPointer [NFCI]. llvm-svn: 352399	2019-01-28 18:30:05 +00:00
Michael Berg	685d5f675e	[NFC] TLI query with default(on) behavior wrt DAG combines for fmin/fmax target control llvm-svn: 352396	2019-01-28 18:03:08 +00:00
Alina Sbirlea	932108703a	[SimpleLoopUnswitch] Early check exit for trivial unswitch with MemorySSA. Summary: If MemorySSA is avaiable, we can skip checking all instructions if block has any Defs. (volatile loads are also Defs). We still need to check all instructions for "canThrow", even if no Defs are found. Reviewers: chandlerc Subscribers: sanjoy, jlebar, Prazek, george.burgess.iv, llvm-commits Differential Revision: https://reviews.llvm.org/D57129 llvm-svn: 352393	2019-01-28 17:48:45 +00:00
Simon Pilgrim	2c17512456	[X86][AVX] Remove lowerShuffleByMerging128BitLanes 2-lane restriction First step towards adding support for 64-bit unary "sublane" handling (a bit like lowerShuffleAsRepeatedMaskAndLanePermute). This allows us to add lowerV64I8Shuffle handling. llvm-svn: 352389	2019-01-28 17:02:35 +00:00
Simon Pilgrim	f4268176fa	[LangRef] Mention vector support for bitreverse/bswap intrinsics (PR38012) Differential Revision: https://reviews.llvm.org/D57309 llvm-svn: 352386	2019-01-28 16:56:38 +00:00
George Rimar	4463ebe4a7	[llvm-objdump] - Restore a piece of code removed by mistake in r352366. Seems when committed the r352366 ("[llvm-objdump] - Print LMAs when dumping section headers.") I resolved merge conflict incorrectly and removed this piece by mistake. Bots did not catch this yet, seems they are slow today, but the `X86/adjust-vma.test` test case fails locally for me without that. llvm-svn: 352383	2019-01-28 16:36:12 +00:00
Sanjay Patel	94cca60b82	[x86] allow more shuffle splitting to avoid vpermps (PR40434) This is tricky to make optimal: sometimes we're better off using a single wider op, but other times it makes more sense to combine a narrow ops to achieve the same result. This solves the case from: https://bugs.llvm.org/show_bug.cgi?id=40434 There's potentially a similar change for vectors with 64-bit elements, but it needs adjustments similar to rL352333 to avoid creating infinite loops. llvm-svn: 352380	2019-01-28 15:51:34 +00:00
George Rimar	7d6fd6d73d	[llvm-objdump] - Update test after r352366. NFC. Change the column name. llvm-svn: 352379	2019-01-28 15:49:41 +00:00
Ranjeet Singh	0022ab4d80	VERSION_GREATER_EQUAL not supported in llvm cmake. Patch https://reviews.llvm.org/D56329 caused build failures for me when building on Windows because of the use of cmake operator 'VERSION_GREATER_EQUAL' which isn't supported in older versions of cmake. The llvm website states that minimum required version of cmake for building llvm is 3.4.3 https://llvm.org/docs/CMake.html Differential Revision: https://reviews.llvm.org/D57326 llvm-svn: 352378	2019-01-28 15:48:07 +00:00
Arnaud A. de Grandmaison	51eb87cadd	Remove no longer needed Arm specific LICENSE.TXT file. As the codebase is now under the Apache 2.0 license with LLVM Exceptions, and all Arm's contributions, past or future, are under that new license, this Arm specific LICENSE.TXT is no longer needed, thus removing it. llvm-svn: 352376	2019-01-28 15:38:01 +00:00
Michal Gorny	d4b194cf95	[cmake] Fix get_llvm_lit_path() to respect LLVM_EXTERNAL_LIT always Refactor the get_llvm_lit_path() logic to respect LLVM_EXTERNAL_LIT, and require the fallback to be defined explicitly as LLVM_DEFAULT_EXTERNAL_LIT. This fixes building libcxx standalone after r346888. The old logic was using LLVM_EXTERNAL_LIT both as user-defined cache variable and an optional pre-definition of default value from caller (e.g. libcxx). It included a hack to make this work by assigning the value back and forth but it was fragile and stopped working in libcxx. The new logic is simpler and more transparent. Default value is provided in a separate variable, and used only when user-specified variable is empty (i.e. not overriden). Differential Revision: https://reviews.llvm.org/D57282 llvm-svn: 352374	2019-01-28 15:16:03 +00:00
George Rimar	3168496822	[obj2yaml] - Dump the sh_entsize section field. I faced with the fact that obj2yaml does not dump the sh_entsize field. A problem arose when I tried to dump ELF versioning sections. This is close to what D50235 did, but D50235 did the change for yaml2obj, and now I had to do the same for obj2yaml. Differential revision: https://reviews.llvm.org/D57229 llvm-svn: 352373	2019-01-28 15:05:10 +00:00
Jordan Rupprecht	b2702d6a45	[llvm-objcopy] Fix crash when writing empty binary output Summary: When using llvm-objcopy -O binary and the resulting file will be empty (e.g. removing the only section that would be written, or using --only-keep with a section that doesn't exist/isn't SHF_ALLOC), we crash because FileOutputBuffer expects Size > 0. Add a regression test, and change Buffer to open/truncate the output file in this case. Reviewers: alexshap, jhenderson, jakehehrlich, espindola Reviewed By: alexshap, jhenderson Subscribers: jfb, llvm-commits, emaste, arichardson Differential Revision: https://reviews.llvm.org/D56806 llvm-svn: 352371	2019-01-28 15:02:40 +00:00
Aleksandar Beserminji	6c5dfcb89e	[mips] Support for +abs2008 attribute Instruction abs.[ds] is not generating correct result when working with NaNs for revisions prior mips32r6 and mips64r6. To generate a sequence which always produce a correct result, but also to allow user more control on how his code is compiled, attribute +abs2008 is added, so user can choose legacy or 2008. By default legacy mode is used on revisions prior R6. Mips32r6 and mips64r6 use abs2008 mode by default. Differential Revision: https://reviews.llvm.org/D35983 llvm-svn: 352370	2019-01-28 14:59:30 +00:00
George Rimar	87fa2e66e7	[llvm-objdump] - Print LMAs when dumping section headers. When --section-headers is used, GNU objdump prints both LMA and VMA for sections. llvm-objdump does not do that what makes it's output be slightly inconsistent. Patch teaches llvm-objdump to print LMA/VMA for ELF file formats. The behavior for other formats remains unchanged. Differential revision: https://reviews.llvm.org/D57146 llvm-svn: 352366	2019-01-28 14:11:35 +00:00
Tim Corringham	824ca3f3dd	[AMDGPU] Add intrinsics for 16 bit interpolation Summary: Added the intrinsics llvm.amdgcn.interp.p1.f16() and llvm.amdgcn.interp.p2.f16() and related LIT test. The p1 intrinsic generates code appropriate for both 16 and 32 bank LDS. Reviewers: #amdgpu, dstuttard, arsenm, tpr Reviewed By: #amdgpu, arsenm Subscribers: jvesely, mgorny, arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D46754 llvm-svn: 352357	2019-01-28 13:48:59 +00:00
James Y Knight	575c0855c0	[opaque pointer types] Remove GraphTraits specialization for Type. The only caller has been deleted in r352076, and I'd like to minimize the amount of code walking Type hierarchies generically, to make it easier to identify code depending on pointee types. llvm-svn: 352353	2019-01-28 13:25:57 +00:00
Petar Avramovic	7cecadb9af	[MIPS GlobalISel] Select sub Lower G_USUBO and G_USUBE. Add narrowScalar for G_SUB. Legalize and select G_SUB for MIPS 32. Differential Revision: https://reviews.llvm.org/D53416 llvm-svn: 352351	2019-01-28 12:10:17 +00:00
Jeremy Morse	8ebffb4b82	[DebugInfo][DAG] Avoid re-ordering of DBG_VALUEs This patch improves the placement of DBG_VALUEs when by SelectionDAG, which as documented in PR40427 can go very wrong. At the core of this is ProcessSourceNode, which assumes the last instruction in a BB is the start of the last processed IR instruction, which isn't always true. Instead, use a helper function to call InstrEmitter::EmitNode, that records before-and-after iterators and determines the first of any new instruction created during emission. This is passed to ProcessSourceNode, which can then make more elightened decisions about ordering for DBG_VALUE placement. Differential revision: https://reviews.llvm.org/D57163 llvm-svn: 352350	2019-01-28 12:08:31 +00:00
George Rimar	740974d984	[llvm-objdump] - Fix comment. NFC. This was mentioned by James Henderson in review for https://reviews.llvm.org/D57051. llvm-svn: 352348	2019-01-28 10:48:54 +00:00
George Rimar	4c3b297621	[llvm-objdump] - Implement the --adjust-vma option. GNU objdump's help says: "--adjust-vma: Add OFFSET to all displayed section addresses" In real life what it does is a bit more complicated (and IMO not always reasonable. For example, GNU objdump prints not only VMA, but also LMA for sections. And with --adjust-vma it adjusts LMA, but only when a section has relocations. llvm-objsump does not seem to support printing LMAs yet, but GNU's logic anyways does not make sense for me here). This patch tries to adjust VMA. I tried to implement a reasonable approach. I am not adjusting sections that are not allocatable. As, for example, adjusting debug sections VA's and rel[a] sections VA's should not make sense. This behavior seems to be GNU compatible. Differential revision: https://reviews.llvm.org/D57051 llvm-svn: 352347	2019-01-28 10:44:01 +00:00
Diana Picus	574e0c5e32	[ARM GlobalISel] Support integer division for Thumb2 Support G_SDIV, G_UDIV, G_SREM and G_UREM. The only significant difference between arm and thumb mode is that we need to check a different subtarget feature. llvm-svn: 352346	2019-01-28 10:37:30 +00:00
Craig Topper	453150bc18	[X86] Add new variadic avx512 compress/expand intrinsics that use vXi1 types for the mask argument. Remove and autoupgrade the old intrinsics llvm-svn: 352343	2019-01-28 07:03:03 +00:00
Craig Topper	b23d5ccafc	[X86] Add vbmi2 compressstore and expandload tests that aren't fast-isel tests. These got removed when we autoupgraded to target independent intrinsics, but we didn't have coverage anywhere else. The avx512f/avx512vl versions do have coverage. Also move some tests back from the upgrade file that aren't really upgraded. llvm-svn: 352342	2019-01-28 05:42:39 +00:00
Petr Hosek	b667153cf6	[CMake] Use __libc_start_main rather than fopen when checking for C library The check_library_exists CMake uses a custom symbol definition. This is a problem when checking for C library symbols because Clang recognizes many of them as builtins, and returns the -Wbuiltin-requires-header (or -Wincompatible-library-redeclaration) error. When building with -Werror which is the default, this causes the check_library_exists check fail making the build think that C library isn't available. To avoid this issue, we should use a symbol that isn't recognized by Clang and wouldn't cause the same issue. __libc_start_main seems like reasonable choice that fits the bill. Differential Revision: https://reviews.llvm.org/D57142 llvm-svn: 352341	2019-01-28 04:12:54 +00:00
Amara Emerson	fd31bf95c1	[AArch64][GlobalISel] Teach RBS about G_FNEG default mapping. llvm-svn: 352340	2019-01-28 03:21:14 +00:00
Amara Emerson	0bfa2faccc	[AArch64][GlobalISel] Add some missing vector support for FP arithmetic ops. Moved the fneg lowering legalization test from AArch64 to X86, as we want to specify that it's already legal. llvm-svn: 352338	2019-01-28 02:28:22 +00:00
Amara Emerson	92ffb305cc	[AArch64][GlobalISel] Add some vector support for fp <-> int conversions. Some unrelated, but benign, test changes as well due to the test update script. llvm-svn: 352337	2019-01-28 02:27:59 +00:00
Matt Arsenault	cfca2a7adf	GlobalISel: Don't reduce elements for atomic load/store This is invalid for the same reason as in the narrowScalar handling for load. llvm-svn: 352334	2019-01-27 22:36:24 +00:00
Sanjay Patel	ebe6b43aec	[x86] add restriction for lowering to vpermps This transform was added with rL351346, and we had an escape for shufps, but we also want one for unpckps vs. vpermps because vpermps doesn't take an immediate shuffle index operand. llvm-svn: 352333	2019-01-27 21:53:33 +00:00
Matt Arsenault	816c9b3e25	GlobalISel: Factor fewerElementVectors into separate functions llvm-svn: 352332	2019-01-27 21:53:09 +00:00
Sanjay Patel	9ceaf2932a	[x86] add tests for extract/extract/unpack; NFC llvm-svn: 352331	2019-01-27 21:34:51 +00:00
Simon Pilgrim	670a6971f8	[X86][SSE] Add UNDEF handling to combineSelect ISD::USUBSAT matching (PR40083) llvm-svn: 352330	2019-01-27 21:01:23 +00:00
Simon Pilgrim	e5cf884018	[X86][SSE] Add UNDEF test case for combineSelect ISD::USUBSAT matching (PR40083) llvm-svn: 352329	2019-01-27 20:52:34 +00:00
Simon Pilgrim	f10b6623cc	[X86][SSE] Permit UNDEFs in combineAddToSUBUS matching (PR40083) llvm-svn: 352328	2019-01-27 20:36:37 +00:00
Sanjay Patel	6c865deedd	[x86] add more tests for lowerShuffleWithUndefHalf; NFC Some other transform is creating the opposite form and causing an infinite loop if we try to split some of these. llvm-svn: 352327	2019-01-27 20:17:02 +00:00
Simon Pilgrim	976b093ecb	[X86][SSE] Add PSUBUS undef element test case (PR40083) llvm-svn: 352326	2019-01-27 20:09:30 +00:00
Martin Storsjo	e5eb6fb950	[COFF] Add new relocation types. Differential Revision: https://reviews.llvm.org/D57291 llvm-svn: 352324	2019-01-27 19:53:36 +00:00
Alexandre Ganea	1dc4e01cbf	Fix some warnings on MSVC Differential Revision: https://reviews.llvm.org/D56329 llvm-svn: 352322	2019-01-27 18:41:40 +00:00
Simon Pilgrim	c9d32e20d5	[X86] Add test cases for PR36721 (unnecessary andl for %cl when shifting) llvm-svn: 352321	2019-01-27 18:31:33 +00:00
Sanjay Patel	5f1fdaa192	[x86] refactor logic in lowerShuffleWithUndefHalf Although this is longer code, this is no-functional-change-intended. The goal is to untangle the conditions under which we bail out, so that's easier to adjust. llvm-svn: 352320	2019-01-27 18:12:03 +00:00
Matt Arsenault	fdfb7d78f1	GlobalISel: Verify load/store has a pointer input I expected this to be automatically verified, but it seems nothing uses that the type index was declared as a "ptype" llvm-svn: 352319	2019-01-27 15:57:23 +00:00
Roman Lebedev	d35424a2b3	[X86][NFC] Replace "<%s" with "< %s" in run-lines. While i have no intention of actually commiting regeneration of the check lines in these test files with update_llc_test_checks, lack of that whitespace breaks that util, which is mildly inconvenient. llvm-svn: 352318	2019-01-27 15:36:35 +00:00
Roman Lebedev	661577466e	[NFC][MCA][X86][BdVer2] Cherry-pick int-to-ivec forwarding tests from BtVer2 llvm-svn: 352317	2019-01-27 14:35:54 +00:00
Simon Pilgrim	f6d7cfef39	[X86] Add CGP tests for PR40486 llvm-svn: 352316	2019-01-27 14:04:45 +00:00
Simon Pilgrim	adca820927	[TTI] Add generic SADDSAT/SSUBSAT costs Add generic costs calculation for SADDSAT/SSUBSAT intrinsics, this uses generic costs for sadd_with_overflow/ssub_with_overflow, an extra sign comparison + a selects based on the sign/overflow. This completes PR40316 Differential Revision: https://reviews.llvm.org/D57239 llvm-svn: 352315	2019-01-27 13:51:59 +00:00
Simon Pilgrim	c09a4db3b7	[X86] Regenerate reverse branch test to explicitly show branching and condition codes. llvm-svn: 352314	2019-01-27 12:39:38 +00:00
Simon Pilgrim	7b980ad368	[X86] Regenerate test to explicitly show branching and condition codes. llvm-svn: 352313	2019-01-27 12:38:09 +00:00
Amara Emerson	711bbdc894	Re-apply "r351584: "GlobalISel: Verify g_zextload and g_sextload"" I reverted it originally due to a bot failing. The underlying bug has been fixed as of r352311. llvm-svn: 352312	2019-01-27 11:34:41 +00:00
Amara Emerson	bf43004ff1	[AArch64][GlobalISel] Fix the G_EXTLOAD combiner creating non-extending illegal instructions. This fixes loads like 's1 = load %p (load 1 from %p)' being combined with an extend into an illegal 's8 = g_extload %p (load 1 from %p)' which doesn't do any extension, by avoiding touching those < s8 size loads. This bug was uncovered by a verifier update r351584, which I reverted it to keep the bots green. llvm-svn: 352311	2019-01-27 10:56:20 +00:00
Thomas Preud'homme	5cb1193075	Revert "Add support for prefix-only CLI options" This reverts commit r351038. llvm-svn: 352310	2019-01-27 09:02:46 +00:00
Thomas Preud'homme	447abc57c5	Revert "Detect incorrect FileCheck variable CLI definition" This reverts commit r351039. llvm-svn: 352309	2019-01-27 09:02:19 +00:00
Thomas Preud'homme	e97834e28f	Revert "Fix defines.txt" This reverts commit r351042. llvm-svn: 352308	2019-01-27 09:02:05 +00:00
Gabor Buella	a0f743b77a	[X86] Add some missing blsr patterns The add+and sequence followed by a branch can happen e.g. when looping over the set bits of an integer: ``` while (x != 0) { func(x & ~x); x &= x - 1; } ``` Reviewed By: ctopper Differential Revision: https://reviews.llvm.org/D57296 llvm-svn: 352306	2019-01-27 06:15:39 +00:00
Gabor Buella	23b04798ad	[NFC][X86] Add a few more blsr test cases llvm-svn: 352305	2019-01-27 06:05:40 +00:00
Craig Topper	e65d4c5525	[X86] Add a pattern for (i64 (and (anyext def32:), 0x00000000FFFFFFFF)) to produce SUBREG_TO_REG def32 here means the producing instruction zeroed bits 63:32. We already do this for zext, but it looks like we can get an and+anyext sometimes. Spotted in the diffs from D33587. llvm-svn: 352303	2019-01-27 03:37:05 +00:00
Matt Arsenault	590c67507a	GlobalISel: Fix typo in assert messages llvm-svn: 352301	2019-01-27 00:53:54 +00:00
Matt Arsenault	211e89d4dd	GlobalISel: Implement narrowScalar for mul llvm-svn: 352300	2019-01-27 00:52:51 +00:00
Matt Arsenault	2e5f900849	GlobalISel: fewerElementsVector for intrinsic_trunc/intrinsic_round llvm-svn: 352298	2019-01-27 00:12:21 +00:00
Matt Arsenault	ded2f82662	AMDGPU/GlobalISel: Use scalarize instead of clampMaxNumElements llvm-svn: 352297	2019-01-26 23:54:53 +00:00
Amara Emerson	203760ab9c	[GlobalISel][IRTranslator] Fix crash on translation of fneg. When the fneg IR instruction was added the code to do translation wasn't tested, and tried to get an invalid operand. llvm-svn: 352296	2019-01-26 23:47:09 +00:00
Matt Arsenault	26a6c74fbe	AMDGPU/GlobalISel: Legalize more bit ops llvm-svn: 352295	2019-01-26 23:47:07 +00:00
Matt Arsenault	4d47594fc5	AMDGPU/GlobalISel: Widen small uaddo/usubo llvm-svn: 352294	2019-01-26 23:44:51 +00:00
Johannes Doerfert	00102c7d95	[ValueTracking] Look through casts when determining non-nullness Bitcast and certain Ptr2Int/Int2Ptr instructions will not alter the value of their operand and can therefore be looked through when we determine non-nullness. Differential Revision: https://reviews.llvm.org/D54956 llvm-svn: 352293	2019-01-26 23:40:35 +00:00
Simon Pilgrim	a914fa4dd8	[X86] combineAddOrSubToADCOrSBB/combineCarryThroughADD - use oneuse for entire SDNode Fix issue noted in D57281 that only tested the one use for the SDValue (the result flag), not the entire SUB. I've added the getNode() to make it clearer what is intended than just the -> redirection. llvm-svn: 352291	2019-01-26 21:29:16 +00:00
Simon Pilgrim	37a8e65a60	[X86] combineCarryThroughADD - add support for X86::COND_A commutations (PR24545) As discussed on PR24545, we should try to commute X86::COND_A 'icmp ugt' cases to X86::COND_B 'icmp ult' to more optimally bind the carry flag output to a SBB instruction. Differential Revision: https://reviews.llvm.org/D57281 llvm-svn: 352289	2019-01-26 20:23:04 +00:00
Simon Pilgrim	b7a15acd38	[X86] Fold X86ISD::SBB(ISD::SUB(X,Y),0) -> X86ISD::SBB(X,Y) (PR25858) We often generate X86ISD::SBB(X, 0) for carry flag arithmetic. I had tried to create test cases for the ADC equivalent (which often uses the same pattern) but haven't managed to find anything yet. Differential Revision: https://reviews.llvm.org/D57169 llvm-svn: 352288	2019-01-26 20:13:44 +00:00
Amaury Sechet	be03018384	Generate test results for combine-fcopysign.ll using update_llc_test_checks.py . NFC llvm-svn: 352285	2019-01-26 18:13:53 +00:00
Simon Pilgrim	6162fba57c	[X86][SSE] Generalized unsigned compares to support nonsplat constant vectors (PR39859) llvm-svn: 352283	2019-01-26 16:40:03 +00:00
Simon Pilgrim	7d6c58e843	[X86] Add nonsplat increment/decrement constant vector with min/max test (PR39859) llvm-svn: 352281	2019-01-26 16:27:48 +00:00
Sanjay Patel	a03c63b77f	[x86] add helper for creating a half-width shuffle; NFC This reduces a bit of duplication between the combining and lowering places that use it, but the primary motivation is to make it easier to rearrange the lowering logic and solve PR40434: https://bugs.llvm.org/show_bug.cgi?id=40434 llvm-svn: 352280	2019-01-26 16:20:22 +00:00
Simon Pilgrim	0199838883	[X86] Add test case from PR34292 llvm-svn: 352274	2019-01-26 13:56:53 +00:00
Simon Pilgrim	c9d33907ef	[llvm-mca][X86] Add some missing DQI tests Match more of the coverage of test\CodeGen\X86\avx512-schedule.ll as discussed on D57244 llvm-svn: 352273	2019-01-26 13:00:46 +00:00
Simon Pilgrim	3cdf3f681d	[X86] Add 'less_than_ideal' followup test case from PR24545 llvm-svn: 352272	2019-01-26 12:51:52 +00:00
Craig Topper	21cdcd7b2b	[X86] Autoupgrade some of the intrinsics used by stack folding tests that have been previously removed. llvm-svn: 352271	2019-01-26 06:27:04 +00:00
Craig Topper	3b5e01b386	[X86] Remove and autoupgrade vpconflict intrinsics that take a mask and passthru argument. We have unmasked versions as of r352172 llvm-svn: 352270	2019-01-26 06:27:01 +00:00
Craig Topper	58e6b37e62	Revert r352255 "[SelectionDAG][X86] Don't use SEXTLOAD for promoting masked loads in the type legalizer" This might be breaking an lldb windows buildbot. llvm-svn: 352268	2019-01-26 02:44:58 +00:00
Craig Topper	6c9c7d0796	[X86] Remove GCCBuiltins from 512-bit cvt(u)qqtops, cvt(u)qqtopd, and cvt(u)dqtops intrinsics. Add new variadic uitofp/sitofp with rounding mode intrinsics. Summary: See clang patch D56998 for a full description. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D56999 llvm-svn: 352266	2019-01-26 02:41:54 +00:00
Matt Arsenault	cdc201fcde	GlobalISel: Fix address space limit in LLT The IR enforced limit for the address space is 24-bits, but LLT was only using 23-bits. Additionally, the argument to the constructor was truncating to 16-bits. A similar problem still exists for the number of vector elements. The IR enforces no limit, so if you try to use a vector with > 65535 elements the IRTranslator asserts in the LLT constructor. llvm-svn: 352264	2019-01-26 01:42:13 +00:00
Thomas Lively	2b8b2978e4	[WebAssembly][NFC] Group SIMD-related ISel configuration Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish Differential Revision: https://reviews.llvm.org/D57263 llvm-svn: 352262	2019-01-26 01:25:37 +00:00
Nemanja Ivanovic	7d007ddedf	[PowerPC] Update Vector Costs for P9 For the power9 CPU, vector operations consume a pair of execution units rather than one execution unit like a scalar operation. Update the target transform cost functions to reflect the higher cost of vector operations when targeting Power9. Patch by RolandF. Differential revision: https://reviews.llvm.org/D55461 llvm-svn: 352261	2019-01-26 01:18:48 +00:00
Craig Topper	7a8e74775c	[X86] Add DAG combine to merge vzext_movl with the various fp<->int conversion operations that only write the lower 64-bits of an xmm register and zero the rest. Summary: We have isel patterns for this, but we're missing some load patterns and all broadcast patterns. A DAG combine seems like a better fit for this. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D56971 llvm-svn: 352260	2019-01-26 01:17:09 +00:00
Vedant Kumar	8ca0875617	[llvm-nm] Print out N_COLD_FUNC as "cold func" Per post-commit feedback from Mike, have llvm-nm print out this symbol attribute as "[cold func]". llvm-svn: 352258	2019-01-26 00:33:15 +00:00
Artem Belevich	dfad526943	[NVPTX] Some nvvm.read.ptx.sreg intrinsics should have IntrInaccessibleMemOnly attribute. These intrinsics may return different values every time they are called and should not be CSE'd. IntrInaccessibleMemOnly appears to be the right attribute to model this behavior. Differential Revision: https://reviews.llvm.org/D57259 llvm-svn: 352256	2019-01-26 00:28:32 +00:00
Craig Topper	b1d3457c03	[SelectionDAG][X86] Don't use SEXTLOAD for promoting masked loads in the type legalizer Summary: I'm not sure why we were using SEXTLOAD. EXTLOAD seems more appropriate since we don't care about the upper bits. This patch changes this and then modifies the X86 post legalization combine to emit a extending shuffle instead of a sign_extend_vector_inreg. Could maybe use an any_extend_vector_inreg, but I just did what we already do in LowerLoad. I think we can actually get rid of this code entirely if we switch to -x86-experimental-vector-widening-legalization. On AVX512 targets I think we might be able to use a masked vpmovzx and not have to expand this at all. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D57186 llvm-svn: 352255	2019-01-26 00:26:37 +00:00
Hans Wennborg	4c85e72ad3	Build LLVM-C.dll by default on windows and enable in release package With the fixes to the building of LLVM-C.dll in D56781 this should now be safe to land. This will greatly simplify dealing with LLVM for people that just want to use the C API on windows. This is a follow up from D35077. Patch by Jakob Bornecrantz! Differential revision: https://reviews.llvm.org/D56774 llvm-svn: 352250	2019-01-25 22:45:17 +00:00
Alexey Lapshin	31f47b8194	[NFC] Test commit : fix typo. llvm-svn: 352248	2019-01-25 21:59:53 +00:00
Alex Bradbury	0092df0669	[RISCV] Add target DAG combine for bitcast fabs/fneg on RV32FD DAGCombiner::visitBITCAST will perform: fold (bitconvert (fneg x)) -> (xor (bitconvert x), signbit) fold (bitconvert (fabs x)) -> (and (bitconvert x), (not signbit)) As shown in double-bitmanip-dagcombines.ll, this can be advantageous. But RV32FD doesn't use bitcast directly (as i64 isn't a legal type), and instead uses RISCVISD::SplitF64. This patch adds an equivalent DAG combine for SplitF64. llvm-svn: 352247	2019-01-25 21:55:48 +00:00
Mircea Trofin	519f42d914	[llvm] Opt-in flag for X86DiscriminateMemOps Summary: Currently, if an instruction with a memory operand has no debug information, X86DiscriminateMemOps will generate one based on the first line of the enclosing function, or the last seen debug info. This may cause confusion in certain debugging scenarios. The long term approach would be to use the line number '0' in such cases, however, that brings in challenges: the base discriminator value range is limited (4096 values). For the short term, adding an opt-in flag for this feature. See bug 40319 (https://bugs.llvm.org/show_bug.cgi?id=40319) Reviewers: dblaikie, jmorse, gbedwell Reviewed By: dblaikie Subscribers: aprantl, eraman, hiraditya Differential Revision: https://reviews.llvm.org/D57257 llvm-svn: 352246	2019-01-25 21:49:54 +00:00
Jessica Paquette	1f9bc2854f	[GlobalISel][AArch64][NFC] Fix incorrect comment in selectUnmergeValues s/scalar/vector/ llvm-svn: 352243	2019-01-25 21:28:27 +00:00
Alina Sbirlea	a34bcbf335	Revert rL352238. llvm-svn: 352241	2019-01-25 21:12:08 +00:00
Alex Bradbury	d760910d3d	[RISCV] Add another potential combine to {double,float}-bitmanip-dagcombines.ll (fcopysign a, (fneg b)) will be expanded to bitwise operations by DAGTypeLegalizer::SoftenFloatRes_FCOPYSIGN if the floating point type isn't legal. Arguably it might be worth doing a combine even if it is legal. llvm-svn: 352240	2019-01-25 21:06:47 +00:00
Alina Sbirlea	890a8e575f	[WarnMissedTransforms] Set default to 1. Summary: Set default value for retrieved attributes to 1, since the check is against 1. Eliminates the warning noise generated when the attributes are not present. Reviewers: sanjoy Subscribers: jlebar, llvm-commits Differential Revision: https://reviews.llvm.org/D57253 llvm-svn: 352238	2019-01-25 20:51:55 +00:00
Ana Pazos	05a6064385	Reapply: [RISCV] Set isAsCheapAsAMove for ADDI, ORI, XORI, LUI This reapplies commit r352010 with RISC-V test fixes. llvm-svn: 352237	2019-01-25 20:22:49 +00:00
Guozhi Wei	81f3fd4bf8	[MBP] Don't move bottom block before header if it can't reduce taken branches If bottom of block BB has only one successor OldTop, in most cases it is profitable to move it before OldTop, except the following case: -->OldTop<- \| . \| \| . \| \| . \| ---Pred \| \| \| BB----- Move BB before OldTop can't reduce the number of taken branches, this patch detects this case and prevent the moving. Differential Revision: https://reviews.llvm.org/D57067 llvm-svn: 352236	2019-01-25 19:45:13 +00:00
Craig Topper	4cf28bad5b	[X86] Combine masked store and truncate into masked truncating stores. We also need to combine to masked truncating with saturation stores, but I'm leaving that for a future patch. This does regress some tests that used truncate wtih saturation followed by a masked store. Those now use a truncating store and use min/max to saturate. Differential Revision: https://reviews.llvm.org/D57218 llvm-svn: 352230	2019-01-25 18:37:36 +00:00
Vedant Kumar	db3f9774ee	[HotColdSplit] Introduce a cost model to control splitting behavior The main goal of the model is to avoid increasing function size, as that would eradicate any memory locality benefits from splitting. This happens when: - There are too many inputs or outputs to the cold region. Argument materialization and reloads of outputs have a cost. - The cold region has too many distinct exit blocks, causing a large switch to be formed in the caller. - The code size cost of the split code is less than the cost of a set-up call. A secondary goal is to prevent excessive overall binary size growth. With the cost model in place, I experimented to find a splitting threshold that works well in practice. To make warm & cold code easily separable for analysis purposes, I moved split functions to a "cold" section. I experimented with thresholds between [0, 4] and set the default to the threshold which minimized geomean __text size. Experiment data from building LNT+externals for X86 (N = 639 programs, all sizes in bytes): \| Configuration \| __text geom size \| __cold geom size \| TEXT geom size \| \| -Os \| 1736.3 \| 0, n=0 \| 10961.6 \| \| -Os, thresh=0 \| 1740.53 \| 124.482, n=134 \| 11014 \| \| -Os, thresh=1 \| 1734.79 \| 57.8781, n=90 \| 10978.6 \| \| -Os, thresh=2 \| 1733.85 \| 65.6604, n=61 \| 10977.6 \| \| -Os, thresh=3 \| 1733.85 \| 65.3071, n=61 \| 10977.6 \| \| -Os, thresh=4 \| 1735.08 \| 67.5156, n=54 \| 10965.7 \| \| -Oz \| 1554.4 \| 0, n=0 \| 10153 \| \| -Oz, thresh=2 \| 1552.2 \| 65.633, n=61 \| 10176 \| \| -O3 \| 2563.37 \| 0, n=0 \| 13105.4 \| \| -O3, thresh=2 \| 2559.49 \| 71.1072, n=61 \| 13162.4 \| Picking thresh=2 reduces the geomean __text section size by 0.14% at -Os, -Oz, and -O3 and causes ~0.2% growth in the TEXT segment. Note that TEXT size is page-aligned, whereas section sizes are byte-aligned. Experiment data from building LNT+externals for ARM64 (N = 558 programs, all sizes in bytes): \| Configuration \| __text geom size \| __cold geom size \| TEXT geom size \| \| -Os \| 1763.96 \| 0, n=0 \| 42934.9 \| \| -Os, thresh=2 \| 1760.9 \| 76.6755, n=61 \| 42934.9 \| Picking thresh=2 reduces the geomean __text section size by 0.17% at -Os and causes no growth in the TEXT segment. Measurements were done with D57082 (r352080) applied. Differential Revision: https://reviews.llvm.org/D57125 llvm-svn: 352228	2019-01-25 18:30:37 +00:00
Vedant Kumar	13ef84fced	[MC] Teach the MachO object writer about N_FUNC_COLD N_FUNC_COLD is a new MachO symbol attribute. It's a hint to the linker to order a symbol towards the end of its section, to improve locality. Example: ``` void a1() {} __attribute__((cold)) void a2() {} void a3() {} int main() { a1(); a2(); a3(); return 0; } ``` A linker that supports N_FUNC_COLD will order _a2 to the end of the text section. From `nm -njU` output, we see: ``` _a1 _a3 _main _a2 ``` Differential Revision: https://reviews.llvm.org/D57190 llvm-svn: 352227	2019-01-25 18:30:22 +00:00
Florian Hahn	fd7ee47940	[opt-viewer] Add javascript to expand/hide full message for multiline remarks. This patch adds support for displaying remarks with multiple lines. For such remarks, it creates a hidden div containing the message's lines except the first one in a <pre> tag. It also prepends a link (with '+' as text) to the regular remark line. This link can be used to show/hide the div containing the full remark. In combination with D57159, this allows for better displaying of multiline remarks in the html pages generated by opt-viewer. The Javascript is very simple and should be supported by any recent major browser. Reviewers: hfinkel, anemet, thegameg, serge-sans-paille Reviewed By: anemet Differential Revision: https://reviews.llvm.org/D57167 llvm-svn: 352223	2019-01-25 17:48:31 +00:00
Sanjay Patel	0020f8bb23	[x86] simplify logic in lowerShuffleWithUndefHalf(); NFCI This seems unnecessarily complicated because we gave names to opposite polarity bools and have code comments that don't really line up with the logic. Step 1: remove UndefUpper and assert that it is the opposite of UndefLower after the initial early exit. llvm-svn: 352217	2019-01-25 17:00:41 +00:00
Florian Hahn	ca95ee5e11	[DiagnosticInfo] Add support for preserving newlines in remark arguments. This patch adds a new type StringBlockVal which can be used to emit a YAML block scalar, which preserves newlines in a multiline string. It also updates MappingTraits<DiagnosticInfoOptimizationBase::Argument> to use it for argument values with more than a single newline. This is helpful for remarks that want to display more in-depth information in a more structured way. Reviewers: thegameg, anemet Reviewed By: anemet Subscribers: hfinkel, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D57159 llvm-svn: 352216	2019-01-25 16:59:06 +00:00
Tom Weaver	4db70d9695	[TEST][COMMIT] - fix comment typo in AsmPrinter/DwarfDebug.cpp - NFC llvm-svn: 352214	2019-01-25 16:29:35 +00:00
Javed Absar	2ee81933d0	[TblGen][NFC] Fix documentation formatting llvm-svn: 352212	2019-01-25 16:17:57 +00:00
Alex Bradbury	c67515d542	[RISCV][NFC] s/f32/f64 in double-arith.ll The intrinsic names erroneously used the .f32 variant. As the return and argument types were still double the intrinsics calls worked properly. llvm-svn: 352211	2019-01-25 16:04:04 +00:00
Simon Pilgrim	f56298f4b9	[X86] Simplify X86ISD::ADD/SUB if we don't use the result flag Simplify to the generic ISD::ADD/SUB if we don't make use of the result flag. This mainly helps with ADDCARRY/SUBBORROW intrinsics which get expanded to X86ISD::ADD/SUB but could be simplified further. Noticed in some of the test cases in PR31754 Differential Revision: https://reviews.llvm.org/D57234 llvm-svn: 352210	2019-01-25 15:58:28 +00:00
Sanjay Patel	21aa6ddc14	[x86] narrow a shuffle that doesn't use or set any high elements This isn't the final fix for our reduction/horizontal codegen, but it takes care of a lot of the problems. After we narrow the shuffle, existing combines for insert/extract and binops kick in, and we end up with cheaper 128-bit ops. The avg and mul reduction tests show an existing shuffle lowering hole for AVX2/AVX512. I think in its most minimal form this is: https://bugs.llvm.org/show_bug.cgi?id=40434 ...but we might need multiple fixes to get it right. Differential Revision: https://reviews.llvm.org/D57156 llvm-svn: 352209	2019-01-25 15:37:42 +00:00
Clement Courbet	b120127001	Revert r351954 "Add a value_type to ArrayRef." This breaks arm self-hosted buildbots. llvm-svn: 352206	2019-01-25 15:25:52 +00:00
Sam McCall	1e7491ea9c	[JSON] Work around excess-precision issue when comparing T_Integer numbers. Reviewers: bkramer Subscribers: kristina, llvm-commits Differential Revision: https://reviews.llvm.org/D57237 llvm-svn: 352204	2019-01-25 15:05:33 +00:00
Nico Weber	e4ed82d674	gn build: Merge r352149 llvm-svn: 352202	2019-01-25 14:53:30 +00:00
Nico Weber	0c828ccc67	gn build: Revert r352200, commit message was wrong llvm-svn: 352201	2019-01-25 14:52:50 +00:00
Nico Weber	74bb231b90	gn build: Merge r352148 llvm-svn: 352200	2019-01-25 14:50:14 +00:00
Alex Bradbury	38c4ec31cb	[RISCV] Add tests to demonstrate bitcasted fneg/fabs dagcombines This target-independent code won't trigger for cases such as RV32FD where custom SelectionDAG nodes are generated. These new tests demonstrate such cases. Additionally, float-arith.ll was updated so that fneg.s, fsgnjn.s, and fabs.s selection patterns are actually exercised. llvm-svn: 352199	2019-01-25 14:33:08 +00:00
Simon Pilgrim	d6e1e3569c	Fix line endings and trim trailing whitespace. NFCI. llvm-svn: 352198	2019-01-25 14:29:57 +00:00
Haojian Wu	7852b7106a	gitignore: ignore clangd index files. Reviewers: kadircet Subscribers: ilya-biryukov, ioeric, MaskRay, jkorous, arphaman, llvm-commits Differential Revision: https://reviews.llvm.org/D57227 llvm-svn: 352197	2019-01-25 14:05:18 +00:00
Simon Pilgrim	d41ccddda9	[X86] Add addcarry/subborrow combine tests Show failure to simplify cases with zero op/flags llvm-svn: 352196	2019-01-25 12:26:27 +00:00
James Henderson	759d5e6783	[llvm-symbolizer] Add switch to adjust addresses by fixed offset If a stack trace or similar has a list of addresses from an executable or DSO loaded at a variable address (e.g. due to ASLR), the addresses will not directly correspond to the addresses stored in the object file. If a user wishes to use llvm-symbolizer, they have to subtract the load address from every address. This is somewhat inconvenient, especially as the output of --print-address will result in the adjusted address being listed, rather than the address coming from the stack trace, making it harder to map results between the two. This change adds a new switch to llvm-symbolizer --adjust-vma which takes an offset, which is then used to automatically do this calculation. The printed address remains the input address (allowing for easy mapping), whilst the specified offset is applied to the addresses when performing the lookup. The switch is conceptually similar to llvm-objdump's new switch of the same name (see D57051), which in turn mirrors a GNU switch. There is no equivalent switch in addr2line. Reviewed by: grimar Differential Revision: https://reviews.llvm.org/D57151 llvm-svn: 352195	2019-01-25 11:49:21 +00:00
Max Kazantsev	7822d25de3	[NFC] One more crashing test on LoopSimplifyCFG llvm-svn: 352194	2019-01-25 11:47:16 +00:00
Simon Pilgrim	dea6174b0b	Fix gcc -Wparentheses warning. NFCI. llvm-svn: 352193	2019-01-25 11:38:40 +00:00
Simon Pilgrim	cdf58092e4	Fix gcc -Wparentheses warning. NFCI. llvm-svn: 352191	2019-01-25 11:34:58 +00:00
Max Kazantsev	e5116e9b4a	[NFC] Add failing test on LCSSA forming llvm-svn: 352190	2019-01-25 11:32:21 +00:00
Diana Picus	8976ad12a9	[ARM GlobalISel] Support shifts for Thumb2 Same as ARM. On this occasion we split some of the instruction select tests for more complicated instructions into their own files, so we can reuse them for ARM and Thumb mode. Likewise for the legalizer tests. llvm-svn: 352188	2019-01-25 10:48:42 +00:00
Diana Picus	23628c7b05	[ARM GlobalISel] Remove rebase artifact from r351882. NFC r351882 introduced some superfluous calls to mark G_INTTOPTR and G_PTRTOINT as legal (looks like a rebase mishap). Remove them. llvm-svn: 352187	2019-01-25 10:48:35 +00:00
Javed Absar	a3e3d85286	[TblGen] Extend !if semantics through new feature !cond This patch extends TableGen language with !cond operator. Instead of embedding !if inside !if which can get cumbersome, one can now use !cond. Below is an example to convert an integer 'x' into a string: !cond(!lt(x,0) : "Negative", !eq(x,0) : "Zero", !eq(x,1) : "One, 1 : "MoreThanOne") Reviewed By: hfinkel, simon_tatham, greened Differential Revision: https://reviews.llvm.org/D55758 llvm-svn: 352185	2019-01-25 10:25:25 +00:00
Douglas Yung	914e838e63	[llvm-objcopy] Add support for -g as an alias for --strip-debug This change adds an option -g to llvm-objcopy which is an alias for the existing option --strip-debug. This fixes PR40003. Reviewed by: alexshap Differential Revision: https://reviews.llvm.org/D57217 llvm-svn: 352182	2019-01-25 09:57:20 +00:00
Simon Pilgrim	d36f7730cd	[llvm-mca][X86] Add missing shuffle tests Match the coverage of test\CodeGen\X86\avx512-shuffle-schedule.ll so we can get rid of -print-schedule (and fix PR37160) without losing schedule tests llvm-svn: 352179	2019-01-25 09:17:30 +00:00

... 2 3 4 5 6 ...

174552 Commits