llvm-project

Commit Graph

Author	SHA1	Message	Date
Hal Finkel	2d3d6da44b	Fix a typo in AddAliasScopeMetadata llvm-svn: 216741	2014-08-29 16:33:41 +00:00
Matt Arsenault	85cbc7e371	Make fabs safe to speculatively execute llvm-svn: 216736	2014-08-29 16:01:17 +00:00
Matt Arsenault	8675db15da	R600/SI: Use mad for fsub + fmul We can use a negate source modifier to match this for fsub. llvm-svn: 216735	2014-08-29 16:01:14 +00:00
Tim Northover	3c0915e858	AArch64: only try to get operand of a known node. A bug in r216725 meant we tried to discover the type of a SETCC before confirming the node actually was a SETCC. llvm-svn: 216734	2014-08-29 15:34:58 +00:00
Frederic Riss	aaae87e5ab	Remove unnecessary regex in test pattern per dblaikie suggestion. llvm-svn: 216733	2014-08-29 15:32:15 +00:00
Sanjay Patel	a065eb44aa	typo llvm-svn: 216732	2014-08-29 15:32:09 +00:00
Jingyue Wu	cb83a155c1	[NVPTX] Make the alignment an explicit argument to ldu/ldg Summary: Instead of specifying the alignment as metadata which may be destroyed by transformation passes, make the alignment the second argument to ldu/ldg intrinsic calls. Test Plan: ldu-ldg.ll ldu-i8.ll ldu-reg-plus-offset.ll Reviewers: eliben, meheff, jholewinski Reviewed By: meheff, jholewinski Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D5093 llvm-svn: 216731	2014-08-29 15:30:20 +00:00
Tilmann Scheller	f28e7e7d34	[ARM] Make Thumb-2 code size optimization test more strict. llvm-svn: 216729	2014-08-29 15:13:35 +00:00
Tilmann Scheller	e8e577833a	[ARM] Add a first test for the Thumb-2 code size optimization pass. While working on a Thumb-2 code size optimization I just realized that we don't have any regression tests for it. So here's a first test case, I plan to increase the coverage over time. llvm-svn: 216728	2014-08-29 15:04:40 +00:00
Tim Northover	c1c05aeb5d	AArch64: skip select/setcc combine in complex case. In an llvm-stress generated test, we were trying to create a v0iN type and asserting when that failed. This case could probably be handled by the function, but not without added complexity and the situation it arises in is sufficiently odd that there's probably no benefit anyway. Should fix PR20775. llvm-svn: 216725	2014-08-29 13:05:18 +00:00
Arnaud A. de Grandmaison	6afbf2aa5e	[AArch64] FPLoadBalancing: move ownership of the chain to its current accumulator register and forget about the previously used accumulator. Coming up with a simple testcase is not easy, as this highly depends on what the register allocator is doing: this issue showed up while working with the PBQP allocator, which produced a different allocation scheme. A testcase would need to come up with chain starting in D[0-7], then moving to D[8-15], followed by a call to a function whose regmask clobbers the starting accumulator in D[0-7], then another use of the chain. Fixed some formatting, added some invariant checks while there. llvm-svn: 216721	2014-08-29 09:54:11 +00:00
Frederic Riss	5732301afb	Use DwarfDebug::attachLowHighPC for the compilation unit DIE. llvm-svn: 216719	2014-08-29 09:00:26 +00:00
Robert Khasanov	a651a62340	[SKX] Enable lowering of integer CMP operations. Added new types to Legalizer. Fixed getSetCCResultType function Added lowering tests. Reviewed by Elena Demikhovsky. llvm-svn: 216717	2014-08-29 08:46:04 +00:00
Job Noorman	9b31bd6bb0	Do not assume the value passed to memset is an i32. The code in SelectionDAG::getMemset for some reason assumes the value passed to memset is an i32. This breaks the generated code for targets that only have registers smaller than 32 bits because the value might get split into multiple registers by the calling convention. See the test for the MSP430 target included in the patch for an example. This patch ensures that nothing is assumed about the type of the value. Instead, the type is taken from the selected overload of the llvm.memset intrinsic. llvm-svn: 216716	2014-08-29 08:23:53 +00:00
Craig Topper	7e24c0a89c	Add conversion constructor to convert ArrayRef<T> to ArrayRef<const T>. Reviewed with Chandler and David Blaikie. llvm-svn: 216709	2014-08-29 06:01:43 +00:00
Jiangning Liu	08f4cda2ec	[AArch64] Fix some failures exposed by value type v4f16 and v8f16. 1) Add some missing bitcast patterns for v8f16. 2) Add type promotion for operand of ld/st operations. llvm-svn: 216706	2014-08-29 01:31:42 +00:00
Chris Bieneman	b1cd51e33c	Cleaning up static initializers in Signals.inc Reviewed by: Chandlerc llvm-svn: 216704	2014-08-29 01:05:16 +00:00
Chris Bieneman	5e7f44c25e	Cleaning up static initializers in TimeValue. Code reviewed by Chandlerc llvm-svn: 216703	2014-08-29 01:05:12 +00:00
Alexey Samsonov	4ee2675dfe	Introduce -DLLVM_USE_SANITIZER=Undefined CMake option to build UBSan-ified version of LLVM/Clang. I've fixed most of the simple bugs and currently "check-llvm" test suite has 26 failures, and "check-clang" suite has 5 failures. llvm-svn: 216701	2014-08-29 00:50:36 +00:00
Juergen Ributzka	77bc09f5ab	[FastISel][AArch64] Don't fold instructions that are not in the same basic block. This fix checks first if the instruction to be folded (e.g. sign-/zero-extend, or shift) is in the same machine basic block as the instruction we are folding into. Not doing so can result in incorrect code, because the value might not be live-out of the basic block, where the value is defined. This fixes rdar://problem/18169495. llvm-svn: 216700	2014-08-29 00:19:21 +00:00
David Majnemer	400e725bde	Revert two GEP-related InstCombine commits This reverts commit r216523 and r216598; people have reported regressions. llvm-svn: 216698	2014-08-29 00:06:43 +00:00
Reid Kleckner	febb279c9c	Don't promote byval pointer arguments when padding matters Don't promote byval pointer arguments when when their size in bits is not equal to their alloc size in bits. This can happen for x86_fp80, where the size in bits is 80 but the alloca size in bits in 128. Promoting these types can break passing unions of x86_fp80s and other types. Patch by Thomas Jablin! Reviewed By: rnk Differential Revision: http://reviews.llvm.org/D5057 llvm-svn: 216693	2014-08-28 22:42:00 +00:00
Jim Grosbach	ec2b0d0b11	AArch64: More correctly constrain target vector extend lowering. The AArch64 target lowering for [zs]ext of vectors is set up to handle input simple types and expects the generic SDag path to do something reasonable with anything that's not a simple type. The code, however, was only checking that the result type was a simple type and assuming that implied that the source type would also be a simple type. That's not a valid assumption, as operations like "zext <1 x i1> %0 to <1 x i32>" demonstrate. The fix is to simply explicitly validate the source type as well as the result type. PR20791 llvm-svn: 216689	2014-08-28 22:08:28 +00:00
Sanjay Patel	ccd267683b	Move FNEG next to FABS and make them more similar, so it's easier that they can be refactored. NFC. llvm-svn: 216688	2014-08-28 21:51:37 +00:00
Rafael Espindola	b43d51de95	On MachO, don't put non-private constants in mergeable sections. On MachO, putting a symbol that doesn't start with a 'L' or 'l' in one of the __TEXT,__literal* sections prevents the linker from merging the context of the section. Since private GVs are the ones the get mangled to start with 'L' or 'l', we now only put those on the __TEXT,__literal* sections. llvm-svn: 216682	2014-08-28 20:13:31 +00:00
Frederic Riss	9e32475f18	Constify MCSymbol* parameters to DwarfDebug::attachLowHighPC. llvm-svn: 216681	2014-08-28 19:09:29 +00:00
Sanjay Patel	81ecbb0737	Fix a logic bug in x86 vector codegen: sext (zext (x) ) != sext (x) (PR20472). Remove a block of code from LowerSIGN_EXTEND_INREG() that was added with: http://llvm.org/viewvc/llvm-project?view=revision&revision=177421 And caused: http://llvm.org/bugs/show_bug.cgi?id=20472 (more analysis here) http://llvm.org/bugs/show_bug.cgi?id=18054 The testcases confirm that we (1) don't remove a zext op that is necessary and (2) generate a pmovz instead of punpck if SSE4.1 is available. Although pmovz is 1 byte longer, it allows folding of the load, and so saves 3 bytes overall. Differential Revision: http://reviews.llvm.org/D4909 llvm-svn: 216679	2014-08-28 18:59:22 +00:00
Owen Anderson	3eb910b404	Do not introduce new shuffle patterns after operation legalization if SHUFFLE_VECTOR was marked custom. The target independent DAG combine has no way to know if the shuffles it is introducing are ones that the target could support or not. llvm-svn: 216678	2014-08-28 17:49:58 +00:00
Sanjay Patel	50cbfc5138	Janitorial services: "Don’t duplicate function or class name at the beginning of the comment." llvm-svn: 216674	2014-08-28 16:29:51 +00:00
Sanjay Patel	e28d57d9d8	Remove local TLI vars that are just duplicates of the class var. No functional change. llvm-svn: 216673	2014-08-28 16:01:50 +00:00
Sanjay Patel	78614bf0e8	Use local vars to improve readability. No functional change. Completes what was started in r216611 and r216623. Used const refs instead of pointers; not sure if one is preferable to the other. llvm-svn: 216672	2014-08-28 15:53:16 +00:00
Sid Manning	67a8936a84	Minor spelling correction. Reviewers: adasgupt, jverma, sidneym Differential Revision: http://reviews.llvm.org/D5025 llvm-svn: 216667	2014-08-28 14:16:32 +00:00
Aaron Ballman	a4aa0d7cc0	Silence a -Wsign-compare warning. NFC. llvm-svn: 216666	2014-08-28 13:23:26 +00:00
Arnaud A. de Grandmaison	efd4363172	[PBQP] Only output debug information when requested llvm-svn: 216660	2014-08-28 10:15:47 +00:00
David Majnemer	074052b623	InstCombine: Remove redundant combines InstSimplify already handles icmp (X+Y), X (and things like it) appropriately. The first thing that InstCombine does is run InstSimplify on the instruction. llvm-svn: 216659	2014-08-28 10:08:37 +00:00
Erik Eckstein	8354cfaf95	Fix: SLPVectorizer tried to move an instruction which was replaced by a vector instruction. For a detailed description of the problem see the comment in the test file. The problematic moveBefore() calls are not required anymore because the new scheduling algorithm ensures a correct ordering anyway. llvm-svn: 216656	2014-08-28 07:04:02 +00:00
David Xu	ee978203e6	Generate CMN when comparing a short int with minus llvm-svn: 216651	2014-08-28 04:59:53 +00:00
Justin Hibbits	3476db4220	Test commit. Fix whitespace from a previous patch of mine. llvm-svn: 216650	2014-08-28 04:40:55 +00:00
Lang Hames	c5cafbb074	[MCJIT] Fix format specifiers for debug output in RuntimeDyld. More work on http://llvm.org/PR20640 llvm-svn: 216648	2014-08-28 04:25:17 +00:00
David Majnemer	9ab5ff1c5b	MC: Don't crash when the COFF section limit is reached I've decided not to commit a test, it takes 2.5 seconds to run on my an incredibly strong machine. llvm-svn: 216647	2014-08-28 04:02:50 +00:00
Chandler Carruth	c01ce6bc01	[x86] Fix whitespace and formatting around this function with clang-format, no functionality changed. llvm-svn: 216646	2014-08-28 04:00:24 +00:00
Chandler Carruth	cb07a4adf3	[x86] Hoist conditions from every single if in this routine to a single early exit. And factor the subsequent cast<> from all but one block into a single variable. No functionality changed. llvm-svn: 216645	2014-08-28 03:57:13 +00:00
Chandler Carruth	974aa336b1	[x86] Inline an SSE4 helper function for INSERT_VECTOR_ELT lowering, no functionality changed. Separating this into two functions wasn't helping. There was a decent amount of boilerplate duplicated, and some subsequent refactorings here will pull even more common code out. llvm-svn: 216644	2014-08-28 03:52:45 +00:00
Chandler Carruth	5260c0894c	[x86] Clean up some tests to use FileCheck and combine two into a single file. Changing code that is covered by these tests is just too hard to debug currently, and now it will be clear the nature of the changes. llvm-svn: 216643	2014-08-28 03:41:28 +00:00
David Majnemer	76d06bc613	InstSimplify: Move a transform from InstCombine to InstSimplify Several combines involving icmp (shl C2, %X) C1 can be simplified without introducing any new instructions. Move them to InstSimplify; while we are at it, make them more powerful. llvm-svn: 216642	2014-08-28 03:34:28 +00:00
Juergen Ributzka	31328168bb	[FastISel] Undo phi node updates when falling-back to SelectionDAG. The included test case would fail, because the MI PHI node would have two operands from the same predecessor. This problem occurs when a switch instruction couldn't be selected. This happens always, because there is no default switch support for FastISel to begin with. The problem was that FastISel would first add the operand to the PHI nodes and then fall-back to SelectionDAG, which would then in turn add the same operands to the PHI nodes again. This fix removes these duplicate PHI node operands by reseting the PHINodesToUpdate to its original state before FastISel tried to select the instruction. This fixes <rdar://problem/18155224>. llvm-svn: 216640	2014-08-28 02:06:55 +00:00
Juergen Ributzka	4f1a54a41a	[FastISel] Currently instructions are folded very aggressively for AArch64 into the memory operation, which can lead to the use of killed operands: %vreg1<def> = ADDXri %vreg0<kill>, 2 %vreg2<def> = LDRBBui %vreg0, 2 ... = ... %vreg1 ... This usually happens when the result is also used by another non-memory instruction in the same basic block, or any instruction in another basic block. This fix teaches hasTrivialKill to not only check the LLVM IR that the value has a single use, but also to check if the register that represents that value has already been used. This can happen when the instruction with the use was folded into another instruction (in this particular case a load instruction). This fixes rdar://problem/18142857. llvm-svn: 216634	2014-08-28 00:09:46 +00:00
Juergen Ributzka	843f14f411	Revert "[FastISel][AArch64] Don't fold instructions too aggressively into the memory operation." Quentin pointed out that this is not the correct approach and there is a better and easier solution. llvm-svn: 216632	2014-08-27 23:09:40 +00:00
Alexey Samsonov	a8d2f819ad	Fix unaligned reads/writes in X86JIT and RuntimeDyldELF. Summary: Introduce support::ulittleX_t::ref type to Support/Endian.h and use it in x86 JIT to enforce correct endianness and fix unaligned accesses. Test Plan: regression test suite Reviewers: lhames Subscribers: ributzka, llvm-commits Differential Revision: http://reviews.llvm.org/D5011 llvm-svn: 216631	2014-08-27 23:06:08 +00:00
Juergen Ributzka	ad8beabe38	[FastISel][AArch64] Don't fold instructions too aggressively into the memory operation. Currently instructions are folded very aggressively into the memory operation, which can lead to the use of killed operands: %vreg1<def> = ADDXri %vreg0<kill>, 2 %vreg2<def> = LDRBBui %vreg0, 2 ... = ... %vreg1 ... This usually happens when the result is also used by another non-memory instruction in the same basic block, or any instruction in another basic block. If the computed address is used by only memory operations in the same basic block, then it is safe to fold them. This is because all memory operations will fold the address computation and the original computation will never be emitted. This fixes rdar://problem/18142857. llvm-svn: 216629	2014-08-27 22:52:33 +00:00
Renato Golin	d352137553	Avoid zero length memset error Adding a check on buffer lenght to avoid a __warn_memset_zero_len warning on GCC 4.8.2. llvm-svn: 216624	2014-08-27 21:58:56 +00:00
Sanjay Patel	159f127f63	Use local variable in visitFADD. No functional change. llvm-svn: 216623	2014-08-27 21:42:42 +00:00
Juergen Ributzka	56b4b33190	[FastISel][AArch64] Fix a comment in my previous commit (r216617). llvm-svn: 216622	2014-08-27 21:40:50 +00:00
Juergen Ributzka	3c1b286152	[FastISel][AArch64] Fix simplify address when the address comes from a shift. When the address comes directly from a shift instruction then the address computation cannot be folded into the memory instruction, because the zero register is not available as a base register. Simplify addess needs to emit the shift instruction and use the result as base register. llvm-svn: 216621	2014-08-27 21:38:33 +00:00
Rafael Espindola	cdb871d734	Fix a double free in llvm::getBitcodeTargetTriple. Unfortunately this is only used by ld64, so no testcase, but should fix the darwin LTO bootstrap. llvm-svn: 216618	2014-08-27 21:11:13 +00:00
Juergen Ributzka	100a9b7fda	[FastISel][AArch64] Use the zero register for stores. Use the zero register directly when possible to avoid an unnecessary register copy and a wasted register at -O0. This also uses integer stores to store a positive floating-point zero. This saves us from materializing the positive zero in a register and then storing it. llvm-svn: 216617	2014-08-27 21:04:52 +00:00
Sanjay Patel	ae402a35a0	Group unsafe-math optimizations for fsub into one block. No functional change. llvm-svn: 216616	2014-08-27 20:57:52 +00:00
Juergen Ributzka	833bc681e3	[FastISel] Fix a potential bug in FastEmitInst_ri FastEmitInst_ri was constraining the first operand without checking if it is a virtual register. Use constrainOperandRegClass as all the other FastEmitInst_* functions. llvm-svn: 216613	2014-08-27 20:47:33 +00:00
Sanjay Patel	a828f2ba46	Use local variable to improve readability. No functional change intended. llvm-svn: 216611	2014-08-27 20:40:31 +00:00
Sanjay Patel	1d23bac843	typo in comment llvm-svn: 216609	2014-08-27 20:27:05 +00:00
Rafael Espindola	eeec8e63c0	Don't create a MemoryBuffer just to get the MemoryBufferRef. NFC. llvm-svn: 216608	2014-08-27 20:25:55 +00:00
David Blaikie	dfbe3d6b17	Convert a few more cases of direct intialization of unique_ptrs from MemoryBuffer::getMemBuffer to move initialization now that it returns by unique_ptr instead of raw pointer. Cleanup/improvements following r216583. llvm-svn: 216605	2014-08-27 20:14:18 +00:00
Reid Kleckner	7b7a599ac5	X86 MC: Handle instructions like fxsave that match multiple operand sizes Instructions like 'fxsave' and control flow instructions like 'jne' match any operand size. The loop I added to the Intel syntax matcher assumed that using a different size would give a different instruction. Now it handles the case where we get the same instruction for different memory operand sizes. This also allows us to remove the hack we had for unsized absolute memory operands, because we can successfully match things like 'jnz' without reporting ambiguity. Removing this hack uncovered test case involving 'fadd' that was ambiguous. The memory operand could have been single or double precision. llvm-svn: 216604	2014-08-27 20:10:38 +00:00
David Majnemer	22ccfc4484	InstCombine: Combine gep X, (Y-X) to Y We try to perform this transform in InstSimplify but we aren't always able to. Sometimes, we need to insert a bitcast if X and Y don't have the same time. llvm-svn: 216598	2014-08-27 20:08:37 +00:00
David Majnemer	11ca2971e8	InstSimplify: Don't simplify gep X, (Y-X) to Y if types differ It's incorrect to perform this simplification if the types differ. A bitcast would need to be inserted for this to work. This fixes PR20771. llvm-svn: 216597	2014-08-27 20:08:34 +00:00
Nico Weber	48c82400ed	Reland r216439 215441, majnemer has a real fix for PR20771. llvm-svn: 216586	2014-08-27 20:06:19 +00:00
Rafael Espindola	3560ff2c1f	Return a std::unique_ptr when creating a new MemoryBuffer. llvm-svn: 216583	2014-08-27 20:03:13 +00:00
Nico Weber	7b343e3cc6	Revert r216439 (and r216441, else the former doesn't revert cleanly). It caused PR 20771. I'll land a test on the clang side. llvm-svn: 216582	2014-08-27 20:00:13 +00:00
Rafael Espindola	9eef18c58c	Remove unused argument. llvm-svn: 216580	2014-08-27 19:49:03 +00:00
Alexey Samsonov	a253bf9678	Use BitVector instead of int in R600 SIISelLowering. int may not have enough bits in it, which was detected by UBSan bootstrap (it reported left shift by a too large constant). llvm-svn: 216579	2014-08-27 19:36:53 +00:00
Rafael Espindola	68669e3a7b	yaml::Stream doesn't need to take ownership of the buffer. In fact, most users were already using the StringRef version. llvm-svn: 216575	2014-08-27 19:03:22 +00:00
Zachary Turner	671355e099	Fix some semantic usability issues with DynamicLibrary. This patch allows invalid DynamicLibrary instances to be constructed, and fixes the const-correctness of the isValid() method. No functional change. llvm-svn: 216571	2014-08-27 18:13:25 +00:00
David Majnemer	d6d1671c1e	InstSimplify: Compute comparison ranges for left shift instructions 'shl nuw CI, x' produces [CI, CI << CLZ(CI)] 'shl nsw CI, x' produces [CI << CLO(CI)-1, CI] if CI is negative 'shl nsw CI, x' produces [CI, CI << CLZ(CI)-1] if CI is non-negative llvm-svn: 216570	2014-08-27 18:03:46 +00:00
Zachary Turner	74a46c24f3	Revert "Limit the symbol search in DynamicLibrary to the module that was opened." This reverts commit r216563, which breaks lli's dynamic symbol resolution. llvm-svn: 216569	2014-08-27 17:51:43 +00:00
Lang Hames	0717c3de02	[MCJIT] Replace a C-style cast in RuntimeDyldImpl.h. llvm-svn: 216568	2014-08-27 17:48:07 +00:00
Lang Hames	dc77feb57d	[MCJIT] More endianness fixes for RuntimeDyldMachO. http://llvm.org/PR20640 llvm-svn: 216567	2014-08-27 17:41:06 +00:00
Zachary Turner	0611d01419	Limit the symbol search in DynamicLibrary to the module that was opened. Differential Revision: http://reviews.llvm.org/D5030 Reviewed By: Reid Kleckner, Rafael Espindola llvm-svn: 216563	2014-08-27 17:06:22 +00:00
Oliver Stannard	89d1542840	Teach the AArch64 backend about v4f16 and v8f16 This teaches the AArch64 backend to deal with the operations required to deal with the operations on v4f16 and v8f16 which are exposed by NEON intrinsics, plus the add, sub, mul and div operations. llvm-svn: 216555	2014-08-27 16:16:04 +00:00
Michael Zolotukhin	5dc466b863	[SLP] Re-enable vectorization of GEP expressions (re-apply r210342 with a fix). llvm-svn: 216549	2014-08-27 15:01:18 +00:00
Evgeniy Stepanov	5050553ab8	Clang-format over X86AsmInstrumentation.* with LLVM style. r216536 mistakenly used -style=Google instead of LLVM. llvm-svn: 216543	2014-08-27 13:11:55 +00:00
Benjamin Kramer	870d951bda	Add an explicit cast to pacify implicit boolean conversion warnings. llvm-svn: 216539	2014-08-27 11:47:52 +00:00
Chandler Carruth	a5a8a9adc8	[x86] Fix a regression introduced with r213897 for 32-bit targets where we stopped efficiently lowering sextload using the SSE41 instructions for that operation. This is a consequence of a bad predicate I used thinking of the memory access needs. The code actually handles the cases where the predicate doesn't apply, and handles them much better. =] Simple fix and a test case added. Fixes PR20767. llvm-svn: 216538	2014-08-27 11:39:47 +00:00
Chandler Carruth	74ec9e19ee	[SDAG] Re-instate r215611 with a fix to a pesky X86 DAG combine. This combine is essentially combining target-specific nodes back into target independent nodes that it "knows" will be combined yet again by a target independent DAG combine into a different set of target-independent nodes that are legal (not custom though!) and thus "ok". This seems... deeply flawed. The crux of the problem is that we don't combine un-legalized shuffles that are introduced by legalizing other operations, and thus we don't see a very profitable combine opportunity. So the backend just forces the input to that combine to re-appear. However, for this to work, the conditions detected to re-form the unlegalized nodes must be exactly right. Previously, failing this would have caused poor code (if you're lucky) or a crasher when we failed to select instructions. After r215611 we would fall back into the legalizer. In some cases, this just "fixed" the crasher by produces bad code. But in the test case added it caused the legalizer and the dag combiner to iterate forever. The fix is to make the alignment checking in the x86 side of things match the alignment checking in the generic DAG combine exactly. This isn't really a satisfying or principled fix, but it at least make the code work as intended. It also highlights that it would be nice to detect the availability of under aligned loads for a given type rather than bailing on this optimization. I've left a FIXME to document this. Original commit message for r215611 which covers the rest of the chang: [SDAG] Fix a case where we would iteratively legalize a node during combining by replacing it with something else but not re-process the node afterward to remove it. In a truly remarkable stroke of bad luck, this would (in the test case attached) end up getting some other node combined into it without ever getting re-processed. By adding it back on to the worklist, in addition to deleting the dead nodes more quickly we also ensure that if it stops being dead for any reason it makes it back through the legalizer. Without this, the test case will end up failing during instruction selection due to an and node with a type we don't have an instruction pattern for. It took many million runs of the shuffle fuzz tester to find this. llvm-svn: 216537	2014-08-27 11:22:16 +00:00
Evgeniy Stepanov	4d04f66627	Clang-format over X86AsmInstrumentation.*. llvm-svn: 216536	2014-08-27 11:10:54 +00:00
Robert Khasanov	29e3b96734	[SKX] Added new versions of cmp instructions in avx512_icmp_cc multiclass, added VL multiclass. Added encoding tests llvm-svn: 216532	2014-08-27 09:34:37 +00:00
Elena Demikhovsky	ff620edd3c	AVX-512: Added intrinsic for VMOVSS store form with mask. llvm-svn: 216530	2014-08-27 07:38:43 +00:00
Craig Topper	e1d1294853	Simplify creation of a bunch of ArrayRefs by using None, makeArrayRef or just letting them be implicitly created. llvm-svn: 216525	2014-08-27 05:25:25 +00:00
Craig Topper	3af9722529	Fix some cases were ArrayRefs were being passed by reference. Also remove 'const' from some other ArrayRef uses since its implicitly const already. llvm-svn: 216524	2014-08-27 05:25:00 +00:00
David Majnemer	54e97d5dc0	InstCombine: Optimize GEP's involving ptrtoint better We supported transforming: (gep i8* X, -(ptrtoint Y)) to: (inttoptr (sub (ptrtoint X), (ptrtoint Y))) However, this only fired if 'X' had type i8*. Generalize this to support various types of different sizes. This results in much better CodeGen, especially for pointers to packed structs. llvm-svn: 216523	2014-08-27 05:16:04 +00:00
David Blaikie	c13bc97e58	Remove type unit skeletons. GDB no longer needs them & this saves a heap of space. llvm-svn: 216521	2014-08-27 05:04:14 +00:00
Juergen Ributzka	fb506a417d	[FastISel][AArch64] Fix address simplification. When a shift with extension or an add with shift and extension cannot be folded into the memory operation, then the address calculation has to be materialized separately. While doing so the code forgot to consider a possible sign-/zero- extension. This fix folds now also the sign-/zero-extension into the add or shift instruction which is used to materialize the address. This fixes rdar://problem/18141718. llvm-svn: 216511	2014-08-27 00:58:30 +00:00
Juergen Ributzka	99dd30f338	[FastISel][AArch64] Fold Sign-/Zero-Extend into the shift immediate instruction. llvm-svn: 216510	2014-08-27 00:58:26 +00:00
David Blaikie	b3833ef0c1	Fix a couple of debug info test cases to match the metadata schema change in r216239 Found these while testing something else. llvm-svn: 216505	2014-08-27 00:04:16 +00:00
Rafael Espindola	e2c1d77fb4	Pass a std::unique_ptr<MemoryBuffer>& to getLazyBitcodeModule. By taking a reference we can do the ownership transfer in one place instead of expecting every caller to do it. llvm-svn: 216492	2014-08-26 22:00:09 +00:00
Rafael Espindola	d96d553d76	Pass a MemoryBufferRef when we can avoid taking ownership. The attached patch simplifies a few interfaces that don't need to take ownership of a buffer. For example, both parseAssembly and parseBitcodeFile will parse the entire buffer before returning. There is no need to take ownership. Using a MemoryBufferRef makes it obvious in the type signature that there is no ownership transfer. llvm-svn: 216488	2014-08-26 21:49:01 +00:00
Rafael Espindola	7271c19420	Give ExecutionEngine of top level buffers. Long term the idea if for the engine to not own the buffers, but for now this is consistent with the rest of the API. llvm-svn: 216484	2014-08-26 21:04:04 +00:00
Reid Kleckner	f6fb780890	MC: Split the x86 asm matcher implementations by dialect The existing matcher has lots of AT&T assembly dialect assumptions baked into it. In particular, the hack for resolving the size of a memory operand by appending the four most common suffixes doesn't work at all. The Intel assembly dialect mnemonic table has ambiguous entries, so we need to try matching multiple times with different operand sizes, since that's the only way to choose different instruction variants. This makes us more compatible with gas's implementation of Intel assembly syntax. MSVC assumes you want byte-sized operations for the instructions that we reject as ambiguous. Reviewed By: grosbach Differential Revision: http://reviews.llvm.org/D4747 llvm-svn: 216481	2014-08-26 20:32:34 +00:00
Joerg Sonnenberger	cb5674b9c2	Revert r210342 and r210343, add test case for the crasher. PR 20642. llvm-svn: 216475	2014-08-26 19:06:41 +00:00
Joerg Sonnenberger	2981591f7f	Convert MC command line option for fatal assembler warnings into a proper flag. llvm-svn: 216471	2014-08-26 18:39:50 +00:00
Rafael Espindola	5c4f4a6c33	Invert the condition to have a single return. Thanks to David Blaikie for the suggestion. llvm-svn: 216468	2014-08-26 18:03:35 +00:00
Rafael Espindola	d233b06afc	Return a std::unique_ptr from the IRReader.h functions. NFC. llvm-svn: 216466	2014-08-26 17:29:46 +00:00
Rafael Espindola	28b351a56d	Return a std::unique_ptr from parseInputFile and propagate. NFC. The memory management in BugPoint is fairly convoluted, so this just unwraps one layer by changing the return type of functions that always return owned Modules. llvm-svn: 216464	2014-08-26 17:19:03 +00:00
Rafael Espindola	2ce3882eaf	Simplify LTOModule::makeLTOModule a bit. NFC. Just call parseBitcodeFile instead of getLazyBitcodeModule followed by materializeAllPermanently. llvm-svn: 216461	2014-08-26 15:09:32 +00:00
Rafael Espindola	016a6d5192	Merge TempDir and system_temp_directory. We had two functions for finding the temp or cache directory. Each had a different set of smarts about OS specific APIs. With this patch system_temp_directory becomes the only way to do it. llvm-svn: 216460	2014-08-26 14:47:52 +00:00
Benjamin Kramer	01e1789d38	Silence unused function warning in Release builds. llvm-svn: 216458	2014-08-26 14:22:05 +00:00
James Molloy	36b8a88188	Change the return value of "getEnd()" from a MachineInstr* to a MachineBasicBlock::iterator. It seems on Darwin the illegal round-trip ::iterator -> MachineInstr* -> ::iterator breaks execution horribly when the iterator is not a real MachineInstr, like ::end(). llvm-svn: 216455	2014-08-26 13:41:31 +00:00
Yi Kong	ebaa150e23	ARM: Add patterns for dbg llvm-svn: 216451	2014-08-26 12:47:26 +00:00
Dinesh Dwivedi	4919bbe29d	This patch enables SimplifyUsingDistributiveLaws() to handle following pattens. (X >> Z) & (Y >> Z) -> (X&Y) >> Z for all shifts. (X >> Z) \| (Y >> Z) -> (X\|Y) >> Z for all shifts. (X >> Z) ^ (Y >> Z) -> (X^Y) >> Z for all shifts. These patterns were previously handled separately in visitAnd()/visitOr()/visitXor(). Differential Revision: http://reviews.llvm.org/D4951 llvm-svn: 216443	2014-08-26 08:53:32 +00:00
Bill Wendling	24c6f5763a	Use 'xz' compression instead of 'gz'. llvm-svn: 216442	2014-08-26 08:11:22 +00:00
David Majnemer	788d0ab8c8	InstSimplify: Fold gep X, (sub 0, ptrtoint(X)) to null Save InstCombine some work if we can perform this fold during InstSimplify. llvm-svn: 216441	2014-08-26 07:08:03 +00:00
David Majnemer	bc4981323f	InstSimplify: Simplify trivial pointer expressions like b + (e - b) consider: long long f(long long b, long long e) { return b + (e - b); } we would lower this to something like: define i64 @f(i64* %b, i64* %e) { %1 = ptrtoint i64* %e to i64 %2 = ptrtoint i64* %b to i64 %3 = sub i64 %1, %2 %4 = ashr exact i64 %3, 3 %5 = getelementptr inbounds i64* %b, i64 %4 ret i64* %5 } This should fold away to just 'e'. N.B. This adds m_SpecificInt as a convenient way to match against a particular 64-bit integer when using LLVM's match interface. llvm-svn: 216439	2014-08-26 05:55:16 +00:00
Dylan Noblesmith	4af4d2c111	AArch64: use std::fill instead of memset Followup based on review. llvm-svn: 216436	2014-08-26 03:33:26 +00:00
Dylan Noblesmith	b06f77b608	Revert "AArch64: use std::vector for temp array" This reverts commit r216365. llvm-svn: 216433	2014-08-26 02:03:43 +00:00
Dylan Noblesmith	43f49cad78	Analysis: cleanup Address review comments. llvm-svn: 216432	2014-08-26 02:03:40 +00:00
Dylan Noblesmith	4ffafefdaa	Revert "Analysis: unique_ptr-ify DependenceAnalysis::collectCoeffInfo" This reverts commit r216358. llvm-svn: 216431	2014-08-26 02:03:38 +00:00
Dylan Noblesmith	c9e2a2709e	Revert "NVPTX: remove another raw delete call" This reverts commit r216364. llvm-svn: 216430	2014-08-26 02:03:35 +00:00
Dylan Noblesmith	4e69e29a72	Revert "Support/APFloat: unique_ptr-ify temp arrays" This reverts commit rr216359. llvm-svn: 216429	2014-08-26 02:03:33 +00:00
Dylan Noblesmith	42836d95e0	Revert "Support/Path: remove raw delete" This reverts commit r216360. llvm-svn: 216428	2014-08-26 02:03:30 +00:00
Dylan Noblesmith	4b535d1930	ExecutionEngine: address review comments llvm-svn: 216427	2014-08-26 02:03:28 +00:00
Dylan Noblesmith	17f05a3fc6	CodeGen/LiveVariables: use vector::assign() Address review comments. llvm-svn: 216426	2014-08-26 02:03:25 +00:00
Reid Kleckner	3715461b48	musttail: Don't eliminate varargs packs if there is a forwarding call Also clean up and beef up this grep test for the feature. llvm-svn: 216425	2014-08-26 00:59:51 +00:00
Sanjay Patel	4e31cdabd1	fix typos in comments llvm-svn: 216424	2014-08-26 00:59:15 +00:00
Reid Kleckner	8349864dbd	Declare that musttail calls in variadic functions forward the ellipsis Summary: There is no functionality change here except in the way we assemble and dump musttail calls in variadic functions. There's really no need to separate out the bits for musttail and "is forwarding varargs" on call instructions. A musttail call by definition has to forward the ellipsis or it would fail verification. Reviewers: chandlerc, nlewycky Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D4892 llvm-svn: 216423	2014-08-26 00:33:28 +00:00
Reid Kleckner	d83c63b704	Fix Path unittests on Windows after raw_fd_ostream changes llvm-svn: 216422	2014-08-26 00:24:23 +00:00
Reid Kleckner	e6e88f99b3	ArgPromotion: Don't touch variadic functions Adding, removing, or changing non-pack parameters can change the ABI classification of pack parameters. Clang and other frontends encode the classification in the IR of the call site, but the callee side determines it dynamically based on the number of registers consumed so far. Changing the prototype affects the number of registers consumed would break such code. Dead argument elimination performs a similar task and already has a similar check to avoid this problem. Patch by Thomas Jablin! llvm-svn: 216421	2014-08-25 23:58:48 +00:00
Lang Hames	40e200eb69	[MCJIT][SystemZ] Use a simpler expression for indirect relocation offsets. The expressions 'Reloc.Addend - Addend' and 'Reloc.Offset' should always be equal in this context. The latter is prefered - we want to remove the RelocationValueRef::Addend field in the future. llvm-svn: 216418	2014-08-25 23:33:48 +00:00
Rafael Espindola	42036ae034	Fix bug in llvm::sys::argumentsFitWithinSystemLimits(). This patch fixes a subtle bug in the UNIX implementation of llvm::sys::argumentsFitWithinSystemLimits() regarding the misuse of a static variable. This bug causes our cached number that stores the system command line maximum length to be halved after each call to the function. With a sufficient number of calls to this function, it will eventually report any given command line string to be over system limits. Patch by Rafael Auler. llvm-svn: 216415	2014-08-25 22:53:21 +00:00
Lang Hames	f4b3b67f57	[MCJIT] Dump section memory both before and after relocations are applied. Also switch section memory dump format from 8 to 16 columns. llvm-svn: 216413	2014-08-25 22:19:14 +00:00
Rafael Espindola	f7c3a1d256	Refactor argument serialization logic when executing process. NFC. This patch refactors the argument serialization logic used in the Execute function, used to launch new Windows processes. There is a critical step that joins char** arguments into a single string, building the command line used to launch the new process, and the readability of this code is improved if this part is refactored in its own helper function. Patch by Rafael Auler! llvm-svn: 216411	2014-08-25 22:15:06 +00:00
Juergen Ributzka	1912e24898	[FastISel][AArch64] Refactor float zero materialization. NFCI. llvm-svn: 216403	2014-08-25 19:58:05 +00:00
Lang Hames	86b08f02c0	[MCJIT] Make RuntimeDyld dump section contents in -debug mode. llvm-svn: 216400	2014-08-25 18:37:38 +00:00
Rafael Espindola	3fd1e9933f	Modernize raw_fd_ostream's constructor a bit. Take a StringRef instead of a "const char *". Take a "std::error_code &" instead of a "std::string &" for error. A create static method would be even better, but this patch is already a bit too big. llvm-svn: 216393	2014-08-25 18:16:47 +00:00
Chandler Carruth	70f81a98ca	[x86] Fix a bug in r216319 where I was missing a 'break'. This actually was caught by existing tests but those tests were disabled with an XFAIL because of PR20736. While working on fixing that, I noticed the test failure, and tracked it down to this. We even have a really nice Clang warning that would have caught this but it isn't enabled in LLVM! =[ I may look at enabling it. llvm-svn: 216391	2014-08-25 18:06:11 +00:00
Bruno Cardoso Lopes	e2a1fa35df	Remove dangling initializers in GlobalDCE GlobalDCE deletes global vars and updates their initializers to nullptr while leaving underlying constants to be cleaned up later by its uses. The clean up may never happen, fix this by forcing it every time it's safe to destroy constants. Final patch by Rafael Espindola http://reviews.llvm.org/D4931 <rdar://problem/17523868> llvm-svn: 216390	2014-08-25 17:51:14 +00:00
Bruno Cardoso Lopes	356c4ac88b	Rise from the dead and update personal info llvm-svn: 216389	2014-08-25 17:51:04 +00:00
Chad Rosier	e62f365458	[AArch32] Add patterns for VCVT{A,N,P,M}. Patterns for lowering libm calls to VCVT{A,N,P,M} are also included. Phabricator Revision: http://reviews.llvm.org/D5033 llvm-svn: 216388	2014-08-25 16:56:33 +00:00
Robert Khasanov	2ea081d4d1	[SKX] avx512_icmp_packed multiclass extension Extended avx512_icmp_packed multiclass by masking versions. Added avx512_icmp_packed_rmb multiclass for embedded broadcast versions. Added corresponding _vl multiclasses. Added encoding tests for CPCMP{EQ\|GT}* instructions. Add more fields for X86VectorVTInfo. Added AVX512VLVectorVTInfo that include X86VectorVTInfo for 512/256/128-bit versions Differential Revision: http://reviews.llvm.org/D5024 llvm-svn: 216383	2014-08-25 14:49:34 +00:00
Stepan Dyatkovskiy	c90308bf83	MergeFunctions, tiny refactoring: cmpAPFloat has been renamed to cmpAPFloats (multiple form). llvm-svn: 216376	2014-08-25 08:22:46 +00:00
Stepan Dyatkovskiy	7f895c1184	MergeFunctions, tiny refactoring: cmpAPInt has been renamed to cmpAPInts (multiple form). llvm-svn: 216375	2014-08-25 08:19:50 +00:00
Stepan Dyatkovskiy	0b765dee6e	MergeFunctions, tiny refactoring: cmpType has been renamed to cmpTypes (multiple form). llvm-svn: 216374	2014-08-25 08:16:39 +00:00
Stepan Dyatkovskiy	016daddc52	MergeFunctions, tiny refactoring: cmpGEP has been renamed to cmpGEPs (multiple form). llvm-svn: 216373	2014-08-25 08:12:45 +00:00
Karthik Bhat	7f33ff7dea	Allow vectorization of division by uniform power of 2. This patch adds support to recognize division by uniform power of 2 and modifies the cost table to vectorize division by uniform power of 2 whenever possible. Updates Cost model for Loop and SLP Vectorizer.The cost table is currently only updated for X86 backend. Thanks to Hal, Andrea, Sanjay for the review. (http://reviews.llvm.org/D4971) llvm-svn: 216371	2014-08-25 04:56:54 +00:00
Dylan Noblesmith	6e69927d03	CodeGen/LiveVariables: hoist out code in nested loops This makes runOnMachineFunction vastly more readable. llvm-svn: 216368	2014-08-25 01:59:49 +00:00
Dylan Noblesmith	46a922c191	CodeGen/LiveVariables: switch to std::vector No functionality change. llvm-svn: 216367	2014-08-25 01:59:42 +00:00
Dylan Noblesmith	b899464f5b	AArch64: unique_ptr-ify map structures llvm-svn: 216366	2014-08-25 01:59:38 +00:00
Dylan Noblesmith	6076debd98	AArch64: use std::vector for temp array llvm-svn: 216365	2014-08-25 01:59:36 +00:00
Dylan Noblesmith	130589f804	NVPTX: remove another raw delete call llvm-svn: 216364	2014-08-25 01:59:32 +00:00
Dylan Noblesmith	802b6ce8de	NVPTX: remove raw delete call Also make members that are never accessed outside the class private. llvm-svn: 216363	2014-08-25 01:59:29 +00:00
Dylan Noblesmith	c4a9942a68	ExecutionEngine: unique_ptr-ify NFC. llvm-svn: 216362	2014-08-25 00:58:18 +00:00
Dylan Noblesmith	2b9b93e6f1	EE/JIT: unique_ptr-ify llvm-svn: 216361	2014-08-25 00:58:15 +00:00

1 2 3 4 5 ...

107271 Commits