llvm-project

Commit Graph

Author	SHA1	Message	Date
Chad Rosier	6db9ff64a8	[AArch64] Prefer Bcc to CBZ/CBNZ/TBZ/TBNZ when NZCV flags can be set for "free". This patch contains a pass that transforms CBZ/CBNZ/TBZ/TBNZ instructions into a conditional branch (Bcc), when the NZCV flags can be set for "free". This is preferred on targets that have more flexibility when scheduling Bcc instructions as compared to CBZ/CBNZ/TBZ/TBNZ (assuming all other variables are equal). This can reduce register pressure and is also the default behavior for GCC. A few examples: add w8, w0, w1 -> cmn w0, w1 ; CMN is an alias of ADDS. cbz w8, .LBB_2 -> b.eq .LBB0_2 ; single def/use of w8 removed. add w8, w0, w1 -> adds w8, w0, w1 ; w8 has multiple uses. cbz w8, .LBB1_2 -> b.eq .LBB1_2 sub w8, w0, w1 -> subs w8, w0, w1 ; w8 has multiple uses. tbz w8, #31, .LBB6_2 -> b.ge .LBB6_2 In looking at all current sub-target machine descriptions, this transformation appears to be either positive or neutral. Differential Revision: https://reviews.llvm.org/D34220. llvm-svn: 306144	2017-06-23 19:20:12 +00:00
whitequark	00ede4dcc1	[X86] Fix SP adjustment in stack probes emitted on 32-bit Windows. Commit r306010 adjusted the condition as follows: - if (Is64Bit) { + if (!STI.isTargetWin32()) { The intent was to preserve the behavior on all Windows platforms but extend the behavior on 64-bit Windows platforms to every other one. (Before r306010, emitStackProbeCall only ever executed when emitting code for Windows triples.) Unfortunately, if (Is64Bit && STI.isOSWindows()) is not the same as if (!STI.isTargetWin32()) because of the way isTargetWin32() is defined: bool isTargetWin32() const { return !In64BitMode && (isTargetCygMing() \|\| isTargetKnownWindowsMSVC()); } In practice this broke the JIT tests on 32-bit Windows, which did not satisfy the new condition: LLVM :: ExecutionEngine/MCJIT/2003-01-15-AlignmentTest.ll LLVM :: ExecutionEngine/MCJIT/2003-08-15-AllocaAssertion.ll LLVM :: ExecutionEngine/MCJIT/2003-08-23-RegisterAllocatePhysReg.ll LLVM :: ExecutionEngine/MCJIT/test-loadstore.ll LLVM :: ExecutionEngine/OrcMCJIT/2003-01-15-AlignmentTest.ll LLVM :: ExecutionEngine/OrcMCJIT/2003-08-15-AllocaAssertion.ll LLVM :: ExecutionEngine/OrcMCJIT/2003-08-23-RegisterAllocatePhysReg.ll LLVM :: ExecutionEngine/OrcMCJIT/test-loadstore.ll because %esp was not updated correctly. The failures are only visible on a MSVC 2017 Debug build, for which we do not have bots. llvm-svn: 306142	2017-06-23 18:58:10 +00:00
Zachary Turner	0b36c3ebd0	[llvm-pdbutil] Add a function for formatting MSF data. The goal here is to make it possible to display absolute file offsets when dumping byets from an MSF. The problem is that when dumping bytes from an MSF, often the bytes will cross a block boundary and encounter a discontinuity. We can't use the normal formatBinary() function for this because this would just treat the sequence as entirely ascending, and not account out-of-order blocks. This patch adds a formatMsfData() function to our printer, and then uses this function to improve the output of the -stream-data command line option for dumping bytes from a particular stream. Test coverage is also expanded to make sure to include all possible scenarios of offsets, sizes, and crossing block boundaries. llvm-svn: 306141	2017-06-23 18:52:13 +00:00
Krzysztof Parzyszek	c0a102f505	[Hexagon] Remove call to printAndVerify from HexagonPassConfig It causes an extra pass of the machine verifier to be added to the pass manager, and causes test/CodeGen/Generic/llc-start-stop.ll to fail. llvm-svn: 306140	2017-06-23 18:47:55 +00:00
Sanjay Patel	3de6bad65f	[x86] fix value types for SBB transform (PR33560) I'm not sure yet why this wouldn't fail in the simple case, but clearly I used the wrong value type with: https://reviews.llvm.org/rL306040 ...and the bug manifests with: https://bugs.llvm.org/show_bug.cgi?id=33560 llvm-svn: 306139	2017-06-23 18:42:15 +00:00
Simon Pilgrim	19cee0d56c	[X86][AVX] Regenerate i256 bitcasted store test Check on slow/fast unaligned memory targets llvm-svn: 306138	2017-06-23 18:34:56 +00:00
Simon Pilgrim	c26105e3c8	Fix Wdocumentation warning. llvm-svn: 306133	2017-06-23 18:03:04 +00:00
Simon Pilgrim	dfa436079f	Regenerate extract-store.ll tests llvm-svn: 306131	2017-06-23 17:19:44 +00:00
Peter Collingbourne	15ab1720c3	Fix a misleading indentation warning. llvm-svn: 306130	2017-06-23 17:17:47 +00:00
Peter Collingbourne	30aaa2f3f6	Make the size specification for cache_size_bytes case insensitive. llvm-svn: 306129	2017-06-23 17:13:51 +00:00
Peter Collingbourne	8d29223386	Add a ThinLTO cache policy for controlling the maximum cache size in bytes. This is useful when an upper limit on the cache size needs to be controlled independently of the amount of the amount of free space. One use case is a machine with a large number of cache directories (e.g. a buildbot slave hosting a large number of independent build jobs). By imposing an upper size limit on each cache directory, users can more easily estimate the server's capacity. Differential Revision: https://reviews.llvm.org/D34547 llvm-svn: 306126	2017-06-23 17:05:03 +00:00
Krzysztof Parzyszek	bb2fcd1921	[Hexagon] Handle decreasing of stack alignment in frame lowering llvm-svn: 306124	2017-06-23 16:53:59 +00:00
Zachary Turner	a1dcb85dde	Add a BinarySubstreamRef, and a method to read one. This is essentially just a BinaryStreamRef packaged with an offset and the logic for reading one is no different than the logic for reading a BinaryStreamRef, except that we save the current offset. llvm-svn: 306122	2017-06-23 16:38:40 +00:00
Simon Pilgrim	6e85e92b6c	Remove trailing whitespace. NFCI. llvm-svn: 306121	2017-06-23 16:35:32 +00:00
Tim Northover	4b4eec7009	GlobalISel: remove G_SEQUENCE instruction. It was trying to do too many things. The basic lumping together of values for legalization purposes is now handled by G_MERGE_VALUES. More complex things involving gaps and odd sizes are handled by G_INSERT sequences. llvm-svn: 306120	2017-06-23 16:15:55 +00:00
Tim Northover	b57bf2ac79	GlobalISel: convert buildSequence to use non-deprecated instructions. G_SEQUENCE is going away soon so as a first step the MachineIRBuilder needs to be taught how to emulate it with alternatives. We use G_MERGE_VALUES where possible, and a sequence of G_INSERTs if not. llvm-svn: 306119	2017-06-23 16:15:37 +00:00
Jun Bum Lim	506cfb7ab7	[InlineCost] Do not take INT_MAX when Cost is negative Summary: visitSwitchInst should not take INT_MAX when Cost is negative. Instead of INT_MAX , we also use a valid upperbound cost when overflow occurs in Cost. Reviewers: hans, echristo, dmgreen Reviewed By: dmgreen Subscribers: mcrosier, javed.absar, llvm-commits, eraman Differential Revision: https://reviews.llvm.org/D34436 llvm-svn: 306118	2017-06-23 16:12:37 +00:00
Ulrich Weigand	eaf0051ba3	[SystemZ] Remove unnecessary serialization before volatile loads This reverts the use of TargetLowering::prepareVolatileOrAtomicLoad introduced by r196905. Nothing in the semantics of the "volatile" keyword or the definition of the z/Architecture actually requires that volatile loads are preceded by a serialization operation, and no other compiler on the platform actually implements this. Since we've now seen a use case where this additional serialization causes noticable performance degradation, this patch removes it. The patch still leaves in the serialization before atomic loads, which is now implemented directly in lowerATOMIC_LOAD. (This also seems overkill, but that can be addressed separately.) llvm-svn: 306117	2017-06-23 15:56:14 +00:00
Sanjay Patel	021f32fd0f	[x86] auto-generate complete checks; NFC llvm-svn: 306114	2017-06-23 15:29:49 +00:00
Sanjay Patel	02469b63c2	[x86] auto-generate complete checks; NFC llvm-svn: 306113	2017-06-23 15:22:27 +00:00
Tom Stellard	af552dc352	AMDGPU/GlobalISel: Mark 32-bit G_AND as legal Reviewers: arsenm Reviewed By: arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, rovka, kristof.beyls, igorb, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D34349 llvm-svn: 306112	2017-06-23 15:17:17 +00:00
Sanjay Patel	563e5afa0e	[x86] remove overridden target settings in test; NFC r306109 was supposed to make this change, but I committed the wrong version. llvm-svn: 306110	2017-06-23 15:06:30 +00:00
Sanjay Patel	8e06df4303	[x86] rename test file and auto-generate complete checks; NFC The command-line params override the target setting in the file itself, so delete that. Also, remove the cpu and arch because those don't matter and neither does the OS specification in the triple. llvm-svn: 306109	2017-06-23 14:58:21 +00:00
Simon Pilgrim	859b48d2d3	[X86][AVX] Extended vector average tests Added AVX1 tests and merged AVX1/AVX2/AVX512 checks where possible llvm-svn: 306107	2017-06-23 14:38:00 +00:00
Jonas Paulsson	82f15a7168	[SystemZ] Fix trap issue and enable expensive checks. The isBarrier/isTerminator flags have been removed from the SystemZ trap instructions, so that tests do not fail with EXPENSIVE_CHECKS. This was just an issue at -O0 and did not affect code output on benchmarks. (Like Eli pointed out: "targets are split over whether they consider their "trap" a terminator; x86, AArch64, and NVPTX don't, but ARM, MIPS, PPC, and SystemZ do. We should probably try to be consistent here.". This is still the case, although SystemZ has switched sides). SystemZ now returns true in isMachineVerifierClean() :-) These Generic tests have been modified so that they can be run with or without EXPENSIVE_CHECKS: CodeGen/Generic/llc-start-stop.ll and CodeGen/Generic/print-machineinstrs.ll Review: Ulrich Weigand, Simon Pilgrim, Eli Friedman https://bugs.llvm.org/show_bug.cgi?id=33047 https://reviews.llvm.org/D34143 llvm-svn: 306106	2017-06-23 14:30:46 +00:00
Anna Thomas	91eed9ac1a	[RuntimeLoopUnrolling] Rename exit block and move assert earlier. NFC The single exit block allowed in runtime unrolling is guaranteed to be the Latch's successor, so rename it as LatchExitBlock. llvm-svn: 306105	2017-06-23 14:28:01 +00:00
Simon Pilgrim	dbd20ffee1	[X86][SSE] Dropped -mcpu from vector average tests Use triple and attribute only for consistency llvm-svn: 306104	2017-06-23 14:16:50 +00:00
Ekaterina Vaartis	4c5192f375	[docs] As of binutils 2.21.51.0.2, ld.bfd supports plugins too, represent this in docs PR#32760 llvm-svn: 306102	2017-06-23 13:54:10 +00:00
Simon Pilgrim	171ba4a699	Fix double->float truncation warning on MSVC llvm-svn: 306101	2017-06-23 13:53:55 +00:00
Anna Thomas	d67165c93c	[InstCombine] Recognize and simplify three way comparison idioms Summary: Many languages have a three way comparison idiom where comparing two values produces not a boolean, but a tri-state value. Typical values (e.g. as used in the lcmp/fcmp bytecodes from Java) are -1 for less than, 0 for equality, and +1 for greater than. We actually do a great job already of converting three way comparisons into binary comparisons when the result produced has one a single use. Unfortunately, such values can have more than one use, and in that case, our existing optimizations break down. The patch adds a peephole which converts a three-way compare + test idiom into a binary comparison on the original inputs. It focused on replacing the test on the result of the three way compare and does nothing about removing the three way compare itself. That's left to other optimizations (which do actually kick in commonly.) We currently recognize one idiom on signed integer compare. In the future, we plan to recognize and simplify other comparison idioms on other signed/unsigned datatypes such as floats, vectors etc. This is a resurrection of Philip Reames' original patch: https://reviews.llvm.org/D19452 Reviewers: majnemer, apilipenko, reames, sanjoy, mkazantsev Reviewed by: mkazantsev Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34278 llvm-svn: 306100	2017-06-23 13:41:45 +00:00
Petar Jovanovic	78811c2c07	Revert r306095: [mips] Fix reg positions in the aui/daui instructions ELF/mips-plt-r6.s in lld-test is failing. Reverting the change. Original commit message: [mips] Fix register positions in the aui/daui instructions Swapped the position of the rt and rs register in the aut/daui instructions for mips32r6 and mips64r6. With this change, the format of the generated instructions complies with specifications and GCC. Patch by Milos Stojanovic. llvm-svn: 306099	2017-06-23 13:33:46 +00:00
Pavel Labath	b0871add86	Fix build breakage caused by r306096 It seems some targets don't have std::strtof and friends. Hopefully, dropping the std:: will be fine, as that's what the compiler recommends. llvm-svn: 306098	2017-06-23 13:13:06 +00:00
Simon Pilgrim	dbf8f5ace7	[X86][SSE] Dropped -mcpu from scalar math tests Use triple and attribute only for consistency llvm-svn: 306097	2017-06-23 13:07:20 +00:00
Pavel Labath	ec000f42fa	[ADT] Add llvm::to_float Summary: The function matches the interface of llvm::to_integer, but as we are calling out to a C library function, I let it take a Twine argument, so we can avoid a string copy at least in some cases. I add a test and replace a couple of existing uses of strtod with this function. Reviewers: zturner Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34518 llvm-svn: 306096	2017-06-23 12:55:02 +00:00
Petar Jovanovic	d5f7711ebb	[mips] Fix register positions in the aui/daui instructions Swapped the position of the rt and rs register in the aut/daui instructions for mips32r6 and mips64r6. With this change, the format of the generated instructions complies with specifications and GCC. Patch by Milos Stojanovic. Differential Revision: https://reviews.llvm.org/D33988 llvm-svn: 306095	2017-06-23 12:47:18 +00:00
Simon Pilgrim	5d3d716815	[X86][SSE] Dropped -mcpu from insertps tests Use triple and attribute only for consistency llvm-svn: 306092	2017-06-23 11:00:49 +00:00
Stefan Maksimovic	b794c0a5ca	[mips][msa] Splat.d endianness check Before this change, it was always the first element of a vector that got splatted since the lower 6 bits of vshf.d $wd were always zero for little endian. Additionally, masking has been performed for vshf via which splat.d is created. Vshf has a property where if its first operand's elements have either bit 6 or 7 set, destination element is set to zero. Initially masked with 63 to avoid this property, which would result in generation of and.v + vshf.d in all cases. Masking with one results in generating a single splati.d instruction when possible. Differential Revision: https://reviews.llvm.org/D32216 llvm-svn: 306090	2017-06-23 09:09:31 +00:00
Craig Topper	2c20c42cb6	[JumpThreading] Teach jump threading how to analyze (and (cmp A, C1), (cmp A, C2)) after InstCombine has turned it into (cmp (add A, C3), C4) Currently JumpThreading can use LazyValueInfo to analyze an 'and' or 'or' of compare if the compare is fed by a livein of a basic block. This can be used to to prove the condition can't be met for some predecessor and the jump from that predecessor can be moved to the false path of the condition. But if the compare is something that InstCombine turns into an add and a single compare, it can't be analyzed because the livein is now an input to the add and not the compare. This patch adds a new method to LVI to get a ConstantRange on an edge. Then we teach jump threading to detect the add livein feeding a compare and to get the ConstantRange and propagate it. Differential Revision: https://reviews.llvm.org/D33262 llvm-svn: 306085	2017-06-23 05:41:35 +00:00
Craig Topper	7927996140	[JumpThreading] Use some temporary variables to reduce the number of times we call the same methods. NFC A future patch will add even more uses of these variables. llvm-svn: 306084	2017-06-23 05:41:32 +00:00
Rafael Espindola	58173b9720	COFF: Produce an error on invalid pcrel relocs. X86_64 COFF only has support for 32 bit pcrel relocations. Produce an error on all others. Note that gnu as has extended the relocation values to support this. It is not clear if we should support the gnu extension. llvm-svn: 306082	2017-06-23 04:07:44 +00:00
Chandler Carruth	4ab0f4910a	[LoopSimplify] Factor the logic to form dedicated exits into a utility. I want to use the same logic as LoopSimplify to form dedicated exits in another pass (SimpleLoopUnswitch) so I wanted to factor it out here. I also noticed that there is a pretty significantly more efficient way to implement this than the way the code in LoopSimplify worked. We don't need to actually retain the set of unique exit blocks, we can just rewrite them as we find them and use only a set to deduplicate. This did require changing one part of LoopSimplify to not re-use the unique set of exits, but it only used it to check that there was a single unique exit. That part of the code is about to walk the exiting blocks anyways, so it seemed better to rewrite it to use those exiting blocks to compute this property on-demand. I also had to ditch a statistic, but it doesn't seem terribly valuable. Differential Revision: https://reviews.llvm.org/D34049 llvm-svn: 306081	2017-06-23 04:03:04 +00:00
Rafael Espindola	13811b0605	Make the test a bit more strict. NFC. llvm-svn: 306080	2017-06-23 03:48:01 +00:00
Rafael Espindola	34e94a8783	COFF: handle "undef - ." expressions. This is another thing that the ELF implementation can do but is missing from COFF. llvm-svn: 306078	2017-06-23 02:15:56 +00:00
Craig Topper	b60f866a8b	[LVI] Teach LVI to reason about ORs of icmps similar to how it reasons about ANDs of icmps Summary: LVI can reason about an AND of icmps on the true dest of a branch. I believe we can do similar for the false dest of ORs. This allows us to get the same answer for the demorganed versions of some of the AND test cases as you can see. Reviewers: anna, reames Reviewed By: reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34431 llvm-svn: 306076	2017-06-23 01:08:16 +00:00
Farhana Aleen	9bd593e0d7	Fixed a (product) build error that was due to an unused variable Details: There was a use but it was in the assert which was not exercised during product build. Reviewers: Andrew Kaylor Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32658 llvm-svn: 306073	2017-06-22 23:56:31 +00:00
Sanjay Patel	359ae44fb4	[x86] add/sub (X==0) --> sbb(cmp X, 1) This is very similar to the transform in: https://reviews.llvm.org/rL306040 ...but in this case, we use cmp X, 1 to set the carry bit as needed. Again, we can show that all of these are logically equivalent (although InstCombine currently canonicalizes to a form not seen here), and if we believe IACA, then this is the smallest/fastest code. Eg, with SNB: \| Num Of \| Ports pressure in cycles \| \| \| Uops \| 0 - DV \| 1 \| 2 - D \| 3 - D \| 4 \| 5 \| \| --------------------------------------------------------------------- \| 1 \| 1.0 \| \| \| \| \| \| \| cmp edi, 0x1 \| 2 \| \| 1.0 \| \| \| \| 1.0 \| CP \| sbb eax, eax The larger motivation is to clean up all select-of-constants combining/lowering because we're missing some common cases. llvm-svn: 306072	2017-06-22 23:47:15 +00:00
Andrew Kaylor	d49711996f	Restrict the definition of loop preheader to avoid EH blocks Differential Revision: https://reviews.llvm.org/D34487 llvm-svn: 306070	2017-06-22 23:27:16 +00:00
whitequark	08b20356c3	Define behavior of "stack-probe-size" attribute when inlining. Also document the attribute, since "probe-stack" already is. Reviewed By: majnemer Differential Revision: https://reviews.llvm.org/D34528 llvm-svn: 306069	2017-06-22 23:22:36 +00:00
Farhana Aleen	4b652a5335	Supported lowerInterleavedStore() in X86InterleavedAccess. Reviewers: RKSimon, DavidKreitzer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32658 llvm-svn: 306068	2017-06-22 22:59:04 +00:00
Eric Christopher	5a7c2f1700	Remove the LoadCombine pass. It was never enabled and is unsupported. Based on discussions with the author on mailing lists. llvm-svn: 306067	2017-06-22 22:58:12 +00:00

1 2 3 4 5 ...

150613 Commits