llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	2c20c42cb6	[JumpThreading] Teach jump threading how to analyze (and (cmp A, C1), (cmp A, C2)) after InstCombine has turned it into (cmp (add A, C3), C4) Currently JumpThreading can use LazyValueInfo to analyze an 'and' or 'or' of compare if the compare is fed by a livein of a basic block. This can be used to to prove the condition can't be met for some predecessor and the jump from that predecessor can be moved to the false path of the condition. But if the compare is something that InstCombine turns into an add and a single compare, it can't be analyzed because the livein is now an input to the add and not the compare. This patch adds a new method to LVI to get a ConstantRange on an edge. Then we teach jump threading to detect the add livein feeding a compare and to get the ConstantRange and propagate it. Differential Revision: https://reviews.llvm.org/D33262 llvm-svn: 306085	2017-06-23 05:41:35 +00:00
Craig Topper	7927996140	[JumpThreading] Use some temporary variables to reduce the number of times we call the same methods. NFC A future patch will add even more uses of these variables. llvm-svn: 306084	2017-06-23 05:41:32 +00:00
Rafael Espindola	58173b9720	COFF: Produce an error on invalid pcrel relocs. X86_64 COFF only has support for 32 bit pcrel relocations. Produce an error on all others. Note that gnu as has extended the relocation values to support this. It is not clear if we should support the gnu extension. llvm-svn: 306082	2017-06-23 04:07:44 +00:00
Chandler Carruth	4ab0f4910a	[LoopSimplify] Factor the logic to form dedicated exits into a utility. I want to use the same logic as LoopSimplify to form dedicated exits in another pass (SimpleLoopUnswitch) so I wanted to factor it out here. I also noticed that there is a pretty significantly more efficient way to implement this than the way the code in LoopSimplify worked. We don't need to actually retain the set of unique exit blocks, we can just rewrite them as we find them and use only a set to deduplicate. This did require changing one part of LoopSimplify to not re-use the unique set of exits, but it only used it to check that there was a single unique exit. That part of the code is about to walk the exiting blocks anyways, so it seemed better to rewrite it to use those exiting blocks to compute this property on-demand. I also had to ditch a statistic, but it doesn't seem terribly valuable. Differential Revision: https://reviews.llvm.org/D34049 llvm-svn: 306081	2017-06-23 04:03:04 +00:00
Rafael Espindola	34e94a8783	COFF: handle "undef - ." expressions. This is another thing that the ELF implementation can do but is missing from COFF. llvm-svn: 306078	2017-06-23 02:15:56 +00:00
Craig Topper	b60f866a8b	[LVI] Teach LVI to reason about ORs of icmps similar to how it reasons about ANDs of icmps Summary: LVI can reason about an AND of icmps on the true dest of a branch. I believe we can do similar for the false dest of ORs. This allows us to get the same answer for the demorganed versions of some of the AND test cases as you can see. Reviewers: anna, reames Reviewed By: reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34431 llvm-svn: 306076	2017-06-23 01:08:16 +00:00
Farhana Aleen	9bd593e0d7	Fixed a (product) build error that was due to an unused variable Details: There was a use but it was in the assert which was not exercised during product build. Reviewers: Andrew Kaylor Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32658 llvm-svn: 306073	2017-06-22 23:56:31 +00:00
Sanjay Patel	359ae44fb4	[x86] add/sub (X==0) --> sbb(cmp X, 1) This is very similar to the transform in: https://reviews.llvm.org/rL306040 ...but in this case, we use cmp X, 1 to set the carry bit as needed. Again, we can show that all of these are logically equivalent (although InstCombine currently canonicalizes to a form not seen here), and if we believe IACA, then this is the smallest/fastest code. Eg, with SNB: \| Num Of \| Ports pressure in cycles \| \| \| Uops \| 0 - DV \| 1 \| 2 - D \| 3 - D \| 4 \| 5 \| \| --------------------------------------------------------------------- \| 1 \| 1.0 \| \| \| \| \| \| \| cmp edi, 0x1 \| 2 \| \| 1.0 \| \| \| \| 1.0 \| CP \| sbb eax, eax The larger motivation is to clean up all select-of-constants combining/lowering because we're missing some common cases. llvm-svn: 306072	2017-06-22 23:47:15 +00:00
Andrew Kaylor	d49711996f	Restrict the definition of loop preheader to avoid EH blocks Differential Revision: https://reviews.llvm.org/D34487 llvm-svn: 306070	2017-06-22 23:27:16 +00:00
whitequark	08b20356c3	Define behavior of "stack-probe-size" attribute when inlining. Also document the attribute, since "probe-stack" already is. Reviewed By: majnemer Differential Revision: https://reviews.llvm.org/D34528 llvm-svn: 306069	2017-06-22 23:22:36 +00:00
Farhana Aleen	4b652a5335	Supported lowerInterleavedStore() in X86InterleavedAccess. Reviewers: RKSimon, DavidKreitzer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32658 llvm-svn: 306068	2017-06-22 22:59:04 +00:00
Eric Christopher	5a7c2f1700	Remove the LoadCombine pass. It was never enabled and is unsupported. Based on discussions with the author on mailing lists. llvm-svn: 306067	2017-06-22 22:58:12 +00:00
Rafael Espindola	d2edd137df	Change creation of relative relocations on COFF. For whatever reason, when processing .globl foo foo: .data bar: .long foo-bar llvm-mc creates a relocation with the section: 0x0 IMAGE_REL_I386_REL32 .text This is different than when the relocation is relative from the beginning. For example, a file with call foo produces 0x0 IMAGE_REL_I386_REL32 foo I would like to refactor the logic for converting "foo - ." into a relative relocation so that it is shared with ELF. This is the first step and just changes the coff implementation to match what ELF (and COFF in the case of calls) does. llvm-svn: 306063	2017-06-22 21:57:04 +00:00
Jacob Gravelle	a31ec61c46	[WebAssembly] WebAssemblyFastISel getelementptr variable index support Summary: Previously -fast-isel getelementptr would constant-fold non-constant i8 load/stores. Reviewers: sunfish Subscribers: jfb, dschuff, sbc100, llvm-commits Differential Revision: https://reviews.llvm.org/D34044 llvm-svn: 306060	2017-06-22 21:26:08 +00:00
Krzysztof Parzyszek	9b7c1d2dcf	[Hexagon] Properly update kill flags in HexagonNewValueJump The feeder instruction will be moved to right before the compare, so the updating code should not be looking for kills past the compare. llvm-svn: 306059	2017-06-22 21:11:44 +00:00
Lang Hames	266202236f	[ORC] Switch the object layer API from addObjectSet to addObject (singular), and move the ObjectCache from the IRCompileLayer to SimpleCompiler. This is the first in a series of patches aimed at cleaning up and improving the robustness and performance of the ORC APIs. llvm-svn: 306058	2017-06-22 21:06:54 +00:00
Reid Kleckner	40a47a8702	[MC] Allow assembling .secidx and .secrel32 for undefined symbols There's nothing incorrect about emitting such relocations against symbols defined in other objects. The code in EmitCOFFSec* was missing the visitUsedExpr part of MCStreamer::EmitValueImpl, so these symbols were not being registered with the object file assembler. This will be used to make reduced test cases for LLD. llvm-svn: 306057	2017-06-22 21:02:14 +00:00
Krzysztof Parzyszek	1a0da8d5a3	[Hexagon] Use LivePhysRegs to fix up kills in HexagonGenMux Remove the previous, manual shuffling of the kill flags. llvm-svn: 306054	2017-06-22 20:43:02 +00:00
Rafael Espindola	656669bae4	Simplify WinCOFFObjectWriter::recordRelocation. It looks like that when this code was written recordRelocation could be called with A-B where A and B are in the same section. The expression evaluation logic these days makes sure those are folded, so some of this code was dead. llvm-svn: 306053	2017-06-22 20:27:33 +00:00
Anna Thomas	72c90c87f8	[LoopDeletion] Update exits correctly when multiple duplicate edges from an exiting block Summary: Currently, we incorrectly update exit blocks of loops when there are multiple edges from a single exiting block to the exit block. This can happen when we have switches as the terminator of the exiting blocks. The fix here is to correctly update the phi nodes in the exit block, and remove all incoming values except for one which is from the preheader. Note: Currently, this error can manifest only while deleting non-executed loops. However, it is possible to trigger this error in invariant loops, once we enhance the logic around the exit conditions for the loop check. Reviewers: chandlerc, dberlin, sanjoy, efriedma Reviewed by: efriedma Subscribers: mzolotukhin, llvm-commits Differential Revision: https://reviews.llvm.org/D34516 llvm-svn: 306048	2017-06-22 20:20:56 +00:00
Craig Topper	792fc92be2	[AVX-512] Remove and autoupgrade the masked integer compare intrinsics Summary: These intrinsics aren't used by clang and haven't been for a while. There's some really terrible codegen in the 32-bit target for avx512bw due to i64 not being legal. But as I said these intrinsics aren't used by clang even before this patch so this codegen reflects our clang behavior today. Reviewers: spatel, RKSimon, zvi, igorb Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34389 llvm-svn: 306047	2017-06-22 20:11:01 +00:00
Ekaterina Vaartis	c4a6322153	[MC] Fix const qualifier warning llvm-svn: 306045	2017-06-22 19:08:30 +00:00
Craig Topper	d3711ee93e	[BasicAA] Add type check and Value equality check around code added in r305481. This matches the checks done at the beginning of isKnownNonEqual that this code is partially emulating. Without this we can get assertion failures due to the bit widths of the KnownBits not matching. llvm-svn: 306044	2017-06-22 19:04:14 +00:00
Adrian McCarthy	4aedc81b8c	Fix build break by using llvm::make_unique instead of std::make_unique. llvm-svn: 306043	2017-06-22 18:57:51 +00:00
Adrian McCarthy	31bcb6f680	Add IDs and clone methods to NativeRawSymbol All NativeRawSymbols will have a unique symbol ID (retrievable via getSymIndexId). For now, these are initialized to 0, but soon the NativeSession will be responsible for creating the raw symbols, and it will assign unique IDs. The symbol cache in the NativeSession will also require the ability to clone raw symbols, so I've provided implementations for that as well. llvm-svn: 306042	2017-06-22 18:43:18 +00:00
Adrian McCarthy	6a4b080a5f	Make IPDBSession::getGlobalScope a non-const method There doesn't seem to be a compelling reason why this method should be const other than it was possible with the DIA implementation. The native session is going to act as a symbol factory and cache. This could be acheived with mutable (and the existing const_cast), but it seems cleaner to accept that this method affects the state of the session. This change eliminates an existing const_cast. llvm-svn: 306041	2017-06-22 18:42:23 +00:00
Sanjay Patel	41a34e4111	[x86] add/sub (X==0) --> sbb(neg X) Our handling of select-of-constants is lumpy in IR (https://reviews.llvm.org/D24480), lumpy in DAGCombiner, and lumpy in X86ISelLowering. That's why we only had the 'sbb' codegen in 1 out of the 4 tests. This is a step towards smoothing that out. First, show that all of these IR forms are equivalent: http://rise4fun.com/Alive/mx Second, show that the 'sbb' version is faster/smaller. IACA output for SandyBridge (later Intel and AMD chips are similar based on Agner's tables): This is the "obvious" x86 codegen (what gcc appears to produce currently): \| Num Of \| Ports pressure in cycles \| \| \| Uops \| 0 - DV \| 1 \| 2 - D \| 3 - D \| 4 \| 5 \| \| --------------------------------------------------------------------- \| 1* \| \| \| \| \| \| \| \| xor eax, eax \| 1 \| 1.0 \| \| \| \| \| \| CP \| test edi, edi \| 1 \| \| \| \| \| \| 1.0 \| CP \| setnz al \| 1 \| \| 1.0 \| \| \| \| \| CP \| neg eax This is the adc version: \| 1* \| \| \| \| \| \| \| \| xor eax, eax \| 1 \| 1.0 \| \| \| \| \| \| CP \| cmp edi, 0x1 \| 2 \| \| 1.0 \| \| \| \| 1.0 \| CP \| adc eax, 0xffffffff And this is sbb: \| 1 \| 1.0 \| \| \| \| \| \| \| neg edi \| 2 \| \| 1.0 \| \| \| \| 1.0 \| CP \| sbb eax, eax If IACA is trustworthy, then sbb became a single uop in Broadwell, so this will be clearly better than the alternatives going forward. llvm-svn: 306040	2017-06-22 18:11:19 +00:00
Sam Clegg	58ad080ef0	MC: Fix dumping of MCFragment values Without this cast the "char" overload of operator<< is chosen and the values is output as an ascii rather than an integer. Differential Revision: https://reviews.llvm.org/D34486 llvm-svn: 306039	2017-06-22 17:57:01 +00:00
Kevin Enderby	abf10f2d2e	Updated llvm-objdump symbolic disassembly with x86_64 Mach-O MH_KEXT_BUNDLE file types so it symbolically disassembles operands using the external relocation entries. rdar://31521343 llvm-svn: 306037	2017-06-22 17:41:22 +00:00
Rafael Espindola	8a261c2565	Add a common error checking for some invalid expressions. This refactors a bit of duplicated code and fixes an assertion failure on ELF. llvm-svn: 306035	2017-06-22 17:25:35 +00:00
David Stuttard	f677966e2e	[AMDGPU] Add intrinsics for tbuffer load and store - build error fix Variable was unused in non-debug build (used in assert) causing compile time warning and eventual build failure llvm-svn: 306034	2017-06-22 17:15:49 +00:00
David Stuttard	70e8bc1bf3	[AMDGPU] Add intrinsics for tbuffer load and store Intrinsic already existed for llvm.SI.tbuffer.store Needed tbuffer.load and also re-implementing the intrinsic as llvm.amdgcn.tbuffer.* Added CodeGen tests for the 2 new variants added. Left the original llvm.SI.tbuffer.store implementation to avoid issues with existing code Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye, tpr Differential Revision: https://reviews.llvm.org/D30687 llvm-svn: 306031	2017-06-22 16:29:22 +00:00
Craig Topper	dffbbcb3fd	[InstCombine] Teach foldSelectICmpAndOr to recognize (select (icmp slt (trunc (X)), 0), Y, (or Y, C2)) Summary: InstCombine likes to turn (icmp eq (and X, C1), 0) into (icmp slt (trunc (X)), 0) sometimes. This breaks foldSelectICmpAndOr's ability to recognize (select (icmp eq (and X, C1), 0), Y, (or Y, C2))->(or (shl (and X, C1), C3), y). This patch tries to recover this. I had to flip around some of the early out checks so that I could create a new And instruction during the compare processing without it possibly never getting used. Reviewers: spatel, majnemer, davide Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34184 llvm-svn: 306029	2017-06-22 16:23:30 +00:00
Teresa Johnson	a690e3cea2	[ThinLTO] Remove unnecessary include of Linker.h (NFC) The ModuleLinker is no longer used by ThinLTO, so this is not needed. Patch by Benoit Belley <Benoit.Belley@autodesk.com> llvm-svn: 306028	2017-06-22 16:18:48 +00:00
Craig Topper	0de5e6a729	[InstCombine] Add one use checks to or/and->xnor folding If the components of the and/or had multiple uses, this transform created an additional instruction. This patch makes sure we remove one of the components. Differential Revision: https://reviews.llvm.org/D34498 llvm-svn: 306027	2017-06-22 16:12:02 +00:00
Krzysztof Parzyszek	f63ad39e7d	[Hexagon] Handle a global operand to A2_addi when creating duplexes llvm-svn: 306012	2017-06-22 15:53:31 +00:00
Sanjay Patel	d1e811979c	[InstCombine] reverse bitcast + bitwise-logic canonicalization (PR33138) There are 2 parts to this patch made simultaneously to avoid a regression. We're reversing the canonicalization that moves bitwise vector ops before bitcasts. We're moving bitwise vector ops after bitcasts instead. That's the 1st and 3rd hunks of the patch. The motivation is that there's only one fold that currently depends on the existing canonicalization (see next), but there are many folds that would automatically benefit from the new canonicalization. PR33138 ( https://bugs.llvm.org/show_bug.cgi?id=33138 ) shows why/how we have these patterns in IR. There's an or(and,andn) pattern that requires an adjustment in order to continue matching to 'select' because the bitcast changes position. This match is unfortunately complicated because it requires 4 logic ops with optional bitcast and sext ops. Test diffs: 1. The bitcast.ll and bitcast-bigendian.ll changes show the most basic difference - bitcast comes before logic. 2. There are also tests with no diffs in bitcast.ll that verify that we're still doing folds that were enabled by the previous canonicalization. 3. icmp-xor-signbit.ll shows the payoff. We don't need to adjust existing icmp patterns to look through bitcasts. 4. logical-select.ll contains several tests for the or(and,andn) --> select fold to verify that we are still handling those cases. The lone diff shows the movement of the bitcast from the new canonicalization rule. Differential Revision: https://reviews.llvm.org/D33517 llvm-svn: 306011	2017-06-22 15:46:54 +00:00
whitequark	cebe8241ca	[X86] Add support for "probe-stack" attribute This commit adds prologue code emission for stack probe function calls. Reviewed By: majnemer Differential Revision: https://reviews.llvm.org/D34387 llvm-svn: 306010	2017-06-22 15:42:53 +00:00
Florian Hahn	5991b5be74	[ARM] Create relocations for beq.w branches to ARM function syms. Summary: The ARM ELF ABI requires the linker to do interworking for wide conditional branches from Thumb code to ARM code. That was pointed out by @peter.smith in the comments for D33436. Reviewers: rafael, peter.smith, echristo Reviewed By: peter.smith Subscribers: aemerson, javed.absar, kristof.beyls, llvm-commits, peter.smith Differential Revision: https://reviews.llvm.org/D34447 llvm-svn: 306009	2017-06-22 15:32:41 +00:00
Sanjay Patel	e800df8eac	[InstCombine] add peekThroughBitcast() helper; NFC This is an NFC portion of D33517. We have similar helpers in the backend. llvm-svn: 306008	2017-06-22 15:28:01 +00:00
Petar Jovanovic	636851b845	[mips] Allow $AT to be used as a register name This patch allows $AT to be used as a register name in assembly files. Currently only $at is recognized as a valid register name. Patch by Stanislav Ocovaj. Differential Revision: https://reviews.llvm.org/D34348 llvm-svn: 306007	2017-06-22 15:24:16 +00:00
Nirav Dave	f2c349ccec	[DAG] Add Target Store Merge pass ordering function Allow targets to specify if they should merge stores before or after legalization. llvm-svn: 306006	2017-06-22 15:07:49 +00:00
Pavel Labath	efd57a8aec	Revert "[Support] Add RetryAfterSignal helper function" and subsequent fix The fix in r306003 uncovered a pretty fundamental problem that libc++ implementation of std::result_of does not handle the prototype of open(2) correctly (presumably because it contains ...). This makes the whole function unusable in its current form, so I am also reverting the original commit (r305892), which introduced the function, at least until I figure out a way to solve the libc++ issue. llvm-svn: 306005	2017-06-22 14:18:55 +00:00
Krzysztof Parzyszek	69ffba4595	[Hexagon] Recognize potential offset overflow for store-imm to stack Reserve an extra scavenging stack slot if the offset field in store- -immediate instructions may overflow. llvm-svn: 306004	2017-06-22 14:11:23 +00:00
Sagar Thakur	15126308c8	Revert [mips] Adds support for R_MIPS_26, HIGHER, HIGHEST relocations in RuntimeDyld Reverting due to build bot failures llvm-svn: 306000	2017-06-22 12:48:04 +00:00
Sam Kolton	ca5a30ed74	[AMDGPU] SDWA: remove support for VOP2 instructions that have only 64-bit encoding Summary: Despite that this instructions are listed in VOP2, they are treated as VOP3 in specs. They should not support SDWA. There are no real instructions for them, but there are pseudo instructions. Reviewers: arsenm, vpykhtin, cfang Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D34403 llvm-svn: 305999	2017-06-22 12:42:14 +00:00
Kristof Beyls	9665249fd8	Don't conditionalize Neon instructions, even in IT blocks. This has been deprecated since ARMARM v7-AR, release C.b, published back in 2012. This also removes test/CodeGen/Thumb2/ifcvt-neon.ll that originally was introduced to check that conditionalization of Neon instructions did happen when generating Thumb2. However, the test had evolved and was no longer testing that. Rather than trying to adapt that test, this commit introduces test/CodeGen/Thumb2/ifcvt-neon-deprecated.mir, since we can now use the MIR framework to write nicer/more maintainable tests. llvm-svn: 305998	2017-06-22 12:11:38 +00:00
Sagar Thakur	f8858d0979	[mips] Adds support for R_MIPS_26, HIGHER, HIGHEST relocations in RuntimeDyld After the N64 static relocation model support was added to llvm it is required to add its support in RuntimeDyld also because lldb uses ExecutionEngine for evaluating expressions. Reviewed by sdardis Differential: D31649 llvm-svn: 305997	2017-06-22 11:49:19 +00:00
Simon Dardis	1c73fcc131	[mips] Implement the ".rdata" MIPS assembly directive. Rather than creating a separate ".rdata" section distinct from the customary ".rodata" in ELF, ".rdata" switches to the ".rodata" section. This patch relands r305949 and r305950 with the correct commit message and addresses nit raised during review. Patch By: John Baldwin! Differential Revision: https://reviews.llvm.org/D34452 llvm-svn: 305995	2017-06-22 10:41:51 +00:00
John Brawn	ed78aaf093	[ARM] Add .w aliases of MOV with shifted operand These appear to have been simply missing. Differential Revision: https://reviews.llvm.org/D34461 llvm-svn: 305993	2017-06-22 10:30:53 +00:00

1 2 3 4 5 ...

103930 Commits