llvm-project

Commit Graph

Author	SHA1	Message	Date
Sanjay Patel	d729ed8d44	[InstCombine] add commuted variants for power-of-2 checks; NFC llvm-svn: 363945	2019-06-20 16:27:23 +00:00
Matt Arsenault	b7f87c0ecf	AMDGPU: Treat undef as an inline immediate This should only matter in vectors with an undef component, since a full undef vector would have been folded out. llvm-svn: 363941	2019-06-20 16:01:09 +00:00
Matt Arsenault	fcce531752	AMDGPU: Make test functions hidden Reduces amount of code in the function from eliminating the GOT load. llvm-svn: 363940	2019-06-20 15:38:30 +00:00
Sanjay Patel	345473c791	[InstCombine] add tests for checking power-of-2; NFC llvm-svn: 363938	2019-06-20 15:25:18 +00:00
Cameron McInally	4452c3b490	[NFC][SLP] Pre-commit unary FNeg test to X86/phi3.ll llvm-svn: 363937	2019-06-20 15:17:17 +00:00
Simon Tatham	232db11020	[ARM] Add a batch of MVE integer instructions. This includes integer arithmetic of various kinds (add/sub/multiply, saturating and not), and the immediate forms of VMOV and VMVN that load an immediate into all lanes of a vector. Reviewers: dmgreen, samparker, SjoerdMeijer, t.p.northover Subscribers: javed.absar, kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62674 llvm-svn: 363936	2019-06-20 15:16:56 +00:00
Stanislav Mekhanoshin	0846c125f9	[AMDGPU] gfx1010 core wave32 changes Differential Revision: https://reviews.llvm.org/D63204 llvm-svn: 363934	2019-06-20 15:08:34 +00:00
Stanislav Mekhanoshin	082ffa9960	Virtualize TargetInstrInfo::getRegClass() AMDGPU target needs to override getRegClass() used during instruction selection. We now may have either 32 or 64 bit conditional registers used in the same instructions. For that purpose special SReg_1 register class is created which is dynamically resolved to either SReg_64 or SGPR_32 depending on the subtarget attributes. Differential Revision: https://reviews.llvm.org/D63205 llvm-svn: 363931	2019-06-20 14:59:28 +00:00
George Rimar	30ea0c4d74	[yaml2obj] - Convert `ELFState<ELFT>::addSymbols` method to `toELFSymbols` helper. NFCI. ELFState<ELFT>::addSymbols method looks a bit strange. User code have to create the destination symbols vector outside, add a null symbol and then pass it to addSymbols when it seems the more natural logic is to isolate all work with symbols inside some function, build the list right there and return it. Differential revision: https://reviews.llvm.org/D63493 llvm-svn: 363930	2019-06-20 14:44:48 +00:00
Simon Pilgrim	1d8093249f	[DAGCombiner] Support (shl (zext (srl x, C)), C) -> (zext (shl (srl x, C), C)) non-uniform folds. Use matchBinaryPredicate instead of isConstOrConstSplat to let us handle non-uniform shift cases. llvm-svn: 363929	2019-06-20 14:42:27 +00:00
Simon Pilgrim	72186a2494	[SLP][X86] Add lookahead reordering tests from D60897 llvm-svn: 363925	2019-06-20 12:52:58 +00:00
Simon Pilgrim	98a0ac5c0f	[DAGCombine] Add TODOs for some combines that should support non-uniform vectors We tend to only test for scalar/scalar consts when really we could support non-uniform vectors using ISD::matchUnaryPredicate/matchBinaryPredicate etc. llvm-svn: 363924	2019-06-20 12:48:49 +00:00
Simon Pilgrim	a4d705e0ef	[X86] LowerAVXExtend - handle ANY_EXTEND_VECTOR_INREG lowering as well. llvm-svn: 363922	2019-06-20 11:31:54 +00:00
Simon Pilgrim	a487628270	[DAGCombine] Reduce scope of ShAmtVal variable. NFCI. Fixes cppcheck warning. Use the more capable getAPIntVal() instead of getZExtValue() as well since I'm here. llvm-svn: 363921	2019-06-20 10:56:37 +00:00
Fangrui Song	7064a437f8	[llvm-nm] Generalize ELF symbol types 'N' and 'n' Reviewed By: grimar, jhenderson Differential Revision: https://reviews.llvm.org/D63588 llvm-svn: 363918	2019-06-20 10:15:11 +00:00
Serge Guelton	dad50a7458	[NFC] Update documentation for AtomicCmpXchgInst Fix bz#42325 llvm-svn: 363914	2019-06-20 09:37:52 +00:00
Sjoerd Meijer	e79b3cdc39	TargetParserTest.ARMExtensionFeatures run out of memory on 32-bit (PR42316) Nothing of these tests made much sense. Loops were iterating too much, and I also don't think it was actually testing anything. I think we simply want to check that AEK_SOME_EXT returns "+some_ext". I've given the AArch64 tests the same treatment as they very similarly didn't made any sense either. This fixes PR42316. Differential Revision: https://reviews.llvm.org/D63569 llvm-svn: 363913	2019-06-20 09:33:11 +00:00
Petar Avramovic	153bd24eda	[MIPS GlobalISel] Select integer to floating point conversions Select G_SITOFP and G_UITOFP for MIPS32. Differential Revision: https://reviews.llvm.org/D63542 llvm-svn: 363912	2019-06-20 09:05:02 +00:00
Petar Avramovic	4b4dae1c76	[MIPS GlobalISel] Select floating point to integer conversions Select G_FPTOSI and G_FPTOUI for MIPS32. Differential Revision: https://reviews.llvm.org/D63541 llvm-svn: 363911	2019-06-20 08:52:53 +00:00
Craig Topper	3ba20e943e	[X86] Add test cases showing missed opportunities to use the C flag from the BLSI instruction to avoid a TEST instruction llvm-svn: 363909	2019-06-20 06:45:01 +00:00
Craig Topper	b4ea64570c	[X86] Remove memory instructions form isUseDefConvertible. The caller of this is looking for comparisons of the input to these instructions with 0. But the memory instructions input is an addess not a value input in a register. llvm-svn: 363907	2019-06-20 04:58:40 +00:00
Craig Topper	451f7feb64	[X86] Add v64i8/v32i16 to several places in X86CallingConv.td where they seemed obviously missing. llvm-svn: 363906	2019-06-20 04:29:00 +00:00
Matt Arsenault	c67c484f36	AMDGPU: Don't clobber VCC in MUBUF addr64 emulation Introducing VCC defs during SIFixSGPRCopies is generally problematic. Avoid it by starting with the VOP3 form with the general condition register. This is the easiest to fix instance, but doesn't solve any specific problems I'm looking at. llvm-svn: 363904	2019-06-20 00:51:28 +00:00
Eli Friedman	d88e28d13e	[llvm-objdump] Switch between ARM/Thumb based on mapping symbols. The ARMDisassembler changes allow changing between ARM and Thumb mode based on the MCSubtargetInfo, rather than the Target, which simplifies the other changes a bit. I'm not really happy with adding more target-specific logic to tools/llvm-objdump/, but there isn't any easy way around it: the logic in question specifically applies to disassembling an object file, and that code simply isn't located in lib/Target, at least at the moment. Differential Revision: https://reviews.llvm.org/D60927 llvm-svn: 363903	2019-06-20 00:29:40 +00:00
Matt Arsenault	e4c2e9b016	AMDGPU: Consolidate some getGeneration checks This is incomplete, and ideally these would all be removed, but it's better to localize them to the subtarget first with comments about what they're for. llvm-svn: 363902	2019-06-19 23:54:58 +00:00
Thomas Preud'homme	a2ef1ba32f	[FileCheck] Stop qualifying expressions as numeric Summary: Stop referring to "numeric expression", using simply the term "expression" instead. Likewise for numeric operation since operations are only used in numeric expressions. Reviewers: jhenderson, jdenny, probinson, arichardson Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63500 llvm-svn: 363901	2019-06-19 23:47:24 +00:00
Thomas Preud'homme	baae41ff76	FileCheck: Return parse error w/ Error & Expected Summary: Make use of Error and Expected to bubble up diagnostics and force checking of errors in the callers. Reviewers: jhenderson, jdenny, probinson, arichardson Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63125 llvm-svn: 363900	2019-06-19 23:47:10 +00:00
Matt Arsenault	e24b34e9c9	AMDGPU: Undo sub x, c canonicalization for v2i16 Should avoid regression from D62341 llvm-svn: 363899	2019-06-19 23:37:43 +00:00
Matt Arsenault	532be255a5	AMDGPU: Add baseline test for vector sub x, c canonicalization This will catch regressions from D62341, and show improvements from a future patch to fix them. llvm-svn: 363888	2019-06-19 22:37:08 +00:00
Simon Pilgrim	046d49a8dc	[DAGCombine] Use ConstantSDNode::getAPIntValue() instead of getZExtValue(). Use getAPIntValue() in a few more places. Most of the time getZExtValue() is fine, but occasionally there's fuzzed code or someone decides to create i65536 or something..... llvm-svn: 363887	2019-06-19 22:14:24 +00:00
Simon Atanasyan	f61c43c636	[mips] Mark the `lwupc` instruction as MIPS64 R6 only The "The MIPS64 Instruction Set Reference Manual" [1] states that the `lwupc` is MIPS64 Release 6 only. It should not be supported for 32-bit CPUs. [1] https://s3-eu-west-1.amazonaws.com/downloads-mips/documents/MD00087-2B-MIPS64BIS-AFP-6.06.pdf llvm-svn: 363886	2019-06-19 22:08:06 +00:00
Simon Atanasyan	0121432602	[mips] Add (GPR\|PTR)_64 predicates to PseudoReturn64 and PseudoIndirectHazardBranch64 This patch is one of a series of patches. The goal is to make P5600 scheduler model complete and turn on the `CompleteModel` flag. llvm-svn: 363885	2019-06-19 22:07:46 +00:00
Philip Reames	046f855758	[Util] Add a helper script for converting -print-before-all output into a file based equivelent Simple little utility which takes a opt logfile generated with "opt -print-before-all -print-module-scope -o /dev/null <args> 2&>1", and splits into a series of individual "chunk-X.ll" files. The intended purpose is to help automate one step in failure reduction. The imagined workflow is: New crasher bug reported against clang or other frontend Frontend run with -emit-llvm equivalent and manually confirmed that opt -O2 <emit.ll> crashes Run this splitter script Manually map pass name to invocation command (next on the to automate list) Run bugpoint on last chunk file + manual command I chose to dump every chunk rather than only the last since miscompile debugging frequently requires either manual step by step reduction, or cross feeding IR into different compiler versions. Not an immediate target, but there may be applications. Differential Revision: https://reviews.llvm.org/D63461 llvm-svn: 363884	2019-06-19 22:05:47 +00:00
Philip Reames	eda1ba65ca	LFTR for multiple exit loops Teach IndVarSimply's LinearFunctionTestReplace transform to handle multiple exit loops. LFTR does two key things 1) it rewrites (all) exit tests in terms of a common IV potentially eliminating one in the process and 2) it moves any offset/indexing/f(i) style logic out of the loop. This turns out to actually be pretty easy to implement. SCEV already has all the information we need to know what the backedge taken count is for each individual exit. (We use that when computing the BE taken count for the loop as a whole.) We basically just need to iterate through the exiting blocks and apply the existing logic with the exit specific BE taken count. (The previously landed NFC makes this super obvious.) I chose to go ahead and apply this to all loop exits instead of only latch exits as originally proposed. After reviewing other passes, the only case I could find where LFTR form was harmful was LoopPredication. I've fixed the latch case, and guards aren't LFTRed anyways. We'll have some more work to do on the way towards widenable_conditions, but that's easily deferred. I do want to note that I added one bit after the review. When running tests, I saw a new failure (no idea why didn't see previously) which pointed out LFTR can rewrite a constant condition back to a loop varying one. This was theoretically possible with a single exit, but the zero case covered it in practice. With multiple exits, we saw this happening in practice for the eliminate-comparison.ll test case because we'd compute a ExitCount for one of the exits which was guaranteed to never actually be reached. Since LFTR ran after simplifyAndExtend, we'd immediately turn around and undo the simplication work we'd just done. The solution seemed obvious, so I didn't bother with another round of review. Differential Revision: https://reviews.llvm.org/D62625 llvm-svn: 363883	2019-06-19 21:58:25 +00:00
Philip Reames	80eb1ce7a0	[Tests] Autogen a test so that future changes are understandable llvm-svn: 363882	2019-06-19 21:39:07 +00:00
Alina Sbirlea	109d2ea153	[MemorySSA] Cleanup trivial phis. Summary: This is unfortunately needed for correctness, if we are to extend the tolerance of the update API to the way simple loop unswitch is doing cloning. In simple loop unswitch (as opposed to loop unswitch), not all blocks are cloned. This can create unreachable cloned blocks (no predecessor), which are later cleaned up. In MemorySSA, the APIs for supporting these kind of updates (clone + update exit blocks), make certain assumption on the integrity of the CFG. When cloning, if something was not cloned, it's values in MemorySSA default to LiveOnEntry. When updating exit blocks, it is safe to assume that we can first insert phis in the blocks merging two clones, then add additional phis in the IDF of the blocks that received phis. This no longer holds true if one of the clones being merged comes from an unreachable block. We'd conservatively need to add all phis before filling in their incoming definitions. In practice this restriction can be relaxed if we clean up trivial phis after the first round of insertion. Reviewers: george.burgess.iv Subscribers: jlebar, Prazek, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63354 llvm-svn: 363880	2019-06-19 21:33:09 +00:00
Alina Sbirlea	238b8e62b6	[MemorySSA] Use GraphDiff info when computing IDF. Summary: When computing IDF for insert updates, ensure we use the snapshot CFG offered by GraphDiff. Caught by D63389. Reviewers: kuhar, george.burgess.iv Subscribers: jlebar, Prazek, llvm-commits, Szelethus Tags: #llvm Differential Revision: https://reviews.llvm.org/D63443 llvm-svn: 363879	2019-06-19 21:17:31 +00:00
Philip Reames	ce53e2226c	[LFTR] Stylistic cleanup as suggested in last review comment of D62939 [NFC] (Resumbit of r363292 which was reverted along w/an earlier patch) llvm-svn: 363877	2019-06-19 20:45:57 +00:00
Matt Arsenault	4d000d2488	AMDGPU: Fix folding immediate into readfirstlane through reg_sequence The def instruction for the vreg may not match, because it may be folding through a reg_sequence. The assert was overly conservative and not necessary. It's not actually important if DefMI really defined the register, because the fold that will be done cares about the def of the value that will be folded. For some reason copies aren't making it through the reg_sequence, although they should. llvm-svn: 363876	2019-06-19 20:44:15 +00:00
Philip Reames	f8104f01e6	[LFTR] Rename variable to minimize confusion [NFC] (Recommit of r363293 which was reverted when a dependent patch was.) As pointed out by Nikita in D62625, BackedgeTakenCount is generally used to refer to the backedge taken count of the loop. A conditional backedge taken count - one which only applies if a particular exit is taken - is called a ExitCount in SCEV code, so be consistent here. llvm-svn: 363875	2019-06-19 20:41:28 +00:00
Peter Collingbourne	2742eeb78e	hwasan: Shrink outlined checks by 1 instruction. Turns out that we can save an instruction by folding the right shift into the compare. Differential Revision: https://reviews.llvm.org/D63568 llvm-svn: 363874	2019-06-19 20:40:03 +00:00
Matt Arsenault	4d55d024be	Reapply "AMDGPU: Add ds_gws_init / ds_gws_barrier intrinsics" This reapplies r363678, using the correct chain for the CopyToReg for v0. glueCopyToM0 counterintuitively changes the operands of the original node. llvm-svn: 363870	2019-06-19 19:55:27 +00:00
Yuanfang Chen	40a156b791	[llvm-readobj] Match GNU output for DT_RPATH and DT_RUNPATH when dumping dynamic symbol table. Reviewers: jhenderson, grimar, MaskRay, rupprecht, espindola Subscribers: emaste, nemanjai, arichardson, kbarton, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63347 llvm-svn: 363868	2019-06-19 19:31:07 +00:00
Philip Reames	caf9655d82	[SCEV] Revise a method description to match actual behavior [NFC] Reword the ScalarEvolution::getExitCount comment in the same terminology as used by getBackedgeTakenCount since they're equivelent for single exit loops. Also, strengthen the comment to indicate exiting on the exact iteration specified is guaranteed. Several transforms implicitly rely on this; and the actual implementation checks for it (via dominating latch checks). So, spell out the guarantee in the comment. llvm-svn: 363867	2019-06-19 19:23:19 +00:00
Peter Collingbourne	9f42845187	gn build: Merge r363757. llvm-svn: 363865	2019-06-19 19:11:23 +00:00
Peter Collingbourne	7a84c5adcd	gn build: Merge r363848. llvm-svn: 363864	2019-06-19 19:11:14 +00:00
Peter Collingbourne	cf2406cd60	gn build: Merge r363846. llvm-svn: 363863	2019-06-19 19:11:05 +00:00
Peter Collingbourne	e99d33885e	gn build: Merge r363794. llvm-svn: 363862	2019-06-19 19:10:56 +00:00
Peter Collingbourne	dd3df8ed34	gn build: Merge r363680. llvm-svn: 363861	2019-06-19 19:10:47 +00:00
Peter Collingbourne	f91d8ba0be	gn build: Merge r363712. llvm-svn: 363860	2019-06-19 19:10:38 +00:00

1 2 3 4 5 ...

180582 Commits