llvm-project

Commit Graph

Author	SHA1	Message	Date
Francis Visoiu Mistrih	57fcd3454a	[MIR] Add support for debug metadata for fixed stack objects Debug var, expr and loc were only supported for non-fixed stack objects. This patch adds the following fields to the "fixedStack:" entries, and renames the ones from "stack:" to: * debug-info-variable * debug-info-expression * debug-info-location Differential Revision: https://reviews.llvm.org/D46032 llvm-svn: 330859	2018-04-25 18:58:06 +00:00
Nico Weber	a48924c706	IWYU llvm-config.h for LLVM_VERSION_STRING llvm-svn: 330856	2018-04-25 18:34:00 +00:00
Sam Clegg	9067b46e1b	[WebAssebmly] Add Module name to WasmSymbol Imports in a wasm module can have custom module name. This change adds the module name to the WasmSymbol structure so that the linker can preserve this module name. This is needed to fix: https://bugs.llvm.org/show_bug.cgi?id=37168 Differential Revision: https://reviews.llvm.org/D45797 llvm-svn: 330854	2018-04-25 18:24:08 +00:00
Nico Weber	4264a38916	Rename sancov.cc to sancov.cpp LLVM uses cpp as its C++ file ending. https://reviews.llvm.org/D46068 llvm-svn: 330853	2018-04-25 18:06:23 +00:00
Simon Pilgrim	0ae4bba911	[CostModel][X86] Add div/rem tests for non-uniform constant divisors llvm-svn: 330852	2018-04-25 18:03:31 +00:00
Craig Topper	300e20d61c	[X86] Form MUL_IMM for multiplies with 3/5/9 to encourage LEA formation over load folding. Previously we only formed MUL_IMM when we split a constant. This blocked load folding on those cases. We should also form MUL_IMM for 3/5/9 to favor LEA over load folding. Differential Revision: https://reviews.llvm.org/D46040 llvm-svn: 330850	2018-04-25 17:35:03 +00:00
Reid Kleckner	1ca666886f	Revert r330755 "[lit] Report line number for failed RUN command" It is causing many tests to fail on Windows buildbots: http://lab.llvm.org:8011/builders/clang-x64-ninja-win7/builds/10211 llvm-svn: 330848	2018-04-25 17:30:00 +00:00
Alex Bradbury	cd8688a4c2	[RISCV] Allow call pseudoinstruction to be used to call a function name that coincides with a register name Previously `call zero`, `call f0` etc would fail. This leads to compilation failures if building programs that define functions with those names and using -save-temps. llvm-svn: 330846	2018-04-25 17:25:29 +00:00
Nico Weber	39d61944df	Don't list a source file twice. llvm-svn: 330845	2018-04-25 17:24:41 +00:00
Taewook Oh	923c216da5	[ICP] Do not attempt type matching for variable length arguments. Summary: When performing indirect call promotion, current implementation inspects "all" parameters of the callsite and attemps to match with the formal argument type of the callee function. However, it is not possible to find the type for variable length arguments, and the compiler crashes when it attemps to match the type for variable lenght argument. It seems that the bug is introduced with D40658. Prior to that, the type matching is performed only for the parameters whose ID is less than callee->getFunctionNumParams(). The attached test case will crash without the patch. Reviewers: mssimpso, davidxl, davide Reviewed By: mssimpso Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46026 llvm-svn: 330844	2018-04-25 17:19:21 +00:00
Nico Weber	79c6ec484e	Rename Attributes.gen, Intrinsics.gen to Attributes.inc, Intrinsics.inc Virtually all other tablegen outputs are called .inc, not .gen, so rename these two too for consistency. No behavior change. https://reviews.llvm.org/D46058 llvm-svn: 330843	2018-04-25 17:07:46 +00:00
Sanjay Patel	807ddee1bf	[InstCombine] clean up foldSelectICmpAnd(); NFC As discussed in D45862, we want to delete parts of this code because it can create more instructions than it removes. But we also want to preserve some folds that are winners, so tidy up what's here to make splitting the good from bad a bit easier. llvm-svn: 330841	2018-04-25 16:34:01 +00:00
Sanjay Patel	0387ceb67a	[InstCombine] add tests for select to logic folds; NFC As discussed in D45862, we want these folds sometimes because they're good improvements. But as we can see here, the current logic doesn't check uses and doesn't produce optimal code in all cases. llvm-svn: 330837	2018-04-25 15:59:23 +00:00
Simon Pilgrim	58e03a09db	[CostModel][X86] Recursive call for cost of imul for packed v16i16 constant shift left. Don't just assume cost = 1. llvm-svn: 330834	2018-04-25 15:22:03 +00:00
Amara Emerson	1f5d994119	[AArch64][GlobalISel] Implement selection for the llvm.trap intrinsic. rdar://38674040 llvm-svn: 330831	2018-04-25 14:43:59 +00:00
Paul Walker	f6e44a0bdf	Fix typo in static_assert for size of LoadSDNodeBitfields. Reviewers: fhahn, jlebar, delena, RKSimon Reviewed By: fhahn, jlebar Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D45769 llvm-svn: 330830	2018-04-25 14:42:44 +00:00
Filipe Cabecinhas	def742ca52	[llvm-mca] Make ViewOptions static. NFCI llvm-svn: 330829	2018-04-25 14:39:16 +00:00
Shiva Chen	d58bd8dc4a	[RISCV] Expand function call to "call" pseudoinstruction To do this: 1. Change GlobalAddress SDNode to TargetGlobalAddress to avoid legalizer split the symbol. 2. Change ExternalSymbol SDNode to TargetExternalSymbol to avoid legalizer split the symbol. 3. Let PseudoCALL match direct call with target operand TargetGlobalAddress and TargetExternalSymbol. Differential Revision: https://reviews.llvm.org/D44885 llvm-svn: 330827	2018-04-25 14:19:12 +00:00
Shiva Chen	98f9389f65	[RISCV] Support "call" pseudoinstruction in the MC layer To do this: 1. Add PseudoCALLIndirct to match indirect function call. 2. Add PseudoCALL to support parsing and print pseudo `call` in assembly 3. Expand PseudoCALL to the following form with R_RISCV_CALL relocation type while encoding: auipc ra, func jalr ra, ra, 0 If we expand PseudoCALL before emitting assembly, we will see auipc and jalr pair when compile with -S. It's hard for assembly parser to parsing this pair and identify it's semantic is function call and then insert R_RISCV_CALL relocation type. Although we could insert R_RISCV_PCREL_HI20 and R_RISCV_PCREL_LO12_I relocation types instead of R_RISCV_CALL. Due to RISCV relocation design, auipc and jalr pair only can relax to jal with R_RISCV_CALL + R_RISCV_RELAX relocation types. We expand PseudoCALL as late as encoding(RISCVMCCodeEmitter) instead of before emitting assembly(RISCVAsmPrinter) because we want to preserve call pseudoinstruction in assembly code. It's more readable and assembly parser could identify call assembly and insert R_RISCV_CALL relocation type. Differential Revision: https://reviews.llvm.org/D45859 llvm-svn: 330826	2018-04-25 14:18:55 +00:00
Simon Dardis	0f2f5976d0	[mips] Teach the delay slot filler to transform 'jal' for microMIPS ISel is currently picking 'JAL' over 'JAL_MM' for calling a function when targeting microMIPS. A later patch will correct this behaviour. This patch extends the mechanism for transforming instructions into their short delay to recognise 'JAL_MM' for transforming into 'JALS_MM'. llvm-svn: 330825	2018-04-25 14:12:57 +00:00
Simon Pilgrim	7574ffd7bc	[llvm-mca][X86] Updated fma3 tests after rL330820 llvm-svn: 330822	2018-04-25 13:19:04 +00:00
Simon Pilgrim	dbd1ae7ddd	[X86] Split WriteFMA into XMM, Scalar and YMM/ZMM scheduler classes This removes all the FMA InstRW overrides. If we ever get PR36924, then we can remove many of these declarations from models. llvm-svn: 330820	2018-04-25 13:07:58 +00:00
Roman Lebedev	cfa9e58ccf	[X86][AArch64][NFC] Finish adding 'bad' tests for masked merge unfolding with constants. I have initially committed basic tests in, rL330771, but then quickly discovered that there are a few more interesting patterns. llvm-svn: 330819	2018-04-25 12:48:23 +00:00
Alexander Timofeev	b934728cd2	[AMDGPU] Revert b0efc4fd6 (https://reviews.llvm.org/D40556 ) llvm-svn: 330818	2018-04-25 12:32:46 +00:00
Gabor Buella	f4bd34b026	Avoid a warning on pointer casting, NFC Reviewers: philip.pfaffe Reviewed By: philip.pfaffe Differential Revision: https://reviews.llvm.org/D46012 llvm-svn: 330817	2018-04-25 12:15:34 +00:00
Andrea Di Biagio	534e1dab79	[llvm-mca] Add a new option category for views. With this patch, options to add/tweak views are all grouped together in the -help output. The new "View Options" category looks like this: ``` View Options: -dispatch-stats - Print dispatch statistics -instruction-info - Print the instruction info view -instruction-tables - Print instruction tables -register-file-stats - Print register file statistics -resource-pressure - Print the resource pressure view -retire-stats - Print retire control unit statistics -scheduler-stats - Print scheduler statistics -timeline - Print the timeline view -timeline-max-cycles=<uint> - Maximum number of cycles in the timeline view. Defaults to 80 cycles -timeline-max-iterations=<uint> - Maximum number of iterations to print in timeline view ``` llvm-svn: 330816	2018-04-25 11:33:14 +00:00
Greg Bedwell	03e85e126c	[UpdateTestChecks] Change update_mca_test_checks.py file mode to match the other scripts llvm-svn: 330815	2018-04-25 11:20:42 +00:00
Simon Pilgrim	6a82e96ed9	[X86][SKX] Setup WriteFAdd and remove unnecessary InstRW scheduler overrides. llvm-svn: 330813	2018-04-25 10:51:19 +00:00
Simon Pilgrim	98e21c5ade	[X86][SNB] Remove unnecessary WriteFBlendLd InstRW scheduler overrides. llvm-svn: 330812	2018-04-25 10:50:39 +00:00
Andrea Di Biagio	641cca3ddf	[llvm-mca] run clang-format on a bunch of files. NFC llvm-svn: 330811	2018-04-25 10:27:30 +00:00
Simon Dardis	eac9301cdb	[mips] Fix the definition of sync, synci Also, fix the disassembly of synci for microMIPS. Reviewers: abeserminji, smaksimovic, atanasyan Differential Revision: https://reviews.llvm.org/D45870 llvm-svn: 330810	2018-04-25 10:19:22 +00:00
Andrea Di Biagio	93c49d5e58	[llvm-mca] Default to the native host cpu if flag -mcpu is not specified. llvm-svn: 330809	2018-04-25 10:18:25 +00:00
Andrea Di Biagio	db66efcb6a	[llvm-mca] Remove method Instruction::isZeroLatency(). NFCI llvm-svn: 330807	2018-04-25 09:38:58 +00:00
Florian Hahn	1da30c659d	[LoopInterchange] Use getExitBlock()/getExitingBlock instead of manual impl. This also means we have to check if the latch is the exiting block now, as `transform` expects the latches to be the exiting blocks too. https://bugs.llvm.org/show_bug.cgi?id=36586 Reviewers: efriedma, davide, karthikthecool Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D45279 llvm-svn: 330806	2018-04-25 09:35:54 +00:00
Sander de Smalen	eb896b148b	[AArch64][SVE] Asm: Add AsmOperand classes for SVE gather/scatter addressing modes. This patch adds parsing support for 'vector + shift/extend' and corresponding asm operand classes, needed for implementing SVE's gather/scatter addressing modes. The added combinations of vector (ZPR) and Shift/Extend are: Unscaled: ZPR64ExtLSL8: signed 64-bit offsets (z0.d) ZPR32ExtUXTW8: unsigned 32-bit offsets (z0.s, uxtw) ZPR32ExtSXTW8: signed 32-bit offsets (z0.s, sxtw) Unpacked and unscaled: ZPR64ExtUXTW8: unsigned 32-bit offsets (z0.d, uxtw) ZPR64ExtSXTW8: signed 32-bit offsets (z0.d, sxtw) Unpacked and scaled: ZPR64ExtUXTW<scale>: unsigned 32-bit offsets (z0.d, uxtw #<shift>) ZPR64ExtSXTW<scale>: signed 32-bit offsets (z0.d, sxtw #<shift>) Scaled: ZPR32ExtUXTW<scale>: unsigned 32-bit offsets (z0.s, uxtw #<shift>) ZPR32ExtSXTW<scale>: signed 32-bit offsets (z0.s, sxtw #<shift>) ZPR64ExtLSL<scale>: unsigned 64-bit offsets (z0.d, lsl #<shift>) ZPR64ExtLSL<scale>: signed 64-bit offsets (z0.d, lsl #<shift>) Patch [1/3] in series to add support for SVE's gather load instructions that use scalar+vector addressing modes: - Patch [1/3]: https://reviews.llvm.org/D45951 - Patch [2/3]: https://reviews.llvm.org/D46023 - Patch [3/3]: https://reviews.llvm.org/D45958 Reviewers: fhahn, rengolin, samparker, SjoerdMeijer, t.p.northover, echristo, evandro, javed.absar Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D45951 llvm-svn: 330805	2018-04-25 09:26:47 +00:00
Bjorn Pettersson	bec2a7c4eb	[DebugInfo] Invalidate debug info in ReassociatePass::RewriteExprTree Summary: When Reassociate is rewriting an expression tree it may reuse old binary expression nodes, for new expressions. Whenever an expression node is reused, but with a non-trivial change in the result, we need to invalidate any debug info that is associated with the node. If for example rewriting x = mul a, b y = mul c, x into x = mul c, b y = mul a, x we still get the same result for 'y', but 'x' is a new expression. All debug info referring to 'x' must be invalidated (marked as optimized out) since we no longer calculate the expected value. As a side-effect this patch avoid (at least some) problems where reassociate could end up creating IR with debug-use before def. Earlier the dbg.value nodes where left untouched in the IR, while the reused binary nodes where sinked to just before the root node of the rewritten expression tree. See PR27273 for more info about such problems. Reviewers: dblaikie, aprantl, dexonsmith Reviewed By: aprantl Subscribers: JDevlieghere, llvm-commits Tags: #debug-info Differential Revision: https://reviews.llvm.org/D45975 llvm-svn: 330804	2018-04-25 09:23:56 +00:00
Craig Topper	773ead2908	[TableGen] Fix bad indentation in tablegen output file. llvm-svn: 330801	2018-04-25 06:24:51 +00:00
David Bolvansky	3ea50f9fef	Merging r46043: ------------------------------------------------------------------------ llvm-svn: 330799	2018-04-25 04:33:36 +00:00
Craig Topper	bba52806b1	[X86] Auto-generate complete checks. NFC llvm-svn: 330797	2018-04-25 03:40:45 +00:00
Geoff Berry	2af5f3c1e5	[DivRemPairs] Fix non-determinism in use list order. Summary: Use a MapVector instead of a DenseMap for RemMap since it is iteratated over and the order of iteration can effect the order that new instructions are created. This can in turn effect the use list order of div/rem input values if multiple new instructions are created that share any input values. Reviewers: spatel Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D45858 llvm-svn: 330792	2018-04-25 02:17:56 +00:00
Chandler Carruth	69e68f8468	[PM/LoopUnswitch] Begin teaching SimpleLoopUnswitch to use the new update API for dominators rather than doing manual, hacky updates. This is just the first step, but in some ways the most important as it moves the non-trivial unswitching to update the domtree rather than fully recalculating it each time. Subsequent patches should remove the custom update logic used by the trivial unswitch and replace it with uses of the update API. This also fixes a number of bugs I was seeing when testing non-trivial unswitch due to it querying the quasi-correct dominator tree. Now the tree is 100% correct and safe to query. That said, there are still more bugs I can see with non-trivial unswitch just running over the test suite, so more bugfix patches are needed as well. Thanks to both Sanjoy and Fedor for reviews and testing! Differential Revision: https://reviews.llvm.org/D45943 llvm-svn: 330787	2018-04-25 00:18:07 +00:00
Jessica Paquette	4f56428de1	[MachineOutliner] Check for explicit uses of LR/W30 in MI operands Before, the outliner would grab ADRPs that used LR/W30. This patch fixes that by checking for explicit uses of those registers before the special-casing for ADRPs. This also adds a test that ensures that those sorts of ADRPs won't be outlined. llvm-svn: 330783	2018-04-24 22:38:15 +00:00
Craig Topper	f3cefad255	[DAGCombiner][X86] When promoting loads don't use ZEXTLOAD even its legal We were previously prefering ZEXTLOAD over EXTLOAD if it is legal. This triggers during X86's promotion of i16->i32. Not sure about other targets. Using ZEXTLOAD can prevent folding it to SEXTLOAD later if we were to promote a sign extended operand like we would need for SRA. However, X86 doesn't currently promote i16 SRA. I was looking into doing that which is how I found this issue. This is also blocking our ability to fold 4 byte aligned EXTLOADs with "loadi32". This is what caused most of the test changes here. Differential Revision: https://reviews.llvm.org/D45585#inline-402825 llvm-svn: 330781	2018-04-24 22:35:27 +00:00
Reid Kleckner	891c8fb753	Fix path separator checks on Windows llvm-svn: 330779	2018-04-24 22:03:07 +00:00
Warren Ristow	b960d2cb40	[X86] Account for partial stack slot spills (PR30821) Previously, _any_ store or load instruction was considered to be operating on a spill if it had a frameindex as an operand, and thus was fair game for optimisations such as "StackSlotColoring". This usually works, except on architectures where spills can be partially restored, for example on X86 where a spilt vector can have a single component loaded (zeroing the rest of the target register). This can be mis-interpreted and the zero extension unsoundly eliminated, see pr30821. To avoid this, this commit optionally provides the caller to isLoadFromStackSlot and isStoreToStackSlot with the number of bytes spilt/loaded by the given instruction. Optimisations can then determine that a full spill followed by a partial load (or vice versa), for example, cannot necessarily be commuted. Patch by Jeremy Morse! Differential Revision: https://reviews.llvm.org/D44782 llvm-svn: 330778	2018-04-24 22:01:50 +00:00
Alexander Shaposhnikov	b3bfd3b028	[llvm-objcopy] Adjust the help message Capitalize the first letter, make the text a bit more consistent. NFC. Differential revision: https://reviews.llvm.org/D46025 llvm-svn: 330777	2018-04-24 21:44:13 +00:00
Reid Kleckner	5176039795	Bring back APInt self-move assignment check for MSVC only Summary: It was removed about a year ago in r300477. Bring it back, along with its unittest, when the MSVC STL is in use. The MSVC STL performs self-assignment in std::shuffle. These days, llvm::sort calls std::shuffle when expensive checks are enabled to help find non-determinism bugs. Reviewers: craig.topper, chandlerc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46028 llvm-svn: 330776	2018-04-24 21:41:50 +00:00
Tom Stellard	a2be8f4c35	AMDGPU: Remove deprecated llvm.AMDGPU.kilp intrinsic Summary: This is no longer used by mesa since its 18.0.0 release. Reviewers: nhaehnle Reviewed By: nhaehnle Subscribers: arsenm, kzhuravl, wdng, yaxunl, dstuttard, tpr, llvm-commits, t-tye Differential Revision: https://reviews.llvm.org/D45988 llvm-svn: 330775	2018-04-24 21:37:57 +00:00
Tom Stellard	257882ff72	AMDGPU/GlobalISel: Fall-back to SelectionDAG for non-void functions Reviewers: arsenm, nhaehnle Reviewed By: nhaehnle Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D45843 llvm-svn: 330774	2018-04-24 21:29:36 +00:00
Mandeep Singh Grang	434d4c0b88	[docs] Add a note on non-deterministic sorting order of equal elements Reviewers: RKSimon, t.p.northover, dexonsmith Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D45831 llvm-svn: 330773	2018-04-24 21:25:57 +00:00
Roman Lebedev	54df09e8fe	[X86][AArch64][NFC] Add tests for masked merge unfolding with %y = const The fold was added in D45733. This appears to be a regression. llvm-svn: 330771	2018-04-24 21:23:22 +00:00
Daniel Neilson	3c148720fa	[CaptureTracking] Fixup const correctness of DomTree arg (NFC) Summary: The PointerMayBeCapturedBefore function's DomTree arg should be const instead of non-const. There are no non-const uses of it in the function. llvm-svn: 330769	2018-04-24 21:12:45 +00:00
Sanjay Patel	54795bb16b	[InstCombine] move tests for select with bit-test of condition; NFC These are all but 1 of the select-of-constant tests that appear to be transformed within foldSelectICmpAnd() and the block above it predicated by decomposeBitTestICmp(). As discussed in D45862 (and can be seen in several tests here), we probably want to stop doing those transforms because they can increase the instruction count without benefitting other passes or codegen. The 1 test not included here is a urem test where the bit hackery allows us to remove a urem. To preserve killing that urem, we should do some stronger known-bits analysis or pattern matching of 'urem x, (select-of-pow2-constants)'. llvm-svn: 330768	2018-04-24 21:06:06 +00:00
Tom Stellard	c7709e1c29	AMDGPU/GlobalISel: Add support for amdgpu_ps calling convention Reviewers: arsenm Reviewed By: arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D45837 llvm-svn: 330767	2018-04-24 20:51:28 +00:00
Chandler Carruth	7e1c3345a0	[wasm] Fix uninitialized memory introduced in r330749. Found with MSan. This was causing all the WASM MC tests to fail about 10% of the time. llvm-svn: 330764	2018-04-24 20:30:56 +00:00
Rafael Espindola	b71251ca71	[bugpoint] Fix crash when testing for miscompilation. Method BugDriver::performFinalCleanups(...) would delete Module object it worked on, which was also deleted by its caller (e.g. TestCodeGenerator(...)). Changed the code to avoid double delete and make Module ownership slightly clearer. Patch by Andrzej Janik. llvm-svn: 330763	2018-04-24 20:15:27 +00:00
Sam McCall	bdc0fe55d1	[Support] fix countLeadingZeros for types shorter than int llvm-svn: 330762	2018-04-24 20:08:05 +00:00
Shoaib Meenai	f2650df77d	[cmake] Fix libc++ detection -stdlib=libc++ is added to both the compilation and the link flags, but the logic for adding it was only checking if it was supported during compilation and not linking. This could lead to false positives, for example when using clang with libstdc++ (where the compiler would support -stdlib=libc++ but then linking would fail because of libc++ actually being unavailable). llvm-svn: 330761	2018-04-24 19:47:39 +00:00
Simon Pilgrim	c4d25a2922	[X86][SKX] Setup WriteFMul and remove unnecessary InstRW scheduler overrides. llvm-svn: 330760	2018-04-24 19:22:01 +00:00
Vedant Kumar	4ce143088c	[test] Update llc checks for CodeGen/X86/avg.ll The output of update_llc_test_checks.py on this test file has changed, so the test file should be updated to minimize source changes in future patches. The test updates for this file appear to be limited to relaxations of the form: -; SSE2-NEXT: movq %rdi, -{{[0-9]+}}(%rsp) # 8-byte Spill +; SSE2-NEXT: movq %rdi, {{[-0-9]+}}(%r{{[sb]}}p) # 8-byte Spill This was suggested in https://reviews.llvm.org/D45995. llvm-svn: 330758	2018-04-24 19:20:18 +00:00
Andrea Di Biagio	ba625f0a86	[llvm-mca] Remove unused flag -verbose. NFC I forgot to remove it at r329794. llvm-svn: 330757	2018-04-24 19:14:56 +00:00
Simon Pilgrim	27bc83e228	[X86] Split off PHMINPOSUW to their own schedule class This also fixes Jaguar's schedule which was treating it as the WriteVecIMul default. llvm-svn: 330756	2018-04-24 18:49:25 +00:00
Joel E. Denny	8a475307bd	[lit] Report line number for failed RUN command When debugging test failures with -vv (or -v in the case of the internal shell), this makes it easier to locate the RUN line that failed. For example, clang's test/Driver/linux-ld.c has 892 total RUN lines, and clang's test/Driver/arm-cortex-cpus.c has 424 RUN lines after concatenation for line continuations. When reading the generated shell script, this also makes it easier to locate the RUN line that produced each command. To support reporting RUN line numbers in the case of the internal shell, this patch extends the internal shell to support the null command, ":", except pipelines are not supported. Reviewed By: asmith, delcypher Differential Revision: https://reviews.llvm.org/D44598 llvm-svn: 330755	2018-04-24 18:43:25 +00:00
Stanislav Mekhanoshin	a4bfb3c446	[AMDGPU] Truncate packed inline constant If a packed inline constant is sign extended it must be truncated after the shift. I.e. a constant (0xH0000, 0xHBC00), will be represented as 0xFFFFFFFFBC000000 in the IR because the immediate is sign extended to 64 bit. After the value shifted right by 16 to use it in a low part with op_sel_hi it becomes 0xFFFFFFFFBC00 and does not qualify as inline constant any longer. Fixed the error and added verification code. Without the fix and with the verification bug is causing pk_max_f16_literal.ll to fail. Differential Revision: https://reviews.llvm.org/D45987 llvm-svn: 330752	2018-04-24 18:17:55 +00:00
Simon Pilgrim	81cb67ad82	[XOP] v4i32 IFMA 'VPMACS' instructions should use the WritePMULLD schedule class llvm-svn: 330751	2018-04-24 18:13:57 +00:00
Sam Clegg	6f08c84ae5	[WebAssembly] Use section index in relocation section header Rather than referring to sections my their code, use the absolute index of the target section within the module. See https://github.com/WebAssembly/tool-conventions/issues/52 Differential Revision: https://reviews.llvm.org/D45980 llvm-svn: 330749	2018-04-24 18:11:36 +00:00
Florian Hahn	97ae30b8d6	[LoopInterchange] Add REQUIRES: asserts to test. llvm-svn: 330748	2018-04-24 18:10:52 +00:00
Simon Pilgrim	cf0199a289	[AVX512] VPERMQ/VPERMPD/VPERMIL single op shuffles are not variable shuffles These variants all take an immediate shuffle mask value and should be scheduled as such. llvm-svn: 330747	2018-04-24 17:59:54 +00:00
Nico Weber	ebc7c74f2f	Let TableGen write output only if it changed, instead of doing so in cmake. Removes one subprocess and one temp file from the build for each tablegen invocation. No intended behavior change. https://reviews.llvm.org/D45899 llvm-svn: 330742	2018-04-24 17:29:05 +00:00
Simon Dardis	d2ac0faf3b	Reland "[mips] Guard traps for microMIPS correctly" This is part of fixing the instruction predicates for MIPS. Reviewers: atanasyan, abeserminji Differential Revision: https://reviews.llvm.org/D44212 This patch relands r327409, hopefully without the problematic part of the tests that cause FileCheck to assert on the windows expensive checks bot. llvm-svn: 330741	2018-04-24 17:11:37 +00:00
Diego Caballero	60f2776b2f	[LV][VPlan] Detect outer loops for explicit vectorization. Patch #2 from VPlan Outer Loop Vectorization Patch Series #1 (RFC: http://lists.llvm.org/pipermail/llvm-dev/2017-December/119523.html). This patch introduces the basic infrastructure to detect, legality check and process outer loops annotated with hints for explicit vectorization. All these changes are protected under the feature flag -enable-vplan-native-path. This should make this patch NFC for the existing inner loop vectorizer. Reviewers: hfinkel, mkuper, rengolin, fhahn, aemerson, mssimpso. Differential Revision: https://reviews.llvm.org/D42447 llvm-svn: 330739	2018-04-24 17:04:17 +00:00
Florian Hahn	ceee788947	[LoopInterchange] Make isProfitableForVectorization slightly more conservative. After D43236, we started interchanging loops with empty dependence matrices. In isProfitableForVectorization, we try to determine if interchanging makes the loop dependences more friendly to the vectorizer. If there are no dependences, we should not interchange, based on that heuristic. Reviewers: efriedma, mcrosier, karthikthecool, blitz.opensource Reviewed By: mcrosier Differential Revision: https://reviews.llvm.org/D45208 llvm-svn: 330738	2018-04-24 16:55:32 +00:00
Simon Pilgrim	f0945aa0e0	[X86][F16C] Add WriteCvtF2FSt scheduling class Fixes the classification of VCVTPS2PHmr/VCVTPS2PHYmr which were tagged as WriteCvtF2FLd_WriteRMW (PR36887) llvm-svn: 330737	2018-04-24 16:43:07 +00:00
Fangrui Song	11b1e8898a	[ADT] Remove ilist_default_traits llvm-svn: 330736	2018-04-24 16:32:55 +00:00
Simon Pilgrim	828ef9e013	[X86][BtVer2] Fix VCVTPS2PHmr/VCVTPS2PHYmr latencies These are stores, not loads, so don't need to account for load latency. llvm-svn: 330735	2018-04-24 16:26:51 +00:00
Simon Pilgrim	f35b8ac196	[X86][IVB] Add F16C resource tests. Note this is IvyBridge (which shares the model) NOT SandyBridge. llvm-svn: 330734	2018-04-24 16:22:59 +00:00
Andrea Di Biagio	0626864fa4	[llvm-mca] Default the output asm dialect used by the instruction printer to the input asm dialect. The instruction printer used by llvm-mca to generate the performance report now defaults the output assembly format to the format used for the input assembly file. On x86, the asm format can be either AT&T or Intel, depending on the presence/absence of directive `.intel_syntax`. Users can still specify a different assembly dialect with the command line flag -output-asm-variant=<uint>. llvm-svn: 330733	2018-04-24 16:19:08 +00:00
Simon Atanasyan	9df3be3ccb	[mips] Show an error if register number is out of range Current code does not check that a register number is in the 0-31 range. Sometimes the parser checks that later for some kinds of instructions, but that leads to unclear / incorrect error messages like that: % cat test.s .text lb $4, 8($32) % llvm-mc test.s -triple=mips64-unknown-linux test.s:2:10: error: expected memory with 16-bit signed offset lb $4, 8($32) ^ Sometimes the parser just crashes: % cat test.s .text lw $4, 8($32) % llvm-mc test.s -triple=mips64-unknown-linux This patch resolves the problem by checking that register number after '$' sign is in the 0-31 range. If the number is out of the range the parser shows the `invalid register number` error, but treats invalid register number as a normal one to continue parsing and catch other possible errors. Differential Revision: https://reviews.llvm.org/D45919 llvm-svn: 330732	2018-04-24 16:14:00 +00:00
Sanjay Patel	510af48e5d	[InstCombine] regenerate checks; NFC The first step in fixing problems raised in D45862 is to make the problems visible. Now we can more easily see/update cases where selects have been turned into multiple instructions with no apparent improvement in analysis or benefits for other passes (vectorization). llvm-svn: 330731	2018-04-24 16:08:03 +00:00
Mark Searles	70901b9047	[AMDGPU][Waitcnt] NFC. Cleanup some code/naming consistency: - s/SWaitcnt/Waitcnt s/WaitCnt/Waitcnt llvm-svn: 330730	2018-04-24 15:59:59 +00:00
Sanjay Patel	f03ec65517	[InstCombine] regenerate checks; NFC The current version of the script uses regex for params. This could mask a bug (param values got wrongly swapped), but it seems unlikely in practice, so let's just update the whole file to reduce diffs when there is a meaningful change here. llvm-svn: 330729	2018-04-24 15:42:30 +00:00
Dan Liew	97a60c21d4	[lit] Remove spurious `-` in invocation of lit in `shtest-xunit-output.py` test. Although there is no `-` file Jeremy Morse has reported to me that it causes problems in their setup because lit tries to find it and ends up loading an out of tree lit configuration file. llvm-svn: 330728	2018-04-24 15:42:00 +00:00
Nico Weber	8c77bf9ec6	Remove LLVM_INSTALL_CCTOOLS_SYMLINKS It used to symlink dsymutil to llvm-dsymutil, but after r327790 llvm's dsymutil binary is now called dsymutil without prefix. r327792 then reversed the direction of the symlink if LLVM_INSTALL_CCTOOLS_SYMLINKS was set, but that looks like a buildfix and not like something anyone should need. https://reviews.llvm.org/D45966 llvm-svn: 330727	2018-04-24 15:41:02 +00:00
David Blaikie	ba47dd16c5	Fix some layering in AggressiveInstCombine (avoiding inclusion of Scalar.h) llvm-svn: 330726	2018-04-24 15:40:07 +00:00
Benjamin Kramer	f85f5da3b2	[LoadStoreVectorize] Ignore interleaved invariant loads. The memory location an invariant load is using can never be clobbered by any store, so it's safe to move the load ahead of the store. Differential Revision: https://reviews.llvm.org/D46011 llvm-svn: 330725	2018-04-24 15:28:47 +00:00
Andrea Di Biagio	27c4b09626	[llvm-mca] Refactor the Scheduler interface in preparation for PR36663. Zero latency instructions are now scheduled the same way as other instructions. Before this patch, there was a specialzed code path for those instructions. All scheduler events are now generated from method `scheduleInstruction()` and from method `cycleEvent()`. This will make easier to implement a "execution stage", and let that stage publish all the scheduler events. No functional change intended. llvm-svn: 330723	2018-04-24 14:53:16 +00:00
Simon Pilgrim	16299273d0	[X86] Remove unnecessary FMA reg-mem InstRW scheduler overrides. llvm-svn: 330720	2018-04-24 14:47:11 +00:00
Ulrich Weigand	497c70fff1	[SystemZ] Use preferred 16-byte function alignment While not necessary for correctness, it is preferable for performance reasons on all architectures we currently support to align functions to 16-byte boundaries by default. llvm-svn: 330718	2018-04-24 14:03:21 +00:00
Simon Pilgrim	d6ceda8451	Fix Wdocumentation warnings. NFCI. llvm-svn: 330716	2018-04-24 13:38:26 +00:00
Simon Pilgrim	23d29250ae	[X86] Fix missing cfi from sitofp checks llvm-svn: 330715	2018-04-24 13:24:56 +00:00
Simon Pilgrim	f7d2a93d5f	[X86] Add vector element insertion/extraction scheduler classes Split off pinsr/pextr and extractps instructions. (Mostly) fixes PR36887. Note: It might be worth adding a WriteFInsertLd class as well in the future. Differential Revision: https://reviews.llvm.org/D45929 llvm-svn: 330714	2018-04-24 13:21:41 +00:00
Simon Pilgrim	2922c102b3	[MC] Remove orphan MCSchedModel::computeReciprocalThroughput declaration. NFCI. llvm-svn: 330713	2018-04-24 13:01:03 +00:00
Alexander Ivchenko	5717fbaf4c	[X86] Replace action Promote with Expand for operation ISD::SINT_TO_FP Summary: If attribute "use-soft-float"="true" is set then X86ISelLowering.cpp sets 'Promote' action for ISD::SINT_TO_FP operation on type i32. But 'Promote' action is not proper in this case since lib function __floatsidf is available for casting from signed int to float type. Thus Expand action is more suitable here. The Expand action should be set for ISD::UINT_TO_FP for soft float as well. If function attribute "use-soft-float"="true" is set then infinite looping can happen in DAG combining, function visitSINT_TO_FP() replaces SINT_TO_FP node with UINT_TO_FP node and function combineUIntToFP() replace vice versa in cycle. The fix prevents it. Patch by vrybalov Differential Revision: https://reviews.llvm.org/D45572 llvm-svn: 330711	2018-04-24 12:57:51 +00:00
Francis Visoiu Mistrih	8ed0f741ae	[CodeGen] Print user-friendly debug locations as MI comments If available, print the file, line and column of the DebugLoc attached to the MachineInstr: MOV16mr $rbp, 1, $noreg, -112, $noreg, killed renamable $ax, debug-location !56 :: (store 2 into %ir.._value12); stepping.swift:10:17 renamable $edx = MOVZX32rm16 $rbp, 1, $noreg, -112, $noreg, debug-location !62 :: (dereferenceable load 2 from %ir.._value13); stepping.swift:10:17 Differential Revision: https://reviews.llvm.org/D45992 llvm-svn: 330709	2018-04-24 11:00:46 +00:00
Chandler Carruth	43acdb35bc	[PM/LoopUnswitch] Fix a bug in the loop block set formation of the new loop unswitch. This code incorrectly added the header to the loop block set early. As a consequence we would incorrectly conclude that a nested loop body had already been visited when the header of the outer loop was the preheader of the nested loop. In retrospect, adding the header eagerly doesn't really make sense. It seems nicer to let the cycle be formed naturally. This will catch crazy bugs in the CFG reconstruction where we can't correctly form the cycle earlier rather than later, and makes the rest of the logic just fall out. I've also added various asserts that make these issues much easier to debug. llvm-svn: 330707	2018-04-24 10:33:08 +00:00
Petar Jovanovic	e2bfcd6394	Correct dwarf unwind information in function epilogue This patch aims to provide correct dwarf unwind information in function epilogue for X86. It consists of two parts. The first part inserts CFI instructions that set appropriate cfa offset and cfa register in emitEpilogue() in X86FrameLowering. This part is X86 specific. The second part is platform independent and ensures that: * CFI instructions do not affect code generation (they are not counted as instructions when tail duplicating or tail merging) * Unwind information remains correct when a function is modified by different passes. This is done in a late pass by analyzing information about cfa offset and cfa register in BBs and inserting additional CFI directives where necessary. Added CFIInstrInserter pass: * analyzes each basic block to determine cfa offset and register are valid at its entry and exit * verifies that outgoing cfa offset and register of predecessor blocks match incoming values of their successors * inserts additional CFI directives at basic block beginning to correct the rule for calculating CFA Having CFI instructions in function epilogue can cause incorrect CFA calculation rule for some basic blocks. This can happen if, due to basic block reordering, or the existence of multiple epilogue blocks, some of the blocks have wrong cfa offset and register values set by the epilogue block above them. CFIInstrInserter is currently run only on X86, but can be used by any target that implements support for adding CFI instructions in epilogue. Patch by Violeta Vukobrat. Differential Revision: https://reviews.llvm.org/D42848 llvm-svn: 330706	2018-04-24 10:32:08 +00:00
Simon Dardis	fce722e6f8	[mips] Correct the patterns for bswap Guard the MIPS64 variant correctly for i64, mark the MIPS32 version as not in microMIPS and provide the microMIPS version. Additionally, remove a related stale XFAIL'd test as bswap has its own test case providing coverage. Reviewers: smaksimovic, abeserminji, atanasyan Differential Revision: https://reviews.llvm.org/D45816 llvm-svn: 330705	2018-04-24 10:19:29 +00:00
Andrea Di Biagio	ef507cb47c	[llvm-mca][CommandGuide] Fix typo in example. llvm-svn: 330703	2018-04-24 10:09:32 +00:00
Andrei Elovikov	822602a75e	[CodeGen] Do not allow opt-bisect-limit to skip ScalarizeMaskedMemIntrin. Summary: The pass is supposed to scalarize such intrinsics if the target does not support them natively, so if the scalarization does not happen instruction selection crashes due to inability to lower these intrinsics. Reviewers: andrew.w.kaylor, craig.topper Reviewed By: andrew.w.kaylor Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D45947 llvm-svn: 330700	2018-04-24 09:24:29 +00:00
Max Kazantsev	c54e67d6b9	[NFC] Remove recently added SE verification because it may be false-positive llvm-svn: 330699	2018-04-24 09:11:01 +00:00
Florian Hahn	f3561ffa87	[LoopInfo] Verify BBMap tracks innermost loops for BBs. By checking that none of the child loops contain a BB we make sure BBMap contains the innermost loop defining BB. This invariant was violated in LoopInterchange and got caught by this assertion. Reviewers: chandlerc, mzolotukhin, sanjoy, mehdi_amini, efriedma Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D45971 llvm-svn: 330698	2018-04-24 09:10:05 +00:00
Sander de Smalen	eb1053f9d3	[AArch64][SVE] Asm: Support for contiguous, first-faulting LDFF1 (scalar+scalar) load instructions. Reviewers: fhahn, rengolin, samparker, SjoerdMeijer, t.p.northover, echristo, evandro, javed.absar Reviewed By: rengolin Subscribers: tschuett, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D45946 llvm-svn: 330697	2018-04-24 08:59:08 +00:00
Roman Lebedev	68e0daedcf	Link to AggressiveInstCombine in a few places. Unbreaks build for me. /usr/local/bin/ld.lld: error: undefined symbol: llvm::createAggressiveInstCombinerPass() >>> referenced by cc1_main.cpp >>> tools/clang/tools/driver/CMakeFiles/clang.dir/cc1_main.cpp.o:(_GLOBAL__sub_I_cc1_main.cpp) And so on The bot coverage is clearly missing. llvm-svn: 330693	2018-04-24 08:40:37 +00:00
Pavel Labath	60b7132a7c	[Support/Path] Add more tests and improve failure messages of existing ones Summary: I am preparing a patch to the path function. While working on it, I noticed that some of the areas are lacking test coverage (e.g. filename and parent_path functions), so I add more tests to guard against regressions there. I have also found the failure messages hard to understand, so I rewrote some existing test to give more actionable messages when they fail: - for tests which run over multiple inputs, I use SCOPED_TRACE, to show which of the inputs caused the actual failure. - for comparisons of vectors, I use gmock's container matchers, which will print out the full container contents (and the elements that differ) when they fail to match. Reviewers: zturner, espindola Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D45941 llvm-svn: 330691	2018-04-24 08:29:20 +00:00
Xin Tong	adb5bfe75b	[LVI] Fix typo. NFC llvm-svn: 330688	2018-04-24 07:38:07 +00:00
Alexander Shaposhnikov	3326e785b2	[llvm-objcopy] Adjust the code for the old versions of msvc Follow-up for r330685. llvm-svn: 330686	2018-04-24 06:23:22 +00:00
Alexander Shaposhnikov	d6884795d0	Recommit "[llvm-objcopy] Switch over to using TableGen for parsing arguments" Add explicit dependency on ObjcopyTableGen and rerun the tests on Windows. I will double-check the build bots and revert this commit if necessary. llvm-svn: 330685	2018-04-24 05:43:32 +00:00
Max Kazantsev	7790d6cbff	[NFC] Use FileCheck in test llvm-svn: 330684	2018-04-24 04:42:37 +00:00
Max Kazantsev	30dee7874d	[NFC] Use forgetTopmostLoop instead of logic duplication llvm-svn: 330683	2018-04-24 04:33:04 +00:00
Craig Topper	19b85103a3	[X86] Add a BSWAP16 instruction using the 32-bit encoding plus a 0x66 prefix. This encoding is recognized by the CPU, but the behavior is undefined. This makes the disassembler handle it correctly so we don't print bswapl with a 16-bit register. llvm-svn: 330682	2018-04-24 04:28:02 +00:00
Chandler Carruth	0ace148ca6	[PM/LoopUnswitch] Remove another over-aggressive assert. This code path can very clearly be called in a context where we have baselined all the cloned blocks to a particular loop and are trying to handle nested subloops. There is no harm in this, so just relax the assert. I've added a test case that will make sure we actually exercise this code path. llvm-svn: 330680	2018-04-24 03:27:00 +00:00
Eric Christopher	b9733d0f7c	Remove unused function HexagonEarlyIfConversion::replacePhiEdges. NFC. llvm-svn: 330678	2018-04-24 02:10:59 +00:00
Max Kazantsev	5a0a40b8cb	[NFC] Add clarification comment llvm-svn: 330677	2018-04-24 02:08:05 +00:00
Eric Christopher	24004d65a5	Reflow formatting after previous NFC commit. llvm-svn: 330676	2018-04-24 01:57:03 +00:00
Eric Christopher	29ff50454c	Change if-conditionals to else-if as they should all be mutually exclusive. No functional change intended. llvm-svn: 330675	2018-04-24 01:57:02 +00:00
Nico Weber	1c2c321b47	Mostly revert r330672. The test is apparently needed e.g. for check-cfi on Windows where we get 'C:/b/slave/sanitizer-windows/build/./bin/clang.exe': command not found without it. Try to fix the problem that was fixed by r330672 by also checking for isabs() instead. llvm-svn: 330673	2018-04-24 01:24:42 +00:00
Nico Weber	a425db5a1e	Remove code that's almost always dead, and harmful if not. lit's util.which() would check if the passed-in path existed directly, and if so return it as-is. This is never the case when running llvm's, clang's, or lld's tests normally. But when running `./llvm-lit path/to/clang/test` with a cwd of llvm-build/bin, this if would detect that clang exists at path 'clang' and return 'clang' as the discovered clang binary -- and then lit would use the " clang " -> "* Do not use 'clang' in tests, use '%clang'. *" substitution to replace that with a broken test. By removing this early return, lit ends up with the usual absolute path and everything works even in this uncommon case. llvm-svn: 330672	2018-04-24 01:05:04 +00:00
David Blaikie	ce84e44a4b	Fix build breaks in examples due to moving stuff from Scalar.h to InstCombine.h llvm-svn: 330670	2018-04-24 00:58:57 +00:00
David Blaikie	a27771b62f	InstCombine: Fix layering by not including Scalar.h in InstCombine (notionally Scalar.h is part of libLLVMScalarOpts, so it shouldn't be included by InstCombine which doesn't/shouldn't need to depend on ScalarOpts) llvm-svn: 330669	2018-04-24 00:48:59 +00:00
Craig Topper	1bcb258ba3	[AggressiveInstCombine] Add aggressive inst combiner to the LLVM C API. I just tried to copy what was done for regular InstCombine. Hopefully I didn't miss anything. llvm-svn: 330668	2018-04-24 00:39:29 +00:00
George Burgess IV	8e807bf3fa	Reland r301880(!): "[InstSimplify] Handle selects of GEPs with 0 offset" I was reminded today that this patch got reverted in r301885. I can no longer reproduce the failure that caused the revert locally (...almost one year later), and the patch applied pretty cleanly, so I guess we'll see if the bots still get angry about it. The original breakage was InstSimplify complaining (in "assertion failed" form) about getting passed some crazy IR when running `ninja check-sanitizer`. I'm unable to find traces of what, exactly, said crazy IR was. I suppose we'll find out pretty soon if that's still the case. :) Original commit: Author: gbiv Date: Mon May 1 18:12:08 2017 New Revision: 301880 URL: http://llvm.org/viewvc/llvm-project?rev=301880&view=rev Log: [InstSimplify] Handle selects of GEPs with 0 offset In particular (since it wouldn't fit nicely in the summary): (select (icmp eq V 0) P (getelementptr P V)) -> (getelementptr P V) Differential Revision: https://reviews.llvm.org/D31435 llvm-svn: 330667	2018-04-24 00:25:01 +00:00
Alex Shlyapnikov	909fb12f0c	[HWASan] Use dynamic shadow memory on Android only (LLVM) There're issues with IFUNC support on other platforms. DIfferential Revision: https://reviews.llvm.org/D45840 llvm-svn: 330665	2018-04-24 00:16:54 +00:00
Craig Topper	ab7578460b	[AggressiveInstCombine] Add createAggressiveInstCombinerPass to LinkAllPasses.h. llvm-svn: 330664	2018-04-24 00:11:04 +00:00
Craig Topper	d4eb2073b7	[AggressiveInstCombine] Add library initializer routine for AggressiveInstCombine library. Use it in bugpoint and llvm-opt-fuzzer to match regular InstCombine. This should make aggressive instcombine usable with these tools. llvm-svn: 330663	2018-04-24 00:05:21 +00:00
Simon Pilgrim	e5e4bf02d6	[X86] Remove unnecessary vector memory folded InstRW overrides. We have test coverage for these with resources-sse/avx llvm-svn: 330662	2018-04-23 22:45:04 +00:00
Robert Widmann	21fc15d553	[LLVM-C] DIBuilder Bindings For Variable Expressions Summary: Add DIBuilder bindings for (global) variable expressions, variable value expressions, and debug value intrinsic insertion. Reviewers: harlanhaskins, deadalnix, whitequark Reviewed By: whitequark Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D45979 llvm-svn: 330661	2018-04-23 22:31:49 +00:00
Roman Tereshin	3c6ea7e28c	[GlobalISel][Legalizer] Look thro copies while combining G_UNMERGE's As we're becoming stricter w/ respect to not allowing vregs having LLTs and regclasses assigned both mid-globalisel pipeline, the number of extra copies grows, some of which separate G_UNMERGE's from their corresponding G_MERGE's, becoming a performance concern. It's worth mentioning that we're already looking through copies while combining legalization artifacts for every kind of artifact but G_UNMERGE. Reviewed By: aditya_nandakumar Reviewers: ab, t.p.northover, volkan, javed.absar Subscribers: rovka, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D45644 llvm-svn: 330660	2018-04-23 22:28:36 +00:00
Simon Pilgrim	eb6090941c	[X86] Remove unnecessary BMI2 InstRW overrides. We have test coverage for these with resources-bmi2.s llvm-svn: 330659	2018-04-23 22:19:55 +00:00
Florian Hahn	7441818560	[LoopInterchange] Do not change LI for BBs in child loops. If a loop with child loops becomes our new inner loop after interchanging, we only need to update LoopInfo for the blocks defined in the old outer loop. BBs in child loops will stay there. Reviewers: efriedma, karthikthecool, mcrosier Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D45970 llvm-svn: 330653	2018-04-23 21:38:19 +00:00
Simon Pilgrim	ed09ebb48d	[X86] Remove unnecessary WriteLEA InstRW overrides. llvm-svn: 330648	2018-04-23 21:04:23 +00:00
Roman Lebedev	95c6eaf530	[DAGCombiner] Unfold scalar masked merge if profitable Summary: This is [[ https://bugs.llvm.org/show_bug.cgi?id=37104 \| PR37104 ]]. [[ https://bugs.llvm.org/show_bug.cgi?id=6773 \| PR6773 ]] will introduce an IR canonicalization that is likely bad for the end assembly. Previously, `andl`+`andn`/`andps`+`andnps` / `bic`/`bsl` would be generated. (see `@out`) Now, they would no longer be generated (see `@in`). So we need to make sure that they are still generated. If the mask is constant, we do nothing. InstCombine should have unfolded it. Else, i use `hasAndNot()` TLI hook. For now, only handle scalars. https://rise4fun.com/Alive/bO6 ---- I really don't like the code i wrote in `DAGCombiner::unfoldMaskedMerge()`. It is super fragile. Is there something like IR Pattern Matchers for this? Reviewers: spatel, craig.topper, RKSimon, javed.absar Reviewed By: spatel Subscribers: andreadb, courbet, kristof.beyls, javed.absar, rengolin, nemanjai, llvm-commits Differential Revision: https://reviews.llvm.org/D45733 llvm-svn: 330646	2018-04-23 20:38:49 +00:00
Roman Lebedev	bf18cc56d3	[X86][AArch64][NFC] Add tests for masked merge unfolding Summary: This is [[ https://bugs.llvm.org/show_bug.cgi?id=37104 \| PR37104 ]]. [[ https://bugs.llvm.org/show_bug.cgi?id=6773 \| PR6773 ]] will introduce an IR canonicalization that is likely bad for the end assembly. Previously, `andl`+`andn`/`andps`+`andnps` / `bic`/`bsl` would be generated. (see `@out`) Now, they would no longer be generated (see `@in`). I'm guessing `llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp` should be able to unfold this. Reviewers: spatel, craig.topper, RKSimon, javed.absar Reviewed By: spatel Subscribers: nemanjai, rengolin, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D45563 llvm-svn: 330645	2018-04-23 20:38:42 +00:00
Sanjay Patel	fa8f5ad9f3	[AggressiveInstCombine] add tests for PR37098; NFC I'm not sure if this is where we should try to fold these patterns inspired by: https://bugs.llvm.org/show_bug.cgi?id=37098 ...if this isn't the right place, we can move the tests. llvm-svn: 330642	2018-04-23 20:20:32 +00:00
Xin Tong	8edff27923	[CallSiteSplit] Make sure we remove nonnull if the parameter turns out to be a constant. Summary: We do not need nonull attribute if we know an argument is going to be constant. Reviewers: junbuml, davide, fhahn Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D45608 llvm-svn: 330641	2018-04-23 20:09:08 +00:00
Gabor Buella	1a2ce572bf	[X86] Revert r330638 - accidental commit llvm-svn: 330640	2018-04-23 20:05:51 +00:00
George Burgess IV	851e3a1792	Fix a broken typedef; NFCI Richard Smith noted that `typedef typename iplist::iplist_impl_type iplist_impl_type` is incorrect, per http://eel.is/c++draft/basic.scope#class-2 It seems that neither clang nor gcc get too angry about this, but a newer version of msvc does. Thanks to jcmac on IRC for pointing this out! llvm-svn: 330639	2018-04-23 20:03:00 +00:00
Gabor Buella	213a7cda1f	[X86] movdiri and movdir64b instructions Reviewers: craig.topper llvm-svn: 330638	2018-04-23 20:00:59 +00:00
Bjorn Pettersson	8e484dc531	[MemCpyOpt] Skip optimizing basic blocks not reachable from entry Summary: Skip basic blocks not reachable from the entry node in MemCpyOptPass::iterateOnFunction. Code that is unreachable may have properties that do not exist for reachable code (an instruction in a basic block can for example be dominated by a later instruction in the same basic block, for example if there is a single block loop). MemCpyOptPass::processStore is only safe to use for reachable basic blocks, since it may iterate past the basic block beginning when used for unreachable blocks. By simply skipping to optimize unreachable basic blocks we can avoid asserts such as "Assertion `!NodePtr->isKnownSentinel()' failed." in MemCpyOptPass::processStore. The problem was detected by fuzz tests. Reviewers: eli.friedman, dneilson, efriedma Reviewed By: efriedma Subscribers: efriedma, llvm-commits Differential Revision: https://reviews.llvm.org/D45889 llvm-svn: 330635	2018-04-23 19:55:04 +00:00
Sam Clegg	2322a93709	[WebAssembly] MC: Refactor section creation code Remove the use of default argument in favor of a separate startCustomSection method. Differential Revision: https://reviews.llvm.org/D45794 llvm-svn: 330632	2018-04-23 19:16:19 +00:00
Quentin Colombet	e6c12abd15	[CODE_OWNERS] Update my email address. NFC llvm-svn: 330631	2018-04-23 19:09:49 +00:00
Peter Collingbourne	5ab4a4793e	Reland r329956, "AArch64: Introduce a DAG combine for folding offsets into addresses.", with a fix for the bot failure. This reland includes a check to prevent the DAG combiner from folding an offset that is smaller than the existing one. This can cause oscillations between two possible DAGs, which was the cause of the hang and later assertion failure observed on the lnt-ctmark-aarch64-O3-flto bot. http://green.lab.llvm.org/green/job/lnt-ctmark-aarch64-O3-flto/2024/ Original commit message: > This is a code size win in code that takes offseted addresses > frequently, such as C++ constructors that typically need to compute > an offseted address of a vtable. This reduces the size of Chromium > for Android's .text section by 108KB. Differential Revision: https://reviews.llvm.org/D45199 llvm-svn: 330630	2018-04-23 19:09:34 +00:00
Daniel Neilson	cc45e923c5	[DSE] Teach the pass that atomic memory intrinsics are stores. Summary: This change teaches DSE that the atomic memory intrinsics are stores that can be eliminated, and can allow other stores to be eliminated. This change specifically does not teach DSE that these intrinsics can be partially eliminated (i.e. length reduced, and dest/src changed); that will be handled in another change. Reviewers: mkazantsev, skatkov, apilipenko, efriedma, rsmith Reviewed By: efriedma Subscribers: dmgreen, llvm-commits Differential Revision: https://reviews.llvm.org/D45535 llvm-svn: 330629	2018-04-23 19:06:49 +00:00
Matthew Simpson	3fd67df3f8	[AArch64] Add cost model test case for transpose This patch adds a cost model test case for vector shuffles having transpose masks. The given costs are inaccurate and will be updated in a follow-on patch. llvm-svn: 330625	2018-04-23 18:21:29 +00:00
Alex Shlyapnikov	a2b4f9b4d4	[HWASan] Switch back to fixed shadow mapping for x86-64 For now switch back to fixed shadow mapping for x86-64 due to the issues with IFUNC linking on older binutils. More details will be added to https://bugs.chromium.org/p/chromium/issues/detail?id=835864 Differential Revision: https://reviews.llvm.org/D45840 llvm-svn: 330623	2018-04-23 18:14:39 +00:00
Craig Topper	ff5ddf75e3	[X86] Add disassembler test cases for bswap. This demonstrates a bug where the encoding for a 16-bit bswap prints a 16-bit register and a 32-bit mnemonic. Intel docs say 16-bit bswap is undefined. We should either claim it as an invalid encoding or we should print a 16-bit mnemonic. objdump does print the encoding as bswap with a 16-bit register. But it doesn't seem to ever print a suffix. llvm-svn: 330621	2018-04-23 17:47:33 +00:00
Vedant Kumar	f17720633b	[SelectionDAG] Dump debug locs in SDNodes This helps debug issues where selection-dag assigns the wrong location to an instruction. Differential Revision: https://reviews.llvm.org/D45913 llvm-svn: 330618	2018-04-23 17:18:24 +00:00
Simon Pilgrim	7f31119fb0	[MC] Remove MachineInstr reference in MC layer (PR37160) Only add support for getSchedInfoStr(const MachineInstr &MI) at the TargetSubtargetInfo level. Really, the getSchedInfoStr calls need to be removed entirely, we should just return a latency/rthroughput through the subtarget and keep a string creation helper function somewhere else. llvm-svn: 330615	2018-04-23 16:59:06 +00:00
Reid Kleckner	e160d51b42	Fix -Wtautological-compare warning with npos on Windows llvm-svn: 330614	2018-04-23 16:47:27 +00:00
Matt Arsenault	b21f9592be	AMDGPU: Move a flawed assert when spilling SGPRs It's possible to validly spill the frame offset register in a call sequence to a VGPR. There are definitely issues with SGPR spilling to memory, so move the assert later. llvm-svn: 330612	2018-04-23 16:13:30 +00:00
Simon Pilgrim	8cd01aaa0f	[X86] Replace x87 instregex with instrs if they only match one instruction llvm-svn: 330611	2018-04-23 16:10:50 +00:00

1 2 3 4 5 ...

163427 Commits