llvm-project

Commit Graph

Author	SHA1	Message	Date
Chandler Carruth	9281503e8f	[PM/LoopUnswitch] Fix how the cloned loops are handled when updating analyses. Summary: I noticed this issue because we didn't put the primary cloned loop into the `NonChildClonedLoops` vector and so never iterated on it. Once I fixed that, it made it clear why I had to do a really complicated and unnecesasry dance when updating the loops to remain in canonical form -- I was unwittingly working around the fact that the primary cloned loop wasn't in the expected list of cloned loops. Doh! Now that we include it in this vector, we don't need to return it and we can consolidate the update logic as we correctly have a single place where it can be handled. I've just added a test for the iteration order aspect as every time I changed the update logic partially or incorrectly here, an existing test failed and caught it so that seems well covered (which is also evidenced by the extensive working around of this missing update). Reviewers: asbirlea, sanjoy Subscribers: mcrosier, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D47647 llvm-svn: 333811	2018-06-02 01:29:01 +00:00
Vedant Kumar	4a2798c934	Remove the test from r333801 In r333801 I added a test for a dump method that, for reasons I don't understand, fails on an msvc bot: http://lab.llvm.org:8011/builders/clang-x86-windows-msvc2015/builds/12306/ I'll remove the test for now to unblock the bot and try to look into why there's a discrepancy on this platform later. llvm-svn: 333807	2018-06-02 00:05:17 +00:00
Roman Tereshin	cf88ffaaf9	[DebugInfo] Refactoring DIType::setFlags to DIType::cloneWithFlags, NFC and using the latter in DIBuilder::createArtificialType and DIBuilder::createObjectPointerType methods as well as introducing mirroring DISubprogram::cloneWithFlags and DIBuilder::createArtificialSubprogram methods. The primary goal here is to add createArtificialSubprogram to support a pass downstream while keeping the method consistent with the existing ones and making sure we don't encourage changing already created DI-nodes. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D47615 llvm-svn: 333806	2018-06-01 23:15:09 +00:00
Chris Bieneman	4b3701a7a7	Revert "Re-land: [MachO] Fixing ub in MachO BinaryFormat" This reverts commit r333803. Still breaking on big endian. Will sort this out later. llvm-svn: 333805	2018-06-01 23:09:37 +00:00
Craig Topper	3828ce7eab	[X86] Do something sensible when an expand load intrinsic is passed a 0 mask. Previously we just returned undef, but really we should be returning the pass thru input. We also need to make sure we preserve the chain output that the original intrinsic node had to maintain connectivity in the DAG. So we should just return the incoming chain as the output chain. llvm-svn: 333804	2018-06-01 22:59:07 +00:00
Chris Bieneman	44e272d440	Re-land: [MachO] Fixing ub in MachO BinaryFormat This re-lands r333797 with a fix for big endian systems. Original commit message: This isn't encountered anywhere inside LLVM, so I wrote a test case to expose the issue and verify that it is fixed. The basic problem is that the macho_load_command union contains all load comamnd structs. Load command structs in 32-bit macho files can be 32-bit aligned instead of 64-bit aligned. There are some strange circumstances in which this can be exposed in a 64-bit macho if the load commands are invalid or if a 32-bit aligned load command is used. In the past we've worked around this type of problem with changes like r264232. llvm-svn: 333803	2018-06-01 22:52:59 +00:00
Vedant Kumar	7224c08141	Add a debug dump for DbgValueHistoryMap This makes it easier to inspect the results of DbgValueHistoryCalculator. Differential Revision: https://reviews.llvm.org/D47663 llvm-svn: 333801	2018-06-01 22:33:15 +00:00
Craig Topper	aa747412b1	[X86] Add isel patterns to use vexpand with zero masking when the passthru value is a zero vector. llvm-svn: 333800	2018-06-01 22:28:28 +00:00
Chris Bieneman	52b2cc5dab	Revert "[MachO] Fixing ub in MachO BinaryFormat" This reverts commit r333797. This patch is failing on BigEndian bots. I will fix and re-land: http://lab.llvm.org:8011/builders/clang-ppc64be-linux/builds/19505/ llvm-svn: 333799	2018-06-01 22:28:23 +00:00
Zachary Turner	b44d7a0da1	Move some function declarations out of WindowsSupport.h The idea behind WindowsSupport.h is that it's in the source directory so that windows.h'isms don't leak out into the larger LLVM project. To that end, any symbol that references a symbol from windows.h must be in this private header, and not in a public header. However, we had some useful utility functions in WindowsSupport.h which have no dependency on the Windows API, but still only make sense on Windows. Those functions should be usable outside of Support since there is no risk of causing a windows.h leak. Although this introduces some preprocessor logic in some header files, It's not too egregious and it's better than the alternative of duplicating a ton of code. Differential Revision: https://reviews.llvm.org/D47662 llvm-svn: 333798	2018-06-01 22:23:46 +00:00
Chris Bieneman	c8a3c86c77	[MachO] Fixing ub in MachO BinaryFormat This isn't encountered anywhere inside LLVM, so I wrote a test case to expose the issue and verify that it is fixed. The basic problem is that the macho_load_command union contains all load comamnd structs. Load command structs in 32-bit macho files can be 32-bit aligned instead of 64-bit aligned. There are some strange circumstances in which this can be exposed in a 64-bit macho if the load commands are invalid or if a 32-bit aligned load command is used. In the past we've worked around this type of problem with changes like r264232. llvm-svn: 333797	2018-06-01 22:07:36 +00:00
Craig Topper	c45479c08e	[X86] Expand the testing of expand and compress intrinsics The avx512f intrinsic tests were in the avx512vl file. We were also missing some combinations of masking. This does show that we fail to use the zero masking form of expand loads when the passthru is zero. I'll try to get that fixed shortly. llvm-svn: 333795	2018-06-01 21:59:24 +00:00
Craig Topper	d7e11ee342	[X86] Add fast-isel tests for avx512vbmi2 instructions. llvm-svn: 333794	2018-06-01 21:59:22 +00:00
Karl-Johan Karlsson	6d52e5c3e4	[ConstantFold] Disallow folding vector geps into bitcasts Summary: Getelementptr returns a vector of pointers, instead of a single address, when one or more of its arguments is a vector. In such case it is not possible to simplify the expression by inserting a bitcast of operand(0) into the destination type, as it will create a bitcast between different sizes. Reviewers: majnemer, mkuper, mssimpso, spatel Reviewed By: spatel Subscribers: lebedev.ri, llvm-commits Differential Revision: https://reviews.llvm.org/D46379 llvm-svn: 333783	2018-06-01 19:34:35 +00:00
Sanjay Patel	66f7e19f6a	[InstCombine] fix vector shuffle transform to replace undef elements (PR37648) This bug: https://bugs.llvm.org/show_bug.cgi?id=37648 ...was created with the enhancement to this transform with rL332479. The urem test shows the disaster potential: any undef divisor lane makes the whole op undef. The test diffs show that vector demanded elements turns some of the potential, but not all, unused binop operands back into undef already. llvm-svn: 333782	2018-06-01 19:23:18 +00:00
Sanjay Patel	3883fcd0c0	[InstCombine] add tests for broken shuffle transform (PR37648) llvm-svn: 333779	2018-06-01 18:52:38 +00:00
Simon Atanasyan	e80c3ce9cc	[mips] Support 64-bit offsets for lb/sb/ld/sd/lld ... instructions The `MipsAsmParser::loadImmediate` can load immediates of various sizes into a register. Idea of this change is to use `loadImmediate` in the `MipsAsmParser::expandMemInst` method to load offset into a register and then call required load/store instruction. The patch removes separate `expandLoadInst` and `expandStoreInst` methods and does everything in the `expandMemInst` method to escape code duplication. Differential Revision: https://reviews.llvm.org/D47316 llvm-svn: 333774	2018-06-01 16:37:53 +00:00
Simon Atanasyan	3a44bcf95a	[mips] Extend list of relocations supported by the `.reloc` directive Supporting GOT and TLS related relocations by the `.reloc` directive is useful for purpose of testing various tools like a linker, for example. llvm-svn: 333773	2018-06-01 16:37:42 +00:00
Paul Semel	46201fb7bc	[llvm-objcopy] Fix null symbol handling This fixes the bug where strip-all option was leading to a malformed outputted ELF file. Differential Revision: https://reviews.llvm.org/D47414 llvm-svn: 333772	2018-06-01 16:19:46 +00:00
Krzysztof Parzyszek	bc68385dad	[Hexagon] Avoid UB when shifting unsigned integer left by 32 llvm-svn: 333771	2018-06-01 15:39:10 +00:00
Sanjay Patel	2896c773eb	[LangRef] fix typo; NFC llvm-svn: 333770	2018-06-01 15:21:14 +00:00
Vlad Tsyrklevich	6867ab7c90	[ThinLTOBitcodeWriter] Emit summaries for regular LTO modules Summary: Emit summaries for bitcode modules that are only destined for the regular LTO portion of the build so they can participate in summary-based dead stripping. This change reduces the size of a nacl_helper build with cfi-icall enabled by 7%, removing the majority of the overhead due to enabling cfi-icall. The cfi-icall size increase was caused by compiling in lots of unused code and cfi-icall generating jumptable references to unused symbols that could no longer be removed by -Wl,-gc-sections. Increasing the visibility of summary-based dead stripping prevented jumptable entries being created for unused symbols from the regular LTO portion of the build. Reviewers: pcc Reviewed By: pcc Subscribers: dschuff, mehdi_amini, inglorion, eraman, llvm-commits, kcc Differential Revision: https://reviews.llvm.org/D47594 llvm-svn: 333768	2018-06-01 15:20:47 +00:00
Karl-Johan Karlsson	b60b920a8c	[ConstantFold] Add lit testcase for bitcast problem. NFC llvm-svn: 333767	2018-06-01 15:08:14 +00:00
Nirav Dave	fc9a700f94	[DAG] Avoid checking for consecutive stores in store merge. NFCI. llvm-svn: 333766	2018-06-01 15:05:55 +00:00
Nirav Dave	39ece11ae5	[DAG] Simplify Expression. NFC. llvm-svn: 333765	2018-06-01 15:05:30 +00:00
Nirav Dave	0fc27acaa2	[DAG] Remove untriggerable check. NFCI. Candidate check precludes this check. llvm-svn: 333764	2018-06-01 15:05:05 +00:00
Nirav Dave	a74921a696	[DAG] Prune store merge legal store check to stop invalid size. NFCI. Do not consider store sizes large than the maximum legal store size. llvm-svn: 333763	2018-06-01 15:04:40 +00:00
Krzysztof Parzyszek	aec2c0c9b6	[Hexagon] Select HVX code for vector CTPOP, CTLZ, and CTTZ llvm-svn: 333760	2018-06-01 14:52:58 +00:00
Clement Courbet	6eb680a40d	[llvm-exegesis] Fix off-by-one in llvm-exegesis documentation. llvm-svn: 333759	2018-06-01 14:49:06 +00:00
Sanjay Patel	284fe7a8ec	[InstCombine] add baseline test for bug with div+select transform (D47576) llvm-svn: 333756	2018-06-01 14:39:05 +00:00
Andrea Di Biagio	bdc670611b	[llvm-mca] Move the logic that computes the block throughput into Support.h. NFC This will allow us to share the logic that computes the block throughput with other views. llvm-svn: 333755	2018-06-01 14:35:21 +00:00
Hiroshi Inoue	9796b47df1	[NFC] Zero initialize local variables This patch makes local variables zero initialized to avoid broken values in debug output. llvm-svn: 333754	2018-06-01 14:23:15 +00:00
Clement Courbet	df79e79e22	[llvm-exegesis] Analysis: Display idealized sched class port pressure. Summary: Screenshot in phabricator diff. Reviewers: gchatelet Subscribers: mgorny, tschuett, mgrang, llvm-commits Differential Revision: https://reviews.llvm.org/D47329 llvm-svn: 333753	2018-06-01 14:18:02 +00:00
Krzysztof Parzyszek	0b6187c1a9	[SelectionDAG] Expand UADDO/USUBO into ADD/SUBCARRY if legal for target Additionally, implement handling of ADD/SUBCARRY on Hexagon, utilizing the UADDO/USUBO expansion. Differential Revision: https://reviews.llvm.org/D47559 llvm-svn: 333751	2018-06-01 14:00:32 +00:00
Alexander Ivchenko	b34afcec5d	[x86] NFC. Reautogenerate test/CodeGen/X86/vector-half-conversions.ll llvm-svn: 333750	2018-06-01 13:51:53 +00:00
Simon Pilgrim	ee7694442d	[Utils][X86] Help update_llc_test_checks.py to recognise retl/retq to reduce CHECK duplication (PR35003) This patch replaces the --x86_extra_scrub command line argument to automatically support a second level of regex-scrubbing if it improves the matching of nearly-identical code patterns. The argument '--extra_scrub' is there now to force extra matching if required. This is mostly useful to help us share 32-bit/64-bit x86 vector tests which only differs by retl/retq instructions, but any scrubber can now technically support this, meaning test checks don't have to be needlessly obfuscated. I've updated some of the existing checks that had been manually run with --x86_extra_scrub, to demonstrate the extra "ret{{[l\|q]}}" scrub now only happens when useful, and re-run the sse42-intrinsics file to show extra matches - most sse/avx intrinsics files should be able to now share 32/64 checks. Tested with the opt/analysis scripts as well which share common code - AFAICT the other update scripts use their own versions. Differential Revision: https://reviews.llvm.org/D47485 llvm-svn: 333749	2018-06-01 13:37:01 +00:00
Amaury Sechet	8467411dad	Set ADDE/ADDC/SUBE/SUBC to expand by default Summary: They've been deprecated in favor of UADDO/ADDCARRY or USUBO/SUBCARRY for a while. Target that uses these opcodes are changed in order to ensure their behavior doesn't change. Reviewers: efriedma, craig.topper, dblaikie, bkramer Subscribers: jholewinski, arsenm, jyknight, sdardis, nemanjai, nhaehnle, kbarton, fedor.sergeev, asb, rbar, johnrusso, simoncook, jordy.potman.lists, apazos, sabuasal, niosHD, jrtc27, zzheng, edward-jones, mgrang, atanasyan, llvm-commits Differential Revision: https://reviews.llvm.org/D47422 llvm-svn: 333748	2018-06-01 13:21:33 +00:00
Amara Emerson	5a3bb68e12	[AArch64][GlobalISel] Zero-extend s1 values when returning. Before we were relying on the any extend of the s1 to s32, but for AAPCS we need to zero-extend it to at least s8. Fixes PR36719 Differential Revision: https://reviews.llvm.org/D47425 llvm-svn: 333747	2018-06-01 13:20:32 +00:00
Florian Hahn	8a17f1f43e	Revert r333740: IPSCCP] Use PredicateInfo to propagate facts from cmp. This is breaking the clang-with-thin-lto-ubuntu bot. llvm-svn: 333745	2018-06-01 12:58:43 +00:00
Sander de Smalen	f95ea047e5	[AArch64][SVE] Asm: Support for FDUP_ZI (copy fp immediate) instruction. Unpredicated copy of floating-point immediate value into SVE vector, along with MOV-aliases. Reviewers: rengolin, fhahn, samparker, SjoerdMeijer, javed.absar Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D47482 llvm-svn: 333744	2018-06-01 12:54:46 +00:00
Simon Dardis	351aa594f6	[mips] Guard more aliases correctly. Also, duplicate an alias for microMIPS. llvm-svn: 333741	2018-06-01 10:57:13 +00:00
Florian Hahn	f4df554f32	Recommit r333268: [IPSCCP] Use PredicateInfo to propagate facts from cmp instructions. This patch updates IPSCCP to use PredicateInfo to propagate facts to true branches predicated by EQ and to false branches predicated by NE. As a follow up, we should be able to extend it to also propagate additional facts about nonnull. Reviewers: davide, mssimpso, dberlin, efriedma Reviewed By: davide, dberlin Differential Revision: https://reviews.llvm.org/D45330 llvm-svn: 333740	2018-06-01 10:48:54 +00:00
Simon Dardis	54217598b6	[mips] Guard 'nop' properly and add mips16's nop instruction Reviewers: smaksimovic, atanasyan, abeserminji Differential Revision: https://reviews.llvm.org/D47583 llvm-svn: 333739	2018-06-01 10:46:00 +00:00
Pavel Labath	d6ca063907	DWARFAcceleratorTable: Add an iterator-based api for accessing names in the index Summary: Back when we were introducing the DWARF v5 name index, there was a short discussion whether we shouldn't have a nicer api for iterating over the index. At that time, I did not find it necessary since the iteration over names was done only from within the index itself (and I figured the internal implementation can deal with a slightly rough interface). However, now I ran into a use for this kind of API in LLDB (for finding all names matching a regular expression), so it looked like a nice opportunity to introduce one. To make the API more useful, I've made the NameTableEntry class a bit smarter: it now stores the string section reference (so it can return its name) and its position in the name index (mainly useful for dumping/logging). I also convert the internal users to use the new API, which also gives test coverage for the added code. Reviewers: JDevlieghere, aprantl, dblaikie Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47590 llvm-svn: 333738	2018-06-01 10:33:11 +00:00
Simon Dardis	ee67dcb837	[mips] Select the correct instruction for computing frameindexes Reviewers: smaksimovic, atanasyan, abeserminji Differential Revision: https://reviews.llvm.org/D47582 llvm-svn: 333736	2018-06-01 10:07:10 +00:00
Gabor Buella	27c96d3d20	NFC Avoid a warning in WasmEHPrepare.cpp ``` ../lib/CodeGen/WasmEHPrepare.cpp:166:30: warning: extra ‘;’ [-Wpedantic] false, false); ^ ``` llvm-svn: 333732	2018-06-01 07:47:46 +00:00
Sander de Smalen	97ca6b9e09	[AArch64][SVE] Asm: Support for DUPM (masked immediate) instruction. Unpredicated copy of repeating immediate pattern to SVE vector, along with MOV-aliases. Reviewers: rengolin, fhahn, samparker, SjoerdMeijer, javed.absar Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D47328 llvm-svn: 333731	2018-06-01 07:25:46 +00:00
Matt Arsenault	72a9f52c87	AMDGPU: Switch some half using-tests to use amdhsa The default clover ABI weirdly promotes half to float, which should probably be fixed. llvm-svn: 333730	2018-06-01 07:06:03 +00:00
Craig Topper	c3cf55b935	[X86][Disassembler] Make it an error to set EVEX.R' to 0 when modrm.reg encodes a GPR. This is different than the behavior of EVEX.X extending modrm.rm to 5 bits. llvm-svn: 333728	2018-06-01 06:11:29 +00:00
Craig Topper	0838c4d6bc	[X86][Disassembler] Ignore EVEX.X extension of modrm.rm to 5-bits when modrm.rm encodes a k-register. llvm-svn: 333727	2018-06-01 05:36:08 +00:00
Daniel Cederman	d72b9fd141	Implemented sane default for llvm-objdump's relocation Value format Summary: "Unknown" for platforms that were not manually added into the switch did not make sense at all. Now it prints Target + addend for all elf-machines that were not explicitly mentioned. Addresses PR21059 and PR25124. Original author: fedor.sergeev Reviewers: jyknight, espindola, fedor.sergeev Reviewed By: jyknight Subscribers: eraman, dcederman, jfb, dschuff, aheejin, llvm-commits Differential Revision: https://reviews.llvm.org/D36464 llvm-svn: 333726	2018-06-01 05:31:58 +00:00
Craig Topper	74a61b02e0	[X86][Disassembler] Clamp index to 4-bits when decoding GPR registers. A 5-bit value can occur when EVEX.X is 0 due to it being used to extend modrm.rm to encode XMM16-31. But if modrm.rm instead encodes a GPR, the Intel documentation says EVEX.X should be ignored so just mask it to 4 bits once we know its a GPR. llvm-svn: 333725	2018-06-01 05:12:44 +00:00
Craig Topper	1a00b0ac27	[X86] Add a test case showing a bad disassembling of an EVEX instruction with EVEX.X=0 and a GPR encoded in modrm.rm. EVEX.X is used to extended modrm.rm when the instruction encodes a XMM/YMM/ZMM register. But we aren't properly ignoring it when it encodes a GPR and we end up printing whatever registers exist in X86 register enum after the GPRs. llvm-svn: 333724	2018-06-01 05:12:43 +00:00
Craig Topper	5b1dd01e57	[X86][Disassembler] Make sure EVEX.X is not used to extend base registers of memory operations. This was an accidental side effect of EVEX.X being used to encode XMM16-XMM31 using modrm.rm with modrm.mod==0x3. I think there are still more bugs related to this. llvm-svn: 333722	2018-06-01 04:29:34 +00:00
Craig Topper	c6b2c2bb70	[X86][Disassembler] Use a local variable instead of using a field in the instruction object. NFC llvm-svn: 333721	2018-06-01 04:29:30 +00:00
Tom Stellard	e43778895c	AMDGPU/R600: Move intrinsics to IntrinsicsAMDGPU.td Reviewers: arsenm, nhaehnle, jvesely Reviewed By: arsenm Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, llvm-commits, t-tye Differential Revision: https://reviews.llvm.org/D47487 llvm-svn: 333720	2018-06-01 02:19:46 +00:00
Craig Topper	dc5ba1e495	[X86] Make sure the check for VEX.vvvv being all ones on instructions that don't use it doesn't ignore a bit in 32-bit mode. llvm-svn: 333717	2018-06-01 01:23:52 +00:00
Craig Topper	0179c6d0e5	[X86][Disassembler] Suppress reading of EVEX.V' and EVEX.R' in 32-bit mode. llvm-svn: 333714	2018-06-01 00:10:36 +00:00
Craig Topper	b9c2e8cc01	[X86] Add test cases showing the disassembler producing an xmm16-xmm31 register in 32-bit mode. We aren't properly suppressing the reading of VEX.R' and VEX.V' in 32-bit mode. llvm-svn: 333713	2018-06-01 00:10:32 +00:00
Heejin Ahn	d69acf3b4c	Change ambiguous uses of term 'funclet' to 'EH scopes'. NFC. Summary: `getEHScopeMembership()` function is used not only for funclet-based EHs; they apply to all EH schemes that use the scoped IR (catchpad/cleanuppad/...). D47005 (rL333045) changed some of the uses of the term 'funclet' to 'EH scopes' in case they apply to all scoped EH, and this fixes more of them. For `FuncletLayout` pass, I left it as is because the pass is only used for funclet-based EH. Reviewers: majnemer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47611 llvm-svn: 333711	2018-06-01 00:03:21 +00:00
Dan Gohman	91ab25bbe3	[WebAssembly] Update to the new names for the memory intrinsics. The WebAssembly committee has decided on the names `memory.size` and `memory.grow` for the memory intrinsics, so update the LLVM intrinsics to follow those names, keeping both sets of old names in place for compatibility. llvm-svn: 333708	2018-05-31 22:35:25 +00:00
Sanjay Patel	affe450db7	[LoopVectorize, x86] add tests to show missing SVML transforms; NFC llvm-svn: 333707	2018-05-31 22:31:02 +00:00
Dan Gohman	b17de645ea	[WebAssembly] Fix the signatures for the __mulo* libcalls. The __mulo* libcalls have an extra i32* to return the overflow value. Fixes PR37401. llvm-svn: 333706	2018-05-31 22:27:24 +00:00
Heejin Ahn	5ef4d5f9c1	[WebAssembly] Support instruction selection for catching exceptions Summary: This lowers exception catching-related instructions: 1. Lowers `wasm.catch` intrinsic to `catch` instruction 2. Removes `catchpad` and `cleanuppad` instructions; they are not necessary after isel phase. (`MachineBasicBlock::isEHFuncletEntry()` or `MachineBasicBlock::isEHPad()` can be used instead.) 3. Lowers `catchret` and `cleanupret` instructions to pseudo `catchret` and `cleanupret` instructions in isel, which will be replaced with other instructions in `WebAssemblyExceptionPrepare` pass. 4. Adds 'WebAssemblyExceptionPrepare` pass, which is for running various transformation for EH. Currently this pass only replaces `catchret` and `cleanupret` instructions into appropriate wasm instructions to make this patch successfully run until the end. Currently this does not handle lowering of intrinsics related to LSDA info generation (`wasm.landingpad.index` and `wasm.lsda`), because they cannot be tested without implementing `EHStreamer`'s wasm-specific handlers. They are marked as TODO, which is needed to make isel pass. Also this does not generate `try` and `end_try` markers yet, which will be handled in later patches. This patch is based on the first wasm EH proposal. (https://github.com/WebAssembly/exception-handling/blob/master/proposals/Exceptions.md) Reviewers: dschuff, majnemer Subscribers: jfb, sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D44090 llvm-svn: 333705	2018-05-31 22:25:54 +00:00
Craig Topper	9a6c0bdcbd	[LoopIdiomRecognize] Only convert loops to ctlz if we can prove that the input is non-negative. Summary: Loop idiom recognize tries to convert loops like ``` int foo(int x) { int cnt = 0; while (x) { x >>= 1; ++cnt; } return cnt; } ``` into calls to ctlz, but if x is initially negative this loop should be infinite. It happens that the cases that motivated this change have an absolute value of x before the loop. So this patch restricts the transform to cases where we know x is positive. Note: We are relying on the absolute value of INT_MIN to be undefined so we can assume that the result is always positive. Fixes PR37479 Reviewers: spatel, hfinkel, efriedma, javed.absar Reviewed By: efriedma Subscribers: dmgreen, llvm-commits Differential Revision: https://reviews.llvm.org/D47348 llvm-svn: 333702	2018-05-31 22:16:55 +00:00
Heejin Ahn	99d60e0dab	[WebAssembly] Add Wasm exception handling prepare pass Summary: This adds a pass that transforms a program to be prepared for Wasm exception handling. This is using Windows EH instructions and based on the previous Wasm EH proposal. (https://github.com/WebAssembly/exception-handling/blob/master/proposals/Exceptions.md) Reviewers: dschuff, majnemer Subscribers: jfb, mgorny, sbc100, jgravelle-google, JDevlieghere, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D43746 llvm-svn: 333696	2018-05-31 22:02:34 +00:00
Sanjay Patel	2ae1ab30e3	[LoopVectorize, x86] regenerate checks; NFC I removed the 'fast' flag from the calls because that's not required. llvm-svn: 333695	2018-05-31 21:30:36 +00:00
Alexander Shaposhnikov	ecc84834b7	[llvm-strip] Add -o option to llvm-strip This diff implements the option -o for specifying a file to write the output to. Test plan: make check-all Differential revision: https://reviews.llvm.org/D47505 llvm-svn: 333693	2018-05-31 20:42:13 +00:00
Andrea Di Biagio	4037011404	[llvm-mca] Fixed a problem caused by an invalid use of a processor resource mask in the Scheduler. The lambda functions used by method ResourceManager::mustIssueImmediately() was incorrectly truncating masks of buffered processor resources to 32-bit quantities. The invalid mask values were then used to access a map of processor resource descriptors. Fixes PR37643. llvm-svn: 333692	2018-05-31 20:27:46 +00:00
Stanislav Mekhanoshin	739174c4be	[AMDGPU] Construct memory clauses before RA Memory clauses are formed into bundles in presence of xnack. Their source operands are marked as early-clobber. This allows to allocate distinct source and destination registers within a clause and prevent breaking the clause with s_nop in the hazard recognizer. Clauses are undone before post-RA scheduler to allow some rescheduling, which will not break the clause since artificial edges are created in the dag to keep memory operations together. Yet this allows a better ILP in some cases. Differential Revision: https://reviews.llvm.org/D47511 llvm-svn: 333691	2018-05-31 20:13:51 +00:00
Sanjay Patel	26368cd5d9	[InstCombine] narrow select to match condition operands' size This is the planned enhancement to D47163 / rL333611. We want to match cmp/select sizes because that will be recognized as min/max more easily and lead to better codegen (especially for vector types). As mentioned in D47163, this improves some of the tests that would also be folded by D46380, so we may want to adjust that patch to match the new patterns where the extend op occurs after the select. llvm-svn: 333689	2018-05-31 19:55:27 +00:00
Stanislav Mekhanoshin	7137f609f4	[AMDGPU] Fixed incorrect -mcpu=gfx800 in xnor.ll test. NFC. llvm-svn: 333687	2018-05-31 19:39:54 +00:00
Aditya Nandakumar	2980b01995	[GISel]: Pattern matchers for GFSUB, GFNEG https://reviews.llvm.org/D47547 Add matching templates for G_FSUB, and G_FNEG. Reviewed by: aemerson. llvm-svn: 333685	2018-05-31 19:30:01 +00:00
Lang Hames	6fe6616c47	[ORC] Add a getRequestedSymbols method to MaterializationResponsibility. This method returns the set of symbols in the target VSO that have queries waiting on them. This can be used to make decisions about which symbols to delegate to another MaterializationUnit (typically this will involve delegating all symbols that have not been requested to another MaterializationUnit so that materialization of those symbols can be deferred until they are requested). llvm-svn: 333684	2018-05-31 19:29:03 +00:00
Lang Hames	d3a76f5bbc	[ORC] Rename IRMaterializationUnit's Discardable member to SymbolToDefinition, and make it protected rather than private. The new name reflects the actual information in the map, and this information can be useful to derived classes (for example, to quickly look up the IR definition of a requested symbol). llvm-svn: 333683	2018-05-31 19:29:01 +00:00
Sanjay Patel	dfbe6b49f0	[InstCombine] regenerate checks; NFC llvm-svn: 333682	2018-05-31 19:25:02 +00:00
Peter Collingbourne	3aa30e8062	IRGen: Write .dwo files when -split-dwarf-file is used together with -fthinlto-index. Differential Revision: https://reviews.llvm.org/D47597 llvm-svn: 333677	2018-05-31 18:25:59 +00:00
Sriraman Tallam	d10c4e07f5	Relax GOTPCREL relocations for tail jmp instructions. Differential Revision: https://reviews.llvm.org/D47563 llvm-svn: 333676	2018-05-31 18:12:33 +00:00
Craig Topper	c9a4c6208b	[JumpThreading] Fix some strange formatting of code inside LLVM_DEBUG. NFC I don't know if clang-format got confused here or what. llvm-svn: 333675	2018-05-31 18:08:11 +00:00
Artem Dergachev	3260b00d48	[ADT] Annotate immutable list/set/map update methods with LLVM_NODISCARD. Because immutable data structures are, well, immutable, methods like "append", "add", "set" create a copy of the list (set, map) instead of mutating the existing map. If the updated object is discarded, it clearly indicates a bug. Such bugs are introduced frequently, hence the warn_unused_result annotation. Differential Revision: https://reviews.llvm.org/D47496 llvm-svn: 333672	2018-05-31 17:32:29 +00:00
Jonas Devlieghere	745918ff87	[ADT] Make escaping fn conform to coding guidelines As noted by Adrian on llvm-commits, PrintHTMLEscaped and PrintEscaped in StringExtras did not conform to the LLVM coding guidelines. This commit rectifies that. llvm-svn: 333669	2018-05-31 17:01:42 +00:00
David Bolvansky	5430b73755	[SimplifyLibcalls] [NFC] Cleanup, improvements Summary: * Use "find('%')" instead of loop to find '%' char (we already uses find('%') in optimizePrintFString..) * Convert getParent() chains to getModule()/getFunction() Reviewers: lebedev.ri, spatel Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47397 llvm-svn: 333668	2018-05-31 16:39:27 +00:00
Francis Visoiu Mistrih	90aba024c5	[MC] Fallback on DWARF when generating compact unwind on AArch64 Instead of asserting when using the def_cfa directive with a register different from fp, fallback on DWARF. Easily triggered with: .cfi_def_cfa x1, 32; rdar://40249694 Differential Revision: https://reviews.llvm.org/D47593 llvm-svn: 333667	2018-05-31 16:33:26 +00:00
Roman Tereshin	f34d7ecc15	[GlobalISel][Mips] LegalizerInfo verifier: Adding LegalizerInfo::verify(...) call for Mips Reviewers: aemerson, qcolombet Reviewed By: qcolombet Differential Revision: https://reviews.llvm.org/D46339 llvm-svn: 333665	2018-05-31 16:16:49 +00:00
Roman Tereshin	76c29c68dc	[GlobalISel][AMDGPU] LegalizerInfo verifier: Adding LegalizerInfo::verify(...) call for AMDGPU Reviewers: aemerson, qcolombet Reviewed By: qcolombet Differential Revision: https://reviews.llvm.org/D46339 llvm-svn: 333664	2018-05-31 16:16:48 +00:00
Roman Tereshin	667c7581ed	[GlobalISel][ARM] LegalizerInfo verifier: Adding LegalizerInfo::verify(...) call and fixing bugs exposed Reviewers: aemerson, qcolombet Reviewed By: qcolombet Differential Revision: https://reviews.llvm.org/D46339 llvm-svn: 333663	2018-05-31 16:16:48 +00:00
Roman Tereshin	cc1a16fdf9	[GlobalISel][X86] LegalizerInfo verifier: Adding LegalizerInfo::verify(...) call and fixing bugs exposed Reviewers: aemerson, qcolombet Reviewed By: qcolombet Differential Revision: https://reviews.llvm.org/D46339 llvm-svn: 333662	2018-05-31 16:16:47 +00:00
Simon Pilgrim	ff0623cd29	[X86][SSE] Recognise splat rotations and expand back to shift ops. Noticed while fixing PR37426, for splat rotations (rotation by an uniform value) its better to just expand back to shift ops than performing as a general non-uniform rotation. llvm-svn: 333661	2018-05-31 15:47:17 +00:00
Simon Pilgrim	c34395d889	[X86][AVX] Add peekThroughEXTRACT_SUBVECTORs helper (NFCI) We often need this for AVX1 128-bit integer ops as they may have been split from a 256-bit source. llvm-svn: 333660	2018-05-31 15:15:49 +00:00
Aditya Kumar	7ef72ded57	make GlobalValueSummary::getOriginalName() a const function Differential Revision: https://reviews.llvm.org/D46962 Reviewers: craig.topper llvm-svn: 333659	2018-05-31 15:15:33 +00:00
David Green	2911b3a07a	[DA] Fix direction vectors for weakZeroSrcSIV Both weakZeroSrcSIV and weakZeroDstSIV are currently giving the same direction vectors. Fix weakZeroSrcSIVtest by flipping the directions it gives. Differential Revision: https://reviews.llvm.org/D46678 llvm-svn: 333658	2018-05-31 14:55:29 +00:00
Clement Courbet	2e41c5a79c	[X86] Introduce WriteFLDC for x87 constant loads. Summary: {FLDL2E, FLDL2T, FLDLG2, FLDLN2, FLDPI} were using WriteMicrocoded. - I've measured the values for Broadwell, Haswell, SandyBridge, Skylake. - For ZnVer1 and Atom, values were transferred form InstRWs. - For SLM and BtVer2, I've guessed some values :( Reviewers: RKSimon, craig.topper, andreadb Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D47585 llvm-svn: 333656	2018-05-31 14:22:01 +00:00
Nico Weber	ca5a16f131	Use -Wextra spelling instead of -W No difference in behavior, but a bit easier to search for. https://reviews.llvm.org/D47490 llvm-svn: 333651	2018-05-31 13:41:04 +00:00
Andrea Di Biagio	be8616f5f2	[MCSchedule] Add the ability to compute the latency and throughput information for MCInst. This patch extends the MCSchedModel API with new methods that can be used to obtain the latency and reciprocal througput information for an MCInst. Scheduling models have recently gained the ability to resolve variant scheduling classes associated with MCInst objects. Before, models were only able to resolve a variant scheduling class from a MachineInstr object. This patch is mainly required by D47374 to avoid regressing a pair of x86 specific -print-schedule tests for btver2. Patch D47374 introduces a new variant class to teach the btver scheduling model (x86 target) how to correctly compute the latency profile for some zero-idioms using the new scheduling predicates. The new methods added by this patch would be mainly used by llc when flag -print-schedule is specified. In particular, tests that contain inline assembly require that code is parsed at code emission stage into a sequence of MCInst. That forces the print-schedule functionality to query the latency/rthroughput information for MCInst instructions too. If we don't expose this new API, then we lose "-print-schedule" test coverage as soon as variant scheduling classes are added to the x86 models. The tablegen SubtargetEmitter changes teaches how to query latency profile information using a object that derives from TargetSubtargetInfo. Note that this should really have been part of r333286. To avoid code duplication, the logic that "resolves" variant scheduling classes for MCInst, has been moved to a common place in MC. That logic is used by the "resolveVariantSchedClass" methods redefined in override by the tablegen'd GenSubtargetInfo classes. Differential Revision: https://reviews.llvm.org/D47536 llvm-svn: 333650	2018-05-31 13:30:42 +00:00
Benjamin Kramer	0deb9a9a1f	Extend the GlobalObject metadata interface - Make eraseMetadata return whether it changed something - Wire getMetadata for a single MDNode efficiently into the attachment map - Add hasMetadata, which is less weird than checking getMetadata == nullptr on a multimap. Use it to simplify code. llvm-svn: 333649	2018-05-31 13:29:58 +00:00
Simon Dardis	d9a453832d	[mips] Guard all short instructions correctly. Reviewers: smaksimovic, atanasyan, abeserminji Differential Revision: https://reviews.llvm.org/D47533 llvm-svn: 333645	2018-05-31 12:47:01 +00:00
Alexandros Lamprineas	61f0ba1fcc	[InstCombine, ARM] Convert vld1 to llvm load Convert a vector load intrinsic into an llvm load instruction. This is beneficial when the underlying object being addressed comes from a constant, since we get constant-folding for free. Differential Revision: https://reviews.llvm.org/D46273 llvm-svn: 333643	2018-05-31 12:19:18 +00:00
Clement Courbet	b78ab5097d	[X86] Extract latency of fldz/fld1 in separate classes. Summary: - I've measured the values for Broadwell, Haswell, SandyBridge, Skylake. - For ZnVer1 and Atom, values were transferred form `InstRW`s. - For SLM and BtVer2, values are from Agner. This is split off from https://reviews.llvm.org/D47377 Reviewers: RKSimon, andreadb Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D47523 llvm-svn: 333642	2018-05-31 11:41:27 +00:00
Simon Pilgrim	346886bc0d	[X86][SSE] Add support for detecting SUB(SPLAT_BV, SPLAT) cases for shift-rotate patterns. This improves splat rotations (rotation by an uniform value), to avoid having to use the generic non-uniform shift code (extension to PR37426). llvm-svn: 333641	2018-05-31 11:25:16 +00:00
Pavel Labath	59870af66f	DWARFAcceleratorTable: fix equal_range iterators Summary: Both (Apple and DWARF5) implementations of the iterators had bugs which resulted in crashes if one attempted to iterate through the accelerator tables all the way. For the Apple tables, the issue was that we did not clear the DataOffset field when we reached the end, which made our iterator compare unequal to the "end" iterator. For the Dwarf5 tables, the problem was that we incremented the CurrentIndex pointer and then used the incremented (possibly invalid) pointer to check whether we have reached the end of the index list. The reason these bugs went undetected is because their only user (dwarfdump) only ever searched for the first match. Besides allowing us to test this fix, changing llvm-dwarfdump --find to display all matches seems like a good improvement (it makes the behavior consistent with the --name option), so I change llvm-dwarfdump to do that. The existing tests would be sufficient to test this fix with the new llvm-dwarfdump behavior, but I add a special test that demonstrates that the tool indeed displays multiple results. The find.test test needed to be tweaked a bit as the tool now does not print the ".debug_info contents" header (also consistent with how --name works). Reviewers: JDevlieghere, aprantl, dblaikie Subscribers: mgrang, llvm-commits Differential Revision: https://reviews.llvm.org/D47543 llvm-svn: 333635	2018-05-31 08:47:00 +00:00
Luke Geeson	2e09995d42	[AArch64] Reverted rL333427 fixing Clang UnitTest Failure llvm-svn: 333634	2018-05-31 08:27:53 +00:00
Max Kazantsev	0bad5be430	[NFC] Factor out a method for further extension llvm-svn: 333633	2018-05-31 08:08:34 +00:00
Roman Lebedev	71d4afb90a	[llvm-exegesis][NFCI] Counter::Counter(): more useful msg on event open error Summary: I'm slowly looking into a new X86 scheduler model, for AMD Bulldozer CPU, model 2 (bdver2, Piledriver). And naturally, i have hit that assert :) I happened to know what it meant, and how to fix it, but that is not too common knowledge. Reviewers: courbet, RKSimon Reviewed By: courbet Subscribers: tschuett, llvm-commits, craig.topper Differential Revision: https://reviews.llvm.org/D47572 llvm-svn: 333632	2018-05-31 07:08:26 +00:00
Roman Lebedev	c0ecd06428	Revert rL333106 / D46814: [InstCombine] Fold unfolded masked merge pattern with variable mask! In post-commit review, Eric Christopher notes that many new MSan warnings are being observed with this patch. The probable reason is: if 'y' is undef here and we could evaluate it twice and get different results. We can't increase the number of uses of a value. llvm-svn: 333631	2018-05-31 06:00:36 +00:00
Joel E. Denny	44ee15f34f	[lit] Fix windows cmd.exe test config for r333620 llvm-svn: 333630	2018-05-31 05:48:33 +00:00
Stanislav Mekhanoshin	d4b500cb08	[AMDGPU] Track occupancy in MFI Keep track of achieved occupancy in SIMachineFunctionInfo. At the moment we have a lot of duplicated or even missed code to query and maintain occupancy info. Record it in the MFI and query in a single call. Interfaces: - getOccupancy() - returns current recorded achieved occupancy. - getMinAllowedOccupancy() - returns lesser of the achieved occupancy and the lowest occupancy we are ready to tolerate. For example if a kernel is memory bound we are ready to tolerate 4 waves. - limitOccupancy() - record occupancy level if we have to lower it. - increaseOccupancy() - record occupancy if scheduler managed to increase the occupancy. MFI takes care of integrating different checks affecting occupancy, including LDS use and waves-per-eu attribute. Note that scheduler starts with not yet known register pressure, so has to record either limit or increase in occupancy after it is done. Later passes can just query a resulting value. New interface is used in the active scheduler and NFC wrt its work. Changes are also made to experimental schedulers to use it and record an occupancy after they are done. Before the change waves-per-eu was ignored by experimental schedulers and tolerance window for memory bound kernels was not used. Differential Revision: https://reviews.llvm.org/D47509 llvm-svn: 333629	2018-05-31 05:36:04 +00:00
Jan Vesely	f5016b79a6	AMDGPU/R600: Make sure functions are cacheline aligned v2: use "ensureAlignment" make functions cache line aligned Fixes GPU hangs since r333219: "AMDGPU: Split R600 AsmPrinter code into its own class" Differential Revision: https://reviews.llvm.org/D47516 llvm-svn: 333622	2018-05-31 04:08:08 +00:00
Joel E. Denny	fc01dd281d	[lit] Terminate ": RUN at line N" with ";" not "&&" This fixes projects/compiler-rt/test/fuzzer/sigusr.test, which was broken by r333614. The trouble was that "&&" changes the command for which "$!" gives the pid. llvm-svn: 333620	2018-05-31 03:40:37 +00:00
Roman Tereshin	5952576de5	[GlobalISel][Legalizer] LegalizerInfo verifier: Making LegalizerInfo::verify(...) errors fatal Reviewers: aemerson, qcolombet Reviewed By: qcolombet Differential Revision: https://reviews.llvm.org/D46339 llvm-svn: 333619	2018-05-31 01:56:07 +00:00
Roman Tereshin	5a65eb75c7	[GlobalISel][AArch64] LegalizerInfo verifier: Fixing bugs exposed by LegalizerInfo::verify(...) Reviewers: aemerson, qcolombet Reviewed By: qcolombet Differential Revision: https://reviews.llvm.org/D46339 llvm-svn: 333618	2018-05-31 01:56:05 +00:00
Joel E. Denny	31b373963f	[lit] Report line number for failed RUN command (Relands r333584, reverted in 333592.) When debugging test failures with -vv (or -v in the case of the internal shell), this makes it easier to locate the RUN line that failed. For example, clang's test/Driver/linux-ld.c has 892 total RUN lines, and clang's test/Driver/arm-cortex-cpus.c has 424 RUN lines after concatenation for line continuations. When reading the generated shell script, this also makes it easier to locate the RUN line that produced each command. To support reporting RUN line numbers in the case of the internal shell, this patch extends the internal shell to support the null command, ":", except pipelines are not supported. To support reporting RUN line numbers in the case of windows cmd.exe as the external shell, this patch extends -vv to set "echo on" instead of "echo off" in bat files. (Support for windows cmd.exe as a lit external shell will likely be dropped later, but I found out too late.) Reviewed By: delcypher, asmith, stella.stamenova, jmorse, lebedev.ri, rnk Differential Revision: https://reviews.llvm.org/D44598 llvm-svn: 333614	2018-05-31 00:55:32 +00:00
Sanjay Patel	e5bc441791	[InstCombine] don't change the size of a select if it would mismatch its condition operands' sizes Don't always: cast (select (cmp x, y), z, C) --> select (cmp x, y), (cast z), C' This is something that came up as far back as D26556, and I lost track of it. I suspect that this transform is part of the underlying problem that is inspiring some of the recent proposals that seek to match larger patterns that include a cast op. Even if that's not true, this transform causes problems for codegen (particularly with vector types). A transform to actively match the size of cmp and select operand sizes should follow. This patch just removes the harmful canonicalization in the other direction. Differential Revision: https://reviews.llvm.org/D47163 llvm-svn: 333611	2018-05-31 00:16:58 +00:00
Sanjay Patel	ceb595b04e	[InstCombine] don't negate constant expression with fsub (PR37605) X + (-C) would be transformed back into X - C, so infinite loop: https://bugs.llvm.org/show_bug.cgi?id=37605 llvm-svn: 333610	2018-05-30 23:55:12 +00:00
Vedant Kumar	e3c1fb8b12	[llvm-cov] Use the new PrintHTMLEscaped utility This removes some duplicate logic to escape characters in HTML output. llvm-svn: 333608	2018-05-30 23:35:14 +00:00
Tom Stellard	c7624317d7	AMDGPU: Split AMDGPUTTI into GCNTTI and R600TTI Reviewers: arsenm, nhaehnle Reviewed By: arsenm Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, llvm-commits, t-tye Differential Revision: https://reviews.llvm.org/D47359 llvm-svn: 333605	2018-05-30 22:55:35 +00:00
Vlad Tsyrklevich	178fdb1a3b	[LowerTypeTests] Discard extern_weak linkage for definitions Summary: Fix PR37625. It's possible for an extern_weak declaration to be emitted to the merged module when a definition exists in the ThinLTO portion of the build; discard the linkage on the declaration in that case. (otherwise we copy the linkage to the alias to the jumptable and fail) Reviewers: pcc Reviewed By: pcc Subscribers: mehdi_amini, llvm-commits, kcc Differential Revision: https://reviews.llvm.org/D47494 llvm-svn: 333604	2018-05-30 22:39:52 +00:00
George Burgess IV	485762ccba	[NewGVN] Fix set comparison; reflow comment Looks like we intended to compare this->Members with Other->Members here, but ended up comparing this->Members with this->Members. Oops. :) Since CongruenceClass::Members is a SmallPtrSet anyway, we can probably skip building std::sets if we're willing to write a bit more code. This appears to be no functional change (for sufficiently lax values of "no"): this equality check was only being called inside of an assert. So, worst case, we'll catch more bugs in the form of assertion failures. Thanks to d0k for noting this! llvm-svn: 333601	2018-05-30 22:24:08 +00:00
Roman Tereshin	8f1753e994	[GlobalISel][AArch64] LegalizerInfo verifier: Adding LegalizerInfo::verify(...) call w/o fixing bugs This is to make it clear what kind of bugs the LegalizerInfo::verifier is able to catch and test its output Reviewers: aemerson, qcolombet Reviewed By: aemerson Differential Revision: https://reviews.llvm.org/D46338 llvm-svn: 333597	2018-05-30 22:10:04 +00:00
Joel E. Denny	71792c741e	Revert r333584: [lit] Report line number for failed RUN command It breaks test-suite. llvm-svn: 333592	2018-05-30 21:07:27 +00:00
Florian Hahn	75e87c3f2a	[TableGen] Avoid leaking TreePatternNodes by using shared_ptr. By using std::shared_ptr for TreePatternNode, we can avoid leaking them. Reviewers: craig.topper, dsanders, stoklund, tstellar, zturner Reviewed By: dsanders Differential Revision: https://reviews.llvm.org/D47463 llvm-svn: 333591	2018-05-30 21:00:18 +00:00
Jonas Devlieghere	50603518a0	[ADT] Add unit test for PrintHTMLEscaped Add unit tests for PrintHTMLEscaped which was added in r333565. llvm-svn: 333590	2018-05-30 20:47:18 +00:00
Daniel Neilson	936d50aeea	[IRBuilder] Add APIs for creating calls to atomic memmove and memset intrinsics. (NFC) Summary: Creating the IRBuilder methods: CreateElementUnorderedAtomicMemSet CreateElementUnorderedAtomicMemMove These mirror the methods that create calls to the regular (non-atomic) memmove and memset intrinsics. llvm-svn: 333588	2018-05-30 20:02:56 +00:00
Simon Pilgrim	159bd7444e	Fix Wdocumentation warning. NFCI. llvm-svn: 333586	2018-05-30 19:50:26 +00:00
Joel E. Denny	b6423479a1	[lit] Report line number for failed RUN command (Relands r330755 (reverted in r330848) with fix for PR37239.) When debugging test failures with -vv (or -v in the case of the internal shell), this makes it easier to locate the RUN line that failed. For example, clang's test/Driver/linux-ld.c has 892 total RUN lines, and clang's test/Driver/arm-cortex-cpus.c has 424 RUN lines after concatenation for line continuations. When reading the generated shell script, this also makes it easier to locate the RUN line that produced each command. To support reporting RUN line numbers in the case of the internal shell, this patch extends the internal shell to support the null command, ":", except pipelines are not supported. To support reporting RUN line numbers in the case of windows cmd.exe as the external shell, this patch extends -vv to set "echo on" instead of "echo off" in bat files. (Support for windows cmd.exe as a lit external shell will likely be dropped later, but I found out too late.) Reviewed By: delcypher, asmith, stella.stamenova, jmorse, lebedev.ri, rnk Differential Revision: https://reviews.llvm.org/D44598 llvm-svn: 333584	2018-05-30 19:42:27 +00:00
Benjamin Kramer	c8bd5449e0	[CalledValuePropagation] Just use a sorted vector instead of a set. The set properties are never used, so a vector is enough. No functionality change intended. While there add some std::moves to SparseSolver. llvm-svn: 333582	2018-05-30 19:31:11 +00:00
Peter Collingbourne	1651ac13be	llvm-objcopy: Set sh_link to 0 on unrecognized symtab-linked sections. Per discussion on the generic-abi mailing list: https://groups.google.com/forum/#!topic/generic-abi/MPr8TVtnVn4 An object file manipulation tool must either write out a symbol table with the same number of entries as the original symbol table and in the same order, or if this is impossible, refuse to operate on the object file if it has unrecognized sections that are linked to the symtab section. However, existing tools (namely GNU strip, GNU objcopy and ld.{bfd,gold,lld} -r) do not comply with this at present: they change symbol table indexes and set sh_link to 0 on the unrecognized symtab-linked sections. We intend to use the latter as a (temporary) signal that a tool has operated on a proposed new symtab-linked section and invalidated the symbol table indexes. However, llvm-objcopy currently keeps sh_link pointing to the new symtab section. This patch changes llvm-objcopy to set sh_link to 0 to match the behaviour of the other tools. Differential Revision: https://reviews.llvm.org/D47404 llvm-svn: 333581	2018-05-30 19:30:39 +00:00
Simon Pilgrim	5e9f459c62	[X86][SSE] Pulled out splat detection helper from LowerScalarVariableShift (NFCI) Created the IsSplatValue helper from the splat detection code in LowerScalarVariableShift as a first NFC step towards improving support for splat rotations, which is an extension of PR37426. llvm-svn: 333580	2018-05-30 19:16:59 +00:00
Galina Kistanova	df917811ca	Reverted r333424 as it broke multiple build bots and left unfixed for a long time llvm-svn: 333578	2018-05-30 18:51:08 +00:00
Roman Tereshin	5404136d06	[GlobalISel][Legalizer] LegalizerInfo verifier: check rules cover type indices This commit adds a simple verifier that tracks type indices being touched by legalization rules' builders. Every target will now have an opportunity to call LegalizerInfo::verify(...) at the end of its derived LegalizerInfo's constructor and check there are no obvious mistakes like checking only first type for an opcode that has more than one type index and therefore implicitly declaring any type for the second (and higher) type index legal. The check is only ran in assert builds and should have very minor performance impact in assert builds and none in release builds. This commit does not add LegalizerInfo::verify(...) calls to target-specific legalizers, look for separate commits for that. This commit also doesn't make the verification errors fatal, only produces an error message, look for a later commit that does. Reviewers: aemerson, qcolombet Reviewed By: aemerson Differential Revision: https://reviews.llvm.org/D46338 llvm-svn: 333576	2018-05-30 18:45:32 +00:00
Craig Topper	2fbbffa4bf	[X86] Update the fast-isel tests for _mm_rcp_ss, _mm_rsqrt_ss, and _mm_sqrt_ss to match clang codegen after r333572. llvm-svn: 333573	2018-05-30 18:30:44 +00:00
Jonas Devlieghere	f4ce54a123	[dsymutil] Escape HTML special characters in plist. When printing string in the Plist, we weren't escaping the characters which lead to invalid XML. This patch adds the escape logic to StringExtras. rdar://39785334 llvm-svn: 333565	2018-05-30 17:47:11 +00:00
Roman Tereshin	4e4cc6f508	[GlobalISel][Legalizer] NFC mostly reducing LegalizeRuleSet's methods' inter-dependecies Making LegalizeRuleSet's implementation a little more dumb and straightforward to make it easier to read and change, in particular in order to add the initial version of LegalizerInfo verifier Reviewers: aemerson, qcolombet Reviewed By: aemerson Differential Revision: https://reviews.llvm.org/D46338 llvm-svn: 333562	2018-05-30 16:54:01 +00:00
Simon Pilgrim	3173f73554	[X86][AVX512BW] Fixed check prefix copy+paste typo in avx512bw-intrinsics.ll Prefix was for AVX512F instead of AVX512BW llvm-svn: 333560	2018-05-30 16:29:06 +00:00
Mark Searles	ed54ff1d51	[AMDGPU][Waitcnt] Fix build error: unused variable 'SWaitInst' https://reviews.llvm.org/rL333556 caused a buildbot failure. See http://lab.llvm.org:8011/builders/lld-x86_64-darwin13/builds/21876/steps/build_Lld/logs/stdio /Users/buildslave/as-bldslv9/lld-x86_64-darwin13/llvm.src/lib/Target/AMDGPU/SIInsertWaitcnts.cpp:2007:10: error: unused variable 'SWaitInst' [-Werror,-Wunused-variable] auto SWaitInst = BuildMI(EntryBB, EntryBB.getFirstNonPHI(), The unused variable was for debugging purposes; removing that piece of code to fix the build. llvm-svn: 333559	2018-05-30 16:27:57 +00:00
Matt Arsenault	7b4826e6ce	AMDGPU: Use better alignment for kernarg lowering This was just emitting loads with the ABI alignment for the raw type. The true alignment is often better, especially when an illegal vector type was scalarized. The better alignment allows using a scalar load more often. llvm-svn: 333558	2018-05-30 16:17:51 +00:00
Karl-Johan Karlsson	ebaaa2ddae	[ValueTracking] Fix endless recursion in isKnownNonZero() Summary: The isKnownNonZero() function have checks that abort the recursion when it reaches the specified max depth. However one of the recursive calls was placed before the max depth check was done, resulting in a endless recursion that eventually triggered a segmentation fault. Fixed the problem by moving the max depth check above the first recursive call. Reviewers: Prazek, nlopes, spatel, craig.topper, hfinkel Reviewed By: hfinkel Subscribers: hfinkel, bjope, llvm-commits Differential Revision: https://reviews.llvm.org/D47531 llvm-svn: 333557	2018-05-30 15:56:46 +00:00
Mark Searles	1054541490	[AMDGPU][Waitcnt] Fix handling of loops with many bottom blocks In terms of waitcnt insertion/if necessary, the waitcnt pass forces convergence for a loop. Previously, that kicked if greater than 2 passes over a loop, which doesn't account for loop with many bottom blocks. So, increase the threshold to (n+1), where n is the number of bottom blocks. This gives the pass an opportunity to consider the contribution of each bottom block, to the overall loop, before the forced convergence potentially kicks in. Differential Revision: https://reviews.llvm.org/D47488 llvm-svn: 333556	2018-05-30 15:47:45 +00:00
Gabor Buella	890e363e11	[X86] Lowering FMA intrinsics to native IR (LLVM part) Support for Clang lowering of fused intrinsics. This patch: 1. Removes bindings to clang fma intrinsics. 2. Introduces new LLVM unmasked intrinsics with rounding mode: int_x86_avx512_vfmadd_pd_512 int_x86_avx512_vfmadd_ps_512 int_x86_avx512_vfmaddsub_pd_512 int_x86_avx512_vfmaddsub_ps_512 supported with a new intrinsic type (INTR_TYPE_3OP_RM). 3. Introduces new x86 fmaddsub/fmsubadd folding. 4. Introduces new tests for code emitted by sequentions introduced in Clang part. Patch by tkrupa Reviewers: craig.topper, sroland, spatel, RKSimon Reviewed By: craig.topper, RKSimon Differential Revision: https://reviews.llvm.org/D47443 llvm-svn: 333554	2018-05-30 15:25:16 +00:00
Daniel Neilson	6b23fb764e	[AliasSet] Teach the alias set how to handle atomic memcpy/memmove/memset Summary: The atomic variants of the memcpy/memmove/memset intrinsics can be treated the same was as the regular forms, with respect to aliasing. Update the AliasSetTracker to treat the atomic forms the same was as the regular forms. llvm-svn: 333551	2018-05-30 14:43:39 +00:00
Alexandros Lamprineas	52457d33b2	[InstCombine, ARM, AArch64] Convert table lookup to shuffle vector Turning a table lookup intrinsic into a shuffle vector instruction can be beneficial. If the mask used for the lookup is the constant vector {7,6,5,4,3,2,1,0}, then the back-end generates byte reverse instructions instead. Differential Revision: https://reviews.llvm.org/D46133 llvm-svn: 333550	2018-05-30 14:38:50 +00:00
Simon Pilgrim	8df8b129ce	[X86][AVX512] Replace -cpu=knl with -mattr=+avx512f for avx512-intrinsics tests It was noticed on D47377 that these tests were being unnecessarily affected by scheduler changes. This adds vzeroupper at the end of some tests as we lose the 'FeatureFastPartialYMMorZMMWrite' feature from KNL, since Skylake+ don't support this its probably better. llvm-svn: 333549	2018-05-30 14:36:41 +00:00
Simon Pilgrim	173e225f1c	[X86][SSE] Remove unnecessary -cpu from sttni tests It was noticed on D47377 that these tests (for PR37246) were being unnecessarily affected by scheduler changes. llvm-svn: 333546	2018-05-30 14:11:57 +00:00
Simon Pilgrim	61b859dca7	[X86][SSE] Replace -cpu with equivalent -mattr for vec_cast tests It was noticed on D47377 that these tests were being unnecessarily affected by scheduler changes. llvm-svn: 333545	2018-05-30 14:01:21 +00:00
Amaury Sechet	f47d9f30b0	[ARM] Remove code handling ADDC/ADDE/SUBC/SUBE Summary: This code is now dead as the ARM backend uses ADDCARRY/SUBCARRY/SETCCCARRY . Reviewers: rogfer01, efriedma, rengolin, javed.absar Subscribers: kristof.beyls, chrib, llvm-commits Differential Revision: https://reviews.llvm.org/D47413 llvm-svn: 333544	2018-05-30 13:45:43 +00:00
Krzysztof Parzyszek	8987174627	[Hexagon] Use vector align-left when shift amount fits in 3 bits This saves an instruction because for align-right the shift amount would need to be put in a register first. llvm-svn: 333543	2018-05-30 13:45:34 +00:00
Simon Dardis	f990bf1cd2	[mips] Correct the definition of CTC2/CFC2 llvm-svn: 333542	2018-05-30 13:21:13 +00:00
Simon Dardis	a3aa926c09	[mips] Correct the predicates of microMIPS compact branch instructions llvm-svn: 333541	2018-05-30 13:16:17 +00:00
Simon Dardis	f909058ad4	[mips] Sink PredicateControl further down the class hierarchy. Previously PredicateControl in some cases was a member of <X>Inst classes for some X (DSP, EVA) or was in more irregular place in the hierarchry for any given instruction. This patch moves PredicateControl down to the root so that it is consistently available. Then correct the base class of microMIPS instructions as using EncodingPredicates instead of the general Predicates field of Instruction. Reviewers: smaksimovic, abeserminji, atanasyan Differential Revision: https://reviews.llvm.org/D47526 llvm-svn: 333536	2018-05-30 12:40:53 +00:00
Simon Dardis	39710e3555	[mips] Correct the predicates of arithmetic and logic instructions. As part of this effort, duplicate and correct the predicates of some aliases. Also disable code generation of some short form instructions for FastISel, as it would otherwise reject them. Reviewers: atanasyan, abeserminji, smaksimovic Differential Revision: https://reviews.llvm.org/D47075 llvm-svn: 333530	2018-05-30 11:33:35 +00:00
Ilya Biryukov	5413510e32	[YAML] Quote multiline string scalars Summary: Otherwise, the YAML parser breaks when trying to read them back in 'key: multiline_string_value' cases. This patch fixes a problem when serializing structs which contain multi-line strings. E.g., if we try to serialize the following struct ``` { "key1": "first line\nsecond line", "key2": "another string" }` ``` Before this patch, we got the YAML output that failed to parse: ``` key1: first line second line key2: another string ``` After the patch, we get: ``` key1: 'first line second line' key2: another string ``` Reviewers: sammccall Reviewed By: sammccall Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47468 llvm-svn: 333527	2018-05-30 10:40:11 +00:00
Tim Northover	d8949f5002	AArch64: print correct annotation for ADRP addresses. The immediate on an ADRP MCInst needs to be multiplied by 0x1000 to obtain the actual PC-offset that will be calculated. llvm-svn: 333525	2018-05-30 09:54:59 +00:00
Sander de Smalen	bdf09fe7a2	[AArch64][AsmParser] Fix segfault on illegal fpimm. Floating point immediate combining a negative sign and a hexadecimal number, e.g. #-0x0 caused the compiler to crash. Reviewers: rengolin, fhahn, samparker, SjoerdMeijer, javed.absar Reviewed By: javed.absar Differential Revision: https://reviews.llvm.org/D47483 llvm-svn: 333524	2018-05-30 09:54:19 +00:00
Daniel Cederman	248dae81dc	[Sparc] Treat %fxx registers with value type Other as single precision They get type Other when used in the clobber list in inline assembly. This fixes tests fp128.ll and float.ll that failed after r333512. llvm-svn: 333523	2018-05-30 09:52:18 +00:00
Hans Wennborg	42e671d73d	Set underlying type for enum with GNU_PROPERTY_X86_FEATURE_1_AND constant The constant was causing a -Wc++11-narrowing error when compiled with clang-cl (see PR30776). llvm-svn: 333520	2018-05-30 09:04:57 +00:00
Serge Pavlov	c4b6d0ebab	Revert commit 333506 It looks like this commit is responsible for the fail: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-autoconf/builds/24382. llvm-svn: 333518	2018-05-30 09:01:12 +00:00
Daniel Cederman	60e6ce4155	[Sparc] Select correct register class for FP register constraints Summary: The fX version of floating-point registers only supports single precision. We need to map the name to dX for doubles and qX for long doubles if we want getRegForInlineAsmConstraint() to be able to pick the correct register class. Reviewers: jyknight, venkatra Reviewed By: jyknight Subscribers: eraman, fedor.sergeev, jrtc27, llvm-commits Differential Revision: https://reviews.llvm.org/D47258 llvm-svn: 333512	2018-05-30 06:07:55 +00:00
Craig Topper	cc0741e59f	[X86] Add unmasked AVX512VNNI instrinsics. Use a select in IR instead. A future patch will remove the old masked intrinsics. llvm-svn: 333508	2018-05-30 05:25:59 +00:00
Serge Pavlov	5096d06c10	Use uniform mechanism for OOM errors handling This is a recommit of r333390, which was reverted in r333395, because it caused cyclic dependency when building shared library `LLVMDemangle.so`. In this commit `ItaniumDemangler.cpp` was not changed. The original commit message is below. In r325551 many calls of malloc/calloc/realloc were replaces with calls of their safe counterparts defined in the namespace llvm. There functions generate crash if memory cannot be allocated, such behavior facilitates handling of out of memory errors on Windows. If the result of alloc function were checked for success, the function was not replaced with the safe variant. In these cases the calling function made the error handling, like: T NewElts = static_cast<T>(malloc(NewCapacitysizeof(T))); if (NewElts == nullptr) report_bad_alloc_error("Allocation of SmallVector element failed."); Actually knowledge about the function where OOM occurred is useless. Moreover having a single entry point for OOM handling is convenient for investigation of memory problems. This change removes custom OOM errors handling and replaces them with calls to functions `llvm::safe_alloc`. Declarations of `safe_alloc` are moved to a separate include file, to avoid cyclic dependency in SmallVector.h Differential Revision: https://reviews.llvm.org/D47440 llvm-svn: 333506	2018-05-30 05:13:19 +00:00
Hiroshi Inoue	3872c6c633	[PowerPC] fix broken JIT-compiled code with tail call optimization The relocation for branch instructions in the dynamic loader of ExecutionEngine assumes branch instructions with R_PPC64_REL24 relocation type are only bl. However, with the tail call optimization, b instructions can be also used to jump into another function. This patch makes the relocation to keep bits in the branch instruction other than the jump offset to avoid relocation rewrites a b instruction into bl. Differential Revision: https://reviews.llvm.org/D47456 llvm-svn: 333502	2018-05-30 04:48:29 +00:00
Sam Clegg	a81fb84811	MC: Remove redundant substr() call Differential Revision: https://reviews.llvm.org/D47047 llvm-svn: 333496	2018-05-30 03:37:26 +00:00
Sam Clegg	e1076e5a77	Fix use of `echo` command in test script On win32 we use lit's executeBuiltinEcho to implement the echo command and this version only currently supports flags that are separate. llvm-svn: 333495	2018-05-30 03:26:28 +00:00
Sam Clegg	105bdc2557	[WebAssembly] MC: Add compile-twice test and fix corresponding bug Differential Revision: https://reviews.llvm.org/D47398 llvm-svn: 333494	2018-05-30 02:57:20 +00:00
Chandler Carruth	71fd27043e	[PM/LoopUnswitch] When using the new SimpleLoopUnswitch pass, schedule loop-cleanup passes at the beginning of the loop pass pipeline, and re-enqueue loops after even trivial unswitching. This will allow us to much more consistently avoid simplifying code while doing trivial unswitching. I've also added a test case that specifically shows effective iteration using this technique. I've unconditionally updated the new PM as that is always using the SimpleLoopUnswitch pass, and I've made the pipeline changes for the old PM conditional on using this new unswitch pass. I added a bunch of comments to the loop pass pipeline in the old PM to make it more clear what is going on when reviewing. Hopefully this will unblock doing partial unswitching instead of just full unswitching. Differential Revision: https://reviews.llvm.org/D47408 llvm-svn: 333493	2018-05-30 02:46:45 +00:00
Lang Hames	e9bdfc16d7	[ORC] Fix an ambiguous make_unique call. llvm-svn: 333492	2018-05-30 02:40:40 +00:00
Lang Hames	bd0cb787d0	[ORC] Update JITCompileCallbackManager to support multi-threaded code. Previously JITCompileCallbackManager only supported single threaded code. This patch embeds a VSO (see include/llvm/ExecutionEngine/Orc/Core.h) in the callback manager. The VSO ensures that the compile callback is only executed once and that the resulting address cached for use by subsequent re-entries. llvm-svn: 333490	2018-05-30 01:57:45 +00:00
Shiva Chen	c3d0e89284	[RISCV] Support resolving fixup_riscv_call and add to MCFixupKindInfo table Resolving fixup_riscv_call by assembler when the linker relaxation diabled and the function and callsite within the same compile unit. And also adding static_assert after Infos array declaration to avoid missing any new fixup in MCFixupKindInfo in the future. Differential Revision: https://reviews.llvm.org/D47126 llvm-svn: 333487	2018-05-30 01:16:36 +00:00
Diego Caballero	b94b21d441	[VPlan] Replace LLVM_ATTRIBUTE_USED with ifndef NDEBUG Minor replacement. LLVM_ATTRIBUTE_USED was introduced to silence a warning but using #ifndef NDEBUG makes more sense in this case. Reviewers: dblaikie, fhahn, hsaito Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D47498 llvm-svn: 333476	2018-05-29 23:10:44 +00:00
Craig Topper	5989db0fb4	[X86] Remove some of the extractelts from the new MOVSS+FMA patterns. We only need the extractelt that corresponds to the register we're trying to insert back into. We can't guarantee the others haven't been optimized out depending on how those operands were produced. So instead just look for an FR32/FR64 input and emit a COPY_TO_REGCLASS to VR128 in the output pattern. This matches what we do for ADD/SUB/MUL/DIV. llvm-svn: 333473	2018-05-29 22:52:09 +00:00
Craig Topper	dbd371e931	[X86] Use VR128X instead of VR128 in EVEX instruction patterns. llvm-svn: 333464	2018-05-29 20:46:27 +00:00
Craig Topper	aba57bfebd	[X86] Rename the operands in the recently introduced MOVSS+FMA patterns so that the operand names in the output pattern are always in 1, 2, 3 order since those are the operand names in the instruction. The order should be controlled in the input pattern. llvm-svn: 333463	2018-05-29 20:46:26 +00:00
Sam Clegg	f4f3750949	Fix build error introduced in rL333459 The DEBUG macro was renamed LLVM_DEBUG. llvm-svn: 333462	2018-05-29 20:16:47 +00:00
Chandler Carruth	4cbcbb0761	[LoopInstSimplify] Re-implement the core logic of loop-instsimplify to be both simpler and substantially more efficient. Rather than use a hand-rolled iteration technique that isn't quite the same as RPO, use the pre-built RPO loop body traversal utility. Once visiting the loop body in RPO, we can assert that we visit defs before uses reliably. When this is the case, the only need to iterate is when simplifying a def that is used by a PHI node along a back-edge. With this patch, the first pass over the loop body is just a complete simplification of every instruction across the loop body. When we encounter a use of a simplified instruction that stems from a PHI node in the loop body that has already been visited (due to some cyclic CFG, potentially the loop itself, or a nested loop, or unstructured control flow), we recall that specific PHI node for the second iteration. Nothing else needs to be preserved from iteration to iteration. On the second and later iterations, only instructions known to have simplified inputs are considered, each time starting from a set of PHIs that had simplified inputs along the backedges. Dead instructions are collected along the way, but deleted in a batch at the end of each iteration making the iterations themselves substantially simpler. This uses a new batch API for recursively deleting dead instructions. This alsa changes the routine to visit subloops. Because simplification is fundamentally transitive, we may need to visit the entire loop body, including subloops, to handle knock-on simplification. I've added a basic test file that helps demonstrate that all of these changes work. It includes both straight-forward loops with simplifications as well as interesting PHI-structures, CFG-structures, and a nested loop case. Differential Revision: https://reviews.llvm.org/D47407 llvm-svn: 333461	2018-05-29 20:15:38 +00:00
Craig Topper	5439b3d1e5	[X86] Fix a potential crash that occur after r333419. The code could issue a truncate from a small type to larger type. We need to extend in that case instead. llvm-svn: 333460	2018-05-29 20:04:10 +00:00
Sam Clegg	b7c6239408	[WebAssembly] Add more error checking to object file parsing This should address some of the assert failures the fuzzer has been finding such as: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=6719 Differential Revision: https://reviews.llvm.org/D47086 llvm-svn: 333459	2018-05-29 19:58:59 +00:00
Matt Arsenault	4b3829d8cf	AMDGPU: Fix broken check lines llvm-svn: 333458	2018-05-29 19:35:53 +00:00
Matt Arsenault	2e4d338d16	AMDGPU: Fix typo in option description llvm-svn: 333457	2018-05-29 19:35:46 +00:00
Matt Arsenault	1ea0402e82	AMDGPU: Round up kernel argument allocation size AFAIK the driver's allocation will actually have to round this up anyway. It is useful to track the rounded up size, so that the end of the kernel segment is known to be dereferencable so a wider s_load_dword can be used for a short argument at the end of the segment. llvm-svn: 333456	2018-05-29 19:35:00 +00:00
Sameer AbuAsal	97684419e8	[RISCV] Add peepholes for Global Address lowering patterns Summary: Base and offset are always separated when a GlobalAddress node is lowered (rL332641) as an optimization to reduce instruction count. However, this optimization is not profitable if the Global Address ends up being used in only instruction. This patch adds peephole optimizations that merge an offset of an address calculation into the LUI %%hi and ADD %lo of the lowering sequence. The peephole handles three patterns: 1) ADDI (ADDI (LUI %hi(global)) %lo(global)), offset ---> ADDI (LUI %hi(global + offset)) %lo(global + offset). This generates: lui a0, hi (global + offset) add a0, a0, lo (global + offset) Instead of lui a0, hi (global) addi a0, hi (global) addi a0, offset This pattern is for cases when the offset is small enough to fit in the immediate filed of ADDI (less than 12 bits). 2) ADD ((ADDI (LUI %hi(global)) %lo(global)), (LUI hi_offset)) ---> offset = hi_offset << 12 ADDI (LUI %hi(global + offset)) %lo(global + offset) Which generates the ASM: lui a0, hi(global + offset) addi a0, lo(global + offset) Instead of: lui a0, hi(global) addi a0, lo(global) lui a1, (offset) add a0, a0, a1 This pattern is for cases when the offset doesn't fit in an immediate field of ADDI but the lower 12 bits are all zeros. 3) ADD ((ADDI (LUI %hi(global)) %lo(global)), (ADDI lo_offset, (LUI hi_offset))) ---> offset = global + offhi20<<12 + offlo12 ADDI (LUI %hi(global + offset)) %lo(global + offset) Which generates the ASM: lui a1, %hi(global + offset) addi a1, %lo(global + offset) Instead of: lui a0, hi(global) addi a0, lo(global) lui a1, (offhi20) addi a1, (offlo12) add a0, a0, a1 This pattern is for cases when the offset doesn't fit in an immediate field of ADDI and both the lower 1 bits and high 20 bits are non zero. Reviewers: asb Reviewed By: asb Subscribers: rbar, johnrusso, simoncook, jordy.potman.lists, apazos, niosHD, kito-cheng, shiva0217, zzheng, edward-jones, mgrang llvm-svn: 333455	2018-05-29 19:34:54 +00:00
Daniel Neilson	3a6c50f4e0	[BasicAA] Teach the analysis about atomic memcpy Summary: A simple change to derive mod/ref info from the atomic memcpy intrinsic in the same way as from the regular memcpy intrinsic. llvm-svn: 333454	2018-05-29 19:23:50 +00:00
Douglas Yung	99feb567bf	Update CodeView register names in a test that was missed in r333421. llvm-svn: 333453	2018-05-29 19:21:22 +00:00
Konstantin Zhuravlyov	2ca6b1f2ba	AMDGPU: Always set COMPUTE_PGM_RSRC2.ENABLE_TRAP_HANDLER to zero for AMDHSA as it is set by CP Differential Revision: https://reviews.llvm.org/D47392 llvm-svn: 333451	2018-05-29 19:09:13 +00:00
Florian Hahn	33b6f9acc4	[TableGen] Use explicit constructor for InstMemo This should fix a few buildbot failures with old GCC versions. llvm-svn: 333448	2018-05-29 18:34:42 +00:00
Eli Friedman	63fead0f43	[ARM] Enable SETCCCARRY lowering for Thumb1. We've had Thumb1 support for ARMISD::SUBE for a while now, so this just works. Reduces codesize a bit for 64-bit integer comparisons. Differential Revision: https://reviews.llvm.org/D47387 llvm-svn: 333445	2018-05-29 18:17:16 +00:00
Matt Arsenault	64c6ab445e	IRBuilder: Add overload for intrinsics without args llvm-svn: 333443	2018-05-29 18:06:50 +00:00
Matt Arsenault	ceafc55e5a	AMDGPU: Pass function directly instead of MachineFunction These functions just query the underlying IR function, so pass it directly. llvm-svn: 333442	2018-05-29 17:42:50 +00:00
Matt Arsenault	2fb9ccf770	AMDGPU: Add nuw to add off of kernarg ptr llvm-svn: 333441	2018-05-29 17:42:38 +00:00
Matt Arsenault	ab2b79cb97	DAG: Remove redundant version of getRegisterTypeForCallingConv There seems to be no real reason to have these separate copies. The existing implementations just copy each other for x86. For Mips there is a subtle difference, which is just a bug since it changes based on the context where which one was called. Dropping this version, all tests pass. If I try to merge them to match the removed version, a test fails. llvm-svn: 333440	2018-05-29 17:42:26 +00:00
Tom Stellard	57b9342c80	AMDGPU: Split R600 MCInst lowering into its own class Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, llvm-commits, t-tye Differential Revision: https://reviews.llvm.org/D47307 llvm-svn: 333439	2018-05-29 17:41:59 +00:00
Florian Hahn	7d3f9a88b9	[TableGen] Fix leaking of PhysRegInputs. Instead of dynamically allocating the vector for PhysRegs, we can allocate it on the stack and move it into InstructionMemo. Reviewers: mcrosier, craig.topper, RKSimon, dsanders Reviewed By: dsanders Differential Revision: https://reviews.llvm.org/D47461 llvm-svn: 333438	2018-05-29 17:40:03 +00:00
Nicolai Haehnle	e7ae0f48f4	TableGen: add some more helpful error messages Summary: Change-Id: I6f3dacf675a4126134577616e259696bebdade3a Reviewers: tra, simon_tatham, craig.topper, MartinO, arsenm Subscribers: wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D47429 Change-Id: I614de12a4c154c6d53c090f2f3e53ad2d09942c5 llvm-svn: 333436	2018-05-29 17:12:20 +00:00
Florian Hahn	6c21b3b595	[TableGen] Fix leaking synthesized registers. By keeping track of unique_ptrs to the synthesized definitions in CodeGenRegBank we avoid leaking them. Reviewers: dsanders, kparzysz, stoklund Reviewed By: dsanders Differential Revision: https://reviews.llvm.org/D47462 llvm-svn: 333434	2018-05-29 16:55:06 +00:00
Cameron McInally	b1bb60aec9	[StrictFP] Make getStrictFPOpcodeAction(...) more accessible NFCI. This function will be reused in upcoming patches. Differential Revision: https://reviews.llvm.org/D47380 llvm-svn: 333433	2018-05-29 16:49:32 +00:00
Simon Pilgrim	db9dbac501	[X86][SSE] Regenerate sdiv combine tests llvm-svn: 333431	2018-05-29 16:36:27 +00:00
Simon Pilgrim	77149a801f	[X86][AVX] Regenerate vzeroall/vzeroupper cleanup tests llvm-svn: 333430	2018-05-29 16:35:38 +00:00
Evandro Menezes	f8425340e4	[AArch64] Fix PR32384: bump up the number of stores per memset and memcpy As suggested in https://bugs.llvm.org/show_bug.cgi?id=32384#c1, this change makes the inlining of `memset()` and `memcpy()` more aggressive when compiling for speed. The tuning remains the same when optimizing for size. Patch by: Sebastian Pop <s.pop@samsung.com> Evandro Menezes <e.menezes@samsung.com> Differential revision: https://reviews.llvm.org/D45098 llvm-svn: 333429	2018-05-29 15:58:50 +00:00
Simon Atanasyan	69301c9eb9	[mips] Process numeric register name in the .set assignment directive Now LLVM assembler cannot process the following code and generates an error. GNU tools support .set assignment directive with numeric register name. ``` .set r4, 4 test.s:1:11: error: invalid token in expression .set r4, $4 ^ ``` This patch teach assembler to handle such directives correctly. Unfortunately a numeric register name cannot be represented as an expression. That's why we have to maintain a separate `StringMap` in the `MipsAsmParser` to keep mapping between aliases names and register numbers. Differential revision: https://reviews.llvm.org/D47464 llvm-svn: 333428	2018-05-29 15:58:06 +00:00
Amara Emerson	d5a9e7bbc9	Revert "[AArch64] added FP16 vcvth intrinsic support" This reverts commit r333410 due to bot failures. llvm-svn: 333427	2018-05-29 15:34:22 +00:00
Alexander Ivchenko	6572425462	[llvm-readobj] Support GNU_PROPERTY_X86_FEATURE_1_AND notes in .note.gnu.property This patch allows parsing GNU_PROPERTY_X86_FEATURE_1_AND notes in .note.gnu.property sections. These notes indicate that the object file is built to support Intel CET. patch by mike.dvoretsky Differential Revision: https://reviews.llvm.org/D47473 llvm-svn: 333424	2018-05-29 14:49:51 +00:00
Sander de Smalen	8704b03c4d	[AArch64][SVE] Asm: Support for predicated LSL/LSR (vectors) Reviewers: rengolin, huntergr, fhahn, samparker, SjoerdMeijer, javed.absar Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D47365 llvm-svn: 333422	2018-05-29 14:40:24 +00:00
Jonas Devlieghere	43dce3edbe	[CodeView] Add prefix to CodeView registers. Adds CVReg to CodeView register names to prevent a duplicate symbol with CR3 defined in termios.h, as suggested by Zachary on the mailing list. http://lists.llvm.org/pipermail/llvm-dev/2018-May/123372.html Differential revision: https://reviews.llvm.org/D47478 rdar://39863705 llvm-svn: 333421	2018-05-29 14:35:34 +00:00
Alexander Ivchenko	96062eaa8e	[X86] Scalar mask and scalar move optimizations 1. Introduction of mask scalar TableGen patterns. 2. Introduction of new scalar move TableGen patterns and refactoring of existing ones. 3. Folding of pattern created by introducing scalar masking in Clang header files. Patch by tkrupa Differential Revision: https://reviews.llvm.org/D47012 llvm-svn: 333419	2018-05-29 14:27:11 +00:00
Than McIntosh	48bf43df8a	StackColoring: better handling of statically unreachable code Summary: Avoid assert/crash during liveness calculation in situations where the incoming machine function has statically unreachable BBs. Second attempt at submitting; this version of the change includes a revised testcase. Fixes PR37130. Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47372 llvm-svn: 333416	2018-05-29 13:52:24 +00:00
Lei Huang	716103f1cd	[PowerPC] Fix the incorrect iterator inside peephole Instruction selection can insert nodes into the underlying list after the root node so iterating will thereby miss it. We should NOT assume that, the root node is the last element in the DAG nodelist. Patch by: steven.zhang (Qing Shan Zhang) Differential Revision: https://reviews.llvm.org/D47437 llvm-svn: 333415	2018-05-29 13:38:56 +00:00
Sander de Smalen	26b9b2a8c3	[AArch64][SVE] Asm: Support for AND, ORR, EOR and BIC instructions. This patch addresses the following variants: - bitmask immediate, e.g. 'and z0.d, z0.d, #0x6'. - unpredicated data vectors, e.g. 'and z0.d, z1.d, z2.d'. - predicated data vectors, e.g. 'and z0.d, p0/m, z0.d, z1.d'. And also several aliases, such as: - ORN, alias of ORR. - EON, alias of EOR. - BIC, alias of AND (immediate variant) - MOV, alias of ORR (if unpredicated and source register operands are the same) Reviewers: rengolin, huntergr, fhahn, samparker, SjoerdMeijer, javed.absar Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D47363 llvm-svn: 333414	2018-05-29 13:08:43 +00:00
Luke Geeson	16092ab3c5	[AArch64] added FP16 vcvth intrinsic support Summary: Change-Id: I0df845749c7689dfc99150ba7c19c7d0dadbd705 Reviewers: javed.absar, SjoerdMeijer Reviewed By: SjoerdMeijer Subscribers: llvm-commits, SjoerdMeijer Differential Revision: https://reviews.llvm.org/D46311 llvm-svn: 333410	2018-05-29 11:40:33 +00:00
Simon Atanasyan	a1d69f9e53	[mips] Emit R_MICROMIPS_GPREL16/R_MICROMIPS_SUB/R_MICROMIPS_LO16 / HI16 relocations Emit R_MICROMIPS_GPREL16/R_MICROMIPS_SUB/R_MICROMIPS_LO16 and R_MICROMIPS_GPREL16/R_MICROMIPS_SUB/R_MICROMIPS_HI16 chains of relocations for %lo(%neg(%gp_rel())) and %hi(%neg(%gp_rel())) expressions in case of microMIPS. Differential Revision: http://reviews.llvm.org/D47220 llvm-svn: 333409	2018-05-29 11:33:54 +00:00
Sander de Smalen	98686c6b15	[AArch64][SVE] Asm: Support for ADD (immediate) instructions. This patch adds addsub_imm8_opt_lsl_(i8\|i16\|i32\|i64) operands that are unsigned values in the range 0 to 255. For element widths of 16 bits or higher it may also be a signed multiple of 256 in the range 0 to 65280. Note: This also does some refactoring to reuse convenience function getShiftedVal<shift>(), and now allows AArch64 scalar 'ADD #-4096' to be accepted to be mapped to SUB #4096. Reviewers: rengolin, fhahn, samparker, SjoerdMeijer, javed.absar Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D47310 llvm-svn: 333408	2018-05-29 10:39:49 +00:00
Simon Atanasyan	6be87bce29	[mips] Emit R_MICROMIPS_HIGHER / R_MICROMIPS_HIGHEST relocations Emit R_MICROMIPS_HIGHER / R_MICROMIPS_HIGHEST relocations for %higher() and %highest() expressions in case of microMIPS. These relocations do exactly the same things as R_MIPS_HIGHER / R_MIPS_HIGHEST, but for consistency it's better to write microMIPS variants. Differential Revision: http://reviews.llvm.org/D47219 llvm-svn: 333407	2018-05-29 10:27:44 +00:00
Luke Geeson	cc09d78297	Test Commit Access - Removed Whitespace llvm-svn: 333406	2018-05-29 10:12:27 +00:00
Simon Dardis	0fad58cbaf	[mips] Correct the predicates for a number of instructions. Previously, their listed predicates were overridden at the scope level. Reviewers: atanasyan, abeserminji, smaksimovic Differential Revision: https://reviews.llvm.org/D46947 llvm-svn: 333405	2018-05-29 09:56:19 +00:00
Simon Atanasyan	b2d61fa3d8	[mips] Cleanup the code to reduce diff with the upcoming patches. NFC llvm-svn: 333404	2018-05-29 09:51:33 +00:00
Simon Atanasyan	d408ec4cfa	[mips] Escape else-after-return. NFC llvm-svn: 333403	2018-05-29 09:51:28 +00:00
Simon Atanasyan	3535cb1130	[mips] Stop parsing a .set assignment if the first argument is not an identifier Before this fix the following code triggers two error messages. The second one is at least useless: test.s:1:9: error: expected identifier after .set .set 123, $a0 ^ test-set.s:1:9: error: unexpected token, expected comma .set 123, $a0 ^ llvm-svn: 333402	2018-05-29 09:51:22 +00:00
Tim Renouf	fa213f797b	[AMDGPU] Fixed build warning Summary: V2: Use cast instead of extra if. Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D47426 Change-Id: I6ac31da0306f79706960284a7ebd7b9c6237a83a llvm-svn: 333397	2018-05-29 08:15:37 +00:00
Serge Pavlov	1a095524f2	Reverted commits 333390, 333391 and 333394 Build of shared library LLVMDemangle.so fails due to dependency problem. llvm-svn: 333395	2018-05-29 07:05:41 +00:00
Serge Pavlov	335fa1eb04	Added library LLVMSupport to dependencies of LLVMDemangle After r333390 build of LLVMDemangle.so fails due to unresolved reference `llvm::report_bad_alloc_error`. llvm-svn: 333394	2018-05-29 06:48:57 +00:00
Craig Topper	a34f8731c7	[X86] Disable a DAG combine to allow packed AVX512DQ instructions to be consistently used for i64->float/double conversions. Summary: We already get this right if the i64 didn't come from a load. Reviewers: RKSimon Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47439 llvm-svn: 333393	2018-05-29 06:22:45 +00:00
Clement Courbet	07c9ec6f2e	[X86][Sched] Add InstRW for CLC on Intel after SNB. Summary: After SNB, Intel CPUs can rename CF independently of other EFLAGS, so the renamer can zero it for free. Note that STC still consumes resources. To reproduce: `$ llvm-exegesis -mode=uops -opcode-name=CLC` On SNB: ``` --- key: opcode_name: CLC mode: uops config: '' cpu_name: sandybridge llvm_triple: x86_64-unknown-linux-gnu num_repetitions: 10000 measurements: - { key: '3', value: 0.0014, debug_string: SBPort0 } - { key: '4', value: 0.0013, debug_string: SBPort1 } - { key: '5', value: 0.0003, debug_string: SBPort4 } - { key: '6', value: 0.0029, debug_string: SBPort5 } - { key: '10', value: 0.0003, debug_string: SBPort23 } error: '' info: 'instruction is serial, repeating a random one. Snippet: CLC ' ... ``` On HSW: ``` --- key: opcode_name: CLC mode: uops config: '' cpu_name: haswell llvm_triple: x86_64-unknown-linux-gnu num_repetitions: 10000 measurements: - { key: '3', value: 0.001, debug_string: HWPort0 } - { key: '4', value: 0.0009, debug_string: HWPort1 } - { key: '5', value: 0.0004, debug_string: HWPort2 } - { key: '6', value: 0.0006, debug_string: HWPort3 } - { key: '7', value: 0.0002, debug_string: HWPort4 } - { key: '8', value: 0.0012, debug_string: HWPort5 } - { key: '9', value: 0.0022, debug_string: HWPort6 } - { key: '10', value: 0.0001, debug_string: HWPort7 } error: '' info: 'instruction is serial, repeating a random one. Snippet: CLC ' ... ``` Reviewers: craig.topper, RKSimon Subscribers: gchatelet, llvm-commits Differential Revision: https://reviews.llvm.org/D47362 llvm-svn: 333392	2018-05-29 06:19:39 +00:00
Serge Pavlov	edc8d889b9	Added system header cstdlib to MemAlloc.h Some buildbots fail because they cannot find `std::malloc` and other allocation functions. llvm-svn: 333391	2018-05-29 06:03:53 +00:00
Serge Pavlov	0e31285fe8	Use uniform mechanism for OOM errors handling In r325551 many calls of malloc/calloc/realloc were replaces with calls of their safe counterparts defined in the namespace llvm. There functions generate crash if memory cannot be allocated, such behavior facilitates handling of out of memory errors on Windows. If the result of alloc function were checked for success, the function was not replaced with the safe variant. In these cases the calling function made the error handling, like: T NewElts = static_cast<T>(malloc(NewCapacitysizeof(T))); if (NewElts == nullptr) report_bad_alloc_error("Allocation of SmallVector element failed."); Actually knowledge about the function where OOM occurred is useless. Moreover having a single entry point for OOM handling is convenient for investigation of memory problems. This change removes custom OOM errors handling and replaces them with calls to functions `llvm::safe_alloc`. Declarations of `safe_alloc` are moved to a separate include file, to avoid cyclic dependency in SmallVector.h Differential Revision: https://reviews.llvm.org/D47440 llvm-svn: 333390	2018-05-29 05:39:08 +00:00
Fangrui Song	74d6a7400c	[LangRef] Fix TBAA example llvm-svn: 333389	2018-05-29 05:38:05 +00:00
Craig Topper	21aeddc3dc	[X86] Remove masked vpermi2var/vpermt2var intrinsics and autoupgrade. We have unmasked intrinsics now and wrap them with a select. This is a net reduction of 36 intrinsics from before the unmasked intrinsics were added. llvm-svn: 333388	2018-05-29 05:22:05 +00:00
Craig Topper	2adc7d956c	[X86] Add unmasked vermi2var intrinsics so we can use explicit select instructions for masking in clang. This will allow us to remove the 3 different flavors of masked intrinsics. I'm leaving the actual intrinsic removal for another patch. llvm-svn: 333386	2018-05-29 03:26:30 +00:00
Craig Topper	dcfcfdb0d1	[X86] Converge X86ISD::VPERMV3 and X86ISD::VPERMIV3 to a single opcode. These do the same thing with the first and second sources swapped. They previously came from separate intrinsics that specified different masking behavior. But we can cover that with isel patterns and a single node. This is a step towards reducing the number of intrinsics needed. A bunch of tests change because we are now biased to choosing VPERMT over VPERMI when there is nothing to signal that commuting is beneficial. llvm-svn: 333383	2018-05-28 19:33:11 +00:00
Craig Topper	6b545182fb	[X86] Fix typo in comment. NFC llvm-svn: 333382	2018-05-28 19:33:06 +00:00
Farhana Aleen	eacb1020aa	[AMDGPU] Re-enabled 128bit wide-vector generation for local addr space by default. Summary: Bug reported here https://bugs.freedesktop.org/show_bug.cgi?id=105464 found to be resolved by some other fixes. Author: FarhanaAleen llvm-svn: 333380	2018-05-28 18:15:11 +00:00
Fangrui Song	afa95ee03d	[LLVM-C] [OCaml] Remove LLVMAddBBVectorizePass Summary: It was fully replaced back in 2014, and the implementation was removed 11 months ago by r306797. Reviewers: hfinkel, chandlerc, whitequark, deadalnix Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47436 llvm-svn: 333378	2018-05-28 16:58:10 +00:00
Lei Huang	651be44913	[Power9]Legalize and emit code for HW/Byte vector extract and convert to QP Implemente patterns to extract HWord and Byte vector elements and convert to quad-precision. Differential Revision: https://reviews.llvm.org/D46774 llvm-svn: 333377	2018-05-28 16:43:29 +00:00
Zaara Syeda	6f3df02fdc	[PowerPC] Set isAsmParserOnly=1 for X-form TLS loads/stores The X-form TLS load/store instructions added for optimizing the initial-exec sequence in https://reviews.llvm.org/rL327635 fail to assemble. llvm-mc fails with the error: invalid operand for instruction. This patch adds these instructions into a block with isAsmParserOnly, similar to how ADD8TLS_ is currently handled. Differential Revision: https://reviews.llvm.org/D47382 llvm-svn: 333374	2018-05-28 15:27:58 +00:00
Daniel Cederman	2e7fe0edaf	[Sparc] Add .uahalf and .uaword directives Summary: Adding these makes it easier to assemble the output from GCC which generates a lot of .uahalf and .uaword directives. GAS treats .uahalf and .half the same unless the --enforce-aligned-data flag is used. I could not find a similar flag for LLVM so it seems that .half does not have any alignment requirement and is treated the same as .uahalf should be. If that would change later on then the tests in sparc-directives.s would fail due to bad alignment. Reviewers: jyknight, asb Reviewed By: jyknight Subscribers: fedor.sergeev, jrtc27, llvm-commits Differential Revision: https://reviews.llvm.org/D47319 llvm-svn: 333372	2018-05-28 12:42:55 +00:00
Craig Topper	26bc84860a	[X86] Stop forcing X86VPermi2X node index operand to match destination type to make masking pattern matching easier. Add extra patterns with bitcasts instead. This basically reverts r280696 in favor of using extra patterns as mentioned as an alternative in that commit message. For now I've only added the cases we have test cases for, but it should be easy to add more in the future. This will help to convert VPERMI2PS/VPERMT2PS intrinsics to use a single ISD node opcode. And hopefully allow some intrinsics to be removed. llvm-svn: 333365	2018-05-28 05:37:25 +00:00
Andrea Di Biagio	df8e919957	[Tablegen] Avoid generating empty switch statements. NFC This fixes an MSVC warning (warning C4065: switch statement contains 'default' but no 'case' labels) introduced with revision 333293. llvm-svn: 333363	2018-05-27 19:08:12 +00:00
Tim Renouf	364edcd2e5	[AMDGPU] Fixed WWM bug in block otherwise entirely in WQM Summary: For a block with WQM on entry and exit and containing no exact mode code, but containing some WWM code, the WQM pass forgot to process the block at all and so did not insert code to enter and leave WWM. This commit fixes that. Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D47027 Change-Id: I044792eead1293bed4203fb26ce75f47878afeb6 llvm-svn: 333362	2018-05-27 17:26:11 +00:00
Simon Pilgrim	79dae5ba2a	[X86] Don't hardcode scheduler class Also fixes BEXTRI instruction to use WritBEXTR class, which was missed when the class was added. llvm-svn: 333360	2018-05-27 14:54:18 +00:00
David Green	aee7ad0cde	Revert 333358 as it's failing on some builders. I'm guessing the tests reply on the ARM backend being built. llvm-svn: 333359	2018-05-27 12:54:33 +00:00
David Green	3034281b43	[UnrollAndJam] Add a new Unroll and Jam pass This is a simple implementation of the unroll-and-jam classical loop optimisation. The basic idea is that we take an outer loop of the form: for i.. ForeBlocks(i) for j.. SubLoopBlocks(i, j) AftBlocks(i) Instead of doing normal inner or outer unrolling, we unroll as follows: for i... i+=2 ForeBlocks(i) ForeBlocks(i+1) for j.. SubLoopBlocks(i, j) SubLoopBlocks(i+1, j) AftBlocks(i) AftBlocks(i+1) Remainder So we have unrolled the outer loop, then jammed the two inner loops into one. This can lead to a simpler inner loop if memory accesses can be shared between the now-jammed loops. To do this we have to prove that this is all safe, both for the memory accesses (using dependence analysis) and that ForeBlocks(i+1) can move before AftBlocks(i) and SubLoopBlocks(i, j). Differential Revision: https://reviews.llvm.org/D41953 llvm-svn: 333358	2018-05-27 12:11:21 +00:00
Eric Christopher	958a1f8d87	Remove boolean argument from isSuitableFromBSS. The argument was used as an additional negative condition and can be expressed in the if conditional without needing to pass it down. Update bss commentary around main use. llvm-svn: 333357	2018-05-27 11:39:34 +00:00
Eric Christopher	ed169ec424	Cleanups for getKindForGlobal: - Clarify block comment - Make Function/GlobalVariable split more explicit. - Move locals closer to uses. llvm-svn: 333356	2018-05-27 11:23:20 +00:00
Eric Christopher	66c5bbc53e	Tidy some language in the xray documentation. llvm-svn: 333354	2018-05-27 09:19:03 +00:00
Jonas Devlieghere	cb547cbb5c	[dwarfdump] Make -c and -p work together When requesting to dump both the parent chain and children, we used to print the DIE more than once because we propagated the dump options to the parent without clearing the respective flags. This commit fixes this oversight and adds a test. rdar://39415292 Differential revision: https://reviews.llvm.org/D47263 llvm-svn: 333350	2018-05-26 19:39:56 +00:00
Craig Topper	51eddb8749	[X86] Remove masking from avx512ifma intrinsics. Use a select instead. This allows us to avoid having mask and maskz variant. Reducing from 12 intrinsics to 6. llvm-svn: 333346	2018-05-26 18:55:19 +00:00
Amaury Sechet	0efdcdfbfc	Fix comment decribing setcccarry. NFC llvm-svn: 333344	2018-05-26 14:40:42 +00:00
Amaury Sechet	c9edc0cfe2	Add test case for D46505 . NFC llvm-svn: 333341	2018-05-26 12:28:23 +00:00
Paul Semel	cf51c80bf1	[llvm-objcopy] Add --keep-file-symbols option This option prevent from removing file symbols while removing symbols. Differential Revision: https://reviews.llvm.org/D46830 llvm-svn: 333339	2018-05-26 08:10:37 +00:00
Teresa Johnson	fb89e7a943	[ThinLTO] Fix a few more test match issues Fix a few more bot failures due to r333335: - don't match path other than file name, since the delimiter is different for Windows - The summary IDs in thinlto-function-summary-refgraph.ll may vary and therefore can't be matched exactly, because the ordering depends on the iteration order of the index map which is keyed by GUID. The GUID for private values will depend on the path. llvm-svn: 333338	2018-05-26 03:50:29 +00:00
Teresa Johnson	365ed5948e	[ThinLTO] Fix another bot failure due to test mismatch Don't try to match the exact GUID for private symbols, as the hashed name includes the file path. llvm-svn: 333337	2018-05-26 03:20:06 +00:00
Teresa Johnson	724e7a19de	[ThinLTO] Fix bot failures from r333335 Change value in vector from StringRef to std::string to avoid errors when trying to initialize from a std::string. llvm-svn: 333336	2018-05-26 02:53:52 +00:00
Teresa Johnson	08d5b4ef0d	[ThinLTO] Print module summary index to assembly Summary: Implements AsmWriter support for printing the module summary index to assembly with the format discussed in the RFC "LLVM Assembly format for ThinLTO Summary". Implements just enough of the parsing support to recognize and ignore the summary entries. As agreed in the RFC thread, this will be the behavior when assembling the IR. A follow on change will implement parsing/assembling of the summary entries for use by tools that currently build the summary index from bitcode. Reviewers: dexonsmith, pcc Subscribers: inglorion, eraman, steven_wu, dblaikie, llvm-commits Differential Revision: https://reviews.llvm.org/D46699 llvm-svn: 333335	2018-05-26 02:34:13 +00:00
Fangrui Song	ffebfe10c1	[llvm-symbolizer] Simplify llvm-svn: 333334	2018-05-26 02:29:14 +00:00
George Burgess IV	45f263dd90	[MemorySSA] Reflow comments + clean up control flow; NFC Style guide says `else`s after returns are iffy, and I agree. I also don't know what broke the comments here and in CFLAA, but shrug. llvm-svn: 333332	2018-05-26 02:28:55 +00:00

... 3 4 5 6 7 ...

165054 Commits