llvm-project

Commit Graph

Author	SHA1	Message	Date
Ivan A. Kosarev	60a991ed1a	[NEON] Support VLD1xN intrinsics in AArch32 mode (LLVM part) We currently support them only in AArch64. The NEON Reference, however, says they are 'ARMv7, ARMv8' intrinsics. Differential Revision: https://reviews.llvm.org/D47120 llvm-svn: 333825	2018-06-02 16:40:03 +00:00
Ivan A. Kosarev	73c5337a64	Revert r333819 "[NEON] Support VLD1xN intrinsics in AArch32 mode (Clang part)" The LLVM part was committed instead of the Clang part. Differential Revision: https://reviews.llvm.org/D47121 llvm-svn: 333824	2018-06-02 16:38:38 +00:00
Michael J. Spencer	ae6eeaea92	[MC] Add assembler support for .cg_profile. Object FIle Representation At codegen time this is emitted into the ELF file a pair of symbol indices and a weight. In assembly it looks like: .cg_profile a, b, 32 .cg_profile freq, a, 11 .cg_profile freq, b, 20 When writing an ELF file these are put into a SHT_LLVM_CALL_GRAPH_PROFILE (0x6fff4c02) section as (uint32_t, uint32_t, uint64_t) tuples as (from symbol index, to symbol index, weight). Differential Revision: https://reviews.llvm.org/D44965 llvm-svn: 333823	2018-06-02 16:33:01 +00:00
Craig Topper	93d8fbd8f2	[X86] Add tied source operand to AVX5124FMAPS and AVX5124VNNIW instructions. This doesn't affect the assembly or disassembly, but is more accurate. llvm-svn: 333822	2018-06-02 16:30:39 +00:00
Craig Topper	27234f1d8f	[X86] Fix warning message for AVX5124FMAPS and AVX5124VNNIW instructions in the assembly parser. The caret was positioned on the wrong operand. It's too hard to get right so just put the caret at the beginning of the instruction. llvm-svn: 333821	2018-06-02 16:30:36 +00:00
Sanjay Patel	bbc6d60677	[InstCombine] call simplify before trying vector folds As noted in the review thread for rL333782, we could have made a bug harder to hit if we were simplifying instructions before trying other folds. The shuffle transform in question isn't ever a simplification; it's just a canonicalization. So I've renamed that to make that clearer. This is NFCI at this point, but I've regenerated the test file to show the cosmetic value naming difference of using instcombine's RAUW vs. the builder. Possible follow-ups: 1. Move reassociation folds after simplifies too. 2. Refactor common code; we shouldn't have so much repetition. llvm-svn: 333820	2018-06-02 16:27:44 +00:00
Ivan A. Kosarev	51f19b9ee1	[NEON] Support VLD1xN intrinsics in AArch32 mode (Clang part) We currently support them only in AArch64. The NEON Reference, however, says they are 'ARMv7, ARMv8' intrinsics. Differential Revision: https://reviews.llvm.org/D47121 llvm-svn: 333819	2018-06-02 16:26:42 +00:00
Fangrui Song	8ca769d204	[Support] Remove unused raw_ostream::handle whose anchor role was superseded by anchor() llvm-svn: 333817	2018-06-02 06:00:35 +00:00
Craig Topper	1534929623	[X86] Add encoding information for the AVX5124FMAPS and AVX5124VNNIW instructions so they can be assembled and disassembled. These instructions are unusual in that they operate on 4 consecutive registers so supporting them in codegen will be more difficult than normal. Includes an assembler check to warn if the source register is not the first register of a 4 register group. llvm-svn: 333812	2018-06-02 02:15:10 +00:00
Chandler Carruth	9281503e8f	[PM/LoopUnswitch] Fix how the cloned loops are handled when updating analyses. Summary: I noticed this issue because we didn't put the primary cloned loop into the `NonChildClonedLoops` vector and so never iterated on it. Once I fixed that, it made it clear why I had to do a really complicated and unnecesasry dance when updating the loops to remain in canonical form -- I was unwittingly working around the fact that the primary cloned loop wasn't in the expected list of cloned loops. Doh! Now that we include it in this vector, we don't need to return it and we can consolidate the update logic as we correctly have a single place where it can be handled. I've just added a test for the iteration order aspect as every time I changed the update logic partially or incorrectly here, an existing test failed and caught it so that seems well covered (which is also evidenced by the extensive working around of this missing update). Reviewers: asbirlea, sanjoy Subscribers: mcrosier, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D47647 llvm-svn: 333811	2018-06-02 01:29:01 +00:00
Roman Tereshin	cf88ffaaf9	[DebugInfo] Refactoring DIType::setFlags to DIType::cloneWithFlags, NFC and using the latter in DIBuilder::createArtificialType and DIBuilder::createObjectPointerType methods as well as introducing mirroring DISubprogram::cloneWithFlags and DIBuilder::createArtificialSubprogram methods. The primary goal here is to add createArtificialSubprogram to support a pass downstream while keeping the method consistent with the existing ones and making sure we don't encourage changing already created DI-nodes. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D47615 llvm-svn: 333806	2018-06-01 23:15:09 +00:00
Craig Topper	3828ce7eab	[X86] Do something sensible when an expand load intrinsic is passed a 0 mask. Previously we just returned undef, but really we should be returning the pass thru input. We also need to make sure we preserve the chain output that the original intrinsic node had to maintain connectivity in the DAG. So we should just return the incoming chain as the output chain. llvm-svn: 333804	2018-06-01 22:59:07 +00:00
Vedant Kumar	7224c08141	Add a debug dump for DbgValueHistoryMap This makes it easier to inspect the results of DbgValueHistoryCalculator. Differential Revision: https://reviews.llvm.org/D47663 llvm-svn: 333801	2018-06-01 22:33:15 +00:00
Craig Topper	aa747412b1	[X86] Add isel patterns to use vexpand with zero masking when the passthru value is a zero vector. llvm-svn: 333800	2018-06-01 22:28:28 +00:00
Zachary Turner	b44d7a0da1	Move some function declarations out of WindowsSupport.h The idea behind WindowsSupport.h is that it's in the source directory so that windows.h'isms don't leak out into the larger LLVM project. To that end, any symbol that references a symbol from windows.h must be in this private header, and not in a public header. However, we had some useful utility functions in WindowsSupport.h which have no dependency on the Windows API, but still only make sense on Windows. Those functions should be usable outside of Support since there is no risk of causing a windows.h leak. Although this introduces some preprocessor logic in some header files, It's not too egregious and it's better than the alternative of duplicating a ton of code. Differential Revision: https://reviews.llvm.org/D47662 llvm-svn: 333798	2018-06-01 22:23:46 +00:00
Karl-Johan Karlsson	6d52e5c3e4	[ConstantFold] Disallow folding vector geps into bitcasts Summary: Getelementptr returns a vector of pointers, instead of a single address, when one or more of its arguments is a vector. In such case it is not possible to simplify the expression by inserting a bitcast of operand(0) into the destination type, as it will create a bitcast between different sizes. Reviewers: majnemer, mkuper, mssimpso, spatel Reviewed By: spatel Subscribers: lebedev.ri, llvm-commits Differential Revision: https://reviews.llvm.org/D46379 llvm-svn: 333783	2018-06-01 19:34:35 +00:00
Sanjay Patel	66f7e19f6a	[InstCombine] fix vector shuffle transform to replace undef elements (PR37648) This bug: https://bugs.llvm.org/show_bug.cgi?id=37648 ...was created with the enhancement to this transform with rL332479. The urem test shows the disaster potential: any undef divisor lane makes the whole op undef. The test diffs show that vector demanded elements turns some of the potential, but not all, unused binop operands back into undef already. llvm-svn: 333782	2018-06-01 19:23:18 +00:00
Simon Atanasyan	e80c3ce9cc	[mips] Support 64-bit offsets for lb/sb/ld/sd/lld ... instructions The `MipsAsmParser::loadImmediate` can load immediates of various sizes into a register. Idea of this change is to use `loadImmediate` in the `MipsAsmParser::expandMemInst` method to load offset into a register and then call required load/store instruction. The patch removes separate `expandLoadInst` and `expandStoreInst` methods and does everything in the `expandMemInst` method to escape code duplication. Differential Revision: https://reviews.llvm.org/D47316 llvm-svn: 333774	2018-06-01 16:37:53 +00:00
Simon Atanasyan	3a44bcf95a	[mips] Extend list of relocations supported by the `.reloc` directive Supporting GOT and TLS related relocations by the `.reloc` directive is useful for purpose of testing various tools like a linker, for example. llvm-svn: 333773	2018-06-01 16:37:42 +00:00
Krzysztof Parzyszek	bc68385dad	[Hexagon] Avoid UB when shifting unsigned integer left by 32 llvm-svn: 333771	2018-06-01 15:39:10 +00:00
Vlad Tsyrklevich	6867ab7c90	[ThinLTOBitcodeWriter] Emit summaries for regular LTO modules Summary: Emit summaries for bitcode modules that are only destined for the regular LTO portion of the build so they can participate in summary-based dead stripping. This change reduces the size of a nacl_helper build with cfi-icall enabled by 7%, removing the majority of the overhead due to enabling cfi-icall. The cfi-icall size increase was caused by compiling in lots of unused code and cfi-icall generating jumptable references to unused symbols that could no longer be removed by -Wl,-gc-sections. Increasing the visibility of summary-based dead stripping prevented jumptable entries being created for unused symbols from the regular LTO portion of the build. Reviewers: pcc Reviewed By: pcc Subscribers: dschuff, mehdi_amini, inglorion, eraman, llvm-commits, kcc Differential Revision: https://reviews.llvm.org/D47594 llvm-svn: 333768	2018-06-01 15:20:47 +00:00
Nirav Dave	fc9a700f94	[DAG] Avoid checking for consecutive stores in store merge. NFCI. llvm-svn: 333766	2018-06-01 15:05:55 +00:00
Nirav Dave	39ece11ae5	[DAG] Simplify Expression. NFC. llvm-svn: 333765	2018-06-01 15:05:30 +00:00
Nirav Dave	0fc27acaa2	[DAG] Remove untriggerable check. NFCI. Candidate check precludes this check. llvm-svn: 333764	2018-06-01 15:05:05 +00:00
Nirav Dave	a74921a696	[DAG] Prune store merge legal store check to stop invalid size. NFCI. Do not consider store sizes large than the maximum legal store size. llvm-svn: 333763	2018-06-01 15:04:40 +00:00
Krzysztof Parzyszek	aec2c0c9b6	[Hexagon] Select HVX code for vector CTPOP, CTLZ, and CTTZ llvm-svn: 333760	2018-06-01 14:52:58 +00:00
Hiroshi Inoue	9796b47df1	[NFC] Zero initialize local variables This patch makes local variables zero initialized to avoid broken values in debug output. llvm-svn: 333754	2018-06-01 14:23:15 +00:00
Krzysztof Parzyszek	0b6187c1a9	[SelectionDAG] Expand UADDO/USUBO into ADD/SUBCARRY if legal for target Additionally, implement handling of ADD/SUBCARRY on Hexagon, utilizing the UADDO/USUBO expansion. Differential Revision: https://reviews.llvm.org/D47559 llvm-svn: 333751	2018-06-01 14:00:32 +00:00
Amaury Sechet	8467411dad	Set ADDE/ADDC/SUBE/SUBC to expand by default Summary: They've been deprecated in favor of UADDO/ADDCARRY or USUBO/SUBCARRY for a while. Target that uses these opcodes are changed in order to ensure their behavior doesn't change. Reviewers: efriedma, craig.topper, dblaikie, bkramer Subscribers: jholewinski, arsenm, jyknight, sdardis, nemanjai, nhaehnle, kbarton, fedor.sergeev, asb, rbar, johnrusso, simoncook, jordy.potman.lists, apazos, sabuasal, niosHD, jrtc27, zzheng, edward-jones, mgrang, atanasyan, llvm-commits Differential Revision: https://reviews.llvm.org/D47422 llvm-svn: 333748	2018-06-01 13:21:33 +00:00
Amara Emerson	5a3bb68e12	[AArch64][GlobalISel] Zero-extend s1 values when returning. Before we were relying on the any extend of the s1 to s32, but for AAPCS we need to zero-extend it to at least s8. Fixes PR36719 Differential Revision: https://reviews.llvm.org/D47425 llvm-svn: 333747	2018-06-01 13:20:32 +00:00
Florian Hahn	8a17f1f43e	Revert r333740: IPSCCP] Use PredicateInfo to propagate facts from cmp. This is breaking the clang-with-thin-lto-ubuntu bot. llvm-svn: 333745	2018-06-01 12:58:43 +00:00
Sander de Smalen	f95ea047e5	[AArch64][SVE] Asm: Support for FDUP_ZI (copy fp immediate) instruction. Unpredicated copy of floating-point immediate value into SVE vector, along with MOV-aliases. Reviewers: rengolin, fhahn, samparker, SjoerdMeijer, javed.absar Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D47482 llvm-svn: 333744	2018-06-01 12:54:46 +00:00
Simon Dardis	351aa594f6	[mips] Guard more aliases correctly. Also, duplicate an alias for microMIPS. llvm-svn: 333741	2018-06-01 10:57:13 +00:00
Florian Hahn	f4df554f32	Recommit r333268: [IPSCCP] Use PredicateInfo to propagate facts from cmp instructions. This patch updates IPSCCP to use PredicateInfo to propagate facts to true branches predicated by EQ and to false branches predicated by NE. As a follow up, we should be able to extend it to also propagate additional facts about nonnull. Reviewers: davide, mssimpso, dberlin, efriedma Reviewed By: davide, dberlin Differential Revision: https://reviews.llvm.org/D45330 llvm-svn: 333740	2018-06-01 10:48:54 +00:00
Simon Dardis	54217598b6	[mips] Guard 'nop' properly and add mips16's nop instruction Reviewers: smaksimovic, atanasyan, abeserminji Differential Revision: https://reviews.llvm.org/D47583 llvm-svn: 333739	2018-06-01 10:46:00 +00:00
Pavel Labath	d6ca063907	DWARFAcceleratorTable: Add an iterator-based api for accessing names in the index Summary: Back when we were introducing the DWARF v5 name index, there was a short discussion whether we shouldn't have a nicer api for iterating over the index. At that time, I did not find it necessary since the iteration over names was done only from within the index itself (and I figured the internal implementation can deal with a slightly rough interface). However, now I ran into a use for this kind of API in LLDB (for finding all names matching a regular expression), so it looked like a nice opportunity to introduce one. To make the API more useful, I've made the NameTableEntry class a bit smarter: it now stores the string section reference (so it can return its name) and its position in the name index (mainly useful for dumping/logging). I also convert the internal users to use the new API, which also gives test coverage for the added code. Reviewers: JDevlieghere, aprantl, dblaikie Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47590 llvm-svn: 333738	2018-06-01 10:33:11 +00:00
Simon Dardis	ee67dcb837	[mips] Select the correct instruction for computing frameindexes Reviewers: smaksimovic, atanasyan, abeserminji Differential Revision: https://reviews.llvm.org/D47582 llvm-svn: 333736	2018-06-01 10:07:10 +00:00
Gabor Buella	27c96d3d20	NFC Avoid a warning in WasmEHPrepare.cpp ``` ../lib/CodeGen/WasmEHPrepare.cpp:166:30: warning: extra ‘;’ [-Wpedantic] false, false); ^ ``` llvm-svn: 333732	2018-06-01 07:47:46 +00:00
Sander de Smalen	97ca6b9e09	[AArch64][SVE] Asm: Support for DUPM (masked immediate) instruction. Unpredicated copy of repeating immediate pattern to SVE vector, along with MOV-aliases. Reviewers: rengolin, fhahn, samparker, SjoerdMeijer, javed.absar Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D47328 llvm-svn: 333731	2018-06-01 07:25:46 +00:00
Craig Topper	c3cf55b935	[X86][Disassembler] Make it an error to set EVEX.R' to 0 when modrm.reg encodes a GPR. This is different than the behavior of EVEX.X extending modrm.rm to 5 bits. llvm-svn: 333728	2018-06-01 06:11:29 +00:00
Craig Topper	0838c4d6bc	[X86][Disassembler] Ignore EVEX.X extension of modrm.rm to 5-bits when modrm.rm encodes a k-register. llvm-svn: 333727	2018-06-01 05:36:08 +00:00
Craig Topper	74a61b02e0	[X86][Disassembler] Clamp index to 4-bits when decoding GPR registers. A 5-bit value can occur when EVEX.X is 0 due to it being used to extend modrm.rm to encode XMM16-31. But if modrm.rm instead encodes a GPR, the Intel documentation says EVEX.X should be ignored so just mask it to 4 bits once we know its a GPR. llvm-svn: 333725	2018-06-01 05:12:44 +00:00
Craig Topper	5b1dd01e57	[X86][Disassembler] Make sure EVEX.X is not used to extend base registers of memory operations. This was an accidental side effect of EVEX.X being used to encode XMM16-XMM31 using modrm.rm with modrm.mod==0x3. I think there are still more bugs related to this. llvm-svn: 333722	2018-06-01 04:29:34 +00:00
Craig Topper	c6b2c2bb70	[X86][Disassembler] Use a local variable instead of using a field in the instruction object. NFC llvm-svn: 333721	2018-06-01 04:29:30 +00:00
Tom Stellard	e43778895c	AMDGPU/R600: Move intrinsics to IntrinsicsAMDGPU.td Reviewers: arsenm, nhaehnle, jvesely Reviewed By: arsenm Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, llvm-commits, t-tye Differential Revision: https://reviews.llvm.org/D47487 llvm-svn: 333720	2018-06-01 02:19:46 +00:00
Craig Topper	dc5ba1e495	[X86] Make sure the check for VEX.vvvv being all ones on instructions that don't use it doesn't ignore a bit in 32-bit mode. llvm-svn: 333717	2018-06-01 01:23:52 +00:00
Craig Topper	0179c6d0e5	[X86][Disassembler] Suppress reading of EVEX.V' and EVEX.R' in 32-bit mode. llvm-svn: 333714	2018-06-01 00:10:36 +00:00
Heejin Ahn	d69acf3b4c	Change ambiguous uses of term 'funclet' to 'EH scopes'. NFC. Summary: `getEHScopeMembership()` function is used not only for funclet-based EHs; they apply to all EH schemes that use the scoped IR (catchpad/cleanuppad/...). D47005 (rL333045) changed some of the uses of the term 'funclet' to 'EH scopes' in case they apply to all scoped EH, and this fixes more of them. For `FuncletLayout` pass, I left it as is because the pass is only used for funclet-based EH. Reviewers: majnemer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47611 llvm-svn: 333711	2018-06-01 00:03:21 +00:00
Dan Gohman	91ab25bbe3	[WebAssembly] Update to the new names for the memory intrinsics. The WebAssembly committee has decided on the names `memory.size` and `memory.grow` for the memory intrinsics, so update the LLVM intrinsics to follow those names, keeping both sets of old names in place for compatibility. llvm-svn: 333708	2018-05-31 22:35:25 +00:00
Dan Gohman	b17de645ea	[WebAssembly] Fix the signatures for the __mulo* libcalls. The __mulo* libcalls have an extra i32* to return the overflow value. Fixes PR37401. llvm-svn: 333706	2018-05-31 22:27:24 +00:00

1 2 3 4 5 ...

113775 Commits