llvm-project

Commit Graph

Author	SHA1	Message	Date
Johannes Doerfert	791c9f1145	[Attributor] Fix TODO to avoid recomputation of results The helpers AAReturnedFromReturnedValues and AACallSiteReturnedFromReturned are useful not only to avoid code duplication but also to avoid recomputation of results. If we have N call sites we should not recompute the function return information N times but once. These are mostly straightforward usages with some minor improvements on the helpers and addition of a new one (IRPosition::getAssociatedType) that knows about function return types.	2020-01-29 19:24:34 -06:00
Nico Weber	442d8e7a91	[gn build] add a FIXME about using /Gw on win	2020-01-29 19:12:08 -05:00
Gabor Horvath	31ae0165c3	[LTO] Add optimization remarks for removed functions This only works with regular LTO for now. Differential Revision: https://reviews.llvm.org/D73597	2020-01-29 15:53:51 -08:00
Craig Topper	35625464c6	[X86] Fix the cost model for v16i16->v16i32 zero_extend/sign_extend with AVX2 We seem to be inheriting the cost from sse4.1. But if we have 256-bit registers we should be able to do this with just one extract to split the 16i16 and two v8i16->v8i32 operations so our cost should be 3 not 4. Differential Revision: https://reviews.llvm.org/D73646	2020-01-29 15:52:10 -08:00
Matt Arsenault	c5fffa4da3	GlobalISel: Add observer argument to legalizeIntrinsic This is passed to legalizeCustom, but not intrinsic. Also remove the MRI argument, since you can get that from the MachineIRBuilder. I'm not sure why MachineIRBuilder has a private observer member, and this is passed separately.	2020-01-29 18:33:45 -05:00
Matt Arsenault	7f3280ecdd	AMDGPU/GlobalISel: Select permlane16/permlanex16	2020-01-29 17:55:31 -05:00
Yuanfang Chen	43d9f2d1e8	[opt viewer] Python compat - decode/encode string Summary: Use io.open instead of codecs.open according to here https://stackoverflow.com/questions/10971033/backporting-python-3-openencoding-utf-8-to-python-2 Add `u` prefix to string literal to make them utf-8 in python2. Reviewers: anemet, serge-sans-paille Reviewed by: serge-sans-paille Differential Revision: https://reviews.llvm.org/D73011	2020-01-29 14:49:24 -08:00
Jonas Devlieghere	d88a5c3987	[SmallString] Remove StringRef indirection for std::string conversion. There's no need to go through StringRef to convert a SmallString to a std::string, the conversion operator can create a std::string directly. Differential revision: https://reviews.llvm.org/D73640	2020-01-29 13:49:56 -08:00
Cameron McInally	4f2e2acc4b	[NFC][AArch64][SVE] Rename Destructive enumerator from DestructiveInstType Rename Destructive enumerator in preparation for a larger set of patches to support prefixing destructive oeprations with MOVPRFX. Differential Revision: https://reviews.llvm.org/D73212	2020-01-29 15:42:26 -06:00
Shoaib Meenai	0423ddfb81	[build] Fix LLVM_ENABLE_RUNTIMES override condition I forgot to add parentheses in `fa44d72b9e`, though I prefer the expanded form anyway.	2020-01-29 13:41:31 -08:00
Amara Emerson	c12f046eb9	[GlobalISel] Add new combine to convert scalar G_MUL to G_SHL. For pow2 constants we should use G_SHL for pattern matching (and perf) purposes later. Vector support not yet implemented. Differential Revision: https://reviews.llvm.org/D73659	2020-01-29 13:39:00 -08:00
LLVM GN Syncbot	e8e6e13176	[gn build] Port `5ea83eef4d`	2020-01-29 21:19:26 +00:00
Derek Schuff	5ea83eef4d	Revert "[llvm-objcopy] Initial support for wasm in llvm-objcopy" This reverts commit `a928d127a5`. It seems to cause issues with big-endian architectures.	2020-01-29 13:12:56 -08:00
Jessica Paquette	050cd443ca	[AArch64][GlobalISel] Fix TBNZ/TBZ opcode selection When the bit is <= 32, we have to use the W register variant for TB(N)Z. This is because of the way the instruction is encoded. Differential Revision: https://reviews.llvm.org/D73660	2020-01-29 13:11:18 -08:00
LLVM GN Syncbot	363289b542	[gn build] Port `24962ced81`	2020-01-29 21:06:15 +00:00
Hiroshi Yamauchi	24962ced81	[Loads] Handle simple cases with same base pointer with constant offsets in FindAvailableLoadedValue when AA is null. Summary: This will help with devirtualization (store forwarding with vtable pointers in the presence of other stores into members in the constructor.) During inlining, we don't have AA. Reviewers: davidxl Subscribers: mgorny, Prazek, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71307	2020-01-29 13:05:46 -08:00
Cameron McInally	00c2249910	[NFCI][AArch64][SVE] Set default DestructiveInstType in AArch64Inst class Some housekeeping for the DestructiveInstType enum before a larger set of patches to support prefixing destructive oeprations with MOVPRFX. Differential Revision: https://reviews.llvm.org/D73141	2020-01-29 15:00:19 -06:00
Victor Huang	1492b70a03	[PowerPC][Future] Add prefixed loads and stores for future CPU A previous patch should have added pld and pstd and any support code in the backend that is required for prefixed load and store type operations. This patch adds a number of additional prefixed load and store type instructions for the future CPU. Differential Revision: https://reviews.llvm.org/D72577	2020-01-29 14:45:56 -06:00
Sanjay Patel	89195638bf	[InstCombine] add splat binop tests; NFC	2020-01-29 15:38:03 -05:00
Matt Arsenault	d3cea95475	AMDGPU/GlobalISel: Fix tests in release build Irritatingly the failure output is different in release vs. debug because of the legality check is removed without asserts, so a register ends up constrained only in release builds.	2020-01-29 12:27:16 -08:00
Sterling Augustine	c64b56617d	Print discriminators when printing .debug_line in GNU style. Summary: gnu addr2line prints DWARF line table discriminators like so: <file>:<line> (discriminator <Number>) This matches that behavior. Document how and when --output-style=GNU prints discriminators Add test for new GNU-style discriminator printing. Reviewers: rupprecht, labath, jhenderson Subscribers: aprantl, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73318	2020-01-29 12:22:12 -08:00
Shoaib Meenai	fa44d72b9e	[build] Fix runtimes build after `2e745ba6b0` I missed the NOT in the condition; this part is actually responsible for passing LLVM_ENABLE_RUNTIMES to the per-target runtime configures, which in turn makes them actually build. I'll put up a more general solution for review, but restore this in the meantime to fix the runtimes build.	2020-01-29 12:16:40 -08:00
Nikita Popov	e086e23024	[InstCombine] Support non-splat vectors in icmp eq + add/sub fold For the icmp eq (add X, C1), C2 => icmp eq X, C2-C1 icmp eq (sub C1, X), C2 => icmp eq X, C1-C2 folds, this allows C1 to be non-splat and contain undefs. C2 is still splat, due to the structure of the code. This is to address the remaining part of the regression in D73411, where demanded element analysis replaces some elements with undef. Differential Revision: https://reviews.llvm.org/D73647	2020-01-29 20:56:58 +01:00
Nikita Popov	5171587a5f	[InstCombine] Add undef/non-splat tests for add/sub + icmp eq; NFC	2020-01-29 20:56:58 +01:00
Amara Emerson	0da937bb5c	[GlobalISel][IRTranslator] Follow convention and put constant offset of getelementptr arithmetic on RHS. We were needlessly putting known constant values on the LHS of a G_MUL, which is suboptimal. Differential Revision: https://reviews.llvm.org/D73650	2020-01-29 11:37:19 -08:00
Nico Weber	b998d481da	attempt to fix symbolize-paths.s everywhere after cd68f4	2020-01-29 14:26:50 -05:00
Nico Weber	cd68f4beaa	attempt to fix symbolize-paths.s on windows	2020-01-29 14:23:00 -05:00
Huihui Zhang	8f6761aa41	Revert "[AArch64] Fix data race on RegisterBank initialization." Buildbot failure, revert first while looking at the issue. This reverts commit `a5a4a47d69`.	2020-01-29 11:17:19 -08:00
Huihui Zhang	af620fc36a	Revert "[AMDGPU] Fix data race on RegisterBank initialization." There looks to be buildbot failure related. This reverts commit `8bb6c8a22a`.	2020-01-29 11:16:27 -08:00
Huihui Zhang	2ec954579a	Revert "[ARM] Fix data race on RegisterBank initialization." There looks to be buildbot failure related. This reverts commit `91618d940e`.	2020-01-29 11:15:27 -08:00
Fangrui Song	8903e61b66	[AsmPrinter][ELF] Define local aliases (.Lfoo$local) for GlobalObjects For `MC_GlobalAddress` operands referencing certain GlobalObjects, we can lower them to STB_LOCAL aliases to avoid costs brought by assembler/linker's conservative decisions about symbol interposition: * An assembler conservatively assumes a global default visibility symbol interposable (ELF semantics). So relocations in object files are needed even if the code generator assumed the definition exact and non-interposable. * The relocations can cause the creation of PLT entries on some targets for -shared links. A linker conservatively assumes a global default visibility symbol interposable (if not otherwise constrained by -Bsymbolic/--dynamic-list/VER_NDX_LOCAL/etc). "certain" refers to GlobalObjects in the intersection of `hasExactDefinition() and !isInterposable()`: `external`, `appending`, `internal`, `private`. Local linkages (`internal` and `private`) cannot be interposed. `appending` is for very few objects LLVM interpret specially. So the set just includes `external`. This patch emits STB_LOCAL aliases (.Lfoo$local) for such GlobalObjects, so that targets can lower MC_GlobalAddress operands to STB_LOCAL aliases if applicable. We may extend the scope and include GlobalAlias in the future. LLVM's existing -fno-semantic-interposition behaviors give us license to do such optimizations: * Various optimizations (ipconstprop, inliner, sccp, sroa, etc) treat normal ExternalLinkage GlobalObjects as non-interposable. * Before D72197, MC resolved a PC-relative VK_None fixup to a non-local symbol at assembly time (no outstanding relocation), if the target is defined in the same section. Put it simply, even if IR optimizations failed to optimize and allowed interposition for the function call in `void foo() {} void bar() { foo(); }`, the assembler would disallow it. This patch sets up AsmPrinter infrastructure to make -fno-semantic-interposition more so. With and without the patch, the object file output should be identical: `.Lfoo$local` does not take a symbol table entry. Reviewed By: sfertile Differential Revision: https://reviews.llvm.org/D73228	2020-01-29 10:58:43 -08:00
Sterling Augustine	0758ac4e0c	Handle non-absolute include dirs properly for both dwarf4 and dwarf5. Summary: Add test case for the same. This test case will also serve as a starting point for later symbolizer tests. Reviewers: dblaikie, jdoerfert Subscribers: hiraditya, llvm-commits, jhenderson Tags: #llvm Differential Revision: https://reviews.llvm.org/D73583	2020-01-29 10:51:51 -08:00
Simon Pilgrim	f7245ef897	[DAGCombiner] ISD::SHL/SRA/SRL - use general SelectionDAG::FoldConstantArithmetic This handles all the constant splat / opaque testing for us.	2020-01-29 18:49:42 +00:00
Huihui Zhang	d2e2fc450e	[ConstantFold][SVE] Fix constant folding for scalable vector binary operations. Summary: Scalable vector should not be evaluated element by element. Add support to handle scalable vector UndefValue. Reviewers: sdesmalen, huntergr, spatel, lebedev.ri, apazos, efriedma, willlovett Reviewed By: efriedma Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71445	2020-01-29 10:49:08 -08:00
Austin Kerbow	2605adb69c	[AMDGPU][GlobalISel] Select 8-byte LDS Ops with 4-byte alignment Reviewers: arsenm Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, rovka, dstuttard, tpr, t-tye, hiraditya, Petar.Avramovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73585	2020-01-29 10:42:12 -08:00
Adrian Prantl	18dbe1b279	Run clang-format on DwarfExpression (NFC)	2020-01-29 10:23:12 -08:00
Adrian Prantl	816ee8a423	DwarfExpression: Factor out getOrCreateBaseType() (NFC)	2020-01-29 10:23:12 -08:00
Jonas Devlieghere	d7049213d0	[SmallString] Add explicit conversion to std::string With the conversion between StringRef and std::string now being explicit, converting SmallStrings becomes more tedious. This patch adds an explicit operator so you can write std::string(Str) instead of Str.str().str(). Differential revision: https://reviews.llvm.org/D73640	2020-01-29 10:17:10 -08:00
Huihui Zhang	91618d940e	[ARM] Fix data race on RegisterBank initialization. Summary: The initialization of RegisterBank needs to be done only once. The logic of AlreadyInit has data race, use llvm::call_once instead. This is continuing work of D73587. Reviewers: arsenm, rovka, dsanders, t.p.northover, efriedma, apazos Reviewed By: arsenm Subscribers: wdng, kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73605	2020-01-29 10:15:37 -08:00
Huihui Zhang	8bb6c8a22a	[AMDGPU] Fix data race on RegisterBank initialization. Summary: The initialization of RegisterBank needs to be done only once. The logic of AlreadyInit has data race, use llvm::call_once instead. This is continuing work of D73587. Reviewers: arsenm, tstellar, ronlieb, efriedma, apazos, nhaehnle Reviewed By: nhaehnle Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73604	2020-01-29 10:14:40 -08:00
Huihui Zhang	a5a4a47d69	[AArch64] Fix data race on RegisterBank initialization. Summary: The initialization of RegisterBank needs to be done only once. The logic of AlreadyInit has a data race, use llvm::call_once instead. This issue was identified through thread sanitizer. Reviewers: efriedma, apazos, qcolombet, dsanders Reviewed By: efriedma Subscribers: arsenm, kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73587	2020-01-29 10:12:52 -08:00
Adrian Prantl	aa6ec19c5f	Add dwarfdump support for DW_OP_regval_type. Differential Revision: https://reviews.llvm.org/D73598	2020-01-29 10:02:23 -08:00
Simon Pilgrim	25b8e96388	[DAGCombiner] ISD::MUL - use general SelectionDAG::FoldConstantArithmetic This handles all the constant splat / opaque testing for us.	2020-01-29 17:26:22 +00:00
Nikita Popov	6a74641e72	[InstCombine] Regenerate test checks; NFC	2020-01-29 18:22:07 +01:00
Craig Topper	90c31b0f42	[X86] Custom lower ISD::FROUND with SSE4.1 to avoid a libcall. ISD::FROUND is defined to round to nearest with ties rounding away from 0. This mode isn't supported in hardware on X86. But as long as we aren't compiling with trapping math, we can emulate this with floor(X + copysign(nextafter(0.5, 0.0), X)). We have to use nextafter to avoid some corner cases that adding 0.5 would have. For example, if X is nextafter(0.5, 0.0) it should round to 0.0, but adding 0.5 would need one extra bit of mantissa than can be stored so it rounds to 1.0. Adding nextafter(0.5, 0.0) instead will just increase the exponent by 1 and leave the mantissa as all 1s. This would be nextafter(1.0, 0.0) which will floor to 0.0. Techically this requires -fno-trapping-math which isn't our default. But if we care about exceptions we should be using constrained intrinsics. Constrained intrinsics would use STRICT_FROUND which won't go through this code. Fixes PR42195. Differential Revision: https://reviews.llvm.org/D73607	2020-01-29 09:10:02 -08:00
Francesco Petrogalli	4bc07c332a	[llvm][docs] LangRef for IR attribute `vector-function-abi-variant`. Reviewers: jdoerfert, andwar, simoll, rengolin, hfinkel, xtian Reviewed By: jdoerfert Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72798	2020-01-29 17:03:05 +00:00
Jay Foad	d07a789579	[AMDGPU] Cluster FLAT instructions with both vaddr and saddr Reviewers: rampitec, arsenm Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73634	2020-01-29 17:01:35 +00:00
Simon Pilgrim	4b04e11735	[DAGCombiner] Sub/SUBSAT - use general SelectionDAG::FoldConstantArithmetic This handles all the constant splat / opaque testing for us.	2020-01-29 16:57:13 +00:00
Simon Pilgrim	48bd6a0986	[DAGCombiner] visitIMINMAX - use general SelectionDAG::FoldConstantArithmetic This handles all the constant splat / opaque testing for us instead of the ConstantSDNode variant where we have to do it ourselves.	2020-01-29 16:57:13 +00:00
Craig Topper	e5edd641fd	[X86] Use a shorter sequence to implement FLT_ROUNDS This code needs to map from the FPCW 2-bit encoding for rounding mode to the 2-bit encoding defined for FLT_ROUNDS. The previous implementation did some clever swapping of bits and adding 1 modulo 4 to do the mapping. This patch instead uses an 8-bit immediate as a lookup table of four 2-bit values. Then we use the 2-bit FPCW encoding to index the lookup table by using a right shift and an AND. This requires extracting the 2-bit value from FPCW and multipying it by 2 to make it usable as a shift amount. But still results in less code. Differential Revision: https://reviews.llvm.org/D73599	2020-01-29 08:56:33 -08:00

1 2 3 4 5 ...

190961 Commits