llvm-project

Commit Graph

Author	SHA1	Message	Date
Fraser Cormack	1e56e8717f	[RISCV] Pre-commit shuffle test cases for D99270	2021-03-25 10:41:40 +00:00
Fraser Cormack	321a71a772	[RISCV] Optimize BUILD_VECTOR sequences that reveal hidden splats This patch adds further optimization techniques to RVV BUILD_VECTOR lowering. It teaches the compiler to find splats of larger vector element types "hidden" in smaller ones. For example, a v4i8 build_vector (0x1, 0x2, 0x1, 0x2) could be splat as v2i16 0x0201. This is generally more optimal than the dominant-element BUILD_VECTORs and so takes priority. This optimization is currently limited to all-constant-or-undef BUILD_VECTORs as those were found to be the most common. There's no reason this couldn't be extended to other BUILD_VECTORs, but the additional bit-manipulation instructions may require more sophisticated heuristics. There are some cases where the materialization of the larger constant takes more scalar instructions than it does to build the vector with vector instructions. We could add heuristics to try and catch this. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D99195	2021-03-25 10:35:31 +00:00
Simon Pilgrim	36e3c6c841	[X86][AVX] Truncate vectors with PACKSS/PACKUS on AVX2 targets Until AVX512 we don't have any vector truncation instructions, and always lower using shuffles instead. combineVectorTruncation performs this earlier than lowering as it makes it easier to use any sign/zero-extended bits in the truncated bits with PACKSS/PACKUS to perform the shuffle. We currently don't attempt to use combineVectorTruncation on AVX2 targets as in the past 256-bit PACKSS/PACKUS tended to cause 128-bit lane shuffle regressions - but these should now be all resolved with combineHorizOpWithShuffle and in all cases we now reduce the amount of cross-lane shuffling and variable shuffle mask usage. Differential Revision: https://reviews.llvm.org/D96609	2021-03-25 10:34:34 +00:00
Simon Pilgrim	9fde88c3e2	[X86][AVX] splitIntVSETCC - handle separate (canonicalized) SETCC operands LowerVSETCC calls splitIntVSETCC after canonicalizing certain patterns, in particular (X & CPow2 != 0) -> (X & CPow2 == CPow2). Unfortunately if we're splitting for AVX1/non-AVX512BW cases, we lose these canonicalizations as we call the split with the original SetCC node, and when the split nodes are later lowered in LowerVSETCC the patterns are lost behind extract_subvector etc. But if we pass the canonicalized operands for splitting we retain the optimizations. Differential Revision: https://reviews.llvm.org/D99256	2021-03-25 10:18:44 +00:00
Krasimir Georgiev	d9abcdd9f4	[clang-format] Fix ObjC method indent after `f7f9f94b` Commit `f7f9f94b2e` changed the indent of ObjC method arguments from +4 to +2, if the method occurs after a block statement. I believe this was unintentional and there was insufficient ObjC test coverage to catch this. Example: `clang-format -style=google test.mm` before: ``` void aaaaaaaaaaaaaaaaaaaaa(int c) { if (c) { f(); } [dddddddddddddddddddddddddddddddddddddddddddddddddddddddd eeeeeeeeeeeeeeeeeeeeeeeeeeeee:^(fffffffffffffff gggggggg) { f(SSSSS, c); }]; } ``` after: ``` void aaaaaaaaaaaaaaaaaaaaa(int c) { if (c) { f(); } [dddddddddddddddddddddddddddddddddddddddddddddddddddddddd eeeeeeeeeeeeeeeeeeeeeeeeeeeee:^(fffffffffffffff gggggggg) { f(SSSSS, c); }]; } ``` Differential Revision: https://reviews.llvm.org/D99063	2021-03-25 10:52:08 +01:00
Raphael Isemann	d90b1230ea	[lldb] Fix TestVSCode.test_progress_events on Linux due to vdso This currently fails when we get the module for `[vdso]` which doesn't have any parsing event associated with it as it's just created from memory.	2021-03-25 10:48:58 +01:00
Kiran Chandramohan	502f27e66f	Trivial change to fix builds Pass the context while creating the Patternslist.	2021-03-25 09:31:54 +00:00
Vladislav Vinogradov	70b6f16e07	[mlir] Support MemRefType with multiple AffineMaps in getStridesAndOffset Compose multiple AffineMaps into single map before strides extraction. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D99166	2021-03-25 12:18:49 +03:00
Jean Perier	ffa455d4d4	[mlir] Translate global initializers after creating all LLVM IR globals In case an operation in a global initializer region refers to another global variable defined afterwards in the module of itself, translation to LLVM IR was currently crashing because it could not find the LLVM IR global when going through the initializer block. To solve this problem, split global conversion to LLVM IR into two passes. A first pass that creates LLVM IR global variables, and a second one that converts the initializer, if any, and adds it to the llvm global. Differential Revision: https://reviews.llvm.org/D99246	2021-03-25 09:53:58 +01:00
Petr Hosek	74ed5124ba	Revert "[libcxxabi] Use cxx-headers target to consume libcxx headers" This reverts commit `72728e1280` which broke libcxxabi tests under the runtimes build.	2021-03-25 01:50:11 -07:00
Martin Storsjö	b8b23aa80e	[libcxx] [test] Quote env variables that are set with a shell "export" in ssh.py This safeguards against cases if some of the env vars contain chars that are problematic for shells, e.g. if called with --env "X=Y;Z". (In cases of cross testing for windows, the PATH variable can end up specified with semicolon separators - even if specifying a PATH when cross testing in such differing environments might not make sense or do anything - but this makes ssh.py not break on such a variable.) Differential Revision: https://reviews.llvm.org/D99242	2021-03-25 09:46:44 +02:00
Martin Storsjö	a88556733a	[LLD] Fix probing a MSYS based 'tar' in a Windows Container Don't run the 'tar' tool in a cleared environment with only the LANG variable set, just set LANG on top of the existing environment. If the 'tar' tool is an MSYS based tool, running it in a Windows Container hangs if all environment variables are cleared - in particular, the USERPROFILE variable needs to be kept intact. This is the same issue fixed as was fixed in other places in `9de63b2e05`, but contrary to running the actual tests, running with an as-cleared-as-possible environment here is less important. Differential Revision: https://reviews.llvm.org/D99304	2021-03-25 09:45:27 +02:00
Craig Topper	32f6a15dfd	[RISCV] Add more tests that can be improved by D99042.	2021-03-25 00:02:42 -07:00
Yolanda Chen	4f9c61ef72	[lld] add context-sensitive PGO options for COFF. Add lld CSPGO (Contex-Sensitive PGO) options for COFF target. Reference the ELF options from https://reviews.llvm.org/D56675 Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D98763	2021-03-24 23:40:09 -07:00
Christopher Di Bella	3f143a10cc	[libcxx] updates regular_invocable test to actually test regular_invocable The test wasn't previously testing this concept, but its base. Differential Revision: https://reviews.llvm.org/D99306	2021-03-25 06:33:48 +00:00
Chuanqi Xu	20b4f484d1	[Driver] Add -fno-split-stack Summary: Add -fno-split-stack and rename CC1 option from `-split-stacks` to `-fsplit-stack`. Test Plan: check-all Differential Revision: https://reviews.llvm.org/D99245	2021-03-25 14:18:28 +08:00
Amara Emerson	0d2c4db637	[GlobalISel] Fix crash in RBS with a non-generic IMPLICIT_DEF. This may occur when swifterror codegen in the translator generates these, but we shouldn't try to handle them since they should have regclasses anyway. rdar://75784009 Differential Revision: https://reviews.llvm.org/D99287	2021-03-24 23:08:51 -07:00
Serge Pavlov	ddb0bcbdff	Add missing cases in RISCVMCExpr::getVariantKindName Differential Revision: https://reviews.llvm.org/D98929	2021-03-25 12:57:05 +07:00
Craig Topper	c8cf8bc7ec	[RISCV] Add some 32-bit ctlz and cttz idiom tests to rv64zbb.ll. NFC This implements various idioms using ctlz/cttz like Log2, Log2_Ceil, findFirstSetBit, etc. Some of these demonstrate that we fail to use clzw because the idiom breaks the isel patterns we use. The isel pattern we use is (add (cttz (and X, 0xffffffff)), -32). Some of the idioms cause the constant on the add to be different.	2021-03-24 21:52:48 -07:00
Mehdi Amini	973ddb7d6e	Define a `NoTerminator` traits that allows operations with a single block region to not provide a terminator In particular for Graph Regions, the terminator needs is just a historical artifact of the generalization of MLIR from CFG region. Operations like Module don't need a terminator, and before Module migrated to be an operation with region there wasn't any needed. To validate the feature, the ModuleOp is migrated to use this trait and the ModuleTerminator operation is deleted. This patch is likely to break clients, if you're in this case: - you may iterate on a ModuleOp with `getBody()->without_terminator()`, the solution is simple: just remove the ->without_terminator! - you created a builder with `Builder::atBlockTerminator(module_body)`, just use `Builder::atBlockEnd(module_body)` instead. - you were handling ModuleTerminator: it isn't needed anymore. - for generic code, a `Block::mayNotHaveTerminator()` may be used. Differential Revision: https://reviews.llvm.org/D98468	2021-03-25 03:59:03 +00:00
Craig Topper	0f99c6c56e	[RISCV] Remove duplicate DebugLoc variables from cases in ReplaceNodeResults. NFC We already created a DebugLoc at the top of the function. We can just use that one.	2021-03-24 20:23:03 -07:00
Fred Riss	c4fee95746	[lldb/ObjC] Make the NonPointerIsaCache initialization lazy The objc_debug_isa_class_mask magic value that the objc runtime vends is now initialized using a static initializer instead of a constant value. The runtime plugin itself will be initialized before the value is computed and as a result, the cache will get the wrong value. Making the creation of the NonPointerIsaCache fully lazy fixes this.	2021-03-24 20:08:06 -07:00
Jonas Devlieghere	ff29fdfe4e	[lldb] Format AppleObjCRuntimeV2 (NFC)	2021-03-24 20:08:06 -07:00
Fangrui Song	927050af53	[Polly] Fix -Wunused-function in -DLLVM_ENABLE_ASSERTIONS=off builds	2021-03-24 19:56:43 -07:00
Jianzhou Zhao	af9f461298	[dfsan] test flush on only x86	2021-03-25 02:45:43 +00:00
Fangrui Song	cdd993fab3	[Driver] Use -dynamic-linker /lib/ld-musl-i386.so.1 for i?86-linux-musl Noticed by Khem Raj	2021-03-24 19:44:53 -07:00
Eric Schweitz	97d8972c9c	[flang][fir] Add the pre-code gen rewrite pass and codegen ops. Before the conversion to LLVM-IR dialect and ultimately LLVM IR, FIR is partially rewritten into a codegen form. This patch adds that pass, the fircg dialect, and the small set of Ops in the fircg (sub) dialect. Fircg is not part of the FIR dialect and should never be used outside of the (closed) conversion to LLVM IR. Authors: Eric Schweitz, Jean Perier, Rajan Walia, et.al. Differential Revision: https://reviews.llvm.org/D98063	2021-03-24 19:27:10 -07:00
Nathan Chancellor	ef58ae86ba	[RISCV] Fix mcount name GCC's name for this symbol is _mcount, which the Linux kernel expects in a few different place: $ echo 'int main(void) { return 0; }' \| riscv32-linux-gcc -c -pg -o tmp.o -x c - $ llvm-objdump -dr tmp.o \| grep mcount 0000000c: R_RISCV_CALL _mcount $ echo 'int main(void) { return 0; }' \| riscv64-linux-gcc -c -pg -o tmp.o -x c - $ llvm-objdump -dr tmp.o \| grep mcount 000000000000000c: R_RISCV_CALL _mcount $ echo 'int main(void) { return 0; }' \| clang -c -pg -o tmp.o --target=riscv32-linux-gnu -x c - $ llvm-objdump -dr tmp.o \| grep mcount 0000000a: R_RISCV_CALL_PLT mcount $ echo 'int main(void) { return 0; }' \| clang -c -pg -o tmp.o --target=riscv64-linux-gnu -x c - $ llvm-objdump -dr tmp.o \| grep mcount 000000000000000a: R_RISCV_CALL_PLT mcount Set MCountName to "_mcount" in RISCVTargetInfo then prevent it from getting overridden in certain OSTargetInfo constructors. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D98881 Signed-off-by: Nathan Chancellor <nathan@kernel.org>	2021-03-24 18:11:37 -07:00
Zequan Wu	dd388ba3e0	[llvm-cov] Check path emptyness in path-equivalence after removing dots.	2021-03-24 17:54:38 -07:00
Philip Reames	9a82f42d12	Plumb TLI through isSafeToExecuteUnconditionally [NFC] Split from D95815 to reduce patch size. Isn't (yet) used for anything, only the client side is wired up.	2021-03-24 17:52:04 -07:00
Giorgis Georgakoudis	7ad55a3df5	[Utils][NFC] Fix regex substitution for update test checks Relates to: https://reviews.llvm.org/D97107	2021-03-24 17:47:33 -07:00
Rob Suderman	f5ba3eea67	[mlir][tosa] Add tosa.bitwise_not lowering to constant and xor Lowering of bitwise_not to linalg dialect using a xor operation with a constant of all-bits-one. Differential Revision: https://reviews.llvm.org/D99221	2021-03-24 17:27:27 -07:00
Jianzhou Zhao	f9a135b652	[dfsan] Test dfsan_flush with origins This is a part of https://reviews.llvm.org/D95835. Reviewed By: morehouse Differential Revision: https://reviews.llvm.org/D99295	2021-03-25 00:12:53 +00:00
Philip Reames	4054b8322f	[deref] Implement initial set of inference rules for deref-at-point This implements a subset of the initial set of inference rules proposed in the llvm-dev thread "RFC: Decomposing deref(N) into deref(N) + nofree". The nolias one got moved to a separate review as there was some concerns raised which require further discussion. Differential Revision: https://reviews.llvm.org/D99135	2021-03-24 16:20:41 -07:00
Matt Morehouse	c8ef98e5de	Revert "[HWASan] Use page aliasing on x86_64." This reverts commit `63f73c3eb9` due to breakage on aarch64 without TBI.	2021-03-24 16:18:29 -07:00
Wenlei He	6869e6c1e7	[InlineCost] Make cost-benefit decision explicit With cost-benefit analysis for inlining, we bypass the cost-threshold by returning inline result from call analyzer early. However the cost and threshold are still available from call analyzer, and when cost is actually higher than threshold, we incorrect set the reason. The change makes the decision from cost-benefit analysis explicit. It's mostly NFC, except that it allows the priority-based sample loader inliner used by CSSPGO to use cost-benefit heuristic. Differential Revision: https://reviews.llvm.org/D99302	2021-03-24 16:10:58 -07:00
Yuanfang Chen	217f0f735a	[Clang][Sema] Implement GCC -Wcast-function-type ``` Warn when a function pointer is cast to an incompatible function pointer. In a cast involving function types with a variable argument list only the types of initial arguments that are provided are considered. Any parameter of pointer-type matches any other pointer-type. Any benign differences in integral types are ignored, like int vs. long on ILP32 targets. Likewise type qualifiers are ignored. The function type void (*) (void) is special and matches everything, which can be used to suppress this warning. In a cast involving pointer to member types this warning warns whenever the type cast is changing the pointer to member type. This warning is enabled by -Wextra. ``` Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D97831	2021-03-24 16:04:18 -07:00
Kazu Hirata	ef69aa961d	[InlineCost] Enable the cost benefit analysis on FDO This patch enables the cost-benefit-analysis-based inliner by default if we have instrumentation profile. - SPEC CPU 2017 shows a 0.4% improvement. - An internal large benchmark shows a 0.9% reduction in the cycle count along with 14.6% reduction in the number of call instructions executed. Differential Revision: https://reviews.llvm.org/D98213	2021-03-24 15:36:49 -07:00
jasonliu	09a84d3047	[libc++] Match declaration for non-member function std::swap(std::packaged_task) with what standard specify Standard specifies: ``` template<class R, class... ArgTypes> void swap(packaged_task<R(ArgTypes...)>& x, packaged_task<R(ArgTypes...)>& y) noexcept; ``` Differential Revision: https://reviews.llvm.org/D99102	2021-03-24 22:33:32 +00:00
Fangrui Song	35dd6470de	[Driver] Bring back "Clean up Debian multiarch /usr/include/<triplet> madness" This reverts commit `aae84b8e39`. The chromium goma folks want to use a Debian sysroot without lib/x86_64-linux-gnu to perform `clang -c` but no link action. The previous commit has removed D.getVFS().exists check to make such usage work.	2021-03-24 15:25:37 -07:00
Fangrui Song	bfbfd83f14	[Driver] Linux.cpp: delete unneeded D.getVFS().exists checks Not only can this save unneeded filesystem stats, it can make `clang --sysroot=/path/to/debian-sysroot -c a.cc` work (get `-internal-isystem $sysroot/usr/include/x86_64-linux-gnu`) even without `lib/x86_64-linux-gnu/`. This should make thakis happy.	2021-03-24 15:25:36 -07:00
Lei Zhang	19435d3863	[mlir][linalg] Fold fill -> tensor_reshape chain For such op chains, we can create new linalg.fill ops with the result type of the linalg.tensor_reshape op. Differential Revision: https://reviews.llvm.org/D99116	2021-03-24 18:17:58 -04:00
Lei Zhang	c241e1c2f5	[mlir][linalg] Support dropping unit dimensions for init tensors init tensor operands also has indexing map and generally follow the same constraints we expect for non-init-tensor operands. Differential Revision: https://reviews.llvm.org/D99115	2021-03-24 18:17:58 -04:00
Lei Zhang	7f28d27cb6	[mlir][linalg] Allow controlling folding unit dim reshapes This commit exposes an option to the pattern FoldWithProducerReshapeOpByExpansion to allow folding unit dim reshapes. This gives callers more fine-grained controls. Differential Revision: https://reviews.llvm.org/D99114	2021-03-24 18:17:57 -04:00
Lei Zhang	f66120a357	[mlir][affine] Add canonicalization to merge affine min/max ops This identifies a pattern where the producer affine min/max op is bound to a dimension/symbol that is used as a standalone expression in the consumer affine op's map. In that case the producer affine min/max op can be merged into its consumer. For example, a pattern like the following: ``` %0 = affine.min affine_map<()[s0] -> (s0 + 16, s0 * 8)> ()[%sym1] %1 = affine.min affine_map<(d0)[s0] -> (s0 + 4, d0)> (%0)[%sym2] ``` Can be turned into: ``` %1 = affine.min affine_map< ()[s0, s1] -> (s0 + 4, s1 + 16, s1 * 8)> ()[%sym2, %sym1] ``` Differential Revision: https://reviews.llvm.org/D99016	2021-03-24 18:17:57 -04:00
Lei Zhang	23fd26608c	[mlir][affine] Deduplicate affine min/max op expressions If there are multiple identical expressions in an affine min/max op's map, we can just keep one. Differential Revision: https://reviews.llvm.org/D99015	2021-03-24 18:17:57 -04:00
Lei Zhang	e58597ee1c	[mlir][linalg] Fuse producers with non-permutation indexing maps Until now Linalg fusion only allow fusing producers whose operands are all permutation indexing maps. It's easier to deduce the subtensor/subview but it is an unnecessary constraint, as in tiling we have more advanced logic to deduce the subranges even when the operand is not of permutation indexing maps, e.g., the input operand for convolution ops. This patch uses the logic on tiling side to deduce subranges for fusion. This enables fusing convolution with its consumer ops when possible. Along the way, we are now generating proper affine.min ops to guard against size boundaries, if we cannot be certain they won't be out of bounds. Differential Revision: https://reviews.llvm.org/D99014	2021-03-24 18:17:57 -04:00
Lei Zhang	ddf93abf49	[mlir][linalg] NFC: Move makeTiledShapes into Utils.{h\|cpp} This is a preparation step to reuse makeTiledShapes in tensor fusion. Along the way, did some lightweight cleanups. Differential Revision: https://reviews.llvm.org/D99013	2021-03-24 18:17:57 -04:00
jasonliu	158026301b	[libc++][AIX] Initial patch to unblock the libc++ build on AIX This path would unblock the build of libc++ library on AIX: 1. Add _AIX guard for _LIBCPP_HAS_THREAD_API_PTHREAD 2. Use uselocale to actually take the locale setting into account. 3. extract_mtime and extract_atime mod needed for AIX. As stat structure on AIX uses internal structure st_timespec to store time for binary compatibility reason. So we need to convert it back to timespec here. 4. Do not build cxa_thread_atexit.cpp for libcxxabi on AIX. Differential Revision: https://reviews.llvm.org/D97558	2021-03-24 22:13:20 +00:00
Sanjay Patel	adf42dff42	[ValueTracking] peek through min/max to find isKnownToBeAPowerOfTwo This is similar to the select logic just ahead of the new code. Min/max choose exactly one value from the inputs, so if both of those are a power-of-2, then the result must be a power-of-2. This might help with D98152, but we likely still need other pieces of the puzzle to avoid regressions. The change in PatternMatch.h is needed to build with clang. It's possible there is a better way to deal with the 'const' incompatibities. Differential Revision: https://reviews.llvm.org/D99276	2021-03-24 17:54:38 -04:00

1 2 3 4 5 ...

383783 Commits All Branches Search

383783 Commits

All Branches