llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	6cb7dddaf4	[X86][AVX] Insert zeros byte elements into 256/512-bit vectors using shuffle/and Avoid extracting/inserting subvectors which makes it more difficult for shuffle combining to merge them together.	2021-03-12 15:16:36 +00:00
Simon Pilgrim	33dcdd414c	[X86] Provide lighter weight getTargetShuffleMask wrapper. NFCI. Most callers to getTargetShuffleMask don't use the IsUnary flag.	2021-03-12 15:16:35 +00:00
Nico Weber	08a5277a64	Revert "[IndirectCallPromotion] Don't strip ".__uniq." suffix when it strips" This reverts commit `90dfbeef59`. Causes PR49554. Also see comments on https://reviews.llvm.org/D98389	2021-03-12 10:03:58 -05:00
Simonas Kazlauskas	a2eca31da2	Test cases for rem-seteq fold with illegal types This also briefly tests a larger set of architectures than the more exhaustive functionality tests for AArch64 and x86. As requested in D88785 Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D98339	2021-03-12 16:28:04 +02:00
Matt Arsenault	6b76d82853	GlobalISel: Fix marking byval arguments as immutable byval arguments need to be assumed writable. Only implicitly stack passed arguments which aren't addressable in the IR can be assumed immutable. Mips is still broken since for some reason its doing its own thing with the ValueHandlers (and x86 doesn't actually handle byval arguments now, although some of the code is there).	2021-03-12 09:01:53 -05:00
Matt Arsenault	34471c3060	GlobalISel: Partially fix handling of byval arguments This was essentially ignoring byval and treating them as a pointer argument which needed to be loaded from. This should copy the frame index value to the virtual register, not insert a load from the frame index into the pointer value. For AMDGPU, this was producing a load from the byval pointer argument, to a pointer used for the byval arguments. I do not understand how AArch64 managed to work before since it appears to be similarly broken. We could also change the ValueHandler API to avoid the extra copy from the frame index, since currently it returns a new register. I believe there is still an issue with outgoing byval arguments. These should have a copy inserted in case the callee decided to overwrite the memory.	2021-03-12 09:01:53 -05:00
Matt Arsenault	d44a3dad99	AArch64/GlobalISel: Don't use common prefix in test Unlike update_llc_test_checks, update_mir_test_checks isn't actually smart enough to common functions with identical output.	2021-03-12 09:01:52 -05:00
Matt Arsenault	3231d2b581	AMDGPU/GlobalISel: Cleanup call lowering sequence Now that handleAssignments is handling all of the argument splitting, we don't have to move the insert point around.	2021-03-12 09:01:52 -05:00
serge-sans-paille	bc4a5bdce4	[NFC] Use StringRef instead of const char* for AsmPrinter This avoids calling strlen to repeatedly compute some string size.	2021-03-12 14:39:25 +01:00
Florian Hahn	8904a82fa7	[LV] Fix name in CHECK pattern after `fb3ca7076`	2021-03-12 13:31:48 +00:00
Florian Hahn	fb3ca70761	[LV] Account IV recipes being uniform in VPTransformState::get(). This patch fixes a crash when trying to get a scalar value using VPTransformState::get() for uniform induction values or truncated induction values. IVs and truncated IVs can be uniform and the updated code accounts for that, fixing the crash. This should fix https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=31981	2021-03-12 13:29:06 +00:00
Christian Sigg	1ef544d4a9	[mlir] Remove mlir-cuda-runner Change CUDA integration tests to use mlir-opt + mlir-cpu-runner instead. Depends On D98203 Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D98396	2021-03-12 14:06:43 +01:00
Sanjay Patel	bd197ed0a5	[SimplifyCFG] avoid sinking insts within an infinite-loop The test is reduced from a C source example in: https://llvm.org/PR49541 It's possible that the test could be reduced further or the predicate generalized further, but it seems to require a few ingredients (including the "late" SimplifyCFG options on the RUN line) to fall into the infinite-loop trap.	2021-03-12 08:04:57 -05:00
Stefan Gränitz	ff52121bba	[Orc] Fix race condition in DebugObjectManagerPlugin During finalization the debug object is registered with the target. Materialization must wait for this process to finish. Otherwise we might start running code before the debugger finished processing the corresponding debug info. Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D98417	2021-03-12 14:04:09 +01:00
Valeriy Savchenko	6dc1523508	[analyzer][solver] Prevent infeasible states (PR49490) This patch fixes the situation when our knowledge of disequalities can help us figuring out that some assumption is infeasible, but the solver still produces a state with inconsistent constraints. Additionally, this patch adds a couple of assertions to catch this type of problems easier. Differential Revision: https://reviews.llvm.org/D98341	2021-03-12 15:56:48 +03:00
Hans Wennborg	f50aef745c	Revert "[InstrProfiling] Don't generate __llvm_profile_runtime_user" This broke the check-profile tests on Mac, see comment on the code review. > This is no longer needed, we can add __llvm_profile_runtime directly > to llvm.compiler.used or llvm.used to achieve the same effect. > > Differential Revision: https://reviews.llvm.org/D98325 This reverts commit `c7712087cb`. Also reverting the dependent follow-up commit: Revert "[InstrProfiling] Generate runtime hook for ELF platforms" > When using -fprofile-list to selectively apply instrumentation only > to certain files or functions, we may end up with a binary that doesn't > have any counters in the case where no files were selected. However, > because on Linux and Fuchsia, we pass -u__llvm_profile_runtime, the > runtime would still be pulled in and incur some non-trivial overhead, > especially in the case when the continuous or runtime counter relocation > mode is being used. A better way would be to pull in the profile runtime > only when needed by declaring the __llvm_profile_runtime symbol in the > translation unit only when needed. > > This approach was already used prior to `9a041a7522`, but we changed it > to always generate the __llvm_profile_runtime due to a TAPI limitation. > Since TAPI is only used on Mach-O platforms, we could use the early > emission of __llvm_profile_runtime there, and on other platforms we > could change back to the earlier approach where the symbol is generated > later only when needed. We can stop passing -u__llvm_profile_runtime to > the linker on Linux and Fuchsia since the generated undefined symbol in > each translation unit that needed it serves the same purpose. > > Differential Revision: https://reviews.llvm.org/D98061 This reverts commit `87fd09b25f`.	2021-03-12 13:53:46 +01:00
Aaron Ballman	e448310059	Add support for digit separators in C2x. WG14 adopted N2626 at the meetings this week. This commit adds support for using ' as a digit separator in a numeric literal which is compatible with the C++ feature.	2021-03-12 07:21:03 -05:00
Simon Pilgrim	f6524b4ada	[PPC] Fix UBSAN warning about out of range shift. NFCI.	2021-03-12 12:03:48 +00:00
Alex Richardson	e2cd2f7d08	[builtins] Fix value of ARM_INEXACT The existing value of 0x1000 sets the IXE bit (Inexact floating-point exception trap enable), but we really want to be setting IXC, bit 4: Inexact cumulative floating-point exception bit. This bit is set to 1 to indicate that the Inexact floating-point exception has occurred since 0 was last written to this bit. Reviewed By: kongyi, peter.smith Differential Revision: https://reviews.llvm.org/D98353	2021-03-12 11:15:24 +00:00
Simon Pilgrim	400952980f	[PPC] Fix static analyzer / UBSAN warnings about out of range shifts. NFCI.	2021-03-12 10:34:35 +00:00
Serguei Katkov	cfe8f8e0f0	Revert "Mark gc.relocate and gc.result as readnone" As readnone function they become movable and LICM can hoist them out of a loop. As a result in LCSSA form phi node of type token is created. No one is ready that GCRelocate first operand is phi node but expects to be token. GVN test were also updated, it seems it does not do what is expected. Test for LICM is also added. This reverts commit `f352463ade`.	2021-03-12 16:59:17 +07:00
David Spickett	44e36fc2b1	[libcxx] Move Linaro 32 bit armv bots to buildkite Instead of setting mcpu like the previous bots, set the target triple. Each config builds either Arm only or Thumb only code. This gives us some coverage of thumb specific issues. The new agents on Linaro's side are running on v8 hardware so will report arch "armv8l" just like the v8 bots. (and buildkite can choose any of them for v7/v8 jobs) Reviewed By: #libc, curdeius, Mordante Differential Revision: https://reviews.llvm.org/D98019	2021-03-12 09:47:07 +00:00
Florian Hahn	194861fa1b	[Matrix] Add missing newline to appease sphinx.	2021-03-12 09:33:36 +00:00
Fraser Cormack	641f5700f9	[RISCV] Optimize INSERT_VECTOR_ELT sequences This patch optimizes the codegen for INSERT_VECTOR_ELT in various ways. Primarily, it removes the use of vslidedown during lowering, and the vector element is inserted entirely using vslideup with a custom VL and slide index. Additionally, lowering of i64-element vectors on RV32 has been optimized in several ways. When the 64-bit value to insert is the same as the sign-extension of the lower 32-bits, the codegen can follow the regular path. When this is not possible, a new sequence of two i32 vslide1up instructions is used to get the vector element into a vector. This sequence was suggested by @craig.topper. From there, the value is slid into the final position for more consistent lowering across RV32 and RV64. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D98250	2021-03-12 09:13:38 +00:00
Fraser Cormack	4d2d5855c7	[RISCV] Fix up stale VECREDUCE comments. NFC. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D98399	2021-03-12 08:49:46 +00:00
Anton Zabaznov	840643bbe1	[OpenCL] Refactor diagnostic for OpenCL extension/feature There is no need to check for enabled pragma for core or optional core features, thus this check is removed Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D97058	2021-03-12 11:43:53 +03:00
Alex Zinenko	be5b844a35	[mlir] fix memory leak on failure path in parser Forward references to blocks lead to `Block`s being allocated in the parser, but they are not necessarily included into a region if parsing fails, leading to a leak. Clean them up in parser destructor. Reviewed By: rriddle, mehdi_amini Differential Revision: https://reviews.llvm.org/D98403	2021-03-12 09:24:08 +01:00
Bjorn Pettersson	529c8e8dc6	[InstSimplify] Simplify smul.fix and smul.fix.sat Add simplification of smul.fix and smul.fix.sat according to X * 0 -> 0 X * undef -> 0 X * (1 << scale) -> X This includes the commuted patterns and splatted vectors. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D98299	2021-03-12 09:09:58 +01:00
Bjorn Pettersson	3638bdfbda	[ConstantFold] Handle undef/poison when constant folding smul_fix/smul_fix_sat Do constant folding according to posion * C -> poison C * poison -> poison undef * C -> 0 C * undef -> 0 for smul_fix and smul_fix_sat intrinsics (for any scale). Reviewed By: nikic, aqjune, nagisa Differential Revision: https://reviews.llvm.org/D98410	2021-03-12 09:09:58 +01:00
Marius Brehler	849f8183fb	[mlir] Fix ConstantOp verifier This restricts the attributes to integers for constants of type IndexType. So far an attribute like StringAttr as in %c1 = constant "" : index is valid. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D98216	2021-03-12 08:49:25 +01:00
Christopher Di Bella	154395536e	[libcxx] adds concept std::semiregular Implements parts of: - P0898R3 Standard Library Concepts - P1754 Rename concepts to standard_case for C++20, while we still can Depends on D97443 Reviewed By: Quuxplusone, EricWF, #libc Differential Revision: https://reviews.llvm.org/D97911	2021-03-11 22:40:07 -08:00
Johannes Doerfert	49ed3032ff	Revert "[OpenMP] Do not propagate match extensions to nested contexts" Two tests failed for some reason, need to investigate: https://lab.llvm.org/buildbot/#/builders/109/builds/10399 This reverts commit `ad9e98b8ef`.	2021-03-11 23:48:36 -06:00
Johannes Doerfert	0fe0d114e4	Revert "[OpenMP] Introduce the `disable_selector_propagation` variant selector trait" Need to revert `ad9e98b8ef` which this commit depends on. This reverts commit f771ef7b5f0ed260d00931cd50e6fe462edbacaf.	2021-03-11 23:48:35 -06:00
Johannes Doerfert	ff256c1376	[Attributor] Derive `willreturn` based on `mustprogress` Since D86233 we have `mustprogress` which, in combination with `readonly`, implies `willreturn`. The idea is that every side-effect has to be modeled as a "write". Consequently, `readonly` means there is no side-effect, and `mustprogress` guarantees that we cannot "loop" forever without side-effect. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D94125	2021-03-11 23:31:44 -06:00
Johannes Doerfert	9c2074dccb	[Attributor][NFC] Update tests after D94741 The update_test_checks script can now check for global symbols and is able to handle them properly when they differ across prefixes, e.g., attribute #0 might be different in different runs. This patch simply updates all the Attributor tests with the new script. Reviewed By: sstefan1 Differential Revision: https://reviews.llvm.org/D97906	2021-03-11 23:31:39 -06:00
Johannes Doerfert	5449fbb5d4	[OpenMP][NFC] Use `AsyncInfo` as the variable name for a `__tgt_async_info` Reviewed By: grokos, tianshilei1992 Differential Revision: https://reviews.llvm.org/D96444	2021-03-11 23:31:34 -06:00
Johannes Doerfert	66ba494b49	[OpenMP][DeviceRTL] Extract shuffle idiom and port it to declare variant The shuffle idiom is differently implemented in our supported targets. To reduce the "target_impl" file we now move the shuffle idiom in it's own self-contained header that provides the implementation for AMDGPU and NVPTX. A fallback can be added later on. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D95752	2021-03-11 23:31:30 -06:00
Johannes Doerfert	b2642456ab	[OpenMP] Introduce the `disable_selector_propagation` variant selector trait Nested `omp [begin\|end] declare variant` inherit the selectors from surrounding `omp (begin\|end) declare variant` constructs. To stop such propagation the user can add the `disable_selector_propagation` to the `extension` set in the `implementation` selector. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D95765	2021-03-11 23:31:25 -06:00
Johannes Doerfert	ad9e98b8ef	[OpenMP] Do not propagate match extensions to nested contexts If we have nested declare variant context, it doesn't make sense to inherit the match extension from the parent. Instead, just skip it. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D95764	2021-03-11 23:31:21 -06:00
Johannes Doerfert	cd1bd6e587	[Utils] Check for more global information in update_test_checks This allows to check for various globals (metadata/attributes/...) and also resolves problems with globals (metadata/attributes/...) being reused across different prefixes. Reviewed By: sstefan1 Differential Revision: https://reviews.llvm.org/D94741	2021-03-11 23:31:16 -06:00
Johannes Doerfert	d22fbccfe2	[FIX] Allow non-constant assume operand bundle operands. Fixes PR49545 Reviewed By: zequanwu, fhahn, lebedev.ri Differential Revision: https://reviews.llvm.org/D98444	2021-03-11 23:31:09 -06:00
Sriraman Tallam	cdb42a4cc4	Disable unique linkage suffixes ifor global vars until demanglers can be fixed. D96109 added support for unique internal linkage names for both internal linkage functions and global variables. There was a lot of discussion on how to get the demangling right for functions but I completely missed the point that demanglers do not support suffixes for global vars. For example: $ c++filt _ZL3foo foo $ c++filt _ZL3foo.uniq.123 _ZL3foo.uniq.123 The demangling for functions works as expected. I am not sure of the impact of this. I don't understand how debuggers and other tools depend on the correctness of global variable demangling so I am pre-emptively disabling it until we can get the demangling support added. Importantly, uniquefying global variables is not needed right now as we do not do profile attribution to global vars based on sampling. It was added for completeness and so this feature is not exactly missed. Differential Revision: https://reviews.llvm.org/D98392	2021-03-11 20:59:30 -08:00
Esme-Yi	fafbdfe63b	[Debug-Info] Add names for the debug line prologue. Summary: This is a minor patch to add names for the debug line prologue, as a follow-up of D95998. Reviewed By: dblaikie, ikudrin, shchenz Differential Revision: https://reviews.llvm.org/D98383	2021-03-12 04:45:08 +00:00
Siva Chandra Reddy	b937908c37	[libc][NFC] Move the template implementation of integer_abs to __support. This eliminates cross-header dependency from stdlib to string.	2021-03-11 20:20:18 -08:00
Craig Topper	1d26bbcf9b	[RISCV] Return false from isShuffleMaskLegal except for splats. We don't support any other shuffles currently. This changes the bswap/bitreverse tests that check for this in their expansion code. Previously we expanded a byte swapping shuffle through memory. Now we're scalarizing and doing bit operations on scalars to swap bytes. In the future we can probably use vrgather.vx to do a byte swap shuffle.	2021-03-11 20:02:49 -08:00
Christopher Di Bella	8ef69c66d5	[libcxx] adds concept std::copyable Implements parts of: - P0898R3 Standard Library Concepts - P1754 Rename concepts to standard_case for C++20, while we still can Depends on D97359 Reviewed By: EricWF, #libc, Quuxplusone, zoecarver Differential Revision: https://reviews.llvm.org/D97443	2021-03-11 19:30:55 -08:00
Christopher Di Bella	dc9f385722	[libcxx] adds concept std::movable Implements parts of: - P0898R3 Standard Library Concepts - P1754 Rename concepts to standard_case for C++20, while we still can Depends on D97162 Reviewed By: EricWF, #libc, Quuxplusone Differential Revision: https://reviews.llvm.org/D97359	2021-03-11 19:27:43 -08:00
Yonghong Song	379d908848	BPF: provide better error message for unsupported atomic operations Currently, BPF backend does not support all variants of atomic_load_{add,and,or,xor}, atomic_swap and atomic_cmp_swap For example, it only supports 32bit (with alu32 mode) and 64bit operations for atomic_load_{and,or,xor}, atomic_swap and atomic_cmp_swap. Due to historical reason, atomic_load_add is always supported with 32bit and 64bit. If user used an unsupported atomic operation, currently, codegen selectiondag cannot find bpf support and will issue a fatal error. This is not user friendly as user may mistakenly think this is a compiler bug. This patch added Custom rule for unsupported atomic operations and will emit better error message during ReplaceNodeResults() callback. The following is an example output. $ cat t.c short sync(short *p) { return __sync_val_compare_and_swap (p, 2, 3); } $ clang -target bpf -O2 -g -c t.c t.c:2:11: error: Unsupported atomic operations, please use 64 bit version return __sync_val_compare_and_swap (p, 2, 3); ^ fatal error: error in backend: Cannot select: t19: i64,ch = AtomicCmpSwap<(load store seq_cst seq_cst 2 on %ir.p)> t0, t2, Constant:i64<2>, Constant:i64<3>, t.c:2:11 t2: i64,ch = CopyFromReg t0, Register:i64 %0 t1: i64 = Register %0 t11: i64 = Constant<2> t10: i64 = Constant<3> In function: sync PLEASE submit a bug report ... Fatal error will still happen since we did not really do proper lowering for these unsupported atomic operations. But we do get a much better error message. Differential Revision: https://reviews.llvm.org/D98471	2021-03-11 19:19:00 -08:00
Vy Nguyen	c578508b5b	Revert "[compiler-rt][asan] Make wild-pointer crash error more useful" This reverts commit `f65e1aee40`.	2021-03-11 22:13:40 -05:00
Jonas Paulsson	5908c7ca41	[libFuzzer] Add attribute noinline on Fuzzer::ExecuteCallback(). The inlining of this function needs to be disabled as it is part of the inpsected stack traces. It's string representation will look different depending on if it was inlined or not which will cause it's string comparison to fail. When it was inlined in only one of the two execution stacks, minimize_two_crashes.test failed on SystemZ. For details see https://bugs.llvm.org/show_bug.cgi?id=49152. Reviewers: Ulrich Weigand, Matt Morehouse, Arthur Eubanks Differential Revision: https://reviews.llvm.org/D97975	2021-03-11 21:05:22 -06:00

... 2 3 4 5 6 ...

382698 Commits All Branches Search

382698 Commits

All Branches