llvm-project

Commit Graph

Author	SHA1	Message	Date
Daniel Kiss	c5a4900e1a	[AArch64] Add BTI to CFI jumptables. With branch protection the jump to the jump table entries requires a landing pad. Reviewed By: eugenis, tamas.petz Differential Revision: https://reviews.llvm.org/D81251	2020-09-29 13:50:23 +02:00
David Stenberg	e6f332ef1e	[IndVarSimplify] Fix Modified status for removal of overflow intrinsics When removing an overflow intrinsic the Changed status in SimplifyIndvar was not set, leading to the IndVarSimplify pass returning an incorrect status. This was caught using the check introduced by D80916. As pointed out in the code review, a similar bug may exist for eliminateTrunc(). Reviewed By: reames Differential Revision: https://reviews.llvm.org/D85971	2020-09-29 13:20:59 +02:00
Vitaly Buka	4aa6abe4ef	[msan] Fix llvm.abs.v intrinsic The last argument of the intrinsic is a boolean flag to control INT_MIN handling and does not affect msan metadata.	2020-09-29 03:52:27 -07:00
Vitaly Buka	1fd9a146d3	[msan] Add test for vector abs intrinsic	2020-09-29 03:52:27 -07:00
sstefan1	cb9cfa0d2f	[OpenMPOpt][Fix] Only initialize ICV initial values once. Reviewers: jdoerfert, ggeorgakoudis Differential Revision: https://reviews.llvm.org/D88441	2020-09-29 12:22:58 +02:00
Simon Pilgrim	324df2661b	[InstCombine] Add trunc(lshr(sext(x),c)) non-uniform vector tests	2020-09-29 10:56:15 +01:00
Florian Hahn	60b852092c	[LoopDeletion] Forget loop before setting values to undef After D71539, we need to forget the loop before setting the incoming values of phi nodes in exit blocks, because we are looking through those phi nodes now and the SCEV expression could depend on the loop phi. If we update the phi nodes before forgetting the loop, we miss those users during invalidation. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D88167	2020-09-29 10:38:44 +01:00
Max Kazantsev	9100bd772d	[SCEV][NFC] Introduce isBasicBlockEntryGuardedByCond Currently, we have `isLoopEntryGuardedByCond` method in SCEV, which checks that some fact is true if we enter the loop. In fact, this is just a particular case of more general concept `isBasicBlockEntryGuardedByCond` applied to given loop's header. In fact, the logic if this code is largely independent on the given loop and only cares code above it. This patch makes this generalization. Now we can query it for any block, and `isBasicBlockEntryGuardedByCond` is just a particular case. Differential Revision: https://reviews.llvm.org/D87828 Reviewed By: fhahn	2020-09-29 15:53:45 +07:00
Tres Popp	eb9f7c28e5	Revert "OpaquePtr: Add type to sret attribute" This reverts commit `55c4ff91bd`. Issues were introduced as discussed in https://reviews.llvm.org/D88241 where this change made previous bugs in the linker and BitCodeWriter visible.	2020-09-29 10:31:04 +02:00
Serguei Katkov	297ec61130	[IsKnownNonZero] Handle the case with non-constant phi nodes Handle the case when all inputs of phi are proven to be non zero. Constants are checked in beginning of this method before check for depth of recursion, so it is a partial case of non-constant phi. Recursion depth is already handled by the function. Reviewers: aqjune, nikic, efriedma Reviewed By: nikic Subscribers: dantrushin, hiraditya, jdoerfert, llvm-commits Differential Revision: https://reviews.llvm.org/D88276	2020-09-29 15:22:10 +07:00
Florian Hahn	b76df593eb	Revert "Recommit "[SCCP] Do not replace deref'able ptr with un-deref'able one."" Looks like there is still another remaining issue: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-bootstrap-msan/builds/22273/steps/build%20libcxx%2Fmsan/logs/stdio This reverts commit `86a20d9e34`.	2020-09-29 09:18:19 +01:00
Florian Hahn	86a20d9e34	Recommit "[SCCP] Do not replace deref'able ptr with un-deref'able one." This version includes an small fix allowing function pointers to be unconditionally replaced for now. This reverts commit `4c5e4aa89b`.	2020-09-29 09:10:27 +01:00
Sam Parker	4c19b89b25	[NFC][ARM] Comments and lambdas Add some comments in LowOverheadLoops and make some lambda variables explicit arguments instead of capturing.	2020-09-29 08:41:53 +01:00
Ellis Hoag	98ef7e29b0	This reduces code duplication between CGObjCMac.cpp and Mangle.cpp for generating the mangled name of an Objective-C method. This has no intended functionality change. https://reviews.llvm.org/D88329	2020-09-29 02:26:51 -04:00
Dmitry Antipov	bc868da0e7	[Driver] Filter out <libdir>/gcc and <libdir>/gcc-cross if they do not exists Differential Revision: https://reviews.llvm.org/D87901	2020-09-29 09:18:50 +03:00
Craig Topper	82da0cabb9	[X86] Add computeKnownBits support for PEXT. The number of zeros in the mask provides a lower bound on the number of leading zeros in the result.	2020-09-28 22:54:07 -07:00
Craig Topper	a4b1fdec91	[X86] Add known bits test for PEXT. NFC	2020-09-28 22:54:07 -07:00
Johannes Doerfert	4fc69ab002	Revert "[OpenMP][FIX] Verify compatible types for declare variant calls" This reverts commit `c942095790`. One of the tests broke, revert to investigate.	2020-09-29 00:37:11 -05:00
Arthur Eubanks	da036b4514	[Docs][NewPM] Add note about required passes Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D88342	2020-09-28 21:45:14 -07:00
Max Kazantsev	e862e78b63	[NFC] Use assert instead of checking the guaranteed condition From preconditions it is known that either A dominates B or B dominates A. If A does not dominate B, we do not really need to check it. Assert should be enough. Should save some compile time.	2020-09-29 11:38:45 +07:00
Max Kazantsev	d266fd960e	[IndVars] Remove exiting conditions that are trivially true/false When removing exiting loop conditions, we only consider checks for which we know the exact exit count. We could also eliminate checks for which the condition is always true/false. Differential Revision: https://reviews.llvm.org/D87344 Reviewed By: lebedev.ri, reames	2020-09-29 11:35:32 +07:00
Johannes Doerfert	c942095790	[OpenMP][FIX] Verify compatible types for declare variant calls Especially for templates we need to check at some point if the base function matches the specialization we might call instead. Before this lead to the replacement of `std::sqrt(int(2))` calls with one that converts the argument to a `std::complex<int>`, clearly not the desired behavior. Reported as PR47655 Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D88384	2020-09-28 23:26:21 -05:00
Kiran Kumar T P	f3ead88e9c	[MLIR][OpenMP] Removed the ambiguity in flush op assembly syntax Summary: ======== Bugzilla Ticket No: Bug 46884 [https://bugs.llvm.org/show_bug.cgi?id=46884] Flush op assembly syntax was ambiguous: Consider the below test case: flush operation is not having any arguments. But the next statement token i.e "%2" is read as the argument for flush operation and then translator issues an error. ************************************************************* $ cat -n flush.mlir 1 llvm.func @_QQmain(%arg0: !llvm.i32) { 2 %0 = llvm.mlir.constant(1 : i64) : !llvm.i64 3 %1 = llvm.alloca %0 x !llvm.i32 {in_type = i32, name = "a"} : (!llvm.i64) -> !llvm.ptr<i32> 4 omp.flush 5 %2 = llvm.load %1 : !llvm.ptr<i32> 6 llvm.return 7 } $ mlir-translate -mlir-to-llvmir flush.mlir flush.mlir:5:6: error: expected ':' %2 = llvm.load %1 : !llvm.ptr<i32> ^ ************************************************************* Solution: ========= Introduced begin ( `(` ) and end token ( `)` ) to determince the begin and end of variadic arguments. The patch includes code changes and testcase modifications. Reviewed By: Valentin Clement, Mehdi AMINI Differential Revision: https://reviews.llvm.org/D88376	2020-09-29 09:41:46 +05:30
Yonghong Song	ca1ce397ac	BPF: explicitly specify bpfel triple for certain tests Commit `54d9f743c8` ("BPF: move AbstractMemberAccess and PreserveDIType passes to EP_EarlyAsPossible") changed most of CORE tests with opt run followed by llc and opt requires the target triple specified in the IR. There are few tests where little endian and big endian will report different result and for little endian versions of tests, "target triple = "bpf"" will produce wrong results if the test executed in a big endian machine, e.g. PowerPC big endian machine, since target "bpf" represents host endian and will resolve to "bpfeb". The builtbot reported such failures when build-and-run on a PowerPC big endian machine. To fix the issue, using "target triple = "bpfel"" instead.	2020-09-28 20:25:25 -07:00
Yaxun (Sam) Liu	5a3023a91c	[HIP] Return non-zero value for invalid target ID This is part of https://reviews.llvm.org/D60620	2020-09-28 23:07:39 -04:00
Yaxun (Sam) Liu	187658b8a6	Recommit "[HIP] Change default --gpu-max-threads-per-block value to 1024" Recommit `04abbb3a78`	2020-09-28 22:43:17 -04:00
Amara Emerson	b9f2b3bc43	[AArch64][GlobalISel] Scalarize <2 x s64> G_MUL since we don't have native support for it. Differential Revision: https://reviews.llvm.org/D88437	2020-09-28 19:29:45 -07:00
Yaxun (Sam) Liu	10eb3bf2d4	Skip -fPIE for AMDGPU and HIP toolchain AMDGPU toolchain does not support -fPIE, therefore skip it if specified by driver. Differential Revision: https://reviews.llvm.org/D88425	2020-09-28 22:03:18 -04:00
Valentin Clement	bbb5dc4923	[mlir][openacc] Add acc.data operation verifier Add a basic verifier for the data operation following the restriction from the standard. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D88334	2020-09-28 21:22:32 -04:00
Nathan Ridge	cc6d1f8029	[clangd] When finding refs for a renaming alias, do not return refs to underlying decls Fixes https://github.com/clangd/clangd/issues/515 Differential Revision: https://reviews.llvm.org/D87225	2020-09-28 21:18:31 -04:00
Mehdi Amini	9f9f89d44b	Remove dependency from LLVM Dialect on the OpenMP dialect The OmpDialect is in practice optional during translation to LLVM IR: the code is tolerant to have a "nullptr" when not present / needed. The dependency still exists on the export to LLVMIR. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D88351	2020-09-29 01:12:01 +00:00
LLVM GN Syncbot	727c4223d7	[gn build] Port `54d9f743c8`	2020-09-29 00:24:06 +00:00
Richard Smith	c375635d05	Ensure that we don't compute linkage for an anonymous class too early if it has a member whose name is the same as a builtin. Fixes a regression from the introduction of BuiltinAttr.	2020-09-28 17:22:40 -07:00
Jan Korous	6fd8c69049	[clang] Update warning-wall.c test Follow-up to 1e86d637eb4f: [clang] Selectively ena/disa-ble format-insufficient-args warning	2020-09-28 17:19:51 -07:00
Ruiling Song	73805329ba	[RegisterCoalescer] Pass Undefs to extendToIndices() When extending the subranges, the reaching-def may be an undefs. When extending such kind of subrange, it will try to search for the reaching def first. If the reaching def is an undef and we did not provide 'Undefs', The findReachingDefs() will fail with message: "Use of $noreg does not have a corresponding definition on every path: LLVM ERROR: Use not jointly dominated by defs." So we computeSubRangeUndefs() and pass the result to extendToIndices(). Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D87744	2020-09-29 08:14:24 +08:00
Zahira Ammarguellat	efd04721c9	BuildVectorType with a dependent (array) type is crashing the compiler - Fix for PR-47542 Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D88150	2020-09-28 17:10:32 -07:00
Yonghong Song	54d9f743c8	BPF: move AbstractMemberAccess and PreserveDIType passes to EP_EarlyAsPossible Move abstractMemberAccess and PreserveDIType passes as early as possible, right after clang code generation. Currently, compiler may transform the above code p1 = llvm.bpf.builtin.preserve.struct.access(base, 0, 0); p2 = llvm.bpf.builtin.preserve.struct.access(p1, 1, 2); a = llvm.bpf.builtin.preserve_field_info(p2, EXIST); if (a) { p1 = llvm.bpf.builtin.preserve.struct.access(base, 0, 0); p2 = llvm.bpf.builtin.preserve.struct.access(p1, 1, 2); bpf_probe_read(buf, buf_size, p2); } to p1 = llvm.bpf.builtin.preserve.struct.access(base, 0, 0); p2 = llvm.bpf.builtin.preserve.struct.access(p1, 1, 2); a = llvm.bpf.builtin.preserve_field_info(p2, EXIST); if (a) { bpf_probe_read(buf, buf_size, p2); } and eventually assembly code looks like reloc_exist = 1; reloc_member_offset = 10; //calculate member offset from base p2 = base + reloc_member_offset; if (reloc_exist) { bpf_probe_read(bpf, buf_size, p2); } if during libbpf relocation resolution, reloc_exist is actually resolved to 0 (not exist), reloc_member_offset relocation cannot be resolved and will be patched with illegal instruction. This will cause verifier failure. This patch attempts to address this issue by do chaining analysis and replace chains with special globals right after clang code gen. This will remove the cse possibility described in the above. The IR typically looks like %6 = load @llvm.sk_buff:0:50$0:0:0:2:0 %7 = bitcast %struct.sk_buff* %2 to i8* %8 = getelementptr i8, i8* %7, %6 for a particular address computation relocation. But this transformation has another consequence, code sinking may happen like below: PHI = <possibly different @preserve__access_globals> %7 = bitcast %struct.sk_buff %2 to i8* %8 = getelementptr i8, i8* %7, %6 For such cases, we will not able to generate relocations since multiple relocations are merged into one. This patch introduced a passthrough builtin to prevent such optimization. Looks like inline assembly has more impact for optimizaiton, e.g., inlining. Using passthrough has less impact on optimizations. A new IR pass is introduced at the beginning of target-dependent IR optimization, which does: - report fatal error if any reloc global in PHI nodes - remove all bpf passthrough builtin functions Changes for existing CORE tests: - for clang tests, add "-Xclang -disable-llvm-passes" flags to avoid builtin->reloc_global transformation so the test is still able to check correctness for clang generated IR. - for llvm CodeGen/BPF tests, add "opt -O2 <ir_file> \| llvm-dis" command before "llc" command since "opt" is needed to call newly-placed builtin->reloc_global transformation. Add target triple in the IR file since "opt" requires it. - Since target triple is added in IR file, if a test may produce different results for different endianness, two tests will be created, one for bpfeb and another for bpfel, e.g., some tests for relocation of lshift/rshift of bitfields. - field-reloc-bitfield-1.ll has different relocations compared to old codes. This is because for the structure in the test, new code returns struct layout alignment 4 while old code is 8. Align 8 is more precise and permits double load. With align 4, the new mechanism uses 4-byte load, so generating different relocations. - test intrinsic-transforms.ll is removed. This is used to test cse on intrinsics so we do not lose metadata. Now metadata is attached to global and not instruction, it won't get lost with cse. Differential Revision: https://reviews.llvm.org/D87153	2020-09-28 16:56:22 -07:00
David Tenty	ee80615b5c	[clang][driver][AIX] Set compiler-rt as default rtlib Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D88182	2020-09-28 19:45:43 -04:00
ogiroux	665dc4012b	Attempt to clear some msan errors in the libcxx atomic tests.	2020-09-28 16:34:41 -07:00
Diego Caballero	93936da904	[mlir][Affine][VectorOps] Fix super vectorizer utility (D85869) Adding missing code that should have been part of "D85869: Utility to vectorize loop nest using strategy." Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D88346	2020-09-28 16:24:11 -07:00
Kostya Kortchinsky	f668a84b58	[scudo][standalone] Remove unused atomic_compare_exchange_weak `atomic_compare_exchange_weak` is unused in Scudo, and its associated test is actually wrong since the weak variant is allowed to fail spuriously (thanks Roland). This lead to flakes such as: ``` [ RUN ] ScudoAtomicTest.AtomicCompareExchangeTest ../../zircon/third_party/scudo/src/tests/atomic_test.cpp:98: Failure: Expected atomic_compare_exchange_weak(reinterpret_cast<T >(&V), &OldVal, NewVal, memory_order_relaxed) is true. Expected: true Which is: 01 Actual : atomic_compare_exchange_weak(reinterpret_cast<T >(&V), &OldVal, NewVal, memory_order_relaxed) Which is: 00 ../../zircon/third_party/scudo/src/tests/atomic_test.cpp💯 Failure: Expected atomic_compare_exchange_weak( reinterpret_cast<T >(&V), &OldVal, NewVal, memory_order_relaxed) is false. Expected: false Which is: 00 Actual : atomic_compare_exchange_weak( reinterpret_cast<T >(&V), &OldVal, NewVal, memory_order_relaxed) Which is: 01 ../../zircon/third_party/scudo/src/tests/atomic_test.cpp:101: Failure: Expected OldVal == NewVal. Expected: NewVal Which is: 24 Actual : OldVal Which is: 42 [ FAILED ] ScudoAtomicTest.AtomicCompareExchangeTest (0 ms) [----------] 2 tests from ScudoAtomicTest (1 ms total) ``` So I am removing this, if someone ever needs the weak variant, feel free to add it back with a test that is not as terrible. This test was initially ported from sanitizer_common, but their weak version calls the strong version, so it works for them. Differential Revision: https://reviews.llvm.org/D88443	2020-09-28 16:25:14 -07:00
Jan Korous	1e86d637eb	[clang] Selectively ena/disa-ble format-insufficient-args warning Differential Revision: https://reviews.llvm.org/D87176	2020-09-28 16:24:50 -07:00
Mehdi Amini	e72d792c14	Guard `find_library(tensorflow_c_api ...)` by checking for TENSORFLOW_C_LIB_PATH to be set by the user Also have CMake fails if the user provides a TENSORFLOW_C_LIB_PATH but we can't find TensorFlow at this path. At the moment the CMake script tries to figure if TensorFlow is available on the system and enables support for it. This is in general not desirable to customize build features this way and instead it is preferable to let the user opt-in explicitly into the features they want to enable. This is in line with other optional external dependencies like Z3. There are a few reasons to this but amongst others: - reproducibility: making features "magically" enabled based on whether we find a package on the system or not makes it harder to handle bug reports from users. - user control: they can't have TensorFlow on the system and build LLVM without TensorFlow right now. They also would suddenly distribute LLVM with a different set of features unknowingly just because their build machine environment would change subtly. Right now this is motivated by a user reporting build failures on their system: .../mesa-git/llvm-git/src/llvm-project/llvm/lib/Analysis/TFUtils.cpp:23:10: fatal error: tensorflow/c/c_api.h: No such file or directory 23 \| #include "tensorflow/c/c_api.h" \| ^~~~~~ It looks like we detected TensorFlow at configure time but couldn't set all the paths correctly. Differential Revision: https://reviews.llvm.org/D88371	2020-09-28 22:15:55 +00:00
Philip Reames	e46d74b589	[CVP] Allow two transforms in one invocation For a call site which had both constant deopt operands and nonnull arguments, we were missing the opportunity to recognize the later by bailing early. This is somewhat of a speculative fix. Months ago, I'd had a private report of performance and compile time regressions from the deopt operand folding. I never received a test case. However, the only possibility I see was that after that change CVP missed the nonnull fold, and we end up with a pass ordering/missed simplification issue. So, since it's a real issue, fix it and hope.	2020-09-28 15:11:42 -07:00
Fangrui Song	bd08a87cfe	[EHStreamer] Simplify sharedTypeIDs with std::mismatch (Note that EMStreamer.cpp is largely under tested. The only test checking the prefix sharing is CodeGen/WebAssembly/eh-lsda.ll)	2020-09-28 15:05:59 -07:00
Sean Silva	a975be0e00	[mlir][shape] Make conversion passes more consistent. - use select-ops to make the lowering simpler - change style of FileCheck variables names to be consistent - change some variable names in the code to be more explicit Differential Revision: https://reviews.llvm.org/D88258	2020-09-28 14:55:42 -07:00
Petr Hosek	2d657d1bd7	[libcxx] Don't pass -s to libtool This flag is the default in libtool on Darwin, and it's not supported by llvm-libtool-darwin causing a build failure. Differential Revision: https://reviews.llvm.org/D88449	2020-09-28 14:50:09 -07:00
Louis Dionne	d092c91288	[libc++] Fix constexpr dynamic allocation on GCC 10 We're technically not allowed by the Standard to call ::operator new in constexpr functions like __libcpp_allocate. Clang doesn't seem to complain about it, but GCC does.	2020-09-28 17:44:31 -04:00
Craig Topper	e53196b1e8	[X86] Add support for calling SimplifyDemandedBits on the input of PDEP with a constant mask. We can do several optimizations for PDEP using computeKnownBits and SimplifyDemandedBits -If the MSBs of the output aren't demanded, those MSBs of the mask input aren't demanded either. We need to keep the most significant demanded bit of the mask and any mask bits before it. -The number of possible ones in the mask determines how many bits of the lsbs of the other operand are demanded. Any bits of the mask we don't demand by the previous rule should not be counted. -The result will have zeros in any position that the mask is zero. -Since non-mask input bits can only be output in the original position or a higher bit position, the result will have at least as many trailing zeroes as the non-mask input. Differential Revision: https://reviews.llvm.org/D87883	2020-09-28 14:21:30 -07:00
Craig Topper	e5ef523ee4	[X86] Add tests for D87883. NFC	2020-09-28 14:21:29 -07:00

1 2 3 4 5 ...

367553 Commits All Branches Search

367553 Commits

All Branches