llvm-project

Commit Graph

Author	SHA1	Message	Date
Muhammad Omair Javaid	483db1c706	[LLDB] Remove xfail decorator TestInferiorAssert.py AArch64/Linux TestInferiorAssert.py test_inferior_asserting_disassemble passes after upgrading LLDB AArch64/Linux buildbot to Ubuntu Focal.	2021-10-11 14:41:30 +05:00
Qiu Chaofan	d11ec6f67e	[Clang] Enable IC/IF mode for __ibm128 As for 128-bit floating points on PowerPC, compiler should have three machine modes: - IFmode, always IBM extended double - KFmode, always IEEE 754R 128-bit floating point - TFmode, matches the semantics for long double This commit adds support for IF mode with its complex variant, IC mode. Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D109950	2021-10-11 17:38:04 +08:00
Andrew Savonichev	7ae8f392a1	[AArch64] Emit AssertZExt for i1 arguments AAPCS requires i1 argument to be zero-extended to 8-bits by the caller. Emit a new AArch64ISD::ASSERT_ZEXT_BOOL hint (or AssertZExt for GlobalISel) to enable some optimization opportunities. In particular, when the argument is forwarded to the callee, we can avoid zero-extension and use it as-is. Differential Revision: https://reviews.llvm.org/D107160	2021-10-11 11:55:11 +03:00
Clement Courbet	342d7b654c	[BasicAA][NFC] Improve comment.	2021-10-11 10:42:59 +02:00
David Sherwood	26b7d9d622	[LoopVectorize] Permit vectorisation of more select(cmp(), X, Y) reduction patterns This patch adds further support for vectorisation of loops that involve selecting an integer value based on a previous comparison. Consider the following C++ loop: int r = a; for (int i = 0; i < n; i++) { if (src[i] > 3) { r = b; } src[i] += 2; } We should be able to vectorise this loop because all we are doing is selecting between two states - 'a' and 'b' - both of which are loop invariant. This just involves building a vector of values that contain either 'a' or 'b', where the final reduced value will be 'b' if any lane contains 'b'. The IR generated by clang typically looks like this: %phi = phi i32 [ %a, %entry ], [ %phi.update, %for.body ] ... %pred = icmp ugt i32 %val, i32 3 %phi.update = select i1 %pred, i32 %b, i32 %phi We already detect min/max patterns, which also involve a select + cmp. However, with the min/max patterns we are selecting loaded values (and hence loop variant) in the loop. In addition we only support certain cmp predicates. This patch adds a new pattern matching function (isSelectCmpPattern) and new RecurKind enums - SelectICmp & SelectFCmp. We only support selecting values that are integer and loop invariant, however we can support any kind of compare - integer or float. Tests have been added here: Transforms/LoopVectorize/AArch64/sve-select-cmp.ll Transforms/LoopVectorize/select-cmp-predicated.ll Transforms/LoopVectorize/select-cmp.ll Differential Revision: https://reviews.llvm.org/D108136	2021-10-11 09:41:38 +01:00
David Spickett	cd1bd95d87	[libcxx][pretty printers] Disable u16string tests Due to reported failures in a local build. FAIL: Something is wrong in the test framework. Converting character sets: Invalid argument. (was enabled in https://reviews.llvm.org/D111138)	2021-10-11 09:30:17 +01:00
Valentin Clement	b0eef1eef0	[fir] Add the abstract result conversion pass Add pass that convert abstract result to function argument. This pass is needed before the conversion to LLVM IR. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: schweitz Differential Revision: https://reviews.llvm.org/D111146 Co-authored-by: Eric Schweitz <eschweitz@nvidia.com>	2021-10-11 10:10:41 +02:00
Clement Courbet	83ded5d323	re-land "[AA] Teach BasicAA to recognize basic GEP range information." Now that PR52104 is fixed.	2021-10-11 10:04:22 +02:00
Muhammad Omair Javaid	c63cb0c80e	[LLDB] Skip TestScriptedProcess on Arm/AArch64 Linux This is failing on Arm and AArch64 Linux buildbots since the time it was comitted. https://lab.llvm.org/buildbot/#/builders/96/builds/12628 Differential Revision: https://reviews.llvm.org/D107585	2021-10-11 12:58:21 +05:00
Clement Courbet	6aaf1e7ea9	[LoopIdiom] Fix store size SCEV type. We were using the type of the loop back edge count to represent the store size. This failed for small loop counts (e.g. in the added test, the loop count was an i2). Use the index type instead. Fixes PR52104. Differential Revision: https://reviews.llvm.org/D111401	2021-10-11 09:39:06 +02:00
Andrew Browne	50a08e2c6d	[DFSan] Fix flakey release_shadow_space.c accounting for Origin chains. Test sometimes fails on buildbot (after two non-Origins executions): /usr/bin/ld: warning: Cannot export local symbol 'dfsan_flush' RSS at start: 4620, after mmap: 107020, after mmap+set label: 209424, after fixed map: 4624, after another mmap+set label: 209424, after munmap: 4624 /usr/bin/ld: warning: Cannot export local symbol 'dfsan_flush' RSS at start: 4620, after mmap: 107020, after mmap+set label: 209424, after fixed map: 4624, after another mmap+set label: 209424, after munmap: 4624 /usr/bin/ld: warning: Cannot export local symbol 'dfsan_flush' RSS at start: 4620, after mmap: 107020, after mmap+set label: 317992, after fixed map: 10792, after another mmap+set label: 317992, after munmap: 10792 release_shadow_space.c.tmp: /b/sanitizer-x86_64-linux/build/llvm-project/compiler-rt/test/dfsan/release_shadow_space.c:91: int main(int, char **): Assertion `after_fixed_mmap <= before + delta' failed. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D111522	2021-10-11 00:35:12 -07:00
Vitaly Buka	9ccb6024a0	[NFC][sanitizer] Add a few consts	2021-10-10 22:59:43 -07:00
Vitaly Buka	982bfec8f0	[NFC][sanitizer] Clang-format sanitizer_flat_map.h	2021-10-10 22:23:49 -07:00
Vitaly Buka	eff6b369bf	[NFC][sanitizer] Add constexpr to FlatMap::size	2021-10-10 22:23:48 -07:00
Vitaly Buka	76b7784bcd	[NFC][sanitizer] Rename ByteMap to Map	2021-10-10 22:23:48 -07:00
Vitaly Buka	74277e254c	[NFC] Allow to include sanitizer_allocator_bytemap.h	2021-10-10 22:23:48 -07:00
Uday Bondhugula	b2217b36fe	[MLIR] Fix affine loop unroll corner case for full unroll Fix affine loop unroll for zero trip count loops. Add missing check. Differential Revision: https://reviews.llvm.org/D111375	2021-10-11 10:22:24 +05:30
Lang Hames	c59ebe4c4c	[ORC] Add TaskDispatcher::shutdown calls to TaskDispatchTest.cpp unit tests. These calls were left out of `4d7cea3d2e`. In the InPlaceDispatcher test case the operation is a no-op, but it's good form to include it. In the DynamicThreadPoolTaskDispatcher test the shutdown call is required to ensure that we don't exit the test (and tear down the dispatcher) before the thread running the dispatch has completed.	2021-10-10 21:09:29 -07:00
Lang Hames	4d7cea3d2e	[ORC] Add optional RunPolicy to ExecutorProcessControl::callWrapperAsync. The callWrapperAsync and callSPSWrapperAsync methods take a handler object that is run on the return value of the call when it is ready. The new RunPolicy parameters allow clients to control how these handlers are run. If no policy is specified then the handler will be packaged as a GenericNamedTask and dispatched using the ExecutorProcessControl's TaskDispatch member. Callers can use the ExecutorProcessControl::RunInPlace policy to cause the handler to be run directly instead, which may be preferrable for simple handlers, or they can write their own policy object (e.g. to dispatch as some other kind of Task, rather than GenericNamedTask).	2021-10-10 20:41:59 -07:00
Lang Hames	2e6c92c540	[examples] Fix LLJITWithRemoteDebugging example after `f341161689`.	2021-10-10 20:25:44 -07:00
Esme-Yi	a00ff71668	[XCOFF] Improve error message context. Summary: This patch improves the error message context of the XCOFF interfaces by providing more details. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D110320	2021-10-11 02:52:20 +00:00
Qiu Chaofan	2fc0d439a4	[Clang] [PowerPC] Fix header include typo in smmintrin.h The SSE4 header (smmintrin.h) should include SSSE3 (tmmintrin.h) instead of SSE2 (emmintrin.h). Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D111482	2021-10-11 10:44:08 +08:00
Lang Hames	771e69484a	[ORC] Add dependence on pthreads library to ORC. `f341161689` introduced a dependence (for builds with LLVM_ENABLE_THREADS) on pthreads. This commit updates the CMakeLists.txt file to include a LINK_LIBS entry for pthreads.	2021-10-10 19:34:34 -07:00
LLVM GN Syncbot	816e9d81e2	[gn build] Port `f341161689`	2021-10-11 02:15:38 +00:00
LLVM GN Syncbot	98c9b3362f	[gn build] Port `3df094d31e`	2021-10-11 02:15:37 +00:00
Lang Hames	1b410e0777	[ORC] Add missing headers. These were accidentally left out of `f341161689`.	2021-10-10 19:11:46 -07:00
Arthur O'Dwyer	3df094d31e	[libc++] [P1614] Implement std::compare_three_way. Differential Revision: https://reviews.llvm.org/D110735	2021-10-10 21:57:10 -04:00
Lang Hames	f341161689	[ORC] Add TaskDispatch API and thread it through ExecutorProcessControl. ExecutorProcessControl objects will now have a TaskDispatcher member which should be used to dispatch work (in particular, handling incoming packets in the implementation of remote EPC implementations like SimpleRemoteEPC). The GenericNamedTask template can be used to wrap function objects that are callable as 'void()' (along with an optional name to describe the task). The makeGenericNamedTask functions can be used to create GenericNamedTask instances without having to name the function object type. In a future patch ExecutionSession will be updated to use the ExecutorProcessControl's dispatcher, instead of its DispatchTaskFunction.	2021-10-10 18:39:55 -07:00
Arthur Eubanks	77bc3ba365	[NFC][llvm-reduce] Cleanup types Use Module& wherever possible. Since every reduction immediately turns Chunks into an Oracle, directly pass Oracle instead. Reviewed By: hans Differential Revision: https://reviews.llvm.org/D111122	2021-10-10 18:07:28 -07:00
Amara Emerson	f1e9ecea44	[AArch64][GlobalISel] Legalize G_VECREDUCE_XOR. Treated same as other bitwise reductions.	2021-10-10 17:01:21 -07:00
Wenlei He	9978e0e475	[llvm-profdata] Allow overlap/similarity comparison to use custom hot threshold cutoff Allow overlap/similarity comparison to use custom hot threshold cutoff, instead of using hard coded 990000 as hot cutoff. Differential Revision: https://reviews.llvm.org/D111385	2021-10-10 13:30:18 -07:00
Wenlei He	da4e5fc861	[llvm-profgen] Deduplicate PID when processing perf input When parsing mmap to retrieve PID, deduplicate them before passing PID list to perf script. Perf script would error out when there's duplicated PID in the input, however raw perf data may main duplicated PID for large binary where more than one mmap is needed to load executable segment. Differential Revision: https://reviews.llvm.org/D111384	2021-10-10 13:30:17 -07:00
Sylvestre Ledru	b07ea8a967	clang release notes: improve the wording	2021-10-10 22:26:11 +02:00
Lang Hames	da7f993a8d	[ORC] Reorder callWrapperAsync and callSPSWrapperAsync parameters. The callee address is now the first parameter and the 'SendResult' function the second. This change improves consistentency with the non-async functions where the callee is the first address and the return value the second.	2021-10-10 13:10:43 -07:00
Lang Hames	a42d5c34d0	Revert "Add missing include after dfd74db9" This reverts commit `dd384d2814`. `dfd74db9` was reverted in `8fe3d9df0e`, so this is no longer needed.	2021-10-10 13:01:08 -07:00
Dawid Jurczak	9e65929a8e	[DSE] Re-enable calloc transformation with extra care (PR25892) Transformation from malloc+memset to calloc is always correct and in many situations it brings significant observable benefits in terms of execution speed and memory consumption [1][2]. Unfortunately there are cases when producing calloc cause performance drops [3]. As discussed here: https://reviews.llvm.org/D103009 it's possible to differentiate between those 2 scenarios. If optimizer is able to prove that after malloc call it's _very_ likely to reach memset branch then after calloc emission we shouldn't observe any performance hits. Therefore finding "null pointer check" pattern before memset basic block sounds like good justification for performing transformation. Also that method was already suggested by GCC folks [4]. Main reason for change is that for now to be safe we check for post dominance relation which is way too conservative approach making transformation "almost" disabled in practice. This patch tends to enable transformation again but with extra care. [1] https://stackoverflow.com/questions/2688466/why-mallocmemset-is-slower-than-calloc [2] https://vorpus.org/blog/why-does-calloc-exist/ [3] http://smalldatum.blogspot.com/2017/11/a-new-optimization-in-gcc-5x-and-mysql.html [4] https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83022 Differential Revision: https://reviews.llvm.org/D110021	2021-10-10 21:47:14 +02:00
Sylvestre Ledru	9c8f950a04	clang release notes: document the -Wbool-operation improvement Reviewed By: xbolva00 Differential Revision: https://reviews.llvm.org/D111215	2021-10-10 21:28:40 +02:00
Nico Weber	62abc1842b	clang: Add range-based CFG::try_blocks() ..and use it. No behavior change.	2021-10-10 15:15:37 -04:00
Nico Weber	23d5fe6235	clang: Convert two loops to for-each And rewrap a line at 80 columns while here. No behavior change.	2021-10-10 14:55:46 -04:00
Joe Loser	65d62e52a7	[libc++][test] Replace a TEST_NOEXCEPT_FALSE with noexcept(false). NFC. Replace `TEST_NOEXCEPT_FALSE` directly with `noexcept(false)` in optional hash test which is only run in C++17 or later. `TEST_NOEXCEPT_FALSE` is only useful in C++03 context where `noexcept` isn't supported by clang. `TEST_NOEXCEPT_FALSE` now only has one remaining use in `hash_unique_ptr.pass.cpp`.	2021-10-10 14:46:35 -04:00
Joe Loser	e53c9251fa	[libc++] Remove empty namespace std in type_traits. NFCI. There is an empty `namespace std` in `type_traits` which was originally used when `std::byte` was added in `c97d8aa866`. At some point, the bitwise operators on `std::byte` got relocated but this empty namespace was left around. Remove it. Reviewed By: Quuxplusone, Mordante, #libc Differential Revision: https://reviews.llvm.org/D111512	2021-10-10 14:35:05 -04:00
Jean Perier	6eb7634f30	[fir] Add character conversion pass Upstream the character conversion pass. Translates entities of one CHARACTER KIND to another. By default the translation is to naively zero-extend or truncate a code point to fit the destination size. This patch is part of the upstreaming effort from fir-dev branch. Co-authored-by: Eric Schweitz <eschweitz@nvidia.com> Co-authored-by: Valentin Clement <clementval@gmail.com> Reviewed By: schweitz Differential Revision: https://reviews.llvm.org/D111405	2021-10-10 20:20:09 +02:00
Joe Loser	67964fc4b2	[libc++][NFC] Replace tab with whitespace in comment There is a stray tab character in a comment block. Replace the tab character with a space for consistency with other comments.	2021-10-10 12:53:35 -04:00
Kazu Hirata	0e9373a6a6	[Basic] Use llvm::is_contained (NFC)	2021-10-10 08:52:14 -07:00
Sanjay Patel	05281d95f2	[InstCombine] move fold for "(X-Y) == 0"; NFC This consolidates related folds that all have a similar use restriction that may not be necessary.	2021-10-10 11:26:03 -04:00
Sanjay Patel	cbd8041b0b	[InstCombine] add tests for (X - Y) == 0; NFC	2021-10-10 11:13:46 -04:00
Sanjay Patel	da210f5d34	[InstCombine] canonicalize "(C2 - Y) > C" as (Y + ~C2) < ~C The test diffs show that we have better analysis/folds for 'add' (although we should at least have the simplifications independently, so we don't have the one-use restriction). This is related to solving regressions that would appear in transforms related to D111410, and that is part of a series of enhancements that may eventually helpi solve PR34047. https://alive2.llvm.org/ce/z/3tB9KG define i1 @src(i8 %x, i8 %C, i8 %C2) { %sub = sub nuw i8 %C2, %x %r = icmp slt i8 %sub, %C ret i1 %r } define i1 @tgt(i8 %x, i8 %C, i8 %C2) { %Cnot = xor i8 %C, -1 %C2not = xor i8 %C2, -1 %add = add nuw i8 %x, %C2not %r = icmp sgt i8 %add, %Cnot ret i1 %r }	2021-10-10 11:06:49 -04:00
Sanjay Patel	c00cab878a	[InstCombine] add test for or-of-icmps; NFC	2021-10-10 11:06:49 -04:00
Chen Zheng	4ead32d1cf	[PowerPC] update test case using the scripts; nfc	2021-10-10 14:39:20 +00:00
Mark de Wever	dcbfceffde	[libc++][nfc] Remove a duplicated include.	2021-10-10 14:21:01 +02:00

... 5 6 7 8 9 ...

401735 Commits All Branches Search

401735 Commits

All Branches