llvm-project

Commit Graph

Author	SHA1	Message	Date
Pirama Arumuga Nainar	8262e94a6d	[ARM] Fix PR 47980: Use constrainRegClass during foldImmediate opt. Previously we used setRegClass to rgpr, which may expand the register domain if the result was already in a constrained class (tcgpr in the above PR). Differential Revision: https://reviews.llvm.org/D91192	2020-11-10 13:38:11 -08:00
Michael Kruse	e408935bb5	[Polly][ScopBuilder] Use only modeled instructions to compute statement granularity. ScopBuilder distributes independent instructions between statements. Only modeled (e.g. not synthesizable) instructions are represented. To compute independence, non-modeled instructions were used in some parts of determining instruction independence, which could lead to the re-introduction of non-model instructions. In particular, required invariant loads could be added to instruction list, which then led to redundant MemoryAccesses for such a load. This fixes llvm.org/PR48059.	2020-11-10 15:30:16 -06:00
Xun Li	19f0770923	[Coroutine][Sema] Cleanup temporaries as early as possible The original bug was discovered in T75057860. Clang front-end emits an AST that looks like this for an co_await expression: \|- ExprWithCleanups \|- -CoawaitExpr \|- -MaterializeTemporaryExpr ... Awaiter ... \|- -CXXMemberCallExpr ... .await_ready ... \|- -CallExpr ... __builtin_coro_resume ... \|- -CXXMemberCallExpr ... .await_resume ... ExprWithCleanups is responsible for cleaning up (including calling dtors) for the temporaries generated in the wrapping expression). In the above structure, the __builtin_coro_resume part (which corresponds to the code for the suspend case in the co_await with symmetric transfer), the pseudocode looks like this: __builtin_coro_resume( awaiter.await_suspend( from_address( __builtin_coro_frame())).address()); One of the temporaries that's generated as part of this code is the coroutine handle returned from awaiter.await_suspend() call. The call returns a handle which is a prvalue (since it's a returned value on the fly). In order to call the address() method on it, it needs to be converted into an xvalue. Hence a materialized temp is created to hold it. This temp will need to be cleaned up eventually. Now, since all cleanups happen at the end of the entire co_await expression, which is after the <coro.suspend> suspension point, the compiler will think that such a temp needs to live across suspensions, and need to be put on the coroutine frame, even though it's only used temporarily just to call address() method. Such a phenomena not only unnecessarily increases the frame size, but can lead to ASAN failures, if the coroutine was already destroyed as part of the await_suspend() call. This is because if the coroutine was already destroyed, the frame no longer exists, and one can not store anything into it. But if the temporary object is considered to need to live on the frame, it will be stored into the frame after await_suspend() returns. A fix attempt was done in https://reviews.llvm.org/D87470. Unfortunately it is incorrect. The reason is that cleanups in Clang works more like linearly than nested. There is one current state indicating whether it needs cleanup, and an ExprWithCleanups resets that state. This means that an ExprWithCleanups must be capable of cleaning up all temporaries created in the wrapping expression, otherwise there will be dangling temporaries cleaned up at the wrong place. I eventually found a walk-around (https://reviews.llvm.org/D89066) that doesn't break any existing tests while fixing the issue. But it targets the final co_await only. If we ever have a co_await that's not on the final awaiter and the frame gets destroyed after suspend, we are in trouble. Hence we need a proper fix. This patch is the proper fix. It does the folllowing things to fully resolve the issue: 1. The AST has to be generated in the order according to their nesting relationship. We should not generate AST out of order because then the code generator would incorrectly track the state of temporaries and when a cleanup is needed. So the code in buildCoawaitCalls is reorganized so that we will be generating the AST for each coawait member call in order along with their child AST. 2. await_ready() call is wrapped with an ExprWithCleanups so that temporaries in it gets cleaned up as early as possible to avoid living across suspension. 3. await_suspend() call is wrapped with an ExprWithCleanups if it's not a symmetric transfer. In the case of a symmetric transfer, in order to maintain the musttail call contract, the ExprWithCleanups is wraaped before the resume call. 4. In the end, we mark again that it needs a cleanup, so that the entire CoawaitExpr will be wrapped with a ExprWithCleanups which will clean up the Awaiter object associated with the await expression. Differential Revision: https://reviews.llvm.org/D90990	2020-11-10 13:27:42 -08:00
AndreyChurbanov	33da6bd7f5	[OpenMP] Fixes for shared memory cleanup when aborts occur Patch by Erdner, Todd <todd.erdner@intel.com> Differential Revision: https://reviews.llvm.org/D90974	2020-11-11 00:16:23 +03:00
Stanislav Mekhanoshin	544ef42e40	[AMDGPU] Set default op_sel_hi on accvgpr read/write These are opsel opcodes with op_sel actually being ignored. As a such op_sel_hi needs to be set to default 1 even though these bits are ignored. This is compatibility change. Differential Revision: https://reviews.llvm.org/D91202	2020-11-10 13:07:29 -08:00
Richard Smith	438a27f2e5	Move code to determine the type of an LValueBase out of ExprConstant and into a member function on LValueBase. NFC.	2020-11-10 13:03:57 -08:00
Bruno Cardoso Lopes	dc14542a71	[Coroutines] Add missing llvm.dbg.declare's to cover for more allocas Tracking local variables across suspend points is still somewhat incomplete. Consider this coroutine snippet: ``` resumable foo() { int x[10] = {}; int a = 3; co_await std::experimental::suspend_always(); a++; x[0] = 1; a += 2; x[1] = 2; a += 3; x[2] = 3; } ``` Can't manage to print `a` or `x` if they turn out to be allocas during CoroSplit (which happens if you build this code with `-O0` prior to this commit): ``` * thread #1, queue = 'com.apple.main-thread', stop reason = step over frame #0: 0x0000000100003729 main-noprint`foo() at main-noprint.cpp:43:5 40 co_await std::experimental::suspend_always(); 41 a++; 42 x[0] = 1; -> 43 a += 2; 44 x[1] = 2; 45 a += 3; 46 x[2] = 3; (lldb) p x error: <user expression 21>:1:1: use of undeclared identifier 'x' x ^ ``` The generated IR contains a `llvm.dbg.declare` for `x` in it's initialization basic block. After CoroSplit, the `llvm.dbg.declare` might not dominate all of `x` uses and we lose debugging quality. Add `llvm.dbg.value`s to all relevant basic blocks such that if later transformations break the dominance the reliable debug info is already in place. For instance, this BB: ``` await.ready: ... %arrayidx = getelementptr inbounds [10 x i32], [10 x i32]* %x.reload.addr, i64 0, i64 0, !dbg !760 ... %arrayidx19 = getelementptr inbounds [10 x i32], [10 x i32]* %x.reload.addr, i64 0, i64 1, !dbg !763 ... %arrayidx21 = getelementptr inbounds [10 x i32], [10 x i32]* %x.reload.addr, i64 0, i64 2, !dbg !766 ``` becomes: ``` await.ready: ... call void @llvm.dbg.value(metadata [10 x i32]* %x.reload.addr, metadata !751, metadata !DIExpression()), !dbg !753 ... %arrayidx = getelementptr inbounds [10 x i32], [10 x i32]* %x.reload.addr, i64 0, i64 0, !dbg !760 ... %arrayidx19 = getelementptr inbounds [10 x i32], [10 x i32]* %x.reload.addr, i64 0, i64 1, !dbg !763 ... %arrayidx21 = getelementptr inbounds [10 x i32], [10 x i32]* %x.reload.addr, i64 0, i64 2, !dbg !766 ``` Differential Revision: https://reviews.llvm.org/D90772	2020-11-10 12:36:07 -08:00
Sjoerd Meijer	2ef47910d5	[LoopFlatten] Run it earlier, just before IndVarSimplify This is a prep step for widening induction variables in LoopFlatten if this is posssible (D90640), to avoid having to perform certain overflow checks. Since IndVarSimplify may already widen induction variables, we want to run LoopFlatten just before IndVarSimplify. This is a minor reshuffle as both passes were already close after each other. Differential Revision: https://reviews.llvm.org/D90402	2020-11-10 20:22:41 +00:00
Marius Brehler	07f1047f41	[mlir] Refactor finding python This drops the use of deprecated CMake modules to find python. Differential Revision: https://reviews.llvm.org/D91197	2020-11-10 21:21:40 +01:00
Jez Ng	21f831134c	[lld-macho] Add very basic support for LTO Just enough to consume some bitcode files and link them. There's more to be done around the symbol resolution API and the LTO config, but I don't yet understand what all the various LTO settings do... Reviewed By: #lld-macho, compnerd, smeenai, MaskRay Differential Revision: https://reviews.llvm.org/D90663	2020-11-10 12:19:28 -08:00
Jez Ng	6cf244327b	[lld-macho][easy] Fix segment max protection We should have maxprot == initprot for all non-i386 architectures, which is what ld64 does. Reviewed By: #lld-macho, compnerd Differential Revision: https://reviews.llvm.org/D89420	2020-11-10 12:19:28 -08:00
Jez Ng	b86908171e	[lld-macho] Implement LC_UUID Apple devtools use this to locate the dSYM files for a given binary. The UUID is computed based on an MD5 hash of the binary's contents. In order to hash the contents, we must first write them, but LC_UUID itself must be part of the written contents in order for all the offsets to be calculated correctly. We resolve this circular paradox by first writing an LC_UUID with an all-zero UUID, then updating the UUID with its real value later. I'm not sure there's a good way to test that the value of the UUID is "as expected", so I've just checked that it's present. Reviewed By: #lld-macho, compnerd, smeenai Differential Revision: https://reviews.llvm.org/D89418	2020-11-10 12:19:28 -08:00
Jez Ng	2e8e1bdb89	[lld-macho] Support linking against stub dylibs Stub dylibs differ from "real" dylibs in that they lack any content in their sections. What they do have are export tries and symbol tables, which means we can still link against them. I am unclear how to properly create these stub dylibs; XCode 11.3's `lipo` is able to create stub dylibs, but those lack LC_ID_DYLIB load commands and are considered invalid by most tooling. Newer versions of `lipo` aren't able to create stub dylibs at all. However, recent SDKs in XCode still come with valid stub dylibs, so it still seems worthwhile to support them. The YAML in this diff's test was generated by taking a non-stub dylib and editing the appropriate fields. Reviewed By: #lld-macho, smeenai Differential Revision: https://reviews.llvm.org/D89012	2020-11-10 12:19:27 -08:00
Yang Fan	703038b35a	[Sema] Fix volatile check when testing if a return object can be implicitly moved In C++11 standard, to become implicitly movable, the expression in return statement should be a non-volatile automatic object. CWG1579 changed the rule to require that the expression only needs to be an automatic object. C++14 standard and C++17 standard kept this rule unchanged. C++20 standard changed the rule back to require the expression be a non-volatile automatic object. This should be a typo in standards, and VD should be non-volatile. Differential Revision: https://reviews.llvm.org/D88295	2020-11-10 15:11:07 -05:00
Mehdi Amini	6cb1c0cae0	Add Python binding to run a PassManager on a MLIR Module Reviewed By: ftynse, stellaraccident Differential Revision: https://reviews.llvm.org/D90823	2020-11-10 20:06:23 +00:00
Sjoerd Meijer	706ead0e87	[LoopFlatten] Make it a FunctionPass This converts LoopFlatten from a LoopPass to a FunctionPass so that we don't run into problems of a loop pass deleting a (inner)loop. Differential Revision: https://reviews.llvm.org/D90940	2020-11-10 20:03:31 +00:00
Mehdi Amini	dc43f78565	Add basic Python bindings for the PassManager and bind libTransforms This only exposes the ability to round-trip a textual pipeline at the moment. To exercise it, we also bind the libTransforms in a new Python extension. This does not include any interesting bindings, but it includes all the mechanism to add separate native extensions and load them dynamically. As such passes in libTransforms are only registered after `import mlir.transforms`. To support this global registration, the TableGen backend is also extended to bind to the C API the group registration for passes. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D90819	2020-11-10 19:55:21 +00:00
Renato Golin	3073cbd2d4	[docs] link new support policy from developer policy Adding new paragraphs under "Introducing New Components" section to check the different levels of support we have, to help introduction of smaller set of changes without overwhelming new collaborators and potentially losing the contribution. Differential Revision: D91013	2020-11-10 19:40:57 +00:00
Florian Hahn	a8e50f1c6e	[VPlan] Use VPValue def for VPWidenSelectRecipe. This patch turns VPWidenSelectRecipe into a VPValue and uses it during VPlan construction and codegeneration instead of the plain IR reference where possible. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D84682	2020-11-10 19:39:37 +00:00
Sam Clegg	504cb2730c	[lld][WebAssembly] Convert TLS tests to asm format Fix a corresponding bug in WasmAsmParser around parsing.tdata sections. Differential Revision: https://reviews.llvm.org/D91113	2020-11-10 11:38:53 -08:00
Benjamin Kramer	92c61a045f	[ARM] Silence unused variable warning in Release builds. NFC.	2020-11-10 20:35:28 +01:00
Craig Topper	70b481e8db	[RISCV] Add missing copyright header to RISCVBaseInfo.cpp. NFC	2020-11-10 11:33:08 -08:00
Stephen Kelly	e73296d3b9	Add utility for testing if we're matching nodes AsIs Differential Revision: https://reviews.llvm.org/D91144	2020-11-10 19:28:11 +00:00
Haojian Wu	7d85f732b1	Fix the DeclContextLookupResult::iterator non-copyable. The value_type is a const pointer, which makes the iteator non-copyable. Before the patch, the normal usage like below was illegal: ``` auto It = lookupresult.begin(); ... It = lookupresult.end(); // the copy is not allowed. ``` Differential Revision: https://reviews.llvm.org/D91158	2020-11-10 20:22:45 +01:00
Michał Górny	f21e704d4a	[lldb] [Process/NetBSD] Copy the recent improvements from FreeBSD Copy the recent improvements from the FreeBSDRemote plugin, notably: - moving event reporting setup into SetupTrace() helper - adding more debug info into SIGTRAP handling - handling user-generated (and unknown) SIGTRAP events - adding missing error handling to the generic signal handler - fixing attaching to processes - switching watchpoint helpers to use llvm::Error - minor style and formatting changes This fixes a number of tests, mostly related to fixed attaching. Differential Revision: https://reviews.llvm.org/D91167	2020-11-10 20:20:44 +01:00
Alexandre Rames	58c586e701	Allow searching for prebuilt implicit modules. This reverts commit `c67656b994`, and addresses the build issue.	2020-11-10 10:14:13 -08:00
David Tenty	ae032e2714	[CMake][ExecutionEngine] add HAVE_(DE)REGISTER_FRAME as a config.h macros The macro HAVE_EHTABLE_SUPPORT is used by parts of ExecutionEngine to tell __register_frame/__deregister_frame is available to register the FDE for a generated (JIT) code. It's currently set by a slowly growing set of macro tests in the respective headers, which is updated now and then when it fails to link on some platform or another due to the symbols being missing (see for example https://bugs.llvm.org/show_bug.cgi?id=5715). This change converts the macro in two HAVE_(DE)REGISTER_FRAME config.h macros (like most of the other HAVE_* macros) and set's them based on whether CMake can actually find a definition for these symbols to link to at configuration time. Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D87114	2020-11-10 13:09:44 -05:00
David Green	08d1c2d470	[ARM] Introduce t2DoLoopStartTP This introduces a new pseudo instruction, almost identical to a t2DoLoopStart but taking 2 parameters - the original loop iteration count needed for a low overhead loop, plus the VCTP element count needed for a DLSTP instruction setting up a tail predicated loop. The idea is that the instruction holds both values and the backend ARMLowOverheadLoops pass can pick between the two, depending on whether it creates a tail predicated loop or falls back to a low overhead loop. To do that there needs to be something that converts a t2DoLoopStart to a t2DoLoopStartTP, for which this patch repurposes the MVEVPTOptimisationsPass as a "tail predication and vpt optimisation" pass. The extra operand for the t2DoLoopStartTP is chosen based on the operands of VCTP's in the loop, and the instruction is moved as late in the block as possible to attempt to increase the likelihood of making tail predicated loops. Differential Revision: https://reviews.llvm.org/D90591	2020-11-10 18:08:12 +00:00
Louis Dionne	02af11094f	[libc++] NFC: Add helper methods to simplify __shared_ptr_emplace The previous implementation was really difficult to follow, especially with the get() method sharing the same name as std::unique_ptr::get().	2020-11-10 12:49:19 -05:00
Raphael Isemann	7211604220	[lldb][NFC] Add lldb-server to the shell tests disallow list This prevents that one can write a test that referenced lldb-server (instead of %lldb-server). Addresses review feedback from D91155.	2020-11-10 18:48:28 +01:00
Jonas Paulsson	89a1042b6a	Make inferLibFuncAttributes() add SExt attribute on second arg to ldexp. This was missing as discovered by the SystemZ multistage bot: http://lab.llvm.org:8011/#/builders/8, where wrong code resulted when this extension was not performed. Thanks for review by Ulrich Weigand and Roman Lebedev. Differential Revision: https://reviews.llvm.org/D90760	2020-11-10 18:32:15 +01:00
Jay Foad	bb8d1437a6	[AMDGPU] Simplify multiclass EXP_m. NFC.	2020-11-10 17:28:36 +00:00
David Green	dbe1bf63aa	[ARM] Cleanup for ARMLowOverheadLoops. NFC	2020-11-10 17:28:07 +00:00
sameeran joshi	2f7a41b2a7	[Flang][OpenMP] Fix 'Internal: no symbol found' for OpenMP aligned and linear clause. The initial approach was to go with changing parser nodes from `std::list<parser::Name>` to `OmpObjectList`, but that might have lead to illegal programs. Resolving the symbols inside `OmpAttributeVisitor`. Fix a couple of `XFAIL` tests. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D90538	2020-11-10 22:47:13 +05:30
Simon Pilgrim	46a734621d	[ValueTracking] computeKnownBitsFromShiftOperator - always return with Known2 containing the shifted value source. NFCI. As detailed on D90479, in most circumstances we will always call computeKnownBits for Op0, so always perform this by pulling out the duplicate calls.	2020-11-10 17:03:17 +00:00
Simon Pilgrim	929a127932	[ValueTracking] computeKnownBitsFromShiftOperator - consistently use Known2 for the shifted value. NFCI. Minor cleanup as part of getting D90479 moving again.	2020-11-10 17:03:17 +00:00
David Green	c7e275388e	[ARM] Don't aggressively unroll vector remainder loops We already do not unroll loops with vector instructions under MVE, but that does not include the remainder loops that the vectorizer produces. These remainder loops will be rarely executed and are not worth unrolling, as the trip count is likely to be low if they get executed at all. Luckily they get llvm.loop.isvectorized to make recognizing them simpler. We have wanted to do this for a while but hit issues with low overhead loops being reverted due to difficult registry allocation. With recent changes that seems to be less of an issue now. Differential Revision: https://reviews.llvm.org/D90055	2020-11-10 17:01:31 +00:00
Kazu Hirata	85cd7ffade	[BranchProbabilityInfo] Use a range-based for loop (NFC)	2020-11-10 09:00:18 -08:00
sameeran joshi	7282d9e170	[Flang][Docs] Fix warnings when building docs. Following warning were seen with recommonmark(0.5.0) and sphinx(1.8.5). `parser.py:75: UserWarning: Container node skipped: type=document warn("Container node skipped: type={0}".format(mdnode.t))` The warnings are due to an issue in recommonmark's(a python package) older versions. A better solution is to use the latest version of recommonmark(>=0.6.0) to avoid these issue in the first place. This patch fixes the warnings for older versions. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D91117	2020-11-10 22:24:49 +05:30
Jay Foad	c981fa169a	[AMDGPU] Remove unused check prefixes	2020-11-10 16:52:32 +00:00
David Green	7f34b9ddf8	[Sphinx] Fix langref formatting. NFC	2020-11-10 16:47:43 +00:00
David Green	73a6cd4b6b	[ARM] Add a RegAllocHint for hinting t2DoLoopStart towards LR This hints the operand of a t2DoLoopStart towards using LR, which can help make it more likely to become t2DLS lr, lr. This makes it easier to move if needed (as the input is the same as the output), or potentially remove entirely. The hint is added after others (from COPY's etc) which still take precedence. It needed to find a place to add the hint, which currently uses the post isel custom inserter. Differential Revision: https://reviews.llvm.org/D89883	2020-11-10 16:28:57 +00:00
Paul Robinson	def26af4ea	Revert "The arm64 triple requires AArch64 not ARM target" This reverts commit `e7256825d5`. apparently it's not that simple. https://lab.llvm.org:8011/#/builders/109/2412	2020-11-10 08:27:05 -08:00
Jonas Devlieghere	8da14fb76c	[lldb] Propagate llvm::Error to report_fatal_error Instead of having a custom error message, propagate the llvm::Error from SystemInitializerCommon. I didn't realize we had this overload until Pavel mentioned it in D90987 today.	2020-11-10 08:19:47 -08:00
Paul Robinson	e7256825d5	The arm64 triple requires AArch64 not ARM target Failure seen if you configure ARM target but not AArch64, as here: http://lab.llvm.org:8011/#/builders/59/builds/271	2020-11-10 08:19:02 -08:00
David Green	b2ac9681a7	[ARM] Alter t2DoLoopStart to define lr This changes the definition of t2DoLoopStart from t2DoLoopStart rGPR to GPRlr = t2DoLoopStart rGPR This will hopefully mean that low overhead loops are more tied together, and we can more reliably generate loops without reverting or being at the whims of the register allocator. This is a fairly simple change in itself, but leads to a number of other required alterations. - The hardware loop pass, if UsePhi is set, now generates loops of the form: %start = llvm.start.loop.iterations(%N) loop: %p = phi [%start], [%dec] %dec = llvm.loop.decrement.reg(%p, 1) %c = icmp ne %dec, 0 br %c, loop, exit - For this a new llvm.start.loop.iterations intrinsic was added, identical to llvm.set.loop.iterations but produces a value as seen above, gluing the loop together more through def-use chains. - This new instrinsic conceptually produces the same output as input, which is taught to SCEV so that the checks in MVETailPredication are not affected. - Some minor changes are needed to the ARMLowOverheadLoop pass, but it has been left mostly as before. We should now more reliably be able to tell that the t2DoLoopStart is correct without having to prove it, but t2WhileLoopStart and tail-predicated loops will remain the same. - And all the tests have been updated. There are a lot of them! This patch on it's own might cause more trouble that it helps, with more tail-predicated loops being reverted, but some additional patches can hopefully improve upon that to get to something that is better overall. Differential Revision: https://reviews.llvm.org/D89881	2020-11-10 15:57:58 +00:00
Ayshe Kuran	55ec2ba4bc	Fix PR47973: Addressing integer division edge case with INT_MIN Adjustment to integer division in int_div_impl.inc to avoid undefined behaviour that can occur as a result of having INT_MIN as one of the parameters. Reviewed By: sepavloff Differential Revision: https://reviews.llvm.org/D90218	2020-11-10 15:57:06 +00:00
Alexey Bataev	dcde6f17fd	Revert "[libomptarget] Add support for target update non-contiguous" This reverts commit `6847bcec1a`. It breaks the build of libomptarget.	2020-11-10 07:49:00 -08:00
Simon Pilgrim	05954c2b69	[X86] Remove unused check-prefixes from vector rotate tests	2020-11-10 15:45:38 +00:00
Simon Pilgrim	75adc8bb4b	[X86] Remove unused check-prefixes from vector trunc tests	2020-11-10 15:45:38 +00:00

1 2 3 4 5 ...

371714 Commits All Branches Search

371714 Commits

All Branches