llvm-project

Commit Graph

Author	SHA1	Message	Date
Chuanqi Xu	405bf90235	[NFC] [Pipelines] Hoist CoroCleanup as Module Pass This is similar to previous patch https://reviews.llvm.org/D123925. It could also reduce the time we call declaresCoroCleanupIntrinsics. And it is helpful for further changes. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D124362	2022-05-05 15:15:09 +08:00
Chuanqi Xu	7d40f562e7	[Pipelines] Hoist CoroCleanup to avoid blocking optimizations CoroCleanup is designed to lowering all the remaining coroutine intrinsics. It is required to run after CoroSplit only. However, the position of CoroCleanup now is far too late. The downside here is that the unlowered coroutine instrincs might blocking other optimizations too. So it should be a pure win to hoist the position of CoroCleanup. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D124360	2022-05-05 15:13:27 +08:00
Simon Pilgrim	e04ca7c4f1	[Coroutines] Regenerate coro-retcon-resume-values.ll	2022-05-01 13:21:55 +01:00
Chuanqi Xu	7eaa84eac3	[NFC] Code cleanups for coroutine after we remvoed legacy passes	2022-04-21 15:32:46 +08:00
Chuanqi Xu	483efc9ad0	[Pipelines] Remove Legacy Passes in Coroutines The legacy passes are deprecated now and would be removed in near future. This patch tries to remove legacy passes in coroutines. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D123918	2022-04-21 10:59:11 +08:00
Chuanqi Xu	f9bee35689	[Pipelines] Hoist CoroEarly as a module pass This change could reduce the time we call `declaresCoroEarlyIntrinsics`. And it is helpful for future changes. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D123925	2022-04-19 11:04:24 +08:00
Nikita Popov	792f80e166	[CoroSplit] Use freeze instead of bitcast for dummy instructions Not all types that can appear in arguments can be bitcasts -- in particular, bitcasts do not support struct types.	2022-04-01 13:07:25 +02:00
Nikita Popov	68d27587e4	[CoroSplit] Handle argument being the frame pointer (PR54523) If the frame pointer is an argument of the original pointer (which happens with opaque pointers), then we currently first replace the argument with undef, which will prevent later replacement of the old frame pointer with the new one. Fix this by replacing arguments with some dummy instructions first, and then replacing those with undef later. This gives us a chance to replace the frame pointer before it becomes undef. Fixes https://github.com/llvm/llvm-project/issues/54523. Differential Revision: https://reviews.llvm.org/D122375	2022-04-01 12:37:29 +02:00
Arthur Eubanks	9bd66b312c	[PassManager][Coroutine] Run passes under -O0 conditionally and run GlobalDCE CoroSplit lowers various coroutine intrinsics. It's a CGSCC pass and CGSCC passes don't run on unreachable functions. Normally GlobalDCE will come along and delete unreachable functions, but we don't run GlobalDCE under -O0, so an unreachable function with coroutine intrinsics may never have CoroSplit run on it. This patch adds GlobalDCE when coroutines intrinsics are present. It also now runs all coroutine passes conditional when coroutine intrinsics are present. This should also solve the -O0 regression reported in D105877 due to LazyCallGraph construction. Fixes https://github.com/llvm/llvm-project/issues/54117 Reviewed By: ChuanqiXu Differential Revision: https://reviews.llvm.org/D122275	2022-03-23 11:03:26 -07:00
Nikita Popov	ce6ca00a92	[CoroSplit] Avoid self-replacement With opaque pointers, the bitcast might be a no-op, and this can end up trying to replace a value with itself, which is illegal.	2022-03-14 13:53:31 +01:00
Michael Gottesman	0b647fc529	[debug-info] Debug salvage llvm.dbg.addr in original function that point into the coroutine frame when splitting coros. We are already doing this in the split functions while we clone. This just handles the original function. I also updated the coroutine split test to validate that we are always referring to the msg in the context object instead of in a shadow copy. rdar://83957028 Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D121324	2022-03-09 14:02:09 -08:00
Nikita Popov	1bd33691cb	[CoroElide] Remove fallback for frame layout determination Only determine the frame layout based on dereferenceable and align attributes, and remove the type-based fallback, which is incompatible with opaque pointers. The dereferenceable attribute is required, while the align attribute uses default alignment of 1 (commonly, align 1 attributes do not get placed, relying on default alignment). The CoroSplit pass producing the resume function adds the necessary attributes in `7daed35911/llvm/lib/Transforms/Coroutines/CoroSplit.cpp (L840)`, and their presence is checked in coro-debug.ll at least. Differential Revision: https://reviews.llvm.org/D120988	2022-03-07 11:23:02 +01:00
Nikita Popov	9bca4ea364	[Coroutines] Allow FramePtr to be an Argument With opaque pointers, after splitRetconCoroutine() the FramePtr may be an Argument rather than an Instruction. With typed pointers, this currently doesn't happen because the FramePtr would be a bitcast instruction. Fix this by making FramePtr a Value and adding a helper for the "after FramePtr" insertion point, which would be the start of the function in the Argument case. Differential Revision: https://reviews.llvm.org/D120994	2022-03-07 10:58:56 +01:00
Nikita Popov	a266af7211	[InstCombine] Canonicalize SPF to min/max intrinsics Now that integer min/max intrinsics have good support in both InstCombine and other passes, start canonicalizing SPF min/max to intrinsic min/max. Once this sticks, we can stop matching SPF min/max in various places, and can remove hacks we have for preventing infinite loops and breaking of SPF canonicalization. Differential Revision: https://reviews.llvm.org/D98152	2022-02-24 09:01:20 +01:00
Michael Gottesman	13681ad654	[move-function] Make test more generally by removing unneeded line. Otherwise this is can be sensitive in the face of changes in register names. I also gardened the test case a little to make it look a little nicer. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D120276	2022-02-21 14:40:23 -08:00
Roman Lebedev	371fcb720e	[SimplifyCFG][PhaseOrdering] Defer lowering switch into an integer range comparison and branch until after at least the IPSCCP That transformation is lossy, as discussed in https://github.com/llvm/llvm-project/issues/53853 and https://github.com/rust-lang/rust/issues/85133#issuecomment-904185574 This is an alternative to D119839, which would add a limited IPSCCP into SimplifyCFG. Unlike lowering switch to lookup, we still want this transformation to happen relatively early, but after giving a chance for the things like CVP to do their thing. It seems like deferring it just until the IPSCCP is enough for the tests at hand, but perhaps we need to be more aggressive and disable it until CVP. Fixes https://github.com/llvm/llvm-project/issues/53853 Refs. https://github.com/rust-lang/rust/issues/85133 Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D119854	2022-02-17 12:13:55 +03:00
Michael Gottesman	19279ffc77	[debug-info] If one sees a spill with a dbg.addr use, salvageDebugInfo upon it and don't hoist it. This ensures that if we have a dbg.addr in a coroutine funclet that is on one of our function arguments, that the dbg.addr is not mapped to undef and also that later it isn't hoisted to the front of the basic block. Instead it remains at its original cloned location. rdar://83957028 Differential Revision: https://reviews.llvm.org/D119576	2022-02-11 15:15:13 -08:00
Dávid Bolvanský	90f185c964	Revert "[AlwaysInliner] Enable call site inlining to make flatten attribute working again (#53360 )" This reverts commit `ceec438368`. Clang tests fail.	2022-01-25 23:13:46 +01:00
Dávid Bolvanský	ceec438368	[AlwaysInliner] Enable call site inlining to make flatten attribute working again (#53360 ) Problem: Migration to new PM broke flatten attribute. This is one use case why LLVM should support inlining call-site with alwaysinline. The flatten attribute is nowdays broken, so we should either land patch like this one or remove everything related to flatten attribute from Clang. Second use case is something like "per call site inlining intrinsics" to control inlining even more; mentioned in https://lists.llvm.org/pipermail/cfe-dev/2018-September/059232.html Fixes https://github.com/llvm/llvm-project/issues/53360 Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D117965	2022-01-25 22:55:30 +01:00
Chuanqi Xu	b574048239	[NFC] [Coroutines] Rename tests in coro-align This is required by ychen. See https://reviews.llvm.org/D117542	2022-01-24 11:04:13 +08:00
Chuanqi Xu	c8ecf12bc3	[Coroutines] Offering llvm.coro.align intrinsic It is a known problem that we can't align the switch-based coroutine frame if the alignment exceeds std::max_align_t (which is 16 usually). We could solve the problem on the middle-end by dynamically transforming or in the frontend by emitting aligned allocation function. If we need to solve it in the frontend, the middle end need to offer an intrinsic to tell the alignment at least. This patch tries to offer such an intrinsic called llvm.coro.align. Reviewed By: https://reviews.llvm.org/D117542 Differential revision: https://reviews.llvm.org/D117542	2022-01-19 09:52:45 +08:00
Chuanqi Xu	22225cc5e6	[Coroutines] Handle lifetime markers, bitcast and unused instruciton for symmetric transfer This fixes bug49888. The root cause for this is that simplifyTerminatorLeadingToRet didn't handle lifetime markers well. Another issue also noted in D116327 is that we deleted some inlined optimization pass in CoroSplit so that simplifyTerminatorLeadingToRet need to remove dead instructions by hand. This patch fixes bug49888 by skipping lifetime markers and bitcast instruction and removing dead instructions by hand in simplifyTerminatorLeadingToRet. Reviewed By: junparser Differential Revision: https://reviews.llvm.org/D116330	2022-01-12 15:58:38 +08:00
Chuanqi Xu	403772ff1c	[Coroutines] Enhance symmetric transfer for constant CmpInst This fixes bug52896. Simply, some symmetric transfer optimization chances get invalided due to we delete some inlined optimization passes in `822b92a`. This would cause stack-overflow in some situations which should be avoided by the design of coroutine. This patch tries to fix this by transforming the constant CmpInst instruction which was done in the deleted passes. Reviewed By: rjmccall, junparser Differential Revision: https://reviews.llvm.org/D116327	2022-01-12 10:14:37 +08:00
Nick Desaulniers	79ebc3b0dd	[llvm][test] rewrite callbr to use i rather than X constraint NFC In D115311, we're looking to modify clang to emit i constraints rather than X constraints for callbr's indirect destinations. Prior to doing so, update all of the existing tests in llvm/ to match. Reviewed By: void, jyknight Differential Revision: https://reviews.llvm.org/D115410	2022-01-11 11:31:08 -08:00
Chuanqi Xu	c75cedc237	[Coroutines] Set presplit attribute in Clang and mlir This fixes bug49264. Simply, coroutine shouldn't be inlined before CoroSplit. And the marker for pre-splited coroutine is created in CoroEarly pass, which ran after AlwaysInliner Pass in O0 pipeline. So that the AlwaysInliner couldn't detect it shouldn't inline a coroutine. So here is the error. This patch set the presplit attribute in clang and mlir. So the inliner would always detect the attribute before splitting. Reviewed By: rjmccall, ezhulenev Differential Revision: https://reviews.llvm.org/D115790	2022-01-05 10:25:02 +08:00
Chuanqi Xu	ea6a3f9f96	[NFC] [Coroutines] Fix incorrect use of coroutine intrinsics The inlined llvm.coro.id should contain the function it refers to. The modifed test would caused the compiler crash under O2. See issue52912 for example.	2022-01-04 11:13:48 +08:00
Chuanqi Xu	7c9fb58cac	[NFC] [Coroutines] Add tests for coro-split-musttail Add two tests to address the problems during marking coro.resume calls as musttail. The two problems are bitcast instruction and unused instruciton respectively.	2021-12-28 16:20:06 +08:00
Chuanqi Xu	1ef3f83ef2	[NFC] [Coroutines] Add tests to address the problem for converting to musttail call Add two tests to address the problem for missing oppotunities to convert calls to musttail call.	2021-12-27 20:27:31 +08:00
Chuanqi Xu	ca4d2c368d	Revert "[NFC] [Coroutines] Add a test for icmp use of coro.suspend to prevent musttail call converting" This reverts commit `21aa4d5d5e`. The test added is not proper. It would be passed all the time since it is in the ramp function.	2021-12-27 19:05:22 +08:00
Chuanqi Xu	21aa4d5d5e	[NFC] [Coroutines] Add a test for icmp use of coro.suspend to prevent musttail call converting Add a test to show the false negative optimization oppotunity to not convert a resume call to musttail call. It should could be.	2021-12-27 17:28:13 +08:00
Chuanqi Xu	320e4efe99	[C++20] [Coroutines] Mark coroutine done if unhandled_exception throws According to [dcl.fct.def.coroutine]/p14: > If the evaluation of the expression promise.unhandled_exception() > exits via an exception, the coroutine is considered suspended at the > final suspend point. But this is not implemented in clang before. This patch would implement this feature by marking the coroutine as done at the place of coro.end(frame, /InUnwindPath=/true ). Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D115219	2021-12-09 14:58:06 +08:00
Arnold Schwaighofer	7c0e066869	[coro async] Don't use lifetime.start based alloca localization for ABI.Async/ABI.Retcon Infinite loops can lead to an IR representation where the lifetime.end intrinsice is missing. The code to do lifetime based optimization then fails to see that an address escapes (is life) accross a supspend. Eventually, we could detect such situations and disable it under more narrow circumstances. For now, do the correct thing. rdar://83635953 Differential Revision: https://reviews.llvm.org/D110949	2021-12-06 11:50:51 -08:00
Arnold Schwaighofer	64ba9dd943	[coro async] Disable lifetime.start sinking for ABI::Async and ABI::Retcon It does not handle loops correctly i.e it moves the lifetime.start intrinsic into a loop rendering the stack object as not alive for part of the loop. ``` entry: %obj = alloca i8 lifetime.start(%obj) br loop loop: coro.suspend() escape(%obj) cond_br %cond, label %exit, label loop br loop exit: lifetime.end(%obj ``` After sinking: ``` entry: %obj = alloca i8 br loop loop: coro.suspend() lifetime.start(%obj) escape(%obj) cond_br %cond, label %exit, label loop br loop exit: lifetime.end(%obj ``` rdar://83411917 Differential Revision: https://reviews.llvm.org/D110953	2021-12-06 10:59:43 -08:00
Chuanqi Xu	2ae5011827	[Coroutines] Handle CallBrInst in SalvageDebugInfo Reviewed by: StephenTozer Differential Revision: https://reviews.llvm.org/D115139	2021-12-06 22:55:05 +08:00
Philip Reames	7b54de5fef	[funcattrs] Fix a bug in recently introduced writeonly argument inference This fixes a bug in `740057d`. There's two ways to describe the issue: * One caller hasn't yet proven nocapture on the argument. Given that, the inference routine is responsible for bailing out on a potential capture. * Even if we know the argument is nocapture, the access inference needs to traverse the exact set of users the capture tracking would (or exit conservatively). Even if capture tracking can prove a store is non-capturing (e.g. to a local alloc which doesn't escape), we still need to track the copy of the pointer to see if it's later reloaded and accessed again. Note that all the test changes except the newly added ones appear to be false negatives. That is, cases where we could prove writeonly, but the current code isn't strong enough. That's why I didn't spot this originally.	2021-12-03 08:57:15 -08:00
Chuanqi Xu	84980761a7	[Coroutines] Handle InvokeInst in SalvageDebugInfo Since coroutine would be splitted into pieces, compiler would move the dbg.declare intrinsic after the Storage is created to make sure the corresponding dbg instruction is still available aftet splitted. However, it would be problematic if the storage instruction is an InvokeInst, which is a terminator. We couldn't move instruction after an InvokeInst. This patch tries to move the dbg.declare intrinsic in the normal destination of the InvokeInst. It should make sense due to the Storage should be invalid in exception path.	2021-12-03 13:46:54 +08:00
Philip Reames	740057d185	[funcattrs] Infer writeonly argument attribute This change extends the current logic for inferring readonly and readnone argument attributes to also infer writeonly. This change is deliberately minimal; there's a couple of areas for follow up. * I left out all call handling and thus any benefit from the SCC walk. When examining the test changes, I realized the existing code is imprecise, and am going to fix that in it's own revision before adding in the writeonly handling. (Mostly because updating the tests is hard when I, the human, can't figure out whether the result is correct.) * I left out handling for storing a value (as opposed to storing to a pointer). This should benefit readonly/readnone as well, and applies to a bunch of other instructions. Seemed worth having as a separate review. Differential Revision: https://reviews.llvm.org/D114963	2021-12-02 13:04:09 -08:00
Arnold Schwaighofer	7d11c5dac2	Coro: Remove coro_end and coro_suspend_retcon in private unprocessed functions We might emit functions that are private and never called. The coro split pass only processes functions that might be called. Remove intrinsics that we can't generate code for. rdar://84619859 Differential Revision: https://reviews.llvm.org/D114021	2021-11-18 07:48:24 -08:00
Tim Northover	3d39612b3d	Coroutines: don't infer function attrs before lowering Coroutines have weird semantics that don't quite match normal LLVM functions, so trying to infer even simple attributes based on thier contents can go wrong.	2021-11-04 10:24:28 +00:00
Chuanqi Xu	ddbf196194	[Coroutines] Ignore partial lifetime markers refer of an alloca When I playing with Coroutines, I found that it is possible to generate following IR: ``` %struct = alloca ... %sub.element = getelementptr %struct, i64 0, i64 index ; index is not %zero lifetime.marker.start(%sub.element) % use of %sub.element lifetime.marker.end(%sub.element) store %struct to xxx ; %struct is escaping! <suspend points> ``` Then the AllocaUseVisitor would collect the lifetime marker for sub.element and treat it as the lifetime markers of the alloca! So it judges that the alloca could be put on the stack instead of the frame by judging the lifetime markers only. The root cause for the bug is that AllocaUseVisitor collects wrong lifetime markers. This patch fixes this. Reviewed By: lxfind Differential Revision: https://reviews.llvm.org/D112216	2021-10-22 09:49:50 +08:00
Arnold Schwaighofer	2df2b27d94	[cora async] Cleanup undefined llvm.coro.async.resume In situations where the coroutine function is not split we can just replace the async.resume by null. rdar://82591919 Differential Revision: https://reviews.llvm.org/D110191	2021-09-30 13:26:53 -07:00
Arthur Eubanks	096d9814aa	[opt] Remove some legacy PM flags Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D109664	2021-09-13 15:50:03 -07:00
Adrian Prantl	12de296d84	Tighten heuristic for coroutine debug info workaround. The OutermostLoad condition is supposed to strip the outermost DW_OP_deref operation because dbg.declares are implicitly indirect. This patch makes sure the heuristic is only applied to dbg.declare intrinsics and only if the outermost instruction is a load. This was found while qualifying the latest Swift compiler rebranch. rdar://82037764	2021-09-01 11:15:36 -07:00
Adrian Prantl	8ae5e0b154	Add missing nullptr check Unfortunatley the IR Verifier doesn't reject debug intrinsics that have nullptr as arguments, so coro::salvageDebugInfo for now also needs to deal with them. rdar://81979541	2021-08-17 13:59:52 -07:00
Adrian Prantl	a353edb8d6	Simplify coro::salvageDebugInfo() (NFC-ish) This patch removes the hand-rolled implementation of salvageDebugInfo for cast and GEPs and replaces it with a call into llvm::salvageDebugInfoImpl(). A side-effect of this is that additional redundant convert operations are introduced, but those don't have any negative effect on the resulting DWARF expression. rdar://80227769 Differential Revision: https://reviews.llvm.org/D107384	2021-08-10 15:21:18 -07:00
Arnold Schwaighofer	b987c283ae	[coro] Correct CurrentBlock tracking bug recently introduced We use the CurrentBlock to determine whether we have already processed a block. Don't reuse this variable for setting where we should insert the rematerialization. The rematerialization block is different to the current block when we rematerialize for coro suspend block users. Differential Revision: https://reviews.llvm.org/D107573	2021-08-09 10:41:41 -07:00
Chuanqi Xu	0237dbfdd3	[Coroutine] Record the elided coroutines Reviewed By: lxfind Differential Revision: https://reviews.llvm.org/D105606	2021-07-27 13:14:09 +08:00
Bjorn Pettersson	472462c472	[NewPM] Consistently use 'simplifycfg' rather than 'simplify-cfg' There was an alias between 'simplifycfg' and 'simplify-cfg' in the PassRegistry. That was the original reason for this patch, which effectively removes the alias. This patch also replaces all occurrances of 'simplify-cfg' by 'simplifycfg'. Reason for choosing that form for the name is that it matches the DEBUG_TYPE for the pass, and the legacy PM name and also how it is spelled out in other passes such as 'loop-simplifycfg', and in other options such as 'simplifycfg-merge-cond-stores'. I for some reason the name should be changed to 'simplify-cfg' in the future, then I think such a renaming should be more widely done and not only impacting the PassRegistry. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D105627	2021-07-09 09:47:03 +02:00
Arnold Schwaighofer	2937f8d148	[coro async] Cap the alignment of spilled values (vs. allocas) at the max frame alignment Before this patch we would normally use the ABI alignment which can be to high for the context alginment. For spilled values we don't need ABI alignment, since the frame entry's address is not escaped. rdar://79664965 Differential Revision: https://reviews.llvm.org/D105288	2021-07-07 08:06:25 -07:00
Arnold Schwaighofer	846a530e7d	Fix coro lowering of single predecessor phis Code assumes that uses of single predecessor phis are not live accross suspend points. Cleanup any single predecessor phis preceeding the code making this assumption. rdar://76020301 Differential Revision: https://reviews.llvm.org/D105488	2021-07-06 10:22:25 -07:00

1 2 3 4 5

228 Commits