llvm-project

Commit Graph

Author	SHA1	Message	Date
River Riddle	ace01605e0	[mlir] Split out a new ControlFlow dialect from Standard This dialect is intended to model lower level/branch based control-flow constructs. The initial set of operations are: AssertOp, BranchOp, CondBranchOp, SwitchOp; all split out from the current standard dialect. See https://discourse.llvm.org/t/standard-dialect-the-final-chapter/6061 Differential Revision: https://reviews.llvm.org/D118966	2022-02-06 14:51:16 -08:00
Chuanqi Xu	dbbe010908	[MLIR] [AsyncToLLVM] Use llvm.coro.align intrinsic Use llvm.coro.align to align coroutine frame properly. Reviewed By: bkramer Differential Revision: https://reviews.llvm.org/D117978	2022-01-25 19:28:25 +08:00
Benjamin Kramer	964dc368e7	[AsyncToLLVM] aligned_alloc requires the size to be a multiple of aignment, so round up Fixes a crash with debug malloc.	2022-01-17 21:48:00 +01:00
Benjamin Kramer	5acd6e0522	[AsyncToLLVM] Align frames to 64 bytes Coroutine lowering always takes the natural alignment when spilling to the frame (issue #53148) so using AVX2 or AVX512 in a coroutine doesn't work. Always overalign to 64 bytes to avoid this issue until we have a better solution. Differential Revision: https://reviews.llvm.org/D117501	2022-01-17 18:51:42 +01:00
William S. Moses	30d87d4a5d	[MLIR][LLVM] Permit integer types in switch other than i32 LLVM switchop currently only permits i32. Both LLVM IR and MLIR Standard switch permit other integer types leading to an illegal state when lowering an i8 switch from MLIR standard Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D113955	2021-11-16 12:00:37 -05:00
Mehdi Amini	1585b13024	Revert "[MLIR][LLVM] Permit integer types in switch other than i32" This reverts commit `94992670fc`. Build is broken with: tools/mlir/include/mlir/Dialect/LLVMIR/LLVMOps.cpp.inc:23996:3: error: no matching function for call to 'printSwitchOpCases' printSwitchOpCases(_odsPrinter, *this, getValue().getType(), getCaseValuesAttr(), getCaseDestinations(), getCaseOperands(), getCaseOperands().getTypes()); ^~~~~~~~~~~~~~~~~~	2021-11-16 05:59:12 +00:00
William S. Moses	94992670fc	[MLIR][LLVM] Permit integer types in switch other than i32 LLVM switchop currently only permits i32. Both LLVM IR and MLIR Standard switch permit other integer types leading to an illegal state when lowering an i8 switch from MLIR standard Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D113955	2021-11-16 00:46:25 -05:00
Mogball	a54f4eae0e	[MLIR] Replace std ops with arith dialect ops Precursor: https://reviews.llvm.org/D110200 Removed redundant ops from the standard dialect that were moved to the `arith` or `math` dialects. Renamed all instances of operations in the codebase and in tests. Reviewed By: rriddle, jpienaar Differential Revision: https://reviews.llvm.org/D110797	2021-10-13 03:07:03 +00:00
Eugene Zhulenev	92db09cde0	[mlir] AsyncRuntime: use int64_t for ref counting operations Workaround for SystemZ ABI problem: https://bugs.llvm.org/show_bug.cgi?id=51898 Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D110550	2021-09-27 07:55:01 -07:00
Eugene Zhulenev	fd52b4357a	[mlir] Async: check awaited operand error state after sync await Previously only await inside the async function (coroutine after lowering to async runtime) would check the error state Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D109229	2021-09-04 05:00:17 -07:00
Eugene Zhulenev	b537c5b414	[mlir] Async: clone constants into async.execute functions and parallel compute functions Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D107007	2021-08-02 12:17:41 -07:00
River Riddle	f8479d9de5	[mlir] Set the namespace of the BuiltinDialect to 'builtin' Historically the builtin dialect has had an empty namespace. This has unfortunately created a very awkward situation, where many utilities either have to special case the empty namespace, or just don't work at all right now. This revision adds a namespace to the builtin dialect, and starts to cleanup some of the utilities to no longer handle empty namespaces. For now, the assembly form of builtin operations does not require the `builtin.` prefix. (This should likely be re-evaluated though) Differential Revision: https://reviews.llvm.org/D105149	2021-07-28 21:00:10 +00:00
Alex Zinenko	881dc34f73	[mlir] replace llvm.mlir.cast with unrealized_conversion_cast The dialect-specific cast between builtin (ex-standard) types and LLVM dialect types was introduced long time before built-in support for unrealized_conversion_cast. It has a similar purpose, but is restricted to compatible builtin and LLVM dialect types, which may hamper progressive lowering and composition with types from other dialects. Replace llvm.mlir.cast with unrealized_conversion_cast, and drop the operation that became unnecessary. Also make unrealized_conversion_cast legal by default in LLVMConversionTarget as the majority of convesions using it are partial conversions that actually want the casts to persist in the IR. The standard-to-llvm conversion, which is still expected to run last, cleans up the remaining casts standard-to-llvm conversion, which is still expected to run last, cleans up the remaining casts Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D105880	2021-07-16 15:14:09 +02:00
Eugene Zhulenev	d43b23608a	[mlir:Async] Add the size parameter to the async.group Specify the `!async.group` size (the number of tokens that will be added to it) at construction time. `async.await_all` operation can potentially race with `async.execute` operations that keep updating the group, for this reason it is required to know upfront how many tokens will be added to the group. Reviewed By: ftynse, herhut Differential Revision: https://reviews.llvm.org/D104780	2021-06-25 10:26:50 -07:00
Eugene Zhulenev	39957aa424	[mlir] Add error state and error propagation to async runtime values Depends On D103102 Not yet implemented: 1. Error handling after synchronous await 2. Error handling for async groups Will be addressed in the followup PRs Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D103109	2021-05-27 09:28:47 -07:00
Julian Gross	e2310704d8	[MLIR] Create memref dialect and move dialect-specific ops from std. Create the memref dialect and move dialect-specific ops from std dialect to this dialect. Moved ops: AllocOp -> MemRef_AllocOp AllocaOp -> MemRef_AllocaOp AssumeAlignmentOp -> MemRef_AssumeAlignmentOp DeallocOp -> MemRef_DeallocOp DimOp -> MemRef_DimOp MemRefCastOp -> MemRef_CastOp MemRefReinterpretCastOp -> MemRef_ReinterpretCastOp GetGlobalMemRefOp -> MemRef_GetGlobalOp GlobalMemRefOp -> MemRef_GlobalOp LoadOp -> MemRef_LoadOp PrefetchOp -> MemRef_PrefetchOp ReshapeOp -> MemRef_ReshapeOp StoreOp -> MemRef_StoreOp SubViewOp -> MemRef_SubViewOp TransposeOp -> MemRef_TransposeOp TensorLoadOp -> MemRef_TensorLoadOp TensorStoreOp -> MemRef_TensorStoreOp TensorToMemRefOp -> MemRef_BufferCastOp ViewOp -> MemRef_ViewOp The roadmap to split the memref dialect from std is discussed here: https://llvm.discourse.group/t/rfc-split-the-memref-dialect-from-std/2667 Differential Revision: https://reviews.llvm.org/D98041	2021-03-15 11:14:09 +01:00
Alexander Belyaev	a89035d750	Revert "[MLIR] Create memref dialect and move several dialect-specific ops from std." This commit introduced a cyclic dependency: Memref dialect depends on Standard because it used ConstantIndexOp. Std depends on the MemRef dialect in its EDSC/Intrinsics.h Working on a fix. This reverts commit `8aa6c3765b`.	2021-02-18 12:49:52 +01:00
Julian Gross	8aa6c3765b	[MLIR] Create memref dialect and move several dialect-specific ops from std. Create the memref dialect and move several dialect-specific ops without dependencies to other ops from std dialect to this dialect. Moved ops: AllocOp -> MemRef_AllocOp AllocaOp -> MemRef_AllocaOp DeallocOp -> MemRef_DeallocOp MemRefCastOp -> MemRef_CastOp GetGlobalMemRefOp -> MemRef_GetGlobalOp GlobalMemRefOp -> MemRef_GlobalOp PrefetchOp -> MemRef_PrefetchOp ReshapeOp -> MemRef_ReshapeOp StoreOp -> MemRef_StoreOp TransposeOp -> MemRef_TransposeOp ViewOp -> MemRef_ViewOp The roadmap to split the memref dialect from std is discussed here: https://llvm.discourse.group/t/rfc-split-the-memref-dialect-from-std/2667 Differential Revision: https://reviews.llvm.org/D96425	2021-02-18 11:29:39 +01:00
Eugene Zhulenev	25f80e16d1	[mlir] Async: add a separate pass to lower from async to async.coro and async.runtime Depends On D95000 Move async.execute outlining and async -> async.runtime lowering into the separate Async transformation pass Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D95311	2021-01-26 03:33:20 -08:00
Eugene Zhulenev	d37b5393e8	[mlir:Async] Use LLVM coro operations in async.coro lowering Instead of using llvm.call operations to call LLVM coro intrinsics use Coro operations from the LLVM dialect. (This was reviewed as a part of https://reviews.llvm.org/D94923 but was lost in arc land from local branch) Differential Revision: https://reviews.llvm.org/D95405	2021-01-25 16:42:11 -08:00
Eugene Zhulenev	9c53b8e52e	[mlir:Async] Add intermediate async.coro and async.runtime operations to simplify Async to LLVM lowering [NFC] No new functionality, mostly a cleanup and one more abstraction level between Async and LLVM IR. Instead of lowering from Async to LLVM coroutines and Async Runtime API in one shot, do it progressively via async.coro and async.runtime operations. 1. Lower from async to async.runtime/coro (e.g. async.execute to function with coro setup and runtime calls) 2. Lower from async.runtime/coro to LLVM intrinsics and runtime API calls Intermediate coro/runtime operations will allow to run transformations on a higher level IR and do not try to match IR based on the LLVM::CallOp properties. Although async.coro is very close to LLVM coroutines, it is not exactly the same API, instead it is optimized for usability in async lowering, and misses a lot of details that are present in @llvm.coro intrinsic. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D94923	2021-01-25 14:04:33 -08:00
Alex Zinenko	dd5165a920	[mlir] replace LLVM dialect float types with built-ins Continue the convergence between LLVM dialect and built-in types by replacing the bfloat, half, float and double LLVM dialect types with their built-in counterparts. At the API level, this is a direct replacement. At the syntax level, we change the keywords to `bf16`, `f16`, `f32` and `f64`, respectively, to be compatible with the built-in type syntax. The old keywords can still be parsed but produce a deprecation warning and will be eventually removed. Depends On D94178 Reviewed By: mehdi_amini, silvas, antiagainst Differential Revision: https://reviews.llvm.org/D94179	2021-01-08 17:38:12 +01:00
Eugene Zhulenev	61422c8b66	[mlir] Async: add support for lowering async value operands to LLVM Depends On D93592 Add support for `async.execute` async value unwrapping operands: ``` %token = async.execute(%async_value as %unwrapped : !async.value<!my.type>) { ... async.yield } ``` Reviewed By: csigg Differential Revision: https://reviews.llvm.org/D93598	2020-12-25 02:25:20 -08:00
Eugene Zhulenev	621ad468d9	[mlir] Async: lowering async.value to LLVM 1. Add new methods to Async runtime API to support yielding async values 2. Add lowering from `async.yield` with value payload to the new runtime API calls `async.value` lowering requires that payload type is convertible to LLVM and supported by `llvm.mlir.cast` (DialectCast) operation. Reviewed By: csigg Differential Revision: https://reviews.llvm.org/D93592	2020-12-25 02:23:48 -08:00
Eugene Zhulenev	a86a9b5ef7	[mlir] Automatic reference counting for Async values + runtime support for ref counted objects Depends On D89963 Automatic reference counting algorithm outline: 1. `ReturnLike` operations forward the reference counted values without modifying the reference count. 2. Use liveness analysis to find blocks in the CFG where the lifetime of reference counted values ends, and insert `drop_ref` operations after the last use of the value. 3. Insert `add_ref` before the `async.execute` operation capturing the value, and pairing `drop_ref` before the async body region terminator, to release the captured reference counted value when execution completes. 4. If the reference counted value is passed only to some of the block successors, insert `drop_ref` operations in the beginning of the blocks that do not have reference coutned value uses. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D90716	2020-11-20 03:08:44 -08:00
Eugene Zhulenev	c30ab6c2a3	[mlir] Transform scf.parallel to scf.for + async.execute Depends On D89958 1. Adds `async.group`/`async.awaitall` to group together multiple async tokens/values 2. Rewrite scf.parallel operation into multiple concurrent async.execute operations over non overlapping subranges of the original loop. Example: ``` scf.for (%i, %j) = (%lbi, %lbj) to (%ubi, %ubj) step (%si, %sj) { "do_some_compute"(%i, %j): () -> () } ``` Converted to: ``` %c0 = constant 0 : index %c1 = constant 1 : index // Compute blocks sizes for each induction variable. %num_blocks_i = ... : index %num_blocks_j = ... : index %block_size_i = ... : index %block_size_j = ... : index // Create an async group to track async execute ops. %group = async.create_group scf.for %bi = %c0 to %num_blocks_i step %c1 { %block_start_i = ... : index %block_end_i = ... : index scf.for %bj = %c0 t0 %num_blocks_j step %c1 { %block_start_j = ... : index %block_end_j = ... : index // Execute the body of original parallel operation for the current // block. %token = async.execute { scf.for %i = %block_start_i to %block_end_i step %si { scf.for %j = %block_start_j to %block_end_j step %sj { "do_some_compute"(%i, %j): () -> () } } } // Add produced async token to the group. async.add_to_group %token, %group } } // Await completion of all async.execute operations. async.await_all %group ``` In this example outer loop launches inner block level loops as separate async execute operations which will be executed concurrently. At the end it waits for the completiom of all async execute operations. Reviewed By: ftynse, mehdi_amini Differential Revision: https://reviews.llvm.org/D89963	2020-11-13 04:02:56 -08:00
Rahul Joshi	8b5a3e4632	[MLIR] Change FuncOp assembly syntax to print visibility inline instead of in attrib dict. - Change syntax for FuncOp to be `func <visibility>? @name` instead of printing the visibility in the attribute dictionary. - Since printFunctionLikeOp() and parseFunctionLikeOp() are also used by other operations, make the "inline visibility" an opt-in feature. - Updated unit test to use and check the new syntax. Differential Revision: https://reviews.llvm.org/D90859	2020-11-09 11:08:08 -08:00
Eugene Zhulenev	f507aa17b7	[mlir] Implement lowering to LLVM of async.execute ops with token dependencies Add support for lowering `async.execute` operations with token dependencies Example: ``` %dep = ... : !async.token %token = async.execute[%dep] { ... } ``` Token dependencies lowered to `async.await` operations inside the outline coroutine body. Reviewed By: herhut, mehdi_amini, ftynse Differential Revision: https://reviews.llvm.org/D89958	2020-10-30 05:59:03 -07:00
Lei Zhang	36ce915ac5	Revert "Revert "[mlir] Convert from Async dialect to LLVM coroutines"" This reverts commit `4986d5eaff` with proper patches to CMakeLists.txt: - Add MLIRAsync as a dependency to MLIRAsyncToLLVM - Add Coroutines as a dependency to MLIRExecutionEngine	2020-10-22 15:23:11 -04:00
Mehdi Amini	4986d5eaff	Revert "[mlir] Convert from Async dialect to LLVM coroutines" This reverts commit `a8b0ae3bdd` and commit `f8fcff5a9d`. The build with SHARED_LIBRARY=ON is broken.	2020-10-22 19:12:19 +00:00
Eugene Zhulenev	f8fcff5a9d	[mlir] Convert from Async dialect to LLVM coroutines Lower from Async dialect to LLVM by converting async regions attached to `async.execute` operations into LLVM coroutines (https://llvm.org/docs/Coroutines.html): 1. Outline all async regions to functions 2. Add LLVM coro intrinsics to mark coroutine begin/end 3. Use MLIR conversion framework to convert all remaining async types and ops to LLVM + Async runtime function calls All `async.await` operations inside async regions converted to coroutine suspension points. Await operation outside of a coroutine converted to the blocking wait operations. Implement simple runtime to support concurrent execution of coroutines. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D89292	2020-10-22 06:30:46 -07:00

31 Commits