llvm-project

Commit Graph

Author	SHA1	Message	Date
Amy Zhuang	5ce368cfe2	[mlir] Vectorize induction variables 1. Add support to vectorize induction variables of loops that are not mapped to any vector dimension in SuperVectorize pass. 2. Fix a bug in getForInductionVarOwner. Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D111370	2021-10-09 12:40:24 -07:00
Reid Kleckner	89b57061f7	Move TargetRegistry.(h\|cpp) from Support to MC This moves the registry higher in the LLVM library dependency stack. Every client of the target registry needs to link against MC anyway to actually use the target, so we might as well move this out of Support. This allows us to ensure that Support doesn't have includes from MC/*. Differential Revision: https://reviews.llvm.org/D111454	2021-10-08 14:51:48 -07:00
Matthias Springer	f8453ea75f	[mlir][linalg][bufferize] Rewrite "write into non-writable memory" detection The purpose of this revision is to make "write into non-writable memory" conflict detection easier to understand. The main idea is that there is a conflict in the case of inplace bufferization if: 1. Someone writes to (an alias of) opOperand, opResult or the to-be-bufferized op writes itself. 2. And, opOperand or opResult aliases a non-writable buffer. Differential Revision: https://reviews.llvm.org/D111379	2021-10-08 21:27:49 +09:00
Lei Zhang	4cd7ff6728	[mlir][linalg] Constant fold linalg.generic that are transposes This commit adds a pattern to perform constant folding on linalg generic ops which are essentially transposes. We see real cases where model importers may generate such patterns. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D110597	2021-10-08 08:09:13 -04:00
Eugene Zhulenev	e2a37bb540	[mlir] Add alignment option to constant tensor bufferization pass Reviewed By: bkramer Differential Revision: https://reviews.llvm.org/D111364	2021-10-08 03:17:20 -07:00
Tobias Gysi	8ed2e8e04f	[mlir][linalg] Retire Linalg ConvOp. The convolution op is one of the remaining hard coded Linalg operations that have no region attached. It got obsolete due to the OpDSL convolution operations. Removing it allows us to delete specialized code and tests that are not needed for the OpDSL counterparts that rely on the standard code paths. Test needed due to specialized implementations are removed. Tiling and fusion tests are replaced by variants using linalg.conv_2d. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D111233	2021-10-08 06:56:37 +00:00
Tobias Gysi	23800b05be	[mlir][linalg] Add loop interchange to CodegenStrategy. Add a loop interchange pass and integrate it with CodegenStrategy. This patch depends on https://reviews.llvm.org/D110728 and https://reviews.llvm.org/D110746. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110748	2021-10-08 06:39:22 +00:00
Tobias Gysi	1ebd197bc5	[mlir][linalg] Add generalization to CodegenStrategy. Add a generalization pass and integrate it with CodegenStrategy. This patch depends on https://reviews.llvm.org/D110728. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110746	2021-10-08 06:31:19 +00:00
Matthias Springer	7dfd3bb034	[mlir][linalg][bufferize][NFC] API change of aliasesNonWritableBuffer The function now takes a Value instead of an OpOperand. Differential Revision: https://reviews.llvm.org/D111378	2021-10-08 14:47:29 +09:00
Matthias Springer	89b2f29d62	[mlir][linalg][bufferize] Fix/add missing case to getAliasingOpOperand Differential Revision: https://reviews.llvm.org/D111377	2021-10-08 14:38:47 +09:00
Matthias Springer	a046154057	[mlir][linalg][bufferize] Add bufferRelation to op interface Currently supported are: BufferRelation::None, BufferRelation::Equivalent. Differential Revision: https://reviews.llvm.org/D111376	2021-10-08 14:28:24 +09:00
MaheshRavishankar	4281946390	[mlir][Tensor] Add ReifyRankedShapedTypeOpInterface to tensor.extract_slice. Differential Revision: https://reviews.llvm.org/D111263	2021-10-07 17:10:35 -07:00
Amy Zhuang	5d001f58f2	[mlir] Fix a bug in Affine LICM. Currently Affine LICM checks iterOperands and does not hoist out any instruction containing iterOperands. We should check iterArgs instead. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D111090	2021-10-07 15:46:43 -07:00
Kiran Chandramohan	3b01cf9286	[mlir][openmp] Add an interface for Outlineable OpenMP ops Add an interface for outlineable OpenMP operations. This patch was initially done in fir-dev and is now needed for the upstreaming. Reviewed By: schweitz Differential Revision: https://reviews.llvm.org/D111310	2021-10-07 20:53:48 +02:00
Matthias Springer	56bf688a09	[mlir][linalg][bufferize][NFC] Simplify getAliasingOpResult() The signature of this function was confusing. Check for hasKnownBufferizationAliasingBehavior separately when needed. Differential Revision: https://reviews.llvm.org/D110916	2021-10-07 22:41:21 +09:00
Lei Zhang	3964c1db91	[mlir][vector] Split populateVectorContractLoweringPatterns It was bundling quite a lot of patterns that convert high-D vector ops into low-D elementary ops. It might not be good for all of the patterns to happen for a particular downstream user. For example, `ShapeCastOpRewritePattern` rewrites `vector.shape_cast` into data movement extract/insert ops. Instead, split the entry point into multiple ones so users can pull in patterns on demand. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D111225	2021-10-07 09:39:26 -04:00
Matthias Springer	1d24b8c603	[mlir][linalg][bufferize][NFC] Change bufferizableInPlaceAnalysis signature Move getInplaceableOpResult() call into bufferizableInPlaceAnalysis. Note: The only goal of this change is to make the signature of bufferizableInPlaceAnalysis smaller. (Fewer arguments.) Differential Revision: https://reviews.llvm.org/D110915	2021-10-07 22:35:40 +09:00
Matthias Springer	6b1f653c94	[mlir][linalg][bufferize] tensor.cast may require a copy Differential Revision: https://reviews.llvm.org/D110806	2021-10-07 22:24:05 +09:00
Eugene Zhulenev	8276ac13e9	[mlir] Add alignment attribute to memref.global Revived https://reviews.llvm.org/D102435 Add alignment attribute to `memref.global` and propagate it to llvm global in memref->llvm lowering Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D111309	2021-10-07 06:21:57 -07:00
Adrian Kuegel	2bb208ddfd	[mlir] Don't allow dynamic extent tensor types for ConstShapeOp. ConstShapeOp has a constant shape, so its type can always be static. We still allow it to have ShapeType though. Differential Revision: https://reviews.llvm.org/D111139	2021-10-07 10:56:16 +02:00
Tobias Gysi	3fe7fe4424	[mlir][linalg] Add unsigned min/max/cast function to OpDSL. Update OpDSL to support unsigned integers by adding unsigned min/max/cast signatures. Add tests in OpDSL and on the C++ side to verify the proper signed and unsigned operations are emitted. The patch addresses an issue brought up in https://reviews.llvm.org/D111170. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D111230	2021-10-07 06:27:20 +00:00
Uday Bondhugula	1e39d32c5a	[MLIR] Add OrOp folding rule for constant one operand Add folding rule for std.or op when an operand has all bits set. or(x, <all bits set>) -> <all bits set> Differential Revision: https://reviews.llvm.org/D111206	2021-10-07 08:05:39 +05:30
Mogball	8f0c673d20	[MLIR] fix arith dialect build failure Missing function defs causes errors on some build configs.	2021-10-06 23:39:10 +00:00
Mogball	8c08f21b60	[MLIR] Split arith dialect from the std dialect Create the Arithmetic dialect that contains basic integer and floating point arithmetic operations. Ops that did not meet this criterion were moved to the Math dialect. First of two atomic patches to remove integer and floating point operations from the standard dialect. Ops will be removed from the standard dialect in a subsequent patch. Reviewed By: ftynse, silvas Differential Revision: https://reviews.llvm.org/D110200	2021-10-06 19:25:51 +00:00
Alexandre Rames	fd9613324d	[MLIR] Rename Shape dialect's `join` to `meet`. For the type lattice, we (now) use the "less specialized or equal" partial order, leading to the bottom representing the empty set, and the top representing any type. This naming is more in line with the generally used conventions, where the top of the lattice is the full set, and the bottom of the lattice is the empty set. A typical example is the powerset of a finite set: generally, meet would be the intersection, and join would be the union. ``` top: {a,b,c} / \| \ {a,b} {a,c} {b,c} \| X X \| {a} { b } {c} \ \| / bottom: { } ``` This is in line with the examined lattice representations in LLVM: * lattice for `BitTracker::BitValue` in `Hexagon/BitTracker.h` * lattice for constant propagation in `HexagonConstPropagation.cpp` * lattice in `VarLocBasedImpl.cpp` * lattice for address space inference code in `InferAddressSpaces.cpp` Reviewed By: silvas, jpienaar Differential Revision: https://reviews.llvm.org/D110766	2021-10-06 09:41:33 -07:00
Nicolas Vasilache	26b3e92981	[mlir][Linalg] Don't return early from inPlaceAnalysis Instead just emit a warning that analysis failed and the result will be treated conservatively. Differential Revision: https://reviews.llvm.org/D111217	2021-10-06 10:03:25 +00:00
Tobias Gysi	a744c7e962	[mlir][linalg] Update OpDSL to use the newly introduced min and max ops. Implement min and max using the newly introduced std operations instead of relying on compare and select. Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D111170	2021-10-06 06:45:53 +00:00
Diego Caballero	eaf2588a51	[mlir][Linalg] Add support for min/max reduction vectorization in linalg.generic This patch extends Linalg core vectorization with support for min/max reductions in linalg.generic ops. It enables the reduction detection for min/max combiner ops. It also renames MIN/MAX combining kinds to MINS/MAXS to make the sign explicit for floating point and signed integer types. MINU/MAXU should be introduce din the future for unsigned integer types. Reviewed By: pifon2a, ThomasRaoux Differential Revision: https://reviews.llvm.org/D110854	2021-10-05 22:47:20 +00:00
Geoffrey Martin-Noble	b983783d2e	[MLIR][linalg] Preserve location during elementwise fusion This otherwise loses a lot of debugging info and results in a painful debugging experience. Reviewed By: mravishankar, stellaraccident Differential Revision: https://reviews.llvm.org/D111107	2021-10-05 09:43:53 -07:00
Aart Bik	16b8f4ddae	[mlir][sparse] add a "release" operation to sparse tensor dialect We have several ways to materialize sparse tensors (new and convert) but no explicit operation to release the underlying sparse storage scheme at runtime (other than making an explicit delSparseTensor() library call). To simplify memory management, a sparse_tensor.release operation has been introduced that lowers to the runtime library call while keeping tensors, opague pointers, and memrefs transparent in the initial IR. Note There is obviously some tension between the concept of immutable tensors and memory management methods. This tension is addressed by simply stating that after the "release" call, no further memref related operations are allowed on the tensor value. We expect the design to evolve over time, however, and arrive at a more satisfactory view of tensors and buffers eventually. Bug: http://llvm.org/pr52046 Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D111099	2021-10-05 09:35:59 -07:00
Lei Zhang	83e074a0c6	[mlir] Add an 'cppNamespace' field to availability This allows us to generate interfaces in a namespace, following other TableGen'erated code. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D108311	2021-10-05 09:38:09 -04:00
Tobias Gysi	e826db6240	[mlir][linalg] Move generalization pattern to Transforms (NFC). Move the generalization pattern to the other Linalg transforms to make it available to the codegen strategy. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110728	2021-10-05 12:49:42 +00:00
Nicolas Vasilache	af9dce18bf	[mlir][Linalg] Allow operand-less scf::ExecuteRegionOp to encapsulate scf::YieldOp These are considered noops. Buferization will still fail on scf.execute_region which yield values. This is used to make comprehensive bufferization interoperate better with external clients. Differential Revision: https://reviews.llvm.org/D111130	2021-10-05 11:34:53 +00:00
Nicolas Vasilache	8096759519	[mlir][Linalg] NFC - Add support to specify that a tensor value is known to bufferize to writeable memory This change allows better interop with external clients of comprehensive bufferization functions but is otherwise NFC for the MLIR pass itself. Differential Revision: https://reviews.llvm.org/D111121	2021-10-05 08:37:34 +00:00
Alex Zinenko	01d696e563	[mlir] rename the "packing" flag of linalg.pad_tensor to "nofold" The discussion in https://reviews.llvm.org/D110425 demonstrated that "packing" may be a confusing term to define the behavior of this op in presence of the attribute. Instead, indicate the intended effect of preventing the folder from being applied. Reviewed By: nicolasvasilache, silvas Differential Revision: https://reviews.llvm.org/D111046	2021-10-04 21:28:11 +02:00
Weiwei Li	1e4cfe5e4f	[mlir][SPIRVToLLVM] Propagate location attribute from spv.GlobalVariable to llvm.mlir.global This patch is mainly to propogate location attribute from spv.GlobalVariable to llvm.mlir.global. It also contains three small changes. 1. Remove the restriction on UniformConstant In SPIRVToLLVM.cpp; 2. Remove the errorCheck on relaxedPrecision when deserializering SPIR-V in Deserializer.cpp 3. In SPIRVOps.cpp, let ConstantOp take signedInteger too. Co-authered: Alan Liu <alanliu.yf@gmail.com> and Xinyi Liu <xyliuhelen@gmail.com> Reviewed by:antiagainst Differential revision: https://reviews.llvm.org/D110207	2021-10-05 00:09:09 +08:00
Mehdi Amini	2da3facd86	Fix memory leak in MLIR SPIRV ModuleCombiner	2021-10-02 23:55:25 +00:00
wren romano	af7ac1d95b	[mlir][sparse] Sharing calls to adaptor.getOperands()[0] This is preliminary work towards D110790. Depends On D110883. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D110884	2021-10-01 14:20:31 -07:00
wren romano	14fffda979	[mlir][sparse] Factoring out allocaIndices() This is preliminary work towards D110790. Depends On D110882. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D110883	2021-10-01 14:18:56 -07:00
wren romano	ca01034714	[mlir][sparse] Factoring out getZero() and avoiding unnecessary Type params This is preliminary work towards D110790 Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D110882	2021-10-01 14:17:53 -07:00
Lei Zhang	a3f425946d	[mlir][linalg] Include InitTensorOp in tiling canonicalization Tiling can create dim ops and those dim ops can take `InitTensorOp` as input. Including it in the tiling canonicalization patterns allows us to fold those dim ops away. Also sorted the existing ops along the way. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D110876	2021-10-01 14:13:19 -04:00
Tobias Gysi	bf28849745	[mlir][linalg] Retire PoolingMaxOp/PoolingMinOp/PoolingSumOp. The pooling ops are among the last remaining hard coded Linalg operations that have no region attached. They got obsolete due to the OpDSL pooling operations. Removing them allows us to delete specialized code and tests that are not needed for the OpDSL counterparts that rely on the standard code paths. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110909	2021-10-01 13:51:56 +00:00
Uday Bondhugula	08b63db8bb	[MLIR][GPU] Add GPU launch op support for dynamic shared memory Add support for dynamic shared memory for GPU launch ops: add an optional operand to gpu.launch and gpu.launch_func ops to specify the amount of "dynamic" shared memory to use. Update lowerings to connect this operand to the GPU runtime. Differential Revision: https://reviews.llvm.org/D110800	2021-10-01 16:46:07 +05:30
Alexander Belyaev	693c61b2e0	[mlir] Enable loop peeling for "reduction" dimensions of tiled_loop. Differential Revision: https://reviews.llvm.org/D110919	2021-10-01 13:07:57 +02:00
Nicolas Vasilache	b016bd1230	[mlir][Linalg] Refactor comprehensive bufferize for external uses - NFC This revision exposes some minimal funcitonality to allow comprehensive bufferization to interop with external projects. Differential Revision: https://reviews.llvm.org/D110875	2021-09-30 20:21:08 +00:00
wren romano	218954865e	[mlir][sparse] Correcting a few typos Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D110773	2021-09-30 11:42:46 -07:00
Lei Zhang	cb2e651800	[mlir][linalg] Fix incorrect bound calculation for tiling conv For convolution, the input window dimension's access affine map is of the form `(d0 * s0 + d1)`, where `d0`/`d1` is the output/ filter window dimension, and `s0` is the stride. When tiling, https://reviews.llvm.org/D109267 changed how the way dimensions are acquired. Instead of directly querying using `.dim` ops on the original convolution op, we now get it by applying the access affine map to the loop upper bounds. This is fine for dimensions having single-dimension affine maps, like matmul, but not for convolution input. It will cause incorrect compuation and out of bound. A concrete example, say we have 1x225x225x3 (NHWC) input, 3x3x3x32 (HWCF) filter, and 1x112x112x3 (NHWC) output with stride 2, (112 2 + 3) would be 227, which is different from the correct input window dimension size 225. Instead, we should first calculate the max indices for each loop, and apply the affine map to them, and then plus one to get the dimension size. Note this makes no difference for matmul-like ops given they will have `d0 - 1 + 1` effectively. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110849	2021-09-30 13:50:57 -04:00
Stella Laurenzo	267bb194f3	[mlir] Remove old "tc" linalg ods generator. * This could have been removed some time ago as it only had one op left in it, which is redundant with the new approach. * `matmul_i8_i8_i32` (the remaining op) can be trivially replaced by `matmul`, which natively supports mixed precision. Differential Revision: https://reviews.llvm.org/D110792	2021-09-30 16:30:06 +00:00
Chris Lattner	fb093c8314	[ODS/AsmParser] Don't pass MLIRContext with DialectAsmParser. The former is redundant because the later carries it as part of its builder. Add a getContext() helper method to DialectAsmParser to make this more convenient, and stop passing the context around explicitly. This simplifies ODS generated parser hooks for attrs and types. This resolves PR51985 Recommit `4b32f8bac4` after fixing a dependency. Differential Revision: https://reviews.llvm.org/D110796	2021-09-30 05:10:28 +00:00
Mehdi Amini	3310e0020c	Revert "[ODS/AsmParser] Don't pass MLIRContext with DialectAsmParser." This reverts commit `4b32f8bac4`. Seems like the build is broken with -DDBUILD_SHARED_LIBS=ON	2021-09-30 05:01:17 +00:00
Chris Lattner	4b32f8bac4	[ODS/AsmParser] Don't pass MLIRContext with DialectAsmParser. The former is redundant because the later carries it as part of its builder. Add a getContext() helper method to DialectAsmParser to make this more convenient, and stop passing the context around explicitly. This simplifies ODS generated parser hooks for attrs and types. This resolves PR51985 Differential Revision: https://reviews.llvm.org/D110796	2021-09-29 21:36:05 -07:00
Matthias Springer	27451a05ed	[mlir][vector] Fold transfer ops and tensor.extract/insert_slice. * Fold vector.transfer_read and tensor.extract_slice. * Fold vector.transfer_write and tensor.insert_slice. Differential Revision: https://reviews.llvm.org/D110627	2021-09-30 09:28:00 +09:00
Rob Suderman	826d3eaae7	[mlir][tosa] Ranked check for transpose was wrong. Should have verified the perm length and input rank were the same before inferring shape. Caused a crash with invalid IR. Differential Revision: https://reviews.llvm.org/D110674	2021-09-29 15:14:42 -07:00
Aart Bik	7f1cb43d60	[mlir][sparse] simplify negi code generation with subi The lack of negi details leaked from merger class into codegen part. Also, special case for vector code was not needed, the type can be used directly! Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D110677	2021-09-29 10:00:06 -07:00
Marcel Koester	09cd4a71ed	Introduced AllocationOpInterface to create deallocation operations on-the-fly that are compatible with the allocation operation implementing this interface. Added interface implementations for AllocOp and CloneOp defined in the MemRef diallect. Adapted the BufferDeallocation pass to be compatible with the interface introduced in this CL. Differential Revision: https://reviews.llvm.org/D109350	2021-09-29 15:54:21 +02:00
Nicolas Vasilache	92ea624a13	[mlir][Linalg] Rewrite CodegenStrategy to populate a pass pipeline. This revision retires a good portion of the complexity of the codegen strategy and puts the logic behind pass logic. Differential revision: https://reviews.llvm.org/D110678	2021-09-29 13:35:45 +00:00
bakhtiyar	bdde959533	Remove unnecessary async group creates and awaits. Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D110605	2021-09-28 14:52:08 -07:00
bakhtiyar	55dfab39a2	Rename target block size to min task size for clarity. Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D110604	2021-09-28 14:51:55 -07:00
Amy Zhuang	7ab14b8886	[mlir] Unroll-and-jam loops with iter_args. Unroll-and-jam currently doesn't work when the loop being unroll-and-jammed or any of its inner loops has iter_args. This patch modifies the unroll-and-jam utility to support loops with iter_args. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D110085	2021-09-28 14:13:27 -07:00
thomasraoux	b12e4c17e0	[mlir] Fix bug in FoldSubview with rank reducing subview Fix how we calculate the new permutation map of the transfer ops. Differential Revision: https://reviews.llvm.org/D110638	2021-09-28 13:18:29 -07:00
Alexander Belyaev	9fb57c8c1d	[mlir] Add min/max operations to Standard. [RFC: Add min/max ops](https://llvm.discourse.group/t/rfc-add-min-max-operations/4353) I was following the naming style for Arith dialect in https://reviews.llvm.org/D110200, i.e. similar to DivSIOp and DivUIOp I defined MaxSIOp, MaxUIOp. When Arith PR is landed, I will migrate these ops as well. Differential Revision: https://reviews.llvm.org/D110540	2021-09-28 09:40:22 +02:00
Tobias Gysi	d20d0e145d	[mlir][linalg] Finer-grained padding control. Adapt the signature of the PaddingValueComputationFunction callback to either return the padding value or failure to signal padding is not desired. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110572	2021-09-27 19:21:37 +00:00
Aart Bik	ec97a205c3	[mlir][sparse] preserve zero-initialization for materializing buffers This revision makes sure that when the output buffer materializes locally (in contrast with the passing in of output tensors either in-place or not in-place), the zero initialization assumption is preserved. This also adds a bit more documentation on our sparse kernel assumption (viz. TACO assumptions). Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D110442	2021-09-27 11:22:05 -07:00
Bixia Zheng	fbd5821c6f	Implement the conversion from sparse constant to sparse tensors. The sparse constant provides a constant tensor in coordinate format. We first split the sparse constant into a constant tensor for indices and a constant tensor for values. We then generate a loop to fill a sparse tensor in coordinate format using the tensors for the indices and the values. Finally, we convert the sparse tensor in coordinate format to the destination sparse tensor format. Add tests. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D110373	2021-09-27 09:47:29 -07:00
Eugene Zhulenev	92db09cde0	[mlir] AsyncRuntime: use int64_t for ref counting operations Workaround for SystemZ ABI problem: https://bugs.llvm.org/show_bug.cgi?id=51898 Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D110550	2021-09-27 07:55:01 -07:00
Tobias Gysi	e158b5634a	[mlir][linalg] Make fusion on tensor rewriter friendly (NFC). Let the calling pass or pattern replace the uses of the original root operation. Internally, the tileAndFuse still replaces uses and updates operands but only of newly created operations. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110169	2021-09-27 11:28:25 +00:00
Nicolas Vasilache	1b49a72de9	[mlir] Factor out constraint set creation from hoist padding. This revision adds a ``` FlatAffineValueConstraints(ValueRange ivs, ValueRange lbs, ValueRange ubs) ``` method and use it in hoist padding. Differential Revision: https://reviews.llvm.org/D110427	2021-09-27 10:11:35 +00:00
Nicolas Vasilache	b74493ecea	[mlir][Linalg] Refactor padding hoisting - NFC This revision extracts padding hoisting in a new file and cleans it up in prevision of future improvements and extensions. Differential Revision: https://reviews.llvm.org/D110414	2021-09-27 09:50:31 +00:00
Matthias Springer	ffdf0a370d	[mlir][vector] Fix bug in vector-transfer-full-partial-split When splitting with linalg.copy, cannot write into the destination alloc directly. Instead, write into a subview of the alloc. Differential Revision: https://reviews.llvm.org/D110512	2021-09-27 18:12:17 +09:00
Lei Zhang	b45476c94c	[mlir][tosa] Do not fold transpose with quantized types For such cases, the type of the constant DenseElementsAttr is different from the transpose op return type. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D110446	2021-09-24 16:57:55 -04:00
Diego Caballero	2a876a711d	[mlir] Create a generic reduction detection utility This patch introduces a generic reduction detection utility that works across different dialecs. It is mostly a generalization of the reduction detection algorithm in Affine. The reduction detection logic in Affine, Linalg and SCFToOpenMP have been replaced with this new generic utility. The utility takes some basic components of the potential reduction and returns: 1) the reduced value, and 2) a list with the combiner operations. The logic to match reductions involving multiple combiner operations disabled until we can properly test it. Reviewed By: ftynse, bondhugula, nicolasvasilache, pifon2a Differential Revision: https://reviews.llvm.org/D110303	2021-09-24 20:45:59 +00:00
River Riddle	aca9bea199	[mlir:MemRef] Move DmaStartOp/DmaWaitOp to ODS These are among the last operations still defined explicitly in C++. I've tried to keep this commit as NFC as possible, but these ops definitely need a non-NFC cleanup at some point. Differential Revision: https://reviews.llvm.org/D110440	2021-09-24 19:35:28 +00:00
Lei Zhang	e325ebb9c7	[mlir][tosa] Add some transpose folders * If the input is a constant splat value, we just need to reshape it. * If the input is a general constant with one user, we can also constant fold it, without bloating the IR. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D110439	2021-09-24 15:25:14 -04:00
River Riddle	ef976337f5	[mlir:OpConversion] Remove the remaing usages of the deprecated matchAndRewrite methods This commits updates the remaining usages of the ArrayRef<Value> based matchAndRewrite/rewrite methods in favor of the new OpAdaptor overload. Differential Revision: https://reviews.llvm.org/D110360	2021-09-24 17:51:41 +00:00
River Riddle	b54c724be0	[mlir:OpConversionPattern] Add overloads for taking an Adaptor instead of ArrayRef This has been a TODO for a long time, and it brings about many advantages (namely nice accessors, and less fragile code). The existing overloads that accept ArrayRef are now treated as deprecated and will be removed in a followup (after a small grace period). Most of the upstream MLIR usages have been fixed by this commit, the rest will be handled in a followup. Differential Revision: https://reviews.llvm.org/D110293	2021-09-24 17:51:41 +00:00
Alex Zinenko	5988a3b7a0	[mlir] Linalg: ensure tile-and-pad always creates padding as requested Initially, the padding transformation and the related operation were only used to guarantee static shapes of subtensors in tiled operations. The transformation would not insert the padding operation if the shapes were already static, and the overall code generation would actively remove such "noop" pads. However, this transformation can be also used to pack data into smaller tensors and marshall them into faster memory, regardless of the size mismatches. In context of expert-driven transformation, we should assume that, if padding is requested, a potentially padded tensor must be always created. Update the transformation accordingly. To do this, introduce an optional `packing` attribute to the `pad_tensor` op that serves as an indication that the padding is an intentional choice (as opposed to side effect of type normalization) and should be left alone by cleanups. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110425	2021-09-24 18:40:13 +02:00
Alex Zinenko	3f89e339bb	[mlir] add pad_tensor(tensor.cast) -> pad_tensor canonicalizer This canonicalization pattern complements the tensor.cast(pad_tensor) one in propagating constant type information when possible. It contributes to the feasibility of pad hoisting. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110343	2021-09-24 12:03:47 +02:00
Matthias Springer	f3f25ffc04	[mlir][linalg] Fix result type in FoldSourceTensorCast * Do not discard static result type information that cannot be inferred from lower/upper padding. * Add optional argument to `PadTensorOp::inferResultType` for specifying known result dimensions. Differential Revision: https://reviews.llvm.org/D110380	2021-09-24 16:47:18 +09:00
Matthias Springer	2190f8a8b1	[mlir][linalg] Support tile+peel with TiledLoopOp Only scf.for was supported until now. Differential Revision: https://reviews.llvm.org/D110220	2021-09-24 10:23:31 +09:00
Matthias Springer	8dc16ba8d2	[mlir][linalg] Merge all tiling passes into a single one. Passes such as `linalg-tile-to-tiled-loop` are merged into `linalg-tile`. Differential Revision: https://reviews.llvm.org/D110214	2021-09-24 10:16:46 +09:00
wren romano	221856f5cd	[mlir][sparse] Moved a conditional from the RT library to the generated MLIR. When generating code to add an element to SparseTensorCOO (e.g., when doing dense=>sparse conversion), we used to check for nonzero values on the runtime side, whereas now we generate MLIR code to do that check. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D110121	2021-09-23 12:44:17 -07:00
Aart Bik	a924fcc7c3	[mlir][sparse] add sparse kernels test to sparse compiler test suite This test makes sure kernels map to efficient sparse code, i.e. all compressed for-loops, no co-iterating while loops. In addition, this revision removes the special constant folding inside the sparse compiler in favor of Mahesh' new generic linalg folding. Thanks! NOTE: relies on Mahesh fix, which needs to be rebased first Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D110001	2021-09-22 14:56:39 -07:00
MaheshRavishankar	a40a08ed98	[mlir][Linalg] Teach constant -> generic op fusion to handle scalar constants. The current folder of constant -> generic op only handles splat constants. The same logic holds for scalar constants. Teach the pattern to handle such cases. Differential Revision: https://reviews.llvm.org/D109982	2021-09-22 13:41:47 -07:00
Aart Bik	5da21338bc	[mlir][sparse] generalize reduction support in sparse compiler Now not just SUM, but also PRODUCT, AND, OR, XOR. The reductions MIN and MAX are still to be done (also depends on recognizing these operations in cmp-select constructs). Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D110203	2021-09-22 12:36:46 -07:00
Tobias Gysi	e828655313	[mlir][linalg] Fix interchange initialization in fusion on tensors. If no interchange vector is given initialize it with the identity permutation from 0 to number of loops. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D110249	2021-09-22 17:45:54 +00:00
Aart Bik	128a9e1cb4	[mlir][sparse] cleanup ABI issues in C interface with memrefs This change adds automatic wrapper functoins with emit_c_interface to all methods in the sparse support library that deal with MEMREFs. The wrappers will take care of passing MEMREFs by value internally and by pointer externally, thereby avoiding ABI issues across platforms. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D110219	2021-09-21 21:58:12 -07:00
Tobias Gysi	8b5236def5	[mlir][linalg] Simplify slice dim computation for fusion on tensors (NFC). Compute the tiled producer slice dimensions directly starting from the consumer not using the producer at all. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110147	2021-09-21 15:09:46 +00:00
Tobias Gysi	9072f1b5f8	[mlir][linalg] Add isPermutation helper (NFC). Add a helper method to check if an index vector contains a permutation of its indices. Additionally, refactor applyPermutationToVector to take int64_t. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110135	2021-09-21 15:07:39 +00:00
Nicolas Vasilache	101d017a64	[mlir][Linalg] Revisit heuristic ordering of tensor.insert_slice in comprehensive bufferize. It was previously assumed that tensor.insert_slice should be bufferized first in a greedy fashion to avoid out-of-place bufferization of the large tensor. This heuristic does not hold upon further inspection. This CL removes the special handling of such ops and adds a test that exhibits better behavior and appears in real use cases. The only test adversely affected is an artificial test which results in a returned memref: this pattern is not allowed by comprehensive bufferization in real scenarios anyway and the offending test is deleted. Differential Revision: https://reviews.llvm.org/D110072	2021-09-21 14:22:45 +00:00
Nicolas Vasilache	0d2c54e851	[mlir][Linalg] Revisit RAW dependence interference in comprehensive bufferize. Previously, comprehensive bufferize would consider all aliasing reads and writes to the result buffer and matching operand. This resulted in spurious dependences being considered and resulted in too many unnecessary copies. Instead, this revision revisits the gathering of read and write alias sets. This results in fewer alloc and copies. An exhaustive test cases is added that considers all possible permutations of `matmul(extract_slice(fill), extract_slice(fill), ...)`.	2021-09-21 14:22:22 +00:00
Tobias Gysi	c8eed8f9a7	[mlir][linalg] Assert tile loop nest invariants in fusion. Assert the tile loop nest invariants are satisfied instead of failing silently. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110137	2021-09-21 14:20:57 +00:00
Uday Bondhugula	5c77ed0330	[MLIR] NFC. gpu.launch op argument const folder cleanup NFC updates to gpu.launch op argument const folder. Differential Revision: https://reviews.llvm.org/D110136	2021-09-21 14:30:03 +05:30
Morten Borup Petersen	032cb1650f	[MLIR][SCF] Add for-to-while loop transformation pass This pass transforms SCF.ForOp operations to SCF.WhileOp. The For loop condition is placed in the 'before' region of the while operation, and indctuion variable incrementation + the loop body in the 'after' region. The loop carried values of the while op are the induction variable (IV) of the for-loop + any iter_args specified for the for-loop. Any 'yield' ops in the for-loop are rewritten to additionally yield the (incremented) induction variable. This transformation is useful for passes where we want to consider structured control flow solely on the basis of a loop body and the computation of a loop condition. As an example, when doing high-level synthesis in CIRCT, the incrementation of an IV in a for-loop is "just another part" of a circuit datapath, and what we really care about is the distinction between our datapath and our control logic (the condition variable). Differential Revision: https://reviews.llvm.org/D108454	2021-09-21 09:09:54 +01:00
Chris Lattner	58abc8c34b	[OpAsmParser] Add a parseCommaSeparatedList helper and beef up Delimeter. Lots of custom ops have hand-rolled comma-delimited parsing loops, as does the MLIR parser itself. Provides a standard interface for doing this that is less error prone and less boilerplate. While here, extend Delimiter to support <> and {} delimited sequences as well (I have a use for <> in CIRCT specifically). Differential Revision: https://reviews.llvm.org/D110122	2021-09-20 20:59:11 -07:00
River Riddle	0cb5d7fc7f	[mlir] Add value_begin/value_end methods to DenseElementsAttr Currently DenseElementsAttr only exposes the ability to get the full range of values for a given type T, but there are many situations where we just want the beginning/end iterator. This revision adds proper value_begin/value_end methods for all of the supported T types, and also cleans up a bit of the interface. Differential Revision: https://reviews.llvm.org/D104173	2021-09-21 01:57:43 +00:00
natashaknk	38ff7e11c0	[mlir][tosa] Add several binary elementwise to the list of broadcastable ops. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D110096	2021-09-20 16:07:35 -07:00
MaheshRavishankar	4cf9bf6c9f	[mlir][MemRef] Compute unused dimensions of a rank-reducing subviews using strides as well. For `memref.subview` operations, when there are more than one unit-dimensions, the strides need to be used to figure out which of the unit-dims are actually dropped. Differential Revision: https://reviews.llvm.org/D109418	2021-09-20 11:05:30 -07:00
MaheshRavishankar	0b33890f45	[mlir][Linalg] Add ConvolutionOpInterface. Add an interface that allows grouping together all covolution and pooling ops within Linalg named ops. The interface currently - the indexing map used for input/image access is valid - the filter and output are accessed using projected permutations - that all loops are charecterizable as one iterating over - batch dimension, - output image dimensions, - filter convolved dimensions, - output channel dimensions, - input channel dimensions, - depth multiplier (for depthwise convolutions) Differential Revision: https://reviews.llvm.org/D109793	2021-09-20 10:41:10 -07:00
Mehdi Amini	5edd79fc97	Revert "[MLIR][SCF] Add for-to-while loop transformation pass" This reverts commit `644b55d57e`. The added test is failing the bots.	2021-09-20 17:21:59 +00:00
Tobias Gysi	7be28d82b4	[mlir][linalg] Add IndexOp support to fusion on tensors. This revision depends on https://reviews.llvm.org/D109761 and https://reviews.llvm.org/D109766. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D109774	2021-09-20 15:59:35 +00:00
Morten Borup Petersen	644b55d57e	[MLIR][SCF] Add for-to-while loop transformation pass This pass transforms SCF.ForOp operations to SCF.WhileOp. The For loop condition is placed in the 'before' region of the while operation, and indctuion variable incrementation + the loop body in the 'after' region. The loop carried values of the while op are the induction variable (IV) of the for-loop + any iter_args specified for the for-loop. Any 'yield' ops in the for-loop are rewritten to additionally yield the (incremented) induction variable. This transformation is useful for passes where we want to consider structured control flow solely on the basis of a loop body and the computation of a loop condition. As an example, when doing high-level synthesis in CIRCT, the incrementation of an IV in a for-loop is "just another part" of a circuit datapath, and what we really care about is the distinction between our datapath and our control logic (the condition variable). Differential Revision: https://reviews.llvm.org/D108454	2021-09-20 16:57:50 +01:00
Tobias Gysi	09100c75b5	[mlir][linalg] Fix typo (NFC).	2021-09-20 15:46:16 +00:00
Tobias Gysi	6db928b8f3	[mlir][linalg] Fusion on tensors. Add a new version of fusion on tensors that supports the following scenarios: - support input and output operand fusion - fuse a producer result passed in via tile loop iteration arguments (update the tile loop iteration arguments) - supports only linalg operations on tensors - supports only scf::for - cannot add an output to the tile loop nest The LinalgTileAndFuseOnTensors pass tiles the root operation and fuses its producers. Reviewed By: nicolasvasilache, mravishankar Differential Revision: https://reviews.llvm.org/D109766	2021-09-20 14:45:34 +00:00
Vladislav Vinogradov	798e4bfbed	[mlir] Fix integration tests failures introduced in D108505	2021-09-20 11:48:24 +03:00
KareemErgawy-TomTom	bdcf4b9b96	[MLIR][Linalg] Make detensoring cost-model more flexible. So far, the CF cost-model for detensoring was limited to discovering pure CF structures. This means, if while discovering the CF component, the cost-model found any op that is not detensorable, it gives up on detensoring altogether. This patch makes it a bit more flexible by cleaning-up the detensorable component from non-detensorable ops without giving up entirely. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D109965	2021-09-20 10:21:31 +02:00
Vladislav Vinogradov	ec03bbe8a7	[mlir] Fix bug in partial dialect conversion The discussion on forum: https://llvm.discourse.group/t/bug-in-partial-dialect-conversion/4115 The `applyPartialConversion` didn't handle the operations, that were marked as illegal inside dynamic legality callback. Instead of reporting error, if such operation was not converted to legal set, the method just added it to `unconvertedSet` in the same way as unknown operations. This patch fixes that and handle dynamically illegal operations as well. The patch includes 2 fixes for existing passes: * `tensor-bufferize` - explicitly mark `std.return` as legal. * `convert-parallel-loops-to-gpu` - ugly fix with marking visited operations to avoid recursive legality checks. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D108505	2021-09-20 10:39:10 +03:00
Uday Bondhugula	57eda9becc	[MLIR][GPU] Add constant propagator for gpu.launch op Add a constant propagator for gpu.launch op in cases where the grid/thread IDs can be trivially determined to take a single constant value of zero. Differential Revision: https://reviews.llvm.org/D109994	2021-09-18 12:02:46 +05:30
Aart Bik	46e77b5d10	[mlir][sparse] add a sparse quantized_matmul example to integration test Note that this revision adds a very tiny bit of constant folding in the sparse compiler lattice construction. Although I am generally trying to avoid such canonicalizations (and rely on other passes to fix this instead), the benefits of avoiding a very expensive disjunction lattice construction justify having this special code (at least for now). Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D109939	2021-09-17 13:04:44 -07:00
thomasraoux	08f0cb7719	[mlir] Prevent crash in DropUnitDim pattern due to tensor with encoding Differential Revision: https://reviews.llvm.org/D109984	2021-09-17 12:03:16 -07:00
thomasraoux	36aac53b36	[mlir][linalg] Extend drop unit dim pattern to all cases of reduction Even with all parallel loops reading the output value is still allowed so we don't have to handle reduction loops differently. Differential Revision: https://reviews.llvm.org/D109851	2021-09-17 10:09:57 -07:00
thomasraoux	416679615d	[mlir] Linalg hoisting should ignore uses outside the loop Differential Revision: https://reviews.llvm.org/D109859	2021-09-17 10:06:57 -07:00
thomasraoux	a123e3c48b	[mlir] Fix potential crash in hoistRedundantVectorTransfers Differential Revision: https://reviews.llvm.org/D107856	2021-09-17 10:05:20 -07:00
Tobias Gysi	90b7817e03	[mlir][linalg] Add helper to update IndexOps after tiling (NFC). Add the addTileLoopIvsToIndexOpResults method to shift the IndexOp results after tiling. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D109761	2021-09-17 15:17:33 +00:00
MaheshRavishankar	04a66f8d2b	Fixing vector add pattern that incorrectly returns success. The pattern is returning success even if it does no work leading to pattern application running up to the max iteration count and failing. Reviewed By: nicolasvasilache, mravishankar Differential Revision: https://reviews.llvm.org/D109791	2021-09-16 14:48:09 -07:00
Rob Suderman	8662a2f208	[mlir][tosa] Relax ranked constraint on quantization builder TosaOp defintion had an artificial constraint that the input/output types needed to be ranked to invoke the quantization builder. This is correct as an unranked tensor could still be quantized. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D109863	2021-09-16 11:43:47 -07:00
Nicolas Vasilache	ee2e414dde	[mlir][Linalg] Cleanup doc and improve logging and readability in ComprehensiveBufferize.cpp - NFC	2021-09-16 16:41:47 +00:00
Aart Bik	b1d44e5902	[mlir][sparse] add affine subscripts to sparse compilation pass This enables the sparsification of more kernels, such as convolutions where there is a x(i+j) subscript. It also enables more tensor invariants such as x(1) or other affine subscripts such as x(i+1). Currently, we reject sparsity altogether for such tensors. Despite this restriction, however, we can already handle a lot more kernels with compound subscripts for dense access (viz. convolution with dense input and sparse filter). Some unit tests and an integration test demonstrate new capability. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D109783	2021-09-15 20:28:04 -07:00
Rob Suderman	1ac2d195ec	[mlir][linalg] Add canonicalizers for depthwise conv There are two main versions of depthwise conv depending whether the multiplier is 1 or not. In cases where m == 1 we should use the version without the multiplier channel as it can perform greater optimization. Add lowering for the quantized/float versions to have a multiplier of one. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D108959	2021-09-15 14:09:15 -07:00
Simon Camphausen	1b79efdc72	[mlir] Fix printing of EmitC attrs/types with escape characters Attributes and types were not escaped when printing. Reviewed By: jpienaar, marbre Differential Revision: https://reviews.llvm.org/D109143	2021-09-15 18:15:38 +00:00
Nicolas Vasilache	96ec0ff2b7	[mlir][Linalg] Revisit insertion points in comprehensive bufferization. This revision fixes a corner case that could appear due to incorrect insertion point behavior in comprehensive bufferization. Differential Revision: https://reviews.llvm.org/D109830	2021-09-15 18:11:38 +00:00
Uday Bondhugula	f68939d3d9	[MLIR] Tighten type constraint on memref.global op def Tighten the def of memref.global op to use the right kind of TypeAttr (of MemRefType). Differential Revision: https://reviews.llvm.org/D109822	2021-09-15 22:41:03 +05:30
Nicolas Vasilache	6fe77b1051	[mlir][Linalg] Fail comprehensive bufferization if a memref is returned. Summary: Reviewers: Subscribers: Differential revision: https://reviews.llvm.org/D109824	2021-09-15 15:11:17 +00:00
Nicolas Vasilache	e3889b3059	[mlir][Linalg] Replace DenseSet by UnionFind in ComprehensiveBufferize - NFC AliasInfo can now use union-find for a much more efficient implementation. This brings no functional changes but large performance gains on more complex examples. Differential Revision: https://reviews.llvm.org/D109819	2021-09-15 10:35:54 +00:00
Matthias Springer	934e2f695e	[mlir][linalg] ComprehensiveBufferize: Do not copy InitTensorOp results E.g.: ``` %2 = memref.alloc() {alignment = 128 : i64} : memref<256x256xf32> %3 = memref.alloc() {alignment = 128 : i64} : memref<256x256xf32> // ... (%3 is not written to) linalg.copy(%3, %2) : memref<256x256xf32>, memref<256x256xf32> vector.transfer_write %11, %2[%c0, %c0] {in_bounds = [true, true]} : vector<256x256xf32>, memref<256x256xf32> ``` Avoid copies of %3 if %3 came directly from an InitTensorOp. Differential Revision: https://reviews.llvm.org/D109742	2021-09-15 17:28:04 +09:00
Matthias Springer	9adc0114bf	[mlir][linalg] PadTensorOp vectorization: Avoid redundant FillOps Do not generate FillOps when these would be entirely overwritten. Differential Revision: https://reviews.llvm.org/D109741	2021-09-15 09:28:37 +09:00
Tobias Gysi	6091873651	[mli][linalg] Reuse getValueOrCreateConstantIndexOp method (NFC). Use getValueOrCreateConstantIndexOp introduced by https://reviews.llvm.org/D109601 in multiple places in LinalgOps.cpp. Reviewed By: nicolasvasilache, springerm Differential Revision: https://reviews.llvm.org/D109756	2021-09-14 15:32:29 +00:00
Tobias Gysi	44a889778c	[mlir][linalg] Fold ExtractSliceOps during tiling. Add the makeComposedExtractSliceOp method that creates an ExtractSliceOp and folds chains of ExtractSliceOps by computing the sum of their offsets and by multiplying their strides. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D109601	2021-09-14 11:43:52 +00:00
Matthias Springer	62883459cd	[mlir][linalg] makeTiledShape: No affine.min if tile size == 1 This improves codegen (more static type information) with `scalarize-dynamic-dims`. Differential Revision: https://reviews.llvm.org/D109415	2021-09-14 10:48:20 +09:00
Matthias Springer	fb1def9c66	[mlir][linalg] New tiling option: Scalarize dynamic dims This tiling option scalarizes all dynamic dimensions, i.e., it tiles all dynamic dimensions by 1. This option is useful for linalg ops with partly dynamic tensor dimensions. E.g., such ops can appear in the partial iteration after loop peeling. After scalarizing dynamic dims, those ops can be vectorized. Differential Revision: https://reviews.llvm.org/D109268	2021-09-14 10:40:50 +09:00
Matthias Springer	8faf35c0a5	[mlir][linalg] Add scf.for loop peeling to codegen strategy Only scf.for loops are supported at the moment. linalg.tiled_loop support will be added in a subsequent commit. Only static tensor sizes are supported. Loops for dynamic tensor sizes can be peeled, but the generated code is not optimal due to a missing canonicalization pattern. Differential Revision: https://reviews.llvm.org/D109043	2021-09-14 10:35:01 +09:00
Nicolas Vasilache	181d18ef53	[mlir][Linalg] Insert static buffers as high as possible during ComprehensiveBufferization. This revision allows hoisting static alloc/dealloc pairs as high as possible during ComprehensiveBufferization. This also aligns such allocated buffers to 128B by default. This change exhibited some issues wrt insertion points and a missing copy that are also fixed in this revision; tests are updated accordingly. Differential Revision: https://reviews.llvm.org/D109684	2021-09-13 15:59:03 +00:00
Matthias Springer	7c9b6a3355	[mlir][linalg] ComprehensiveBufferize: Do not copy InitTensorOps Do not copy InitTensorOps or casts thereof. Differential Revision: https://reviews.llvm.org/D109656	2021-09-13 22:31:54 +09:00
Nicolas Vasilache	b01d223faf	[mlir][Linalg] Use reify for padded op shape derivation. Previously, we would insert a DimOp and rely on later canonicalizations. Unfortunately, reifyShape kind of rewrites are not canonicalizations anymore. This introduces undesirable pass dependencies. Instead, immediately reify the result shape and avoid the DimOp altogether. This is akin to a local folding, which avoids introducing more reliance on `-resolve-shaped-type-result-dims` (similar to compositions of `affine.apply` by construction to avoid chains of size > 1). It does not completely get rid of the reliance on the pass as the process is merely local: calling the pass may still be necessary for global effects. Indeed, one of the tests still requires the pass. Differential Revision: https://reviews.llvm.org/D109571	2021-09-13 11:54:59 +00:00
Rob Suderman	b0532286fe	[mlir][tosa] Add shape inference for tosa.while Tosa.while shape inference requires repeatedly running shape inference across the body of the loop until the types become static as we do not know the number of iterations required by the loop body. Once the least specific arguments are known they are propagated to both regions. To determine the final end type, the least restrictive types are determined from all yields. Differential Revision: https://reviews.llvm.org/D108801	2021-09-10 13:11:53 -07:00
Stephan Herhut	5e6c170b3f	[mlir][linalg] Fix bufferize pattern to allow unknown operations in body of generic The original version of the bufferization pattern for linalg.generic would manually clone operations within the region to the bufferized clone of the operation. This triggers legality requirements on those operations in the conversion infra. Instead, this now uses the rewriter to inline the region instead, avoiding those legality requirements. Differential Revision: https://reviews.llvm.org/D109581	2021-09-10 13:37:42 +02:00
Matthias Springer	0f3544d185	[mlir][scf] Loop peeling: Use scf.for for partial iteration Generate an scf.for instead of an scf.if for the partial iteration. This is for consistency reasons: The peeling of linalg.tiled_loop also uses another loop for the partial iteration. Note: Canonicalizations patterns may rewrite partial iterations to scf.if afterwards. Differential Revision: https://reviews.llvm.org/D109568	2021-09-10 19:07:09 +09:00
Tobias Gysi	16488dc300	[mlir][linalg] Pass all operands to tile to the tile loop region builder (NFC). Extend the signature of the tile loop nest region builder to take all operand values to use and not just the scf::For iterArgs. This change allows us to pass in all block arguments of TiledLoop and use them directly instead of replacing them after the loop generation. Reviewed By: pifon2a Differential Revision: https://reviews.llvm.org/D109569	2021-09-10 08:35:11 +00:00
Nicolas Vasilache	5f1a1af4bf	[mlir][Linalg] Properly order extract_slice traversal in comprehensive bufferization This revision fixes the traversal order of extract_slice during the inplace analysis. It was previously thought that such ops could be analyzed at the very end. This is unfortunately not true as the AliasInfo for dependents of these ops need to be updated. This change allows the aliases introduced by the bufferization of extract_slice to be properly propagated. Differential Revision: https://reviews.llvm.org/D109519	2021-09-10 07:10:06 +00:00
Aart Bik	066d786ce0	[mlir][sparse] add folding to sparse_tensor.convert folds conversion between identical types (with tests) Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D109545	2021-09-09 15:45:19 -07:00
Alexander Slepko	89837a0e1b	Adding min(f/s/u) and max(f/s/u) cases for vector reduction This PR adds missing AtomicRMWKind::min/max cases which we would like to use for min/max reduction loop vectorizations. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D104881	2021-09-09 12:00:43 -07:00
Chris Lattner	735f46715d	[APInt] Normalize naming on keep constructors / predicate methods. This renames the primary methods for creating a zero value to `getZero` instead of `getNullValue` and renames predicates like `isAllOnesValue` to simply `isAllOnes`. This achieves two things: 1) This starts standardizing predicates across the LLVM codebase, following (in this case) ConstantInt. The word "Value" doesn't convey anything of merit, and is missing in some of the other things. 2) Calling an integer "null" doesn't make any sense. The original sin here is mine and I've regretted it for years. This moves us to calling it "zero" instead, which is correct! APInt is widely used and I don't think anyone is keen to take massive source breakage on anything so core, at least not all in one go. As such, this doesn't actually delete any entrypoints, it "soft deprecates" them with a comment. Included in this patch are changes to a bunch of the codebase, but there are more. We should normalize SelectionDAG and other APIs as well, which would make the API change more mechanical. Differential Revision: https://reviews.llvm.org/D109483	2021-09-09 09:50:24 -07:00
Aart Bik	e2d3db42e5	[mlir][sparse] add casts to operations to lattice and exp builders Further enhance the set of operations that can be handled by the sparse compiler Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D109413	2021-09-09 08:49:50 -07:00
Uday Bondhugula	524eafa5b2	[MLIR] Avoid double space print on llvm global op Fix extra space print for llvm global op when the 'unamed_addr' attribute was empty. This led to two spaces being printed in the custom form between non-whitespace chars. A round trip would add an extra space to a typical spaced form. NFC. Differential Revision: https://reviews.llvm.org/D109502	2021-09-09 19:52:38 +05:30
Matthias Springer	c7d569b8f7	[mlir][scf] Fold dim(scf.for) to dim(iter_arg) Fold dim ops of scf.for results to dim ops of the respective iter args if the loop is shape preserving. Differential Revision: https://reviews.llvm.org/D109430	2021-09-09 13:47:13 +09:00
Matthias Springer	e2c8fcb9d0	[mlir][linalg] Fold dim(linalg.tiled_loop) to dim(output_arg) Fold dim ops of linalg.tiled_loop results to dim ops of the respective iter args if the loop is shape preserving. Differential Revision: https://reviews.llvm.org/D109431	2021-09-09 13:37:28 +09:00
Matthias Springer	f7137da174	[mlir][linalg] Fix dim(iter_arg) canonicalization Run a small analysis to see if the runtime type of the iter_arg is changing. Fold only if the runtime type stays the same. (Same as `DimOfIterArgFolder` in SCF.) Differential Revision: https://reviews.llvm.org/D109299	2021-09-09 12:13:05 +09:00
Matthias Springer	c95a7246a3	[mlir][linalg] Tiling: Use loop ub in extract_slice size computation if possible When tiling a LinalgOp, extract_slice/insert_slice pairs are inserted. To avoid going out-of-bounds when the tile size does not divide the shape size evenly (at the boundary), AffineMin ops are inserted. Some ops have assumptions regarding the dimensions of inputs/outputs. E.g., in a `A * B` matmul, `dim(A, 1) == dim(B, 0)`. However, loop bounds use either `dim(A, 1)` or `dim(B, 0)`. With this change, AffineMin ops are expressed in terms of loop bounds instead of tensor sizes. (Both have the same runtime value.) This simplifies canonicalizations. Differential Revision: https://reviews.llvm.org/D109267	2021-09-09 11:06:22 +09:00
Matthias Springer	c57c4f888c	[mlir][linalg] linalg.tiled_loop peeling Differential Revision: https://reviews.llvm.org/D108270	2021-09-07 09:50:08 +09:00
Alexander Belyaev	58c188507f	[mlir][linalg] Fix `FoldInitTensorWithDimOp` if dim(init_tensor) is static. It looks like it was a typo. Instead of `maybeConstantIndex`, `initTensorOp.getStaticSize(maybeConstantIndex)` should be used to access the dim size of the tensor. There is a test for that in `canonicalize.mlir`, but it was working correctly because `ReplaceStaticShapeDims` was canonicalizing DimOp before `FoldInitTensorWithDimOp`. So, to make the patterns more "orthogonal", this case is disabled. Differential Revision: https://reviews.llvm.org/D109247	2021-09-06 10:47:26 +02:00
Eugene Zhulenev	fd52b4357a	[mlir] Async: check awaited operand error state after sync await Previously only await inside the async function (coroutine after lowering to async runtime) would check the error state Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D109229	2021-09-04 05:00:17 -07:00
Loren Maggiore	361458b1ce	[mlir] create gpu memset op Create a gpu memset op and corresponding CUDA and ROCm wrappers. Reviewed By: herhut, lorenrose1013 Differential Revision: https://reviews.llvm.org/D107548	2021-09-04 08:13:04 +02:00
Mehdi Amini	78accf9f35	Make LLVM Linkage a first class attribute instead of using an integer attribute This makes the IR more readable, in particular when this will be used on the builtin func outside of the LLVM dialect. Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D109209	2021-09-03 21:21:46 +00:00
Alexander Belyaev	5ee5bbd0ff	[mlir][linalg] Extend tiled_loop to SCF conversion to generate scf.parallel. Differential Revision: https://reviews.llvm.org/D109230	2021-09-03 18:05:54 +02:00
Aart Bik	b6d1a31c1b	[mlir][sparse] refine heuristic for iteration graph topsort The sparse index order must always be satisfied, but this may give a choice in topsorts for several cases. We broke ties in favor of any dense index order, since this gives good locality. However, breaking ties in favor of pushing unrelated indices into sparse iteration spaces gives better asymptotic complexity. This revision improves the heuristic. Note that in the long run, we are really interested in using ML for ML to find the best loop ordering as a replacement for such heuristics. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D109100	2021-09-03 08:37:15 -07:00
Jean Perier	49af2a6275	[mlir][flang] Do not prevent integer types from being parsed as MLIR keywords DialectAsmParser::parseKeyword is rejecting `'i' digit+` while it is a valid identifier according to mlir/docs/LangRef.md. Integer types actually used to be TOK_KEYWORD a while back before the change: `6af866c58d`. This patch Modifies `isCurrentTokenAKeyword` to return true for tokens that match integer types too. The motivation for this change is the parsing of `!fir.type<{` `component-name: component-type,`+ `}>` type in FIR that represent Fortran derived types. The component-names are parsed as keywords, and can very well be i32 or any ixxx (which are valid Fortran derived type component names). The Quant dialect type parser had to be modified since it relied on `iw` not being parsed as keywords. Differential Revision: https://reviews.llvm.org/D108913	2021-09-03 08:20:49 +02:00
Matthias Springer	4fa6c2734c	[mlir][scf] Allow runtime type of iter_args to change The limitation on iter_args introduced with D108806 is too restricting. Changes of the runtime type should be allowed. Extends the dim op canonicalization with a simple analysis to determine when it is safe to canonicalize. Differential Revision: https://reviews.llvm.org/D109125	2021-09-03 10:03:05 +09:00
Kiran Chandramohan	711aa35759	[MLIR][OpenMP] Add support for declaring critical construct names Add an operation omp.critical.declare to declare names/symbols of critical sections. Named omp.critical operations should use symbols declared by omp.critical.declare. Having a declare operation ensures that the names of critical sections are global and unique. In the lowering flow to LLVM IR, the OpenMP IRBuilder creates unique names for critical sections. Reviewed By: ftynse, jeanPerier Differential Revision: https://reviews.llvm.org/D108713	2021-09-02 14:31:19 +00:00
Alexander Belyaev	f68de11c10	[mlir][linalg] Expose function to create op on buffers during bufferization. Differential Revision: https://reviews.llvm.org/D109140	2021-09-02 11:09:05 +02:00
Weiwei Li	a79d7c2c85	[mlir][SPIRV] Add Image Operands for Image Instructions This patch is to add Image Operands in SPIR-V Dialect and also let ImageDrefGather to use Image Operands. Image Operands are used in many image instructions. "Image Operands encodes what oprands follow, as per Image Operands". And ususally, they are optional to image instructions. The format of image operands looks like: %0 = spv.ImageXXXX %1, ... %3 : f32 ["Bias\|Lod"](%4, %5 : f32, f32) -> ... This patch doesn’t implement all operands (see Section 3.14 in SPIR-V Spec) but provides a skeleton of it. There is TODO in verifyImageOperands function. Co-authored: Alan Liu <alanliu.yf@gmail.com> Reviewed by: antiagainst Differential Revision: https://reviews.llvm.org/D108501	2021-09-02 04:14:17 +08:00
MaheshRavishankar	b686fdbf92	[mlir][Linalg] Drop output tensor from `linalg.pad_tensor` op. The output tensor was added for tiling purposes. With use of `TilingInterface` for tiling pad operations, there is no need for an explicit operand for the shape of result of `linalg.pad_tensor` op. The interface allows the tiling pattern to query the value that can be used for the "init" needed for tiling dynamically. Differential Revision: https://reviews.llvm.org/D108613	2021-08-31 11:12:24 -07:00
Mehdi Amini	c41b16c26b	Change ASM Op printer to print the operation name in the framework instead of leaving it up to each individual operation This aligns the printer with the parser contract: the operation isn't part of the user-controllable part of the syntax. Differential Revision: https://reviews.llvm.org/D108804	2021-08-31 17:52:40 +00:00
Tres Popp	44485fcd97	[mlir] Prevent assertion failure in DropUnitDims Don't assert fail on strided memrefs when dropping unit dims. Instead just leave them unchanged. Differential Revision: https://reviews.llvm.org/D108205	2021-08-31 12:15:13 +02:00
marina kolpakova a.k.a. geexie	0080d2aa55	[mlir][gpu] folds memref.dim of gpu.alloc implements canonicalization which folds memref.dim(gpu.alloc(%size), %idx) -> %size Differential Revision: https://reviews.llvm.org/D108892	2021-08-31 12:33:10 +03:00
MaheshRavishankar	2dfb66833f	Fix unused variable in release build. Differential Revision: https://reviews.llvm.org/D108963	2021-08-30 19:34:52 -07:00
MaheshRavishankar	ba72cfe734	[mlir] Add an interface to allow operations to specify how they can be tiled. An interface to allow for tiling of operations is introduced. The tiling of the linalg.pad_tensor operation is modified to use this interface. Differential Revision: https://reviews.llvm.org/D108611	2021-08-30 16:31:18 -07:00
Chris Lattner	faf1c22408	[Builder] Eliminate the StringRef/StringAttr forms of getSymbolRefAttr. The StringAttr version doesn't need a context, so we can just use the existing `SymbolRefAttr::get` form. The StringRef version isn't preferred so we want to encourage people to use StringAttr. There is an additional form of getSymbolRefAttr that takes a (SymbolTrait implementing) operation. This should also be moved, but I'll do that as a separate patch. Differential Revision: https://reviews.llvm.org/D108922	2021-08-30 16:05:36 -07:00
Chris Lattner	41d4aa7de6	[SymbolRefAttr] Revise SymbolRefAttr to hold a StringAttr. SymbolRefAttr is fundamentally a base string plus a sequence of nested references. Instead of storing the string data as a copies StringRef, store it as an already-uniqued StringAttr. This makes a lot of things simpler and more efficient because: 1) references to the symbol are already stored as StringAttr's: there is no need to copy the string data into MLIRContext multiple times. 2) This allows pointer comparisons instead of string comparisons (or redundant uniquing) within SymbolTable.cpp. 3) This allows SymbolTable to hold a DenseMap instead of a StringMap (which again copies the string data and slows lookup). This is a moderately invasive patch, so I kept a lot of compatibility APIs around. It would be nice to explore changing getName() to return a StringAttr for example (right now you have to use getNameAttr()), and eliminate things like the StringRef version of getSymbol. Differential Revision: https://reviews.llvm.org/D108899	2021-08-29 21:54:47 -07:00
Matthias Springer	d18ffd61d4	[mlir][SCF] Canonicalize dim(x) where x is an iter_arg * Add `DimOfIterArgFolder`. * Move existing cross-dialect canonicalization patterns to `LoopCanonicalization.cpp`. * Rename `SCFAffineOpCanonicalization` pass to `SCFForLoopCanonicalization`. * Expand documentaton of scf.for: The type of loop-carried variables may not change with iterations. (Not even the dynamic type.) Differential Revision: https://reviews.llvm.org/D108806	2021-08-30 01:39:56 +00:00
Matthias Springer	eedc997b7d	[mlir][Analysis] Add batched version of FlatAffineConstraints::addId * Add batched version of all `addId` variants, so that multiple IDs can be added at a time. * Rename `addId` and variants to `insertId` and `appendId`. Most external users call `appendId`. Splitting `addId` into two functions also makes it possible to provide batched version for both. (Otherwise, the overloads are ambigious when calling `addId`.) Differential Revision: https://reviews.llvm.org/D108532	2021-08-30 00:56:44 +00:00
Lei Zhang	a5621e26db	[mlir][spirv] Use type dyn_cast when scanning spv.GlobalVariable This avoids crashes when there are spv.GlobalVariable without pointer type.	2021-08-29 12:01:19 -04:00
Aart Bik	0a7b8cc5dd	[mlir][sparse] fully implement sparse tensor to sparse tensor conversions with rigorous integration test Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108721	2021-08-27 15:08:18 -07:00
Butygin	1e35a7690d	[mlir][spirv] Initial support for 64 bit index type and builtins Differential Revision: https://reviews.llvm.org/D108516	2021-08-27 01:38:53 +03:00
River Riddle	9658b061dd	[mlir] Update DialectAsmParser::parseString to use std::string instead of StringRef This allows for parsing strings that have escape sequences, which require constructing a string (as they can't be represented by looking at the Token contents directly). Differential Revision: https://reviews.llvm.org/D108589	2021-08-25 09:27:35 +00:00
Matthias Springer	a9cff97f94	[mlir][SCF] Generalize AffineMinSCFCanonicalization to min/max ops * Add support for affine.max ops to SCF loop peeling pattern. * Add support for affine.max ops to `AffineMinSCFCanonicalizationPattern`. * Rename `AffineMinSCFCanonicalizationPattern` to `AffineOpSCFCanonicalizationPattern`. * Rename `AffineMinSCFCanonicalization` pass to `SCFAffineOpCanonicalization`. Differential Revision: https://reviews.llvm.org/D108009	2021-08-25 10:40:34 +09:00
Matthias Springer	2de2dbef2a	[mlir][linalg] Replace AffineMinSCFCanonicalizationPattern with SCF reimplementation Use the new canonicalization pattern in the SCF dialect. Differential Revision: https://reviews.llvm.org/D107732	2021-08-25 08:52:56 +09:00
Matthias Springer	98aa694d0d	[mlir][scf] Add general affine.min canonicalization pattern This canonicalization simplifies affine.min operations inside "for loop"-like operations (e.g., scf.for and scf.parallel) based on two invariants: * iv >= lb * iv < lb + step * ((ub - lb - 1) floorDiv step) + 1 This commit adds a new pass `canonicalize-scf-affine-min` (instead of being a canonicalization pattern) to avoid dependencies between the Affine dialect and the SCF dialect. Differential Revision: https://reviews.llvm.org/D107731	2021-08-25 07:32:30 +09:00
Tyler Augustine	d25e91d7f6	Support alias.scope and noalias metadata Introduces new Ops to represent 1. alias.scope metadata in LLVM, and 2. domains for these scopes. These correspond to the metadata described in https://llvm.org/docs/LangRef.html#noalias-and-alias-scope-metadata. Lists of scopes are modeled the same way as access groups - as an ArrayAttr on the Op (added in https://reviews.llvm.org/D97944). Lowering 'noalias' attributes on function parameters is already supported. However, lowering `noalias` metadata on individual Ops is not, which is added in this change. LLVM uses the same keyword for these, but this change introduces a separate attribute name 'noalias_scopes' to represent this distinct concept. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D107870	2021-08-24 20:42:59 +02:00
Aart Bik	fda176892e	[mlir][sparse] use new permutation utility to avoid codedup Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D108636	2021-08-24 08:48:17 -07:00
Matthias Springer	ebf35370ff	[mlir][tensor] Insert explicit tensor.cast ops for insert_slice src If additional static type information can be deduced from a insert_slice's size operands, insert an explicit cast of the op's source operand. This enables other canonicalization patterns that are matching for tensor_cast ops such as `ForOpTensorCastFolder` in SCF. Differential Revision: https://reviews.llvm.org/D108617	2021-08-24 19:45:04 +09:00
Matthias Springer	0c36082963	[mlir][SCF] Use symbols in loop peeling rewrite Use symbols in the affine map instead of dims. Dims should not be divided. Differential Revision: https://reviews.llvm.org/D108431	2021-08-24 19:39:19 +09:00
MaheshRavishankar	b546f4347b	[mlir]Linalg] Allow controlling fusion of linalg.generic -> linalg.tensor_expand_shape. Differential Revision: https://reviews.llvm.org/D108565	2021-08-23 16:28:10 -07:00
Aart Bik	236a90802d	[mlir][sparse] replace support lib conversion with actual MLIR codegen Rationale: Passing in a pointer to the memref data in order to implement the dense to sparse conversion was a bit too low-level. This revision improves upon that approach with a cleaner solution of generating a loop nest in MLIR code itself that prepares the COO object before passing it to our "swiss army knife" setup. This is much more intuitive and now also allows for dynamic shapes. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108491	2021-08-23 14:26:05 -07:00
River Riddle	4e103a12d9	[mlir] Add support for VariadicOfVariadic operands This revision adds native ODS support for VariadicOfVariadic operand groups. An example of this is the SwitchOp, which has a variadic number of nested operand ranges for each of the case statements, where the number of case statements is variadic. Builtin ODS support allows for generating proper accessors for the nested operand ranges, builder support, and declarative format support. VariadicOfVariadic operands are supported by providing a segment attribute to use to store the operand groups, mapping similarly to the AttrSizedOperand trait (but with a user defined attribute name). `build` methods for VariadicOfVariadic operand expect inputs of the form `ArrayRef<ValueRange>`. Accessors for the variadic ranges return a new `OperandRangeRange` type, which represents a contiguous range of `OperandRange`. In the declarative assembly format, VariadicOfVariadic operands and types are by default formatted as a comma delimited list of value lists: `(<value>, <value>), (), (<value>)`. Differential Revision: https://reviews.llvm.org/D107774	2021-08-23 20:32:31 +00:00
MaheshRavishankar	4aeeb91a92	[mlir][Linalg] Allow all build methods of Structured ops to specify additional attributes. Differential Revision: https://reviews.llvm.org/D108338	2021-08-23 13:06:34 -07:00
Matthias Springer	bc194a5bb5	[mlir][SCF] Do not peel loops inside partial iterations Do not apply loop peeling to loops that are contained in the partial iteration of an already peeled loop. This is to avoid code explosion when dealing with large loop nests. Can be controlled with a new pass option `skip-partial`. Differential Revision: https://reviews.llvm.org/D108542	2021-08-23 21:35:46 +09:00
Rob Suderman	871c812483	[mlir][linalg] Finish refactor of TC ops to YAML Multiple operations were still defined as TC ops that had equivalent versions as YAML operations. Reducing to a single compilation path guarantees that frontends can lower to their equivalent operations without missing the optimized fastpath. Some operations are maintained purely for testing purposes (mainly conv{1,2,3}D as they are included as sole tests in the vectorizaiton transforms. Differential Revision: https://reviews.llvm.org/D108169	2021-08-20 12:35:04 -07:00
Vladislav Vinogradov	9775c0c9f0	[mlir] Fix ControlFlowInterfaces implementation for Async dialect * Add `RegionBranchTerminatorOpInterface` to `YieldOp`. * Implement `getSuccessorEntryOperands` in `ExecuteOp`. * Fix `getSuccessorRegions` implementation in `ExecuteOp`. Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D108373	2021-08-20 12:14:45 +03:00
Morten Borup Petersen	6c1436a9b0	[MLIR][SCF] Parenthesize multiple return types in scf.execute_region asm op Previously, ExecuteRegionOps with multiple return values would fail a round-trip test due to missing parenthesis around the types. Differential Revision: https://reviews.llvm.org/D108402	2021-08-19 21:31:51 +01:00
MaheshRavishankar	16ffb283c5	Revert "[mlir][Linalg] Allow all build methods of Structured ops to specify additional attributes." This reverts commit `95ddc8341a`. Differential Revision: https://reviews.llvm.org/D108396	2021-08-19 11:53:41 -07:00
MaheshRavishankar	95ddc8341a	[mlir][Linalg] Allow all build methods of Structured ops to specify additional attributes. Differential Revision: https://reviews.llvm.org/D108338	2021-08-19 11:14:35 -07:00
Matthias Springer	76a1861816	[mlir][SparseTensor] Split scf.for loop into masked/unmasked parts Apply the "for loop peeling" pattern from SCF dialect transforms. This pattern splits scf.for loops into full and partial iterations. In the full iteration, all masked loads/stores are canonicalized to unmasked loads/stores. Differential Revision: https://reviews.llvm.org/D107733	2021-08-19 21:53:11 +09:00
Matthias Springer	8e8b70aa84	[mlir][scf] Simplify affine.min ops after loop peeling Simplify affine.min ops, enabling various other canonicalizations inside the peeled loop body. affine.min ops such as: ``` map = affine_map<(d0)[s0, s1] -> (s0, -d0 + s1)> %r = affine.min #affine.min #map(%iv)[%step, %ub] ``` are rewritten them into (in the case the peeled loop): ``` %r = %step ``` To determine how an affine.min op should be rewritten and to prove its correctness, FlatAffineConstraints is utilized. Differential Revision: https://reviews.llvm.org/D107222	2021-08-19 17:24:53 +09:00
Matthias Springer	08dbed8a57	[mlir][linalg] Canonicalize dim ops of tiled_loop block args E.g.: ``` %y = ... : tensor<...> linalg.tiled_loop ... ins(%x = %y : tensor<...>) { tensor.dim %x, %c0 : tensor<...> } ``` is rewritten to: ``` %y = ... : tensor<...> linalg.tiled_loop ... ins(%x = %y : tensor<...>) { tensor.dim %y, %c0 : tensor<...> } ``` Differential Revision: https://reviews.llvm.org/D108272	2021-08-19 11:24:33 +09:00
Matthias Springer	9329438244	[mlir][linalg] Remove ConstraintsSet class The same functionality can be implemented with FlatAffineValueConstraints. Differential Revision: https://reviews.llvm.org/D108179	2021-08-19 10:57:35 +09:00
Aart Bik	d37d72eaf8	[mlir][sparse] use shared util for DimOp generation This shares more code with existing utilities. Also, to be consistent, we moved dimension permutation on the DimOp to the tensor lowering phase. This way, both pre-existing DimOps on sparse tensors (not likely but possible) as well as compiler generated DimOps are handled consistently. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108309	2021-08-18 17:12:32 -07:00
Diego Caballero	b7cac864b2	[mlir] Fix typo in SuperVectorizer NFC. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D108334	2021-08-18 22:55:12 +00:00
Chia-hung Duan	41e5dbe0fa	Enables inferring return types for Shape op if possible Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D102565	2021-08-18 21:36:55 +00:00
Butygin	ddc3d51d58	[mlir][spirv] Add (InBounds)PtrAccessChain ops Differential Revision: https://reviews.llvm.org/D108070	2021-08-18 17:59:21 +03:00
Lei Zhang	4c15ad2321	[mlir][linalg] Don't drop existing attributes when creating ops Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D108219	2021-08-17 15:44:56 -04:00
Tobias Gysi	583a754248	[mlir][linalg] Remove duplicate methods (NFC). Remove duplicate methods used to check iterator types. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D108102	2021-08-17 09:06:17 +00:00

... 2 3 4 5 6 ...

2778 Commits