llvm-project

Commit Graph

Author	SHA1	Message	Date
Eugene Zhulenev	94e645f9cc	[mlir] Async: Add numWorkerThreads argument to createAsyncParallelForPass Add an option to pass the number of worker threads to select the number of async regions for parallel for transformation. ``` std::unique_ptr<OperationPass<FuncOp>> createAsyncParallelForPass(int numWorkerThreads); ``` Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D92835	2020-12-08 10:30:14 -08:00
Benjamin Kramer	5844bc540c	[mlir][Shape] Canonicalize assume_all with one input and tensor_cast of constant_shape This allows simplifying some more complicated shape expressions Differential Revision: https://reviews.llvm.org/D92843	2020-12-08 17:07:24 +01:00
ergawy	6c69d3d68e	[MLIR][SPIRV] Add initial support for OpSpecConstantOp. This commit adds initial support for SPIR-V OpSpecConstantOp instruction. The following is introdcued: - A new `spv.specConstantOperation` operation consisting of a single region and of 2 operations within that regions (more details in the docs of the op itself). - A new `spv.yield` instruction that acts a terminator for `spv.specConstantOperation`. For now, the generic form of the new op is supported (i.e. no custom parsing or printing). This will be done in a follow up patch. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D92232	2020-12-08 09:07:52 -05:00
Alex Zinenko	80766ecc65	[mlir] Add an option to control the number of loops in affine parallelizer Add a pass option to control the number of nested parallel loops produced by the parallelization passes. This is useful to build end-to-end passes targeting systems that don't need multiple parallel dimensions (e.g., CPUs typically need only one). Reviewed By: wsmoses, chelini Differential Revision: https://reviews.llvm.org/D92765	2020-12-08 10:44:37 +01:00
Alex Zinenko	2fe30a3534	[mlir] properly support min/max in affine parallelization The existing implementation of the affine parallelization silently copies over the lower and upper bound maps from affine.for to affine.parallel. However, the semantics of these maps differ between these two ops: in affine.for, a max(min) of results is taken for the lower(upper) bound; in affine.parallel, multiple induction variables can be defined an each result corresponds to one induction variable. Thus the existing implementation could generate invalid IR or IR that passes the verifier but has different semantics than the original code. Fix the parallelization utility to emit dedicated min/max operations before the affine.parallel in such cases. Disallow parallelization if min/max would have been in an operation without the AffineScope trait, e.g., in another loop, since the result of these operations is not considered a valid affine dimension identifier and may not be properly handled by the affine analyses. Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D92763	2020-12-08 10:43:35 +01:00
Aart Bik	74cd9e587d	[mlir][sparse] hoist loop invariant tensor loads in sparse compiler After bufferization, the backend has much more trouble hoisting loop invariant loads from the loops generated by the sparse compiler. Therefore, this is done during sparse code generation. Note that we don't bother hoisting derived invariant expressions on SSA values, since the backend does that very well. Still TBD: scalarize reductions to avoid load-add-store cycles Reviewed By: penpornk Differential Revision: https://reviews.llvm.org/D92534	2020-12-07 11:59:48 -08:00
Navdeep Kumar	dc930e5f2f	[MLIR][Affine] Add affine.for normalization support Add support to normalize affine.for ops i.e., convert the lower bound to zero and loop step to one. The Upper bound is set to the trip count of the loop. The exact value of loopIV is calculated just inside the body of affine.for. Currently loops with lower bounds having single result are supported. No such restriction exists on upper bounds. Differential Revision: https://reviews.llvm.org/D92233	2020-12-07 22:04:07 +05:30
Rahul Joshi	fe7fdcac87	[MLIR] Fix parseFunctionLikeOp() to fail parsing empty regions - Change parseOptionalRegion to return an OptionalParseResult. - Change parseFunctionLikeOp() to fail parsing if the function body was parsed but was empty. - See https://llvm.discourse.group/t/funcop-parsing-bug/2164 Differential Revision: https://reviews.llvm.org/D91886	2020-12-04 09:09:59 -08:00
Nicolas Vasilache	2c66b6ec09	[mlir][Linalg] NFC - Expose tiling canonicalization patterns through a populate method	2020-12-04 14:57:29 +00:00
Nicolas Vasilache	a1cd559ce5	[mlir][Linalg] Properly use distribution options. Let tiling to scf.for actually use the distribution method. For now only Cyclic is supported. Differential Revision: https://reviews.llvm.org/D92653	2020-12-04 14:00:54 +00:00
Hanhan Wang	f5f1a5c244	[mlir][Linalg] Handle fusion on tensors for projected permutation. In the past, the reshape op can be folded only if the indexing map is permutation in consumer's usage. We can relax to condition to be projected permutation. This patch still limits the fusion for scalar cases. Scalar case is a corner case, because we need to decide where to put extra dims. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D92466	2020-12-03 23:11:29 -08:00
River Riddle	09f7a55fad	[mlir][Types][NFC] Move all of the builtin Type classes to BuiltinTypes.h This is part of a larger refactoring the better congregates the builtin structures under the BuiltinDialect. This also removes the problematic "standard" naming that clashes with the "standard" dialect, which is not defined within IR/. A temporary forward is placed in StandardTypes.h to allow time for downstream users to replaced references. Differential Revision: https://reviews.llvm.org/D92435	2020-12-03 18:02:10 -08:00
River Riddle	e66c2e259f	[mlir][NFC] Remove Function.h and Module.h in favor of BuiltinOps.h The definitions of ModuleOp and FuncOp are now within BuiltinOps.h, making the individual files obsolete. Differential Revision: https://reviews.llvm.org/D92622	2020-12-03 18:02:10 -08:00
Aart Bik	c95acf052b	[mlir][vector][avx512] move avx512 lowering pass into general vector lowering A separate AVX512 lowering pass does not compose well with the regular vector lowering pass. As such, it is at risk of code duplication and lowering inconsistencies. This change removes the separate AVX512 lowering pass and makes it an "option" in the regular vector lowering pass (viz. vector dialect "augmented" with AVX512 dialect). Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D92614	2020-12-03 17:23:46 -08:00
Thomas Raoux	c503dc1b8a	[mlir][linalg] Add vectorization for element-wise linalg ops Add support for vectorization for linalg.generic representing element-wise ops. Those are converted to transfer_read + vector ops + transfer_write. Also re-organize the vectorization tests to be together. Implementation derived from the work of @burmako, @agrue and @fedelebron. Differential Revision: https://reviews.llvm.org/D92540	2020-12-03 15:31:13 -08:00
Max Kudryavtsev	636db7f87c	[MLIR] Fix vector::TransferWriteOp builder losing permutation map Supervectorizer pass uses this builder and loses the permutation map. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D92145	2020-12-03 09:53:08 -08:00
Christian Sigg	48f7ca1879	Fix forward for rGd9adde5ae216: adding missing dependency. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D92552	2020-12-03 10:16:57 +01:00
Christian Sigg	d9adde5ae2	[mlir][gpu] Move gpu.wait ops from async.execute regions to its dependencies. This can prevent unnecessary host synchronization. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D90346	2020-12-03 08:52:28 +01:00
Christian Sigg	c4a0405902	Add `Operation* OpState::operator->()` to provide more convenient access to members of Operation. Given that OpState already implicit converts to Operator*, this seems reasonable. The alternative would be to add more functions to OpState which forward to Operation. Reviewed By: rriddle, ftynse Differential Revision: https://reviews.llvm.org/D92266	2020-12-02 15:46:20 +01:00
Sean Silva	774f1d3ffd	[mlir] Small cleanups to func-bufferize/finalizing-bufferize - Address TODO in scf-bufferize: the argument materialization issue is now fixed and the code is now in Transforms/Bufferize.cpp - Tighten up finalizing-bufferize to avoid creating invalid IR when operand types potentially change - Tidy up the testing of func-bufferize, and move appropriate tests to a new finalizing-bufferize.mlir - The new stricter checking in finalizing-bufferize revealed that we needed a DimOp conversion pattern (found when integrating into npcomp). Previously, the converion infrastructure was blindly changing the operand type during finalization, which happened to work due to DimOp's tensor/memref polymorphism, but is generally not encouraged (the new pattern is the way to tell the conversion infrastructure that it is legal to change that type).	2020-11-30 17:04:14 -08:00
Jacques Pienaar	e534cee26a	[mlir] Add a shape function library op Op with mapping from ops to corresponding shape functions for those op in the library and mechanism to associate shape functions to functions. The mapping of operand to shape function is kept separate from the shape functions themselves as the operation is associated to the shape function and not vice versa, and one could have a common library of shape functions that can be used in different contexts. Use fully qualified names and require a name for shape fn lib ops for now and an explicit print/parse (based around the generated one & GPU module op ones). This commit reverts `d9da4c3e73`. Fixes missing headers (don't know how that was working locally). Differential Revision: https://reviews.llvm.org/D91672	2020-11-29 11:15:30 -08:00
Mehdi Amini	d9da4c3e73	Revert "[mlir] Add a shape function library op" This reverts commit `6dd9596b19`. Build is broken.	2020-11-29 05:28:42 +00:00
Jacques Pienaar	6dd9596b19	[mlir] Add a shape function library op Op with mapping from ops to corresponding shape functions for those op in the library and mechanism to associate shape functions to functions. The mapping of operand to shape function is kept separate from the shape functions themselves as the operation is associated to the shape function and not vice versa, and one could have a common library of shape functions that can be used in different contexts. Use fully qualified names and require a name for shape fn lib ops for now and an explicit print/parse (based around the generated one & GPU module op ones). Differential Revision: https://reviews.llvm.org/D91672	2020-11-28 15:53:59 -08:00
Frederik Gossen	6484567f14	[MLIR][SCF] Find all innermost loops for parallel loop tiling Overcome the assumption that parallel loops are only nested in other parallel loops. Differential Revision: https://reviews.llvm.org/D92188	2020-11-27 10:08:56 +01:00
Stephan Herhut	4dd5f79f07	[mlir][bufferize] Add argument materialization for bufferization This enables partial bufferization that includes function signatures. To test this, this change also makes the func-bufferize partial and adds a dedicated finalizing-bufferize pass. Differential Revision: https://reviews.llvm.org/D92032	2020-11-26 13:43:44 +01:00
Aart Bik	d5f0d0c0c4	[mlir][sparse] add ability to select pointer/index storage type This change gives sparse compiler clients more control over selecting individual types for the pointers and indices in the sparse storage schemes. Narrower width obviously results in smaller memory footprints, but the range should always suffice for the maximum number of entries or index value. Reviewed By: penpornk Differential Revision: https://reviews.llvm.org/D92126	2020-11-25 17:32:44 -08:00
Sean Silva	5488a6b0ff	[NFC] Fix pattern name. It still had the old name from before ElementwiseMappable was added.	2020-11-25 16:10:34 -08:00
Frank Laub	9ffba19e86	[MLIR][Affine] Add custom builders for AffineVectorLoadOp/AffineVectorStoreOp Adding missing custom builders for AffineVectorLoadOp & AffineVectorStoreOp. In practice, it is difficult to correctly construct these ops without these builders (because the AffineMap is not included at construction time). Differential Revision: https://reviews.llvm.org/D86380	2020-11-25 20:22:56 +00:00
Aart Bik	5c4e397e6c	[mlir][sparse] add parallelization strategies to sparse compiler This CL adds the ability to request different parallelization strategies for the generate code. Every "parallel" loop is a candidate, and converted to a parallel op if it is an actual for-loop (not a while) and the strategy allows dense/sparse outer/inner parallelization. This will connect directly with the work of @ezhulenev on parallel loops. Still TBD: vectorization strategy Reviewed By: penpornk Differential Revision: https://reviews.llvm.org/D91978	2020-11-24 17:17:13 -08:00
Aart Bik	b228e2bd92	[mlir][sparse] generalize invariant expression handling in sparse compiler Generalizes invariant handling to anything defined outside the Linalg op (parameters and SSA computations). Fixes bug that was using parameter number as tensor number. Reviewed By: penpornk Differential Revision: https://reviews.llvm.org/D91985	2020-11-24 13:41:14 -08:00
Alex Zinenko	119545f433	[mlir] Add conversion from SCF parallel loops to OpenMP Introduce a conversion pass from SCF parallel loops to OpenMP dialect constructs - parallel region and workshare loop. Loops with reductions are not supported because the OpenMP dialect cannot model them yet. The conversion currently targets only one level of parallelism, i.e. only one top-level `omp.parallel` operation is produced even if there are nested `scf.parallel` operations that could be mapped to `omp.wsloop`. Nested parallelism support is left for future work. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D91982	2020-11-24 21:12:56 +01:00
Nicolas Vasilache	c247081025	[mlir] NFC - Refactor and expose a helper printOffsetSizesAndStrides helper function. Print part of an op of the form: ``` <optional-offset-prefix>`[` offset-list `]` <optional-size-prefix>`[` size-list `]` <optional-stride-prefix>[` stride-list `]` ``` Also address some leftover nits. Differential revision: https://reviews.llvm.org/D92031	2020-11-24 20:00:59 +00:00
Nicolas Vasilache	b6c71c13a3	[mlir] NFC - Refactor and expose a parsing helper for OffsetSizeAndStrideInterface Parse trailing part of an op of the form: ``` <optional-offset-prefix>`[` offset-list `]` <optional-size-prefix>`[` size-list `]` <optional-stride-prefix>[` stride-list `]` ``` Each entry in the offset, size and stride list either resolves to an integer constant or an operand of index type. Constants are added to the `result` as named integer array attributes with name `OffsetSizeAndStrideOpInterface::getStaticOffsetsAttrName()` (resp. `getStaticSizesAttrName()`, `getStaticStridesAttrName()`). Append the number of offset, size and stride operands to `segmentSizes` before adding it to `result` as the named attribute: `OpTrait::AttrSizedOperandSegments<void>::getOperandSegmentSizeAttr()`. Offset, size and stride operands resolution occurs after `preResolutionFn` to give a chance to leading operands to resolve first, after parsing the types. ``` ParseResult parseOffsetsSizesAndStrides( OpAsmParser &parser, OperationState &result, ArrayRef<int> segmentSizes, llvm::function_ref<ParseResult(OpAsmParser &, OperationState &)> preResolutionFn = nullptr, llvm::function_ref<ParseResult(OpAsmParser &)> parseOptionalOffsetPrefix = nullptr, llvm::function_ref<ParseResult(OpAsmParser &)> parseOptionalSizePrefix = nullptr, llvm::function_ref<ParseResult(OpAsmParser &)> parseOptionalStridePrefix = nullptr); ``` Differential revision: https://reviews.llvm.org/D92030	2020-11-24 19:45:16 +00:00
Tei Jeong	760063267c	Fix CalibratedQuantizedType's print function to match parser Reviewed By: liufengdb Differential Revision: https://reviews.llvm.org/D92034	2020-11-24 09:35:35 -08:00
Stella Laurenzo	db9713cd77	[mlir] Add Tosa dialect const folder for tosa.const. * Was missed in the initial submission and is required for a ConstantLike op. * Also adds a materializeConstant hook to preserve it. * Tightens up the argument constraint on tosa.const to match what is actually legal. Differential Revision: https://reviews.llvm.org/D92040	2020-11-24 17:33:00 +00:00
Nicolas Vasilache	a8de412f51	[mlir] NFC - Expose an OffsetSizeAndStrideOpInterface This revision will make it easier to create new ops base on the strided memref abstraction outside of the std dialect. OffsetSizeAndStrideOpInterface is an interface for ops that allow specifying mixed dynamic and static offsets, sizes and strides variadic operands. Ops that implement this interface need to expose the following methods: 1. `getArrayAttrRanks` to specify the length of static integer attributes. 2. `offsets`, `sizes` and `strides` variadic operands. 3. `static_offsets`, resp. `static_sizes` and `static_strides` integer array attributes. The invariants of this interface are: 1. `static_offsets`, `static_sizes` and `static_strides` have length exactly `getArrayAttrRanks()`[0] (resp. [1], [2]). 2. `offsets`, `sizes` and `strides` have each length at most `getArrayAttrRanks()`[0] (resp. [1], [2]). 3. if an entry of `static_offsets` (resp. `static_sizes`, `static_strides`) is equal to a special sentinel value, namely `ShapedType::kDynamicStrideOrOffset` (resp. `ShapedType::kDynamicSize`, `ShapedType::kDynamicStrideOrOffset`), then the corresponding entry is a dynamic offset (resp. size, stride). 4. a variadic `offset` (resp. `sizes`, `strides`) operand must be present for each dynamic offset (resp. size, stride). This interface is useful to factor out common behavior and provide support for carrying or injecting static behavior through the use of the static attributes. Differential Revision: https://reviews.llvm.org/D92011	2020-11-24 14:42:47 +00:00
Alexander Belyaev	fd92c5dbee	[mlir][linalg] Add bufferization pattern for `linalg.indexed_generic`. Differential Revision: https://reviews.llvm.org/D92014	2020-11-24 11:14:21 +01:00
Nicolas Vasilache	5073e7edb6	[mlir] Add mising dependency	2020-11-23 20:36:50 +00:00
MaheshRavishankar	11ea2e2448	[mlir][Linalg] NFC: Expose some utility functions used for promotion. Exposing some utility functions from Linalg to allow for promotion of fused views outside of the core tile+fuse logic. This is an alternative to patch D91322 which adds the promotion logic to the tileAndFuse method. Downside with that approach is that it is not easily customizable based on needs. Differential Revision: https://reviews.llvm.org/D91503	2020-11-23 10:35:42 -08:00
MaheshRavishankar	e65a5e5b00	[mlir][Linalg] Fuse sequence of Linalg operation (on buffers) Enhance the tile+fuse logic to allow fusing a sequence of operations. Make sure the value used to obtain tile shape is a SubViewOp/SubTensorOp. Current logic used to get the bounds of loop depends on the use of `getOrCreateRange` method on `SubViewOp` and `SubTensorOp`. Make sure that the value/dim used to compute the range is from such ops. This fix is a reasonable WAR, but a btter fix would be to make `getOrCreateRange` method be a method of `ViewInterface`. Differential Revision: https://reviews.llvm.org/D90991	2020-11-23 10:30:51 -08:00
Alex Zinenko	31a233d463	[mlir] canonicalize away zero-iteration SCF for loops An SCF 'for' loop does not iterate if its lower bound is equal to its upper bound. Remove loops where both bounds are the same SSA value as such bounds are guaranteed to be equal. Similarly, remove 'parallel' loops where at least one pair of respective lower/upper bounds is specified by the same SSA value. Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D91880	2020-11-23 15:04:31 +01:00
Nicolas Vasilache	9ac0b314a4	[mlir][Linalg] Drop symbol_source abstraction which does not pay for itself. Differential Revision: https://reviews.llvm.org/D91956	2020-11-23 12:43:02 +00:00
Nicolas Vasilache	01c4418544	[mlir][Linalg] NFC - Factor out Linalg functionality for shape and loop bounds computation This revision refactors code used in various Linalg transformations and makes it a first class citizen to the LinalgStructureOpInterface. This is in preparation to allowing more advanced Linalg behavior but is otherwise NFC. Differential revision: https://reviews.llvm.org/D91863	2020-11-23 10:17:18 +00:00
Aart Bik	af42550523	[mlir][sparse] refine optimization, add few more test cases Adds tests for full sum reduction (tensors summed up into scalars) and the well-known sampled-dense-dense-matrix-product. Refines the optimizations rules slightly to handle the summation better. Reviewed By: penpornk Differential Revision: https://reviews.llvm.org/D91818	2020-11-20 17:01:59 -08:00
Thomas Raoux	369c51a74b	[mlir][vector] Add transfer_op LoadToStore forwarding and deadStore optimizations Add transformation to be able to forward transfer_write into transfer_read operation and to be able to remove dead transfer_write when a transfer_write is overwritten before being read. Differential Revision: https://reviews.llvm.org/D91321	2020-11-20 11:59:01 -08:00
Alex Zinenko	18d0f7d5c3	[mlir] add canonicalization patterns for trivial SCF 'for' and 'if' Add canoncalization patterns to remove zero-iteration 'for' loops, replace single-iteration 'for' loops with their bodies; remove known-false conditionals with no 'else' branch and replace conditionals with known value by the respective region. Although similar transformations are performed at the CFG level, not all flows reach that level, e.g., the GPU flow may want to remove single-iteration loops before deciding on loop mapping to thread dimensions. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D91865	2020-11-20 19:04:39 +01:00
Stella Stamenova	370d0bac90	[mlir] Expose parseDimAndSymbolList from affineops.h This was removed from ops.h, but it is used by onnx-mlir Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D91830	2020-11-20 09:26:58 -08:00
Stephan Herhut	a89e55ca57	[mlir][std] Canonicalize a dim(memref_reshape) into a load from the shape operand This canonicalization helps propagate shape information through the program. Differential Revision: https://reviews.llvm.org/D91854	2020-11-20 14:03:02 +01:00
Stephan Herhut	6af81ea1d6	[mlir][std] Fold load(tensor_to_memref) into extract_element This canonicalization is useful to resolve loads into scalar values when doing partial bufferization. Differential Revision: https://reviews.llvm.org/D91855	2020-11-20 13:42:11 +01:00
Stephan Herhut	cb778c3423	[mlir][std] Fold comparisons when the operands are equal For equal operands, comparisons can be decided statically. Differential Revision: https://reviews.llvm.org/D91856	2020-11-20 13:26:41 +01:00

1 2 3 4 5 ...

1516 Commits