llvm-project

Commit Graph

Author	SHA1	Message	Date
Valentin Clement	cfcdebaf32	[mlir][openacc] Conversion of data operands in acc.parallel to LLVM IR dialect Convert data operands from the acc.parallel operation using the same conversion pattern than D102170. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D103337	2021-06-07 11:22:20 -04:00
KareemErgawy	2def12ebc6	[MLIR][SPIRV] Use getAsmResultName(...) hook for AddressOfOp. Implements better naming for results of spv.mlir.addressof ops by making it inherit from OpAsmOpInterface and implementing the associated getAsmResultName(...) hook. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D103594	2021-06-07 13:58:26 +02:00
Matthias Springer	6e7bbdd6e7	[mlir] Add offset/stride helper functions to OffsetSizeAndStrideOpInterface * Add hasUnitStride and hasZeroOffset to OffsetSizeAndStrideOpInterface. These functions are useful for various patterns. E.g., some vectorization patterns apply only for tensor ops with zero offsets and/or unit stride. * Add getConstantIntValue and isEqualConstantInt helper functions, which are useful for implementing the two above functions, as well as various patterns. Differential Revision: https://reviews.llvm.org/D103763	2021-06-07 20:11:41 +09:00
Tobias Gysi	caf26612dd	[mlir][linalg] Cleanup LinalgOp usage in comprehensive bufferization. Replace the uses of deprecated Structured Op Interface methods in ComprehensiveBufferize.cpp. This patch is based on https://reviews.llvm.org/D103394. Differential Revision: https://reviews.llvm.org/D103520	2021-06-07 09:08:13 +00:00
Aart Bik	86e9bc1a34	[mlir][sparse] add option for 32-bit indices in scatter/gather Controlled by a compiler option, if 32-bit indices can be handled with zero/sign-extention alike (viz. no worries on non-negative indices), scatter/gather operations can use the more efficient 32-bit SIMD version. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D103632	2021-06-04 16:57:12 -07:00
Ahmed Taei	bba8d8c186	Revert "Add memref.dim canonicalization patterns to TilingCanonicalizationPatterns" This reverts commit `a52959401d`. Differential Revision: https://reviews.llvm.org/D103724	2021-06-04 15:41:43 -07:00
Ahmed Taei	a52959401d	Add memref.dim canonicalization patterns to TilingCanonicalizationPatterns Otherwise tiled and padded linalg op will be alive (after distribution). Differential Revision: https://reviews.llvm.org/D103715	2021-06-04 13:40:36 -07:00
Matthias Springer	e789efc92a	[mlir][linalg] Refactor PadTensorOpVectorizationPattern (NFC) * Rename PadTensorOpVectorizationPattern to GenericPadTensorOpVectorizationPattern. * Make GenericPadTensorOpVectorizationPattern a private pattern, to be instantiated via populatePadTensorOpVectorizationPatterns. * Factor out parts of PadTensorOpVectorizationPattern into helper functions. This commit prepares PadTensorOpVectorizationPattern for a series of subsequent commits that add more specialized PadTensorOp vectorization patterns. Differential Revision: https://reviews.llvm.org/D103681	2021-06-04 23:45:08 +09:00
Valentin Clement	fcb1547229	[mlir][openacc] Conversion of data operands in acc.data to LLVM IR dialect Convert data operands from the acc.data operation using the same conversion pattern than D102170. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D103332	2021-06-04 10:26:22 -04:00
Tobias Gysi	67b1c37d9f	[mlir][linalg] Cleanup left over uses of deprecated LinalgOp methods. Replace all remaining uses of deprecated Structured Op Interface methods. This patch is based on https://reviews.llvm.org/D103394. Differential Revision: https://reviews.llvm.org/D103673	2021-06-04 08:48:02 +00:00
Alexander Belyaev	89df483d30	[mlir] Fix warnings.	2021-06-03 17:09:09 +02:00
Tobias Gysi	f44e90b93a	[mlir][linalg] Cleanup LinalgOp usage in scalar inlining. Replace the uses of deprecated Structured Op Interface methods in InlineScalarOperands.cpp. This patch is based on https://reviews.llvm.org/D103394. Differential Revision: https://reviews.llvm.org/D103518	2021-06-03 14:45:14 +00:00
Tobias Gysi	8fb6c31cbb	[mlir][linalg] Cleanup LinalgOp usage in op declarations. Replace the uses of deprecated Structured Op Interface methods in LinalgOps.cpp. This patch is based on https://reviews.llvm.org/D103394. Differential Revision: https://reviews.llvm.org/D103506	2021-06-03 14:04:44 +00:00
Tobias Gysi	6b265f949f	[mlir][linalg] Cleanup LinalgOp usage in loop lowering. Replace the uses of deprecated Structured Op Interface methods in Loops.cpp. This patch is based on https://reviews.llvm.org/D103394. Differential Revision: https://reviews.llvm.org/D103453	2021-06-03 13:29:52 +00:00
Nicolas Agostini	0804a88e48	[mlir][linalg] Transform PadTensorOp into InitOp, FillOp, GenericOp Introduces a test pass that rewrites PadTensorOps with static shapes as a sequence of: ``` linalg.init_tensor // to create output linalg.fill // to initialize with padding value linalg.generic // to copy the original contents to the padded tensor ``` The pass can be triggered with: - `--test-linalg-transform-patterns="test-transform-pad-tensor"` Differential Revision: https://reviews.llvm.org/D102804	2021-06-03 22:09:09 +09:00
Tobias Gysi	c698505257	[mlir][linalg] Cleanup LinalgOp usage in drop unit dims. Replace the uses of deprecated Structured Op Interface methods in DropUnitDims.cpp. This patch is based on https://reviews.llvm.org/D103394. Differential Revision: https://reviews.llvm.org/D103448	2021-06-03 12:27:05 +00:00
Tobias Gysi	7c234ae549	[mlir][linalg] Cleanup LinalgOp usage in bufferize, detensorize, and interchange. Replace the uses of deprecated Structured Op Interface methods in Bufferize.cpp, Detensorize.cpp, and Interchange.cpp. The patch is based on https://reviews.llvm.org/D103394. Differential Revision: https://reviews.llvm.org/D103530	2021-06-03 12:07:29 +00:00
Tobias Gysi	9f815cb578	[mlir][linalg] Cleanup LinalgOp usage in test passes. Replace the uses of deprecated Structured Op Interface methods in TestLinalgElementwiseFusion.cpp, TestLinalgFusionTransforms.cpp, and Transforms.cpp. The patch is based on https://reviews.llvm.org/D103394. Differential Revision: https://reviews.llvm.org/D103528	2021-06-03 12:07:29 +00:00
Tobias Gysi	e70d2c8e6f	[mlir][linalg] Cleanup LinalgOp usage in promotion. Replace the uses of deprecated Structured Op Interface methods in Promotion.cpp. This patch is based on https://reviews.llvm.org/D103394. Differential Revision: https://reviews.llvm.org/D103450	2021-06-03 11:01:02 +00:00
Tobias Gysi	ad10d965c8	[mlir][linalg] Cleanup LinalgOp usage in generalization. Replace the uses of deprecated Structured Op Interface methods in Generalization.cpp. This patch is based on https://reviews.llvm.org/D103394. Differential Revision: https://reviews.llvm.org/D103531	2021-06-03 09:45:02 +00:00
Alexander Belyaev	485c21be8a	[mlir] Split linalg reshape ops into expand/collapse. Differential Revision: https://reviews.llvm.org/D103548	2021-06-03 11:40:22 +02:00
Mehdi Amini	8c948b18e9	Fix -Wsign-compare warning (NFC)	2021-06-02 17:28:57 +00:00
Tobias Gysi	f84b908f89	[mlir][linalg] Cleanup LinalgOp usage in fusion on tensors (NFC). Replace the uses of deprecated Structured Op Interface methods in FusionOnTensors.cpp. This patch is based on https://reviews.llvm.org/D103394. Differential Revision: https://reviews.llvm.org/D103471	2021-06-02 12:20:45 +00:00
Tobias Gysi	2f2b5b7d28	[mlir][linalg] Cleanup LinalgOp usage in sparse compiler (NFC). Replace the uses of deprecated Structured Op Interface methods in Sparsification.cpp. This patch is based on https://reviews.llvm.org/D103394. Differential Revision: https://reviews.llvm.org/D103436	2021-06-02 06:21:56 +00:00
Tobias Gysi	07576cc4dc	[mlir][linalg] Fix signed/unsigned comparison warnings (NFC). Fix signedness warnings in Utils.cpp and LinalgInterfaces.cpp.	2021-06-01 10:56:43 +00:00
Tobias Gysi	94643fda13	[mlir][linalg] Cleanup LinalgOp usage in dependence analysis (NFC). Replace the uses of deprecated Structured Op Interface methods in DependenceAnalysis.cpp and DependenceAnalysis.h. This patch is based on https://reviews.llvm.org/D103394. Differential Revision: https://reviews.llvm.org/D103411	2021-06-01 08:44:15 +00:00
Tobias Gysi	7594f5028a	[mlir][linalg] Cleanup LinalgOp usage in fusion (NFC). Replace the uses of deprecated Structured Op Interface methods in Fusion.cpp. This patch is based on https://reviews.llvm.org/D103394. Differential Revision: https://reviews.llvm.org/D103437	2021-06-01 08:21:30 +00:00
Tobias Gysi	c2e5226a85	[mlir][linalg] Cleanup LinalgOp usage in tiling (NFC). Replace the uses of deprecated Structured Op Interface methods in Tiling.cpp and Utils.cpp. This patch is based on https://reviews.llvm.org/D103394. Differential Revision: https://reviews.llvm.org/D103438	2021-06-01 08:17:38 +00:00
Tobias Gysi	912ebf60b1	[mlir][linalg] Cleanup LinalgOp usage in vectorization (NFC). Replace the uses of deprecated Structured Op Interface methods in Vectorization.cpp. This patch is based on https://reviews.llvm.org/D103394. Differential Revision: https://reviews.llvm.org/D103410	2021-06-01 08:08:40 +00:00
Tobias Gysi	f4f7bc1737	[mlir][linalg] Cleanup LinalgOp usage in verification (NFC). Replace the uses of deprecated Structured Op Interface methods in LinalgInterfaces.cpp. This patch is based on https://reviews.llvm.org/D103394. Differential Revision: https://reviews.llvm.org/D103404	2021-05-31 14:25:45 +00:00
Tobias Gysi	0a52d9006c	[mlir][linalg] Update Structured Op Interface (NFC). Adding methods to access operand properties via OpOperands and mark outdated methods as deprecated. Differential Revision: https://reviews.llvm.org/D103394	2021-05-31 13:20:48 +00:00
Frederik Gossen	1288adaa73	[MLIR][Shape] Remove duplicate operands of `shape.assuming_all` op Differential Revision: https://reviews.llvm.org/D103403	2021-05-31 14:37:55 +02:00
Uday Bondhugula	18c2106e28	[MLIR] Fix warnings in AffineOps.cpp Fix warnings in AffineOps.cpp. Differential Revision: https://reviews.llvm.org/D103374	2021-05-31 17:58:02 +05:30
Matthias Springer	2bc8ffa8af	[mlir] Support permutation maps in vector transfer op folder Fold away in_bounds attribute even if the transfer op has a non-identity permutation map. Differential Revision: https://reviews.llvm.org/D103133	2021-05-31 17:22:46 +09:00
KareemErgawy	e493abcf55	[MLIR][SPIRV] Use getAsmResultName(...) hook for ConstantOp. Implements better naming for results of `spv.Constant` ops by making it inherit from OpAsmOpInterface and implementing the associated getAsmResultName(...) hook. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D103152	2021-05-28 09:28:02 +02:00
Eugene Zhulenev	8f23fac4da	[mlir:Async] Convert assertions to async errors only inside async functions Differential Revision: https://reviews.llvm.org/D103278	2021-05-27 12:49:00 -07:00
Eugene Zhulenev	9136b7d075	[mlir] AsyncRefCounting: check that LivenessBlockInfo is not nullptr Differential Revision: https://reviews.llvm.org/D103270	2021-05-27 10:54:21 -07:00
Eugene Zhulenev	d8c84d2a4e	[mlir] Async: Add error propagation support to async groups Depends On D103109 If any of the tokens/values added to the `!async.group` switches to the error state, than the group itself switches to the error state. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D103203	2021-05-27 09:35:11 -07:00
Eugene Zhulenev	39957aa424	[mlir] Add error state and error propagation to async runtime values Depends On D103102 Not yet implemented: 1. Error handling after synchronous await 2. Error handling for async groups Will be addressed in the followup PRs Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D103109	2021-05-27 09:28:47 -07:00
Eugene Zhulenev	c412979cde	[mlir] Async reference counting for block successors with divergent reference counted liveness Support reference counted values implicitly passed (live) only to some of the successors. Example: if branched to ^bb2 token will leak, unless `drop_ref` operation is properly created ``` ^entry: %token = async.runtime.create : !async.token cond_br %cond, ^bb1, ^bb2 ^bb1: async.runtime.await %token async.runtime.drop_ref %token br ^bb2 ^bb2: return ``` Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D103102	2021-05-27 09:21:59 -07:00
thomasraoux	b44007bec2	[mlir][gpu] Relax restriction on MMA store op to allow chain of mma ops. In order to allow large matmul operations using the MMA ops we need to chain operations this is not possible unless "DOp" and "COp" type have matching layout so remove the "DOp" layout and force accumulator and result type to match. Added a test for the case where the MMA value is accumulated. Differential Revision: https://reviews.llvm.org/D103023	2021-05-27 09:13:51 -07:00
Nicolas Vasilache	ce4f99e7f2	[mlir][Linalg] Add comprehensive bufferization support for subtensor (5/n) This revision refactors and simplifies the pattern detection logic: thanks to SSA value properties, we can actually look at all the uses of a given value and avoid having to pattern-match specific chains of operations. A bufferization pattern for subtensor is added and specific inplaceability analysis is implemented for the simple case of subtensor. More advanced use cases will follow. Differential revision: https://reviews.llvm.org/D102512	2021-05-27 12:48:08 +00:00
Alexander Belyaev	281ee42911	[mlir] Add a pass to distribute linalg::TiledLoopOp. Differential Revision: https://reviews.llvm.org/D103194	2021-05-27 08:45:20 +02:00
Frank Laub	b5c3f17e70	[MLIR] Add support for empty IVs to affine.parallel Allow support for specifying empty IVs in an `affine.parallel`. For example: ``` affine.parallel () = () to () { affine.yield } ``` Reviewed By: bondhugula, jbruestle Differential Revision: https://reviews.llvm.org/D102895	2021-05-26 23:45:11 +00:00
Alexander Belyaev	74a89cba8c	[mlir] Add `distributionTypes` to LinalgTilingOptions. Differential Revision: https://reviews.llvm.org/D103161	2021-05-26 17:51:38 +02:00
Adrian Kuegel	dee46d0829	[mlir] Fold complex.create(complex.re(op), complex.im(op)) Differential Revision: https://reviews.llvm.org/D103148	2021-05-26 14:02:53 +02:00
Adrian Kuegel	cb65419b1a	[mlir] Simplify folding code (NFC)	2021-05-26 11:00:07 +02:00
Adrian Kuegel	b99f892b02	[mlir] Fold complex.re(complex.create) and complex.im(complex.create) This extends the folding we already have. A test needs to be adjusted. Differential Revision: https://reviews.llvm.org/D103141	2021-05-26 10:53:05 +02:00
Alexander Belyaev	2ea6e13bf8	[mlir] Add an optional distributionTypes attribute to TiledLoopOp. Differential Revision: https://reviews.llvm.org/D103104	2021-05-25 20:04:41 +02:00
Vinayaka Bandishti	eff269fc9f	[MLIR][Affine][LICM] Mark users of `iter_args` variant Prevent users of `iter_args` of an affine for loop from being hoisted out of it. Otherwise, LICM leads to a violation of the SSA dominance (as demonstrated in the added test case). Fixes: https://bugs.llvm.org/show_bug.cgi?id=50103 Reviewed By: bondhugula, ayzhuang Differential Revision: https://reviews.llvm.org/D102984	2021-05-25 15:56:52 +05:30
Tres Popp	9ccdc2e23b	[mlir] Fold memref.dim of OffsetSizeAndStrideOpInterface outputs This previously handled memref::SubviewOp, but this can be extended to all ops implementing the interface. Differential Revision: https://reviews.llvm.org/D103076	2021-05-25 12:16:10 +02:00
Uday Bondhugula	9c21ddb70a	[MLIR] Make MLIR cmake variable names consistent Fix inconsistent MLIR CMake variable names. Consistently name them as MLIR_ENABLE_<feature>. Eg: MLIR_CUDA_RUNNER_ENABLED -> MLIR_ENABLE_CUDA_RUNNER MLIR follows (or has mostly followed) the convention of naming cmake enabling variables in the from MLIR_ENABLE_... etc. Using a convention here is easy and also important for convenience. A counter pattern was started with variables named MLIR_..._ENABLED. This led to a sequence of related counter patterns: MLIR_CUDA_RUNNER_ENABLED, MLIR_ROCM_RUNNER_ENABLED, etc.. From a naming standpoint, the imperative form is more meaningful. Additional discussion at: https://llvm.discourse.group/t/mlir-cmake-enable-variable-naming-convention/3520 Switch all inconsistent ones to the ENABLE form. Keep the couple of old mappings needed until buildbot config is migrated. Differential Revision: https://reviews.llvm.org/D102976	2021-05-24 08:43:10 +05:30
Philipp Krones	c2f819af73	[MC] Refactor MCObjectFileInfo initialization and allow targets to create MCObjectFileInfo This makes it possible for targets to define their own MCObjectFileInfo. This MCObjectFileInfo is then used to determine things like section alignment. This is a follow up to D101462 and prepares for the RISCV backend defining the text section alignment depending on the enabled extensions. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101921	2021-05-23 14:15:23 -07:00
Butygin	4184018253	[mlir][SCF] Canonicalize nested ParallelOp's Differential Revision: https://reviews.llvm.org/D102799	2021-05-22 14:00:00 +03:00
Aart Bik	c194b49c9c	[mlir][sparse] add full dimension ordering support This revision completes the "dimension ordering" feature of sparse tensor types that enables the programmer to define a preferred order on dimension access (other than the default left-to-right order). This enables e.g. selection of column-major over row-major storage for sparse matrices, but generalized to any rank, as in: dimOrdering = affine_map<(i,j,k,l,m,n,o,p) -> (p,o,j,k,i,l,m,n)> Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D102856	2021-05-21 12:35:13 -07:00
Alexander Belyaev	335fa18028	[mlir] NFC: Expose tiled_loop->scf pattern. Differential Revision: https://reviews.llvm.org/D102921	2021-05-21 18:19:00 +02:00
Alexander Belyaev	9ecc8178d7	[mlir] Add support for fusion into TiledLoopOp. Differential Revision: https://reviews.llvm.org/D102722	2021-05-21 18:13:45 +02:00
Stephan Herhut	90e55dfcf4	[mlir][memref] Improve canonicalization of memref.clone The previous implementation did not handle casting behavior properly and did not consider aliases. Differential Revision: https://reviews.llvm.org/D102785	2021-05-21 16:34:50 +02:00
Stephan Herhut	884a6291f0	[mlir][linalg] Add scalar operands inlining pattern This pattern inlines operands to a linalg.generic operation that use a constant index and hence are loop-invariant scalars. This reduces the number of linalg.generic operands and unlocks some canonicalizations that rely on seeing an explicit tensor.extract. Differential Revision: https://reviews.llvm.org/D102682	2021-05-21 15:23:28 +02:00
Nicolas Vasilache	8eb18a0f3e	[mlir][Standard] NFC - Drop remaining EDSC usage Drop the remaining EDSC subdirectories and update all uses. Differential Revision: https://reviews.llvm.org/D102911	2021-05-21 10:40:39 +00:00
Nicolas Vasilache	e84a9b9bb3	[mlir][Affine] NFC - Drop Affine EDSC usage Drop the Affine dialect EDSC subdirectory and update all uses. Differential Revision: https://reviews.llvm.org/D102878	2021-05-20 21:45:45 +00:00
Nicolas Vasilache	e3cf7c88c4	[mlir][MemRef] NFC - Drop MemRef EDSC usage Drop the MemRef dialect EDSC subdirectory and update all uses. Differential Revision: https://reviews.llvm.org/D102868	2021-05-20 20:13:58 +00:00
Nicolas Vasilache	4519ca3d2e	[mlir][Linalg] NFC - Drop Linalg EDSC usage Drop the Linalg dialect EDSC subdirectory and update all uses. Differential Revision: https://reviews.llvm.org/D102848	2021-05-20 15:33:56 +00:00
Adrian Kuegel	a28fe17d73	[mlir] Add EqualOp and NotEqualOp to complex dialect.	2021-05-20 13:25:07 +02:00
Nicolas Vasilache	ef33c6e3ce	[mlir][Linalg] Drop spurious usage of OperationFolder Instead, use createOrFold builders which result in more static information available. Differential Revision: https://reviews.llvm.org/D102832	2021-05-20 09:17:58 +00:00
Aart Bik	bf9ef3efaa	[mlir][sparse] skip sparsification for unannotated (or unhandled) cases Skip the sparsification pass for Linalg ops without annotated tensors (or cases that are not properly handled yet). Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D102787	2021-05-19 13:49:28 -07:00
Nicolas Vasilache	84a880e1e2	[mlir][SCF] NFC - Drop SCF EDSC usage Drop the SCF dialect EDSC subdirectory and update all uses. Differential Revision: https://reviews.llvm.org/D102780	2021-05-19 15:52:14 +00:00
Tobias Gysi	9a2769db80	[mir][Python][linalg] Support OpDSL extensions in C++. The patch extends the yaml code generation to support the following new OpDSL constructs: - captures - constants - iteration index accesses - predefined types These changes have been introduced by revision https://reviews.llvm.org/D101364. Differential Revision: https://reviews.llvm.org/D102075	2021-05-19 13:36:56 +00:00
Nicolas Vasilache	6825bfe23e	[mlir][Vector] NFC - Drop vector EDSC usage Drop the vector dialect EDSC subdirectory and update all uses.	2021-05-19 12:44:38 +00:00
Matthias Springer	fb7ec1f187	[mlir] Use VectorTransferPermutationMapLoweringPatterns in VectorToSCF VectorTransferPermutationMapLoweringPatterns can be enabled via a pass option. These additional patterns lower permutation maps to minor identity maps with broadcasting, if possible, allowing for more efficient vector load/stores. The option is deactivated by default. Differential Revision: https://reviews.llvm.org/D102593	2021-05-19 14:46:19 +09:00
MaheshRavishankar	e2b365948b	[mlir][Linalg] Break unnecessary dependency through unused `outs` tensor. LinalgOps that are all parallel do not use the value of `outs` tensor. The semantics is that the `outs` tensor is fully overwritten. Using anything other than `init_tensor` can add false dependencies between operations, when the use is just for the shape of the tensor. Adding a canonicalization to always use `init_tensor` in such cases, breaks this dependence. Differential Revision: https://reviews.llvm.org/D102561	2021-05-18 22:31:42 -07:00
Wenyi Zhao	851d02f61e	Enhance InferShapedTypeOpInterface to make it accessible during dialect conversion Original interfaces are not safe to be called during dialect conversion. This is because some ops (e.g. `dynamic_reshape(input, target_shape)`) depend on the values of their operands to calculate the output shape. However the operands may be out of reach during dialect conversion (e.g. converting from tensor world to buffer world). This patch provides a new kind of interface which accpets user-provided operands to solve this problem. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D102317	2021-05-19 02:51:14 +00:00
Adrian Kuegel	fa765a0944	[mlir] Add folder for complex.ReOp and complex.ImOp. Now that complex constants are supported, we can also fold. Differential Revision: https://reviews.llvm.org/D102616	2021-05-18 11:27:23 +02:00
Jacques Pienaar	24bf554b10	Add type function for ConstShape op. - Enables inferring return type for ConstShape, takes into account valid return types; - The compatible return type function could be reused, leaving that for next use refactoring; Differential Revision: https://reviews.llvm.org/D102182	2021-05-17 11:47:19 -07:00
Aart Bik	5879da496c	[mlir][sparse] replace experimental flag with inplace attribute The experimental flag for "inplace" bufferization in the sparse compiler can be replaced with the new inplace attribute. This gives a uniform way of expressing the more efficient way of bufferization. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D102538	2021-05-17 11:43:44 -07:00
Matthias Springer	2c9688d201	[mlir] Improve TransferOp verifier: broadcasts are in_bounds Broadcast dimensions of vector transfer ops are always in-bounds. This is consistent with the fact that the starting position of a transfer is always in-bounds. Differential Revision: https://reviews.llvm.org/D102566	2021-05-17 22:35:44 +09:00
Adrian Kuegel	967f07f547	Revert "[mlir] Add folder for complex.ReOp and complex.ImOp." This reverts commit `6b49834d65`. Some tests fail.	2021-05-17 13:49:42 +02:00
Adrian Kuegel	6b49834d65	[mlir] Add folder for complex.ReOp and complex.ImOp. Now that complex constants are supported, we can also fold. Differential Revision: https://reviews.llvm.org/D102609	2021-05-17 13:35:51 +02:00
Julian Gross	1fbb484ea4	[WIP][mlir] Resolve memref dependency in canonicalize pass. Splitting the memref dialect lead to an introduction of several dependencies to avoid compilation issues. The canonicalize pass also depends on the memref dialect, but it shouldn't. This patch resolves the dependencies and the unintuitive includes are removed. However, the dependency moves to the constructor of the std dialect. Differential Revision: https://reviews.llvm.org/D102060	2021-05-17 11:33:38 +02:00
Tobias Gysi	7c16f93c44	[mlir][linalg] Remove template parameter from loop lowering. Replace the templated linalgLowerOpToLoops method by three specialized methods linalgOpToLoops, LinalgOpToParallelLoops, and linalgOpToAffineLoops. Differential Revision: https://reviews.llvm.org/D102324	2021-05-17 09:31:53 +00:00
Adrian Kuegel	5ef21506b9	Add support for complex constants to MLIR core. BEGIN_PUBLIC Add support for complex constants to MLIR core. END_PUBLIC Differential Revision: https://reviews.llvm.org/D101908	2021-05-17 09:12:39 +02:00
Matthias Springer	7ddeffee55	[mlir] Lower permutation maps on TransferWriteOps Add TransferWritePermutationLowering, which replaces permutation maps of TransferWriteOps with vector.transpose. Differential Revision: https://reviews.llvm.org/D102548	2021-05-17 15:30:46 +09:00
Matthias Springer	6774e5a995	[mlir] Fix in_bounds attr handling in TransferReadPermutationLowering The in_bounds attribute should also be transposed. Differential Revision: https://reviews.llvm.org/D102572	2021-05-17 15:28:16 +09:00
Aart Bik	56fd4c1cf8	[mlir][sparse] prepare runtime support lib for multiple dim level types We are moving from just dense/compressed to more general dim level types, so we need more than just an "i1" array for annotations. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D102520	2021-05-14 19:12:07 -07:00
Nicolas Vasilache	dd65f420cd	[mlir][Linalg] NFC - More gracefully degrade lookup into failure during comprehensive bufferization (4/n) Differential revsion: https://reviews.llvm.org/D102420	2021-05-14 22:12:23 +00:00
Nicolas Vasilache	6f90955f69	[mlir][Linalg] Add support for subtensor_insert comprehensive bufferization (3/n) Differential revision: https://reviews.llvm.org/D102417	2021-05-14 21:51:00 +00:00
Ian Bearman	0816b96a10	Allow same memory space for SRC and DST of dma_start operations This change allows the SRC and DST of dma_start operations to be located in the same memory space. This applies to both the Affine dialect and Memref dialect versions of these Ops. The documention has been updated to reflect this by explicitly stating overlapping memory locations are not supported (undefined behavior). Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D102274	2021-05-14 10:40:15 -07:00
Rahul Joshi	23a84e1c60	[MLIR] Fix build failures due to unused variables in non-debug builds. Differential Revision: https://reviews.llvm.org/D102458	2021-05-13 18:42:48 -07:00
Nicolas Vasilache	bebf5d56bf	[mlir][Linalg] Add support for vector.transfer ops to comprehensive bufferization (2/n). Differential revision: https://reviews.llvm.org/D102395	2021-05-13 22:26:28 +00:00
Nicolas Vasilache	1e01a8919f	[mlir][Linalg] Add ComprehensiveBufferize for functions(step 1/n) This is the first step towards upstreaming comprehensive bufferization following the discourse post: https://llvm.discourse.group/t/rfc-linalg-on-tensors-update-and-comprehensive-bufferization-rfc/3373/6. This first commit introduces a basic pass for bufferizing within function boundaries, assuming that the inplaceable function boundaries have been marked as such. Differential revision: https://reviews.llvm.org/D101693	2021-05-13 22:24:40 +00:00
Sean Silva	12874e93a1	[mlir][NFC] Add helper for common pattern of replaceAllUsesExcept This covers the extremely common case of replacing all uses of a Value with a new op that is itself a user of the original Value. This should also be a little bit more efficient than the `SmallPtrSet<Operation *, 1>{op}` idiom that was being used before. Differential Revision: https://reviews.llvm.org/D102373	2021-05-13 12:42:10 -07:00
Weiwei Li	cd0eeb52ad	[mlir][spirv] Define spv.ImageQuerySize operation Support OpImageQuerySize in spirv dialect co-authored-by: Alan Liu <alanliu.yf@gmail.com> Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D102029	2021-05-13 13:17:08 -04:00
Tobias Gysi	cf194da1bb	[mlir][linalg] Remove IndexedGenericOp support from FusionOnTensors... after introducing the IndexedGenericOp to GenericOp canonicalization (https://reviews.llvm.org/D101612). Differential Revision: https://reviews.llvm.org/D102163	2021-05-13 14:57:16 +00:00
Tobias Gysi	f358c37209	[mlir][linalg] Remove IndexedGenericOp support from DropUnitDims... after introducing the IndexedGenericOp to GenericOp canonicalization (https://reviews.llvm.org/D101612). Differential Revision: https://reviews.llvm.org/D102235	2021-05-13 14:18:59 +00:00
Matthias Springer	60da33c2d4	[mlir] Support masks in TransferOpReduceRank and TransferReadPermutationLowering These two patterns allow for more efficient codegen in VectorToSCF. Differential Revision: https://reviews.llvm.org/D102222	2021-05-13 15:08:08 +09:00
Matthias Springer	864adf399e	[mlir] Allow empty position in vector.insert and vector.extract Such ops are no-ops and are folded to their respective `source`/`vector` operand. Differential Revision: https://reviews.llvm.org/D101879	2021-05-13 12:54:18 +09:00
Matthias Springer	c52cbe63e4	[mlir] Fix masked vector transfer ops with broadcasts Broadcast dimensions of a vector transfer op have no corresponding dimension in the mask vector. E.g., a 2-D TransferReadOp, where one dimension is a broadcast, can have a 1-D `mask` attribute. This commit also adds a few additional transfer op integration tests for various combinations of broadcasts, masking, dim transposes, etc. Differential Revision: https://reviews.llvm.org/D101745	2021-05-13 12:46:03 +09:00
Matthias Springer	6555e53ab0	Revert "[mlir] Fix masked vector transfer ops with broadcasts" This reverts commit `c9087788f7`. Accidentally pushed old version of the commit.	2021-05-13 11:55:00 +09:00
Matthias Springer	c9087788f7	[mlir] Fix masked vector transfer ops with broadcasts Broadcast dimensions of a vector transfer op have no corresponding dimension in the mask vector. E.g., a 2-D TransferReadOp, where one dimension is a broadcast, can have a 1-D `mask` attribute. This commit also adds a few additional transfer op integration tests for various combinations of broadcasts, masking, dim transposes, etc. Differential Revision: https://reviews.llvm.org/D101745	2021-05-13 11:37:36 +09:00
Rob Suderman	7b57517507	[mlir][linalg] Fixed issue generating reassociation map with Rank-0 types Rank-0 case causes a graph during linalg reshape operation. Differential Revision: https://reviews.llvm.org/D102282	2021-05-12 11:00:51 -07:00
Inho Seo	5480ea6c84	Update static bound checker for Linalg to cover decreasing cases The current static checker for linalg does not work on the decreasing index cases well. So, this is to Update the current static bound checker for linalg to cover decreasing index cases. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D102302	2021-05-12 10:29:19 -07:00
Aart Bik	ca5d0a7310	[mlir][sparse] keep runtime support library signature consistent Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D102285	2021-05-12 09:59:46 -07:00
Valentin Clement	6110b667b0	[mlir][openacc] Conversion of data operand to LLVM IR dialect Add a conversion pass to convert higher-level type before translation. This conversion extract meangingful information and pack it into a struct that the translation (D101504) will be able to understand. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D102170	2021-05-12 11:34:15 -04:00
Tobias Gysi	06bb9cf30d	[mlir][linalg] Remove IndexedGenericOp support from LinalgInterchangePattern... after introducing the IndexedGenericOp to GenericOp canonicalization (https://reviews.llvm.org/D101612). Differential Revision: https://reviews.llvm.org/D102245	2021-05-12 13:01:37 +00:00
Tobias Gysi	c6b96ae06f	[mlir][linalg] Remove IndexedGenericOp support from LinalgBufferize... after introducing the IndexedGenericOp to GenericOp canonicalization (https://reviews.llvm.org/D101612). Differential Revision: https://reviews.llvm.org/D102308	2021-05-12 12:15:05 +00:00
Dumitru Potop	9a0ea5994b	[mlir] Support alignment in LLVM dialect GlobalOp First step in adding alignment as an attribute to MLIR global definitions. Alignment can be specified for global objects in LLVM IR. It can also be specified as a named attribute in the LLVMIR dialect of MLIR. However, this attribute has no standing and is discarded during translation from MLIR to LLVM IR. This patch does two things: First, it adds the attribute to the syntax of the llvm.mlir.global operation, and by doing this it also adds accessors and verifications. The syntax is "align=XX" (with XX being an integer), placed right after the value of the operation. Second, it allows transforming this operation to and from LLVM IR. It is checked whether the value is an integer power of 2. Reviewed By: ftynse, mehdi_amini Differential Revision: https://reviews.llvm.org/D101492	2021-05-12 09:07:20 +02:00
Benjamin Kramer	b20e150c9b	[mlir] Use static shape knowledge when lowering memref.reshape This is actually necessary for correctness, as memref.reinterpret_cast doesn't verify if the output shape doesn't match the static sizes. Differential Revision: https://reviews.llvm.org/D102232	2021-05-11 18:21:09 +02:00
Uday Bondhugula	1c777ab459	[MLIR] Switch llvm.noalias to a unit attribute Switch llvm.noalias attribute from a boolean attribute to a unit attribute. Differential Revision: https://reviews.llvm.org/D102225	2021-05-11 15:41:09 +05:30
Tres Popp	88a48999d2	Support VectorTransfer splitting on writes also. VectorTransfer split previously only split read xfer ops. This adds the same logic to write ops. The resulting code involves 2 conditionals for write ops while read ops only needed 1, but the created ops are built upon the same patterns, so pattern matching/expectations are all consistent other than in regards to the if/else ops. Differential Revision: https://reviews.llvm.org/D102157	2021-05-11 10:33:27 +02:00
Tobias Gysi	7bc6df2528	[mlir][linalg] Remove IndexedGenericOp support from LinalgToLoops... after introducing the IndexedGenericOp to GenericOp canonicalization (https://reviews.llvm.org/D101612). Differential Revision: https://reviews.llvm.org/D102187	2021-05-11 06:53:47 +00:00
Tobias Gysi	6676e09b22	[mlir][linalg] Remove IndexedGenericOp support from Fusion... after introducing the IndexedGenericOp to GenericOp canonicalization (https://reviews.llvm.org/D101612). Differential Revision: https://reviews.llvm.org/D102174	2021-05-11 06:49:25 +00:00
Tobias Gysi	d69bccf1ed	[mlir][linalg] Remove IndexedGenericOp support from Tiling... after introducing the IndexedGenericOp to GenericOp canonicalization (https://reviews.llvm.org/D101612). Differential Revision: https://reviews.llvm.org/D102176	2021-05-11 05:53:58 +00:00
Aart Bik	bf812ea484	[mlir][linalg] remove the -now- obsolete sparse support in linalg All glue and clutter in the linalg ops has been replaced by proper sparse tensor type encoding. This code is no longer needed. Thanks to ntv@ for giving us a temporary home in linalg. So long, and thanks for all the fish. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D102098	2021-05-10 16:49:33 -07:00
Benjamin Kramer	7b52aeadfa	[mlir][Tensor] Add folding for tensor.from_elements This trivially folds into a constant when all operands are constant. Differential Revision: https://reviews.llvm.org/D102199	2021-05-11 00:42:45 +02:00
Aart Bik	96a23911f6	[mlir][sparse] complete migration to sparse tensor type A very elaborate, but also very fun revision because all puzzle pieces are finally "falling in place". 1. replaces lingalg annotations + flags with proper sparse tensor types 2. add rigorous verification on sparse tensor type and sparse primitives 3. removes glue and clutter on opaque pointers in favor of sparse tensor types 4. migrates all tests to use sparse tensor types NOTE: next CL will remove all obsoleted sparse code in Linalg Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D102095	2021-05-10 12:55:22 -07:00
Lei Zhang	7e71823f1d	[mlir][linalg] Restrict distribution to parallel dims According to the API contract, LinalgLoopDistributionOptions expects to work on parallel iterators. When getting processor information, only loop ranges for parallel dimensions should be fed in. But right now after generating scf.for loop nests, we feed in all loops, including the ones materialized for reduction iterators. This can cause unexpected distribution of reduction dimensions. This commit fixes it. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D102079	2021-05-10 15:23:00 -04:00
Julian Gross	fc253e69f9	Fixed bug in buffer deallocation pass using unranked memref types. In the buffer deallocation pass, unranked memref types are not properly supported. After investigating this issue, it turns out that the Clone and Dealloc operation does not support unranked memref types in the current implementation. This patch adds the missing feature and enables the transformation of any memref type. This patch solves this bug: https://bugs.llvm.org/show_bug.cgi?id=48385 Differential Revision: https://reviews.llvm.org/D101760	2021-05-10 10:50:29 +02:00
Frederik Gossen	a81e45b8bc	[MLIR][Shape] Concretize broadcast result type if possible As a canonicalization, infer the resulting shape rank if possible. Differential Revision: https://reviews.llvm.org/D102068	2021-05-10 10:24:08 +02:00
River Riddle	53b946aa63	[mlir] Refactor the representation of function-like argument/result attributes. The current design uses a unique entry for each argument/result attribute, with the name of the entry being something like "arg0". This provides for a somewhat sparse design, but ends up being much more expensive (from a runtime perspective) in-practice. The design requires building a string every time we lookup the dictionary for a specific arg/result, and also requires N attribute lookups when collecting all of the arg/result attribute dictionaries. This revision restructures the design to instead have an ArrayAttr that contains all of the attribute dictionaries for arguments and another for results. This design reduces the number of attribute name lookups to 1, and allows for O(1) lookup for individual element dictionaries. The major downside is that we can end up with larger memory usage, as the ArrayAttr contains an entry for each element even if that element has no attributes. If the memory usage becomes too problematic, we can experiment with a more sparse structure that still provides a lot of the wins in this revision. This dropped the compilation time of a somewhat large TensorFlow model from ~650 seconds to ~400 seconds. Differential Revision: https://reviews.llvm.org/D102035	2021-05-07 19:32:31 -07:00
thomasraoux	6aaf06f929	[mlir][vector] Fix warning Previous change caused another warning in some build configuration: "default label in switch which covers all enumeration values"	2021-05-07 17:12:47 -07:00
thomasraoux	b90b66bcbe	[mlir] Missed clang-format	2021-05-07 13:57:34 -07:00
thomasraoux	d0453a8933	[mlir][vector] Extend pattern to trim lead unit dimension to Splat Op Differential Revision: https://reviews.llvm.org/D102091	2021-05-07 13:54:41 -07:00
Alexander Belyaev	3444996b4c	[mlir] Add a pattern to bufferize std.index_cast. Differential Revision: https://reviews.llvm.org/D102088	2021-05-07 21:32:02 +02:00
Alexander Belyaev	a3f22d020b	[mlir] Add a pattern to bufferize linalg.tensor_reshape. Differential Revision: https://reviews.llvm.org/D102089	2021-05-07 21:31:17 +02:00
thomasraoux	a970e69d6b	[mlir][vector] add pattern to cast away leading unit dim for elementwise op Differential Revision: https://reviews.llvm.org/D102034	2021-05-07 07:54:09 -07:00
Tobias Gysi	f31531a30b	[mlir][linalg] Remove redundant indexOp builder. Remove the builder signature taking a signed dimension identifier. Reviewed By: ergawy Differential Revision: https://reviews.llvm.org/D102055	2021-05-07 14:22:12 +00:00
Tobias Gysi	26e916334e	[mlir][linalg] Add IndexedGenericOp to GenericOp canonicalization. Replace all `linalg.indexed_generic` ops by `linalg.generic` ops that access the iteration indices using the `linalg.index` op. Differential Revision: https://reviews.llvm.org/D101612	2021-05-07 06:00:16 +00:00
MaheshRavishankar	05a89312d8	[mlir][Linalg] Allow folding to rank-zero tensor when using rank-reducing subtensors. The pattern to convert subtensor ops to their rank-reduced versions (by dropping unit-dims in the result) can also convert to a zero-rank tensor. Handle that case. This also fixes a OOB access bug in the existing pattern for such cases. Differential Revision: https://reviews.llvm.org/D101949	2021-05-06 19:03:55 -07:00
Lei Zhang	41bc54cc56	[mlir][spirv] NFC: Replace OwningSPIRVModuleRef with OwningOpRef Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D102009	2021-05-06 17:17:44 -04:00
thomasraoux	71eb32d97e	[mlir][vector] Fix typo	2021-05-06 10:12:31 -07:00
thomasraoux	52525cb20f	[mlir][linalg][NFC] Make reshape folding control more fine grain This expose a lambda control instead of just a boolean to control unit dimension folding. This however gives more control to user to pick a good heuristic. Folding reshapes helps fusion opportunities but may generate sub-optimal generic ops. Differential Revision: https://reviews.llvm.org/D101917	2021-05-06 10:11:39 -07:00
thomasraoux	933551eaeb	[mlir][NFC] Fix warning in VectorTransforms.cpp	2021-05-06 08:11:42 -07:00
thomasraoux	0b303da6f8	[mlir][vector] add pattern to cast away lead unit dimension for broadcast op Differential Revision: https://reviews.llvm.org/D101955	2021-05-06 08:02:17 -07:00
Christian Sigg	a0d019fc89	[mlir] Add support for ops with regions in 'gpu-async-region' rewriter. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D101757	2021-05-06 13:21:28 +02:00
Navdeep Kumar	875eb523c1	[MLIR][GPU][NVVM] Add warp synchronous matrix-multiply accumulate ops Add warp synchronous matrix-multiply accumulate ops in GPU and NVVM dialect. Add following three ops to GPU dialect :- 1.) subgroup_mma_load_matrix 2.) subgroup_mma_store_matrix 3.) subgroup_mma_compute Add following three ops to NVVM dialect :- 1.) wmma.m16n16k16.load.[a,b,c].[f16,f32].row.stride 2.) wmma.m16n16k16.store.d.[f16,f32].row.stride 3.) wmma.m16n16k16.mma.row.row.[f16,f32].[f16,f32] Reviewed By: bondhugula, ftynse, ThomasRaoux Differential Revision: https://reviews.llvm.org/D95330	2021-05-06 12:06:25 +05:30
MaheshRavishankar	b6060b7673	[mlir][Linalg] Fix element type of results when folding reshapes. Fixing a minor bug which lead to element type of the output being modified when folding reshapes with generic op. Differential Revision: https://reviews.llvm.org/D101942	2021-05-05 15:40:41 -07:00
Emilio Cota	0edc4bc84a	[mlir] Add polynomial approximation for math::ExpM1 This approximation matches the one in Eigen. ``` name old cpu/op new cpu/op delta BM_mlir_Expm1_f32/10 90.9ns ± 4% 52.2ns ± 4% -42.60% (p=0.000 n=74+87) BM_mlir_Expm1_f32/100 837ns ± 3% 231ns ± 4% -72.43% (p=0.000 n=79+69) BM_mlir_Expm1_f32/1k 8.43µs ± 3% 1.58µs ± 5% -81.30% (p=0.000 n=77+83) BM_mlir_Expm1_f32/10k 83.8µs ± 3% 15.4µs ± 5% -81.65% (p=0.000 n=83+69) BM_eigen_s_Expm1_f32/10 68.8ns ±17% 72.5ns ±14% +5.40% (p=0.000 n=118+115) BM_eigen_s_Expm1_f32/100 694ns ±11% 717ns ± 2% +3.34% (p=0.000 n=120+75) BM_eigen_s_Expm1_f32/1k 7.69µs ± 2% 7.97µs ±11% +3.56% (p=0.000 n=95+117) BM_eigen_s_Expm1_f32/10k 88.0µs ± 1% 89.3µs ± 6% +1.45% (p=0.000 n=74+106) BM_eigen_v_Expm1_f32/10 44.3ns ± 6% 45.0ns ± 8% +1.45% (p=0.018 n=81+111) BM_eigen_v_Expm1_f32/100 351ns ± 1% 360ns ± 9% +2.58% (p=0.000 n=73+99) BM_eigen_v_Expm1_f32/1k 3.31µs ± 1% 3.42µs ± 9% +3.37% (p=0.000 n=71+100) BM_eigen_v_Expm1_f32/10k 33.7µs ± 8% 34.1µs ± 9% +1.04% (p=0.007 n=99+98) ``` Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D101852	2021-05-05 14:31:34 -07:00
Philipp Krones	632ebc4ab4	[MC] Untangle MCContext and MCObjectFileInfo This untangles the MCContext and the MCObjectFileInfo. There is a circular dependency between MCContext and MCObjectFileInfo. Currently this dependency also exists during construction: You can't contruct a MOFI without a MCContext without constructing the MCContext with a dummy version of that MOFI first. This removes this dependency during construction. In a perfect world, MCObjectFileInfo wouldn't depend on MCContext at all, but only be stored in the MCContext, like other MC information. This is future work. This also shifts/adds more information to the MCContext making it more available to the different targets. Namely: - TargetTriple - ObjectFileType - SubtargetInfo Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101462	2021-05-05 10:03:02 -07:00
Javier Setoain	95861216ac	[mlir][ArmSVE] Add masked arithmetic operations These instructions map to SVE-specific instrinsics that accept a predicate operand to support control flow in vector code. Differential Revision: https://reviews.llvm.org/D100982	2021-05-05 17:41:58 +01:00
Sergei Grechanik	d80b04ab00	[mlir][Affine][Vector] Support vectorizing reduction loops This patch adds support for vectorizing loops with 'iter_args' implementing known reductions along the vector dimension. Comparing to the non-vector-dimension case, two additional things are done during vectorization of such loops: - The resulting vector returned from the loop is reduced to a scalar using `vector.reduce`. - In some cases a mask is applied to the vector yielded at the end of the loop to prevent garbage values from being written to the accumulator. Vectorization of reduction loops is disabled by default. To enable it, a map from loops to array of reduction descriptors should be explicitly passed to `vectorizeAffineLoops`, or `vectorize-reductions=true` should be passed to the SuperVectorize pass. Current limitations: - Loops with a non-unit step size are not supported. - n-D vectorization with n > 1 is not supported. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D100694	2021-05-05 09:03:59 -07:00
Tobias Gysi	4a6ee23d83	[mlir][linalg] Fix bug in the fusion on tensors index op handling. The old index op handling let the new index operations point back to the producer block. As a result, after fusion some index operations in the fused block had back references to the old producer block resulting in illegal IR. The patch now relies on a block and value mapping to avoid such back references. Differential Revision: https://reviews.llvm.org/D101887	2021-05-05 14:46:08 +00:00
Alexander Belyaev	2865d114f9	[mlir] Use ReassociationIndices instead of affine maps in linalg.reshape. Differential Revision: https://reviews.llvm.org/D101861	2021-05-05 12:59:57 +02:00
Javier Setoain	001d601ac4	[mlir][ArmSVE] Add basic arithmetic operations While we figure out how to best add Standard support for scalable vectors, these instructions provide a workaround for basic arithmetic between scalable vectors. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D100837	2021-05-05 09:50:18 +02:00
William S. Moses	f4a2dbfe29	[MLIR][SCF] Combine adjacent scf.if with same condition Differential Revision: https://reviews.llvm.org/D101798	2021-05-05 00:39:58 -04:00
Aart Bik	a2c9d4bb04	[mlir][sparse] Introduce proper sparsification passes This revision migrates more code from Linalg into the new permanent home of SparseTensor. It replaces the test passes with proper compiler passes. NOTE: the actual removal of the last glue and clutter in Linalg will follow Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D101811	2021-05-04 17:10:09 -07:00
William S. Moses	cb395b84b0	[MLIR] Add not icmp canonicalization documentation See: https://reviews.llvm.org/D101710	2021-05-04 11:44:25 -04:00
William S. Moses	8e211bf1c8	[MLIR][SCF] Assume uses of condition in the body of scf.while is true Differential Revision: https://reviews.llvm.org/D101801	2021-05-04 11:39:07 -04:00
William S. Moses	93297e4bac	[MLIR] Replace a not of a comparison with appropriate comparison Differential Revision: https://reviews.llvm.org/D101710	2021-05-04 11:23:29 -04:00
Tobias Gysi	05d2297b86	[mlir][linalg] Always lower index operations during loop lowering. Ensure the index operations are lowered on all linalg loop lowering paths. Differential Revision: https://reviews.llvm.org/D101827	2021-05-04 14:30:59 +00:00
Matthias Springer	aa58281979	[mlir] Fix bug in TransferOpReduceRank when all dims are broadcasts TransferReadOps that are a scalar read + broadcast are handled by TransferReadToVectorLoadLowering. Differential Revision: https://reviews.llvm.org/D101808	2021-05-04 11:21:44 +09:00
Eugene Zhulenev	9b67096fe9	[mlir] Linalg: add vector transfer lowering patterns to the contraction lowering This fixes a performance regression in vec-mat vectorization Reviewed By: asaadaldien Differential Revision: https://reviews.llvm.org/D101795	2021-05-03 16:21:51 -07:00
Emilio Cota	1c0374e770	[mlir] Add polynomial approximation for math::Log1p This approximation matches the one in Eigen. ``` name old cpu/op new cpu/op delta BM_mlir_Log1p_f32/10 83.2ns ± 7% 34.8ns ± 5% -58.19% (p=0.000 n=84+71) BM_mlir_Log1p_f32/100 664ns ± 4% 129ns ± 4% -80.57% (p=0.000 n=82+82) BM_mlir_Log1p_f32/1k 6.75µs ± 4% 0.81µs ± 3% -88.07% (p=0.000 n=88+79) BM_mlir_Log1p_f32/10k 76.5µs ± 3% 7.8µs ± 4% -89.84% (p=0.000 n=80+80) BM_eigen_s_Log1p_f32/10 70.1ns ±14% 72.6ns ±14% +3.49% (p=0.000 n=116+112) BM_eigen_s_Log1p_f32/100 706ns ± 9% 717ns ± 3% +1.60% (p=0.018 n=117+80) BM_eigen_s_Log1p_f32/1k 8.26µs ± 1% 8.26µs ± 1% ~ (p=0.567 n=84+86) BM_eigen_s_Log1p_f32/10k 92.1µs ± 5% 92.6µs ± 6% +0.60% (p=0.047 n=115+115) BM_eigen_v_Log1p_f32/10 31.8ns ±24% 34.9ns ±17% +9.72% (p=0.000 n=98+96) BM_eigen_v_Log1p_f32/100 169ns ±10% 177ns ± 5% +4.66% (p=0.000 n=119+81) BM_eigen_v_Log1p_f32/1k 1.42µs ± 4% 1.46µs ± 8% +2.70% (p=0.000 n=93+113) BM_eigen_v_Log1p_f32/10k 14.4µs ± 5% 14.9µs ± 8% +3.61% (p=0.000 n=115+110) ``` Reviewed By: ezhulenev, ftynse Differential Revision: https://reviews.llvm.org/D101765	2021-05-03 15:11:37 -07:00
MaheshRavishankar	a6e09391bb	[mlir][Linalg] Add a utility method to get reassociations maps for reshape. Given the source and destination shapes, if they are static, or if the expanded/collapsed dimensions are unit-extent, it is possible to compute the reassociation maps that can be used to reshape one type into another. Add a utility method to return the reassociation maps when possible. This utility function can be used to fuse a sequence of reshape ops, given the type of the source of the producer and the final result type. This pattern supercedes a more constrained folding pattern added to DropUnitDims pass. Differential Revision: https://reviews.llvm.org/D101343	2021-05-03 14:40:15 -07:00
MaheshRavishankar	fd15e2b825	[mlir][Linalg] Use rank-reduced versions of subtensor and subtensor insert when possible. Convert subtensor and subtensor_insert operations to use their rank-reduced versions to drop unit dimensions. Differential Revision: https://reviews.llvm.org/D101495	2021-05-03 12:51:24 -07:00
thomasraoux	9621c1ef56	[mlir][linalg] Fix vectorization bug in vector transfer indexing map calculation The current implementation had a bug as it was relying on the target vector dimension sizes to calculate where to insert broadcast. If several dimensions have the same size we may insert the broadcast on the wrong dimension. The correct broadcast cannot be inferred from the type of the source and destination vector. Instead when we want to extend transfer ops we calculate an "inverse" map to the projected permutation and insert broadcast in place of the projected dimensions. Differential Revision: https://reviews.llvm.org/D101738	2021-05-03 12:16:38 -07:00
Frederik Gossen	456efbc0f1	[MLIR][Linalg] Avoid forward declaration in `Loops.cpp` Differential Revision: https://reviews.llvm.org/D101771	2021-05-03 21:06:50 +02:00
Frederik Gossen	ec339163a7	[MLIR][Linalg] Lower `linalg.tiled_loop` in a separate pass Add dedicated pass `convert-linalg-tiled-loops-to-scf` to lower `linalg.tiled_loop`s. Differential Revision: https://reviews.llvm.org/D101768	2021-05-03 21:02:02 +02:00
thomasraoux	f44c76d6e9	[mlir][vector] Extend vector transfer unrolling to support permutations and broadcast Differential Revision: https://reviews.llvm.org/D101637	2021-05-03 10:47:02 -07:00
thomasraoux	7417541fd8	[mlir][vector] Add canonicalization for extract/insert -> shapecast Differential Revision: https://reviews.llvm.org/D101643	2021-05-03 10:41:15 -07:00
thomasraoux	be8e2801a4	[mlir][vector][NFC] split TransposeOp lowerning out of contractLowering Move TransposeOp lowering in its own populate function as in some cases it is better to keep it during ContractOp lowering to better canonicalize it rather than emiting scalar insert/extract. Differential Revision: https://reviews.llvm.org/D101647	2021-05-03 10:23:45 -07:00
Frederik Gossen	d2a291a5f8	[MLIR][Linalg] Lower `linalg.tiled_loop` to `scf` loops Differential Revision: https://reviews.llvm.org/D101747	2021-05-03 18:47:12 +02:00
William S. Moses	039bdcc0a8	[MLIR] Canonicalize sub/add of a constant and another sub/add of a constant Differential Revision: https://reviews.llvm.org/D101705	2021-05-03 11:49:23 -04:00
William S. Moses	78720296f3	[MLIR] Canonicalization of Integer Cast Operations 1) Canonicalize IndexCast(SExt(x)) => IndexCast(x) 2) Provide constant folds of sign_extend and truncate Differential Revision: https://reviews.llvm.org/D101714	2021-05-02 11:22:18 -04:00
eopXD	0c1ff26bd3	[mlir] [affine] add canonicalization for affine.vector_load, vector_store Added canonicalization for vector_load and vector_store. An existing pattern SimplifyAffineOp can be reused to compose maps that supplies result into them. Added AffineVectorStoreOp and AffineVectorLoadOp into static_assert of SimplifyAffineOp to allow operation to use it. This fixes the bug filed: https://bugs.llvm.org/show_bug.cgi?id=50058 Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D101691	2021-05-02 09:06:46 +05:30
Aart Bik	0a29219931	[mlir][sparse] sparse tensor type encoding migration (new home, new builders) (1) migrates the encoding from TensorDialect into the new SparseTensorDialect (2) replaces dictionary-based storage and builders with struct-like data Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D101669	2021-04-30 19:30:38 -07:00
Ahmed Taei	499e89fc91	Add patterns to lower vector.multi_reduction into a sequence of vector.reduction Three patterns are added to convert into vector.multi_reduction into a sequence of vector.reduction as the following: - Transpose the inputs so inner most dimensions are always reduction. - Reduce rank of vector.multi_reduction into 2d with inner most reduction dim (get the 2d canical form) - 2D canonical form is converted into a sequence of vector.reduction. There are two things we might worth in a follow up diff: - An scf.for (maybe optionally) around vector.reduction instead of unrolling it. - Breakdown the vector.reduction into a sequence of vector.reduction (e.g tree-based reduction) instead of relying on how downstream dialects handle it. Note: this will requires passing target-vector-length Differential Revision: https://reviews.llvm.org/D101570	2021-04-30 10:52:21 -07:00
Aart Bik	319072f4e3	[mlir][sparse] migrate sparse operations into new sparse tensor dialect This is the very first step toward removing the glue and clutter from linalg and replace it with proper sparse tensor types. This revision migrates the LinalgSparseOps into SparseTensorOps of a sparse tensor dialect. This also provides a new home for sparse tensor related transformation. NOTE: the actual replacement with sparse tensor types (and removal of linalg glue/clutter) will follow but I am trying to keep the amount of changes per revision manageable. Differential Revision: https://reviews.llvm.org/D101573	2021-04-29 15:52:35 -07:00
Mehdi Amini	086e0f05bf	Revert "[mlir][sparse] migrate sparse operations into new sparse tensor dialect" This reverts commit `a6d92a9711`. The build with -DBUILD_SHARED_LIBS=ON is broken.	2021-04-29 20:59:41 +00:00
Aart Bik	a6d92a9711	[mlir][sparse] migrate sparse operations into new sparse tensor dialect This is the very first step toward removing the glue and clutter from linalg and replace it with proper sparse tensor types. This revision migrates the LinalgSparseOps into SparseTensorOps of a sparse tensor dialect. This also provides a new home for sparse tensor related transformation. NOTE: the actual replacement with sparse tensor types (and removal of linalg glue/clutter) will follow but I am trying to keep the amount of changes per revision manageable. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D101488	2021-04-29 12:09:10 -07:00
Alex Zinenko	6841e6afba	[mlir] support max/min lower/upper bounds in affine.parallel This enables to express more complex parallel loops in the affine framework, for example, in cases of tiling by sizes not dividing loop trip counts perfectly or inner wavefront parallelism, among others. One can't use affine.max/min and supply values to the nested loop bounds since the results of such affine.max/min operations aren't valid symbols. Making them valid symbols isn't an option since they would introduce selection trees into memref subscript arithmetic as an unintended and undesired consequence. Also add support for converting such loops to SCF. Drop some API that isn't used in the core repo from AffineParallelOp since its semantics becomes ambiguous in presence of max/min bounds. Loop normalization is currently unavailable for such loops. Depends On D101171 Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D101172	2021-04-29 13:16:25 +02:00
Alex Zinenko	545fa37834	[mlir] Affine: parallelize affine loops with reductions Introduce a basic support for parallelizing affine loops with reductions expressed using iteration arguments. Affine parallelism detector now has a flag to assume such reductions are parallel. The transformation handles a subset of parallel reductions that are can be expressed using affine.parallel: integer/float addition and multiplication. This requires to detect the reduction operation since affine.parallel only supports a fixed set of reduction operators. Reviewed By: chelini, kumasento, bondhugula Differential Revision: https://reviews.llvm.org/D101171	2021-04-29 13:16:24 +02:00
Lorenzo Chelini	de94b1855c	[mlir] Fix top-level comments (NFC)	2021-04-29 13:06:40 +02:00
Tres Popp	b863af5a5e	[mlir] Add LinalgTransforms dependency on Complex	2021-04-29 12:20:44 +02:00
Tres Popp	42e5f42215	[mlir] Support complex numbers in Linalg promotion FillOp allows complex ops, and filling a properly sized buffer with a default zero complex number is implemented. Differential Revision: https://reviews.llvm.org/D99939	2021-04-29 11:58:57 +02:00
Nicolas Vasilache	b6113db955	[mlir][Linalg] Generalize linalg vectorization This revision adds support for vectorizing more general linalg operations with projected permutation maps. This is achieved by eagerly broadcasting the intermediate vector to the common size of the iteration domain of the linalg op. This allows a much more natural expression of generalized vectorization but may introduce additional computations until all the proper canonicalizations are implemented. This generalization modifies the vector.transfer_read/write permutation logic and exposes the fact that the logic employed in vector.contract was too ad-hoc. As a consequence, changes occur in the permutation / transposition logic for contraction. In turn this prompts supporting more cases in the lowering of contract to matrix intrinsics, which is required to make the corresponding tests pass. Differential revision: https://reviews.llvm.org/D101165	2021-04-29 07:44:01 +00:00
MaheshRavishankar	41849a9195	[mlir][Linalg] Avoid changing the rank of the result in canonicalizations of subtensor. Canonicalizations for subtensor operations defaulted to use the rank-reduced version of the operation, but the cast inserted to get back the original type would be illegal if the rank was actually reduced. Instead make the canonicalization not reduce the rank of the operation. Differential Revision: https://reviews.llvm.org/D101258	2021-04-28 11:33:26 -07:00
Alexander Belyaev	fa0d044c44	[mlir] Fix canonicalization of tiled_loop if not all opresults fold. The current canonicalization did not remap operation results correctly and attempted to erase tiledLoop, which is incorrect if not all tensor results are folded.	2021-04-28 19:57:48 +02:00
Frederik Gossen	511ffe17ed	Revert "[MLIR][Shape] Concretize broadcast result type if possible" This reverts commit `dca5361035`.	2021-04-28 17:16:02 +02:00
Alexander Belyaev	9a66d33452	[mlir] Fix the postsubmit comments in https://reviews.llvm.org/D101445	2021-04-28 14:58:02 +02:00
Alexander Belyaev	29dbac0ae2	[mlir] Add folding for tensor inputs and memref.cast in linalg.tiled_loop. Tensor inputs, if not used in the body of TiledLoopOp, can be removed. memref::CastOp can be folded into TiledLoopOp as well. Differential Revision: https://reviews.llvm.org/D101445	2021-04-28 14:36:07 +02:00
Frederik Gossen	dca5361035	[MLIR][Shape] Concretize broadcast result type if possible As a canonicalization, infer the resulting shape rank if possible. Differential Revision: https://reviews.llvm.org/D101377	2021-04-28 11:58:32 +02:00
Frederik Gossen	cb393f4c99	[MLIR][Shape] Canonicalize casted extent tensor operands Both, `shape.broadcast` and `shape.cstr_broadcastable` accept dynamic and static extent tensors. If their operands are casted, we can use the original value instead. Differential Revision: https://reviews.llvm.org/D101376	2021-04-28 11:51:58 +02:00
Frederik Gossen	3e037f8f0e	[MLIR][Shape] Derive more concrete type for `shape.shape_of` Also create all extent tensor constants with const_shape op. Differential Revision: https://reviews.llvm.org/D99197	2021-04-28 10:50:53 +02:00
Ahmed Taei	7fe2063446	Handle the case of tile and pad a subset of the dimensions This is useful in cases such as tile-distribute-and-pad where not all dims are tiled Differential Revision: https://reviews.llvm.org/D101319	2021-04-27 17:41:22 -07:00
Frederik Gossen	f8d7bd996f	[MLIR][Shape] Remove empty extent tensor operands Empty extent tensor operands were only removed when they were defined as a constant. Additionally, we can remove them if they are known to be empty by their type `tensor<0xindex>`. Differential Revision: https://reviews.llvm.org/D101351	2021-04-27 14:51:43 +02:00
Frederik Gossen	2b9b999d4d	[MLIR][Shape] Replace single operand broadcasts with appropriate cast Differential Revision: https://reviews.llvm.org/D101350	2021-04-27 14:48:56 +02:00
Alexander Belyaev	4b13b7581d	[mlir] Add a pass to tile Linalg ops using `linalg.tiled_loop`. Differential Revision: https://reviews.llvm.org/D101084	2021-04-27 12:33:28 +02:00
Frederik Gossen	b003ebd603	[MLIR][Linalg] Generalize splat constant folding Splat constant folding was limited to `std.constant` operations. Instead, use the constant matcher and apply splat constant folding to any constant-like operation that holds a splat attribute. Differential Revision: https://reviews.llvm.org/D101301	2021-04-27 09:08:34 +02:00
Aart Bik	23c9e8bc25	[mlir][tensors] Introduce attribute interface/attribute for tensor encoding The new "encoding" field in tensor types so far had no meaning. This revision introduces: 1. an encoding attribute interface in IR: for verification between tensors and encodings in general 2. an attribute in Tensor dialect; #tensor.sparse<dict> + concrete sparse tensors API Active discussion: https://llvm.discourse.group/t/rfc-introduce-a-sparse-tensor-type-to-core-mlir/2944/ Reviewed By: silvas, penpornk, bixia Differential Revision: https://reviews.llvm.org/D101008	2021-04-26 18:31:54 -07:00
William S. Moses	ca27260701	[MLIR] Add SCF.if Condition Canonicalizations Add two canoncalizations for scf.if. 1) A canonicalization that allows users of a condition within an if to assume the condition is true if in the true region, etc. 2) A canonicalization that removes yielded statements that are equivalent to the condition or its negation Differential Revision: https://reviews.llvm.org/D101012	2021-04-26 20:13:08 -04:00
Frederik Gossen	88b8b88035	[MLIR] Remove empty shape operands from `cstr_broadcastable` ops Differential Revision: https://reviews.llvm.org/D101170	2021-04-26 18:34:18 +02:00
Frederik Gossen	858d4885dc	[MLIR][Shape] Ensure to preserve op type of `shape.broadcast` Ensure to preserve the correct type during when folding and canonicalization. `shape.broadcast` of of a single operand can only be folded away if the argument type is correct. Differential Revision: https://reviews.llvm.org/D101158	2021-04-26 17:55:39 +02:00
Butygin	f22d381385	[mlir] Canonicalize AllocOp's with only store and dealloc uses Differential Revision: https://reviews.llvm.org/D100268	2021-04-24 09:51:00 +03:00
Alexander Belyaev	5291a7a3c7	[mlir] Add block arguments for input/output operands of 'linalg.tiled_loop`. Differential Revision: https://reviews.llvm.org/D101186	2021-04-23 20:55:20 +02:00
Alexander Belyaev	0724911d2a	[mlir] Add `tensor.reshape`. This operation a counterpart of `memref.reshape`. RFC [Reshape Ops Restructuring](https://llvm.discourse.group/t/rfc-reshape-ops-restructuring/3310) Differential Revision: https://reviews.llvm.org/D100971	2021-04-22 14:53:23 +02:00
Frederik Gossen	f0c51cb2d4	[MLIR][Shape] Add canonicalizations for `shape.broadcast` Eliminate empty shapes from the operands, partially fold all constant shape operands, and fix normal folding. Differential Revision: https://reviews.llvm.org/D100634	2021-04-22 14:11:23 +02:00
Tobias Gysi	0e777e4ad7	[mlir][linalg] remove interchange option on linalg to loop lowering. The interchange option attached to the linalg to loop lowering affects only the loops and does not update the memory accesses generated in to body of the operation. Instead of performing the interchange during the loop lowering use the interchange pattern. Differential Revision: https://reviews.llvm.org/D100758	2021-04-22 08:55:17 +00:00
thomasraoux	d40a19c3a8	[mlir][linalg] Add pattern to push reshape after elementwise operation This help expose more fusion opportunities. Differential Revision: https://reviews.llvm.org/D100685	2021-04-21 21:22:39 -07:00
Eugene Zhulenev	3f1e827abd	[mlir] Linalg : do not forward memrefs to outputs when do bufferization Example: ``` %0 = linalg.init_tensor : tensor<...> %1 = linalg.generic ... outs(%0: tensor<...>) %2 = linalg.generic ... outs(%0: tensor<...>) ``` Memref allocated as a result of `init_tensor` bufferization can be incorrectly overwritten by the second linalg.generic operation Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D100921	2021-04-21 16:39:06 -07:00
Ahmed Taei	10d7924581	Fix FoldReshapeOpWithUnitExtent generating illegal reshape This will prevent fusion that spains all dims and generates (d0, d1, ...) -> () reshape that isn't legal Differential Revision: https://reviews.llvm.org/D100805	2021-04-21 11:30:45 -07:00
Nico Weber	297a5b7cbc	[mlir] hopefully final round of iwyu fixes after `ba7a92c01e`	2021-04-21 11:03:06 -04:00
Nico Weber	56f987fafe	[mlir] yet more iwyu fixes after `ba7a92c01e`	2021-04-21 10:54:44 -04:00
thomasraoux	ded18708f9	[mlir][NFC] Refactor linalg substituteMin and AffineMinSCF canonizalizations Break up the dependency between SCF ops and substituteMin helper and make a more generic version of AffineMinSCFCanonicalization. This reduce dependencies between linalg and SCF and will allow the logic to be used with other kind of ops. (Like ID ops). Differential Revision: https://reviews.llvm.org/D100321	2021-04-21 07:19:36 -07:00
Butygin	85740ee108	[mlir] Assume terminators in nested regions are always legal in FuncBufferizePass Previously, any terminator without ReturnLike and BranchOpInterface traits (e.g. scf.condition) were causing pass to fail. Differential Revision: https://reviews.llvm.org/D100832	2021-04-21 11:55:11 +03:00
Tobias Gysi	5a451e486f	[mlir][linalg] adapt named op generalization to work with captures. Instead of always running the region builder check if the generalized op has a region attached. If yes inline the existing region instead of calling the region builder. This change circumvents a problem with named operations that have a region builder taking captures and the generalization pass not knowing about this captures. Differential Revision: https://reviews.llvm.org/D100880	2021-04-21 06:37:53 +00:00
Amy Zhuang	9194071626	[mlir] Support hoisting whole affine for loops in LICM Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D100512	2021-04-20 18:07:06 -07:00
Matthias Springer	dd5324467d	[mlir] Disallow broadcast dimensions on TransferWriteOp. The current implementation allows for TransferWriteOps with broadcasts that do not make sense. E.g., a broadcast could write a vector into a single (scalar) memory location, which is effectively the same as writing only the last element of the vector. Differential Revision: https://reviews.llvm.org/D100842	2021-04-21 07:43:45 +09:00
Tobias Gysi	b9715156ff	[mlir][linalg] lower index operations during linalg to vector lowering. The patch extends the vectorization pass to lower linalg index operations to vector code. It allocates constant 1d vectors that enumerate the indexes along the iteration dimensions and broadcasts/transposes these 1d vectors to the iteration space. Differential Revision: https://reviews.llvm.org/D100373	2021-04-20 11:55:44 +00:00
KareemErgawy-TomTom	0b05207e45	[MLIR][LinAlg] Detensoring CF cost-model: look forward. This patch extends the control-flow cost-model for detensoring by implementing a forward-looking pass on block arguments that should be detensored. This makes sure that if a (to-be-detensored) block argument "escapes" its block through the terminator, then the successor arguments are also detensored. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D100457	2021-04-20 09:01:43 +02:00
Tobias Gysi	39a604e3df	[mlir][linalg] update fusion on tensors to support linalg index operations. The patch replaces the index operations in the body of fused producers and linearizes the indices after expansion. Differential Revision: https://reviews.llvm.org/D100479	2021-04-20 06:13:04 +00:00
Tobias Gysi	d0774f7f0a	[mlir][linalg] update drop unit dims to support linalg index operations. Update the dimensions of the index operations to account for dropped dimensions and replace the index operations of dropped dimensions by zero. Differential Revision: https://reviews.llvm.org/D100395	2021-04-20 04:54:00 +00:00
clementval	c46a88625d	[mlir][llvm] Add UnnamedAddr attribute to GlobalOp This patch add the UnnamedAddr attribute for the GlobalOp in the LLVM dialect. The attribute is also handled to and from LLVM IR. This is meant to be used in a follow up patch to lower OpenACC/OpenMP ops to call to kmp and tgt runtime calls (D100678). Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D100677	2021-04-19 21:45:14 -04:00
Tobias Gysi	495e1d7e8a	[mlir][linalg] adding pass to run the interchange pattern. Instead of interchanging loops during the loop lowering this pass performs the interchange by permuting the indexing maps. It also updates the iterator types and the index accesses in the body of the operation. Differential Revision: https://reviews.llvm.org/D100627	2021-04-19 12:19:15 +00:00
Nicolas Vasilache	843f1fc825	[mlir][scf] Add scf.for + tensor.cast canonicalization pattern Fold scf.for iter_arg/result pairs that go through incoming/ougoing a tensor.cast op pair so as to pull the tensor.cast inside the scf.for: ``` %0 = tensor.cast %t0 : tensor<32x1024xf32> to tensor<?x?xf32> %1 = scf.for %i = %c0 to %c1024 step %c32 iter_args(%iter_t0 = %0) -> (tensor<?x?xf32>) { %2 = call @do(%iter_t0) : (tensor<?x?xf32>) -> tensor<?x?xf32> scf.yield %2 : tensor<?x?xf32> } %2 = tensor.cast %1 : tensor<?x?xf32> to tensor<32x1024xf32> use_of(%2) ``` folds into: ``` %0 = scf.for %arg2 = %c0 to %c1024 step %c32 iter_args(%arg3 = %arg0) -> (tensor<32x1024xf32>) { %2 = tensor.cast %arg3 : tensor<32x1024xf32> to tensor<?x?xf32> %3 = call @do(%2) : (tensor<?x?xf32>) -> tensor<?x?xf32> %4 = tensor.cast %3 : tensor<?x?xf32> to tensor<32x1024xf32> scf.yield %4 : tensor<32x1024xf32> } use_of(%0) ``` Differential Revision: https://reviews.llvm.org/D100661	2021-04-16 16:50:21 +00:00
thomasraoux	3fc0fbefc8	[mlir][vector] Move transferOp on tensor opt to folder/canonicalization Move the existing optimization for transfer op on tensor to folder and canonicalization. This handles the write after write case and read after write and also add write after read case. Differential Revision: https://reviews.llvm.org/D100597	2021-04-16 08:13:10 -07:00
Javier Setoain	b739bada9d	[mlir][ArmSVE] Cleanup dialect registration ArmSVE dialect is behind the recent changes in how the Vector dialect interacts with backend vector dialects and the MLIR -> LLVM IR translation module. This patch cleans up ArmSVE initialization within Vector and removes the need for an LLVMArmSVE dialect. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D100171	2021-04-16 15:56:51 +02:00
Frederik Gossen	3a5a610e27	[MLIR][Shape] Expose `getShapeVec` and add support for extent tensors Differential Revision: https://reviews.llvm.org/D100636	2021-04-16 13:59:20 +02:00
Nicolas Vasilache	8cf650c554	[mlir][linalg] Add support for WAW fusion on tensors. Differential Revision: https://reviews.llvm.org/D100603	2021-04-16 08:22:09 +00:00
Ahmed Taei	0e2f9b61fd	Fix tile-and-pad when padding doesn't span all dimension Without this tile-and-pad will never terminate if pad-fails. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D97720	2021-04-15 20:17:40 -07:00
River Riddle	4efb7754e0	[mlir][NFC] Add a using directive for llvm::SetVector Differential Revision: https://reviews.llvm.org/D100436	2021-04-15 16:09:34 -07:00
Aart Bik	916f3e16bd	[mlir][vector][avx] add AVX dot product to X86Vector dialect with lowering In the long run, we want to unify the dot product codegen solutions between all target architectures, but this intrinsic enables experimenting with AVX specific implementations in the meantime. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D100593	2021-04-15 15:01:39 -07:00
Alexander Belyaev	cf761904a2	[mlir] Add verification for `linalg.tiled_loop` op. Differential Revision: https://reviews.llvm.org/D100555	2021-04-15 20:50:36 +02:00
Alexander Belyaev	67f60bcc75	[mlir] Expose `updateBoundsForCyclicDistribution` in Linalg/Utils.h. Differential Revision: https://reviews.llvm.org/D100580	2021-04-15 20:47:37 +02:00
Aart Bik	92b0a9d7d4	[mlir][sparse] remove restriction on vectorization of index type Rationale: Now that vector<?xindex> is allowed, the restriction on vectorization of index types in the sparse compiler can be removed. Also needs generalization of scatter/gather index types. Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D100522	2021-04-15 10:27:04 -07:00
Tobias Gysi	ce82843f72	[mlir][linalg] update fusion to support linalg index operations. The patch updates the linalg fusion pass to add the tile offsets to the indices. Differential Revision: https://reviews.llvm.org/D100456	2021-04-14 15:32:42 +00:00
Hanhan Wang	7c4de2e9b9	[mlir][StandardToSPIRV] Add support for lowering memref<?xi1> to SPIR-V Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D100452	2021-04-14 07:22:49 -07:00
Tres Popp	d80178f7c1	[mlir] Change verification order to prevent null dereference Differential Revision: https://reviews.llvm.org/D100390	2021-04-14 09:33:17 +02:00
Sumesh Udayakumaran	f56791ae2e	[mlir] Prevent operations with users from being hoisted This patch collects operations that have users in a for loop and uses them when loop invariant operations are detected and hoisted. Reviewed By: bondhugula, vinayaka-polymage Differential Revision: https://reviews.llvm.org/D99761	2021-04-13 15:29:17 -07:00
Lei Zhang	5b15fe9334	[mlir][spirv] Only attach struct offset for required storage classes Per the SPIR-V spec "2.16.2. Validation Rules for Shader Capabilities": Composite objects in the StorageBuffer, PhysicalStorageBuffer, Uniform, and PushConstant Storage Classes must be explicitly laid out. For other cases we don't need to attach the struct offsets. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D100386	2021-04-13 15:30:30 -04:00
Eugene Zhulenev	8a316b00d6	[mlir] Convert async dialect passes from function passes to op agnostic passes Differential Revision: https://reviews.llvm.org/D100401	2021-04-13 11:46:00 -07:00
Emilio Cota	0b63e3222b	[mlir] X86Vector: Add AVX Rsqrt Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D99818	2021-04-13 08:43:48 -07:00
Tobias Gysi	8ea5d190ec	[mlir][linalg] update tiling to support linalg index operations. The patch updates the tiling pass to add the tile offsets to the indices returned by the linalg operations. Differential Revision: https://reviews.llvm.org/D100379	2021-04-13 14:36:01 +00:00
Butygin	eb31540066	[mlir] Canonicalize single-iteration ParallelOp Differential Revision: https://reviews.llvm.org/D100248	2021-04-13 13:42:19 +03:00
Tobias Gysi	ef30179eff	[mlir][linalg] lower index operations during linalg to loop lowering. The patch extends the linalg to loop lowering pass to replace all linalg index operations by the induction variables of the generated loop nests. Differential Revision: https://reviews.llvm.org/D100364	2021-04-13 09:04:09 +00:00
KareemErgawy-TomTom	aa6eb2af10	[MLIR][LinAlg] Implement detensoring cost-modelling. This patch introduces the neccessary infrastructure changes to implement cost-modelling for detensoring. In particular, it introduces the following changes: - An extension to the dialect conversion framework to selectively convert sub-set of non-entry BB arguments. - An extension to branch conversion pattern to selectively convert sub-set of a branche's operands. - An interface for detensoring cost-modelling. - 2 simple implementations of 2 different cost models. This sets the stage to explose cost-modelling for detessoring in an easier way. We still need to come up with better cost models. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D99945	2021-04-13 09:07:18 +02:00
Eugene Zhulenev	a6628e596e	[mlir] Async: add automatic reference counting at async.runtime operations level Depends On D95311 Previous automatic-ref-counting pass worked with high level async operations (e.g. async.execute), however async values reference counting is a runtime implementation detail. New pass mostly relies on the save liveness analysis to place drop_ref operations, and does better verification of CFG with different liveIn sets in block successors. This is almost NFC change. No new reference counting ideas, just a cleanup of the previous version. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D95390	2021-04-12 18:54:55 -07:00
Geoffrey Martin-Noble	ae33eef505	[MLIR] Add a switch operation to the standard dialect This is similar to the definition of llvm.switch, providing unstructured branch-based control flow. It differs from the LLVM operation in that it accepts any signless integer (not only an i32), takes no branch weights (the same as the Branch and CondBranch ops), and has a slightly different syntax for the default case that includes it in the list of cases with an explicit `default` keyword. Also included are several canonicalizers. See https://llvm.discourse.group/t/rfc-add-std-switch-and-scf-switch/3090 Reviewed By: rriddle, bondhugula Differential Revision: https://reviews.llvm.org/D99925	2021-04-12 18:46:02 -07:00
Lei Zhang	23b8264b52	[mlir][spirv] Fix runtime array stride when emulating bitwidth The stride should be calculated with the converted array element type, not the original input type. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D100337	2021-04-12 17:13:33 -04:00
Lei Zhang	0deeaaca39	[mlir] Move memref.subview patterns to MemRef/Transforms/ These patterns have been used as a prerequisite step for lowering to SPIR-V. But they don't involve SPIR-V dialect ops; they are pure memref/vector op transformations. Given now we have a dedicated MemRef dialect, moving them to Memref/Transforms/, which is a more suitable place to host them, to allow used by others. This commit just moves code around and renames patterns/passes accordingly. CMakeLists.txt for existing MemRef libraries are also improved along the way. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D100326	2021-04-12 16:38:22 -04:00
Lei Zhang	fd91f81c85	[mlir][spirv] Put debug-only variable in LLVM_DEBUG This avoids paying the cost when building in release. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D100325	2021-04-12 15:14:43 -04:00
eopXD	9cc417cbca	[mlir][affine] Fix unfolded bounding maps for affine.for Loop bounds of affine.for didn't perform foldings like affine.load, affine.store. Bound maps shall be more composed, leaving most affine.apply become dead. This resolves the bug listed on https://bugs.llvm.org/show_bug.cgi?id=45203 Differential Revision: https://reviews.llvm.org/D99323	2021-04-13 00:12:43 +05:30
Emilio Cota	8508a63b88	[mlir] Rename AVX512 dialect to X86Vector We will soon be adding non-AVX512 operations to MLIR, such as AVX's rsqrt. In https://reviews.llvm.org/D99818 several possibilities were discussed, namely to (1) add non-AVX512 ops to the AVX512 dialect, (2) add more dialects (e.g. AVX dialect for AVX rsqrt), and (3) expand the scope of the AVX512 to include these SIMD x86 ops, thereby renaming the dialect to something more accurate such as X86Vector. Consensus was reached on option (3), which this patch implements. Reviewed By: aartbik, ftynse, nicolasvasilache Differential Revision: https://reviews.llvm.org/D100119	2021-04-12 19:20:04 +02:00
MaheshRavishankar	b0fc712b14	[mlir][Linalg] Disable const -> linalg.generic when fused op is illegal. Fusing a constant with a linalg.generic operation can result in the fused operation being illegal since the loop bound computation fails. Avoid such fusions. Differential Revision: https://reviews.llvm.org/D100272	2021-04-12 10:15:54 -07:00
Tobias Gysi	93f9922d65	[mlir][linalg] adding operation to access the iteration index of enclosing linalg ops. The `linalg.index` operation provides access to the iteration indexes of immediately enclosing linalg operations. It takes a dimension `dim` attribute and returns the iteration index in the given dimension. Having `linalg.index` allows us to unify `linalg.generic` and `linalg.indexed_generic` and also enables index access in named operations. Differential Revision: https://reviews.llvm.org/D100292	2021-04-12 13:37:17 +00:00
Frederik Gossen	e413b86a2c	[MLIR][Shape] Combine `cstr_eq` only if they share shape operands Differential Revision: https://reviews.llvm.org/D100198	2021-04-09 16:54:54 +02:00
Frederik Gossen	74d33052dd	[MLIR][Shape] Add convenience builder for `shape.assuming_all` Differential Revision: https://reviews.llvm.org/D100105	2021-04-09 12:17:34 +02:00
Frederik Gossen	79d12ded53	[MLIR][Shape] Canonicalize `assuming_all` when all operands are `cstr_eq` ops Differential Revision: https://reviews.llvm.org/D100104	2021-04-09 11:49:29 +02:00
Frederik Gossen	538254e8e0	[MLIR] Do not yield values from an assuming op that are never used Differential Revision: https://reviews.llvm.org/D100042	2021-04-09 11:06:41 +02:00
MaheshRavishankar	f4eb681dc3	[mlir][Linalg] Drop unit-trip loops of reductions only if other reduction loops exists. Recent change enable dropping unit-trip loops of "reduction" iterator type as well. This is fine as long as there is one other "reduction" iterator in the operation. Without this the initialized value (value of `out`) is not read which leads to a correctness issue. Also fix a bug in the `fill` -> `tensor_reshape` folding. The `out` operand of the `fill` needs to be reshaped to get the `out` operand of the generated `fill` operation. Differential Revision: https://reviews.llvm.org/D100145	2021-04-08 22:31:29 -07:00
Weiwei Li	12ffc26067	[mlir][spirv] Define spv.ImageDrefGather operation This patch doesn't support the optional operands of ImageDrefGather. The support of optional operands will be implemented later. co-authered-by: Alan Liu <alanliu.yf@gmail.com> Differential Revision: https://reviews.llvm.org/D100128	2021-04-08 20:15:54 -04:00
Hanhan Wang	c361435845	[mlir][StandardToSPIRV] Handle i1 case for lowering memref.load/store op This patch unconditionally converts i1 types to i8 types on memrefs. If the extensions or capabilities are not met, they will be converted to i32. Hence the logic in IntLoadPattern and IntStorePattern are also updated. Also added the implementation of SPIRVTypeConverter::getOptions(). Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D99724	2021-04-08 12:15:25 -07:00
Lei Zhang	5299843c31	[mlir][spirv] Add control for non-32-bit scalar type emulation Non-32-bit scalar types require special hardware support that may not exist on all GPUs. This is reflected in SPIR-V as that non-32-bit scalar types require special capabilities or extensions. Previously when there is a non-32-bit type and no native support, we unconditionally emulate it with 32-bit ones. This isn't good given that it can have implications over ABI and data layout consistency. This commit introduces an option to control whether to use 32-bit types to emulate. Differential Revision: https://reviews.llvm.org/D100059	2021-04-08 08:19:47 -04:00
Lei Zhang	004f29c0bb	[mlir][spirv] Timely fail type conversion Per the TypeConverter API contract, returning `llvm:None` means other conversion rules should be tried. But we only have one rule per input type. So there is no need to try others and we can just directly fail, which should return `nullptr`. This avoids unnecessary checks. Differential Revision: https://reviews.llvm.org/D100058	2021-04-08 08:19:46 -04:00
Tobias Gysi	b614ada0e8	[mlir] add support for index type in vectors. The patch enables the use of index type in vectors. It is a prerequisite to support vectorization for indexed Linalg operations. This refactoring became possible due to the newly introduced data layout infrastructure. The data layout of a module defines the bitwidth of the index type needed to verify bitcasts and similar vector operations. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D99948	2021-04-08 08:17:13 +00:00
Haruki Imai	39ee9fd8c1	[mlir] Fixed alignment attribute of alloc constant folding. When allocLikeOp is updated in alloc constant folding, alighnment attribute was ignored. This patch fixes it. Signed-off-by: Haruki Imai <imaihal@jp.ibm.com> Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D99882	2021-04-07 19:28:49 +00:00
Aart Bik	3acf49829c	[mlir][sparse] support integral types i32,i16,i8 for numerical values Some sparse matrices operate on integral values (in contrast with the common f32 and f64 values). This CL expands the compiler and runtime support to deal with several common type combinations. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D99999	2021-04-07 10:01:37 -07:00
Matthias Springer	65a3f28939	[mlir] Add "mask" operand to vector.transfer_read/write. Also factors out out-of-bounds mask generation from vector.transfer_read/write into a new MaterializeTransferMask pattern. Differential Revision: https://reviews.llvm.org/D100001	2021-04-07 21:33:13 +09:00
Jacques Pienaar	8b109bc2ea	[mlir,shape] Add max/min folder for simple case When both arguments are the same for these ops, propagate this argument.	2021-04-06 20:22:42 -07:00
Nicolas Vasilache	518e6f341d	[mlir][Linalg] Fix fusion on tensors operands / bbArg mismatch Linalg fusion on tensors has mismatching assumptions on the operand side than on the region bbArg side. Relax the behavior on the operand/indexing map side so that we better support output operands that may also be read from. Differential revision: https://reviews.llvm.org/D99499	2021-04-06 15:39:40 +00:00
MaheshRavishankar	944a2fe763	[mlir][Linalg] Add callbacks to fusion of elementwise operations to control fusion. Right now Elementwise operations fusion in Linalg fuses everything it can. This can run up against resource limits of the target hardware without some checks. This patch adds a callback function that clients can use to implement a cost function. When two elementwise operations are deemed structurally fusable, the callback can be used to control if the fusion applies. Differential Revision: https://reviews.llvm.org/D99820	2021-04-05 16:08:47 -07:00
MaheshRavishankar	ea069aebcc	[mlir][Linalg] NFC: Move populatePatterns* method into linalg namespace. The moved `populate` methods are only relevant to Linalg operations. So they are better of in `linalg` namespace. Also rename `populateLinalgTensorOpsFusionPatterns` to `populateElementwiseOpsFusionPatterns`. This makes the scope of these patterns explicit and disambiguates it with fusion on tensors using tile + fuse. Differential Revision: https://reviews.llvm.org/D99819	2021-04-05 11:16:02 -07:00
Lei Zhang	6dd07fa513	[mlir][spirv] Add utilities for push constant value This commit add utility functions for creating push constant storage variable and loading values from it. Along the way, performs some clean up: * Deleted `setABIAttrs`, which is just a 4-liner function with one user. * Moved `SPIRVConverstionTarget` into `mlir` namespace, to be consistent with `SPIRVTypeConverter` and `LLVMConversionTarget`. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D99725	2021-04-02 07:51:07 -04:00
Aart Bik	a0c5b7e3b5	[mlir][sparse] support for very narrow index and pointer types Rationale: Small indices and values, when allowed by the required range of the input tensors, can reduce the memory footprint of sparse tensors even more. Note, however, that we must be careful zero extending the values (since sparse tensors never use negatives for indexing), but LLVM treats the index type as signed in most memory operations (like the scatter and gather). This CL dots all the i's in this regard. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D99777	2021-04-01 18:21:27 -07:00
Aden Grue	3ba1b1cd20	Add a pattern to combine composed subview ops Differential Revision: https://reviews.llvm.org/D99229	2021-04-01 10:56:57 -07:00
Matthias Springer	95f8135043	[mlir] Change vector.transfer_read/write "masked" attribute to "in_bounds". This is in preparation for adding a new "mask" operand. The existing "masked" attribute was used to specify dimensions that may be out-of-bounds. Such transfers can be lowered to masked load/stores. The new "in_bounds" attribute is used to specify dimensions that are guaranteed to be within bounds. (Semantics is inverted.) Differential Revision: https://reviews.llvm.org/D99639	2021-03-31 18:04:22 +09:00
Nicolas Vasilache	43b9fa3ce0	[mlir][Linalg][Python] Create the body of builtin named Linalg ops This revision adds support to properly add the body of registered builtin named linalg ops. At this time, indexing_map and iterator_type support is still missing so the op is not executable yet. Differential Revision: https://reviews.llvm.org/D99578	2021-03-31 07:58:32 +00:00
Alexander Belyaev	465b9a4a33	Revert "Revert "[mlir] Introduce CloneOp and adapt test cases in BufferDeallocation."" This reverts commit `883912abe6`.	2021-03-31 09:49:09 +02:00
Inho Seo	f584633454	Added static verification for Linalg Ops. This verification is to check if the indices for static shaped operands on linalgOps access out of bound memory or not. For dynamic shaped operands, we would be able to check it on runtime stage. Found several invalid Linalg ops testcases, and fixed them. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D98390	2021-03-30 07:10:17 -07:00
MaheshRavishankar	c4d5b95617	Fix broken build for commit `9b0517035f` Differential Revision: https://reviews.llvm.org/D99533	2021-03-29 12:48:45 -07:00
MaheshRavishankar	9b0517035f	[mlir] Enhance InferShapedTypeOpInterface and move LinalgOps to use them. A new `InterfaceMethod` is added to `InferShapedTypeOpInterface` that allows an operation to return the `Value`s for each dim of its results. It is intended for the case where the `Value` returned for each dim is computed using the operands and operation attributes. This interface method is for cases where the result dim of an operation can be computed independently, and it avoids the need to aggregate all dims of a result into a single shape value. This also implies that this is not suitable for cases where the result type is unranked (for which the existing interface methods is to be used). Also added is a canonicalization pattern that uses this interface and resolves the shapes of the output in terms of the shapes of the inputs. Moving Linalg ops to use this interface, so that many canonicalization patterns implemented for individual linalg ops to achieve the same result can be removed in favor of the added canonicalization pattern. Differential Revision: https://reviews.llvm.org/D97887	2021-03-29 11:39:48 -07:00
MaheshRavishankar	f0a2fe7f79	[mlir][Linalg] Rewrite SubTensors that take a slice out of a unit-extend dimension. Subtensor operations that are taking a slice out of a tensor that is unit-extent along a dimension can be rewritten to drop that dimension. Differential Revision: https://reviews.llvm.org/D99226	2021-03-29 09:19:36 -07:00
MaheshRavishankar	7d8b478ce1	[mlir][Linalg] Drop spurious error message Drop usage of `emitRemark` and use `notifyMatchFailure` instead to avoid unnecessary spew during compilation. Differential Revision: https://reviews.llvm.org/D99485	2021-03-29 09:17:25 -07:00
thomasraoux	5288c25c70	[mlir][vector] Add lowering of Transfer_read with broadcast and permutation map Convert transfer_read ops with permutation maps into simpler transfer_read with minority map + vector.braodcast and vector.transpose. And transfer_read with leading dimensions broacast into transfer_read of lower rank. Differential Revision: https://reviews.llvm.org/D99019	2021-03-29 08:38:43 -07:00
Frederik Gossen	630afc61a8	[MLIR][Shape] Canonicalize casted dynamic extent tensor Differential Revision: https://reviews.llvm.org/D99161	2021-03-29 13:59:19 +02:00
Alexander Belyaev	883912abe6	Revert "[mlir] Introduce CloneOp and adapt test cases in BufferDeallocation." This reverts commit `06b03800f3`. Until some kind of support for region args is added.	2021-03-29 12:47:59 +02:00
Julian Gross	06b03800f3	[mlir] Introduce CloneOp and adapt test cases in BufferDeallocation. Add a new clone operation to the memref dialect. This operation implicitly copies data from a source buffer to a new buffer. In contrast to the linalg.copy operation, this operation does not accept a target buffer as an argument. Instead, this operation performs a conceptual allocation which does not need to be performed manually. Furthermore, this operation resolves the dependency from the linalg-dialect in the BufferDeallocation pass. In addition, we also extended the canonicalization patterns to fold clone operations. The copy removal pass has been removed. Differential Revision: https://reviews.llvm.org/D99172	2021-03-29 10:19:10 +02:00
KareemErgawy-TomTom	c52a5f2aa7	MLIR][STD] Fold trunci (sexti). This patch folds the following pattern: ``` %arg0 = ... %0 = sexti %arg0 : i1 to i8 %1 = trunci %0 : i8 to i1 ``` into just `%arg0`. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D99464	2021-03-29 08:34:08 +02:00
KareemErgawy-TomTom	e5f2898bc7	[MLIR][STD] Fold trunci (zexti). This patch folds the following pattern: ``` %arg0 = ... %0 = zexti %arg0 : i1 to i8 %1 = trunci %0 : i8 to i1 ``` into just `%arg0`. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D99453	2021-03-27 19:40:10 +01:00
Jacques Pienaar	7ce07c6494	[mlir] Remove unneeded ShapeFunctionLibraryTerminatorOp Now that NoTerminator is possible this op can be removed/it was only needed structurally before. NFC.	2021-03-26 16:03:51 -07:00
Alexander Belyaev	7f2236cf58	[mlir][linalg] Add output tensor args folding for linalg.tiled_loop. Folds away TiledLoopOp output tensors when the following conditions are met: * result of `linalg.tiled_loop` has no uses * output tensor is the argument of `linalg.yield` Example: ``` %0 = linalg.tiled_loop ... outs (%out, %out_buf:tensor<...>, memref<...>) { ... linalg.yield %out : tensor ... } ``` Becomes ``` linalg.tiled_loop ... outs (%out_buf:memref<...>) { ... linalg.yield } ``` Differential Revision: https://reviews.llvm.org/D99333	2021-03-25 18:11:05 +01:00
Mehdi Amini	973ddb7d6e	Define a `NoTerminator` traits that allows operations with a single block region to not provide a terminator In particular for Graph Regions, the terminator needs is just a historical artifact of the generalization of MLIR from CFG region. Operations like Module don't need a terminator, and before Module migrated to be an operation with region there wasn't any needed. To validate the feature, the ModuleOp is migrated to use this trait and the ModuleTerminator operation is deleted. This patch is likely to break clients, if you're in this case: - you may iterate on a ModuleOp with `getBody()->without_terminator()`, the solution is simple: just remove the ->without_terminator! - you created a builder with `Builder::atBlockTerminator(module_body)`, just use `Builder::atBlockEnd(module_body)` instead. - you were handling ModuleTerminator: it isn't needed anymore. - for generic code, a `Block::mayNotHaveTerminator()` may be used. Differential Revision: https://reviews.llvm.org/D98468	2021-03-25 03:59:03 +00:00
Lei Zhang	19435d3863	[mlir][linalg] Fold fill -> tensor_reshape chain For such op chains, we can create new linalg.fill ops with the result type of the linalg.tensor_reshape op. Differential Revision: https://reviews.llvm.org/D99116	2021-03-24 18:17:58 -04:00
Lei Zhang	c241e1c2f5	[mlir][linalg] Support dropping unit dimensions for init tensors init tensor operands also has indexing map and generally follow the same constraints we expect for non-init-tensor operands. Differential Revision: https://reviews.llvm.org/D99115	2021-03-24 18:17:58 -04:00
Lei Zhang	7f28d27cb6	[mlir][linalg] Allow controlling folding unit dim reshapes This commit exposes an option to the pattern FoldWithProducerReshapeOpByExpansion to allow folding unit dim reshapes. This gives callers more fine-grained controls. Differential Revision: https://reviews.llvm.org/D99114	2021-03-24 18:17:57 -04:00
Lei Zhang	f66120a357	[mlir][affine] Add canonicalization to merge affine min/max ops This identifies a pattern where the producer affine min/max op is bound to a dimension/symbol that is used as a standalone expression in the consumer affine op's map. In that case the producer affine min/max op can be merged into its consumer. For example, a pattern like the following: ``` %0 = affine.min affine_map<()[s0] -> (s0 + 16, s0 * 8)> ()[%sym1] %1 = affine.min affine_map<(d0)[s0] -> (s0 + 4, d0)> (%0)[%sym2] ``` Can be turned into: ``` %1 = affine.min affine_map< ()[s0, s1] -> (s0 + 4, s1 + 16, s1 * 8)> ()[%sym2, %sym1] ``` Differential Revision: https://reviews.llvm.org/D99016	2021-03-24 18:17:57 -04:00
Lei Zhang	23fd26608c	[mlir][affine] Deduplicate affine min/max op expressions If there are multiple identical expressions in an affine min/max op's map, we can just keep one. Differential Revision: https://reviews.llvm.org/D99015	2021-03-24 18:17:57 -04:00
Lei Zhang	e58597ee1c	[mlir][linalg] Fuse producers with non-permutation indexing maps Until now Linalg fusion only allow fusing producers whose operands are all permutation indexing maps. It's easier to deduce the subtensor/subview but it is an unnecessary constraint, as in tiling we have more advanced logic to deduce the subranges even when the operand is not of permutation indexing maps, e.g., the input operand for convolution ops. This patch uses the logic on tiling side to deduce subranges for fusion. This enables fusing convolution with its consumer ops when possible. Along the way, we are now generating proper affine.min ops to guard against size boundaries, if we cannot be certain they won't be out of bounds. Differential Revision: https://reviews.llvm.org/D99014	2021-03-24 18:17:57 -04:00
Lei Zhang	ddf93abf49	[mlir][linalg] NFC: Move makeTiledShapes into Utils.{h\|cpp} This is a preparation step to reuse makeTiledShapes in tensor fusion. Along the way, did some lightweight cleanups. Differential Revision: https://reviews.llvm.org/D99013	2021-03-24 18:17:57 -04:00
Tobias Gysi	880822255e	[mlir][linalg] Do not call region builder during vectorization. All linalg operations having a region builder shall call it during op creation. Calling it during vectorization is obsolete. Differential Revision: https://reviews.llvm.org/D99168	2021-03-24 14:55:11 +00:00
Alex Zinenko	b3386a734e	[mlir] introduce data layout entry for index type Index type is an integer type of target-specific bitwidth present in many MLIR operations (loops, memory accesses). Converting values of this type to fixed-size integers has always been problematic. Introduce a data layout entry to specify the bitwidth of `index` in a given layout scope, defaulting to 64 bits, which is a commonly used assumption, e.g., in constants. Port builtin-to-LLVM type conversion to use this data layout entry when converting `index` type and untie it from pointer size. This is particularly relevant for GPU targets. Keep a possibility to forcibly override the index type in lowerings. Depends On D98525 Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D98937	2021-03-24 15:13:42 +01:00
Alex Zinenko	1916b0e098	[mlir] support data layout specs on ModuleOp ModuleOp is a natural place to provide scoped data layout information. However, it is undesirable for ModuleOp to implement the entirety of DataLayoutOpInterface because that would require either pushing the interface inside the IR library instead of a separate library, or putting the default implementation of the interface as inline functions in headers leading to binary bloat. Instead, ModuleOp accepts an arbitrary data layout spec attribute and has a dedicated hook to extract it, and DataLayout is modified to know about ModuleOp particularities. Reviewed By: herhut, nicolasvasilache Differential Revision: https://reviews.llvm.org/D98500	2021-03-24 15:13:38 +01:00
Nicolas Vasilache	7716e5535c	[mlir] Fixes to hoist padding Fix the BlockAndValueMapping update that was missing entries for scf.for op's blockIterArgs. Skip cloning subtensors of the padded tensor as the logic for these is separate. Add a filter to drop side-effecting ops. Tests are beefed up to verify the IR is sound in all hoisting configurations for 2-level 3-D tiled matmul. Differential Revision: https://reviews.llvm.org/D99255	2021-03-24 11:51:28 +00:00
Vladislav Vinogradov	18a2f479bf	[mlir][NFC] Replace `getMemorySpaceAsInt` with `getMemorySpace` where possible Use new `MemRefType::getMemorySpace` method with generic Attribute in cases, where there is no specific logic around the memory space. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D99154	2021-03-24 13:23:59 +03:00
River Riddle	76f3c2f3f3	[mlir][Pattern] Add better support for using interfaces/traits to match root operations in rewrite patterns To match an interface or trait, users currently have to use the `MatchAny` tag. This tag can be quite problematic for compile time for things like the canonicalizer, as the `MatchAny` patterns may get applied to every operation. This revision adds better support by bucketing interface/trait patterns based on which registered operations have them registered. This means that moving forward we will only attempt to match these patterns to operations that have this interface registered. Two simplify defining patterns that match traits and interfaces, two new utility classes have been added: OpTraitRewritePattern and OpInterfaceRewritePattern. Differential Revision: https://reviews.llvm.org/D98986	2021-03-23 14:05:33 -07:00
Chris Lattner	782c534117	[ODS] Implement a new 'hasCanonicalizeMethod' bit for cann patterns. This provides a simplified way to implement 'matchAndRewrite' style canonicalization patterns for ops that don't need the full power of RewritePatterns. Using this style, you can implement a static method with a signature like: ``` LogicalResult AssertOp::canonicalize(AssertOp op, PatternRewriter &rewriter) { return success(); } ``` instead of dealing with defining RewritePattern subclasses. This also adopts this for a few canonicalization patterns in the std dialect to show how it works. Differential Revision: https://reviews.llvm.org/D99143	2021-03-23 13:45:45 -07:00
Alex Zinenko	20c68d9441	[mlir] silence -Wunused-variable in release mode in Linalg transforms	2021-03-23 18:59:12 +01:00
Nicolas Vasilache	2240568579	[MLIR][Linalg] Hoist padding across multiple levels of tiling This revision introduces proper backward slice computation during the hoisting of PadTensorOp. This allows hoisting padding even across multiple levels of tiling. Such hoisting requires the proper handling of loop bounds that may depend on enclosing loop variables. Differential revision: https://reviews.llvm.org/D98965	2021-03-23 17:47:32 +00:00
Frederik Gossen	94ef248d7b	Revert "[MLIR] Canonicalize `shape.assuming` op to yield only inner values" This reverts commit `5f8acd4fd2`.	2021-03-23 16:05:55 +01:00
Frederik Gossen	5f8acd4fd2	[MLIR] Canonicalize `shape.assuming` op to yield only inner values Differential Revision: https://reviews.llvm.org/D99156	2021-03-23 12:34:50 +01:00
Frederik Gossen	f368b3a029	[MLIR][Shape] Canonicalize duplicate operands in `shape.cstr_broadcastable` Differential Revision: https://reviews.llvm.org/D99159	2021-03-23 12:23:22 +01:00
Frederik Gossen	d78374b2d3	[MLIR] Add callback builder for `shape.assuming` op Differential Revision: https://reviews.llvm.org/D99153	2021-03-23 11:46:01 +01:00
Chris Lattner	79d7f618af	Rename FrozenRewritePatternList -> FrozenRewritePatternSet; NFC. This nicely aligns the naming with RewritePatternSet. This type isn't as widely used, but we keep a using declaration in to help with downstream consumption of this change. Differential Revision: https://reviews.llvm.org/D99131	2021-03-22 17:40:45 -07:00
Chris Lattner	dc4e913be9	[PatternMatch] Big mechanical rename OwningRewritePatternList -> RewritePatternSet and insert -> add. NFC This doesn't change APIs, this just cleans up the many in-tree uses of these names to use the new preferred names. We'll keep the old names around for a couple weeks to help transitions. Differential Revision: https://reviews.llvm.org/D99127	2021-03-22 17:20:50 -07:00
Chris Lattner	549e190236	[PatternRewriter] Rename OwningRewritePatternList -> RewritePatternSet and insert -> add This maintains the old name to have minimal source impact on downstream codes, and does not do the huge mechanical patch. I expect the huge mechanical patch to land sometime this week, but we can keep around the old names for a couple weeks to reduce impact on downstream projects. Differential Revision: https://reviews.llvm.org/D99119	2021-03-22 16:33:18 -07:00
Chris Lattner	6874726610	[PatternMatching] Add convenience insert method to OwningRewritePatternList. NFC. This allows adding a C function pointer as a matchAndRewrite style pattern, which is a very common case. This adopts it in ExpandTanh to show how it reduces a level of nesting. We could allow C++ lambdas here, but that doesn't work as well with type inference in the common case. Instead of: patterns.insert(convertTanhOp); you need to specify: patterns.insert<math::TanhOp>(convertTanhOp); which is boilerplate'y. Capturing state like this is very uncommon, so we choose to require clients to define their own structs and use the non-convenience method when they need to do so. Differential Revision: https://reviews.llvm.org/D99039	2021-03-22 11:18:21 -07:00
Nicolas Vasilache	bcd6424f9b	[mlir][Linalg] Fix linalg on tensor fusion - Drop unnecessary occurrences of rewriter.eraseOp: dead linalg ops on tensors should be cleaned up by DCE. - reimplement the part of Linalg on fusion that constructs the body and block arguments: the previous implementation had too much magic. Instead this spells out all cases explicitly and asserts / introduces TODOs for incorrect cases. As a consequence, we can use the default traversal order for this pattern. Differential Revision: https://reviews.llvm.org/D99070	2021-03-22 13:29:40 +00:00
Adrian Kuegel	c691b9686b	[mlir] Add an option to still use bottom-up traversal GreedyPatternRewriteDriver was changed from bottom-up traversal to top-down traversal. Not all passes work yet with that change for traversal order. To give some time for fixing, add an option to allow to switch back to bottom-up traversal. Use this option in FusionOfTensorOpsPass which fails otherwise. Differential Revision: https://reviews.llvm.org/D99059	2021-03-22 09:49:44 +01:00
Chris Lattner	ffde3acb1b	[ShapeDialect] Silence a build warning, NFC mlir/lib/Dialect/Shape/IR/Shape.cpp:573:26: warning: loop variable 'shape' is always a copy because the range of type '::mlir::Operation::operand_range' (aka 'mlir::OperandRange') does not return a reference [-Wrange-loop-analysis] for (const auto &shape : shapes()) { ^	2021-03-21 10:10:38 -07:00
Chris Lattner	3a506b31a3	Change OwningRewritePatternList to carry an MLIRContext with it. This updates the codebase to pass the context when creating an instance of OwningRewritePatternList, and starts removing extraneous MLIRContext parameters. There are many many more to be removed. Differential Revision: https://reviews.llvm.org/D99028	2021-03-21 10:06:31 -07:00
Butygin	7219b31d40	[mlir] Additional folding for SelectOp * Fold SelectOp when both true and false args are same SSA value * Fold some cmp + select patterns Differential Revision: https://reviews.llvm.org/D98576	2021-03-20 13:40:42 +03:00
Butygin	5657f93e78	[mlir] Canonicalize IfOp with trivial `then` and `else` bodies to list of SelectOp's * Do we need a threshold on maximum number of Yeild arguments processed (maximum number of SelectOp's to be generated)? * Had to modify some old IfOp tests to not get optimized by this pattern Differential Revision: https://reviews.llvm.org/D98592	2021-03-20 12:18:49 +03:00
Mehdi Amini	cdb6eb7e83	Update syntax for amx.tile_muli to use two Unit attr to mark the zext case This makes the annotation tied to the operand and the use of a keyword more explicit/readable on what it means. Differential Revision: https://reviews.llvm.org/D99001	2021-03-20 04:12:24 +00:00
Benjamin Kramer	6327a7cfd7	[mlir][Linalg] Make LLVM_DEBUG region bigger to avoid warnings in Release builds Transforms.cpp:586:16: error: unused variable 'v' [-Werror,-Wunused-variable] for (Value v : operands) ^	2021-03-19 20:56:59 +01:00
Nicolas Vasilache	5b2d8503d1	[mlir][Linalg] NFC - Expose helper function `substituteMin`.	2021-03-19 16:26:52 +00:00
Alexander Belyaev	628f5c9da2	[mlir] Add a roundtrip test for 'linalg.tiled_loop' on buffers. https://llvm.discourse.group/t/rfc-add-linalg-tileop/2833 Differential Revision: https://reviews.llvm.org/D98900	2021-03-19 09:38:20 +01:00
Christian Sigg	a825fb2c07	[mlir] Remove mlir-rocm-runner This change combines for ROCm what was done for CUDA in D97463, D98203, D98360, and D98396. I did not try to compile SerializeToHsaco.cpp or test mlir/test/Integration/GPU/ROCM because I don't have an AMD card. I fixed the things that had obvious bit-rot though. Reviewed By: whchung Differential Revision: https://reviews.llvm.org/D98447	2021-03-19 00:24:10 -07:00
Lei Zhang	fcc1ce0093	Revert "Revert "[mlir] Add linalg.fill bufferization conversion"" This reverts commit `c69550c132` with proper fix applied.	2021-03-18 17:21:58 -04:00
Mehdi Amini	c69550c132	Revert "[mlir] Add linalg.fill bufferization conversion" This reverts commit `32a744ab20`. CI is broken: test/Dialect/Linalg/bufferize.mlir:274:12: error: CHECK: expected string not found in input // CHECK: %[[MEMREF:.*]] = tensor_to_memref %[[IN]] : memref<?xf32> ^	2021-03-18 21:18:07 +00:00
Eugene Zhulenev	32a744ab20	[mlir] Add linalg.fill bufferization conversion `BufferizeAnyLinalgOp` fails because `FillOp` is not a `LinalgGenericOp` and it fails while reading operand sizes attribute. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D98671	2021-03-18 13:41:16 -07:00
thomasraoux	16947650d5	[mlir][linalg] Extend linalg vectorization to support non-identity input maps This propagates the affine map to transfer_read op in case it is not a minor identity map. Differential Revision: https://reviews.llvm.org/D98523	2021-03-18 12:32:35 -07:00
lorenzo chelini	4c782a24d9	[mlir] Fix typo in SCF.cpp (NFC)	2021-03-18 19:15:33 +01:00
Alexander Belyaev	283799157e	[mlir][linalg] Add support for memref inputs/outputs for `linalg.tiled_loop`. Also use `ArrayAttr` to pass iterator pass to the TiledLoopOp builder. Differential Revision: https://reviews.llvm.org/D98871	2021-03-18 16:11:03 +01:00
David Truby	de155f4af2	[MLIR][OpenMP] Pretty printer and parser for omp.wsloop Co-authored-by: Kiran Chandramohan <kiran.chandramohan@arm.com> Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D92327	2021-03-18 13:37:01 +00:00
Frederik Gossen	1ce70c15ed	[MLIR] Canonicalize broadcast operations on single shapes This covers cases that are not folded away because the extent tensor type becomes more concrete in the process. Differential Revision: https://reviews.llvm.org/D98782	2021-03-18 08:59:50 +01:00
Vladislav Vinogradov	fee9054232	[mlir][ODS] Support specialized Attribute class for Enums Add a feature to `EnumAttr` definition to generate specialized Attribute class for the particular enumeration. This class will inherit `StringAttr` or `IntegerAttr` and will override `classof` and `getValue` methods. With this class the enumeration predicate can be checked with simple RTTI calls (`isa`, `dyn_cast`) and it will return the typed enumeration directly instead of raw string/integer. Based on the following discussion: https://llvm.discourse.group/t/rfc-add-enum-attribute-decorator-class/2252 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D97836	2021-03-17 16:44:24 +03:00
lorenzo chelini	0a74a7161b	[mlir] scf::ForOp: Drop iter arguments (and corresponding result) with no use 'ForOpIterArgsFolder' can now remove iterator arguments (and corresponding results) with no use. Example: ``` %cst = constant 32 : i32 %0:2 = scf.for %arg1 = %lb to %ub step %step iter_args(%arg2 = %arg0, %arg3 = %cst) -> (i32, i32) { %1 = addu %arg2, %cst : i32 scf.yield %1, %1 : i32, i32 } use(%0#0) ``` %arg3 is not used in the block, and its corresponding result `%0#1` has no use, thus remove the iter argument. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D98711	2021-03-17 12:06:17 +00:00
River Riddle	3a833a0e0e	[mlir][PDL] Add support for variadic operands and results in the PDL Interpreter This revision extends the PDL Interpreter dialect to add support for variadic operands and results, with ranges of these values represented via the recently added !pdl.range type. To support this extension, three new operations have been added that closely match the single variant: * pdl_interp.check_types : Compare a range of types with a known range. * pdl_interp.create_types : Create a constant range of types. * pdl_interp.get_operands : Get a range of operands from an operation. * pdl_interp.get_results : Get a range of results from an operation. * pdl_interp.switch_types : Switch on a range of types. This revision handles adding support in the interpreter dialect and the conversion from PDL to PDLInterp. Support for variadic operands and results in the bytecode will be added in a followup revision. Differential Revision: https://reviews.llvm.org/D95722	2021-03-16 13:20:19 -07:00
River Riddle	1eb6994d6a	[mlir][PDL] Add support for variadic operands and results in PDL This revision extends the PDL dialect to add support for variadic operands and results, with ranges of these values represented via the recently added !pdl.range type. To support this extension, three new operations have been added that closely match the single variant: * pdl.operands : Define a range of input operands. * pdl.results : Extract a result group from an operation. * pdl.types : Define a handle to a range of types. Support for these in the pdl interpreter dialect and byte code will be added in followup revisions. Differential Revision: https://reviews.llvm.org/D95721	2021-03-16 13:20:18 -07:00
River Riddle	02c4c0d5b2	[mlir][pdl] Remove CreateNativeOp in favor of a more general ApplyNativeRewriteOp. This has a numerous amount of benefits, given the overly clunky nature of CreateNativeOp: * Users can now call into arbitrary rewrite functions from inside of PDL, allowing for more natural interleaving of PDL/C++ and enabling for more of the pattern to be in PDL. * Removes the need for an additional set of C++ functions/registry/etc. The new ApplyNativeRewriteOp will use the same PDLRewriteFunction as the existing RewriteOp. This reduces the API surface area exposed to users. This revision also introduces a new PDLResultList class. This class is used to provide results of native rewrite functions back to PDL. We introduce a new class instead of using a SmallVector to simplify the work necessary for variadics, given that ranges will require some changes to the structure of PDLValue. Differential Revision: https://reviews.llvm.org/D95720	2021-03-16 13:20:18 -07:00
River Riddle	242762c9a3	[mlir][pdl] Restructure how results are represented. Up until now, results have been represented as additional results to a pdl.operation. This is fairly clunky, as it mismatches the representation of the rest of the IR constructs(e.g. pdl.operand) and also isn't a viable representation for operations returned by pdl.create_native. This representation also creates much more difficult problems when factoring in support for variadic result groups, optional results, etc. To resolve some of these problems, and simplify adding support for variable length results, this revision extracts the representation for results out of pdl.operation in the form of a new `pdl.result` operation. This operation returns the result of an operation at a given index, e.g.: ``` %root = pdl.operation ... %result = pdl.result 0 of %root ``` Differential Revision: https://reviews.llvm.org/D95719	2021-03-16 13:20:18 -07:00
Nicolas Vasilache	b661788b77	[mlir] NFC - Expose GlobalCreator so it can be reused.	2021-03-16 12:29:04 +00:00
Adrian Kuegel	2995e161b0	[mlir]: Add canonicalization for dim of 1D alloc of size rank. Differential Revision: https://reviews.llvm.org/D97542	2021-03-16 10:38:57 +01:00
Lorenzo Chelini	fd7eee64c5	scf::ForOp: Fold away iterator arguments with no use and for which the corresponding input is yielded Enhance 'ForOpIterArgsFolder' to remove unused iteration arguments in a scf::ForOp. If the block argument corresponding to the given iterator has no use and the yielded value equals the input, we fold it away. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D98503	2021-03-16 07:01:25 +00:00
Aart Bik	6ad7b97e20	[mlir][amx] Add Intel AMX dialect (architectural-specific vector dialect) The Intel Advanced Matrix Extensions (AMX) provides a tile matrix multiply unit (TMUL), a tile control register (TILECFG), and eight tile registers TMM0 through TMM7 (TILEDATA). This new MLIR dialect provides a bridge between MLIR concepts like vectors and memrefs and the lower level LLVM IR details of AMX. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D98470	2021-03-15 17:59:05 -07:00
Alex Zinenko	0fb4a201c0	[mlir] fix shared-lib build fallout of `e2310704d8` The patch in question broke the build with shared libraries due to missing dependencies, one of which would have been circular between MLIRStandard and MLIRMemRef if added. Fix this by moving more code around and swapping the dependency direction. MLIRMemRef now depends on MLIRStandard, but MLIRStandard does _not_ depend on MLIRMemRef. Arguably, this is the right direction anyway since numerous libraries depend on MLIRStandard and don't necessarily need to depend on MLIRMemref. Other otable changes include: - some EDSC code is moved inline to MemRef/EDSC/Intrinsics.h because it creates MemRef dialect operations; - a utility function related to shape moved to BuiltinTypes.h/cpp because it only realtes to shaped types and not any particular dialect (standard dialect is erroneously believed to contain MemRefType); - a Python test for the standard dialect is disabled completely because the ops it tests moved to the new MemRef dialect, but it is not exposed to Python bindings, and the change for that is non-trivial.	2021-03-15 13:41:38 +01:00
Julian Gross	e2310704d8	[MLIR] Create memref dialect and move dialect-specific ops from std. Create the memref dialect and move dialect-specific ops from std dialect to this dialect. Moved ops: AllocOp -> MemRef_AllocOp AllocaOp -> MemRef_AllocaOp AssumeAlignmentOp -> MemRef_AssumeAlignmentOp DeallocOp -> MemRef_DeallocOp DimOp -> MemRef_DimOp MemRefCastOp -> MemRef_CastOp MemRefReinterpretCastOp -> MemRef_ReinterpretCastOp GetGlobalMemRefOp -> MemRef_GetGlobalOp GlobalMemRefOp -> MemRef_GlobalOp LoadOp -> MemRef_LoadOp PrefetchOp -> MemRef_PrefetchOp ReshapeOp -> MemRef_ReshapeOp StoreOp -> MemRef_StoreOp SubViewOp -> MemRef_SubViewOp TransposeOp -> MemRef_TransposeOp TensorLoadOp -> MemRef_TensorLoadOp TensorStoreOp -> MemRef_TensorStoreOp TensorToMemRefOp -> MemRef_BufferCastOp ViewOp -> MemRef_ViewOp The roadmap to split the memref dialect from std is discussed here: https://llvm.discourse.group/t/rfc-split-the-memref-dialect-from-std/2667 Differential Revision: https://reviews.llvm.org/D98041	2021-03-15 11:14:09 +01:00
Frederik Gossen	b55f424ffc	[MLIR] Add canonicalization for `shape.broadcast` Remove redundant operands and fold if only one left. Differential Revision: https://reviews.llvm.org/D98402	2021-03-15 10:11:28 +01:00
Aart Bik	e7ee4eaaf7	[mlir][sparse] disable nonunit stride dense vectorization This is a temporary work-around to get our all-annotations-all-flags stress testing effort run clean. In the long run, we want to provide efficient implementations of strided loads and stores though Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D98563	2021-03-12 16:49:32 -08:00
Eugene Zhulenev	39b2cd4009	[mlir] Annotate functions used only in debug mode with LLVM_ATTRIBUTE_UNUSED Functions used only in `assert` cause warnings in release mode Reviewed By: mehdi_amini, dcaballe, ftynse Differential Revision: https://reviews.llvm.org/D98476	2021-03-12 11:25:46 -08:00
Marius Brehler	849f8183fb	[mlir] Fix ConstantOp verifier This restricts the attributes to integers for constants of type IndexType. So far an attribute like StringAttr as in %c1 = constant "" : index is valid. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D98216	2021-03-12 08:49:25 +01:00
Sergei Grechanik	fd2b08969b	[mlir][Vector] Lowering of transfer_read/write to vector.load/store This patch introduces progressive lowering patterns for rewriting vector.transfer_read/write to vector.load/store and vector.broadcast in certain supported cases. Reviewed By: dcaballe, nicolasvasilache Differential Revision: https://reviews.llvm.org/D97822	2021-03-11 18:17:51 -08:00
Mehdi Amini	e1364f1068	Replace use of OperationState with builder::create in GPU Kernel Outlining (NFC) OperationState is a low level API that is rarely indicated, the builder API convenient wrapper is preferred when possible.	2021-03-12 00:14:02 +00:00
Diego Caballero	0fd0fb5329	Reland: [mlir][Affine][Vector] Add initial support for 'iter_args' to Affine vectorizer. This patch adds support for vectorizing loops with 'iter_args' when those loops are not a vector dimension. This allows vectorizing outer loops with an inner 'iter_args' loop (e.g., reductions). Vectorizing scenarios where 'iter_args' loops are vector dimensions would require more work (e.g., analysis, generating horizontal reduction, etc.) not included in this patch. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D97892	2021-03-12 01:08:28 +02:00
Diego Caballero	96891f0418	Reland: [mlir][Vector][Affine] Improve affine vectorizer algorithm This patch replaces the root-terminal vectorization approach implemented in the Affine vectorizer with a topological order approach that vectorizes all the operations within the target loop nest. These are the most important changes introduced by the new algorithm: * Removed tracking of root and terminal ops. Existing vectorization functionality is preserved and extended so that loop nests without root-terminal chains can be vectorized. * Vectorizing a loop nest now only requires a single topological traversal. * A new vector loop nest is incrementally built along the vectorization process. The original scalar loop is kept intact. No cloning guard is needed to recover the scalar loop if vectorization fails. This approach also simplifies the challenging task of replacing a loop operation amid the vectorization process without invalidating the analysis information that depends on the original loop. * Vectorization of specific operations has been implemented as independent, preparing them to be moved to a potential vectorization interface. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D97442	2021-03-12 00:19:50 +02:00
River Riddle	31bb8efd69	[mlir][StorageUniquer] Properly call the destructor on non-trivially destructible storage instances This allows for storage instances to store data that isn't uniqued in the context, or contain otherwise non-trivial logic, in the rare situations that they occur. Storage instances with trivial destructors will still have their destructor skipped. A consequence of this is that the storage instance definition must be visible from the place that registers the type. Differential Revision: https://reviews.llvm.org/D98311	2021-03-11 11:35:32 -08:00
Diego Caballero	ed193bce9d	[mlir][Vector][Affine] Fix heap-use-after-free in vectorizer This patch fixes a heap-use-after-free introduced by the recent changes in the vectorizer: https://reviews.llvm.org/rG95db7b4aeaad590f37720898e339a6d54313422f The problem is due to the way candidate loops are visited. All candidate loops are pattern-matched beforehand using the 'NestedMatch' utility. These matches may intersect with each other so it may happen that we try to vectorize a loop that was previously vectorized. The new vectorization algorithm replaces the original loops that are vectorized with new loops and, therefore, any reference to the original loops in the pre-computed matches becomes invalid. This patch fixes the problem by classifying the candidate matches into buckets before vectorization. Each bucket contains all the matches that intersect. The vectorizer uses these buckets to make sure that we only vectorize one match from each bucket, at most. Differential Revision: https://reviews.llvm.org/D98382	2021-03-11 20:44:07 +02:00
Alex Zinenko	3ba14fa0ce	[mlir] Introduce data layout modeling subsystem Data layout information allows to answer questions about the size and alignment properties of a type. It enables, among others, the generation of various linear memory addressing schemes for containers of abstract types and deeper reasoning about vectors. This introduces the subsystem for modeling data layouts in MLIR. The data layout subsystem is designed to scale to MLIR's open type and operation system. At the top level, it consists of attribute interfaces that can be implemented by concrete data layout specifications; type interfaces that should be implemented by types subject to data layout; operation interfaces that must be implemented by operations that can serve as data layout scopes (e.g., modules); and dialect interfaces for data layout properties unrelated to specific types. Built-in types are handled specially to decrease the overall query cost. A concrete default implementation of these interfaces is provided in the new Target dialect. Defaults for built-in types that match the current behavior are also provided. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D97067	2021-03-11 16:54:47 +01:00
Arpith C. Jacob	b4a516cc43	[mlir] Add LLVM loop codegen options to control software pipelining Support specifying the II and disabling pipelining. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D98420	2021-03-11 16:46:44 +01:00
Frederik Gossen	b975e3b5aa	[MLIR] Add canoncalization for `shape.is_broadcastable` Canonicalize `is_broadcastable` to constant true if fewer than 2 unique shape operands. Eliminate redundant operands, otherwise. Differential Revision: https://reviews.llvm.org/D98361	2021-03-11 10:10:34 +01:00
Christian Sigg	2224221fb3	[mlir] Add NVVM to CUBIN conversion to mlir-opt If MLIR_CUDA_RUNNER_ENABLED, register a 'gpu-to-cubin' conversion pass to mlir-opt. The next step is to switch CUDA integration tests from mlir-cuda-runner to mlir-opt + mlir-cpu-runner and remove mlir-cuda-runner. Depends On D98279 Reviewed By: herhut, rriddle, mehdi_amini Differential Revision: https://reviews.llvm.org/D98203	2021-03-11 10:07:11 +01:00

... 5 6 7 8 9 ...

2456 Commits