llvm-project

Commit Graph

Author	SHA1	Message	Date
Thomas Raoux	15389cdc5b	[mlir][spirv] Add remaining cooperative matrix instructions Adds support for cooperative matrix support for arithmetic and cast instructions. It also adds cooperative matrix store, muladd and matrixlength instructions which are part of the extension. Differential Revision: https://reviews.llvm.org/D80181	2020-05-21 11:55:33 -07:00
jerryyin	9c53ac08de	[mlir][rocdl] Exposing buffer load/store intrinsic Summary: * Updated ROCDLOps tablegen * Added parsing and printing function for new intrinsic * Added unit tests Reviewers: ftynse Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, jurahul, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80233	2020-05-21 14:14:35 +00:00
MaheshRavishankar	071358e082	[mlir][Linalg] Add producer-consumer fusion when producer is a ConstantOp and Consumer is a GenericOp. Differential Revision: https://reviews.llvm.org/D79838	2020-05-20 09:16:19 -07:00
Nicolas Vasilache	004a3d4f56	[mlir][Linalg] Refactor linalg tiling Summary: This revision refactors the Linalg tiling pass to be written as pattern applications and retires the use of the folder in Linalg tiling. In the early days, tiling was written as a pass that would create (partially) folded and canonicalized operations on the fly for better composability. As this evolves towards composition of patterns, the pass-specific folder is counter-productive and is retired. The tiling options struct evolves to take a tile size creation function which allows materializing tile sizes on the fly (in particular constant tile sizes). This plays better with folding and DCE. With the folder going away in Tiling, the check on whether subviews are the same in linalg fusion needs to be more robust. This revision also implements such a check. In the current form, there are still some canonicalizations missing due to AffineMin/Max ops fed by scf::ForOp. These will be improved at a later time. Differential Revision: https://reviews.llvm.org/D80267	2020-05-20 09:39:56 -04:00
Tres Popp	02035580d3	[mlir] Add custom assembly formats to shape.witness ops. The assembly formats are essentially the generic forms without quotations and type information. Differential Revision: https://reviews.llvm.org/D80180	2020-05-20 13:25:33 +02:00
Tres Popp	fb6986ef69	[mlir] Custom printing/parsing for Shape::AssumingOp Summary: Additionally, this adds traits and builder methods to AssumingYieldOp and names the input witness to the AssumingOp. Differential Revision: https://reviews.llvm.org/D80187	2020-05-20 10:39:26 +02:00
Thomas Raoux	b359bbaa8b	[mlir][spirv] First step to support spirv cooperative matrix extension. Add a new type to SPIRV dialect for cooperative matrix and add new op for cooperative matrix load. This is missing most instructions to support cooperative matrix extension but this is a stop-gap patch to avoid creating big review. Differential Revision: https://reviews.llvm.org/D80043	2020-05-19 19:29:41 -07:00
Sean Silva	21b0eff773	[mlir][shape] Add `shape.from_extents`. Summary: This is a basic op needed for creating shapes from SSA values representing the extents. Differential Revision: https://reviews.llvm.org/D79833	2020-05-19 14:26:08 -07:00
George	e984b7f2a2	Added a TanOp to SPIR-V dialect GLSL ops Implemented tangent op from SPIR-V's GLSL extended instruction set. Added a round-trip and serialization/deserialization tests for the op. Differential Revision: https://reviews.llvm.org/D80152	2020-05-19 09:15:29 -04:00
Kiran Kumar T P	fa8fc9ffcc	[MLIR, OpenMP] Support for flush operation, and translating the same to LLVM IR Summary: This patch adds support for flush operation in OpenMP dialect and translation of this construct to LLVM IR. The OpenMP IRBuilder is used for this translation. The patch includes code changes and testcase modifications. Reviewed By: ftynse, kiranchandramohan Differential Revision: https://reviews.llvm.org/D79937	2020-05-19 17:01:25 +05:30
Pierre Oechsel	d1866f8947	[MLIR] [Linalg] Add option to use the partial view after promotion. For now the promoted buffer is indexed using the `full view`. The full view might be slightly bigger than the partial view (which is accounting for boundaries). Unfortunately this does not compose easily with other transformations when multiple buffers with shapes related to each other are involved. Take `linalg.matmul A B C` (with A of size MxK, B of size KxN and C of size MxN) and suppose we are: - Tiling over M by 100 - Promoting A only This is producing a `linalg.matmul promoted_A B subview_C` where `promoted_A` is a promoted buffer of `A` of size (100xK) and `subview_C` is a subview of size mxK where m could be smaller than 100 due to boundaries thus leading to a possible incorrect behavior. We propose to: - Add a new parameter to the tiling promotion allowing to enable the use of the full tile buffer. - By default all promoted buffers will be indexed by the partial view. Note that this could be considered as a breaking change in comparison to the way the tiling promotion was working. Differential Revision: https://reviews.llvm.org/D79927	2020-05-18 18:28:18 +02:00
Nicolas Vasilache	1870e787af	[mlir][Vector] Add an optional "masked" boolean array attribute to vector transfer operations Summary: Vector transfer ops semantic is extended to allow specifying a per-dimension `masked` attribute. When the attribute is false on a particular dimension, lowering to LLVM emits unmasked load and store operations. Differential Revision: https://reviews.llvm.org/D80098	2020-05-18 11:52:08 -04:00
Nicolas Vasilache	36cdc17f8c	[mlir][Vector] Make minor identity permutation map optional in transfer op printing and parsing Summary: This revision makes the use of vector transfer operatons more idiomatic by allowing to omit and inferring the permutation_map. Differential Revision: https://reviews.llvm.org/D80092	2020-05-18 11:41:27 -04:00
Denis Khalikov	0dc91bfd11	[mlir][spirv] Handle debuginfo for control flow ops. Summary: Handle debuginfo for control flow operations: spv.Selection, spv.Loop, spv.BranchOp, spv.BranchConditional. Differential Revision: https://reviews.llvm.org/D79931	2020-05-16 15:36:14 +03:00
aartbik	b1c688dbae	[mlir] [VectorOps] Implement vector.create_mask lowering to LLVM IR Summary: First, compact implementation of lowering to LLVM IR. A bit more challenging than the constant mask due to the dynamic indices, of course. I like to hear if there are more efficient ways of doing this in LLVM, but this for now at least gives us a functional reference implementation. Reviewers: nicolasvasilache, ftynse, bkramer, reidtatge, andydavis1, mehdi_amini Reviewed By: nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, jurahul, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79954	2020-05-15 11:02:30 -07:00
Tres Popp	a26883e5aa	[MLIR] Add shape.witness type and ops Summary: These represent shape based preconditions on execution of code. Differential Revision: https://reviews.llvm.org/D79717	2020-05-15 14:33:54 +02:00
Nicolas Vasilache	f1b972041a	[mlir][Linalg] Start a LinalgToStandard pass and move conversion to library calls. This revision starts decoupling the include the kitchen sink behavior of Linalg to LLVM lowering by inserting a -convert-linalg-to-std pass. The lowering of linalg ops to function calls was previously lowering to memref descriptors by having both linalg -> std and std -> LLVM patterns in the same rewrite. When separating this step, a new issue occurred: the layout is automatically type-erased by this process. This revision therefore introduces memref casts to perform these type erasures explicitly. To connect everything end-to-end, the LLVM lowering of MemRefCastOp is relaxed because it is artificially more restricted than the op semantics. The op semantics already guarantee that source and target MemRefTypes are cast-compatible. An invalid lowering test now becomes valid and is removed. Differential Revision: https://reviews.llvm.org/D79468	2020-05-15 00:24:03 -04:00
Diego Caballero	bc5565f9ea	[mlir][Affine] Introduce affine.vector_load and affine.vector_store This patch adds `affine.vector_load` and `affine.vector_store` ops to the Affine dialect and lowers them to `vector.transfer_read` and `vector.transfer_write`, respectively, in the Vector dialect. Reviewed By: bondhugula, nicolasvasilache Differential Revision: https://reviews.llvm.org/D79658	2020-05-14 13:17:58 -07:00
Alex Zinenko	60f443bb3b	[mlir] Change dialect namespace loop->scf All ops of the SCF dialect now use the `scf.` prefix instead of `loop.`. This is a part of dialect renaming. Differential Revision: https://reviews.llvm.org/D79844	2020-05-13 19:20:21 +02:00
Nicolas Vasilache	e0b99a5de4	[mlir] Add SubViewOp::getOrCreateRanges and fix folding pattern The existing implementation of SubViewOp::getRanges relies on all offsets/sizes/strides to be dynamic values and does not work in combination with canonicalization. This revision adds a SubViewOp::getOrCreateRanges to create the missing constants in the canonicalized case. This allows reactivating the fused pass with staged pattern applications. However another issue surfaces that the SubViewOp verifier is now too strict to allow folding. The existing folding pattern is turned into a canonicalization pattern which rewrites memref_cast + subview into subview + memref_cast. The transform-patterns-matmul-to-vector can then be reactivated. Differential Revision: https://reviews.llvm.org/D79759	2020-05-13 10:11:30 -04:00
MaheshRavishankar	49e6c19100	[mlir][StandardToLLVM] Add SinOp to LLVM dialect and lowering of std.sin to this op. Differential Revision: https://reviews.llvm.org/D79505	2020-05-12 23:15:25 -07:00
MaheshRavishankar	5440d0a12d	[mlir][Linalg] Add folders and canonicalizers for linalg.reshape/linalg.tensor_reshape operations. Differential Revision: https://reviews.llvm.org/D79765	2020-05-12 23:03:26 -07:00
MaheshRavishankar	d2a9569850	[mlir][Linalg] Allow reshapes to collapse to a zero-rank tensor. This is only valid if the source tensors (result tensor) is static shaped with all unit-extents when the reshape is collapsing (expanding) dimensions. Differential Revision: https://reviews.llvm.org/D79764	2020-05-12 23:03:25 -07:00
aartbik	fb2c4d50f1	[mlir] [VectorOps] Implement vector.constant_mask lowering to LLVM IR Summary: Makes this operation runnable on CPU by generating MLIR instructions that are eventually folded into an LLVM IR constant for the mask. Reviewers: nicolasvasilache, ftynse, reidtatge, bkramer, andydavis1 Reviewed By: nicolasvasilache, ftynse, andydavis1 Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79815	2020-05-12 19:44:23 -07:00
Nicolas Vasilache	63c0e72b2f	[mlir] Revisit std.subview handling of static information. The main objective of this revision is to change the way static information is represented, propagated and canonicalized in the SubViewOp. In the current implementation the issue is that canonicalization may strictly lose information because static offsets are combined in irrecoverable ways into the result type, in order to fit the strided memref representation. The core semantics of the op do not change but the parser and printer do: the op always requires `rank` offsets, sizes and strides. These quantities can now be either SSA values or static integer attributes. The result type is automatically deduced from the static information and more powerful canonicalizations (as powerful as the representation with sentinel `?` values allows). Previously static information was inferred on a best-effort basis from looking at the source and destination type. Relevant tests are rewritten to use the idiomatic `offset: x, strides : [...]`-form. Bugs are corrected along the way that were not trivially visible in flattened strided memref form. Lowering to LLVM is updated, simplified and now supports all cases. A mixed static-dynamic mode test that wouldn't previously lower is added. It is an open question, and a longer discussion, whether a better result type representation would be a nicer alternative. For now, the subview op carries the required semantic. Differential Revision: https://reviews.llvm.org/D79662	2020-05-12 20:04:44 -04:00
Sean Silva	452e2fc409	Revert of Revert of [mlir][shape] Tidy up shape.shape_of Summary: - Mark it NoSideEffect - Add custom parser/printer This reverts the temporary revert in https://reviews.llvm.org/rG84a9c725742d26df04808a3c7349dbd98684c6cb That was a false alarm. A downstream test actually needed to be updated.	2020-05-12 15:51:24 -07:00
Sam McCall	691e826995	Revert "[mlir] Revisit std.subview handling of static information." This reverts commit `80d133b24f`. Per Stephan Herhut: The canonicalizer pattern that was added creates forms of the subview op that cannot be lowered. This is shown by failing Tensorflow XLA tests such as: tensorflow/compiler/xla/service/mlir_gpu/tests:abs.hlo.test Will provide more details offline, they rely on logs from private CI.	2020-05-12 15:18:50 +02:00
Eric Christopher	84a9c72574	Temporarily Revert "[mlir][shape] Tidy up shape.shape_of" as it's breaking a few tests. This reverts commit `b604544886`. Followed up offline with a testcase.	2020-05-11 23:05:18 -07:00
Nicolas Vasilache	80d133b24f	[mlir] Revisit std.subview handling of static information. Summary: The main objective of this revision is to change the way static information is represented, propagated and canonicalized in the SubViewOp. In the current implementation the issue is that canonicalization may strictly lose information because static offsets are combined in irrecoverable ways into the result type, in order to fit the strided memref representation. The core semantics of the op do not change but the parser and printer do: the op always requires `rank` offsets, sizes and strides. These quantities can now be either SSA values or static integer attributes. The result type is automatically deduced from the static information and more powerful canonicalizations (as powerful as the representation with sentinel `?` values allows). Previously static information was inferred on a best-effort basis from looking at the source and destination type. Relevant tests are rewritten to use the idiomatic `offset: x, strides : [...]`-form. Bugs are corrected along the way that were not trivially visible in flattened strided memref form. It is an open question, and a longer discussion, whether a better result type representation would be a nicer alternative. For now, the subview op carries the required semantic. Reviewers: ftynse, mravishankar, antiagainst, rriddle!, andydavis1, timshen, asaadaldien, stellaraccident Reviewed By: mravishankar Subscribers: aartbik, bondhugula, mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, bader, grosul1, frgossen, Kayjukh, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79662	2020-05-11 17:44:24 -04:00
Reid Tatge	334a4159ec	[mlir][Vector] NFC - Rename vector.strided_slice into vector.extract_strided_slice Differential Revision: https://reviews.llvm.org/D79734	2020-05-11 14:21:10 -07:00
Nicolas Vasilache	d12d05a731	[mlir][Linalg] Introduce a helper function for staged pattern application Summary: This revision introduces a helper function to allow applying rewrite patterns, interleaved with more global transformations, in a staged fashion: 1. the first stage consists of an OwningRewritePatternList. The RewritePattern in this list are applied once and in order. 2. the second stage consists of a single OwningRewritePattern that is applied greedily until convergence. 3. the third stage consists of applying a lambda, generally used for non-local transformation effects. This allows creating custom fused transformations where patterns can be ordered and applied at a finer granularity than a sequence of traditional compiler passes. A test that exercises these behaviors is added. Differential Revision: https://reviews.llvm.org/D79518	2020-05-11 16:46:30 -04:00
Sean Silva	b604544886	[mlir][shape] Tidy up shape.shape_of Summary: - Mark it NoSideEffect - Add custom parser/printer Differential Revision: https://reviews.llvm.org/D79684	2020-05-11 12:55:31 -07:00
Nicolas Vasilache	6ed61a26c2	[mlir] Simplify and better document std.view semantics This [discussion](https://llvm.discourse.group/t/viewop-isnt-expressive-enough/991/2) raised some concerns with ViewOp. In particular, the handling of offsets is incorrect and does not match the op description. Note that with an elemental type change, offsets cannot be part of the type in general because sizeof(srcType) != sizeof(dstType). Howerver, offset is a poorly chosen term for this purpose and is renamed to byte_shift. Additionally, for all intended purposes, trying to support non-identity layouts for this op does not bring expressive power but rather increases code complexity. This revision simplifies the existing semantics and implementation. This simplification effort is voluntarily restrictive and acts as a stepping stone towards supporting richer semantics: treat the non-common cases as YAGNI for now and reevaluate based on concrete use cases once a round of simplification occurred. Differential revision: https://reviews.llvm.org/D79541	2020-05-11 12:29:23 -04:00
rtayl	58cb88733f	[mlir][rocdl] Add xdlops intrinsics to rocdl dialect Summary: This adds xdlops (mfma) to the rocdl dialect and also tests the translation to llvm ir. Reviewers: ftynse Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, llvm-commits Tags: #llvm #mlir Differential Revision: https://reviews.llvm.org/D79642	2020-05-11 10:08:58 -04:00
Alex Zinenko	c25b20c0f6	[mlir] NFC: Rename LoopOps dialect to SCF (Structured Control Flow) This dialect contains various structured control flow operaitons, not only loops, reflect this in the name. Drop the Ops suffix for consistency with other dialects. Note that this only moves the files and changes the C++ namespace from 'loop' to 'scf'. The visible IR prefix remains the same and will be updated separately. The conversions will also be updated separately. Differential Revision: https://reviews.llvm.org/D79578	2020-05-11 15:04:27 +02:00
Denis Khalikov	b5973d20b3	[mlir][spirv] Handle debuginfo for variables. Summary: Handle debuginfo for spv.Variable and spv.globalVariable during (de)serialization. Differential Revision: https://reviews.llvm.org/D79566	2020-05-08 18:10:18 +03:00
Alexander Belyaev	a6b2877f4c	[MLIR] Make ParallelLoopFusion pass scan through all nested regions. Differential Revision: https://reviews.llvm.org/D79558	2020-05-07 13:47:30 +02:00
Alex Zinenko	54c927b988	[mlir] Add a test exercising partial constant folding of affine min/max This functionality was introduced in `a87db48e6f` but only only tested indirectly though Linalg tests. Add direct tests.	2020-05-07 12:42:03 +02:00
Alex Zinenko	a87db48e6f	[mlir] Support partial folding of affine.min/max Originally, these operations were folded only if all expressions in their affine maps could be folded to a constant expression that can be then subject to numeric min/max computation. This introduces a more advanced version that partially folds the affine map by lifting individual constant expression in it even if some of the expressions remain variable. The folding can update the operation in place to use a simpler map. Note that this is not as powerful as canonicalization, in particular this does not remove dimensions or symbols that became useless. This allows for better composition of Linalg tiling and promotion transformation, where the latter can handle some canonical forms of affine.min that the folding can now produce. Differential Revision: https://reviews.llvm.org/D79502	2020-05-07 12:30:04 +02:00
Uday Bondhugula	57d361bd2f	[MLIR][NFC] Rename op trait PolyhedralScope -> AffineScope Rename op trait PolyhedralScope -> AffineScope for consistency. Differential Revision: https://reviews.llvm.org/D79503	2020-05-07 00:19:56 +05:30
Andy Davis	93d1108801	[MLIR][LoopOps] Adds the loop unroll transformation for loop::ForOp. Summary: Adds the loop unroll transformation for loop::ForOp. Adds support for promoting the body of single-iteration loop::ForOps into its containing block. Adds check tests for loop::ForOps with dynamic and static lower/upper bounds and step. Care was taken to share code (where possible) with the AffineForOp unroll transformation to ease maintenance and potential future transition to a LoopLike construct on which loop transformations for different loop types can implemented. Reviewers: ftynse, nicolasvasilache Reviewed By: ftynse Subscribers: bondhugula, mgorny, zzheng, mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, Joonsoo, grosul1, frgossen, Kayjukh, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79184	2020-05-05 10:42:36 -07:00
Ehsan Toosi	6ccaf73887	[MLIR][LINALG] Convert Linalg on Tensors to Buffers This is a basic pass to convert Linalg.GenericOp which works on tensors to use buffers instead. Differential Revision: https://reviews.llvm.org/D78996	2020-05-05 15:48:07 +02:00
Alexander Belyaev	b79751e83d	[MLIR] Add conversion from AtomicRMWOp -> GenericAtomicRMWOp. Adding this pattern reduces code duplication. There is no need to have a custom implementation for lowering to llvm.cmpxchg. Differential Revision: https://reviews.llvm.org/D78753	2020-05-05 10:32:13 +02:00
River Riddle	469c02d058	[mlir] Add support for merging identical blocks during canonicalization This revision adds support for merging identical blocks, or those with the same operations that branch to the same successors. Operands that mismatch between the different blocks are replaced with new block arguments added to the merged block. Differential Revision: https://reviews.llvm.org/D79134	2020-05-04 19:56:46 -07:00
Nicolas Vasilache	307cfdf533	[mlir][Linalg] Mostly NFC - Refactor Linalg patterns and transformations. Linalg transformations are currently exposed as DRRs. Unfortunately RewriterGen does not play well with the line of work on named linalg ops which require variadic operands and results. Additionally, DRR is arguably not the right abstraction to expose compositions of such patterns that don't rely on SSA use-def semantics. This revision abandons DRRs and exposes manually written C++ patterns. Refactorings and cleanups are performed to uniformize APIs. This refactoring will allow replacing the currently manually specified Linalg named ops. A collateral victim of this refactoring is the `tileAndFuse` DRR, and the one associated test, which will be revived at a later time. Lastly, the following 2 tests do not add value and are altered: - a dot_perm tile + interchange test does not test anything new and is removed - a dot tile + lower to loops does not need 2-D tiling and is trimmed.	2020-05-04 11:17:37 -04:00
Wen-Heng (Jack) Chung	bc23c1d85e	[mlir][rocdl] add rocdl.barier op. - Add rocdl.barrier op. - Lower gpu.barier to rocdl.barrier in -convert-gpu-to-rocdl. Differential Revision: https://reviews.llvm.org/D79126	2020-05-04 10:35:01 +02:00
River Riddle	cb9ae0025c	[mlir] Add a new context flag for disabling/enabling multi-threading This is useful for several reasons: * In some situations the user can guarantee that thread-safety isn't necessary and don't want to pay the cost of synchronization, e.g., when parsing a very large module. * For things like logging threading is not desirable as the output is not guaranteed to be in stable order. This flag also subsumes the pass manager flag for multi-threading. Differential Revision: https://reviews.llvm.org/D79266	2020-05-02 12:32:25 -07:00
Denis Khalikov	29b955f97c	[mlir][spirv] Handle debug information during (de)serialization. Summary: This is an initial version, currently supports OpString and OpLine for autogenerated operations during (de)serialization. Differential Revision: https://reviews.llvm.org/D79091	2020-05-01 14:11:54 +03:00
Nicolas Vasilache	0d61dcf606	[mlir][EDSC] Make use of InsertGuard Summary: This revision cleans up a layer of complexity in ScopedContext and uses InsertGuard instead of previously manual bookkeeping. The method `getBuilder` is renamed to `getBuilderRef` and spurious copies of OpBuilder are tracked. This results in some canonicalizations not happening anymore in the Linalg matmul to vector test. This test is retired because relying on DRRs for this has been shaky at best. The solution will be better support to write fused passes in C++ with more idiomatic pattern composition and application. Differential Revision: https://reviews.llvm.org/D79208	2020-04-30 18:04:31 -04:00
Nicolas Vasilache	3bdd7fcc34	[mlir][Linalg] Add support to lower named ops to loops. This revision adds support to allow named ops to lower to loops. Linalg.batch_matmul successfully lowers to loops and to LLVM. In the process, this test also activates linalg to affine loops. However padded convolutions to not lower to affine.load atm so this revision overrides the type of underlying load / store operation. Differential Revision: https://reviews.llvm.org/D79135	2020-04-30 13:45:17 -04:00
Lei Zhang	87e07b4c64	[mlir] Use memory effect to detecting allocation This commit marks AllocLikeOp as MemAlloc in StandardOps. Also in Linalg dependency analysis use memory effect to detect allocation. This allows the dependency analysis to be more general and recognize other allocation-like operations. Differential Revision: https://reviews.llvm.org/D78705	2020-04-30 09:20:53 -04:00
Uday Bondhugula	480345381a	[MLIR] Introduce op trait PolyhedralScope (revised) (A previous version of this, `dd2c639c3c`, was reverted.) Introduce op trait PolyhedralScope for ops to define a new scope for polyhedral optimization / affine dialect purposes, thus generalizing such scopes beyond FuncOp. Ops to which this trait is attached will define a new scope for the consideration of SSA values as valid symbols for the purposes of polyhedral analysis and optimization. Update methods that check for dim/symbol validity to work based on this trait. Differential Revision: https://reviews.llvm.org/D79060	2020-04-29 16:08:23 +05:30
Tres Popp	f66c87637a	[MLIR] Give AffineStoreOp and AffineLoadOp Memory SideEffects. Summary: This change results in tests also being changed to prevent dead affine.load operations from being folded away during rewrites. Also move AffineStoreOp and AffineLoadOp to an ODS file. Differential Revision: https://reviews.llvm.org/D78930	2020-04-28 15:45:25 +02:00
Dmitri Gribenko	ef06016d73	Revert "[MLIR] Introduce op trait PolyhedralScope" This reverts commit `dd2c639c3c`. It broke a few things -- the explanation will be posted to the review thread.	2020-04-28 14:50:57 +02:00
Uday Bondhugula	dd2c639c3c	[MLIR] Introduce op trait PolyhedralScope Introduce op trait `PolyhedralScope` for ops to define a new scope for polyhedral optimization / affine dialect purposes, thus generalizing such scopes beyond FuncOp. Ops to which this trait is attached will define a new scope for the consideration of SSA values as valid symbols for the purposes of polyhedral analysis and optimization. Update methods that check for dim/symbol validity to work based on this trait. Differential Revision: https://reviews.llvm.org/D78863	2020-04-28 09:55:31 +05:30
Tres Popp	2d2d696137	[MLIR] Propagate input side effect information Summary: Previously operations like std.load created methods for obtaining their effects but did not inherit from the SideEffect interfaces when their parameters were decorated with the information. The resulting situation was that passes had no information on the SideEffects of std.load/store and had to treat them more cautiously. This adds the inheritance information when creating the methods. As a side effect, many tests are modified, as they were using std.load for testing and this oepration would be folded away as part of pattern rewriting. Tests are modified to use store or to reutn the result of the std.load. Reviewers: mravishankar, antiagainst, nicolasvasilache, herhut, aartbik, ftynse! Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, csigg, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, bader, grosul1, frgossen, Kayjukh, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78802	2020-04-27 11:35:52 +02:00
Sean Silva	5fff169daa	[shape] More constant folding - shape split_at - shape.broadcast - shape.concat - shape.to_extent_tensor Differential Revision: https://reviews.llvm.org/D78821	2020-04-24 16:10:19 -07:00
Sean Silva	d1ad267a56	[shape] Basic constant folding. - Implement a first constant fold for shape.shape_of (more ops coming in subsequent patches) - Implement the right builder interfaces for ShapeType and other types - Splits shape.constant into shape.const_size and shape.const_shape which plays better with dyn_cast and building vs one polymorphic op. Also, fix the RUN line in ops.mlir to properly verify round-tripping.	2020-04-24 15:49:35 -07:00
Frederik Gossen	7e4b139a04	[MLIR] Ensure `gpu.func` must be inside a `gpu.module`. Ensure that `gpu.func` is only used within the dedicated `gpu.module`. Implement the constraint to the GPU dialect and adopt test cases. Differential Revision: https://reviews.llvm.org/D78541	2020-04-24 07:17:48 +00:00
MaheshRavishankar	542668d1e2	[mlir][Linalg] Add support for fusing linalg.tensor_reshape with linalg.generic operations. Differential Revision: https://reviews.llvm.org/D78464	2020-04-23 13:41:47 -07:00
River Riddle	7f85adb54d	[mlir][Standard] Allow select to use an i1 for vector and tensor values It currently requires that the condition match the shape of the selected value, but this is only really useful for things like masks. This revision allows for the use of i1 to mean that all of the vector/tensor is selected. This also matches the behavior of LLVM select. A benefit of this change is that transformations that want to generate selects, like those on the CFG, don't have to special case vector/tensor. Previously the only way to generate a select from an i1 was to use a splat, but that doesn't support dynamically shaped/unranked tensors. Differential Revision: https://reviews.llvm.org/D78690	2020-04-23 04:50:09 -07:00
River Riddle	2fafe7ff59	[mlir][Standard] Add support for canonicalizing branches to passthrough blocks This revision adds support for canonicalizing the following: ``` br ^bb1 ^bb1 br ^bbN(...) br ^bbN(...) ``` Differential Revision: https://reviews.llvm.org/D78683	2020-04-23 04:42:02 -07:00
River Riddle	af331bc52d	[mlir][Standard] Add a canonicalization to simplify cond_br when the successors are identical This revision adds support for canonicalizing the following: ``` cond_br %cond, ^bb1(A, ..., N), ^bb1(A, ..., N) br ^bb1(A, ..., N) ``` If the operands to the successor are different and the cond_br is the only predecessor, we emit selects for the branch operands. ``` cond_br %cond, ^bb1(A), ^bb1(B) %select = select %cond, A, B br ^bb1(%select) ``` Differential Revision: https://reviews.llvm.org/D78682	2020-04-23 04:42:02 -07:00
River Riddle	2f4b303d68	[mlir][Standard] Add canonicalization for collapsing pass through cond_br successors. This revision adds support for the following canonicalization: ``` cond_br %cond, ^bb1, ^bb2 ^bb1 br ^bbN(...) ^bb2 br ^bbK(...) cond_br %cond, ^bbN(...), ^bbK(...) ``` Differential Revision: https://reviews.llvm.org/D78681	2020-04-23 04:42:01 -07:00
Frederik Gossen	0372db05bb	[MLIR] Use nested symbol to identify kernel in `LaunchFuncOp`. Summary: Use a nested symbol to identify the kernel to be invoked by a `LaunchFuncOp` in the GPU dialect. This replaces the two attributes that were used to identify the kernel module and the kernel within seperately. Differential Revision: https://reviews.llvm.org/D78551	2020-04-22 07:44:29 +00:00
Frederik Gossen	648fc95083	[MLIR] Use `kernel` as a short hand for `gpu.kernel` attribute. Summary: Use the shortcu `kernel` for the `gpu.kernel` attribute of `gpu.func`. The parser supports this and test cases are easier to read. Differential Revision: https://reviews.llvm.org/D78542	2020-04-22 07:38:30 +00:00
Frederik Gossen	2813802746	[MLIR] Fix test case for kernel attribute. Summary: Fix a broken test case in the `invalid.mlir` lit test case. `expect` was missing its `e`. Differential Revision: https://reviews.llvm.org/D78540	2020-04-22 07:27:39 +00:00
Pierre Oechsel	128d72751f	[mlir] [linalg] Specify alignment during promotion. The buffer allocated by a promotion can be subject to other transformations afterward. For example it could be vectorized, in which case it is needed to ensure that this buffer is memory-aligned. Differential Revision: https://reviews.llvm.org/D78556	2020-04-21 18:13:40 +02:00
Nicolas Vasilache	538ac26f25	[mlir][Linalg] Create a named batch_matmul op and pipe it through. This revision is the first in a set of improvements that aim at allowing more generalized named Linalg op generation from a mathematical specification. This revision allows creating a new op and checks that the parser, printer and verifier are hooked up properly. This opened up a few design points that will be addressed in the future: 1. A named linalg op has a static region builder instead of an explicitly parsed region. This is not currently compatible with assemblyFormat so a custom parser / printer are needed. 2. The convention for structured ops and tensor return values needs to evolve to allow tensor-land and buffer land specifications to agree 3. ReferenceIndexingMaps and referenceIterators will need to become static to allow building attributes at parse time. 4. Error messages will be improved once we have 3. and we pretty print in custom form. Differential Revision: https://reviews.llvm.org/D78327	2020-04-21 12:09:46 -04:00
Pierre Oechsel	49202476e6	[mlir] [linalg] Fix transform-patterns test. Unfortunately FileCheck ignores directives with whitespace between the directive and the colon (`CHECK :` for example), thus most of the directives of this test were ignored. Differential Revision: https://reviews.llvm.org/D78548	2020-04-21 12:53:45 +02:00
Pierre Oechsel	12dcb89dad	[mlir] [linalg] Only promote selected buffers. The promotion transformation is promoting all input and output buffers of the transformed op. The user might want to only promote some of these buffers. Differential Revision: https://reviews.llvm.org/D78498	2020-04-21 11:50:08 +02:00
Uday Bondhugula	2d231c703d	[MLIR] NFC clean up loop tiling test cases Summary: Improve the checks and drop redundant ones. Rename the outlined maps better. Reviewers: andydavis1, nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, Joonsoo, grosul1, frgossen, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78514	2020-04-21 01:13:46 +05:30
Uday Bondhugula	3dff8c9109	[MLIR] Fix affine loop tiling utility upper bound bug Fix intra-tile upper bound setting in a scenario where the tile size was larger than the trip count. Differential Revision: https://reviews.llvm.org/D78505	2020-04-21 00:54:01 +05:30
Lei Zhang	f83d502feb	[mlir][spirv] NFC: remove unnecessary `%N =` in test CHECKs	2020-04-20 11:52:29 -04:00
aartbik	186709c6e0	[mlir] [VectorOps] Progressive lowering of vector.broadcast Summary: Rather than having a full, recursive, lowering of vector.broadcast to LLVM IR, it is much more elegant to have a progressive lowering of each vector.broadcast into a lower dimensional vector.broadcast, until only elementary vector operations remain. This results in more elegant, step-wise code, that is easier to understand. Also makes some optimizations in the generated code. Reviewers: nicolasvasilache, mehdi_amini, andydavis1, grosul1 Reviewed By: nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, grosul1, frgossen, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78071	2020-04-16 21:02:27 -07:00
Nicolas Vasilache	f54312277c	[mlir][Linalg] Drop function attribute from generic ops. The function attribute in generic ops is not paying for itself. A region is the more standardized way of specifying a custom computation. If needed this region can call a function directly. This is deemed more natural than managing a dedicated function attribute. This also simplifies named ops generation by trimming unnecessary complexity. Differential Revision: https://reviews.llvm.org/D78266	2020-04-16 09:47:08 -04:00
Alexander Belyaev	be9c3bdc44	[MLIR] Fix fusion of linalg.indexed_generic producer into tiled (Indexed)GenericOp. Differential Revision: https://reviews.llvm.org/D78209	2020-04-16 10:45:17 +02:00
Jeremy Bruestle	9f3ab92ec8	[MLIR] Improve support for 0-dimensional Affine Maps. Summary: Modified AffineMap::get to remove support for the overload which allowed an ArrayRef of AffineExpr but no context (and gathered the context from a presumed first entry, resulting in bugs when there were 0 results). Instead, we support only a ArrayRef and a context, and a version which takes a single AffineExpr. Additionally, removed some now needless case logic which previously special cased which call to AffineMap::get to use. Reviewers: flaub, bondhugula, rriddle!, nicolasvasilache, ftynse, ulysseB, mravishankar, antiagainst, aartbik Subscribers: mehdi_amini, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, bader, grosul1, frgossen, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78226	2020-04-15 14:15:02 -07:00
Uday Bondhugula	af5e83f569	[MLIR] Introduce utility to hoist affine if/else conditions This revision introduces a utility to unswitch affine.for/parallel loops by hoisting affine.if operations past surrounding affine.for/parallel. The hoisting works for both perfect/imperfect nests and in the presence of else blocks. The hoisting is currently to as outermost a level as possible. Uses a test pass to test the utility. Add convenience method Operation::getParentWithTrait<Trait>. Depends on D77487. Differential Revision: https://reviews.llvm.org/D77870	2020-04-16 00:32:34 +05:30
Alex Zinenko	129cf84e69	[mlir] LLVM dialect: support globals without linkage keyword, assuming 'external' Similarly to actual LLVM IR, and to `llvm.mlir.func`, allow the custom syntax of `llvm.mlir.global` to omit the linkage keyword. If omitted, the linkage is assumed to be external. This makes the modeling of globals in the LLVM dialect more consistent, both within the dialect and with LLVM IR. Differential Revision: https://reviews.llvm.org/D78096	2020-04-15 10:58:32 +02:00
Uday Bondhugula	04b5274ede	[MLIR] Introduce applyOpPatternsAndFold for op local rewrites Introduce mlir::applyOpPatternsAndFold which applies patterns as well as any folding only on a specified op (in contrast to applyPatternsAndFoldGreedily which applies patterns only on the regions of an op isolated from above). The caller is made aware of the op being folded away or erased. Depends on D77485. Differential Revision: https://reviews.llvm.org/D77487	2020-04-15 14:10:01 +05:30
MaheshRavishankar	37b520763f	[mlir][Linalg] Handle null affine map returns from inversePermutation. The inversePermutation method returns a null map on failure. Update uses of this method within Linalg to handle this. In LinalgToLoops the null return value was used to emit scalar code. Modify that to return failure, and emit scalar implementation when affine map is "empty", i.e. 1 dims, 0 symbols and no result exprs. Differential Revision: https://reviews.llvm.org/D77964	2020-04-14 14:41:20 -07:00
MaheshRavishankar	3b2f26ab05	[mlir][Linalg] NFC : Fix check for scalar case handling in LinalgToLoops The invertPermutation method does not return a nullptr anymore, but rather returns an empty map for the scalar case. Update the check in LinalgToLoops to reflect this. Also add test case for generating scalar code.	2020-04-13 13:23:01 -07:00
MaheshRavishankar	03391df90e	[mlir][Linalg] Add loop.parallel lowering for all Linalg Ops. The outer parallel loops of a linalg operation is lowered to loop.parallel, with the other loops lowered to loop.for. This gets the lowering to loop.parallel on par with the loop.for lowering. In future the reduction loop could also be lowered to loop.parallel. Also add a utility function that returns the loops that are created. Differential Revision: https://reviews.llvm.org/D77678	2020-04-13 13:19:12 -07:00
Denis Khalikov	ec99d6e62f	[mlir][spirv] Add a `spirv::InterfaceVarABIAttr`. Summary: Add a proper dialect-specific attribute for interface variable ABI. Differential Revision: https://reviews.llvm.org/D77941	2020-04-13 22:47:47 +03:00
Lei Zhang	a9cb529a84	[mlir][spirv] NFC: use Optional to replace SPV_Optional Differential Revision: https://reviews.llvm.org/D78046	2020-04-13 15:44:06 -04:00
Uday Bondhugula	500e03881b	[MLIR] NFC clean up simplify-affine-structures test case NFC clean up for simplify-affine-structures test cases. Rename sets better; avoid suffix numbers; move outlined definitions close to use. This is in preparation for other functionality updates. Differential Revision: https://reviews.llvm.org/D78017	2020-04-13 23:59:21 +05:30
Lei Zhang	a290c3af9d	[mlir][spirv] Improve stride support in array types This commit added stride support in runtime array types. It also adjusted the assembly form for the stride from `[N]` to `stride=N`. This makes the IR more readable, especially for the cases where one mix array types and struct types. Differential Revision: https://reviews.llvm.org/D78034	2020-04-13 14:08:17 -04:00
Kiran Kumar T P	7ecee63e71	[MLIR] Support for taskwait and taskyield operations, and translating the same to LLVM IR This patch adds support for taskwait and taskyield operations in OpenMP dialect and translation of the these constructs to LLVM IR. The OpenMP IRBuilder is used for this translation. The patch includes code changes and a testcase modifications. Differential Revision: https://reviews.llvm.org/D77634	2020-04-10 07:42:34 +00:00
Nicolas Vasilache	2d32ee0d7a	[mlir][Vector] Update lowering of vector ops to llvm intrinsics to use row-major. Summary: LLVM matrix intrinsics recently introduced an option to support row-major mode. This matches the MLIR vector model, this revision switches to row-major. A corner case related to degenerate sizes was also fixed upstream. This revision removes the guard against this corner case. A bug was uncovered on the output vector construction which this revision also fixes. Lastly, this has been tested on a small size and benchmarked independently: no visible performance regression is observed. In the future, when matrix intrinsics support per op attribute, we can more aggressively translate to that and avoid inserting MLIR-level transposes. This has been tested independently to work on small matrices. Differential Revision: https://reviews.llvm.org/D77761	2020-04-09 16:37:28 -04:00
Nicolas Vasilache	6fb6a4d7f9	[mlir][Linalg] Add a test for a fused Linalg pass based on DRR to go from matmul to vectors This revision builds a simple "fused pass" consisting of 2 levels of tiling, memory promotion and vectorization using linalg transformations written as composable pattern rewrites.	2020-04-08 16:54:40 -04:00
River Riddle	400ad6f95d	[mlir] Eliminate the remaining usages of cl::opt instead of PassOption. Summary: Pass options are a better choice for various reasons and avoid the need for static constructors. Differential Revision: https://reviews.llvm.org/D77707	2020-04-08 13:05:08 -07:00
Uday Bondhugula	ae8ebeca51	[MLIR] Fix test case typos in unroll.mlir Fix check line typos in test/Dialect/Affine/unroll.mlir. Differential Revision: https://reviews.llvm.org/D77738	2020-04-08 22:21:07 +05:30
Andy Davis	7006daa548	[MLIR][Vector] Update ShapeCastOp folder to use producer-consumer value forwarding. Summary: Update ShapeCastOp folder to use producer-consumer value forwarding. Support is added for tracking sub-vectors through trivial shape cast operations, where the sub-vector shape is preserved across shape cast operations and only leading ones are added or removed. Support is preserved for cancelling shape cast operations. One unit test is added and two are updated. Reviewers: aartbik, nicolasvasilache Reviewed By: aartbik, nicolasvasilache Subscribers: frgossen, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, grosul1, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77253	2020-04-08 08:55:37 -07:00
Nicolas Vasilache	3cb1f35df2	[mlir][Linalg] Use subview instead of linalg.slice in Promotion.cpp This revision removes the reliance of Promotion on `linalg.slice` which is meant for the rank-reducing case. Differential Revision: https://reviews.llvm.org/D77676	2020-04-07 23:52:31 -04:00
Stella Laurenzo	f5deb0878d	Remove FxpMathOps dialect and Quantizer tool. Summary: * Removal of FxpMathOps was discussed on the mailing list. * Will send a courtesy note about also removing the Quantizer (which had some dependencies on FxpMathOps). * These were only ever used for experimental purposes and we know how to get them back from history as needed. * There is a new proposal for more generalized quantization tooling, so moving these older experiments out of the way helps clean things up. Subscribers: mgorny, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, grosul1, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77479	2020-04-07 13:22:39 -07:00
Lei Zhang	47b234944d	[mlir][spirv] Fix wrong Phi parent block for back-to-back loops If we have two back-to-back loops with block arguments, the OpPhi instructions generated for the second loop's block arguments should have use the merge block of the first SPIR-V loop structure as their incoming parent block. Differential Revision: https://reviews.llvm.org/D77543	2020-04-07 12:54:54 -04:00
Uday Bondhugula	70da33bf30	[MLIR] fix/update affine data copy utility for max/min bounds Fix point-wise copy generation to work with bounds that have max/min. Change structure of copy loop nest to use absolute loop indices and subtracting base from the indexes of the fast buffers. Update supporting utilities: Fix FlatAffineConstraints::getLowerAndUpperBound to look at equalities as well and for a missing division. Update unionBoundingBox to not discard common constraints (leads to a tighter system). Update MemRefRegion::getConstantBoundingSizeAndShape to add memref dimension constraints. Run removeTrivialRedundancy at the end of MemRefRegion::compute. Run single iteration loop promotion and load/store canonicalization after affine data copy (in its test pass as well). Differential Revision: https://reviews.llvm.org/D77320	2020-04-07 13:55:42 +05:30
Nicolas Vasilache	8f229989d5	[mlir][Linalg] Add a linalg.tensor_reshape to operate on tensors Summary: This revision adds a tensor_reshape operation that operates on tensors. In the tensor world the constraints are less stringent and we can allow more arbitrary dynamic reshapes, as long as they are contractions. The expansion of a dynamic dimension into multiple dynamic dimensions is under-specified and is punted on for now. Differential Revision: https://reviews.llvm.org/D77360	2020-04-06 11:19:17 -04:00
River Riddle	0359b86d8b	[mlir][ODS] Add support for variadic regions. Summary: This revision adds support for marking the last region as variadic in the ODS region list with the VariadicRegion directive. Differential Revision: https://reviews.llvm.org/D77455	2020-04-05 01:03:38 -07:00
Kazuaki Ishizaki	5aacce3db2	[mlir] NFC: Fix trivial typo Differential Revision: https://reviews.llvm.org/D77473	2020-04-05 11:30:30 +09:00
Alex Grosul	3a5192098c	[mlir][VectorOps] Implement canonicalization for TransposeOp. Two back-to-back transpose operations are combined into a single transpose, which uses a combination of their permutation vectors. Differential Revision: https://reviews.llvm.org/D77331	2020-04-02 18:36:40 -07:00
Uday Bondhugula	7c771631c6	[MLIR][NFC] drop unnecessary matches in affine dma generate test case Drop unnecessary matches in affine DMA generate test case. Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, grosul1, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77243	2020-04-02 03:02:07 +05:30
Uday Bondhugula	5e8093134a	[MLIR] Add method to drop duplicate result exprs from AffineMap Add a method that given an affine map returns another with just its unique results. Use this to drop redundant bounds in max/min for affine.for. Update affine.for's canonicalization pattern and createCanonicalizedForOp to use this. Differential Revision: https://reviews.llvm.org/D77237	2020-04-02 03:00:19 +05:30
Uday Bondhugula	572890f1d3	[MLIR][NFC] clean up affine data copy test case Capture maps to test better; drop unnecessary matches Differential Revision: https://reviews.llvm.org/D77196	2020-04-01 22:37:49 +05:30
Hanhan Wang	6dd696ae4f	[mlir][Linalg] Extend fusion to support WAW atm on buffers. Summary: The RAW fusion happens only if the produecer block dominates the consumer block. The WAW pattern also works with the precondition. I.e., if a producer can dominate the consumer, they can fairly fuse together. Since they are all tilable, we can think the pattern like this way: Input: ``` linalg_op1 view tile_loop subview_2 linalg_op2 subview_2 ``` Tile the first Linalg op as same as the second Linalg. ``` tile_loop subview_1 linalg_op1 subview_1 tile_loop subview_2 liangl_op2 subview_2 ``` Since the first Linalg op is tilable in the same way and the computation are independently, it's fair to fuse it with the second Linalg op. ``` tile_loop subview_1 linalg_op1 subview_1 linalg_op2 subview_2 ``` In short, this patch includes: - Handling both RAW and WAW pattern. - Adding a interface method to get input and output buffers. - Exposing a method to get a StringRef of a dependency type. - Fixing existing WAW tests and add one more use case: initialize the buffer before conv op. Differential Revision: https://reviews.llvm.org/D76897	2020-03-31 21:33:50 -07:00
Hanhan Wang	69ddee1d2a	[mlir][Linalg] Introduce linalg.pooling_min/max/sum op. Summary: Performs an N-D pooling operation similarly to the description in the TF documentation: https://www.tensorflow.org/api_docs/python/tf/nn/pool Different from the description, this operation doesn't perform on batch and channel. It only takes tensors of rank `N`. ``` output[x[0], ..., x[N-1]] = REDUCE_{z[0], ..., z[N-1]} input[ x[0] * strides[0] - pad_before[0] + dilation_rate[0]z[0], ... x[N-1]strides[N-1] - pad_before[N-1] + dilation_rate[N-1]*z[N-1] ], ``` The required optional arguments are: - strides: an i64 array specifying the stride (i.e. step) for window loops. - dilations: an i64 array specifying the filter upsampling/input downsampling rate - padding: an i64 array of pairs (low, high) specifying the number of elements to pad along a dimension. If strides or dilations attributes are missing then the default value is one for each of the input dimensions. Similarly, padding values are zero for both low and high in each of the dimensions, if not specified. Differential Revision: https://reviews.llvm.org/D76414	2020-03-31 21:21:54 -07:00
Alex Grosul	855e738be2	[VectorOps] Implement a simple folder for identity vector.transpose operations. Differential Revision: https://reviews.llvm.org/D77088	2020-03-31 17:03:10 -07:00
MaheshRavishankar	da7b6fe942	[mlir][Linalg] Allow tiling of batch dimension for convolution ops with padding. Existing tiling implementation of Linalg would still work for tiling the batch dimensions of the convolution op. Differential Revision: https://reviews.llvm.org/D76637	2020-03-31 09:22:38 -07:00
Andy Davis	31a346cc35	[MLIR][Vector] Add support for TupleGetOp folding through InsertSlicesOp and ExtractSlicesOp. Summary: Add support for TupleGetOp folding through InsertSlicesOp and ExtractSlicesOp. Vector-to-vector transformations for unrolling and lowering to hardware vectors can generate chains of structured vector operations (InsertSlicesOp, ExtractSlicesOp and ShapeCastOp) between the producer of a hardware vector value and its consumer. Because InsertSlicesOp, ExtractSlicesOp and ShapeCastOp are structured, we can track the location (tuple index and vector offsets) of the consumer vector value through the chain of structured operations to the producer, enabling a much more powerful producer-consumer fowarding of values through structured ops and tuple, which in turn enables a more powerful TupleGetOp folding transformation. Reviewers: nicolasvasilache, aartbik Reviewed By: aartbik Subscribers: grosul1, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76889	2020-03-31 08:39:17 -07:00
Ahmed Taei	221fa96cd4	Fix linalg.generic access of hoisted constants Summary: Otherwise the added @generic_const_int will fail Reviewers: nicolasvasilache, rriddle, mravishankar Subscribers: mehdi_amini, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, grosul1, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77109	2020-03-30 21:15:41 -07:00
Mehdi Amini	111c932855	Fix test: add `-allow-unregistered-dialect` to Affine/loop-permute.mlir test (missed during rebase)	2020-03-30 20:14:47 +00:00
Mehdi Amini	bab5bcf8fd	Add a flag on the context to protect against creation of operations in unregistered dialects Differential Revision: https://reviews.llvm.org/D76903	2020-03-30 19:37:31 +00:00
Uday Bondhugula	f273e5c507	[MLIR] Fix permuteLoops utility Rewrite mlir::permuteLoops (affine loop permutation utility) to fix incorrect approach. Avoiding using sinkLoops entirely - use single move approach. Add test pass. This fixes https://bugs.llvm.org/show_bug.cgi?id=45328 Depends on D77003. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D77004	2020-03-30 23:38:23 +05:30
Uday Bondhugula	43a95a543f	[MLIR] Introduce full/partial tile separation using if/else This patch introduces a utility to separate full tiles from partial tiles when tiling affine loop nests where trip counts are unknown or where tile sizes don't divide trip counts. A conditional guard is generated to separate out the full tile (with constant trip count loops) into the then block of an 'affine.if' and the partial tile to the else block. The separation allows the 'then' block (which has constant trip count loops) to be optimized better subsequently: for eg. for unroll-and-jam, register tiling, vectorization without leading to cleanup code, or to offload to accelerators. Among techniques from the literature, the if/else based separation leads to the most compact cleanup code for multi-dimensional cases (because a single version is used to model all partial tiles). INPUT affine.for %i0 = 0 to %M { affine.for %i1 = 0 to %N { "foo"() : () -> () } } OUTPUT AFTER TILING W/O SEPARATION map0 = affine_map<(d0) -> (d0)> map1 = affine_map<(d0)[s0] -> (d0 + 32, s0)> affine.for %arg2 = 0 to %M step 32 { affine.for %arg3 = 0 to %N step 32 { affine.for %arg4 = #map0(%arg2) to min #map1(%arg2)[%M] { affine.for %arg5 = #map0(%arg3) to min #map1(%arg3)[%N] { "foo"() : () -> () } } } } OUTPUT AFTER TILING WITH SEPARATION map0 = affine_map<(d0) -> (d0)> map1 = affine_map<(d0) -> (d0 + 32)> map2 = affine_map<(d0)[s0] -> (d0 + 32, s0)> #set0 = affine_set<(d0, d1)[s0, s1] : (-d0 + s0 - 32 >= 0, -d1 + s1 - 32 >= 0)> affine.for %arg2 = 0 to %M step 32 { affine.for %arg3 = 0 to %N step 32 { affine.if #set0(%arg2, %arg3)[%M, %N] { // Full tile. affine.for %arg4 = #map0(%arg2) to #map1(%arg2) { affine.for %arg5 = #map0(%arg3) to #map1(%arg3) { "foo"() : () -> () } } } else { // Partial tile. affine.for %arg4 = #map0(%arg2) to min #map2(%arg2)[%M] { affine.for %arg5 = #map0(%arg3) to min #map2(%arg3)[%N] { "foo"() : () -> () } } } } } The separation is tested via a cmd line flag on the loop tiling pass. The utility itself allows one to pass in any band of contiguously nested loops, and can be used by other transforms/utilities. The current implementation works for hyperrectangular loop nests. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D76700	2020-03-28 06:58:35 +05:30
Uday Bondhugula	92744f6247	[MLIR] Add flat affine constraints method to round trip integer set - add method to get back an integer set from flat affine constraints; this allows a round trip - use this to complete the simplification of integer sets in -simplify-affine-structures - update FlatAffineConstraints::removeTrivialRedundancy to also do GCD tightening and normalize by GCD (while still keeping it linear time). Signed-off-by: Uday Bondhugula <uday@polymagelabs.com>	2020-03-26 12:07:13 +05:30
Frej Drejhammar	d8981ce5b9	[mlir][Parser] Fix attribute parser errors for ui64 Summary: The attribute parser fails to correctly parse unsigned 64 bit attributes as the check `isNegative ? (int64_t)-val.getValue() >= 0 : (int64_t)val.getValue() < 0` will falsely detect an overflow for unsigned values larger than 2^63-1. This patch reworks the overflow logic to instead of doing arithmetic on int64_t use APInt::isSignBitSet() and knowledge of the attribute type. Test-cases which verify the de-facto behavior of the parser and triggered the previous faulty handing of unsigned 64 bit attrbutes are also added. Differential Revision: https://reviews.llvm.org/D76493	2020-03-25 11:57:16 -07:00
aartbik	8d46bfa808	[mlir] [VectorOps] A "reference" lowering of vector.transpose to LLVM IR Summary: Makes the vector.tranpose runnable on CPU. Reviewers: nicolasvasilache, andydavis1, rriddle Reviewed By: nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76644	2020-03-23 19:01:38 -07:00
Uday Bondhugula	b873761496	[MLIR][NFC] Move some of the affine transforms / tests to dialect dirs Move some of the affine transforms and their test cases to their respective dialect directory. This patch does not complete the move, but takes care of a good part. Renames: prefix 'affine' to affine loop tiling cl options, vectorize -> super-vectorize Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D76565	2020-03-23 08:25:07 +05:30
River Riddle	e9482ed194	[mlir] Move several static cl::opts to be pass options instead. This removes the reliance on global options, and also simplifies the pass registration. Differential Revision: https://reviews.llvm.org/D76552	2020-03-22 03:16:21 -07:00
aartbik	479ee11061	[mlir] [VectorOps] Introduce vector.transpose Summary: Introduced in order to introduce specialized lowering passes that implement transposition operations efficiently. Reviewers: nicolasvasilache, andydavis1 Reviewed By: nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76460	2020-03-20 15:47:44 -07:00
Rob Suderman	e708471395	[mlir][NFC] Cleanup AffineOps directory structure Summary: Change AffineOps Dialect structure to better group both IR and Tranforms. This included extracting transforms directly related to AffineOps. Also move AffineOps to Affine. Differential Revision: https://reviews.llvm.org/D76161	2020-03-20 14:23:43 -07:00
Ahmed Taei	08a9147349	[mlir][LLVMIR] Fix fusion for rank-0 tensors Summary: This diff fixes fusion craching for ops with rank-0 tensors Reviewers: mravishankar, nicolasvasilache, rriddle! Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76479	2020-03-20 13:17:19 -07:00
Nicolas Vasilache	462db62053	[mlir][AVX512] Start a primitive AVX512 dialect The Vector Dialect [document](https://mlir.llvm.org/docs/Dialects/Vector/) discusses the vector abstractions that MLIR supports and the various tradeoffs involved. One of the layer that is missing in OSS atm is the Hardware Vector Ops (HWV) level. This revision proposes an AVX512-specific to add a new Dialect/Targets/AVX512 Dialect that would directly target AVX512-specific intrinsics. Atm, we rely too much on LLVM’s peephole optimizer to do a good job from small insertelement/extractelement/shufflevector. In the future, when possible, generic abstractions such as VP intrinsics should be preferred. The revision will allow trading off HW-specific vs generic abstractions in MLIR. Differential Revision: https://reviews.llvm.org/D75987	2020-03-20 14:11:57 -04:00
Valentin Clement	d4d62fcab6	[MLIR] Add test for multiple gpu.all_reduce in the same kernel when lowering to NVVM Summary: This patch add tests when lowering multiple `gpu.all_reduce` operations in the same kernel. This was previously failing. Differential Revision: https://reviews.llvm.org/D75930	2020-03-19 16:36:38 +01:00
Shraiysh Vaishay	ff77397fcf	[mlir] Added llvm.resume and personality functions in LLVM IR Dialect `llvm.resume` is similar to `llvm.return` except that has to be exactly one operand and that should be derived from a `llvm.landingpad` instruction. Any function having `llvm.landingpad` instruction must have a personality attribute. Example: LLVM IR ``` define dso_local i32 @main() personality i32 (...)* @__gxx_personality_v0 { invoke void @foo(i32 42) to label %3 unwind label %1 1: ; preds = %0 %2 = landingpad i8* catch i8** @_ZTIi catch i8* bitcast (i8** @_ZTIi to i8) resume i8 %2 3: ; preds = %0 ret i32 1 } ``` MLIR - LLVM IR Dialect ``` llvm.func @main() -> !llvm.i32 attributes {personality = @__gxx_personality_v0} { %0 = llvm.mlir.constant(1 : i32) : !llvm.i32 %1 = llvm.mlir.addressof @_ZTIi : !llvm<"i8"> %2 = llvm.bitcast %1 : !llvm<"i8"> to !llvm<"i8"> %3 = llvm.mlir.addressof @_ZTIi : !llvm<"i8"> %4 = llvm.mlir.constant(42 : i32) : !llvm.i32 llvm.invoke @foo(%4) to ^bb2 unwind ^bb1 : (!llvm.i32) -> () ^bb1: // pred: ^bb0 %5 = llvm.landingpad (catch %3 : !llvm<"i8">) (catch %2 : !llvm<"i8">) : !llvm<"i8"> llvm.resume %5 : !llvm<"i8"> ^bb2: // pred: ^bb0 llvm.return %0 : !llvm.i32 } ``` Differential Revision: https://reviews.llvm.org/D71888	2020-03-19 13:14:25 +01:00
Alex Zinenko	bc18624b40	[mlir] vector.type_cast: disallow memrefs with layout in verifier Summary: These are not supported by any of the code using `type_cast`. In the general case, such casting would require memrefs to handle a non-contiguous vector representation or misaligned vectors (e.g., if the offset of the source memref is not divisible by vector size, since offset in the target memref is expressed in the number of elements). Differential Revision: https://reviews.llvm.org/D76349	2020-03-19 10:15:41 +01:00
Lei Zhang	58df5e6d9a	[mlir][spirv] Plumbing target environment into type converter This commit unifies target environment queries into a new wrapper class spirv::TargetEnv and shares across various places needing the functionality. We still create multiple instances of TargetEnv though given the parent components (type converters, passes, conversion targets) have different lifetimes. In the meantime, LowerABIAttributesPass is updated to take into consideration the target environment, which requires updates to tests to provide that. Differential Revision: https://reviews.llvm.org/D76242	2020-03-18 20:11:05 -04:00
Lei Zhang	3b35f9d8b5	[mlir][spirv] Use memref memory space for storage class Previously in SPIRVTypeConverter, we always convert memref types to StorageBuffer regardless of their memory spaces. This commit fixes that to let the conversion to look into memory space properly. For this purpose, a mapping between SPIR-V storage class and memref memory space is introduced. The mapping is arbitary decided at the moment and the hope is that we can leverage string memory space later to be more clear. Now spv.interface_var_abi cannot contain storage class unless it's attached to a scalar value, where we need the storage class as side channel information. Verifications and tests are properly adjusted. Differential Revision: https://reviews.llvm.org/D76241	2020-03-18 20:11:04 -04:00
Nicolas Vasilache	2fae7878d5	[mlir][Vector] Mostly-NFC - Restructure options for lowering to LLVM Matrix Intrinsics Summary: This revision restructures the calling of vector transforms to make it more flexible to ask for lowering through LLVM matrix intrinsics. This also makes sure we bail out in degenerate cases (i.e. 1) in which LLVM complains about not being able to scalarize. Differential Revision: https://reviews.llvm.org/D76266	2020-03-17 22:58:02 -04:00
Rob Suderman	4d60f47b08	[mlir][NFC] Renamed VectorOps to Vector Summary: Renamed VectorOps to Vector to avoid the redundant Ops suffix. Differential Revision: https://reviews.llvm.org/D76317	2020-03-17 15:28:08 -07:00
Rob Suderman	363dd3f394	[mlir][NFC] Rename QuantOps to Quant Summary: Renamed QuantOps to Quant to avoid the Ops suffix. All dialects will contain ops, so the Ops suffix is redundant. Differential Revision: https://reviews.llvm.org/D76318	2020-03-17 15:16:47 -07:00
Jacques Pienaar	9a65d683e0	[mlir] Add target for Shape dialect Summary: Add targets and basic printing/parsing of types in Shape dialect. Differential Revision: https://reviews.llvm.org/D76321	2020-03-17 14:54:25 -07:00
Sagar Jain	76cf14035b	[MLIR] Added llvm.fence This patch adds llvm.fence. I tried not to change the syntax much. syntax: LLVM IR `fence [syncscope("<target-scope>")] <ordering>` MLIR LLVM Dialect `llvm.fence [syncscope("<target-scope>")] <ordering>` example: LLVM IR: `fence syncscope("agent") seq_cst` MLIR: `llvm.fence syncscope("agent") seq_cst` Differential Revision: https://reviews.llvm.org/D75645	2020-03-17 17:53:37 +01:00
Alex Zinenko	e119980f3f	[mlir] LLVM dialect: move ensureDistinctSuccessors out of std->LLVM conversion MLIR supports terminators that have the same successor block with different block operands, which cannot be expressed in the LLVM's phi-notation as the block identifier is used to tell apart the predecessors. This limitation can be worked around by branching to a new block instead, with this new block unconditionally branching to the original successor and forwarding the argument. Until now, this transformation was performed during the conversion from the Standard to the LLVM dialect. This does not scale well to multiple dialects targeting the LLVM dialect as all of them would have to be aware of this limitation and perform the preparatory transformation. Instead, do it as a separate pass and run it immediately before the translation. Differential Revision: https://reviews.llvm.org/D75619	2020-03-17 15:22:14 +01:00
Uday Bondhugula	332f0b3cd4	Affine expr simplification for add of const multiple of same expression - Detect "c_1 * expr + c_2 * expr" as (c_1 + c_2) * expr - subsumes things like 'expr - expr' and "expr * -1 + expr" as 0. - change AffineConstantExpr ctor to allow default null init Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D76233	2020-03-17 08:22:17 +05:30
Feng Liu	166f83f436	[QuantOps] Add the quant region definition Summary: This regional op in the QuantOps dialect will be used to wrap high-precision ops into atomic units for quantization. All the values used by the internal ops are captured explicitly by the op inputs. The quantization parameters of the inputs and outputs are stored in the attributes. Subscribers: jfb, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, aartbik, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75972	2020-03-16 15:44:43 -07:00
Hanhan Wang	92f7e8133a	[mlir][Linalg] Implement padding for linalg.conv and lowering to loops. Summary: To enable this, two changes are needed: 1) Add an optional attribute `padding` to linalg.conv. 2) Compute if the indices accessing is out of bound in the loops. If so, use the padding value `0`. Otherwise, use the value derived from load. In the patch, the padding only works for lowering without other transformations, e.g., tiling, fusion, etc. Differential Revision: https://reviews.llvm.org/D75722	2020-03-13 14:35:58 -07:00
Nicolas Vasilache	bbf3ef8541	[mlir][Vector]Lower vector.contract to llvm.intr.matrix_multiply Summary: This revision adds lowering of vector.contract to llvm.intr.matrix_multiply. Note that there is currently a mismatch between the MLIR vector dialect which expects row-major layout and the LLVM matrix intrinsics which expect column major layout. As a consequence, we currently only match a vector.contract with indexing maps that express column-major matrix multiplication. Other cases would require additional transposes and it is better to wait for LLVM intrinsics to provide a per-operation attribute that would specify which layout is expected. A separate integration test, not submitted to MLIR core, has independently verified that correct execution occurs on a 2x2x2 matrix multiplication. Differential Revision: https://reviews.llvm.org/D76014	2020-03-13 16:33:23 -04:00
aartbik	a213ece30b	[mlir] [VectorOps,LinAlg] Remove direct LLVM lowering for vector.broadcast Summary: The direct lowering of vector.broadcast into LLVM has been replaced by progressive lowering into elementary vector ops. This also required a small refactoring of a llvm.mlir test that used a direct vector.broadcast operator (just to define a matmul). Reviewers: nicolasvasilache, andydavis1, rriddle Reviewed By: nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76143	2020-03-13 11:42:51 -07:00
Lei Zhang	e5c85a5a4f	[mlir][spirv] Support querying type extension/capability requirements Previously we only consider the version/capability/extension requirements on ops themselves. Some types in SPIR-V also require special extensions or capabilities to be used. For example, non-32-bit integers/floats will require different capabilities and/or extensions depending on where they are used because it may mean special hardware abilities. This commit adds query methods to SPIR-V type class hierarchy to support querying extensions and capabilities. We don't go through ODS for auto-generating such information given that we don't have them in SPIR-V machine readable grammar and there are just a few types. Differential Revision: https://reviews.llvm.org/D75875	2020-03-12 19:37:45 -04:00
Lei Zhang	3148f10b17	[mlir][spirv] Use spv.vce in spv.module and wire up (de)serialization This commits changes the definition of spv.module to use the #spv.vce attribute for specifying (version, capabilities, extensions) triple so that we can have better API and custom assembly form. Since now we have proper modelling of the triple, (de)serialization is wired up to use them. With the new UpdateVCEPass, we don't need to manually specify the required extensions and capabilities anymore when creating a spv.module. One just need to call UpdateVCEPass before serialization to get the needed version/extensions/capabilities. Differential Revision: https://reviews.llvm.org/D75872	2020-03-12 19:37:45 -04:00
Lei Zhang	9414db1090	[mlir][spirv] Add a pass to deduce version/extension/capability Creates an operation pass that deduces and attaches the minimal version/ capabilities/extensions requirements for spv.module ops. For each spv.module op, this pass requires a `spv.target_env` attribute on it or an enclosing module-like op to drive the deduction. The reason is that an op can be enabled by multiple extensions/capabilities. So we need to know which one to pick. `spv.target_env` gives the hard limit as for what the target environment can support; this pass deduces what are actually needed for a specific spv.module op. Differential Revision: https://reviews.llvm.org/D75870	2020-03-12 19:37:45 -04:00
Lei Zhang	e115a40f50	[mlir][spirv] Use separate attribute for (version, capabilities, extensions) We also need the (version, capabilities, extensions) triple on the spv.module op. Thus far we have been using separate 'extensions' and 'capabilities' attributes there and 'version' is missing. Creating a separate attribute for the trip allows us to reuse the assembly form and verification. Differential Revision: https://reviews.llvm.org/D75868	2020-03-12 19:37:45 -04:00
River Riddle	907403f342	[mlir] Add a new `ConstantLike` trait to better identify operations that represent a "constant". The current mechanism for identifying is a bit hacky and extremely adhoc, i.e. we explicit check 1-result, 0-operand, no side-effect, and always foldable and then assume that this is a constant. Adding a trait adds structure to this, and makes checking for a constant much more efficient as we can guarantee that all of these things have already been verified. Differential Revision: https://reviews.llvm.org/D76020	2020-03-12 14:26:15 -07:00
River Riddle	7c211cf3af	[mlir][NFC] Move the definition of AffineApplyOp to ODS This has been a long standing cleanup TODO. Differential Revision: https://reviews.llvm.org/D76019	2020-03-12 14:26:15 -07:00
aartbik	078776a679	[mlir] [VectorOps] Progressively lower vector.outerproduct to LLVM Summary: This replaces the direct lowering of vector.outerproduct to LLVM with progressive lowering into elementary vectors ops to avoid having the similar lowering logic at several places. NOTE1: with the new progressive rule, the lowered llvm is slightly more elaborate than with the direct lowering, but the generated assembly is just as optimized; still if we want to stay closer to the original, we should add a "broadcast on extract" to shuffle rewrite (rather than special cases all the lowering steps) NOTE2: the original outerproduct lowering code should now be removed but some linalg test work directly on vector and contain some dead code, so this requires another CL Reviewers: nicolasvasilache, andydavis1 Reviewed By: nicolasvasilache, andydavis1 Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75956	2020-03-12 13:45:42 -07:00
Valentin Clement	c7380995f8	[MLIR] Add `and`, `or`, `xor`, `min`, `max` too gpu.all_reduce and the nvvm lowering Summary: This patch add some builtin operation for the gpu.all_reduce ops. - for Integer only: `and`, `or`, `xor` - for Float and Integer: `min`, `max` This is useful for higher level dialect like OpenACC or OpenMP that can lower to the GPU dialect. Differential Revision: https://reviews.llvm.org/D75766	2020-03-11 14:07:04 +01:00
Stephan Herhut	f6790a1c63	Revert "[MLIR] Add `and`, `or`, `xor`, `min`, `max` too gpu.all_reduce and the nvvm lowering" Attribution to original author got lost.	2020-03-11 14:07:04 +01:00
Stephan Herhut	2eff566b07	[MLIR] Add `and`, `or`, `xor`, `min`, `max` too gpu.all_reduce and the nvvm lowering Summary: This patch add some builtin operation for the gpu.all_reduce ops. - for Integer only: `and`, `or`, `xor` - for Float and Integer: `min`, `max` This is useful for higher level dialect like OpenACC or OpenMP that can lower to the GPU dialect. Differential Revision: https://reviews.llvm.org/D75766	2020-03-10 21:09:06 +01:00
Nicolas Vasilache	47ec8702cb	[mlir][Linalg] Revisit 0-D abstraction This revision takes advantage of the empty AffineMap to specify the 0-D edge case. This allows removing a bunch of annoying corner cases that ended up impacting users of Linalg. Differential Revision: https://reviews.llvm.org/D75831	2020-03-10 15:14:09 -04:00
Nicolas Vasilache	90322403c2	[mlir][Vector] Allow lowering of vector.shape_cast 2D <-> 1D Summary: This will support the progressive lowering of: ``` vector.contract -> downcast + vector.matrix_multiply + upcast -> llvm.intr.matrix ``` Differential Revision: https://reviews.llvm.org/D75776	2020-03-09 13:14:39 -04:00
Alexander Belyaev	3147342ae7	[MLIR] Change custom printer/parser for loop.parallel and loop.reduce.	2020-03-09 15:11:48 +01:00
aartbik	0d924700a6	[mlir] [VectorOps] Merge VectorReduction/VectorReductionV2 into one Op Summary: Paying off some technical debt in VectorOps, where I introduced a special op for a fused accumulator into reduction to avoid some issues around printing and parsing an optional accumulator. This CL merges the two into one op again and does things the right way (still would be nice to have "assemblyFormat" for optional operands though....). Reviewers: nicolasvasilache, andydavis1, ftynse, rriddle Reviewed By: nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75699	2020-03-05 13:07:31 -08:00
River Riddle	cb1777127c	[mlir] Remove successor operands from the Operation class Summary: This revision removes all of the functionality related to successor operands on the core Operation class. This greatly simplifies a lot of handling of operands, as well as successors. For example, DialectConversion no longer needs a special "matchAndRewrite" for branching terminator operations.(Note, the existing method was also broken for operations with variadic successors!!) This also enables terminator operations to define their own relationships with successor arguments, instead of the hardcoded "pass-through" behavior that exists today. Differential Revision: https://reviews.llvm.org/D75318	2020-03-05 12:53:02 -08:00
River Riddle	01f7431b5b	[mlir][DeclarativeParser] Add support for formatting operations with AttrSizedOperandSegments. This attribute details the segment sizes for operand groups within the operation. This revision add support for automatically populating this attribute in the declarative parser. Differential Revision: https://reviews.llvm.org/D75315	2020-03-05 12:51:28 -08:00
River Riddle	c0fd5e657e	[mlir] Add traits for verifying the number of successors and providing relevant accessors. This allows for simplifying OpDefGen, as well providing specializing accessors for the different successor counts. This mirrors the existing traits for operands and results. Differential Revision: https://reviews.llvm.org/D75313	2020-03-05 12:49:59 -08:00
MaheshRavishankar	3f44495dfd	[mlir][GPU] Expose the functionality to create a GPUFuncOp from a LaunchOp The current setup of the GPU dialect is to model both the host and device side codegen. For cases (like IREE) the host side modeling might not directly fit its use case, but device-side codegen is still valuable. First step in accessing just the device-side functionality of the GPU dialect is to allow just creating a gpu.func operation from a gpu.launch operation. In addition this change also "inlines" operations into the gpu.func op at time of creation instead of this being a later step. Differential Revision: https://reviews.llvm.org/D75287	2020-03-05 11:03:51 -08:00
MaheshRavishankar	755c050200	[mlir][Linalg] Fix load/store operations generated while lower loops when output has zero rank. While lowering to loops, no indices should be used in the load/store operation if the buffer is zero-rank. Differential Revision: https://reviews.llvm.org/D75391	2020-03-04 17:04:30 -08:00
Lei Zhang	9600b55ac8	[mlir][spirv] Support integer signedness This commit updates SPIR-V dialect to support integer signedness by relaxing various checks for signless to just normal integers. The hack for spv.Bitcast can now be removed. Differential Revision: https://reviews.llvm.org/D75611	2020-03-04 15:14:11 -05:00
Lei Zhang	5b2cc6c3d0	[mlir][ods] Improve integer signedness modelling A previous commit added support for integer signedness in C++ IntegerType. This change introduces ODS definitions for integer types and integer (element) attributes w.r.t. signedness. This commit also updates various existing definitions' descriptions to mention signless where suitable to make it more clear. Positive and non-negative integer attributes are removed to avoid the explosion of subclasses. Instead, one should use more atmoic constraints together with Confined to model that. For example, `Confined<..., [IntPositive]>`. Differential Revision: https://reviews.llvm.org/D75610	2020-03-04 15:05:42 -05:00
Alexander Belyaev	0145a26c65	[MLIR] Add explicit initial values for loop.parallel op. Differential Revision: https://reviews.llvm.org/D75206	2020-03-03 15:36:10 +01:00
Sagar Jain	d85821dfa6	[MLIR] Added llvm.freeze This patch adds llvm.freeze & processes undef constants from LLVM IR. Syntax: LLVM IR `<result> = freeze ty <val>` MLIR LLVM Dialect: `llvm.freeze val attr-dict : type` Example: LLVM IR: `%3 = freeze i32 5` MLIR: `%6 = llvm.freeze %5 : !llvm.i32` Differential Revision: https://reviews.llvm.org/D75329	2020-03-02 10:24:01 +01:00
Benjamin Kramer	5abf128d64	Add a pass that specializes parallel loops for easier unrolling and vectorization This matches loops with a affine.min upper bound, limiting the trip count to a constant, and rewrites them into two loops, one with constant upper bound and one with variable upper bound. The assumption is that the constant upper bound loop will be unrolled and vectorized, which is preferable if this is the hot path. Differential Revision: https://reviews.llvm.org/D75240	2020-02-28 19:47:23 +01:00
Alex Zinenko	54e5600e4d	[mlir] fix wrong symbol order in AffineApplyNormalizer Summary: AffineApplyNormalizer provides common logic for folding affine maps that appear in affine.apply into other affine operations that use the result of said affine.apply. In the process, affine maps of both operations are composed. During the composition `A.compose(B)` the symbols from the map A are placed before those of the map B in a single concatenated symbol list. However, AffineApplyNormalizer was ordering the operands of the operation being normalized by iteratively appending the symbols into a single list accoridng to the operand order, regardless of whether these operands are symbols of the current operation or of the map that is being folded into it. This could lead to wrong order of symbols and, when the symbols were bound to constant values, to visibly incorrect folding of constants into affine maps as reported in PR45031. Make sure symbols operands to the current operation are always placed before symbols coming from the folded maps. Update the test that was exercising the incorrect folder behavior. For some reason, the order of symbol operands was swapped in the test input compared to the previous operations, making it easy to assume the correct maps were produced whereas they were swapping the symbols back due to the problem described above. Closes https://bugs.llvm.org/show_bug.cgi?id=45031 Differential Revision: https://reviews.llvm.org/D75247	2020-02-27 15:15:29 +01:00
Lei Zhang	5bc6ff6455	[mlir][spirv] Add some folders for spv.LogicalAnd/spv.LogicalOr This commit handles folding spv.LogicalAnd/spv.LogicalOr when one of the operands is constant true/false. Differential Revision: https://reviews.llvm.org/D75195	2020-02-26 15:13:37 -05:00
Alex Zinenko	305320b005	[mlir] NFC: move AffineOps tests from test/ to test/Dialect AffineOps dialect lives under lib/Dialect/AffineOps and so should its tests.	2020-02-25 14:20:40 +01:00
Stephan Herhut	7a7eacc797	[MLIR][GPU] Implement a simple greedy loop mapper. Summary: The mapper assigns annotations to loop.parallel operations that are compatible with the loop to gpu mapping pass. The outermost loop uses the grid dimensions, followed by block dimensions. All remaining loops are mapped to sequential loops. Differential Revision: https://reviews.llvm.org/D74963	2020-02-25 11:42:42 +01:00
Lei Zhang	8358ddbe5d	[mlir][spirv] NFC: Move test passes to test/lib Previously C++ test passes for SPIR-V were put under test/Dialect/SPIRV. Move them to test/lib/Dialect/SPIRV to create a better structure. Also fixed one of the test pass to use new PassRegistration mechanism. Differential Revision: https://reviews.llvm.org/D75066	2020-02-24 14:17:02 -05:00
Benjamin Kramer	bc1947a6f5	Add a basic tiling pass for parallel loops This exploits the fact that the iterations of parallel loops are independent so tiling becomes just an index transformation. This pass only tiles the innermost loop of a loop nest. The ultimate goal is to allow vectorization of the tiled loops, but I don't think we're there yet with the current rewriting, as the tiled loops don't have a constant trip count. Differential Revision: https://reviews.llvm.org/D74954	2020-02-24 11:44:40 +01:00
River Riddle	9eb436feaa	[mlir][DeclarativeParser] Add support for formatting the successors of an operation. This revision add support for formatting successor variables in a similar way to operands, attributes, etc. Differential Revision: https://reviews.llvm.org/D74789	2020-02-21 15:15:32 -08:00
River Riddle	b1de971ba8	[mlir][ODS] Add support for specifying the successors of an operation. This revision add support in ODS for specifying the successors of an operation. Successors are specified via the `successors` list: ``` let successors = (successor AnySuccessor:$target, AnySuccessor:$otherTarget); ``` Differential Revision: https://reviews.llvm.org/D74783	2020-02-21 15:15:32 -08:00
Hanhan Wang	29ad9d6b26	[mlir][spirv] Add lowering for load/store zero-rank memref from std to SPIR-V. Differential Revision: https://reviews.llvm.org/D74874	2020-02-21 14:41:12 -05:00
Nagy Mostafa	bc7b26c333	[MLIR] Allow Loop dialect IfOp and ForOp to define values This patch implements the RFCs proposed here: https://llvm.discourse.group/t/rfc-modify-ifop-in-loop-dialect-to-yield-values/463 https://llvm.discourse.group/t/rfc-adding-operands-and-results-to-loop-for/459/19. It introduces the following changes: - All Loop Ops region, except for ReduceOp, terminate with a YieldOp. - YieldOp can have variadice operands that is used to return values out of IfOp and ForOp regions. - Change IfOp and ForOp syntax and representation to define values. - Add unit-tests and update .td documentation. - YieldOp is a terminator to loop.for/if/parallel - YieldOp custom parser and printer Lowering is not supported at the moment, and will be in a follow-up PR. Thanks. Reviewed By: bondhugula, nicolasvasilache, rriddle Differential Revision: https://reviews.llvm.org/D74174	2020-02-21 10:05:32 -08:00
aartbik	ee260c1a0f	[mlir] [VectorOps] Multi-dim reductions for lowering vector.contract Summary: This implements the last step for lowering vector.contract progressively to LLVM IR (except for masks). Multi-dimensional reductions that remain after expanding all parallel dimensions are lowered into into simpler vector.contract operations until a trivial 1-dim reduction remains. Reviewers: nicolasvasilache, andydavis1 Reviewed By: andydavis1 Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74880	2020-02-20 14:16:50 -08:00
Hanhan Wang	28e0449ec6	[mlir][Linalg] Allow specifiying zero-rank shaped type operands to linalg.indexed_generic ops. Patch D74638 allows linalg.generic ops to use zero-rank shaped type operands, this also can be applied to linalg.indexed_generic ops.	2020-02-19 19:24:27 -05:00
aartbik	0ba9ee9f0e	[mlir] [VectorOps] Framework for progressive lowering of vector.contract Summary: Lowers all free/batch dimensions in a vector.contract progressively into simpler vector.contract operations until a direct vector.reduction operation is reached. Then lowers 1-D reductions into vector.reduce. Still TBD: multi-dimensional contractions that remain after removing all the parallel dims Reviewers: nicolasvasilache, andydavis1, rriddle Reviewed By: andydavis1 Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74797	2020-02-19 11:36:11 -08:00
River Riddle	6b6c96695c	[mlir][ODS] Add a new trait `TypesMatchWith` Summary: This trait takes three arguments: lhs, rhs, transformer. It verifies that the type of 'rhs' matches the type of 'lhs' when the given 'transformer' is applied to 'lhs'. This allows for adding constraints like: "the type of 'a' must match the element type of 'b'". A followup revision will add support in the declarative parser for using these equality constraints to port more c++ parsers to the declarative form. Differential Revision: https://reviews.llvm.org/D74647	2020-02-19 10:18:58 -08:00
Alexander Belyaev	284279ac23	[MLIR] Add naive fusion of parallel loops.	2020-02-19 14:51:09 +01:00
Tim Shen	b762bbd4c8	[MLIR] change NVVM.mma.sync to the most useful variant. Summary: the .row.col variant turns out to be the popular one, contrary to what I thought as .row.row. Since .row.col is so prevailing (as I inspect cuDNN's behavior), I'm going to remove the .row.row support here, which makes the patch a little bit easier. Reviewers: ftynse Subscribers: jholewinski, bixia, sanjoy.google, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74655	2020-02-18 17:57:04 -08:00
MaheshRavishankar	a8355b5c0f	[mlir][Linalg] Allow specifiying zero-rank shaped type operands to linalg.generic ops. Fixing a bug where using a zero-rank shaped type operand to linalg.generic ops hit an unrelated assert. This also meant that lowering the operation to loops was not supported. Adding roundtrip tests and lowering to loops test for zero-rank shaped type operand with fixes to make the test pass. Differential Revision: https://reviews.llvm.org/D74638	2020-02-18 13:23:28 -08:00
Pierre Oechsel	0acd7e02f2	[mlir] Linalg: Extend promotion to non f32 buffers. Summary: Linalg's promotion pass was only supporting f32 buffers due to how the zero value was build for the `fill` operation. Moreover, `promoteSubViewOperands` was returning a vector with one entry per float subview while omitting integer subviews. For a program with only integer subviews the return vector would be of size 0. However, `promoteSubViewsOperands` would try to access a non zero number of entries of this vector, resulting in a sefgault. Reviewers: nicolasvasilache, ftynse Reviewed By: ftynse Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74532	2020-02-17 15:56:49 +01:00
River Riddle	5756bc4382	[mlir][DeclarativeParser] Add support for formatting enum attributes in the string form. Summary: This revision adds support to the declarative parser for formatting enum attributes in the symbolized form. It uses this new functionality to port several of the SPIRV parsers over to the declarative form. Differential Revision: https://reviews.llvm.org/D74525	2020-02-13 17:11:48 -08:00
aartbik	b21c799952	[mlir] [VectorOps] Initial framework for progressively lowering vector.contract Summary: This sets the basic framework for lowering vector.contract progressively into simpler vector.contract operations until a direct vector.reduction operation is reached. More details will be filled out progressively as well. Reviewers: nicolasvasilache Reviewed By: nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74520	2020-02-13 15:07:57 -08:00
Alex Zinenko	5ae9c4c868	[mlir] Linalg fusion: ignore indexed_generic producers They are currently not supported and we should not attempt fusing them.	2020-02-12 15:13:21 +01:00
Lei Zhang	d3e7816d85	[mlir][spirv] Introduce spv.func Thus far we have been using builtin func op to model SPIR-V functions. It was because builtin func op used to have special treatment in various parts of the core codebase (e.g., pass pipelines, etc.) and it's easy to bootstrap the development of the SPIR-V dialect. But nowadays with general op concepts and region support we don't have such limitations and it's time to tighten the SPIR-V dialect for completeness. This commits introduces a spv.func op to properly model SPIR-V functions. Compared to builtin func op, it can provide the following benefits: * We can control the full op so we can integrate SPIR-V information bits (e.g., function control) in a more integrated way and define our own assembly form and enforcing better verification. * We can have a better dialect and library boundary. At the current moment only functions are modelled with an external op. With this change, all ops modelling SPIR-V concpets will be spv.* ops and registered to the SPIR-V dialect. * We don't need to special-case func op anymore when creating ConversionTarget declaring SPIR-V dialect as legal. This is quite important given we'll see more and more conversions in the future. In the process, bumps a few FuncOp methods to the FunctionLike trait. Differential Revision: https://reviews.llvm.org/D74226	2020-02-12 07:46:43 -05:00
Mehdi Amini	c64770506b	Remove static registration for dialects, and the "alwayslink" hack for passes In the previous state, we were relying on forcing the linker to include all libraries in the final binary and the global initializer to self-register every piece of the system. This change help moving away from this model, and allow users to compose pieces more freely. The current change is only "fixing" the dialect registration and avoiding relying on "whole link" for the passes. The translation is still relying on the global registry, and some refactoring is needed to make this all more convenient. Differential Revision: https://reviews.llvm.org/D74461	2020-02-12 09:13:02 +00:00
Andy Davis	813bfffec3	[mlir][VectorOps] Adds canonicalization rewrite patterns for vector ShapeCastOp. Summary: Adds two rewrite patterns for the vector ShapeCastOp. ) ShapeCastOp decomposer: decomposes ShapeCastOp on tuple-of-vectors to multiple ShapeCastOps each on vector types. ) ShapeCastOp folder: folds canceling shape cast ops (e.g. shape_cast A -> B followed by shape_cast B -> A) away. Reviewers: nicolasvasilache, aartbik Reviewed By: nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74327	2020-02-11 13:11:45 -08:00
aartbik	e83b7b99da	[mlir] [VectorOps] Implement vector.reduce operation Summary: This new operation operates on 1-D vectors and forms the bridge between vector.contract and llvm intrinsics for vector reductions. Reviewers: nicolasvasilache, andydavis1, ftynse Reviewed By: nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74370	2020-02-11 11:31:59 -08:00
Hanhan Wang	4687822b9e	[mlir][Linalg] Add a roundtrip test for indexed_generic op with tensors. Summary: After D72555 has been landed, `linalg.indexed_generic` also accepts ranked tensor as input and output. Add a test for it. Differential Revision: https://reviews.llvm.org/D74267	2020-02-10 15:51:59 -05:00
Alex Zinenko	5a1778057f	[mlir] use unpacked memref descriptors at function boundaries The existing (default) calling convention for memrefs in standard-to-LLVM conversion was motivated by interfacing with LLVM IR produced from C sources. In particular, it passes a pointer to the memref descriptor structure when calling the function. Therefore, the descriptor is allocated on stack before the call. This convention leads to several problems. PR44644 indicates a problem with stack exhaustion when calling functions with memref-typed arguments in a loop. Allocating outside of the loop may lead to concurrent access problems in case the loop is parallel. When targeting GPUs, the contents of the stack-allocated memory for the descriptor (passed by pointer) needs to be explicitly copied to the device. Using an aggregate type makes it impossible to attach pointer-specific argument attributes pertaining to alignment and aliasing in the LLVM dialect. Change the default calling convention for memrefs in standard-to-LLVM conversion to transform a memref into a list of arguments, each of primitive type, that are comprised in the memref descriptor. This avoids stack allocation for ranked memrefs (and thus stack exhaustion and potential concurrent access problems) and simplifies the device function invocation on GPUs. Provide an option in the standard-to-LLVM conversion to generate auxiliary wrapper function with the same interface as the previous calling convention, compatible with LLVM IR porduced from C sources. These auxiliary functions pack the individual values into a descriptor structure or unpack it. They also handle descriptor stack allocation if necessary, serving as an allocation scope: the memory reserved by `alloca` will be freed on exiting the auxiliary function. The effect of this change on MLIR-generated only LLVM IR is minimal. When interfacing MLIR-generated LLVM IR with C-generated LLVM IR, the integration only needs to require auxiliary functions and change the function name to call the wrapper function instead of the original function. This also opens the door to forwarding aliasing and alignment information from memrefs to LLVM IR pointers in the standrd-to-LLVM conversion.	2020-02-10 15:03:43 +01:00
River Riddle	2f94ce0dcf	[mlir][DeclarativeParser] Move several missed parsers over to the declarative form. Differential Revision: https://reviews.llvm.org/D74283	2020-02-08 15:47:55 -08:00
natashaknk	9c1c825b72	[mlir][spirv] Adding sin op in the GLSL extension Differential Revision: https://reviews.llvm.org/D74151	2020-02-07 16:36:12 -05:00
Nicolas Vasilache	681f929f59	[mlir][VectorOps] Introduce a `vector.fma` op that works on n-D vectors and lowers to `llvm.intrin.fmuladd` Summary: The `vector.fma` operation is portable enough across targets that we do not want to keep it wrapped under `vector.outerproduct` and `llvm.intrin.fmuladd`. This revision lifts the op into the vector dialect and implements the lowering to LLVM by using two patterns: 1. a pattern that lowers from n-D to (n-1)-D by unrolling when n > 2 2. a pattern that converts from 1-D to the proper LLVM representation Reviewers: ftynse, stellaraccident, aartbik, dcaballe, jsetoain, tetuante Reviewed By: aartbik Subscribers: fhahn, dcaballe, merge_guards_bot, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74075	2020-02-07 15:44:53 -05:00
Nicolas Vasilache	499ad45877	[mlir][VectorOps] Expose and use llvm.intrin.fma* Summary: This revision exposes the portable `llvm.fma` intrinsic in LLVMOps and uses it in lieu of `llvm.fmuladd` when lowering the `vector.outerproduct` op to LLVM. This guarantees proper `fma` instructions will be emitted if the target ISA supports it. `llvm.fmuladd` does not have this guarantee in its semantics, despite evidence that the proper x86 instructions are emitted. For more details, see https://llvm.org/docs/LangRef.html#llvm-fmuladd-intrinsic. Reviewers: ftynse, aartbik, dcaballe, fhahn Reviewed By: aartbik Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74219	2020-02-07 15:38:40 -05:00
MaheshRavishankar	d06dd29e09	[mlir][Linalg] Implement fusion of linalg.generic operation on tensors. The initial implementation of the fusion operation exposes a method to fuse a consumer with its producer, when - both the producer and consumer operate on tensors - the producer has only a single result value - the producer has only "parallel" iterator types A new interface method hasTensorSemantics is added to verify that an operation has all operands and results of type RankedTensorType. Differential Revision: https://reviews.llvm.org/D74172	2020-02-07 10:36:53 -08:00
aartbik	6e2309d7fa	[mlir] [VectorOps] generalized vector.contract semantics Summary: Previously, vector.contract did not allow an empty set of free or batch dimensions (K = 0) which defines a basic reduction into a scalar (like a dot product). This CL relaxes that restriction. Also adds constraints on element type of operands and results. With tests. Reviewers: nicolasvasilache, andydavis1, rriddle Reviewed By: andydavis1 Subscribers: merge_guards_bot, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74014	2020-02-05 17:20:32 -08:00
Andy Davis	f9efce1dd5	[mlir][VectorOps] Support vector transfer_read/write unrolling for memrefs with vector element type. Summary: [mlir][VectorOps] Support vector transfer_read/write unrolling for memrefs with vector element type. When unrolling vector transfer read/write on memrefs with vector element type, the indices used to index the memref argument must be updated to reflect the unrolled operation. However, in the case of memrefs with vector element type, we need to be careful to only update the relevant memref indices. For example, a vector transfer read with the following source/result types, memref<6x2x1xvector<2x4xf32>>, vector<2x1x2x4xf32>, should only update memref indices 1 and 2 during unrolling. Reviewers: nicolasvasilache, aartbik Reviewed By: nicolasvasilache, aartbik Subscribers: lebedev.ri, Joonsoo, merge_guards_bot, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72965	2020-02-05 16:21:58 -08:00
Andy Davis	3ce8095c29	[mlir][VectorOps] Add ShapeCastOp to the vector ops dialect. Summary: Add ShapeCastOp to the vector ops dialect. The shape_cast operation casts between an n-D source vector shape and a k-D result vector shape (the element type remains the same). Reviewers: nicolasvasilache, aartbik Reviewed By: nicolasvasilache Subscribers: Joonsoo, merge_guards_bot, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73635	2020-02-05 15:45:12 -08:00
Lei Zhang	13b197c7d1	[mlir][spirv] Add dialect-specific attribute for target environment We were using normal dictionary attribute for target environment specification. It becomes cumbersome with more and more fields. This commit changes the modelling to a dialect-specific attribute, where we can have control over its storage and assembly form. Differential Revision: https://reviews.llvm.org/D73959	2020-02-04 21:33:13 -05:00
Stephen Neuendorffer	d7cbef2714	[MLIR] Fixes for shared library dependencies. Summary: This patch is a step towards enabling BUILD_SHARED_LIBS=on, which builds most libraries as DLLs instead of statically linked libraries. The main effect of this is that incremental build times are greatly reduced, since usually only one library need be relinked in response to isolated code changes. The bulk of this patch is fixing incorrect usage of cmake, where library dependencies are listed under add_dependencies rather than under target_link_libraries or under the LINK_LIBS tag. Correct usage should be like this: add_dependencies(MLIRfoo MLIRfooIncGen) target_link_libraries(MLIRfoo MLIRlib1 MLIRlib2) A separate issue is that in cmake, dependencies between static libraries are automatically included in dependencies. In the above example, if MLIBlib1 depends on MLIRlib2, then it is sufficient to have only MLIRlib1 in the target_link_libraries. When compiling with shared libraries, it is necessary to have both MLIRlib1 and MLIRlib2 specified if MLIRfoo uses symbols from both. Reviewers: mravishankar, antiagainst, nicolasvasilache, vchuravy, inouehrs, mehdi_amini, jdoerfert Reviewed By: nicolasvasilache, mehdi_amini Subscribers: Joonsoo, merge_guards_bot, jholewinski, mgorny, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, csigg, arpith-jacob, mgester, lucyrfox, herhut, aartbik, liufengdb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73653	2020-02-04 08:56:37 -08:00
Alexander Belyaev	baecae838d	[Linalg] Add tiling of Linalg to parallel loops. Differential Revision: https://reviews.llvm.org/D73955	2020-02-04 14:51:19 +01:00
Lei Zhang	399887c9e4	[mlir][spirv] Add resource limits into target environment This commit adds two resource limits, max_compute_workgroup_size and max_compute_workgroup_invocations as resource limits to the target environment. They are not used at the current moment, but they will affect the SPIR-V CodeGen. Adding for now to have a proper target environment modelling. Differential Revision: https://reviews.llvm.org/D73905	2020-02-04 08:35:19 -05:00
River Riddle	7ef37a5f99	[mlir] Initial support for type constraints in the declarative assembly format Summary: This revision add support for accepting a few type constraints, e.g. AllTypesMatch, when inferring types for operands and results. This is used to remove the c++ parsers for several additional operations. Differential Revision: https://reviews.llvm.org/D73735	2020-02-03 21:55:09 -08:00
Alexander Belyaev	eda6b2e2b3	[MLIR][Linalg] Allow fusion of more than 2 linalg ops. LinalgDependenceGraph was not updated after successful producer-consumer fusion for linalg ops. In this patch it is fixed by reconstructing LinalgDependenceGraph on every iteration. This is very ineffective and should be improved by updating LDGraph only when it is necessary.	2020-02-03 21:00:23 +01:00
Alexander Belyaev	3dcc1fc61b	[MLIR][Linalg] Lower linalg.generic to ploops. Differential Revision: https://reviews.llvm.org/D73684	2020-02-03 11:52:23 +01:00
Stephan Herhut	283b5e733d	[MLIR] Make gpu.launch implicitly capture uses of values defined above. Summary: In the original design, gpu.launch required explicit capture of uses and passing them as operands to the gpu.launch operation. This was motivated by infrastructure restrictions rather than design. This change lifts the requirement and removes the concept of kernel arguments from gpu.launch. Instead, the kernel outlining transformation now does the explicit capturing. This is a breaking change for users of gpu.launch. Differential Revision: https://reviews.llvm.org/D73769	2020-02-03 10:08:48 +01:00
Nicolas Vasilache	dc1d43cfa0	[mlir][Linalg] NFC - Cleanup and split input file for roundtrip.mlir	2020-01-31 22:01:56 -05:00
Alex Zinenko	9dfcddfaae	[mlir] Linalg tiling: generate code avoding out-of-bounds accesses Summary: After the `subview` operation was migrated from Linalg to Standard, it changed semantics and does not guarantee the absence of out-of-bounds accesses through the created view anymore. Compute the size of the subview to make sure it always fits within the view (subviews in last iterations of the loops may be smaller than those in other iterations). Differential Revision: https://reviews.llvm.org/D73614	2020-01-31 19:43:47 +01:00
River Riddle	389b126210	[mlir][NFC] Update several SPIRV operations to use declarative parsers. Differential Revision: https://reviews.llvm.org/D73504	2020-01-30 11:43:41 -08:00
River Riddle	528adb2e48	[mlir][NFC] Use declarative format for several operations in LLVM and Linalg dialects Differential Revision: https://reviews.llvm.org/D73503	2020-01-30 11:43:41 -08:00
River Riddle	82170d5619	[mlir] Update various operations to declaratively specify their assembly format. Summary: This revision switches over many operations to use the declarative methods for defining the assembly specification. This updates operations in the NVVM, ROCDL, Standard, and VectorOps dialects. Differential Revision: https://reviews.llvm.org/D73407	2020-01-30 11:43:40 -08:00
Denis Khalikov	4801522432	[mlir][spirv] Add GroupNonUniform min and max operations. Add GroupNonUniform atihtmetic operations: FMax, FMin, SMax, SMin, UMax, UMin. Differential Revision: https://reviews.llvm.org/D73563	2020-01-30 10:25:15 -05:00
Alexander Belyaev	9109cccb4f	[Linalg] Format Linalg/fusion.mlir. Differential Revision: https://reviews.llvm.org/D73689	2020-01-30 14:17:52 +01:00
Shraiysh Vaishay	d242aa245c	[MLIR] Added llvm.invoke and llvm.landingpad Summary: I have tried to implement `llvm.invoke` and `llvm.landingpad`. # `llvm.invoke` is similar to `llvm.call` with two successors added, the first one is the normal label and the second one is unwind label. # `llvm.launchpad` takes a variable number of args with either `catch` or `filter` associated with them. Catch clauses are not array types and filter clauses are array types. This is same as the criteria used by LLVM (`4f82af81a0/llvm/include/llvm/IR/Instructions.h (L2866)`) Examples: LLVM IR ``` define i32 @caller(i32 %a) personality i8* bitcast (i32 (...)* @__gxx_personality_v0 to i8) { invoke i32 @foo(i32 2) to label %success unwind label %fail success: ret i32 2 fail: landingpad {i8, i32} catch i8 @_ZTIi catch i8 null catch i8* bitcast (i8** @_ZTIi to i8) filter [1 x i8] [ i8 1 ] ret i32 3 } ``` MLIR LLVM Dialect ``` llvm.func @caller(%arg0: !llvm.i32) -> !llvm.i32 { %0 = llvm.mlir.constant(3 : i32) : !llvm.i32 %1 = llvm.mlir.constant("\01") : !llvm<"[1 x i8]"> %2 = llvm.mlir.addressof @_ZTIi : !llvm<"i8"> %3 = llvm.bitcast %2 : !llvm<"i8"> to !llvm<"i8"> %4 = llvm.mlir.null : !llvm<"i8"> %5 = llvm.mlir.addressof @_ZTIi : !llvm<"i8"> %6 = llvm.mlir.constant(2 : i32) : !llvm.i32 %7 = llvm.invoke @foo(%6) to ^bb1 unwind ^bb2 : (!llvm.i32) -> !llvm.i32 ^bb1: // pred: ^bb0 llvm.return %6 : !llvm.i32 ^bb2: // pred: ^bb0 %8 = llvm.landingpad (catch %5 : !llvm<"i8">) (catch %4 : !llvm<"i8">) (catch %3 : !llvm<"i8">) (filter %1 : !llvm<"[1 x i8]">) : !llvm<"{ i8, i32 }"> llvm.return %0 : !llvm.i32 } ``` Signed-off-by: Shraiysh Vaishay <cs17btech11050@iith.ac.in> Differential Revision: https://reviews.llvm.org/D72006	2020-01-30 12:55:28 +01:00
Stephan Herhut	2692751895	Add 'gpu.terminator' operation. Summary: The 'gpu.terminator' operation is used as the terminator for the regions of gpu.launch. This is to disambugaute them from the return operation on 'gpu.func' functions. This is a breaking change and users of the gpu dialect will need to adapt their code when producting 'gpu.launch' operations. Reviewers: nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, csigg, arpith-jacob, mgester, lucyrfox, liufengdb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73620	2020-01-30 12:41:41 +01:00
Nicolas Vasilache	ea1e3369f7	[mlir][Linalg] Introduce folding patterns to remove certain MemRefCastOp Summary: Canonicalization and folding patterns in StandardOps may interfere with the needs of Linalg. This revision introduces specific foldings for dynamic memrefs that can be proven to be static. Very concretely: Determines whether it is possible to fold it away in the parent Linalg op: ```mlir %1 = memref_cast %0 : memref<8x16xf32> to memref<?x?xf32> %2 = linalg.slice %1 ... : memref<?x?xf32> ... // or %1 = memref_cast %0 : memref<8x16xf32, affine_map<(i, j)->(16 * i + j)>> to memref<?x?xf32> linalg.generic(%1 ...) : memref<?x?xf32> ... ``` into ```mlir %2 = linalg.slice %0 ... : memref<8x16xf32> ... // or linalg.generic(%0 ... : memref<8x16xf32, affine_map<(i, j)->(16 * i + j)>> ``` Reviewers: ftynse, aartbik, jsetoain, tetuante, asaadaldien Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73565	2020-01-29 09:52:51 -05:00
David Truby	63c8972562	[MLIR] Add OpenMP dialect with barrier operation Summary: Barrier is a simple operation that takes no arguments and returns nothing, but implies a side effect (synchronization of all threads) Reviewers: jdoerfert Subscribers: mgorny, guansong, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72400	2020-01-29 11:34:58 +00:00
Ahmed Taei	16e82d855a	[mlir] Add primitive transform pattern to rewrite linalg.fill into vector.broadcast form. Summary: This diff adds a transformation patter to rewrite linalg.fill as broadcasting a scaler into a vector. It uses the same preconditioning as matmul (memory is contiguous). Reviewers: nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73391	2020-01-28 11:21:56 -08:00
Denis Khalikov	731b140a52	[mlir][spirv] Add GroupNonUniform arithmetic operations. Add GroupNonUniform arithmetic operations: FAdd, FMul, IMul. Unify parser, printer, verifier for GroupNonUniform arithmetic operations. Differential Revision: https://reviews.llvm.org/D73491	2020-01-28 10:21:56 -05:00
Lei Zhang	60d541e1b9	[mlir][spirv] Relax verification to allow flexible placement Thus far certain SPIR-V ops have been required to be in spv.module. While this provides strong verification to catch unexpected errors, it's quite rigid and makes progressive lowering difficult. Sometimes we would like to partially lower ops from other dialects, which may involve creating ops like global variables that should be placed in other module-like ops. So this commit relaxes the requirement of such SPIR-V ops' scope to module-like ops. Similarly for function- like ops. Differential Revision: https://reviews.llvm.org/D73415	2020-01-26 10:39:45 -05:00
Lei Zhang	ae21e37eb4	[mlir][spirv] Add spv.GroupNonUniformElect and spv.GroupNonUniformIAdd Differential Revision: https://reviews.llvm.org/D73349	2020-01-26 10:20:40 -05:00
aartbik	303fddeeab	[mlir] [VectorOps] Rewriting of vector.extract/insert_slices to other vector ops Summary: Rewrites the extract/insert_slices operation in terms of strided_slice/insert_strided_slice ops with intermediate tuple uses (that should get optimimized away with typical usage). This is done in a separate "pass" to enable testing this particular rewriting in isolation. Reviewers: nicolasvasilache, andydavis1, ftynse Reviewed By: nicolasvasilache Subscribers: merge_guards_bot, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73295	2020-01-24 16:24:45 -08:00
aartbik	ed8222b2ca	[mlir] [VectorOps] Implement vector tuple get folding Summary: Rewrites get-i tup<a1,...,an> into ai Reviewers: nicolasvasilache, rriddle, andydavis1 Reviewed By: nicolasvasilache, rriddle, andydavis1 Subscribers: merge_guards_bot, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73213	2020-01-23 14:15:27 -08:00
Tres Popp	9a52ea5cf9	Create a gpu.module operation for the GPU Dialect. Summary: This is based on the use of code constantly checking for an attribute on a model and instead represents the distinct operaion with a different op. Instead, this op can be used to provide better filtering. Reverts "Revert "[mlir] Create a gpu.module operation for the GPU Dialect."" This reverts commit ac446302ca4145cdc89f377c0c364c29ee303be5 after fixing internal Google issues. This additionally updates ROCDL lowering to use the new gpu.module. Reviewers: herhut, mravishankar, antiagainst, nicolasvasilache Subscribers: jholewinski, mgorny, mehdi_amini, jpienaar, burmako, shauheen, csigg, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, llvm-commits, mravishankar, rriddle, antiagainst, bkramer Tags: #llvm Differential Revision: https://reviews.llvm.org/D72921	2020-01-21 14:05:03 +01:00
Frank Laub	fffea2842d	[MLIR] LLVM Dialect: add llvm.cmpxchg and improve llvm.atomicrmw custom parser Summary: Add a `llvm.cmpxchg` op as a counterpart to LLVM IR's `cmpxchg` instruction. Note that the `weak`, `volatile`, and `syncscope` attributes are not yet supported. This will be useful for upcoming parallel versions of affine.for and generally for reduction-like semantics (especially for reductions that can't make use of `atomicrmw`, e.g. `fmax`). Reviewers: ftynse, nicolasvasilache Reviewed By: ftynse Subscribers: merge_guards_bot, jfb, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72995	2020-01-21 01:09:42 -08:00
Mehdi Amini	fdb9cc7dc5	Fix printer for llvm.addressof symbol name that need escaping Differential Revision: https://reviews.llvm.org/D73065	2020-01-20 22:09:18 +00:00
Jacques Pienaar	b70e4efb75	[mlir] Generalize broadcastable trait operands Summary: Generalize broadcastable trait to variadic operands. Update the documentation that still talked about element type as part of broadcastable trait (that bug was already fixed). Also rename Broadcastable to ResultBroadcastableShape to be more explicit that the trait affects the result shape (it is possible for op to allow broadcastable operands but not have result shape that is broadcast compatible with operands). Doing some intermediate work to have getBroadcastedType take an optional elementType as input and use that if specified, instead of the common element type of type1 and type2 in this function. Differential Revision: https://reviews.llvm.org/D72559	2020-01-20 13:02:14 -08:00
Christian Sigg	8b2eb7c494	[mlir] Add in-dialect lowering of gpu.all_reduce. Reviewers: ftynse, nicolasvasilache, herhut Reviewed By: ftynse, herhut Subscribers: liufengdb, aartbik, herhut, merge_guards_bot, mgorny, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72129	2020-01-20 13:43:43 +01:00
Kazuaki Ishizaki	fc817b09e2	[mlir] NFC: Fix trivial typos in comments Differential Revision: https://reviews.llvm.org/D73012	2020-01-20 03:17:03 +00:00
Frank Laub	ee2de95507	[MLIR] LLVM dialect: modernize and cleanups Summary: Modernize some of the existing custom parsing code in the LLVM dialect. While this reduces some boilerplate code, it also reduces the precision of the diagnostic error messges. Reviewers: ftynse, nicolasvasilache, rriddle Reviewed By: rriddle Subscribers: merge_guards_bot, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72967	2020-01-17 17:11:50 -08:00
Nicolas Vasilache	64c4dcb5ee	[mlir][Linalg] Extend linalg vectorization to MatmulOp Summary: This is a simple extension to allow vectorization to work not only on GenericLinalgOp but more generally across named ops too. For now, this still only vectorizes matmul-like ops but is a step towards more generic vectorization of Linalg ops. Reviewers: ftynse Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72942	2020-01-17 17:09:47 -05:00
Frank Laub	60a0c612df	[MLIR] LLVM dialect: Add llvm.atomicrmw Summary: This op is the counterpart to LLVM's atomicrmw instruction. Note that volatile and syncscope attributes are not yet supported. This will be useful for upcoming parallel versions of `affine.for` and generally for reduction-like semantics. Differential Revision: https://reviews.llvm.org/D72741	2020-01-17 21:17:14 +01:00
Eric Schweitz	37e2560d3d	[Flang][mlir] add a band-aid to support the creation of mutually recursive types when lowering to LLVM IR Summary: This is a temporary implementation to support Flang. The LLVM-IR parser will need to be extended in some way to support recursive types. The exact approach here is still a work-in-progress. Unfortunately, this won't pass roundtrip testing yet. Adding a comment to the test file as a reminder. Differential Revision: https://reviews.llvm.org/D72542	2020-01-17 21:17:06 +01:00
Lei Zhang	267483ac70	[mlir][spirv] Support implied extensions and capabilities In SPIR-V, when a new version is introduced, it is possible some existing extensions will be incorporated into it so that it becomes implicitly declared if targeting the new version. This affects conversion target specification because we need to take this into account when allowing what extensions to use. For a capability, it may also implies some other capabilities, for example, the `Shader` capability implies `Matrix` the capability. This should also be taken into consideration when preparing the conversion target: when we specify an capability is allowed, all its recursively implied capabilities are also allowed. This commit adds utility functions to query implied extensions for a given version and implied capabilities for a given capability and updated SPIRVConversionTarget to use them. This commit also fixes a bug in availability spec. When a symbol (op or enum case) can be enabled by an extension, we should drop it's minimal version requirement. Being enabled by an extension naturally means the symbol can be used by any SPIR-V version as long as the extension is supported. The grammar still encodes the 'version' field for such cases, but it should be interpreted as a different way: rather than meaning a minimal version requirement, it says the symbol becomes core at that specific version. Differential Revision: https://reviews.llvm.org/D72765	2020-01-17 08:01:57 -05:00
Lei Zhang	961174f878	[mlir][spirv] Fix SPV_MM_Vulkan extension reqirements SPV_MM_Vulkan can be enabled by the SPV_KHR_vulkan_memory_model extension. Differential Revision: https://reviews.llvm.org/D72764	2020-01-16 21:32:01 -05:00
Benjamin Kramer	0133cc60e4	Revert "[mlir] Create a gpu.module operation for the GPU Dialect." This reverts commit `4624a1e8ac`. Causing problems downstream.	2020-01-15 17:52:17 +01:00
Lei Zhang	47c6ab2b97	[mlir][spirv] Properly support SPIR-V conversion target This commit defines a new SPIR-V dialect attribute for specifying a SPIR-V target environment. It is a dictionary attribute containing the SPIR-V version, supported extension list, and allowed capability list. A SPIRVConversionTarget subclass is created to take in the target environment and sets proper dynmaically legal ops by querying the op availability interface of SPIR-V ops to make sure they are available in the specified target environment. All existing conversions targeting SPIR-V is changed to use this SPIRVConversionTarget. It probes whether the input IR has a `spv.target_env` attribute, otherwise, it uses the default target environment: SPIR-V 1.0 with Shader capability and no extra extensions. Differential Revision: https://reviews.llvm.org/D72256	2020-01-14 19:18:42 -05:00
Nicolas Vasilache	f52d71736b	[mlir][Linalg] Update the semantics, verifier and test for Linalg with tensors. Summary: This diff fixes issues with the semantics of linalg.generic on tensors that appeared when converting directly from HLO to linalg.generic. The changes are self-contained within MLIR and can be captured and tested independently of XLA. The linalg.generic and indexed_generic are updated to: To allow progressive lowering from the value world (a.k.a tensor values) to the buffer world (a.k.a memref values), a linalg.generic op accepts mixing input and output ranked tensor values with input and output memrefs. ``` %1 = linalg.generic #trait_attribute %A, %B {other-attributes} : tensor<?x?xf32>, memref<?x?xf32, stride_specification> -> (tensor<?x?xf32>) ``` In this case, the number of outputs (args_out) must match the sum of (1) the number of output buffer operands and (2) the number of tensor return values. The semantics is that the linalg.indexed_generic op produces (i.e. allocates and fills) its return values. Tensor values must be legalized by a buffer allocation pass before most transformations can be applied. Such legalization moves tensor return values into output buffer operands and updates the region argument accordingly. Transformations that create control-flow around linalg.indexed_generic operations are not expected to mix with tensors because SSA values do not escape naturally. Still, transformations and rewrites that take advantage of tensor SSA values are expected to be useful and will be added in the near future. Subscribers: bmahjour, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72555	2020-01-14 17:25:28 -05:00
Tres Popp	4624a1e8ac	[mlir] Create a gpu.module operation for the GPU Dialect. Summary: This is based on the use of code constantly checking for an attribute on a model and instead represents the distinct operaion with a different op. Instead, this op can be used to provide better filtering. Reviewers: herhut, mravishankar, antiagainst, rriddle Reviewed By: herhut, antiagainst, rriddle Subscribers: liufengdb, aartbik, jholewinski, mgorny, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, csigg, arpith-jacob, mgester, lucyrfox, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72336	2020-01-14 12:05:47 +01:00
Adrian Kuegel	018b042593	[mlir] Add loop.parallel, loop.reduce and loop.reduce.return operations. Summary: These operations can be used to specify a loop nest with a body that can contain reductions. The iteration space can be iterated in any order. RFC: https://groups.google.com/a/tensorflow.org/d/topic/mlir/pwtSgiKFPis/discussion Differential Revision: https://reviews.llvm.org/D72394	2020-01-14 11:35:41 +01:00
River Riddle	4268e4f4b8	[mlir] Change the syntax of AffineMapAttr and IntegerSetAttr to avoid conflicts with function types. Summary: The current syntax for AffineMapAttr and IntegerSetAttr conflict with function types, making it currently impossible to round-trip function types(and e.g. FuncOp) in the IR. This revision changes the syntax for the attributes by wrapping them in a keyword. AffineMapAttr is wrapped with `affine_map<>` and IntegerSetAttr is wrapped with `affine_set<>`. Reviewed By: nicolasvasilache, ftynse Differential Revision: https://reviews.llvm.org/D72429	2020-01-13 13:24:39 -08:00
Alex Zinenko	08778d8c4f	[mlir][GPU] introduce utilities for promotion to workgroup memory Introduce a set of function that promote a memref argument of a `gpu.func` to workgroup memory using memory attribution. The promotion boils down to additional loops performing the copy from the original argument to the attributed memory in the beginning of the function, and back at the end of the function using all available threads. The loop bounds are specified so as to adapt to any size of the workgroup. These utilities are intended to compose with other existing utilities (loop coalescing and tiling) in cases where the distribution of work across threads is uneven, e.g. copying a 2D memref with only the threads along the "x" dimension. Similarly, specialization of the kernel to specific launch sizes should be implemented as a separate pass combining constant propagation and canonicalization. Introduce a simple attribute-driven pass to test the promotion transformation since we don't have a heuristic at the moment. Differential revision: https://reviews.llvm.org/D71904	2020-01-09 10:06:00 +01:00
Nicolas Vasilache	766ce87e9b	[mlir][Linalg] Lower linalg.reshape to LLVM for the static case Summary: This diff adds lowering of the linalg.reshape op to LLVM. A new descriptor is created with fields initialized as follows: 1. allocatedPTr, alignedPtr and offset are copied from the source descriptor 2. sizes are copied from the static destination shape 3. strides are copied from the static strides collected with `getStridesAndOffset` Only the static case in which the target view conforms to strided memref semantics is supported. Other cases are left for future work and will be added on a per-need basis. Reviewers: ftynse, mravishankar Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72316	2020-01-08 13:07:41 -05:00
Denis Khalikov	dd495e8a87	[mlir][spirv] Add lowering for std cmp ops. Differential Revision: https://reviews.llvm.org/D72296	2020-01-07 21:51:51 -05:00
Denis Khalikov	9883b14cd1	[mlir][spirv] Add lowering for standard bit ops Differential Revision: https://reviews.llvm.org/D72205	2020-01-07 21:45:54 -05:00
Nicolas Vasilache	e3750cafdb	[mlir][Linalg] Add a linalg.reshape op Summary: This diff adds a new operation to linalg to allow reshaping of an existing view into a new view in the same buffer at the same offset. More specifically: The `linalg.reshape` op produces a new view whose sizes are a reassociation of the original `view`. Depending on whether or not the reassociated MemRefType is contiguous, the resulting memref may require explicit alloc and copies. A reassociation is defined as a continous grouping of dimensions and is represented with a affine map array attribute. In the future, non-continous groupings may be allowed (i.e. permutations, reindexings etc). For now, it is assumed that either: 1. a reassociation produces and consumes contiguous MemRefType or, 2. the reshape op will be folded into its consumers (by changing the shape of the computations). All other cases are undefined behavior and a reshape op may not lower to LLVM if it cannot be proven statically that it does not require alloc+copy. A reshape may either collapse or expand dimensions, depending on the relationship between source and target memref ranks. The verification rule is that the reassociation maps are applied to the memref with the larger rank to obtain the memref with the smaller rank. In the case of a dimension expansion, the reassociation maps can be interpreted as inverse maps. Examples: ```mlir // Dimension collapse (i, j) -> i' and k -> k' %1 = linalg.reshape %0 [(i, j, k) -> (i, j), (i, j, k) -> (k)] : memref<?x?x?xf32, stride_spec> into memref<?x?xf32, stride_spec_2> ``` ```mlir // Dimension expansion i -> (i', j') and (k) -> (k') %1 = linalg.reshape %0 [(i, j, k) -> (i, j), (i, j, k) -> (k)] : memref<?x?xf32, stride_spec> into memref<?x?x?xf32, stride_spec_2> ``` The relevant invalid and roundtripping tests are added. Reviewers: AlexEichenberger, ftynse, rriddle, asaadaldien, yangjunpro Subscribers: kiszk, merge_guards_bot, mehdi_amini, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72168	2020-01-06 22:21:19 -05:00
Ahmed Taei	14ee51581a	[mlir][linalg] Lower linalg to affine loops Reviewers: nicolasvasilache Reviewed By: nicolasvasilache Subscribers: mgester, lucyrfox, merge_guards_bot, AlexEichenberger, mravishankar, ftynse, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72094	2020-01-03 13:21:10 -05:00
Lei Zhang	b3d2867769	[mlir][spirv] Fix shader ABI attribute prefix and add verification This commit fixes shader ABI attributes to use `spv.` as the prefix so that they match the dialect's namespace. This enables us to add verification hooks in the SPIR-V dialect to verify them. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D72062	2020-01-03 07:44:27 -05:00
Lei Zhang	98856b22cd	[mlir][spirv] Update SPIR-V enums and ops with availability spec This commit updates gen_spirv_dialect.py to query the grammar and generate availability spec for various enum attribute definitions and all defined ops. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D72095	2020-01-02 14:09:02 -05:00

... 3 4 5 6 7 ...

683 Commits