llvm-project

Commit Graph

Author	SHA1	Message	Date
Thomas Raoux	0712eac766	[mlir][spirv] Enable composite instructions for cooperative matrix type. Enable inset/extract/construct composite ops as well as access chain for cooperative matrix. ConstantComposite requires more change and will be done in a separate patch. Also fix the getNumElements function for coopMatrix per feedback from Jeff Bolz. The number of element is implementation dependent so it cannot be known at compile time. Differential Revision: https://reviews.llvm.org/D80321	2020-05-21 12:19:55 -07:00
Thomas Raoux	15389cdc5b	[mlir][spirv] Add remaining cooperative matrix instructions Adds support for cooperative matrix support for arithmetic and cast instructions. It also adds cooperative matrix store, muladd and matrixlength instructions which are part of the extension. Differential Revision: https://reviews.llvm.org/D80181	2020-05-21 11:55:33 -07:00
jerryyin	9c53ac08de	[mlir][rocdl] Exposing buffer load/store intrinsic Summary: * Updated ROCDLOps tablegen * Added parsing and printing function for new intrinsic * Added unit tests Reviewers: ftynse Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, jurahul, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80233	2020-05-21 14:14:35 +00:00
MaheshRavishankar	071358e082	[mlir][Linalg] Add producer-consumer fusion when producer is a ConstantOp and Consumer is a GenericOp. Differential Revision: https://reviews.llvm.org/D79838	2020-05-20 09:16:19 -07:00
Nicolas Vasilache	004a3d4f56	[mlir][Linalg] Refactor linalg tiling Summary: This revision refactors the Linalg tiling pass to be written as pattern applications and retires the use of the folder in Linalg tiling. In the early days, tiling was written as a pass that would create (partially) folded and canonicalized operations on the fly for better composability. As this evolves towards composition of patterns, the pass-specific folder is counter-productive and is retired. The tiling options struct evolves to take a tile size creation function which allows materializing tile sizes on the fly (in particular constant tile sizes). This plays better with folding and DCE. With the folder going away in Tiling, the check on whether subviews are the same in linalg fusion needs to be more robust. This revision also implements such a check. In the current form, there are still some canonicalizations missing due to AffineMin/Max ops fed by scf::ForOp. These will be improved at a later time. Differential Revision: https://reviews.llvm.org/D80267	2020-05-20 09:39:56 -04:00
Tres Popp	02035580d3	[mlir] Add custom assembly formats to shape.witness ops. The assembly formats are essentially the generic forms without quotations and type information. Differential Revision: https://reviews.llvm.org/D80180	2020-05-20 13:25:33 +02:00
Tres Popp	fb6986ef69	[mlir] Custom printing/parsing for Shape::AssumingOp Summary: Additionally, this adds traits and builder methods to AssumingYieldOp and names the input witness to the AssumingOp. Differential Revision: https://reviews.llvm.org/D80187	2020-05-20 10:39:26 +02:00
Thomas Raoux	b359bbaa8b	[mlir][spirv] First step to support spirv cooperative matrix extension. Add a new type to SPIRV dialect for cooperative matrix and add new op for cooperative matrix load. This is missing most instructions to support cooperative matrix extension but this is a stop-gap patch to avoid creating big review. Differential Revision: https://reviews.llvm.org/D80043	2020-05-19 19:29:41 -07:00
Sean Silva	21b0eff773	[mlir][shape] Add `shape.from_extents`. Summary: This is a basic op needed for creating shapes from SSA values representing the extents. Differential Revision: https://reviews.llvm.org/D79833	2020-05-19 14:26:08 -07:00
George	e984b7f2a2	Added a TanOp to SPIR-V dialect GLSL ops Implemented tangent op from SPIR-V's GLSL extended instruction set. Added a round-trip and serialization/deserialization tests for the op. Differential Revision: https://reviews.llvm.org/D80152	2020-05-19 09:15:29 -04:00
Kiran Kumar T P	fa8fc9ffcc	[MLIR, OpenMP] Support for flush operation, and translating the same to LLVM IR Summary: This patch adds support for flush operation in OpenMP dialect and translation of this construct to LLVM IR. The OpenMP IRBuilder is used for this translation. The patch includes code changes and testcase modifications. Reviewed By: ftynse, kiranchandramohan Differential Revision: https://reviews.llvm.org/D79937	2020-05-19 17:01:25 +05:30
Pierre Oechsel	d1866f8947	[MLIR] [Linalg] Add option to use the partial view after promotion. For now the promoted buffer is indexed using the `full view`. The full view might be slightly bigger than the partial view (which is accounting for boundaries). Unfortunately this does not compose easily with other transformations when multiple buffers with shapes related to each other are involved. Take `linalg.matmul A B C` (with A of size MxK, B of size KxN and C of size MxN) and suppose we are: - Tiling over M by 100 - Promoting A only This is producing a `linalg.matmul promoted_A B subview_C` where `promoted_A` is a promoted buffer of `A` of size (100xK) and `subview_C` is a subview of size mxK where m could be smaller than 100 due to boundaries thus leading to a possible incorrect behavior. We propose to: - Add a new parameter to the tiling promotion allowing to enable the use of the full tile buffer. - By default all promoted buffers will be indexed by the partial view. Note that this could be considered as a breaking change in comparison to the way the tiling promotion was working. Differential Revision: https://reviews.llvm.org/D79927	2020-05-18 18:28:18 +02:00
Nicolas Vasilache	1870e787af	[mlir][Vector] Add an optional "masked" boolean array attribute to vector transfer operations Summary: Vector transfer ops semantic is extended to allow specifying a per-dimension `masked` attribute. When the attribute is false on a particular dimension, lowering to LLVM emits unmasked load and store operations. Differential Revision: https://reviews.llvm.org/D80098	2020-05-18 11:52:08 -04:00
Nicolas Vasilache	36cdc17f8c	[mlir][Vector] Make minor identity permutation map optional in transfer op printing and parsing Summary: This revision makes the use of vector transfer operatons more idiomatic by allowing to omit and inferring the permutation_map. Differential Revision: https://reviews.llvm.org/D80092	2020-05-18 11:41:27 -04:00
Denis Khalikov	0dc91bfd11	[mlir][spirv] Handle debuginfo for control flow ops. Summary: Handle debuginfo for control flow operations: spv.Selection, spv.Loop, spv.BranchOp, spv.BranchConditional. Differential Revision: https://reviews.llvm.org/D79931	2020-05-16 15:36:14 +03:00
aartbik	b1c688dbae	[mlir] [VectorOps] Implement vector.create_mask lowering to LLVM IR Summary: First, compact implementation of lowering to LLVM IR. A bit more challenging than the constant mask due to the dynamic indices, of course. I like to hear if there are more efficient ways of doing this in LLVM, but this for now at least gives us a functional reference implementation. Reviewers: nicolasvasilache, ftynse, bkramer, reidtatge, andydavis1, mehdi_amini Reviewed By: nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, jurahul, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79954	2020-05-15 11:02:30 -07:00
Tres Popp	a26883e5aa	[MLIR] Add shape.witness type and ops Summary: These represent shape based preconditions on execution of code. Differential Revision: https://reviews.llvm.org/D79717	2020-05-15 14:33:54 +02:00
Nicolas Vasilache	f1b972041a	[mlir][Linalg] Start a LinalgToStandard pass and move conversion to library calls. This revision starts decoupling the include the kitchen sink behavior of Linalg to LLVM lowering by inserting a -convert-linalg-to-std pass. The lowering of linalg ops to function calls was previously lowering to memref descriptors by having both linalg -> std and std -> LLVM patterns in the same rewrite. When separating this step, a new issue occurred: the layout is automatically type-erased by this process. This revision therefore introduces memref casts to perform these type erasures explicitly. To connect everything end-to-end, the LLVM lowering of MemRefCastOp is relaxed because it is artificially more restricted than the op semantics. The op semantics already guarantee that source and target MemRefTypes are cast-compatible. An invalid lowering test now becomes valid and is removed. Differential Revision: https://reviews.llvm.org/D79468	2020-05-15 00:24:03 -04:00
Diego Caballero	bc5565f9ea	[mlir][Affine] Introduce affine.vector_load and affine.vector_store This patch adds `affine.vector_load` and `affine.vector_store` ops to the Affine dialect and lowers them to `vector.transfer_read` and `vector.transfer_write`, respectively, in the Vector dialect. Reviewed By: bondhugula, nicolasvasilache Differential Revision: https://reviews.llvm.org/D79658	2020-05-14 13:17:58 -07:00
Alex Zinenko	60f443bb3b	[mlir] Change dialect namespace loop->scf All ops of the SCF dialect now use the `scf.` prefix instead of `loop.`. This is a part of dialect renaming. Differential Revision: https://reviews.llvm.org/D79844	2020-05-13 19:20:21 +02:00
Nicolas Vasilache	e0b99a5de4	[mlir] Add SubViewOp::getOrCreateRanges and fix folding pattern The existing implementation of SubViewOp::getRanges relies on all offsets/sizes/strides to be dynamic values and does not work in combination with canonicalization. This revision adds a SubViewOp::getOrCreateRanges to create the missing constants in the canonicalized case. This allows reactivating the fused pass with staged pattern applications. However another issue surfaces that the SubViewOp verifier is now too strict to allow folding. The existing folding pattern is turned into a canonicalization pattern which rewrites memref_cast + subview into subview + memref_cast. The transform-patterns-matmul-to-vector can then be reactivated. Differential Revision: https://reviews.llvm.org/D79759	2020-05-13 10:11:30 -04:00
MaheshRavishankar	49e6c19100	[mlir][StandardToLLVM] Add SinOp to LLVM dialect and lowering of std.sin to this op. Differential Revision: https://reviews.llvm.org/D79505	2020-05-12 23:15:25 -07:00
MaheshRavishankar	5440d0a12d	[mlir][Linalg] Add folders and canonicalizers for linalg.reshape/linalg.tensor_reshape operations. Differential Revision: https://reviews.llvm.org/D79765	2020-05-12 23:03:26 -07:00
MaheshRavishankar	d2a9569850	[mlir][Linalg] Allow reshapes to collapse to a zero-rank tensor. This is only valid if the source tensors (result tensor) is static shaped with all unit-extents when the reshape is collapsing (expanding) dimensions. Differential Revision: https://reviews.llvm.org/D79764	2020-05-12 23:03:25 -07:00
aartbik	fb2c4d50f1	[mlir] [VectorOps] Implement vector.constant_mask lowering to LLVM IR Summary: Makes this operation runnable on CPU by generating MLIR instructions that are eventually folded into an LLVM IR constant for the mask. Reviewers: nicolasvasilache, ftynse, reidtatge, bkramer, andydavis1 Reviewed By: nicolasvasilache, ftynse, andydavis1 Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79815	2020-05-12 19:44:23 -07:00
Nicolas Vasilache	63c0e72b2f	[mlir] Revisit std.subview handling of static information. The main objective of this revision is to change the way static information is represented, propagated and canonicalized in the SubViewOp. In the current implementation the issue is that canonicalization may strictly lose information because static offsets are combined in irrecoverable ways into the result type, in order to fit the strided memref representation. The core semantics of the op do not change but the parser and printer do: the op always requires `rank` offsets, sizes and strides. These quantities can now be either SSA values or static integer attributes. The result type is automatically deduced from the static information and more powerful canonicalizations (as powerful as the representation with sentinel `?` values allows). Previously static information was inferred on a best-effort basis from looking at the source and destination type. Relevant tests are rewritten to use the idiomatic `offset: x, strides : [...]`-form. Bugs are corrected along the way that were not trivially visible in flattened strided memref form. Lowering to LLVM is updated, simplified and now supports all cases. A mixed static-dynamic mode test that wouldn't previously lower is added. It is an open question, and a longer discussion, whether a better result type representation would be a nicer alternative. For now, the subview op carries the required semantic. Differential Revision: https://reviews.llvm.org/D79662	2020-05-12 20:04:44 -04:00
Sean Silva	452e2fc409	Revert of Revert of [mlir][shape] Tidy up shape.shape_of Summary: - Mark it NoSideEffect - Add custom parser/printer This reverts the temporary revert in https://reviews.llvm.org/rG84a9c725742d26df04808a3c7349dbd98684c6cb That was a false alarm. A downstream test actually needed to be updated.	2020-05-12 15:51:24 -07:00
Sam McCall	691e826995	Revert "[mlir] Revisit std.subview handling of static information." This reverts commit `80d133b24f`. Per Stephan Herhut: The canonicalizer pattern that was added creates forms of the subview op that cannot be lowered. This is shown by failing Tensorflow XLA tests such as: tensorflow/compiler/xla/service/mlir_gpu/tests:abs.hlo.test Will provide more details offline, they rely on logs from private CI.	2020-05-12 15:18:50 +02:00
Eric Christopher	84a9c72574	Temporarily Revert "[mlir][shape] Tidy up shape.shape_of" as it's breaking a few tests. This reverts commit `b604544886`. Followed up offline with a testcase.	2020-05-11 23:05:18 -07:00
Nicolas Vasilache	80d133b24f	[mlir] Revisit std.subview handling of static information. Summary: The main objective of this revision is to change the way static information is represented, propagated and canonicalized in the SubViewOp. In the current implementation the issue is that canonicalization may strictly lose information because static offsets are combined in irrecoverable ways into the result type, in order to fit the strided memref representation. The core semantics of the op do not change but the parser and printer do: the op always requires `rank` offsets, sizes and strides. These quantities can now be either SSA values or static integer attributes. The result type is automatically deduced from the static information and more powerful canonicalizations (as powerful as the representation with sentinel `?` values allows). Previously static information was inferred on a best-effort basis from looking at the source and destination type. Relevant tests are rewritten to use the idiomatic `offset: x, strides : [...]`-form. Bugs are corrected along the way that were not trivially visible in flattened strided memref form. It is an open question, and a longer discussion, whether a better result type representation would be a nicer alternative. For now, the subview op carries the required semantic. Reviewers: ftynse, mravishankar, antiagainst, rriddle!, andydavis1, timshen, asaadaldien, stellaraccident Reviewed By: mravishankar Subscribers: aartbik, bondhugula, mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, bader, grosul1, frgossen, Kayjukh, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79662	2020-05-11 17:44:24 -04:00
Reid Tatge	334a4159ec	[mlir][Vector] NFC - Rename vector.strided_slice into vector.extract_strided_slice Differential Revision: https://reviews.llvm.org/D79734	2020-05-11 14:21:10 -07:00
Nicolas Vasilache	d12d05a731	[mlir][Linalg] Introduce a helper function for staged pattern application Summary: This revision introduces a helper function to allow applying rewrite patterns, interleaved with more global transformations, in a staged fashion: 1. the first stage consists of an OwningRewritePatternList. The RewritePattern in this list are applied once and in order. 2. the second stage consists of a single OwningRewritePattern that is applied greedily until convergence. 3. the third stage consists of applying a lambda, generally used for non-local transformation effects. This allows creating custom fused transformations where patterns can be ordered and applied at a finer granularity than a sequence of traditional compiler passes. A test that exercises these behaviors is added. Differential Revision: https://reviews.llvm.org/D79518	2020-05-11 16:46:30 -04:00
Sean Silva	b604544886	[mlir][shape] Tidy up shape.shape_of Summary: - Mark it NoSideEffect - Add custom parser/printer Differential Revision: https://reviews.llvm.org/D79684	2020-05-11 12:55:31 -07:00
Nicolas Vasilache	6ed61a26c2	[mlir] Simplify and better document std.view semantics This [discussion](https://llvm.discourse.group/t/viewop-isnt-expressive-enough/991/2) raised some concerns with ViewOp. In particular, the handling of offsets is incorrect and does not match the op description. Note that with an elemental type change, offsets cannot be part of the type in general because sizeof(srcType) != sizeof(dstType). Howerver, offset is a poorly chosen term for this purpose and is renamed to byte_shift. Additionally, for all intended purposes, trying to support non-identity layouts for this op does not bring expressive power but rather increases code complexity. This revision simplifies the existing semantics and implementation. This simplification effort is voluntarily restrictive and acts as a stepping stone towards supporting richer semantics: treat the non-common cases as YAGNI for now and reevaluate based on concrete use cases once a round of simplification occurred. Differential revision: https://reviews.llvm.org/D79541	2020-05-11 12:29:23 -04:00
rtayl	58cb88733f	[mlir][rocdl] Add xdlops intrinsics to rocdl dialect Summary: This adds xdlops (mfma) to the rocdl dialect and also tests the translation to llvm ir. Reviewers: ftynse Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, llvm-commits Tags: #llvm #mlir Differential Revision: https://reviews.llvm.org/D79642	2020-05-11 10:08:58 -04:00
Alex Zinenko	c25b20c0f6	[mlir] NFC: Rename LoopOps dialect to SCF (Structured Control Flow) This dialect contains various structured control flow operaitons, not only loops, reflect this in the name. Drop the Ops suffix for consistency with other dialects. Note that this only moves the files and changes the C++ namespace from 'loop' to 'scf'. The visible IR prefix remains the same and will be updated separately. The conversions will also be updated separately. Differential Revision: https://reviews.llvm.org/D79578	2020-05-11 15:04:27 +02:00
Denis Khalikov	b5973d20b3	[mlir][spirv] Handle debuginfo for variables. Summary: Handle debuginfo for spv.Variable and spv.globalVariable during (de)serialization. Differential Revision: https://reviews.llvm.org/D79566	2020-05-08 18:10:18 +03:00
Alexander Belyaev	a6b2877f4c	[MLIR] Make ParallelLoopFusion pass scan through all nested regions. Differential Revision: https://reviews.llvm.org/D79558	2020-05-07 13:47:30 +02:00
Alex Zinenko	54c927b988	[mlir] Add a test exercising partial constant folding of affine min/max This functionality was introduced in `a87db48e6f` but only only tested indirectly though Linalg tests. Add direct tests.	2020-05-07 12:42:03 +02:00
Alex Zinenko	a87db48e6f	[mlir] Support partial folding of affine.min/max Originally, these operations were folded only if all expressions in their affine maps could be folded to a constant expression that can be then subject to numeric min/max computation. This introduces a more advanced version that partially folds the affine map by lifting individual constant expression in it even if some of the expressions remain variable. The folding can update the operation in place to use a simpler map. Note that this is not as powerful as canonicalization, in particular this does not remove dimensions or symbols that became useless. This allows for better composition of Linalg tiling and promotion transformation, where the latter can handle some canonical forms of affine.min that the folding can now produce. Differential Revision: https://reviews.llvm.org/D79502	2020-05-07 12:30:04 +02:00
Uday Bondhugula	57d361bd2f	[MLIR][NFC] Rename op trait PolyhedralScope -> AffineScope Rename op trait PolyhedralScope -> AffineScope for consistency. Differential Revision: https://reviews.llvm.org/D79503	2020-05-07 00:19:56 +05:30
Andy Davis	93d1108801	[MLIR][LoopOps] Adds the loop unroll transformation for loop::ForOp. Summary: Adds the loop unroll transformation for loop::ForOp. Adds support for promoting the body of single-iteration loop::ForOps into its containing block. Adds check tests for loop::ForOps with dynamic and static lower/upper bounds and step. Care was taken to share code (where possible) with the AffineForOp unroll transformation to ease maintenance and potential future transition to a LoopLike construct on which loop transformations for different loop types can implemented. Reviewers: ftynse, nicolasvasilache Reviewed By: ftynse Subscribers: bondhugula, mgorny, zzheng, mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, Joonsoo, grosul1, frgossen, Kayjukh, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79184	2020-05-05 10:42:36 -07:00
Ehsan Toosi	6ccaf73887	[MLIR][LINALG] Convert Linalg on Tensors to Buffers This is a basic pass to convert Linalg.GenericOp which works on tensors to use buffers instead. Differential Revision: https://reviews.llvm.org/D78996	2020-05-05 15:48:07 +02:00
Alexander Belyaev	b79751e83d	[MLIR] Add conversion from AtomicRMWOp -> GenericAtomicRMWOp. Adding this pattern reduces code duplication. There is no need to have a custom implementation for lowering to llvm.cmpxchg. Differential Revision: https://reviews.llvm.org/D78753	2020-05-05 10:32:13 +02:00
River Riddle	469c02d058	[mlir] Add support for merging identical blocks during canonicalization This revision adds support for merging identical blocks, or those with the same operations that branch to the same successors. Operands that mismatch between the different blocks are replaced with new block arguments added to the merged block. Differential Revision: https://reviews.llvm.org/D79134	2020-05-04 19:56:46 -07:00
Nicolas Vasilache	307cfdf533	[mlir][Linalg] Mostly NFC - Refactor Linalg patterns and transformations. Linalg transformations are currently exposed as DRRs. Unfortunately RewriterGen does not play well with the line of work on named linalg ops which require variadic operands and results. Additionally, DRR is arguably not the right abstraction to expose compositions of such patterns that don't rely on SSA use-def semantics. This revision abandons DRRs and exposes manually written C++ patterns. Refactorings and cleanups are performed to uniformize APIs. This refactoring will allow replacing the currently manually specified Linalg named ops. A collateral victim of this refactoring is the `tileAndFuse` DRR, and the one associated test, which will be revived at a later time. Lastly, the following 2 tests do not add value and are altered: - a dot_perm tile + interchange test does not test anything new and is removed - a dot tile + lower to loops does not need 2-D tiling and is trimmed.	2020-05-04 11:17:37 -04:00
Wen-Heng (Jack) Chung	bc23c1d85e	[mlir][rocdl] add rocdl.barier op. - Add rocdl.barrier op. - Lower gpu.barier to rocdl.barrier in -convert-gpu-to-rocdl. Differential Revision: https://reviews.llvm.org/D79126	2020-05-04 10:35:01 +02:00
River Riddle	cb9ae0025c	[mlir] Add a new context flag for disabling/enabling multi-threading This is useful for several reasons: * In some situations the user can guarantee that thread-safety isn't necessary and don't want to pay the cost of synchronization, e.g., when parsing a very large module. * For things like logging threading is not desirable as the output is not guaranteed to be in stable order. This flag also subsumes the pass manager flag for multi-threading. Differential Revision: https://reviews.llvm.org/D79266	2020-05-02 12:32:25 -07:00
Denis Khalikov	29b955f97c	[mlir][spirv] Handle debug information during (de)serialization. Summary: This is an initial version, currently supports OpString and OpLine for autogenerated operations during (de)serialization. Differential Revision: https://reviews.llvm.org/D79091	2020-05-01 14:11:54 +03:00
Nicolas Vasilache	0d61dcf606	[mlir][EDSC] Make use of InsertGuard Summary: This revision cleans up a layer of complexity in ScopedContext and uses InsertGuard instead of previously manual bookkeeping. The method `getBuilder` is renamed to `getBuilderRef` and spurious copies of OpBuilder are tracked. This results in some canonicalizations not happening anymore in the Linalg matmul to vector test. This test is retired because relying on DRRs for this has been shaky at best. The solution will be better support to write fused passes in C++ with more idiomatic pattern composition and application. Differential Revision: https://reviews.llvm.org/D79208	2020-04-30 18:04:31 -04:00
Nicolas Vasilache	3bdd7fcc34	[mlir][Linalg] Add support to lower named ops to loops. This revision adds support to allow named ops to lower to loops. Linalg.batch_matmul successfully lowers to loops and to LLVM. In the process, this test also activates linalg to affine loops. However padded convolutions to not lower to affine.load atm so this revision overrides the type of underlying load / store operation. Differential Revision: https://reviews.llvm.org/D79135	2020-04-30 13:45:17 -04:00
Lei Zhang	87e07b4c64	[mlir] Use memory effect to detecting allocation This commit marks AllocLikeOp as MemAlloc in StandardOps. Also in Linalg dependency analysis use memory effect to detect allocation. This allows the dependency analysis to be more general and recognize other allocation-like operations. Differential Revision: https://reviews.llvm.org/D78705	2020-04-30 09:20:53 -04:00
Uday Bondhugula	480345381a	[MLIR] Introduce op trait PolyhedralScope (revised) (A previous version of this, `dd2c639c3c`, was reverted.) Introduce op trait PolyhedralScope for ops to define a new scope for polyhedral optimization / affine dialect purposes, thus generalizing such scopes beyond FuncOp. Ops to which this trait is attached will define a new scope for the consideration of SSA values as valid symbols for the purposes of polyhedral analysis and optimization. Update methods that check for dim/symbol validity to work based on this trait. Differential Revision: https://reviews.llvm.org/D79060	2020-04-29 16:08:23 +05:30
Tres Popp	f66c87637a	[MLIR] Give AffineStoreOp and AffineLoadOp Memory SideEffects. Summary: This change results in tests also being changed to prevent dead affine.load operations from being folded away during rewrites. Also move AffineStoreOp and AffineLoadOp to an ODS file. Differential Revision: https://reviews.llvm.org/D78930	2020-04-28 15:45:25 +02:00
Dmitri Gribenko	ef06016d73	Revert "[MLIR] Introduce op trait PolyhedralScope" This reverts commit `dd2c639c3c`. It broke a few things -- the explanation will be posted to the review thread.	2020-04-28 14:50:57 +02:00
Uday Bondhugula	dd2c639c3c	[MLIR] Introduce op trait PolyhedralScope Introduce op trait `PolyhedralScope` for ops to define a new scope for polyhedral optimization / affine dialect purposes, thus generalizing such scopes beyond FuncOp. Ops to which this trait is attached will define a new scope for the consideration of SSA values as valid symbols for the purposes of polyhedral analysis and optimization. Update methods that check for dim/symbol validity to work based on this trait. Differential Revision: https://reviews.llvm.org/D78863	2020-04-28 09:55:31 +05:30
Tres Popp	2d2d696137	[MLIR] Propagate input side effect information Summary: Previously operations like std.load created methods for obtaining their effects but did not inherit from the SideEffect interfaces when their parameters were decorated with the information. The resulting situation was that passes had no information on the SideEffects of std.load/store and had to treat them more cautiously. This adds the inheritance information when creating the methods. As a side effect, many tests are modified, as they were using std.load for testing and this oepration would be folded away as part of pattern rewriting. Tests are modified to use store or to reutn the result of the std.load. Reviewers: mravishankar, antiagainst, nicolasvasilache, herhut, aartbik, ftynse! Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, csigg, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, bader, grosul1, frgossen, Kayjukh, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78802	2020-04-27 11:35:52 +02:00
Sean Silva	5fff169daa	[shape] More constant folding - shape split_at - shape.broadcast - shape.concat - shape.to_extent_tensor Differential Revision: https://reviews.llvm.org/D78821	2020-04-24 16:10:19 -07:00
Sean Silva	d1ad267a56	[shape] Basic constant folding. - Implement a first constant fold for shape.shape_of (more ops coming in subsequent patches) - Implement the right builder interfaces for ShapeType and other types - Splits shape.constant into shape.const_size and shape.const_shape which plays better with dyn_cast and building vs one polymorphic op. Also, fix the RUN line in ops.mlir to properly verify round-tripping.	2020-04-24 15:49:35 -07:00
Frederik Gossen	7e4b139a04	[MLIR] Ensure `gpu.func` must be inside a `gpu.module`. Ensure that `gpu.func` is only used within the dedicated `gpu.module`. Implement the constraint to the GPU dialect and adopt test cases. Differential Revision: https://reviews.llvm.org/D78541	2020-04-24 07:17:48 +00:00
MaheshRavishankar	542668d1e2	[mlir][Linalg] Add support for fusing linalg.tensor_reshape with linalg.generic operations. Differential Revision: https://reviews.llvm.org/D78464	2020-04-23 13:41:47 -07:00
River Riddle	7f85adb54d	[mlir][Standard] Allow select to use an i1 for vector and tensor values It currently requires that the condition match the shape of the selected value, but this is only really useful for things like masks. This revision allows for the use of i1 to mean that all of the vector/tensor is selected. This also matches the behavior of LLVM select. A benefit of this change is that transformations that want to generate selects, like those on the CFG, don't have to special case vector/tensor. Previously the only way to generate a select from an i1 was to use a splat, but that doesn't support dynamically shaped/unranked tensors. Differential Revision: https://reviews.llvm.org/D78690	2020-04-23 04:50:09 -07:00
River Riddle	2fafe7ff59	[mlir][Standard] Add support for canonicalizing branches to passthrough blocks This revision adds support for canonicalizing the following: ``` br ^bb1 ^bb1 br ^bbN(...) br ^bbN(...) ``` Differential Revision: https://reviews.llvm.org/D78683	2020-04-23 04:42:02 -07:00
River Riddle	af331bc52d	[mlir][Standard] Add a canonicalization to simplify cond_br when the successors are identical This revision adds support for canonicalizing the following: ``` cond_br %cond, ^bb1(A, ..., N), ^bb1(A, ..., N) br ^bb1(A, ..., N) ``` If the operands to the successor are different and the cond_br is the only predecessor, we emit selects for the branch operands. ``` cond_br %cond, ^bb1(A), ^bb1(B) %select = select %cond, A, B br ^bb1(%select) ``` Differential Revision: https://reviews.llvm.org/D78682	2020-04-23 04:42:02 -07:00
River Riddle	2f4b303d68	[mlir][Standard] Add canonicalization for collapsing pass through cond_br successors. This revision adds support for the following canonicalization: ``` cond_br %cond, ^bb1, ^bb2 ^bb1 br ^bbN(...) ^bb2 br ^bbK(...) cond_br %cond, ^bbN(...), ^bbK(...) ``` Differential Revision: https://reviews.llvm.org/D78681	2020-04-23 04:42:01 -07:00
Frederik Gossen	0372db05bb	[MLIR] Use nested symbol to identify kernel in `LaunchFuncOp`. Summary: Use a nested symbol to identify the kernel to be invoked by a `LaunchFuncOp` in the GPU dialect. This replaces the two attributes that were used to identify the kernel module and the kernel within seperately. Differential Revision: https://reviews.llvm.org/D78551	2020-04-22 07:44:29 +00:00
Frederik Gossen	648fc95083	[MLIR] Use `kernel` as a short hand for `gpu.kernel` attribute. Summary: Use the shortcu `kernel` for the `gpu.kernel` attribute of `gpu.func`. The parser supports this and test cases are easier to read. Differential Revision: https://reviews.llvm.org/D78542	2020-04-22 07:38:30 +00:00
Frederik Gossen	2813802746	[MLIR] Fix test case for kernel attribute. Summary: Fix a broken test case in the `invalid.mlir` lit test case. `expect` was missing its `e`. Differential Revision: https://reviews.llvm.org/D78540	2020-04-22 07:27:39 +00:00
Pierre Oechsel	128d72751f	[mlir] [linalg] Specify alignment during promotion. The buffer allocated by a promotion can be subject to other transformations afterward. For example it could be vectorized, in which case it is needed to ensure that this buffer is memory-aligned. Differential Revision: https://reviews.llvm.org/D78556	2020-04-21 18:13:40 +02:00
Nicolas Vasilache	538ac26f25	[mlir][Linalg] Create a named batch_matmul op and pipe it through. This revision is the first in a set of improvements that aim at allowing more generalized named Linalg op generation from a mathematical specification. This revision allows creating a new op and checks that the parser, printer and verifier are hooked up properly. This opened up a few design points that will be addressed in the future: 1. A named linalg op has a static region builder instead of an explicitly parsed region. This is not currently compatible with assemblyFormat so a custom parser / printer are needed. 2. The convention for structured ops and tensor return values needs to evolve to allow tensor-land and buffer land specifications to agree 3. ReferenceIndexingMaps and referenceIterators will need to become static to allow building attributes at parse time. 4. Error messages will be improved once we have 3. and we pretty print in custom form. Differential Revision: https://reviews.llvm.org/D78327	2020-04-21 12:09:46 -04:00
Pierre Oechsel	49202476e6	[mlir] [linalg] Fix transform-patterns test. Unfortunately FileCheck ignores directives with whitespace between the directive and the colon (`CHECK :` for example), thus most of the directives of this test were ignored. Differential Revision: https://reviews.llvm.org/D78548	2020-04-21 12:53:45 +02:00
Pierre Oechsel	12dcb89dad	[mlir] [linalg] Only promote selected buffers. The promotion transformation is promoting all input and output buffers of the transformed op. The user might want to only promote some of these buffers. Differential Revision: https://reviews.llvm.org/D78498	2020-04-21 11:50:08 +02:00
Uday Bondhugula	2d231c703d	[MLIR] NFC clean up loop tiling test cases Summary: Improve the checks and drop redundant ones. Rename the outlined maps better. Reviewers: andydavis1, nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, Joonsoo, grosul1, frgossen, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78514	2020-04-21 01:13:46 +05:30
Uday Bondhugula	3dff8c9109	[MLIR] Fix affine loop tiling utility upper bound bug Fix intra-tile upper bound setting in a scenario where the tile size was larger than the trip count. Differential Revision: https://reviews.llvm.org/D78505	2020-04-21 00:54:01 +05:30
Lei Zhang	f83d502feb	[mlir][spirv] NFC: remove unnecessary `%N =` in test CHECKs	2020-04-20 11:52:29 -04:00
aartbik	186709c6e0	[mlir] [VectorOps] Progressive lowering of vector.broadcast Summary: Rather than having a full, recursive, lowering of vector.broadcast to LLVM IR, it is much more elegant to have a progressive lowering of each vector.broadcast into a lower dimensional vector.broadcast, until only elementary vector operations remain. This results in more elegant, step-wise code, that is easier to understand. Also makes some optimizations in the generated code. Reviewers: nicolasvasilache, mehdi_amini, andydavis1, grosul1 Reviewed By: nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, grosul1, frgossen, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78071	2020-04-16 21:02:27 -07:00
Nicolas Vasilache	f54312277c	[mlir][Linalg] Drop function attribute from generic ops. The function attribute in generic ops is not paying for itself. A region is the more standardized way of specifying a custom computation. If needed this region can call a function directly. This is deemed more natural than managing a dedicated function attribute. This also simplifies named ops generation by trimming unnecessary complexity. Differential Revision: https://reviews.llvm.org/D78266	2020-04-16 09:47:08 -04:00
Alexander Belyaev	be9c3bdc44	[MLIR] Fix fusion of linalg.indexed_generic producer into tiled (Indexed)GenericOp. Differential Revision: https://reviews.llvm.org/D78209	2020-04-16 10:45:17 +02:00
Jeremy Bruestle	9f3ab92ec8	[MLIR] Improve support for 0-dimensional Affine Maps. Summary: Modified AffineMap::get to remove support for the overload which allowed an ArrayRef of AffineExpr but no context (and gathered the context from a presumed first entry, resulting in bugs when there were 0 results). Instead, we support only a ArrayRef and a context, and a version which takes a single AffineExpr. Additionally, removed some now needless case logic which previously special cased which call to AffineMap::get to use. Reviewers: flaub, bondhugula, rriddle!, nicolasvasilache, ftynse, ulysseB, mravishankar, antiagainst, aartbik Subscribers: mehdi_amini, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, bader, grosul1, frgossen, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78226	2020-04-15 14:15:02 -07:00
Uday Bondhugula	af5e83f569	[MLIR] Introduce utility to hoist affine if/else conditions This revision introduces a utility to unswitch affine.for/parallel loops by hoisting affine.if operations past surrounding affine.for/parallel. The hoisting works for both perfect/imperfect nests and in the presence of else blocks. The hoisting is currently to as outermost a level as possible. Uses a test pass to test the utility. Add convenience method Operation::getParentWithTrait<Trait>. Depends on D77487. Differential Revision: https://reviews.llvm.org/D77870	2020-04-16 00:32:34 +05:30
Alex Zinenko	129cf84e69	[mlir] LLVM dialect: support globals without linkage keyword, assuming 'external' Similarly to actual LLVM IR, and to `llvm.mlir.func`, allow the custom syntax of `llvm.mlir.global` to omit the linkage keyword. If omitted, the linkage is assumed to be external. This makes the modeling of globals in the LLVM dialect more consistent, both within the dialect and with LLVM IR. Differential Revision: https://reviews.llvm.org/D78096	2020-04-15 10:58:32 +02:00
Uday Bondhugula	04b5274ede	[MLIR] Introduce applyOpPatternsAndFold for op local rewrites Introduce mlir::applyOpPatternsAndFold which applies patterns as well as any folding only on a specified op (in contrast to applyPatternsAndFoldGreedily which applies patterns only on the regions of an op isolated from above). The caller is made aware of the op being folded away or erased. Depends on D77485. Differential Revision: https://reviews.llvm.org/D77487	2020-04-15 14:10:01 +05:30
MaheshRavishankar	37b520763f	[mlir][Linalg] Handle null affine map returns from inversePermutation. The inversePermutation method returns a null map on failure. Update uses of this method within Linalg to handle this. In LinalgToLoops the null return value was used to emit scalar code. Modify that to return failure, and emit scalar implementation when affine map is "empty", i.e. 1 dims, 0 symbols and no result exprs. Differential Revision: https://reviews.llvm.org/D77964	2020-04-14 14:41:20 -07:00
MaheshRavishankar	3b2f26ab05	[mlir][Linalg] NFC : Fix check for scalar case handling in LinalgToLoops The invertPermutation method does not return a nullptr anymore, but rather returns an empty map for the scalar case. Update the check in LinalgToLoops to reflect this. Also add test case for generating scalar code.	2020-04-13 13:23:01 -07:00
MaheshRavishankar	03391df90e	[mlir][Linalg] Add loop.parallel lowering for all Linalg Ops. The outer parallel loops of a linalg operation is lowered to loop.parallel, with the other loops lowered to loop.for. This gets the lowering to loop.parallel on par with the loop.for lowering. In future the reduction loop could also be lowered to loop.parallel. Also add a utility function that returns the loops that are created. Differential Revision: https://reviews.llvm.org/D77678	2020-04-13 13:19:12 -07:00
Denis Khalikov	ec99d6e62f	[mlir][spirv] Add a `spirv::InterfaceVarABIAttr`. Summary: Add a proper dialect-specific attribute for interface variable ABI. Differential Revision: https://reviews.llvm.org/D77941	2020-04-13 22:47:47 +03:00
Lei Zhang	a9cb529a84	[mlir][spirv] NFC: use Optional to replace SPV_Optional Differential Revision: https://reviews.llvm.org/D78046	2020-04-13 15:44:06 -04:00
Uday Bondhugula	500e03881b	[MLIR] NFC clean up simplify-affine-structures test case NFC clean up for simplify-affine-structures test cases. Rename sets better; avoid suffix numbers; move outlined definitions close to use. This is in preparation for other functionality updates. Differential Revision: https://reviews.llvm.org/D78017	2020-04-13 23:59:21 +05:30
Lei Zhang	a290c3af9d	[mlir][spirv] Improve stride support in array types This commit added stride support in runtime array types. It also adjusted the assembly form for the stride from `[N]` to `stride=N`. This makes the IR more readable, especially for the cases where one mix array types and struct types. Differential Revision: https://reviews.llvm.org/D78034	2020-04-13 14:08:17 -04:00
Kiran Kumar T P	7ecee63e71	[MLIR] Support for taskwait and taskyield operations, and translating the same to LLVM IR This patch adds support for taskwait and taskyield operations in OpenMP dialect and translation of the these constructs to LLVM IR. The OpenMP IRBuilder is used for this translation. The patch includes code changes and a testcase modifications. Differential Revision: https://reviews.llvm.org/D77634	2020-04-10 07:42:34 +00:00
Nicolas Vasilache	2d32ee0d7a	[mlir][Vector] Update lowering of vector ops to llvm intrinsics to use row-major. Summary: LLVM matrix intrinsics recently introduced an option to support row-major mode. This matches the MLIR vector model, this revision switches to row-major. A corner case related to degenerate sizes was also fixed upstream. This revision removes the guard against this corner case. A bug was uncovered on the output vector construction which this revision also fixes. Lastly, this has been tested on a small size and benchmarked independently: no visible performance regression is observed. In the future, when matrix intrinsics support per op attribute, we can more aggressively translate to that and avoid inserting MLIR-level transposes. This has been tested independently to work on small matrices. Differential Revision: https://reviews.llvm.org/D77761	2020-04-09 16:37:28 -04:00
Nicolas Vasilache	6fb6a4d7f9	[mlir][Linalg] Add a test for a fused Linalg pass based on DRR to go from matmul to vectors This revision builds a simple "fused pass" consisting of 2 levels of tiling, memory promotion and vectorization using linalg transformations written as composable pattern rewrites.	2020-04-08 16:54:40 -04:00
River Riddle	400ad6f95d	[mlir] Eliminate the remaining usages of cl::opt instead of PassOption. Summary: Pass options are a better choice for various reasons and avoid the need for static constructors. Differential Revision: https://reviews.llvm.org/D77707	2020-04-08 13:05:08 -07:00
Uday Bondhugula	ae8ebeca51	[MLIR] Fix test case typos in unroll.mlir Fix check line typos in test/Dialect/Affine/unroll.mlir. Differential Revision: https://reviews.llvm.org/D77738	2020-04-08 22:21:07 +05:30
Andy Davis	7006daa548	[MLIR][Vector] Update ShapeCastOp folder to use producer-consumer value forwarding. Summary: Update ShapeCastOp folder to use producer-consumer value forwarding. Support is added for tracking sub-vectors through trivial shape cast operations, where the sub-vector shape is preserved across shape cast operations and only leading ones are added or removed. Support is preserved for cancelling shape cast operations. One unit test is added and two are updated. Reviewers: aartbik, nicolasvasilache Reviewed By: aartbik, nicolasvasilache Subscribers: frgossen, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, grosul1, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77253	2020-04-08 08:55:37 -07:00
Nicolas Vasilache	3cb1f35df2	[mlir][Linalg] Use subview instead of linalg.slice in Promotion.cpp This revision removes the reliance of Promotion on `linalg.slice` which is meant for the rank-reducing case. Differential Revision: https://reviews.llvm.org/D77676	2020-04-07 23:52:31 -04:00
Stella Laurenzo	f5deb0878d	Remove FxpMathOps dialect and Quantizer tool. Summary: * Removal of FxpMathOps was discussed on the mailing list. * Will send a courtesy note about also removing the Quantizer (which had some dependencies on FxpMathOps). * These were only ever used for experimental purposes and we know how to get them back from history as needed. * There is a new proposal for more generalized quantization tooling, so moving these older experiments out of the way helps clean things up. Subscribers: mgorny, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, grosul1, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77479	2020-04-07 13:22:39 -07:00
Lei Zhang	47b234944d	[mlir][spirv] Fix wrong Phi parent block for back-to-back loops If we have two back-to-back loops with block arguments, the OpPhi instructions generated for the second loop's block arguments should have use the merge block of the first SPIR-V loop structure as their incoming parent block. Differential Revision: https://reviews.llvm.org/D77543	2020-04-07 12:54:54 -04:00
Uday Bondhugula	70da33bf30	[MLIR] fix/update affine data copy utility for max/min bounds Fix point-wise copy generation to work with bounds that have max/min. Change structure of copy loop nest to use absolute loop indices and subtracting base from the indexes of the fast buffers. Update supporting utilities: Fix FlatAffineConstraints::getLowerAndUpperBound to look at equalities as well and for a missing division. Update unionBoundingBox to not discard common constraints (leads to a tighter system). Update MemRefRegion::getConstantBoundingSizeAndShape to add memref dimension constraints. Run removeTrivialRedundancy at the end of MemRefRegion::compute. Run single iteration loop promotion and load/store canonicalization after affine data copy (in its test pass as well). Differential Revision: https://reviews.llvm.org/D77320	2020-04-07 13:55:42 +05:30
Nicolas Vasilache	8f229989d5	[mlir][Linalg] Add a linalg.tensor_reshape to operate on tensors Summary: This revision adds a tensor_reshape operation that operates on tensors. In the tensor world the constraints are less stringent and we can allow more arbitrary dynamic reshapes, as long as they are contractions. The expansion of a dynamic dimension into multiple dynamic dimensions is under-specified and is punted on for now. Differential Revision: https://reviews.llvm.org/D77360	2020-04-06 11:19:17 -04:00
River Riddle	0359b86d8b	[mlir][ODS] Add support for variadic regions. Summary: This revision adds support for marking the last region as variadic in the ODS region list with the VariadicRegion directive. Differential Revision: https://reviews.llvm.org/D77455	2020-04-05 01:03:38 -07:00
Kazuaki Ishizaki	5aacce3db2	[mlir] NFC: Fix trivial typo Differential Revision: https://reviews.llvm.org/D77473	2020-04-05 11:30:30 +09:00
Alex Grosul	3a5192098c	[mlir][VectorOps] Implement canonicalization for TransposeOp. Two back-to-back transpose operations are combined into a single transpose, which uses a combination of their permutation vectors. Differential Revision: https://reviews.llvm.org/D77331	2020-04-02 18:36:40 -07:00
Uday Bondhugula	7c771631c6	[MLIR][NFC] drop unnecessary matches in affine dma generate test case Drop unnecessary matches in affine DMA generate test case. Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, grosul1, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77243	2020-04-02 03:02:07 +05:30
Uday Bondhugula	5e8093134a	[MLIR] Add method to drop duplicate result exprs from AffineMap Add a method that given an affine map returns another with just its unique results. Use this to drop redundant bounds in max/min for affine.for. Update affine.for's canonicalization pattern and createCanonicalizedForOp to use this. Differential Revision: https://reviews.llvm.org/D77237	2020-04-02 03:00:19 +05:30
Uday Bondhugula	572890f1d3	[MLIR][NFC] clean up affine data copy test case Capture maps to test better; drop unnecessary matches Differential Revision: https://reviews.llvm.org/D77196	2020-04-01 22:37:49 +05:30
Hanhan Wang	6dd696ae4f	[mlir][Linalg] Extend fusion to support WAW atm on buffers. Summary: The RAW fusion happens only if the produecer block dominates the consumer block. The WAW pattern also works with the precondition. I.e., if a producer can dominate the consumer, they can fairly fuse together. Since they are all tilable, we can think the pattern like this way: Input: ``` linalg_op1 view tile_loop subview_2 linalg_op2 subview_2 ``` Tile the first Linalg op as same as the second Linalg. ``` tile_loop subview_1 linalg_op1 subview_1 tile_loop subview_2 liangl_op2 subview_2 ``` Since the first Linalg op is tilable in the same way and the computation are independently, it's fair to fuse it with the second Linalg op. ``` tile_loop subview_1 linalg_op1 subview_1 linalg_op2 subview_2 ``` In short, this patch includes: - Handling both RAW and WAW pattern. - Adding a interface method to get input and output buffers. - Exposing a method to get a StringRef of a dependency type. - Fixing existing WAW tests and add one more use case: initialize the buffer before conv op. Differential Revision: https://reviews.llvm.org/D76897	2020-03-31 21:33:50 -07:00
Hanhan Wang	69ddee1d2a	[mlir][Linalg] Introduce linalg.pooling_min/max/sum op. Summary: Performs an N-D pooling operation similarly to the description in the TF documentation: https://www.tensorflow.org/api_docs/python/tf/nn/pool Different from the description, this operation doesn't perform on batch and channel. It only takes tensors of rank `N`. ``` output[x[0], ..., x[N-1]] = REDUCE_{z[0], ..., z[N-1]} input[ x[0] * strides[0] - pad_before[0] + dilation_rate[0]z[0], ... x[N-1]strides[N-1] - pad_before[N-1] + dilation_rate[N-1]*z[N-1] ], ``` The required optional arguments are: - strides: an i64 array specifying the stride (i.e. step) for window loops. - dilations: an i64 array specifying the filter upsampling/input downsampling rate - padding: an i64 array of pairs (low, high) specifying the number of elements to pad along a dimension. If strides or dilations attributes are missing then the default value is one for each of the input dimensions. Similarly, padding values are zero for both low and high in each of the dimensions, if not specified. Differential Revision: https://reviews.llvm.org/D76414	2020-03-31 21:21:54 -07:00
Alex Grosul	855e738be2	[VectorOps] Implement a simple folder for identity vector.transpose operations. Differential Revision: https://reviews.llvm.org/D77088	2020-03-31 17:03:10 -07:00
MaheshRavishankar	da7b6fe942	[mlir][Linalg] Allow tiling of batch dimension for convolution ops with padding. Existing tiling implementation of Linalg would still work for tiling the batch dimensions of the convolution op. Differential Revision: https://reviews.llvm.org/D76637	2020-03-31 09:22:38 -07:00
Andy Davis	31a346cc35	[MLIR][Vector] Add support for TupleGetOp folding through InsertSlicesOp and ExtractSlicesOp. Summary: Add support for TupleGetOp folding through InsertSlicesOp and ExtractSlicesOp. Vector-to-vector transformations for unrolling and lowering to hardware vectors can generate chains of structured vector operations (InsertSlicesOp, ExtractSlicesOp and ShapeCastOp) between the producer of a hardware vector value and its consumer. Because InsertSlicesOp, ExtractSlicesOp and ShapeCastOp are structured, we can track the location (tuple index and vector offsets) of the consumer vector value through the chain of structured operations to the producer, enabling a much more powerful producer-consumer fowarding of values through structured ops and tuple, which in turn enables a more powerful TupleGetOp folding transformation. Reviewers: nicolasvasilache, aartbik Reviewed By: aartbik Subscribers: grosul1, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76889	2020-03-31 08:39:17 -07:00
Ahmed Taei	221fa96cd4	Fix linalg.generic access of hoisted constants Summary: Otherwise the added @generic_const_int will fail Reviewers: nicolasvasilache, rriddle, mravishankar Subscribers: mehdi_amini, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, grosul1, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77109	2020-03-30 21:15:41 -07:00
Mehdi Amini	111c932855	Fix test: add `-allow-unregistered-dialect` to Affine/loop-permute.mlir test (missed during rebase)	2020-03-30 20:14:47 +00:00
Mehdi Amini	bab5bcf8fd	Add a flag on the context to protect against creation of operations in unregistered dialects Differential Revision: https://reviews.llvm.org/D76903	2020-03-30 19:37:31 +00:00
Uday Bondhugula	f273e5c507	[MLIR] Fix permuteLoops utility Rewrite mlir::permuteLoops (affine loop permutation utility) to fix incorrect approach. Avoiding using sinkLoops entirely - use single move approach. Add test pass. This fixes https://bugs.llvm.org/show_bug.cgi?id=45328 Depends on D77003. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D77004	2020-03-30 23:38:23 +05:30
Uday Bondhugula	43a95a543f	[MLIR] Introduce full/partial tile separation using if/else This patch introduces a utility to separate full tiles from partial tiles when tiling affine loop nests where trip counts are unknown or where tile sizes don't divide trip counts. A conditional guard is generated to separate out the full tile (with constant trip count loops) into the then block of an 'affine.if' and the partial tile to the else block. The separation allows the 'then' block (which has constant trip count loops) to be optimized better subsequently: for eg. for unroll-and-jam, register tiling, vectorization without leading to cleanup code, or to offload to accelerators. Among techniques from the literature, the if/else based separation leads to the most compact cleanup code for multi-dimensional cases (because a single version is used to model all partial tiles). INPUT affine.for %i0 = 0 to %M { affine.for %i1 = 0 to %N { "foo"() : () -> () } } OUTPUT AFTER TILING W/O SEPARATION map0 = affine_map<(d0) -> (d0)> map1 = affine_map<(d0)[s0] -> (d0 + 32, s0)> affine.for %arg2 = 0 to %M step 32 { affine.for %arg3 = 0 to %N step 32 { affine.for %arg4 = #map0(%arg2) to min #map1(%arg2)[%M] { affine.for %arg5 = #map0(%arg3) to min #map1(%arg3)[%N] { "foo"() : () -> () } } } } OUTPUT AFTER TILING WITH SEPARATION map0 = affine_map<(d0) -> (d0)> map1 = affine_map<(d0) -> (d0 + 32)> map2 = affine_map<(d0)[s0] -> (d0 + 32, s0)> #set0 = affine_set<(d0, d1)[s0, s1] : (-d0 + s0 - 32 >= 0, -d1 + s1 - 32 >= 0)> affine.for %arg2 = 0 to %M step 32 { affine.for %arg3 = 0 to %N step 32 { affine.if #set0(%arg2, %arg3)[%M, %N] { // Full tile. affine.for %arg4 = #map0(%arg2) to #map1(%arg2) { affine.for %arg5 = #map0(%arg3) to #map1(%arg3) { "foo"() : () -> () } } } else { // Partial tile. affine.for %arg4 = #map0(%arg2) to min #map2(%arg2)[%M] { affine.for %arg5 = #map0(%arg3) to min #map2(%arg3)[%N] { "foo"() : () -> () } } } } } The separation is tested via a cmd line flag on the loop tiling pass. The utility itself allows one to pass in any band of contiguously nested loops, and can be used by other transforms/utilities. The current implementation works for hyperrectangular loop nests. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D76700	2020-03-28 06:58:35 +05:30
Uday Bondhugula	92744f6247	[MLIR] Add flat affine constraints method to round trip integer set - add method to get back an integer set from flat affine constraints; this allows a round trip - use this to complete the simplification of integer sets in -simplify-affine-structures - update FlatAffineConstraints::removeTrivialRedundancy to also do GCD tightening and normalize by GCD (while still keeping it linear time). Signed-off-by: Uday Bondhugula <uday@polymagelabs.com>	2020-03-26 12:07:13 +05:30
Frej Drejhammar	d8981ce5b9	[mlir][Parser] Fix attribute parser errors for ui64 Summary: The attribute parser fails to correctly parse unsigned 64 bit attributes as the check `isNegative ? (int64_t)-val.getValue() >= 0 : (int64_t)val.getValue() < 0` will falsely detect an overflow for unsigned values larger than 2^63-1. This patch reworks the overflow logic to instead of doing arithmetic on int64_t use APInt::isSignBitSet() and knowledge of the attribute type. Test-cases which verify the de-facto behavior of the parser and triggered the previous faulty handing of unsigned 64 bit attrbutes are also added. Differential Revision: https://reviews.llvm.org/D76493	2020-03-25 11:57:16 -07:00
aartbik	8d46bfa808	[mlir] [VectorOps] A "reference" lowering of vector.transpose to LLVM IR Summary: Makes the vector.tranpose runnable on CPU. Reviewers: nicolasvasilache, andydavis1, rriddle Reviewed By: nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76644	2020-03-23 19:01:38 -07:00
Uday Bondhugula	b873761496	[MLIR][NFC] Move some of the affine transforms / tests to dialect dirs Move some of the affine transforms and their test cases to their respective dialect directory. This patch does not complete the move, but takes care of a good part. Renames: prefix 'affine' to affine loop tiling cl options, vectorize -> super-vectorize Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D76565	2020-03-23 08:25:07 +05:30
River Riddle	e9482ed194	[mlir] Move several static cl::opts to be pass options instead. This removes the reliance on global options, and also simplifies the pass registration. Differential Revision: https://reviews.llvm.org/D76552	2020-03-22 03:16:21 -07:00
aartbik	479ee11061	[mlir] [VectorOps] Introduce vector.transpose Summary: Introduced in order to introduce specialized lowering passes that implement transposition operations efficiently. Reviewers: nicolasvasilache, andydavis1 Reviewed By: nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76460	2020-03-20 15:47:44 -07:00
Rob Suderman	e708471395	[mlir][NFC] Cleanup AffineOps directory structure Summary: Change AffineOps Dialect structure to better group both IR and Tranforms. This included extracting transforms directly related to AffineOps. Also move AffineOps to Affine. Differential Revision: https://reviews.llvm.org/D76161	2020-03-20 14:23:43 -07:00
Ahmed Taei	08a9147349	[mlir][LLVMIR] Fix fusion for rank-0 tensors Summary: This diff fixes fusion craching for ops with rank-0 tensors Reviewers: mravishankar, nicolasvasilache, rriddle! Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76479	2020-03-20 13:17:19 -07:00
Nicolas Vasilache	462db62053	[mlir][AVX512] Start a primitive AVX512 dialect The Vector Dialect [document](https://mlir.llvm.org/docs/Dialects/Vector/) discusses the vector abstractions that MLIR supports and the various tradeoffs involved. One of the layer that is missing in OSS atm is the Hardware Vector Ops (HWV) level. This revision proposes an AVX512-specific to add a new Dialect/Targets/AVX512 Dialect that would directly target AVX512-specific intrinsics. Atm, we rely too much on LLVM’s peephole optimizer to do a good job from small insertelement/extractelement/shufflevector. In the future, when possible, generic abstractions such as VP intrinsics should be preferred. The revision will allow trading off HW-specific vs generic abstractions in MLIR. Differential Revision: https://reviews.llvm.org/D75987	2020-03-20 14:11:57 -04:00
Valentin Clement	d4d62fcab6	[MLIR] Add test for multiple gpu.all_reduce in the same kernel when lowering to NVVM Summary: This patch add tests when lowering multiple `gpu.all_reduce` operations in the same kernel. This was previously failing. Differential Revision: https://reviews.llvm.org/D75930	2020-03-19 16:36:38 +01:00
Shraiysh Vaishay	ff77397fcf	[mlir] Added llvm.resume and personality functions in LLVM IR Dialect `llvm.resume` is similar to `llvm.return` except that has to be exactly one operand and that should be derived from a `llvm.landingpad` instruction. Any function having `llvm.landingpad` instruction must have a personality attribute. Example: LLVM IR ``` define dso_local i32 @main() personality i32 (...)* @__gxx_personality_v0 { invoke void @foo(i32 42) to label %3 unwind label %1 1: ; preds = %0 %2 = landingpad i8* catch i8** @_ZTIi catch i8* bitcast (i8** @_ZTIi to i8) resume i8 %2 3: ; preds = %0 ret i32 1 } ``` MLIR - LLVM IR Dialect ``` llvm.func @main() -> !llvm.i32 attributes {personality = @__gxx_personality_v0} { %0 = llvm.mlir.constant(1 : i32) : !llvm.i32 %1 = llvm.mlir.addressof @_ZTIi : !llvm<"i8"> %2 = llvm.bitcast %1 : !llvm<"i8"> to !llvm<"i8"> %3 = llvm.mlir.addressof @_ZTIi : !llvm<"i8"> %4 = llvm.mlir.constant(42 : i32) : !llvm.i32 llvm.invoke @foo(%4) to ^bb2 unwind ^bb1 : (!llvm.i32) -> () ^bb1: // pred: ^bb0 %5 = llvm.landingpad (catch %3 : !llvm<"i8">) (catch %2 : !llvm<"i8">) : !llvm<"i8"> llvm.resume %5 : !llvm<"i8"> ^bb2: // pred: ^bb0 llvm.return %0 : !llvm.i32 } ``` Differential Revision: https://reviews.llvm.org/D71888	2020-03-19 13:14:25 +01:00
Alex Zinenko	bc18624b40	[mlir] vector.type_cast: disallow memrefs with layout in verifier Summary: These are not supported by any of the code using `type_cast`. In the general case, such casting would require memrefs to handle a non-contiguous vector representation or misaligned vectors (e.g., if the offset of the source memref is not divisible by vector size, since offset in the target memref is expressed in the number of elements). Differential Revision: https://reviews.llvm.org/D76349	2020-03-19 10:15:41 +01:00
Lei Zhang	58df5e6d9a	[mlir][spirv] Plumbing target environment into type converter This commit unifies target environment queries into a new wrapper class spirv::TargetEnv and shares across various places needing the functionality. We still create multiple instances of TargetEnv though given the parent components (type converters, passes, conversion targets) have different lifetimes. In the meantime, LowerABIAttributesPass is updated to take into consideration the target environment, which requires updates to tests to provide that. Differential Revision: https://reviews.llvm.org/D76242	2020-03-18 20:11:05 -04:00
Lei Zhang	3b35f9d8b5	[mlir][spirv] Use memref memory space for storage class Previously in SPIRVTypeConverter, we always convert memref types to StorageBuffer regardless of their memory spaces. This commit fixes that to let the conversion to look into memory space properly. For this purpose, a mapping between SPIR-V storage class and memref memory space is introduced. The mapping is arbitary decided at the moment and the hope is that we can leverage string memory space later to be more clear. Now spv.interface_var_abi cannot contain storage class unless it's attached to a scalar value, where we need the storage class as side channel information. Verifications and tests are properly adjusted. Differential Revision: https://reviews.llvm.org/D76241	2020-03-18 20:11:04 -04:00
Nicolas Vasilache	2fae7878d5	[mlir][Vector] Mostly-NFC - Restructure options for lowering to LLVM Matrix Intrinsics Summary: This revision restructures the calling of vector transforms to make it more flexible to ask for lowering through LLVM matrix intrinsics. This also makes sure we bail out in degenerate cases (i.e. 1) in which LLVM complains about not being able to scalarize. Differential Revision: https://reviews.llvm.org/D76266	2020-03-17 22:58:02 -04:00
Rob Suderman	4d60f47b08	[mlir][NFC] Renamed VectorOps to Vector Summary: Renamed VectorOps to Vector to avoid the redundant Ops suffix. Differential Revision: https://reviews.llvm.org/D76317	2020-03-17 15:28:08 -07:00
Rob Suderman	363dd3f394	[mlir][NFC] Rename QuantOps to Quant Summary: Renamed QuantOps to Quant to avoid the Ops suffix. All dialects will contain ops, so the Ops suffix is redundant. Differential Revision: https://reviews.llvm.org/D76318	2020-03-17 15:16:47 -07:00
Jacques Pienaar	9a65d683e0	[mlir] Add target for Shape dialect Summary: Add targets and basic printing/parsing of types in Shape dialect. Differential Revision: https://reviews.llvm.org/D76321	2020-03-17 14:54:25 -07:00
Sagar Jain	76cf14035b	[MLIR] Added llvm.fence This patch adds llvm.fence. I tried not to change the syntax much. syntax: LLVM IR `fence [syncscope("<target-scope>")] <ordering>` MLIR LLVM Dialect `llvm.fence [syncscope("<target-scope>")] <ordering>` example: LLVM IR: `fence syncscope("agent") seq_cst` MLIR: `llvm.fence syncscope("agent") seq_cst` Differential Revision: https://reviews.llvm.org/D75645	2020-03-17 17:53:37 +01:00
Alex Zinenko	e119980f3f	[mlir] LLVM dialect: move ensureDistinctSuccessors out of std->LLVM conversion MLIR supports terminators that have the same successor block with different block operands, which cannot be expressed in the LLVM's phi-notation as the block identifier is used to tell apart the predecessors. This limitation can be worked around by branching to a new block instead, with this new block unconditionally branching to the original successor and forwarding the argument. Until now, this transformation was performed during the conversion from the Standard to the LLVM dialect. This does not scale well to multiple dialects targeting the LLVM dialect as all of them would have to be aware of this limitation and perform the preparatory transformation. Instead, do it as a separate pass and run it immediately before the translation. Differential Revision: https://reviews.llvm.org/D75619	2020-03-17 15:22:14 +01:00
Uday Bondhugula	332f0b3cd4	Affine expr simplification for add of const multiple of same expression - Detect "c_1 * expr + c_2 * expr" as (c_1 + c_2) * expr - subsumes things like 'expr - expr' and "expr * -1 + expr" as 0. - change AffineConstantExpr ctor to allow default null init Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D76233	2020-03-17 08:22:17 +05:30
Feng Liu	166f83f436	[QuantOps] Add the quant region definition Summary: This regional op in the QuantOps dialect will be used to wrap high-precision ops into atomic units for quantization. All the values used by the internal ops are captured explicitly by the op inputs. The quantization parameters of the inputs and outputs are stored in the attributes. Subscribers: jfb, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, aartbik, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75972	2020-03-16 15:44:43 -07:00
Hanhan Wang	92f7e8133a	[mlir][Linalg] Implement padding for linalg.conv and lowering to loops. Summary: To enable this, two changes are needed: 1) Add an optional attribute `padding` to linalg.conv. 2) Compute if the indices accessing is out of bound in the loops. If so, use the padding value `0`. Otherwise, use the value derived from load. In the patch, the padding only works for lowering without other transformations, e.g., tiling, fusion, etc. Differential Revision: https://reviews.llvm.org/D75722	2020-03-13 14:35:58 -07:00
Nicolas Vasilache	bbf3ef8541	[mlir][Vector]Lower vector.contract to llvm.intr.matrix_multiply Summary: This revision adds lowering of vector.contract to llvm.intr.matrix_multiply. Note that there is currently a mismatch between the MLIR vector dialect which expects row-major layout and the LLVM matrix intrinsics which expect column major layout. As a consequence, we currently only match a vector.contract with indexing maps that express column-major matrix multiplication. Other cases would require additional transposes and it is better to wait for LLVM intrinsics to provide a per-operation attribute that would specify which layout is expected. A separate integration test, not submitted to MLIR core, has independently verified that correct execution occurs on a 2x2x2 matrix multiplication. Differential Revision: https://reviews.llvm.org/D76014	2020-03-13 16:33:23 -04:00
aartbik	a213ece30b	[mlir] [VectorOps,LinAlg] Remove direct LLVM lowering for vector.broadcast Summary: The direct lowering of vector.broadcast into LLVM has been replaced by progressive lowering into elementary vector ops. This also required a small refactoring of a llvm.mlir test that used a direct vector.broadcast operator (just to define a matmul). Reviewers: nicolasvasilache, andydavis1, rriddle Reviewed By: nicolasvasilache Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76143	2020-03-13 11:42:51 -07:00
Lei Zhang	e5c85a5a4f	[mlir][spirv] Support querying type extension/capability requirements Previously we only consider the version/capability/extension requirements on ops themselves. Some types in SPIR-V also require special extensions or capabilities to be used. For example, non-32-bit integers/floats will require different capabilities and/or extensions depending on where they are used because it may mean special hardware abilities. This commit adds query methods to SPIR-V type class hierarchy to support querying extensions and capabilities. We don't go through ODS for auto-generating such information given that we don't have them in SPIR-V machine readable grammar and there are just a few types. Differential Revision: https://reviews.llvm.org/D75875	2020-03-12 19:37:45 -04:00
Lei Zhang	3148f10b17	[mlir][spirv] Use spv.vce in spv.module and wire up (de)serialization This commits changes the definition of spv.module to use the #spv.vce attribute for specifying (version, capabilities, extensions) triple so that we can have better API and custom assembly form. Since now we have proper modelling of the triple, (de)serialization is wired up to use them. With the new UpdateVCEPass, we don't need to manually specify the required extensions and capabilities anymore when creating a spv.module. One just need to call UpdateVCEPass before serialization to get the needed version/extensions/capabilities. Differential Revision: https://reviews.llvm.org/D75872	2020-03-12 19:37:45 -04:00
Lei Zhang	9414db1090	[mlir][spirv] Add a pass to deduce version/extension/capability Creates an operation pass that deduces and attaches the minimal version/ capabilities/extensions requirements for spv.module ops. For each spv.module op, this pass requires a `spv.target_env` attribute on it or an enclosing module-like op to drive the deduction. The reason is that an op can be enabled by multiple extensions/capabilities. So we need to know which one to pick. `spv.target_env` gives the hard limit as for what the target environment can support; this pass deduces what are actually needed for a specific spv.module op. Differential Revision: https://reviews.llvm.org/D75870	2020-03-12 19:37:45 -04:00
Lei Zhang	e115a40f50	[mlir][spirv] Use separate attribute for (version, capabilities, extensions) We also need the (version, capabilities, extensions) triple on the spv.module op. Thus far we have been using separate 'extensions' and 'capabilities' attributes there and 'version' is missing. Creating a separate attribute for the trip allows us to reuse the assembly form and verification. Differential Revision: https://reviews.llvm.org/D75868	2020-03-12 19:37:45 -04:00
River Riddle	907403f342	[mlir] Add a new `ConstantLike` trait to better identify operations that represent a "constant". The current mechanism for identifying is a bit hacky and extremely adhoc, i.e. we explicit check 1-result, 0-operand, no side-effect, and always foldable and then assume that this is a constant. Adding a trait adds structure to this, and makes checking for a constant much more efficient as we can guarantee that all of these things have already been verified. Differential Revision: https://reviews.llvm.org/D76020	2020-03-12 14:26:15 -07:00
River Riddle	7c211cf3af	[mlir][NFC] Move the definition of AffineApplyOp to ODS This has been a long standing cleanup TODO. Differential Revision: https://reviews.llvm.org/D76019	2020-03-12 14:26:15 -07:00
aartbik	078776a679	[mlir] [VectorOps] Progressively lower vector.outerproduct to LLVM Summary: This replaces the direct lowering of vector.outerproduct to LLVM with progressive lowering into elementary vectors ops to avoid having the similar lowering logic at several places. NOTE1: with the new progressive rule, the lowered llvm is slightly more elaborate than with the direct lowering, but the generated assembly is just as optimized; still if we want to stay closer to the original, we should add a "broadcast on extract" to shuffle rewrite (rather than special cases all the lowering steps) NOTE2: the original outerproduct lowering code should now be removed but some linalg test work directly on vector and contain some dead code, so this requires another CL Reviewers: nicolasvasilache, andydavis1 Reviewed By: nicolasvasilache, andydavis1 Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75956	2020-03-12 13:45:42 -07:00
Valentin Clement	c7380995f8	[MLIR] Add `and`, `or`, `xor`, `min`, `max` too gpu.all_reduce and the nvvm lowering Summary: This patch add some builtin operation for the gpu.all_reduce ops. - for Integer only: `and`, `or`, `xor` - for Float and Integer: `min`, `max` This is useful for higher level dialect like OpenACC or OpenMP that can lower to the GPU dialect. Differential Revision: https://reviews.llvm.org/D75766	2020-03-11 14:07:04 +01:00
Stephan Herhut	f6790a1c63	Revert "[MLIR] Add `and`, `or`, `xor`, `min`, `max` too gpu.all_reduce and the nvvm lowering" Attribution to original author got lost.	2020-03-11 14:07:04 +01:00

1 2 3 4 5 ...

584 Commits