llvm-project

Commit Graph

Author	SHA1	Message	Date
Alexander Belyaev	68b0aaad56	Revert "Revert "[mlir] Reuse the code between `getMixed*s()` funcs in ViewLikeInterface.cpp."" This reverts commit `e78d7637fb`. Differential Revision: https://reviews.llvm.org/D130706	2022-07-31 21:46:18 +02:00
Alexander Belyaev	e78d7637fb	Revert "[mlir] Reuse the code between `getMixed*s()` funcs in ViewLikeInterface.cpp." This reverts commit `e8c2877565`.	2022-07-31 21:25:20 +02:00
Alexander Belyaev	e8c2877565	[mlir] Reuse the code between `getMixed*s()` funcs in ViewLikeInterface.cpp. Differential Revision: https://reviews.llvm.org/D130706	2022-07-31 21:09:30 +02:00
srishti-cb	b508c5649f	[MLIR] Add a utility to sort the operands of commutative ops Added a commutativity utility pattern and a function to populate it. The pattern sorts the operands of an op in ascending order of the "key" associated with each operand iff the op is commutative. This sorting is stable. The function is intended to be used inside passes to simplify the matching of commutative operations. After the application of the above-mentioned pattern, since the commutative operands now have a deterministic order in which they occur in an op, the matching of large DAGs becomes much simpler, i.e., requires much less number of checks to be written by a user in her/his pattern matching function. The "key" associated with an operand is the list of the "AncestorKeys" associated with the ancestors of this operand, in a breadth-first order. The operand of any op is produced by a set of ops and block arguments. Each of these ops and block arguments is called an "ancestor" of this operand. Now, the "AncestorKey" associated with: 1. A block argument is `{type: BLOCK_ARGUMENT, opName: ""}`. 2. A non-constant-like op, for example, `arith.addi`, is `{type: NON_CONSTANT_OP, opName: "arith.addi"}`. 3. A constant-like op, for example, `arith.constant`, is `{type: CONSTANT_OP, opName: "arith.constant"}`. So, if an operand, say `A`, was produced as follows: ``` `<block argument>` `<block argument>` \ / \ / `arith.subi` `arith.constant` \ / `arith.addi` \| returns `A` ``` Then, the block arguments and operations present in the backward slice of `A`, in the breadth-first order are: `arith.addi`, `arith.subi`, `arith.constant`, `<block argument>`, and `<block argument>`. Thus, the "key" associated with operand `A` is: ``` { {type: NON_CONSTANT_OP, opName: "arith.addi"}, {type: NON_CONSTANT_OP, opName: "arith.subi"}, {type: CONSTANT_OP, opName: "arith.constant"}, {type: BLOCK_ARGUMENT, opName: ""}, {type: BLOCK_ARGUMENT, opName: ""} } ``` Now, if "keyA" is the key associated with operand `A` and "keyB" is the key associated with operand `B`, then: "keyA" < "keyB" iff: 1. In the first unequal pair of corresponding AncestorKeys, the AncestorKey in operand `A` is smaller, or, 2. Both the AncestorKeys in every pair are the same and the size of operand `A`'s "key" is smaller. AncestorKeys of type `BLOCK_ARGUMENT` are considered the smallest, those of type `CONSTANT_OP`, the largest, and `NON_CONSTANT_OP` types come in between. Within the types `NON_CONSTANT_OP` and `CONSTANT_OP`, the smaller ones are the ones with smaller op names (lexicographically). --- Some examples of such a sorting: Assume that the sorting is being applied to `foo.commutative`, which is a commutative op. Example 1: > %1 = foo.const 0 > %2 = foo.mul <block argument>, <block argument> > %3 = foo.commutative %1, %2 Here, 1. The key associated with %1 is: ``` { {CONSTANT_OP, "foo.const"} } ``` 2. The key associated with %2 is: ``` { {NON_CONSTANT_OP, "foo.mul"}, {BLOCK_ARGUMENT, ""}, {BLOCK_ARGUMENT, ""} } ``` The key of %2 < the key of %1 Thus, the sorted `foo.commutative` is: > %3 = foo.commutative %2, %1 Example 2: > %1 = foo.const 0 > %2 = foo.mul <block argument>, <block argument> > %3 = foo.mul %2, %1 > %4 = foo.add %2, %1 > %5 = foo.commutative %1, %2, %3, %4 Here, 1. The key associated with %1 is: ``` { {CONSTANT_OP, "foo.const"} } ``` 2. The key associated with %2 is: ``` { {NON_CONSTANT_OP, "foo.mul"}, {BLOCK_ARGUMENT, ""} } ``` 3. The key associated with %3 is: ``` { {NON_CONSTANT_OP, "foo.mul"}, {NON_CONSTANT_OP, "foo.mul"}, {CONSTANT_OP, "foo.const"}, {BLOCK_ARGUMENT, ""}, {BLOCK_ARGUMENT, ""} } ``` 4. The key associated with %4 is: ``` { {NON_CONSTANT_OP, "foo.add"}, {NON_CONSTANT_OP, "foo.mul"}, {CONSTANT_OP, "foo.const"}, {BLOCK_ARGUMENT, ""}, {BLOCK_ARGUMENT, ""} } ``` Thus, the sorted `foo.commutative` is: > %5 = foo.commutative %4, %3, %2, %1 Signed-off-by: Srishti Srivastava <srishti.srivastava@polymagelabs.com> Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D124750	2022-07-30 19:25:18 -04:00
Kazu Hirata	16eaeaded5	Use is_contained (NFC)	2022-07-30 10:35:54 -07:00
Kazu Hirata	5bc0e7b73c	Convert for_each to range-based for loops (NFC)	2022-07-30 10:35:52 -07:00
Kazu Hirata	e5a1ccbf25	Use value instead of getValue (NFC)	2022-07-29 21:18:41 -07:00
Uday Bondhugula	995e9d84f8	[MLIR] Fix getCommonBlock utility in affine analysis Fix the hardcoded check for `FuncOp` in `getCommonBlock` utility: the check should have been for an op that starts an affine scope. The incorrect block returned in turn causes dependence analysis to function incorrectly. This change allows affine store-load forwarding to work correctly inside any ops that start an affine scope. Reviewed By: ftynse, dcaballe Differential Revision: https://reviews.llvm.org/D130749	2022-07-30 07:14:54 +05:30
Anlun Xu	c01e622120	[mlir][sparse]Add more test cases for sparse reshape Add more test cases for sparse reshape Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D130603	2022-07-29 16:42:48 -07:00
Fangrui Song	7430894a65	Replace Optional::hasValue with has_value or operator bool. NFC	2022-07-29 10:57:25 -07:00
Markus Böck	37da2a141c	[mlir][LLVM] Rework the API of GEPOp The implementation and API of GEP Op has gotten a bit convoluted over the time. Issues with it are: * Misleading naming: `indices` actually only contains the dynamic indices, not all of them. To get the amount of indices you need to query the size of `structIndices` * Very difficult to iterate over all indices properly: One had to iterate over `structIndices`, check whether it contains the magic constant `kDynamicIndex`, if it does, access the next value in `index` etc. * Inconvenient to build: One either has create lots of constant ops for every index or have an odd split of passing both a `ValueRange` as well as a `ArrayRef<int32_t>` filled with `kDynamicIndex` at the correct places. * Implementation doing verification in the build method and more. This patch attempts to address all these issues via convenience classes and reworking the way GEP Op works: * Adds `GEPArg` class which is a sum type of a `int32_t` and `Value` and is used to have a single convenient easy to use `ArrayRef<GEPArg>` in the builders instead of the previous `ValueRange` + `ArrayRef<int32_t>` builders. * Adds `GEPIndicesAdapter` which is a class used for easy random access and iteration over the indices of a GEP. It is generic and flexible enough to also instead return eg. a corresponding `Attribute` for an operand inside of `fold`. * Rename `structIndices` to `rawConstantIndices` and `indices` to `dynamicIndices`: `rawConstantIndices` signifies one shouldn't access it directly as it is encoded, and `dynamicIndices` is more accurate and also frees up the `indices` name. * Add `getIndices` returning a `GEPIndicesAdapter` to easily iterate over the GEP Ops indices. * Move the verification/asserts out of the build method and into the `verify` method emitting op error messages. * Add convenient builder methods making use of `GEPArg`. * Add canonicalizer turning dynamic indices with constant values into constant indices to have a canonical representation. The only breaking change is for any users building GEPOps that have so far used the old `ValueRange` + `ArrayRef<int32_t>` builder as well as those using the generic syntax. Another follow up patch then goes through upstream and makes use of the new `ArrayRef<GEPArg>` builder to remove a lot of code building constants for GEP indices. Differential Revision: https://reviews.llvm.org/D130730	2022-07-29 18:22:54 +02:00
lewuathe	730cb82226	[mlir][complex] Canonicalize complex.add zero Adding complex value with 0 for real and imaginary part can be ignored. NOTE: This type of canonicalization can be written in an easy and tidy format using `complex.number` after constant op supports custom attribute. Differential Revision: https://reviews.llvm.org/D130748	2022-07-29 18:03:45 +02:00
Alex Zinenko	26821f75ed	[mlir][NFC] accept plain OpBuidler in folded construction helpers A group of functions in the Affine dialect provides a mechanism for buliding folded-by-construction operations. These functions used to accept a `RewriterBase` reference because they may need to erase the operations that were folded and notify the rewriter when called from rewrite patterns. Adopt a different approach: postpone the builder notification of the op creation until we are certain that the op will not be folded away. This removes the need to notify the rewriter about op deletion following op construction in case of successful folding, and removes a bunch of one-off `IRRewriter` instances in transform code that may mess up insertion points. Reviewed By: springerm, mravishankar Differential Revision: https://reviews.llvm.org/D130616	2022-07-29 16:01:56 +00:00
Adrian Kuegel	7d76da539f	[mlir][Complex] Add a convenience getValue() method. This method returns the value as std::complex<APFloat> Differential Revision: https://reviews.llvm.org/D130770	2022-07-29 15:35:39 +02:00
Adrian Kuegel	6e951b3ec9	[mlir][Complex] Add convenience builder for complex.number attribute. Differential Revision: https://reviews.llvm.org/D130756	2022-07-29 14:13:44 +02:00
Fangrui Song	0595edd80f	[mlir][toy] Replace Optional::getValue with value. NFC	2022-07-29 01:10:51 -07:00
lewuathe	bcd538ab92	[mlir][complex] Canonicalize consecutive complex.conj We can canonicalize consecutive complex.conj just by removing all conjugate operations. Reviewed By: pifon2a Differential Revision: https://reviews.llvm.org/D130684	2022-07-29 09:41:09 +09:00
Jacques Pienaar	af3ed4a2a7	[mlir] Introduce DefaultValuedOptionalAttr Currently DefaultValuedAttr is confusingly actually default valued & optional but that was an artifact of development and longstanding TODO to address. Add new attribute that matches this behavior for cases where that is actually the desired behavior before addressing TODO (e.g., this is an incremental step to fixing DefaultValuedAttr). Differential Revision: https://reviews.llvm.org/D130679	2022-07-28 14:43:13 -07:00
Markus Böck	2ec510f113	[mlir] Add Type::isa_and_nonnull	2022-07-28 22:41:46 +02:00
Adrian Kuegel	23c3eb7cdf	[mlir][Complex] Change complex.number attribute type to ComplexType. It is more useful to use ComplexType as type of the attribute than to use the element type as attribute type. This means when using this attribute in complex::ConstantOp, we just need to check whether the types match. Reviewed By: pifon2a Differential Revision: https://reviews.llvm.org/D130703	2022-07-28 21:25:12 +02:00
River Riddle	00a52c7565	[mlir:SubElementsInterface] Add support for "skipping" when replacing attributes/types This is used to fix a bug in SymbolTable::replaceAllSymbolUses where we replace symbols that we shouldn't. Differential Revision: https://reviews.llvm.org/D130693	2022-07-28 10:52:12 -07:00
Mahesh Ravishankar	9fe27bca71	[mlir][Linalg] Allow decompose to handle ops when value of `outs` operand is used in payload. Current implementation of decomposition of Linalg operations wouldnt work if the `outs` operand values were used within the body of the operation. Relax this restriction. This potentially sets the stage for decomposing ops with reduction iterator types (but is not done here since it requires more study). Differential Revision: https://reviews.llvm.org/D130527	2022-07-28 16:42:54 +00:00
Mahesh Ravishankar	6f03a10e4f	[mlir][TilingInterface] Add a method to generate scalar implementation of the op. While The tiling interface provides a mechanism for operations to be tiled into tiled version of the op (or another op at the same level of abstraction), the `generateScalarImplementation` method added here is the "exit point" after all transformations have been done. Ops that implement this method are expected to generate IR that are directly lowerable to backend dialects like LLVM or SPIR-V dialects. Differential Revision: https://reviews.llvm.org/D130612	2022-07-28 16:37:15 +00:00
Prabhdeep Singh Soni	f5efa1892e	[Flang][MLIR][OpenMP] Add support for simdlen clause This supports lowering from parse-tree to MLIR and translation from MLIR to LLVM IR using OMPIRBuilder for OpenMP simdlen clause in SIMD construct. Reviewed By: shraiysh, peixin, arnamoy10 Differential Revision: https://reviews.llvm.org/D130195	2022-07-28 23:49:17 +08:00
Adrian Kuegel	ba110cf97a	[mlir] Add getters for DenseArrayAttr. This change adds convenience getters to builders. Differential Revision: https://reviews.llvm.org/D130696	2022-07-28 13:26:27 +02:00
Alexander Belyaev	824954a8c9	[mlir] Small stylistic changes to Complex_NumberAttr Differential Revision: https://reviews.llvm.org/D130632	2022-07-28 10:59:52 +02:00
Matthias Springer	c1e6caac70	[mlir][transform] Support results on ForeachOp Handles can be yielded from the ForeachOp. Differential Revision: https://reviews.llvm.org/D130640	2022-07-28 10:39:54 +02:00
Gaurav Shukla	7d6ef5caef	[mlir][tensor] Fold `tensor.cast` into `tensor.collapse_shape` op This commit folds a `tensor.cast` op into a `tensor.collapse_shape` op when following two conditions meet: 1. the `tensor.collapse_shape` op consumes result of the `tensor.cast` op. 2. `tensor.cast` op casts to a more dynamic version of the source tensor. This is added as a canonicalization pattern in `tensor.collapse_shape` op. Signed-Off-By: Gaurav Shukla <gaurav@nod-labs.com> Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D130650	2022-07-28 13:11:43 +05:30
Stella Laurenzo	7356404ace	[mlir] Delete most of the ops from the quant dialect. * https://discourse.llvm.org/t/rfc-removing-the-quant-dialect/3643/8 * Removes most ops. Leaves casts given final comment (can remove more in a followup). * There are a few uses in Tosa keeping some of the utilities alive. In a followup, I will probably elect to just move simplified versions of them into Tosa itself vs having this quasi-library dependency. Differential Revision: https://reviews.llvm.org/D120204	2022-07-27 17:50:42 -07:00
Lei Zhang	067daa56a9	[mlir][spirv] Unify resources of different vector sizes This commit extends UnifyAliasedResourcePass to handle the case where aliased resources have different vector sizes. (It still requires all scalar types to be of the same bitwidth.) This is effectively reusing the code for handling different-bitwidth scalar types. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D130671	2022-07-27 19:22:50 -04:00
Lei Zhang	7668e58210	[mlir][spirv] Fix spv.CompositeConstruct assembly and validation This commit fixes spv.CompositeConstruct to assembly to list operand types to enable vector construction out of smaller vectors. Validation is also fixed to properly check the cases for vector construction. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D130669	2022-07-27 19:17:23 -04:00
bixia1	66088afbc8	[mlir][sparse] Add arith-expand pass to the sparse-compiler pipeline. Modify an existing test to test the situation. Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D130658	2022-07-27 14:42:21 -07:00
Shraiysh Vaishay	57ef633259	[mlir][OpenMP] Add omp.atomic.update canonicalization This patch adds canonicalization conditions for omp.atomic.update thus eliminating it when it becomes just a write or a no-op due to other changes during canonicalization. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D126531	2022-07-28 00:19:18 +05:30
Peiming Liu	bf59cd320e	[mlir][sparse] fix error when sparse kernel is nested in a scf structrual operator. Sparse compiler failed on the provided test (when the sparse kernel is nested in a scf structrual operator). Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D130609	2022-07-27 16:12:23 +00:00
lewuathe	5caa941f68	[mlir][complex] Custom attribute comlex.number. Add custom attribute for complex dialect. Although this commit does not have significant impact on the conversion framework, it will lead us to construct complex numbers in a readable and tidy manner. Related discussion: https://reviews.llvm.org/D127476 Reviewed By: pifon2a, akuegel Differential Revision: https://reviews.llvm.org/D130149	2022-07-27 15:15:55 +02:00
Markus Böck	7a67e41be2	[mlir][LLVM] Fix incorrect GEP fold with struct constants The fold in it's current state only checks whether the amount of dynamic indices is 1. This does however not check for the presence of any struct indices, leading to an incorrect fold. This patch fixes that issue by checking that struct indices are 1, which in addition to the pre-existing check that dynamic indices are 1, guarantees that the single index is a dynamic one. Differential Revision: https://reviews.llvm.org/D129374	2022-07-27 14:42:35 +02:00
Alex Zinenko	e99fae8997	[mlir] more aggressive folding in tiling/fusion transformations Combine the recently added utilities for folded-by-construction affine operations with the attribute-based Range to enable more folding. This decreases the amount of emitted code but has little effect on test precisely because the tests are not checking for the spurious constants. The difference in the shape of affine maps comes from the internals of affine folding. Depends on D129633 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D130167	2022-07-27 08:52:18 +00:00
Alex Zinenko	70e99f387a	[mlir] Make ViewLikeInterface Range work with attributes While most of methods in ViewLikeInterface accept an `OpFoldResult` for the offset/size/stride that may be static, represented as `Attribute`, or dynamic, represented as `Value`, the `Range` abstraction only accepted `Values`. This can often lead to known-constant offset/size/strides being materialized into constant operations and hinder further constant propagation without explicitly running the constant folding pass. This often leads to a more complicated than necessary addressing code being emitted. Switch `Range` to use `OpFoldResult`. Code that uses `Range` currently keeps materializing the constants to minimize the effect of this change on the IR. Further commits will make use of this. Reviewed By: nicolasvasilache, mravishankar Differential Revision: https://reviews.llvm.org/D129633	2022-07-27 08:52:13 +00:00
Alex Zinenko	08a1b07e7c	[mlir] Partially port splitting transform to TilingInterface The structured op splitting transformation is conceptually similar to tiling in the sense that it decomposes the iteration space of the original op into several parts. Therefore, it is possible to implement it using the TilingInterface to operate on iteration spaces and their parts. However, the implementation also requires to pass updated input operands, which is not supported by the interface, so the implementation currently remains Linalg-specific. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D129564	2022-07-27 08:52:08 +00:00
Adrian Kuegel	a0154218c3	[mlir] Apply ClangTidyPerformance findings (NFC)	2022-07-27 10:05:22 +02:00
River Riddle	01eedbc7c1	[mlir] Refactor SubElementInterface replace support The current support was essentially the amount necessary to support replacing SymbolRefAttrs, but suffers from various deficiencies (both ergonomic and functional): * Replace crashes if unsupported This makes it really hard to use safely, given that you don't know if you are going to crash or not when using it. * Types aren't supported This seems like a simple missed addition when the attribute replacement support was originally added. * The ergonomics are weird It currently uses an index based replacement, which makes the implementations quite clunky. This commit refactors support to be a bit more ergonomic, and also adds support for types in the process. This was also a great oppurtunity to greatly simplify how replacement is done in the symbol table. Fixes #56355 Differential Revision: https://reviews.llvm.org/D130589	2022-07-26 14:51:22 -07:00
Matthias Springer	bffec215ab	[mlir][transform] Add ForeachOp to transform dialect This op "unbatches" an op handle and executes the loop body for each payload op. Differential Revision: https://reviews.llvm.org/D130257	2022-07-26 18:07:44 +02:00
Benjamin Kramer	48a1a993fb	[mlir] Fall back to posix_memalign for aligned_alloc on MacOS aligned_alloc was added in MacOS 10.15, some users want to support older versions. The runtime functions makes this easy, so just put in a call to posix_memalign, which provides the same functionality.	2022-07-26 11:29:37 +02:00
Sebastian Neubauer	ae222dae56	[CMake] Fix add_subdirectory llvm builds Fixes a regression from D117973, that used CMAKE_BINARY_DIR instead of LLVM_BINARY_DIR in some places. Differential Revision: https://reviews.llvm.org/D130555	2022-07-26 11:24:23 +02:00
Kazu Hirata	ae002f8bca	Use isa instead of dyn_cast (NFC)	2022-07-25 23:00:58 -07:00
Kazu Hirata	2789c4f51c	[mlir] Use value_or (NFC)	2022-07-25 23:00:56 -07:00
jacquesguan	78015047b2	[mlir][Math] Add constant folder for Exp2Op. This patch adds constant folder for Exp2Op which only supports single and double precision floating-point. Differential Revision: https://reviews.llvm.org/D130472	2022-07-26 08:48:48 +08:00
Jacques Pienaar	13bc82b587	[mlir] SCCP add missing pessimistic setting When this was updated in D127139 the update in-place case was no longer marked as pessimistic. Add back in. Differential Revision: https://reviews.llvm.org/D130453	2022-07-25 17:41:17 -07:00
River Riddle	c35a4f5804	[mlir][Parser] Fix memory leak when failing to parse a forward declared block This commit fixes a failure edge case where we accidentally drop forward declared blocks in the error case. This allows for running the invalid.mlir test in asan mode now. Fixes #51387 Differential Revision: https://reviews.llvm.org/D130132	2022-07-25 17:29:49 -07:00
River Riddle	c60b897d22	[mlir] Refactor the Parser library in preparation for an MLIR binary format The current Parser library is solely focused on providing API for the textual MLIR format, but MLIR will soon also provide a binary format. This commit renames the current Parser library to AsmParser to better correspond to what the library is actually intended for. A new Parser library is added which will act as a unified parser interface between both text and binary formats. Most parser clients are unaffected, given that the unified interface is essentially the same as the current interface. Only clients that rely on utilizing the AsmParserState, or those that want to parse Attributes/Types need to be updated to point to the AsmParser library. Differential Revision: https://reviews.llvm.org/D129605	2022-07-25 16:33:01 -07:00
lipracer	de7ff8d66f	[mlir][NFC] Remove TypeRange's constructors that cause ambiguity ArrayRef<Value> can implicit convert to ValueRange，when we call TypeRange(SmallVector<Value>) is ambiguity. TypeRange(ValueRange values) TypeRange(ArrayRef<Value> values) Reviewed By: rriddle, Mogball Differential Revision: https://reviews.llvm.org/D129475	2022-07-25 15:38:05 -06:00
lipracer	10a4f03335	Split class ValueRange to a new file When we apply parent patch : https://reviews.llvm.org/D129475 The prompt I get with the clang compiler is: ValueRange is imcomplete type,ValueRange is a forward declaration in the file TypeRange.h, and the file OperationSupport.h already includes the file TypeRange.h.The class TypeRange and the class ValueRange depend on each other. Reviewed By: rriddle, Mogball Differential Revision: https://reviews.llvm.org/D130332	2022-07-25 15:33:21 -06:00
John Ericson	ac0d1d5c7b	[cmake] Support custom package install paths Firstly, we we make an additional GNUInstallDirs-style variable. With NixOS, for example, this is crucial as we want those to go in `${dev}/lib/cmake` not `${out}/lib/cmake` as that would a cmake subdir of the "regular" libdir, which is installed even when no one needs to do any development. Secondly, we make *Config.cmake robust to absolute package install paths. We for NixOS will in fact be passing them absolute paths to make the `${dev}` vs `${out}` distinction mentioned above, and the GNUInstallDirs-style variables are suposed to support absolute paths in general so it's good practice besides the NixOS use-case. Thirdly, we make `${project}_INSTALL_PACKAGE_DIR` CACHE PATHs like other install dirs are. Reviewed By: sebastian-ne Differential Revision: https://reviews.llvm.org/D117973	2022-07-25 21:02:53 +00:00
Kiran Chandramohan	7bb1151ba2	[Flang][OpenMP] Initial support for integer reduction in worksharing-loop Lower the Flang parse-tree containing OpenMP reductions to the OpenMP dialect. The OpenMP dialect models reductions with, 1) A reduction declaration operation that specifies how to initialize, combine, and atomically combine private reduction variables. 2) The OpenMP operation (like wsloop) that supports reductions has an array of reduction accumulator variables (operands) and an array attribute of the same size that points to the reduction declaration to be used for the reduction accumulation. 3) The OpenMP reduction operation that takes a value and an accumulator. This operation replaces the original reduction operation in the source. (1) is implemented by the `createReductionDecl` in OpenMP.cpp, (2) is implemented while creating the OpenMP operation, (3) is implemented by the `genOpenMPReduction` function in OpenMP.cpp, and called from Bridge.cpp. The implementation of (3) is not very robust. NOTE 1: The patch currently supports only reductions for integer type addition. NOTE 2: Only supports reduction in the worksharing loop. NOTE 3: Does not generate atomic combination region. NOTE 4: Other options for creating the reduction operation include a) having the reduction operation as a construct containing an assignment and then handling it appropriately in the Bridge. b) we can modify `genAssignment` or `genFIR(AssignmentStmt)` in the Bridge to handle OpenMP reduction but so far we have tried not to mix OpenMP and non-OpenMP code and this will break that. I will try (b) in a separate patch. NOTE 5: OpenMP dialect gained support for reduction with the patches: D105358, D107343. See https://discourse.llvm.org/t/rfc-openmp-reduction-support/3367 for more details. Reviewed By: awarzynski Differential Revision: https://reviews.llvm.org/D130077 Co-authored-by: Peixin-Qiao <qiaopeixin@huawei.com>	2022-07-25 18:47:07 +00:00
Christopher Bate	894641e974	Revert "[mlir][Arithmetic] Add `arith.delinearize_index` operation" This reverts commit `535b507ba5`.	2022-07-25 11:52:05 -06:00
Mike Urbach	e0af5032f1	[mlir] Update Python CMake version requirement. The minimum required version is now 3.19 due to the usage of some more recent features. Update the version check and error message accordingly. Also remove some logic that behaved differently before 3.18, since we can assume we are now on version 3.19+. Reviewed By: stella.stamenova Differential Revision: https://reviews.llvm.org/D130171	2022-07-25 11:29:33 -06:00
Benoit Jacob	f4ac950957	Generalize the vector transfer flattening patterns (dyn shapes). Differential Revision: https://reviews.llvm.org/D130284	2022-07-25 15:59:08 +00:00
Matthias Springer	a299539ade	[mlir][linalg] Expand test case for tile-and-fuse with transform dialect Reverse the order of the payload ops. fuse_into_containing_op should still work. Differential Revision: https://reviews.llvm.org/D130355	2022-07-25 16:14:35 +02:00
Alex Zinenko	9c1d133c3a	[mlir] move _mlir_alloc and friends to CRunnerUtils These functions don't depend on the C++ runtime and therefore belong to CRunnerUtils. Clean up the macros on the way as `_MSC_VER` indicates the compiler, not the platform, which is indicated by `_WIN32` and will be present when, e.g., compiling with minGW. Reviewed By: rdzhabarov Differential Revision: https://reviews.llvm.org/D130025	2022-07-25 15:52:53 +02:00
Michele Scuttari	a8601f11fb	[MLIR] Generic 'malloc', 'aligned_alloc' and 'free' functions When converted to the LLVM dialect, the memref.alloc and memref.free operations were generating calls to hardcoded 'malloc' and 'free' functions. This didn't leave any freedom to users to provide their custom implementation. Those operations now convert into calls to '_mlir_alloc' and '_mlir_free' functions, which have also been implemented into the runtime support library as wrappers to 'malloc' and 'free'. The same has been done for the 'aligned_alloc' function. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D128791	2022-07-25 15:52:51 +02:00
Matthias Springer	1defec8730	[mlir][tensor][bufferize][NFC] Remove duplicate code InsertSliceOp and ParallelInsertSliceOp are very similar and can share some of the bufferization analysis code. Differential Revision: https://reviews.llvm.org/D130465	2022-07-25 12:34:16 +02:00
Matthias Springer	664ffa46bb	[mlir][tensor][bufferize] Fix deallocation of GenerateOp/FromElementsOp Both ops allocate a buffer. There were cases in which the buffer was not deallocated. Differential Revision: https://reviews.llvm.org/D130469	2022-07-25 12:25:06 +02:00
Alex Zinenko	333ee218ce	[mlir] Transform dialect: separate dependent and generated dialects In the Transform dialect extensions, provide the separate mechanism to declare dependent dialects (the dialects the transform IR depends on) and the generated dialects (the dialects the payload IR may be transformed into). This allows the Transform dialect clients that are only constructing the transform IR to avoid loading the dialects relevant for the payload IR along with the Transform dialect itself, thus decreasing the build/link time. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D130289	2022-07-25 09:59:53 +00:00
Matthias Springer	5f5f71e737	[mlir][tensor][bufferize] Load dependent dialects Load dialects that will be generated by the extension. (Except for BufferizationDialect and MemrefDialect which are loaded already.) Differential Revision: https://reviews.llvm.org/D130463	2022-07-25 11:36:10 +02:00
Rajas Vanjape	d146b51162	[mlir][sparse][nfc] Fix memory leak in sparse sampled matmul integration test https://reviews.llvm.org/D130023 added a memory leak in sparse_sampled_matmul.mlir This diff fixes the memory leak. Testing: Ran integration tests after building with -DLLVM_USE_SANITIZER=Address flag. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D130428	2022-07-25 08:55:12 +00:00
Adrian Kuegel	07628a94e8	[mlir] Apply ClangTidyPerformance findings (NFC)	2022-07-25 10:50:04 +02:00
Sebastian Neubauer	efe1527e28	[CMake] Copy folder without permissions Copying the folder keeps the original permissions by default. This creates problems when the source folder is read-only, e.g. in a packaging environment. Then, the copied folder in the build directory is read-only as well. Later on, other files are copied into that directory (in the build tree), failing when the directory is read-only. Fix that problem by copying the folder without keeping the original permissions. Follow-up to D130254. Differential Revision: https://reviews.llvm.org/D130338	2022-07-25 10:47:04 +02:00
jacquesguan	9e241c70f7	[mlir][Math] Add constant folder for ExpOp. This patch adds constant folder for ExpOp which only supports single and double precision floating-point. Differential Revision: https://reviews.llvm.org/D130318	2022-07-25 16:26:48 +08:00
Marius Hillenbrand	1ef32e7828	[mlir][Arithmetic] Fix printing larger integer attributes in arith.const For arith.constant operations of integer type, the operation generates result names that include the value of the constant (i.e., the IntegerAttr that defines the constant's value). That code currently assumes integer widths of 64 bits or less and hits an assert with wider constants or would create truncated and potentially ambiguous names when built with assertions disabled. To enable printing arith.constant ops for arbitrarily wide integer types, change to use the IntegerAttr's function getValue() when generating result names. Also, add a regression test. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D129930	2022-07-25 08:37:51 +05:30
Kazu Hirata	9e88cbcc40	Use any_of (NFC)	2022-07-24 14:48:11 -07:00
Kazu Hirata	87d627b623	Remove redundant string initialization (NFC) Identified with readability-redundant-string-init.	2022-07-24 12:27:11 -07:00
Kazu Hirata	c730f9a164	Convert for_each to range-based for loops (NFC)	2022-07-23 12:17:27 -07:00
Dmitri Gribenko	aba43035bd	Use llvm::sort instead of std::sort where possible llvm::sort is beneficial even when we use the iterator-based overload, since it can optionally shuffle the elements (to detect non-determinism). However llvm::sort is not usable everywhere, for example, in compiler-rt. Reviewed By: nhaehnle Differential Revision: https://reviews.llvm.org/D130406	2022-07-23 15:19:05 +02:00
Kazu Hirata	380a1b204c	Use callables directly in any_of, count_if, etc (NFC)	2022-07-23 00:28:31 -07:00
Kazu Hirata	33c18fbc19	[mlir] Use value instead of getValue (NFC)	2022-07-22 23:33:46 -07:00
Kazu Hirata	6fa6901bf0	Use has_value instead of hasValue (NFC)	2022-07-22 23:04:38 -07:00
rkayaith	65aedd338c	[mlir][python] Fix issue in diagnostic note initialization Previously the elements of the notes tuple would be invalid objects when accessed from a diagnostic handler, resulting in a segfault when used. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D129943	2022-07-22 16:56:14 -04:00
Christopher Bate	535b507ba5	[mlir][Arithmetic] Add `arith.delinearize_index` operation This change adds a new DelinearizeIndexOp to the `arith` dialect. The operation accepts an `index` type as well as a basis (array of index values) representing how the index should be decomposed into a multi-index. The decomposition obeys a canonical semantic that treats the final basis element as "fastest varying" and the first basis element as "slowest varying". A naive lowering of the operation using a sequence of `arith.divui` and `arith.remui` operations is also given. Differential Revision: https://reviews.llvm.org/D129697	2022-07-22 11:20:41 -06:00
Thomas Raoux	c53ee73b48	[mlir][vector] NFC change to improve doc of vector distribution op Improve doc based on post commit review from https://reviews.llvm.org/D123703 Add more details on the op semantic, explicitly mention what part are parallel and what parts are serial. Differential Revision: https://reviews.llvm.org/D125227	2022-07-22 17:18:31 +00:00
Thomas Raoux	9f6ba4be26	[mlir][vector] Extend transfer_write to read propagation Folding of transfer_write into transfer_read is already supported but this requires the read and write to have the same permuation map. After linalg vectorization it is common to have different ppermuation map for write followed by read even though the cases could be propagated. This canonicalization handle cases where the permuation maps are different but the data read and written match and replace the transfer ops with broadcast and permuation Differential Revision: https://reviews.llvm.org/D130135	2022-07-22 17:11:06 +00:00
Jacques Pienaar	13448db06a	[mlir][tosa] Flip accessors used to prefixed form (NFC) Follow up from dialect flip, just flipping accessors. Both forms still generated.	2022-07-22 09:56:08 -07:00
Jeff Niu	edfc4bb9b9	[mlir][ods] Remove warning in `AttrOrTypeDef` This warning was added because using attribute or type assembly formats with `skipDefaultBuilders` set could cause compilation errors, since the required builder prototype may not necessarily be generated and would need to be checked by hand. This patch removes the warning because a warning that the generated C++ "might" not compile is not particularly useful. Attempting to address the TODO (i.e. detect whether a builder of the correct prototype is provided) would be fragile since it would not be possible to account for implicit conversions, etc. In general, ODS should not be emitting warnings in cases like these. Reviewed By: rriddle, wrengr Differential Revision: https://reviews.llvm.org/D130102	2022-07-22 08:29:23 -07:00
Jacques Pienaar	1b7feac2a6	[mlir][tosa] Split canonicalization and folders out of TosaOps. Scope ops file to ops. Used canonicalization as grouping for canonicalization patterns and folders (also considered OpTransforms but that felt too generic and the former two are used together). Reviewed By: silvas, rsuderman Differential Revision: https://reviews.llvm.org/D130297	2022-07-22 07:20:25 -07:00
Kiran Chandramohan	06dbcf7b2b	[MLIR][OpenMP] Add a constraint to the Threadprivate Op Add a constraint to ensure that the operand and result of the threadprivate operation are the same. Reviewed By: peixin Differential Revision: https://reviews.llvm.org/D128609	2022-07-22 13:12:24 +00:00
Kiran Chandramohan	4ee9f3d59e	[MLIR,OpenMP] : Add Conversion pattern for Critical Op The Conversion pattern enables conversion of Critical Op with block arguments. Fixes https://github.com/llvm/llvm-project/issues/56629 Reviewed By: shraiysh Differential Revision: https://reviews.llvm.org/D130343	2022-07-22 12:57:48 +00:00
Matthias Springer	0eb0dfb20b	[mlir][linalg] Add tile-and-fuse with transform dialect example Differential Revision: https://reviews.llvm.org/D130346	2022-07-22 13:55:18 +02:00
Matthias Springer	32c6e0815a	[mlir][linalg] Add attribute matcher to structured.match transform op This is useful for building small test cases and will be utilized in a subsequent commit that adds a fusion example. Differential Revision: https://reviews.llvm.org/D130344	2022-07-22 13:55:12 +02:00
Matthias Springer	bc882ed21f	[mlir][linalg][transform] Add fuse_into_containing op This op fuses a given payload op into a given container op. Inside the container, all uses of the producer are replaced (fused) with the newly inserted op. If the producer is tileable and accessed via a tensor.extract_slice, the new op computes only the requested slice ("tile and fuse"). Otherwise, the entire tensor value is computed inside the container ("clone and fuse"). Differential Revision: https://reviews.llvm.org/D130244	2022-07-22 13:55:04 +02:00
Ivan Butygin	917e4519bc	[mlir][arith] cmpi: move constant to the right side Convert arith.cmpi to the canonical form with constants on the right side to simplify further optimizations and open more opportunities for CSE. Differential Revision: https://reviews.llvm.org/D129929	2022-07-22 12:39:17 +02:00
Ivan Butygin	f46744bd2a	[mlir][linalg] Fix FoldTensorCastConsumerOp invalid folding CastOp can be in conditionally reachable region, in which case this folding will be invalid. Only conservatively fold ops in same block for now. Fixes https://github.com/llvm/llvm-project/issues/56557 Differential Revision: https://reviews.llvm.org/D130314	2022-07-22 11:39:12 +02:00
Kazu Hirata	70257fab68	Use any_of (NFC)	2022-07-22 01:05:17 -07:00
Uday Bondhugula	fd156f5d4a	[MLIR] Add affine.if canonicalization to compose in affine.apply ops Add affine.if canonicalization to compose affine.apply ops into its set and operands. This eliminates affine.apply ops feeding into affine.if ops. Differential Revision: https://reviews.llvm.org/D130242	2022-07-22 09:56:24 +05:30
Jacques Pienaar	72686d68c1	[mlir] Explicitly mark dialect prefixing Missed previously and needed to flip the default. Most of these just flipped to _Raw to retain existing state/keep this small except for TOSA dialect which got flipped to _Both as no further change was needed..	2022-07-21 12:58:19 -07:00
John Ericson	07b749800c	[cmake] Don't export `LLVM_TOOLS_INSTALL_DIR` anymore First of all, `LLVM_TOOLS_INSTALL_DIR` put there breaks our NixOS builds, because `LLVM_TOOLS_INSTALL_DIR` defined the same as `CMAKE_INSTALL_BINDIR` becomes an absolute path, and then when downstream projects try to install there too this breaks because our builds always install to fresh directories for isolation's sake. Second of all, note that `LLVM_TOOLS_INSTALL_DIR` stands out against the other specially crafted `LLVM_CONFIG_*` variables substituted in `llvm/cmake/modules/LLVMConfig.cmake.in`. @beanz added it in `d0e1c2a550` to fix a dangling reference in `AddLLVM`, but I am suspicious of how this variable doesn't follow the pattern. Those other ones are carefully made to be build-time vs install-time variables depending on which `LLVMConfig.cmake` is being generated, are carefully made relative as appropriate, etc. etc. For my NixOS use-case they are also fine because they are never used as downstream install variables, only for reading not writing. To avoid the problems I face, and restore symmetry, I deleted the exported and arranged to have many `${project}_TOOLS_INSTALL_DIR`s. `AddLLVM` now instead expects each project to define its own, and they do so based on `CMAKE_INSTALL_BINDIR`. `LLVMConfig` still exports `LLVM_TOOLS_BINARY_DIR` which is the location for the tools defined in the usual way, matching the other remaining exported variables. For the `AddLLVM` changes, I tried to copy the existing pattern of internal vs non-internal or for LLVM vs for downstream function/macro names, but it would good to confirm I did that correctly. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D117977	2022-07-21 19:04:00 +00:00
Jacques Pienaar	a1ec0d8bdc	[mlir] Flip dialects to _Prefixed At least two weeks passed since flipped to _Both. Made some additional NFC changes in .td files that were not converted earlier.	2022-07-21 12:03:07 -07:00
George Petterson	a935a0bf50	Adding a new variant of DepthwiseConv2D This is the same as the existing multiplier-1 variant of DepthwiseConv2D, but in PyTorch dim order. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D128575	2022-07-21 14:36:57 -04:00
Javed Absar	56094296f5	Fix type in documentation Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D130274	2022-07-21 18:42:31 +01:00
Jakub Kuderski	52b630da02	[mlir][spirv] Rename spv.GLSL ops to spv.GL. NFC. This is to improve consistency within the SPIR-V dialect and make these ops a bit shorter. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D130280	2022-07-21 13:03:30 -04:00
Christopher Bate	297ba167de	[mlir][linalg] Add tile_size option to `structured.tile_to_foreach_thread_op` This change modifies `structured.tile_to_foreach_thread_op` so that it accepts either `tile_sizes` or `num_threads` parameters. If `tile_sizes` are specified, then the number of threads required is derived the tile sizes rather than the other way around. In both cases, more aggressive folding of loop parameters is enabled during the transformation, allowing for the potential elimination of `affine.min` and `affine.max` operations in the static shape case when calculating the final adjusted tile size. Differential Revision: https://reviews.llvm.org/D130139	2022-07-21 10:32:01 -06:00
Alex Zinenko	ee168fb90e	[mlir][python] Fix issues with block argument slices The type extraction helper function for block argument and op result list objects was ignoring the slice entirely. So was the slice addition. Both are caused by a misleading naming convention to implement slices via CRTP. Make the convention more explicit and hide the helper functions so users have harder time calling them directly. Closes #56540. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D130271	2022-07-21 14:41:12 +00:00
Nicolas Vasilache	1f77f01c65	[mlir][Linalg] Add a Transform dialect NavigationOp op to match a list of ops or an interface. This operation is a NavigationOp that simplifies the writing of transform IR. Since there is no way of refering to an interface by name, the current implementation uses an EnumAttr and depends on the interfaces it supports. In the future, it would be worthwhile to remove this dependence and generalize. Differential Revision: https://reviews.llvm.org/D130267	2022-07-21 07:11:42 -07:00
Ivan Butygin	d4217e6cc8	[mlir][memref] Missing type conversion in memref.reshape llvm lowering Shape can be memref of index type, so memref::LoadOp result need to be converted into llvm type. Differential Revision: https://reviews.llvm.org/D129965	2022-07-21 11:15:35 +02:00
lorenzo chelini	2ed7c3fd84	[MLIR][SCF] Enable better bufferization for `TileConsumerAndFuseProducersUsingSCFForOp` Replace iterators of the outermost loop with region arguments of the innermost one. The changes avoid later `bufferization` passes to insert allocation within the body of the innermost loop. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D130083	2022-07-21 10:14:26 +02:00
lorenzo chelini	7f1c03171d	Revert "[RFC][MLIR][SCF] Enable better bufferization for `TileConsumerAndFuseProducersUsingSCFForOp`" This reverts commit `9e65850305`.	2022-07-21 09:40:30 +02:00
lorenzo chelini	9e65850305	[RFC][MLIR][SCF] Enable better bufferization for `TileConsumerAndFuseProducersUsingSCFForOp` Replace iterators of the outermost loop with region arguments of the innermost one. The changes avoid later `bufferization` passes to insert allocation within the body of the innermost loop. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D130083	2022-07-21 08:56:50 +02:00
Mahesh Ravishankar	485190df95	[mlir][Linalg] Deprecate `tileAndFuseLinalgOps` method and associated patterns. The `tileAndFuseLinalgOps` is a legacy approach for tiling + fusion of Linalg operations. Since it was also intended to work on operations with buffer operands, this method had fairly complex logic to make sure tile and fuse was correct even with side-effecting linalg ops. While complex, it still wasnt robust enough. This patch deprecates this method and thereby deprecating the tiling + fusion method for ops with buffer semantics. Note that the core transformation to do fusion of a producer with a tiled consumer still exists. The deprecation here only removes methods that auto-magically tried to tile and fuse correctly in presence of side-effects. The `tileAndFuseLinalgOps` also works with operations with tensor semantics. There are at least two other ways the same functionality exists. 1) The `tileConsumerAndFuseProducers` method. This does a similar transformation, but using a slightly different logic to automatically figure out the legal tile + fuse code. Note that this is also to be deprecated soon. 2) The prefered way uses the `TilingInterface` for tile + fuse, and relies on the caller to set the tiling options correctly to ensure that the generated code is correct. As proof that (2) is equivalent to the functionality provided by `tileAndFuseLinalgOps`, relevant tests have been moved to use the interface, where the test driver sets the tile sizes appropriately to generate the expected code. Differential Revision: https://reviews.llvm.org/D129901	2022-07-21 05:05:06 +00:00
jacquesguan	9c22853ec4	[mlir][Math] Add constant folder for LogOp. This patch adds constant folder for LogOp which only supports single and double precision floating-point. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D130148	2022-07-21 10:58:32 +08:00
River Riddle	ed344c8877	[mlir:LSP] Add a quickfix code action for inserting expected-* diagnostic checks This allows for automatically inserting expected checks for parser and verifier diagnostics, which simplifies the workflow when building new dialect constructs or extending existing ones. Differential Revision: https://reviews.llvm.org/D130152	2022-07-20 15:43:59 -07:00
Tanyo Kwok	5b0d6bf210	[MLIR] Add function to create Float16 array attribute This patch adds a new function mlirDenseElementsAttrFloat16Get(), which accepts the shaped type, the number of Float16 values, and a pointer to an array of Float16 values, each of which is a uint16_t value. This commit is repeating https://reviews.llvm.org/D123981 + #761 but for Float16 Differential Revision: https://reviews.llvm.org/D130069	2022-07-20 21:58:15 +00:00
Jakub Kuderski	3930cc6847	[mlir][spirv] Rename spv.ocl to spv.cl. NFC. This is to improve the consistency within the SPIR-V dialect and to make op names a bit shorter. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D130194	2022-07-20 17:46:45 -04:00
River Riddle	b64a286332	[mlir:LSP] Add a doc blurb for code completion This also gets used for the vscode extension documentation.	2022-07-20 12:02:13 -07:00
River Riddle	362c4e68c9	[mlir:LSP] Remove several more annoying completion commit characters These characters are annoying in that they accept completions that weren't intended, causing user frustration.	2022-07-20 12:02:13 -07:00
Kazu Hirata	360c1111e3	Use llvm::is_contained (NFC)	2022-07-20 09:09:19 -07:00
Ingo Müller	000e426449	[mlir][bufferize][doc] Fix documentation of return type of `to_memref`. This is my very first contact with this dialect, so I am not very confident with this commit, but it seems like the op returns a memref, not a tensor, so that's what comment about the result type should say. [mlir][bufferization][doc] Improve typesetting of inline code. Fix Typo. Reviewed By: pifon2a Differential Revision: https://reviews.llvm.org/D130159	2022-07-20 13:21:49 +00:00
River Riddle	ee7ccbeaa7	[mlir:LSP][NFC] Make the LSPServer class private There is no benefit to making it public, and the code is much cleaner and easier to follow when inlined. This also matches the pattern within the PDLL lsp server.	2022-07-20 01:43:30 -07:00
Anush Elangovan	f9676d2d22	[mlir] Fix macOS tests Fix shared library names on macOS for execution_engine.py test. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D130143	2022-07-20 10:19:05 +02:00
jacquesguan	c3d856bf58	[mlir][Math] Add constant folder for Log1pOp. This patch adds constant folder for Log1pOp which only supports single and double precision floating-point. Differential Revision: https://reviews.llvm.org/D129979	2022-07-20 14:59:35 +08:00
Mahesh Ravishankar	b8a1f00d41	[mlir][TilingInterface] Add support for interchange to tiling patterns that use the `TilingInterface`. Differential Revision: https://reviews.llvm.org/D129956	2022-07-20 05:24:17 +00:00
Jacques Pienaar	c35807f271	[mlir] Address compiler warning	2022-07-19 20:03:03 -07:00
River Riddle	7d815ab9b4	[mlir][NFC] Split out various tests from IR/invalid.mlir This file contains a huge number of tests that should really be in different dialect/files. It is monolothic because of the legacy surrounding the old standard dialect, affine operations, etc. Splitting this up makes the tests much more maintainable given that they are now group with other similar tests.	2022-07-19 16:34:35 -07:00
Slava Zakharin	04644a9e55	[mlir] Fixed ordering of pass statistics. The change makes sure the plain C string statistics names are properly ordered. Differential Revision: https://reviews.llvm.org/D130122	2022-07-19 15:52:02 -07:00
Jacques Pienaar	d2c0572b2e	[mlir] Flip LinAlg dialect to _Both This one required more changes than ideal due to overlapping generated name with different return types. Changed getIndexingMaps to getIndexingMapsArray to move it out of the way/highlight that it returns (more expensively) a SmallVector and uses the prefixed name for the Attribute. Differential Revision: https://reviews.llvm.org/D129919	2022-07-19 14:42:58 -07:00
Benoit Jacob	f0c3fd185e	Don't combine if there would remain no true reduction dim. Differential Revision: https://reviews.llvm.org/D130109	2022-07-19 19:58:53 +00:00
Rajas Vanjape	9451440f82	[mlir][docs] Fix pass manager document The code example for pass manager incorrectly uses nestedFunctionPM instead of nestedAnyPm for adding CSE and Canonicalize Passes. This diff fixes it by changing it to nestedAnyPm. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D130110	2022-07-19 19:52:57 +00:00
Jeff Niu	2d05418b17	Revert "[mlir][ods] (NFC) Remove warning in `AttrOrTypeDef`" This reverts commit `e45ef5ebf4`.	2022-07-19 10:25:24 -07:00
Jeff Niu	e45ef5ebf4	[mlir][ods] (NFC) Remove warning in `AttrOrTypeDef` This warning was added because using attribute or type assembly formats with `skipDefaultBuilders` set could cause compilation errors, since the required builder prototype may not necessarily be generated and would need to be checked by hand. This patch removes the warning because a warning that the generated C++ "might" not compile is not particularly useful. Attempting to address the TODO (i.e. detect whether a builder of the correct prototype is provided) would be fragile since it would not be possible to account for implicit conversions, etc. In general, ODS should not be emitting warnings in cases like these.	2022-07-19 09:17:19 -07:00
bhatuzdaname	7f76471ee8	[mlir][tblgen] Add support for extraClassDefinition in AttrDef For AttrDef declarations, place specified code in extraClassDefinition into the generated *.cpp.inc file. Reviewed By: Mogball, rriddle Differential Revision: https://reviews.llvm.org/D129574	2022-07-19 09:13:32 -07:00
lipracer	71da3914c1	[mlir][NFC] Use proper c++ namespaces in .td files td files: mlir::ArrayRef => llvm::ArrayRef mlir::Optional=>llvm::Optional mlir::SmallVector => llvm::SmallVector Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D128537	2022-07-19 08:51:21 -07:00
Nicolas Vasilache	18b92c66fe	[mlir][Linalg] Add a TileToForeachThread transform. This revision adds a new transformation to tile a TilingInterface `op` to a tiled `scf.foreach_thread`, applying tiling by `num_threads`. If non-empty, the `threadDimMapping` is added as an attribute to the resulting `scf.foreach_thread`. 0-tile sizes (i.e. tile by the full size of the data) are used to encode that a dimension is not tiled. Differential Revision: https://reviews.llvm.org/D129577	2022-07-19 04:56:11 -07:00
Markus Böck	61394636f0	[mlir] Ignore effects on allocated results when checking whether the op is trivially dead. In the current state, this is only special cased for Allocation effects, but any effects on results allocated by the operation may be ignored when checking whether the op may be removed, as none of them are possible to be observed if the result is unused. A use case for this is for IRs for languages which always initialize on allocation. To correctly model such operations, a Write as well as an Allocation effect should be placed on the result. This would prevent the Op from being deleted if unused however. This patch fixes that issue. Differential Revision: https://reviews.llvm.org/D129854	2022-07-19 10:58:25 +02:00
Matthias Springer	106d695287	[mlir][sparse][NFC] Update remaining test cases No more to_memref, memref.alloc or memref.dealloc when possible. Differential Revision: https://reviews.llvm.org/D130023	2022-07-19 09:21:10 +02:00
Matthias Springer	27a431f5e9	[mlir][bufferization][NFC] Move sparse_tensor.release to bufferization dialect This op used to belong to the sparse dialect, but there are use cases for dense bufferization as well. (E.g., when a tensor alloc is returned from a function and should be deallocated at the call site.) This change moves the op to the bufferization dialect, which now has an `alloc_tensor` and a `dealloc_tensor` op. Differential Revision: https://reviews.llvm.org/D129985	2022-07-19 09:18:19 +02:00
Jacques Pienaar	c8598fa22f	[mlir] Add refineReturnTypes to InferTypeOpInterface refineReturnType method shares the same parameters as inferReturnTypes but gets passed in the return types of the op if known that can be used during refinement passes or for more op specific error reporting. Currently the error reporting on failure is generic and doesn't allow for specializing the returned result based on failure, with this change what would previously have been a separate trait with specialized verification can just be handled as part of inferrence rather than duplicated. refineReturnTypes behaves like inferReturnTypes if no result types are fed in, while the current verification is recast as the default implementation for refineReturnTypes with it calling inferReturnTypes (and so the default type verification now goes through refine and allows for more op specific inference mismatch errors). Differential Revision: https://reviews.llvm.org/D129955	2022-07-18 22:18:52 -07:00
Mehdi Amini	d04c2b2fd9	Revert "[MLIR] Generic 'malloc', 'aligned_alloc' and 'free' functions" This reverts commit `3e21fb616d`. A lot of integration tests are failing on the bot.	2022-07-18 18:07:36 +00:00
Arjun P	477c2c6f4a	[MLIR][Presburger] fix warning under g++ (NFC)	2022-07-18 18:02:20 +01:00
Arjun P	ffb8b7b2a0	[MLIR][Presburger] Provide functions to convert between arrays of MPInt and int64_t Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D129509	2022-07-18 17:34:50 +01:00
Arjun P	ca6a5afbb2	[MLIR][Presburger] SlowMPInt: fix bug in ceilDiv, floorDiv where widths weren't harmonized This also adds tests for abs, ceilDiv, floorDiv, mod, gcd and lcm. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D129816	2022-07-18 17:34:01 +01:00
Aart Bik	28ebb0b61d	[mlir][sparse] migrate sparse rewriting to sparse transformations pass The rules in the linalg file were very specific to sparse tensors so will find a better home under sparse tensor dialect than linalg dialect. Also moved some rewriting from sparsification into this new "pre-rewriting" file. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D129910	2022-07-18 09:29:22 -07:00
Alexander Batashev	e59cdcd070	[mlir][spirv] Allow unnamed entry point functions SPIR-V specification does not require a function to have a name if it is an entry point. Adjust deserializer to allow those kinds of SPIR-V binaries. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D120181	2022-07-18 12:03:38 -04:00
Michele Scuttari	3e21fb616d	[MLIR] Generic 'malloc', 'aligned_alloc' and 'free' functions When converted to the LLVM dialect, the memref.alloc and memref.free operations were generating calls to hardcoded 'malloc' and 'free' functions. This didn't leave any freedom to users to provide their custom implementation. Those operations now convert into calls to '_mlir_alloc' and '_mlir_free' functions, which have also been implemented into the runtime support library as wrappers to 'malloc' and 'free'. The same has been done for the 'aligned_alloc' function. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D128791	2022-07-18 17:58:58 +02:00
jacquesguan	a5cae20bdb	[mlir][Math] Add constant folder for Log10Op. This patch adds constant folder for Log10Op which only support single and double precision floating-point. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D129740	2022-07-18 10:19:25 +08:00
Kazu Hirata	10bcfeebfa	[mlir] Remove unused using (NFC) Identified with misc-unused-using-decls.	2022-07-17 18:08:48 -07:00
Kazu Hirata	8dfdb80f72	Ensure newlines at the end of files (NFC)	2022-07-17 15:37:45 -07:00
Stella Laurenzo	beebffa9ab	[NFC] Remove obsolete all_passes_registration from integration tests. After https://reviews.llvm.org/D128593 this is not needed (and not available). Was missed in original landing because integration tests do not run on pre-merge.	2022-07-16 18:18:43 -07:00
Stella Laurenzo	5e83a5b475	[mlir] Overhaul C/Python registration APIs to properly scope registration/loading activities. Since the very first commits, the Python and C MLIR APIs have had mis-placed registration/load functionality for dialects, extensions, etc. This was done pragmatically in order to get bootstrapped and then just grew in. Downstreams largely bypass and do their own thing by providing various APIs to register things they need. Meanwhile, the C++ APIs have stabilized around this and it would make sense to follow suit. The thing we have observed in canonical usage by downstreams is that each downstream tends to have native entry points that configure its installation to its preferences with one-stop APIs. This patch leans in to this approach with `RegisterEverything.h` and `mlir._mlir_libs._mlirRegisterEverything` being the one-stop entry points for the "upstream packages". The `_mlir_libs.__init__.py` now allows customization of the environment and Context by adding "initialization modules" to the `_mlir_libs` package. If present, `_mlirRegisterEverything` is treated as such a module. Others can be added by downstreams by adding a `_site_initialize_{i}.py` module, where '{i}' is a number starting with zero. The number will be incremented and corresponding module loaded until one is not found. Initialization modules can: * Perform load time customization to the global environment (i.e. registering passes, hooks, etc). * Define a `register_dialects(registry: DialectRegistry)` function that can extend the `DialectRegistry` that will be used to bootstrap the `Context`. * Define a `context_init_hook(context: Context)` function that will be added to a list of callbacks which will be invoked after dialect registration during `Context` initialization. Note that the `MLIRPythonExtension.RegisterEverything` is not included by default when building a downstream (its corresponding behavior was prior). For downstreams which need the default MLIR initialization to take place, they must add this back in to their Python CMake build just like they add their own components (i.e. to `add_mlir_python_common_capi_library` and `add_mlir_python_modules`). It is perfectly valid to not do this, in which case, only the things explicitly depended on and initialized by downstreams will be built/packaged. If the downstream has not been set up for this, it is recommended to simply add this back for the time being and pay the build time/package size cost. CMake changes: * `MLIRCAPIRegistration` -> `MLIRCAPIRegisterEverything` (renamed to signify what it does and force an evaluation: a number of places were incidentally linking this very expensive target) * `MLIRPythonSoure.Passes` removed (without replacement: just drop) * `MLIRPythonExtension.AllPassesRegistration` removed (without replacement: just drop) * `MLIRPythonExtension.Conversions` removed (without replacement: just drop) * `MLIRPythonExtension.Transforms` removed (without replacement: just drop) Header changes: * `mlir-c/Registration.h` is deleted. Dialect registration functionality is now in `IR.h`. Registration of upstream features are in `mlir-c/RegisterEverything.h`. When updating MLIR and a couple of downstreams, I found that proper usage was commingled so required making a choice vs just blind S&R. Python APIs removed: * mlir.transforms and mlir.conversions (previously only had an __init__.py which indirectly triggered `mlirRegisterTransformsPasses()` and `mlirRegisterConversionPasses()` respectively). Downstream impact: Remove these imports if present (they now happen as part of default initialization). * mlir._mlir_libs._all_passes_registration, mlir._mlir_libs._mlirTransforms, mlir._mlir_libs._mlirConversions. Downstream impact: None expected (these were internally used). C-APIs changed: * mlirRegisterAllDialects(MlirContext) now takes an MlirDialectRegistry instead. It also used to trigger loading of all dialects, which was already marked with a TODO to remove -- it no longer does, and for direct use, dialects must be explicitly loaded. Downstream impact: Direct C-API users must ensure that needed dialects are loaded or call `mlirContextLoadAllAvailableDialects(MlirContext)` to emulate the prior behavior. Also see the `ir.c` test case (e.g. ` mlirContextGetOrLoadDialect(ctx, mlirStringRefCreateFromCString("func"));`). * mlirDialectHandle* APIs were moved from Registration.h (which now is restricted to just global/upstream registration) to IR.h, arguably where it should have been. Downstream impact: include correct header (likely already doing so). C-APIs added: * mlirContextLoadAllAvailableDialects(MlirContext): Corresponds to C++ API with the same purpose. Python APIs added: * mlir.ir.DialectRegistry: Mapping for an MlirDialectRegistry. * mlir.ir.Context.append_dialect_registry(MlirDialectRegistry) * mlir.ir.Context.load_all_available_dialects() * mlir._mlir_libs._mlirAllRegistration: New native extension that exposes a `register_dialects(MlirDialectRegistry)` entry point and performs all upstream pass/conversion/transforms registration on init. In this first step, we eagerly load this as part of the __init__.py and use it to monkey patch the Context to emulate prior behavior. * Type caster and capsule support for MlirDialectRegistry This should make it possible to build downstream Python dialects that only depend on a subset of MLIR. See: https://github.com/llvm/llvm-project/issues/56037 Here is an example PR, minimally adapting IREE to these changes: https://github.com/iree-org/iree/pull/9638/files In this situation, IREE is opting to not link everything, since it is already configuring the Context to its liking. For projects that would just like to not think about it and pull in everything, add `MLIRPythonExtension.RegisterEverything` to the list of Python sources getting built, and the old behavior will continue. Reviewed By: mehdi_amini, ftynse Differential Revision: https://reviews.llvm.org/D128593	2022-07-16 17:27:50 -07:00
Markus Böck	3dccac09a6	[mlir][NFC] Fully qualify typenames in SymbolInterfaces.td	2022-07-16 14:39:00 +02:00
Markus Böck	884d310d87	[mlir][LLVM] Add OpBuilder for opaque pointer GEPs without struct indices An OpBuilder already exists for GEPs that does not have any struct indices for existing typed pointers, but no such builder exists for GEPs utilizing opaque pointers that has an explicit `basePtrType`. Differential Revision: https://reviews.llvm.org/D129376	2022-07-16 10:55:49 +02:00
Matthias Springer	afb1def876	[mli][sparse] Remove manual dense dealloc from test case Differential Revision: https://reviews.llvm.org/D129834	2022-07-16 09:28:27 +02:00
Uday Bondhugula	9819cbda0c	[MLIR] Clean up checks for alloc-like ops in analysis Clean up checks for alloc-like ops in analysis. Use the analysis utility to properly check for the desired kind of effects. The previous locality utility worked for all practical purposes but wasn't sound and was locally duplicate code. Instead, use mlir::hasSingleEffect. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D129439	2022-07-16 12:54:47 +05:30
Uday Bondhugula	9f7d80145a	[MLIR] NFC. Clean up logic of hasSingleEffect Clean up conditional logic of hasSingleEffect. NFC. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D129438	2022-07-16 12:39:32 +05:30
Kazu Hirata	5605a1eedd	Use drop_begin (NFC)	2022-07-15 23:58:11 -07:00
Stella Laurenzo	1d6a90418e	Example: ``` // -----// IR Dump Before LowerLinalgMicrokernels (iree-vmvx-lower-linalg-microkernels) //----- // ``` I've been meaning to suggest this for a long time, and I think the only reason we don't have it is because we didn't used to have the `getArgument()` handy when printing these comments. When debugging or putting a pipeline together based on such dumps, I often find myself grepping for the argument name of the pass (which is often related but not universally).	2022-07-15 23:27:14 -07:00
Kazu Hirata	246bf08db3	Use drop_begin (NFC)	2022-07-15 23:24:59 -07:00
bzcheeseman	f382dfc06b	[MLIR][Liveness] Add `currentlyLiveValues`, a way to get a set of values that are live as of a given operation. This change allows the user of LivenessBlockInfo to specify an op within the block and get a set of all values that are live as of that op. Semantically it relies on having a dominance-based region that has ordered operations. For DFG regions, computing liveness statically this way doesn't really make sense, it likely needs to be done at runtime. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D129447	2022-07-15 22:10:53 -07:00
Kazu Hirata	1a5d007659	Use has_value/value instead of hasValue/getValue (NFC)	2022-07-15 21:48:17 -07:00
Kazu Hirata	3b0dce5b8b	Use value_or (NFC)	2022-07-15 19:46:29 -07:00
Ashay Rane	179588ea39	[mlir] remove unnecessary statements in test A previous commit (`f2b94bd`) added some unnecessary statements that dereferenced operations only to get the operations back. This patch removes the unnecessary statements. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D129913	2022-07-15 18:57:07 -07:00
Jeff Niu	7fe2294e47	[mlir][ods] Allow specifying return types of builders This patch allows custom attribute and type builders to return something other than the C++ type of the attribute or type. This is useful for attributes or types that may perform extra work during construction (e.g. canonicalization) that could result in a different kind of attribute or type being returned. Reviewed By: rriddle, lattner Differential Revision: https://reviews.llvm.org/D129792	2022-07-15 18:00:35 -07:00
Mahesh Ravishankar	3139cc766c	[mlir][Linalg] Add a pattern to decompose `linalg.generic` ops. This patch adds a pattern to decompose a `linalg.generic` operations that - has only parallel iterator types - has more than 2 statements (including the yield) into multiple `linalg.generic` operation such that each operation has a single statement and a yield. The pattern added here just splits the matching `linalg.generic` into two `linalg.generic`s, one containing the first statement, and the other containing the remaining. The same pattern can be applied repeatedly on the second op to ultimately fully decompose the generic op. Differential Revision: https://reviews.llvm.org/D129704	2022-07-15 23:01:18 +00:00
Ashay Rane	f2b94bd7ea	[mlir] check whether region and block visitors are interrupted The visitor functions for `Region` and `Block` types did not always check the value returned by recursive calls. This caused the top-level visitor invocation to return `WalkResult::advance()` even if one or more recursive invocations returned `WalkResult::interrupt()`. This patch fixes the problem by check if any recursive call is interrupted, and if so, return `WalkResult::interrupt()`. Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D129718	2022-07-15 14:50:42 -07:00
Jim Kitchen	2b8a4d9ce1	[mlir][sparse] Introduce new reduce op A new sparse_tensor operation allows for custom reduction code to be injected during linalg.generic lowering for sparse tensors. An identity value is provided to indicate the starting value of the reduction. A single block region is required to contain the custom reduce computation. Reviewed by: aartbik Differential Revision: https://reviews.llvm.org/D128004	2022-07-15 15:30:41 -05:00
Mahesh Ravishankar	331afccad2	[mlir][Linalg] Extend Generic op dedup canonicalization to drop redundant result values.	2022-07-15 19:47:46 +00:00
Mahesh Ravishankar	b62060a8e3	[mlir][Linalg] NFC: Refactor canonicalization for deduping generic op operands. This is a NFC change to make it easier to update this canonicalization for more use cases. The refactoring makes things easier to understand/adapt. Differential Revision: https://reviews.llvm.org/D129829	2022-07-15 19:47:45 +00:00
Rajas Vanjape	1976ad70c5	[mlir][sparse] Add 3-dimensional sparse tensor multiplication integration test This diff adds an integration test which does element wise multiplication for two sparse 3-d tensors of size 3x3x5 Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D129638	2022-07-15 12:12:51 -07:00
Lei Zhang	56796ae1a8	[mlir][linalg] Fix tensor tiling together with interchange In `linalg::tileConsumerAndFuseProducers`, there are two levels of tiling and fusion; we partition the tile sizes and only use one half for each of them. The partition is using the first non-parallel dimension after interchange as the boundary. However, concrete tiling happens together with loop interchange, so we still need to provide the partial tile sizes before the interchange. Otherwise, there will be inconsistency, which is what this patch is to fix. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D129804	2022-07-15 13:54:47 -04:00
Laszlo Kindrat	ac14d5a1db	[mlir] Enable perfect forwarding in RewritePatternSet::add This patch modifies the implementation of `RewritePatternSet::add` to perfectly forward its arguments to pattern constructors. Without this, code like the following compiles but, due to the limited lifetime of the temporary TypeConverter, can produce unexpected behavior: ``` RewritePatternSet patterns(context); patterns.add<SomeOpConversion, OtherOpConversion>(TypeConverter(), context); if (failed(applyPartialConversion(getOperation(), target, std::move(patterns)))) return signalPassFailure(); ``` The patch also changes the linalg fusion pattern implementation to correctly fill the test pattern set given the new behavior. Author: Laszlo Kindrat <laszlokindrat@gmail.com> Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D129601	2022-07-15 19:44:18 +02:00
lorenzo chelini	12929c241a	Revert "[mlir] Enable perfect forwarding in RewritePatternSet::add" Did not preserve author information. This reverts commit `b0afda78f0`.	2022-07-15 19:16:09 +02:00
lorenzo chelini	b0afda78f0	[mlir] Enable perfect forwarding in RewritePatternSet::add This patch modifies the implementation of `RewritePatternSet::add` to perfectly forward its arguments to pattern constructors. Without this, code like the following compiles but, due to the limited lifetime of the temporary TypeConverter, can produce unexpected behavior: ``` RewritePatternSet patterns(context); patterns.add<SomeOpConversion, OtherOpConversion>(TypeConverter(), context); if (failed(applyPartialConversion(getOperation(), target, std::move(patterns)))) return signalPassFailure(); ``` The patch also changes the linalg fusion pattern implementation to correctly fill the test pattern set given the new behavior. Author: Laszlo Kindrat <laszlokindrat@gmail.com> Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D129601	2022-07-15 19:08:23 +02:00
Arjun P	821fe9efa4	[MLIR][Presburger] reintroduce int64_t versions of floorDiv, ceilDiv in mlir::presburger namespace This is useful because MPInt.h defines identically-named functions that operate on MPInts, which would otherwie become the only candidates of overload resolution when calling e.g. ceilDiv from the mlir::presburger namespace (iff MPInt.h is included). So to access the 64-bit overloads, an explict call to mlir::ceilDiv would be required. This patch adds `using` declarations allowing overload resolution to transparently call the right function. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D129820	2022-07-15 17:29:31 +01:00
Arjun P	9390b8d34c	[MLIR][Presburger] MPInt: add missing fastpath in ceilDiv This is not a bug in functionality, just a missed optimization. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D129815	2022-07-15 17:26:37 +01:00
Slava Zakharin	451e5e2b28	[mlir][math] Added math::tan operation. Differential Revision: https://reviews.llvm.org/D129539	2022-07-15 09:17:06 -07:00
Ingo Müller	b9f5b02fd0	[mlir][mbr] Improve diagnostics on error with `raise from`. This commit extends the `raise` statements on errors in user-provided code with `from e` clauses that attach the original exception to the one being raised. This allows to debug the root cause of the error more easily. Reviewed By: SaurabhJha Differential Revision: https://reviews.llvm.org/D129762	2022-07-15 15:15:58 +00:00
David Kreitzer	c720b6fddd	Clarify the behavior of the llvm.vector.insert/extract intrinsics when the index is out of range. Both intrinsics return a poison value. Consequently, mark the intrinsics speculatable. Differential Revision: https://reviews.llvm.org/D129656	2022-07-15 07:56:44 -07:00
Arjun P	86d73c11cf	[MLIR][Presburger] SlowMPInt: gcd: assert that operands are non-negative	2022-07-15 15:45:53 +01:00
Arjun P	ec5f18e38a	[MLIR][Presburger] MPInt: gcd: assert that operands are non-negative	2022-07-15 14:13:41 +01:00
Fangrui Song	3c849d0aef	Modernize Optional::{getValueOr,hasValue}	2022-07-15 01:20:39 -07:00
Ingo Müller	5da5483ffb	[mlir][benchmark] Fix import in sparse benchmark. The benchmark currently fails to run because it cannot find the `func` symbol when using a `FuncOp`. I suppose that the breakage was introduced by the extraction of the func dialect from the builtin dialect that wasn't reflected in the benchmark yet. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D129738	2022-07-15 07:15:51 +00:00
Arjun P	d6fbe1394f	[MLIR][Presburger] MPInt: use /// for top-level comment, not // (NFC)	2022-07-15 00:30:02 +01:00
Manish Gupta	713d3de5fb	[mlir][NVGPU] Verifier for nvgpu.ldmatrix * Adds verifiers for `nvgpu.ldmatrix` op * Adds tests to `mlir/test/Dialect/NVGPU/invalid.mlir` Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D129669	2022-07-14 22:46:38 +00:00
Aart Bik	f30822d29c	[mlir][sparse][bufferization] fix a few memory leaks Fixed some new memory leaks after migration to new bufferization. One is expected, the other may need some more careful analysis. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D129805	2022-07-14 15:05:12 -07:00
Aart Bik	67564841cb	[mlir][sparse][bufferization] initialize reduction variable After recent bufferization improvement, this test started failing due to missed zero initialization. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D129800	2022-07-14 13:58:23 -07:00
Krzysztof Drewniak	bc61cc9a2d	[mlir][AMDGPU] Add lds_barrier op The lds_barrier op allows workgroups to wait at a barrier for operations to/from their local data store (LDS) to complete without incurring the performance penalties of a full memory fence. Reviewed By: nirvedhmeshram Differential Revision: https://reviews.llvm.org/D129522	2022-07-14 20:45:26 +00:00
Jeff Niu	b7f93c2809	[mlir] (NFC) run clang-format on all files	2022-07-14 13:32:13 -07:00
Thomas Raoux	f48ce52c4c	[mlir][vector] Pattern to clean up vector.extract during distribution This prevents blocking propagation when converting between scalar and vector<1> Differential Revision: https://reviews.llvm.org/D129782	2022-07-14 17:07:32 +00:00
Thomas Raoux	ffa7384f10	[mlir][vector] Support distribution of vector.reduce with accumulator Right now the pattern was ignoring the optional accumulator. Differential Revision: https://reviews.llvm.org/D129719	2022-07-14 14:28:38 +00:00
Matthias Springer	74902cc96f	[mlir][linalg][NFC] Cleanup: Drop linalg.inplaceable attribute bufferization.writable is used in most cases instead. All remaining test cases are updated. Some code that is no longer needed is deleted. Differential Revision: https://reviews.llvm.org/D129739	2022-07-14 15:50:03 +02:00
Nicolas Vasilache	5a0011360c	[mlir][Linalg] Retire LinalgPromotion pattern This revision removes the LinalgPromotion pattern and adds a `transform.structured.promotion` op. Since the LinalgPromotion transform allows the injection of arbitrary C++ via lambdas, the current transform op does not handle it. It is left for future work to decide what the right transform op control is for those cases. Note the underlying implementation remains unchanged and the mechanism is still controllable by lambdas from the API. During this refactoring it was also determined that the `dynamicBuffers` option does not actually connect to a change of behavior in the algorithm. This also exhibits that the related test is wrong (and dangerous). Both the option and the test are therefore removed. Lastly, a test that connects patterns using the filter-based mechanism is removed: all the independent pieces are already tested separately. Context: https://discourse.llvm.org/t/psa-retire-linalg-filter-based-patterns/63785 Differential Revision: https://reviews.llvm.org/D129649	2022-07-14 05:29:27 -07:00
Ingo Müller	d4a7ca81fe	[mlir][doc] Fix usage of PatternApplicator. The constructor of PatternApplicator doesn't have a constructor that accepts only a `RewritePatternSet` as currently used in the example code in PatternRewriter.md. Instead, one has to turn it into a `FrozenRewritePatternSet`. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D125236	2022-07-14 09:08:26 +00:00
Matthias Springer	c66303c287	[mlir][sparse] Switch to One-Shot Bufferize This change removes the partial bufferization passes from the sparse compilation pipeline and replaces them with One-Shot Bufferize. One-Shot Analysis (and TensorCopyInsertion) is used to resolve all out-of-place bufferizations, dense and sparse. Dense ops are then bufferized with BufferizableOpInterface. Sparse ops are still bufferized in the Sparsification pass. Details: * Dense allocations are automatically deallocated, unless they are yielded from a block. (In that case the alloc would leak.) All test cases are modified accordingly. E.g., some funcs now have an "out" tensor argument that is returned from the function. (That way, the allocation happens at the call site.) * Sparse allocations are not automatically deallocated. They must be "released" manually. (No change, this will be addressed in a future change.) * Sparse tensor copies are not supported yet. (Future change) * Sparsification no longer has to consider inplacability. If necessary, allocations and/or copies are inserted during TensorCopyInsertion. All tensors are inplaceable by the time Sparsification is running. Instead of marking a tensor as "not inplaceable", it can be marked as "not writable", which will trigger an allocation and/or copy during TensorCopyInsertion. Differential Revision: https://reviews.llvm.org/D129356	2022-07-14 09:52:48 +02:00
Kazu Hirata	c27d815249	[mlir] Use value instead of getValue (NFC)	2022-07-14 00:19:59 -07:00
Manish Gupta	f7d42d5149	[mlir][NVGPU] Verifiers for nvgpu.mma.sync Op - Adds verification for `nvgpu.mma.sync` op - Adds tests to `mlir/test/Dialect/NVGPU/invalid.mlir` - `nvgpu.mma.sync` verifier caught a bug and triggered a failure in m16n8k4_tf32_f32 variant in `mlir/test/Conversion/NVGPUToNVVM/nvgpu-to-nvvm.mlir` - The output shape of vector holding thread-level accumulators was inconsistent and fixed in this change Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D129400	2022-07-13 18:57:07 +00:00
Benoit Jacob	6870a50f43	lowerParallel is also called on unit-size, one-sided reduction dims See: https://gist.github.com/bjacob/d8be8ec7e70ed0be4b3a5794ced2a7e8 Differential Revision: https://reviews.llvm.org/D129096	2022-07-13 16:21:12 +00:00
Jacques Pienaar	69b6454fff	[mlir] Plumb through default attribute populate for extensible dialect.	2022-07-13 09:05:04 -07:00
Nicolas Vasilache	cd6e02eebc	[mlir][Linalg] Retire TestLinalgCodegenStrategy pass. This pass tests patterns that are already tested elsewhere by applying them in a semi-targeted fashion using anchor function and op names. From now on, targeted tests should use the transform dialect interpreter. Differential Revision: https://reviews.llvm.org/D129627	2022-07-13 04:20:42 -07:00
Kazu Hirata	491d27013d	[mlir] Use has_value instead of hasValue (NFC)	2022-07-13 00:57:02 -07:00
Adrian Kuegel	aabfaf901b	[mlir] Allow empty lists for DenseArrayAttr. Differential Revision: https://reviews.llvm.org/D129552	2022-07-13 09:16:09 +02:00
Anlun Xu	033b9f21b0	[mlir][sparse]Replace redundant indices checks in sparse_tensor.conversion Replace some redundant indices checks with the correct checks Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D129101	2022-07-12 21:04:31 -07:00
Thomas Raoux	5f8cefebd9	[mlir][vector] Fix crash in vector.reduction canonicalization since vector.reduce support accumulator in all the cases remove the assert assuming old definition. Differential Revision: https://reviews.llvm.org/D129602	2022-07-12 23:15:30 +00:00
Thomas Raoux	8fe076ffe0	[mlir][VectorToLLVM] Fix bug in lowering of vector.reduce fmax/fmin The lowering of fmax/fmin reduce was ignoring the optional accumulator. Differential Revision: https://reviews.llvm.org/D129597	2022-07-12 22:03:39 +00:00
Jacques Pienaar	0db084d4c7	[mlir] Switch create to use NamedAttrList&& Avoids needing the two parallel functions as NamedAttrList already takes care of caching DictionaryAttr and implicitly can convert from either. Differential Revision: https://reviews.llvm.org/D129527	2022-07-12 13:24:09 -07:00

... 2 3 4 5 6 ...

12231 Commits