llvm-project

Commit Graph

Author	SHA1	Message	Date
Matthias Springer	fb7ec1f187	[mlir] Use VectorTransferPermutationMapLoweringPatterns in VectorToSCF VectorTransferPermutationMapLoweringPatterns can be enabled via a pass option. These additional patterns lower permutation maps to minor identity maps with broadcasting, if possible, allowing for more efficient vector load/stores. The option is deactivated by default. Differential Revision: https://reviews.llvm.org/D102593	2021-05-19 14:46:19 +09:00
MaheshRavishankar	e2b365948b	[mlir][Linalg] Break unnecessary dependency through unused `outs` tensor. LinalgOps that are all parallel do not use the value of `outs` tensor. The semantics is that the `outs` tensor is fully overwritten. Using anything other than `init_tensor` can add false dependencies between operations, when the use is just for the shape of the tensor. Adding a canonicalization to always use `init_tensor` in such cases, breaks this dependence. Differential Revision: https://reviews.llvm.org/D102561	2021-05-18 22:31:42 -07:00
Wenyi Zhao	851d02f61e	Enhance InferShapedTypeOpInterface to make it accessible during dialect conversion Original interfaces are not safe to be called during dialect conversion. This is because some ops (e.g. `dynamic_reshape(input, target_shape)`) depend on the values of their operands to calculate the output shape. However the operands may be out of reach during dialect conversion (e.g. converting from tensor world to buffer world). This patch provides a new kind of interface which accpets user-provided operands to solve this problem. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D102317	2021-05-19 02:51:14 +00:00
Richard Smith	80d981eda6	Revert "[IR] Add a Location to BlockArgument." and follow-on commit "[mlir] Speed up Lexer::getEncodedSourceLocation" This reverts commit `3043be9d2d` and commit `861d69a525`. This change resulted in printing textual MLIR that can't be parsed; see review thread https://reviews.llvm.org/D102567 for details.	2021-05-18 19:26:00 -07:00
River Riddle	2257e4a70e	[mlir] Allow derived rewrite patterns to define a non-virtual `initialize` hook This is a hook that allows for providing custom initialization of the pattern, e.g. if it has bounded recursion, setting the debug name, etc., without needing to define a custom constructor. A non-virtual hook was chosen to avoid polluting the vtable with code that we really just want to be inlined when constructing the pattern. The alternative to this would be to just define a constructor for each pattern, this unfortunately creates a lot of otherwise unnecessary boiler plate for a lot of patterns and a hook provides a much simpler/cleaner interface for the very common case. Differential Revision: https://reviews.llvm.org/D102440	2021-05-18 14:40:32 -07:00
River Riddle	f9ea3ebef2	[mlir-lsp-server] Add support for recording text document versions The version is used by LSP clients to ignore stale diagnostics, and can be used in a followup to help verify incremental changes. Differential Revision: https://reviews.llvm.org/D102644	2021-05-18 12:57:52 -07:00
Chris Lattner	3043be9d2d	[IR] Add a Location to BlockArgument. This adds the ability to specify a location when creating BlockArguments. Notably Value::getLoc() will return this correctly, which makes diagnostics more precise (e.g. the example in test-legalize-type-conversion.mlir). This is currently optional to avoid breaking any existing code - if absent, the BlockArgument defaults to using the location of its enclosing operation (preserving existing behavior). The bulk of this change is plumbing location tracking through the parser and printer to make sure it can round trip (in -mlir-print-debuginfo mode). This is complete for generic operations, but requires manual adoption for custom ops. I added support for function-like ops to round trip their argument locations - they print correctly, but when parsing the locations are dropped on the floor. I intend to fix this, but it will require more invasive plumbing through "function_like_impl" stuff so I think it best to split it out to its own patch. Differential Revision: https://reviews.llvm.org/D102567	2021-05-18 10:18:04 -07:00
Vinayaka Bandishti	a3917d3670	[MLIR][Affine] Privatize certain escaping memrefs During affine loop fusion, create private memrefs for escaping memrefs too under the conditions that: -- the source is not removed after fusion, and -- the destination does not write to the memref. This creates more fusion opportunities as illustrated in the test case. Reviewed By: bondhugula, ayzhuang Differential Revision: https://reviews.llvm.org/D102604	2021-05-18 22:23:02 +05:30
Nicolas Vasilache	f8dbd61074	[mlir][Linalg] Drop spuriously long matmul_column_major benchmark	2021-05-18 10:07:19 +00:00
Adrian Kuegel	fa765a0944	[mlir] Add folder for complex.ReOp and complex.ImOp. Now that complex constants are supported, we can also fold. Differential Revision: https://reviews.llvm.org/D102616	2021-05-18 11:27:23 +02:00
Jacques Pienaar	24bf554b10	Add type function for ConstShape op. - Enables inferring return type for ConstShape, takes into account valid return types; - The compatible return type function could be reused, leaving that for next use refactoring; Differential Revision: https://reviews.llvm.org/D102182	2021-05-17 11:47:19 -07:00
Aart Bik	5879da496c	[mlir][sparse] replace experimental flag with inplace attribute The experimental flag for "inplace" bufferization in the sparse compiler can be replaced with the new inplace attribute. This gives a uniform way of expressing the more efficient way of bufferization. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D102538	2021-05-17 11:43:44 -07:00
Chris Lattner	648f34a284	Merge with mainline. Differential Revision: https://reviews.llvm.org/D102636	2021-05-17 11:15:10 -07:00
Rob Suderman	08068ddba7	[mlir][tosa] Fix tosa.avg_pool2d lowering to normalize correctly Initial version of pooling assumed normalization was accross all elements equally. TOSA actually requires the noramalization is perform by how many elements were summed (edges are not artifically dimmer). Updated the lowering to reflect this change with corresponding tests. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D102540	2021-05-17 10:00:43 -07:00
Valentin Clement	ab5ff154ab	[mlir][openacc] Translate ExitDataop to LLVM IR Translate ExitDataOp with delete and copyout operands to runtime call. This is done in a similar way as D101504. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D102381	2021-05-17 11:11:59 -04:00
Matthias Springer	2c9688d201	[mlir] Improve TransferOp verifier: broadcasts are in_bounds Broadcast dimensions of vector transfer ops are always in-bounds. This is consistent with the fact that the starting position of a transfer is always in-bounds. Differential Revision: https://reviews.llvm.org/D102566	2021-05-17 22:35:44 +09:00
Stephan Herhut	db81e88f25	[mlir][memref] Mark memref.buffer_cast as NoSideEffect This brings it in line with the bultin unrealized_conversion_cast, which memref.buffer_cast is a specialized version of. Differential Revision: https://reviews.llvm.org/D102608	2021-05-17 14:20:00 +02:00
Adrian Kuegel	967f07f547	Revert "[mlir] Add folder for complex.ReOp and complex.ImOp." This reverts commit `6b49834d65`. Some tests fail.	2021-05-17 13:49:42 +02:00
Adrian Kuegel	6b49834d65	[mlir] Add folder for complex.ReOp and complex.ImOp. Now that complex constants are supported, we can also fold. Differential Revision: https://reviews.llvm.org/D102609	2021-05-17 13:35:51 +02:00
Adam Paszke	d89602ed62	Add `mlirModuleFromOperation` to C API At the moment `MlirModule`s can be converted to `MlirOperation`s, but not the other way around (at least not without going around the C API). This makes it impossible to e.g. run passes over a `ModuleOp` created through `mlirOperationCreate`. Reviewed By: nicolasvasilache, mehdi_amini Differential Revision: https://reviews.llvm.org/D102497	2021-05-17 10:14:16 +00:00
Adrian Kuegel	5ef21506b9	Add support for complex constants to MLIR core. BEGIN_PUBLIC Add support for complex constants to MLIR core. END_PUBLIC Differential Revision: https://reviews.llvm.org/D101908	2021-05-17 09:12:39 +02:00
Matthias Springer	7ddeffee55	[mlir] Lower permutation maps on TransferWriteOps Add TransferWritePermutationLowering, which replaces permutation maps of TransferWriteOps with vector.transpose. Differential Revision: https://reviews.llvm.org/D102548	2021-05-17 15:30:46 +09:00
Matthias Springer	6774e5a995	[mlir] Fix in_bounds attr handling in TransferReadPermutationLowering The in_bounds attribute should also be transposed. Differential Revision: https://reviews.llvm.org/D102572	2021-05-17 15:28:16 +09:00
Uday Bondhugula	185ce8cdfc	[MLIR][PYTHON] Provide opt level for ExecutionEngine Python binding Provide an option to specify optimization level when creating an ExecutionEngine via the MLIR JIT Python binding. Not only is the specified optimization level used for code generation, but all LLVM optimization passes at the optimization level are also run prior to machine code generation (akin to the mlir-cpu-runner tool). Default opt level continues to remain at level two (-O2). Contributions in part from Prashant Kumar <prashantk@polymagelabs.com> as well. Differential Revision: https://reviews.llvm.org/D102551	2021-05-16 13:58:49 +05:30
Aart Bik	56fd4c1cf8	[mlir][sparse] prepare runtime support lib for multiple dim level types We are moving from just dense/compressed to more general dim level types, so we need more than just an "i1" array for annotations. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D102520	2021-05-14 19:12:07 -07:00
Nicolas Vasilache	6f90955f69	[mlir][Linalg] Add support for subtensor_insert comprehensive bufferization (3/n) Differential revision: https://reviews.llvm.org/D102417	2021-05-14 21:51:00 +00:00
River Riddle	dfacb8c8d4	[mlir] Add missing dependence to TestDialect from TestTransforms This was accidentally dropped in D102456	2021-05-14 11:00:31 -07:00
Ian Bearman	0816b96a10	Allow same memory space for SRC and DST of dma_start operations This change allows the SRC and DST of dma_start operations to be located in the same memory space. This applies to both the Affine dialect and Memref dialect versions of these Ops. The documention has been updated to reflect this by explicitly stating overlapping memory locations are not supported (undefined behavior). Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D102274	2021-05-14 10:40:15 -07:00
River Riddle	3fef2d26a3	[mlir][NFC] Move passes in test/lib/Transforms/ to a directory that mirrors what they test test/lib/Transforms/ has bitrot and become somewhat of a dumping grounds for testing pretty much any part of the project. This revision cleans this up, and moves the files within to a directory that reflects what is actually being tested. Differential Revision: https://reviews.llvm.org/D102456	2021-05-14 10:28:11 -07:00
Benoit Jacob	e0a88db545	Fix some typos. Fix some typos Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D102503	2021-05-14 21:34:09 +05:30
Nicolas Vasilache	bebf5d56bf	[mlir][Linalg] Add support for vector.transfer ops to comprehensive bufferization (2/n). Differential revision: https://reviews.llvm.org/D102395	2021-05-13 22:26:28 +00:00
Nicolas Vasilache	1e01a8919f	[mlir][Linalg] Add ComprehensiveBufferize for functions(step 1/n) This is the first step towards upstreaming comprehensive bufferization following the discourse post: https://llvm.discourse.group/t/rfc-linalg-on-tensors-update-and-comprehensive-bufferization-rfc/3373/6. This first commit introduces a basic pass for bufferizing within function boundaries, assuming that the inplaceable function boundaries have been marked as such. Differential revision: https://reviews.llvm.org/D101693	2021-05-13 22:24:40 +00:00
Rob Suderman	f97d970a49	[mlir][tosa] Add lowering to tosa.abs for integer cases Integer case requires decomposing to simple LLVM operatons. Differential Revision: https://reviews.llvm.org/D101809	2021-05-13 13:55:17 -07:00
natashaknk	0831793ed9	[mlir][tosa] Add tosa.div integer lowering to linalg.generic. Lowering div elementwise op to the linalg dialect. Since tosa only supports integer division, that is the only version that is currently implemented. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D102430	2021-05-13 13:16:00 -07:00
Weiwei Li	cd0eeb52ad	[mlir][spirv] Define spv.ImageQuerySize operation Support OpImageQuerySize in spirv dialect co-authored-by: Alan Liu <alanliu.yf@gmail.com> Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D102029	2021-05-13 13:17:08 -04:00
Jacques Pienaar	3f2891db6d	[mlir] Add python test for shape dialect Add basic test for shape.const_shape op as start. Differential Revision: https://reviews.llvm.org/D102341	2021-05-13 09:13:47 -07:00
Tobias Gysi	cf194da1bb	[mlir][linalg] Remove IndexedGenericOp support from FusionOnTensors... after introducing the IndexedGenericOp to GenericOp canonicalization (https://reviews.llvm.org/D101612). Differential Revision: https://reviews.llvm.org/D102163	2021-05-13 14:57:16 +00:00
Matthias Springer	0f24163870	[mlir] Replace vector-to-scf with progressive-vector-to-scf Depends On D102388 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D102101	2021-05-13 23:27:31 +09:00
Tobias Gysi	f358c37209	[mlir][linalg] Remove IndexedGenericOp support from DropUnitDims... after introducing the IndexedGenericOp to GenericOp canonicalization (https://reviews.llvm.org/D101612). Differential Revision: https://reviews.llvm.org/D102235	2021-05-13 14:18:59 +00:00
Matthias Springer	d020dd2b21	[mlir] Migrate vector-to-loops.mlir to ProgressiveVectorToSCF Create a copy of vector-to-loops.mlir and adapt the test for ProgressiveVectorToSCF. Fix a small bug in getExtractOp() triggered by this test. Differential Revision: https://reviews.llvm.org/D102388	2021-05-13 22:48:20 +09:00
Matthias Springer	bf068e1077	[mlir] Do not use pass labels in unrolled ProgressiveVectorToSCF Do not rely on pass labels to detect if the pattern was already applied in the past (which allows for more some extra optimizations to avoid extra InsertOps and ExtractOps). Instead, check if these optimizations can be applied on-the-fly. This also fixes a bug, where vector.insert and vector.extract ops sometimes disappeared in the middle of the pass because they get folded away, but the next application of the pattern expected them to be there. Differential Revision: https://reviews.llvm.org/D102206	2021-05-13 22:01:08 +09:00
Matthias Springer	60da33c2d4	[mlir] Support masks in TransferOpReduceRank and TransferReadPermutationLowering These two patterns allow for more efficient codegen in VectorToSCF. Differential Revision: https://reviews.llvm.org/D102222	2021-05-13 15:08:08 +09:00
Rob Suderman	3f8aafd790	[mlir][tosa] Fix tosa.cast semantics to perform rounding/clipping Rounding to integers requires rounding (for floating points) and clipping to the min/max values of the destination range. Added this behavior and updated tests appropriately. Reviewed By: sjarus, silvas Differential Revision: https://reviews.llvm.org/D102375	2021-05-12 21:53:53 -07:00
Matthias Springer	9b77be5583	[mlir] Unrolled progressive-vector-to-scf. Instead of an SCF for loop, these pattern generate fully unrolled loops with no temporary buffer allocations. Differential Revision: https://reviews.llvm.org/D101981	2021-05-13 13:08:48 +09:00
Matthias Springer	864adf399e	[mlir] Allow empty position in vector.insert and vector.extract Such ops are no-ops and are folded to their respective `source`/`vector` operand. Differential Revision: https://reviews.llvm.org/D101879	2021-05-13 12:54:18 +09:00
Matthias Springer	c52cbe63e4	[mlir] Fix masked vector transfer ops with broadcasts Broadcast dimensions of a vector transfer op have no corresponding dimension in the mask vector. E.g., a 2-D TransferReadOp, where one dimension is a broadcast, can have a 1-D `mask` attribute. This commit also adds a few additional transfer op integration tests for various combinations of broadcasts, masking, dim transposes, etc. Differential Revision: https://reviews.llvm.org/D101745	2021-05-13 12:46:03 +09:00
Matthias Springer	6555e53ab0	Revert "[mlir] Fix masked vector transfer ops with broadcasts" This reverts commit `c9087788f7`. Accidentally pushed old version of the commit.	2021-05-13 11:55:00 +09:00
Matthias Springer	c9087788f7	[mlir] Fix masked vector transfer ops with broadcasts Broadcast dimensions of a vector transfer op have no corresponding dimension in the mask vector. E.g., a 2-D TransferReadOp, where one dimension is a broadcast, can have a 1-D `mask` attribute. This commit also adds a few additional transfer op integration tests for various combinations of broadcasts, masking, dim transposes, etc. Differential Revision: https://reviews.llvm.org/D101745	2021-05-13 11:37:36 +09:00
Aart Bik	58d12332a4	[mlir][sparse][capi][python] add sparse tensor passes First set of "boilerplate" to get sparse tensor passes available through CAPI and Python. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D102362	2021-05-12 16:40:50 -07:00
River Riddle	b3911cdfc8	[mlir-lsp-server] Add support for sending diagnostics to the client This allows for diagnostics emitted during parsing/verification to be surfaced to the user by the language client, as opposed to just being emitted to the logs like they are now. Differential Revision: https://reviews.llvm.org/D102293	2021-05-12 13:02:25 -07:00
Suraj Sudhir	4b01435230	[mlir][tosa] Remove tosa.identityn operator Removes the identityn operator from TOSA MLIR definition. Removes TosaToLinAlg mappings Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D102329	2021-05-12 12:46:22 -07:00
Rob Suderman	7b57517507	[mlir][linalg] Fixed issue generating reassociation map with Rank-0 types Rank-0 case causes a graph during linalg reshape operation. Differential Revision: https://reviews.llvm.org/D102282	2021-05-12 11:00:51 -07:00
Valentin Clement	113b807017	[mlir][openacc] Add OpenACC translation to LLVM IR (enter_data op create/copyin) This patch begins to translate acc.enter_data operation to call to tgt runtime call. It currently only translate create/copyin operands of memref type. This acts as a basis to add support for FIR types in the Flang/OpenACC support. It follows more or less a similar path than clang with `omp target enter data map` directives. This patch is taking a different approach than D100678 and perform a translation to LLVM IR and make use of the OpenMPIRBuilder instead of doing a conversion to the LLVMIR dialect. OpenACC support in Flang will rely on the current OpenMP runtime where 1:1 lowering can be applied. Some extension will be added where features are not available yet. Big part of this code will be shared for other standalone data operations in the OpenACC dialect such as acc.exit_data and acc.update. It is likely that parts of the lowering can also be shared later with the ops for standalone data directives in the OpenMP dialect when they are introduced. This is an initial translation and it probably needs more work. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D101504	2021-05-12 13:41:14 -04:00
Inho Seo	5480ea6c84	Update static bound checker for Linalg to cover decreasing cases The current static checker for linalg does not work on the decreasing index cases well. So, this is to Update the current static bound checker for linalg to cover decreasing index cases. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D102302	2021-05-12 10:29:19 -07:00
Aart Bik	ca5d0a7310	[mlir][sparse] keep runtime support library signature consistent Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D102285	2021-05-12 09:59:46 -07:00
Fabian Schuiki	33f908c428	[MLIR] Factor pass timing out into a dedicated timing manager This factors out the pass timing code into a separate `TimingManager` that can be plugged into the `PassManager` from the outside. Users are able to provide their own implementation of this manager, and use it to time additional code paths outside of the pass manager. Also allows for multiple `PassManager`s to run and contribute to a single timing report. More specifically, moves most of the existing infrastructure in `Pass/PassTiming.cpp` into a new `Support/Timing.cpp` file and adds a public interface in `Support/Timing.h`. The `PassTiming` instrumentation becomes a wrapper around the new timing infrastructure which adapts the instrumentation callbacks to the new timers. Reviewed By: rriddle, lattner Differential Revision: https://reviews.llvm.org/D100647	2021-05-12 18:14:51 +02:00
Valentin Clement	6110b667b0	[mlir][openacc] Conversion of data operand to LLVM IR dialect Add a conversion pass to convert higher-level type before translation. This conversion extract meangingful information and pack it into a struct that the translation (D101504) will be able to understand. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D102170	2021-05-12 11:34:15 -04:00
Tobias Gysi	06bb9cf30d	[mlir][linalg] Remove IndexedGenericOp support from LinalgInterchangePattern... after introducing the IndexedGenericOp to GenericOp canonicalization (https://reviews.llvm.org/D101612). Differential Revision: https://reviews.llvm.org/D102245	2021-05-12 13:01:37 +00:00
Tobias Gysi	c6b96ae06f	[mlir][linalg] Remove IndexedGenericOp support from LinalgBufferize... after introducing the IndexedGenericOp to GenericOp canonicalization (https://reviews.llvm.org/D101612). Differential Revision: https://reviews.llvm.org/D102308	2021-05-12 12:15:05 +00:00
Tobias Gysi	0fb364a97e	[mlir][linalg] Remove IndexedGenericOp support from LinalgToStandard... after introducing the IndexedGenericOp to GenericOp canonicalization (https://reviews.llvm.org/D101612). Differential Revision: https://reviews.llvm.org/D102236	2021-05-12 11:56:07 +00:00
Dumitru Potop	9a0ea5994b	[mlir] Support alignment in LLVM dialect GlobalOp First step in adding alignment as an attribute to MLIR global definitions. Alignment can be specified for global objects in LLVM IR. It can also be specified as a named attribute in the LLVMIR dialect of MLIR. However, this attribute has no standing and is discarded during translation from MLIR to LLVM IR. This patch does two things: First, it adds the attribute to the syntax of the llvm.mlir.global operation, and by doing this it also adds accessors and verifications. The syntax is "align=XX" (with XX being an integer), placed right after the value of the operation. Second, it allows transforming this operation to and from LLVM IR. It is checked whether the value is an integer power of 2. Reviewed By: ftynse, mehdi_amini Differential Revision: https://reviews.llvm.org/D101492	2021-05-12 09:07:20 +02:00
Rob Suderman	764ad3b3fa	[mlir][tosa] Tosa elementwise broadcasting had some minor bugs Updated tests to include broadcast of left and right. Includes bypass if in-type and out-type match shape (no broadcasting). Differential Revision: https://reviews.llvm.org/D102276	2021-05-11 13:58:06 -07:00
Sean Silva	49755871ad	[mlir][ODS]: Add per-op cppNamespace. This is useful for dialects that have logical subparts. Differential Revision: https://reviews.llvm.org/D102200	2021-05-11 10:48:05 -07:00
Benjamin Kramer	b20e150c9b	[mlir] Use static shape knowledge when lowering memref.reshape This is actually necessary for correctness, as memref.reinterpret_cast doesn't verify if the output shape doesn't match the static sizes. Differential Revision: https://reviews.llvm.org/D102232	2021-05-11 18:21:09 +02:00
Uday Bondhugula	1c777ab459	[MLIR] Switch llvm.noalias to a unit attribute Switch llvm.noalias attribute from a boolean attribute to a unit attribute. Differential Revision: https://reviews.llvm.org/D102225	2021-05-11 15:41:09 +05:30
Tres Popp	88a48999d2	Support VectorTransfer splitting on writes also. VectorTransfer split previously only split read xfer ops. This adds the same logic to write ops. The resulting code involves 2 conditionals for write ops while read ops only needed 1, but the created ops are built upon the same patterns, so pattern matching/expectations are all consistent other than in regards to the if/else ops. Differential Revision: https://reviews.llvm.org/D102157	2021-05-11 10:33:27 +02:00
Tobias Gysi	7bc6df2528	[mlir][linalg] Remove IndexedGenericOp support from LinalgToLoops... after introducing the IndexedGenericOp to GenericOp canonicalization (https://reviews.llvm.org/D101612). Differential Revision: https://reviews.llvm.org/D102187	2021-05-11 06:53:47 +00:00
Tobias Gysi	6676e09b22	[mlir][linalg] Remove IndexedGenericOp support from Fusion... after introducing the IndexedGenericOp to GenericOp canonicalization (https://reviews.llvm.org/D101612). Differential Revision: https://reviews.llvm.org/D102174	2021-05-11 06:49:25 +00:00
Tobias Gysi	d69bccf1ed	[mlir][linalg] Remove IndexedGenericOp support from Tiling... after introducing the IndexedGenericOp to GenericOp canonicalization (https://reviews.llvm.org/D101612). Differential Revision: https://reviews.llvm.org/D102176	2021-05-11 05:53:58 +00:00
Benjamin Kramer	7b52aeadfa	[mlir][Tensor] Add folding for tensor.from_elements This trivially folds into a constant when all operands are constant. Differential Revision: https://reviews.llvm.org/D102199	2021-05-11 00:42:45 +02:00
Stella Laurenzo	295087644a	[mlir] Fix windows build bot break due to use of `alloca` in a test. Differential Revision: https://reviews.llvm.org/D102189	2021-05-10 20:39:16 +00:00
Stella Laurenzo	a2c8aebd8f	[mlir][Python] Finish adding RankedTensorType support for encoding. Differential Revision: https://reviews.llvm.org/D102184	2021-05-10 20:39:16 +00:00
Aart Bik	96a23911f6	[mlir][sparse] complete migration to sparse tensor type A very elaborate, but also very fun revision because all puzzle pieces are finally "falling in place". 1. replaces lingalg annotations + flags with proper sparse tensor types 2. add rigorous verification on sparse tensor type and sparse primitives 3. removes glue and clutter on opaque pointers in favor of sparse tensor types 4. migrates all tests to use sparse tensor types NOTE: next CL will remove all obsoleted sparse code in Linalg Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D102095	2021-05-10 12:55:22 -07:00
Lei Zhang	7e71823f1d	[mlir][linalg] Restrict distribution to parallel dims According to the API contract, LinalgLoopDistributionOptions expects to work on parallel iterators. When getting processor information, only loop ranges for parallel dimensions should be fed in. But right now after generating scf.for loop nests, we feed in all loops, including the ones materialized for reduction iterators. This can cause unexpected distribution of reduction dimensions. This commit fixes it. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D102079	2021-05-10 15:23:00 -04:00
Stella Laurenzo	f38633d1bb	[mlir][Python] Re-export cext sparse_tensor module to the public namespace. * This was left out of the previous commit accidentally. Differential Revision: https://reviews.llvm.org/D102183	2021-05-10 18:08:29 +00:00
Stella Laurenzo	f13893f66a	[mlir][Python] Upstream the PybindAdaptors.h helpers and use it to implement sparse_tensor.encoding. * The PybindAdaptors.h file has been evolving across different sub-projects (npcomp, circt) and has been successfully used for out of tree python API interop/extensions and defining custom types. * Since sparse_tensor.encoding is the first in-tree custom attribute we are supporting, it seemed like the right time to upstream this header and use it to define the attribute in a way that we can support for both in-tree and out-of-tree use (prior, I had not wanted to upstream dead code which was not used in-tree). * Adapted the circt version of `mlir_type_subclass`, also providing an `mlir_attribute_subclass`. As we get a bit of mileage on this, I would like to transition the builtin types/attributes to this mechanism and delete the old in-tree only `PyConcreteType` and `PyConcreteAttribute` template helpers (which cannot work reliably out of tree as they depend on internals). * Added support for defaulting the MlirContext if none is passed so that we can support the same idioms as in-tree versions. There is quite a bit going on here and I can split it up if needed, but would prefer to keep the first use and the header together so sending out in one patch. Differential Revision: https://reviews.llvm.org/D102144	2021-05-10 17:15:43 +00:00
Stella Laurenzo	bcfa7baec8	[mlir][CAPI] Add CAPI bindings for the sparse_tensor dialect. * Adds dialect registration, hand coded 'encoding' attribute and test. * An MLIR CAPI tablegen backend for attributes does not exist, and this is a relatively complicated case. I opted to hand code it in a canonical way for now, which will provide a reasonable blueprint for building out the tablegen version in the future. * Also added a (local) CMake function for declaring new CAPI tests, since it was getting repetitive/buggy. Differential Revision: https://reviews.llvm.org/D102141	2021-05-10 16:54:56 +00:00
Mats Petersson	7280f4b279	[OpenMP][MLIR]Add support for guided, auto and runtime scheduling When using parallel loop construct, the OpenMP specification allows for guided, auto and runtime as scheduling variants (as well as static and dynamic which are already supported). This adds the translation from MLIR to LLVM-IR for these scheduling variants. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D101435	2021-05-10 09:18:52 +00:00
Julian Gross	fc253e69f9	Fixed bug in buffer deallocation pass using unranked memref types. In the buffer deallocation pass, unranked memref types are not properly supported. After investigating this issue, it turns out that the Clone and Dealloc operation does not support unranked memref types in the current implementation. This patch adds the missing feature and enables the transformation of any memref type. This patch solves this bug: https://bugs.llvm.org/show_bug.cgi?id=48385 Differential Revision: https://reviews.llvm.org/D101760	2021-05-10 10:50:29 +02:00
Frederik Gossen	a81e45b8bc	[MLIR][Shape] Concretize broadcast result type if possible As a canonicalization, infer the resulting shape rank if possible. Differential Revision: https://reviews.llvm.org/D102068	2021-05-10 10:24:08 +02:00
Alex Zinenko	72d013dd73	[mlir] OpenMP-to-LLVM: properly set outer alloca insertion point Previously, the OpenMP to LLVM IR conversion was setting the alloca insertion point to the same position as the main compuation when converting OpenMP `parallel` operations. This is problematic if, for example, the `parallel` operation is placed inside a loop and would keep allocating on stack on each iteration leading to stack overflow. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D101307	2021-05-10 10:04:52 +02:00
Chia-hung Duan	34b5482b33	Support NativeCodeCall binding in rewrite pattern. We are able to bind the result from native function while rewriting pattern. In matching pattern, if we want to get some values back, we can do that by passing parameter as return value placeholder. Besides, add the semantic of '$_self' in NativeCodeCall while matching, it'll be the operation that defines certain operand. Differential Revision: https://reviews.llvm.org/D100746	2021-05-10 09:29:27 +08:00
River Riddle	53b946aa63	[mlir] Refactor the representation of function-like argument/result attributes. The current design uses a unique entry for each argument/result attribute, with the name of the entry being something like "arg0". This provides for a somewhat sparse design, but ends up being much more expensive (from a runtime perspective) in-practice. The design requires building a string every time we lookup the dictionary for a specific arg/result, and also requires N attribute lookups when collecting all of the arg/result attribute dictionaries. This revision restructures the design to instead have an ArrayAttr that contains all of the attribute dictionaries for arguments and another for results. This design reduces the number of attribute name lookups to 1, and allows for O(1) lookup for individual element dictionaries. The major downside is that we can end up with larger memory usage, as the ArrayAttr contains an entry for each element even if that element has no attributes. If the memory usage becomes too problematic, we can experiment with a more sparse structure that still provides a lot of the wins in this revision. This dropped the compilation time of a somewhat large TensorFlow model from ~650 seconds to ~400 seconds. Differential Revision: https://reviews.llvm.org/D102035	2021-05-07 19:32:31 -07:00
River Riddle	5c84195b8c	[mlir] Add hover support to mlir-lsp-server This provides information when the user hovers over a part of the source .mlir file. This revision adds the following hover behavior: * Operation: - Shows the generic form. * Operation Result: - Shows the parent operation name, result number(s), and type(s). * Block: - Shows the parent operation name, block number, predecessors, and successors. * Block Argument: - Shows the parent operation name, parent block, argument number, and type. Differential Revision: https://reviews.llvm.org/D101113	2021-05-07 18:09:01 -07:00
thomasraoux	d0453a8933	[mlir][vector] Extend pattern to trim lead unit dimension to Splat Op Differential Revision: https://reviews.llvm.org/D102091	2021-05-07 13:54:41 -07:00
Alexander Belyaev	3444996b4c	[mlir] Add a pattern to bufferize std.index_cast. Differential Revision: https://reviews.llvm.org/D102088	2021-05-07 21:32:02 +02:00
Alexander Belyaev	a3f22d020b	[mlir] Add a pattern to bufferize linalg.tensor_reshape. Differential Revision: https://reviews.llvm.org/D102089	2021-05-07 21:31:17 +02:00
thomasraoux	a970e69d6b	[mlir][vector] add pattern to cast away leading unit dim for elementwise op Differential Revision: https://reviews.llvm.org/D102034	2021-05-07 07:54:09 -07:00
thomasraoux	565ee6afc7	[mlir][spirv] add support lowering of extract_slice to scalar type Differential Revision: https://reviews.llvm.org/D102041	2021-05-07 07:52:02 -07:00
KareemErgawy-TomTom	e4dee7e730	[MLIR][SPIRV] Properly (de-)serialize BranchConditionalOp. Implements proper (de-)serialization logic for BranchConditionalOp when such ops have true/false target operands. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D101602	2021-05-07 09:00:50 +02:00
Tobias Gysi	26e916334e	[mlir][linalg] Add IndexedGenericOp to GenericOp canonicalization. Replace all `linalg.indexed_generic` ops by `linalg.generic` ops that access the iteration indices using the `linalg.index` op. Differential Revision: https://reviews.llvm.org/D101612	2021-05-07 06:00:16 +00:00
MaheshRavishankar	05a89312d8	[mlir][Linalg] Allow folding to rank-zero tensor when using rank-reducing subtensors. The pattern to convert subtensor ops to their rank-reduced versions (by dropping unit-dims in the result) can also convert to a zero-rank tensor. Handle that case. This also fixes a OOB access bug in the existing pattern for such cases. Differential Revision: https://reviews.llvm.org/D101949	2021-05-06 19:03:55 -07:00
Rob Suderman	d3e987c389	[mlir][tosa] Added div op, variadic concat. Removed placeholder. Spec v0.22 alignment. Nearly complete alignment to spec v0.22 - Adds Div op - Concat inputs now variadic - Removes Placeholder op Note: TF side PR https://github.com/tensorflow/tensorflow/pull/48921 deletes Concat legalizations to avoid breaking TensorFlow CI. This must be merged only after the TF PR has merged. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D101958	2021-05-06 15:55:58 -07:00
Amy Zhuang	5dc1ed3f62	[mlir] Update dstNode after DenseMap insertion in loop fusion pass. Reviewed By: vinayaka-polymage Differential Revision: https://reviews.llvm.org/D101794	2021-05-06 15:23:59 -07:00
Lei Zhang	41bc54cc56	[mlir][spirv] NFC: Replace OwningSPIRVModuleRef with OwningOpRef Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D102009	2021-05-06 17:17:44 -04:00
Denys Shabalin	1f109f9d9c	Fix array attribute in bindings for linalg.init_tensor Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D101998	2021-05-06 18:25:59 +02:00
thomasraoux	0b303da6f8	[mlir][vector] add pattern to cast away lead unit dimension for broadcast op Differential Revision: https://reviews.llvm.org/D101955	2021-05-06 08:02:17 -07:00
Navdeep Kumar	875eb523c1	[MLIR][GPU][NVVM] Add warp synchronous matrix-multiply accumulate ops Add warp synchronous matrix-multiply accumulate ops in GPU and NVVM dialect. Add following three ops to GPU dialect :- 1.) subgroup_mma_load_matrix 2.) subgroup_mma_store_matrix 3.) subgroup_mma_compute Add following three ops to NVVM dialect :- 1.) wmma.m16n16k16.load.[a,b,c].[f16,f32].row.stride 2.) wmma.m16n16k16.store.d.[f16,f32].row.stride 3.) wmma.m16n16k16.mma.row.row.[f16,f32].[f16,f32] Reviewed By: bondhugula, ftynse, ThomasRaoux Differential Revision: https://reviews.llvm.org/D95330	2021-05-06 12:06:25 +05:30
Emilio Cota	3c952ab25f	[mlir] Check generated IR of math_polynomial_approx.mlir Instead of just checking that we emit something. Differential Revision: https://reviews.llvm.org/D101940	2021-05-05 16:42:48 -07:00
MaheshRavishankar	4b2d7ef3ea	[mlir][Linalg] Fix test to use new reshape op form. Differential Revision: https://reviews.llvm.org/D101956	2021-05-05 16:06:58 -07:00
MaheshRavishankar	b6060b7673	[mlir][Linalg] Fix element type of results when folding reshapes. Fixing a minor bug which lead to element type of the output being modified when folding reshapes with generic op. Differential Revision: https://reviews.llvm.org/D101942	2021-05-05 15:40:41 -07:00
Emilio Cota	0edc4bc84a	[mlir] Add polynomial approximation for math::ExpM1 This approximation matches the one in Eigen. ``` name old cpu/op new cpu/op delta BM_mlir_Expm1_f32/10 90.9ns ± 4% 52.2ns ± 4% -42.60% (p=0.000 n=74+87) BM_mlir_Expm1_f32/100 837ns ± 3% 231ns ± 4% -72.43% (p=0.000 n=79+69) BM_mlir_Expm1_f32/1k 8.43µs ± 3% 1.58µs ± 5% -81.30% (p=0.000 n=77+83) BM_mlir_Expm1_f32/10k 83.8µs ± 3% 15.4µs ± 5% -81.65% (p=0.000 n=83+69) BM_eigen_s_Expm1_f32/10 68.8ns ±17% 72.5ns ±14% +5.40% (p=0.000 n=118+115) BM_eigen_s_Expm1_f32/100 694ns ±11% 717ns ± 2% +3.34% (p=0.000 n=120+75) BM_eigen_s_Expm1_f32/1k 7.69µs ± 2% 7.97µs ±11% +3.56% (p=0.000 n=95+117) BM_eigen_s_Expm1_f32/10k 88.0µs ± 1% 89.3µs ± 6% +1.45% (p=0.000 n=74+106) BM_eigen_v_Expm1_f32/10 44.3ns ± 6% 45.0ns ± 8% +1.45% (p=0.018 n=81+111) BM_eigen_v_Expm1_f32/100 351ns ± 1% 360ns ± 9% +2.58% (p=0.000 n=73+99) BM_eigen_v_Expm1_f32/1k 3.31µs ± 1% 3.42µs ± 9% +3.37% (p=0.000 n=71+100) BM_eigen_v_Expm1_f32/10k 33.7µs ± 8% 34.1µs ± 9% +1.04% (p=0.007 n=99+98) ``` Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D101852	2021-05-05 14:31:34 -07:00
Rob Suderman	7abb56c78b	[mlir][tosa] Add tosa.depthwise lowering to existing linalg.depthwise_conv Implements support for undialated depthwise convolution using the existing depthwise convolution operation. Once convolutions migrate to yaml defined versions we can rewrite for cleaner implementation. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D101579	2021-05-05 13:30:05 -07:00
Javier Setoain	95861216ac	[mlir][ArmSVE] Add masked arithmetic operations These instructions map to SVE-specific instrinsics that accept a predicate operand to support control flow in vector code. Differential Revision: https://reviews.llvm.org/D100982	2021-05-05 17:41:58 +01:00
Sergei Grechanik	d80b04ab00	[mlir][Affine][Vector] Support vectorizing reduction loops This patch adds support for vectorizing loops with 'iter_args' implementing known reductions along the vector dimension. Comparing to the non-vector-dimension case, two additional things are done during vectorization of such loops: - The resulting vector returned from the loop is reduced to a scalar using `vector.reduce`. - In some cases a mask is applied to the vector yielded at the end of the loop to prevent garbage values from being written to the accumulator. Vectorization of reduction loops is disabled by default. To enable it, a map from loops to array of reduction descriptors should be explicitly passed to `vectorizeAffineLoops`, or `vectorize-reductions=true` should be passed to the SuperVectorize pass. Current limitations: - Loops with a non-unit step size are not supported. - n-D vectorization with n > 1 is not supported. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D100694	2021-05-05 09:03:59 -07:00
Tobias Gysi	4a6ee23d83	[mlir][linalg] Fix bug in the fusion on tensors index op handling. The old index op handling let the new index operations point back to the producer block. As a result, after fusion some index operations in the fused block had back references to the old producer block resulting in illegal IR. The patch now relies on a block and value mapping to avoid such back references. Differential Revision: https://reviews.llvm.org/D101887	2021-05-05 14:46:08 +00:00
Alexander Belyaev	2865d114f9	[mlir] Use ReassociationIndices instead of affine maps in linalg.reshape. Differential Revision: https://reviews.llvm.org/D101861	2021-05-05 12:59:57 +02:00
Javier Setoain	001d601ac4	[mlir][ArmSVE] Add basic arithmetic operations While we figure out how to best add Standard support for scalable vectors, these instructions provide a workaround for basic arithmetic between scalable vectors. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D100837	2021-05-05 09:50:18 +02:00
William S. Moses	f4a2dbfe29	[MLIR][SCF] Combine adjacent scf.if with same condition Differential Revision: https://reviews.llvm.org/D101798	2021-05-05 00:39:58 -04:00
Aart Bik	a2c9d4bb04	[mlir][sparse] Introduce proper sparsification passes This revision migrates more code from Linalg into the new permanent home of SparseTensor. It replaces the test passes with proper compiler passes. NOTE: the actual removal of the last glue and clutter in Linalg will follow Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D101811	2021-05-04 17:10:09 -07:00
River Riddle	c1c1df6347	[mlir] Fix region successor bug in forward dataflow analysis We weren't properly visiting region successors when the terminator wasn't return like, which could create incorrect results in the analysis. This revision ensures that we properly visit region successors, to avoid optimistically assuming a value is constant when it isn't. Differential Revision: https://reviews.llvm.org/D101783	2021-05-04 14:50:37 -07:00
Rob Suderman	1f7adf8cb1	[mlir][tosa] Fix tosa.concat by inserting linalg.fill after linalg.init All linalg.init operations must be fed into a linalg operation before subtensor. The inserted linalg.fill guarantees it executes correctly. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D101848	2021-05-04 14:26:28 -07:00
William S. Moses	8e211bf1c8	[MLIR][SCF] Assume uses of condition in the body of scf.while is true Differential Revision: https://reviews.llvm.org/D101801	2021-05-04 11:39:07 -04:00
William S. Moses	93297e4bac	[MLIR] Replace a not of a comparison with appropriate comparison Differential Revision: https://reviews.llvm.org/D101710	2021-05-04 11:23:29 -04:00
Adrian Kuegel	93537fabce	[mlir] Add lowering from math.expm1 to LLVM. Differential Revision: https://reviews.llvm.org/D96776	2021-05-04 14:22:10 +02:00
natashaknk	07ce5c99d7	[mlir][tosa] Add lowerings for tosa.equal and tosa.arithmetic_right_shift Lowerings equal and arithmetic_right_shift for elementwise ops to linalg dialect using linalg.generic Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D101804	2021-05-03 18:26:49 -07:00
Emilio Cota	1c0374e770	[mlir] Add polynomial approximation for math::Log1p This approximation matches the one in Eigen. ``` name old cpu/op new cpu/op delta BM_mlir_Log1p_f32/10 83.2ns ± 7% 34.8ns ± 5% -58.19% (p=0.000 n=84+71) BM_mlir_Log1p_f32/100 664ns ± 4% 129ns ± 4% -80.57% (p=0.000 n=82+82) BM_mlir_Log1p_f32/1k 6.75µs ± 4% 0.81µs ± 3% -88.07% (p=0.000 n=88+79) BM_mlir_Log1p_f32/10k 76.5µs ± 3% 7.8µs ± 4% -89.84% (p=0.000 n=80+80) BM_eigen_s_Log1p_f32/10 70.1ns ±14% 72.6ns ±14% +3.49% (p=0.000 n=116+112) BM_eigen_s_Log1p_f32/100 706ns ± 9% 717ns ± 3% +1.60% (p=0.018 n=117+80) BM_eigen_s_Log1p_f32/1k 8.26µs ± 1% 8.26µs ± 1% ~ (p=0.567 n=84+86) BM_eigen_s_Log1p_f32/10k 92.1µs ± 5% 92.6µs ± 6% +0.60% (p=0.047 n=115+115) BM_eigen_v_Log1p_f32/10 31.8ns ±24% 34.9ns ±17% +9.72% (p=0.000 n=98+96) BM_eigen_v_Log1p_f32/100 169ns ±10% 177ns ± 5% +4.66% (p=0.000 n=119+81) BM_eigen_v_Log1p_f32/1k 1.42µs ± 4% 1.46µs ± 8% +2.70% (p=0.000 n=93+113) BM_eigen_v_Log1p_f32/10k 14.4µs ± 5% 14.9µs ± 8% +3.61% (p=0.000 n=115+110) ``` Reviewed By: ezhulenev, ftynse Differential Revision: https://reviews.llvm.org/D101765	2021-05-03 15:11:37 -07:00
MaheshRavishankar	a6e09391bb	[mlir][Linalg] Add a utility method to get reassociations maps for reshape. Given the source and destination shapes, if they are static, or if the expanded/collapsed dimensions are unit-extent, it is possible to compute the reassociation maps that can be used to reshape one type into another. Add a utility method to return the reassociation maps when possible. This utility function can be used to fuse a sequence of reshape ops, given the type of the source of the producer and the final result type. This pattern supercedes a more constrained folding pattern added to DropUnitDims pass. Differential Revision: https://reviews.llvm.org/D101343	2021-05-03 14:40:15 -07:00
Aart Bik	90d18e106b	[mlir][sparse] fixed typo: sparse -> sparse_tensor Test passes either way, but this is full name of dialect Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D101774	2021-05-03 14:19:09 -07:00
MaheshRavishankar	fd15e2b825	[mlir][Linalg] Use rank-reduced versions of subtensor and subtensor insert when possible. Convert subtensor and subtensor_insert operations to use their rank-reduced versions to drop unit dimensions. Differential Revision: https://reviews.llvm.org/D101495	2021-05-03 12:51:24 -07:00
thomasraoux	9621c1ef56	[mlir][linalg] Fix vectorization bug in vector transfer indexing map calculation The current implementation had a bug as it was relying on the target vector dimension sizes to calculate where to insert broadcast. If several dimensions have the same size we may insert the broadcast on the wrong dimension. The correct broadcast cannot be inferred from the type of the source and destination vector. Instead when we want to extend transfer ops we calculate an "inverse" map to the projected permutation and insert broadcast in place of the projected dimensions. Differential Revision: https://reviews.llvm.org/D101738	2021-05-03 12:16:38 -07:00
Frederik Gossen	ec339163a7	[MLIR][Linalg] Lower `linalg.tiled_loop` in a separate pass Add dedicated pass `convert-linalg-tiled-loops-to-scf` to lower `linalg.tiled_loop`s. Differential Revision: https://reviews.llvm.org/D101768	2021-05-03 21:02:02 +02:00
Stella Laurenzo	9f3f6d7bd8	Move MLIR python sources to mlir/python. * NFC but has some fixes for CMake glitches discovered along the way (things not cleaning properly, co-mingled depends). * Includes previously unsubmitted fix in D98681 and a TODO to fix it more appropriately in a smaller followup. Differential Revision: https://reviews.llvm.org/D101493	2021-05-03 18:36:48 +00:00
thomasraoux	d51275cbc0	[mlir][spirv] Add support to convert std.splat op Differential Revision: https://reviews.llvm.org/D101511	2021-05-03 10:57:40 -07:00
thomasraoux	f44c76d6e9	[mlir][vector] Extend vector transfer unrolling to support permutations and broadcast Differential Revision: https://reviews.llvm.org/D101637	2021-05-03 10:47:02 -07:00
thomasraoux	7417541fd8	[mlir][vector] Add canonicalization for extract/insert -> shapecast Differential Revision: https://reviews.llvm.org/D101643	2021-05-03 10:41:15 -07:00
Benjamin Kramer	96a7900eb0	[mlir] Fix multidimensional lowering from std.select to llvm.select The converter assumed that all operands have the same type, that's not true for select. Differential Revision: https://reviews.llvm.org/D101767	2021-05-03 19:30:49 +02:00
thomasraoux	be8e2801a4	[mlir][vector][NFC] split TransposeOp lowerning out of contractLowering Move TransposeOp lowering in its own populate function as in some cases it is better to keep it during ContractOp lowering to better canonicalize it rather than emiting scalar insert/extract. Differential Revision: https://reviews.llvm.org/D101647	2021-05-03 10:23:45 -07:00
Uday Bondhugula	92153575e6	[MLIR] Fix TestAffineDataCopy for test cases with no load ops Add missing check in -test-affine-data-copy without which a test case that has no affine.loads at all would crash this test pass. Fix two clang-tidy warnings in the file while at this. (Not adding a test case given the triviality.) Differential Revision: https://reviews.llvm.org/D101719	2021-05-03 22:42:52 +05:30
Stella Laurenzo	b57d6fe42e	[mlir][Python] Add casting constructor to Type and Attribute. * This makes them consistent with custom types/attributes, whose constructors will do a type checked conversion. Of course, the base classes can represent everything so never error. * More importantly, this makes it possible to subclass Type and Attribute out of tree in sensible ways. Differential Revision: https://reviews.llvm.org/D101734	2021-05-03 10:12:03 -07:00
Frederik Gossen	d2a291a5f8	[MLIR][Linalg] Lower `linalg.tiled_loop` to `scf` loops Differential Revision: https://reviews.llvm.org/D101747	2021-05-03 18:47:12 +02:00
William S. Moses	039bdcc0a8	[MLIR] Canonicalize sub/add of a constant and another sub/add of a constant Differential Revision: https://reviews.llvm.org/D101705	2021-05-03 11:49:23 -04:00
Benjamin Kramer	cdeb4a8a64	[mlir] Allow lowering cmpi/cmpf with multidimensional vectors to LLVM Differential Revision: https://reviews.llvm.org/D101535	2021-05-03 11:30:21 +02:00
William S. Moses	78720296f3	[MLIR] Canonicalization of Integer Cast Operations 1) Canonicalize IndexCast(SExt(x)) => IndexCast(x) 2) Provide constant folds of sign_extend and truncate Differential Revision: https://reviews.llvm.org/D101714	2021-05-02 11:22:18 -04:00
William S. Moses	a2b5314cbc	[MLIR] Handle llvm.icmp of pointers Differential Revision: https://reviews.llvm.org/D101712	2021-05-02 01:17:50 -04:00
eopXD	0c1ff26bd3	[mlir] [affine] add canonicalization for affine.vector_load, vector_store Added canonicalization for vector_load and vector_store. An existing pattern SimplifyAffineOp can be reused to compose maps that supplies result into them. Added AffineVectorStoreOp and AffineVectorLoadOp into static_assert of SimplifyAffineOp to allow operation to use it. This fixes the bug filed: https://bugs.llvm.org/show_bug.cgi?id=50058 Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D101691	2021-05-02 09:06:46 +05:30
Aart Bik	0a29219931	[mlir][sparse] sparse tensor type encoding migration (new home, new builders) (1) migrates the encoding from TensorDialect into the new SparseTensorDialect (2) replaces dictionary-based storage and builders with struct-like data Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D101669	2021-04-30 19:30:38 -07:00
Ahmed Taei	499e89fc91	Add patterns to lower vector.multi_reduction into a sequence of vector.reduction Three patterns are added to convert into vector.multi_reduction into a sequence of vector.reduction as the following: - Transpose the inputs so inner most dimensions are always reduction. - Reduce rank of vector.multi_reduction into 2d with inner most reduction dim (get the 2d canical form) - 2D canonical form is converted into a sequence of vector.reduction. There are two things we might worth in a follow up diff: - An scf.for (maybe optionally) around vector.reduction instead of unrolling it. - Breakdown the vector.reduction into a sequence of vector.reduction (e.g tree-based reduction) instead of relying on how downstream dialects handle it. Note: this will requires passing target-vector-length Differential Revision: https://reviews.llvm.org/D101570	2021-04-30 10:52:21 -07:00
Aart Bik	319072f4e3	[mlir][sparse] migrate sparse operations into new sparse tensor dialect This is the very first step toward removing the glue and clutter from linalg and replace it with proper sparse tensor types. This revision migrates the LinalgSparseOps into SparseTensorOps of a sparse tensor dialect. This also provides a new home for sparse tensor related transformation. NOTE: the actual replacement with sparse tensor types (and removal of linalg glue/clutter) will follow but I am trying to keep the amount of changes per revision manageable. Differential Revision: https://reviews.llvm.org/D101573	2021-04-29 15:52:35 -07:00
Rob Suderman	be01b091af	[mlir][tosa] Remove constant-0 dim expr values from TOSA lowerings Constant-0 dim expr values should be avoided for linalg as it can prevent fusion. This includes adding support for rank-0 reshapes. Differential Revision: https://reviews.llvm.org/D101418	2021-04-29 15:06:03 -07:00
Mehdi Amini	086e0f05bf	Revert "[mlir][sparse] migrate sparse operations into new sparse tensor dialect" This reverts commit `a6d92a9711`. The build with -DBUILD_SHARED_LIBS=ON is broken.	2021-04-29 20:59:41 +00:00
Benjamin Kramer	b389c80963	[mlir] Fix lowering of multi-dimensional vector log1p to LLVM This was using the untransformed operand, leading to invalid IR. Differential Revision: https://reviews.llvm.org/D101531	2021-04-29 21:53:52 +02:00
Aart Bik	a6d92a9711	[mlir][sparse] migrate sparse operations into new sparse tensor dialect This is the very first step toward removing the glue and clutter from linalg and replace it with proper sparse tensor types. This revision migrates the LinalgSparseOps into SparseTensorOps of a sparse tensor dialect. This also provides a new home for sparse tensor related transformation. NOTE: the actual replacement with sparse tensor types (and removal of linalg glue/clutter) will follow but I am trying to keep the amount of changes per revision manageable. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D101488	2021-04-29 12:09:10 -07:00
Alex Zinenko	6841e6afba	[mlir] support max/min lower/upper bounds in affine.parallel This enables to express more complex parallel loops in the affine framework, for example, in cases of tiling by sizes not dividing loop trip counts perfectly or inner wavefront parallelism, among others. One can't use affine.max/min and supply values to the nested loop bounds since the results of such affine.max/min operations aren't valid symbols. Making them valid symbols isn't an option since they would introduce selection trees into memref subscript arithmetic as an unintended and undesired consequence. Also add support for converting such loops to SCF. Drop some API that isn't used in the core repo from AffineParallelOp since its semantics becomes ambiguous in presence of max/min bounds. Loop normalization is currently unavailable for such loops. Depends On D101171 Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D101172	2021-04-29 13:16:25 +02:00
Alex Zinenko	545fa37834	[mlir] Affine: parallelize affine loops with reductions Introduce a basic support for parallelizing affine loops with reductions expressed using iteration arguments. Affine parallelism detector now has a flag to assume such reductions are parallel. The transformation handles a subset of parallel reductions that are can be expressed using affine.parallel: integer/float addition and multiplication. This requires to detect the reduction operation since affine.parallel only supports a fixed set of reduction operators. Reviewed By: chelini, kumasento, bondhugula Differential Revision: https://reviews.llvm.org/D101171	2021-04-29 13:16:24 +02:00
Tres Popp	42e5f42215	[mlir] Support complex numbers in Linalg promotion FillOp allows complex ops, and filling a properly sized buffer with a default zero complex number is implemented. Differential Revision: https://reviews.llvm.org/D99939	2021-04-29 11:58:57 +02:00
Frederik Gossen	eb56fa97de	[MLIR][Shape] Fix `shape.broadcast` to standard lowering Differential Revision: https://reviews.llvm.org/D101456	2021-04-29 10:09:15 +02:00
Nicolas Vasilache	b6113db955	[mlir][Linalg] Generalize linalg vectorization This revision adds support for vectorizing more general linalg operations with projected permutation maps. This is achieved by eagerly broadcasting the intermediate vector to the common size of the iteration domain of the linalg op. This allows a much more natural expression of generalized vectorization but may introduce additional computations until all the proper canonicalizations are implemented. This generalization modifies the vector.transfer_read/write permutation logic and exposes the fact that the logic employed in vector.contract was too ad-hoc. As a consequence, changes occur in the permutation / transposition logic for contraction. In turn this prompts supporting more cases in the lowering of contract to matrix intrinsics, which is required to make the corresponding tests pass. Differential revision: https://reviews.llvm.org/D101165	2021-04-29 07:44:01 +00:00
Tobias Gysi	c2be2cda8d	[mlir][Python][Linalg] Adding const, capture, and index support to the OpDSL. The patch extends the OpDSL with support for: - Constant values - Capture scalar parameters - Access the iteration indices using the index operation - Provide predefined floating point and integer types. Up to now the patch only supports emitting the new nodes. The C++/yaml path is not fully implemented. The fill_rng_2d operation defined in emit_structured_generic.py makes use of the new DSL constructs. Differential Revision: https://reviews.llvm.org/D101364	2021-04-29 07:24:47 +00:00
Mike Urbach	49745f87e6	[mlir][python] Add `destroy` method to PyOperation. This adds a method to directly invoke `mlirOperationDestroy` on the MlirOperation wrapped by a PyOperation. Reviewed By: stellaraccident, mehdi_amini Differential Revision: https://reviews.llvm.org/D101422	2021-04-28 19:30:05 -06:00

1 2 3 4 5 ...

4089 Commits