llvm-project

Commit Graph

Author	SHA1	Message	Date
Aart Bik	06e2a0684e	[mlir][sparse] sampled matrix multiplication fusion test This integration tests runs a fused and non-fused version of sampled matrix multiplication. Both should eventually have the same performance! NOTE: relies on pending tensor.init fix! Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D110444	2021-09-27 11:50:49 -07:00
Aart Bik	ec97a205c3	[mlir][sparse] preserve zero-initialization for materializing buffers This revision makes sure that when the output buffer materializes locally (in contrast with the passing in of output tensors either in-place or not in-place), the zero initialization assumption is preserved. This also adds a bit more documentation on our sparse kernel assumption (viz. TACO assumptions). Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D110442	2021-09-27 11:22:05 -07:00
Sumesh Udayakumaran	b2af2aeea6	[mlir] Mode for explicitly controlling the fusion kind New mode option that allows for either running the default fusion kind that happens today or doing either of producer-consumer or sibling fusion. This will also be helpful to minimize the compile-time of the fusion tests. Reviewed By: bondhugula, dcaballe Differential Revision: https://reviews.llvm.org/D110102	2021-09-27 20:37:42 +03:00
William S. Moses	6dd5b1e33e	[MLIR][LLVM] Add error if using incorrect attribute type for specifying LLVM linkage Address post-commit review in https://reviews.llvm.org/D108524 to add appropriate diagnostics. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D110566	2021-09-27 13:24:05 -04:00
Bixia Zheng	fbd5821c6f	Implement the conversion from sparse constant to sparse tensors. The sparse constant provides a constant tensor in coordinate format. We first split the sparse constant into a constant tensor for indices and a constant tensor for values. We then generate a loop to fill a sparse tensor in coordinate format using the tensors for the indices and the values. Finally, we convert the sparse tensor in coordinate format to the destination sparse tensor format. Add tests. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D110373	2021-09-27 09:47:29 -07:00
Eugene Zhulenev	92db09cde0	[mlir] AsyncRuntime: use int64_t for ref counting operations Workaround for SystemZ ABI problem: https://bugs.llvm.org/show_bug.cgi?id=51898 Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D110550	2021-09-27 07:55:01 -07:00
Nicolas Vasilache	b74493ecea	[mlir][Linalg] Refactor padding hoisting - NFC This revision extracts padding hoisting in a new file and cleans it up in prevision of future improvements and extensions. Differential Revision: https://reviews.llvm.org/D110414	2021-09-27 09:50:31 +00:00
Matthias Springer	ffdf0a370d	[mlir][vector] Fix bug in vector-transfer-full-partial-split When splitting with linalg.copy, cannot write into the destination alloc directly. Instead, write into a subview of the alloc. Differential Revision: https://reviews.llvm.org/D110512	2021-09-27 18:12:17 +09:00
Mehdi Amini	b3891f28a3	Fix ClangTidyLegacy warning: "'virtual' is redundant since the function is already declared 'final' " (NFC)	2021-09-26 22:02:23 +00:00
River Riddle	ef764eeeb9	[mlir:ElementsAttr] Avoid crash on empty contiguous ranges We currently, incorrectly, assume that a range always has at least one element when building a contiguous range. This commit adds a proper empty check to avoid crashing. Differential Revision: https://reviews.llvm.org/D110457	2021-09-24 23:48:51 +00:00
Lei Zhang	b45476c94c	[mlir][tosa] Do not fold transpose with quantized types For such cases, the type of the constant DenseElementsAttr is different from the transpose op return type. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D110446	2021-09-24 16:57:55 -04:00
Diego Caballero	2a876a711d	[mlir] Create a generic reduction detection utility This patch introduces a generic reduction detection utility that works across different dialecs. It is mostly a generalization of the reduction detection algorithm in Affine. The reduction detection logic in Affine, Linalg and SCFToOpenMP have been replaced with this new generic utility. The utility takes some basic components of the potential reduction and returns: 1) the reduced value, and 2) a list with the combiner operations. The logic to match reductions involving multiple combiner operations disabled until we can properly test it. Reviewed By: ftynse, bondhugula, nicolasvasilache, pifon2a Differential Revision: https://reviews.llvm.org/D110303	2021-09-24 20:45:59 +00:00
River Riddle	aca9bea199	[mlir:MemRef] Move DmaStartOp/DmaWaitOp to ODS These are among the last operations still defined explicitly in C++. I've tried to keep this commit as NFC as possible, but these ops definitely need a non-NFC cleanup at some point. Differential Revision: https://reviews.llvm.org/D110440	2021-09-24 19:35:28 +00:00
Lei Zhang	e325ebb9c7	[mlir][tosa] Add some transpose folders * If the input is a constant splat value, we just need to reshape it. * If the input is a general constant with one user, we can also constant fold it, without bloating the IR. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D110439	2021-09-24 15:25:14 -04:00
River Riddle	ef976337f5	[mlir:OpConversion] Remove the remaing usages of the deprecated matchAndRewrite methods This commits updates the remaining usages of the ArrayRef<Value> based matchAndRewrite/rewrite methods in favor of the new OpAdaptor overload. Differential Revision: https://reviews.llvm.org/D110360	2021-09-24 17:51:41 +00:00
Alex Zinenko	5988a3b7a0	[mlir] Linalg: ensure tile-and-pad always creates padding as requested Initially, the padding transformation and the related operation were only used to guarantee static shapes of subtensors in tiled operations. The transformation would not insert the padding operation if the shapes were already static, and the overall code generation would actively remove such "noop" pads. However, this transformation can be also used to pack data into smaller tensors and marshall them into faster memory, regardless of the size mismatches. In context of expert-driven transformation, we should assume that, if padding is requested, a potentially padded tensor must be always created. Update the transformation accordingly. To do this, introduce an optional `packing` attribute to the `pad_tensor` op that serves as an indication that the padding is an intentional choice (as opposed to side effect of type normalization) and should be left alone by cleanups. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110425	2021-09-24 18:40:13 +02:00
Alex Zinenko	3f89e339bb	[mlir] add pad_tensor(tensor.cast) -> pad_tensor canonicalizer This canonicalization pattern complements the tensor.cast(pad_tensor) one in propagating constant type information when possible. It contributes to the feasibility of pad hoisting. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110343	2021-09-24 12:03:47 +02:00
Matthias Springer	f3f25ffc04	[mlir][linalg] Fix result type in FoldSourceTensorCast * Do not discard static result type information that cannot be inferred from lower/upper padding. * Add optional argument to `PadTensorOp::inferResultType` for specifying known result dimensions. Differential Revision: https://reviews.llvm.org/D110380	2021-09-24 16:47:18 +09:00
Mehdi Amini	83f3c615dd	Add missing storageType to AttrDef to ODS This is only noticeable when using an attribute across dialects I think. Previously the namespace would be ommited, but it wouldn't matter as long as the generated code stays within a single namespace. Differential Revision: https://reviews.llvm.org/D110367	2021-09-24 01:30:29 +00:00
Matthias Springer	2190f8a8b1	[mlir][linalg] Support tile+peel with TiledLoopOp Only scf.for was supported until now. Differential Revision: https://reviews.llvm.org/D110220	2021-09-24 10:23:31 +09:00
Matthias Springer	8dc16ba8d2	[mlir][linalg] Merge all tiling passes into a single one. Passes such as `linalg-tile-to-tiled-loop` are merged into `linalg-tile`. Differential Revision: https://reviews.llvm.org/D110214	2021-09-24 10:16:46 +09:00
John Demme	47cc166bc0	[MLIR] [Python] Make Attribute and Type hashable Enables putting types and attributes in sets and in dicts as keys. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D110301	2021-09-22 19:59:03 -07:00
Aart Bik	a924fcc7c3	[mlir][sparse] add sparse kernels test to sparse compiler test suite This test makes sure kernels map to efficient sparse code, i.e. all compressed for-loops, no co-iterating while loops. In addition, this revision removes the special constant folding inside the sparse compiler in favor of Mahesh' new generic linalg folding. Thanks! NOTE: relies on Mahesh fix, which needs to be rebased first Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D110001	2021-09-22 14:56:39 -07:00
Tyler Augustine	cd36bab4ca	Fix bug for Ops with default valued attributes and successors/variadic regions. When both a DefaultValuedAttr and a successor or variadic region was specified, this would generate invalid C++ declaration. There would be the parameter with a default value, followed by the successors/regions, which don't have a default, which is invalid. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D110205	2021-09-22 21:22:31 +00:00
MaheshRavishankar	a40a08ed98	[mlir][Linalg] Teach constant -> generic op fusion to handle scalar constants. The current folder of constant -> generic op only handles splat constants. The same logic holds for scalar constants. Teach the pattern to handle such cases. Differential Revision: https://reviews.llvm.org/D109982	2021-09-22 13:41:47 -07:00
River Riddle	6e60bb6883	[mlir:DataFlowAnalysis] Reprocess the arguments of already executable edges This fixes a bug where we discover new information about the arguments of an already executable edge, but don't visit the arguments. We only visit the arguments, and not the block itself, so this commit shouldn't really affect performance at all. Fixes PR#51871 Differential Revision: https://reviews.llvm.org/D110197	2021-09-22 20:14:55 +00:00
Aart Bik	5da21338bc	[mlir][sparse] generalize reduction support in sparse compiler Now not just SUM, but also PRODUCT, AND, OR, XOR. The reductions MIN and MAX are still to be done (also depends on recognizing these operations in cmp-select constructs). Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D110203	2021-09-22 12:36:46 -07:00
Alex Zinenko	bdaf038266	[mlir] Always create a list of alias scopes when emitting LLVM IR Previously, the translation to LLVM IR would emit IR that directly uses a scope metadata node in case only one scope was in use in alias.scopes or noalias metadata. It should always be a list of scopes. The verifier change in `8700f2bd36` enforced this and broke the test. Fix the translation to always create a list of scopes using a new metadata node, update and reenable the respective test. Fixes PR51919. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D110140	2021-09-22 00:00:46 +02:00
Tobias Gysi	8b5236def5	[mlir][linalg] Simplify slice dim computation for fusion on tensors (NFC). Compute the tiled producer slice dimensions directly starting from the consumer not using the producer at all. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110147	2021-09-21 15:09:46 +00:00
Nicolas Vasilache	101d017a64	[mlir][Linalg] Revisit heuristic ordering of tensor.insert_slice in comprehensive bufferize. It was previously assumed that tensor.insert_slice should be bufferized first in a greedy fashion to avoid out-of-place bufferization of the large tensor. This heuristic does not hold upon further inspection. This CL removes the special handling of such ops and adds a test that exhibits better behavior and appears in real use cases. The only test adversely affected is an artificial test which results in a returned memref: this pattern is not allowed by comprehensive bufferization in real scenarios anyway and the offending test is deleted. Differential Revision: https://reviews.llvm.org/D110072	2021-09-21 14:22:45 +00:00
Nicolas Vasilache	0d2c54e851	[mlir][Linalg] Revisit RAW dependence interference in comprehensive bufferize. Previously, comprehensive bufferize would consider all aliasing reads and writes to the result buffer and matching operand. This resulted in spurious dependences being considered and resulted in too many unnecessary copies. Instead, this revision revisits the gathering of read and write alias sets. This results in fewer alloc and copies. An exhaustive test cases is added that considers all possible permutations of `matmul(extract_slice(fill), extract_slice(fill), ...)`.	2021-09-21 14:22:22 +00:00
Morten Borup Petersen	032cb1650f	[MLIR][SCF] Add for-to-while loop transformation pass This pass transforms SCF.ForOp operations to SCF.WhileOp. The For loop condition is placed in the 'before' region of the while operation, and indctuion variable incrementation + the loop body in the 'after' region. The loop carried values of the while op are the induction variable (IV) of the for-loop + any iter_args specified for the for-loop. Any 'yield' ops in the for-loop are rewritten to additionally yield the (incremented) induction variable. This transformation is useful for passes where we want to consider structured control flow solely on the basis of a loop body and the computation of a loop condition. As an example, when doing high-level synthesis in CIRCT, the incrementation of an IV in a for-loop is "just another part" of a circuit datapath, and what we really care about is the distinction between our datapath and our control logic (the condition variable). Differential Revision: https://reviews.llvm.org/D108454	2021-09-21 09:09:54 +01:00
Chris Lattner	58abc8c34b	[OpAsmParser] Add a parseCommaSeparatedList helper and beef up Delimeter. Lots of custom ops have hand-rolled comma-delimited parsing loops, as does the MLIR parser itself. Provides a standard interface for doing this that is less error prone and less boilerplate. While here, extend Delimiter to support <> and {} delimited sequences as well (I have a use for <> in CIRCT specifically). Differential Revision: https://reviews.llvm.org/D110122	2021-09-20 20:59:11 -07:00
River Riddle	d80d3a358f	[mlir] Refactor ElementsAttr into an AttrInterface This revision refactors ElementsAttr into an Attribute Interface. This enables a common interface with which to interact with element attributes, without needing to modify the builtin dialect. It also removes a majority (if not all?) of the need for the current OpaqueElementsAttr, which was originally intended as a way to opaquely represent data that was not representable by the other builtin constructs. The new ElementsAttr interface not only allows for users to natively represent their data in the way that best suits them, it also allows for efficient opaque access and iteration of the underlying data. Attributes using the ElementsAttr interface can directly expose support for interacting with the held elements using any C++ data type they claim to support. For example, DenseIntOrFpElementsAttr supports iteration using various native C++ integer/float data types, as well as APInt/APFloat, and more. ElementsAttr instances that refer to DenseIntOrFpElementsAttr can use all of these data types for iteration: ```c++ DenseIntOrFpElementsAttr intElementsAttr = ...; ElementsAttr attr = intElementsAttr; for (uint64_t value : attr.getValues<uint64_t>()) ...; for (APInt value : attr.getValues<APInt>()) ...; for (IntegerAttr value : attr.getValues<IntegerAttr>()) ...; ``` ElementsAttr also supports failable range/iterator access, allowing for selective code paths depending on data type support: ```c++ ElementsAttr attr = ...; if (auto range = attr.tryGetValues<uint64_t>()) { for (uint64_t value : *range) ...; } ``` Differential Revision: https://reviews.llvm.org/D109190	2021-09-21 01:57:43 +00:00
River Riddle	4f21152af1	[mlir] Tighten verification of SparseElementsAttr SparseElementsAttr currently does not perform any verfication on construction, with the only verification existing within the parser. This revision moves the parser verification to SparseElementsAttr, and also adds additional verification for when a sparse index is not valid. Differential Revision: https://reviews.llvm.org/D109189	2021-09-21 01:57:42 +00:00
Chia-hung Duan	bb2506061b	[mlir-tblgen] Add DagNode StaticMatcher. Some patterns may share the common DAG structures. Generate a static function to do the match logic to reduce the binary size. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D105797	2021-09-20 23:37:42 +00:00
MaheshRavishankar	4cf9bf6c9f	[mlir][MemRef] Compute unused dimensions of a rank-reducing subviews using strides as well. For `memref.subview` operations, when there are more than one unit-dimensions, the strides need to be used to figure out which of the unit-dims are actually dropped. Differential Revision: https://reviews.llvm.org/D109418	2021-09-20 11:05:30 -07:00
MaheshRavishankar	0b33890f45	[mlir][Linalg] Add ConvolutionOpInterface. Add an interface that allows grouping together all covolution and pooling ops within Linalg named ops. The interface currently - the indexing map used for input/image access is valid - the filter and output are accessed using projected permutations - that all loops are charecterizable as one iterating over - batch dimension, - output image dimensions, - filter convolved dimensions, - output channel dimensions, - input channel dimensions, - depth multiplier (for depthwise convolutions) Differential Revision: https://reviews.llvm.org/D109793	2021-09-20 10:41:10 -07:00
Mehdi Amini	5edd79fc97	Revert "[MLIR][SCF] Add for-to-while loop transformation pass" This reverts commit `644b55d57e`. The added test is failing the bots.	2021-09-20 17:21:59 +00:00
Mehdi Amini	f18f1ab4fd	Temporarily XFAIL MLIR test that fails the LLVM verifier after `8700f2bd3`	2021-09-20 17:20:11 +00:00
Tobias Gysi	7be28d82b4	[mlir][linalg] Add IndexOp support to fusion on tensors. This revision depends on https://reviews.llvm.org/D109761 and https://reviews.llvm.org/D109766. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D109774	2021-09-20 15:59:35 +00:00
Morten Borup Petersen	644b55d57e	[MLIR][SCF] Add for-to-while loop transformation pass This pass transforms SCF.ForOp operations to SCF.WhileOp. The For loop condition is placed in the 'before' region of the while operation, and indctuion variable incrementation + the loop body in the 'after' region. The loop carried values of the while op are the induction variable (IV) of the for-loop + any iter_args specified for the for-loop. Any 'yield' ops in the for-loop are rewritten to additionally yield the (incremented) induction variable. This transformation is useful for passes where we want to consider structured control flow solely on the basis of a loop body and the computation of a loop condition. As an example, when doing high-level synthesis in CIRCT, the incrementation of an IV in a for-loop is "just another part" of a circuit datapath, and what we really care about is the distinction between our datapath and our control logic (the condition variable). Differential Revision: https://reviews.llvm.org/D108454	2021-09-20 16:57:50 +01:00
Tobias Gysi	6db928b8f3	[mlir][linalg] Fusion on tensors. Add a new version of fusion on tensors that supports the following scenarios: - support input and output operand fusion - fuse a producer result passed in via tile loop iteration arguments (update the tile loop iteration arguments) - supports only linalg operations on tensors - supports only scf::for - cannot add an output to the tile loop nest The LinalgTileAndFuseOnTensors pass tiles the root operation and fuses its producers. Reviewed By: nicolasvasilache, mravishankar Differential Revision: https://reviews.llvm.org/D109766	2021-09-20 14:45:34 +00:00
Valentin Clement	d6929aaa67	[mlir][openacc] Make use of the second counter extension in DataOp translation Make use of runtime extension for the second reference counter used in structured data region. This extension is implemented in D106510 and D106509. Differential Revision: https://reviews.llvm.org/D106517	2021-09-20 13:43:50 +02:00
KareemErgawy-TomTom	bdcf4b9b96	[MLIR][Linalg] Make detensoring cost-model more flexible. So far, the CF cost-model for detensoring was limited to discovering pure CF structures. This means, if while discovering the CF component, the cost-model found any op that is not detensorable, it gives up on detensoring altogether. This patch makes it a bit more flexible by cleaning-up the detensorable component from non-detensorable ops without giving up entirely. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D109965	2021-09-20 10:21:31 +02:00
Vladislav Vinogradov	ec03bbe8a7	[mlir] Fix bug in partial dialect conversion The discussion on forum: https://llvm.discourse.group/t/bug-in-partial-dialect-conversion/4115 The `applyPartialConversion` didn't handle the operations, that were marked as illegal inside dynamic legality callback. Instead of reporting error, if such operation was not converted to legal set, the method just added it to `unconvertedSet` in the same way as unknown operations. This patch fixes that and handle dynamically illegal operations as well. The patch includes 2 fixes for existing passes: * `tensor-bufferize` - explicitly mark `std.return` as legal. * `convert-parallel-loops-to-gpu` - ugly fix with marking visited operations to avoid recursive legality checks. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D108505	2021-09-20 10:39:10 +03:00
Jacques Pienaar	0a1e569d37	[mlir-c] Add getting fused loc For creating a fused loc using array of locations and metadata. Differential Revision: https://reviews.llvm.org/D110022	2021-09-18 06:57:51 -07:00
Uday Bondhugula	57eda9becc	[MLIR][GPU] Add constant propagator for gpu.launch op Add a constant propagator for gpu.launch op in cases where the grid/thread IDs can be trivially determined to take a single constant value of zero. Differential Revision: https://reviews.llvm.org/D109994	2021-09-18 12:02:46 +05:30
Aart Bik	46e77b5d10	[mlir][sparse] add a sparse quantized_matmul example to integration test Note that this revision adds a very tiny bit of constant folding in the sparse compiler lattice construction. Although I am generally trying to avoid such canonicalizations (and rely on other passes to fix this instead), the benefits of avoiding a very expensive disjunction lattice construction justify having this special code (at least for now). Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D109939	2021-09-17 13:04:44 -07:00
Aart Bik	d4e16171e8	[mlir][sparse] add dce test for all sparse tensor ops Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D109992	2021-09-17 13:03:42 -07:00

1 2 3 4 5 ...

4582 Commits