llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	a004da0d77	[Canonicalize] Switch the default setting to "top down". This provides a sizable compile time improvement by seeding the worklist in an order that leads to less iterations of the worklist. This patch only changes the behavior of the Canonicalize pass itself, it does not affect other passes that use the GreedyPatternRewrite driver Differential Revision: https://reviews.llvm.org/D103053	2021-05-25 13:42:11 -07:00
Tres Popp	6054bfa813	[mlir] Support buffer hoisting on allocas This adds support for hoisting allocas in both BufferHoisting and BufferLoopHoisting. Differential Revision: https://reviews.llvm.org/D102681	2021-05-25 14:50:01 +02:00
Haruki Imai	000a05fd1a	[mlir] Normalize dynamic memrefs with a map of tiled-layout. Steps for normalizing dynamic memrefs for tiled layout map 1. Check if original map is tiled layout. Only tiled layout is supported. 2. Create normalized memrefType. Dimensions that include dynamic dimensions in the map output will be dynamic dimensions. 3. Create new maps to calculate each dimension size of new memref. In tiled layout, the dimension size can be calculated by replacing "floordiv <tile size>" with "ceildiv <tile size>" and "mod <tile size>" with "<tile size>". 4. Create AffineApplyOp to apply the new maps. The output of AffineApplyOp is dynamicSizes for new AllocOp. 5. Add the new dynamic sizes in new AllocOp. This patch also set MemRefsNormalizable trant in CastOp and DimOp since they used with dynamic memrefs. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D97655	2021-05-24 08:39:36 +05:30
Chris Lattner	81467f500f	[IR] Add a Location to BlockArgument This adds the ability to specify a location when creating BlockArguments. Notably Value::getLoc() will return this correctly, which makes diagnostics more precise (e.g. the example in test-legalize-type-conversion.mlir). This is currently optional to avoid breaking any existing code - if absent, the BlockArgument defaults to using the location of its enclosing operation (preserving existing behavior). The bulk of this change is plumbing location tracking through the parser and printer to make sure it can round trip (in -mlir-print-debuginfo mode). This is complete for generic operations, but requires manual adoption for custom ops. I added support for function-like ops to round trip their argument locations - they print correctly, but when parsing the locations are dropped on the floor. I intend to fix this, but it will require more invasive plumbing through "function_like_impl" stuff so I think it best to split it out to its own patch. This is a reapply of the patch here: https://reviews.llvm.org/D102567 with an additional change: we now never defer block argument locations, guaranteeing that we can round trip correctly. This isn't required in all cases, but allows us to hill climb here and works around unrelated bugs like https://bugs.llvm.org/show_bug.cgi?id=50451 Differential Revision: https://reviews.llvm.org/D102991	2021-05-23 14:10:00 -07:00
Richard Smith	80d981eda6	Revert "[IR] Add a Location to BlockArgument." and follow-on commit "[mlir] Speed up Lexer::getEncodedSourceLocation" This reverts commit `3043be9d2d` and commit `861d69a525`. This change resulted in printing textual MLIR that can't be parsed; see review thread https://reviews.llvm.org/D102567 for details.	2021-05-18 19:26:00 -07:00
Chris Lattner	3043be9d2d	[IR] Add a Location to BlockArgument. This adds the ability to specify a location when creating BlockArguments. Notably Value::getLoc() will return this correctly, which makes diagnostics more precise (e.g. the example in test-legalize-type-conversion.mlir). This is currently optional to avoid breaking any existing code - if absent, the BlockArgument defaults to using the location of its enclosing operation (preserving existing behavior). The bulk of this change is plumbing location tracking through the parser and printer to make sure it can round trip (in -mlir-print-debuginfo mode). This is complete for generic operations, but requires manual adoption for custom ops. I added support for function-like ops to round trip their argument locations - they print correctly, but when parsing the locations are dropped on the floor. I intend to fix this, but it will require more invasive plumbing through "function_like_impl" stuff so I think it best to split it out to its own patch. Differential Revision: https://reviews.llvm.org/D102567	2021-05-18 10:18:04 -07:00
Vinayaka Bandishti	a3917d3670	[MLIR][Affine] Privatize certain escaping memrefs During affine loop fusion, create private memrefs for escaping memrefs too under the conditions that: -- the source is not removed after fusion, and -- the destination does not write to the memref. This creates more fusion opportunities as illustrated in the test case. Reviewed By: bondhugula, ayzhuang Differential Revision: https://reviews.llvm.org/D102604	2021-05-18 22:23:02 +05:30
Chris Lattner	648f34a284	Merge with mainline. Differential Revision: https://reviews.llvm.org/D102636	2021-05-17 11:15:10 -07:00
Julian Gross	fc253e69f9	Fixed bug in buffer deallocation pass using unranked memref types. In the buffer deallocation pass, unranked memref types are not properly supported. After investigating this issue, it turns out that the Clone and Dealloc operation does not support unranked memref types in the current implementation. This patch adds the missing feature and enables the transformation of any memref type. This patch solves this bug: https://bugs.llvm.org/show_bug.cgi?id=48385 Differential Revision: https://reviews.llvm.org/D101760	2021-05-10 10:50:29 +02:00
Amy Zhuang	5dc1ed3f62	[mlir] Update dstNode after DenseMap insertion in loop fusion pass. Reviewed By: vinayaka-polymage Differential Revision: https://reviews.llvm.org/D101794	2021-05-06 15:23:59 -07:00
River Riddle	c1c1df6347	[mlir] Fix region successor bug in forward dataflow analysis We weren't properly visiting region successors when the terminator wasn't return like, which could create incorrect results in the analysis. This revision ensures that we properly visit region successors, to avoid optimistically assuming a value is constant when it isn't. Differential Revision: https://reviews.llvm.org/D101783	2021-05-04 14:50:37 -07:00
William S. Moses	ca27260701	[MLIR] Add SCF.if Condition Canonicalizations Add two canoncalizations for scf.if. 1) A canonicalization that allows users of a condition within an if to assume the condition is true if in the true region, etc. 2) A canonicalization that removes yielded statements that are equivalent to the condition or its negation Differential Revision: https://reviews.llvm.org/D101012	2021-04-26 20:13:08 -04:00
Butygin	f22d381385	[mlir] Canonicalize AllocOp's with only store and dealloc uses Differential Revision: https://reviews.llvm.org/D100268	2021-04-24 09:51:00 +03:00
Haruki Imai	39ee9fd8c1	[mlir] Fixed alignment attribute of alloc constant folding. When allocLikeOp is updated in alloc constant folding, alighnment attribute was ignored. This patch fixes it. Signed-off-by: Haruki Imai <imaihal@jp.ibm.com> Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D99882	2021-04-07 19:28:49 +00:00
Thomas Preud'homme	17f4f23eea	[MLIR, test] Fix use of undef FileCheck var MLIR test Transforms/canonicalize.mlir tries to check for the absence of a sequence of instructions with several CHECK-NOT with one of those directives using a variable defined in another. However CHECK-NOT are checked independently so that is using a variable defined in a pattern that should not occur in the input. This commit removes the dependency between those CHECK-NOT by replacing occurences of variables by the regex that were used to define them. Reviewed By: pifon2a Differential Revision: https://reviews.llvm.org/D99958	2021-04-06 16:58:09 +01:00
Aden Grue	3ba1b1cd20	Add a pattern to combine composed subview ops Differential Revision: https://reviews.llvm.org/D99229	2021-04-01 10:56:57 -07:00
Vinayaka Bandishti	dc537158d5	[MLIR][Affine] Add utility to check if the slice is valid Fixes a bug in affine fusion pipeline where an incorrect slice is computed. After the slice computation is done, original domain of the the source is compared with the new domain that will result if the fusion succeeds. If the new domain must be a subset of the original domain for the slice to be valid. If the slice computed is incorrect, fusion based on such a slice is avoided. Relevant test cases are added/edited. Fixes https://bugs.llvm.org/show_bug.cgi?id=49203 Differential Revision: https://reviews.llvm.org/D98239	2021-04-01 14:52:22 +05:30
Andrew Young	9c61c76b12	[mlir][cse] do not replace operands in previously simplified operations If an operation has been inserted as a key in to the known values hashtable, then it can not be modified in a way which changes its hash. This change avoids modifying the operands of any previously recorded operation, which prevents their hash from changing. In an SSACFG region, it is impossible to visit an operation before visiting its operands, so this is not a problem. This situation can only happen in regions without strict dominance, such as graph regions. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D99486	2021-03-31 12:20:34 -07:00
Alexander Belyaev	465b9a4a33	Revert "Revert "[mlir] Introduce CloneOp and adapt test cases in BufferDeallocation."" This reverts commit `883912abe6`.	2021-03-31 09:49:09 +02:00
Mehdi Amini	a360a9786f	Fix deletion of operations through the rewriter in a pattern matching a consumer operation This allows for the conversion to match `A(B()) -> C()` with a pattern matching `A` and marking `B` for deletion. Also add better assertions when an operation is erased while still having uses. Differential Revision: https://reviews.llvm.org/D99442	2021-03-30 22:02:14 +00:00
MaheshRavishankar	9b0517035f	[mlir] Enhance InferShapedTypeOpInterface and move LinalgOps to use them. A new `InterfaceMethod` is added to `InferShapedTypeOpInterface` that allows an operation to return the `Value`s for each dim of its results. It is intended for the case where the `Value` returned for each dim is computed using the operands and operation attributes. This interface method is for cases where the result dim of an operation can be computed independently, and it avoids the need to aggregate all dims of a result into a single shape value. This also implies that this is not suitable for cases where the result type is unranked (for which the existing interface methods is to be used). Also added is a canonicalization pattern that uses this interface and resolves the shapes of the output in terms of the shapes of the inputs. Moving Linalg ops to use this interface, so that many canonicalization patterns implemented for individual linalg ops to achieve the same result can be removed in favor of the added canonicalization pattern. Differential Revision: https://reviews.llvm.org/D97887	2021-03-29 11:39:48 -07:00
Alexander Belyaev	883912abe6	Revert "[mlir] Introduce CloneOp and adapt test cases in BufferDeallocation." This reverts commit `06b03800f3`. Until some kind of support for region args is added.	2021-03-29 12:47:59 +02:00
Julian Gross	06b03800f3	[mlir] Introduce CloneOp and adapt test cases in BufferDeallocation. Add a new clone operation to the memref dialect. This operation implicitly copies data from a source buffer to a new buffer. In contrast to the linalg.copy operation, this operation does not accept a target buffer as an argument. Instead, this operation performs a conceptual allocation which does not need to be performed manually. Furthermore, this operation resolves the dependency from the linalg-dialect in the BufferDeallocation pass. In addition, we also extended the canonicalization patterns to fold clone operations. The copy removal pass has been removed. Differential Revision: https://reviews.llvm.org/D99172	2021-03-29 10:19:10 +02:00
KareemErgawy-TomTom	c52a5f2aa7	MLIR][STD] Fold trunci (sexti). This patch folds the following pattern: ``` %arg0 = ... %0 = sexti %arg0 : i1 to i8 %1 = trunci %0 : i8 to i1 ``` into just `%arg0`. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D99464	2021-03-29 08:34:08 +02:00
KareemErgawy-TomTom	e5f2898bc7	[MLIR][STD] Fold trunci (zexti). This patch folds the following pattern: ``` %arg0 = ... %0 = zexti %arg0 : i1 to i8 %1 = trunci %0 : i8 to i1 ``` into just `%arg0`. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D99453	2021-03-27 19:40:10 +01:00
Uday Bondhugula	0b20413ef6	Revert "[Canonicalizer] Process regions top-down instead of bottom up & reuse existing constants." This reverts commit `361b7d125b` by Chris Lattner <clattner@nondot.org> dated Fri Mar 19 21:22:15 2021 -0700. The change to the greedy rewriter driver picking a different order was made without adequate analysis of the trade-offs and experimentation. A change like this has far reaching consequences on transformation pipelines, and a major impact upstream and downstream. For eg., one can’t be sure that it doesn’t slow down a large number of cases by small amounts or create other issues. More discussion here: https://llvm.discourse.group/t/speeding-up-canonicalize/3015/25 Reverting this so that improvements to the traversal order can be made on a clean slate, in bigger steps, and higher bar. Differential Revision: https://reviews.llvm.org/D99329	2021-03-25 22:17:26 +05:30
Mehdi Amini	973ddb7d6e	Define a `NoTerminator` traits that allows operations with a single block region to not provide a terminator In particular for Graph Regions, the terminator needs is just a historical artifact of the generalization of MLIR from CFG region. Operations like Module don't need a terminator, and before Module migrated to be an operation with region there wasn't any needed. To validate the feature, the ModuleOp is migrated to use this trait and the ModuleTerminator operation is deleted. This patch is likely to break clients, if you're in this case: - you may iterate on a ModuleOp with `getBody()->without_terminator()`, the solution is simple: just remove the ->without_terminator! - you created a builder with `Builder::atBlockTerminator(module_body)`, just use `Builder::atBlockEnd(module_body)` instead. - you were handling ModuleTerminator: it isn't needed anymore. - for generic code, a `Block::mayNotHaveTerminator()` may be used. Differential Revision: https://reviews.llvm.org/D98468	2021-03-25 03:59:03 +00:00
Chris Lattner	361b7d125b	[Canonicalizer] Process regions top-down instead of bottom up & reuse existing constants. This reapplies `b5d9a3c` / https://reviews.llvm.org/D98609 with a one line fix in processExistingConstants to skip() when erasing a constant we've already seen. Original commit message: 1) Change the canonicalizer to walk the function in top-down order instead of bottom-up order. This composes well with the "top down" nature of constant folding and simplification, reducing iterations and re-evaluation of ops in simple cases. 2) Explicitly enter existing constants into the OperationFolder table before canonicalizing. Previously we would "constant fold" them and rematerialize them, wastefully recreating a bunch fo constants, which lead to pointless memory traffic. Both changes together provide a 33% speedup for canonicalize on some mid-size CIRCT examples. One artifact of this change is that the constants generated in normal pattern application get inserted at the top of the function as the patterns are applied. Because of this, we get "inverted" constants more often, which is an aethetic change to the IR but does permute some testcases. Differential Revision: https://reviews.llvm.org/D99006	2021-03-20 16:30:15 -07:00
Chris Lattner	b2f232b830	[testsuite] Make testsuite more stable vs canonicalization change. NFC. Differential Revision: https://reviews.llvm.org/D98998	2021-03-19 18:11:12 -07:00
Andrew Young	f178c13fa8	[mlir] Support use-def cycles in graph regions during regionDCE When deleting operations in DCE, the algorithm uses a post-order walk of the IR to ensure that value uses were erased before value defs. Graph regions do not have the same structural invariants as SSA CFG, and this post order walk could delete value defs before uses. This problem is guaranteed to occur when there is a cycle in the use-def graph. This change stops DCE from visiting the operations and blocks in any meaningful order. Instead, we rely on explicitly dropping all uses of a value before deleting it. Reviewed By: mehdi_amini, rriddle Differential Revision: https://reviews.llvm.org/D98919	2021-03-18 23:06:45 -07:00
Julian Gross	e2310704d8	[MLIR] Create memref dialect and move dialect-specific ops from std. Create the memref dialect and move dialect-specific ops from std dialect to this dialect. Moved ops: AllocOp -> MemRef_AllocOp AllocaOp -> MemRef_AllocaOp AssumeAlignmentOp -> MemRef_AssumeAlignmentOp DeallocOp -> MemRef_DeallocOp DimOp -> MemRef_DimOp MemRefCastOp -> MemRef_CastOp MemRefReinterpretCastOp -> MemRef_ReinterpretCastOp GetGlobalMemRefOp -> MemRef_GetGlobalOp GlobalMemRefOp -> MemRef_GlobalOp LoadOp -> MemRef_LoadOp PrefetchOp -> MemRef_PrefetchOp ReshapeOp -> MemRef_ReshapeOp StoreOp -> MemRef_StoreOp SubViewOp -> MemRef_SubViewOp TransposeOp -> MemRef_TransposeOp TensorLoadOp -> MemRef_TensorLoadOp TensorStoreOp -> MemRef_TensorStoreOp TensorToMemRefOp -> MemRef_BufferCastOp ViewOp -> MemRef_ViewOp The roadmap to split the memref dialect from std is discussed here: https://llvm.discourse.group/t/rfc-split-the-memref-dialect-from-std/2667 Differential Revision: https://reviews.llvm.org/D98041	2021-03-15 11:14:09 +01:00
Alex Zinenko	40d8e4d3f9	Revert "[Canonicalizer] Process regions top-down instead of bottom up & reuse existing constants." This reverts commit `b5d9a3c923`. The commit introduced a memory error in canonicalization/operation walking that is exposed when compiled with ASAN. It leads to crashes in some "release" configurations.	2021-03-15 10:27:55 +01:00
Chris Lattner	b5d9a3c923	[Canonicalizer] Process regions top-down instead of bottom up & reuse existing constants. Two changes: 1) Change the canonicalizer to walk the function in top-down order instead of bottom-up order. This composes well with the "top down" nature of constant folding and simplification, reducing iterations and re-evaluation of ops in simple cases. 2) Explicitly enter existing constants into the OperationFolder table before canonicalizing. Previously we would "constant fold" them and rematerialize them, wastefully recreating a bunch fo constants, which lead to pointless memory traffic. Both changes together provide a 33% speedup for canonicalize on some mid-size CIRCT examples. One artifact of this change is that the constants generated in normal pattern application get inserted at the top of the function as the patterns are applied. Because of this, we get "inverted" constants more often, which is an aethetic change to the IR but does permute some testcases. Differential Revision: https://reviews.llvm.org/D98609	2021-03-14 18:21:42 -07:00
Julian Gross	2aef202981	[mlir] Fix invalid hoisting of dependent allocs in buffer hoisting pass. Buffer hoisting moves allocs upwards although it has dependency within its nested region. This patch fixes this issue. https://bugs.llvm.org/show_bug.cgi?id=49142 Differential Revision: https://reviews.llvm.org/D98248	2021-03-11 11:46:16 +01:00
Lei Zhang	50000abe3c	[mlir] Use affine.apply when distributing to processors This makes it easy to compose the distribution computation with other affine computations. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D98171	2021-03-09 08:37:20 -05:00
Jacques Pienaar	dd2f50a4d0	[mlir] Improve test coverage for print-op-graph	2021-02-27 10:18:38 -08:00
Vinayaka Bandishti	ce0f10a1d1	[MLIR][affine] Certain Call Ops to prevent fusion Fixes a bug in affine fusion pipeline where an incorrect fusion is performed despite a Call Op that potentially modifies memrefs under consideration exists between source and target. Fixes part of https://bugs.llvm.org/show_bug.cgi?id=49220 Reviewed By: bondhugula, dcaballe Differential Revision: https://reviews.llvm.org/D97252	2021-02-26 15:27:41 +05:30
Tung D. Le	203d5eeec5	[MLIR][affine-loop-fusion] Handle defining ops between the source and dest loops This patch handles defining ops between the source and dest loop nests, and prevents loop nests with `iter_args` from being fused. If there is any SSA value in the dest loop nest whose defining op has dependence from the source loop nest, we cannot fuse the loop nests. If there is a `affine.for` with `iter_args`, prevent it from being fused. Reviewed By: dcaballe, bondhugula Differential Revision: https://reviews.llvm.org/D97030	2021-02-25 18:12:34 +02:00
Adam Straw	af8adea155	make Affine parallel and yield ops MemRefsNormalizable Affine parallel ops may contain and yield results from MemRefsNormalizable ops in the loop body. Thus, both affine.parallel and affine.yield should have the MemRefsNormalizable trait. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D96821	2021-02-23 10:16:47 -08:00
Vinayaka Bandishti	15332982c3	[MLIR][affine] Prevent fusion when ops with memory effect free are present between producer and consumer This commit fixes a bug in affine fusion pipeline where an incorrect fusion is performed despite a dealloc op is present between a producer and a consumer. This is done by creating a node for dealloc op in the MDG. Reviewed By: bondhugula, dcaballe Differential Revision: https://reviews.llvm.org/D97032	2021-02-22 23:21:02 +05:30
Jacques Pienaar	02d7b260c6	[mlir] Register the print-op-graph pass using ODS Move over to ODS & use pass options.	2021-02-20 15:42:02 -08:00
Nicolas Vasilache	b3c227a25a	[mlir] Better support for rank-reducing subview / subtensor type inference. Differential Revision: https://reviews.llvm.org/D96995	2021-02-19 08:30:50 +00:00
Alexander Belyaev	a89035d750	Revert "[MLIR] Create memref dialect and move several dialect-specific ops from std." This commit introduced a cyclic dependency: Memref dialect depends on Standard because it used ConstantIndexOp. Std depends on the MemRef dialect in its EDSC/Intrinsics.h Working on a fix. This reverts commit `8aa6c3765b`.	2021-02-18 12:49:52 +01:00
Julian Gross	8aa6c3765b	[MLIR] Create memref dialect and move several dialect-specific ops from std. Create the memref dialect and move several dialect-specific ops without dependencies to other ops from std dialect to this dialect. Moved ops: AllocOp -> MemRef_AllocOp AllocaOp -> MemRef_AllocaOp DeallocOp -> MemRef_DeallocOp MemRefCastOp -> MemRef_CastOp GetGlobalMemRefOp -> MemRef_GetGlobalOp GlobalMemRefOp -> MemRef_GlobalOp PrefetchOp -> MemRef_PrefetchOp ReshapeOp -> MemRef_ReshapeOp StoreOp -> MemRef_StoreOp TransposeOp -> MemRef_TransposeOp ViewOp -> MemRef_ViewOp The roadmap to split the memref dialect from std is discussed here: https://llvm.discourse.group/t/rfc-split-the-memref-dialect-from-std/2667 Differential Revision: https://reviews.llvm.org/D96425	2021-02-18 11:29:39 +01:00
Mehdi Amini	aa4e466caa	[mlir][Linalg] Improve region support in Linalg ops This revision takes advantage of the newly extended `ref` directive in assembly format to allow better region handling for LinalgOps. Specifically, FillOp and CopyOp now build their regions explicitly which allows retiring older behavior that relied on specific op knowledge in both lowering to loops and vectorization. This reverts commit `3f22547fd1` and reland `973e133b76` with a workaround for a gcc bug that does not accept lambda default parameters: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=59949 Differential Revision: https://reviews.llvm.org/D96598	2021-02-12 19:11:24 +00:00
Mehdi Amini	3f22547fd1	Revert "[mlir][Linalg] Improve region support in Linalg ops." This reverts commit `973e133b76`. It triggers an issue in gcc5 that require investigation, the build is broken with: /tmp/ccdpj3B9.s: Assembler messages: /tmp/ccdpj3B9.s:5821: Error: symbol `_ZNSt17_Function_handlerIFvjjEUljjE2_E9_M_invokeERKSt9_Any_dataOjS6_' is already defined /tmp/ccdpj3B9.s:5860: Error: symbol `_ZNSt14_Function_base13_Base_managerIUljjE2_E10_M_managerERSt9_Any_dataRKS3_St18_Manager_operation' is already defined	2021-02-12 18:15:51 +00:00
Nicolas Vasilache	973e133b76	[mlir][Linalg] Improve region support in Linalg ops. This revision takes advantage of the newly extended `ref` directive in assembly format to allow better region handling for LinalgOps. Specifically, FillOp and CopyOp now build their regions explicitly which allows retiring older behavior that relied on specific op knowledge in both lowering to loops and vectorization. Differential Revision: https://reviews.llvm.org/D96598	2021-02-12 14:51:03 +00:00
Stephan Herhut	4348d8ab7f	[mlir][math] Split off the math dialect. This does not split transformations, yet. Those will be done as future clean ups. Differential Revision: https://reviews.llvm.org/D96272	2021-02-12 10:55:12 +01:00
Tung D. Le	05c6c648ec	[MLIR] [affine-loop-fusion] Fix a bug about non-result ops in affine-loop-fusion This patch fixes the following bug when calling --affine-loop-fusion Input program: ```mlir func @should_not_fuse_since_top_level_non_affine_non_result_users( %in0 : memref<32xf32>, %in1 : memref<32xf32>) { %c0 = constant 0 : index %cst_0 = constant 0.000000e+00 : f32 affine.for %d = 0 to 32 { %lhs = affine.load %in0[%d] : memref<32xf32> %rhs = affine.load %in1[%d] : memref<32xf32> %add = addf %lhs, %rhs : f32 affine.store %add, %in0[%d] : memref<32xf32> } store %cst_0, %in0[%c0] : memref<32xf32> affine.for %d = 0 to 32 { %lhs = affine.load %in0[%d] : memref<32xf32> %rhs = affine.load %in1[%d] : memref<32xf32> %add = addf %lhs, %rhs: f32 affine.store %add, %in0[%d] : memref<32xf32> } return } ``` call --affine-loop-fusion, we got an incorrect output: ```mlir func @should_not_fuse_since_top_level_non_affine_non_result_users(%arg0: memref<32xf32>, %arg1: memref<32xf32>) { %c0 = constant 0 : index %cst = constant 0.000000e+00 : f32 store %cst, %arg0[%c0] : memref<32xf32> affine.for %arg2 = 0 to 32 { %0 = affine.load %arg0[%arg2] : memref<32xf32> %1 = affine.load %arg1[%arg2] : memref<32xf32> %2 = addf %0, %1 : f32 affine.store %2, %arg0[%arg2] : memref<32xf32> %3 = affine.load %arg0[%arg2] : memref<32xf32> %4 = affine.load %arg1[%arg2] : memref<32xf32> %5 = addf %3, %4 : f32 affine.store %5, %arg0[%arg2] : memref<32xf32> } return } ``` This happened because when analyzing the source and destination nodes, affine loop fusion ignored non-result ops sandwitched between them. In other words, the MemRefDependencyGraph in the affine loop fusion ignored these non-result ops. This patch solves the issue by adding these non-result ops to the MemRefDependencyGraph. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D95668	2021-02-06 13:30:16 +05:30
Alex Zinenko	5b91060dcc	[mlir] Apply source materialization in case of transitive conversion In dialect conversion infrastructure, source materialization applies as part of the finalization procedure to results of the newly produced operations that replace previously existing values with values having a different type. However, such operations may be created to replace operations created in other patterns. At this point, it is possible that the results of the _original_ operation are still in use and have mismatching types, but the results of the _intermediate_ operation that performed the type change are not in use leading to the absence of source materialization. For example, %0 = dialect.produce : !dialect.A dialect.use %0 : !dialect.A can be replaced with %0 = dialect.other : !dialect.A %1 = dialect.produce : !dialect.A // replaced, scheduled for removal dialect.use %1 : !dialect.A and then with %0 = dialect.final : !dialect.B %1 = dialect.other : !dialect.A // replaced, scheduled for removal %2 = dialect.produce : !dialect.A // replaced, scheduled for removal dialect.use %2 : !dialect.A in the same rewriting, but only the %1->%0 replacement is currently considered. Change the logic in dialect conversion to look up all values that were replaced by the given value and performing source materialization if any of those values is still in use with mismatching types. This is performed by computing the inverse value replacement mapping. This arguably expensive manipulation is performed only if there were some type-changing replacements. An alternative could be to consider all replaced operations and not only those that resulted in type changes, but it would harm pattern-level composability: the pattern that performed the non-type-changing replacement would have to be made aware of the type converter in order to call the materialization hook. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D95626	2021-02-04 11:15:11 +01:00
Mehdi Amini	a1d5bdf819	Make the folder more robust against op fold() methods that generate a type mismatch We could extend this with an interface to allow dialect to perform a type conversion, but that would make the folder creating operation which isn't the case at the moment, and isn't necessarily always desirable. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D95991	2021-02-04 01:58:56 +00:00
Alex Zinenko	0409eb2874	[mlir] Keep track of region signature conversions as argument replacements In dialect conversion, signature conversions essentially perform block argument replacement and are added to the general value remapping. However, the replaced values were not tracked, so if a signature conversion was rolled back, the construction of operand lists for the following patterns could have obtained block arguments from the mapping and give them to the pattern leading to use-after-free. Keep track of signature conversions similarly to normal block argument replacement, and erase such replacements from the general mapping when the conversion is rolled back. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D95688	2021-02-02 10:38:31 +01:00
Alexander Belyaev	8d7cbcf582	[mlir] Preserve lexicographic order after loop collapsing. Currently, for a scf.parallel (i,j,k) after the loop collapsing to 1D is done, the IVs would be traversed as for an scf.parallel(k,j,i). Differential Revision: https://reviews.llvm.org/D95693	2021-01-29 21:32:36 +01:00
Diego Caballero	c8fc5c0385	[mlir][Affine] Add support for multi-store producer fusion This patch adds support for producer-consumer fusion scenarios with multiple producer stores to the AffineLoopFusion pass. The patch introduces some changes to the producer-consumer algorithm, including: * For a given consumer loop, producer-consumer fusion iterates over its producer candidates until a fixed point is reached. * Producer candidates are gathered beforehand for each iteration of the consumer loop and visited in reverse program order (not strictly guaranteed) to maximize the number of loops fused per iteration. In general, these changes were needed to simplify the multi-store producer support and remove some of the workarounds that were introduced in the past to support more fusion cases under the single-store producer limitation. This patch also preserves the existing functionality of AffineLoopFusion with one minor change in behavior. Producer-consumer fusion didn't fuse scenarios with escaping memrefs and multiple outgoing edges (from a single store). Multi-store producer scenarios will usually (always?) have multiple outgoing edges so we couldn't fuse any with escaping memrefs, which would greatly limit the applicability of this new feature. Therefore, the patch enables fusion for these scenarios. Please, see modified tests for specific details. Reviewed By: andydavis1, bondhugula Differential Revision: https://reviews.llvm.org/D92876	2021-01-25 20:31:17 +02:00
Diego Caballero	735a07f047	Revert "[mlir][Affine] Add support for multi-store producer fusion" This reverts commit `7dd198852b`. ASAN issue.	2021-01-21 00:37:23 +02:00
Diego Caballero	7dd198852b	[mlir][Affine] Add support for multi-store producer fusion This patch adds support for producer-consumer fusion scenarios with multiple producer stores to the AffineLoopFusion pass. The patch introduces some changes to the producer-consumer algorithm, including: * For a given consumer loop, producer-consumer fusion iterates over its producer candidates until a fixed point is reached. * Producer candidates are gathered beforehand for each iteration of the consumer loop and visited in reverse program order (not strictly guaranteed) to maximize the number of loops fused per iteration. In general, these changes were needed to simplify the multi-store producer support and remove some of the workarounds that were introduced in the past to support more fusion cases under the single-store producer limitation. This patch also preserves the existing functionality of AffineLoopFusion with one minor change in behavior. Producer-consumer fusion didn't fuse scenarios with escaping memrefs and multiple outgoing edges (from a single store). Multi-store producer scenarios will usually (always?) have multiple outgoing edges so we couldn't fuse any with escaping memrefs, which would greatly limit the applicability of this new feature. Therefore, the patch enables fusion for these scenarios. Please, see modified tests for specific details. Reviewed By: andydavis1, bondhugula Differential Revision: https://reviews.llvm.org/D92876	2021-01-20 19:03:07 +02:00
Julian Gross	43f34f5834	Added check if there are regions that do not implement the RegionBranchOpInterface. Add a check if regions do not implement the RegionBranchOpInterface. This is not allowed in the current deallocation steps. Furthermore, we handle edge-cases, where a single region is attached and the parent operation has no results. This fixes: https://bugs.llvm.org/show_bug.cgi?id=48575 Differential Revision: https://reviews.llvm.org/D94586	2021-01-20 12:15:28 +01:00
Sean Silva	be7352c00d	[mlir][splitting std] move 2 more ops to `tensor` - DynamicTensorFromElementsOp - TensorFromElements Differential Revision: https://reviews.llvm.org/D94994	2021-01-19 13:49:25 -08:00
Andrew Young	a55a0a3056	[mlir] Remove over specified memory effects The standard and gpu dialect both have `alloc` operations which use the memory effect `MemAlloc`. In both cases, it is specified on both the operation itself and on the result. This results in two memory effects being created for these operations. When `MemAlloc` is defined on an operation, it represents some background effect which the compiler cannot reason about, and inhibits the ability of the compiler to remove dead `std.alloc` operations. This change removes the uneeded `MemAlloc` effect from these operations and leaves the effect on the result, which allows dead allocs to be erased. There is the same problem, but to a lesser extent, with MemFree, MemRead and MemWrite. Over-specifying these traits is not currently inhibiting any optimization. Differential Revision: https://reviews.llvm.org/D94662	2021-01-14 14:49:41 -08:00
River Riddle	c8fb6ee341	[mlir][PatternRewriter] Add a new hook to selectively replace uses of an operation This revision adds a new `replaceOpWithIf` hook that replaces uses of an operation that satisfy a given functor. If all uses are replaced, the operation gets erased in a similar manner to `replaceOp`. DialectConversion support will be added in a followup as this requires adjusting how replacements are tracked there. Differential Revision: https://reviews.llvm.org/D94632	2021-01-14 11:58:21 -08:00
River Riddle	93592b726c	[mlir][OpFormatGen] Format enum attribute cases as keywords when possible In the overwhelmingly common case, enum attribute case strings represent valid identifiers in MLIR syntax. This revision updates the format generator to format as a keyword in these cases, removing the need to wrap values in a string. The parser still retains the ability to parse the string form, but the printer will use the keyword form when applicable. Differential Revision: https://reviews.llvm.org/D94575	2021-01-14 11:35:49 -08:00
Alex Zinenko	2230bf99c7	[mlir] replace LLVMIntegerType with built-in integer type The LLVM dialect type system has been closed until now, i.e. did not support types from other dialects inside containers. While this has had obvious benefits of deriving from a common base class, it has led to some simple types being almost identical with the built-in types, namely integer and floating point types. This in turn has led to a lot of larger-scale complexity: simple types must still be converted, numerous operations that correspond to LLVM IR intrinsics are replicated to produce versions operating on either LLVM dialect or built-in types leading to quasi-duplicate dialects, lowering to the LLVM dialect is essentially required to be one-shot because of type conversion, etc. In this light, it is reasonable to trade off some local complexity in the internal implementation of LLVM dialect types for removing larger-scale system complexity. Previous commits to the LLVM dialect type system have adapted the API to support types from other dialects. Replace LLVMIntegerType with the built-in IntegerType plus additional checks that such types are signless (these are isolated in a utility function that replaced `isa<LLVMType>` and in the parser). Temporarily keep the possibility to parse `!llvm.i32` as a synonym for `i32`, but add a deprecation notice. Reviewed By: mehdi_amini, silvas, antiagainst Differential Revision: https://reviews.llvm.org/D94178	2021-01-07 19:48:31 +01:00
Kazuaki Ishizaki	2b638ed5a1	[mlir] NFC: fix trivial typos fix typos under docs, test, and tools directories Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D94158	2021-01-07 02:36:02 +09:00
Sean Silva	129d6e554e	[mlir] Move `std.tensor_cast` -> `tensor.cast`. This is almost entirely mechanical. Differential Revision: https://reviews.llvm.org/D93357	2020-12-17 16:06:56 -08:00
River Riddle	d7eba20052	[mlir][Inliner] Refactor the inliner to use nested pass pipelines instead of just canonicalization Now that passes have support for running nested pipelines, the inliner can now allow for users to provide proper nested pipelines to use for optimization during inlining. This revision also changes the behavior of optimization during inlining to optimize before attempting to inline, which should lead to a more accurate cost model and prevents the need for users to schedule additional duplicate cleanup passes before/after the inliner that would already be run during inlining. Differential Revision: https://reviews.llvm.org/D91211	2020-12-14 18:09:47 -08:00
Sean Silva	444822d77a	Revert "Revert "[mlir] Start splitting the `tensor` dialect out of `std`."" This reverts commit `0d48d265db`. This reapplies the following commit, with a fix for CAPI/ir.c: [mlir] Start splitting the `tensor` dialect out of `std`. This starts by moving `std.extract_element` to `tensor.extract` (this mirrors the naming of `vector.extract`). Curiously, `std.extract_element` supposedly works on vectors as well, and this patch removes that functionality. I would tend to do that in separate patch, but I couldn't find any downstream users relying on this, and the fact that we have `vector.extract` made it seem safe enough to lump in here. This also sets up the `tensor` dialect as a dependency of the `std` dialect, as some ops that currently live in `std` depend on `tensor.extract` via their canonicalization patterns. Part of RFC: https://llvm.discourse.group/t/rfc-split-the-tensor-dialect-from-std/2347/2 Differential Revision: https://reviews.llvm.org/D92991	2020-12-11 14:30:50 -08:00
Sean Silva	0d48d265db	Revert "[mlir] Start splitting the `tensor` dialect out of `std`." This reverts commit `cab8dda90f`. I mistakenly thought that CAPI/ir.c failure was unrelated to this change. Need to debug it.	2020-12-11 14:15:41 -08:00
Sean Silva	cab8dda90f	[mlir] Start splitting the `tensor` dialect out of `std`. This starts by moving `std.extract_element` to `tensor.extract` (this mirrors the naming of `vector.extract`). Curiously, `std.extract_element` supposedly works on vectors as well, and this patch removes that functionality. I would tend to do that in separate patch, but I couldn't find any downstream users relying on this, and the fact that we have `vector.extract` made it seem safe enough to lump in here. This also sets up the `tensor` dialect as a dependency of the `std` dialect, as some ops that currently live in `std` depend on `tensor.extract` via their canonicalization patterns. Part of RFC: https://llvm.discourse.group/t/rfc-split-the-tensor-dialect-from-std/2347/2 Differential Revision: https://reviews.llvm.org/D92991	2020-12-11 13:50:55 -08:00
River Riddle	c24f88b4db	[mlir][SCCP] Don't visit private callables unless they are used when tracking interprocedural arguments/results This fixes a subtle bug where SCCP could incorrectly optimize a private callable while waiting for its arguments to be resolved. Fixes PR#48457 Differential Revision: https://reviews.llvm.org/D92976	2020-12-10 12:53:27 -08:00
Haruki Imai	b2391d5f0d	[MLIR] Normalize the results of normalizable operations Memrefs with affine_map in the results of normalizable operation were not normalized by `--normalize-memrefs` option. This patch normalizes them. Differential Revision: https://reviews.llvm.org/D88719	2020-12-03 19:34:07 +05:30
Julian Gross	8aeca73702	[MLIR] Added support for dynamic shaped allocas to promote-buffers-to-stack pass. Extended promote buffers to stack pass to support dynamically shaped allocas. The conversion is limited by the rank of the underlying tensor. An option is added to the pass to adjust the given rank. Differential Revision: https://reviews.llvm.org/D91969	2020-12-03 11:47:49 +01:00
Sean Silva	774f1d3ffd	[mlir] Small cleanups to func-bufferize/finalizing-bufferize - Address TODO in scf-bufferize: the argument materialization issue is now fixed and the code is now in Transforms/Bufferize.cpp - Tighten up finalizing-bufferize to avoid creating invalid IR when operand types potentially change - Tidy up the testing of func-bufferize, and move appropriate tests to a new finalizing-bufferize.mlir - The new stricter checking in finalizing-bufferize revealed that we needed a DimOp conversion pattern (found when integrating into npcomp). Previously, the converion infrastructure was blindly changing the operand type during finalization, which happened to work due to DimOp's tensor/memref polymorphism, but is generally not encouraged (the new pattern is the way to tell the conversion infrastructure that it is legal to change that type).	2020-11-30 17:04:14 -08:00
Stephan Herhut	20c926e079	[mlir][DialectConversion] Do not prematurely drop unused cast operations The rewrite logic has an optimization to drop a cast operation after rewriting block arguments if the cast operation has no users. This is unsafe as there might be a pending rewrite that replaced the cast operation itself and hence would trigger a second free. Instead, do not remove the casts and leave it up to a later canonicalization to do so. Differential Revision: https://reviews.llvm.org/D92184	2020-11-26 17:39:14 +01:00
William S. Moses	f5c5fd1c50	[MLIR] Correct block merge bug Block merging in MLIR will incorrectly merge blocks with operations whose values are used outside of that block. This change forbids this behavior and provides a test where it is illegal to perform such a merge. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D91745	2020-11-20 19:12:59 +01:00
Tres Popp	b0750e2df6	Fix rollback of first block erasure in a region. Differential Revision: https://reviews.llvm.org/D91788	2020-11-19 21:24:10 +01:00
Stephan Herhut	c4472f8b4c	[mlir][std] Canonicalize extract_element(tensor_cast). Canonicalize extract_element(tensor_cast(v)) to just extract_element(v). Differential Revision: https://reviews.llvm.org/D91621	2020-11-17 14:41:39 +01:00
Rahul Joshi	b7382ed3fe	[MLIR] Extend Symbol verification to reject public symbol declarations. - Extend the Symbol interface with `isDeclaration` to identify operations that declare a symbol as opposed to define it. - Extend verification to disallow public declarations as per the discussion in https://llvm.discourse.group/t/rfc-symbol-definition-declaration-x-visibility-checks/2140 - Adopt the new interface for `FuncOp` and fix test and code to not have/create public function declarations. Differential Revision: https://reviews.llvm.org/D91456	2020-11-16 16:05:32 -08:00
Sean Silva	7c62c6313b	[mlir] Add DecomposeCallGraphTypes pass. This replaces the old type decomposition logic that was previously mixed into bufferization, and makes it easily accessible. This also deletes TestFinalizingBufferize, because after we remove the type decomposition, it doesn't do anything that is not already provided by func-bufferize. Differential Revision: https://reviews.llvm.org/D90899	2020-11-16 12:25:35 -08:00
Stephan Herhut	4a771108ac	[mlir][bufferize] Fix buffer promotion to stack for index types The index type does not have a bitsize and hence the size of corresponding allocations cannot be computed. Instead, the promotion pass now has an explicit option to specify the size of index. Differential Revision: https://reviews.llvm.org/D91360	2020-11-13 09:23:36 +01:00
Tres Popp	cc5b4a8603	[mlir] Rework DialectConversion inlineRegionBefore The previous logic for inlining a region A with N blocks into region B would produce incorrect results on rollback for N greater than 1. This rollback logic would leave blocks 1..N in region B and only move block 0 to region A. The new inlining action recording stores the block move actions from N-1 to 0. Now on roll back, block 0 is moved to region A and then 1..N is appended to the list of blocks in region A. Differential Revision: https://reviews.llvm.org/D91185	2020-11-11 10:42:33 +01:00
River Riddle	892605b449	[mlir][Asm] Add support for using an alias for trailing operation locations Locations often get very long and clutter up operations when printed inline with them. This revision adds support for using aliases with trailing operation locations, and makes printing with aliases the default behavior. Aliases in the trailing location take the form `loc(<alias>)`, such as `loc(#loc0)`. As with all aliases, using `mlir-print-local-scope` can be used to disable them and get the inline behavior. Differential Revision: https://reviews.llvm.org/D90652	2020-11-09 21:54:47 -08:00
River Riddle	ebcc022507	[mlir][AsmPrinter] Refactor printing to only print aliases for attributes/types that will exist in the output. This revision refactors the way that attributes/types are considered when generating aliases. Instead of considering all of the attributes/types of every operation, we perform a "fake" print step that prints the operations using a dummy printer to collect the attributes and types that would actually be printed during the real process. This removes a lot of attributes/types from consideration that generally won't end up in the final output, e.g. affine map attributes in an `affine.apply`/`affine.for`. This resolves a long standing TODO w.r.t aliases, and helps to have a much cleaner textual output format. As a datapoint to the latter, as part of this change several tests were identified as testing for the presence of attributes aliases that weren't actually referenced by the custom form of any operation. To ensure that this wouldn't cause a large degradation in compile time due to the second full print, I benchmarked this change on a very large module with a lot of operations(The file is ~673M/~4.7 million lines long). This file before this change take ~6.9 seconds to print in the custom form, and ~7 seconds after this change. In the custom assembly case, this added an average of a little over ~100 miliseconds to the compile time. This increase was due to the way that argument attributes on functions are structured and how they get printed; i.e. with a better representation the negative impact here can be greatly decreased. When printing in the generic form, this revision had no observable impact on the compile time. This benchmarking leads me to believe that the impact of this change on compile time w.r.t printing is closely related to `print` methods that perform a lot of additional/complex processing outside of the OpAsmPrinter. Differential Revision: https://reviews.llvm.org/D90512	2020-11-09 21:54:47 -08:00
Rahul Joshi	8b5a3e4632	[MLIR] Change FuncOp assembly syntax to print visibility inline instead of in attrib dict. - Change syntax for FuncOp to be `func <visibility>? @name` instead of printing the visibility in the attribute dictionary. - Since printFunctionLikeOp() and parseFunctionLikeOp() are also used by other operations, make the "inline visibility" an opt-in feature. - Updated unit test to use and check the new syntax. Differential Revision: https://reviews.llvm.org/D90859	2020-11-09 11:08:08 -08:00
Alex Zinenko	0c782c214b	[mlir] Add folding of memref_cast inside another memref_cast There exists a generic folding facility that folds the operand of a memref_cast into users of memref_cast that support this. However, it was not used for the memref_cast itself. Fix it to enable elimination of memref_cast chains such as %1 = memref_cast %0 : A to B %2 = memref_cast %1 : B to A that is achieved by combining the folding with the existing "A to A" cast elimination. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D90910	2020-11-06 10:42:40 +01:00
Sean Silva	f7bc568266	[mlir] Remove AppendToArgumentsList functionality from BufferizeTypeConverter. This functionality is superceded by BufferResultsToOutParams pass (see https://reviews.llvm.org/D90071) for users the require buffers to be out-params. That pass should be run immediately after all tensors are gone from the program (before buffer optimizations and deallocation insertion), such as immediately after a "finalizing" bufferize pass. The -test-finalizing-bufferize pass now defaults to what used to be the `allowMemrefFunctionResults=true` flag. and the finalizing-bufferize-allowed-memref-results.mlir file is moved to test/Transforms/finalizing-bufferize.mlir. Differential Revision: https://reviews.llvm.org/D90778	2020-11-05 11:20:09 -08:00
Nicolas Vasilache	ecca7852d9	[mlir][Linalg] Side effects interface for Linalg ops The LinalgDependenceGraph and alias analysis provide the necessary analysis for the Linalg fusion on buffers case. However this is not enough for linalg on tensors which require proper memory effects to play nicely with DCE and other transformations. This revision adds side effects to Linalg ops that were previously missing and has 2 consequences: 1. one example in the copy removal pass now fails since the linalg.generic op has side effects and the pass does not perform alias analysis / distinguish between reads and writes. 2. a few examples in fusion-tensor.mlir need to return the resulting tensor otherwise DCE automatically kicks in as part of greedy pattern application. Differential Revision: https://reviews.llvm.org/D90762	2020-11-05 09:00:28 +00:00
Alexandre Eichenberger	0795715616	[mlir][std] Add SignedCeilDivIOp and SignedFloorDivIOp with std to std lowering triggered by -std-expand-divs option. The new operations support positive/negative nominator/denominator numbers. Differential Revision: https://reviews.llvm.org/D89726 Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>	2020-11-04 14:16:23 -05:00
Alexander Bosch	5452fa6a59	[MLIR] Added test operations to replace linalg dependency for BufferizeTests. Summary: Added test operations to replace the LinalgDialect dependency in tests which use the buffer-deallocation, buffer-hoisting, buffer-loop-hoisting, promote-buffers-to-stack, buffer-placement-preparation-allowed-memref-resutls and buffer-placement-preparation pass. Adapted the corresponding tests cases and TestBufferPlacement.cpp. Differential Revision: https://reviews.llvm.org/D90037	2020-11-03 12:18:49 +01:00
Sean Silva	773ad135a3	[mlir][Bufferize] Rename TestBufferPlacement to TestFinalizingBufferize BufferPlacement is no longer part of bufferization. However, this test is an important test of "finalizing" bufferize passes. A "finalizing" bufferize conversion is one that performs a "full" conversion and expects all tensors to be gone from the program. This in particular involves rewriting funcs (including block arguments of the contained region), calls, and returns. The unique property of finalizing bufferization passes is that they cannot be done via a local transformation with suitable materializations to ensure composability (as other bufferization passes do). For example, if a call is rewritten, the callee needs to be rewritten otherwise the IR will end up invalid. Thus, finalizing bufferization passes require an atomic change to the entire program (e.g. the whole module). This new designation makes it clear also that it shouldn't be testing bufferization of linalg ops, so the tests have been updated to not use linalg.generic ops. (linalg.copy is still used as the "copy" op for copying into out-params) Differential Revision: https://reviews.llvm.org/D89979	2020-11-02 12:42:32 -08:00
Sean Silva	b866574246	[mlir] Add BufferResultsToOutParams pass. This pass allows removing getResultConversionKind from BufferizeTypeConverter. This pass replaces the AppendToArgumentsList functionality. As far as I could tell, the only use of this functionlity is to perform the transformation that is implemented in this pass. Future patches will remove the getResultConversionKind machinery from BufferizeTypeConverter, but sending this patch for individual review for clarity. Differential Revision: https://reviews.llvm.org/D90071	2020-10-30 14:06:14 -07:00
River Riddle	a463ea50a4	[mlir][ASM] Refactor how attribute/type aliases are specified. Previously they were separated into "instance" and "kind" aliases, and also required that the dialect know ahead of time all of the instances that would have a corresponding alias. This approach was very clunky and not ergonomic to interact with. The new approach is to provide the dialect with an instance of an attribute/type to provide an alias for, fully replacing the original split approach. Differential Revision: https://reviews.llvm.org/D89354	2020-10-30 00:39:46 -07:00
River Riddle	501fda0167	[mlir][Inliner] Add a new hook for checking if it is legal to inline a callable into a call In certain situations it isn't legal to inline a call operation, but this isn't something that is possible(at least not easily) to prevent with the current hooks. This revision adds a new hook so that dialects with call operations that shouldn't be inlined can prevent it. Differential Revision: https://reviews.llvm.org/D90359	2020-10-28 21:49:28 -07:00
Kazuaki Ishizaki	41b09f4eff	[mlir] NFC: fix trivial typos fix typos in comments and documents Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D90089	2020-10-29 04:05:22 +09:00
Julian Gross	0d1d363c51	[MLIR] Added PromoteBuffersToStackPass to convert heap- to stack-based allocations. Added optimization pass to convert heap-based allocs to stack-based allocas in buffer placement. Added the corresponding test file. Differential Revision: https://reviews.llvm.org/D89688	2020-10-23 12:02:25 +02:00
Marcel Koester	1b1c61ff47	[mlir] Refactored BufferPlacement transformation. The current BufferPlacement transformation contains several concepts for hoisting allocations. However, more advanced hoisting techniques should not be integrated into the BufferPlacement transformation. Hence, this CL refactors the current BufferPlacement pass into three separate pieces: BufferDeallocation and BufferAllocation(Loop)Hoisting. Moreover, it extends the hoisting functionality by allowing to move allocations out of loops. Differential Revision: https://reviews.llvm.org/D87756	2020-10-19 12:52:16 +02:00
River Riddle	a8feeee15f	[mlir] Add canonicalization for cond_br that feed into a cond_br on the same condition ``` ... cond_br %cond, ^bb1(...), ^bb2(...) ... ^bb1: // has single predecessor ... cond_br %cond, ^bb3(...), ^bb4(...) ``` -> ``` ... cond_br %cond, ^bb1(...), ^bb2(...) ... ^bb1: // has single predecessor ... br ^bb3(...) ``` Differential Revision: https://reviews.llvm.org/D89604	2020-10-18 13:51:02 -07:00
Stephan Herhut	307124535f	[mlir][standard] Fix parsing of scalar subview and canonicalize Parsing of a scalar subview did not create the required static_offsets attribute. This also adds support for folding scalar subviews away. Differential Revision: https://reviews.llvm.org/D89467	2020-10-15 16:41:54 +02:00
Sean Silva	9a14cb53cb	[mlir][bufferize] Rename BufferAssignment* to Bufferize* Part of the refactor discussed in: https://llvm.discourse.group/t/what-is-the-strategy-for-tensor-memref-conversion-bufferization/1938/17 Differential Revision: https://reviews.llvm.org/D89271	2020-10-14 12:39:16 -07:00
Alexander Belyaev	b98e5e0f7e	[mlir] Move Linalg tensors-to-buffers tests to Linalg tests. The buffer placement preparation tests in test/Transforms/buffer-placement-preparation* are using Linalg as a test dialect which leads to confusion and "copy-pasta", i.e. Linalg is being extended now and when TensorsToBuffers.cpp is changed, TestBufferPlacement is sometimes kept in-sync, which should not be the case. This has led to the unnoticed bug, because the tests were in a different directory and the patterns were slightly off. Differential Revision: https://reviews.llvm.org/D89209	2020-10-12 10:18:57 +02:00
Tatiana Shpeisman	9909ef292d	[mlir][scf] Fix a bug in scf::ForOp loop unroll with an epilogue Fixes a bug in formation and simplification of an epilogue loop generated during loop unroll of scf::ForOp (https://bugs.llvm.org/show_bug.cgi?id=46689) Differential Revision: https://reviews.llvm.org/D87583	2020-10-10 14:18:25 +05:30
Nicolas Vasilache	d8ee28b96e	[mlir][Linalg] Extend buffer allocation to support Linalg init tensors This revision adds init_tensors support to buffer allocation for Linalg on tensors. Currently makes the assumption that the init_tensors fold onto the first output tensors. This assumption is not currently enforced or cast in stone and requires experimenting with tiling linalg on tensors for ops without reductions. Still this allows progress towards the end-to-end goal.	2020-10-06 13:24:27 +00:00
Nicolas Vasilache	787bf5e383	[mlir] Add canonicalization for the `subtensor` op Differential revision: https://reviews.llvm.org/D88656	2020-10-02 06:05:52 -04:00
Haruki Imai	c1f8568031	[MLIR] Fix for updating function signature in normalizing memrefs Normalizing memrefs failed when a caller of symbolic use in a function can not be casted to `CallOp`. This patch avoids the failure by checking the result of the casting. If the caller can not be casted to `CallOp`, it is skipped. Differential Revision: https://reviews.llvm.org/D87746	2020-09-25 22:56:56 +05:30
Haruki Imai	ff00b58392	[MLIR] Normalize memrefs in LoadOp and StoreOp of Standard Ops Added a trait, `MemRefsNormalizable` in LoadOp and StoreOp of Standard Ops to normalize input memrefs in LoadOp and StoreOp. Related revision: https://reviews.llvm.org/D86236 Differential Revision: https://reviews.llvm.org/D88156	2020-09-24 18:57:15 +05:30
Nicolas Vasilache	ed229132f1	[mlir][Linalg] Uniformize linalg.generic with named ops. This revision allows representing a reduction at the level of linalg on tensors for generic ops by uniformizing with the named ops approach.	2020-09-22 04:13:22 -04:00
Tres Popp	ffdd4a46a9	[mlir] Shape.AssumingOp implements RegionBranchOpInterface. This adds support for the interface and provides unambigious information on the control flow as it is unconditional on any runtime values. The code is tested through confirming that buffer-placement behaves as expected. Differential Revision: https://reviews.llvm.org/D87894	2020-09-21 11:33:11 +02:00
Stephan Herhut	5e0ded2689	[mlir][Standard] Canonicalize chains of tensor_cast operations Adds a pattern that replaces a chain of two tensor_cast operations by a single tensor_cast operation if doing so will not remove constraints on the shapes.	2020-09-17 16:50:38 +02:00
Stephan Herhut	c897a7fb3e	[mlir][Standard] Add canonicalizer for dynamic_tensor_from_elements This add canonicalizer for - extracting an element from a dynamic_tensor_from_elements - propagating constant operands to the type of dynamic_tensor_from_elements Differential Revision: https://reviews.llvm.org/D87525	2020-09-15 15:38:14 +02:00
Marcel Koester	feb0b9c3bb	[mlir] Added support for loops to BufferPlacement transformation. The current BufferPlacement transformation cannot handle loops properly. Buffers passed via backedges will not be freed automatically introducing memory leaks. This CL adds support for loops to overcome these limitations. Differential Revision: https://reviews.llvm.org/D85513	2020-09-09 10:53:35 +02:00
Frederik Gossen	133322d2e3	[MLIR][Standard] Update `tensor_from_elements` assembly format Remove the redundant parenthesis that are used for none of the other operation formats. Differential Revision: https://reviews.llvm.org/D86287	2020-09-09 07:45:46 +00:00
Ehsan Toosi	4e9f4d0b9d	[mlir] Fix bug in copy removal A crash could happen due to copy removal. The bug is fixed and two more test cases are added. Differential Revision: https://reviews.llvm.org/D87128	2020-09-08 14:17:13 +02:00
Ehsan Toosi	39cf83cc78	[mlir] Extend BufferAssignmentTypeConverter with result conversion callbacks In this PR, the users of BufferPlacement can configure BufferAssginmentTypeConverter. These new configurations would give the user more freedom in the process of converting function signature, and return and call operation conversions. These are the new features: - Accepting callback functions for decomposing types (i.e. 1 to N type conversion such as unpacking tuple types). - Defining ResultConversionKind for specifying whether a function result with a certain type should be appended to the function arguments list or should be kept as function result. (Usage: converter.setResultConversionKind<MemRefType>(AppendToArgumentList)) - Accepting callback functions for composing or decomposing values (i.e. N to 1 and 1 to N value conversion). Differential Revision: https://reviews.llvm.org/D85133	2020-09-02 17:53:42 +02:00
Lei Zhang	1b88bbf5eb	Revert "[mlir] Extend BufferAssignmentTypeConverter with result conversion callbacks" This reverts commit `94f5d24877` because of failing the following tests: MLIR :: Dialect/Linalg/tensors-to-buffers.mlir MLIR :: Transforms/buffer-placement-preparation-allowed-memref-results.mlir MLIR :: Transforms/buffer-placement-preparation.mlir	2020-09-02 09:24:36 -04:00
Ehsan Toosi	94f5d24877	[mlir] Extend BufferAssignmentTypeConverter with result conversion callbacks In this PR, the users of BufferPlacement can configure BufferAssginmentTypeConverter. These new configurations would give the user more freedom in the process of converting function signature, and return and call operation conversions. These are the new features: - Accepting callback functions for decomposing types (i.e. 1 to N type conversion such as unpacking tuple types). - Defining ResultConversionKind for specifying whether a function result with a certain type should be appended to the function arguments list or should be kept as function result. (Usage: converter.setResultConversionKind<MemRefType>(AppendToArgumentList)) - Accepting callback functions for composing or decomposing values (i.e. N to 1 and 1 to N value conversion). Differential Revision: https://reviews.llvm.org/D85133	2020-09-02 13:26:55 +02:00
Vincent Zhao	28a7dfa33d	[MLIR] Fixed missing constraint append when adding an AffineIfOp domain The prior diff that introduced `addAffineIfOpDomain` missed appending constraints from the ifOp domain. This revision fixes this problem. Differential Revision: https://reviews.llvm.org/D86421	2020-08-28 00:34:23 +05:30
Alexandre E. Eichenberger	a14a2805b0	[MLIR] MemRef Normalization for Dialects When dealing with dialects that will results in function calls to external libraries, it is important to be able to handle maps as some dialects may require mapped data. Before this patch, the detection of whether normalization can apply or not, operations are compared to an explicit list of operations (`alloc`, `dealloc`, `return`) or to the presence of specific operation interfaces (`AffineReadOpInterface`, `AffineWriteOpInterface`, `AffineDMAStartOp`, or `AffineDMAWaitOp`). This patch add a trait, `MemRefsNormalizable` to determine if an operation can have its `memrefs` normalized. This trait can be used in turn by dialects to assert that such operations are compatible with normalization of `memrefs` with nontrivial memory layout specification. An example is given in the literal tests. Differential Revision: https://reviews.llvm.org/D86236	2020-08-27 20:26:59 +05:30
Kazuaki Ishizaki	a23d055912	[mlir] NFC: fix trivial typo under test and tools Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D86648	2020-08-27 15:37:42 +09:00
River Riddle	474f7639e3	[mlir] Fix bug in block merging when the types of the operands differ The merging algorithm was previously not checking for type equivalence. Fixes PR47314 Differential Revision: https://reviews.llvm.org/D86594	2020-08-26 01:17:20 -07:00
Alexander Belyaev	fed9ff5117	[mlir] Test CallOp STD->LLVM conversion. This exercises the corner case that was fixed in https://reviews.llvm.org/rG8979a9cdf226066196f1710903d13492e6929563. The bug can be reproduced when there is a @callee with a custom type argument and @caller has a producer of this argument passed to the @callee. Example: func @callee(!test.test_type) -> i32 func @caller() -> i32 { %arg = "test.type_producer"() : () -> !test.test_type %out = call @callee(%arg) : (!test.test_type) -> i32 return %out : i32 } Even though there is a type conversion for !test.test_type, the output IR (before the fix) contained a DialectCastOp: module { llvm.func @callee(!llvm.ptr<i8>) -> !llvm.i32 llvm.func @caller() -> !llvm.i32 { %0 = llvm.mlir.null : !llvm.ptr<i8> %1 = llvm.mlir.cast %0 : !llvm.ptr<i8> to !test.test_type %2 = llvm.call @callee(%1) : (!test.test_type) -> !llvm.i32 llvm.return %2 : !llvm.i32 } } instead of module { llvm.func @callee(!llvm.ptr<i8>) -> !llvm.i32 llvm.func @caller() -> !llvm.i32 { %0 = llvm.mlir.null : !llvm.ptr<i8> %1 = llvm.call @callee(%0) : (!llvm.ptr<i8>) -> !llvm.i32 llvm.return %1 : !llvm.i32 } } Differential Revision: https://reviews.llvm.org/D85914	2020-08-13 19:10:21 +02:00
avarmapml	6d4f7801b1	[MLIR] Support for ReturnOps in memref map layout normalization -- This commit handles the returnOp in memref map layout normalization. -- An initial filter is applied on FuncOps which helps us know which functions can be a suitable candidate for memref normalization which doesn't lead to invalid IR. -- Handles memref map normalization for external function assuming the external function is normalizable. Differential Revision: https://reviews.llvm.org/D85226	2020-08-13 19:10:47 +05:30
Vincent Zhao	654e8aadfd	[MLIR] Consider AffineIfOp when getting the index set of an Op wrapped in nested loops This diff attempts to resolve the TODO in `getOpIndexSet` (formerly known as `getInstIndexSet`), which states "Add support to handle IfInsts surronding `op`". Major changes in this diff: 1. Overload `getIndexSet`. The overloaded version considers both `AffineForOp` and `AffineIfOp`. 2. The `getInstIndexSet` is updated accordingly: its name is changed to `getOpIndexSet` and its implementation is based on a new API `getIVs` instead of `getLoopIVs`. 3. Add `addAffineIfOpDomain` to `FlatAffineConstraints`, which extracts new constraints from the integer set of `AffineIfOp` and merges it to the current constraint system. 4. Update how a `Value` is determined as dim or symbol for `ValuePositionMap` in `buildDimAndSymbolPositionMaps`. Differential Revision: https://reviews.llvm.org/D84698	2020-08-09 03:16:03 +05:30
Nicolas Vasilache	2a01d7f7b6	[mlir][SCF] Add utility to outline the then and else branches of an scf.IfOp Differential Revision: https://reviews.llvm.org/D85449	2020-08-07 14:49:49 -04:00
Diego Caballero	3bfbc5df87	[MLIR][Affine] Fix createPrivateMemRef in affine fusion Always define a remapping for the memref replacement (`indexRemap`) with the proper number of inputs, including all the `outerIVs`, so that the number of inputs and the operands provided for the map don't mismatch. Reviewed By: bondhugula, andydavis1 Differential Revision: https://reviews.llvm.org/D85177	2020-08-04 12:17:48 -07:00
MaheshRavishankar	e888886cc3	[mlir][DialectConversion] Add support for mergeBlocks in ConversionPatternRewriter. Differential Revision: https://reviews.llvm.org/D84795	2020-08-03 10:06:04 -07:00
Julian Gross	6d47431d7e	[mlir] Extended Buffer Assignment to support AllocaOps. Added support for AllocaOps in Buffer Assignment. Differential Revision: https://reviews.llvm.org/D85017	2020-08-03 11:20:30 +02:00
Abhishek Varma	76d07503f0	[MLIR] Introduce inter-procedural memref layout normalization -- Introduces a pass that normalizes the affine layout maps to the identity layout map both within and across functions by rewriting function arguments and call operands where necessary. -- Memref normalization is now implemented entirely in the module pass '-normalize-memrefs' and the limited intra-procedural version has been removed from '-simplify-affine-structures'. -- Run using -normalize-memrefs. -- Return ops are not handled and would be handled in the subsequent revisions. Signed-off-by: Abhishek Varma <abhishek.varma@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D84490	2020-07-30 18:12:56 +05:30
Stephan Herhut	823ffef009	[mlir][Standard] Allow unranked memrefs as operands to dim and rank `std.dim` currently only accepts ranked memrefs and `std.rank` is limited to tensors. Differential Revision: https://reviews.llvm.org/D84790	2020-07-29 14:42:58 +02:00
Anand Kodnani	834133c950	[MLIR] Vector store to load forwarding The MemRefDataFlow pass does store to load forwarding only for affine store/loads. This patch updates the pass to use affine read/write interface which enables vector forwarding. Reviewed By: dcaballe, bondhugula, ftynse Differential Revision: https://reviews.llvm.org/D84302	2020-07-28 11:30:54 -07:00
River Riddle	4589dd924d	[mlir][DialectConversion] Enable deeper integration of type conversions This revision adds support for much deeper type conversion integration into the conversion process, and enables auto-generating cast operations when necessary. Type conversions are now largely automatically managed by the conversion infra when using a ConversionPattern with a provided TypeConverter. This removes the need for patterns to do type cast wrapping themselves and moves the burden to the infra. This makes it much easier to perform partial lowerings when type conversions are involved, as any lingering type conversions will be automatically resolved/legalized by the conversion infra. To support this new integration, a few changes have been made to the type materialization API on TypeConverter. Materialization has been split into three separate categories: * Argument Materialization: This type of materialization is used when converting the type of block arguments when calling `convertRegionTypes`. This is useful for contextually inserting additional conversion operations when converting a block argument type, such as when converting the types of a function signature. * Source Materialization: This type of materialization is used to convert a legal type of the converter into a non-legal type, generally a source type. This may be called when uses of a non-legal type persist after the conversion process has finished. * Target Materialization: This type of materialization is used to convert a non-legal, or source, type into a legal, or target, type. This type of materialization is used when applying a pattern on an operation, but the types of the operands have not yet been converted. Differential Revision: https://reviews.llvm.org/D82831	2020-07-23 19:40:31 -07:00
Haruki Imai	7f44a7130b	[MLIR] Set alignment in AllocOp of normalizeMemref() AllocOp is updated in normalizeMemref(AllocOp allocOp), but, when the AllocOp has `alignment` attribute, it was ignored and updated AllocOp does not have `alignment` attribute. This patch fixes it. Differential Revision: https://reviews.llvm.org/D83656	2020-07-22 12:34:35 +05:30
River Riddle	b98f414a04	[mlir][DialectConversion] Emit an error if an operation marked as erased has live users after conversion Up until now, there has been an implicit agreement that when an operation is marked as "erased" all uses of that operation's results are guaranteed to be removed during conversion. How this works in practice is that there is either an assert/crash/asan failure/etc. This revision adds support for properly detecting when an erased operation has dangling users, emits and error and fails the conversion. Differential Revision: https://reviews.llvm.org/D82830	2020-07-14 13:06:08 -07:00
Alexander Belyaev	1a2ed71a8a	[mlir] Support unranked types in func signature conversion in BufferPlacement. Currently, only ranked tensor args and results can be converted to memref types. Differential Revision: https://reviews.llvm.org/D83324	2020-07-07 19:43:48 +02:00
River Riddle	9db53a1827	[mlir][NFC] Remove usernames and google bug numbers from TODO comments. These were largely leftover from when MLIR was a google project, and don't really follow LLVM guidelines.	2020-07-07 01:40:52 -07:00
Julian Gross	91c320e9d8	[mlir] Add check for ViewLikeOpInterface that creates additional aliases. ViewLikeOpInterfaces introduce new aliases that need to be added to the alias list. This is necessary to place deallocs in the right positions. Differential Revision: https://reviews.llvm.org/D83044	2020-07-03 16:38:21 +02:00
Ehsan Toosi	0f03b2bfda	[mlir] Add redundant copy removal transform This pass removes redundant dialect-independent Copy operations in different situations like the following: %from = ... %to = ... ... (no user/alias for %to) copy(%from, %to) ... (no user/alias for %from) dealloc %from use(%to) Differential Revision: https://reviews.llvm.org/D82757	2020-07-03 15:36:25 +02:00
Marcel Koester	6f5da84f7b	[mlir] Extended BufferPlacement to support nested region control flow. Summary: The current BufferPlacement implementation does not support nested region control flow. This CL adds support for nested regions via the RegionBranchOpInterface and the detection of branch-like (ReturnLike) terminators inside nested regions. Differential Revision: https://reviews.llvm.org/D81926	2020-06-30 12:10:01 +02:00
Tobias Gysi	652a79659a	[mlir] fix off-by-one error in collapseParallelLoops Summary: The patch fixes an off by one error in the method collapseParallelLoops. It ensures the same normalized bound is used for the computation of the division and the remainder. Reviewers: herhut Reviewed By: herhut Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, stephenneuendorffer, Joonsoo, grosul1, Kayjukh, jurahul, msifontes Tags: #mlir Differential Revision: https://reviews.llvm.org/D82634	2020-06-26 15:39:46 +02:00
Tung D. Le	2b5d1776ff	[MLIR][Affine-loop-fusion] Fix a bug in affine-loop-fusion pass when there are non-affine operations When there is a mix of affine load/store and non-affine operations (e.g. std.load, std.store), affine-loop-fusion ignores the present of non-affine ops, thus changing the program semantics. E.g. we have a program of three affine loops operating on the same memref in which one of them uses std.load and std.store, as follows. ``` affine.for affine.store %1 affine.for std.load %1 std.store %1 affine.for affine.load %1 affine.store %1 ``` affine-loop-fusion will produce the following result which changed the program semantics: ``` affine.for std.load %1 std.store %1 affine.for affine.store %1 affine.load %1 affine.store %1 ``` This patch is to fix the above problem by checking non-affine users of the memref that are between the source and destination nodes of interest. Differential Revision: https://reviews.llvm.org/D82158	2020-06-26 18:26:42 +05:30
Tobias Gysi	48f1d4fcd2	[mlir] parallel loop canonicalization Summary: The patch introduces a canonicalization pattern for parallel loops. The pattern removes single-iteration loop dimensions if the loop bounds and steps are constants. Reviewers: herhut, ftynse Reviewed By: herhut Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, stephenneuendorffer, Joonsoo, grosul1, Kayjukh, jurahul, msifontes Tags: #mlir Differential Revision: https://reviews.llvm.org/D82191	2020-06-26 09:57:08 +02:00
Uday Bondhugula	aec5344f48	[MLIR] Fix affine loop fusion private memref alloc Drop stale code that provided the wrong operands to alloc. Reported-by: rjnw on discourse Differential Revision: https://reviews.llvm.org/D82409	2020-06-24 22:19:29 +05:30
River Riddle	8d67d187ba	[mlir][DialectConversion] Refactor how block argument types get converted This revision removes the TypeConverter parameter passed to the apply* methods, and instead moves the responsibility of region type conversion to patterns. The types of a region can be converted using the 'convertRegionTypes' method, which acts similarly to the existing 'applySignatureConversion'. This method ensures that all blocks within, and including those moved into, a region will have the block argument types converted using the provided converter. This has the benefit of making more of the legalization logic controlled by patterns, instead of being handled explicitly by the driver. It also opens up the possibility to support multiple type conversions at some point in the future. This revision also adds a new utility class `FailureOr<T>` that provides a LogicalResult friendly facility for returning a failure or a valid result value. Differential Revision: https://reviews.llvm.org/D81681	2020-06-18 15:59:22 -07:00
River Riddle	80d7ac3bc7	[mlir] Allow for patterns to match any root kind. Traditionally patterns have always had the root operation kind hardcoded to a specific operation name. This has worked well for quite some time, but it has certain limitations that make it undesirable. For example, some lowering have the same implementation for many different operations types with a few lowering entire dialects using the same pattern implementation. This problem has led to several "solutions": a) Provide a template implementation to the user so that they can instantiate it for each operation combination, generally requiring the inclusion of the auto-generated operation definition file. b) Use a non-templated pattern that allows for providing the name of the operation to match - No one ever does this, because enumerating operation names can be cumbersome and so this quickly devolves into solution a. This revision removes the restriction that patterns have a hardcoded root type, and allows for a class patterns that could match "any" operation type. The major downside of root-agnostic patterns is that they make certain pattern analyses more difficult, so it is still very highly encouraged that an operation specific pattern be used whenever possible. Differential Revision: https://reviews.llvm.org/D82066	2020-06-18 13:58:47 -07:00
River Riddle	f4ef77cbb4	[mlir][Inliner] Properly handle callgraph node deletion We previously weren't properly updating the SCC iterator when nodes were removed, leading to asan failures in certain situations. This commit adds a CallGraphSCC class and defers operation deletion until inlining has finished. Differential Revision: https://reviews.llvm.org/D81984	2020-06-17 15:45:56 -07:00
Uday Bondhugula	7965dd79a3	[MLIR] Fix memref region compute for 0-d memref accesses Fix memref region compute for 0-d memref accesses in certain cases (when there are loops surrounding such 0-d accesses). Differential Revision: https://reviews.llvm.org/D81792	2020-06-16 13:59:53 +05:30
MaheshRavishankar	462e3ccdd0	[mlir][StandardDialect] Add some folding for operations in standard dialect. Add the following canonicalization - and(x, 1) -> x - subi(x, 0) -> x Differential Revision: https://reviews.llvm.org/D81534	2020-06-15 22:52:29 -07:00
Marcel Koester	ff4c510337	[mlir] Extended BufferPlacement to support more sophisticated scenarios in which allocations cannot be moved freely and can remain in divergent control flow. The current BufferPlacement pass does not support allocation nodes that carry additional dependencies (like in the case of dynamic shaped types). These allocations can often not be moved freely and in turn might remain in divergent control-flow branches. This requires a different strategy with respect to block arguments and aliases. This CL adds additinal functionality to support allocation nodes in divergent control flow while avoiding memory leaks. Differential Revision: https://reviews.llvm.org/D79850	2020-06-15 12:19:23 +02:00
Mehdi Amini	95371ce9c2	Enable FileCheck -enable-var-scope by default in MLIR test This option avoids to accidentally reuse variable across -LABEL match, it can be explicitly opted-in by prefixing the variable name with $ Differential Revision: https://reviews.llvm.org/D81531	2020-06-12 00:43:09 +00:00
Diego Caballero	2e7a084591	[mlir][Affine] Revisit fusion candidates after successful fusion This patch changes the fusion algorithm so that after fusing two loop nests we revisit previously visited nodes so that they are considered again for fusion in the context of the new fused loop nest. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D81609	2020-06-11 14:53:08 -07:00
Frederik Gossen	904f91db5f	[MLIR][Standard] Make the `dim` operation index an operand. Allow for dynamic indices in the `dim` operation. Rather than an attribute, the index is now an operand of type `index`. This allows to apply the operation to dynamically ranked tensors. The correct lowering of dynamic indices remains to be implemented. Differential Revision: https://reviews.llvm.org/D81551	2020-06-10 13:54:47 +00:00
Mehdi Amini	d31c9e5a46	Change filecheck default to dump input on failure Having the input dumped on failure seems like a better default: I debugged FileCheck tests for a while without knowing about this option, which really helps to understand failures. Remove `-dump-input-on-failure` and the environment variable FILECHECK_DUMP_INPUT_ON_FAILURE which are now obsolete. Differential Revision: https://reviews.llvm.org/D81422	2020-06-09 18:57:46 +00:00

1 2 3 4 5 ...

697 Commits