llvm-project

Commit Graph

Author	SHA1	Message	Date
River Riddle	ace01605e0	[mlir] Split out a new ControlFlow dialect from Standard This dialect is intended to model lower level/branch based control-flow constructs. The initial set of operations are: AssertOp, BranchOp, CondBranchOp, SwitchOp; all split out from the current standard dialect. See https://discourse.llvm.org/t/standard-dialect-the-final-chapter/6061 Differential Revision: https://reviews.llvm.org/D118966	2022-02-06 14:51:16 -08:00
Jacques Pienaar	88c525235b	[mlir] Add pass to privatize symbols unless excluded. Simple pass that changes all symbols to private unless symbol is excluded (and in which case there is no change to symbol's visibility). Differential Revision: https://reviews.llvm.org/D118752	2022-02-03 20:20:54 -08:00
River Riddle	dec8af701f	[mlir] Move SelectOp from Standard to Arithmetic This is part of splitting up the standard dialect. See https://llvm.discourse.group/t/standard-dialect-the-final-chapter/ for discussion. Differential Revision: https://reviews.llvm.org/D118648	2022-02-02 14:45:12 -08:00
River Riddle	6a8ba3186e	[mlir] Split std.splat into tensor.splat and vector.splat This is part of the larger effort to split the standard dialect. This will also allow for pruning some additional dependencies on Standard (done in a followup). Differential Revision: https://reviews.llvm.org/D118202	2022-02-02 14:45:12 -08:00
Alexander Belyaev	ebc8153786	Revert "Revert "[mlir] Purge `linalg.copy` and use `memref.copy` instead."" This reverts commit `25bf6a2a9b`.	2022-02-01 18:21:21 +01:00
Alexander Belyaev	25bf6a2a9b	Revert "[mlir] Purge `linalg.copy` and use `memref.copy` instead." This reverts commit `016956b680`. Reverting it to fix NVidia build without being in a hurry.	2022-01-31 18:51:39 +01:00
Alexander Belyaev	016956b680	[mlir] Purge `linalg.copy` and use `memref.copy` instead. Differential Revision: https://reviews.llvm.org/D118028	2022-01-31 18:25:56 +01:00
Uday Bondhugula	92ccb8cc50	[MLIR][NFC] Update SCF pass cmd line names to prefix scf Update SCF pass cmd line names to prefix `scf`. This is consistent with guidelines/convention on how to name dialect passes. This also avoids ambiguity on the context given the multiple `for` operations in the tree. NFC. Differential Revision: https://reviews.llvm.org/D118564	2022-01-31 07:09:30 +05:30
Benjamin Kramer	b70366c9c4	[mlir][BufferOptimization] Use datalayout instead of a flag to find index size This has the additional advantage of supporting more types. Differential Revision: https://reviews.llvm.org/D118348	2022-01-27 13:50:29 +01:00
Uday Bondhugula	fa5c5230d9	[MLIR] NFC. Rename pass cmd-line to prefix affine Prefix "affine-" to affine transform passes that were missing it -- to avoid ambiguity and for uniformity. There were only two needed this. Move mispaced affine coalescing test case file. NFC. Differential Revision: https://reviews.llvm.org/D118314	2022-01-27 13:01:39 +05:30
Mogball	3628febcf8	[mlir] NFC control-flow sink cleanup	2022-01-24 23:34:42 +00:00
Mogball	572fa9642c	[mlir] Add a ControlFlowSink pass. Control-Flow Sink moves operations whose only uses are in conditionally-executed regions into those regions so that paths in which their results are not needed do not perform unnecessary computation. Depends on D115087 Reviewed By: jpienaar, rriddle, bondhugula Differential Revision: https://reviews.llvm.org/D115088	2022-01-24 23:08:34 +00:00
Mogball	5c36ee8d57	[mlir] Drop the leading space when printing regions The leading space that is always printed at the beginning of regions is not consistent with other parts of the printing API. Moreover, this leading space can lead to undesirable assembly formats: ``` attr-dict-with-keyword $region ``` Prints as: ``` // Two spaces between `}` and `{` attributes {foo} { ... } ``` Moreover, the leading space results in the odd generic op format: ``` "test.op"() ( {...}) : () -> () ``` Reviewed By: rriddle, mehdi_amini Differential Revision: https://reviews.llvm.org/D117411	2022-01-18 16:52:34 +00:00
Dominik Grewe	5f782d25a7	Preserve argument locations when cloning a region. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D117403	2022-01-16 21:17:23 +00:00
Uday Bondhugula	fc61d07dc1	Add inliner interface for GPU dialect Add inliner interface for GPU dialect. The interface marks all GPU dialect ops legal to inline anywhere. Differential Revision: https://reviews.llvm.org/D116889	2022-01-12 12:55:02 +05:30
Tyler Augustine	87a9be2a74	Don't fail if unable to promote loops during unrolling When the unroll factor is 1, we should only fail "unrolling" when the trip count also is determined to be 1 and it is unable to be promoted. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D115365	2022-01-10 22:26:21 +00:00
Alex Zinenko	2f672e2ffa	[mlir] Don't inline calls from dead SCCs During iterative inlining of the functions in a multi-step call chain, the inliner could add the same call operation several times to the worklist, which led to use-after-free when this op was considered more than once. Closes #52887. Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D116820	2022-01-10 12:07:14 +01:00
Mogball	4ca5e95c6f	[mlir] Symbol DCE ignores unknown symbols Instead of failing when it encounters a reference to an unknown symbol, Symbol DCE should ignore them. References to unknown symbols do not affect the overall function of Symbol DCE, so it should not need to fail when it encounters one. In general, requiring that symbol references always be valid rather than only when necessary can be overly conservative. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D116047	2022-01-05 20:48:30 +00:00
Nicolas Vasilache	bb2f87af0a	[mlir] Fix missing check on nested op values in LICM LICM checks that nested ops depend only on values defined outside before performing hoisting. However, it specifically omits to check for terminators which can lead to SSA violations. This revision fixes the incorrect behavior. Differential Revision: https://reviews.llvm.org/D116657	2022-01-05 09:31:23 -05:00
Mogball	41a64338cc	[mlir] Add getNumThreads to MLIRContext Querying threads directly from the thread pool fails if there is no thread pool or if multithreading is not enabled. Returns 1 by default. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D116259	2021-12-24 02:02:54 +00:00
Alexander Belyaev	15f8f3e20a	[mlir] Split std.rank into tensor.rank and memref.rank. Move `std.rank` similarly to how `std.dim` was moved to TensorOps and MemRefOps. Differential Revision: https://reviews.llvm.org/D115665	2021-12-14 10:15:55 +01:00
Alexander Belyaev	f89bb3c012	[mlir] Move bufferization-related passes to `bufferization` dialect. [RFC](https://llvm.discourse.group/t/rfc-dialect-for-bufferization-related-ops/4712) Differential Revision: https://reviews.llvm.org/D114698	2021-11-30 09:58:47 +01:00
Alexander Belyaev	57470abc41	[mlir] Move memref.[tensor_load\|buffer_cast\|clone] to "bufferization" dialect. https://llvm.discourse.group/t/rfc-dialect-for-bufferization-related-ops/4712 Differential Revision: https://reviews.llvm.org/D114552	2021-11-25 11:50:39 +01:00
Groverkss	98daa4e425	[MLIR] Fix incorrect removal of source loop in loop fusion This patch fixes a bug in loop fusion pass where the source loop was removed even when the fused loop did not cover all iterations of the source loop. This was because the fast hueristic check for checking if source loop and fused loop have same iterations did not take into account steps in loop. Reviewed By: dcaballe, bondhugula Differential Revision: https://reviews.llvm.org/D114164	2021-11-23 02:54:09 +05:30
Alex Zinenko	9c5982ef8e	[mlir] support recursive types in type conversion infra MLIR supports recursive types but they could not be handled by the conversion infrastructure directly as it would result in infinite recursion in `convertType` for elemental types. Support this case by keeping the "call stack" of nested type conversions in the TypeConverter class and by passing it as an optional argument to the individual conversion callback. The callback can then check if a specific type is present on the stack more than once to detect and handle the recursive case. This approach is preferred to the alternative approach of having a separate callback dedicated to handling only the recursive case as the latter was observed to introduce ~3% time overhead on a 50MB IR file even if it did not contain recursive types. This approach is also preferred to keeping a local stack in type converters that need to handle recursive types as that would compose poorly in case of out-of-tree or cross-project extensions. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D113579	2021-11-22 18:16:02 +01:00
lipracer	8165eaa885	[mlir](arithmetic) Add ceildivui to the arithmetic dialect The specific description is [[ https://llvm.discourse.group/t/adding-unsigned-integer-ceil-and-floor-in-std-dialect/4541 \| Adding unsigned integer ceil in Std Dialect ]] . When we lower ceilDivOp this will generate below code, sometimes we know m and n are unsigned intergal.Here are some redundant judgments about positive and negative. So we need to add some unsigned operations to simplify the instructions. ``` ceilDiv(n, m) x = (m > 0) ? -1 : 1 return (n*m>0) ? ((n+x) / m) + 1 : - (-n / m) ``` unsigned operations: ``` ceilDivU(n, m) return n ==0 ? 0 : ((n - 1) / m) + 1 ``` Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D113363	2021-11-11 01:49:14 +00:00
River Riddle	4070f305f9	[mlir][DialectConversion] Legalize all live argument conversions Previously we didn't materialize conversions for arguments in certain cases as the implicit type propagation was being heavily relied on by many patterns. Now that those patterns have been fixed to properly handle type conversions, we can drop the special behavior. Differential Revision: https://reviews.llvm.org/D113233	2021-11-05 18:43:56 +00:00
River Riddle	7f312f6d79	[mlir] Avoid folding in OpBuilder::tryFold when types change This was missed when tightening fold restrictions in https://reviews.llvm.org/D95991. Differential Revision: https://reviews.llvm.org/D113138	2021-11-03 20:35:46 +00:00
River Riddle	015192c634	[mlir:DialectConversion] Restructure how argument/target materializations get invoked The current implementation invokes materializations whenever an input operand does not have a mapping for the desired type, i.e. it requires materialization at the earliest possible point. This conflicts with goal of dialect conversion (and also the current documentation) which states that a materialization is only required if the materialization is supposed to persist after the conversion process has finished. This revision refactors this such that whenever a target materialization "might" be necessary, we insert an unrealized_conversion_cast to act as a temporary materialization. This allows for deferring the invocation of the user materialization hooks until the end of the conversion process, where we actually have a better sense if it's actually necessary. This has several benefits: * In some cases a target materialization hook is no longer necessary When performing a full conversion, there are some situations where a temporary materialization is necessary. Moving forward, these users won't need to provide any target materializations, as the temporary materializations do not require the user to provide materialization hooks. * getRemappedValue can now handle values that haven't been converted yet Before this commit, it wasn't well supported to get the remapped value of a value that hadn't been converted yet (making it difficult/impossible to convert multiple operations in many situations). This commit updates getRemappedValue to properly handle this case by inserting temporary materializations when necessary. Another code-health related benefit is that with this change we can move a majority of the complexity related to materializations to the end of the conversion process, instead of handling adhoc while conversion is happening. Differential Revision: https://reviews.llvm.org/D111620	2021-10-27 02:09:04 +00:00
Mogball	a54f4eae0e	[MLIR] Replace std ops with arith dialect ops Precursor: https://reviews.llvm.org/D110200 Removed redundant ops from the standard dialect that were moved to the `arith` or `math` dialects. Renamed all instances of operations in the codebase and in tests. Reviewed By: rriddle, jpienaar Differential Revision: https://reviews.llvm.org/D110797	2021-10-13 03:07:03 +00:00
Uday Bondhugula	1e39d32c5a	[MLIR] Add OrOp folding rule for constant one operand Add folding rule for std.or op when an operand has all bits set. or(x, <all bits set>) -> <all bits set> Differential Revision: https://reviews.llvm.org/D111206	2021-10-07 08:05:39 +05:30
Stella Laurenzo	56272257f3	Return failure on failure in convertBlockSignature. This was causing a subsequent assert/crash when a type converter failed to convert a block argument. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D110985	2021-10-06 15:35:31 -07:00
Sumesh Udayakumaran	b2af2aeea6	[mlir] Mode for explicitly controlling the fusion kind New mode option that allows for either running the default fusion kind that happens today or doing either of producer-consumer or sibling fusion. This will also be helpful to minimize the compile-time of the fusion tests. Reviewed By: bondhugula, dcaballe Differential Revision: https://reviews.llvm.org/D110102	2021-09-27 20:37:42 +03:00
River Riddle	6e60bb6883	[mlir:DataFlowAnalysis] Reprocess the arguments of already executable edges This fixes a bug where we discover new information about the arguments of an already executable edge, but don't visit the arguments. We only visit the arguments, and not the block itself, so this commit shouldn't really affect performance at all. Fixes PR#51871 Differential Revision: https://reviews.llvm.org/D110197	2021-09-22 20:14:55 +00:00
Vladislav Vinogradov	ec03bbe8a7	[mlir] Fix bug in partial dialect conversion The discussion on forum: https://llvm.discourse.group/t/bug-in-partial-dialect-conversion/4115 The `applyPartialConversion` didn't handle the operations, that were marked as illegal inside dynamic legality callback. Instead of reporting error, if such operation was not converted to legal set, the method just added it to `unconvertedSet` in the same way as unknown operations. This patch fixes that and handle dynamically illegal operations as well. The patch includes 2 fixes for existing passes: * `tensor-bufferize` - explicitly mark `std.return` as legal. * `convert-parallel-loops-to-gpu` - ugly fix with marking visited operations to avoid recursive legality checks. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D108505	2021-09-20 10:39:10 +03:00
Krzysztof Drewniak	121aab84d1	[MLIR][Affine] Simplify nested modulo operations when able It is the case that, for all positive a and b such that b divides a (e mod (a * b)) mod b = e mod b. For example, ((d0 mod 35) mod 5) can be simplified to (d0 mod 5), but ((d0 mod 35) mod 4) cannot be simplified further (x = 36 is a counterexample). This change enables more complex simplifications. For example, ((d0 * 72 + d1) mod 144) mod 9 can now simplify to (d0 * 72 + d1) mod 9 and thus to d1 mod 9. Expressions with chained modulus operators are reasonably common in tensor applications, and this change _should_ improve code generation for such expressions. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D109930	2021-09-17 19:06:00 +00:00
cwz920716	500d4c45ba	[MLIR] Use memref.copy ops in BufferResultsToOutParams pass. Both copy/alloc ops are using memref dialect after this change. Reviewed By: silvas, mehdi_amini Differential Revision: https://reviews.llvm.org/D109480	2021-09-15 02:59:30 +00:00
Arnab Dutta	1524b01541	[MLIR] Add loop coalesce utility for affine.for Add loop coalesce utility for affine.for. This expects loops to have been normalized a-priori. This works for both constant as well non constant upper bounds having single/multiple result upper bound affine map. With contributions from Arnab Dutta and Uday Bondhugula. Reviewed By: bondhugula, ayzhuang Differential Revision: https://reviews.llvm.org/D108126	2021-09-08 18:02:23 +05:30
Mehdi Amini	387f95541b	Add a new interface allowing to set a default dialect to be used for printing/parsing regions Currently the builtin dialect is the default namespace used for parsing and printing. As such module and func don't need to be prefixed. In the case of some dialects that defines new regions for their own purpose (like SpirV modules for example), it can be beneficial to change the default dialect in order to improve readability. Differential Revision: https://reviews.llvm.org/D107236	2021-08-31 17:52:40 +00:00
River Riddle	e4635e6328	[mlir][FoldUtils] Ensure the created constant dominates the replaced op This revision fixes a bug where an operation would get replaced with a pre-existing constant that didn't dominate it. This can occur when a pattern inserts operations to be folded at the beginning of the constants insertion block. This revision fixes the bug by moving the existing constant before the replaced operation in such cases. This is fine because if a constant didn't already exist, a new one would have been inserted before this operation anyways. Differential Revision: https://reviews.llvm.org/D108498	2021-08-23 18:48:24 +00:00
Haruki Imai	b34b1c6955	[mlir] Support normalizing memrefs with MemRef_ReinterpretCastOp This patch enables normalizing memrefs with MemRef_ReinterpretCastOp by adding MemRefsNormalizable trait in the Op definition. Signed-off-by: Haruki Imai <imaihal@jp.ibm.com> Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D107425	2021-08-11 01:15:18 +05:30
Matthias Springer	9102a16bef	[mlir] Support drawing control-flow graphs in ViewOpGraph.cpp * Add new pass option `print-data-flow-edges`, default value `true`. * Add new pass option `print-control-flow-edges`, default value `false`. * Remove `PrintCFGPass`. Same functionality now provided by `PrintOpPass`. Differential Revision: https://reviews.llvm.org/D106342	2021-08-04 20:45:15 +09:00
Matthias Springer	8d15b7dcba	[mlir] Improve Graphviz visualization in PrintOpPass * Visualize blocks and regions as subgraphs. * Generate DOT file directly instead of using `GraphTraits`. `GraphTraits` does not support subgraphs. Differential Revision: https://reviews.llvm.org/D106253	2021-08-04 11:56:26 +09:00
Sumesh Udayakumaran	24b0df8686	[NFC][MLIR] Split large fusion test file into 4 test files mlir/test/transforms/loop-fusion.mlir is too big and is split into mlir/test/transforms/loop-fusion.mlir, mlir/test/transforms/loop-fusion-2.mlir, mlir/test/transforms/loop-fusion-3.mlir and mlir/test/transforms/loop-fusion-4.mlir. Further tests can be added in mlir/test/transforms/loop-fusion-4.mlir Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D106473	2021-08-03 20:08:33 +03:00
Tung D. Le	a2186277be	[mlir][affine-loop-fusion] Fix a bug that AffineIfOp prevents fusion of the other loops The presence of AffineIfOp inside AffineFor prevents fusion of the other loops to happen. For example: ``` affine.for %i0 = 0 to 10 { affine.store %cf7, %a[%i0] : memref<10xf32> } affine.for %i1 = 0 to 10 { %v0 = affine.load %a[%i1] : memref<10xf32> affine.store %v0, %b[%i1] : memref<10xf32> } affine.for %i2 = 0 to 10 { affine.if #set(%i2) { %v0 = affine.load %b[%i2] : memref<10xf32> } } ``` The first two loops were not be fused because of `affine.if` inside the last `affine.for`. The issue seems to come from a conservative constraint that does not allow fusion if there are ops whose number of regions != 0 (affine.if is one of them). This patch just removes such a constraint when`affine.if` is inside `affine.for`. The existing `canFuseLoops` method is able to handle `affine.if` correctly. Reviewed By: bondhugula, vinayaka-polymage Differential Revision: https://reviews.llvm.org/D105963	2021-07-30 15:22:46 +05:30
River Riddle	f8479d9de5	[mlir] Set the namespace of the BuiltinDialect to 'builtin' Historically the builtin dialect has had an empty namespace. This has unfortunately created a very awkward situation, where many utilities either have to special case the empty namespace, or just don't work at all right now. This revision adds a namespace to the builtin dialect, and starts to cleanup some of the utilities to no longer handle empty namespaces. For now, the assembly form of builtin operations does not require the `builtin.` prefix. (This should likely be re-evaluated though) Differential Revision: https://reviews.llvm.org/D105149	2021-07-28 21:00:10 +00:00
Marcel Koester	0425332015	[mlir] Added new RegionBranchTerminatorOpInterface and adapted uses of hasTrait<ReturnLike>. This CL adds a new RegionBranchTerminatorOpInterface to query information about operands that can be passed to successor regions. Similar to the BranchOpInterface, it allows to freely define the involved operands. However, in contrast to the BranchOpInterface, it expects an additional region number to distinguish between various use cases which might require different operands passed to different regions. Moreover, we added new utility functions (namely getMutableRegionBranchSuccessorOperands and getRegionBranchSuccessorOperands) to query (mutable) operand ranges for operations equiped with the ReturnLike trait and/or implementing the newly added interface. This simplifies reasoning about terminators in the scope of the nested regions. We also adjusted the SCF.ConditionOp to benefit from the newly added capabilities. Differential Revision: https://reviews.llvm.org/D105018	2021-07-26 06:39:31 +02:00
Rahul Joshi	0cc2346cbf	[MLIR][NFC] Minor cleanup for BufferDeallocation pass. - Change walkReturnOperations() to be a non-template and look at block terminator for ReturnLike trait. - Clarify description of validateSupportedControlFlow - Eliminate unused argument in Backedges::recurse. - Eliminate repeated calls to getFunction() - Fix wording for non-SCF loop failure Differential Revision: https://reviews.llvm.org/D106373	2021-07-20 09:43:22 -07:00
Sumesh Udayakumaran	ada580863f	[mlir] Enable cleanup of single iteration reduction loops being sibling-fused maximally Changes include the following: 1. Single iteration reduction loops being sibling fused at innermost insertion level are skipped from being considered as sequential loops. Otherwise, the slice bounds of these loops is reset. 2. Promote loops that are skipped in previous step into outer loops. 3. Two utility function - buildSliceTripCountMap, getSliceIterationCount - are moved from mlir/lib/Transforms/Utils/LoopFusionUtils.cpp to mlir/lib/Analysis/Utils.cpp Reviewed By: bondhugula, vinayaka-polymage Differential Revision: https://reviews.llvm.org/D104249	2021-07-16 00:07:20 +03:00
Matthias Springer	c0a6318d96	[mlir][tensor] Add tensor.dim operation * Split memref.dim into two operations: memref.dim and tensor.dim. Both ops have the same builder interface and op argument names, so that they can be used with templates in patterns that apply to both tensors and memrefs (e.g., some patterns in Linalg). * Add constant materializer to TensorDialect (needed for folding in affine.apply etc.). * Remove some MemRefDialect dependencies, make some explicit. Differential Revision: https://reviews.llvm.org/D105165	2021-07-01 10:00:19 +09:00

1 2 3 4 5 ...

661 Commits