llvm-project

Commit Graph

Author	SHA1	Message	Date
Alexander Belyaev	ed5363a674	[MLIR] Add getBody() method to SingleImplicitBlockTerminator op trait. Many ops with this trait have `getBody()` and `getBodyBuilder()` methods defined in `extraClassDeclaration` in tablegen. `getBody()` implementation is the same accross all these ops, but `getBodyBuilder()` can return builders with varying insertion points set. In this PR, `getBody()` is moved into `SingleImplicitBlockTerminator` struct and `getBodyBuilder()` is replaced with `OpBuilder::atBlock(End\|Terminator)(op.getBody);`. Differential Revision: https://reviews.llvm.org/D78864	2020-04-27 21:48:52 +02:00
River Riddle	4dfd1b5fcb	[mlir] Optimize operand storage such that all operations can have resizable operand lists This revision refactors the structure of the operand storage such that there is no additional memory cost for resizable operand lists until it is required. This is done by using two different internal representations for the operand storage: * One using trailing operands * One using a dynamically allocated std::vector<OpOperand> This allows for removing the resizable operand list bit, and will free up APIs from needing to workaround non-resizable operand lists. Differential Revision: https://reviews.llvm.org/D78875	2020-04-26 21:34:01 -07:00
River Riddle	0816de167a	[mlir][DialectConversion] Add support for properly tracking replaceUsesOfBlockArgument The current implementation of this method performs the replacement directly, and thus doesn't support proper back tracking. Differential Revision: https://reviews.llvm.org/D78790	2020-04-24 12:37:32 -07:00
River Riddle	2eda87dfbe	[mlir][SCCP] Add support for propagating constants across inter-region control flow. This is possible by adding two new ControlFlowInterface additions: - A new interface, RegionBranchOpInterface This interface allows for region holding operations to describe how control flows between regions. This interface initially contains two methods: * getSuccessorEntryOperands Returns the operands of this operation used as the entry arguments when entering the region at `index`, which was specified as a successor by `getSuccessorRegions`. when entering. These operands should correspond 1-1 with the successor inputs specified in `getSuccessorRegions`, and may be a subset of the entry arguments for that region. * getSuccessorRegions Returns the viable successors of a region, or the possible successor when branching from the parent op. This allows for describing which regions may be executed when entering an operation, and which regions are executed after having executed another region of the parent op. For example, a structured loop operation may always enter into the loop body region. The loop body region may branch back to itself, or exit to the operation. - A trait, ReturnLike This trait signals that a terminator exits a region and forwards all of its operands as "exiting" values. These additions allow for performing more general dataflow analysis in the presence of region holding operations. Differential Revision: https://reviews.llvm.org/D78447	2020-04-21 02:59:25 -07:00
River Riddle	152d29cc74	[mlir][Transforms] Add pass to perform sparse conditional constant propagation This revision adds the initial pass for performing SCCP generically in MLIR. SCCP is an algorithm for propagating constants across control flow, and optimistically assumes all values to be constant unless proven otherwise. It currently supports branching control, with support for regions and inter-procedural propagation being added in followups. Differential Revision: https://reviews.llvm.org/D78397	2020-04-21 02:59:25 -07:00
Sean Silva	22219cfc6a	Fix inlining multi-block callees with type conversion. The previous code result a mismatch between block argument types and predecessor successor args when a type conversion was needed in a multiblock case. It was assuming the replaced result types matched the region result types. Also, slighly improve the debug output from the inliner. Differential Revision: https://reviews.llvm.org/D78415	2020-04-20 16:54:01 -07:00
Alexander Belyaev	def3e10eac	[MLIR] Add #include "llvm/ADT/SmallPtrSet.h" back to LoopUtils.h.	2020-04-20 10:21:18 +02:00
Alexander Belyaev	ad9988f4da	[MLIR] Move `replaceAllUsesExcept` from LoopUtil.h to Value.h. Differential Revision: https://reviews.llvm.org/D78426	2020-04-20 09:21:06 +02:00
Uday Bondhugula	ecddafd84a	[MLIR] NFC affine for op tiling cleanup / utility rename Rename mlir::tileCodeGen -> mlir::tilePerfectlyNested to be consistent. NFC clean up tiling utility code, drop dead code, better comments. Expose isPerfectlyNested and reuse. Differential Revision: https://reviews.llvm.org/D78423	2020-04-19 00:53:34 +05:30
Uday Bondhugula	f043677f6d	[MLIR] Make isPerfectlyNested check more efficient Make mlir::isPerfectlyNested more efficient; use O(1) check instead of O(N) size() method. Differential Revision: https://reviews.llvm.org/D78428	2020-04-18 23:34:49 +05:30
Stephen Neuendorffer	f061295732	[MLIR] Complete refactoring of Affine dialect into sub-libraries. There were some unused CMakeFiles for Affine/IR and Affine/EDSC. This change builds separate MLIRAffineOps and MLIRAffineEDSC libraries using those CMakeFiles. This combination replaces the old MLIRAffine library. Differential Revision: https://reviews.llvm.org/D78317	2020-04-16 13:41:17 -07:00
Alexander Belyaev	be9c3bdc44	[MLIR] Fix fusion of linalg.indexed_generic producer into tiled (Indexed)GenericOp. Differential Revision: https://reviews.llvm.org/D78209	2020-04-16 10:45:17 +02:00
Lorenzo Chelini	a60fdd2ba4	[MLIR] NFC after commit D77478. Remove leftovers 'applyPatternsGreedily' from the codebase. Differential Revision: https://reviews.llvm.org/D78274	2020-04-16 10:32:01 +02:00
River Riddle	4f37450b2c	[mlir][Inliner] Store the resolved call by-value instead of by-reference This avoids asan failures as more calls may be added during inlining, invalidating the reference. Differential Revision: https://reviews.llvm.org/D78258	2020-04-15 17:42:27 -07:00
Jeremy Bruestle	9f3ab92ec8	[MLIR] Improve support for 0-dimensional Affine Maps. Summary: Modified AffineMap::get to remove support for the overload which allowed an ArrayRef of AffineExpr but no context (and gathered the context from a presumed first entry, resulting in bugs when there were 0 results). Instead, we support only a ArrayRef and a context, and a version which takes a single AffineExpr. Additionally, removed some now needless case logic which previously special cased which call to AffineMap::get to use. Reviewers: flaub, bondhugula, rriddle!, nicolasvasilache, ftynse, ulysseB, mravishankar, antiagainst, aartbik Subscribers: mehdi_amini, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, bader, grosul1, frgossen, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78226	2020-04-15 14:15:02 -07:00
Uday Bondhugula	04b5274ede	[MLIR] Introduce applyOpPatternsAndFold for op local rewrites Introduce mlir::applyOpPatternsAndFold which applies patterns as well as any folding only on a specified op (in contrast to applyPatternsAndFoldGreedily which applies patterns only on the regions of an op isolated from above). The caller is made aware of the op being folded away or erased. Depends on D77485. Differential Revision: https://reviews.llvm.org/D77487	2020-04-15 14:10:01 +05:30
River Riddle	92f1562f3d	[mlir][NFC] Remove the STLExtras.h header file now that it has been merged into LLVM. Now that no more utilities exist within, this file can be deleted. Differential Revision: https://reviews.llvm.org/D78079	2020-04-14 15:14:41 -07:00
River Riddle	ebf190fcda	[llvm][ADT] Move TypeSwitch class from MLIR to LLVM This class implements a switch-like dispatch statement for a value of 'T' using dyn_cast functionality. Each `Case<T>` takes a callable to be invoked if the root value isa<T>, the callable is invoked with the result of dyn_cast<T>() as a parameter. Differential Revision: https://reviews.llvm.org/D78070	2020-04-14 15:14:41 -07:00
River Riddle	2f21a57966	[llvm][STLExtras] Move the algorithm `interleave*` methods from MLIR to LLVM These have proved incredibly useful for interleaving values between a range w.r.t to streams. After this revision, the mlir/Support/STLExtras.h is empty. A followup revision will remove it from the tree. Differential Revision: https://reviews.llvm.org/D78067	2020-04-14 15:14:40 -07:00
River Riddle	204c3b5516	[llvm][STLExtras] Move various iterator/range utilities from MLIR to LLVM This revision moves the various range utilities present in MLIR to LLVM to enable greater reuse. This revision moves the following utilities: * indexed_accessor_* This is set of utility iterator/range base classes that allow for building a range class where the iterators are represented by an object+index pair. * make_second_range Given a range of pairs, returns a range iterating over the `second` elements. * hasSingleElement Returns if the given range has 1 element. size() == 1 checks end up being very common, but size() is not always O(1) (e.g., ilist). This method provides O(1) checks for those cases. Differential Revision: https://reviews.llvm.org/D78064	2020-04-14 15:14:40 -07:00
Uday Bondhugula	ac047d9fce	[MLIR] Remove dead affine.applys while generating pointwise copies This makes no impact on the test cases because affine-data-copy-generate runs whole function canonicalization at its end; however, the latter will be removed in a pending revision. It is thus useful to clean up these affine.applys right here, and eventually, not even generate these (when the right API to compose by construction is in place). Differential Revision: https://reviews.llvm.org/D78055	2020-04-14 09:47:14 +05:30
Uday Bondhugula	42ada5fee9	[MLIR] NFC cleanup/modernize memref-dataflow-opt / getNestingDepth Bring code to date with recent changes to the core infrastructure / coding style. Differential Revision: https://reviews.llvm.org/D77998	2020-04-14 00:03:06 +05:30
Uday Bondhugula	cbcb12fd44	[MLIR] Handle in-place folding properly in greedy pattern rewrite driver OperatioFolder::tryToFold performs both true folding and in a few instances in-place updates through op rewrites. In the latter case, we should still be applying the supplied pattern rewrites in the same iteration; however this wasn't the case since tryToFold returned success() for both true folding and in-place updates, and the patterns for the in-place updated ops were being applied only in the next iteration of the driver's outer loop. This fix would make it converge faster. Differential Revision: https://reviews.llvm.org/D77485	2020-04-11 19:57:29 +05:30
Uday Bondhugula	a5b9316b24	[MLIR][NFC] applyPatternsGreedily -> applyPatternsAndFoldGreedily Rename mlir::applyPatternsGreedily -> applyPatternsAndFoldGreedily. The new name is a more accurate description of the method - it performs both, application of the specified patterns and folding of all ops in the op's region irrespective of whether any patterns have been supplied. Differential Revision: https://reviews.llvm.org/D77478	2020-04-10 12:55:21 +05:30
River Riddle	bd1ccfe6df	[mlir] Add a new RewritePattern::hasBoundedRewriteRecursion hook. Summary: Some pattern rewriters, like dialect conversion, prohibit the unbounded recursion(or reapplication) of patterns on generated IR. Most patterns are not written with recursive application in mind, so will generally explode the stack if uncaught. This revision adds a hook to RewritePattern, `hasBoundedRewriteRecursion`, to signal that the pattern can safely be applied to the generated IR of a previous application of the same pattern. This allows for establishing a contract between the pattern and rewriter that the pattern knows and can handle the potential recursive application. Differential Revision: https://reviews.llvm.org/D77782	2020-04-09 12:42:28 -07:00
River Riddle	400ad6f95d	[mlir] Eliminate the remaining usages of cl::opt instead of PassOption. Summary: Pass options are a better choice for various reasons and avoid the need for static constructors. Differential Revision: https://reviews.llvm.org/D77707	2020-04-08 13:05:08 -07:00
River Riddle	1834ad4a69	[mlir][Pass] Update the PassGen to generate base classes instead of utilities Summary: This is much cleaner, and fits the same structure as many other tablegen backends. This was not done originally as the CRTP in the pass classes made it overly verbose/complex. Differential Revision: https://reviews.llvm.org/D77367	2020-04-07 14:08:52 -07:00
River Riddle	80aca1eaf7	[mlir][Pass] Remove the use of CRTP from the Pass classes This revision removes all of the CRTP from the pass hierarchy in preparation for using the tablegen backend instead. This creates a much cleaner interface in the C++ code, and naturally fits with the rest of the infrastructure. A new utility class, PassWrapper, is added to replicate the existing behavior for passes not suitable for using the tablegen backend. Differential Revision: https://reviews.llvm.org/D77350	2020-04-07 14:08:52 -07:00
River Riddle	722f909f7a	[mlir][Pass][NFC] Replace usages of ModulePass with OperationPass<ModuleOp> ModulePass doesn't provide any special utilities and thus doesn't give enough benefit to warrant a special pass class. This revision replaces all usages with the more general OperationPass. Differential Revision: https://reviews.llvm.org/D77339	2020-04-07 14:08:52 -07:00
Uday Bondhugula	70da33bf30	[MLIR] fix/update affine data copy utility for max/min bounds Fix point-wise copy generation to work with bounds that have max/min. Change structure of copy loop nest to use absolute loop indices and subtracting base from the indexes of the fast buffers. Update supporting utilities: Fix FlatAffineConstraints::getLowerAndUpperBound to look at equalities as well and for a missing division. Update unionBoundingBox to not discard common constraints (leads to a tighter system). Update MemRefRegion::getConstantBoundingSizeAndShape to add memref dimension constraints. Run removeTrivialRedundancy at the end of MemRefRegion::compute. Run single iteration loop promotion and load/store canonicalization after affine data copy (in its test pass as well). Differential Revision: https://reviews.llvm.org/D77320	2020-04-07 13:55:42 +05:30
Uday Bondhugula	3f9cdd44d7	[MLIR] Add pattern rewriter util to erase block; remove dead else Add a pattern rewriter utility to erase blocks (while notifying the pattern rewriting driver of the erased ops). Use this to remove trivial else blocks in affine.if ops. Differential Revision: https://reviews.llvm.org/D77083	2020-04-05 19:24:43 +05:30
Uday Bondhugula	cc6738949d	[MLIR][NFC] fix name operand -> userOp The wrong name was confusing to read. value.getUsers() yields Operation *s. Differential Revision: https://reviews.llvm.org/D77486	2020-04-05 19:17:15 +05:30
Uday Bondhugula	f875e55ba9	[MLIR] fix greedy pattern rewrite driver iteration on change Removing dead ops should make the outer loop of the pattern rewriting driver run again. Although its operands are added to the worklist, if no changes happenned to them or remaining ops in the worklist, the driver wouldn't run once again - but it should be. Differential Revision: https://reviews.llvm.org/D77483	2020-04-05 19:15:46 +05:30
Kazuaki Ishizaki	5aacce3db2	[mlir] NFC: Fix trivial typo Differential Revision: https://reviews.llvm.org/D77473	2020-04-05 11:30:30 +09:00
Alex Zinenko	f27f1e8c27	[mlir] DialectConversion: support block creation in ConversionPatternRewriter PatternRewriter and derived classes provide a set of virtual methods to manipulate blocks, which ConversionPatternRewriter overrides to keep track of the manipulations and undo them in case the conversion fails. However, one can currently create a block only by splitting another block into two. This not only makes the API inconsistent (`splitBlock` is allowed in conversion patterns, but `createBlock` is not), but it also make it impossible for one to create blocks with argument lists different from those of already existing blocks since in-place block updates are not supported either. Such functionality precludes dialect conversion infrastructure from being used more extensively on region-containing ops, for example, for value-returning "if" operations. At the same time, ConversionPatternRewriter already allows one to undo block creation as block creation is one of the primitive operations in already supported region inlining. Support block creation in conversion patterns by hooking `createBlock` on the block action undo mechanism. This requires to make `Builder::createBlock` virtual, similarly to Op insertion. This is a minimal change to the Builder infrastructure that will later help support additional use cases such as block signature changes. `createBlock` now additionally takes the types of the block arguments that are added immediately so as to avoid in-place argument list manipulation that would be illegal in conversion patterns.	2020-04-03 20:30:03 +02:00
Uday Bondhugula	5e8093134a	[MLIR] Add method to drop duplicate result exprs from AffineMap Add a method that given an affine map returns another with just its unique results. Use this to drop redundant bounds in max/min for affine.for. Update affine.for's canonicalization pattern and createCanonicalizedForOp to use this. Differential Revision: https://reviews.llvm.org/D77237	2020-04-02 03:00:19 +05:30
Mehdi Amini	0dd21130ef	Add LLVM_ATTRIBUTE_UNUSED to function used only in assert (NFC)	2020-04-01 17:21:07 +00:00
Uday Bondhugula	68316afb29	[MLIR][NFC] loop transforms/analyis utils cleanup / modernize Modernize/cleanup code in loop transforms utils - a lot of this code was written prior to the currently available IR support / code style. This patch also does some variable renames including inst -> op, comment updates, turns getCleanupLoopLowerBound into a local function. Differential Revision: https://reviews.llvm.org/D77175	2020-04-01 22:36:25 +05:30
River Riddle	9a277af2d4	[mlir][Pass] Add support for generating pass utilities via tablegen This revision adds support for generating utilities for passes such as options/statistics/etc. that can be inferred from the tablegen definition. This removes additional boilerplate from the pass, and also makes it easier to remove the reliance on the pass registry to provide certain things(e.g. the pass argument). Differential Revision: https://reviews.llvm.org/D76659	2020-04-01 02:10:46 -07:00
River Riddle	8155e41ac6	[mlir][Pass] Add a tablegen backend for defining Pass information This will greatly simplify a number of things related to passes: * Enables generation of pass registration * Enables generation of boiler plate pass utilities * Enables generation of pass documentation This revision focuses on adding the basic structure and adds support for generating the registration for passes in the Transforms/ directory. Future revisions will add more support and move more passes over. Differential Revision: https://reviews.llvm.org/D76656	2020-04-01 02:10:46 -07:00
Tres Popp	90b7bbffdd	[MLIR] Rename collapsePLoops -> collapseParallelLoops Summary: Additionally, NFC code cleanups were done. This is to address additional comments on https://reviews.llvm.org/D76363 Differential Revision: https://reviews.llvm.org/D77052	2020-04-01 10:15:13 +02:00
Uday Bondhugula	f273e5c507	[MLIR] Fix permuteLoops utility Rewrite mlir::permuteLoops (affine loop permutation utility) to fix incorrect approach. Avoiding using sinkLoops entirely - use single move approach. Add test pass. This fixes https://bugs.llvm.org/show_bug.cgi?id=45328 Depends on D77003. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D77004	2020-03-30 23:38:23 +05:30
scentini	3b20970de8	Fix unused-variable error when assertions are disabled	2020-03-30 13:55:43 +02:00
Uday Bondhugula	4e4ea2cde4	[MLIR] Add missing asserts in interchangeLoops util, doc comment update Add missing assert checks for input to mlir::interchangeLoops utility. Rename interchangeLoops -> permuteLoops; update doc comments to clarify inputs / return val. Other than the assert checks, this is NFC. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D77003	2020-03-30 00:03:12 +05:30
Uday Bondhugula	43a95a543f	[MLIR] Introduce full/partial tile separation using if/else This patch introduces a utility to separate full tiles from partial tiles when tiling affine loop nests where trip counts are unknown or where tile sizes don't divide trip counts. A conditional guard is generated to separate out the full tile (with constant trip count loops) into the then block of an 'affine.if' and the partial tile to the else block. The separation allows the 'then' block (which has constant trip count loops) to be optimized better subsequently: for eg. for unroll-and-jam, register tiling, vectorization without leading to cleanup code, or to offload to accelerators. Among techniques from the literature, the if/else based separation leads to the most compact cleanup code for multi-dimensional cases (because a single version is used to model all partial tiles). INPUT affine.for %i0 = 0 to %M { affine.for %i1 = 0 to %N { "foo"() : () -> () } } OUTPUT AFTER TILING W/O SEPARATION map0 = affine_map<(d0) -> (d0)> map1 = affine_map<(d0)[s0] -> (d0 + 32, s0)> affine.for %arg2 = 0 to %M step 32 { affine.for %arg3 = 0 to %N step 32 { affine.for %arg4 = #map0(%arg2) to min #map1(%arg2)[%M] { affine.for %arg5 = #map0(%arg3) to min #map1(%arg3)[%N] { "foo"() : () -> () } } } } OUTPUT AFTER TILING WITH SEPARATION map0 = affine_map<(d0) -> (d0)> map1 = affine_map<(d0) -> (d0 + 32)> map2 = affine_map<(d0)[s0] -> (d0 + 32, s0)> #set0 = affine_set<(d0, d1)[s0, s1] : (-d0 + s0 - 32 >= 0, -d1 + s1 - 32 >= 0)> affine.for %arg2 = 0 to %M step 32 { affine.for %arg3 = 0 to %N step 32 { affine.if #set0(%arg2, %arg3)[%M, %N] { // Full tile. affine.for %arg4 = #map0(%arg2) to #map1(%arg2) { affine.for %arg5 = #map0(%arg3) to #map1(%arg3) { "foo"() : () -> () } } } else { // Partial tile. affine.for %arg4 = #map0(%arg2) to min #map2(%arg2)[%M] { affine.for %arg5 = #map0(%arg3) to min #map2(%arg3)[%N] { "foo"() : () -> () } } } } } The separation is tested via a cmd line flag on the loop tiling pass. The utility itself allows one to pass in any band of contiguously nested loops, and can be used by other transforms/utilities. The current implementation works for hyperrectangular loop nests. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D76700	2020-03-28 06:58:35 +05:30
Uday Bondhugula	ad4b4acbb0	[MLIR][NFC] drop some unnecessary includes Drop unnecessary includes Differential Revision: https://reviews.llvm.org/D76898	2020-03-27 09:17:27 +05:30
Tres Popp	27c201aa1d	[MLIR] Add parallel loop collapsing. This allows conversion of a ParallelLoop from N induction variables to some nuber of induction variables less than N. The first intended use of this is for the GPUDialect to convert ParallelLoops to iterate over 3 dimensions so they can be launched as GPU Kernels. To implement this: - Normalize each iteration space of the ParallelLoop - Use the same induction variable in a new ParallelLoop for multiple original iterations. - Split the new induction variable back into the original set of values inside the body of the ParallelLoop. Subscribers: mgorny, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76363	2020-03-26 09:32:52 +01:00
Uday Bondhugula	98fa615002	[MLIR] move loopUnrollJamBy*Factor to loop transforms utils The declarations for these were already part of transforms utils, but the definitions were left in affine transforms. Move definitions to loop transforms utils. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D76633	2020-03-24 08:08:57 +05:30
MaheshRavishankar	04f2b717d2	[mlir] Fix unsafe create operation in GreedyPatternRewriter When trying to fold an operation during operation creation check that the operation folding succeeds before inserting the op. Differential Revision: https://reviews.llvm.org/D76415	2020-03-23 11:50:40 -07:00
Uday Bondhugula	b873761496	[MLIR][NFC] Move some of the affine transforms / tests to dialect dirs Move some of the affine transforms and their test cases to their respective dialect directory. This patch does not complete the move, but takes care of a good part. Renames: prefix 'affine' to affine loop tiling cl options, vectorize -> super-vectorize Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D76565	2020-03-23 08:25:07 +05:30

1 2 3 4 5 ...

781 Commits