llvm-project

Commit Graph

Author	SHA1	Message	Date
River Riddle	037742cdf2	Add support for early exit walk methods. This is done by providing a walk callback that returns a WalkResult. This result is either `advance` or `interrupt`. `advance` means that the walk should continue, whereas `interrupt` signals that the walk should stop immediately. An example is shown below: auto result = op->walk([](Operation *op) { if (some_invariant) return WalkResult::interrupt(); return WalkResult::advance(); }); if (result.wasInterrupted()) ...; PiperOrigin-RevId: 266436700	2019-08-30 12:47:53 -07:00
River Riddle	4bfae66d70	Refactor the 'walk' methods for operations. This change refactors and cleans up the implementation of the operation walk methods. After this refactoring is that the explicit template parameter for the operation type is no longer needed for the explicit op walks. For example: op->walk<AffineForOp>([](AffineForOp op) { ... }); is now accomplished via: op->walk([](AffineForOp op) { ... }); PiperOrigin-RevId: 266209552	2019-08-29 13:04:50 -07:00
Uday Bondhugula	aa2cee9cf5	Refactor / improve replaceAllMemRefUsesWith Refactor replaceAllMemRefUsesWith to split it into two methods: the new method does the replacement on a single op, and is used by the existing one. - make the methods return LogicalResult instead of bool - Earlier, when replacement failed (due to non-deferencing uses of the memref), the set of ops that had already been processed would have been replaced leaving the IR in an inconsistent state. Now, a pass is made over all ops to first check for non-deferencing uses, and then replacement is performed. No test cases were affected because all clients of this method were first checking for non-deferencing uses before calling this method (for other reasons). This isn't true for a use case in another upcoming PR (scalar replacement); clients can now bail out with consistent IR on failure of replaceAllMemRefUsesWith. Add test case. - multiple deferencing uses of the same memref in a single op is possible (we have no such use cases/scenarios), and this has always remained unsupported. Add an assertion for this. - minor fix to another test pipeline-data-transfer case. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#87 PiperOrigin-RevId: 265808183	2019-08-27 17:56:56 -07:00
Andy Ly	6a501e3d1b	Support folding of ops with inner ops in GreedyPatternRewriteDriver. This fixes a bug when folding ops with inner ops and inner ops are still being visited. PiperOrigin-RevId: 265475780	2019-08-26 09:44:39 -07:00
River Riddle	32052c8417	NFC: Add a note to 'applyPatternsGreedily' that it also performs folding/dce. Fixes tensorflow/mlir#72 PiperOrigin-RevId: 265097597	2019-08-23 11:28:45 -07:00
River Riddle	ffde975e21	NFC: Move AffineOps dialect to the Dialect sub-directory. PiperOrigin-RevId: 264482571	2019-08-20 15:36:39 -07:00
River Riddle	ba0fa92524	NFC: Move LLVMIR, SDBM, and StandardOps to the Dialect/ directory. PiperOrigin-RevId: 264193915	2019-08-19 11:01:25 -07:00
Jacques Pienaar	79f53b0cf1	Change from llvm::make_unique to std::make_unique Switch to C++14 standard method as llvm::make_unique has been removed ( https://reviews.llvm.org/D66259). Also mark some targets as c++14 to ease next integrates. PiperOrigin-RevId: 263953918	2019-08-17 11:06:03 -07:00
River Riddle	5290e8c36d	NFC: Update pattern rewrite API to pass OwningRewritePatternList by const reference. The pattern list is not modified by any of these APIs and should thus be passed with const. PiperOrigin-RevId: 262844002	2019-08-11 18:34:14 -07:00
River Riddle	1e42954032	NFC: Standardize the terminology used for parent ops/regions/etc. There are currently several different terms used to refer to a parent IR unit in 'get' methods: getParent/getEnclosing/getContaining. This cl standardizes all of these methods to use 'getParent*'. PiperOrigin-RevId: 262680287	2019-08-09 20:07:52 -07:00
River Riddle	41968fb475	NFC: Update usages of OwningRewritePatternList to pass by & instead of &&. This will allow for reusing the same pattern list, which may be costly to continually reconstruct, on multiple invocations. PiperOrigin-RevId: 262664599	2019-08-09 17:20:29 -07:00
River Riddle	8089f93746	Add utility 'replaceAllUsesWith' methods to Operation. These methods will allow replacing the uses of results with an existing operation, with the same number of results, or a range of values. This removes a number of hand-rolled result replacement loops and simplifies replacement for operations with multiple results. PiperOrigin-RevId: 262206600	2019-08-07 13:48:52 -07:00
Andy Ly	55f2e24ab3	Remove ops in regions/blocks from worklist when parent op is being removed via GreedyPatternRewriteDriver::replaceOp. This fixes a bug where ops inside the parent op are visited even though the parent op has been removed. PiperOrigin-RevId: 261953580	2019-08-06 11:08:54 -07:00
River Riddle	a0df3ebd15	NFC: Implement OwningRewritePatternList as a class instead of a using directive. This allows for proper forward declaration, as opposed to leaking the internal implementation via a using directive. This also allows for all pattern building to go through 'insert' methods on the OwningRewritePatternList, replacing uses of 'push_back' and 'RewriteListBuilder'. PiperOrigin-RevId: 261816316	2019-08-05 18:38:22 -07:00
Nicolas Vasilache	48a1baeb8a	Refactor LoopParametricTiling as a test pass - NFC This CL moves LoopParametricTiling into test/lib as a pass for purely testing purposes. PiperOrigin-RevId: 259300264	2019-07-22 04:31:17 -07:00
Nicolas Vasilache	d2a872922f	Refactor stripmineSink for AffineForOp - NFC More moving less cloning. PiperOrigin-RevId: 258947575	2019-07-19 11:40:37 -07:00
Nicolas Vasilache	db4cd1c8dc	Utility function to map a loop on a parametric grid of virtual processors This CL introduces a simple loop utility function which rewrites the bounds and step of a loop so as to become mappable on a regular grid of processors whose identifiers are given by SSA values. A corresponding unit test is added. For example, using CUDA terminology, and assuming a 2-d grid with processorIds = [blockIdx.x, threadIdx.x] and numProcessors = [gridDim.x, blockDim.x], the loop: ``` loop.for %i = %lb to %ub step %step { ... } ``` is rewritten into a version resembling the following pseudo-IR: ``` loop.for %i = %lb + threadIdx.x + blockIdx.x * blockDim.x to %ub step %gridDim.x * blockDim.x { ... } ``` PiperOrigin-RevId: 258945942	2019-07-19 11:40:31 -07:00
Nicolas Vasilache	5bc344743c	Uniformize the API for the mlir::tile functions on AffineForOp and loop::ForOp This CL adapts the recently introduced parametric tiling to have an API matching the tiling of AffineForOp. The transformation using stripmineSink is more general and produces imperfectly nested loops. Perfect nesting invariants of the tiled version are obtained by selectively applying hoisting of ops to isolate perfectly nested bands. Such hoisting may fail to produce a perfect loop nest in cases where ForOp transitively depend on enclosing induction variables. In such cases, the API provides a LogicalResult return but the SimpleParametricLoopTilingPass does not currently use this result. A new unit test is added with a triangular loop for which the perfect nesting property does not hold. For this example, the old behavior was to produce IR that did not verify (some use was not dominated by its def). PiperOrigin-RevId: 258928309	2019-07-19 11:40:25 -07:00
Nicolas Vasilache	0002e2964d	Move affine.for and affine.if to ODS As the move to ODS is made, body and region names across affine and loop dialects are uniformized. PiperOrigin-RevId: 258416590	2019-07-16 13:45:47 -07:00
River Riddle	e7a2ef21f9	Update 'applyPatternsGreedily' to work on the regions of any operations. 'applyPatternsGreedily' is a useful utility outside of just function regions. PiperOrigin-RevId: 258182937	2019-07-16 13:44:39 -07:00
Alex Zinenko	fc044e8929	Introduce loop coalescing utility and a simple pass Multiple (perfectly) nested loops with independent bounds can be combined into a single loop and than subdivided into blocks of arbitrary size for load balancing or more efficient parallelism exploitation. However, MLIR wants to preserve the multi-dimensional multi-loop structure at higher levels of abstraction. Introduce a transformation that coalesces nested loops with independent bounds so that they can be further subdivided by tiling. PiperOrigin-RevId: 258151016	2019-07-16 13:43:44 -07:00
Nicolas Vasilache	cca53e8527	Extract std.for std.if and std.terminator in their own dialect These ops should not belong to the std dialect. This CL extracts them in their own dialect and updates the corresponding conversions and tests. PiperOrigin-RevId: 258123853	2019-07-16 13:43:18 -07:00
River Riddle	2566a72a21	Update the PatternRewriter constructor to take a context instead of a region. This will allow for cleanly using a rewriter for multiple different regions. PiperOrigin-RevId: 257845371	2019-07-12 17:42:52 -07:00
River Riddle	8e349a48b6	Remove the 'region' field from OpBuilder. This field wasn't updated as the insertion point changed, making it potentially dangerous given the multi-level of MLIR(e.g. 'createBlock' would always insert the new block in 'region'). This also allows for building an OpBuilder with just a context. PiperOrigin-RevId: 257829135	2019-07-12 17:42:41 -07:00
River Riddle	60a2983779	Fix a bug in the canonicalizer when replacing constants via patterns. The GreedyPatternRewriteDriver currently does not notify the OperationFolder when constants are removed as part of a pattern match. This materializes in a nasty bug where a different operation may be allocated to the same address. This causes an assertion in the OperationFolder when it gets notified of the new operations removal. PiperOrigin-RevId: 257817627	2019-07-12 17:42:24 -07:00
Nicolas Vasilache	cab671d166	Lower affine control flow to std control flow to LLVM dialect This CL splits the lowering of affine to LLVM into 2 parts: 1. affine -> std 2. std -> LLVM The conversions mostly consists of splitting concerns between the affine and non-affine worlds from existing conversions. Short-circuiting of affine `if` conditions was never tested or exercised and is removed in the process, it can be reintroduced later if needed. LoopParametricTiling.cpp is updated to reflect the newly added ForOp::build. PiperOrigin-RevId: 257794436	2019-07-12 08:44:28 -07:00
River Riddle	9dbef0bf96	Rename FunctionAttr to SymbolRefAttr. This allows for the attribute to hold symbolic references to other operations than FuncOp. This also allows for removing the dependence on FuncOp from the base Builder. PiperOrigin-RevId: 257650017	2019-07-12 08:43:42 -07:00
River Riddle	8c44367891	NFC: Rename Function to FuncOp. PiperOrigin-RevId: 257293379	2019-07-10 10:10:53 -07:00
Alex Zinenko	7a2e8726e8	Fix a test broken on some systems due to a mis-rebase. PiperOrigin-RevId: 257190161	2019-07-09 07:43:42 -07:00
Alex Zinenko	9d03f5674f	Implement parametric tiling on standard for loops Parametric tiling can be used to extract outer loops with fixed number of iterations. This in turn enables mapping to GPU kernels on a fixed grid independently of the range of the original loops, which may be unknown statically, making the kernel adaptable to different sizes. Provide a utility function that also computes the parametric tile size given the range of the loop. Exercise the utility function through a simple pass that applies it to all top-level loop nests. Permutability or parallelism checks must be performed before calling this utility function in actual passes. Note that parametric tiling cannot be implemented in a purely affine way, although it can be encoded using semi-affine maps. The choice to implement it on standard loops is guided by them being the common representation between Affine loops, Linalg and GPU kernels. PiperOrigin-RevId: 257180251	2019-07-09 06:37:41 -07:00
River Riddle	ce502af9cd	NFC: Remove the various "::getFunction" methods. These methods assume that a function is a valid builtin top-level operation, and removing these methods allows for decoupling FuncOp and IR/. Utility "getParentOfType" methods have been added to Operation/OpState to allow for querying the first parent operation of a given type. PiperOrigin-RevId: 257018913	2019-07-08 12:40:08 -07:00
River Riddle	474e354179	NFC: Remove Region::getContainingFunction as Functions are now Operations. PiperOrigin-RevId: 256579717	2019-07-04 13:23:10 -07:00
Andy Davis	2e1187dd25	Globally change load/store/dma_start/dma_wait operations over to affine.load/store/dma_start/dma_wait. In most places, this is just a name change (with the exception of affine.dma_start swapping the operand positions of its tag memref and num_elements operands). Significant code changes occur here: ) Vectorization: LoopAnalysis.cpp, Vectorize.cpp ) Affine Transforms: Transforms/Utils/Utils.cpp PiperOrigin-RevId: 256395088	2019-07-03 14:37:06 -07:00
River Riddle	54cd6a7e97	NFC: Refactor Function to be value typed. Move the data members out of Function and into a new impl storage class 'FunctionStorage'. This allows for Function to become value typed, which will greatly simplify the transition of Function to FuncOp(given that FuncOp is also value typed). PiperOrigin-RevId: 255983022	2019-07-01 11:39:00 -07:00
Nicolas Vasilache	e7f51ad08a	Add a folder-based EDSC intrinsics constructor (NFC) PiperOrigin-RevId: 255908660	2019-07-01 09:55:35 -07:00
River Riddle	66ed7d6d83	Update the OperationFolder to find a valid insertion point when materializing constants. The OperationFolder currently just inserts into the entry block of a Function, but regions may be isolated above, i.e. explicit capture only, and blindly inserting constants may break the invariants of these regions. PiperOrigin-RevId: 254987796	2019-06-25 09:43:21 -07:00
River Riddle	bcacef1a70	Add a new dialect hook 'materializeConstant' to create a constant operation that materializes an attribute value with the given type. This effectively adds support for dialect specific constant values that have different invariants than std.constant. 'OperationFolder' is updated to use this new hook, or attempt to default to std.constant when legal. PiperOrigin-RevId: 254570153	2019-06-22 13:05:27 -07:00
River Riddle	48d6cf1ced	NFC: Remove the 'context' parameter from OperationState. Now that Locations are Attributes they contain a direct reference to the MLIRContext, i.e. the context can be directly accessed from the given location instead of being explicitly passed in. PiperOrigin-RevId: 254568329	2019-06-22 13:05:10 -07:00
Nicolas Vasilache	0804750c9b	Uniformize usage of OpBuilder& (NFC) Historically the pointer-based version of builders was used. This CL uniformizes to OpBuilder & PiperOrigin-RevId: 254280885	2019-06-22 09:14:49 -07:00
Andy Davis	59b68146ff	Factor fusion compute cost calculation out of LoopFusion and into LoopFusionUtils (NFC). PiperOrigin-RevId: 253797886	2019-06-19 23:06:26 -07:00
Alex Zinenko	4291ae7431	Factor Region::getUsedValuesDefinedAbove into Transforms/RegionUtils Arguably, this function is only useful for transformations and should not pollute the main IR. Also make sure it accepts a the resulting container by-reference instead of returning it. PiperOrigin-RevId: 253622981	2019-06-19 23:03:51 -07:00
Andy Davis	898cf0e968	LoopFusion: adds support for computing forward computation slices, which will enable fusion of consumer loop nests into their producers in subsequent CLs. PiperOrigin-RevId: 253601994	2019-06-19 23:03:42 -07:00
Alex Zinenko	ee6f84aebd	Convert a nest affine loops to a GPU kernel This converts entire loops into threads/blocks. No check on the size of the block or grid, or on the validity of parallelization is performed, it is under the responsibility of the caller to strip-mine the loops and to perform the dependence analysis before calling the conversion. PiperOrigin-RevId: 253189268	2019-06-19 23:02:02 -07:00
River Riddle	0560f153b8	Add utility 'create' methods to OperationFolder that will create an operation with a given OpBuilder and automatically try to fold it, similarly to OpBuilder::createOrFold. The difference here is that these methods enable folding to constants in addition to existing values. This functionality is then used to replace linalg::FunctionConstants. PiperOrigin-RevId: 251716247	2019-06-09 16:19:51 -07:00
River Riddle	f1b848e470	NFC: Rename FuncBuilder to OpBuilder and refactor to take a top level region instead of a function. PiperOrigin-RevId: 251563898	2019-06-09 16:17:59 -07:00
River Riddle	9b4a02c1e9	NFC: Rename FoldHelper to OperationFolder and split a large function in two. PiperOrigin-RevId: 251485843	2019-06-09 16:17:11 -07:00
MLIR Team	5a91b9896c	Remove "size" property of affine maps. -- PiperOrigin-RevId: 250572818	2019-06-01 20:09:02 -07:00
Andy Davis	1de0f97fff	LoopFusionUtils CL 2/n: Factor out and generalize slice union computation. ) Factors slice union computation out of LoopFusion into Analysis/Utils (where other iteration slice utilities exist). ) Generalizes slice union computation to take the union of slices computed on all loads/stores pairs between source and destination loop nests. ) Fixes a bug in FlatAffineConstraints::addSliceBounds where redundant constraints were added. ) Takes care of a TODO to expose FlatAffineConstraints::mergeAndAlignIds as a public method. -- PiperOrigin-RevId: 250561529	2019-06-01 20:08:52 -07:00
Andy Davis	a560f2c646	Affine Loop Fusion Utility Module (1/n). ) Adds LoopFusionUtils which will expose a set of loop fusion utilities (e.g. dependence checks, fusion cost/storage reduction, loop fusion transformation) for use by loop fusion algorithms. Support for checking block-level fusion-preventing dependences is added in this CL (additional loop fusion utilities will be added in subsequent CLs). ) Adds TestLoopFusion test pass for testing LoopFusionUtils at a fine granularity. *) Adds unit test for testing dependence check for block-level fusion-preventing dependences. -- PiperOrigin-RevId: 249861071	2019-06-01 20:00:23 -07:00
River Riddle	c33862b0ed	Refactor FunctionAttr to hold the internal function reference by name instead of pointer. The one downside to this is that the function reference held by a FunctionAttr needs to be explicitly looked up from the parent module. This provides several benefits though: * There is no longer a need to explicitly remap function attrs. - This removes a potentially expensive call from the destructor of Function. - This will enable some interprocedural transformations to now run intraprocedurally. - This wasn't scalable and forces dialect defined attributes to override a virtual function. * Replacing a function is now a trivial operation. * This is a necessary first step to representing functions as operations. -- PiperOrigin-RevId: 249510802	2019-06-01 19:56:54 -07:00
River Riddle	8780d8d8eb	Add user iterators to IRObjects, i.e. Values. -- PiperOrigin-RevId: 248877752	2019-05-20 13:47:19 -07:00
River Riddle	3de0c7696b	Rewrite the DialectOpConversion patterns to inherit from RewritePattern instead of Pattern. This simplifies the infrastructure a bit by being able to reuse PatternRewriter and the RewritePatternMatcher, but also starts to lay the groundwork for a more generalized legalization framework that can operate on DialectOpConversions as well as normal RewritePatterns. -- PiperOrigin-RevId: 248836492	2019-05-20 13:47:01 -07:00
River Riddle	eb5ec03960	Refactor PatternRewriter to inherit from FuncBuilder instead of Builder. This is necessary for allowing more complicated rewrites in the future that may do things like update the insertion point (e.g. for rewrites involving regions). -- PiperOrigin-RevId: 248803153	2019-05-20 13:46:26 -07:00
River Riddle	1982afb145	Unify the 'constantFold' and 'fold' hooks on an operation into just 'fold'. This new unified fold hook will take constant attributes as operands, and may return an existing 'Value *' or a constant 'Attribute' when folding. This removes the awkward situation where a simple canonicalization like "sub(x,x)->0" had to be written as a canonicalization pattern as opposed to a fold. -- PiperOrigin-RevId: 248582024	2019-05-20 13:44:24 -07:00
Andy Davis	90d4023c9b	Factor out loop interchange code from LoopFusion into LoopUtils (NFC). -- PiperOrigin-RevId: 247926512	2019-05-20 13:38:12 -07:00
River Riddle	d5b60ee840	Replace Operation::isa with llvm::isa. -- PiperOrigin-RevId: 247789235	2019-05-20 13:37:52 -07:00
River Riddle	adca3c2edc	Replace Operation::cast with llvm::cast. -- PiperOrigin-RevId: 247785983	2019-05-20 13:37:42 -07:00
River Riddle	c5ecf9910a	Add support for using llvm::dyn_cast/cast/isa for operation casts and replace usages of Operation::dyn_cast with llvm::dyn_cast. -- PiperOrigin-RevId: 247780086	2019-05-20 13:37:31 -07:00
MLIR Team	41d90a85bd	Automated rollback of changelist 247778391. PiperOrigin-RevId: 247778691	2019-05-20 13:37:20 -07:00
River Riddle	02e03b9bf4	Add support for using llvm::dyn_cast/cast/isa for operation casts and replace usages of Operation::dyn_cast with llvm::dyn_cast. -- PiperOrigin-RevId: 247778391	2019-05-20 13:37:10 -07:00
Mehdi Amini	91f0781000	Remove extra `;` after function definition (NFC) Fix a GCC warning -- PiperOrigin-RevId: 247670176	2019-05-10 19:29:26 -07:00
River Riddle	983e0eea95	Simplify several usages of attributes now that they always have a type and, transitively, access to the context. This also fixes a bug where FunctionAttrs were not being remapped for function and function argument attributes. -- PiperOrigin-RevId: 246876924	2019-05-10 19:22:41 -07:00
Jacques Pienaar	2fe8ae4f6c	Fix up some mixed sign warnings. -- PiperOrigin-RevId: 246614498	2019-05-06 08:28:20 -07:00
River Riddle	b14c4b4ca8	Add support for basic remark diagnostics. This is the minimal functionality needed to separate notes from remarks. It also provides a starting point to start building out better remark infrastructure. -- PiperOrigin-RevId: 246175216	2019-05-06 08:24:02 -07:00
Feng Liu	5c757087c7	Apply patterns repeatly if the function is modified During the pattern rewrite, if the function is changed, i.e. ops created, deleted or swapped, the pattern rewriter needs to re-scan the function entirely and apply the patterns again, so the patterns whose root ops have been popped out from the working list nor an immediate users of the changed ops can be reconsidered. A command line flag is added to set the max number of iterations rescanning the function for pattern match. If the rewrite doesn' converge after this number, this compiling will continue and the result can be sub-optimal. One unit test is updated because this change fixed the missing optimization opportunities. -- PiperOrigin-RevId: 244754190	2019-04-23 22:02:16 -07:00
MLIR Team	0cd589c337	Create a LoopUtil function to return perfectly nested loop set -- PiperOrigin-RevId: 242019230	2019-04-05 07:42:01 -07:00
River Riddle	a8f4b9eeeb	Iterate on the operations to fold in TestConstantFold in reverse to remove the need for ConstantFoldHelper to have a flag for insertion at the head of the entry block. This also fixes an asan bug in TestConstantFold due to the iteration order of operations and ConstantFoldHelper's constant insertion placement. Note: This now means that we cannot fold chains of operations, i.e. where constant foldable operations feed into each other. Given that this is a testing pass solely for constant folding, this isn't really something that we want anyways. Constant fold tests should be simple and direct, with more advanced folding/feeding being tested with the canonicalizer. -- PiperOrigin-RevId: 242011744	2019-04-05 07:41:52 -07:00
Lei Zhang	4e40c83291	Deduplicate constant folding logic in ConstantFold and GreedyPatternRewriteDriver There are two places containing constant folding logic right now: the ConstantFold pass and the GreedyPatternRewriteDriver. The logic was not shared and started to drift apart. We were testing constant folding logic using the ConstantFold pass, but lagged behind the GreedyPatternRewriteDriver, where we really want the constant folding to happen. This CL pulled the logic into utility functions and classes for sharing between these two places. A new ConstantFoldHelper class is created to help constant fold and de-duplication. Also, renamed the ConstantFold pass to TestConstantFold to make it clear that it is intended for testing purpose. -- PiperOrigin-RevId: 241971681	2019-04-05 07:41:32 -07:00
River Riddle	6fa3181329	Remove the non-postorder walk functions from Function/Block/Instruction and rename walkPostOrder to walk. -- PiperOrigin-RevId: 241965239	2019-04-05 07:41:23 -07:00
River Riddle	213b8d4d3b	Rename InstOperand to OpOperand. PiperOrigin-RevId: 240814651	2019-03-29 17:50:41 -07:00
River Riddle	99b87c9707	Replace usages of Instruction with Operation in the Transforms/ directory. PiperOrigin-RevId: 240636130	2019-03-29 17:47:26 -07:00
Alex Zinenko	5a5bba0279	Introduce affine terminator Due to legacy reasons (ML/CFG function separation), regions in affine control flow operations require contained blocks not to have terminators. This is inconsistent with the notion of the block and may complicate code motion between regions of affine control operations and other regions. Introduce `affine.terminator`, a special terminator operation that must be used to terminate blocks inside affine operations and transfers the control back to he region enclosing the affine operation. For brevity and readability reasons, allow `affine.for` and `affine.if` to omit the `affine.terminator` in their regions when using custom printing and parsing format. The custom parser injects the `affine.terminator` if it is missing so as to always have it present in constructed operations. Update transformations to account for the presence of terminator. In particular, most code motion transformation between loops should leave the terminator in place, and code motion between loops and non-affine blocks should drop the terminator. PiperOrigin-RevId: 240536998	2019-03-29 17:44:24 -07:00
River Riddle	f9d91531df	Replace usages of Instruction with Operation in the /IR directory. This is step 2/N to renaming Instruction to Operation. PiperOrigin-RevId: 240459216	2019-03-29 17:43:37 -07:00
River Riddle	9ffdc930c0	Rename the Instruction class to Operation. This just renames the class, usages of Instruction will still refer to a typedef in the interim. This is step 1/N to renaming Instruction to Operation. PiperOrigin-RevId: 240431520	2019-03-29 17:42:50 -07:00
Chris Lattner	46ade282c8	Make FunctionPass::getFunction() return a reference to the function, instead of a pointer. This makes it consistent with all the other methods in FunctionPass, as well as with ModulePass::getModule(). NFC. PiperOrigin-RevId: 240257910	2019-03-29 17:40:44 -07:00
River Riddle	96ebde9cfd	Replace usages of "Op::operator->" with ".". This is step 2/N of removing the temporary operator-> method as part of the de-const transition. PiperOrigin-RevId: 240200792	2019-03-29 17:40:09 -07:00
River Riddle	5de726f493	Refactor the Pattern framework to allow for combined match/rewrite patterns. This is done by adding a new 'matchAndRewrite' function to RewritePattern that performs the match and rewrite in one step. The default behavior simply calls into the existing 'match' and 'rewrite' functions. The 'PatternMatcher' class has now been specialized for RewritePatterns and has been rewritten to make use of the new matchAndRewrite functionality. This combined match/rewrite functionality allows simplifying the majority of existing RewritePatterns, as they do not benefit from separate match and rewrite functions. Some of the existing canonicalization patterns in StandardOps have been modified to take advantage of this functionality. PiperOrigin-RevId: 240187856	2019-03-29 17:39:35 -07:00
River Riddle	af1abcc80b	Replace usages of "operator->" with "." for the AffineOps. Note: The "operator->" method is a temporary helper for the de-const transition and is gradually being phased out. PiperOrigin-RevId: 240179439	2019-03-29 17:39:19 -07:00
River Riddle	832567b379	NFC: Rename the 'for' operation in the AffineOps dialect to 'affine.for' and set the namespace of the AffineOps dialect to 'affine'. PiperOrigin-RevId: 240165792	2019-03-29 17:39:03 -07:00
Chris Lattner	d9b5bc8f55	Remove OpPointer, cleaning up a ton of code. This also moves Ops to using inherited constructors, which is cleaner and means you can now use DimOp() to get a null op, instead of having to use Instruction::getNull<DimOp>(). This removes another 200 lines of code. PiperOrigin-RevId: 240068113	2019-03-29 17:36:21 -07:00
Chris Lattner	dd2b2ec542	Push a bunch of 'consts' out of the *Op structure, in prep for removing OpPointer. PiperOrigin-RevId: 240044712	2019-03-29 17:35:35 -07:00
Chris Lattner	986310a68f	Remove const from Value, Instruction, Argument, and the various methods on the *Op classes. This is a net reduction by almost 400LOC. PiperOrigin-RevId: 239972443	2019-03-29 17:34:33 -07:00
Nicolas Vasilache	fc5bbdd6c8	Improve comment for `augmentMapAndBounds` Followup from a previous CL. PiperOrigin-RevId: 239591775	2019-03-29 17:27:57 -07:00
Chris Lattner	589df37142	Move to new `const` model, part 1: remove ConstOpPointer. This eliminate ConstOpPointer (but keeps OpPointer for now) by making OpPointer implicitly launder const in a const incorrect way. It will eventually go away entirely, this is a progressive step towards the new const model. PiperOrigin-RevId: 239512640	2019-03-29 17:26:56 -07:00
Nicolas Vasilache	d6c650cfb5	Properly propagate induction variable in tiling This CL fixes an issue where cloned loop induction variables were not properly propagated and beefs up the corresponding test. PiperOrigin-RevId: 239422961	2019-03-29 17:25:53 -07:00
Jacques Pienaar	57270a9a99	Remove some statements that required >C++11, add includes and qualify names. NFC. PiperOrigin-RevId: 239197784	2019-03-29 17:24:53 -07:00
Uday Bondhugula	075090f891	Extend loop unrolling and unroll-jamming to non-matching bound operands and multi-result upper bounds, complete TODOs, fix/improve test cases. - complete TODOs for loop unroll/unroll-and-jam. Something as simple as "for %i = 0 to %N" wasn't being unrolled earlier (unless it had been written as "for %i = ()[s0] -> (0)()[%N] to %N"; addressed now. - update/replace getTripCountExpr with buildTripCountMapAndOperands; makes it more powerful as it composes inputs into it - getCleanupLowerBound and getUnrolledLoopUpperBound actually needed the same code; refactor and remove one. - reorganize test cases, write previous ones better; most of these changes are "label replacements". - fix wrongly labeled test cases in unroll-jam.mlir PiperOrigin-RevId: 238014653	2019-03-29 17:14:12 -07:00
River Riddle	5e1f1d2cab	Update the constantFold/fold API to use LogicalResult instead of bool. PiperOrigin-RevId: 237719658	2019-03-29 17:10:50 -07:00
River Riddle	0310d49f46	Move the success/failure functions out of LogicalResult and into the mlir namespace. PiperOrigin-RevId: 237712180	2019-03-29 17:10:21 -07:00
River Riddle	80d3568c0a	Rename Status to LogicalResult to avoid conflictions with the Status in xla/tensorflow/etc. PiperOrigin-RevId: 237537341	2019-03-29 17:08:50 -07:00
River Riddle	ba6fdc8b01	Move UtilResult into the Support directory and rename it to Status. Status provides an unambiguous way to specify success/failure results. These can be generated by 'Status::success()' and Status::failure()'. Status provides no implicit conversion to bool and should be consumed by one of the following utility functions: * bool succeeded(Status) - Return if the status corresponds to a success value. * bool failed(Status) - Return if the status corresponds to a failure value. PiperOrigin-RevId: 237153884	2019-03-29 17:04:19 -07:00
Nicolas Vasilache	069c818f40	Fix lower/upper bound mismatch in stripmineSink Also beef up the corresponding test case. PiperOrigin-RevId: 236878818	2019-03-29 16:57:21 -07:00
River Riddle	f37651c708	NFC. Move all of the remaining operations left in BuiltinOps to StandardOps. The only thing left in BuiltinOps are the core MLIR types. The standard types can't be moved because they are referenced within the IR directory, e.g. in things like Builder. PiperOrigin-RevId: 236403665	2019-03-29 16:53:35 -07:00
Lei Zhang	85d9b6c8f7	Use consistent names for dialect op source files This CL changes dialect op source files (.h, .cpp, .td) to follow the following convention: <full-dialect-name>/<dialect-namespace>Ops.{h\|cpp\|td} Builtin and standard dialects are specially treated, though. Both of them do not have dialect namespace; the former is still named as BuiltinOps.* and the latter is named as Ops.*. Purely mechanical. NFC. PiperOrigin-RevId: 236371358	2019-03-29 16:53:19 -07:00
Uday Bondhugula	58889884a2	Change some of the debug messages to use emitError / emitWarning / emitNote - NFC PiperOrigin-RevId: 236169676	2019-03-29 16:50:29 -07:00
Nicolas Vasilache	62c54a2ec4	Add a stripmineSink and imperfectly nested tiling primitives. This CL adds a primitive to perform stripmining of a loop by a given factor and sinking it under multiple target loops. In turn this is used to implement imperfectly nested loop tiling (with interchange) by repeatedly calling the stripmineSink primitive. The API returns the point loops and allows repeated invocations of tiling to achieve declarative, multi-level, imperfectly-nested tiling. Note that this CL is only concerned with the mechanical aspects and does not worry about analysis and legality. The API is demonstrated in an example which creates an EDSC block, emits the corresponding MLIR and applies imperfectly-nested tiling: ```cpp auto block = edsc::block({ For(ArrayRef<edsc::Expr>{i, j}, {zero, zero}, {M, N}, {one, one}, { For(k1, zero, O, one, { C({i, j, k1}) = A({i, j, k1}) + B({i, j, k1}) }), For(k2, zero, O, one, { C({i, j, k2}) = A({i, j, k2}) + B({i, j, k2}) }), }), }); // clang-format on emitter.emitStmts(block.getBody()); auto l_i = emitter.getAffineForOp(i), l_j = emitter.getAffineForOp(j), l_k1 = emitter.getAffineForOp(k1), l_k2 = emitter.getAffineForOp(k2); auto indicesL1 = mlir::tile({l_i, l_j}, {512, 1024}, {l_k1, l_k2}); auto l_ii1 = indicesL1[0][0], l_jj1 = indicesL1[1][0]; mlir::tile({l_jj1, l_ii1}, {32, 16}, l_jj1); ``` The edsc::Expr for the induction variables (i, j, k_1, k_2) provide the programmatic hooks from which tiling can be applied declaratively. PiperOrigin-RevId: 235548228	2019-03-29 16:41:20 -07:00
Uday Bondhugula	dfe07b7bf6	Refactor AffineExprFlattener and move FlatAffineConstraints out of IR into Analysis - NFC - refactor AffineExprFlattener (-> SimpleAffineExprFlattener) so that it doesn't depend on FlatAffineConstraints, and so that FlatAffineConstraints could be moved out of IR/; the simplification that the IR needs for AffineExpr's doesn't depend on FlatAffineConstraints - have AffineExprFlattener derive from SimpleAffineExprFlattener to use for all Analysis/Transforms purposes; override addLocalFloorDivId in the derived class - turn addAffineForOpDomain into a method on FlatAffineConstraints - turn AffineForOp::getAsValueMap into an AffineValueMap ctor PiperOrigin-RevId: 235283610	2019-03-29 16:39:32 -07:00
River Riddle	da0ebe0670	Add a generic pattern matcher for matching constant values produced by an operation with zero operands and a single result. PiperOrigin-RevId: 234616691	2019-03-29 16:31:56 -07:00
MLIR Team	8f5f2c765d	LoopFusion: perform a series of loop interchanges to increase the loop depth at which slices of producer loop nests can be fused into constumer loop nests. ) Adds utility to LoopUtils to perform loop interchange of two AffineForOps. ) Adds utility to LoopUtils to sink a loop to a specified depth within a loop nest, using a series of loop interchanges. ) Computes dependences between all loads and stores in the loop nest, and classifies each loop as parallel or sequential. ) Computes loop interchange permutation required to sink sequential loops (and raise parallel loop nests) while preserving relative order among them. ) Checks each dependence against the permutation to make sure that dependences would not be violated by the loop interchange transformation. ) Calls loop interchange in LoopFusion pass on consumer loop nests before fusing in producers, sinking loops with loop carried dependences deeper into the consumer loop nest. *) Adds and updates related unit tests. PiperOrigin-RevId: 234158370	2019-03-29 16:29:26 -07:00
Uday Bondhugula	8b3f841daf	Generate dealloc's for the alloc's of dma-generate. - for the DMA buffers being allocated (and their tags), generate corresponding deallocs - minor related update to replaceAllMemRefUsesWith and PipelineDataTransfer pass Code generation for DMA transfers was being done with the initial simplifying assumption that the alloc's would map to scoped allocations, and so no deallocations would be necessary. Drop this assumption to generalize. Note that even with scoped allocations, unrolling loops that have scoped allocations could create a series of allocations and exhaustion of fast memory. Having a end of lifetime marker like a dealloc in fact allows creating new scopes if necessary when lowering to a backend and still utilize scoped allocation. DMA buffers created by -dma-generate are guaranteed to have either non-overlapping lifetimes or nested lifetimes. PiperOrigin-RevId: 233502632	2019-03-29 16:24:08 -07:00

1 2 3 4 5

225 Commits