llvm-project

Commit Graph

Author	SHA1	Message	Date
River Riddle	76181a7b38	Remove the LowerEDSCTestPass. Most of the tests have been ported to be unit-tests and this pass is problematic in the way it depends on TableGen-generated files. This pass is also non-deterministic during multi-threading and a blocker to turning it on by default. PiperOrigin-RevId: 240889154	2019-03-29 17:53:05 -07:00
River Riddle	909a63d8bf	Tidy up a few comments and error messages related to parsing multi-result operations. PiperOrigin-RevId: 240876306	2019-03-29 17:52:51 -07:00
River Riddle	01140bd137	Change the muli-return syntax for operations. The name of the operation result now contains the number of results that it refers to if the number of results is greater than 1. Example: %call:2 = call @multi_return() : () -> (f32, i32) use(%calltensorflow/mlir#0, %calltensorflow/mlir#1) This cl also adds parser support for uniquely named result values. This means that a test writer can now write something like: %foo, %bar = call @multi_return() : () -> (f32, i32) use(%foo, %bar) Note: The printer will still print the collapsed form. PiperOrigin-RevId: 240860058	2019-03-29 17:51:32 -07:00
MLIR Team	9d9675fc8f	Remove overly conservative check in LoopFusion pass (enables fusion in tutorial example). PiperOrigin-RevId: 240859227	2019-03-29 17:51:16 -07:00
Dimitrios Vytiniotis	79bd6badb2	Remove global LLVM CLI variables from library code Plus move parsing code into the MLIR CPU runner binary. PiperOrigin-RevId: 240786709	2019-03-29 17:50:23 -07:00
River Riddle	af9760fe18	Replace remaining usages of the Instruction class with Operation. PiperOrigin-RevId: 240777521	2019-03-29 17:50:04 -07:00
Nicolas Vasilache	31442a66ef	Cleanup vectorize_1d.mlir test - NFC This CL splits a large monolithic test function into smaller ones that are each CHECK-LABEL'd PiperOrigin-RevId: 240684979	2019-03-29 17:49:45 -07:00
Nicolas Vasilache	4dc7af9da8	Make vectorization aware of loop semantics Now that we have a dependence analysis, we can check that loops are indeed parallel and make vectorization correct. PiperOrigin-RevId: 240682727	2019-03-29 17:49:30 -07:00
River Riddle	21547ace87	Update the multi-threaded pass timing to not assume that total time will be different from user time. PiperOrigin-RevId: 240681618	2019-03-29 17:49:14 -07:00
Jacques Pienaar	8f1e744169	Move test of trait using dialect ops, to dialects of ops. PiperOrigin-RevId: 240680010	2019-03-29 17:48:59 -07:00
Jacques Pienaar	d7e386cea9	Move TF dialect test to dialect. PiperOrigin-RevId: 240646586	2019-03-29 17:48:28 -07:00
Nicolas Vasilache	c3742d20b5	Give the Vectorize pass a virtualVectorSize argument. This CL allows vectorization to be called and configured in other ways than just via command line arguments. This allows triggering vectorization programmatically. PiperOrigin-RevId: 240638208	2019-03-29 17:48:12 -07:00
Lei Zhang	d5524388ab	[TableGen] Change names for Builder* and OperationState* parameters to avoid collision The `Builder*` parameter is unused in both generated build() methods so that we can leave it unnamed. Changed stand-alone parameter build() to take `_tblgen_state` instead of `result` to allow `result` to avoid having name collisions with op operand, attribute, or result. PiperOrigin-RevId: 240637700	2019-03-29 17:47:57 -07:00
River Riddle	3a845be7d1	Add support for multi-threaded pass timing. When multi-threading is enabled in the pass manager the meaning of the display slightly changes. First, a new timing column is added, `User Time`, that displays the total time spent across all threads. Secondly, the `Wall Time` column displays the longest individual time spent amongst all of the threads. This means that the `Wall Time` column will continue to give an indicator on the perceived time, or clock time, whereas the `User Time` will display the total cpu time. Example: $ mlir-opt foo.mlir -experimental-mt-pm -cse -canonicalize -convert-to-llvmir -pass-timing ===-------------------------------------------------------------------------=== ... Pass execution timing report ... ===-------------------------------------------------------------------------=== Total Execution Time: 0.0078 seconds ---User Time--- ---Wall Time--- --- Name --- 0.0175 ( 88.3%) 0.0055 ( 70.4%) Function Pipeline 0.0018 ( 9.3%) 0.0006 ( 8.1%) CSE 0.0013 ( 6.3%) 0.0004 ( 5.8%) (A) DominanceInfo 0.0017 ( 8.7%) 0.0006 ( 7.1%) FunctionVerifier 0.0128 ( 64.6%) 0.0039 ( 50.5%) Canonicalizer 0.0011 ( 5.7%) 0.0004 ( 4.7%) FunctionVerifier 0.0004 ( 2.1%) 0.0004 ( 5.2%) ModuleVerifier 0.0010 ( 5.3%) 0.0010 ( 13.4%) LLVMLowering 0.0009 ( 4.3%) 0.0009 ( 11.0%) ModuleVerifier 0.0198 (100.0%) 0.0078 (100.0%) Total PiperOrigin-RevId: 240636269	2019-03-29 17:47:41 -07:00
Jacques Pienaar	810e95b861	Use dereference instead of implicit conversion for IndexedValue to Value*. Avoids ambiguous constructor error on some compilers. PiperOrigin-RevId: 240606838	2019-03-29 17:45:56 -07:00
Alex Zinenko	e2f9079a71	LLVM IR Conversion: support zero-dimensional memrefs The spec allows zero-dimensional memrefs to exist and treats them essentially as single-element buffers. Unlike single-dimensional memrefs of static shape <1xTy>, zero-dimensional memrefs do not require indices to access the only element they store. Add support of zero-dimensional memrefs to the LLVM IR conversion. In particular, such memrefs are converted into bare pointers, and accesses to them are converted to bare loads and stores, without the overhead of `getelementptr %buffer, 0`. PiperOrigin-RevId: 240579456	2019-03-29 17:45:26 -07:00
Nicolas Vasilache	04b925f1b8	Port api-test::tile_2d to the edsc::Builder API The AST-based EDSCs implementation will be retired soon, this test was missing from the builders API. PiperOrigin-RevId: 240547453	2019-03-29 17:44:40 -07:00
Alex Zinenko	5a5bba0279	Introduce affine terminator Due to legacy reasons (ML/CFG function separation), regions in affine control flow operations require contained blocks not to have terminators. This is inconsistent with the notion of the block and may complicate code motion between regions of affine control operations and other regions. Introduce `affine.terminator`, a special terminator operation that must be used to terminate blocks inside affine operations and transfers the control back to he region enclosing the affine operation. For brevity and readability reasons, allow `affine.for` and `affine.if` to omit the `affine.terminator` in their regions when using custom printing and parsing format. The custom parser injects the `affine.terminator` if it is missing so as to always have it present in constructed operations. Update transformations to account for the presence of terminator. In particular, most code motion transformation between loops should leave the terminator in place, and code motion between loops and non-affine blocks should drop the terminator. PiperOrigin-RevId: 240536998	2019-03-29 17:44:24 -07:00
River Riddle	f9d91531df	Replace usages of Instruction with Operation in the /IR directory. This is step 2/N to renaming Instruction to Operation. PiperOrigin-RevId: 240459216	2019-03-29 17:43:37 -07:00
Feng Liu	c489f50e6f	Add a trait to set the result type by attribute Before this CL, the result type of the pattern match results need to be as same as the first operand type, operand broadcast type or a generic tensor type. This CL adds a new trait to set the result type by attribute. For example, the TFL_ConstOp can use this to set the output type to its value attribute. PiperOrigin-RevId: 240441249	2019-03-29 17:43:06 -07:00
Nicolas Vasilache	8811e284e8	Add an IndexedValue::operator Value* This avoids the need to explicitly convert to a ValueHandle when using an Indexed where a Value* is expected. PiperOrigin-RevId: 240371014	2019-03-29 17:41:17 -07:00
Alex Zinenko	a7215a9032	Allow creating standalone Regions Currently, regions can only be constructed by passing in a `Function` or an `Instruction` pointer referencing the parent object, unlike `Function`s or `Instruction`s themselves that can be created without a parent. It leads to a rather complex flow in operation construction where one has to create the operation first before being able to work with its regions. It may be necessary to work with the regions before the operation is created. In particular, in `build` and `parse` functions that are executed _before_ the operation is created in cases where boilerplate region manipulation is required (for example, inserting the hypothetical default terminator in affine regions). Allow creating standalone regions. Such regions are meant to own a list of blocks and transfer them to other regions on demand. Each instruction stores a fixed number of regions as trailing objects and has ownership of them. This decreases the size of the Instruction object for the common case of instructions without regions. Keep this behavior intact. To allow some flexibility in construction, make OperationState store an owning vector of regions. When the Builder creates an Instruction from OperationState, the bodies of the regions are transferred into the instruction-owned regions to minimize copying. Thus, it becomes possible to fill standalone regions with blocks and move them to an operation when it is constructed, or move blocks from a region to an operation region, e.g., for inlining. PiperOrigin-RevId: 240368183	2019-03-29 17:40:59 -07:00
River Riddle	832567b379	NFC: Rename the 'for' operation in the AffineOps dialect to 'affine.for' and set the namespace of the AffineOps dialect to 'affine'. PiperOrigin-RevId: 240165792	2019-03-29 17:39:03 -07:00
River Riddle	9c6e92360c	NFC: Rename the 'if' operation in the AffineOps dialect to 'affine.if'. PiperOrigin-RevId: 240071154	2019-03-29 17:36:53 -07:00
Chris Lattner	d9b5bc8f55	Remove OpPointer, cleaning up a ton of code. This also moves Ops to using inherited constructors, which is cleaner and means you can now use DimOp() to get a null op, instead of having to use Instruction::getNull<DimOp>(). This removes another 200 lines of code. PiperOrigin-RevId: 240068113	2019-03-29 17:36:21 -07:00
Jacques Pienaar	7ab37aaf02	Fix missing parenthesis around negation. This should probably be changed to instead use the negated form (e.g., get predicate + negate it + get resulting template), but this fixes it locally. PiperOrigin-RevId: 240067116	2019-03-29 17:36:06 -07:00
Nicolas Vasilache	f26c7cd792	Cleanup ValueHandleArray We just need a way to unpack ArrayRef<ValueHandle> to ArrayRef<Value*>. No need to expose this to the user. This reduces the cognitive overhead for the tutorial. PiperOrigin-RevId: 240037425	2019-03-29 17:35:20 -07:00
Chris Lattner	986310a68f	Remove const from Value, Instruction, Argument, and the various methods on the *Op classes. This is a net reduction by almost 400LOC. PiperOrigin-RevId: 239972443	2019-03-29 17:34:33 -07:00
Jacques Pienaar	b236041b93	Return operand_range instead for generated variadic operands accessor. PiperOrigin-RevId: 239959381	2019-03-29 17:34:17 -07:00
Chris Lattner	5246bceee0	Now that ConstOpPointer is gone, we can change the various methods generated by tblgen be non-const. This requires introducing some const_cast's at the moment, but those (and lots more stuff) will disappear in subsequent patches. This significantly simplifies those patches because the various tblgen op emitters get adjusted. PiperOrigin-RevId: 239954566	2019-03-29 17:33:45 -07:00
River Riddle	fdef161592	Remove "<label>" from the llvm basic block CHECK names. PiperOrigin-RevId: 239898185	2019-03-29 17:32:06 -07:00
Jacques Pienaar	5546733ec4	Start elemental type constraint specification modelling. Enable users specifying operand type constraint combinations (e.g., considering multiple operands). Some of these will be refactored (particularly the OpBase change and that should also not be needed to be done by most users), but the focus is more on user side (shown in test). The generated code for this does not take any known facts into account or perform any simplification. Start with 2 primities to specify 1) whether an operand has a specific element type, and 2) whether an operand's element type matches another operands element type. PiperOrigin-RevId: 239875712	2019-03-29 17:31:50 -07:00
Nicolas Vasilache	071ca8da91	Support composition of symbols in AffineApplyOp This CL revisits the composition of AffineApplyOp for the special case where a symbol itself comes from an AffineApplyOp. This is achieved by rewriting such symbols into dims to allow composition to occur mathematically. The implementation is also refactored to improve readability. Rationale for locally rewriting symbols as dims: ================================================ The mathematical composition of AffineMap must always concatenate symbols because it does not have enough information to do otherwise. For example, composing `(d0)[s0] -> (d0 + s0)` with itself must produce `(d0)[s0, s1] -> (d0 + s0 + s1)`. The result is only equivalent to `(d0)[s0] -> (d0 + 2 * s0)` when applied to the same mlir::Value* for both s0 and s1. As a consequence mathematical composition of AffineMap always concatenates symbols. When AffineMaps are used in AffineApplyOp however, they may specify composition via symbols, which is ambiguous mathematically. This corner case is handled by locally rewriting such symbols that come from AffineApplyOp into dims and composing through dims. PiperOrigin-RevId: 239791597	2019-03-29 17:30:59 -07:00
Nicolas Vasilache	e21c101037	Add intrinsics for constants PiperOrigin-RevId: 239596595	2019-03-29 17:28:12 -07:00
Lei Zhang	a09dc8a491	[TableGen] Generate op declaration and definition into different files Previously we emit both op declaration and definition into one file and include it in *Ops.h. That pulls in lots of implementation details in the header file and we cannot hide symbols local to implementation. This CL splits them to provide a cleaner interface. The way how we define custom builders in TableGen is changed accordingly because now we need to distinguish signatures and implementation logic. Some custom builders with complicated logic now can be moved to be implemented in .cpp entirely. PiperOrigin-RevId: 239509594	2019-03-29 17:26:26 -07:00
Nicolas Vasilache	d6c650cfb5	Properly propagate induction variable in tiling This CL fixes an issue where cloned loop induction variables were not properly propagated and beefs up the corresponding test. PiperOrigin-RevId: 239422961	2019-03-29 17:25:53 -07:00
River Riddle	30e68230bd	Add support for a standard TupleType. Though this is a standard type, it merely provides a common mechanism for representing tuples in MLIR. It is up to dialect authors to provides operations for manipulating them, e.g. extract_tuple_element. TupleType has the following form: tuple-type ::= `tuple` `<` (type (`,` type)*)? `>` Example: // Empty tuple. tuple<> // Single element. tuple<i32> // Multi element. tuple<i32, tuple<f32>, i16> PiperOrigin-RevId: 239226021	2019-03-29 17:25:09 -07:00
Jacques Pienaar	57270a9a99	Remove some statements that required >C++11, add includes and qualify names. NFC. PiperOrigin-RevId: 239197784	2019-03-29 17:24:53 -07:00
River Riddle	6d6ff7298a	Add support for parsing true/false inside of a splat tensor literal. PiperOrigin-RevId: 239052061	2019-03-29 17:24:09 -07:00
Nicolas Vasilache	a89d8c0a1a	Port Tablegen'd reference implementation of Add to declarative builders. PiperOrigin-RevId: 238977252	2019-03-29 17:22:36 -07:00
Nicolas Vasilache	3a12bc5041	Remove LOAD/STORE/RETURN boilerplate in declarative builders. This CL introduces a ValueArrayHandle helper to manage the implicit conversion of ArrayRef<ValueHandle> -> ArrayRef<Value> by converting first to ValueArrayHandle. Without this, boilerplate operations that take ArrayRef<Value> cannot be removed easily. This all seems to boil down to decoupling Value from Type. Alternative solutions exist (e.g. MLIR using Value by value everywhere) but they would be very intrusive. This seems to be the lowest impedance change. Intrinsics are also lowercased by popular demand. PiperOrigin-RevId: 238974125	2019-03-29 17:22:20 -07:00
Nicolas Vasilache	f43388e4ce	Port LowerVectorTransfers from EDSC + AST to declarative builders This CL removes the dependency of LowerVectorTransfers on the AST version of EDSCs which will be retired. This exhibited a pretty fundamental staging difference in AST-based vs declarative based emission. Since the delayed creation with an AST was staged, the loop order came into existence after the clipping expressions were computed. This now changes as the loops first need to be created declaratively in fixed order and then the clipping expressions are created. Also, due to lack of staging, coalescing cannot be done on the fly anymore and needs to be done either as a pre-pass (current implementation) or as a local transformation on the generated IR (future work). Tests are updated accordingly. PiperOrigin-RevId: 238971631	2019-03-29 17:22:06 -07:00
River Riddle	076a7350e2	Add an instrumentation for conditionally printing the IR before and after pass execution. This instrumentation can be added directly to the PassManager via 'enableIRPrinting'. mlir-opt exposes access to this instrumentation via the following flags: * print-ir-before=(comma-separated-pass-list) - Print the IR before each of the passes provided within the pass list. * print-ir-before-all - Print the IR before every pass in the pipeline. * print-ir-after=(comma-separated-pass-list) - Print the IR after each of the passes provided within the pass list. * print-ir-after-all - Print the IR after every pass in the pipeline. * print-ir-module-scope - Always print the Module IR, even for non module passes. PiperOrigin-RevId: 238523649	2019-03-29 17:19:57 -07:00
Alex Zinenko	276fae1b0d	Rename BlockList into Region NFC. This is step 1/n to specifying regions as parts of any operation. PiperOrigin-RevId: 238472370	2019-03-29 17:18:04 -07:00
Uday Bondhugula	e1e455f7dd	Change parallelism detection test pass to emit a note - emit a note on the loop being parallel instead of setting a loop attribute - rename the pass -test-detect-parallel (from -detect-parallel) PiperOrigin-RevId: 238122847	2019-03-29 17:16:27 -07:00
Feng Liu	c52a812700	[TableGen] Support nested dag attributes arguments in the result pattern Add support to create a new attribute from multiple attributes. It extended the DagNode class to represent attribute creation dag. It also changed the RewriterGen::emitOpCreate method to support this nested dag emit. An unit test is added. PiperOrigin-RevId: 238090229	2019-03-29 17:15:57 -07:00
Uday Bondhugula	9f2781e8dd	Fix misc bugs / TODOs / other improvements to analysis utils - fix for getConstantBoundOnDimSize: floordiv -> ceildiv for extent - make getConstantBoundOnDimSize also return the identifier upper bound - fix unionBoundingBox to correctly use the divisor and upper bound identified by getConstantBoundOnDimSize - deal with loop step correctly in addAffineForOpDomain (covers most cases now) - fully compose bound map / operands and simplify/canonicalize before adding dim/symbol to FlatAffineConstraints; fixes false positives in -memref-bound-check; add test case there - expose mlir::isTopLevelSymbol from AffineOps PiperOrigin-RevId: 238050395	2019-03-29 17:15:27 -07:00
Uday Bondhugula	075090f891	Extend loop unrolling and unroll-jamming to non-matching bound operands and multi-result upper bounds, complete TODOs, fix/improve test cases. - complete TODOs for loop unroll/unroll-and-jam. Something as simple as "for %i = 0 to %N" wasn't being unrolled earlier (unless it had been written as "for %i = ()[s0] -> (0)()[%N] to %N"; addressed now. - update/replace getTripCountExpr with buildTripCountMapAndOperands; makes it more powerful as it composes inputs into it - getCleanupLowerBound and getUnrolledLoopUpperBound actually needed the same code; refactor and remove one. - reorganize test cases, write previous ones better; most of these changes are "label replacements". - fix wrongly labeled test cases in unroll-jam.mlir PiperOrigin-RevId: 238014653	2019-03-29 17:14:12 -07:00
MLIR Team	8d62a6092f	Clean up some stray mlfunc/cfgfunc leftovers. PiperOrigin-RevId: 237936610	2019-03-29 17:13:26 -07:00
Nicolas Vasilache	dfd904d4a9	Minor changes to the EDSC API NFC This CL makes some minor changes to the declarative builder Helpers: 1. adds lb, ub, step methods to MemRefView to avoid always having to go through std::get + range; 2. drops MemRefView& from IndexedValue which was just creating ownership concerns. Instead, an IndexedValue only needs to keep track of the ValueHandle from which a MemRefView can be constructed on-demand if necessary. PiperOrigin-RevId: 237861493	2019-03-29 17:12:41 -07:00

1 2 3 4 5 ...

540 Commits