llvm-project

Commit Graph

Author	SHA1	Message	Date
Uday Bondhugula	e1e455f7dd	Change parallelism detection test pass to emit a note - emit a note on the loop being parallel instead of setting a loop attribute - rename the pass -test-detect-parallel (from -detect-parallel) PiperOrigin-RevId: 238122847	2019-03-29 17:16:27 -07:00
Uday Bondhugula	a228b7d477	Change getMemoryFootprintBytes emitError to a warning - this is really not a hard error; emit a warning instead (for inability to compute footprint due to the union failing due to unimplemented cases) - remove a misleading warning from LoopFusion.cpp PiperOrigin-RevId: 238118711	2019-03-29 17:16:12 -07:00
Feng Liu	c52a812700	[TableGen] Support nested dag attributes arguments in the result pattern Add support to create a new attribute from multiple attributes. It extended the DagNode class to represent attribute creation dag. It also changed the RewriterGen::emitOpCreate method to support this nested dag emit. An unit test is added. PiperOrigin-RevId: 238090229	2019-03-29 17:15:57 -07:00
River Riddle	6558f80c8d	Refactor pass timing so that it is toggled on the passmanager via 'enableTiming'. This also makes the pipeline view the default display mode. PiperOrigin-RevId: 238079916	2019-03-29 17:15:42 -07:00
Uday Bondhugula	9f2781e8dd	Fix misc bugs / TODOs / other improvements to analysis utils - fix for getConstantBoundOnDimSize: floordiv -> ceildiv for extent - make getConstantBoundOnDimSize also return the identifier upper bound - fix unionBoundingBox to correctly use the divisor and upper bound identified by getConstantBoundOnDimSize - deal with loop step correctly in addAffineForOpDomain (covers most cases now) - fully compose bound map / operands and simplify/canonicalize before adding dim/symbol to FlatAffineConstraints; fixes false positives in -memref-bound-check; add test case there - expose mlir::isTopLevelSymbol from AffineOps PiperOrigin-RevId: 238050395	2019-03-29 17:15:27 -07:00
River Riddle	7eee76b84c	Give the TypeUniquer its own BumpPtrAllocator and a SmartRWMutex to make it thread-safe. This is step 1/N to making the MLIRContext thread-safe. PiperOrigin-RevId: 238037814	2019-03-29 17:15:11 -07:00
River Riddle	739f3ef7ee	NFC: Remove a stray print in mlir::buildTripCountMapAndOperands. PiperOrigin-RevId: 238033349	2019-03-29 17:14:56 -07:00
Lei Zhang	372a3a52b5	[TableGen] Sort OpBase.td attribute kinds and refine some comments This CL sorts attribute kinds in OpBase.td according to a logical order: simple cases ahead of complicated ones. The logic of attribute kinds involved are completely untouched. Comments on AttrConstraint and Attr are revised slightly. PiperOrigin-RevId: 238031275	2019-03-29 17:14:41 -07:00
Lei Zhang	f0998d589b	[TableGen] Add common I<n>Tensor, F<n>Tensor, and I64Attr definitions This CL also changes IntegerAttrBase to use APInt as return value to defer bitwidth handling to API call sites and be consistent with FloatAttrBase. Call sites are adjusted accordingly. PiperOrigin-RevId: 238030614	2019-03-29 17:14:27 -07:00
Uday Bondhugula	075090f891	Extend loop unrolling and unroll-jamming to non-matching bound operands and multi-result upper bounds, complete TODOs, fix/improve test cases. - complete TODOs for loop unroll/unroll-and-jam. Something as simple as "for %i = 0 to %N" wasn't being unrolled earlier (unless it had been written as "for %i = ()[s0] -> (0)()[%N] to %N"; addressed now. - update/replace getTripCountExpr with buildTripCountMapAndOperands; makes it more powerful as it composes inputs into it - getCleanupLowerBound and getUnrolledLoopUpperBound actually needed the same code; refactor and remove one. - reorganize test cases, write previous ones better; most of these changes are "label replacements". - fix wrongly labeled test cases in unroll-jam.mlir PiperOrigin-RevId: 238014653	2019-03-29 17:14:12 -07:00
Alex Zinenko	9abea4a466	Python bindings: provide context managers for the Blocks Expose EDSC block builders as Python context managers, similarly to loop builders. Note that blocks, unlike loops, are addressable and may need to be "declared" without necessarily filling their bodies with instructions. This is the case, for example, when branching to a new block from the existing block. Therefore, creating the block context manager immediately creates the block (unless the manager captures an existing block) by creating and destroying the block builder. With this approach, one can either fill in the block and refer to it later leveraging Python's dynamic variable lookup with BlockContext([indexType]) as b: op(...) # operation inside the block ret() op(...) # operation outside the block (in the function entry block) br(b, [...]) # branching to the block created above or declare the block contexts upfront and enter them on demand bb1 = BlockContext() # empty block created in the surrounding function bb2 = BlockContext() # context cond_br(bb1.handle, [], bb2.handle, []) # branch to blocks from here with bb1: op(...) # operation inside the first block with bb2: op(...) # operation inside the second block with bb1: op(...) # append operation to the first block Additionally, one can create multiple throw-away contexts that append to the same block with BlockContext() as b: op(...) # operation inside the block with BlockContext(appendTo(b)): op(...) # new context appends to the block which has a potential of being extended to control the insertion point of the block at a finer level of granularity. PiperOrigin-RevId: 238005298	2019-03-29 17:13:57 -07:00
Alex Zinenko	b0cc81883c	Python bindings: drop third_party/ in includes Historically, Python bindings were using full path including third_party for most headers but not all of them. This is inconsistent with the rest of MLIR. Drop the prefix path in #include directives. PiperOrigin-RevId: 237999346	2019-03-29 17:13:42 -07:00
MLIR Team	8d62a6092f	Clean up some stray mlfunc/cfgfunc leftovers. PiperOrigin-RevId: 237936610	2019-03-29 17:13:26 -07:00
River Riddle	fde5bcdae7	Add documentation for the pass instrumentation framework to the WritingAPass document. PiperOrigin-RevId: 237919897	2019-03-29 17:13:11 -07:00
River Riddle	59b0839206	NFC: Remove old comment referencing CFG/EXT/ML functions. PiperOrigin-RevId: 237902039	2019-03-29 17:12:56 -07:00
Nicolas Vasilache	dfd904d4a9	Minor changes to the EDSC API NFC This CL makes some minor changes to the declarative builder Helpers: 1. adds lb, ub, step methods to MemRefView to avoid always having to go through std::get + range; 2. drops MemRefView& from IndexedValue which was just creating ownership concerns. Instead, an IndexedValue only needs to keep track of the ValueHandle from which a MemRefView can be constructed on-demand if necessary. PiperOrigin-RevId: 237861493	2019-03-29 17:12:41 -07:00
Lei Zhang	e1595df1af	Allow input and output to have different element types for broadcastable ops TensorFlow comparison ops like tf.Less supports broadcast behavior but the result type have different element types as the input types. Extend broadcastable trait to allow such cases. Added tf.Less to demonstrate it. PiperOrigin-RevId: 237846127	2019-03-29 17:12:26 -07:00
Lei Zhang	7972dcef84	Pull shape broadcast out as a stand-alone utility function So that we can use this function to deduce broadcasted shapes elsewhere. Also added support for unknown dimensions, by following TensorFlow behavior. PiperOrigin-RevId: 237846065	2019-03-29 17:12:11 -07:00
River Riddle	0cc212f2b7	Ensure that pass timing is the last added pass instrumentation. This also updates the PassInstrumentor to iterate in reverse for the "after" hooks. This ensures that the instrumentations run in a stack like fashion. PiperOrigin-RevId: 237840808	2019-03-29 17:11:56 -07:00
River Riddle	dc141c307b	Tidy up some of the pass infrastructure g3doc. * Separate MyAnalysis into MyFunctionAnalysis/MyModuleAnalysis to avoid potential confusion. * Add an example of an inline lambda builder for PassPipelineRegistration. * Clarify the wording on a few of the pass restrictions. PiperOrigin-RevId: 237840325	2019-03-29 17:11:41 -07:00
River Riddle	e46ba31c66	Add a new instrumentation for timing pass and analysis execution. This is made available in mlir-opt via the 'pass-timing' and 'pass-timing-display' flags. The 'pass-timing-display' flag toggles between the different available display modes for the timing results. The current display modes are 'list' and 'pipeline', with 'list' representing the default. Below shows the output for an example mlir-opt command line. mlir-opt foo.mlir -verify-each=false -cse -canonicalize -cse -cse -pass-timing list view (-pass-timing-display=list): * In this mode the results are displayed in a list sorted by total time; with each pass/analysis instance aggregated into one unique result. This mode is similar to the output of 'time-passes' in llvm-opt. ===-------------------------------------------------------------------------=== ... Pass execution timing report ... ===-------------------------------------------------------------------------=== Total Execution Time: 0.0097 seconds (0.0096 wall clock) ---User Time--- --System Time-- --User+System-- ---Wall Time--- --- Name --- 0.0051 ( 58.3%) 0.0001 ( 12.2%) 0.0052 ( 53.8%) 0.0052 ( 53.8%) Canonicalizer 0.0025 ( 29.1%) 0.0005 ( 58.2%) 0.0031 ( 31.9%) 0.0031 ( 32.0%) CSE 0.0011 ( 12.6%) 0.0003 ( 29.7%) 0.0014 ( 14.3%) 0.0014 ( 14.2%) DominanceInfo 0.0087 (100.0%) 0.0009 (100.0%) 0.0097 (100.0%) 0.0096 (100.0%) Total pipeline view (-pass-timing-display=pipeline): * In this mode the results are displayed in a nested pipeline view that mirrors the internal pass pipeline that is being executed in the pass manager. This view is useful for understanding specifically which parts of the pipeline are taking the most time, and can also be used to identify when analyses are being invalidated and recomputed. ===-------------------------------------------------------------------------=== ... Pass execution timing report ... ===-------------------------------------------------------------------------=== Total Execution Time: 0.0082 seconds (0.0081 wall clock) ---User Time--- --System Time-- --User+System-- ---Wall Time--- --- Name --- 0.0042 (100.0%) 0.0039 (100.0%) 0.0082 (100.0%) 0.0081 (100.0%) Function Pipeline 0.0005 ( 11.6%) 0.0008 ( 21.1%) 0.0013 ( 16.1%) 0.0013 ( 16.2%) CSE 0.0002 ( 5.0%) 0.0004 ( 9.3%) 0.0006 ( 7.0%) 0.0006 ( 7.0%) (A) DominanceInfo 0.0026 ( 61.8%) 0.0018 ( 45.6%) 0.0044 ( 54.0%) 0.0044 ( 54.1%) Canonicalizer 0.0005 ( 11.7%) 0.0005 ( 13.0%) 0.0010 ( 12.3%) 0.0010 ( 12.4%) CSE 0.0003 ( 6.1%) 0.0003 ( 8.3%) 0.0006 ( 7.2%) 0.0006 ( 7.1%) (A) DominanceInfo 0.0002 ( 3.8%) 0.0001 ( 2.8%) 0.0003 ( 3.3%) 0.0003 ( 3.3%) CSE 0.0042 (100.0%) 0.0039 (100.0%) 0.0082 (100.0%) 0.0081 (100.0%) Total PiperOrigin-RevId: 237825367	2019-03-29 17:11:25 -07:00
Mehdi Amini	732160eaa5	Move `createConvertToLLVMIRPass()` to its own header matching the target library clients need to link PiperOrigin-RevId: 237723197	2019-03-29 17:11:07 -07:00
River Riddle	5e1f1d2cab	Update the constantFold/fold API to use LogicalResult instead of bool. PiperOrigin-RevId: 237719658	2019-03-29 17:10:50 -07:00
River Riddle	43d0ca8419	NFC: Move the PassExecutor and PassAdaptor classes into PassDetail.h so that they can be referenced throughout lib/Pass. PiperOrigin-RevId: 237712736	2019-03-29 17:10:36 -07:00
River Riddle	0310d49f46	Move the success/failure functions out of LogicalResult and into the mlir namespace. PiperOrigin-RevId: 237712180	2019-03-29 17:10:21 -07:00
River Riddle	2d2b40bce5	Add basic infrastructure for instrumenting pass execution and analysis computation. A virtual class, PassInstrumentation, is provided to allow for different parts of the pass manager infrastructure. The currently available hooks allow for instrumenting: * before/after pass execution * after a pass fails * before/after an analysis is computed After getting this infrastructure in place, we can start providing common developer utilities like pass timing, IR printing after pass execution, etc. PiperOrigin-RevId: 237709692	2019-03-29 17:10:06 -07:00
Nicolas Vasilache	861eb87471	[EDSC] Cleanup declarative builder insertion point with blocks Declarative builders want to provide the same nesting interface for blocks and loops. MLIR on the other hand has different behaviors: 1. when an AffineForOp is created the insertion point does not enter the loop body; 2. when a Block is created, the insertion point does enter the block body. Guard against the second behavior in EDSC to make the interface unsurprising. This also surfaces two places in the eager branch API where I was guarding against this behavior indirectly by creating a new ScopedContext. Instead, uniformize everything to properly reset the insertion point in the unique place that builds the mlir::Block*. PiperOrigin-RevId: 237619513	2019-03-29 17:09:51 -07:00
Jacques Pienaar	497d645337	Delete dead function. Can reintroduce when needed. PiperOrigin-RevId: 237599264	2019-03-29 17:09:35 -07:00
Nicolas Vasilache	0d925c5510	Follow up on custom instruction support. This CL addresses a few post-submit comments: 1. better comments, 2. check number of results before dyn_cast (which is a less common case) 3. test usage for multi-result InstructionHandle PiperOrigin-RevId: 237549333	2019-03-29 17:09:20 -07:00
Nicolas Vasilache	eb19b4eefc	Add support for custom ops in declarative builders. This CL adds support for named custom instructions in declarative builders. To allow this, it introduces a templated `CustomInstruction` class. This CL also splits ValueHandle which can capture only the value in single-valued instructions from InstructionHandle which can capture any instruction but provide no typing and sugaring to extract the potential Value*. PiperOrigin-RevId: 237543222	2019-03-29 17:09:05 -07:00
River Riddle	80d3568c0a	Rename Status to LogicalResult to avoid conflictions with the Status in xla/tensorflow/etc. PiperOrigin-RevId: 237537341	2019-03-29 17:08:50 -07:00
River Riddle	e2c301441e	Don't run verifyOperation in verifyDominance, as it is already run as part of verifyBlock. This caused the verifier to run in exponential time for nested regions. PiperOrigin-RevId: 237519751	2019-03-29 17:08:35 -07:00
Lei Zhang	d6afced006	[TF] Define tf.FusedBatchNormOp in TableGen Also fixed wrong epsilon attribute types for tf.FusedBatchNormOp in test cases. PiperOrigin-RevId: 237514017	2019-03-29 17:08:20 -07:00
Lei Zhang	684cc6e8da	[TableGen] Change to attach the name to DAG operator in result patterns There are two ways that we can attach a name to a DAG node: 1) (Op:$name ...) 2) (Op ...):$name The problem with 2) is that we cannot do it on the outmost DAG node in a tree. Switch from 2) to 1). PiperOrigin-RevId: 237513962	2019-03-29 17:08:05 -07:00
Lei Zhang	18fde7c9d8	[TableGen] Support multiple result patterns This CL added the ability to generate multiple ops using multiple result patterns, with each of them replacing one result of the matched source op. Specifically, the syntax is ``` def : Pattern<(SourceOp ...), [(ResultOp1 ...), (ResultOp2 ...), (ResultOp3 ...)]>; ``` Assuming `SourceOp` has three results. Currently we require that each result op must generate one result, which can be lifted later when use cases arise. To help with cases that certain output is unused and we don't care about it, this CL also introduces a new directive: `verifyUnusedValue`. Checks will be emitted in the `match()` method to make sure if the corresponding output is not unused, `match()` returns with `matchFailure()`. PiperOrigin-RevId: 237513904	2019-03-29 17:07:50 -07:00
Uday Bondhugula	87884ab4b6	Refactor and share common code across addAffineForOpDomain / addSliceBounds PiperOrigin-RevId: 237508755	2019-03-29 17:07:35 -07:00
Lei Zhang	999a0c8736	[TF] Improve verification for integer and floating-point tensor types TensorFlow does not allow integers of random bitwidths. It only accepts 8-, 16-, 32-, and 64-bit integer types. Similarly for floating point types, only half, single, double, and bfloat16 types. PiperOrigin-RevId: 237483913	2019-03-29 17:07:20 -07:00
River Riddle	2c78469a93	Introduce a TypeID class to provide unique identifiers for derived type classes. This removes the need for derived types to define a static typeID field. PiperOrigin-RevId: 237482890	2019-03-29 17:07:06 -07:00
Uday Bondhugula	ce7e59536c	Add a basic model to set tile sizes + some cleanup - compute tile sizes based on a simple model that looks at memory footprints (instead of using the hardcoded default value) - adjust tile sizes to make them factors of trip counts based on an option - update loop fusion CL options to allow setting maximal fusion at pass creation - change an emitError to emitWarning (since it's not a hard error unless the client treats it that way, in which case, it can emit one) $ mlir-opt -debug-only=loop-tile -loop-tile test/Transforms/loop-tiling.mlir test/Transforms/loop-tiling.mlir:81:3: note: using tile sizes [4 4 5 ] for %i = 0 to 256 { for %i0 = 0 to 256 step 4 { for %i1 = 0 to 256 step 4 { for %i2 = 0 to 250 step 5 { for %i3 = #map4(%i0) to #map11(%i0) { for %i4 = #map4(%i1) to #map11(%i1) { for %i5 = #map4(%i2) to #map12(%i2) { %0 = load %arg0[%i3, %i5] : memref<8x8xvector<64xf32>> %1 = load %arg1[%i5, %i4] : memref<8x8xvector<64xf32>> %2 = load %arg2[%i3, %i4] : memref<8x8xvector<64xf32>> %3 = mulf %0, %1 : vector<64xf32> %4 = addf %2, %3 : vector<64xf32> store %4, %arg2[%i3, %i4] : memref<8x8xvector<64xf32>> } } } } } } PiperOrigin-RevId: 237461836	2019-03-29 17:06:51 -07:00
Alex Zinenko	8b4b9b31f1	Python bindings: introduce loop and loop nest contexts Recently, EDSC introduced an eager mode for building IR in different contexts. Introduce Python bindings support for loop and loop nest contexts of EDSC builders. The eager mode is built around the notion of ValueHandle, which is convenience class for delayed initialization and operator overloads. Expose this class and overloads directly. The model of insertion contexts maps naturally to Python context manager mechanism, therefore new bindings are defined bypassing the C APIs. The bindings now provide three new context manager classes: FunctionContext, LoopContext and LoopNestContext. The last two can be used with the `with`-construct in Python to create loop (nests) and obtain handles to the loop induction variables seamlessly: with LoopContext(lhs, rhs, 1) as i: lhs + rhs + i with LoopContext(rhs, rhs + rhs, 2) as j: x = i + j Any statement within the Python context will trigger immediate emission of the corresponding IR constructs into the context owned by the nearest context manager. PiperOrigin-RevId: 237447732	2019-03-29 17:06:36 -07:00
River Riddle	1e55ae19a0	Convert ambiguous bool returns in /Analysis to use Status instead. PiperOrigin-RevId: 237390240	2019-03-29 17:06:21 -07:00
River Riddle	10ddae6d88	Use Status instead of bool in DialectConversion. PiperOrigin-RevId: 237339277	2019-03-29 17:06:06 -07:00
River Riddle	f427bddd06	Update the PassManager infrastructure to return Status instead of bool. PiperOrigin-RevId: 237261205	2019-03-29 17:05:51 -07:00
Uday Bondhugula	b5f7b7fd59	Fix unionBoundingBox bug introduced by cl/237141668 - add test case PiperOrigin-RevId: 237241598	2019-03-29 17:05:36 -07:00
Alex Zinenko	6621f39d19	LLVM IR Dialect conversion: use builder arguments instead of named attributes The first version of TableGen-defined LLVM IR Dialect did not include the mandatory or optional attributes of the operations due to the missing support for some of the relevant attribute types. This support has been recently introduced, along with named attributes as arguments in the TableGen operation definitions. With these changes, LLVM IR Dialect operations now have factory functions accepting (unnamed) attributes and attaching their canonical names. Use these factories instead of manually constructing named attributes in the dialect convreter to avoid hardcoded attribute names in unexpected places. PiperOrigin-RevId: 237237769	2019-03-29 17:05:20 -07:00
Alex Zinenko	b9724e98c2	Cleanups in the LLVM IR Dialect These cleanups reflects some recent changes to the LLVM IR Dialect and the infrastructure that affects it. In particular, add documentation on direct and indirect function calls as well as remove the `call` and `call0` separation. Change the prefix of custom types from `!llvm.type` to `!llvm` so that it matches the IR. Remove the verifier check disallowing conditional branches to the same block with arguments: identical arguments are now supported, and different arguments will be caught later. PiperOrigin-RevId: 237203452	2019-03-29 17:05:05 -07:00
Alex Zinenko	dbaab04a80	TableGen most of the LLVM IR Dialect to LLVM IR conversions The LLVM IR Dialect strives to be close to the original LLVM IR instructions. The conversion from the LLVM IR Dialect to LLVM IR proper is mostly mechanical and can be automated. Implement TableGen support for generating conversions from a concise pattern form in the TableGen definition of the LLVM IR Dialect operations. It is used for all operations except calls and branches. These operations need access to function and block remapping tables and would require significantly more code to generate the conversions from TableGen definitions than the current manually written conversions. This implementation is accompanied by various necessary changes to the TableGen operation definition infrastructure. In particular, operation definitions now contain named accessors to results as well as named accessors to the variadic operand (returning a vector of operands). The base operation support TableGen file now contains a FunctionAttr definition. The TableGen now allows to query the names of the operation results. PiperOrigin-RevId: 237203077	2019-03-29 17:04:50 -07:00
Mehdi Amini	056fc2fd09	Change assert message to mention `nullptr` instead of `sentinel`: this is likely more helpful to the user when it fires PiperOrigin-RevId: 237170067	2019-03-29 17:04:35 -07:00
River Riddle	ba6fdc8b01	Move UtilResult into the Support directory and rename it to Status. Status provides an unambiguous way to specify success/failure results. These can be generated by 'Status::success()' and Status::failure()'. Status provides no implicit conversion to bool and should be consumed by one of the following utility functions: * bool succeeded(Status) - Return if the status corresponds to a success value. * bool failed(Status) - Return if the status corresponds to a failure value. PiperOrigin-RevId: 237153884	2019-03-29 17:04:19 -07:00
River Riddle	157e3cdb19	Add documentation for the new pass infrastructure. PiperOrigin-RevId: 237153501	2019-03-29 17:04:03 -07:00
MLIR Team	11b099c012	Adds offset argument to specified range of ids know to be aligned when calling mergeAndAlignIds (used by FlatAffineConstraints). Supports use case where FlatAffineConstraints::composeMap adds dim identifiers with no SSA values (because the identifiers are the result of an AffineValueMap which is not materialized in the IR and thus has no SSA Value results). PiperOrigin-RevId: 237145506	2019-03-29 17:03:47 -07:00
MLIR Team	1678fd1584	Fix opt build. PiperOrigin-RevId: 237141751	2019-03-29 17:03:32 -07:00
Uday Bondhugula	b8b15c7700	Add FlatAffineConstraints::containsId to avoid using findId when position isn't needed + other cleanup - clean up unionBoundingBox (hoist SmallVector allocations out of loop). PiperOrigin-RevId: 237141668	2019-03-29 17:03:17 -07:00
Nicolas Vasilache	9e425a06f7	Fix an incorrect comment in builder-api-test. Also address post commit cleanups that were missed. PiperOrigin-RevId: 237122077	2019-03-29 17:03:00 -07:00
Nicolas Vasilache	7c0b9e8b62	Add helper classes to declarative builders to help write end-to-end custom ops. This CL adds the same helper classes that exist in the AST form of EDSCs to support a basic indexing notation and emit the proper load and store operations and capture MemRefViews as function arguments. This CL also adds a wrapper class LoopNestBuilder to allow generic rank-agnostic loops over indices. PiperOrigin-RevId: 237113755	2019-03-29 17:02:41 -07:00
Lei Zhang	4fc9b51727	[TableGen] Emit verification code for op results They can be verified using the same logic as operands. PiperOrigin-RevId: 237101461	2019-03-29 17:02:26 -07:00
River Riddle	d43f630de8	NFC: Remove 'Result' from the analysis manager api to better reflect the implementation. There is no distinction between analysis computation and result. PiperOrigin-RevId: 237093101	2019-03-29 17:02:12 -07:00
Dimitrios Vytiniotis	32943f5783	More graceful failure when verifying llvm.noalias. PiperOrigin-RevId: 237081778	2019-03-29 17:01:56 -07:00
River Riddle	1d87b62afe	Add support for preserving specific analyses in the analysis manager. Passes can now preserve specific analyses via 'markAnalysesPreserved'. Example: markAnalysesPreserved<DominanceInfo>(); markAnalysesPreserved<DominanceInfo, PostDominanceInfo>(); PiperOrigin-RevId: 237081454	2019-03-29 17:01:41 -07:00
Nicolas Vasilache	b2fe39977e	Hotfix for unused variable in opt mode PiperOrigin-RevId: 237073601	2019-03-29 17:01:26 -07:00
Dimitrios Vytiniotis	480cc2b063	Using llvm.noalias attribute when generating LLVMIR. PiperOrigin-RevId: 237063104	2019-03-29 17:01:11 -07:00
Nicolas Vasilache	421276e0d5	[EDSC] Hotfix: Avoid implicit OpPointer -> OpType* conversion An implicit OpPointer -> OpType* conversion results in AddressSanitizer triggering a stack-use-after-scope error (which may be a false positive). Avoid using such patterns to make life good again. PiperOrigin-RevId: 237053863	2019-03-29 17:00:56 -07:00
Alex Zinenko	76759395f2	Python bindinds: support functions with attributes and argument attributes Currently, Python bindings provide support for declarting and defining MLIR functions given a list of argument and result types. Extend the support for both function declaration and function definition to handle optional function attributes and function argument attributes. Function attributes are exposed as keyword arguments on function declaration and definition calls. Function argument attributes are exposed through a special object that combines the argument type and its list of attributes. Such objects can be passed instead of bare types into the type declaration and definition calls. They can be constructed from bare types and reused in different declarations. Note that, from the beginning, Python bindings did not pass through C bindings to declare and define functions. This commit keeps the direct interaction between Python and C++. PiperOrigin-RevId: 237047561	2019-03-29 17:00:41 -07:00
Nicolas Vasilache	ee4a80bbd6	Add an eager API version for BR and COND_BR When building unstructured control-flow there is a need to construct mlir::Block* before being able to fill them. This invites goto-style programming. This CL introduces an alternative eager API for BR and COND_BR in which blocks are created eagerly and captured on the fly. This allows reducing the number of calls to `BlockBuilder` from 4 to 2 in the `builder_blocks_eager` test and from 3 to 2 in the `builder_cond_branch_eager` test. PiperOrigin-RevId: 237046114	2019-03-29 17:00:26 -07:00
Nicolas Vasilache	38f1d2d77e	Add support for Branches in edsc::Builder This CL adds support for BranchHandle and BranchBuilder that require a slightly different abstraction since an mlir::Block is not an mlir::Value. This CL also adds support for the BR and COND_BR instructions and the relevant tests. PiperOrigin-RevId: 237034312	2019-03-29 17:00:09 -07:00
Nicolas Vasilache	af6c3f7a63	Start a new implementation for edsc::Builder This CL reworks the design of EDSCs from first principles. It introduces a ValueHandle which can hold either: 1. an eagerly typed, delayed Value* 2. a precomputed Value* A ValueHandle can be manipulated with intrinsic operations a nested within a NestedBuilder. These NestedBuilder are a more idiomatic nested builder abstraction that should feel intuitive to program in C++. Notably, this abstraction does not require an AST to stage the construction of MLIR snippets from C++. Instead, the abstraction makes use of orderings between definition and declaration of ValueHandles and provides a NestedBuilder and a LoopBuilder helper classes to handle those orderings. All instruction creations are meant to use the templated ValueHandle::create<> which directly calls mlir::Builder.create<>. For now the EDSC AST and the builders live side-by-side until the C API is ported. PiperOrigin-RevId: 237030945	2019-03-29 16:59:50 -07:00
Alex Zinenko	95949a0d09	TableGen: allow mixing attributes and operands in the Arguments DAG of Op definition The existing implementation of the Op definition generator assumes and relies on the fact that native Op Attributes appear after its value-based operands in the Arguments list. Furthermore, the same order is used in the generated `build` function for the operation. This is not desirable for some operations with mandatory attributes that would want the attribute to appear upfront for better consistency with their textual representation, for example `cmpi` would prefer the `predicate` attribute to be foremost in the argument list. Introduce support for using attributes and operands in the Arguments DAG in no particular order. This is achieved by maintaining a list of Arguments that point to either the value or the attribute and are used to generate the `build` method. PiperOrigin-RevId: 237002921	2019-03-29 16:59:35 -07:00
MLIR Team	c1ff9e866e	Use FlatAffineConstraints::unionBoundingBox to perform slice bounds union for loop fusion pass (WIP). Adds utility to convert slice bounds to a FlatAffineConstraints representation. Adds utility to FlatAffineConstraints to promote loop IV symbol identifiers to dim identifiers. PiperOrigin-RevId: 236973261	2019-03-29 16:59:21 -07:00
Uday Bondhugula	5836fae8a0	DMA generation CL flag update - allow mem capacity to be overridden by command-line flag - change default fast mem space to 2 PiperOrigin-RevId: 236951598	2019-03-29 16:59:05 -07:00
Uday Bondhugula	7e288e7c19	Add missing run command to fusion test cases - follow up to cl/236882988 PiperOrigin-RevId: 236947383	2019-03-29 16:58:50 -07:00
Uday Bondhugula	02af8c22df	Change Pass:getFunction() to return pointer instead of ref - NFC - change this for consistency - everything else similar takes/returns a Function pointer - the FuncBuilder ctor, Block/Value/Instruction::getFunction(), etc. - saves a whole bunch of &s everywhere PiperOrigin-RevId: 236928761	2019-03-29 16:58:35 -07:00
River Riddle	73e0297d36	Change the TensorFlow attribute prefix from "tf$" to "tf." to match the specification of dialect attributes. This also fixes tblgen generation of dialect attributes that used the sugared name "tf$attr" as c++ identifiers. PiperOrigin-RevId: 236924392	2019-03-29 16:58:20 -07:00
River Riddle	7b0b344c5e	Add a unittests directory for MLIR and start adding tests for OperandStorage from InstructionSupport.h. PiperOrigin-RevId: 236911640	2019-03-29 16:58:05 -07:00
River Riddle	89d42f15a7	NFC: Move OperandStorage into a new header file for instruction support utilities, InstructionSupport.h. PiperOrigin-RevId: 236892857	2019-03-29 16:57:50 -07:00
Uday Bondhugula	b34f8d3c83	Fix and improve detectAsMod - fix for the mod detection - simplify/avoid the mod at construction (if the dividend is already known to be less than the divisor), since the information is available at hand there PiperOrigin-RevId: 236882988	2019-03-29 16:57:36 -07:00
Nicolas Vasilache	069c818f40	Fix lower/upper bound mismatch in stripmineSink Also beef up the corresponding test case. PiperOrigin-RevId: 236878818	2019-03-29 16:57:21 -07:00
River Riddle	2dfefdafea	Fix dialect attribute hooks so that they accept a NamedAttribute instead of an Attribute. PiperOrigin-RevId: 236869321	2019-03-29 16:57:05 -07:00
Alex Zinenko	dd75675080	TableGen: fix builder generation for optional attributes The recently introduced support for generating MLIR Operations with optional attributes did not handle the formatted string emission properly, in particular it did not escape `{` and `}` in calls to `formatv` leading to assertions during TableGen op definition generation. Fix this by splitting out the unncessary braces from the format string. Additionally, fix the emission of the builder argument comment to correctly indicate which attributes are indeed optional and which are not. PiperOrigin-RevId: 236832230	2019-03-29 16:56:50 -07:00
Uday Bondhugula	a77734e185	Make sure that fusion test cases don't have out of bounds accesses - fix out of bounds test case - -memref-bound-check on the test/Transforms/loop-fusion.mlir no longer reports any errors, before or after -loop-fusion is run PiperOrigin-RevId: 236757658	2019-03-29 16:56:35 -07:00
MLIR Team	39a1ddeb1c	Adds loop attribute as a temporary work around to prevent slice fusion of loop nests containing instructions with side effects (the proper solution will be do use memref read/write regions in the future). PiperOrigin-RevId: 236733739	2019-03-29 16:56:20 -07:00
Uday Bondhugula	12b9dece8d	Bug fix for getConstantBoundOnDimSize - this was detected when memref-bound-check was run on the output of the loop-fusion pass - the addition (to represent ceildiv as a floordiv) had to be performed only for the constant term of the constraint - update test cases - memref-bound-check no longer returns an error on the output of this test case PiperOrigin-RevId: 236731137	2019-03-29 16:56:06 -07:00
Dimitrios Vytiniotis	a60ba7d908	Supporting conversion of argument attributes along their types. This fixes a bug: previously, during conversion function argument attributes were neither beings passed through nor converted. This fix extends DialectConversion to allow for simultaneous conversion of the function type and the argument attributes. This was important when lowering MLIR to LLVM where attribute information (e.g. noalias) needs to be preserved in MLIR(LLVMDialect). Longer run it seems reasonable that we want to convert both the function attribute and its type and the argument attributes, but that requires a small refactoring in Function.h to aggregate these three fields in an inner struct, which will require some discussion. PiperOrigin-RevId: 236709409	2019-03-29 16:55:51 -07:00
River Riddle	50efe0fc85	Add a 'verifyPasses' flag to the PassManager that specifies if the IR should be verified after each pass. This also adds a "verify-each" flag to mlir-opt to optionally disable running the verifier after each pass. PiperOrigin-RevId: 236703760	2019-03-29 16:55:35 -07:00
MLIR Team	d42ef78a75	Handle MemRefRegion::compute return value in loop fusion pass (NFC). PiperOrigin-RevId: 236685849	2019-03-29 16:55:20 -07:00
River Riddle	a495f960e0	Introduce the notion of dialect attributes and dependent attributes. A dialect attribute derives its context from a specific dialect, whereas a dependent attribute derives context from what it is attached to. Following this, we now enforce that functions and function arguments may only contain dialect specific attributes. These are generic entities and cannot provide any specific context for a dependent attribute. Dialect attributes are defined as: dialect-namespace `.` attr-name `:` attribute-value Dialects can override any of the following hooks to verify the validity of a given attribute: * verifyFunctionAttribute * verifyFunctionArgAttribute * verifyInstructionAttribute PiperOrigin-RevId: 236507970	2019-03-29 16:55:05 -07:00
River Riddle	485746f524	Implement the initial AnalysisManagement infrastructure, with the introduction of the FunctionAnalysisManager and ModuleAnalysisManager classes. These classes provide analysis computation, caching, and invalidation for a specific IR unit. The invalidation is currently limited to either all or none, i.e. you cannot yet preserve specific analyses. An analysis can be any class, but it must provide the following: * A constructor for a given IR unit. struct MyAnalysis { // Compute this analysis with the provided module. MyAnalysis(Module *module); }; Analyses can be accessed from a Pass by calling either the 'getAnalysisResult<AnalysisT>' or 'getCachedAnalysisResult<AnalysisT>' methods. A FunctionPass may query for a cached analysis on the parent module with 'getCachedModuleAnalysisResult'. Similary, a ModulePass may query an analysis, it doesn't need to be cached, on a child function with 'getFunctionAnalysisResult'. By default, when running a pass all cached analyses are set to be invalidated. If no transformation was performed, a pass can use the method 'markAllAnalysesPreserved' to preserve all analysis results. As noted above, preserving specific analyses is not yet supported. PiperOrigin-RevId: 236505642	2019-03-29 16:54:50 -07:00
Mehdi Amini	c1b02a17be	Add an assertion on the builder to ensure that a block is set before creating an operation This is more friendly for the user than a raw segfault PiperOrigin-RevId: 236504102	2019-03-29 16:54:35 -07:00
River Riddle	eeeef090ef	Set the namespace of the StandardOps dialect to "std", but add a special case to the parser to allow parsing standard operations without the "std" prefix. This will now allow for the standard dialect to be looked up dynamically by name. PiperOrigin-RevId: 236493865	2019-03-29 16:54:20 -07:00
Uday Bondhugula	eee85361bb	Remove hidden flag from fusion CL options PiperOrigin-RevId: 236409185	2019-03-29 16:54:05 -07:00
Uday Bondhugula	62e3e2c57c	Update addSliceBounds to deal with loops with floor's/mod's. - This change only impacts the cost model for fusion, given the way addSliceBounds was being used. It so happens that the output in spite of this CL's fix is the same; however, the assertions added no longer fail. (an invalid/inconsistent memref region was being used earlier). PiperOrigin-RevId: 236405030	2019-03-29 16:53:50 -07:00
River Riddle	f37651c708	NFC. Move all of the remaining operations left in BuiltinOps to StandardOps. The only thing left in BuiltinOps are the core MLIR types. The standard types can't be moved because they are referenced within the IR directory, e.g. in things like Builder. PiperOrigin-RevId: 236403665	2019-03-29 16:53:35 -07:00
Lei Zhang	85d9b6c8f7	Use consistent names for dialect op source files This CL changes dialect op source files (.h, .cpp, .td) to follow the following convention: <full-dialect-name>/<dialect-namespace>Ops.{h\|cpp\|td} Builtin and standard dialects are specially treated, though. Both of them do not have dialect namespace; the former is still named as BuiltinOps.* and the latter is named as Ops.*. Purely mechanical. NFC. PiperOrigin-RevId: 236371358	2019-03-29 16:53:19 -07:00
Uday Bondhugula	8254aabd4a	A simple pass to detect and mark all parallel loops - detect all parallel loops based on dep information and mark them with a "parallel" attribute - add mlir::isLoopParallel(OpPointer<AffineForOp> ...), and refactor an existing method to use that (reuse some code from @andydavis (cl/236007073) for this) - a simple/meaningful way to test memref dep test as well Ex: $ mlir-opt -detect-parallel test/Transforms/parallelism-detection.mlir #map1 = ()[s0] -> (s0) func @foo(%arg0: index) { %0 = alloc() : memref<1024x1024xvector<64xf32>> %1 = alloc() : memref<1024x1024xvector<64xf32>> %2 = alloc() : memref<1024x1024xvector<64xf32>> for %i0 = 0 to %arg0 { for %i1 = 0 to %arg0 { for %i2 = 0 to %arg0 { %3 = load %0[%i0, %i2] : memref<1024x1024xvector<64xf32>> %4 = load %1[%i2, %i1] : memref<1024x1024xvector<64xf32>> %5 = load %2[%i0, %i1] : memref<1024x1024xvector<64xf32>> %6 = mulf %3, %4 : vector<64xf32> %7 = addf %5, %6 : vector<64xf32> store %7, %2[%i0, %i1] : memref<1024x1024xvector<64xf32>> } {parallel: false} } {parallel: true} } {parallel: true} return } PiperOrigin-RevId: 236367368	2019-03-29 16:53:03 -07:00
MLIR Team	d038e34735	Loop fusion for input reuse. ) Breaks fusion pass into multiple sub passes over nodes in data dependence graph: - first pass fuses single-use producers into their unique consumer. - second pass enables fusing for input-reuse by fusing sibling nodes which read from the same memref, but which do not share dependence edges. - third pass fuses remaining producers into their consumers (Note that the sibling fusion pass may have transformed a producer with multiple uses into a single-use producer). ) Fusion for input reuse is enabled by computing a sibling node slice using the load/load accesses to the same memref, and fusion safety is guaranteed by checking that the sibling node memref write region (to a different memref) is preserved. ) Enables output vector and output matrix computations from KFAC patches-second-moment operation to fuse into a single loop nest and reuse input from the image patches operation. ) Adds a generic loop utilitiy for finding all sequential loops in a loop nest. *) Adds and updates unit tests. PiperOrigin-RevId: 236350987	2019-03-29 16:52:35 -07:00
River Riddle	269c872ee8	Add support for parsing and printing affine.if and affine.for attributes. The attribute dictionaries are printed after the final block list for both operations: for %i = 0 to 10 { ... } {some_attr: true} if () : () () { ... } {some_attr: true} if () : () () { ... } else { ... } {some_attr: true} PiperOrigin-RevId: 236346983	2019-03-29 16:52:19 -07:00
Uday Bondhugula	932e4fb29f	Analysis support for floordiv/mod's in loop bounds/ - handle floordiv/mod's in loop bounds for all analysis purposes - allows fusion slicing to be more powerful - add simple test cases based on -memref-bound-check - fusion based test cases in follow up CLs PiperOrigin-RevId: 236328551	2019-03-29 16:52:04 -07:00
Uday Bondhugula	6ef5fc582e	Method to align/merge dimensional/symbolic identifiers between two FlatAffineConstraints - add a method to merge and align the spaces (identifiers) of two FlatAffineConstraints (both get dimension-wise and symbol-wise unique columns) - this completes several TODOs, gets rid of previous assumptions/restrictions in composeMap, unionBoundingBox, and reuses common code - remove previous workarounds / duplicated funcitonality in FlatAffineConstraints::composeMap and unionBoundingBox, use mergeAlignIds from both PiperOrigin-RevId: 236320581	2019-03-29 16:51:47 -07:00
Alex Zinenko	4bd5d28391	EDSC bindings: expose generic Op construction interface EDSC Expressions can now be used to build arbitrary MLIR operations identified by their canonical name, i.e. the name obtained from `OpClass::getOperationName()` for registered operations. Expose this functionality to the C API and Python bindings. This exposes builder-level interface to Python and avoids the need for experimental Python code to implement EDSC free function calls for constructing each op type. This modification required exposing mlir::Attribute to the C API and Python bindings, which only supports integer attributes for now. This is step 4/n to making EDSCs more generalizable. PiperOrigin-RevId: 236306776	2019-03-29 16:51:32 -07:00
Uday Bondhugula	229bd9ba21	Use Instruction::isBeforeInBlock instead of a linear scan - use Instruction::isBeforeInBlock instead of a linear scan in AffineAnalysis.cpp PiperOrigin-RevId: 236235824	2019-03-29 16:51:16 -07:00
River Riddle	ddc6788cc7	Provide a Builder::getNamedAttr and (Instruction\|Function)::setAttr(StringRef, Attribute) to simplify attribute manipulation. PiperOrigin-RevId: 236222504	2019-03-29 16:50:59 -07:00
River Riddle	ed5fe2098b	Remove PassResult and have the runOnFunction/runOnModule functions return void instead. To signal a pass failure, passes should now invoke the 'signalPassFailure' method. This provides the equivalent functionality when needed, but isn't an intrusive part of the API like PassResult. PiperOrigin-RevId: 236202029	2019-03-29 16:50:44 -07:00
Uday Bondhugula	58889884a2	Change some of the debug messages to use emitError / emitWarning / emitNote - NFC PiperOrigin-RevId: 236169676	2019-03-29 16:50:29 -07:00
River Riddle	db1757f858	Add support for named function argument attributes. The attribute dictionary is printed after the argument type: func @arg_attrs(i32 {arg_attr: 10}) func @arg_attrs(%arg0: i32 {arg_attr: 10}) PiperOrigin-RevId: 236136830	2019-03-29 16:50:15 -07:00
Alex Zinenko	8cc50208a6	LLVM IR Dialect: unify call and call0 operations When the LLVM IR dialect was implemented, TableGen operation definition scheme did not support operations with variadic results. Therefore, the `call` instruction was split into `call` and `call0` for the single- and zero-result calls (LLVM does not support multi-result operations). Unify `call` and `call0` using the recently added TableGen support for operations with Variadic results. Explicitly verify that the new operation has 0 or 1 results. As a side effect, this change enables clean-ups in the conversion to the LLVM IR dialect that no longer needs to rely on wrapped LLVM IR void types when constructing zero-result calls. PiperOrigin-RevId: 236119197	2019-03-29 16:49:59 -07:00
Alex Zinenko	d9cc3c31cc	ExecutionEngine OptUtils: support -On flags in string-based initialization Original implementation of OutUtils provided two different LLVM IR module transformers to be used with the MLIR ExecutionEngine: OptimizingTransformer parameterized by the optimization levels (similar to -O3 flags) and LLVMPassesTransformer parameterized by the string formatted similarly to command line options of LLVM's "opt" tool without support for -O* flags. Introduce such support by declaring the flags inside the parser and by populating the pass managers similarly to what "opt" does. Remove the additional flags from mlir-cpu-runner as they can now be wrapped into `-llvm-opts` together with other LLVM-related flags. PiperOrigin-RevId: 236107292	2019-03-29 16:49:44 -07:00
River Riddle	0f8c3f4071	When parsing, check that a region operation is not referencing any of the entry arguments to its block lists. PiperOrigin-RevId: 236030438	2019-03-29 16:49:29 -07:00
River Riddle	300e4126c5	Move the PassExecutor and ModuleToFunctionPassAdaptor classes from PassManager.h to Pass.cpp. This allows for us to remove a dependency on Pass.h from PassManager.h. PiperOrigin-RevId: 236029339	2019-03-29 16:49:15 -07:00
River Riddle	303b768579	Add a generic getValue to ElementsAttr for accessing a value at a given index. PiperOrigin-RevId: 236013669	2019-03-29 16:48:59 -07:00
River Riddle	1c1767621c	Remove the stubs for getValue from DenseIntElementsAttr and DenseFPElementsAttr as they aren't implemented. The type for the index is also wrong. PiperOrigin-RevId: 236010720	2019-03-29 16:48:44 -07:00
River Riddle	091ff3dc3f	Add support for registering pass pipelines to the PassRegistry. This is done by providing a static registration facility PassPipelineRegistration that works similarly to PassRegistration except for it also takes a function that will add necessary passes to a provided PassManager. void pipelineBuilder(PassManager &pm) { pm.addPass(new MyPass()); pm.addPass(new MyOtherPass()); } static PassPipelineRegistration Unused("unused", "Unused pass", pipelineBuilder); This is also useful for registering specializations of existing passes: Pass *createFooPass10() { return new FooPass(10); } static PassPipelineRegistration Unused("unused", "Unused pass", createFooPass10); PiperOrigin-RevId: 235996282	2019-03-29 16:48:29 -07:00
Jacques Pienaar	e31c23853b	Fix incorrect line split in header guard. PiperOrigin-RevId: 235994785	2019-03-29 16:48:14 -07:00
Uday Bondhugula	a003179367	Detect more trivially redundant constraints better - detect more trivially redundant constraints in FlatAffineConstraints::removeTrivialRedundantConstraints. Redundancy due to constraints that only differ in the constant part (eg., 32i + 64j - 3 >= 0, 32 + 64j - 8 >= 0) is now detected. The method is still linear-time and does a single scan over the FlatAffineConstraints buffer. This detection is useful and needed to eliminate redundant constraints generated after FM elimination. - update GCDTightenInequalities so that we also normalize by the GCD while at it. This way more constraints will show up as redundant (232i - 203 >= 0 becomes i - 1 >= 0 instead of 232i - 232 >= 0) without having to call normalizeConstraintsByGCD. - In FourierMotzkinEliminate, call GCDTightenInequalities and normalizeConstraintsByGCD before calling removeTrivialRedundantConstraints() - so that more redundant constraints are detected. As a result, redundancy due to constraints like i - 5 >= 0, i - 7 >= 0, 2i - 5 >= 0, 232i - 203 >= 0 is now detected (here only i >= 7 is non-redundant). As a result of these, a -memref-bound-check on the added test case runs in 16ms instead of 1.35s (opt build) and no longer returns a conservative result. PiperOrigin-RevId: 235983550	2019-03-29 16:47:59 -07:00
MLIR Team	c2766f3760	Fix bug in memref region computation with slice loop bounds. Adds loop IV values to ComputationSliceState which are used in FlatAffineConstraints::addSliceBounds, to ensure that constraints are only added for loop IV values which are present in the constraint system. PiperOrigin-RevId: 235952912	2019-03-29 16:47:29 -07:00
River Riddle	c6c534493d	Port all of the existing passes over to the new pass manager infrastructure. This is largely NFC. PiperOrigin-RevId: 235952357	2019-03-29 16:47:14 -07:00
River Riddle	6067cdebaa	Implement the initial pass management functionality. The definitions of derived passes have now changed and passes must adhere to the following: * Inherit from a CRTP base class FunctionPass/ModulePass. - This class provides several necessary utilities for the transformation: . Access to the IR unit being transformed (getFunction/getModule) . Various utilities for pass identification and registration. * Provide a 'PassResult runOn(Function\|Module)()' method to transform the IR. - This replaces the runOn* functions from before. This patch also introduces the notion of the PassManager. This allows for simplified construction of pass pipelines and acts as the sole interface for executing passes. This is important as FunctionPass will no longer have a 'runOnModule' method. PiperOrigin-RevId: 235952008	2019-03-29 16:46:59 -07:00
Lei Zhang	493d46067b	[TableGen] Use result names in build() methods if possible This will make it clear which result's type we are expecting in the build() signature. PiperOrigin-RevId: 235925706	2019-03-29 16:46:41 -07:00
Lei Zhang	9e18783e41	[TableGen] Add more scalar integer and floating-point types PiperOrigin-RevId: 235918286	2019-03-29 16:46:26 -07:00
Alex Zinenko	486dde42c0	EDSC: move FileCheck tests into the source file EDSC provide APIs for constructing and modifying the IR. These APIs are currently tested by a "test" module pass that reads the dummy IR (empty functions), recognizes certain function names and injects the IR into those functions based on their name. This situation is unsatisfactory because the expected outcome of the test lives in a different file than the input to the test, i.e. the API calls. Create a new binary for tests that constructs the IR from scratch using EDSC APIs and prints it. Put FileCheck comments next to the printing. This removes the need to have a file with dummy inputs and assert on its contents in the test driver. The test source includes a simplistic test harness that runs all functions marked as TEST_FUNC but intentionally does not include any value-testing functionality. PiperOrigin-RevId: 235886629	2019-03-29 16:46:10 -07:00
Ben Vanik	d3918fc8cd	Adding an IREE type kind range definition. PiperOrigin-RevId: 235849609	2019-03-29 16:45:55 -07:00
River Riddle	302fb03961	Add a new class NamedAttributeList to deduplicate named attribute handling between Function and Instruction. PiperOrigin-RevId: 235830304	2019-03-29 16:45:40 -07:00
Uday Bondhugula	7aa60a383f	Temp change in FlatAffineConstraints::getSliceBounds() to deal with TODO in LoopFusion - getConstDifference in LoopFusion is pending a refactoring to handle bounds with min's and max's; it currently asserts on some useful test cases that we want to experiment with. This CL changes getSliceBounds to be more conservative so as to not trigger the assertion. Filed b/126426796 to track this. PiperOrigin-RevId: 235826538	2019-03-29 16:45:23 -07:00
River Riddle	03913698a8	Allow function names to have a leading underscore. This matches what is already defined in the spec, but not supported in the implementation. PiperOrigin-RevId: 235823663	2019-03-29 16:45:08 -07:00
River Riddle	3b3e11da93	Validate the names of attribute, dialect, and functions during verification. This essentially enforces the parsing rules upon their names. PiperOrigin-RevId: 235818842	2019-03-29 16:44:53 -07:00
Uday Bondhugula	d4b3ff1096	Loop fusion comand line options cleanup - clean up loop fusion CL options for promoting local buffers to fast memory space - add parameters to loop fusion pass instantiation PiperOrigin-RevId: 235813419	2019-03-29 16:44:38 -07:00
River Riddle	2d4b0e2c00	Add parser support for internal named attributes. These are attributes with names starting with ':'. PiperOrigin-RevId: 235774810	2019-03-29 16:44:22 -07:00
Lei Zhang	bac3eece66	[TableGen] Fix using rewrite()'s qualified name for a bound argument in match() PiperOrigin-RevId: 235767304	2019-03-29 16:44:05 -07:00
River Riddle	79944e5eef	Add a Function::isExternal utility to simplify checks for external functions. PiperOrigin-RevId: 235746553	2019-03-29 16:43:50 -07:00
River Riddle	cdbfd48471	Rewrite the dominance info classes to allow for operating on arbitrary control flow within operation regions. The CSE pass is also updated to properly handle nested dominance. PiperOrigin-RevId: 235742627	2019-03-29 16:43:35 -07:00
Dimitrios Vytiniotis	41c37c6246	Unboxing for static memrefs. When lowering to MLIR(LLVMDialect) we unbox the structs that result from converting static memrefs, that is, singleton structs that just contain a raw pointer. This allows us to get rid of all "extractvalue" instructions in the common case where shapes are fully known. PiperOrigin-RevId: 235706021	2019-03-29 16:43:20 -07:00
Alex Zinenko	1da1b4c321	LLVM IR dialect and translation: support conditional branches with arguments Since the goal of the LLVM IR dialect is to reflect LLVM IR in MLIR, the dialect and the conversion procedure must account for the differences betweeen block arguments and LLVM IR PHI nodes. In particular, LLVM IR disallows PHI nodes with different values coming from the same source. Therefore, the LLVM IR dialect now disallows `cond_br` operations that have identical successors accepting arguments, which would lead to invalid PHI nodes. The conversion process resolves the potential PHI source ambiguity by injecting dummy blocks if the same block is used more than once as a successor in an instruction. These dummy blocks branch unconditionally to the original successors, pass them the original operands (available in the dummy block because it is dominated by the original block) and are used instead of them in the original terminator operation. PiperOrigin-RevId: 235682798	2019-03-29 16:43:05 -07:00
Alex Zinenko	970715be9c	Update LLVM Dialect documentation Addressing post-submit comments. The `getelementptr` operation now supports non-constant indexes, similarly to LLVM, and this functionality is exercised by the lowering to the dialect. Update the documentation accordingly. List the values of integer comparison predicates, which currently correspond to those of CmpIOp in MLIR. Ideally, we would use strings instead, but it requires additional support for argument conversion in both the dialect lowering pass and the LLVM translator. PiperOrigin-RevId: 235678877	2019-03-29 16:42:50 -07:00
Smit Hinsu	fd3c2d156f	Verify IR produced by TranslateToMLIR functions TESTED with existing unit tests PiperOrigin-RevId: 235623059	2019-03-29 16:42:35 -07:00
Uday Bondhugula	b269481106	Cleanup post cl/235283610 - NFC - remove stale comments + cleanup - drop MLIRContext * field from expr flattener PiperOrigin-RevId: 235621178	2019-03-29 16:42:20 -07:00
River Riddle	b4f033f6c6	Convert the dialect type parse/print hooks into virtual functions on the Dialect class. PiperOrigin-RevId: 235589945	2019-03-29 16:42:05 -07:00
River Riddle	f1f86eac60	Add support for constructing DenseIntElementsAttr with an array of APInt and DenseFPElementsAttr with an array of APFloat. PiperOrigin-RevId: 235581794	2019-03-29 16:41:50 -07:00
Lei Zhang	3f644705eb	[TableGen] Use ArrayRef instead of SmallVectorImpl for suitable method PiperOrigin-RevId: 235577399	2019-03-29 16:41:35 -07:00
Nicolas Vasilache	62c54a2ec4	Add a stripmineSink and imperfectly nested tiling primitives. This CL adds a primitive to perform stripmining of a loop by a given factor and sinking it under multiple target loops. In turn this is used to implement imperfectly nested loop tiling (with interchange) by repeatedly calling the stripmineSink primitive. The API returns the point loops and allows repeated invocations of tiling to achieve declarative, multi-level, imperfectly-nested tiling. Note that this CL is only concerned with the mechanical aspects and does not worry about analysis and legality. The API is demonstrated in an example which creates an EDSC block, emits the corresponding MLIR and applies imperfectly-nested tiling: ```cpp auto block = edsc::block({ For(ArrayRef<edsc::Expr>{i, j}, {zero, zero}, {M, N}, {one, one}, { For(k1, zero, O, one, { C({i, j, k1}) = A({i, j, k1}) + B({i, j, k1}) }), For(k2, zero, O, one, { C({i, j, k2}) = A({i, j, k2}) + B({i, j, k2}) }), }), }); // clang-format on emitter.emitStmts(block.getBody()); auto l_i = emitter.getAffineForOp(i), l_j = emitter.getAffineForOp(j), l_k1 = emitter.getAffineForOp(k1), l_k2 = emitter.getAffineForOp(k2); auto indicesL1 = mlir::tile({l_i, l_j}, {512, 1024}, {l_k1, l_k2}); auto l_ii1 = indicesL1[0][0], l_jj1 = indicesL1[1][0]; mlir::tile({l_jj1, l_ii1}, {32, 16}, l_jj1); ``` The edsc::Expr for the induction variables (i, j, k_1, k_2) provide the programmatic hooks from which tiling can be applied declaratively. PiperOrigin-RevId: 235548228	2019-03-29 16:41:20 -07:00
Alex Zinenko	e7193a70f8	EDSC: support conditional branch instructions Leverage the recently introduced support for multiple argument groups and multiple destination blocks in EDSC Expressions to implement conditional branches in EDSC. Conditional branches have two successors and three argument groups. The first group contains a single expression of i1 type that corresponds to the condition of the branch. The two following groups contain arguments of the two successors of the conditional branch instruction, in the same order as the successors. Expose this instruction to the C API and Python bindings. PiperOrigin-RevId: 235542768	2019-03-29 16:41:05 -07:00
Alex Zinenko	83e8db2193	EDSC: support branch instructions The new implementation of blocks was designed to support blocks with arguments. More specifically, StmtBlock can be constructed with a list of Bindables that will be bound to block aguments upon construction. Leverage this functionality to implement branch instructions with arguments. This additionally requires the statement storage to have a list of successors, similarly to core IR operations. Becauase successor chains can form loops, we need a possibility to decouple block declaration, after which it becomes usable by branch instructions, from block body definition. This is achieved by creating an empty block and by resetting its body with a new list of instructions. Note that assigning a block from another block will not affect any instructions that may have designated this block as their successor (this behavior is necessary to make value-type semantics of EDSC types consistent). Combined, one can now write generators like EDSCContext context; Type indexType = ...; Bindable i(indexType), ii(indexType), zero(indexType), one(indexType); StmtBlock loopBlock({i}, {}); loopBlock.set({ii = i + one, Branch(loopBlock, {ii})}); MLIREmitter(&builder) .bindConstant<ConstantIndexOp>(zero, 0) .bindConstant<ConstantIndexOp>(one, 1) .emitStmt(Branch(loopBlock, {zero})); where the emitter will emit the statement and its successors, if present. PiperOrigin-RevId: 235541892	2019-03-29 16:40:50 -07:00
Tatiana Shpeisman	8b99d1bdbf	Use dialect hook registration for constant folding hook. Deletes specialized mechanism for registering constant folding hook and uses dialect hooks registration mechanism instead. PiperOrigin-RevId: 235535410	2019-03-29 16:40:35 -07:00
River Riddle	a51d21538c	Add constant folding for ExtractElementOp when the aggregate is an OpaqueElementsAttr. PiperOrigin-RevId: 235533283	2019-03-29 16:40:20 -07:00
Alex Zinenko	ec76f9c8c1	EDSC printing: handle integer attributes with bitwidth > 64 This came up in post-submit review. Use LLVM's support for outputting APInt values directly instead of obtaining a 64-bit integer value from APInt, which will not work for wider integers. PiperOrigin-RevId: 235531574	2019-03-29 16:40:05 -07:00
Lei Zhang	4887e45546	[TableGen] Fix infinite loop in SubstLeaves substitution Previously we have `auto pos = std::string::find(...) != std::string::npos` as if condition to control substring substitution. Instead of the position for the found substring, `pos` will be a boolean value indicating found nor not. Then used as the replace start position, we were always replacing starting from 0 or 1. If the replaced substring also has the pattern to be matched, we'll see an infinite loop. PiperOrigin-RevId: 235504681	2019-03-29 16:39:47 -07:00
Uday Bondhugula	dfe07b7bf6	Refactor AffineExprFlattener and move FlatAffineConstraints out of IR into Analysis - NFC - refactor AffineExprFlattener (-> SimpleAffineExprFlattener) so that it doesn't depend on FlatAffineConstraints, and so that FlatAffineConstraints could be moved out of IR/; the simplification that the IR needs for AffineExpr's doesn't depend on FlatAffineConstraints - have AffineExprFlattener derive from SimpleAffineExprFlattener to use for all Analysis/Transforms purposes; override addLocalFloorDivId in the derived class - turn addAffineForOpDomain into a method on FlatAffineConstraints - turn AffineForOp::getAsValueMap into an AffineValueMap ctor PiperOrigin-RevId: 235283610	2019-03-29 16:39:32 -07:00
Stella Laurenzo	c81b16e279	Spike to define real math ops and lowering of one variant of add to corresponding integer ops. The only reason in starting with a fixedpoint add is that it is the absolute simplest variant and illustrates the level of abstraction I'm aiming for. The overall flow would be: 1. Determine quantization parameters (out of scope of this cl). 2. Source dialect rules to lower supported math ops to the quantization dialect (out of scope of this cl). 3. Quantization passes: [-quant-convert-const, -quant-lower-uniform-real-math, -quant-lower-unsupported-to-float] (the last one not implemented yet) 4. Target specific lowering of the integral arithmetic ops (roughly at the level of gemmlowp) to more fundamental operations (i.e. calls to gemmlowp, simd instructions, DSP instructions, etc). How I'm doing this should facilitate implementation of just about any kind of backend except TFLite, which has a very course, adhoc surface area for its quantized kernels. Options there include (I'm not taking an opinion on this - just trying to provide options): a) Not using any of this: just match q/dbarrier + tf math ops to the supported TFLite quantized op set. b) Implement the more fundamental integer math ops on TFLite and convert to those instead of the current op set. Note that I've hand-waved over the process of choosing appropriate quantization parameters. Getting to that next. As you can see, different implementations will likely have different magic combinations of specific math support, and we will need the target system that has been discussed for some of the esoteric cases (i.e. many DSPs only support POT fixedpoint). Two unrelated changes to the overall goal of this CL and can be broken out of desired: - Adding optional attribute support to TabelGen - Allowing TableGen native rewrite hooks to return nullptr, signalling that no rewrite has been done. PiperOrigin-RevId: 235267229	2019-03-29 16:39:13 -07:00
River Riddle	f48716146e	NFC: Make DialectConversion not directly inherit from ModulePass. It is now just a utility class that performs dialect conversion on a provided module. PiperOrigin-RevId: 235194067	2019-03-29 16:38:57 -07:00
River Riddle	5410dff790	Rewrite MLPatternLoweringPass to no longer inherit from FunctionPass and just provide a utility function that applies ML patterns. PiperOrigin-RevId: 235194034	2019-03-29 16:38:41 -07:00
MLIR Team	8564b274db	Internal change PiperOrigin-RevId: 235191129	2019-03-29 16:38:24 -07:00
Alex Zinenko	51835e73e0	Document the conversion into the LLVM IR dialect Add a documentation page on the key points of the conversion to LLVM IR. This focuses on the aspects of conversion that are relevant for integration of the LLVM IR dialect (and produced LLVM IR that is mostly a one-to-one translation) into other projects. In particular, it describes the type conversion rules and the memref model supporting dynamic sizes. PiperOrigin-RevId: 235190772	2019-03-29 16:38:04 -07:00
Brian Patton	d52e631359	Add a test example of calling a builtin function. PiperOrigin-RevId: 235149430	2019-03-29 16:37:46 -07:00

1 2 3 4 5 ...

1215 Commits