llvm-project

Commit Graph

Author	SHA1	Message	Date
River Riddle	302fb03961	Add a new class NamedAttributeList to deduplicate named attribute handling between Function and Instruction. PiperOrigin-RevId: 235830304	2019-03-29 16:45:40 -07:00
River Riddle	3b3e11da93	Validate the names of attribute, dialect, and functions during verification. This essentially enforces the parsing rules upon their names. PiperOrigin-RevId: 235818842	2019-03-29 16:44:53 -07:00
Uday Bondhugula	d4b3ff1096	Loop fusion comand line options cleanup - clean up loop fusion CL options for promoting local buffers to fast memory space - add parameters to loop fusion pass instantiation PiperOrigin-RevId: 235813419	2019-03-29 16:44:38 -07:00
River Riddle	79944e5eef	Add a Function::isExternal utility to simplify checks for external functions. PiperOrigin-RevId: 235746553	2019-03-29 16:43:50 -07:00
River Riddle	cdbfd48471	Rewrite the dominance info classes to allow for operating on arbitrary control flow within operation regions. The CSE pass is also updated to properly handle nested dominance. PiperOrigin-RevId: 235742627	2019-03-29 16:43:35 -07:00
Alex Zinenko	1da1b4c321	LLVM IR dialect and translation: support conditional branches with arguments Since the goal of the LLVM IR dialect is to reflect LLVM IR in MLIR, the dialect and the conversion procedure must account for the differences betweeen block arguments and LLVM IR PHI nodes. In particular, LLVM IR disallows PHI nodes with different values coming from the same source. Therefore, the LLVM IR dialect now disallows `cond_br` operations that have identical successors accepting arguments, which would lead to invalid PHI nodes. The conversion process resolves the potential PHI source ambiguity by injecting dummy blocks if the same block is used more than once as a successor in an instruction. These dummy blocks branch unconditionally to the original successors, pass them the original operands (available in the dummy block because it is dominated by the original block) and are used instead of them in the original terminator operation. PiperOrigin-RevId: 235682798	2019-03-29 16:43:05 -07:00
Uday Bondhugula	b269481106	Cleanup post cl/235283610 - NFC - remove stale comments + cleanup - drop MLIRContext * field from expr flattener PiperOrigin-RevId: 235621178	2019-03-29 16:42:20 -07:00
River Riddle	b4f033f6c6	Convert the dialect type parse/print hooks into virtual functions on the Dialect class. PiperOrigin-RevId: 235589945	2019-03-29 16:42:05 -07:00
River Riddle	f1f86eac60	Add support for constructing DenseIntElementsAttr with an array of APInt and DenseFPElementsAttr with an array of APFloat. PiperOrigin-RevId: 235581794	2019-03-29 16:41:50 -07:00
Lei Zhang	3f644705eb	[TableGen] Use ArrayRef instead of SmallVectorImpl for suitable method PiperOrigin-RevId: 235577399	2019-03-29 16:41:35 -07:00
Nicolas Vasilache	62c54a2ec4	Add a stripmineSink and imperfectly nested tiling primitives. This CL adds a primitive to perform stripmining of a loop by a given factor and sinking it under multiple target loops. In turn this is used to implement imperfectly nested loop tiling (with interchange) by repeatedly calling the stripmineSink primitive. The API returns the point loops and allows repeated invocations of tiling to achieve declarative, multi-level, imperfectly-nested tiling. Note that this CL is only concerned with the mechanical aspects and does not worry about analysis and legality. The API is demonstrated in an example which creates an EDSC block, emits the corresponding MLIR and applies imperfectly-nested tiling: ```cpp auto block = edsc::block({ For(ArrayRef<edsc::Expr>{i, j}, {zero, zero}, {M, N}, {one, one}, { For(k1, zero, O, one, { C({i, j, k1}) = A({i, j, k1}) + B({i, j, k1}) }), For(k2, zero, O, one, { C({i, j, k2}) = A({i, j, k2}) + B({i, j, k2}) }), }), }); // clang-format on emitter.emitStmts(block.getBody()); auto l_i = emitter.getAffineForOp(i), l_j = emitter.getAffineForOp(j), l_k1 = emitter.getAffineForOp(k1), l_k2 = emitter.getAffineForOp(k2); auto indicesL1 = mlir::tile({l_i, l_j}, {512, 1024}, {l_k1, l_k2}); auto l_ii1 = indicesL1[0][0], l_jj1 = indicesL1[1][0]; mlir::tile({l_jj1, l_ii1}, {32, 16}, l_jj1); ``` The edsc::Expr for the induction variables (i, j, k_1, k_2) provide the programmatic hooks from which tiling can be applied declaratively. PiperOrigin-RevId: 235548228	2019-03-29 16:41:20 -07:00
Alex Zinenko	e7193a70f8	EDSC: support conditional branch instructions Leverage the recently introduced support for multiple argument groups and multiple destination blocks in EDSC Expressions to implement conditional branches in EDSC. Conditional branches have two successors and three argument groups. The first group contains a single expression of i1 type that corresponds to the condition of the branch. The two following groups contain arguments of the two successors of the conditional branch instruction, in the same order as the successors. Expose this instruction to the C API and Python bindings. PiperOrigin-RevId: 235542768	2019-03-29 16:41:05 -07:00
Alex Zinenko	83e8db2193	EDSC: support branch instructions The new implementation of blocks was designed to support blocks with arguments. More specifically, StmtBlock can be constructed with a list of Bindables that will be bound to block aguments upon construction. Leverage this functionality to implement branch instructions with arguments. This additionally requires the statement storage to have a list of successors, similarly to core IR operations. Becauase successor chains can form loops, we need a possibility to decouple block declaration, after which it becomes usable by branch instructions, from block body definition. This is achieved by creating an empty block and by resetting its body with a new list of instructions. Note that assigning a block from another block will not affect any instructions that may have designated this block as their successor (this behavior is necessary to make value-type semantics of EDSC types consistent). Combined, one can now write generators like EDSCContext context; Type indexType = ...; Bindable i(indexType), ii(indexType), zero(indexType), one(indexType); StmtBlock loopBlock({i}, {}); loopBlock.set({ii = i + one, Branch(loopBlock, {ii})}); MLIREmitter(&builder) .bindConstant<ConstantIndexOp>(zero, 0) .bindConstant<ConstantIndexOp>(one, 1) .emitStmt(Branch(loopBlock, {zero})); where the emitter will emit the statement and its successors, if present. PiperOrigin-RevId: 235541892	2019-03-29 16:40:50 -07:00
Tatiana Shpeisman	8b99d1bdbf	Use dialect hook registration for constant folding hook. Deletes specialized mechanism for registering constant folding hook and uses dialect hooks registration mechanism instead. PiperOrigin-RevId: 235535410	2019-03-29 16:40:35 -07:00
River Riddle	a51d21538c	Add constant folding for ExtractElementOp when the aggregate is an OpaqueElementsAttr. PiperOrigin-RevId: 235533283	2019-03-29 16:40:20 -07:00
Uday Bondhugula	dfe07b7bf6	Refactor AffineExprFlattener and move FlatAffineConstraints out of IR into Analysis - NFC - refactor AffineExprFlattener (-> SimpleAffineExprFlattener) so that it doesn't depend on FlatAffineConstraints, and so that FlatAffineConstraints could be moved out of IR/; the simplification that the IR needs for AffineExpr's doesn't depend on FlatAffineConstraints - have AffineExprFlattener derive from SimpleAffineExprFlattener to use for all Analysis/Transforms purposes; override addLocalFloorDivId in the derived class - turn addAffineForOpDomain into a method on FlatAffineConstraints - turn AffineForOp::getAsValueMap into an AffineValueMap ctor PiperOrigin-RevId: 235283610	2019-03-29 16:39:32 -07:00
Stella Laurenzo	c81b16e279	Spike to define real math ops and lowering of one variant of add to corresponding integer ops. The only reason in starting with a fixedpoint add is that it is the absolute simplest variant and illustrates the level of abstraction I'm aiming for. The overall flow would be: 1. Determine quantization parameters (out of scope of this cl). 2. Source dialect rules to lower supported math ops to the quantization dialect (out of scope of this cl). 3. Quantization passes: [-quant-convert-const, -quant-lower-uniform-real-math, -quant-lower-unsupported-to-float] (the last one not implemented yet) 4. Target specific lowering of the integral arithmetic ops (roughly at the level of gemmlowp) to more fundamental operations (i.e. calls to gemmlowp, simd instructions, DSP instructions, etc). How I'm doing this should facilitate implementation of just about any kind of backend except TFLite, which has a very course, adhoc surface area for its quantized kernels. Options there include (I'm not taking an opinion on this - just trying to provide options): a) Not using any of this: just match q/dbarrier + tf math ops to the supported TFLite quantized op set. b) Implement the more fundamental integer math ops on TFLite and convert to those instead of the current op set. Note that I've hand-waved over the process of choosing appropriate quantization parameters. Getting to that next. As you can see, different implementations will likely have different magic combinations of specific math support, and we will need the target system that has been discussed for some of the esoteric cases (i.e. many DSPs only support POT fixedpoint). Two unrelated changes to the overall goal of this CL and can be broken out of desired: - Adding optional attribute support to TabelGen - Allowing TableGen native rewrite hooks to return nullptr, signalling that no rewrite has been done. PiperOrigin-RevId: 235267229	2019-03-29 16:39:13 -07:00
River Riddle	f48716146e	NFC: Make DialectConversion not directly inherit from ModulePass. It is now just a utility class that performs dialect conversion on a provided module. PiperOrigin-RevId: 235194067	2019-03-29 16:38:57 -07:00
River Riddle	5410dff790	Rewrite MLPatternLoweringPass to no longer inherit from FunctionPass and just provide a utility function that applies ML patterns. PiperOrigin-RevId: 235194034	2019-03-29 16:38:41 -07:00
River Riddle	3e656599f1	Define a PassID class to use when defining a pass. This allows for the type used for the ID field to be self documenting. It also allows for the compiler to know the set alignment of the ID object, which is useful for storing pointer identifiers within llvm data structures. PiperOrigin-RevId: 235107957	2019-03-29 16:37:12 -07:00
Sergei Lebedev	1cc9305c71	Exposed division and remainder operations in EDSC This change introduces three new operators in EDSC: Div (also exposed via Expr.__div__ aka /) -- floating-point division, FloorDiv and CeilDiv for flooring/ceiling index division. The lowering to LLVM will be implemented in b/124872679. PiperOrigin-RevId: 234963217	2019-03-29 16:36:41 -07:00
Alex Zinenko	59a209721e	EDSC: support call instructions Introduce support for binding MLIR functions as constant expressions. Standard constant operation supports functions as possible constant values. Provide C APIs to look up existing named functions in an MLIR module and expose them to the Python bindings. Provide Python bindings to declare a function in an MLIR module without defining it and to add a definition given a function declaration. These declarations are useful when attempting to link MLIR modules with, e.g., the standard library. Introduce EDSC support for direct and indirect calls to other MLIR functions. Internally, an indirect call is always emitted to leverage existing support for delayed construction of MLIR Values using EDSC Exprs. If the expression is bound to a constant function (looked up or declared beforehand), MLIR constant folding will be able to replace an indirect call by a direct call. Currently, only zero- and one-result functions are supported since we don't have support for multi-valued expressions in EDSC. Expose function calling interface to Python bindings on expressions by defining a `__call__` function accepting a variable number of arguments. PiperOrigin-RevId: 234959444	2019-03-29 16:36:26 -07:00
Jacques Pienaar	1725b485eb	Create OpTrait base class & allow operation predicate OpTraits. * Introduce a OpTrait class in C++ to wrap the TableGen definition; * Introduce PredOpTrait and rename previous usage of OpTrait to NativeOpTrait; * PredOpTrait allows specifying a trait of the operation by way of predicate on the operation. This will be used in future to create reusable set of trait building blocks in the definition of operations. E.g., indicating whether to operands have the same type and allowing locally documenting op requirements by trait composition. - Some of these building blocks could later evolve into known fixed set as LLVMs backends do, but that can be considered with more data. * Use the modelling to address one verify TODO in a very local manner. This subsumes the current custom verify specification which will be removed in a separate mechanical CL. PiperOrigin-RevId: 234827169	2019-03-29 16:35:11 -07:00
Alex Zinenko	0a95aac7c7	Allow Builder to create function-type constants A recent change made ConstantOp::build accept a NumericAttr or assert that a generic Attribute is in fact a NumericAttr. The rationale behind the change was that NumericAttrs have a type that can be used as the result type of the constant operation. FunctionAttr also has a type, and it is valid to construct function-typed constants as exercised by the parser.mlir test. Relax ConstantOp::build back to take a generic Attribute. In the overload that only takes an attribute, assert that the Attribute is either a NumericAttr or a FunctionAttr, because it is necessary to extract the type. In the overload that takes both type type and the attribute, delegate the attribute type checking to ConstantOp::verify to prevent non-Builder-based Op construction mechanisms from creating invalid IR. PiperOrigin-RevId: 234798569	2019-03-29 16:34:41 -07:00
Alex Zinenko	21bd4540f3	EDSC: introduce min/max only usable inside for upper/lower bounds of a loop Introduce a type-safe way of building a 'for' loop with max/min bounds in EDSC. Define new types MaxExpr and MinExpr in C++ EDSC API and expose them to Python bindings. Use values of these type to construct 'for' loops with max/min in newly introduced overloads of the `edsc::For` factory function. Note that in C APIs, we still must expose MaxMinFor as a different function because C has no overloads. Also note that MaxExpr and MinExpr do _not_ derive from Expr because they are not allowed to be used in a regular Expr context (which may produce `affine.apply` instructions not expecting `min` or `max`). Factory functions `Min` and `Max` in Python can be further overloaded to produce chains of comparisons and selects on non-index types. This is not trivial in C++ since overloaded functions cannot differ by the return type only (`MaxExpr` or `Expr`) and making `MaxExpr` derive from `Expr` defies the purpose of type-safe construction. PiperOrigin-RevId: 234786131	2019-03-29 16:34:11 -07:00
Alex Zinenko	d055a4e100	EDSC: support multi-expression loop bounds MLIR supports 'for' loops with lower(upper) bound defined by taking a maximum(minimum) of a list of expressions, but does not have first-class affine constructs for the maximum(minimum). All these expressions must have affine provenance, similarly to a single-expression bound. Add support for constructing such loops using EDSC. The expression factory function is called `edsc::MaxMinFor` to (1) highlight that the maximum(minimum) operation is applied to the lower(upper) bound expressions and (2) differentiate it from a `edsc::For` that creates multiple perfectly nested loops (and should arguably be called `edsc::ForNest`). PiperOrigin-RevId: 234785996	2019-03-29 16:33:56 -07:00
Alex Zinenko	a2a433652d	EDSC: create constants as expressions Introduce a functionality to create EDSC expressions from typed constants. This complements the current functionality that uses "unbound" expressions and binds them to a specific constant before emission. It comes in handy in cases where we want to check if something is a constant early during construciton rather than late during emission, for example multiplications and divisions in affine expressions. This is also consistent with MLIR vision of constants being defined by an operation (rather than being special kinds of values in the IR) by exposing this operation as EDSC expression. PiperOrigin-RevId: 234758020	2019-03-29 16:33:41 -07:00
Uday Bondhugula	a1dad3a5d9	Extend/improve getSliceBounds() / complete TODO + update unionBoundingBox - compute slices precisely where the destination iteration depends on multiple source iterations (instead of over-approximating to the whole source loop extent) - update unionBoundingBox to deal with input with non-matching symbols - reenable disabled backend test case PiperOrigin-RevId: 234714069	2019-03-29 16:33:11 -07:00
River Riddle	48ccae2476	NFC: Refactor the files related to passes. * PassRegistry is split into its own source file. * Pass related files are moved to a new library 'Pass'. PiperOrigin-RevId: 234705771	2019-03-29 16:32:56 -07:00
River Riddle	da0ebe0670	Add a generic pattern matcher for matching constant values produced by an operation with zero operands and a single result. PiperOrigin-RevId: 234616691	2019-03-29 16:31:56 -07:00
Alex Zinenko	05f37d52d0	EDSC: clean up type casting mechanism Originally, edsc::Expr had a long enum edsc::ExprKind with all supported types of operations. Recent Expr extensibility support removed the need to specify supported types in advance. Replace the no-longer-used blocks of enum values reserved for unary/binary/ternary/variadic expressions with simple values (it is still useful to know if an expression is, e.g., binary to access it through a simpler API). Furthermore, wrap string-comparison now used to identify specific ops into an `Expr::is_op<>` function template, that acts similarly to `Instruction::isa<>`. Introduce `{Unary,Binary,Ternary,Variadic}Expr::make<> ` function template that creates a Expression emitting the MLIR Op specified as template argument. PiperOrigin-RevId: 234612916	2019-03-29 16:31:41 -07:00
Alex Zinenko	b4dba895a6	EDSC: make Expr typed and extensible Expose the result types of edsc::Expr, which are now stored for all types of Exprs and not only for the variadic ones. Require return types when an Expr is constructed, if it will ever have some. An empty return type list is interpreted as an Expr that does not create a value (e.g. `return` or `store`). Conceptually, all edss::Exprs are now typed, with the type being a (potentially empty) tuple of return types. Unbound expressions and Bindables must now be constructed with a specific type they will take. This makes EDSC less evidently type-polymorphic, but we can still write generic code such as Expr sumOfSquares(Expr lhs, Expr rhs) { return lhs * lhs + rhs * rhs; } and use it to construct different typed expressions as sumOfSquares(Bindable(IndexType::get(ctx)), Bindable(IndexType::get(ctx))); sumOfSquares(Bindable(FloatType::getF32(ctx)), Bindable(FloatType::getF32(ctx))); On the positive side, we get the following. 1. We can now perform type checking when constructing Exprs rather than during MLIR emission. Nevertheless, this is still duplicates the Op::verify() until we can factor out type checking from that. 2. MLIREmitter is significantly simplified. 3. ExprKind enum is only used for actual kinds of expressions. Data structures are converging with AbstractOperation, and the users can now create a VariadicExpr("canonical_op_name", {types}, {exprs}) for any operation, even an unregistered one without having to extend the enum and make pervasive changes to EDSCs. On the negative side, we get the following. 1. Typed bindables are more verbose, even in Python. 2. We lose the ability to do print debugging for higher-level EDSC abstractions that are implemented as multiple MLIR Ops, for example logical disjunction. This is the step 2/n towards making EDSC extensible. *** Move MLIR Op construction from MLIREmitter::emitExpr to Expr::build since Expr now has sufficient information to build itself. This is the step 3/n towards making EDSC extensible. Both of these strive to minimize the amount of irrelevant changes. In particular, this introduces more complex pretty-printing for affine and binary expression to make sure tests continue to pass. It also relies on string comparison to identify specific operations that an Expr produces. PiperOrigin-RevId: 234609882	2019-03-29 16:31:26 -07:00
Lei Zhang	e0fc503896	[TableGen] Support using Variadic<Type> in results This CL extended TableGen Operator class to provide accessors for information on op results. In OpDefinitionGen, added checks to make sure only the last result can be variadic, and adjusted traits and builders generation to consider variadic results. PiperOrigin-RevId: 234596124	2019-03-29 16:31:11 -07:00
Alex Zinenko	0a4c940c1b	EDSC: introduce support for blocks EDSC currently implement a block as a statement that is itself a list of statements. This suffers from two modeling problems: (1) these blocks are not addressable, i.e. one cannot create an instruction where thus constructed block is a successor; (2) they support block nesting, which is not supported by MLIR blocks. Furthermore, emitting such "compound statement" (misleadingly named `Block` in Python bindings) does not actually produce a new Block in the IR. Implement support for creating actual IR Blocks in EDSC. In particular, define a new StmtBlock EDSC class that is neither an Expr nor a Stmt but contains a list of Stmts. Additionally, StmtBlock may have (early-) typed arguments. These arguments are Bindable expressions that can be used inside the block. Provide two calls in the MLIREmitter, `emitBlock` that actually emits a new block and `emitBlockBody` that only emits the instructions contained in the block without creating a new block. In the latter case, the instructions must not use block arguments. Update Python bindings to make it clear when instruction emission happens without creating a new block. PiperOrigin-RevId: 234556474	2019-03-29 16:30:56 -07:00
Lei Zhang	911b9960ba	[TableGen] Fix discrepancy between parameter meaning and code logic The parameter to emitStandaloneParamBuilder() was renamed from hasResultType to isAllSameType, which is the opposite boolean value. The logic should be changed to make them consistent. Also re-ordered some methods in Operator. And few other tiny improvements. PiperOrigin-RevId: 234478316	2019-03-29 16:30:41 -07:00
Uday Bondhugula	f97c1c5b06	Misc. updates/fixes to analysis utils used for DMA generation; update DMA generation pass to make it drop certain assumptions, complete TODOs. - multiple fixes for getMemoryFootprintBytes - pass loopDepth correctly from getMemoryFootprintBytes() - use union while computing memory footprints - bug fixes for addAffineForOpDomain - take into account loop step - add domains of other loop IVs in turn that might have been used in the bounds - dma-generate: drop assumption of "non-unit stride loops being tile space loops and skipping those and recursing to inner depths"; DMA generation is now purely based on available fast mem capacity and memory footprint's calculated - handle memory region compute failures/bailouts correctly from dma-generate - loop tiling cleanup/NFC - update some debug and error messages to use emitNote/emitError in pipeline-data-transfer pass - NFC PiperOrigin-RevId: 234245969	2019-03-29 16:30:26 -07:00
Alex Zinenko	4bb31f7377	ExecutionEngine: provide utils for running CLI-configured LLVM passes A recent change introduced a possibility to run LLVM IR transformation during JIT-compilation in the ExecutionEngine. Provide helper functions that construct IR transformers given either clang-style optimization levels or a list passes to run. The latter wraps the LLVM command line option parser to parse strings rather than actual command line arguments. As a result, we can run either of mlir-cpu-runner -O3 input.mlir mlir-cpu-runner -some-mlir-pass -llvm-opts="-llvm-pass -other-llvm-pass" to combine different transformations. The transformer builder functions are provided as a separate library that depends on LLVM pass libraries unlike the main execution engine library. The library can be used for integrating MLIR execution engine into external frameworks. PiperOrigin-RevId: 234173493	2019-03-29 16:29:41 -07:00
MLIR Team	8f5f2c765d	LoopFusion: perform a series of loop interchanges to increase the loop depth at which slices of producer loop nests can be fused into constumer loop nests. ) Adds utility to LoopUtils to perform loop interchange of two AffineForOps. ) Adds utility to LoopUtils to sink a loop to a specified depth within a loop nest, using a series of loop interchanges. ) Computes dependences between all loads and stores in the loop nest, and classifies each loop as parallel or sequential. ) Computes loop interchange permutation required to sink sequential loops (and raise parallel loop nests) while preserving relative order among them. ) Checks each dependence against the permutation to make sure that dependences would not be violated by the loop interchange transformation. ) Calls loop interchange in LoopFusion pass on consumer loop nests before fusing in producers, sinking loops with loop carried dependences deeper into the consumer loop nest. *) Adds and updates related unit tests. PiperOrigin-RevId: 234158370	2019-03-29 16:29:26 -07:00
Lei Zhang	081299333b	[TableGen] Rename Operand to Value to prepare sharing between operand and result We specify op operands and results in TableGen op definition using the same syntax. They should be modelled similarly in TableGen driver wrapper classes. PiperOrigin-RevId: 234153332	2019-03-29 16:29:11 -07:00
Alex Zinenko	d7aa700ccb	Dialect conversion: decouple function signature conversion from type conversion Function types are built-in in MLIR and affect the validity of the IR itself. However, advanced target dialects such as the LLVM IR dialect may include custom function types. Until now, dialect conversion was expecting function types not to be converted to the custom type: although the signatures was allowed to change, the outer type must have been an mlir::FunctionType. This effectively prevented dialect conversion from creating instructions that operate on values of the custom function type. Dissociate function signature conversion from general type conversion. Function signature conversion must still produce an mlir::FunctionType and is used in places where built-in types are required to make IR valid. General type conversion is used for SSA values, including function and block arguments and function results. Exercise this behavior in the LLVM IR dialect conversion by converting function types to LLVM IR function pointer types. The pointer to a function is chosen to provide consistent lowering of higher-order functions: while it is possible to have a value of function type, it is not possible to create a function type accepting a returning another function type. PiperOrigin-RevId: 234124494	2019-03-29 16:28:41 -07:00
MLIR Team	affb2193cc	Update direction vector computation to use FlatAffineConstraints::getLower/UpperBounds. Update FlatAffineConstraints::getLower/UpperBounds to project to the identifier for which bounds are being computed. This change enables computing bounds on an identifier which were previously dependent on the bounds of another identifier. PiperOrigin-RevId: 234017514	2019-03-29 16:28:25 -07:00
Lei Zhang	93d8f14c0f	[TFLite] Fuse AddOp into preceding convolution ops If we see an add op adding a constant value to a convolution op with constant bias, we can fuse the add into the convolution op by constant folding the bias and the add op's constant operand. This CL also removes dangling RewriterGen check that prevents us from using nested DAG nodes in result patterns, which is already supported. PiperOrigin-RevId: 233989654	2019-03-29 16:27:55 -07:00
Lei Zhang	eb3f8dcb93	[TableGen] Use deduced result types for build() of suitable ops For ops with the SameOperandsAndResultType trait, we know that all result types should be the same as the first operand's type. So we can generate a build() method without requiring result types as parameters and also invoke this method when constructing such ops during expanding rewrite patterns. Similarly for ops have broadcast behavior, we can define build() method to use the deduced type as the result type. So we can also calling into this build() method when constructing ops in RewriterGen. PiperOrigin-RevId: 233988307	2019-03-29 16:27:40 -07:00
Jacques Pienaar	388fb3751e	Add pattern constraints. Enable matching pattern only if constraint is met. Start with type constraints and more general C++ constraints. PiperOrigin-RevId: 233830768	2019-03-29 16:26:53 -07:00
Alex Zinenko	bc184cff3f	EDSC: unify Expr storage EDSC expressions evolved to have different types of underlying storage. Separate classes are used for unary, binary, ternary and variadic expressions. The latter covers all the needs of the three special cases. Remove these special cases and use a single ExprStorage class everywhere while maintaining the same APIs at the Expr level (ExprStorage is an internal implementation class). This is step 1/n to converging EDSC expressions and Ops and making EDSCs support custom operations. PiperOrigin-RevId: 233704912	2019-03-29 16:26:37 -07:00
River Riddle	2f11f86846	Add langref descriptions for the attribute values supported in MLIR. PiperOrigin-RevId: 233661338	2019-03-29 16:26:08 -07:00
River Riddle	4755774d16	Make IndexType a standard type instead of a builtin. This also cleans up some unnecessary factory methods on the Type class. PiperOrigin-RevId: 233640730	2019-03-29 16:25:38 -07:00
Alex Zinenko	8de7f6c471	LLVM IR Dialect: add select op and lower standard select to it This is a similar one-to-one mapping. PiperOrigin-RevId: 233621006	2019-03-29 16:25:23 -07:00
Alex Zinenko	0e59e5c49b	EDSC: move Expr and Stmt construction operators to a namespace In the current state, edsc::Expr and edsc::Stmt overload operators to construct other Exprs and Stmts. This includes some unconventional overloads of the `operator==` to create a comparison expression and of the `operator!` to create a negation expression. This situation could lead to unpleasant surprises where the code does not behave like expected. Make all Expr and Stmt construction operators free functions and move them to the `edsc::op` namespace. Callers willing to use these operators must explicitly include them with the `using` declaration. This can be done in some local scope. Additionally, we currently emit signed comparisons for order-comparison operators. With namespaces, we can later introduce two sets of operators in different namespace, e.g. `edsc::op::sign` and `edsc::op::unsign` to clearly state which kind of comparison is implied. PiperOrigin-RevId: 233578674	2019-03-29 16:25:08 -07:00
Tatiana Shpeisman	2e6cd60d3b	Add dialect-specific decoding for opaque constants. Associates opaque constants with a particular dialect. Adds general mechanism to register dialect-specific hooks defined in external components. Adds hooks to decode opaque tensor constant and extract an element of an opaque tensor constant. This CL does not change the existing mechanism for registering constant folding hook yet. One thing at a time. PiperOrigin-RevId: 233544757	2019-03-29 16:24:38 -07:00
Jacques Pienaar	4b88e7a245	Fix incorrect type in iterator. PiperOrigin-RevId: 233542711	2019-03-29 16:24:23 -07:00
Uday Bondhugula	8b3f841daf	Generate dealloc's for the alloc's of dma-generate. - for the DMA buffers being allocated (and their tags), generate corresponding deallocs - minor related update to replaceAllMemRefUsesWith and PipelineDataTransfer pass Code generation for DMA transfers was being done with the initial simplifying assumption that the alloc's would map to scoped allocations, and so no deallocations would be necessary. Drop this assumption to generalize. Note that even with scoped allocations, unrolling loops that have scoped allocations could create a series of allocations and exhaustion of fast memory. Having a end of lifetime marker like a dealloc in fact allows creating new scopes if necessary when lowering to a backend and still utilize scoped allocation. DMA buffers created by -dma-generate are guaranteed to have either non-overlapping lifetimes or nested lifetimes. PiperOrigin-RevId: 233502632	2019-03-29 16:24:08 -07:00
Lei Zhang	a57b398906	[TableGen] Assign created ops to variables and rewrite with PatternRewriter::replaceOp() Previously we were using PatternRewrite::replaceOpWithNewOp() to both create the new op inline and rewrite the matched op. That does not work well if we want to generate multiple ops in a sequence. To support that, this CL changed to assign each newly created op to a separate variable. This CL also refactors how PatternEmitter performs the directive dispatch logic. PiperOrigin-RevId: 233206819	2019-03-29 16:22:53 -07:00
River Riddle	366ebcf6aa	Remove the restriction that only registered terminator operations may terminate a block and have block operands. This allows for any operation to hold block operands. It also introduces the notion that unregistered operations may terminate a block. As such, the 'isTerminator' api on Instruction has been split into 'isKnownTerminator' and 'isKnownNonTerminator'. PiperOrigin-RevId: 233076831	2019-03-29 16:22:23 -07:00
Uday Bondhugula	c419accea3	Automated rollback of changelist 232728977. PiperOrigin-RevId: 232944889	2019-03-29 16:21:38 -07:00
Smit Hinsu	c201e6ef05	Handle dynamic shapes in Broadcastable op trait That allows TensorFlow Add and Div ops to use Broadcastable op trait instead of more restrictive SameValueType op trait. That in turn allows TensorFlow ops to be registered by defining GET_OP_LIST and including the generated ops file. Currently, tf-raise-control-flow pass tests are using dynamic shapes in tf.Add op and AddOp can't be registered without supporting the dynamic shapes. TESTED with unit tests PiperOrigin-RevId: 232927998	2019-03-29 16:21:23 -07:00
Jacques Pienaar	351eed0dd1	Add tf.LeakyRelu. * Add tf.LeakyRelu op definition + folders (well one is really canonicalizer) * Change generated error message to use attribute description instead; * Change the return type of F32Attr to be APFloat - internally it is already stored as APFloat so let the caller decides if they want to convert it or not. I could see varying opinions here though :) (did not change i32attr similarly) PiperOrigin-RevId: 232923358	2019-03-29 16:20:53 -07:00
Alex Zinenko	36c0516c78	Disallow zero dimensions in vectors and memrefs Aggregate types where at least one dimension is zero do not fully make sense as they cannot contain any values (their total size is zero). However, TensorFlow and XLA support tensors with zero sizes, so we must support those too. This is relatively safe since, unlike vectors and memrefs, we don't have first-class element accessors for MLIR tensors. To support sparse element attributes of vector types that have no non-zero elements, make sure that index and value element attributes have tensor type so that we never need to create a zero vector type internally. Note that this is already consistent with the inline documentation of the sparse elements attribute. Users of the sparse elements attribute should not rely on the storage schema anyway. PiperOrigin-RevId: 232896707	2019-03-29 16:20:38 -07:00
River Riddle	a886625813	Modify the canonicalizations of select and muli to use the fold hook. This also extends the greedy pattern rewrite driver to add the operands of folded operations back to the worklist. PiperOrigin-RevId: 232878959	2019-03-29 16:20:06 -07:00
Alex Zinenko	8093f17a66	ExecutionEngine: provide a hook for LLVM IR passes The current ExecutionEngine flow generates the LLVM IR from MLIR and JIT-compiles it as is without any transformation. It thus misses the opportunity to perform optimizations supported by LLVM or collect statistics about the module. Modify the Orc JITter to perform transformations on the LLVM IR. Accept an optional LLVM module transformation function when constructing the ExecutionEngine and use it while JIT-compiling. This prevents MLIR ExecutionEngine from depending on LLVM passes; its clients should depend on the passes they require. PiperOrigin-RevId: 232877060	2019-03-29 16:19:49 -07:00
Uday Bondhugula	4ba8c9147d	Automated rollback of changelist 232717775. PiperOrigin-RevId: 232807986	2019-03-29 16:19:33 -07:00
River Riddle	fd2d7c857b	Rename the 'if' operation in the AffineOps dialect to 'affine.if' and namespace the AffineOps dialect with 'affine'. PiperOrigin-RevId: 232728977	2019-03-29 16:18:59 -07:00
Lei Zhang	888b9fa8a6	Add constant build() method not requiring result type Instead, we deduce the result type from the given attribute. This is in preparation for generating constant ops with TableGen. PiperOrigin-RevId: 232723467	2019-03-29 16:18:44 -07:00
Stella Laurenzo	c78d708487	Implement Quantization dialect and minimal UniformQuantizedType. PiperOrigin-RevId: 232723240	2019-03-29 16:18:29 -07:00
River Riddle	90d10b4e00	NFC: Rename the 'for' operation in the AffineOps dialect to 'affine.for'. The is the second step to adding a namespace to the AffineOps dialect. PiperOrigin-RevId: 232717775	2019-03-29 16:17:59 -07:00
River Riddle	3227dee15d	NFC: Rename affine_apply to affine.apply. This is the first step to adding a namespace to the affine dialect. PiperOrigin-RevId: 232707862	2019-03-29 16:17:29 -07:00
MLIR Team	b9dde91ea6	Adds the ability to compute the MemRefRegion of a sliced loop nest. Utilizes this feature during loop fusion cost computation, to compute what the write region of a fusion candidate loop nest slice would be (without having to materialize the slice or change the IR). ) Adds parameter to public API of MemRefRegion::compute for passing in the slice loop bounds to compute the memref region of the loop nest slice. ) Exposes public method MemRefRegion::getRegionSize for computing the size of the memref region in bytes. PiperOrigin-RevId: 232706165	2019-03-29 16:17:15 -07:00
River Riddle	42a2d7d6e1	Remove findInstPositionInBlock from the Block api. PiperOrigin-RevId: 232704766	2019-03-29 16:16:43 -07:00
Lei Zhang	1df6ca5053	[TableGen] Model variadic operands using Variadic<Type> Previously, we were using the trait mechanism to specify that an op has variadic operands. That led a discrepancy between how we handle ops with deterministic number of operands. Besides, we have no way to specify the constraints and match against the variadic operands. This CL introduced Variadic<Type> as a way to solve the above issues. PiperOrigin-RevId: 232656104	2019-03-29 16:16:28 -07:00
River Riddle	0c65cf283c	Move the AffineFor loop bound folding to a canonicalization pattern on the AffineForOp. PiperOrigin-RevId: 232610715	2019-03-29 16:16:11 -07:00
River Riddle	10237de8eb	Refactor the affine analysis by moving some functionality to IR and some to AffineOps. This is important for allowing the affine dialect to define canonicalizations directly on the operations instead of relying on transformation passes, e.g. ComposeAffineMaps. A summary of the refactoring: * AffineStructures has moved to IR. * simplifyAffineExpr/simplifyAffineMap/getFlattenedAffineExpr have moved to IR. * makeComposedAffineApply/fullyComposeAffineMapAndOperands have moved to AffineOps. * ComposeAffineMaps is replaced by AffineApplyOp::canonicalize and deleted. PiperOrigin-RevId: 232586468	2019-03-29 16:15:41 -07:00
Smit Hinsu	2927297a1c	Add derived type attributes for TensorFlow ops generated by TableGen Motivation for this change is to remove redundant TF type attributes for TensorFlow ops. For example, tf$T: "tfdtype$DT_FLOAT". Type attributes can be derived using the MLIR operand or result MLIR types, attribute names and their mapping. This will also allow constant folding of instructions generated within MLIR (and not imported from TensorFlow) without adding type attributes for the instruction. Derived attributes are populated while exporting MLIR to TF GraphDef using auto-generated populators. Populators are only available for the ops that are generated by the TableGen. Also, fixed Operator::getNumArgs method to exclude derived attributes as they are not part of the arguments. TESTED with unit test PiperOrigin-RevId: 232531561	2019-03-29 16:15:08 -07:00
Alex Zinenko	3fa22b88de	Print non-default attribute types in optional attr dictionary In optional attribute dictionary used, among others, in the generic form of the ops, attribute types for integers and floats are omitted. This could lead to inconsistencies when round-tripping the IR, in particular the attributes are created with incorrect types after parsing (integers default to i64, floats default to f64). Provide API to emit a trailing type after the attribute for integers and floats. Use it while printing the optional attribute dictionary. Omitting types for i64 and f64 is a pragmatic decision that minimizes changes in tests. We may want to reconsider in the future and always print types of attributes in the generic form. PiperOrigin-RevId: 232480116	2019-03-29 16:14:05 -07:00
Sergei Lebedev	52ec65c85e	Implemented __eq__ and __ne__ in EDSC Python bindings PiperOrigin-RevId: 232473201	2019-03-29 16:13:34 -07:00
River Riddle	bf9c381d1d	Remove InstWalker and move all instruction walking to the api facilities on Function/Block/Instruction. PiperOrigin-RevId: 232388113	2019-03-29 16:12:59 -07:00
River Riddle	c9ad4621ce	NFC: Move AffineApplyOp to the AffineOps dialect. This also moves the isValidDim/isValidSymbol methods from Value to the AffineOps dialect. PiperOrigin-RevId: 232386632	2019-03-29 16:12:40 -07:00
Uday Bondhugula	0f50414fa4	Refactor common code getting memref access in getMemRefRegion - NFC - use getAccessMap() instead of repeating it - fold getMemRefRegion into MemRefRegion ctor (more natural, avoid heap allocation and unique_ptr where possible) - change extractForInductionVars - MutableArrayRef -> ArrayRef for the arguments. Since the method is just returning copies of 'Value *', the client can't mutate the pointers themselves; it's fine to mutate the 'Value''s themselves, but that doesn't mutate the pointers to those. - change the way extractForInductionVars returns (see b/123437690) PiperOrigin-RevId: 232359277	2019-03-29 16:12:25 -07:00
River Riddle	74adaa5b31	Remove the OwnerTy template parameter of IROperandImpl and ValueUseIterator as it is no longer necessary now that all instructions are operations. PiperOrigin-RevId: 232356323	2019-03-29 16:11:53 -07:00
Jacques Pienaar	5e88422f1d	No need to specify default behavior. NFC. This avoids overriding the class members + setting the printer/parser hooks only to fall back to generic. PiperOrigin-RevId: 232348307	2019-03-29 16:11:23 -07:00
River Riddle	2d75501691	Remove the forward definition of OperationInst now that no references remain. PiperOrigin-RevId: 232325321	2019-03-29 16:11:08 -07:00
River Riddle	a3d9ccaecb	Replace the walkOps/visitOperationInst variants from the InstWalkers with the Instruction variants. PiperOrigin-RevId: 232322030	2019-03-29 16:10:24 -07:00
Dimitrios Vytiniotis	9ca0691b06	Exposing logical operators in EDSC all the way up to Python. PiperOrigin-RevId: 232299839	2019-03-29 16:10:08 -07:00
Uday Bondhugula	b26900dce5	Update dma-generate pass to (1) work on blocks of instructions (instead of just loops), (2) take into account fast memory space capacity and lower 'dmaDepth' to fit, (3) add location information for debug info / errors - change dma-generate pass to work on blocks of instructions (start/end iterators) instead of 'for' loops; complete TODOs - allows DMA generation for straightline blocks of operation instructions interspersed b/w loops - take into account fast memory capacity: check whether memory footprint fits in fastMemoryCapacity parameter, and recurse/lower the depth at which DMA generation is performed until it does fit in the provided memory - add location information to MemRefRegion; any insufficient fast memory capacity errors or debug info w.r.t dma generation shows location information - allow DMA generation pass to be instantiated with a fast memory capacity option (besides command line flag) - change getMemRefRegion to return unique_ptr's - change getMemRefFootprintBytes to work on a 'Block' instead of 'ForInst' - other helper methods; add postDomInstFilter option for replaceAllMemRefUsesWith; drop forInst->walkOps, add Block::walkOps methods Eg. output $ mlir-opt -dma-generate -dma-fast-mem-capacity=1 /tmp/single.mlir /tmp/single.mlir:9:13: error: Total size of all DMA buffers' for this block exceeds fast memory capacity for %i3 = (d0) -> (d0)(%i1) to (d0) -> (d0 + 32)(%i1) { ^ $ mlir-opt -debug-only=dma-generate -dma-generate -dma-fast-mem-capacity=400 /tmp/single.mlir /tmp/single.mlir:9:13: note: 8 KiB of DMA buffers in fast memory space for this block for %i3 = (d0) -> (d0)(%i1) to (d0) -> (d0 + 32)(%i1) { PiperOrigin-RevId: 232297044	2019-03-29 16:09:52 -07:00
River Riddle	870d778350	Begin the process of fully removing OperationInst. This patch cleans up references to OperationInst in the /include, /AffineOps, and lib/Analysis. PiperOrigin-RevId: 232199262	2019-03-29 16:09:36 -07:00
River Riddle	de2d0dfbca	Fold the functionality of OperationInst into Instruction. OperationInst still exists as a forward declaration and will be removed incrementally in a set of followup cleanup patches. PiperOrigin-RevId: 232198540	2019-03-29 16:09:19 -07:00
Lei Zhang	b2dbbdb704	Merge OpProperty and Traits into OpTrait They are essentially both modelling MLIR OpTrait; the former achieves the purpose via introducing corresponding symbols in TableGen, while the latter just uses plain strings. Unify them to provide a single mechanism to avoid confusion and to better reflect the definitions on MLIR C++ side. Ideally we should be able to deduce lots of these traits automatically via other bits of op definitions instead of manually specifying them; but not for now though. PiperOrigin-RevId: 232191401	2019-03-29 16:09:03 -07:00
Lei Zhang	8b75cc5741	Define NumericAttr as the base class for BoolAttr, IntegerAttr, FloatAttr, and ElementsAttr These attribute kinds are different from the rest in the sense that their types are defined in MLIR's type hierarchy and we can build constant op out of them. By defining this middle-level base class, we have a unified way to test and query the type of these attributes, which will be useful when constructing constant ops of various dialects. This CL also added asserts to reject non-NumericAttr in constant op's build() method. PiperOrigin-RevId: 232188178	2019-03-29 16:08:43 -07:00
River Riddle	dae0263e0b	Fold IROperandOwner into Instruction. PiperOrigin-RevId: 232159334	2019-03-29 16:08:11 -07:00
River Riddle	5052bd8582	Define the AffineForOp and replace ForInst with it. This patch is largely mechanical, i.e. changing usages of ForInst to OpPointer<AffineForOp>. An important difference is that upon construction an AffineForOp no longer automatically creates the body and induction variable. To generate the body/iv, 'createBody' can be called on an AffineForOp with no body. PiperOrigin-RevId: 232060516	2019-03-29 16:06:49 -07:00
Lei Zhang	e0774c008f	[TableGen] Use tblgen::DagLeaf to model DAG arguments This CL added a tblgen::DagLeaf wrapper class with several helper methods for handling DAG arguments. It helps to refactor the rewriter generation logic to be more higher level. This CL also added a tblgen::ConstantAttr wrapper class for constant attributes. PiperOrigin-RevId: 232050683	2019-03-29 16:06:31 -07:00
Nicolas Vasilache	0353ef99eb	Cleanup EDSCs and start a functional auto-generated library of custom Ops This CL applies the following simplifications to EDSCs: 1. Rename Block to StmtList because an MLIR Block is a different, not yet supported, notion; 2. Rework Bindable to drop specific storage and just use it as a simple wrapper around Expr. The only value of Bindable is to force a static cast when used by the user to bind into the emitter. For all intended purposes, Bindable is just a lightweight check that an Expr is Unbound. This simplifies usage and reduces the API footprint. After playing with it for some time, it wasn't worth the API cognition overhead; 3. Replace makeExprs and makeBindables by makeNewExprs and copyExprs which is more explicit and less easy to misuse; 4. Add generally useful functionality to MLIREmitter: a. expose zero and one for the ubiquitous common lower bounds and step; b. add support to create already bound Exprs for all function arguments as well as shapes and views for Exprs bound to memrefs. 5. Delete Stmt::operator= and replace by a `Stmt::set` method which is more explicit. 6. Make Stmt::operator Expr() explicit. 7. Indexed.indices assertions are removed to pave the way for expressing slices and views as well as to work with 0-D memrefs. The CL plugs those simplifications with TableGen and allows emitting a full MLIR function for pointwise add. This "x.add" op is both type and rank-agnostic (by allowing ArrayRef of Expr passed to For loops) and opens the door to spinning up a composable library of existing and custom ops that should automate a lot of the tedious work in TF/XLA -> MLIR. Testing needs to be significantly improved but can be done in a separate CL. PiperOrigin-RevId: 231982325	2019-03-29 16:05:23 -07:00
River Riddle	9f22a2391b	Define an detail::OperandStorage class to handle managing instruction operands. This class stores operands in a similar way to SmallVector except for two key differences. The first is the inline storage, which is a trailing objects array. The second is that being able to dynamically resize the operand list is optional. This means that we can enable the cases where operations need to change the number of operands after construction without losing the spatial locality benefits of the common case (operation instructions / non-control flow instructions with a lifetime fixed number of operands). PiperOrigin-RevId: 231910497	2019-03-29 16:05:08 -07:00
Jacques Pienaar	82dc6a878c	Add fallback to native code op builder specification for patterns. This allow for arbitrarily complex builder patterns which is meant to cover initial cases while the modelling is improved and long tail cases/cases for which expanding the DSL would result in worst overall system. NFC just sorting the emit replace methods alphabetical in the class and file body. PiperOrigin-RevId: 231890352	2019-03-29 16:04:53 -07:00
Jacques Pienaar	4161d44bd5	Enable using constant attribute as matchers. Straight roll-forward of cl/231322019 that got accidentally reverted in the move. PiperOrigin-RevId: 231791464	2019-03-29 16:04:38 -07:00
Nicolas Vasilache	d4921f4a96	Address Performance issue in NestedMatcher A performance issue was reported due to the usage of NestedMatcher in ComposeAffineMaps. The main culprit was the ubiquitous copies that were occuring when appending even a single element in `matchOne`. This CL generally simplifies the implementation and removes one level of indirection by getting rid of auxiliary storage as well as simplifying the API. The users of the API are updated accordingly. The implementation was tested on a heavily unrolled example with ComposeAffineMaps and is now close in performance with an implementation based on stateless InstWalker. As a reminder, the whole ComposeAffineMaps pass is slated to disappear but the bug report was very useful as a stress test for NestedMatchers. Lastly, the following cleanups reported by @aminim were addressed: 1. make NestedPatternContext scoped within runFunction rather than at the Pass level. This was caused by a previous misunderstanding of Pass lifetime; 2. use defensive assertions in the constructor of NestedPatternContext to make it clear a unique such locally scoped context is allowed to exist. PiperOrigin-RevId: 231781279	2019-03-29 16:04:07 -07:00
Lei Zhang	66647a313a	[tablegen] Use tblgen:: classes for NamedAttribute and Operand fields This is another step towards hiding raw TableGen API calls. PiperOrigin-RevId: 231580827	2019-03-29 16:02:23 -07:00
Lei Zhang	726dc08e4d	[doc] Generate more readable description for attributes This CL added "description" field to AttrConstraint and Attr, like what we have for type classes. PiperOrigin-RevId: 231579853	2019-03-29 16:01:53 -07:00
Lei Zhang	18219caeb2	[doc] Generate more readable description for operands This CL mandated TypeConstraint and Type to provide descriptions and fixed various subclasses and definitions to provide so. The purpose is to enforce good documentation; using empty string as the default just invites oversight. PiperOrigin-RevId: 231579629	2019-03-29 16:01:38 -07:00
River Riddle	994111238b	Fold CallIndirectOp to CallOp when the callee operand is a known constant function. PiperOrigin-RevId: 231511697	2019-03-29 16:01:23 -07:00
Lei Zhang	a759cf3190	Include op results in generate TensorFlow/TFLite op docs * Emitted result lists for ops. * Changed to allow empty summary and description for ops. * Avoided indenting description to allow proper MarkDown rendering of formatting markers inside description content. * Used fixed width font for operand/attribute names. * Massaged TensorFlow op docs and generated dialect op doc. PiperOrigin-RevId: 231427574	2019-03-29 16:00:53 -07:00
Lei Zhang	c224a518f5	TableGen: Use DAG for op results Similar to op operands and attributes, use DAG to specify operation's results. This will allow us to provide names and matchers for outputs. Also Defined `outs` as a marker to indicate the start of op result list. PiperOrigin-RevId: 231422455	2019-03-29 16:00:22 -07:00
Lei Zhang	1dfc3ac5ce	Prefix Operator getter methods with "get" to be consistent PiperOrigin-RevId: 231416230	2019-03-29 15:59:46 -07:00
River Riddle	755538328b	Recommit: Define a AffineOps dialect as well as an AffineIfOp operation. Replace all instances of IfInst with AffineIfOp and delete IfInst. PiperOrigin-RevId: 231342063	2019-03-29 15:59:30 -07:00
Nicolas Vasilache	0f9436e56a	Move google-mlir to google_mlir Python modules cannot be defined under a directory that has a `-` character in its name inside of Google code. Rename to `google_mlir` which circumvents this limitation. PiperOrigin-RevId: 231329321	2019-03-29 15:42:55 -07:00
Nicolas Vasilache	ae772b7965	Automated rollback of changelist 231318632. PiperOrigin-RevId: 231327161	2019-03-29 15:42:38 -07:00
Jacques Pienaar	ad637f3cce	Enable using constant attribute as matchers. Update to allow constant attribute values to be used to match or as result in rewrite rule. Define variable ctx in the matcher to allow matchers to refer to the context of the operation being matched. PiperOrigin-RevId: 231322019	2019-03-29 15:42:23 -07:00
River Riddle	5ecef2b3f6	Define a AffineOps dialect as well as an AffineIfOp operation. Replace all instances of IfInst with AffineIfOp and delete IfInst. PiperOrigin-RevId: 231318632	2019-03-29 15:42:08 -07:00
Nicolas Vasilache	cacf05892e	Add a C API for EDSCs in other languages + python This CL adds support for calling EDSCs from other languages than C++. Following the LLVM convention this CL: 1. declares simple opaque types and a C API in mlir-c/Core.h; 2. defines the implementation directly in lib/EDSC/Types.cpp and lib/EDSC/MLIREmitter.cpp. Unlike LLVM however the nomenclature for these types and API functions is not well-defined, naming suggestions are most welcome. To avoid the need for conversion functions, Types.h and MLIREmitter.h include mlir-c/Core.h and provide constructors and conversion operators between the mlir::edsc type and the corresponding C type. In this first commit, mlir-c/Core.h only contains the types for the C API to allow EDSCs to work from Python. This includes both a minimal set of core MLIR types (mlir_context_t, mlir_type_t, mlir_func_t) as well as the EDSC types (edsc_mlir_emitter_t, edsc_expr_t, edsc_stmt_t, edsc_indexed_t). This can be restructured in the future as concrete needs arise. For now, the API only supports: 1. scalar types; 2. memrefs of scalar types with static or symbolic shapes; 3. functions with input and output of these types. The C API is not complete wrt ownership semantics. This is in large part due to the fact that python bindings are written with Pybind11 which allows very idiomatic C++ bindings. An effort is made to write a large chunk of these bindings using the C API but some C++isms are used where the design benefits from this simplication. A fully isolated C API will make more sense once we also integrate with another language like Swift and have enough use cases to drive the design. Lastly, this CL also fixes a bug in mlir::ExecutionEngine were the order of declaration of llvmContext and the JIT result in an improper order of destructors (which used to crash before the fix). PiperOrigin-RevId: 231290250	2019-03-29 15:41:53 -07:00
Lei Zhang	eb753f4aec	Add tblgen::Pattern to model Patterns defined in TableGen Similar to other tblgen:: abstractions, tblgen::Pattern hides the native TableGen API and provides a nicer API that is more coherent with the TableGen definitions. PiperOrigin-RevId: 231285143	2019-03-29 15:41:38 -07:00
Jacques Pienaar	0fbf4ff232	Define mAttr in terms of AttrConstraint. * Matching an attribute and specifying a attribute constraint is the same thing executionally, so represent it such. * Extract AttrConstraint helper to match TypeConstraint and use that where mAttr was previously used in RewriterGen. PiperOrigin-RevId: 231213580	2019-03-29 15:41:23 -07:00
Jacques Pienaar	8c7f106e53	Add value member to constant attribute specification base. String specification of the default value is the common case so just make it so. PiperOrigin-RevId: 231204081	2019-03-29 15:40:53 -07:00
Chris Lattner	b42bea215a	Change AffineApplyOp to produce a single result, simplifying the code that works with it, and updating the g3docs. PiperOrigin-RevId: 231120927	2019-03-29 15:40:38 -07:00
River Riddle	36babbd781	Change the ForInst induction variable to be a block argument of the body instead of the ForInst itself. This is a necessary step in converting ForInst into an operation. PiperOrigin-RevId: 231064139	2019-03-29 15:40:23 -07:00
Nicolas Vasilache	0e7a8a9027	Drop AffineMap::Null and IntegerSet::Null Addresses b/122486036 This CL addresses some leftover crumbs in AffineMap and IntegerSet by removing the Null method and cleaning up the constructors. As the ::Null uses were tracked down, opportunities appeared to untangle some of the Parsing logic and make it explicit where AffineMap/IntegerSet have ambiguous syntax. Previously, ambiguous cases were hidden behind the implicit pointer values of AffineMap* and IntegerSet* that were passed as function parameters. Depending the values of those pointers one of 3 behaviors could occur. This parsing logic convolution is one of the rare cases where I would advocate for code duplication. The more proper fix would be to make the syntax unambiguous or to allow some lookahead. PiperOrigin-RevId: 231058512	2019-03-29 15:40:08 -07:00
Nicolas Vasilache	81c7f2e2f3	Cleanup resource management and rename recursive matchers This CL follows up on a memory leak issue related to SmallVector growth that escapes the BumpPtrAllocator. The fix is to properly use ArrayRef and placement new to define away the issue. The following renaming is also applied: 1. MLFunctionMatcher -> NestedPattern 2. MLFunctionMatches -> NestedMatch As a consequence all allocations are now guaranteed to live on the BumpPtrAllocator. PiperOrigin-RevId: 231047766	2019-03-29 15:39:53 -07:00
Nicolas Vasilache	629f5b7fcb	Add a simple arity-agnostic invocation of JIT-compiled functions. This is useful to call generic function with unspecified number of arguments e.g. when interfacing with ML frameworks. PiperOrigin-RevId: 230974736	2019-03-29 15:38:08 -07:00
Uday Bondhugula	b588d58c5f	Update createAffineComputationSlice to generate single result affine maps - Update createAffineComputationSlice to generate a sequence of single result affine apply ops instead of one multi-result affine apply - update pipeline-data-transfer test case; while on this, also update the test case to use only single result affine maps, and make it more robust to change. PiperOrigin-RevId: 230965478	2019-03-29 15:37:53 -07:00
River Riddle	c3424c3c75	Allow operations to hold a blocklist and add support for parsing/printing a block list for verbose printing. PiperOrigin-RevId: 230951462	2019-03-29 15:37:37 -07:00
Alex Zinenko	6d37a255e2	Generic dialect conversion pass exercised by LLVM IR lowering This commit introduces a generic dialect conversion/lowering/legalization pass and illustrates it on StandardOps->LLVMIR conversion. It partially reuses the PatternRewriter infrastructure and adds the following functionality: - an actual pass; - non-default pattern constructors; - one-to-many rewrites; - rewriting terminators with successors; - not applying patterns iteratively (unlike the existing greedy rewrite driver); - ability to change function signature; - ability to change basic block argument types. The latter two things required, given the existing API, to create new functions in the same module. Eventually, this should converge with the rest of PatternRewriter. However, we may want to keep two pass versions: "heavy" with function/block argument conversion and "light" that only touches operations. This pass creates new functions within a module as a means to change function signature, then creates new blocks with converted argument types in the new function. Then, it traverses the CFG in DFS-preorder to make sure defs are converted before uses in the dominated blocks. The generic pass has a minimal interface with two hooks: one to fill in the set of patterns, and another one to convert types for functions and blocks. The patterns are defined as separate classes that can be table-generated in the future. The LLVM IR lowering pass partially inherits from the existing LLVM IR translator, in particular for type conversion. It defines a conversion pattern template, instantiated for different operations, and is a good candidate for tablegen. The lowering does not yet support loads and stores and is not connected to the translator as it would have broken the existing flows. Future patches will add missing support before switching the translator in a single patch. PiperOrigin-RevId: 230951202	2019-03-29 15:37:23 -07:00
Mehdi Amini	d9ce382fc9	Use a unique_ptr instead of manual deletion for PIMPL idiom (NFC) PiperOrigin-RevId: 230930254	2019-03-29 15:37:07 -07:00
Lei Zhang	ba1715f407	Pull TableGen op argument definitions into their own files PiperOrigin-RevId: 230923050	2019-03-29 15:36:52 -07:00
Lei Zhang	2de5e9fd19	Support op removal patterns in TableGen This CL adds a new marker, replaceWithValue, to indicate that no new result op is generated by applying a pattern. Instead, the matched DAG is replaced by an existing SSA value. Converted the tf.Identity converter to use the pattern. PiperOrigin-RevId: 230922323	2019-03-29 15:36:37 -07:00
Alex Zinenko	5a4403787f	Simple CPU runner This implements a simple CPU runner based on LLVM Orc JIT. The base functionality is provided by the ExecutionEngine class that compiles and links the module, and provides an interface for obtaining function pointers to the JIT-compiled MLIR functions and for invoking those functions directly. Since function pointers need to be casted to the correct pointer type, the ExecutionEngine wraps LLVM IR functions obtained from MLIR into a helper function with the common signature `void (void **)` where the single argument is interpreted as a list of pointers to the actual arguments passed to the function, eventually followed by a pointer to the result of the function. Additionally, the ExecutionEngine is set up to resolve library functions to those available in the current process, enabling support for, e.g., simple C library calls. For integration purposes, this also provides a simplistic runtime for memref descriptors as expected by the LLVM IR code produced by MLIR translation. In particular, memrefs are transformed into LLVM structs (can be mapped to C structs) with a pointer to the data, followed by dynamic sizes. This implementation only supports statically-shaped memrefs of type float, but can be extened if necessary. Provide a binary for the runner and a test that exercises it. PiperOrigin-RevId: 230876363	2019-03-29 15:36:08 -07:00
Uday Bondhugula	f94b15c247	Update dma-generate: update for multiple load/store op's per memref - introduce a way to compute union using symbolic rectangular bounding boxes - handle multiple load/store op's to the same memref by taking a union of the regions - command-line argument to provide capacity of the fast memory space - minor change to replaceAllMemRefUsesWith to not generate affine_apply if the supplied index remap was identity PiperOrigin-RevId: 230848185	2019-03-29 15:35:38 -07:00
River Riddle	4a7dfa7882	Add order bit to instructions to lazily track dominance queries. This improves the performance of dominance queries, which are used quite often within the compiler(especially within the verifier). This reduced the execution time of a few internal tests from ~2 minutes to ~4 seconds. PiperOrigin-RevId: 230819723	2019-03-29 15:35:23 -07:00
Chris Lattner	934b6d125f	Introduce a new operation hook point for implementing simple local canonicalizations of operations. The ultimate important user of this is going to be a funcBuilder->foldOrCreate<YourOp>(...) API, but for now it is just a more convenient way to write certain classes of canonicalizations (see the change in StandardOps.cpp). NFC. PiperOrigin-RevId: 230770021	2019-03-29 15:34:35 -07:00
River Riddle	451869f394	Add cloning functionality to Block and Function, this also adds support for remapping successor block operands of terminator operations. We define a new BlockAndValueMapping class to simplify mapping between cloned values. PiperOrigin-RevId: 230768759	2019-03-29 15:34:20 -07:00
River Riddle	f319bbbd28	Add a function pass to strip debug info from functions and instructions. PiperOrigin-RevId: 230654315	2019-03-29 15:33:50 -07:00
River Riddle	6859f33292	Migrate VectorOrTensorType/MemRefType shape api to use int64_t instead of int. PiperOrigin-RevId: 230605756	2019-03-29 15:33:20 -07:00
Feng Liu	b64998a6b3	Add a method to construct a CallSiteLoc which represents a stack of locations. PiperOrigin-RevId: 230592860	2019-03-29 15:33:05 -07:00
River Riddle	1210e92d86	Add asmparser/printer support for locations to make them round-trippable. Location printing is currently behind a command line flag "mlir-print-debuginfo", we can rethink this when we have a pass for stripping debug info or when we have support for printer flags. Example inline notation: trailing-location ::= 'loc' '(' location ')' // FileLineCol Location. %1 = "foo"() : () -> i1 loc("mysource.cc":10:8) // Name Location return loc("foo") // CallSite Location return loc(callsite("foo" at "mysource.cc":19:9)) // Fused Location /// Without metadata func @inline_notation() loc(fused["foo", "mysource.cc":10:8]) /// With metadata return loc(fused<"myPass">["foo", "foo2"]) // Unknown location. return loc(unknown) Locations are currently only printed with inline notation at the line of each instruction. Further work is needed to allow for reference notation, e.g: ... return loc 1 } ... loc 1 = "source.cc":10:1 PiperOrigin-RevId: 230587621	2019-03-29 15:32:49 -07:00
Lei Zhang	5654450853	Unify terms regarding assembly form to use generic vs. custom This CL just changes various docs and comments to use the term "generic" and "custom" when mentioning assembly forms. To be consist, several methods are also renamed: * FunctionParser::parseVerboseOperation() -> parseGenericOperation() * ModuleState::hasShorthandForm() -> hasCustomForm() * OpAsmPrinter::printDefaultOp() -> printGenericOp() PiperOrigin-RevId: 230568819	2019-03-29 15:32:35 -07:00
Uday Bondhugula	864d9e02a1	Update fusion cost model + some additional infrastructure and debug information for -loop-fusion - update fusion cost model to fuse while tolerating a certain amount of redundant computation; add cl option -fusion-compute-tolerance evaluate memory footprint and intermediate memory reduction - emit debug info from -loop-fusion showing what was fused and why - introduce function to compute memory footprint for a loop nest - getMemRefRegion readability update - NFC PiperOrigin-RevId: 230541857	2019-03-29 15:32:06 -07:00
Nicolas Vasilache	e4020c2d1a	Add support for Return in EDSCs This CL adds the Return op to EDSCs types and emitter. This allows generating full function bodies that can be compiled all the way down to LLVMIR and executed on CPU. At this point, the MLIR lacks the testing infrastructure to exercise this. End-to-end testing of full functions written in EDSCs is left for a future CL. PiperOrigin-RevId: 230527530	2019-03-29 15:31:50 -07:00
Uday Bondhugula	1b735dfe27	Refactor -dma-generate walker - NFC - ForInst::walkOps will also be used in an upcoming CL (cl/229438679); better to have this instead of deriving from the InstWalker PiperOrigin-RevId: 230413820	2019-03-29 15:31:03 -07:00
Uday Bondhugula	7669204304	Improve / fix documentation for affine map composition utilities - NFC - improve/fix doc comments for affine apply composition related methods. - drop makeSingleValueComposedAffineApply - really redundant and out of line in a public API; it's just returning the first result of the composed affine apply op, and not making a single result affine map or an affine_apply op. PiperOrigin-RevId: 230406169	2019-03-29 15:30:47 -07:00
Uday Bondhugula	94a03f864f	Allocate private/local buffers for slices accurately during fusion - the size of the private memref created for the slice should be based on the memref region accessed at the depth at which the slice is being materialized, i.e., symbolic in the outer IVs up until that depth, as opposed to the region accessed based on the entire domain. - leads to a significant contraction of the temporary / intermediate memref whenever the memref isn't reduced to a single scalar (through store fwd'ing). Other changes - update to promoteIfSingleIteration - avoid introducing unnecessary identity map affine_apply from IV; makes it much easier to write and read test cases and pass output for all passes that use promoteIfSingleIteration; loop-fusion test cases become much simpler - fix replaceAllMemrefUsesWith bug that was exposed by the above update - 'domInstFilter' could be one of the ops erased due to a memref replacement in it. - fix getConstantBoundOnDimSize bug: a division by the coefficient of the identifier was missing (the latter need not always be 1); add lbFloorDivisors output argument - rename getBoundingConstantSizeAndShape -> getConstantBoundingSizeAndShape PiperOrigin-RevId: 230405218	2019-03-29 15:30:31 -07:00
Jacques Pienaar	34c6f8c6e4	Add default attr value & define tf.AvgPool op and use pattern for rewrite. Add default values to attributes, to allow attribute being left unspecified. The attr getter will always return an attribute so callers need not check for it, if the attribute is not set then the default will be returned (at present the default will be constructed upon query but this will be changed). Add op definition for tf.AvgPool in ops.td, rewrite matcher using pattern using attribute matching & transforms. Adding some helper functions to make it simpler. Handle attributes with dialect prefix and map them to getter without dialect prefix. Note: VerifyAvgPoolOp could probably be autogenerated by know given the predicate specification on attributes, but deferring that to a follow up. PiperOrigin-RevId: 230364857	2019-03-29 15:29:59 -07:00
Jacques Pienaar	a280e3997e	Start doc generation pass. Start doc generation pass that generates simple markdown output. The output is formatted simply[1] in markdown, but this allows seeing what info we have, where we can refine the op description (e.g., the inputs is probably redundant), what info is missing (e.g., the attributes could probably have a description). The formatting of the description is still left up to whatever was in the op definition (which luckily, due to the uniformity in the .td file, turned out well but relying on the indentation there is fragile). The mechanism to autogenerate these post changes has not been added yet either. The output file could be run through a markdown formatter too to remove extra spaces. [1]. This is not proposal for final style :) There could also be a discussion around single doc vs multiple (per dialect, per op), whether we want a TOC, whether operands/attributes should be headings or just formatted differently ... PiperOrigin-RevId: 230354538	2019-03-29 15:29:29 -07:00
Nicolas Vasilache	2553473618	Make MLIREmitter::bindConstant variadic This is needed to allow binding to more constant types. Tests that exercise this behavior will come in a followup CL. In the meantime this does not breaks things. PiperOrigin-RevId: 230320621	2019-03-29 15:29:13 -07:00
River Riddle	512d87cefc	Add a constant folding hook to ExtractElementOp to fold extracting the element of a constant. This also adds a 'getValue' function to DenseElementsAttr and SparseElementsAttr to get the element at a constant index. PiperOrigin-RevId: 230098938	2019-03-29 15:28:28 -07:00
Nicolas Vasilache	119af6712e	Cleanup spurious printing bits in EDSCs This CL also makes ScopedEDSCContexts to reset the Bindable numbering when creating a new context. This is useful to write minimal tests that don't use FileCheck pattern captures for now. PiperOrigin-RevId: 230079997	2019-03-29 15:28:13 -07:00
Nicolas Vasilache	9f3f39d61a	Cleanup EDSCs This CL performs a bunch of cleanups related to EDSCs that are generally useful in the context of using them with a simple wrapping C API (not in this CL) and with simple language bindings to Python and Swift. PiperOrigin-RevId: 230066505	2019-03-29 15:27:58 -07:00
River Riddle	174f66bc8a	Restructure FloatAttr::get(Type, double) to allow for loss of precision when converting the double value to the target type semantics. A comment is added to discourage the use of this method for non simple constants. The new handling also removes the direct use of the float constructor for APFloat to avoid runtime float cast asan errors. PiperOrigin-RevId: 230014696	2019-03-29 15:27:44 -07:00
River Riddle	122f710c6c	When constructing or hashing a key type in TypeUniquer first check if the derived storage type provides a 'getKey' or 'hashKey'. PiperOrigin-RevId: 229939463	2019-03-29 15:26:44 -07:00
Nicolas Vasilache	24e5a72dac	Fix AffineApply corner case This CL adds a test reported by andydavis@ and fixes the corner case that appears when operands do not come from an AffineApply and no Dim composition is needed. In such cases, we would need to create an empty map which is disallowed. The composition in such cases becomes trivial: there is no composition. This CL also updates the name AffineNormalizer to AffineApplyNormalizer. PiperOrigin-RevId: 229819234	2019-03-29 15:25:59 -07:00
River Riddle	0e81d7c420	[MLIR] Add functionality for constructing a DenseElementAttr from an array of attributes and rerwite DenseElementsAttr::writeBits/readBits to handle non uniform bitwidths. This fixes asan failures that happen when using non uniform bitwidths. PiperOrigin-RevId: 229815107	2019-03-29 15:25:45 -07:00
Jacques Pienaar	d6f84fa5d9	Add AttrConstraint to enable generating verification for attribute values. Change MinMaxAttr to match hasValidMinMaxAttribute behavior. Post rewriting the other users of that function it could be removed too. The currently generated error message is: error: 'tfl.fake_quant' op attribute 'minmax' failed to satisfy constraint of MinMaxAttr PiperOrigin-RevId: 229775631	2019-03-29 15:25:13 -07:00
Nicolas Vasilache	4573a8da9a	Fix improperly indexed DimOp in LowerVectorTransfers.cpp This CL fixes a misunderstanding in how to build DimOp which triggered execution issues in the CPU path. The problem is that, given a `memref<?x4x?x8x?xf32>`, the expressions to construct the dynamic dimensions should be: `dim %arg, 0 : memref<?x4x?x8x?xf32>` `dim %arg, 2 : memref<?x4x?x8x?xf32>` and `dim %arg, 4 : memref<?x4x?x8x?xf32>` Before this CL, we wold construct: `dim %arg, 0 : memref<?x4x?x8x?xf32>` `dim %arg, 1 : memref<?x4x?x8x?xf32>` `dim %arg, 2 : memref<?x4x?x8x?xf32>` and expect the other dimensions to be constants. This assumption seems consistent at first glance with the syntax of alloc: ``` %tensor = alloc(%M, %N, %O) : memref<?x4x?x8x?xf32> ``` But this was actuallyincorrect. This CL also makes the relevant functions available to EDSCs and removes duplication of the incorrect function. PiperOrigin-RevId: 229622766	2019-03-29 15:24:13 -07:00
Lei Zhang	3766332533	Change impl::printBinaryOp() to consider operand and result type The operand and result types of binary ops are not necessarily the same. For those binary ops, we cannot print in the short-form assembly. Enhance impl:::printBinaryOp to consider operand and result types to select which assembly form to use. PiperOrigin-RevId: 229608142	2019-03-29 15:23:28 -07:00
Alex Zinenko	05b02bb98e	TableGen: implement predicate tree and basic simplification A recent change in TableGen definitions allowed arbitrary AND/OR predicate compositions at the cost of removing known-true predicate simplification. Introduce a more advanced simplification mechanism instead. In particular, instead of folding predicate C++ expressions directly in TableGen, keep them as is and build a predicate tree in TableGen C++ library. The predicate expression-substitution mechanism, necessary to implement complex predicates for nested classes such as `ContainerType`, is replaced by a dedicated predicate. This predicate appears in the predicate tree and can be used for tree matching and separation. More specifically, subtrees defined below such predicate may be subject to different transformations than those that appear above. For example, a subtree known to be true above the substitution predicate is not necessarily true below it. Use the predicate tree structure to eliminate known-true and known-false predicates before code emission, as well as to collapse AND and OR predicates if their value can be deduced based on the value of one child. PiperOrigin-RevId: 229605997	2019-03-29 15:22:58 -07:00
Jacques Pienaar	a5827fc91d	Add attribute matching and transform to pattern rewrites. Start simple with single predicate match & transform rules for attributes. * Its unclear whether modelling Attr predicates will be needed so start with allowing matching attributes with a single predicate. * The input and output attr type often differs and so add ability to specify a transform between the input and output format. PiperOrigin-RevId: 229580879	2019-03-29 15:22:14 -07:00
MLIR Team	27d067e164	LoopFusion improvements: ) Adds support for fusing into consumer loop nests with multiple loads from the same memref. ) Adds support for reducing slice loop trip count by projecting out destination loop IVs greater than destination loop depth. *) Removes dependence on src loop depth and simplifies cost model computation. PiperOrigin-RevId: 229575126	2019-03-29 15:21:59 -07:00
Alex Zinenko	bd161ae5bc	TableGen: untie Attr from Type In TableGen definitions, the "Type" class has been used for types of things that can be stored in Attributes, but not necessarily present in the MLIR type system. As a consequence, records like "String" or "DerviedAttrBody" were of class "Type", which can be confusing. Furthermore, the "builderCall" field of the "Type" class serves only for attribute construction. Some TableGen "Type" subclasses that correspond to MLIR kinds of types do not have a canonical way of construction only from the data available in TableGen, e.g. MemRefType would require the list of affine maps. This leads to a conclusion that the entities that describe types of objects appearing in Attributes should be independent of "Type": they have some properties "Type"s don't and vice versa. Do not parameterize Tablegen "Attr" class by an instance of "Type". Instead, provide a "constBuilderCall" field that can be used to build an attribute from a constant value stored in TableGen instead of indirectly going through Attribute.Type.builderCall. Some attributes still don't have a "constBuilderCall" because they used to depend on types without a "builderCall". Drop definitions of class "Type" that don't correspond to MLIR Types. Provide infrastructure to define type-dependent attributes and string-backed attributes for convenience. PiperOrigin-RevId: 229570087	2019-03-29 15:21:28 -07:00
Lei Zhang	590012772d	Promote broadcast logic from TensorFlowLite to Dialect/ directory We also need the broadcast logic in the TensorFlow dialect. Move it to a Dialect/ directory for a broader scope. This Dialect/ directory is intended for code not in core IR, but can potentially be shared by multiple dialects. Apart from fixing TensorFlow op TableGen to use this trait, this CL only contains mechanical code shuffling. PiperOrigin-RevId: 229563911	2019-03-29 15:21:14 -07:00
Lei Zhang	254821d1db	Rename hasCanonicalizationPatterns to hasCanonicalizer The latter is shorter but still conveys the idea clearly. It is also more consistent with hasConstantFolder. PiperOrigin-RevId: 229561774	2019-03-29 15:20:44 -07:00
River Riddle	18fe1ffcd7	Move the storage of uniqued TypeStorage objects into TypeUniquer and give each context a unique TypeUniquer instance. PiperOrigin-RevId: 229460053	2019-03-29 15:19:56 -07:00
Uday Bondhugula	03e15e1b9f	Minor code cleanup - NFC. - readability changes PiperOrigin-RevId: 229443430	2019-03-29 15:19:41 -07:00
River Riddle	f9d2eb1c8c	Change derived type storage objects to define an 'operator==(const KeyTy &)' instead of converting to the KeyTy. This allows for handling cases where the KeyTy does not provide an equality operator on itself. PiperOrigin-RevId: 229423249	2019-03-29 15:19:11 -07:00
Alex Zinenko	0e58de70e7	Initial version of the LLVM IR dialect LLVM IR types are defined using MLIR's extendable type system. The dialect provides the only type kind, LLVMType, that wraps an llvm::Type*. Since LLVM IR types are pointer-unique, MLIR type systems relies on those pointers to perform its own type unique'ing. Type parsing and printing is delegated to LLVM libraries. Define MLIR operations for the LLVM IR instructions currently used by the translation to the LLVM IR Target to simplify eventual transition. Operations classes are defined using TableGen. LLVM IR instruction operands that are only allowed to take constant values are accepted as attributes instead. All operations are using verbose form for printing and parsing. PiperOrigin-RevId: 229400375	2019-03-29 15:18:37 -07:00
Alex Zinenko	44e9869f1a	TableGen: extract TypeConstraints from Type MLIR has support for type-polymorphic instructions, i.e. instructions that may take arguments of different types. For example, standard arithmetic operands take scalars, vectors or tensors. In order to express such instructions in TableGen, we need to be able to verify that a type object satisfies certain constraints, but we don't need to construct an instance of this type. The existing TableGen definition of Type requires both. Extract out a TypeConstraint TableGen class to define restrictions on types. Define the Type TableGen class as a subclass of TypeConstraint for consistency. Accept records of the TypeConstraint class instead of the Type class as values in the Arguments class when defining operators. Replace the predicate logic TableGen class based on conjunctive normal form with the predicate logic classes allowing for abitrary combinations of predicates using Boolean operators (AND/OR/NOT). The combination is implemented using simple string rewriting of C++ expressions and, therefore, respects the short-circuit evaluation order. No logic simplification is performed at the TableGen level so all expressions must be valid C++. Maintaining CNF using TableGen only would have been complicated when one needed to introduce top-level disjunction. It is also unclear if it could lead to a significantly simpler emitted C++ code. In the future, we may replace inplace predicate string combination with a tree structure that can be simplified in TableGen's C++ driver. Combined, these changes allow one to express traits like ArgumentsAreFloatLike directly in TableGen instead of relying on C++ trait classes. PiperOrigin-RevId: 229398247	2019-03-29 15:18:23 -07:00
Nicolas Vasilache	515ce1e68e	Add edsc::Indexed helper struct to act as syntactic sugar This CL adds edsc::Indexed. This helper class exists purely for sugaring purposes and allows writing expressions such as: ```mlir Indexed A(...), B(...), C(...); ForNest(ivs, zeros, shapeA, ones, { C[ivs] = A[ivs] + B[ivs] }); ``` PiperOrigin-RevId: 229388644	2019-03-29 15:17:37 -07:00
Nicolas Vasilache	424041ad58	Add EDSC sugar This allows load, store and ForNest to be used with both Expr and Bindable. This simplifies writing generic pieces of MLIR snippet. For instance, a generic pointwise add can now be written: ```cpp // Different Bindable ivs, one per loop in the loop nest. auto ivs = makeBindables(shapeA.size()); Bindable zero, one; // Same bindable, all equal to `zero`. SmallVector<Bindable, 8> zeros(ivs.size(), zero); // Same bindable, all equal to `one`. SmallVector<Bindable, 8> ones(ivs.size(), one); // clang-format off Bindable A, B, C; Stmt scalarA, scalarB, tmp; Stmt block = edsc::Block({ ForNest(ivs, zeros, shapeA, ones, { scalarA = load(A, ivs), scalarB = load(B, ivs), tmp = scalarA + scalarB, store(tmp, C, ivs) }), }); // clang-format on ``` This CL also adds some extra support for pretty printing that will be used in a future CL when we introduce standalone testing of EDSCs. At the momen twe are lacking the basic infrastructure to write such tests. PiperOrigin-RevId: 229375850	2019-03-29 15:16:53 -07:00
Uday Bondhugula	6e4f3e40c7	Fix outdated comments PiperOrigin-RevId: 229300301	2019-03-29 15:16:08 -07:00
River Riddle	3bb35ad0dc	Don't allocate a buffer for an empty ArrayRef in TypeStorageAllocator. PiperOrigin-RevId: 229290802	2019-03-29 15:15:52 -07:00
River Riddle	b9c791b96d	Change derived type storage objects to be constructed with an instance of the KeyTy. This will simplify the cases where a type can be constructed, and need to be verified, in multiple ways. PiperOrigin-RevId: 229279000	2019-03-29 15:15:37 -07:00
Lei Zhang	61ec6c0992	Swap the type and attribute parameter in ConstantOp::build() This is to keep consistent with other TableGen generated builders so that we can also use this builder in TableGen rules. PiperOrigin-RevId: 229244630	2019-03-29 15:14:52 -07:00
River Riddle	ed26dd0421	Add a canonicalization pattern for conditional branch to fold constant branch conditions. PiperOrigin-RevId: 229242007	2019-03-29 15:14:37 -07:00
MLIR Team	38c2fe3158	LoopFusion: automate selection of source loop nest slice depth and destination loop nest insertion depth based on a simple cost model (cost model can be extended/replaced at a later time). ) LoopFusion: Adds fusion cost function which compares the cost of the fused loop nest, with the cost of the two unfused loop nests to determine if it is profitable to fuse the candidate loop nests. The fusion cost function is run for various combinations for src/dst loop depths attempting find the minimum cost setting for src/dst loop depths which does not increase the computational cost when the loop nests are fused. Combinations of src/dst loop depth are evaluated attempting to maximize loop depth (i.e. take a bigger computation slice from the source loop nest, and insert it deeper in the destination loop nest for better locality). ) LoopFusion: Adds utility to compute op instance count for loop nests, sliced loop nests, and to compute the cost of a loop nest fused with another sliced loop nest. ) LoopFusion: canonicalizes slice bound AffineMaps (and updates related tests). ) Analysis::Utils: Splits getBackwardComputationSlice into two functions: one which calculates and returns the slice loop bounds for analysis by LoopFusion, and the other for insertion of the computation slice (ones fusion has calculated the min-cost src/dst loop depths). *) Test: Adds multiple unit tests to test the new functionality. PiperOrigin-RevId: 229219757	2019-03-29 15:13:53 -07:00
River Riddle	d6b71b0d57	Add a Block::dropAllReferences to drop all references from held instructions and call it when clearing the block. This fixes a bug where ForInst/IfInst instructions may still have references to values while being destroyed. PiperOrigin-RevId: 229207798	2019-03-29 15:13:39 -07:00
River Riddle	791049fb34	Add a FloatAttr::getChecked, and invoke it during Attribute parsing. PiperOrigin-RevId: 229167099	2019-03-29 15:13:10 -07:00
Nicolas Vasilache	1b171e9357	Add EDSC support for operator* PiperOrigin-RevId: 229097351	2019-03-29 15:12:55 -07:00
Nicolas Vasilache	362557e11c	Simplify compositions of AffineApply This CL is the 6th and last on the path to simplifying AffineMap composition. This removes `AffineValueMap::forwardSubstitutions` and replaces it by simple calls to `fullyComposeAffineMapAndOperands`. PiperOrigin-RevId: 228962580	2019-03-29 15:11:56 -07:00
River Riddle	ba9a544615	Simplify Attribute constructor definitions. PiperOrigin-RevId: 228926113	2019-03-29 15:11:41 -07:00
River Riddle	6985dc62b5	Make sure that type construction arguments are forwarded. PiperOrigin-RevId: 228910216	2019-03-29 15:10:55 -07:00
Lei Zhang	311af4abf3	Const fold splat vectors/tensors in standard add, sub, and mul ops The const folding logic is structurally similar, so use a template to abstract the common part. Moved mul(x, 0) to a legalization pattern to be consistent with mul(x, 1). Also promoted getZeroAttr() to be a method on Builder since it is expected to be frequently used. PiperOrigin-RevId: 228891989	2019-03-29 15:09:55 -07:00
Jacques Pienaar	4c0faef943	Avoid redundant predicate checking in type matching. Expand type matcher template generator to consider a set of predicates that are known to hold. This avoids inserting redundant checking for trivially true predicates (for example predicate that hold according to the op definition). This only targets predicates that trivially holds and does not attempt any logic equivalence proof. PiperOrigin-RevId: 228880468	2019-03-29 15:09:25 -07:00
Lei Zhang	ac5a50e1e4	Extract openInputFile() into Support/FileUtilities Multiple binaries have the needs to open input files. Use this function to de-duplicate the code. Also changed openOutputFile() to return errors using std::string since it is a library call and accessing I/O in library call is not friendly. PiperOrigin-RevId: 228878221	2019-03-29 15:09:11 -07:00
River Riddle	e8d0e1f72a	Provide dialect hooks for defining named aliases for AffineMap/IntegerSet/Type. The AsmPrinter will then query registered dialects for aliases of symbols used within the module and use them in place. PiperOrigin-RevId: 228831678	2019-03-29 15:08:55 -07:00
Nicolas Vasilache	cfa5831960	Uniformize composition of AffineApplyOp by construction This CL is the 5th on the path to simplifying AffineMap composition. This removes the distinction between normalized single-result AffineMap and more general composed multi-result map. One nice byproduct of making the implementation driven by single-result is that the multi-result extension is a trivial change: the implementation is still single-result and we just use: ``` unsigned idx = getIndexOf(...); map.getResult(idx); ``` This CL also fixes an AffineNormalizer implementation issue related to symbols. Namely it stops performing substitutions on symbols in AffineNormalizer and instead concatenates them all to be consistent with the call to `AffineMap::compose(AffineMap)`. This latter call to `compose` cannot perform simplifications of symbols coming from different maps based on positions only: i.e. dims are applied and renumbered but symbols must be concatenated. The only way to determine whether symbols from different AffineApply are the same is to look at the concrete values. The canonicalizeMapAndOperands is thus extended with behavior to support replacing operands that appear multiple times. Lastly, this CL demonstrates that the implementation is correct by rewriting ComposeAffineMaps using only `makeComposedAffineApply`. The implementation uses a matcher because AffineApplyOp are introduced as composed operations on the fly instead of iteratively forwardSubstituting. For this purpose, a walker would revisit freshly introduced AffineApplyOp. Regardless, ComposeAffineMaps is scheduled to disappear, this CL replaces the implementation based on iterative `forwardSubstitute` by a composed-by-construction `makeComposedAffineApply`. Remaining calls to `forwardSubstitute` will be removed in the next CL. PiperOrigin-RevId: 228830443	2019-03-29 15:08:40 -07:00
Jacques Pienaar	fdcfa0bb52	Remove unary, binary, ternary ops from op_base Previously these were all defined as operating on Tensors which is not true in general. These don't serve much now so just inline it and we can extract it out again. PiperOrigin-RevId: 228827011	2019-03-29 15:08:25 -07:00
Uday Bondhugula	2370c601ba	Add safeguard against FM explosion - FM has a worst case exponential complexity. For our purposes, this worst case is rarely expected, but could still appear due to improperly constructed constraints (a logical/memory error in other methods for eg.) or artificially created arbitrarily complex integer sets (adversarial / fuzz tests). Add a check to detect such an explosion in the number of constraints and conservatively return false from isEmpty() (instead of running out of memory or running for too long). - Add an artifical virus test case. PiperOrigin-RevId: 228753496	2019-03-29 15:07:55 -07:00
River Riddle	56b99b4045	Add a few utilities for terminator management: * Get a specific successor operand. * Iterator support for non successor operands. * Fix bug when removing the last operand from the operand list of an Instruction. * Get the argument number for a BlockArgument. PiperOrigin-RevId: 228660898	2019-03-29 15:07:25 -07:00
Lei Zhang	9b034f0bfd	Add tblgen::Attribute to wrap around TableGen Attr defs This CL added a tblgen::Attribute class to wrap around raw TableGen Record getValue*() calls on Attr defs, which will provide a nicer API for handling TableGen Record. PiperOrigin-RevId: 228581107	2019-03-29 15:06:41 -07:00
Uday Bondhugula	303c09299f	Fix affine expr flattener bug + improve simplification in a particular scenario - fix visitDivExpr: constraints constructed for localVarCst used the original divisor instead of the simplified divisor; fix this. Add a simple test case in memref-bound-check that reproduces this bug - although this was encountered in the context of slicing for fusion. - improve mod expr flattening: when flattening mod expressions, cancel out the GCD of the numerator and denominator so that we can get a simpler flattened form along with a simpler floordiv local var for it PiperOrigin-RevId: 228539928	2019-03-29 15:06:11 -07:00
Lei Zhang	3e5ee82b81	Put Operator and PredCNF into the tblgen namespace PiperOrigin-RevId: 228429130	2019-03-29 15:05:38 -07:00
Lei Zhang	b2cc2c344e	Add tblgen::Type to wrap around TableGen Type defs This CL added a tblgen::Type class to wrap around raw TableGen Record getValue*() calls on Type defs, which will provide a nicer API for handling TableGen Record. The PredCNF class is also updated to work together with tblgen::Type. PiperOrigin-RevId: 228429090	2019-03-29 15:05:23 -07:00
Chris Lattner	2b902f1288	Delete FuncBuilder::createChecked. It is perhaps still a good idea, but has no clients. Let's re-add it in the future if there is ever a reason to. NFC. Unrelatedly, add a use of a variable to unbreak the non-assert build. PiperOrigin-RevId: 228284026	2019-03-29 15:05:08 -07:00
Nicolas Vasilache	997415fa77	Extract BuiltinOps::canonicalizeMapAndOperands This CL is the 4th on the path to simplifying AffineMap composition. This CL extract canonicalizeMapAndOperands so it can be reused by other functions; in particular, this will be used in `makeNormalizedAffineApply`. PiperOrigin-RevId: 228277890	2019-03-29 15:04:52 -07:00
Nicolas Vasilache	00aac70159	Move makeNormalizedAffineApply This CL is the 3rd on the path to simplifying AffineMap composition. This CL just moves `makeNormalizedAffineApply` from VectorAnalysis to AffineAnalysis where it more naturally belongs. PiperOrigin-RevId: 228277182	2019-03-29 15:04:38 -07:00
Nicolas Vasilache	c6f798a976	Introduce AffineMap::compose(AffineMap) This CL is the 2nd on the path to simplifying AffineMap composition. This CL uses the now accepted `AffineExpr::compose(AffineMap)` to implement `AffineMap::compose(AffineMap)`. Implications of keeping the simplification function in Analysis are documented where relevant. PiperOrigin-RevId: 228276646	2019-03-29 15:04:20 -07:00
Nicolas Vasilache	c449e46ceb	Introduce AffineExpr::compose(AffineMap) This CL is the 1st on the path to simplifying AffineMap composition. This CL uses the now accepted AffineExpr.replaceDimsAndSymbols to implement `AffineExpr::compose(AffineMap)`. Arguably, `simplifyAffineExpr` should be part of IR and not Analysis but this CL does not yet pull the trigger on that. PiperOrigin-RevId: 228265845	2019-03-29 15:03:36 -07:00
Uday Bondhugula	21baf86a2f	Extend loop-fusion's slicing utility + other fixes / updates - refactor toAffineFromEq and the code surrounding it; refactor code into FlatAffineConstraints::getSliceBounds - add FlatAffineConstraints methods to detect identifiers as mod's and div's of other identifiers - add FlatAffineConstraints::getConstantLower/UpperBound - Address b/122118218 (don't assert on invalid fusion depths cmdline flags - instead, don't do anything; change cmdline flags src-loop-depth -> fusion-src-loop-depth - AffineExpr/Map print method update: don't fail on null instances (since we have a wrapper around a pointer, it's avoidable); rationale: dump/print methods should never fail if possible. - Update memref-dataflow-opt to add an optimization to avoid a unnecessary call to IsRangeOneToOne when it's trivially going to be true. - Add additional test cases to exercise the new support - update a few existing test cases since the maps are now generated uniformly with all destination loop operands appearing for the backward slice - Fix projectOut - fix wrong range for getBestElimCandidate. - Fix for getConstantBoundOnDimSize() - didn't show up in any test cases since we didn't have any non-hyperrectangular ones. PiperOrigin-RevId: 228265152	2019-03-29 15:03:20 -07:00
Uday Bondhugula	56b3640b94	Misc readability and doc / code comment related improvements - NFC - when SSAValue/MLValue existed, code at several places was forced to create additional aggregate temporaries of SmallVector<SSAValue/MLValue> to handle the conversion; get rid of such redundant code - use filling ctors instead of explicit loops - for smallvectors, change insert(list.end(), ...) -> append(... - improve comments at various places - turn getMemRefAccess into MemRefAccess ctor and drop duplicated getMemRefAccess. In the next CL, provide getAccess() accessors for load, store, DMA op's to return a MemRefAccess. PiperOrigin-RevId: 228243638	2019-03-29 15:02:41 -07:00
Jacques Pienaar	00ac2f6eb4	Verify the size of the vector in generated op verify. PiperOrigin-RevId: 228195756	2019-03-29 15:02:10 -07:00
Lei Zhang	f8bbe5deca	Various tiny refinements over TableGen Operator class Use "native" vs "derived" to differentiate attributes on ops: native ones are specified when creating the op as a part of defining the op, while derived ones are computed from properties of the op. PiperOrigin-RevId: 228186962	2019-03-29 15:01:56 -07:00
Jacques Pienaar	65fc8643ec	Addresing follow up comments from cl/227991412. PiperOrigin-RevId: 228185819	2019-03-29 15:01:40 -07:00
River Riddle	3b2c5600d9	Add support for types belonging to unknown dialects. This allows for types to be round tripped even if the dialect that defines them is not linked in. These types will be represented by a new "UnknownType" that uniques them based upon the dialect namespace and raw string type data. PiperOrigin-RevId: 228184629	2019-03-29 15:01:11 -07:00
Jacques Pienaar	aae85ddce1	Match attributes in input pattern. Bind attributes similar to operands. Use to rewrite leakyreulo and const rewrite pattern. The attribute type/attributes are not currently checked so should only be used where the attributes match due to the construction of the op. To support current attribute namespacing, convert __ in attribute name to "$" for matching purposes ('$' is not valid character in variable in TableGen). Some simplification to make it simpler to specify indented ostream and avoid so many spaces. The goal is not to have perfectly formatted code generated but good enough so that its still easy to read for a user. PiperOrigin-RevId: 228183639	2019-03-29 15:00:55 -07:00
Jacques Pienaar	8d849eb4b9	Add static shape tensor type and rewrite squeeze and reshape rewrites as patterns. This also moves the predicate declaration in op_base into one section. PiperOrigin-RevId: 228170793	2019-03-29 15:00:40 -07:00
Alex Zinenko	92a899f629	Drop all uses of the ForInst induction variable before deleting ForInst The `for` instruction defines the loop induction variable it uses. In the well-formed IR, the induction variable can only be used by the body of the `for` loop. Existing implementation was explicitly cleaning the body of the for loop to remove all uses of the induction variable before removing its definition. However, in ill-formed IR that may appear in some stages of parsing, there may be (invalid) users of the loop induction variable outside the loop body. In case of unsuccessful parsing, destructor of the ForInst-defined Value would assert because there are remaining though invalid users of this Value. Explicitly drop all uses of the loop induction Value when destroying a ForInst. It is no longer necessary to explicitly clean the body of the loop, destructor of the block will take care of this. PiperOrigin-RevId: 228168880	2019-03-29 15:00:26 -07:00
Uday Bondhugula	94c2d969ce	Rename getAffineBinaryExpr -> getAffineBinaryOpExpr, getBinaryAffineOpExpr -> getAffineBinaryOpExpr for consistency (NFC) - this is consistent with the name of the class and getAffineDimExpr/ConstantExpr, etc. PiperOrigin-RevId: 228164959	2019-03-29 14:59:52 -07:00
Alex Zinenko	c47ed53211	Add simple constant folding hook for CmpIOp Integer comparisons can be constant folded if both of their arguments are known constants, which we can compare in the compiler. This requires implementing all comparison predicates, but thanks to consistency between LLVM and MLIR comparison predicates, we have a one-to-one correspondence between predicates and llvm::APInt comparison functions. Constant folding of comparsions with maximum/minimum values of the integer type are left for future work. This will be used to test the lowering of mod/floordiv/ceildiv in affine expressions at compile time. PiperOrigin-RevId: 228077580	2019-03-29 14:59:22 -07:00
Alex Zinenko	bc04556cf8	Introduce integer division and remainder operations This adds signed/unsigned integer division and remainder operations to the StandardOps dialect. Two versions are required because MLIR integers are signless, but the meaning of the leading bit is important in division and affects the results. LLVM IR made a similar choice. Define the operations in the tablegen file and add simple constant folding hooks in the C++ implementation. Handle signed division overflow and division by zero errors in constant folding. Canonicalization is left for future work. These operations are necessary to lower affine_apply's down to LLVM IR. PiperOrigin-RevId: 228077549	2019-03-29 14:58:52 -07:00
Nicolas Vasilache	28cf580555	Cleanup spurious DenseMap include PiperOrigin-RevId: 228059305	2019-03-29 14:58:38 -07:00
Jacques Pienaar	8f24943826	Verify type of operands match those specifed in op registry. Expand type to include matcher predicates. Use CNF form to allow specifying combinations of constraints for type. The matching call for the type is used to verify the construction of the operation as well as in rewrite pattern generation. The matching initially includes redundant checks (e.g., even if the operand of the op is guaranteed to satisfy some requirement, it is still checked during matcher generation for now). As well as some of the traits specified now check what the generated code already checks. Some of the traits can be removed in future as the verify method will include the relevant checks based on the op definition already. More work is needed for variadic operands. CNF form is used so that in the follow up redundant checks in the rewrite patterns could be omitted (e.g., when matching a F32Tensor, one does not need to verify that op X's operand 0 is a Tensor if that is guaranteed by op X's definition). The alternative was to have single matcher function specified, but this would not allow for reasoning about what attributes already hold (at the level of PredAtoms). Use this new operand type restrictions to rewrite BiasAdd with floating point operands as declarative pattern. PiperOrigin-RevId: 227991412	2019-03-29 14:58:23 -07:00
Uday Bondhugula	8496f2c30b	Complete TODOs / cleanup for loop-fusion utility - this is CL 1/2 that does a clean up and gets rid of one limitation in an underlying method - as a result, fusion works for more cases. - fix bugs/incomplete impl. in toAffineMapFromEq - fusing across rank changing reshapes for example now just works For eg. given a rank 1 memref to rank 2 memref reshape (64 -> 8 x 8) like this, -loop-fusion -memref-dataflow-opt now completely fuses and inlines/store-forward to get rid of the temporary: INPUT // Rank 1 -> Rank 2 reshape for %i0 = 0 to 64 { %v = load %A[%i0] store %v, %B[%i0 floordiv 8, i0 mod 8] } for %i1 = 0 to 8 for %i2 = 0 to 8 %w = load %B[%i1, i2] "foo"(%w) : (f32) -> () OUTPUT $ mlir-opt -loop-fusion -memref-dataflow-opt fuse_reshape.mlir #map0 = (d0, d1) -> (d0 * 8 + d1) mlfunc @fuse_reshape(%arg0: memref<64xf32>) { for %i0 = 0 to 8 { for %i1 = 0 to 8 { %0 = affine_apply #map0(%i0, %i1) %1 = load %arg0[%0] : memref<64xf32> "foo"(%1) : (f32) -> () } } } AFAIK, there is no polyhedral tool / compiler that can perform such fusion - because it's not really standard loop fusion, but possible through a generalized slicing-based approach such as ours. PiperOrigin-RevId: 227918338	2019-03-29 14:57:22 -07:00
Nicolas Vasilache	618c6a74c6	[MLIR] Introduce normalized single-result unbounded AffineApplyOp Supervectorization does not plan on handling multi-result AffineMaps and non-canonical chains of > 1 AffineApplyOp. This CL introduces a simpler abstraction and composition of single-result unbounded AffineApplyOp by using the existing unbound AffineMap composition. This CL adds a simple API call and relevant tests: ```c++ OpPointer<AffineApplyOp> makeNormalizedAffineApply( FuncBuilder b, Location loc, AffineMap map, ArrayRef<Value> operands); ``` which creates a single-result unbounded AffineApplyOp. The operands of AffineApplyOp are not themselves results of AffineApplyOp by consrtuction. This represent the simplest possible interface to complement the composition of (mathematical) AffineMap, for the cases when we are interested in applying it to Value*. In this CL the composed AffineMap is not compressed (i.e. there exist operands that are not part of the result). A followup commit will compress to normal form. The single-result unbounded AffineApplyOp abstraction will be used in a followup CL to support the MaterializeVectors pass. PiperOrigin-RevId: 227879021	2019-03-29 14:56:37 -07:00
River Riddle	d2cd083f79	Introduce CRTP TypeBase class to simplify type construction and validation. This impl class currently provides the following: * auto definition of the 'ImplType = StorageClass' * get/getChecked wrappers around TypeUniquer * 'verifyConstructionInvariants' hook - This hook verifies that the arguments passed into get/getChecked are valid to construct a type instance with. With this, all non-generic type uniquing has been moved out of MLIRContext.cpp PiperOrigin-RevId: 227871108	2019-03-29 14:56:22 -07:00
Chris Lattner	7983bbc251	Introduce a simple canonicalization of affine_apply that drops unused dims and symbols. Included with this is some other infra: - Testcases for other canonicalizations that I will implement next. - Some helpers in AffineMap/Expr for doing simple walks without defining whole visitor classes. - A 'replaceDimsAndSymbols' facility that I'll be using to simplify maps and exprs, e.g. to fold one constant into a mapping and to drop/renumber unused dims. - Allow index (and everything else) to work in memref's, as we previously discussed, to make the testcase easier to write. - A "getAffineBinaryExpr" helper to produce a binop when you know the kind as an enum. This line of work will eventually subsume the ComposeAffineApply pass, but it is no where close to that yet :-) PiperOrigin-RevId: 227852951	2019-03-29 14:56:07 -07:00
Lei Zhang	ca88ea6f08	Fix format for empty method definition PiperOrigin-RevId: 227840511	2019-03-29 14:55:52 -07:00
Alex Zinenko	8281151c2a	TableGen standard arithmetic ops Use tablegen to generate definitions of the standard binary arithmetic operations. These operations share a lot of boilerplate that is better off generated by a tool. Using tablegen for standard binary arithmetic operations requires the following modifications. 1. Add a bit field `hasConstantFolder` to the base Op tablegen class; generate the `constantFold` method signature if the bit is set. Differentiate between single-result and zero/multi-result functions that use different signatures. The implementation of the method remains in C++, similarly to canonicalization patterns, since it may be large and non-trivial. 2. Define the `AnyType` record of class `Type` since `BinaryOp` currently provided in op_base.td is supposed to operate on tensors and other tablegen users may rely on this behavior. Note that this drops the inline documentation on the operation classes that was copy-pasted around anyway. Since we don't generate g3doc from tablegen yet, keep LangRef.md as it is. Eventually, the user documentation can move to the tablegen definition file as well. PiperOrigin-RevId: 227820815	2019-03-29 14:55:37 -07:00
Jacques Pienaar	dde5bf234d	Use Operator class in OpDefinitionsGen. Cleanup NFC. PiperOrigin-RevId: 227764826	2019-03-29 14:55:22 -07:00
Nicolas Vasilache	5b87a5ef4b	[MLIR] Drop strict super-vector requirement in MaterializeVector The strict requirement (i.e. at least 2 HW vectors in a super-vector) was a premature optimization to avoid interfering with other vector code potentially introduced via other means. This CL avoids this premature optimization and the spurious errors it causes when super-vector size == HW vector size (which is a possible corner case). This may be revisited in the future. PiperOrigin-RevId: 227763966	2019-03-29 14:54:49 -07:00
River Riddle	54948a4380	Split the standard types from builtin types and move them into separate source files(StandardTypes.cpp/h). After this cl only FunctionType and IndexType are builtin types, but IndexType will likely become a standard type when the ml/cfgfunc merger is done. Mechanical NFC. PiperOrigin-RevId: 227750918	2019-03-29 14:54:07 -07:00
Jacques Pienaar	ae1a6619df	Include both TF and TFL ops.td in legalize patterns. Need to do some ifdef jumps with TableGen to avoid errors due to including the base multiple times. The way TableGen flags repeated includes is by way of checking the include directive this necessitates that the guards are on the includes as well as around the classes/defines. PiperOrigin-RevId: 227692030	2019-03-29 14:53:52 -07:00
Jacques Pienaar	c396c044e6	Match the op via isa instead of string compare. * Match using isa - This limits the rewrite pattern to ops defined in op registry but that is probably better end state (esp. for additional verification). PiperOrigin-RevId: 227598946	2019-03-29 14:53:37 -07:00
River Riddle	8abc06f3d5	Implement initial support for dialect specific types. Dialect specific types are registered similarly to operations, i.e. registerType<...> within the dialect. Unlike operations, there is no notion of a "verbose" type, that is all types must be registered to a dialect. Casting support(isa/dyn_cast/etc.) is implemented by reserving a range of type kinds in the top level Type class as opposed to string comparison like operations. To support derived types a few hooks need to be implemented: In the concrete type class: - static char typeID; * A unique identifier for the type used during registration. In the Dialect: - typeParseHook and typePrintHook must be implemented to provide parser support. The syntax for dialect extended types is as follows: dialect-type: '!' dialect-namespace '<' '"' type-specific-data '"' '>' The 'type-specific-data' is information used to identify different types within the dialect, e.g: - !tf<"variant"> // Tensor Flow Variant Type - !tf<"string"> // Tensor Flow String Type TensorFlow/TensorFlowControl types are now implemented as dialect specific types as a proof of concept. PiperOrigin-RevId: 227580052	2019-03-29 14:53:07 -07:00
Alex Zinenko	0c4ee54198	Merge LowerAffineApplyPass into LowerIfAndForPass, rename to LowerAffinePass This change is mechanical and merges the LowerAffineApplyPass and LowerIfAndForPass into a single LowerAffinePass. It makes a step towards defining an "affine dialect" that would contain all polyhedral-related constructs. The motivation for merging these two passes is based on retiring MLFunctions and, eventually, transforming If and For statements into regular operations. After that happens, LowerAffinePass becomes yet another legalization. PiperOrigin-RevId: 227566113	2019-03-29 14:52:52 -07:00
Jacques Pienaar	3633becf8a	Add builderCall to Type and add constant attr class. With the builder to construct the type on the Type, the appropriate mlir::Type can be constructed where needed. Also add a constant attr class that has the attribute and value as members. PiperOrigin-RevId: 227564789	2019-03-29 14:52:37 -07:00
Alex Zinenko	fa710c17f4	LowerForAndIf: expand affine_apply's inplace Existing implementation was created before ML/CFG unification refactoring and did not concern itself with further lowering to separate concerns. As a result, it emitted `affine_apply` instructions to implement `for` loop bounds and `if` conditions and required a follow-up function pass to lower those `affine_apply` to arithmetic primitives. In the unified function world, LowerForAndIf is mostly a lowering pass with low complexity. As we move towards a dialect for affine operations (including `for` and `if`), it makes sense to lower `for` and `if` conditions directly to arithmetic primitives instead of relying on `affine_apply`. Expose `expandAffineExpr` function in LoweringUtils. Use this function together with `expandAffineMaps` to emit primitives that implement loop and branch conditions directly. Also remove tests that become unnecessary after transforming LowerForAndIf into a function pass. PiperOrigin-RevId: 227563608	2019-03-29 14:52:22 -07:00
Alex Zinenko	d64db86f20	Refactor LowerAffineApply In LoweringUtils, extract out `expandAffineMap`. This function takes an affine map and a list of values the map should be applied to and emits a sequence of arithmetic instructions that implement the affine map. It is independent of the AffineApplyOp and can be used in places where we need to insert an evaluation of an affine map without relying on a (temporary) `affine_apply` instruction. This prepares for a merge between LowerAffineApply and LowerForAndIf passes. Move the `expandAffineApply` function to the LowerAffineApply pass since it is the only place that must be aware of the `affine_apply` instructions. PiperOrigin-RevId: 227563439	2019-03-29 14:52:07 -07:00
Chris Lattner	bbf362b784	Eliminate extfunc/cfgfunc/mlfunc as a concept, and just use 'func' instead. The entire compiler now looks at structural properties of the function (e.g. does it have one block, does it contain an if/for stmt, etc) so the only thing holding up this difference is round tripping through the parser/printer syntax. Removing this shrinks the compile by ~140LOC. This is step 31/n towards merging instructions and statements. The last step is updating the docs, which I will do as a separate patch in order to split it from this mostly mechanical patch. PiperOrigin-RevId: 227540453	2019-03-29 14:51:37 -07:00
River Riddle	ae3f8a79ae	Rename OperationPrefix to Namespace in Dialect. This is important as dialects will soon be able to define more than just operations. Moving forward dialect namespaces cannot contain '.' characters. This cl also standardizes that operation names must begin with the dialect namespace followed by a '.'. PiperOrigin-RevId: 227532193	2019-03-29 14:51:22 -07:00
Chris Lattner	50a356d118	Simplify FunctionPass to only have a runOnFunction hook, instead of having a runOnCFG/MLFunction override locations. Passes that care can handle this filtering if they choose. Also, eliminate one needless difference between CFG/ML functions in the parser. This is step 30/n towards merging instructions and statements. PiperOrigin-RevId: 227515912	2019-03-29 14:50:53 -07:00
Nicolas Vasilache	73f5c9c380	[MLIR] Sketch a simple set of EDSCs to declaratively write MLIR This CL introduces a simple set of Embedded Domain-Specific Components (EDSCs) in MLIR components: 1. a `Type` system of shell classes that closely matches the MLIR type system. These types are subdivided into `Bindable` leaf expressions and non-bindable `Expr` expressions; 2. an `MLIREmitter` class whose purpose is to: a. maintain a map of `Bindable` leaf expressions to concrete SSAValue; b. provide helper functionality to specify bindings of `Bindable` classes to SSAValue while verifying comformable types; c. traverse the `Expr` and emit the MLIR. This is used on a concrete example to implement MemRef load/store with clipping in the LowerVectorTransfer pass. More specifically, the following pseudo-C++ code: ```c++ MLFuncBuilder *b = ...; Location location = ...; Bindable zero, one, expr, size; // EDSL expression auto access = select(expr < zero, zero, select(expr < size, expr, size - one)); auto ssaValue = MLIREmitter(b) .bind(zero, ...) .bind(one, ...) .bind(expr, ...) .bind(size, ...) .emit(location, access); ``` is used to emit all the MLIR for a clipped MemRef access. This simple EDSL can easily be extended to more powerful patterns and should serve as the counterpart to pattern matchers (and could potentially be unified once we get enough experience). In the future, most of this code should be TableGen'd but for now it has concrete valuable uses: make MLIR programmable in a declarative fashion. This CL also adds Stmt, proper supporting free functions and rewrites VectorTransferLowering fully using EDSCs. The code for creating the EDSCs emitting a VectorTransferReadOp as loops with clipped loads is: ```c++ Stmt block = Block({ tmpAlloc = alloc(tmpMemRefType), vectorView = vector_type_cast(tmpAlloc, vectorMemRefType), ForNest(ivs, lbs, ubs, steps, { scalarValue = load(scalarMemRef, accessInfo.clippedScalarAccessExprs), store(scalarValue, tmpAlloc, accessInfo.tmpAccessExprs), }), vectorValue = load(vectorView, zero), tmpDealloc = dealloc(tmpAlloc.getLHS())}); emitter.emitStmt(block); ``` where `accessInfo.clippedScalarAccessExprs)` is created with: ```c++ select(i + ii < zero, zero, select(i + ii < N, i + ii, N - one)); ``` The generated MLIR resembles: ```mlir %1 = dim %0, 0 : memref<?x?x?x?xf32> %2 = dim %0, 1 : memref<?x?x?x?xf32> %3 = dim %0, 2 : memref<?x?x?x?xf32> %4 = dim %0, 3 : memref<?x?x?x?xf32> %5 = alloc() : memref<5x4x3xf32> %6 = vector_type_cast %5 : memref<5x4x3xf32>, memref<1xvector<5x4x3xf32>> for %i4 = 0 to 3 { for %i5 = 0 to 4 { for %i6 = 0 to 5 { %7 = affine_apply #map0(%i0, %i4) %8 = cmpi "slt", %7, %c0 : index %9 = affine_apply #map0(%i0, %i4) %10 = cmpi "slt", %9, %1 : index %11 = affine_apply #map0(%i0, %i4) %12 = affine_apply #map1(%1, %c1) %13 = select %10, %11, %12 : index %14 = select %8, %c0, %13 : index %15 = affine_apply #map0(%i3, %i6) %16 = cmpi "slt", %15, %c0 : index %17 = affine_apply #map0(%i3, %i6) %18 = cmpi "slt", %17, %4 : index %19 = affine_apply #map0(%i3, %i6) %20 = affine_apply #map1(%4, %c1) %21 = select %18, %19, %20 : index %22 = select %16, %c0, %21 : index %23 = load %0[%14, %i1, %i2, %22] : memref<?x?x?x?xf32> store %23, %5[%i6, %i5, %i4] : memref<5x4x3xf32> } } } %24 = load %6[%c0] : memref<1xvector<5x4x3xf32>> dealloc %5 : memref<5x4x3xf32> ``` In particular notice that only 3 out of the 4-d accesses are clipped: this corresponds indeed to the number of dimensions in the super-vector. This CL also addresses the cleanups resulting from the review of the prevous CL and performs some refactoring to simplify the abstraction. PiperOrigin-RevId: 227367414	2019-03-29 14:50:23 -07:00
Chris Lattner	7974889f54	Update and generalize various passes to work on both CFG and ML functions, simplifying them in minor ways. The only significant cleanup here is the constant folding pass. All the other changes are simple and easy, but this is still enough to shrink the compiler by 45LOC. The one pass left to merge is the CSE pass, which will be move involved, so I'm splitting it out to its own patch (which I'll tackle right after this). This is step 28/n towards merging instructions and statements. PiperOrigin-RevId: 227328115	2019-03-29 14:49:52 -07:00
Uday Bondhugula	18fbc3e170	Drop unusued HyperRectangularSet.h/.cpp, given the new design being worked on. - drop these ununsed/incomplete sketches given the new design @albertcohen is working on, and given that FlatAffineConstraints is now stable and fast enough for all the analyses/transforms that depend on it. PiperOrigin-RevId: 227322739	2019-03-29 14:49:03 -07:00
Uday Bondhugula	f12182157e	Introduce PostDominanceInfo, fix properlyDominates() for Instructions - introduce PostDominanceInfo in the right/complete way and use that for post dominance check in store-load forwarding - replace all uses of Analysis/Utils::dominates/properlyDominates with DominanceInfo::dominates/properlyDominates - drop all redundant copies of dominance methods in Analysis/Utils/ - in pipeline-data-transfer, replace dominates call with a much less expensive check; similarly, substitute dominates() in checkMemRefAccessDependence with a simpler check suitable for that context - fix a bug in properlyDominates - improve doc for 'for' instruction 'body' PiperOrigin-RevId: 227320507	2019-03-29 14:48:44 -07:00
Uday Bondhugula	cea9f28a2c	Fix dominates() for block's. - dominates() for blocks was assuming that there was only a single block at the top level whenever there was a hierarchy of blocks (as in the case of 'for'/'if' instructions). - fix the comments as well PiperOrigin-RevId: 227319738	2019-03-29 14:48:28 -07:00
Chris Lattner	ae618428f6	Greatly simplify the ConvertToCFG pass, converting it from a module pass to a function pass, and eliminating the need to copy over code and do interprocedural updates. While here, also improve it to make fewer empty blocks, and rename it to "LowerIfAndFor" since that is what it does. This is a net reduction of ~170 lines of code. As drive-bys, change the splitBlock method to not insert an unconditional branch, since that behavior is annoying for all clients. Also improve the AsmPrinter to not crash when a block is referenced that isn't linked into a function. PiperOrigin-RevId: 227308856	2019-03-29 14:48:13 -07:00
Uday Bondhugula	b9fe6be6d4	Introduce memref store to load forwarding - a simple memref dataflow analysis - the load/store forwarding relies on memref dependence routines as well as SSA/dominance to identify the memref store instance uniquely supplying a value to a memref load, and replaces the result of that load with the value being stored. The memref is also deleted when possible if only stores remain. - add methods for post dominance for MLFunction blocks. - remove duplicated getLoopDepth/getNestingDepth - move getNestingDepth, getMemRefAccess, getNumCommonSurroundingLoops into Analysis/Utils (were earlier static) - add a helper method in FlatAffineConstraints - isRangeOneToOne. PiperOrigin-RevId: 227252907	2019-03-29 14:47:28 -07:00
Chris Lattner	dffc589ad2	Extend InstVisitor and Walker to handle arbitrary CFG functions, expand the Function::walk functionality into f->walkInsts/Ops which allows visiting all instructions, not just ops. Eliminate Function::getBody() and Function::getReturn() helpers which crash in CFG functions, and were only kept around as a bridge. This is step 25/n towards merging instructions and statements. PiperOrigin-RevId: 227243966	2019-03-29 14:46:58 -07:00
Jacques Pienaar	bbe3f4d9f5	Switch rewriters for relu, relu6, placeholder_input, softmax to patterns. Add a constant F32 attribute for use with softmax legalization. PiperOrigin-RevId: 227241643	2019-03-29 14:46:44 -07:00
Chris Lattner	56e2a6cc3b	Merge the verifier logic for all functions into a unified framework, this requires enhancing DominanceInfo to handle the structure of an ML function, which is required anyway. Along the way, this also fixes a const correctness problem with Instruction::getBlock(). This is step 24/n towards merging instructions and statements. PiperOrigin-RevId: 227228900	2019-03-29 14:45:43 -07:00
Jacques Pienaar	554848d617	Match multiple pattern nodes as input to rewrite. * Allow multi input node patterns in the rewrite; * Use number of nodes matched as benefit; * Rewrite relu(add(...)) matching using the new pattern; To allow for undefined ops, do string compare - will address soon! PiperOrigin-RevId: 227225425	2019-03-29 14:45:14 -07:00
Chris Lattner	5b9c3f7cdb	Tidy up references to "basic blocks" that should refer to blocks now. NFC. PiperOrigin-RevId: 227196077	2019-03-29 14:44:59 -07:00
Chris Lattner	456ad6a8e0	Standardize naming of statements -> instructions, revisting the code base to be consistent and moving the using declarations over. Hopefully this is the last truly massive patch in this refactoring. This is step 21/n towards merging instructions and statements, NFC. PiperOrigin-RevId: 227178245	2019-03-29 14:44:30 -07:00
Chris Lattner	315a466aed	Rename BasicBlock and StmtBlock to Block, and make a pass cleaning it up. I did not make an effort to rename all of the 'bb' names in the codebase, since they are still correct and any specific missed once can be fixed up on demand. The last major renaming is Statement -> Instruction, which is why Statement and Stmt still appears in various places. This is step 19/n towards merging instructions and statements, NFC. PiperOrigin-RevId: 227163082	2019-03-29 14:43:58 -07:00
Jacques Pienaar	2a463c36b1	Add convenience wrapper for operator in tblgen Add convenience wrapper to make it easier to iterate over attributes and operands of operator defined in TableGen file. Use this class in RewriterGen (not used in the op generator yet, will do shortly). Change the RewriterGen to pass the bound arguments explicitly, this is in preparation for multi-op matching. PiperOrigin-RevId: 227156748	2019-03-29 14:43:43 -07:00
Chris Lattner	69d9e990fa	Eliminate the using decls for MLFunction and CFGFunction standardizing on Function. This is step 18/n towards merging instructions and statements, NFC. PiperOrigin-RevId: 227139399	2019-03-29 14:43:13 -07:00
Jacques Pienaar	f845bc4542	Fix incorrect names due to merging of tblgen tools. PiperOrigin-RevId: 227131485	2019-03-29 14:42:57 -07:00
Chris Lattner	d798f9bad5	Rename BBArgument -> BlockArgument, Op::getOperation -> Op::getInst(), StmtResult -> InstResult, StmtOperand -> InstOperand, and remove the old names. This is step 17/n towards merging instructions and statements, NFC. PiperOrigin-RevId: 227121537	2019-03-29 14:42:40 -07:00
Chris Lattner	5187cfcf03	Merge Operation into OperationInst and standardize nomenclature around OperationInst. This is a big mechanical patch. This is step 16/n towards merging instructions and statements, NFC. PiperOrigin-RevId: 227093712	2019-03-29 14:42:23 -07:00
Chris Lattner	471c976413	Rework inherentance hierarchy: Operation now derives from Statement, and OperationInst derives from it. This allows eliminating some forwarding functions, other complex code handling multiple paths, and the 'isStatement' bit tracked by Operation. This is the last patch I think I can make before the big mechanical change merging Operation into OperationInst, coming next. This is step 15/n towards merging instructions and statements, NFC. PiperOrigin-RevId: 227077411	2019-03-29 14:41:49 -07:00
Feng Liu	9b20a4ccdf	add a method to get FloatAttr value as double Sometimes we have to get the raw value of the FloatAttr to invoke APIs from non-MLIR libraries (i.e. in the tpu_ops.inc and convert_tensor.cc files). Using `FloatAttr::getValue().convertToFloat()` and `FloatAttr::getValue().convertToDouble()` is not safe because interally they checke the semantics of the APFloat in the attribute, and the semantics is not always specified (the default value is f64 then convertToFloat will fail) or inferred incorrectly (for example, using 1.0 instead of 1.f for IEEEFloat). Calling these convert methods without knowing the semantics can usually crash the compiler. This new method converts the value of a FloatAttr to double even if it loses precision. Currently this method can be used to read in f32 data from arrays. PiperOrigin-RevId: 227076616	2019-03-29 14:41:34 -07:00
Chris Lattner	bd24a131d3	Fix an ASAN detected bug introduced by cr/227067644. While MLFunctions always have one block, CFG Functions don't necessarily have one (e.g. when they are being first constructed by the parser). PiperOrigin-RevId: 227075636	2019-03-29 14:41:19 -07:00
Chris Lattner	1b430f1d32	Delicately re-layer Operation, Statement, and OperationStmt, reworking #includes so Statements.h includes Operation.h but nothing else does. This is in preparation to eliminate the Operation class and the complexity it brings with it. I split this patch off because it is just moving stuff around, the next patch will be more complex. This is step 14/n towards merging instructions and statements, NFC. PiperOrigin-RevId: 227071777	2019-03-29 14:41:05 -07:00
Chris Lattner	4fbcd1ac52	Minor renamings: Trim the "Stmt" prefix off StmtSuccessorIterator/StmtSuccessorIterator, and rename and move the CFGFunctionViewGraph pass to ViewFunctionGraph. This is step 13/n towards merging instructions and statements, NFC. PiperOrigin-RevId: 227069438	2019-03-29 14:40:51 -07:00
Chris Lattner	4c05f8cac6	Merge CFGFuncBuilder/MLFuncBuilder/FuncBuilder together into a single new FuncBuilder class. Also rename SSAValue.cpp to Value.cpp This is step 12/n towards merging instructions and statements, NFC. PiperOrigin-RevId: 227067644	2019-03-29 14:40:22 -07:00

... 3 4 5 6 7 ...

854 Commits