llvm-project

Commit Graph

Author	SHA1	Message	Date
Aart Bik	67c019ddac	[VectorOps] remove redundant returns from invalid ops test PiperOrigin-RevId: 286640660	2019-12-20 14:27:42 -08:00
Uday Bondhugula	e5691c512f	fix isValidDim for block arg case - a block argument associated with an arbitrary op can't be a valid dimensional identifier; it has to be the block argument of either a function op or an affine.for. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#331 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/331 from bondhugula:valid_dim 3273b4fcbaa31fb7b6671d93c9e42a6b2a6a4e4c PiperOrigin-RevId: 286593693	2019-12-20 09:44:03 -08:00
Christian Sigg	42d46b4efa	Add gpu.shuffle op. This will allow us to lower most of gpu.all_reduce (when all_reduce doesn't exist in the target dialect) within the GPU dialect, and only do target-specific lowering for the shuffle op. PiperOrigin-RevId: 286548256	2019-12-20 02:52:52 -08:00
Frank Laub	7811ad3c2b	Allow dialect to create friendly names for region arguments This is the block argument equivalent of the existing `getAsmResultNames` hook. Closes tensorflow/mlir#329 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/329 from plaidml:flaub-region-arg-names fc7876f2d1335024e441083cd25263fd6247eb7d PiperOrigin-RevId: 286523299	2019-12-19 22:16:07 -08:00
Jacques Pienaar	b6d54a1ba3	Unique trait list during ODS Operator trait construction Concatting lists in TableGen is easy, creating unique lists less so. There is no reason for duplicated op traits so we could throw an error instead but duplicates could occur due to concatting different list of traits in ODS (e.g., for convenience reasons), so just dedup them during Operator trait construction instead. PiperOrigin-RevId: 286488423	2019-12-19 16:44:56 -08:00
Andy Davis	8020ad3e39	[VectorOps] Update vector transfer_read/write ops to operatate on memrefs with vector element type. Update vector transfer_read/write ops to operatate on memrefs with vector element type. This handle cases where the memref vector element type represents the minimal memory transfer unit (or multiple of the minimal memory transfer unit). PiperOrigin-RevId: 286482115	2019-12-19 16:05:32 -08:00
Andy Davis	1d798b1d27	[VectorOps] Add vector ReshapeOp to the VectorOps dialect. Adds vector ReshapeOp to the VectorOps dialect. An aggregate vector reshape operation, which aggregates multiple hardware vectors, can enable optimizations during decomposition (e.g. loading one input hardware vector and performing multiple rotate and scatter store operations to the vector output). PiperOrigin-RevId: 286440658	2019-12-19 12:27:59 -08:00
Aart Bik	15f800f4bc	[VectorOps] minor cleanup: vector dialect "subscripts" are i32 Introduces some centralized methods to move towards consistent use of i32 as vector subscripts. Note: sizes/strides/offsets attributes are still i64 PiperOrigin-RevId: 286434133	2019-12-19 11:51:08 -08:00
Nicolas Vasilache	50f9be6d2d	Add runtime utils support for print_memref_i8 This CL adds print_memref_i8 along with a unit test. PiperOrigin-RevId: 286299237	2019-12-18 17:32:35 -08:00
Aart Bik	a1e84db66e	[VectorOps] Replace iostream with stdio in support lib for vector.print PiperOrigin-RevId: 286252829	2019-12-18 13:24:30 -08:00
Marcel Koester	6054610bbe	Added LLVM ops and lowering phases from standard dialect for FAbs, FCeil, Cos, FNeg, CopySign. Added test cases for the newly added LLVM operations and lowering features. Closes tensorflow/mlir#300 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/300 from dfki-jugr:std_to_llvm da6168bbc1a369ae2e99ad3881fdddd82f075dd4 PiperOrigin-RevId: 286231169	2019-12-18 11:42:43 -08:00
Aart Bik	d9b500d3bb	[VectorOps] Add vector.print definition, with lowering support Examples: vector.print %f : f32 vector.print %x : vector<4xf32> vector.print %y : vector<3x4xf32> vector.print %z : vector<2x3x4xf32> LLVM lowering replaces these with fully unrolled calls into a small runtime support library that provides some basic printing operations (single value, opening closing bracket, comma, newline). PiperOrigin-RevId: 286230325	2019-12-18 11:31:34 -08:00
River Riddle	29807ff5e4	Add support for providing a default implementation for an interface method. This enables providing a default implementation of an interface method. This method is defined on the Trait that is attached to the operation, and thus has all of the same constraints and properties as any other interface method. This allows for interface authors to provide a conservative default implementation for certain methods, without requiring that all users explicitly define it. The default implementation can be specified via the argument directly after the interface method body: StaticInterfaceMethod< /desc=/"Returns whether two array of types are compatible result types for an op.", /retTy=/"bool", /methodName=/"isCompatibleReturnTypes", /args=/(ins "ArrayRef<Type>":$lhs, "ArrayRef<Type>":$rhs), /methodBody=/[{ return ConcreteOp::isCompatibleReturnTypes(lhs, rhs); }], /defaultImplementation=/[{ /// Returns whether two arrays are equal as strongest check for /// compatibility by default. return lhs == rhs; }] PiperOrigin-RevId: 286226054	2019-12-18 11:09:11 -08:00
Uday Bondhugula	47034c4bc5	Introduce prefetch op: affine -> std -> llvm intrinsic Introduce affine.prefetch: op to prefetch using a multi-dimensional subscript on a memref; similar to affine.load but has no effect on semantics, but only on performance. Provide lowering through std.prefetch, llvm.prefetch and map to llvm's prefetch instrinsic. All attributes reflected through the lowering - locality hint, rw, and instr/data cache. affine.prefetch %0[%i, %j + 5], false, 3, true : memref<400x400xi32> Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#225 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/225 from bondhugula:prefetch 4c3b4e93bc64d9a5719504e6d6e1657818a2ead0 PiperOrigin-RevId: 286212997	2019-12-18 10:00:04 -08:00
Alex Zinenko	40ef46fba4	Harden the requirements to memory attribution types in gpu.func When memory attributions are present in `gpu.func`, require that they are of memref type and live in memoryspaces 3 and 5 for workgroup and private memory attributions, respectively. Adapt the conversion from the GPU dialect to the NVVM dialect to drop the private memory space from attributions as NVVM is able to model them as local `llvm.alloca`s in the default memory space. PiperOrigin-RevId: 286161763	2019-12-18 03:38:55 -08:00
Andy Davis	6fa3bd5b3e	Add pattern rewrite which splits a vector TransferWriteOp into slices according to the unrolling/slicing scheme of its InsertSlicesOp operand. PiperOrigin-RevId: 286042578	2019-12-17 13:17:10 -08:00
Mahesh Ravishankar	319cca3bbe	Add missing virtual inliner interface method in SPIR-V dialect. The inline interface uses two methods to check legality of inling: 1) Can a region be inlined into another. 2) Can an operation be inlined into another. Setting the former to true, allows the inliner to use the second for legality checks. Add this method to the SPIR-V dialect inlining interface. PiperOrigin-RevId: 286041734	2019-12-17 13:06:05 -08:00
Andy Davis	d1fb285b32	Add pattern rewrite to forward vector tuple elements to their users. User(TupleGetOp(ExtractSlicesOp(InsertSlicesOp(TupleOp(Producer))) -> User(Producer) PiperOrigin-RevId: 286020249	2019-12-17 11:21:45 -08:00
Andy Davis	038ad1d856	Add pattern rewrite which splits a vector TransferReadOp into slices according to the unrolling/slicing scheme of its ExtractSlicesOp user. PiperOrigin-RevId: 285975613	2019-12-17 07:29:06 -08:00
Andy Davis	4e825c59be	Update vector op unrolling transformation to generate ExtractSlicesOp and InsertSlicesOp (instead of less structured chain of StridedSliceOps and InsertStridedSliceOps). PiperOrigin-RevId: 285968051	2019-12-17 06:27:01 -08:00
Mahesh Ravishankar	80ec474a65	Add atomic operations to SPIR-V dialect. Some changes to the dialect generation script to allow specification of different base class to derive from in ODS. PiperOrigin-RevId: 285859230	2019-12-16 15:05:51 -08:00
Lei Zhang	659150b570	[spirv] Re-enable nested loop (de)serialization test PiperOrigin-RevId: 285849308	2019-12-16 14:21:52 -08:00
Nicolas Vasilache	3c179b6575	Add edsc::ops for pointwise, conv and dilated_conv This CL adds more Linalg EDSC ops and tests to support building pointwise operations along with conv and dilated_conv. This also fixes a bug in the existing linalg_matmul EDSC and beefs up the test. The current set of ops is already enough to build an interesting, albeit simple, model used internally. PiperOrigin-RevId: 285838012	2019-12-16 13:42:38 -08:00
Andy Davis	11e92875f0	Add InsertSlicesOp to the VectorOps dialect. PiperOrigin-RevId: 285830394	2019-12-16 12:56:38 -08:00
Alex Zinenko	6273fa0c6a	Plug gpu.func into the GPU lowering pipelines This updates the lowering pipelines from the GPU dialect to lower-level dialects (NVVM, SPIRV) to use the recently introduced gpu.func operation instead of a standard function annotated with an attribute. In particular, the kernel outlining is updated to produce gpu.func instead of std.func and the individual conversions are updated to consume gpu.funcs and disallow standard funcs after legalization, if necessary. The attribute "gpu.kernel" is preserved in the generic syntax, but can also be used with the custom syntax on gpu.funcs. The special kind of function for GPU allows one to use additional features such as memory attribution. PiperOrigin-RevId: 285822272	2019-12-16 12:12:48 -08:00
Jose Ignacio Gomez	3ae56c4135	[Linalg] Expose subview promotion as a declarative pattern This PR targest issue tensorflow/mlir#295. It exposes the already existing subiew promotion pass as a declarative pattern Change-Id: If901ebef9fb53fcd0b12ecc536f6b174ce320b92 Closes tensorflow/mlir#315 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/315 from tetuante:issue295 8e5f268b6d85f31015c33505329dbd7a4db97ac5 PiperOrigin-RevId: 285801463	2019-12-16 10:50:45 -08:00
Aart Bik	cd5dab8ad7	[VectorOps] Add [insert/extract]element definition together with lowering to LLVM Similar to insert/extract vector instructions but (1) work on 1-D vectors only (2) allow for a dynamic index %c3 = constant 3 : index %0 = vector.insertelement %arg0, %arg1[%c : index] : vector<4xf32> %1 = vector.extractelement %arg0[%c3 : index] : vector<4xf32> PiperOrigin-RevId: 285792205	2019-12-16 09:52:46 -08:00
Andy Davis	73ec37c8bb	Adds ExtractSlicesOp to the VectorOps dialect. ExtractSlicesOp extracts slices of its vector operand and with a specified tiling scheme. This operation centralizes the tiling scheme around a single op, which simplifies vector op unrolling and subsequent pattern rewrite transformations. PiperOrigin-RevId: 285761129	2019-12-16 06:39:09 -08:00
Alex Zinenko	0684aa9a8b	Make memref promotion during std->LLVM lowering the default calling convention During the conversion from the standard dialect to the LLVM dialect, memref-typed arguments are promoted from registers to memory and passed into functions by pointer. This had been introduced into the lowering to work around the abesnce of calling convention modeling in MLIR to enable better interoperability with LLVM IR generated from C, and has been exerciced for several months. Make this promotion the default calling covention when converting to the LLVM dialect. This adds the documentation, simplifies the code and makes the conversion consistent across function operations and function types used in other places, e.g. in high-order functions or attributes, which would not follow the same rule previously. PiperOrigin-RevId: 285751280	2019-12-16 05:17:14 -08:00
Nicolas Vasilache	200beb8446	Apply a level of sugaring to the linalg.generic EDSC - NFC Make the declarative C++ builder API simpler to use so we can start chaining these ops together. PiperOrigin-RevId: 285496266	2019-12-13 17:39:46 -08:00
Jing Pu	27ae92516b	Skip generating C++ for "DeclareOpInterfaceMethods" in op interface gen. This is needed for calling the generator on a .td file that contains both OpInterface definitions and op definitions with DeclareOpInterfaceMethods<...> Traits. PiperOrigin-RevId: 285465784	2019-12-13 17:08:33 -08:00
Nicolas Vasilache	7923abd357	Add a layer of EDSC for linalg.GenericOp This will be evolved into a simple programming model for custom ops and custom layers in followup CLs. This CL also deletes the obsolete tablegen's reference-impl.td that was using EDSCs. PiperOrigin-RevId: 285459545	2019-12-13 16:57:57 -08:00
River Riddle	b030e4a4ec	Try to fold operations in DialectConversion when trying to legalize. This change allows for DialectConversion to attempt folding as a mechanism to legalize illegal operations. This also expands folding support in OpBuilder::createOrFold to generate new constants when folding, and also enables it to work in the context of a PatternRewriter. PiperOrigin-RevId: 285448440	2019-12-13 16:47:26 -08:00
Jacques Pienaar	a50cb184a0	Fix logic on when to emit collective type but separate arg builder Got the comment right but the code wrong :/ PiperOrigin-RevId: 285270561	2019-12-12 14:23:14 -08:00
Aart Bik	1c81adf362	[VectorOps] Add lowering of vector.shuffle to LLVM IR For example, a shuffle %1 = vector.shuffle %arg0, %arg1 [0 : i32, 1 : i32] : vector<2xf32>, vector<2xf32> becomes a direct LLVM shuffle 0 = llvm.shufflevector %arg0, %arg1 [0 : i32, 1 : i32] : !llvm<"<2 x float>">, !llvm<"<2 x float>"> but %1 = vector.shuffle %a, %b[1 : i32, 0 : i32, 2: i32] : vector<1x4xf32>, vector<2x4xf32> becomes the more elaborate (note the index permutation that drives argument selection for the extract operations) %0 = llvm.mlir.undef : !llvm<"[3 x <4 x float>]"> %1 = llvm.extractvalue %arg1[0] : !llvm<"[2 x <4 x float>]"> %2 = llvm.insertvalue %1, %0[0] : !llvm<"[3 x <4 x float>]"> %3 = llvm.extractvalue %arg0[0] : !llvm<"[1 x <4 x float>]"> %4 = llvm.insertvalue %3, %2[1] : !llvm<"[3 x <4 x float>]"> %5 = llvm.extractvalue %arg1[1] : !llvm<"[2 x <4 x float>]"> %6 = llvm.insertvalue %5, %4[2] : !llvm<"[3 x <4 x float>]"> PiperOrigin-RevId: 285268164	2019-12-12 14:11:56 -08:00
Jacques Pienaar	41a73ddce8	Add type inference variant for separate params builder generated Add variant that does invoke infer type op interface where defined. Also add entry function that invokes that different separate argument builders for wrapped, unwrapped and inference variant. PiperOrigin-RevId: 285220709	2019-12-12 10:36:14 -08:00
Nicolas Vasilache	782ae29678	Retire !linalg.buffer type - NFC This type is not used anymore now that Linalg view and subview have graduated to std and that alignment is supported on alloc. PiperOrigin-RevId: 285213424	2019-12-12 10:03:57 -08:00
Alexander Belyaev	1b579d998a	[Linalg] Add test for fusion of GenericOp with IndexedGenericOp. PiperOrigin-RevId: 285211797	2019-12-12 09:56:45 -08:00
Ehsan Toosi	f7bffad5a7	Added lowering of `std.tanh` to llvm function call to `tanh` and `tanhf`. Closes tensorflow/mlir#312 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/312 from dfki-ehna:tanh 9e89b072ff91ff390ad739501745114feb3ac856 PiperOrigin-RevId: 285205674	2019-12-12 09:25:15 -08:00
Nicolas Vasilache	95b5a4fd67	Move cpu runner utils templates to .h This allows reusing the implementation in various places by just including and permits more easily writing test functions without explicit template instantiations. This also modifies UnrankedMemRefType to take a template type parameter since it cannot be type agnostic atm. PiperOrigin-RevId: 285187711	2019-12-12 07:33:09 -08:00
Christian Sigg	9b85582682	Automated rollback of commit `f68ac464d8` PiperOrigin-RevId: 285162061	2019-12-12 03:48:38 -08:00
Christian Sigg	f68ac464d8	Switch from shfl.bfly to shfl.down. Both work for the current use case, but the latter allows implementing prefix sums and is a little easier to understand for partial warps. PiperOrigin-RevId: 285145287	2019-12-12 01:28:01 -08:00
Nicolas Vasilache	9dfa84a269	Add std.log* and llvm.intr.log* that correspond to the LLVMIR intrinsics PiperOrigin-RevId: 285073483	2019-12-11 15:25:34 -08:00
Mahesh Ravishankar	b909299d20	Add missing CMake dependency for MLIRTestIR. PiperOrigin-RevId: 285039153	2019-12-11 12:44:42 -08:00
Denis Khalikov	d968f9696d	[spirv] Add lowering for std.fdiv, std.frem, std.fsub Closes tensorflow/mlir#313 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/313 from denis0x0D:sandbox/lowering_std_farith 41715070a74d13bfa9401957478978c1bb8006c0 PiperOrigin-RevId: 285023586	2019-12-11 11:17:35 -08:00
Nicolas Vasilache	508d4e672e	Continue refactoring StructuredOps utilities This CL adds more common information to StructuredOpsUtils.h The n_view attribute is retired in favor of args_in + args_out but the CL is otherwise NFC. PiperOrigin-RevId: 285000621	2019-12-11 09:27:34 -08:00
Alexander Belyaev	4b0198acb5	Roll-forward initial liveness analysis including test cases. Fix the usage of the map size when appending to the map with []. PiperOrigin-RevId: 284985916	2019-12-11 08:13:43 -08:00
Alexander Belyaev	984fdde269	Automated rollback of commit `98fbf41044` PiperOrigin-RevId: 284979684	2019-12-11 07:17:21 -08:00
Alexander Belyaev	bae8a7a724	[Linalg] Add tiling for IndexedGenericOp with a region. PiperOrigin-RevId: 284949355	2019-12-11 02:56:40 -08:00
Marcel Koester	98fbf41044	Add initial liveness analysis including test cases. Closes tensorflow/mlir#255 PiperOrigin-RevId: 284935454	2019-12-11 01:03:25 -08:00
Aart Bik	9826fe5c9f	[VectorOps] Add lowering of vector.insert to LLVM IR For example, an insert %0 = vector.insert %arg0, %arg1[3 : i32] : f32 into vector<4xf32> becomes %0 = llvm.mlir.constant(3 : i32) : !llvm.i32 %1 = llvm.insertelement %arg0, %arg1[%0 : !llvm.i32] : !llvm<"<4 x float>"> A more elaborate example, inserting an element in a higher dimension vector %0 = vector.insert %arg0, %arg1[3 : i32, 7 : i32, 15 : i32] : f32 into vector<4x8x16xf32> becomes %0 = llvm.extractvalue %arg1[3 : i32, 7 : i32] : !llvm<"[4 x [8 x <16 x float>]]"> %1 = llvm.mlir.constant(15 : i32) : !llvm.i32 %2 = llvm.insertelement %arg0, %0[%1 : !llvm.i32] : !llvm<"<16 x float>"> %3 = llvm.insertvalue %2, %arg1[3 : i32, 7 : i32] : !llvm<"[4 x [8 x <16 x float>]]"> PiperOrigin-RevId: 284882443	2019-12-10 17:12:49 -08:00
Andy Davis	4d8ba88610	Add VectorOp transform pattern which splits vector TransferReadOps to target vector unroll size. PiperOrigin-RevId: 284880592	2019-12-10 17:02:51 -08:00
Uday Bondhugula	36a415bcc5	More affine expr simplifications for floordiv and mod Add one more simplification for floordiv and mod affine expressions. Examples: (2d0 + 1) floordiv 2 is simplified to d0 (8d0 + 4d1 + d2) floordiv 4 simplified to 4d0 + d1 + d2 floordiv 4. etc. Similarly, (4d1 + 1) mod 2 is simplified to 1, (2d0 + 8d1) mod 8 simplified to 2d0 mod 8. Change getLargestKnownDivisor to return int64_t to be consistent and to avoid casting at call sites (since the return value is used in expressions of int64_t/index type). Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#202 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/202 from bondhugula:affine b13fcb2f1c00a39ca5434613a02408e085a80e77 PiperOrigin-RevId: 284866710	2019-12-10 16:00:53 -08:00
Nicolas Vasilache	995048d7b7	Fold TestLinalgTilePermutePatterns into TestLinalgTransformPatterns - NFC Centralize all patterns that test Linalg transforms in a single pass. PiperOrigin-RevId: 284835938	2019-12-10 13:26:15 -08:00
Jose Ignacio Gomez	b19fed5415	[Linalg] Add a Linalg iterator permutation transformation This patch closes issue tensorflow/mlir#272 We add a standalone iterator permutation transformation to Linalg. This transformation composes a permutation map with the maps in the "indexing_maps" attribute. It also permutes "iterator_types" accordingly. Change-Id: I7c1e693b8203aeecc595a7c012e738ca1100c857 Closes tensorflow/mlir#307 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/307 from tetuante:issue272 f7908d58792f4111119721885e247045104f1131 PiperOrigin-RevId: 284824102	2019-12-10 12:25:43 -08:00
Nicolas Vasilache	ad38e49806	Uniformize Vector transforms as patterns on the model of Linalg - NFC This reorganizes the vector transformations to be more easily testable as patterns and more easily composable into fused passes in the future. PiperOrigin-RevId: 284817474	2019-12-10 11:54:33 -08:00
Aart Bik	1fe65688d4	[VectorOps] Add a ShuffleOp to the VectorOps dialect For example %0 = vector.shuffle %x, %y [3 : i32, 2 : i32, 1 : i32, 0 : i32] : vector<2xf32>, vector<2xf32> yields a vector<4xf32> result with a permutation of the elements of %x and %y PiperOrigin-RevId: 284657191	2019-12-09 16:15:41 -08:00
Aart Bik	0e963b9c42	[VectorOps] Fix off-by-one error in insert/extract validation PiperOrigin-RevId: 284652653	2019-12-09 15:54:23 -08:00
Denis Khalikov	34265dad65	[spirv] Add CompositeConstruct operation. Closes tensorflow/mlir#308 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/308 from denis0x0D:sandbox/composite_construct 9ef7180f77f9374bcd05afc4f9e6c1d2d72d02b7 PiperOrigin-RevId: 284613617	2019-12-09 12:43:53 -08:00
Lei Zhang	2c7e8ed7c6	[spirv] Add spv.IAdd, spv.ISub, and spv.IMul folders The patterns to be folded away can be commonly generated during lowering to SPIR-V. PiperOrigin-RevId: 284604855	2019-12-09 11:59:10 -08:00
Jacques Pienaar	89cef725f4	ODS: Generate named accessors for raw attributes Currently named accessors are generated for attributes returning a consumer friendly type. But sometimes the attributes are used while transforming an existing op and then the returned type has to be converted back into an attribute or the raw `getAttr` needs to be used. Generate raw named accessor for attributes to reference the raw attributes without having to use the string interface for better compile time verification. This allows calling `blahAttr()` instead of `getAttr("blah")`. Raw here refers to returning the underlying storage attribute. PiperOrigin-RevId: 284583426	2019-12-09 10:29:34 -08:00
Mahesh Ravishankar	4a62019eb8	Add lowering for module with gpu.kernel_module attribute. The existing GPU to SPIR-V lowering created a spv.module for every function with gpu.kernel attribute. A better approach is to lower the module that the function lives in (which has the attribute gpu.kernel_module) to a spv.module operation. This better captures the host-device separation modeled by GPU dialect and simplifies the lowering as well. PiperOrigin-RevId: 284574688	2019-12-09 09:52:21 -08:00
Andy Davis	312ccb1c0f	Unify vector op unrolling transformation. Unifies vector op unrolling transformation, by using the same unrolling implementation for contraction and elementwise operations. Removes fakefork/join operations which are non longer needed now that we have the InsertStridedSlice operation. PiperOrigin-RevId: 284570784	2019-12-09 09:35:15 -08:00
Kazuaki Ishizaki	ae05cf27c6	Minor spelling tweaks Closes tensorflow/mlir#304 PiperOrigin-RevId: 284568358	2019-12-09 09:23:48 -08:00
Nicolas Vasilache	91c0074624	[StructuredOps][Linalg] Add a primitive pattern to rewrite the linalg.generic form of matmul to vector form. This CL uses the newly expanded matcher support to easily detect when a linalg.generic has a multiply-accumulate body. A linalg.generic with such a body is rewritten as a vector contraction. This CL additionally limits the rewrite to the case of matrix multiplication on contiguous and statically shaped memrefs for now. Before expanding further, we should harden the infrastructure for expressing custom ops with the structured ops abstraction. PiperOrigin-RevId: 284566659	2019-12-09 09:14:39 -08:00
Jacques Pienaar	70aeb4566e	Add RegionRange for when need to abstract over different region iteration Follows ValueRange in representing a generic abstraction over the different ways to represent a range of Regions. This wrapper is not as ValueRange and only considers the current cases of interest: MutableArrayRef<Region> and ArrayRef<std::unique_ptr<Region>> as occurs during op construction vs op region querying. Note: ArrayRef<std::unique_ptr<Region>> allows for unset regions, so this range returns a pointer to a Region instead of a Region. PiperOrigin-RevId: 284563229	2019-12-09 08:57:56 -08:00
Nicolas Vasilache	7b19bd5411	Post-submit cleanups in RecursiveMatchers This CL addresses leftover cleanups and adds a test mixing RecursiveMatchers and m_Constant that captures properly. PiperOrigin-RevId: 284551567	2019-12-09 07:47:35 -08:00
Nicolas Vasilache	ade58a268c	Add a layer of recursive matchers that compose. This CL adds support for building matchers recursively. The following matchers are provided: 1. `m_any()` can match any value 2. `m_val(Value *)` binds to a value and must match it 3. `RecursivePatternMatcher<OpType, Matchers...>` n-arity pattern that matches `OpType` and whose operands must be matched exactly by `Matchers...`. This allows building expression templates for patterns, declaratively, in a very natural fashion. For example pattern `p9` defined as follows: ``` auto mul_of_muladd = m_Op<MulFOp>(m_Op<MulFOp>(), m_Op<AddFOp>()); auto mul_of_anyadd = m_Op<MulFOp>(m_any(), m_Op<AddFOp>()); auto p9 = m_Op<MulFOp>(m_Op<MulFOp>( mul_of_muladd, m_Op<MulFOp>()), m_Op<MulFOp>(mul_of_anyadd, mul_of_anyadd)); ``` Successfully matches `%6` in: ``` %0 = addf %a, %b: f32 %1 = addf %a, %c: f32 // matched %2 = addf %c, %b: f32 %3 = mulf %a, %2: f32 // matched %4 = mulf %3, %1: f32 // matched %5 = mulf %4, %4: f32 // matched %6 = mulf %5, %5: f32 // matched ``` Note that 0-ary matchers can be used as leaves in place of n-ary matchers. This alleviates from passing explicit `m_any()` leaves. In the future, we may add extra patterns to specify that operands may be matched in any order. PiperOrigin-RevId: 284469446	2019-12-08 18:09:40 -08:00
River Riddle	d6ee6a0310	Update the builder API to take ValueRange instead of ArrayRef<Value *> This allows for users to provide operand_range and result_range in builder.create<> calls, instead of requiring an explicit copy into a separate data structure like SmallVector/std::vector. PiperOrigin-RevId: 284360710	2019-12-07 10:35:41 -08:00
River Riddle	8904e91035	Add a flag to the IRPrinter instrumentation to only print after a pass if there is a change to the IR. This adds an additional filtering mode for printing after a pass that checks to see if the pass actually changed the IR before printing it. This "change" detection is implemented using a SHA1 hash of the current operation and its children. PiperOrigin-RevId: 284291089	2019-12-06 17:05:05 -08:00
Jacques Pienaar	4add9edd72	Change inferReturnTypes to return LogicalResult and values Previously the error case was using a sentinel in the error case which was bad. Also make the one `build` invoke the other `build` to reuse verification there. And follow up on suggestion to use formatv which I missed during previous review. PiperOrigin-RevId: 284265762	2019-12-06 14:42:45 -08:00
Aart Bik	d37f27251f	[VecOps] Rename vector.[insert\|extract]element to just vector.[insert\|extract] Since these operations lower to [insert\|extract][element\|value] at LLVM dialect level, neither element nor value would correctly reflect the meaning. PiperOrigin-RevId: 284240727	2019-12-06 12:39:25 -08:00
Aart Bik	b36aaeafb1	[VectorOps] Add lowering of vector.broadcast to LLVM IR For example, a scalar broadcast %0 = vector.broadcast %x : f32 to vector<2xf32> return %0 : vector<2xf32> which expands scalar x into vector [x,x] by lowering to the following LLVM IR dialect to implement the duplication over the leading dimension. %0 = llvm.mlir.undef : !llvm<"<2 x float>"> %1 = llvm.mlir.constant(0 : index) : !llvm.i64 %2 = llvm.insertelement %x, %0[%1 : !llvm.i64] : !llvm<"<2 x float>"> %3 = llvm.shufflevector %2, %0 [0 : i32, 0 : i32] : !llvm<"<2 x float>">, !llvm<"<2 x float>"> return %3 : vector<2xf32> In the trailing dimensions, the operand is simply "passed through", unless a more elaborate "stretch" is required. For example %0 = vector.broadcast %arg0 : vector<1xf32> to vector<4xf32> return %0 : vector<4xf32> becomes %0 = llvm.mlir.undef : !llvm<"<4 x float>"> %1 = llvm.mlir.constant(0 : index) : !llvm.i64 %2 = llvm.extractelement %arg0[%1 : !llvm.i64] : !llvm<"<1 x float>"> %3 = llvm.mlir.constant(0 : index) : !llvm.i64 %4 = llvm.insertelement %2, %0[%3 : !llvm.i64] : !llvm<"<4 x float>"> %5 = llvm.shufflevector %4, %0 [0 : i32, 0 : i32, 0 : i32, 0 : i32] : !llvm<"<4 x float>">, !llvm<"<4 x float>"> llvm.return %5 : !llvm<"<4 x float>"> PiperOrigin-RevId: 284219926	2019-12-06 11:02:29 -08:00
Jacques Pienaar	398f04aa49	Generate builder for ops that use InferTypeOpInterface trait in ODS For ops with infer type op interface defined, generate version that calls the inferal method on build. This is intermediate step to removing special casing of SameOperandsAndResultType & FirstAttrDereivedResultType. After that would be generating the inference code, with the initial focus on shaped container types. In between I plan to refactor these a bit to reuse generated paths. The intention would not be to add the type inference trait in multiple places, but rather to take advantage of the current modelling in ODS where possible to emit it instead. Switch the `inferReturnTypes` method to be static. Skipping ops with regions here as I don't like the Region vs unique_ptr<Region> difference at the moment, and I want the infer return type trait to be useful for verification too. So instead, just skip it for now to avoid churn. PiperOrigin-RevId: 284217913	2019-12-06 10:53:06 -08:00
Alex Zinenko	e216a72ab8	Add conversions of GPU func with memory attributions to LLVM/NVVM GPU functions use memory attributions, a combination of Op attributes and region arguments, to specify function-wide buffers placed in workgroup or private memory spaces. Introduce a lowering pattern for GPU functions to be converted to LLVM functions taking into account memory attributions. Workgroup attributions get transformed into module-level globals with unique names derived from function names. Private attributions get converted into llvm.allocas inside the function body. In both cases, we inject at the beginning of the function the IR that obtains the raw pointer to the data and populates a MemRef descriptor based on the MemRef type of buffer, making attributions compose with the rest of the MemRef lowering and transparent for use with std.load and std.store. While using raw pointers instead of descriptors might have been more efficient, it is better implemented as a canonicalization or a separate transformation so that non-attribution memrefs could also benefit from it. PiperOrigin-RevId: 284208396	2019-12-06 10:08:43 -08:00
River Riddle	79047e1ab5	Use regex to fix failure when stats are disabled. It would be nice if we could detect if stats were enabled or not and use 'Requires', but this isn't possible to do at configure time. Fixes tensorflow/mlir#296 PiperOrigin-RevId: 284200271	2019-12-06 09:29:14 -08:00
Andy Davis	41f8e105fa	Unroll vector masks along with their associated vector arguments. Updates vector ContractionOp to use proper vector masks (produced by CreateMaskOp/ConstantMaskOp). Leverages the following canonicalizations in unrolling unit test: CreateMaskOp -> ConstantMaskOp, StridedSliceOp(ConstantMaskOp) -> ConstantMaskOp Removes IndexTupleOp (no longer needed now that we have vector mask ops). Updates all unit tests. PiperOrigin-RevId: 284182168	2019-12-06 07:37:28 -08:00
Uday Bondhugula	3ade6a7d15	DimOp folding for alloc/view dynamic dimensions Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#253 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/253 from bondhugula:dimop a4b464f24ae63fd259114558d87e11b8ee4dae86 PiperOrigin-RevId: 284169689	2019-12-06 06:00:54 -08:00
Kazuaki Ishizaki	84a6182ddd	minor spelling tweaks Closes tensorflow/mlir#290 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/290 from kiszk:spelling_tweaks_201912 9d9afd16a723dd65754a04698b3976f150a6054a PiperOrigin-RevId: 284169681	2019-12-06 05:59:30 -08:00
Alex Zinenko	58adf99ed1	LLVM::AddressOfOp: properly take into account the address space The AddressOf operation in the LLVM dialect return a pointer to a global variable. The latter may be in a non-default address space as indicated by the "addr_space" attribute. Check that the address space of the pointer returned by AddressOfOp matches that of the referenced GlobalOp. Update the AddressOfOp builder to respect this constraint. PiperOrigin-RevId: 284138860	2019-12-06 01:09:13 -08:00
River Riddle	71999ff7f2	Add include path to the TestDialect to fix broken build. PiperOrigin-RevId: 284067891	2019-12-05 15:33:33 -08:00
Jose Ignacio Gomez	f60bbb6c3b	[Linalg] Add permutation information to tiling This patch closes issue tensorflow/mlir#271. It adds an optional permutation map to declarative tiling transformations. The map is expressed as a list of integers. Closes tensorflow/mlir#288 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/288 from tetuante:issue271 2df2938d6a1f01b3bc404ded08dea2dd1e10b588 PiperOrigin-RevId: 284064151	2019-12-05 15:14:59 -08:00
nmostafa	daff60cd68	Add UnrankedMemRef Type Closes tensorflow/mlir#261 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/261 from nmostafa:nmostafa/unranked 96b6e918f6ed64496f7573b2db33c0b02658ca45 PiperOrigin-RevId: 284037040	2019-12-05 13:13:20 -08:00
Denis Khalikov	e67acfa468	[spirv] Add CompositeInsertOp operation A CompositeInsertOp operation make a copy of a composite object, while modifying one part of it. Closes tensorflow/mlir#292 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/292 from denis0x0D:sandbox/composite_insert 2200962b9057bda53cd2f2866b461e2797196380 PiperOrigin-RevId: 284036551	2019-12-05 13:10:44 -08:00
River Riddle	33a64540ad	Add support for instance specific pass statistics. Statistics are a way to keep track of what the compiler is doing and how effective various optimizations are. It is useful to see what optimizations are contributing to making a particular program run faster. Pass-instance specific statistics take this even further as you can see the effect of placing a particular pass at specific places within the pass pipeline, e.g. they could help answer questions like "what happens if I run CSE again here". Statistics can be added to a pass by simply adding members of type 'Pass::Statistics'. This class takes as a constructor arguments: the parent pass pointer, a name, and a description. Statistics can be dumped by the pass manager in a similar manner to how pass timing information is dumped, i.e. via PassManager::enableStatistics programmatically; or -pass-statistics and -pass-statistics-display via the command line pass manager options. Below is an example: struct MyPass : public OperationPass<MyPass> { Statistic testStat{this, "testStat", "A test statistic"}; void runOnOperation() { ... ++testStat; ... } }; $ mlir-opt -pass-pipeline='func(my-pass,my-pass)' foo.mlir -pass-statistics Pipeline Display: ===-------------------------------------------------------------------------=== ... Pass statistics report ... ===-------------------------------------------------------------------------=== 'func' Pipeline MyPass (S) 15 testStat - A test statistic MyPass (S) 6 testStat - A test statistic List Display: ===-------------------------------------------------------------------------=== ... Pass statistics report ... ===-------------------------------------------------------------------------=== MyPass (S) 21 testStat - A test statistic PiperOrigin-RevId: 284022014	2019-12-05 11:53:28 -08:00
Mahesh Ravishankar	4d61a79db4	Allow specification of the workgroup size for GPUToSPIRV lowering. SPIR-V/Vulkan spec requires the workgroups size to be specified with the spv.ExecutionMode operation. This was hard-wired to be set to a particular value. It is now changed to be configurable by clients of the pass or of the patterns that implement the lowering from GPU to SPIRV. PiperOrigin-RevId: 284017482	2019-12-05 11:31:57 -08:00
Lei Zhang	037044b0ae	Add spv.AtomicCompareExchangeWeak PiperOrigin-RevId: 283997917	2019-12-05 10:06:24 -08:00
Lei Zhang	c0a9de29ad	[spirv] Fix nested loop (de)serialization For serialization, when we have nested ops, the inner loop will create multiple SPIR-V blocks. If the outer loop has block arguments (which corresponds to OpPhi instructions), we defer the handling of OpPhi's parent block handling until we serialized all blocks and then fix it up with the result <id>. These two cases happening together was generating invalid SPIR-V blob because we previously assume the parent block to be the block containing the terminator. That is not true anymore when the block contains structured control flow ops. If that happens, it should be fixed to use the structured control flow op's merge block. For deserialization, we record a map from header blocks to their corresponding merge and continue blocks during the initial deserialization and then use the info to construct spv.selection/spv.loop. The existing implementation will also fall apart when we have nested loops. If so, we clone all blocks for the outer loop, including the ones for the inner loop, to the spv.loop's region. So the map for header blocks' merge info need to be updated; otherwise we are operating on already deleted blocks. PiperOrigin-RevId: 283949230	2019-12-05 04:39:37 -08:00
Tres Popp	b8cd0c1486	Move ModuleManager functionality into mlir::SymbolTable. Note for broken code, the following transformations occurred: ModuleManager::insert(Block::iterator, Operation) - > SymbolTable::insert(Operation, Block::iterator) ModuleManager::lookupSymbol -> SymbolTable::lookup ModuleManager::getModule() -> SymbolTable::getOp() ModuleManager::getContext() -> SymbolTable::getOp()->getContext() ModuleManager::* -> SymbolTable::* PiperOrigin-RevId: 283944635	2019-12-05 03:56:46 -08:00
Nicolas Vasilache	b3f7cf80a7	Add a CL option to Standard to LLVM lowering to use alloca instead of malloc/free. In the future, a more configurable malloc and free interface should be used and exposed via extra parameters to the `createLowerToLLVMPass`. Until requirements are gathered, a simple CL flag allows generating code that runs successfully on hardware that cannot use the stdlib. PiperOrigin-RevId: 283833424	2019-12-04 14:16:00 -08:00
Andy Davis	d20d763241	Add canonicalization patterns for vector CreateMaskOp and StridedSliceOp to be used in the unroll vector op transformation. Adds a ConstantMaskOp to the vector ops dialect. Adds the following canonicalization patterns: CreateMaskOp -> ConstantMaskOp StridedSliceOp(ConstantMaskOp) -> ConstantMaskOp PiperOrigin-RevId: 283816752	2019-12-04 13:00:43 -08:00
Nicolas Vasilache	edfaf925cf	Drop MaterializeVectorTransfers in favor of simpler declarative unrolling Now that we have unrolling as a declarative pattern, we can drop a full pass that has gone stale. In the future we may want to add specific unrolling patterns for VectorTransferReadOp. PiperOrigin-RevId: 283806880	2019-12-04 12:11:42 -08:00
Sean Silva	26484bc0b6	Print out large elementsattr's such that they are parseable. I found that when running crash reproducers, the elided elementsattr's would prevent parsing the IR repro. I found myself manually going and replacing the "..." with some valid IR. With this change, we now print elided attrs as `opaque<"", "0xDEADBEEF">` to clearly delineate them as being elided while still being parseable. PiperOrigin-RevId: 283781806	2019-12-04 10:19:54 -08:00
Scott Todd	bf45ff6aab	[spirv] Adding sqrt op in the GLSL extension. PiperOrigin-RevId: 283769736	2019-12-04 09:16:23 -08:00
Alex Zinenko	75175134d4	Loop coalescing: fix pointer chainsing in use-chain traversal In the replaceAllUsesExcept utility function called from loop coalescing the iteration over the use-chain is incorrect. The use list nodes (IROperands) have next/prev links, and bluntly resetting the use would make the loop to continue on uses of the value that was replaced instead of the original one. As a result, it could miss the existing uses and update the wrong ones. Make sure we increment the iterator before updating the use in the loop body. Reported-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#291. PiperOrigin-RevId: 283754195	2019-12-04 07:42:29 -08:00
Julian Gross	f7c6bc70a9	Added new FAbs, FCeil, Cos, Neg, Sign, Tanh operations. Closes tensorflow/mlir#251 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/251 from dfki-jugr:new_ops 0398997bf9953016898f873068e22916a062eb2b PiperOrigin-RevId: 283750699	2019-12-04 07:17:30 -08:00
Andy Davis	34e1f4aa51	Adds support for unrolling single-result vector operations with iterator type lists and indexing maps to a target vector size. Adds unit tests for unrolling the vector ContractionOp with different iteration orders. PiperOrigin-RevId: 283747503	2019-12-04 06:53:37 -08:00
Nicolas Vasilache	5c0c51a997	Refactor dependencies to expose Vector transformations as patterns - NFC This CL refactors some of the MLIR vector dependencies to allow decoupling VectorOps, vector analysis, vector transformations and vector conversions from each other. This makes the system more modular and allows extracting VectorToVector into VectorTransforms that do not depend on vector conversions. This refactoring exhibited a bunch of cyclic library dependencies that have been cleaned up. PiperOrigin-RevId: 283660308	2019-12-03 17:52:10 -08:00
Lei Zhang	50b2b26e70	[spirv] Add spv.GroupNonUniformBallot This CL also did the following cleanup: - Moved the test for spv.SubgroupBallotKHR to its own file - Wrapped generated canonicalization patterns in anonymous namespace - Updated header comments in SPVOps.td PiperOrigin-RevId: 283650091	2019-12-03 16:44:09 -08:00
Mahesh Ravishankar	c5ba37b6ae	Add a pass to legalize operations before lowering to SPIR-V. Not all StandardOps can be lowered to SPIR-V. For example, subview op implementation requires use of pointer bitcasts which is not valid according to SPIR-V spec (or at least is ambiguous about it). Such ops need to be removed/transformed before lowering to SPIR-V. The SPIRVLegalizationPass is added a place where such legalizations can be added. Current implementation folds the subview ops with load/stores so that the lowering itself does not have to convert a subview op. PiperOrigin-RevId: 283642981	2019-12-03 16:06:17 -08:00
Sean Silva	82f9f9d112	Make diagnostic a bit clearer. This prints out in case of any pass failure. Not just a crash. PiperOrigin-RevId: 283616719	2019-12-03 14:01:25 -08:00
Andy Davis	2c13fd9f17	Add CreateMaskOp to the VectorOps dialect. PiperOrigin-RevId: 283591888	2019-12-03 11:55:54 -08:00
Sean Silva	67515e8d7a	Verifier: Better error message in case of successor operand mismatch. In particular, print the successor number in the diagnostic. PiperOrigin-RevId: 283585084	2019-12-03 11:24:31 -08:00
Mahesh Ravishankar	353fb2bd38	Convert MemRefType to a linearized array in SPIR-V lowering. The SPIR-V lowering used nested !spv.arrays to represented multi-dimensional arrays, with the hope that in-conjunction with the layout annotations, the shape and layout of memref can be represented directly. It is unclear though how portable this representation will end up being. It will rely on driver compilers implementing complex index computations faithfully. A more portable approach is to use linearized arrays to represent memrefs and explicitly instantiate all the index computation in SPIR-V. This gives added benefit that we can further optimize the generated code in MLIR before generating the SPIR-V binary. PiperOrigin-RevId: 283571167	2019-12-03 10:21:16 -08:00
Alex Zinenko	993e79e9bd	Fix ViewOp to have at most one offset operand As described in the documentation, ViewOp is expected to take an optional dynamic offset followed by a list of dynamic sizes. However, the ViewOp parser did not include a check for the offset being a single value and accepeted a list of values instead. Furthermore, several tests have been exercising the wrong syntax of a ViewOp, passing multiple values to the dyanmic stride list, which was not caught by the parser. The trailing values could have been erronously interpreted as dynamic sizes. This is likely due to resyntaxing of the ViewOp, with the previous syntax taking the list of sizes before the offset. Update the tests to use the syntax with the offset preceding the sizes. Worse, the conversion of ViewOp to the LLVM dialect assumed the wrong order of operands with offset in the trailing position, and erronously relied on the permissive parsing that interpreted trailing dynamic offset values as leading dynamic sizes. Fix the lowering to use the correct order of operands. PiperOrigin-RevId: 283532506	2019-12-03 06:23:04 -08:00
Diego Caballero	330d1ff00e	AffineLoopFusion: Prevent fusion of multi-out-edge producer loops tensorflow/mlir#162 introduced a bug that incorrectly allowed fusion of producer loops with multiple outgoing edges. This commit fixes that problem. It also introduces a new flag to disable sibling loop fusion so that we can test producer-consumer fusion in isolation. Closes tensorflow/mlir#259 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/259 from dcaballe:dcaballe/fix_multi_out_edge_producer_fusion 578d5661705fd5c56c555832d5e0528df88c5282 PiperOrigin-RevId: 283531105	2019-12-03 06:09:50 -08:00
Stephan Herhut	2125c0e3a8	Extend conversion of SubViewOp to llvm to also support cases where size and stride are constant (i.e., there are no size and stride operands). We recently added canonicalization that rewrites constant size and stride operands to SubViewOp into static information in the type, so these patterns now occur during code generation. PiperOrigin-RevId: 283524688	2019-12-03 05:11:49 -08:00
Lei Zhang	1af9633d85	[spirv] Add spv.SubgroupBallotKHROp PiperOrigin-RevId: 283522284	2019-12-03 04:49:56 -08:00
Alex Zinenko	fdbb99cd62	Add linkage support to LLVMFuncOp A recent commit introduced the Linkage attribute to the LLVM dialect and used it in the Global Op. Also use it in LLVMFuncOp. As per LLVM Language Reference, if the linkage attribute is omitted, the function is assumed to have external linkage. PiperOrigin-RevId: 283493299	2019-12-03 00:26:44 -08:00
Aart Bik	3126004a5a	[VectorOps] Add legality rules to broadcast PiperOrigin-RevId: 283360101	2019-12-02 09:57:27 -08:00
Lei Zhang	b41162b3af	[ODS] Generate builders taking unwrapped value and defaults for attributes Existing builders generated by ODS require attributes to be passed in as mlir::Attribute or its subclasses. This is okay foraggregate- parameter builders, which is primarily to be used by programmatic C++ code generation; it is inconvenient for separate-parameter builders meant to be called in manually written C++ code because it requires developers to wrap raw values into mlir::Attribute by themselves. This CL extends to generate additional builder methods that take raw values for attributes and handles the wrapping in the builder implementation. Additionally, if an attribute appears late in the arguments list and has a default value, the default value is supplied in the declaration if possible. PiperOrigin-RevId: 283355919	2019-12-02 09:33:57 -08:00
Lei Zhang	4982eaf87c	[DRR] Introduce `$_` to ignore op argument match Right now op argument matching in DRR is position-based, meaning we need to specify N arguments for an op with N ODS-declared argument. This can be annoying when we don't want to capture all the arguments. `$_` is to remedy the situation. PiperOrigin-RevId: 283339992	2019-12-02 07:54:50 -08:00
Alexander Belyaev	9630fcbc52	Lower linalg.indexed_generic with libcall to LLVM. PiperOrigin-RevId: 283328994	2019-12-02 06:30:52 -08:00
Alex Zinenko	d5e627f84b	Introduce Linkage attribute to the LLVM dialect LLVM IR supports linkage on global objects such as global variables and functions. Introduce the Linkage attribute into the LLVM dialect, backed by an integer storage. Use this attribute on LLVM::GlobalOp and make it mandatory. Implement parsing/printing of the attribute and conversion to LLVM IR. See tensorflow/mlir#277. PiperOrigin-RevId: 283309328	2019-12-02 03:28:10 -08:00
Denis Khalikov	cd556f25de	[spirv] Check that operand of `spirv::CompositeExtractOp` is constant while folding. Closes tensorflow/mlir#281 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/281 from denis0x0D:sandbox/composite_ex_fold d02d73658bd1b9eaa515eb4e0aee34bc41d4252b PiperOrigin-RevId: 282971563	2019-11-28 13:27:56 -08:00
Jose Ignacio Gomez	0494ef60f7	[Linalg] Change attribute n_loop_types to iterator This addresses issue tensorflow/mlir#270. Linalg is updated to take the same form of iterator_types than vector contraction. Closes tensorflow/mlir#280 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/280 from tetuante:PRissue270 d26d88d090d3765d3b9884bfabdd023143f27287 PiperOrigin-RevId: 282905396	2019-11-28 01:59:55 -08:00
Lei Zhang	d4e4387fbf	[spirv] Add folders for spv.IAdd and spv.IMul Adding zero and multiplying one can be common when generating code for index calculation. This CL also sorted canonicalize.mlir to alphabetical order. PiperOrigin-RevId: 282828055	2019-11-27 13:46:52 -08:00
Aart Bik	9f89c34f4b	Fixed typo in Toy tutorial (second var e -> var f) PiperOrigin-RevId: 282810649	2019-11-27 11:58:45 -08:00
Nicolas Vasilache	1fa8c8070b	Implement Linalg to loops lowering as a pattern This CL rewrites the linalg ops to loops transformations as patterns that can be targeted directly from Tablegen. Reliance on OpFolder is removed and to cope with it we introduce local folding patterns that are applied greedily. PiperOrigin-RevId: 282765550	2019-11-27 07:32:13 -08:00
Aart Bik	e2232fbcee	[VectorOps] Refine BroadcastOp in VectorOps dialect Since second argument is always fully overwritten and shape is define in "to" clause, it is not needed. Also renamed "into" to "to" now that arg is dropped. PiperOrigin-RevId: 282686475	2019-11-26 19:52:38 -08:00
Aart Bik	cf97263cb8	[VectorOps] Add a BroadcastOp to the VectorOps dialect PiperOrigin-RevId: 282643305	2019-11-26 14:43:31 -08:00
Mahesh Ravishankar	03620fa70a	Misc changes to lowering to SPIR-V. These changes to SPIR-V lowering while adding support for lowering SUbViewOp, but are not directly related. - Change the lowering of MemRefType to !spv.ptr<!spv.struct<!spv.array<...>[offset]>, ..> This is consistent with the Vulkan spec. - To enable testing a simple pattern of lowering functions is added to ConvertStandardToSPIRVPass. This is just used to convert the type of the arguments of the function. The added function lowering itself is not meant to be the way functions are eventually lowered into SPIR-V dialect. PiperOrigin-RevId: 282589644	2019-11-26 10:11:34 -08:00
Nicolas Vasilache	9059cf392d	Automated rollback of commit `d60133f89b` PiperOrigin-RevId: 282574110	2019-11-26 08:47:48 -08:00
Nicolas Vasilache	109338085d	Relax restriction on affine_apply dim and symbol operands The affine_apply operation is currently "doubly" affine and conflates two things: 1. it applies an affine map to a list of values of type `index` that are defined as either dim or symbol 2. it restricts (and propagates constraints on) the provenance of dims and symbols to a small subset of ops for which more restrictive polyhedral constraints apply. Point 2. is related to the ability to form so-called static control parts and is related to dependence analysis and legality of transformations. Point 1. however is completely independent, the only local implication of dims and symbol for affine_apply is that dims compose while symbols concatenate as well as the structural constraint that dims may not be multiplied. The properties of composition and canonicalization in affine_apply are more generally useful. This CL relaxes the verifier on affine_apply so it can be used more generally. The relevant affine.for/if/load/store op verifiers already implement the dim and symbol checking. See this thread for the related discussion: https://groups.google.com/a/tensorflow.org/g/mlir/c/HkwCbV8D9N0/m/8srUNrX6CAAJ PiperOrigin-RevId: 282562517	2019-11-26 07:39:05 -08:00
Lei Zhang	13c6e419ca	Add support for AttrSizedOperandSegments/AttrSizedResultSegments Certain operations can have multiple variadic operands and their size relationship is not always known statically. For such cases, we need a per-op-instance specification to divide the operands into logical groups or segments. This can be modeled by attributes. This CL introduces C++ trait AttrSizedOperandSegments for operands and AttrSizedResultSegments for results. The C++ trait just guarantees such size attribute has the correct type (1D vector) and values (non-negative), etc. It serves as the basis for ODS sugaring that with ODS argument declarations we can further verify the number of elements match the number of ODS-declared operands and we can generate handy getter methods. PiperOrigin-RevId: 282467075	2019-11-25 17:26:50 -08:00
Nicolas Vasilache	174076a157	Use vector.InsertStridedSlice in Vector -> Vector unrolling This CL uses the recently added op to finish the implementation of Vector -> Vector unrolling by replacing the "fake join op" by a series of InsertStridedSliceOp. Test is updated accordingly PiperOrigin-RevId: 282451126	2019-11-25 15:56:37 -08:00
Nicolas Vasilache	36469f7d2a	Add a vector.InsertStridedSliceOp This new op is the counterpart of vector.StridedSliceOp and will be used for in the pattern rewrites for vector unrolling. PiperOrigin-RevId: 282447414	2019-11-25 15:37:13 -08:00
MLIR Team	1012c492f0	Allow LLVM::ExtractElementOp to have non-i32 indices. Also change the text format a bit, so that indices are braced by squares. PiperOrigin-RevId: 282437095	2019-11-25 14:44:52 -08:00
Ben Vanik	38d7870ee5	Make std.divis and std.diviu support ElementsAttr folding. PiperOrigin-RevId: 282434465	2019-11-25 14:31:43 -08:00
Andy Davis	8fc44a4d13	Update VectorContractionOp to take iterator types and index mapping attributes compatible with linalg ops. PiperOrigin-RevId: 282412311	2019-11-25 12:40:00 -08:00
Christian Sigg	d60133f89b	Changing directory shortcut for CPU/GPU runner utils. Moving cuda-runtime-wrappers.so into subdirectory to match libmlir_runner_utils.so. Provide parent directory when running test and load .so from subdirectory. PiperOrigin-RevId: 282410749	2019-11-25 12:30:54 -08:00
Lei Zhang	9b6e6cef68	De-duplicate EnumAttr overrides by defining defaults EnumAttr should provide meaningful defaults so concrete instances do not need to duplicate the fields. PiperOrigin-RevId: 282398431	2019-11-25 11:29:55 -08:00
Mahesh Ravishankar	bd485afda0	Introduce attributes that specify the final ABI for a spirv::ModuleOp. To simplify the lowering into SPIR-V, while still respecting the ABI requirements of SPIR-V/Vulkan, split the process into two 1) While lowering a function to SPIR-V (when the function is an entry point function), allow specifying attributes on arguments and function itself that describe the ABI of the function. 2) Add a pass that materializes the ABI described in the function. Two attributes are needed. 1) Attribute on arguments of the entry point function that describe the descriptor_set, binding, storage class, etc, of the spv.globalVariable this argument will be replaced by 2) Attribute on function that specifies workgroup size, etc. (for now only workgroup size). Add the pass -spirv-lower-abi-attrs to materialize the ABI described by the attributes. This change makes the SPIRVBasicTypeConverter class unnecessary and is removed, further simplifying the SPIR-V lowering path. PiperOrigin-RevId: 282387587	2019-11-25 11:19:56 -08:00
Mahesh Ravishankar	1ea231bd39	Allow memref_cast from static strides to dynamic strides. Memref_cast supports cast from static shape to dynamic shape memrefs. The same should be true for strides as well, i.e a memref with static strides can be casted to a memref with dynamic strides. PiperOrigin-RevId: 282381862	2019-11-25 11:08:56 -08:00
Nicolas Vasilache	01145544aa	Add vector.insertelement op This is the counterpart of vector.extractelement op and has the same limitations at the moment (static I64IntegerArrayAttr to express position). This restriction will be filterd in the future. LLVM lowering will be added in a subsequent commit. PiperOrigin-RevId: 282365760	2019-11-25 08:47:15 -08:00
Alex Zinenko	bf4692dc49	Introduce gpu.func Introduce a new function-like operation to the GPU dialect to provide a placeholder for the execution semantic description and to add support for GPU memory hierarchy. This aligns with the overall goal of the dialect to expose the common abstraction layer for GPU devices, in particular by providing an MLIR unit of semantics (i.e. an operation) for memory modeling. This proposal has been discussed in the mailing list: https://groups.google.com/a/tensorflow.org/d/msg/mlir/RfXNP7Hklsc/MBNN7KhjAgAJ As decided, the "convergence" aspect of the execution model will be factored out into a new discussion and therefore is not included in this commit. This commit only introduces the operation but does not hook it up with the remaining flow. The intention is to develop the new flow while keeping the old flow operational and do the switch in a simple, separately reversible commit. PiperOrigin-RevId: 282357599	2019-11-25 08:10:37 -08:00
Ben Vanik	d2284f1f0b	Support folding of StandardOps with DenseElementsAttr. PiperOrigin-RevId: 282270243	2019-11-24 19:23:38 -08:00
Lei Zhang	aaafeac89b	[spirv] NFC: rename test files and sort tests inside PiperOrigin-RevId: 282132339	2019-11-23 06:58:38 -08:00
Uday Bondhugula	6a101671b0	Make isValidSymbol more powerful The check in isValidSymbol, as far as a DimOp result went, checked if the dim op was on a top-level memref. However, any alloc'ed, view, or subview memref would be fine as long as the corresponding dimension of that memref is either a static one or was in turn created using a valid symbol in the case of dynamic dimensions. Reported-by: Jose Gomez Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#252 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/252 from bondhugula:symbol 7b57dc394df9375e651f497231c6e4525a32a662 PiperOrigin-RevId: 282097114	2019-11-22 22:09:31 -08:00
River Riddle	b8ee563449	NFC: Remove unnecessarily guarded tablegen includes. Support for including a file multiple times was added in tablegen, removing the need for these extra guards. This is because we already insert c/c++ style header guards within each of the specific .td files. PiperOrigin-RevId: 282076728	2019-11-22 18:01:57 -08:00
Denis Khalikov	a5cda4763f	[spirv] Add a canonicalizer for `spirv::LogicalNotOp`. Add a canonicalizer for `spirv::LogicalNotOp`. Converts: * spv.LogicalNot(spv.IEqual(...)) -> spv.INotEqual(...) * spv.LogicalNot(spv.INotEqual(...)) -> spv.IEqual(...) * spv.LogicalNot(spv.LogicalEqual(...)) -> spv.LogicalNotEqual(...) * spv.LogicalNot(spv.LogicalNotEqual(...)) -> spv.LogicalEqual(...) Also moved the test for spv.IMul to arithemtic tests. Closes tensorflow/mlir#256 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/256 from denis0x0D:sandbox/canon_logical_not 76ab5787b2c777f948c8978db061d99e76453d44 PiperOrigin-RevId: 282012356	2019-11-22 12:25:52 -08:00
Mahesh Ravishankar	6db8530c26	Add more canonicalizations for SubViewOp. Depending on which of the offsets, sizes, or strides are constant, the subview op can be canonicalized in different ways. Add such canonicalizations, which generalize the existing approach of canonicalizing subview op only if all of offsets, sizes and shapes are constants. PiperOrigin-RevId: 282010703	2019-11-22 12:14:18 -08:00
River Riddle	c35378003c	Add support for using the ODS result names as the Asm result names for multi-result operations. This changes changes the OpDefinitionsGen to automatically add the OpAsmOpInterface for operations with multiple result groups using the provided ODS names. We currently just limit the generation to multi-result ops as most single result operations don't have an interesting name(result/output/etc.). An example is shown below: // The following operation: def MyOp : ... { let results = (outs AnyType:$first, Variadic<AnyType>:$middle, AnyType); } // May now be printed as: %first, %middle:2, %0 = "my.op" ... PiperOrigin-RevId: 281834156	2019-11-21 14:55:46 -08:00
Christian Sigg	d7c17195a4	Change CUDA tests to use print_memref. Swap dimensions in all-reduce-op test. PiperOrigin-RevId: 281791744	2019-11-21 11:26:36 -08:00
Nicolas Vasilache	2c4985816f	Split Linalg declarative patterns from specific test patterns - NFC This will make it easier to scale out test patterns and build specific passes that do not interfere with independent testing. PiperOrigin-RevId: 281736335	2019-11-21 06:40:17 -08:00
Alex Zinenko	b5af3784a6	Don't force newline before function attributes Due to legacy reasons, a newline character followed by two spaces was always inserted before the attributes of the function Op in pretty form. This breaks formatting when functions are nested in some other operations. Don't print the newline and just put the attributes on the same line, which is also more consistent with module Op. Line breaking aware of indentation can be introduced separately into the parser if deemed useful. PiperOrigin-RevId: 281721793	2019-11-21 05:08:19 -08:00
MLIR Team	75379a684f	Correctly parse empty affine maps. Previously the test case crashes / produces an error. PiperOrigin-RevId: 281630540	2019-11-20 18:30:15 -08:00
River Riddle	fafb708b9a	Merge DCE and unreachable block elimination into a new utility 'simplifyRegions'. This moves the different canonicalizations of regions into one place and invokes them in the fixed-point iteration of the canonicalizer. PiperOrigin-RevId: 281617072	2019-11-20 15:53:19 -08:00
Andy Davis	d6a70b31be	Add VectorContractionOp to the VectorOps dialect. PiperOrigin-RevId: 281605471	2019-11-20 14:53:57 -08:00
Mahesh Ravishankar	1145cebdab	Verify subview op result has dynamic shape, when sizes are specified. If the sizes are specified as arguments to the subview op, then the shape must be dynamic as well. PiperOrigin-RevId: 281591608	2019-11-20 14:16:05 -08:00
Sean Silva	e4f83c6c26	Add multi-level DCE pass. This is a simple multi-level DCE pass that operates pretty generically on the IR. Its key feature compared to the existing peephole dead op folding that happens during canonicalization is being able to delete recursively dead cycles of the use-def graph, including block arguments. PiperOrigin-RevId: 281568202	2019-11-20 12:55:10 -08:00
Mahesh Ravishankar	19212105dd	Changes to SubViewOp to make it more amenable to canonicalization. The current SubViewOp specification allows for either all offsets, shape and stride to be dynamic or all of them to be static. There are opportunities for more fine-grained canonicalization based on which of these are static. For example, if the sizes are static, the result memref is of static shape. The specification of SubViewOp is modified to allow on or more of offsets, shapes and strides to be statically specified. The verification is updated to ensure that the result type of the subview op is consistent with which of these are static and which are dynamic. PiperOrigin-RevId: 281560457	2019-11-20 12:32:51 -08:00
Nicolas Vasilache	fa14d4f6ab	Implement unrolling of vector ops to finer-grained vector ops as a pattern. This CL uses the pattern rewrite infrastructure to implement a simple VectorOps -> VectorOps legalization strategy to unroll coarse-grained vector operations into finer grained ones. The transformation is written using local pattern rewrites to allow composition with other rewrites. It proceeds by iteratively introducing fake cast ops and cleaning canonicalizing or lowering them away where appropriate. This is an example of writing transformations as compositions of local pattern rewrites that should enable us to make them significantly more declarative. PiperOrigin-RevId: 281555100	2019-11-20 11:49:36 -08:00
River Riddle	eb418559ef	Add a new OpAsmOpInterface to allow for ops to directly hook into the AsmPrinter. This interface provides more fine-grained hooks into the AsmPrinter than the dialect interface, allowing for operations to define the asm name to use for results directly on the operations themselves. The hook is also expanded to enable defining named result "groups". Get a special name to use when printing the results of this operation. The given callback is invoked with a specific result value that starts a result "pack", and the name to give this result pack. To signal that a result pack should use the default naming scheme, a None can be passed in instead of the name. For example, if you have an operation that has four results and you want to split these into three distinct groups you could do the following: setNameFn(getResult(0), "first_result"); setNameFn(getResult(1), "middle_results"); setNameFn(getResult(3), ""); // use the default numbering. This would print the operation as follows: %first_result, %middle_results:2, %0 = "my.op" ... PiperOrigin-RevId: 281546873	2019-11-20 10:45:45 -08:00
Nicolas Vasilache	3c055957de	Add StridedMemRef<>::operator[] - NFC This operator is used for internal debugging purposes. PiperOrigin-RevId: 281544152	2019-11-20 10:17:13 -08:00
Alexander Belyaev	e50261657f	Fix 'the the' typo. PiperOrigin-RevId: 281501234	2019-11-20 05:38:14 -08:00
Stephan Herhut	abb626686d	Extend kernel outlining to also consider dim worth inlining. PiperOrigin-RevId: 281483447	2019-11-20 02:59:35 -08:00
Christian Sigg	f868adafee	Make type and rank explicit in mcuMemHostRegister function. Fix registered size of indirect MemRefType kernel arguments. PiperOrigin-RevId: 281362940	2019-11-19 13:13:02 -08:00
Nicolas Vasilache	ee95f6f259	Add VectorOps.StridedSliceOp The `vector.strided_slice` takes an n-D vector, k-D `offsets` integer array attribute, a k-D `sizes` integer array attribute, a k-D `strides` integer array attribute and extracts the n-D subvector at the proper offset. Returns an n-D vector where the first k-D dimensions match the `sizes` attribute. The returned subvector contains the elements starting at offset `offsets` and ending at `offsets + sizes`. Example: ``` %1 = vector.strided_slice %0 {offsets : [0, 2], sizes : [2, 4], strides : [1, 1]}: vector<4x8x16xf32> // returns a vector<2x4x16xf32> ``` This op will be useful for progressive lowering within the VectorOp dialect. PiperOrigin-RevId: 281352749	2019-11-19 12:22:34 -08:00
Nicolas Vasilache	3732ba4def	Fix pretty printer corner case in mlir_runner_utils.cpp. In the particular case where the size of a memref dimension is 1, double printing would happen because printLast was called unconditionally. This CL fixes the print and updates an incorrect test that should have caught this in the first place. PiperOrigin-RevId: 281345142	2019-11-19 11:52:27 -08:00
Diego Caballero	dd5a7cb488	Add getRemappedValue to ConversionPatternRewriter This method is needed for N->1 conversion patterns to retrieve remapped Values used in the original N operations. Closes tensorflow/mlir#237 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/237 from dcaballe:dcaballe/getRemappedValue 1f64fadcf2b203f7b336ff0c5838b116ae3625db PiperOrigin-RevId: 281321881	2019-11-19 11:09:39 -08:00
Alex Zinenko	8961d8e32f	Change conversion CLI flag from -lower-to-llvm to -convert-std-to-llvm The command-line flag name `lower-to-llvm` for the pass performing dialect conversion from the Standard dialect to the LLVM dialect is misleading and inconsistent with most of the conversion passses. It leads the user to believe that there are no restrictions on what can be converted, while in fact only a subset of the Standard dialect can be converted (with operations from other dialects converted by separate passes). Use `convert-std-to-llvm` that better reflects what the pass does and is consistent with most other conversions. PiperOrigin-RevId: 281238797	2019-11-19 00:34:51 -08:00
Hanhan Wang	c614c92fdc	Support SPIR-V constant op to take DenseElementsAttr as input. Iterates each element to build the array. This includes a little refactor to combine bool/int/float into a function, since they are similar. The only difference is calling different function in the end. PiperOrigin-RevId: 281210288	2019-11-18 20:02:05 -08:00
Alexander Belyaev	8c6a5233d5	Lower linalg.indexed_generic to loops. PiperOrigin-RevId: 281169885	2019-11-18 16:55:15 -08:00
Andy Davis	a6a287335d	Fix SubViewOp stride calculation in constant folding. Adds unit tests for subview offset and stride argument constant folding. PiperOrigin-RevId: 281161041	2019-11-18 15:01:08 -08:00
River Riddle	9873a29817	Add a parseAttribute<AttrType> overload for the non-type case. The variant that accepts a type will check that the parsed attribute is a valid instance of AttrType. The non-type variant would silently fail in this case, leading to garbage attribute values. PiperOrigin-RevId: 281136528	2019-11-18 13:11:36 -08:00
Denis Khalikov	6c77e59bfd	[spirv] Add a canonicalizer for BitcastOp. Convert chained `spirv::BitcastOp` operations into one `spirv::BitcastOp` operation. Closes tensorflow/mlir#238 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/238 from denis0x0D:sandbox/canon_bitcast 4352ed4f81b959ec92f849c599e733b62a99c010 PiperOrigin-RevId: 281129234	2019-11-18 12:37:00 -08:00
Andy Davis	68a8da4a93	Fix Affine Loop Fusion test case reported on github. This CL utilizies the more robust fusion feasibility analysis being built out in LoopFusionUtils, which will eventually be used to replace the current affine loop fusion pass. PiperOrigin-RevId: 281112340	2019-11-18 11:20:37 -08:00
Stephan Herhut	f0f3b71d67	Implement folding of pattern dim(subview(_)[...][s1, ..., sn][...], i) -> si. PiperOrigin-RevId: 281042016	2019-11-18 04:31:33 -08:00
Alex Zinenko	b8dc3fd812	Rename CLI flags -lower-gpu-ops-to--ops to -convert-gpu-to- This makes the flags consistent with the naming scheme used elsewhere in the codebase for dialect conversions. PiperOrigin-RevId: 281027517	2019-11-18 02:43:10 -08:00
Denis Khalikov	68e48ba111	[spirv] Add bit ops This CL added op definitions for a few bit operations: * OpBitFieldInsert * OpBitFieldSExtract * OpBitFieldUExtract Closes tensorflow/mlir#233 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/233 from denis0x0D:sandbox/bit_field_ops e7fd85b00d72d483d7992dc42b9cc4d673903455 PiperOrigin-RevId: 280691816	2019-11-15 11:03:19 -08:00
Lei Zhang	a0986bf43d	NFC: Convert CmpIPredicate in StandardOps to use EnumAttr This turns several hand-written functions to auto-generated ones. PiperOrigin-RevId: 280684326	2019-11-15 10:17:31 -08:00
Lei Zhang	88843ae37c	Use aggregate-parameter builder for ops having autogen type-deduction builder Thus far DRR always invokes the separate-parameter builder (i.e., requiring a separate parameter for each result-type/operand/attribute) for creating ops, no matter whether we can auto-generate a builder with type-deduction ability or not. This CL changes the path for ops that we can auto-generate type-deduction builders, i.e., with SameOperandsAndResultType/FirstAttrDerivedResultType traits. Now they are going through a aggregate-parameter builder (i.e., requiring one parameter for all result-types/operands/attributes). attributes.) It is expected this approach will be more friendly for future shape inference function autogen and calling those autogen'd shape inference function without excessive packing and repacking operand/attribute lists. Also, it would enable better support for creating ops with optional attributes because we are not required to provide an Attribute() as placeholder for an optional attribute anymore. PiperOrigin-RevId: 280654800	2019-11-15 07:33:54 -08:00
Stephan Herhut	57bafc674e	Mark std.view as no-sideeffect. The same reasoning as for std.subview applies. PiperOrigin-RevId: 280639308	2019-11-15 05:28:31 -08:00
Stephan Herhut	9c7bceb4fe	Mark std.subview as no-sideeffect. In essence, std.subview is just an abstract indexing transformation (somewhat akin to a gep in llvm) and by itself has no effect. From a practical perspective this helps, as it allows to remove dead subview operations. PiperOrigin-RevId: 280630046	2019-11-15 04:00:31 -08:00
Nicolas Vasilache	0b271b7dfe	Refactor the LowerVectorTransfers pass to use the RewritePattern infra - NFC This is step 1/n in refactoring infrastructure along the Vector dialect to make it ready for retargetability and composable progressive lowering. PiperOrigin-RevId: 280529784	2019-11-14 15:40:07 -08:00
Andy Davis	a4669cd3b4	Adds canonicalizer to SubViewOp which folds constants from base memref and operands into the subview result memref type. Changes SubViewOp to support zero operands case, when offset, strides and sizes are all constant. PiperOrigin-RevId: 280485075	2019-11-14 12:23:04 -08:00
Lei Zhang	796ca609eb	[ODS] Fix operation argument population to avoid crash The `Operator` class keeps an `arguments` field, which contains pointers to `operands` and `attributes` elements. Thus it must be populated after `operands` and `attributes` are finalized so to have stable pointers. SmallVector may re-allocate when still having new elements added, which will invalidate pointers. PiperOrigin-RevId: 280466896	2019-11-14 11:03:29 -08:00
Alex Zinenko	bf5916e7a4	Use MemRefDescriptor in Vector-to-LLVM convresion Following up on the consolidation of MemRef descriptor conversion, update Vector-to-LLVM conversion to use the helper class that abstracts away the implementation details of the MemRef descriptor. This also makes the types of the attributes in emitted llvm.insert/extractelement operations consistently i64 instead of a mix of index and i64. PiperOrigin-RevId: 280441451	2019-11-14 09:05:42 -08:00
Nicolas Vasilache	f2b6ae9991	Move VectorOps to Tablegen - (almost) NFC This CL moves VectorOps to Tablegen and cleans up the implementation. This is almost NFC but 2 changes occur: 1. an interface change occurs in the padding value specification in vector_transfer_read: the value becomes non-optional. As a shortcut we currently use %f0 for all paddings. This should become an OpInterface for vectorization in the future. 2. the return type of vector.type_cast is trivial and simplified to `memref<vector<...>>` Relevant roundtrip and invalid tests that used to sit in core are moved to the vector dialect. The op documentation is moved to the .td file. PiperOrigin-RevId: 280430869	2019-11-14 08:15:23 -08:00
Jacques Pienaar	d1c99e10d0	Do not emit aliases when printing local form Expand local scope printing to skip printing aliases as aliases are printed out at the top of a module and may not be part of the output generated by local scope print. PiperOrigin-RevId: 280278617	2019-11-13 14:21:49 -08:00
Nicolas Vasilache	0bd6390b54	Deprecate linalg.subview in favor of std.subview This CL uses the now standard std.subview in linalg. Two shortcuts are currently taken to allow this port: 1. the type resulting from a view is currently degraded to fully dynamic to pass the SubViewOp verifier. 2. indexing into SubViewOp may access out of bounds since lowering to LLVM does not currently enforce it by construction. These will be fixed in subsequent commits after discussions. PiperOrigin-RevId: 280250129	2019-11-13 12:10:09 -08:00
Sean Silva	486f2122cd	Add FuncOp::eraseArgument This is a quite complex operation that users are likely to attempt to write themselves and get wrong (citation: users=me). Ideally, we could pull this into FunctionLike, but for now, the FunctionType rewriting makes it FuncOp specific. We would need some hook for rewriting the function type (which for LLVM's func op, would need to rewrite the underlying LLVM type). PiperOrigin-RevId: 280234164	2019-11-13 10:59:55 -08:00
River Riddle	d985c74883	NFC: Refactor block signature conversion to not erase the original arguments. This refactors the implementation of block signature(type) conversion to not insert fake cast operations to perform the type conversion, but to instead create a new block containing the proper signature. This has the benefit of enabling the use of pre-computed analyses that rely on mapping values. It also leads to a much cleaner implementation overall. The major user facing change is that applySignatureConversion will now replace the entry block of the region, meaning that blocks generally shouldn't be cached over calls to applySignatureConversion. PiperOrigin-RevId: 280226936	2019-11-13 10:27:53 -08:00
River Riddle	6df8369941	Rename the current parseSymbolName to parseOptionalSymbolName The current implementation silently fails if the '@' identifier isn't present, making it similar to the 'optional' parse methods. This change renames the current implementation to 'Optional' and adds a new 'parseSymbolName' that emits an error. PiperOrigin-RevId: 280214610	2019-11-13 09:32:20 -08:00
Hanhan Wang	85d7fb3324	Make VariableOp instructions be in the first block in the function. Since VariableOp is serialized during processBlock, we add two more fields, `functionHeader` and `functionBody`, to collect instructions for a function. After all the blocks have been processed, we append them to the `functions`. Also, fix a bug in processGlobalVariableOp. The global variables should be encoded into `typesGlobalValues`. PiperOrigin-RevId: 280105366	2019-11-12 18:59:15 -08:00
Mahesh Ravishankar	2be53603e9	Add operations needed to support lowering of AffineExpr to SPIR-V. Lowering of CmpIOp, DivISOp, RemISOp, SubIOp and SelectOp to SPIR-V dialect enables the lowering of operations generated by AffineExpr -> StandardOps conversion into the SPIR-V dialect. PiperOrigin-RevId: 280039204	2019-11-12 13:20:06 -08:00
Lei Zhang	b259c26eb0	Add support for OpPhi in loop header block During deserialization, the loop header block will be moved into the spv.loop's region. If the loop header block has block arguments, we need to make sure it is correctly carried over to the block where the new spv.loop resides. During serialization, we need to make sure block arguments from the spv.loop's entry block are not silently dropped. PiperOrigin-RevId: 280021777	2019-11-12 12:00:28 -08:00
River Riddle	626e1fd95e	Add an option to print an operation if a diagnostic is emitted on it It is often helpful to inspect the operation that the error/warning/remark/etc. originated from, especially in the context of debugging or in the case of a verifier failure. This change adds an option 'mlir-print-op-on-diagnostic' that attaches the operation as a note to any diagnostic that is emitted on it via Operation::emit(Error\|Warning\|Remark). In the case of an error, the operation is printed in the generic form. PiperOrigin-RevId: 280021438	2019-11-12 11:59:19 -08:00
Mahesh Ravishankar	104af84f4c	Add Conversion to lower loop::ForOp to spirv::LoopOp. loop::ForOp can be lowered to the structured control flow represented by spirv::LoopOp by making the continue block of the spirv::LoopOp the loop latch and the merge block the exit block. The resulting spirv::LoopOp has a single back edge from the continue to header block, and a single exit from header to merge. PiperOrigin-RevId: 280015614	2019-11-12 11:33:27 -08:00
Nicolas Vasilache	51de3f688e	Add LLVM lowering of std.subview A followup CL will replace usage of linalg.subview by std.subview. PiperOrigin-RevId: 279961981	2019-11-12 07:23:18 -08:00
Andy Davis	82d2c43eca	Adds affine.min operation which returns the minimum value from a multi-result affine map. This operation is useful for things like computing the dynamic value of affine loop bounds, and is trivial to constant fold. PiperOrigin-RevId: 279959714	2019-11-12 07:08:49 -08:00
Nicolas Vasilache	f51a155337	Add support for alignment attribute in std.alloc. This CL adds an extra pointer to the memref descriptor to allow specifying alignment. In a previous implementation, we used 2 types: `linalg.buffer` and `view` where the buffer type was the unit of allocation/deallocation/alignment and `view` was the unit of indexing. After multiple discussions it was decided to use a single type, which conflates both, so the memref descriptor now needs to carry both pointers. This is consistent with the [RFC-Proposed Changes to MemRef and Tensor MLIR Types](https://groups.google.com/a/tensorflow.org/forum/#!searchin/mlir/std.view%7Csort:date/mlir/-wKHANzDNTg/4K6nUAp8AAAJ). PiperOrigin-RevId: 279959463	2019-11-12 07:06:54 -08:00
River Riddle	9b9c647cef	Add support for nested symbol references. This change allows for adding additional nested references to a SymbolRefAttr to allow for further resolving a symbol if that symbol also defines a SymbolTable. If a referenced symbol also defines a symbol table, a nested reference can be used to refer to a symbol within that table. Nested references are printed after the main reference in the following form: symbol-ref-attribute ::= symbol-ref-id (`::` symbol-ref-id)* Example: module @reference { func @nested_reference() } my_reference_op @reference::@nested_reference Given that SymbolRefAttr is now more general, the existing functionality centered around a single reference is moved to a derived class FlatSymbolRefAttr. Followup commits will add support to lookups, rauw, etc. for scoped references. PiperOrigin-RevId: 279860501	2019-11-11 18:18:31 -08:00
Andy Davis	5cf6e0ce7f	Adds std.subview operation which takes dynamic offsets, sizes and strides and returns a memref type which represents sub/reduced-size view of its memref argument. This operation is a companion operation to the std.view operation added as proposed in "Updates to the MLIR MemRefType" RFC. PiperOrigin-RevId: 279766410	2019-11-11 10:33:27 -08:00
Stephan Herhut	e04d4bf865	Also consider index constants when folding integer arithmetics with constants. PiperOrigin-RevId: 279698088	2019-11-11 02:34:21 -08:00
MLIR Team	9fbf52e330	Look for SymbolRefAttr in KernelOutlining instead of hard-coding CallOp This code should be exercised using the existing kernel outlining unit test, but let me know if I should add a dedicated unit test using a fake call instruction as well. PiperOrigin-RevId: 279436321	2019-11-08 19:13:13 -08:00
Denis Khalikov	4697d657b7	[spirv] Add bit ops This CL added op definitions for a few bit operations: * OpShiftLeftLogical * OpShiftRightArithmetic * OpShiftRightLogical * OpBitCount * OpBitReverse * OpNot Also moved the definition of spv.BitwiseAnd to follow the lexicographical order. Closes tensorflow/mlir#215 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/215 from denis0x0D:sandbox/bit_ops d9b0852b689ac6c4879a9740b1740a2357f44d24 PiperOrigin-RevId: 279350470	2019-11-08 11:17:05 -08:00
Alex Zinenko	09e8e7107a	mlir-translate: support -verify-diagnostics MLIR translation tools can emit diagnostics and we want to be able to check if it is indeed the case in tests. Reuse the source manager error handlers provided for mlir-opt to support the verification in mlir-translate. This requires us to change the signature of the functions that are registered to translate sources to MLIR: it now takes a source manager instead of a memory buffer. PiperOrigin-RevId: 279132972	2019-11-07 11:42:46 -08:00
Uday Bondhugula	eb47d5ee66	Fix asm printer for affine expr - fixes tensorflow/mlir#201 Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#204 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/204 from bondhugula:printfix 3f8a5b65391f45598258b2735fecaa409fbde848 PiperOrigin-RevId: 279115720	2019-11-07 10:27:27 -08:00
Andy Davis	8f00b4494d	Swap operand order in std.view operation so that offset appears before dynamic sizes in the operand list. PiperOrigin-RevId: 279114236	2019-11-07 10:20:23 -08:00
River Riddle	6b4e30b7c8	Add Ch-7 of the toy tutorial detailing how to define new types. This chapter adds a new composite type to Toy, and shows the process of adding a new type to the IR, adding and updating operations to use it, and constant folding operations producing it. PiperOrigin-RevId: 279107885	2019-11-07 09:54:04 -08:00
Andy Davis	5fbdb67b0a	Add canonicalizer for ViewOp which folds constants into the ViewOp memref shape and layout map strides and offset. PiperOrigin-RevId: 279088023	2019-11-07 08:05:03 -08:00
Jacques Pienaar	7af61f6bcd	Add compatible query method to infer type interface A return type that differs from the inferred return type need not indicate that an operation is invalid (e.g., tensor<*xf32> vs tensor<10xf32>) but they should be compatible for the operation to be considered valid. Add method to query if inferred type is compatible with return type. Also add InferTypeOpIntefaceDefault trait that considers equality and compatibility as the same. Currently an op has to opt in to using it explicitly. PiperOrigin-RevId: 279085639	2019-11-07 07:51:45 -08:00
Nicolas Vasilache	72040bf7c8	Update Linalg to use std.view Now that a view op has graduated to the std dialect, we can update Linalg to use it and remove ops that have become obsolete. As a byproduct, the linalg buffer and associated ops can also disappear. PiperOrigin-RevId: 279073591	2019-11-07 06:33:10 -08:00
Alexander Belyaev	eee9cbdeb7	Add IndexedGenericOp to Linalg. PiperOrigin-RevId: 279013404	2019-11-06 22:36:25 -08:00
Nicolas Vasilache	ffebc8ce1d	Drop spurious test file PiperOrigin-RevId: 278959717	2019-11-06 16:00:57 -08:00
Nicolas Vasilache	7f6c6084b5	Add lowering of std.view to LLVM This CL ports the lowering of linalg.view to the newly introduced std.view. Differences in implementation relate to std.view having slightly different semantics: 1. a static or dynamic offset can be specified. 2. the size of the (contiguous) shape is passed instead of a range. 3. static size and stride information is extracted from the memref type rather than the range. Besides these differences, lowering behaves the same. A future CL will update Linalg to use this unified infrastructure. PiperOrigin-RevId: 278948853	2019-11-06 15:06:16 -08:00
Andy Davis	b5654d1311	Add ViewOp verification for dynamic strides, and address some comments from previous change. PiperOrigin-RevId: 278903187	2019-11-06 11:25:54 -08:00
Andy Davis	c38dca7f4b	Add ViewOp to the StandardOps dialect, which casts a 1D/i8 element type memref type to an N-D memref type. Proposed in RFC: https://groups.google.com/a/tensorflow.org/forum/#!searchin/mlir/std.view%7Csort:date/mlir/-wKHANzDNTg/4K6nUAp8AAAJ Supports creating the N-D memref type with dynamic sizes and at a dynamic offset within the 1D base memref. This change contains op definition/parsing/printing and tests. Follow up changes will handle constant shape/layout map folding and llvm lowering. PiperOrigin-RevId: 278869990	2019-11-06 08:54:12 -08:00
Eric Schweitz	0d545921ea	Add support for the LLVM FNeg instruction Closes tensorflow/mlir#216 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/216 from schweitzpgi:llvmir-fneg-op f9b5f185845d671b745ab6fc213d5d9aff044b34 PiperOrigin-RevId: 278795325	2019-11-06 00:02:10 -08:00
James Molloy	250a11ae0f	[llvm] Allow GlobalOp to take a region for complex initializers This allows GlobalOp to either take a value attribute (for simple constants) or a region that can contain IR instructions (that must be constant-foldable) to create a ConstantExpr initializer. Example: // A complex initializer is constructed with an initializer region. llvm.mlir.global constant @int_gep() : !llvm<"i32"> { %0 = llvm.mlir.addressof @g2 : !llvm<"i32"> %1 = llvm.mlir.constant(2 : i32) : !llvm.i32 %2 = llvm.getelementptr %0[%1] : (!llvm<"i32">, !llvm.i32) -> !llvm<"i32"> llvm.return %2 : !llvm<"i32*"> } PiperOrigin-RevId: 278717836	2019-11-05 15:11:01 -08:00
James Molloy	6b534ecbcb	[llvm] Add initial import of LLVM modules to mlir-translate This adds an importer from LLVM IR or bitcode to the LLVM dialect. The importer is registered with mlir-translate. Known issues exposed by this patch but not yet fixed: * Globals' initializers are attributes, which makes it impossible to represent a ConstantExpr. This will be fixed in a followup. * icmp returns i32 rather than i1. * select and a couple of other instructions aren't implemented. * llvm.cond_br takes its successors in a weird order. The testing here is known to be non-exhaustive. I'd appreciate feedback on where this functionality should live. It looks like the translator from MLIR to LLVM lives in Target/, but the SPIR-V deserializer lives in Dialect/ which is why I've put this here too. PiperOrigin-RevId: 278711683	2019-11-05 14:41:38 -08:00
River Riddle	2366561a39	Add a PatternRewriter hook to merge blocks, and use it to support for folding branches. A pattern rewriter hook, mergeBlock, is added that allows for merging the operations of one block into the end of another. This is used to support a canonicalization pattern for branch operations that folds the branch when the successor has a single predecessor(the branch block). Example: ^bb0: %c0_i32 = constant 0 : i32 br ^bb1(%c0_i32 : i32) ^bb1(%x : i32): return %x : i32 becomes: ^bb0: %c0_i32 = constant 0 : i32 return %c0_i32 : i32 PiperOrigin-RevId: 278677825	2019-11-05 11:57:38 -08:00
MLIR Team	1f43d0d000	[NVVM] Add mma.sync operation. PiperOrigin-RevId: 278440547	2019-11-04 12:36:37 -08:00
River Riddle	e4a912eb5a	Update the SPV dialect type parser to use the methods on DialectAsmParser directly. This simplifies the implementation quite a bit, and removes the need for explicit string munging. One change is made to some of the enum elements of SPV_DimAttr to ensure that they are proper identifiers; The string form is now prefixed with 'Dim'. PiperOrigin-RevId: 278027132	2019-11-01 16:55:25 -07:00
River Riddle	68cfc89a0d	Refactor LinalgDialect::parseType to use the DialectAsmParser methods directly. This simplifies the implementation, and removes the need to do explicit string manipulation. A utility method 'parseDimensionList' is added to the DialectAsmParser to simplify defining types and attributes that contain shapes. PiperOrigin-RevId: 278020604	2019-11-01 16:14:10 -07:00
River Riddle	e94a8bfca8	Refactor QuantOps TypeParser to use the DialectAsmParser methods directly. This greatly simplifies the implementation and removes custom parser functionality. The necessary methods are added to the DialectAsmParser. PiperOrigin-RevId: 278015983	2019-11-01 15:47:03 -07:00
Lei Zhang	f143fbfa77	Add ReferToOp attribute constraint for SymbolRefAttr This constraint can be used to limit a SymbolRefAttr to point to a specific kind of op in the closest parent with a symbol table. PiperOrigin-RevId: 278001364	2019-11-01 14:26:36 -07:00
Nicolas Vasilache	e20a2aa9f2	Delete spurious file PiperOrigin-RevId: 277967079	2019-11-01 11:28:15 -07:00
Mahesh Ravishankar	9cbbd8f4df	Support lowering of imperfectly nested loops into GPU dialect. The current lowering of loops to GPU only supports lowering of loop nests where the loops mapped to workgroups and workitems are perfectly nested. Here a new lowering is added to handle lowering of imperfectly nested loop body with the following properties 1) The loops partitioned to workgroups are perfectly nested. 2) The loop body of the inner most loop partitioned to workgroups can contain one or more loop nests that are to be partitioned across workitems. Each individual loops nests partitioned to workitems should also be perfectly nested. 3) The number of workgroups and workitems are not deduced from the loop bounds but are passed in by the caller of the lowering as values. 4) For statements within the perfectly nested loop nest partitioned across workgroups that are not loops, it is valid to have all threads execute that statement. This is NOT verified. PiperOrigin-RevId: 277958868	2019-11-01 10:52:06 -07:00
Nicolas Vasilache	bd94a10c02	Add Linalg pattern for producer-consumer fusion This CL adds a simple pattern for specifying producer-consumer fusion on Linalg operations. Implementing such an extension reveals some interesting properties. Since Linalg operates on a buffer abstraction, the output buffers are specified as in/out parameters to the ops. As a consequence, there are no SSA use-def chains and one cannot specify complex dag input patterns with the current infrastructure. Instead this CL uses constraints based on the existing linalg dependence analysis to focus the pattern and refine patterns based on the type of op that last wrote in a buffer. This is a very local property and is less powerful than the generic dag specification based on SSA use-def chains. This will be generalized in the future. PiperOrigin-RevId: 277931503	2019-11-01 08:30:38 -07:00
James Molloy	96531e2f87	[mlir][llvm] Add missing cast ops Also adds a builder method for fcmp, identical to that for icmp. PiperOrigin-RevId: 277923158	2019-11-01 07:32:09 -07:00
Lei Zhang	7432234f3c	NFC: Use #ifndef in various .td files instead of #ifdef and #else Upstream LLVM gained support for #ifndef with https://reviews.llvm.org/D61888 This is changed mechanically via the following command: find . -name ".td" -exec sed -i -e ':a' -e 'N' -e '$!ba' -e 's/#ifdef $[A-Z_]$\n#else/#ifndef \1/g' {} \; PiperOrigin-RevId: 277789427	2019-10-31 13:29:50 -07:00
Mehdi Amini	ce9477934a	Add a test for lowering GPU ops that cover cases where the symbol table isn't held by a ModuleOp (NFC) PiperOrigin-RevId: 277752004	2019-10-31 10:35:15 -07:00
Mehdi Amini	07b4ce7409	Add a test.symbol_scope operation that has the SymbolTable Traits to the Test dialect PiperOrigin-RevId: 277741687	2019-10-31 09:49:42 -07:00
Denis Khalikov	d423d4a338	[spirv] Add cast operations This CL added op definitions for a few cast operations: * OpConvertFToU * OpConvertFToS * OpConvertSToF * OpConvertUToF * OpUConvert * OpSConvert * OpFConvert Also moved the definition of spv.Bitcast to the new file. Closes tensorflow/mlir#208 and tensorflow/mlir#174 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/208 from denis0x0D:sandbox/cast_ops 79bc9b37398aafddee6cf6beb301807988fe67f9 PiperOrigin-RevId: 277587891	2019-10-30 14:53:04 -07:00
Lei Zhang	d024b68e6b	Use `not` to invert return code in expected to fail tests Windows does not like the RUN command of `(... \|\| true) \| ...`. PiperOrigin-RevId: 277587031	2019-10-30 14:38:18 -07:00
River Riddle	a32f0dcb5d	Add support to GreedyPatternRewriter for erasing unreachable blocks. Rewrite patterns may make modifications to the CFG, including dropping edges between blocks. This change adds a simple unreachable block elimination run at the end of each iteration to ensure that the CFG remains valid. PiperOrigin-RevId: 277545805	2019-10-30 11:19:24 -07:00
Lei Zhang	cb40e36d3b	Fix segfault when no symbol is given to an constraint operand This fixed the segfault when we see the following pattern: Pat<(...), (...), [(... 1, 2, 3), ...]> PiperOrigin-RevId: 277544300	2019-10-30 11:12:57 -07:00
Nicolas Vasilache	05a5a41416	Add basic support for declarative Linalg transformations Linalg ops provide a good anchor for pattern matching/rewriting transformations. This CL adds a simple example of how multi-level tiling may be specified by attaching a simple StringAttr to ops as they are transformed so we can easily specify partial lowering to control transformation application. This is a first stab at taking advantage of higher-level information contained in Linalg ops and will evolve in the future. PiperOrigin-RevId: 277497958	2019-10-30 07:12:33 -07:00
Lei Zhang	80213ba5f0	[spirv] Fix gen_spirv_dialect.py and add spv.Unreachable This CL fixed gen_spirv_dialect.py to support nested delimiters when chunking existing ODS entries in .td files and to allow ops without correspondence in the spec. This is needed to pull in the definition of OpUnreachable. PiperOrigin-RevId: 277486465	2019-10-30 05:41:18 -07:00
Lei Zhang	ca2538e9a7	[spirv] Support OpPhi using block arguments This CL adds another control flow instruction in SPIR-V: OpPhi. It is modelled as block arguments to be idiomatic with MLIR. See the rationale.md doc for "Block Arguments vs PHI nodes". Serialization and deserialization is updated to convert between block arguments and SPIR-V OpPhi instructions. PiperOrigin-RevId: 277161545	2019-10-28 15:58:42 -07:00
Sean Silva	66ec24d833	Parse locations in parseGenericOperation For ops that recursively re-enter the parser to parse an operation (such as ops with a "wraps" pretty form), this ensures that the wrapped op will parse its location, which can then be used for the locations of the wrapping op and any other implicit ops. PiperOrigin-RevId: 277152636	2019-10-28 15:11:26 -07:00
River Riddle	2f4d0c085a	Add support for marking an operation as recursively legal. In some cases, it may be desirable to mark entire regions of operations as legal. This provides an additional granularity of context to the concept of "legal". The `ConversionTarget` supports marking operations, that were previously added as `Legal` or `Dynamic`, as `recursively` legal. Recursive legality means that if an operation instance is legal, either statically or dynamically, all of the operations nested within are also considered legal. An operation can be marked via `markOpRecursivelyLegal<>`: ```c++ ConversionTarget &target = ...; /// The operation must first be marked as `Legal` or `Dynamic`. target.addLegalOp<MyOp>(...); target.addDynamicallyLegalOp<MySecondOp>(...); /// Mark the operation as always recursively legal. target.markOpRecursivelyLegal<MyOp>(); /// Mark optionally with a callback to allow selective marking. target.markOpRecursivelyLegal<MyOp, MySecondOp>([](Operation *op) { ... }); /// Mark optionally with a callback to allow selective marking. target.markOpRecursivelyLegal<MyOp>([](MyOp op) { ... }); ``` PiperOrigin-RevId: 277086382	2019-10-28 10:04:34 -07:00
Alexander Belyaev	780a108d31	Fix include guards and add tests for OpToFuncCallLowering. PiperOrigin-RevId: 276859463	2019-10-26 08:21:36 -07:00
Smit Hinsu	cde337cfde	Define AnyRankedTensor Type in TableGen PiperOrigin-RevId: 276714649	2019-10-25 10:31:56 -07:00
River Riddle	b69e8ee049	Add support for parsing multiple result name groups. This allows for parsing things like: %name_1, %name_2:5, %name_3:2 = "my.op" ... This is useful for operations that have groups of variadic result values. The total number of results is expected to match the number of results defined by the operation. PiperOrigin-RevId: 276703280	2019-10-25 09:34:02 -07:00
Denis Khalikov	dd2e444325	[spirv] AccessChainOp canonicalization. Combine chained `spirv::AccessChainOp` operations into one `spirv::AccessChainOp` operation. Closes tensorflow/mlir#198 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/198 from denis0x0D:sandbox/canon_access_chain 0cb87955a85511071143d62637ff939d0dabc2bd PiperOrigin-RevId: 276609345	2019-10-24 18:41:34 -07:00
River Riddle	2b61b7979e	Convert the Canonicalize and CSE passes to generic Operation Passes. This allows for them to be used on other non-function, or even other function-like, operations. The algorithms are already generic, so this is simply changing the derived pass type. The majority of this change is just ensuring that the nesting of these passes remains the same, as the pass manager won't auto-nest them anymore. PiperOrigin-RevId: 276573038	2019-10-24 15:01:09 -07:00
River Riddle	ef43b56538	Add support for replacing all uses of a symbol. This requires reconstructing the attribute dictionary of each operation containing a use. PiperOrigin-RevId: 276520544	2019-10-24 10:47:27 -07:00
River Riddle	21ee4e987f	Add @below and @above directives to verify-diagnostics. This simplifies defining expected-* directives when there are multiple that apply to the next or previous line. @below applies the directive to the next non-designator line, i.e. the next line that does not contain an expected-* designator. @above applies to the previous non designator line. Examples: // Expect an error on the next line that does not contain a designator. // expected-remark@below {{remark on function below}} // expected-remark@below {{another remark on function below}} func @bar(%a : f32) // Expect an error on the previous line that does not contain a designator. func @baz(%a : f32) // expected-remark@above {{remark on function above}} // expected-remark@above {{another remark on function above}} PiperOrigin-RevId: 276369085	2019-10-23 15:56:29 -07:00
Uday Bondhugula	ad6925f479	Update loop.for verifier message fix: nonnegative -> positive Closes tensorflow/mlir#206 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/206 from bondhugula:bondhugula-patch-1 9a47ca7dfd230180a9df33e9a64b33d02252d30a PiperOrigin-RevId: 276060885	2019-10-22 07:34:56 -07:00
Lei Zhang	020f9eb68c	[DRR] Allow interleaved operands and attributes Previously DRR assumes attributes to appear after operands. This was the previous requirements on ODS, but that has changed some time ago. Fix DRR to also support interleaved operands and attributes. PiperOrigin-RevId: 275983485	2019-10-21 20:48:17 -07:00
Lei Zhang	d9fe892e42	[spirv] Allow block arguments on spv.Branch(Conditional) We will use block arguments as the way to model SPIR-V OpPhi in the SPIR-V dialect. This CL also adds a few useful helper methods to both ops to get the block arguments. Also added tests for branch weight (de)serialization. PiperOrigin-RevId: 275960797	2019-10-21 17:32:00 -07:00
Alex Zinenko	5f867d26b4	Use LLVM_Type instead of AnyType in the definition of LLVM_CallOp The type constraint had to be relaxed due to the order of lowering passes in the examples, that since has been fixed. The relaxed version was still used by the CUDA lowering for launch sizes of `index` type. This is not necessary since the GPU dialect does not restrict the type of the launch size operands. Use an LLVM type instead and restore the check in the LLVM_CallOp definition. PiperOrigin-RevId: 275920109	2019-10-21 14:12:19 -07:00
River Riddle	4514cdd5eb	Cleanup and rewrite Ch-4.md. This change rewrites Ch-4.md to introduced interfaces in a detailed step-by-step manner, adds examples, and fixes some errors. PiperOrigin-RevId: 275887017	2019-10-21 11:32:39 -07:00
River Riddle	941a1c4332	NFC: Fix remaining usages of MulOp as matrix multiplication. MulOp now represents an element-wise multiplication instead of a matrix multiplication. PiperOrigin-RevId: 275886774	2019-10-21 11:31:32 -07:00
River Riddle	03d7be2aca	NFC: Elide the value of a UnitAttr within nested attribute dictionaries. This matches the behavior of the top level attribute dictionary. PiperOrigin-RevId: 275879828	2019-10-21 11:02:07 -07:00
River Riddle	9ac459e871	Add a Symbol trait to simplify defining operations that represent symbols. This trait provides accessors for the name, symbol use list methods, verification, with more to be added. PiperOrigin-RevId: 275864554	2019-10-21 09:58:59 -07:00

... 3 4 5 6 7 ...

1715 Commits