llvm-project

Commit Graph

Author	SHA1	Message	Date
Andy Davis	1d798b1d27	[VectorOps] Add vector ReshapeOp to the VectorOps dialect. Adds vector ReshapeOp to the VectorOps dialect. An aggregate vector reshape operation, which aggregates multiple hardware vectors, can enable optimizations during decomposition (e.g. loading one input hardware vector and performing multiple rotate and scatter store operations to the vector output). PiperOrigin-RevId: 286440658	2019-12-19 12:27:59 -08:00
Aart Bik	15f800f4bc	[VectorOps] minor cleanup: vector dialect "subscripts" are i32 Introduces some centralized methods to move towards consistent use of i32 as vector subscripts. Note: sizes/strides/offsets attributes are still i64 PiperOrigin-RevId: 286434133	2019-12-19 11:51:08 -08:00
Aart Bik	d9b500d3bb	[VectorOps] Add vector.print definition, with lowering support Examples: vector.print %f : f32 vector.print %x : vector<4xf32> vector.print %y : vector<3x4xf32> vector.print %z : vector<2x3x4xf32> LLVM lowering replaces these with fully unrolled calls into a small runtime support library that provides some basic printing operations (single value, opening closing bracket, comma, newline). PiperOrigin-RevId: 286230325	2019-12-18 11:31:34 -08:00
Alex Zinenko	40ef46fba4	Harden the requirements to memory attribution types in gpu.func When memory attributions are present in `gpu.func`, require that they are of memref type and live in memoryspaces 3 and 5 for workgroup and private memory attributions, respectively. Adapt the conversion from the GPU dialect to the NVVM dialect to drop the private memory space from attributions as NVVM is able to model them as local `llvm.alloca`s in the default memory space. PiperOrigin-RevId: 286161763	2019-12-18 03:38:55 -08:00
Andy Davis	6fa3bd5b3e	Add pattern rewrite which splits a vector TransferWriteOp into slices according to the unrolling/slicing scheme of its InsertSlicesOp operand. PiperOrigin-RevId: 286042578	2019-12-17 13:17:10 -08:00
Mahesh Ravishankar	319cca3bbe	Add missing virtual inliner interface method in SPIR-V dialect. The inline interface uses two methods to check legality of inling: 1) Can a region be inlined into another. 2) Can an operation be inlined into another. Setting the former to true, allows the inliner to use the second for legality checks. Add this method to the SPIR-V dialect inlining interface. PiperOrigin-RevId: 286041734	2019-12-17 13:06:05 -08:00
Andy Davis	d1fb285b32	Add pattern rewrite to forward vector tuple elements to their users. User(TupleGetOp(ExtractSlicesOp(InsertSlicesOp(TupleOp(Producer))) -> User(Producer) PiperOrigin-RevId: 286020249	2019-12-17 11:21:45 -08:00
Andy Davis	038ad1d856	Add pattern rewrite which splits a vector TransferReadOp into slices according to the unrolling/slicing scheme of its ExtractSlicesOp user. PiperOrigin-RevId: 285975613	2019-12-17 07:29:06 -08:00
Andy Davis	4e825c59be	Update vector op unrolling transformation to generate ExtractSlicesOp and InsertSlicesOp (instead of less structured chain of StridedSliceOps and InsertStridedSliceOps). PiperOrigin-RevId: 285968051	2019-12-17 06:27:01 -08:00
Mahesh Ravishankar	80ec474a65	Add atomic operations to SPIR-V dialect. Some changes to the dialect generation script to allow specification of different base class to derive from in ODS. PiperOrigin-RevId: 285859230	2019-12-16 15:05:51 -08:00
Lei Zhang	659150b570	[spirv] Re-enable nested loop (de)serialization test PiperOrigin-RevId: 285849308	2019-12-16 14:21:52 -08:00
Andy Davis	11e92875f0	Add InsertSlicesOp to the VectorOps dialect. PiperOrigin-RevId: 285830394	2019-12-16 12:56:38 -08:00
Alex Zinenko	6273fa0c6a	Plug gpu.func into the GPU lowering pipelines This updates the lowering pipelines from the GPU dialect to lower-level dialects (NVVM, SPIRV) to use the recently introduced gpu.func operation instead of a standard function annotated with an attribute. In particular, the kernel outlining is updated to produce gpu.func instead of std.func and the individual conversions are updated to consume gpu.funcs and disallow standard funcs after legalization, if necessary. The attribute "gpu.kernel" is preserved in the generic syntax, but can also be used with the custom syntax on gpu.funcs. The special kind of function for GPU allows one to use additional features such as memory attribution. PiperOrigin-RevId: 285822272	2019-12-16 12:12:48 -08:00
Jose Ignacio Gomez	3ae56c4135	[Linalg] Expose subview promotion as a declarative pattern This PR targest issue tensorflow/mlir#295. It exposes the already existing subiew promotion pass as a declarative pattern Change-Id: If901ebef9fb53fcd0b12ecc536f6b174ce320b92 Closes tensorflow/mlir#315 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/315 from tetuante:issue295 8e5f268b6d85f31015c33505329dbd7a4db97ac5 PiperOrigin-RevId: 285801463	2019-12-16 10:50:45 -08:00
Aart Bik	cd5dab8ad7	[VectorOps] Add [insert/extract]element definition together with lowering to LLVM Similar to insert/extract vector instructions but (1) work on 1-D vectors only (2) allow for a dynamic index %c3 = constant 3 : index %0 = vector.insertelement %arg0, %arg1[%c : index] : vector<4xf32> %1 = vector.extractelement %arg0[%c3 : index] : vector<4xf32> PiperOrigin-RevId: 285792205	2019-12-16 09:52:46 -08:00
Andy Davis	73ec37c8bb	Adds ExtractSlicesOp to the VectorOps dialect. ExtractSlicesOp extracts slices of its vector operand and with a specified tiling scheme. This operation centralizes the tiling scheme around a single op, which simplifies vector op unrolling and subsequent pattern rewrite transformations. PiperOrigin-RevId: 285761129	2019-12-16 06:39:09 -08:00
Alexander Belyaev	1b579d998a	[Linalg] Add test for fusion of GenericOp with IndexedGenericOp. PiperOrigin-RevId: 285211797	2019-12-12 09:56:45 -08:00
Christian Sigg	9b85582682	Automated rollback of commit `f68ac464d8` PiperOrigin-RevId: 285162061	2019-12-12 03:48:38 -08:00
Christian Sigg	f68ac464d8	Switch from shfl.bfly to shfl.down. Both work for the current use case, but the latter allows implementing prefix sums and is a little easier to understand for partial warps. PiperOrigin-RevId: 285145287	2019-12-12 01:28:01 -08:00
Nicolas Vasilache	508d4e672e	Continue refactoring StructuredOps utilities This CL adds more common information to StructuredOpsUtils.h The n_view attribute is retired in favor of args_in + args_out but the CL is otherwise NFC. PiperOrigin-RevId: 285000621	2019-12-11 09:27:34 -08:00
Alexander Belyaev	bae8a7a724	[Linalg] Add tiling for IndexedGenericOp with a region. PiperOrigin-RevId: 284949355	2019-12-11 02:56:40 -08:00
Andy Davis	4d8ba88610	Add VectorOp transform pattern which splits vector TransferReadOps to target vector unroll size. PiperOrigin-RevId: 284880592	2019-12-10 17:02:51 -08:00
Nicolas Vasilache	995048d7b7	Fold TestLinalgTilePermutePatterns into TestLinalgTransformPatterns - NFC Centralize all patterns that test Linalg transforms in a single pass. PiperOrigin-RevId: 284835938	2019-12-10 13:26:15 -08:00
Jose Ignacio Gomez	b19fed5415	[Linalg] Add a Linalg iterator permutation transformation This patch closes issue tensorflow/mlir#272 We add a standalone iterator permutation transformation to Linalg. This transformation composes a permutation map with the maps in the "indexing_maps" attribute. It also permutes "iterator_types" accordingly. Change-Id: I7c1e693b8203aeecc595a7c012e738ca1100c857 Closes tensorflow/mlir#307 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/307 from tetuante:issue272 f7908d58792f4111119721885e247045104f1131 PiperOrigin-RevId: 284824102	2019-12-10 12:25:43 -08:00
Nicolas Vasilache	ad38e49806	Uniformize Vector transforms as patterns on the model of Linalg - NFC This reorganizes the vector transformations to be more easily testable as patterns and more easily composable into fused passes in the future. PiperOrigin-RevId: 284817474	2019-12-10 11:54:33 -08:00
Aart Bik	1fe65688d4	[VectorOps] Add a ShuffleOp to the VectorOps dialect For example %0 = vector.shuffle %x, %y [3 : i32, 2 : i32, 1 : i32, 0 : i32] : vector<2xf32>, vector<2xf32> yields a vector<4xf32> result with a permutation of the elements of %x and %y PiperOrigin-RevId: 284657191	2019-12-09 16:15:41 -08:00
Aart Bik	0e963b9c42	[VectorOps] Fix off-by-one error in insert/extract validation PiperOrigin-RevId: 284652653	2019-12-09 15:54:23 -08:00
Denis Khalikov	34265dad65	[spirv] Add CompositeConstruct operation. Closes tensorflow/mlir#308 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/308 from denis0x0D:sandbox/composite_construct 9ef7180f77f9374bcd05afc4f9e6c1d2d72d02b7 PiperOrigin-RevId: 284613617	2019-12-09 12:43:53 -08:00
Lei Zhang	2c7e8ed7c6	[spirv] Add spv.IAdd, spv.ISub, and spv.IMul folders The patterns to be folded away can be commonly generated during lowering to SPIR-V. PiperOrigin-RevId: 284604855	2019-12-09 11:59:10 -08:00
Kazuaki Ishizaki	ae05cf27c6	Minor spelling tweaks Closes tensorflow/mlir#304 PiperOrigin-RevId: 284568358	2019-12-09 09:23:48 -08:00
Nicolas Vasilache	91c0074624	[StructuredOps][Linalg] Add a primitive pattern to rewrite the linalg.generic form of matmul to vector form. This CL uses the newly expanded matcher support to easily detect when a linalg.generic has a multiply-accumulate body. A linalg.generic with such a body is rewritten as a vector contraction. This CL additionally limits the rewrite to the case of matrix multiplication on contiguous and statically shaped memrefs for now. Before expanding further, we should harden the infrastructure for expressing custom ops with the structured ops abstraction. PiperOrigin-RevId: 284566659	2019-12-09 09:14:39 -08:00
Aart Bik	d37f27251f	[VecOps] Rename vector.[insert\|extract]element to just vector.[insert\|extract] Since these operations lower to [insert\|extract][element\|value] at LLVM dialect level, neither element nor value would correctly reflect the meaning. PiperOrigin-RevId: 284240727	2019-12-06 12:39:25 -08:00
Aart Bik	b36aaeafb1	[VectorOps] Add lowering of vector.broadcast to LLVM IR For example, a scalar broadcast %0 = vector.broadcast %x : f32 to vector<2xf32> return %0 : vector<2xf32> which expands scalar x into vector [x,x] by lowering to the following LLVM IR dialect to implement the duplication over the leading dimension. %0 = llvm.mlir.undef : !llvm<"<2 x float>"> %1 = llvm.mlir.constant(0 : index) : !llvm.i64 %2 = llvm.insertelement %x, %0[%1 : !llvm.i64] : !llvm<"<2 x float>"> %3 = llvm.shufflevector %2, %0 [0 : i32, 0 : i32] : !llvm<"<2 x float>">, !llvm<"<2 x float>"> return %3 : vector<2xf32> In the trailing dimensions, the operand is simply "passed through", unless a more elaborate "stretch" is required. For example %0 = vector.broadcast %arg0 : vector<1xf32> to vector<4xf32> return %0 : vector<4xf32> becomes %0 = llvm.mlir.undef : !llvm<"<4 x float>"> %1 = llvm.mlir.constant(0 : index) : !llvm.i64 %2 = llvm.extractelement %arg0[%1 : !llvm.i64] : !llvm<"<1 x float>"> %3 = llvm.mlir.constant(0 : index) : !llvm.i64 %4 = llvm.insertelement %2, %0[%3 : !llvm.i64] : !llvm<"<4 x float>"> %5 = llvm.shufflevector %4, %0 [0 : i32, 0 : i32, 0 : i32, 0 : i32] : !llvm<"<4 x float>">, !llvm<"<4 x float>"> llvm.return %5 : !llvm<"<4 x float>"> PiperOrigin-RevId: 284219926	2019-12-06 11:02:29 -08:00
Andy Davis	41f8e105fa	Unroll vector masks along with their associated vector arguments. Updates vector ContractionOp to use proper vector masks (produced by CreateMaskOp/ConstantMaskOp). Leverages the following canonicalizations in unrolling unit test: CreateMaskOp -> ConstantMaskOp, StridedSliceOp(ConstantMaskOp) -> ConstantMaskOp Removes IndexTupleOp (no longer needed now that we have vector mask ops). Updates all unit tests. PiperOrigin-RevId: 284182168	2019-12-06 07:37:28 -08:00
Uday Bondhugula	3ade6a7d15	DimOp folding for alloc/view dynamic dimensions Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#253 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/253 from bondhugula:dimop a4b464f24ae63fd259114558d87e11b8ee4dae86 PiperOrigin-RevId: 284169689	2019-12-06 06:00:54 -08:00
Alex Zinenko	58adf99ed1	LLVM::AddressOfOp: properly take into account the address space The AddressOf operation in the LLVM dialect return a pointer to a global variable. The latter may be in a non-default address space as indicated by the "addr_space" attribute. Check that the address space of the pointer returned by AddressOfOp matches that of the referenced GlobalOp. Update the AddressOfOp builder to respect this constraint. PiperOrigin-RevId: 284138860	2019-12-06 01:09:13 -08:00
Jose Ignacio Gomez	f60bbb6c3b	[Linalg] Add permutation information to tiling This patch closes issue tensorflow/mlir#271. It adds an optional permutation map to declarative tiling transformations. The map is expressed as a list of integers. Closes tensorflow/mlir#288 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/288 from tetuante:issue271 2df2938d6a1f01b3bc404ded08dea2dd1e10b588 PiperOrigin-RevId: 284064151	2019-12-05 15:14:59 -08:00
Denis Khalikov	e67acfa468	[spirv] Add CompositeInsertOp operation A CompositeInsertOp operation make a copy of a composite object, while modifying one part of it. Closes tensorflow/mlir#292 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/292 from denis0x0D:sandbox/composite_insert 2200962b9057bda53cd2f2866b461e2797196380 PiperOrigin-RevId: 284036551	2019-12-05 13:10:44 -08:00
Lei Zhang	037044b0ae	Add spv.AtomicCompareExchangeWeak PiperOrigin-RevId: 283997917	2019-12-05 10:06:24 -08:00
Lei Zhang	c0a9de29ad	[spirv] Fix nested loop (de)serialization For serialization, when we have nested ops, the inner loop will create multiple SPIR-V blocks. If the outer loop has block arguments (which corresponds to OpPhi instructions), we defer the handling of OpPhi's parent block handling until we serialized all blocks and then fix it up with the result <id>. These two cases happening together was generating invalid SPIR-V blob because we previously assume the parent block to be the block containing the terminator. That is not true anymore when the block contains structured control flow ops. If that happens, it should be fixed to use the structured control flow op's merge block. For deserialization, we record a map from header blocks to their corresponding merge and continue blocks during the initial deserialization and then use the info to construct spv.selection/spv.loop. The existing implementation will also fall apart when we have nested loops. If so, we clone all blocks for the outer loop, including the ones for the inner loop, to the spv.loop's region. So the map for header blocks' merge info need to be updated; otherwise we are operating on already deleted blocks. PiperOrigin-RevId: 283949230	2019-12-05 04:39:37 -08:00
Andy Davis	d20d763241	Add canonicalization patterns for vector CreateMaskOp and StridedSliceOp to be used in the unroll vector op transformation. Adds a ConstantMaskOp to the vector ops dialect. Adds the following canonicalization patterns: CreateMaskOp -> ConstantMaskOp StridedSliceOp(ConstantMaskOp) -> ConstantMaskOp PiperOrigin-RevId: 283816752	2019-12-04 13:00:43 -08:00
Scott Todd	bf45ff6aab	[spirv] Adding sqrt op in the GLSL extension. PiperOrigin-RevId: 283769736	2019-12-04 09:16:23 -08:00
Lei Zhang	50b2b26e70	[spirv] Add spv.GroupNonUniformBallot This CL also did the following cleanup: - Moved the test for spv.SubgroupBallotKHR to its own file - Wrapped generated canonicalization patterns in anonymous namespace - Updated header comments in SPVOps.td PiperOrigin-RevId: 283650091	2019-12-03 16:44:09 -08:00
Andy Davis	2c13fd9f17	Add CreateMaskOp to the VectorOps dialect. PiperOrigin-RevId: 283591888	2019-12-03 11:55:54 -08:00
Alex Zinenko	993e79e9bd	Fix ViewOp to have at most one offset operand As described in the documentation, ViewOp is expected to take an optional dynamic offset followed by a list of dynamic sizes. However, the ViewOp parser did not include a check for the offset being a single value and accepeted a list of values instead. Furthermore, several tests have been exercising the wrong syntax of a ViewOp, passing multiple values to the dyanmic stride list, which was not caught by the parser. The trailing values could have been erronously interpreted as dynamic sizes. This is likely due to resyntaxing of the ViewOp, with the previous syntax taking the list of sizes before the offset. Update the tests to use the syntax with the offset preceding the sizes. Worse, the conversion of ViewOp to the LLVM dialect assumed the wrong order of operands with offset in the trailing position, and erronously relied on the permissive parsing that interpreted trailing dynamic offset values as leading dynamic sizes. Fix the lowering to use the correct order of operands. PiperOrigin-RevId: 283532506	2019-12-03 06:23:04 -08:00
Lei Zhang	1af9633d85	[spirv] Add spv.SubgroupBallotKHROp PiperOrigin-RevId: 283522284	2019-12-03 04:49:56 -08:00
Alex Zinenko	fdbb99cd62	Add linkage support to LLVMFuncOp A recent commit introduced the Linkage attribute to the LLVM dialect and used it in the Global Op. Also use it in LLVMFuncOp. As per LLVM Language Reference, if the linkage attribute is omitted, the function is assumed to have external linkage. PiperOrigin-RevId: 283493299	2019-12-03 00:26:44 -08:00
Aart Bik	3126004a5a	[VectorOps] Add legality rules to broadcast PiperOrigin-RevId: 283360101	2019-12-02 09:57:27 -08:00
Alexander Belyaev	9630fcbc52	Lower linalg.indexed_generic with libcall to LLVM. PiperOrigin-RevId: 283328994	2019-12-02 06:30:52 -08:00
Alex Zinenko	d5e627f84b	Introduce Linkage attribute to the LLVM dialect LLVM IR supports linkage on global objects such as global variables and functions. Introduce the Linkage attribute into the LLVM dialect, backed by an integer storage. Use this attribute on LLVM::GlobalOp and make it mandatory. Implement parsing/printing of the attribute and conversion to LLVM IR. See tensorflow/mlir#277. PiperOrigin-RevId: 283309328	2019-12-02 03:28:10 -08:00
Denis Khalikov	cd556f25de	[spirv] Check that operand of `spirv::CompositeExtractOp` is constant while folding. Closes tensorflow/mlir#281 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/281 from denis0x0D:sandbox/composite_ex_fold d02d73658bd1b9eaa515eb4e0aee34bc41d4252b PiperOrigin-RevId: 282971563	2019-11-28 13:27:56 -08:00
Jose Ignacio Gomez	0494ef60f7	[Linalg] Change attribute n_loop_types to iterator This addresses issue tensorflow/mlir#270. Linalg is updated to take the same form of iterator_types than vector contraction. Closes tensorflow/mlir#280 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/280 from tetuante:PRissue270 d26d88d090d3765d3b9884bfabdd023143f27287 PiperOrigin-RevId: 282905396	2019-11-28 01:59:55 -08:00
Lei Zhang	d4e4387fbf	[spirv] Add folders for spv.IAdd and spv.IMul Adding zero and multiplying one can be common when generating code for index calculation. This CL also sorted canonicalize.mlir to alphabetical order. PiperOrigin-RevId: 282828055	2019-11-27 13:46:52 -08:00
Nicolas Vasilache	1fa8c8070b	Implement Linalg to loops lowering as a pattern This CL rewrites the linalg ops to loops transformations as patterns that can be targeted directly from Tablegen. Reliance on OpFolder is removed and to cope with it we introduce local folding patterns that are applied greedily. PiperOrigin-RevId: 282765550	2019-11-27 07:32:13 -08:00
Aart Bik	e2232fbcee	[VectorOps] Refine BroadcastOp in VectorOps dialect Since second argument is always fully overwritten and shape is define in "to" clause, it is not needed. Also renamed "into" to "to" now that arg is dropped. PiperOrigin-RevId: 282686475	2019-11-26 19:52:38 -08:00
Aart Bik	cf97263cb8	[VectorOps] Add a BroadcastOp to the VectorOps dialect PiperOrigin-RevId: 282643305	2019-11-26 14:43:31 -08:00
Mahesh Ravishankar	03620fa70a	Misc changes to lowering to SPIR-V. These changes to SPIR-V lowering while adding support for lowering SUbViewOp, but are not directly related. - Change the lowering of MemRefType to !spv.ptr<!spv.struct<!spv.array<...>[offset]>, ..> This is consistent with the Vulkan spec. - To enable testing a simple pattern of lowering functions is added to ConvertStandardToSPIRVPass. This is just used to convert the type of the arguments of the function. The added function lowering itself is not meant to be the way functions are eventually lowered into SPIR-V dialect. PiperOrigin-RevId: 282589644	2019-11-26 10:11:34 -08:00
Nicolas Vasilache	36469f7d2a	Add a vector.InsertStridedSliceOp This new op is the counterpart of vector.StridedSliceOp and will be used for in the pattern rewrites for vector unrolling. PiperOrigin-RevId: 282447414	2019-11-25 15:37:13 -08:00
MLIR Team	1012c492f0	Allow LLVM::ExtractElementOp to have non-i32 indices. Also change the text format a bit, so that indices are braced by squares. PiperOrigin-RevId: 282437095	2019-11-25 14:44:52 -08:00
Andy Davis	8fc44a4d13	Update VectorContractionOp to take iterator types and index mapping attributes compatible with linalg ops. PiperOrigin-RevId: 282412311	2019-11-25 12:40:00 -08:00
Mahesh Ravishankar	bd485afda0	Introduce attributes that specify the final ABI for a spirv::ModuleOp. To simplify the lowering into SPIR-V, while still respecting the ABI requirements of SPIR-V/Vulkan, split the process into two 1) While lowering a function to SPIR-V (when the function is an entry point function), allow specifying attributes on arguments and function itself that describe the ABI of the function. 2) Add a pass that materializes the ABI described in the function. Two attributes are needed. 1) Attribute on arguments of the entry point function that describe the descriptor_set, binding, storage class, etc, of the spv.globalVariable this argument will be replaced by 2) Attribute on function that specifies workgroup size, etc. (for now only workgroup size). Add the pass -spirv-lower-abi-attrs to materialize the ABI described by the attributes. This change makes the SPIRVBasicTypeConverter class unnecessary and is removed, further simplifying the SPIR-V lowering path. PiperOrigin-RevId: 282387587	2019-11-25 11:19:56 -08:00
Nicolas Vasilache	01145544aa	Add vector.insertelement op This is the counterpart of vector.extractelement op and has the same limitations at the moment (static I64IntegerArrayAttr to express position). This restriction will be filterd in the future. LLVM lowering will be added in a subsequent commit. PiperOrigin-RevId: 282365760	2019-11-25 08:47:15 -08:00
Alex Zinenko	bf4692dc49	Introduce gpu.func Introduce a new function-like operation to the GPU dialect to provide a placeholder for the execution semantic description and to add support for GPU memory hierarchy. This aligns with the overall goal of the dialect to expose the common abstraction layer for GPU devices, in particular by providing an MLIR unit of semantics (i.e. an operation) for memory modeling. This proposal has been discussed in the mailing list: https://groups.google.com/a/tensorflow.org/d/msg/mlir/RfXNP7Hklsc/MBNN7KhjAgAJ As decided, the "convergence" aspect of the execution model will be factored out into a new discussion and therefore is not included in this commit. This commit only introduces the operation but does not hook it up with the remaining flow. The intention is to develop the new flow while keeping the old flow operational and do the switch in a simple, separately reversible commit. PiperOrigin-RevId: 282357599	2019-11-25 08:10:37 -08:00
Denis Khalikov	a5cda4763f	[spirv] Add a canonicalizer for `spirv::LogicalNotOp`. Add a canonicalizer for `spirv::LogicalNotOp`. Converts: * spv.LogicalNot(spv.IEqual(...)) -> spv.INotEqual(...) * spv.LogicalNot(spv.INotEqual(...)) -> spv.IEqual(...) * spv.LogicalNot(spv.LogicalEqual(...)) -> spv.LogicalNotEqual(...) * spv.LogicalNot(spv.LogicalNotEqual(...)) -> spv.LogicalEqual(...) Also moved the test for spv.IMul to arithemtic tests. Closes tensorflow/mlir#256 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/256 from denis0x0D:sandbox/canon_logical_not 76ab5787b2c777f948c8978db061d99e76453d44 PiperOrigin-RevId: 282012356	2019-11-22 12:25:52 -08:00
Alex Zinenko	b5af3784a6	Don't force newline before function attributes Due to legacy reasons, a newline character followed by two spaces was always inserted before the attributes of the function Op in pretty form. This breaks formatting when functions are nested in some other operations. Don't print the newline and just put the attributes on the same line, which is also more consistent with module Op. Line breaking aware of indentation can be introduced separately into the parser if deemed useful. PiperOrigin-RevId: 281721793	2019-11-21 05:08:19 -08:00
Andy Davis	d6a70b31be	Add VectorContractionOp to the VectorOps dialect. PiperOrigin-RevId: 281605471	2019-11-20 14:53:57 -08:00
Stephan Herhut	abb626686d	Extend kernel outlining to also consider dim worth inlining. PiperOrigin-RevId: 281483447	2019-11-20 02:59:35 -08:00
Nicolas Vasilache	ee95f6f259	Add VectorOps.StridedSliceOp The `vector.strided_slice` takes an n-D vector, k-D `offsets` integer array attribute, a k-D `sizes` integer array attribute, a k-D `strides` integer array attribute and extracts the n-D subvector at the proper offset. Returns an n-D vector where the first k-D dimensions match the `sizes` attribute. The returned subvector contains the elements starting at offset `offsets` and ending at `offsets + sizes`. Example: ``` %1 = vector.strided_slice %0 {offsets : [0, 2], sizes : [2, 4], strides : [1, 1]}: vector<4x8x16xf32> // returns a vector<2x4x16xf32> ``` This op will be useful for progressive lowering within the VectorOp dialect. PiperOrigin-RevId: 281352749	2019-11-19 12:22:34 -08:00
Hanhan Wang	c614c92fdc	Support SPIR-V constant op to take DenseElementsAttr as input. Iterates each element to build the array. This includes a little refactor to combine bool/int/float into a function, since they are similar. The only difference is calling different function in the end. PiperOrigin-RevId: 281210288	2019-11-18 20:02:05 -08:00
Alexander Belyaev	8c6a5233d5	Lower linalg.indexed_generic to loops. PiperOrigin-RevId: 281169885	2019-11-18 16:55:15 -08:00
River Riddle	9873a29817	Add a parseAttribute<AttrType> overload for the non-type case. The variant that accepts a type will check that the parsed attribute is a valid instance of AttrType. The non-type variant would silently fail in this case, leading to garbage attribute values. PiperOrigin-RevId: 281136528	2019-11-18 13:11:36 -08:00
Denis Khalikov	6c77e59bfd	[spirv] Add a canonicalizer for BitcastOp. Convert chained `spirv::BitcastOp` operations into one `spirv::BitcastOp` operation. Closes tensorflow/mlir#238 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/238 from denis0x0D:sandbox/canon_bitcast 4352ed4f81b959ec92f849c599e733b62a99c010 PiperOrigin-RevId: 281129234	2019-11-18 12:37:00 -08:00
Denis Khalikov	68e48ba111	[spirv] Add bit ops This CL added op definitions for a few bit operations: * OpBitFieldInsert * OpBitFieldSExtract * OpBitFieldUExtract Closes tensorflow/mlir#233 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/233 from denis0x0D:sandbox/bit_field_ops e7fd85b00d72d483d7992dc42b9cc4d673903455 PiperOrigin-RevId: 280691816	2019-11-15 11:03:19 -08:00
Nicolas Vasilache	f2b6ae9991	Move VectorOps to Tablegen - (almost) NFC This CL moves VectorOps to Tablegen and cleans up the implementation. This is almost NFC but 2 changes occur: 1. an interface change occurs in the padding value specification in vector_transfer_read: the value becomes non-optional. As a shortcut we currently use %f0 for all paddings. This should become an OpInterface for vectorization in the future. 2. the return type of vector.type_cast is trivial and simplified to `memref<vector<...>>` Relevant roundtrip and invalid tests that used to sit in core are moved to the vector dialect. The op documentation is moved to the .td file. PiperOrigin-RevId: 280430869	2019-11-14 08:15:23 -08:00
Nicolas Vasilache	0bd6390b54	Deprecate linalg.subview in favor of std.subview This CL uses the now standard std.subview in linalg. Two shortcuts are currently taken to allow this port: 1. the type resulting from a view is currently degraded to fully dynamic to pass the SubViewOp verifier. 2. indexing into SubViewOp may access out of bounds since lowering to LLVM does not currently enforce it by construction. These will be fixed in subsequent commits after discussions. PiperOrigin-RevId: 280250129	2019-11-13 12:10:09 -08:00
Hanhan Wang	85d7fb3324	Make VariableOp instructions be in the first block in the function. Since VariableOp is serialized during processBlock, we add two more fields, `functionHeader` and `functionBody`, to collect instructions for a function. After all the blocks have been processed, we append them to the `functions`. Also, fix a bug in processGlobalVariableOp. The global variables should be encoded into `typesGlobalValues`. PiperOrigin-RevId: 280105366	2019-11-12 18:59:15 -08:00
Lei Zhang	b259c26eb0	Add support for OpPhi in loop header block During deserialization, the loop header block will be moved into the spv.loop's region. If the loop header block has block arguments, we need to make sure it is correctly carried over to the block where the new spv.loop resides. During serialization, we need to make sure block arguments from the spv.loop's entry block are not silently dropped. PiperOrigin-RevId: 280021777	2019-11-12 12:00:28 -08:00
Nicolas Vasilache	f51a155337	Add support for alignment attribute in std.alloc. This CL adds an extra pointer to the memref descriptor to allow specifying alignment. In a previous implementation, we used 2 types: `linalg.buffer` and `view` where the buffer type was the unit of allocation/deallocation/alignment and `view` was the unit of indexing. After multiple discussions it was decided to use a single type, which conflates both, so the memref descriptor now needs to carry both pointers. This is consistent with the [RFC-Proposed Changes to MemRef and Tensor MLIR Types](https://groups.google.com/a/tensorflow.org/forum/#!searchin/mlir/std.view%7Csort:date/mlir/-wKHANzDNTg/4K6nUAp8AAAJ). PiperOrigin-RevId: 279959463	2019-11-12 07:06:54 -08:00
MLIR Team	9fbf52e330	Look for SymbolRefAttr in KernelOutlining instead of hard-coding CallOp This code should be exercised using the existing kernel outlining unit test, but let me know if I should add a dedicated unit test using a fake call instruction as well. PiperOrigin-RevId: 279436321	2019-11-08 19:13:13 -08:00
Denis Khalikov	4697d657b7	[spirv] Add bit ops This CL added op definitions for a few bit operations: * OpShiftLeftLogical * OpShiftRightArithmetic * OpShiftRightLogical * OpBitCount * OpBitReverse * OpNot Also moved the definition of spv.BitwiseAnd to follow the lexicographical order. Closes tensorflow/mlir#215 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/215 from denis0x0D:sandbox/bit_ops d9b0852b689ac6c4879a9740b1740a2357f44d24 PiperOrigin-RevId: 279350470	2019-11-08 11:17:05 -08:00
Andy Davis	8f00b4494d	Swap operand order in std.view operation so that offset appears before dynamic sizes in the operand list. PiperOrigin-RevId: 279114236	2019-11-07 10:20:23 -08:00
Nicolas Vasilache	72040bf7c8	Update Linalg to use std.view Now that a view op has graduated to the std dialect, we can update Linalg to use it and remove ops that have become obsolete. As a byproduct, the linalg buffer and associated ops can also disappear. PiperOrigin-RevId: 279073591	2019-11-07 06:33:10 -08:00
Alexander Belyaev	eee9cbdeb7	Add IndexedGenericOp to Linalg. PiperOrigin-RevId: 279013404	2019-11-06 22:36:25 -08:00
Eric Schweitz	0d545921ea	Add support for the LLVM FNeg instruction Closes tensorflow/mlir#216 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/216 from schweitzpgi:llvmir-fneg-op f9b5f185845d671b745ab6fc213d5d9aff044b34 PiperOrigin-RevId: 278795325	2019-11-06 00:02:10 -08:00
James Molloy	250a11ae0f	[llvm] Allow GlobalOp to take a region for complex initializers This allows GlobalOp to either take a value attribute (for simple constants) or a region that can contain IR instructions (that must be constant-foldable) to create a ConstantExpr initializer. Example: // A complex initializer is constructed with an initializer region. llvm.mlir.global constant @int_gep() : !llvm<"i32"> { %0 = llvm.mlir.addressof @g2 : !llvm<"i32"> %1 = llvm.mlir.constant(2 : i32) : !llvm.i32 %2 = llvm.getelementptr %0[%1] : (!llvm<"i32">, !llvm.i32) -> !llvm<"i32"> llvm.return %2 : !llvm<"i32*"> } PiperOrigin-RevId: 278717836	2019-11-05 15:11:01 -08:00
MLIR Team	1f43d0d000	[NVVM] Add mma.sync operation. PiperOrigin-RevId: 278440547	2019-11-04 12:36:37 -08:00
River Riddle	e4a912eb5a	Update the SPV dialect type parser to use the methods on DialectAsmParser directly. This simplifies the implementation quite a bit, and removes the need for explicit string munging. One change is made to some of the enum elements of SPV_DimAttr to ensure that they are proper identifiers; The string form is now prefixed with 'Dim'. PiperOrigin-RevId: 278027132	2019-11-01 16:55:25 -07:00
River Riddle	68cfc89a0d	Refactor LinalgDialect::parseType to use the DialectAsmParser methods directly. This simplifies the implementation, and removes the need to do explicit string manipulation. A utility method 'parseDimensionList' is added to the DialectAsmParser to simplify defining types and attributes that contain shapes. PiperOrigin-RevId: 278020604	2019-11-01 16:14:10 -07:00
River Riddle	e94a8bfca8	Refactor QuantOps TypeParser to use the DialectAsmParser methods directly. This greatly simplifies the implementation and removes custom parser functionality. The necessary methods are added to the DialectAsmParser. PiperOrigin-RevId: 278015983	2019-11-01 15:47:03 -07:00
Nicolas Vasilache	e20a2aa9f2	Delete spurious file PiperOrigin-RevId: 277967079	2019-11-01 11:28:15 -07:00
Nicolas Vasilache	bd94a10c02	Add Linalg pattern for producer-consumer fusion This CL adds a simple pattern for specifying producer-consumer fusion on Linalg operations. Implementing such an extension reveals some interesting properties. Since Linalg operates on a buffer abstraction, the output buffers are specified as in/out parameters to the ops. As a consequence, there are no SSA use-def chains and one cannot specify complex dag input patterns with the current infrastructure. Instead this CL uses constraints based on the existing linalg dependence analysis to focus the pattern and refine patterns based on the type of op that last wrote in a buffer. This is a very local property and is less powerful than the generic dag specification based on SSA use-def chains. This will be generalized in the future. PiperOrigin-RevId: 277931503	2019-11-01 08:30:38 -07:00
James Molloy	96531e2f87	[mlir][llvm] Add missing cast ops Also adds a builder method for fcmp, identical to that for icmp. PiperOrigin-RevId: 277923158	2019-11-01 07:32:09 -07:00
Denis Khalikov	d423d4a338	[spirv] Add cast operations This CL added op definitions for a few cast operations: * OpConvertFToU * OpConvertFToS * OpConvertSToF * OpConvertUToF * OpUConvert * OpSConvert * OpFConvert Also moved the definition of spv.Bitcast to the new file. Closes tensorflow/mlir#208 and tensorflow/mlir#174 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/208 from denis0x0D:sandbox/cast_ops 79bc9b37398aafddee6cf6beb301807988fe67f9 PiperOrigin-RevId: 277587891	2019-10-30 14:53:04 -07:00
Nicolas Vasilache	05a5a41416	Add basic support for declarative Linalg transformations Linalg ops provide a good anchor for pattern matching/rewriting transformations. This CL adds a simple example of how multi-level tiling may be specified by attaching a simple StringAttr to ops as they are transformed so we can easily specify partial lowering to control transformation application. This is a first stab at taking advantage of higher-level information contained in Linalg ops and will evolve in the future. PiperOrigin-RevId: 277497958	2019-10-30 07:12:33 -07:00
Lei Zhang	80213ba5f0	[spirv] Fix gen_spirv_dialect.py and add spv.Unreachable This CL fixed gen_spirv_dialect.py to support nested delimiters when chunking existing ODS entries in .td files and to allow ops without correspondence in the spec. This is needed to pull in the definition of OpUnreachable. PiperOrigin-RevId: 277486465	2019-10-30 05:41:18 -07:00
Lei Zhang	ca2538e9a7	[spirv] Support OpPhi using block arguments This CL adds another control flow instruction in SPIR-V: OpPhi. It is modelled as block arguments to be idiomatic with MLIR. See the rationale.md doc for "Block Arguments vs PHI nodes". Serialization and deserialization is updated to convert between block arguments and SPIR-V OpPhi instructions. PiperOrigin-RevId: 277161545	2019-10-28 15:58:42 -07:00
Denis Khalikov	dd2e444325	[spirv] AccessChainOp canonicalization. Combine chained `spirv::AccessChainOp` operations into one `spirv::AccessChainOp` operation. Closes tensorflow/mlir#198 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/198 from denis0x0D:sandbox/canon_access_chain 0cb87955a85511071143d62637ff939d0dabc2bd PiperOrigin-RevId: 276609345	2019-10-24 18:41:34 -07:00
River Riddle	2b61b7979e	Convert the Canonicalize and CSE passes to generic Operation Passes. This allows for them to be used on other non-function, or even other function-like, operations. The algorithms are already generic, so this is simply changing the derived pass type. The majority of this change is just ensuring that the nesting of these passes remains the same, as the pass manager won't auto-nest them anymore. PiperOrigin-RevId: 276573038	2019-10-24 15:01:09 -07:00
Uday Bondhugula	ad6925f479	Update loop.for verifier message fix: nonnegative -> positive Closes tensorflow/mlir#206 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/206 from bondhugula:bondhugula-patch-1 9a47ca7dfd230180a9df33e9a64b33d02252d30a PiperOrigin-RevId: 276060885	2019-10-22 07:34:56 -07:00
Lei Zhang	d9fe892e42	[spirv] Allow block arguments on spv.Branch(Conditional) We will use block arguments as the way to model SPIR-V OpPhi in the SPIR-V dialect. This CL also adds a few useful helper methods to both ops to get the block arguments. Also added tests for branch weight (de)serialization. PiperOrigin-RevId: 275960797	2019-10-21 17:32:00 -07:00

1 2 3 4 5 ...

276 Commits