llvm-project

Commit Graph

Author	SHA1	Message	Date
Feng Liu	cf0a782339	Remove the constraint that min / max should stride zero Since we apply nudging for the zero point to make sure the nudged zerop points can be in the range of [qmin, qmax], the constraint that rmin / rmax should stride zero isn't necessary. This also matches the documentation of tensorflow's FakeQuantWithMinMaxArgs op, where min and max don't need to stride zero: https://www.tensorflow.org/api_docs/python/tf/quantization/fake_quant_with_min_max_args PiperOrigin-RevId: 268296285	2019-09-10 13:26:46 -07:00
Feng Liu	c68d5467d6	Convert ConstFakeQuantPerAxis to qcast and dcast pair This is also to add the test to the fakeQuantAttrsToType for per-channel fake quant. PiperOrigin-RevId: 268260032	2019-09-10 10:50:57 -07:00
Feng Liu	f4ae4762bf	Add quant.const_fake_quant_per_axis op Comparing to the existing quant.const_fake_quant op, the min and max attributes of this new op is for each channel of last dimension of the input. PiperOrigin-RevId: 268093722	2019-09-09 15:42:37 -07:00
MLIR Team	5e65dafbfa	Add warpsize and laneid intrinsics to NVVM dialect. PiperOrigin-RevId: 268041263	2019-09-09 11:38:03 -07:00
River Riddle	e702875d16	Add support for coalescing adjacent nested pass pipelines. This allows for parallelizing across pipelines of multiple operation types. AdaptorPasses can now hold pass managers for multiple operation types and will dispatch based upon the operation being operated on. PiperOrigin-RevId: 268017344	2019-09-09 09:52:25 -07:00
Stephan Herhut	318ff019cf	Addressing some late review comments on kernel inlining. Just formatting and better lit tests, no functional change. PiperOrigin-RevId: 267942907	2019-09-09 01:15:47 -07:00
Mehdi Amini	42b60d34fc	Add `parseGenericOperation()` to the OpAsmParser This method parses an operation in its generic form, from the current parser state. This is the symmetric of OpAsmPrinter::printGenericOp(). An immediate use case is illustrated in the test dialect, where an operation wraps another one in its region and makes use of a single-line pretty-print form. PiperOrigin-RevId: 267930869	2019-09-08 23:40:12 -07:00
River Riddle	120509a6b2	Refactor PassTiming to support nested pipelines. This is done via a new set of instrumentation hooks runBeforePipeline/runAfterPipeline, that signal the lifetime of a pass pipeline on a specific operation type. These hooks also provide the parent thread of the pipeline, allowing for accurate merging of timers running on different threads. PiperOrigin-RevId: 267909193	2019-09-08 19:58:13 -07:00
Mehdi Amini	cbb6f09ce8	Use "final" instead of marking method virtual in override (NFC) This is the only example for overriding this interface in the repo, let's try to make it right as it may be taken as a reference when implemented in other dialects PiperOrigin-RevId: 267811123	2019-09-07 18:57:19 -07:00
Nagy Mostafa	8154370b49	Add custom builder for AffineIfOp Closes tensorflow/mlir#109 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/109 from nmostafa:nmostafa/AffineIfOp 7dbf2115f0092ffab26381ea8704aa05a0253971 PiperOrigin-RevId: 267633077	2019-09-06 11:03:03 -07:00
Nicolas Vasilache	1b8eff8fcd	Simplify Linalg ABI integration with external function calls. View descriptors are converted to pointer to LLVM struct to avoid ABI issues related to C struct packing. This creates unnecessary complexity and hampers unification with memrefs. Instead, this CL makes view descriptors convert to LLVM struct (as it was originally) and promotes all structs to pointers right before calling an external function. PiperOrigin-RevId: 267602693	2019-09-06 08:31:19 -07:00
Uday Bondhugula	854a384f50	Integer set + operands / affine if op canonicalization - turn canonicalizeMapAndOperands into a template that works on both sets and maps, and use it to introduce a utility to canonicalize an affine integer set and its operands - add pattern to canonicalize affine if op's. - rename IntegerSet::getNumOperands -> IntegerSet::getNumInputs to be consistent with AffineMap - add missing accessors for IntegerSet Doesn't need extensive testing since canonicalizeSetAndOperands just reuses canonicalizeMapAndOperands' logic, and the latter is tested on affine.apply map + operands; the new method works the same way on an integer set + operands of an affine if op for example. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#112 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/112 from bondhugula:set-canonicalize eff72f23250b96fa7d9f5caff3877440f5de2cec PiperOrigin-RevId: 267532876	2019-09-05 23:12:35 -07:00
River Riddle	85bc4889b3	Add support for conservatively inlining Affine operations. This commit defines an initial implementation of the DialectInlinerInterface for the AffineOps dialect. This change allows for affine operations to be inlined into any region that is not an affine region. Inlining into affine regions requires special handling for dimension/symbol identifiers that will be added in followups. PiperOrigin-RevId: 267467078	2019-09-05 15:20:25 -07:00
Lei Zhang	916eb980b0	[spirv] Add spv.loop SPIR-V can explicitly declare structured control-flow constructs using merge instructions. These explicitly declare a header block before the control flow diverges and a merge block where control flow subsequently converges. These blocks delimit constructs that must nest, and can only be entered and exited in structured ways. Instead of having a `spv.LoopMerge` op to directly model loop merge instruction for indicating the merge and continue target, we use regions to delimit the boundary of the loop: the merge target is the next op following the `spv.loop` op and the continue target is the block that has a back-edge pointing to the entry block inside the `spv.loop`'s region. This way it's easier to discover all blocks belonging to a construct and it plays nicer with the MLIR system. Updated the SPIR-V.md doc. PiperOrigin-RevId: 267431010	2019-09-05 12:45:53 -07:00
River Riddle	0ba0087887	Add the initial inlining infrastructure. This defines a set of initial utilities for inlining a region(or a FuncOp), and defines a simple inliner pass for testing purposes. A new dialect interface is defined, DialectInlinerInterface, that allows for dialects to override hooks controlling inlining legality. The interface currently provides the following hooks, but these are just premilinary and should be changed/added to/modified as necessary: * isLegalToInline - Determine if a region can be inlined into one of this dialect, or if an operation of this dialect can be inlined into a given region. * shouldAnalyzeRecursively - Determine if an operation with regions should be analyzed recursively for legality. This allows for child operations to be closed off from the legality checks for operations like lambdas. * handleTerminator - Process a terminator that has been inlined. This cl adds support for inlining StandardOps, but other dialects will be added in followups as necessary. PiperOrigin-RevId: 267426759	2019-09-05 12:24:13 -07:00
Stephan Herhut	7eb25cd367	Make GPU kernel outlining test independent of value names. PiperOrigin-RevId: 267323604	2019-09-05 01:46:26 -07:00
Smit Hinsu	33ac6f043b	Generalize I32ElementsAttr definition and introduce I64ElementsAttr Also, fix constBuilderCall to return attribute of the storage class DenseIntElementsAttr PiperOrigin-RevId: 267305813	2019-09-04 23:16:01 -07:00
MLIR Team	b5652720c1	Retain address space during MLIR > LLVM conversion. PiperOrigin-RevId: 267206460	2019-09-04 12:26:52 -07:00
Alex Zinenko	c6f8adad8e	Move LLVMIR dialect tests from test/LLVMIR to test/Dialect and test/Conversion This follows up on the recent restructuring that moved the dialects under lib/Dialect and inter-dialect conversions to lib/Conversion. Originally, the tests for both the LLVMIR dialect itself and the conversion from Standard to LLVMIR dialect lived under test/LLVMIR. This no longer reflects the code structure. Move the tests to either test/Dialect/LLVMIR or test/Conversion/StandardToLLVM depending on the features they exercise. PiperOrigin-RevId: 267159219	2019-09-04 08:38:18 -07:00
Uday Bondhugula	8c9dc690eb	pipeline-data-transfer: remove dead tag alloc's and improve test coverage for replaceMemRefUsesWith / pipeline-data-transfer - address remaining comments from PR tensorflow/mlir#87 for better test coverage for pipeline-data-transfer/replaceAllMemRefUsesWith - remove dead tag allocs the same way they are removed for the replaced buffers Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#106 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/106 from bondhugula:followup 9e868666d047e8d43e5f82f43e4093b838c710fa PiperOrigin-RevId: 267144774	2019-09-04 06:59:09 -07:00
Alex Zinenko	6395229509	Move Linalg dialect tests to test/Dialect/Linalg This was missing from the commit that moved the Linalg dialect to lib/Dialect. PiperOrigin-RevId: 267141176	2019-09-04 06:33:23 -07:00
Stephan Herhut	dfd06af562	Make GPU kernel outlining inline constants. It is generally beneficial to pass less arguments to a kernel, so cloning constants into the kernel is beneficial. PiperOrigin-RevId: 267139084	2019-09-04 06:16:07 -07:00
MLIR Team	2f13df13b0	Add support for array-typed constants. PiperOrigin-RevId: 267121729	2019-09-04 03:46:06 -07:00
Alex Zinenko	71d27dfc3b	Update the syntax of splat attribute in LLVM.md The syntax for splat attributes changed, but was not updated in the description of the LLVM dialect constant operations in LLVM.md. Update the document to use the correct syntax. Also add a dialect roundtrip test for such attribute, which was previously missing. PiperOrigin-RevId: 267116305	2019-09-04 03:01:54 -07:00
Nicolas Vasilache	0c8ad3aafb	Properly clone Linalg ops with regions This CL adds support for proper cloning of Linalg ops that have regions (i.e. the generic linalg op). This is used to properly implement tiling and fusion for such ops. Adequate tests are added. PiperOrigin-RevId: 267027176	2019-09-03 15:28:47 -07:00
Uday Bondhugula	54d674f51e	Utility to normalize memrefs with non-identity layout maps - introduce utility to convert memrefs with non-identity layout maps to ones with identity layout maps: convert the type and rewrite/remap all its uses - add this utility to -simplify-affine-structures pass for testing purposes Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#104 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/104 from bondhugula:memref-normalize f2c914aa1890e8860326c9e33f9aa160b3d65e6d PiperOrigin-RevId: 266985317	2019-09-03 12:14:28 -07:00
Lei Zhang	5593e005c6	Add folding rule and dialect materialization hook for spv.constant This will allow us to use MLIR's folding infrastructure to deduplicate SPIR-V constants. This CL also changed isValidSPIRVType in SPIRVDialect to a static method. PiperOrigin-RevId: 266984403	2019-09-03 12:09:58 -07:00
Uday Bondhugula	b1ef9dc22c	Fix affine data copy generation corner cases/bugs - the [begin, end) range identified for copying could end in between the block, which makes hoisting invalid in some cases. Change the range identification to always end with end of block. - add test case to exercise these (with fast mem capacity set to minimal so that single element memref buffers are generated at the innermost loop) - the location of begin/end of the block range for data copying was being confused with the insert points for copy in and copy out code. In cases, where we choose to hoist transfers, these are separate. - when copy loops are single iteration ones, promote their bodies at the end of the pass. - change default fast mem space to 1 (setting it to zero made it generate DMA op's that won't verify in the default case - since the DMA ops have a check for src/dest memref spaces being different). Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Co-Authored-By: Mehdi Amini <joker.eph@gmail.com> Closes tensorflow/mlir#88 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/88 from bondhugula:datacopy 88697267c45e850c3ced87671e16e4a930c02a42 PiperOrigin-RevId: 266980911	2019-09-03 11:53:16 -07:00
River Riddle	61ee7d640c	Fix an invalid assert when processing escaped strings. The assert assumed that the escaped character could not appear at the end of the string. Fixes tensorflow/mlir#117 PiperOrigin-RevId: 266975471	2019-09-03 11:27:39 -07:00
Alex Zinenko	c335d9d313	LLVM dialect: prefix auxiliary operations with "mlir." Some of the operations in the LLVM dialect are required to model the LLVM IR in MLIR, for example "constant" operations are needed to declare a constant value since MLIR, unlike LLVM, does not support immediate values as operands. To avoid confusion with actual LLVM operations, we prefix such axuiliary operations with "mlir.". PiperOrigin-RevId: 266942838	2019-09-03 09:10:56 -07:00
Mahesh Ravishankar	2acd0dbf05	Add Select operation to SPIR-V dialect. The SelectOp models the semantics of OpSelect from SPIR-V spec. PiperOrigin-RevId: 266849559	2019-09-02 21:07:18 -07:00
River Riddle	5c036e682d	Refactor the pass manager to support operations other than FuncOp/ModuleOp. This change generalizes the structure of the pass manager to allow arbitrary nesting pass managers for other operations, at any level. The only user visible change to existing code is the fact that a PassManager must now provide an MLIRContext on construction. A new class `OpPassManager` has been added that represents a pass manager on a specific operation type. `PassManager` will remain the top-level entry point into the pipeline, with OpPassManagers being nested underneath. OpPassManagers will still be implicitly nested if the operation type on the pass differs from the pass manager. To explicitly build a pipeline, the 'nest' methods on OpPassManager may be used: // Pass manager for the top-level module. PassManager pm(ctx); // Nest a pipeline operating on FuncOp. OpPassManager &fpm = pm.nest<FuncOp>(); fpm.addPass(...); // Nest a pipeline under the FuncOp pipeline that operates on spirv::ModuleOp OpPassManager &spvModulePM = pm.nest<spirv::ModuleOp>(); // Nest a pipeline on FuncOps inside of the spirv::ModuleOp. OpPassManager &spvFuncPM = spvModulePM.nest<FuncOp>(); To help accomplish this a new general OperationPass is added that operates on opaque Operations. This pass can be inserted in a pass manager of any type to operate on any operation opaquely. An example of this opaque OperationPass is a VerifierPass, that simply runs the verifier opaquely on the current operation. /// Pass to verify an operation and signal failure if necessary. class VerifierPass : public OperationPass<VerifierPass> { void runOnOperation() override { Operation *op = getOperation(); if (failed(verify(op))) signalPassFailure(); markAllAnalysesPreserved(); } }; PiperOrigin-RevId: 266840344	2019-09-02 19:25:26 -07:00
River Riddle	6563b1c446	Add a new dialect interface for the OperationFolder `OpFolderDialectInterface`. This interface will allow for providing hooks to interrop with operation folding. The first hook, 'shouldMaterializeInto', will allow for controlling which region to insert materialized constants into. The folder will generally materialize constants into the top-level isolated region, this allows for materializing into a lower level ancestor region if it is more profitable/correct. PiperOrigin-RevId: 266702972	2019-09-01 20:07:08 -07:00
Mehdi Amini	765d60fd4d	Add missing lowering to CFG in mlir-cpu-runner + related cleanup - the list of passes run by mlir-cpu-runner included -lower-affine and -lower-to-llvm but was missing -lower-to-cfg (because -lower-affine at some point used to lower straight to CFG); add -lower-to-cfg in between. IR with affine ops can now be run by mlir-cpu-runner. - update -lower-to-cfg to be consistent with other passes (create*Pass methods were changed to return unique ptrs, but -lower-to-cfg appears to have been missed). - mlir-cpu-runner was unable to parse custom form of affine op's - fix link options - drop unnecessary run options from test/mlir-cpu-runner/simple.mlir (none of the test cases had loops) - -convert-to-llvmir was changed to -lower-to-llvm at some point, but the create pass method name wasn't updated (this pass converts/lowers to LLVM dialect as opposed to LLVM IR). Fix this. (If we prefer "convert", the cmd-line options could be changed to "-convert-to-llvm/cfg" then.) Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#115 PiperOrigin-RevId: 266666909	2019-09-01 11:33:22 -07:00
River Riddle	9c8a8a7d0d	Add a canonicalization to erase empty AffineForOps. AffineForOp themselves are pure and can be removed if there are no internal operations. PiperOrigin-RevId: 266481293	2019-08-30 16:49:32 -07:00
Logan Chien	6b1d7f51ef	Add TensorRankOf for ranked tensor types with specific ranks This commit adds `TensorRankOf<types, typeNames, ranks>` to specify ranked tensor types with the specified types and ranks. For example, `TensorRankOf<[I32, F32], ["i32", "F32"], [0, 1]>` matches `tensor<i32>`, `tensor<?xi32>`, `tensor<f32>`, or `tensor<?xf32>`. PiperOrigin-RevId: 266461256	2019-08-30 14:54:14 -07:00
Jacques Pienaar	06e8101034	Add mechanism to dump JIT-compiled objects to files This commit introduces the bits to be able to dump JIT-compile objects to external files by passing an object cache to OrcJit. The new functionality is tested in mlir-cpu-runner under the flag `dump-object-file`. Closes tensorflow/mlir#95 PiperOrigin-RevId: 266439265	2019-08-30 13:02:10 -07:00
Lei Zhang	4f6c29223e	Add spv.Branch and spv.BranchConditional This CL just covers the op definition, its parsing, printing, and verification. (De)serialization is to be implemented in a subsequent CL. PiperOrigin-RevId: 266431077	2019-08-30 12:17:53 -07:00
River Riddle	4bfae66d70	Refactor the 'walk' methods for operations. This change refactors and cleans up the implementation of the operation walk methods. After this refactoring is that the explicit template parameter for the operation type is no longer needed for the explicit op walks. For example: op->walk<AffineForOp>([](AffineForOp op) { ... }); is now accomplished via: op->walk([](AffineForOp op) { ... }); PiperOrigin-RevId: 266209552	2019-08-29 13:04:50 -07:00
Feng Liu	6de6c2c138	Add tests to verify 0.0 is quantized correctly We should consider both signed and narrow_range cases. PiperOrigin-RevId: 266167366	2019-08-29 10:09:22 -07:00
Uday Bondhugula	4bb6f8ecdb	Extend map canonicalization to propagate constant operands - extend canonicalizeMapAndOperands to propagate constant operands into the map's expressions (and thus drop those operands). - canonicalizeMapAndOperands previously only dropped duplicate and unused operands; however, operands that were constants were retained. This change makes IR maps/expressions generated by various utilities/passes even simpler; also makes some of the test checks more accurate and simpler -- for eg., 0' instead of symbol(%{{.*}}). Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#107 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/107 from bondhugula:canonicalize-maps c889a51486d14fbf7db489f224f881e7e1ff7d72 PiperOrigin-RevId: 266085289	2019-08-29 01:13:29 -07:00
Uday Bondhugula	bc2a543225	fix loop unroll and jam - operand mapping - imperfect nest case - fix operand mapping while cloning sub-blocks to jam - was incorrect for imperfect nests where def/use was across sub-blocks - strengthen/generalize the first test case to cover the previously missed scenario - clean up the other cases while on this. Previously, unroll-jamming the following nest ``` affine.for %arg0 = 0 to 2048 { %0 = alloc() : memref<512x10xf32> affine.for %arg1 = 0 to 10 { %1 = affine.load %0[%arg0, %arg1] : memref<512x10xf32> } dealloc %0 : memref<512x10xf32> } ``` would yield ``` %0 = alloc() : memref<512x10xf32> %1 = affine.apply #map0(%arg0) %2 = alloc() : memref<512x10xf32> affine.for %arg1 = 0 to 10 { %4 = affine.load %0[%arg0, %arg1] : memref<512x10xf32> %5 = affine.apply #map0(%arg0) %6 = affine.load %0[%5, %arg1] : memref<512x10xf32> } dealloc %0 : memref<512x10xf32> %3 = affine.apply #map0(%arg0) dealloc %0 : memref<512x10xf32> ``` instead of ``` module { affine.for %arg0 = 0 to 2048 step 2 { %0 = alloc() : memref<512x10xf32> %1 = affine.apply #map0(%arg0) %2 = alloc() : memref<512x10xf32> affine.for %arg1 = 0 to 10 { %4 = affine.load %0[%arg0, %arg1] : memref<512x10xf32> %5 = affine.apply #map0(%arg0) %6 = affine.load %2[%5, %arg1] : memref<512x10xf32> } dealloc %0 : memref<512x10xf32> %3 = affine.apply #map0(%arg0) dealloc %2 : memref<512x10xf32> } ``` Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#98 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/98 from bondhugula:ujam ddbc853f69b5608b3e8ff9b5ac1f6a5a0bb315a4 PiperOrigin-RevId: 266073460	2019-08-28 23:42:50 -07:00
Stephan Herhut	e90542c03b	Add verification for dimension attribute on GPUDialect index operations. PiperOrigin-RevId: 266073204	2019-08-28 23:39:57 -07:00
Eric Schweitz	2225411690	Tweak to the pretty type parser to recognize that `->` is a special token. Tweak to the pretty type parser to recognize that `->` is a special token that shouldn't be split into two characters. This change allows dialect types to wrap function types as in `!my.ptr_type<(i32) -> i32>`. Closes tensorflow/mlir#105 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/105 from schweitzpgi:parse-arrow 8b2d768053f419daae5a1a864121a44c4319acbe PiperOrigin-RevId: 265986240	2019-08-28 13:55:42 -07:00
Stephan Herhut	c60c490356	Add implementation for tensor_load and tensor_store operations. This change adds definitions, parsing and verification for both ops. PiperOrigin-RevId: 265954051	2019-08-28 11:25:52 -07:00
Uday Bondhugula	aa2cee9cf5	Refactor / improve replaceAllMemRefUsesWith Refactor replaceAllMemRefUsesWith to split it into two methods: the new method does the replacement on a single op, and is used by the existing one. - make the methods return LogicalResult instead of bool - Earlier, when replacement failed (due to non-deferencing uses of the memref), the set of ops that had already been processed would have been replaced leaving the IR in an inconsistent state. Now, a pass is made over all ops to first check for non-deferencing uses, and then replacement is performed. No test cases were affected because all clients of this method were first checking for non-deferencing uses before calling this method (for other reasons). This isn't true for a use case in another upcoming PR (scalar replacement); clients can now bail out with consistent IR on failure of replaceAllMemRefUsesWith. Add test case. - multiple deferencing uses of the same memref in a single op is possible (we have no such use cases/scenarios), and this has always remained unsupported. Add an assertion for this. - minor fix to another test pipeline-data-transfer case. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#87 PiperOrigin-RevId: 265808183	2019-08-27 17:56:56 -07:00
River Riddle	6f68def852	Update Ch.2 of the Toy tutorial. The code and documentation for this chapter of the tutorial have been updated to follow the new flow. The toy 'array' type has been replaced by usages of the MLIR tensor type. The code has also been cleaned up and modernized. Closes tensorflow/mlir#101 PiperOrigin-RevId: 265744086	2019-08-27 12:44:27 -07:00
MLIR Team	696fcb7520	Add 3 additional intrinsic ops to NVVM dialect, in preparation to implement block-wide reduce. PiperOrigin-RevId: 265720077	2019-08-27 10:56:18 -07:00
Mahesh Ravishankar	4ced99c085	Enhance GPU To SPIR-V conversion to support builtins and load/store ops. To support a conversion of a simple load-compute-store kernel from GPU dialect to SPIR-V dialect, the conversion of operations like "gpu.block_dim", "gpu.thread_id" which allow threads to get the launch conversion is needed. In SPIR-V these are specified as global variables with builin attributes. This CL adds support to specify builtin variables in SPIR-V conversion framework. This is used to convert the relevant operations from GPU dialect to SPIR-V dialect. Also add support for conversion of load/store operation in Standard dialect to SPIR-V dialect. To simplify the conversion add a method to build a spv.AccessChain operation that automatically determines the return type based on the base pointer type and the indices provided. PiperOrigin-RevId: 265718525	2019-08-27 10:50:23 -07:00
River Riddle	2f59f76876	NFC: Remove the explicit context from Operation::create and OperationState. The context can easily be recovered from the Location in these situations. PiperOrigin-RevId: 265578574	2019-08-26 17:34:48 -07:00
Eric Schweitz	761d57d187	Add FPToSI/FPExt/FPTrunc cast ops to the LLVM dialect. Closes tensorflow/mlir#99 PiperOrigin-RevId: 265538731	2019-08-26 14:19:19 -07:00
Andy Ly	6a501e3d1b	Support folding of ops with inner ops in GreedyPatternRewriteDriver. This fixes a bug when folding ops with inner ops and inner ops are still being visited. PiperOrigin-RevId: 265475780	2019-08-26 09:44:39 -07:00
Nicolas Vasilache	46084c03f4	Lower linalg.copy to LLVM dialect in the presence of transposes. Add an extra RewritePattern that does not convert types to rewrite a CopyOp that has non-identity permutations into a sequence of TransposeOp followed by a CopyOp without such permutations. This RewitePattern is made to fail in the non-permutation case so that the conversion pattern can kick in to lower to LLVM. This is an instance of A->A->B lowering where A->A is done by a RewritePattern in case_1 and A->B is done by a ConversionPatternRewriter when not(case_1). PiperOrigin-RevId: 265171380	2019-08-23 17:45:19 -07:00
Nicolas Vasilache	28e28e5d65	Lower linalg.transpose to LLVM dialect Add a conversion pattern that transforms a linalg.transpose op into: 1. A function entry `alloca` operation to allocate a ViewDescriptor. 2. A load of the ViewDescriptor from the pointer allocated in 1. 3. Updates to the ViewDescriptor to introduce the data ptr, offset, size and stride. Size and stride are permutations of the original values. 4. A store of the resulting ViewDescriptor to the alloca'ed pointer. The linalg.transpose op is replaced by the alloca'ed pointer. PiperOrigin-RevId: 265169112	2019-08-23 17:29:26 -07:00
Nicolas Vasilache	2c2c9ffd80	Add a linalg.transpose op A linalg.transpose op is a pure metadata operation that takes a view + permutation map and produces another view of the same underlying data, with a different reindexing. This is a pure metadata operation that does not touch the underlying data. Example: ``` %t = linalg.transpose %v (i, j) -> (j, i) : !linalg.view<?x?xf32> ``` PiperOrigin-RevId: 265139429	2019-08-23 14:48:13 -07:00
Nicolas Vasilache	7f42b3d721	Add lowering of linalg.copy to an external C++ library and a test. This CL extends support for lowering of linalg to external C++ libraries with CopyOp. Currently this can only work when the permutation maps in the copies are identity. Future support for permutations will be added later. PiperOrigin-RevId: 265093025	2019-08-23 11:09:53 -07:00
Lei Zhang	0e131d83fe	[spirv] NFC: move SPIR-V control flow ops to a separate file This CL is also purely moving code around for better file organization. PiperOrigin-RevId: 265092566	2019-08-23 11:07:52 -07:00
Chris Lattner	31a003dc3c	Introduce the ability for "isolated from above" ops to introduce shadowing names for the basic block arguments in their body. PiperOrigin-RevId: 265084627	2019-08-23 10:35:49 -07:00
Lei Zhang	21b77fc11f	[spirv] NFC: move arithmetic and logical ops to separate files This is purely moving code around for better file organization. PiperOrigin-RevId: 265082517	2019-08-23 10:26:45 -07:00
MLIR Team	a329d33b4f	Add I32ElementsAttr to OpBase PiperOrigin-RevId: 264969142	2019-08-22 19:05:40 -07:00
River Riddle	d906f84b52	Add iterator support to ElementsAttr and SparseElementsAttr. This will allow iterating the values of a non-opaque ElementsAttr, with all of the types currently supported by DenseElementsAttr. This should help reduce the amount of specialization on DenseElementsAttr. PiperOrigin-RevId: 264968151	2019-08-22 18:59:24 -07:00
Lei Zhang	51cbf97b53	[spirv] Add support for extension (de)serialization Only a few important KHR extensions are registered to the SPIR-V dialect for now. PiperOrigin-RevId: 264939428	2019-08-22 16:01:35 -07:00
Nicolas Vasilache	6f1d4bb8df	Avoid overflow when lowering linalg.slice linalg.subview used to lower to a slice with a bounded range resulting in correct bounded accesses. However linalg.slice could still index out of bounds. This CL moves the bounding to linalg.slice. LLVM select and cmp ops gain a more idiomatic builder. PiperOrigin-RevId: 264897125	2019-08-22 12:47:01 -07:00
Lei Zhang	27ed82f99c	[spirv] Add support for capability (de)serialization This CL pulls in capabilities defined in the spec and adds support for (de)serialize capabilities of a spv.module. PiperOrigin-RevId: 264877413	2019-08-22 11:15:41 -07:00
Logan Chien	b1ce4df505	Add Positive{I32,I64}Attr and HasAnyRankOfPred This commit adds `PositiveI32Attr` and `PositiveI64Attr` to match positive integers but not zero nor negative integers. This commit also adds `HasAnyRankOfPred` to match tensors with the specified ranks. PiperOrigin-RevId: 264867046	2019-08-22 10:36:32 -07:00
Lei Zhang	1d10eb162c	Point to spv.AccessChain when reporting spv.AccessChain errors PiperOrigin-RevId: 264742130	2019-08-21 18:54:06 -07:00
Lei Zhang	748edce6b8	Remove the wrapping function in SPIR-V (de)serialization Previously Module and Function are builtinn constructs in MLIR. Due to the structural requirements we must wrap the SPIR-V module inside a Function inside a Module. Now the requirement is lifted and we can remove the wrapping function! :) PiperOrigin-RevId: 264736051	2019-08-21 18:05:24 -07:00
MLIR Team	d661eda811	[TableGen] Add a `StaticShapeMemRefOf` trait. The trait specifies that the `MemRefOf` has to have a static shape. PiperOrigin-RevId: 264692758	2019-08-21 14:28:41 -07:00
River Riddle	b618221350	Automated rollback of commit `b9dc2e4818` PiperOrigin-RevId: 264672975	2019-08-21 13:01:03 -07:00
River Riddle	b9dc2e4818	Add iterator support to ElementsAttr and SparseElementsAttr. This will allow iterating the values of a non-opaque ElementsAttr, with all of the types currently supported by DenseElementsAttr. This should help reduce the amount of specialization on DenseElementsAttr. PiperOrigin-RevId: 264637293	2019-08-21 10:23:44 -07:00
Lei Zhang	8d18fdf2d3	[spirv] Support i1 as bool type PiperOrigin-RevId: 264612014	2019-08-21 08:17:50 -07:00
Lei Zhang	31cfee6077	Support variadic ops in declarative rewrite rules This CL extends declarative rewrite rules to support matching and generating ops with variadic operands/results. For this, the generated `matchAndRewrite()` method for each pattern now are changed to * Use "range" types for the local variables used to store captured values (`operand_range` for operands, `ArrayRef<Value >` for values, Op for results). This allows us to have a unified way of handling both single values and value ranges. * Create local variables for each operand for op creation. If the operand is variadic, then a `SmallVector<Value>` will be created to collect all values for that operand; otherwise a `Value` will be created. * Use a collective result type builder. All result types are specified via a single parameter to the builder. We can use one result pattern to replace multiple results of the matched root op. When that happens, it will require specifying types for multiple results. Add a new collective-type builder. PiperOrigin-RevId: 264588559	2019-08-21 05:35:32 -07:00
Lei Zhang	69cf811d5b	Materialize spv.constants at use sites In SPIR-V binary format, constants are placed at the module level and referenced by instructions inside functions using their result <id>s. To model this natively (using SSA values for result <id>s), it means we need to have implicit capturing functions. We will lose the ability to have function passes if going down that path. Instead, this CL changes to materialize constants at their use sites in deserialization. It's cheap to copy constants in MLIR given that attributes is uniqued to MLIRContext. By localizing constants into functions, we can preserve isolated functions. PiperOrigin-RevId: 264582532	2019-08-21 04:45:49 -07:00
River Riddle	ffde975e21	NFC: Move AffineOps dialect to the Dialect sub-directory. PiperOrigin-RevId: 264482571	2019-08-20 15:36:39 -07:00
Lei Zhang	f4934bcc3e	Add spv.specConstant and spv._reference_of Similar to global variables, specialization constants also live in the module scope and can be referenced by instructions in functions in native SPIR-V. A direct modelling would be to allow functions in the SPIR-V dialect to implicit capture, but it means we are losing the ability to write passes for Functions. While in SPIR-V normally we want to process the module as a whole, it's not common to see multiple functions get used so we'd like to leave the door open for those cases. Therefore, similar to global variables, we introduce spv.specConstant to model three SPIR-V instructions: OpSpecConstantTrue, OpSpecConstantFalse, and OpSpecConstant. They do not return SSA value results; instead they have symbols and can only be referenced by the symbols. To use it in a function, we need to have another op spv._reference_of to turn the symbol into an SSA value. This breaks the tie and makes functions still explicit capture. Previously specialization constants were handled similarly as normal constants. That is incorrect given that specialization constant actually acts more like variable (without need to load and store). E.g., they cannot be de-duplicated like normal constants. This CL also refines various documents and comments. PiperOrigin-RevId: 264455172	2019-08-20 13:34:13 -07:00
Denis Khalikov	82cf6051ee	[spirv] Support (de)serialization of spv.struct Support (de)serialization of spv.struct with offset decorations. Closes tensorflow/mlir#94 PiperOrigin-RevId: 264421427	2019-08-20 11:03:42 -07:00
Diego Caballero	9e6cf0d025	Fix build of affine load/store with empty map tensorflow/mlir#58 fixed and exercised verification of load/store ops using empty affine maps. Unfortunately, it didn't exercise the creation of them. This PR addresses that aspect. It removes the assumption of AffineMap having at least one result and stores a pointer to MLIRContext as member of AffineMap. * Add empty map support to affine.store + test * Move MLIRContext to AffineMapStorage Closes tensorflow/mlir#74 PiperOrigin-RevId: 264416260	2019-08-20 10:44:18 -07:00
Alex Zinenko	006fcce44a	ConvertLaunchFuncToCudaCalls: use LLVM dialect globals This conversion has been using a stack-allocated array of i8 to store the null-terminated kernel name in order to pass it to the CUDA wrappers expecting a C string because the LLVM dialect was missing support for globals. Now that the suport is introduced, use a global instead. Refactor global string construction from GenerateCubinAccessors into a common utility function living in the LLVM namespace. PiperOrigin-RevId: 264382489	2019-08-20 07:52:01 -07:00
Alex Zinenko	0d82a292b0	JitRunner: support entry functions returning void JitRunner can use as entry points functions that produce either a single '!llvm.f32' value or a list of memrefs. Memref support is legacy and was introduced before MLIR could lower memref allocation and deallocation to malloc/free calls so as to allocate the memory externally, and is likely to be dropped in the future since it unconditionally runs affine+standard-to-llvm lowering on the module instead of accepting the LLVM dialect. CUDA runner relies on memref-based flow in the runner without actually returning anything. Introduce a runner flow to use functions that return void as entry points. PiperOrigin-RevId: 264381686	2019-08-20 07:46:17 -07:00
Alex Zinenko	0f974817b5	LLVM dialect: prefix operations that correspond to intrinsics with "intr." LLVM intrinsics have an open name space and their names can potentially overlap with names of LLVM instructions (LLVM intrinsics are functions, not instructions). In MLIR, LLVM intrinsics are modeled as operations, so it needs to make sure their names cannot clash with the instructions. Use the "intr." prefix for intrinsics in the LLVM dialect. PiperOrigin-RevId: 264372173	2019-08-20 06:38:52 -07:00
Nicolas Vasilache	f55ac5c076	Add support for LLVM lowering of binary ops on n-D vector types This CL allows binary operations on n-D vector types to be lowered to LLVMIR by performing an (n-1)-D extractvalue, 1-D vector operation and an (n-1)-D insertvalue. PiperOrigin-RevId: 264339118	2019-08-20 02:00:22 -07:00
Uday Bondhugula	07ecb011a7	Fix AffineExpr::simplifyAdd bug - fix missing check while simplifying an expression with floordiv to a mod - fixes issue tensorflow/mlir#82 Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#84 PiperOrigin-RevId: 264338353	2019-08-20 01:53:07 -07:00
Nicolas Vasilache	b628194013	Move Linalg and VectorOps dialects to the Dialect subdir - NFC PiperOrigin-RevId: 264277760	2019-08-19 17:11:38 -07:00
River Riddle	305516fcd3	Allow isolated regions to form isolated SSA name scopes in the printer. This will allow for naming values the same as existing SSA values for regions attached to operations that are isolated from above. This fits in with how the system already allows separate name scopes for sibling regions. This name shadowing can be enabled in the custom parser of operations by setting the 'enableNameShadowing' flag to true when calling 'parseRegion'. %arg = constant 10 : i32 foo.op { %arg = constant 10 : i32 } PiperOrigin-RevId: 264255999	2019-08-19 15:27:10 -07:00
Nicolas Vasilache	36f48063dd	Add alignment support to linalg.buffer_alloc This CL adds an integer attribute to linalg.buffer_alloc and lowering to LLVM. The alignment is constrained to be a positive power of 2. Lowering to LLVM produces the pattern: ``` %[[alloc:.]] = llvm.call @malloc(%[[s]]) : (!llvm.i64) -> !llvm<"i8"> %[[cast:.]] = llvm.bitcast %[[alloc]] : !llvm<"i8"> to !llvm.i64 %[[rem:.]] = llvm.urem %[[cast]], %[[c16]] : !llvm.i64 %[[drem:.]] = llvm.sub %[[c16]], %[[rem]] : !llvm.i64 %[[off:.]] = llvm.urem %[[drem]], %[[c16]] : !llvm.i64 llvm.getelementptr %{{.}}[%[[off]]] : (!llvm<"i8">, !llvm.i64) -> !llvm<"i8"> ``` where `ptr` is aligned on `align` by computing the address `ptr + (align - ptr % align) % align`. To allow dealloc op to still be able to free memory, additional information is needed in the buffer type. The buffer type is thus extended with an extra i8* for the base allocation address. PiperOrigin-RevId: 264244455	2019-08-19 14:37:18 -07:00
Mahesh Ravishankar	377bfb3a14	Fix parsing/printing of spv.globalVariable and spv._address_of Change the prining/parsing of spv.globalVariable to print the type of the variable after the ':' to be consistent with MLIR convention. The spv._address_of should print the variable type after the ':'. It was mistakenly printing the address of the return value. Add a (missing) test that should have caught that. Also move spv.globalVariable and spv._address_of tests to structure-ops.mlir. PiperOrigin-RevId: 264204686	2019-08-19 11:39:25 -07:00
River Riddle	ba0fa92524	NFC: Move LLVMIR, SDBM, and StandardOps to the Dialect/ directory. PiperOrigin-RevId: 264193915	2019-08-19 11:01:25 -07:00
Lei Zhang	64abcd983d	[spirv] Add spv.ReturnValue This CL adds the spv.ReturnValue op and its tests. Also adds a InFunctionScope trait to make sure that the op stays inside a function. To be consistent, ModuleOnly trait is changed to InModuleScope. PiperOrigin-RevId: 264193081	2019-08-19 10:58:10 -07:00
Nicolas Vasilache	9bf69e6a2e	Refactor linalg lowering to LLVM The linalg.view type used to be lowered to a struct containing a data pointer, offset, sizes/strides information. This was problematic when passing to external functions due to ABI, struct padding and alignment issues. The linalg.view type is now lowered to LLVMIR as a pointer to a struct containing the data pointer, offset and sizes/strides. This simplifies the interfacing with external library functions and makes it trivial to add new functions without creating a shim that would go from a value type struct to a pointer type. The consequences are that: 1. lowering explicitly uses llvm.alloca in lieu of llvm.undef and performs the proper llvm.load/llvm.store where relevant. 2. the shim creation function `getLLVMLibraryCallDefinition` disappears. 3. views are passed by pointer, scalars are passed by value. In the future, other structs will be passed by pointer (on a per-need basis). PiperOrigin-RevId: 264183671	2019-08-19 10:21:40 -07:00
Nicolas Vasilache	c9f37fca37	Add alignment support for llvm.alloca Extend the LLVM dialect AllocaOp with an alignment attribute. PiperOrigin-RevId: 264068306	2019-08-18 18:55:14 -07:00
Jacques Pienaar	79f53b0cf1	Change from llvm::make_unique to std::make_unique Switch to C++14 standard method as llvm::make_unique has been removed ( https://reviews.llvm.org/D66259). Also mark some targets as c++14 to ease next integrates. PiperOrigin-RevId: 263953918	2019-08-17 11:06:03 -07:00
Mahesh Ravishankar	d745101339	Add spirv::GlobalVariableOp that allows module level definition of variables FuncOps in MLIR use explicit capture. So global variables defined in module scope need to have a symbol name and this should be used to refer to the variable within the function. This deviates from SPIR-V spec, which assigns an SSA value to variables at all scopes that can be used to refer to the variable, which requires SPIR-V functions to allow implicit capture. To handle this add a new op, spirv::GlobalVariableOp that can be used to define module scope variables. Since instructions need an SSA value, an new spirv::AddressOfOp is added to convert a symbol reference to an SSA value for use with other instructions. This also means the spirv::EntryPointOp instruction needs to change to allow initializers to be specified using symbol reference instead of SSA value The current spirv::VariableOp which returns an SSA value (as defined by SPIR-V spec) can still be used to define function-scope variables. PiperOrigin-RevId: 263951109	2019-08-17 10:20:13 -07:00
Denis Khalikov	cf358017e6	[spirv] Extend spv.array with Layoutinfo Extend spv.array with Layoutinfo to support (de)serialization. Closes tensorflow/mlir#80 PiperOrigin-RevId: 263795304	2019-08-16 10:18:14 -07:00
Nicolas Vasilache	f826ceef3c	Extend vector.outerproduct with an optional 3rd argument This CL adds an optional third argument to the vector.outerproduct instruction. When such a third argument is specified, it is added to the result of the outerproduct and is lowered to FMA intrinsic when the lowering supports it. In the future, we can add an attribute on the `vector.outerproduct` instruction to modify the operations for which to emit code (e.g. "+/", "max/+", "min/+", "log/exp" ...). This CL additionally performs minor cleanups in the vector lowering and adds tests to improve coverage. This has been independently verified to result in proper fma instructions for haswell as follows. Input: ``` func @outerproduct_add(%arg0: vector<17xf32>, %arg1: vector<8xf32>, %arg2: vector<17x8xf32>) -> vector<17x8xf32> { %2 = vector.outerproduct %arg0, %arg1, %arg2 : vector<17xf32>, vector<8xf32> return %2 : vector<17x8xf32> } } ``` Command: ``` mlir-opt vector-to-llvm.mlir -vector-lower-to-llvm-dialect --disable-pass-threading \| mlir-opt -lower-to-cfg -lower-to-llvm \| mlir-translate --mlir-to-llvmir \| opt -O3 \| llc -O3 -march=x86-64 -mcpu=haswell -mattr=fma,avx2 ``` Output: ``` outerproduct_add: # @outerproduct_add # %bb.0: ... vmovaps 112(%rbp), %ymm8 vbroadcastss %xmm0, %ymm0 ... vbroadcastss 64(%rbp), %ymm15 vfmadd213ps 144(%rbp), %ymm8, %ymm0 # ymm0 = (ymm8 ymm0) + mem ... vfmadd213ps 400(%rbp), %ymm8, %ymm9 # ymm9 = (ymm8 * ymm9) + mem ... ``` PiperOrigin-RevId: 263743359	2019-08-16 03:53:26 -07:00
Mahesh Ravishankar	d71915420b	Add BuiltIn EnumAttr to SPIR-V dialect Generate the EnumAttr to represent BuiltIns in SPIR-V dialect. The builtIn can be specified as a StringAttr with value being the name of the builtin. Extend Decoration (de)serialization to handle BuiltIns. Also fix an error in the SPIR-V dialect generator script. PiperOrigin-RevId: 263596624	2019-08-15 10:52:59 -07:00
River Riddle	a481032a33	Refactor ElementsAttr::getValue and DenseElementsAttr::getSplatValue. All 'getValue' variants now require that the index is valid, queryable via 'isValidIndex'. 'getSplatValue' now requires that the attribute is a proper splat. This allows for querying these methods on DenseElementAttr with all possible value types; e.g. float, int, APInt, etc. This also allows for removing unnecessary conversions to Attribute that really want the underlying value. PiperOrigin-RevId: 263437337	2019-08-14 15:03:53 -07:00
Nicolas Vasilache	f32f291b37	Move remaining linalg ops to ODS - NFC This CL moves the linalg.load/range/store ops to ODS. Minor cleanups are performed. Additional invalid IR tests are added for coverage. PiperOrigin-RevId: 263432110	2019-08-14 14:40:52 -07:00
Nicolas Vasilache	4f10c9b1ce	Refactor linalg.view lowering to LLVM - NFC This CL fuses the emission of size and stride information and makes it clearer which indexings are stepped over when querying the positions. This refactor was motivated by an index calculation bug in the stride computation. PiperOrigin-RevId: 263341610	2019-08-14 07:01:41 -07:00
Nicolas Vasilache	4286c52ae8	Move linalg.slice to ODS PiperOrigin-RevId: 263334168	2019-08-14 06:03:12 -07:00
Nicolas Vasilache	b09dfcb1d7	Fix indexing issue in lowering of linalg.slice This CL fixes the stepping through operands when emitting the view sizes of linalg.slice to LLVMIR. This is now consistent with the strides emission. A relevant test is added. Fix suggested by Alex Zinenko, thanks! PiperOrigin-RevId: 263150922	2019-08-13 09:20:32 -07:00
Alex Zinenko	5f0a843144	LLVM dialect: introduce fmuladd intrinsic as operation This operation is important to achieve decent performance in computational kernels. In LLVM, it is implemented as an intrinsic (through function declaration and function call). Thanks to MLIR's extendable set of operations, it does not have to differentiate between built-ins and intrinsics, so fmuladd is introduced as a general type-polymorphic operation. Custom printing and parsing will be added later. PiperOrigin-RevId: 263106305	2019-08-13 03:40:51 -07:00
Alex Zinenko	88de8b2a2b	GenerateCubinAccessors: use LLVM dialect constants The GenerateCubinAccessors was generating functions that fill dynamically-allocated memory with the binary constant of a CUBIN attached as a stirng attribute to the GPU kernel. This approach was taken to circumvent the missing support for global constants in the LLVM dialect (and MLIR in general). Global constants were recently added to the LLVM dialect. Change the GenerateCubinAccessors pass to emit a global constant array of characters and a function that returns a pointer to the first character in the array. PiperOrigin-RevId: 263092052	2019-08-13 01:39:21 -07:00
Mehdi Amini	926fb685de	Express ownership transfer in PassManager API through std::unique_ptr (NFC) Since raw pointers are always passed around for IR construct without implying any ownership transfer, it can be error prone to have implicit ownership transferred the same way. For example this code can seem harmless: Pass *pass = .... pm.addPass(pass); pm.addPass(pass); pm.run(module); PiperOrigin-RevId: 263053082	2019-08-12 19:13:12 -07:00
Alex Zinenko	2dd38b09c1	LLVM dialect: introduce llvm.addressof to access globals This instruction is a local counterpart of llvm.global that takes a symbol reference to a global and produces an SSA value containing the pointer to it. Used in combination, these two operations allow one to use globals with other operations expecting SSA values. At a cost of IR indirection, we make sure the functions don't implicitly capture the surrounding SSA values and remain suitable for parallel processing. PiperOrigin-RevId: 262908622	2019-08-12 06:10:54 -07:00
Nicolas Vasilache	252ada4932	Add lowering of vector dialect to LLVM dialect. This CL is step 3/n towards building a simple, programmable and portable vector abstraction in MLIR that can go all the way down to generating assembly vector code via LLVM's opt and llc tools. This CL adds support for converting MLIR n-D vector types to (n-1)-D arrays of 1-D LLVM vectors and a conversion VectorToLLVM that lowers the `vector.extractelement` and `vector.outerproduct` instructions to the proper mix of `llvm.vectorshuffle`, `llvm.extractelement` and `llvm.mulf`. This has been independently verified to produce proper avx2 code. Input: ``` func @vec_1d(%arg0: vector<4xf32>, %arg1: vector<8xf32>) -> vector<8xf32> { %2 = vector.outerproduct %arg0, %arg1 : vector<4xf32>, vector<8xf32> %3 = vector.extractelement %2[0 : i32]: vector<4x8xf32> return %3 : vector<8xf32> } ``` Command: ``` mlir-opt vector-to-llvm.mlir -vector-lower-to-llvm-dialect --disable-pass-threading \| mlir-opt -lower-to-cfg -lower-to-llvm \| mlir-translate --mlir-to-llvmir \| opt -O3 \| llc -O3 -march=x86-64 -mcpu=haswell -mattr=fma,avx2 ``` Output: ``` vec_1d: # @vec_1d # %bb.0: vbroadcastss %xmm0, %ymm0 vmulps %ymm1, %ymm0, %ymm0 retq ``` PiperOrigin-RevId: 262895929	2019-08-12 04:08:57 -07:00
River Riddle	1e42954032	NFC: Standardize the terminology used for parent ops/regions/etc. There are currently several different terms used to refer to a parent IR unit in 'get' methods: getParent/getEnclosing/getContaining. This cl standardizes all of these methods to use 'getParent*'. PiperOrigin-RevId: 262680287	2019-08-09 20:07:52 -07:00
River Riddle	41968fb475	NFC: Update usages of OwningRewritePatternList to pass by & instead of &&. This will allow for reusing the same pattern list, which may be costly to continually reconstruct, on multiple invocations. PiperOrigin-RevId: 262664599	2019-08-09 17:20:29 -07:00
Alex Zinenko	68451df267	LLVM dialect and translation: support global strings Unlike regular constant values, strings must be placed in some memory and referred to through a pointer to that memory. Until now, they were not supported in function-local constant declarations with `llvm.constant`. Introduce support for global strings using `llvm.global`, which would translate them into global arrays in LLVM IR and thus make sure they have some memory allocated for storage. PiperOrigin-RevId: 262569316	2019-08-09 09:00:13 -07:00
Alex Zinenko	b9ff2dd87e	Translation to LLVM: support llvm.global Add support for translating recently introduced llvm.global operations to global variables in the LLVM IR proper. PiperOrigin-RevId: 262564700	2019-08-09 08:30:42 -07:00
Nicolas Vasilache	59b473c231	External library name mangling support for linalg. This CL introduces the ability to generate the external library name for Linalg operations. The problem is that neither mlir or C support overloading and we want a simplified form of name mangling that is still reasonable to read. This CL creates the name of the external call that Linalg expects from the operation name and the type of its arguments. The interface library names are updated and use new cases are added for FillOp. PiperOrigin-RevId: 262556833	2019-08-09 07:33:58 -07:00
Nicolas Vasilache	20f2d3b598	Allow linalg.view to change the underlying elemental type. This CL adds the ability for linalg.view to act as a bitcast operation. This will be used when promoting views into faster memory and casting to vector types. In the process, linalg.view is moved to ODS. PiperOrigin-RevId: 262556246	2019-08-09 07:29:21 -07:00
Nicolas Vasilache	d2aba89f2e	Add a higher-order vector.outerproduct operation in MLIR This CL is step 2/n towards building a simple, programmable and portable vector abstraction in MLIR that can go all the way down to generating assembly vector code via LLVM's opt and llc tools. This CL adds the vector.outerproduct operation to the MLIR vector dialect as well as the appropriate roundtrip test. Lowering to LLVM will occur in the following CL. PiperOrigin-RevId: 262552027	2019-08-09 06:55:36 -07:00
Nicolas Vasilache	39f1b9a053	Add a higher-order vector.extractelement operation in MLIR This CL is step 2/n towards building a simple, programmable and portable vector abstraction in MLIR that can go all the way down to generating assembly vector code via LLVM's opt and llc tools. This CL adds the vector.extractelement operation to the MLIR vector dialect as well as the appropriate roundtrip test. Lowering to LLVM will occur in the following CL. PiperOrigin-RevId: 262545089	2019-08-09 05:58:47 -07:00
Nicolas Vasilache	92dc127ab3	Add support for vector ops in the LLVM dialect This CL is step 1/n towards building a simple, programmable and portable vector abstraction in MLIR that can go all the way down to generating assembly vector code via LLVM's opt and llc tools. This CL adds the 3 instructions `llvm.extractelement`, `llvm.insertelement` and `llvm.shufflevector` as documented in the LLVM LangRef "Vector Instructions" section. The "Experimental Vector Reduction Intrinsics" are left out for now and can be added in the future on a per-need basis. Appropriate roundtrip and LLVM Target tests are added. PiperOrigin-RevId: 262542095	2019-08-09 05:25:31 -07:00
Alex Zinenko	6d8611b38f	LLVM Dialect: introduce llvm.global Introduce an operation that defines global constants and variables in the LLVM dialect, to reflect the corresponding LLVM IR capability. This operation is expected to live in the top-level module and behaves similarly to llvm.constant. It currently does not model many of the attributes supported by the LLVM IR for global values (memory space, alignment, thread-local, linkage) and will be extended as the relevant use cases appear. PiperOrigin-RevId: 262539445	2019-08-09 05:01:52 -07:00
Nagy Mostafa	48fdc8d7a3	Add support for floating-point comparison 'fcmp' to the LLVM dialect. This adds support for fcmp to the LLVM dialect and adds any necessary lowerings, as well as support for EDSCs. Closes tensorflow/mlir#69 PiperOrigin-RevId: 262475255	2019-08-08 18:29:48 -07:00
Alex Zinenko	0126dcf1f0	Introduce support for variadic function signatures for the LLVM dialect LLVM function type has first-class support for variadic functions. In the current lowering pipeline, it is emulated using an attribute on functions of standard function type. In LLVMFuncOp that has LLVM function type, this can be modeled directly. Introduce parsing support for variadic arguments to the function and use it to support variadic function declarations in LLVMFuncOp. Function definitions are currently not supported as that would require modeling va_start/va_end LLVM intrinsics in the dialect and we don't yet have a consistent story for LLVM intrinsics. PiperOrigin-RevId: 262372651	2019-08-08 09:42:16 -07:00
Alex Zinenko	70ca59ac50	Parser: treat implicit top-level module as an SSA name scope Now that modules are also operations, nothing prevents one from defining SSA values in the module. Doing so in an implicit top-level module, i.e. outside of a `module` operation, was leading to a crash because the implicit module was not associated with an SSA name scope. Create a name scope before parsing the top-level module to fix this. PiperOrigin-RevId: 262366891	2019-08-08 09:14:46 -07:00
Nicolas Vasilache	b0ea33a7c6	Add canonicalization pattern for linalg.dim This CL introduces canonicalization patterns for linalg.dim. This allows the dimenions of chains of view, slice and subview operations to simplify. Down the line, when mixed with cse, this also allows better composition of linalg tiling and fusion by tracking operations that give the same result (not in this CL). PiperOrigin-RevId: 262365865	2019-08-08 09:09:58 -07:00
Eric Schweitz	881b238d7e	Add the LLVM IR unreachable instruction to the LLVMIR dialect. http://llvm.org/docs/LangRef.html#unreachable-instruction Closes tensorflow/mlir#64 PiperOrigin-RevId: 262301557	2019-08-08 01:06:00 -07:00
Diego Caballero	c6a006d4c7	Fix verification of zero-dim memref in affine.load/affine.store/std.load/std.store Verification complained when using zero-dimensional memrefs in affine.load, affine.store, std.load and std.store. This PR extends verification so that those memrefs can be used. Closes tensorflow/mlir#58 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/58 from dcaballe:dcaballe/zero-dim 49bcdcd45c52c48beca776431328e5ce551dfa9e PiperOrigin-RevId: 262164916	2019-08-07 10:31:49 -07:00
River Riddle	8920afb0a6	NFC: Simplify ModuleTerminatorOp by using the HasParent trait. PiperOrigin-RevId: 261962104	2019-08-06 11:46:32 -07:00
Andy Ly	55f2e24ab3	Remove ops in regions/blocks from worklist when parent op is being removed via GreedyPatternRewriteDriver::replaceOp. This fixes a bug where ops inside the parent op are visited even though the parent op has been removed. PiperOrigin-RevId: 261953580	2019-08-06 11:08:54 -07:00
River Riddle	641fc7007c	NFC: Simplify ModuleOp by using the SingleBlockImplicitTerminator trait. PiperOrigin-RevId: 261944712	2019-08-06 10:33:45 -07:00
Nicolas Vasilache	4b422a51ed	Add a region to linalg.generic This CL extends the Linalg GenericOp with an alternative way of specifying the body of the computation based on a single block region. The "fun" attribute becomes optional. Either a SymbolRef "fun" attribute or a single block region must be specified to describe the side-effect-free computation. Upon lowering to loops, the new region body is inlined in the innermost loop. The parser, verifier and pretty printer are extended. Appropriate roundtrip, negative and lowering to loop tests are added. PiperOrigin-RevId: 261895568	2019-08-06 05:50:36 -07:00
Nicolas Vasilache	24647750d4	Refactor Linalg ops to loop lowering (NFC) This CL modifies the LowerLinalgToLoopsPass to use RewritePattern. This will make it easier to inline Linalg generic functions and regions when emitting to loops in a subsequent CL. PiperOrigin-RevId: 261894120	2019-08-06 05:38:16 -07:00
River Riddle	a0df3ebd15	NFC: Implement OwningRewritePatternList as a class instead of a using directive. This allows for proper forward declaration, as opposed to leaking the internal implementation via a using directive. This also allows for all pattern building to go through 'insert' methods on the OwningRewritePatternList, replacing uses of 'push_back' and 'RewriteListBuilder'. PiperOrigin-RevId: 261816316	2019-08-05 18:38:22 -07:00
Nicolas Vasilache	ceb8d2d20e	Drop linalg.range_intersect op This op is not useful. PiperOrigin-RevId: 261665736	2019-08-05 05:26:20 -07:00
Lei Zhang	496a42f291	Use SingleBlockImplicitTerminator trait for spv.module This trait provides the ensureTerminator() utility function and the checks to make sure a spv.module is indeed terminated with spv._module_end. PiperOrigin-RevId: 261664153	2019-08-05 05:10:05 -07:00
Alex Zinenko	6059122601	Introduce custom syntax for llvm.func Similar to all LLVM dialect operations, llvm.func needs to have the custom syntax. Use the generic FunctionLike printer and parser to implement it. PiperOrigin-RevId: 261641755	2019-08-05 01:57:54 -07:00
Denis Khalikov	b36e3be3fc	[mlir-translate] Fix test suite. llvm ir printer was changed at LLVM r367755. Prints value numbers for unnamed functions argument. Closes tensorflow/mlir#67 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/67 from denis0x0D:sandbox/fix_mlir_translate ae46844e66f34a02e0cf86782ddadc5bce58b30d PiperOrigin-RevId: 261640048	2019-08-05 01:39:55 -07:00
Lei Zhang	9d7655677f	[ODS] Add new definitions for non-negative integer attributes This CL added a new NonNegativeIntAttrBase class and two instantiations, one for I32 and the other for I64. PiperOrigin-RevId: 261513292	2019-08-03 16:58:52 -07:00
Nicolas Vasilache	600c47e77b	Add a generic Linalg op This CL introduces a linalg.generic op to represent generic tensor contraction operations on views. A linalg.generic operation requires a numbers of attributes that are sufficient to emit the computation in scalar form as well as compute the appropriate subviews to enable tiling and fusion. These attributes are very similar to the attributes for existing operations such as linalg.matmul etc and existing operations can be implemented with the generic form. In the future, most existing operations can be implemented using the generic form. This CL starts by splitting out most of the functionality of the linalg::NInputsAndOutputs trait into a ViewTrait that queries the per-instance properties of the op. This allows using the attribute informations. This exposes an ordering of verifiers issue where ViewTrait::verify uses attributes but the verifiers for those attributes have not been run. The desired behavior would be for the verifiers of the attributes specified in the builder to execute first but it is not the case atm. As a consequence, to emit proper error messages and avoid crashing, some of the linalg.generic methods are defensive as such: ``` unsigned getNumInputs() { // This is redundant with the `n_views` attribute verifier but ordering of verifiers // may exhibit cases where we crash instead of emitting an error message. if (!getAttr("n_views") \|\| n_views().getValue().size() != 2) return 0; ``` In pretty-printed form, the specific attributes required for linalg.generic are factored out in an independent dictionary named "_". When parsing its content is flattened and the "_name" is dropped. This allows using aliasing for reducing boilerplate at each linalg.generic invocation while benefiting from the Tablegen'd verifier form for each named attribute in the dictionary. For instance, implementing linalg.matmul in terms of linalg.generic resembles: ``` func @mac(%a: f32, %b: f32, %c: f32) -> f32 { %d = mulf %a, %b: f32 %e = addf %c, %d: f32 return %e: f32 } #matmul_accesses = [ (m, n, k) -> (m, k), (m, n, k) -> (k, n), (m, n, k) -> (m, n) ] #matmul_trait = { doc = "C(m, n) += A(m, k) * B(k, n)", fun = @mac, indexing_maps = #matmul_accesses, library_call = "linalg_matmul", n_views = [2, 1], n_loop_types = [2, 1, 0] } ``` And can be used in multiple places as: ``` linalg.generic #matmul_trait %A, %B, %C [other-attributes] : !linalg.view<?x?xf32>, !linalg.view<?x?xf32>, !linalg.view<?x?xf32> ``` In the future it would be great to have a mechanism to alias / register a new linalg.op as a pair of linalg.generic, #trait. Also, note that with one could theoretically only specify the `doc` string and parse all the attributes from it. PiperOrigin-RevId: 261338740	2019-08-02 09:53:41 -07:00
Diego Caballero	c19b72d3f3	Add StdIndexedValue to EDSC helpers Add StdIndexedValue to EDSC helper so that we can use it to generated std.load and std.store in EDSC. Closes tensorflow/mlir#59 PiperOrigin-RevId: 261324965	2019-08-02 08:24:17 -07:00
Uday Bondhugula	18b8d4352b	Introduce explicit copying optimization by generalizing the DMA generation pass Explicit copying to contiguous buffers is a standard technique to avoid conflict misses and TLB misses, and improve hardware prefetching performance. When done in conjunction with cache tiling, it nearly eliminates all cache conflict and TLB misses, and a single hardware prefetch stream is needed per data tile. - generalize/extend DMA generation pass (renamed data copying pass) to perform either point-wise explicit copies to fast memory buffers or DMAs (depending on a cmd line option). All logic is the same as erstwhile -dma-generate. - -affine-dma-generate is now renamed -affine-data-copy; when -dma flag is provided, DMAs are generated, or else explicit copy loops are generated (point-wise) by default. - point-wise copying could be used for CPUs (or GPUs); some indicative performance numbers with a "C" version of the MLIR when compiled with and without this optimization (about 2x improvement here). With a matmul on 4096^2 matrices on a single core of an Intel Core i7 Skylake i7-8700K with clang 8.0.0: clang -O3: 518s clang -O3 with MLIR tiling (128x128): 24.5s clang -O3 with MLIR tiling + data copying 12.4s (code equivalent to test/Transforms/data-copy.mlir func @matmul) - fix some misleading comments. - change default fast-mem space to 0 (more intuitive now with the default copy generation using point-wise copies instead of DMAs) On a simple 3-d matmul loop nest, code generated with -affine-data-copy: ``` affine.for %arg3 = 0 to 4096 step 128 { affine.for %arg4 = 0 to 4096 step 128 { %0 = affine.apply #map0(%arg3, %arg4) %1 = affine.apply #map1(%arg3, %arg4) %2 = alloc() : memref<128x128xf32, 2> // Copy-in Out matrix. affine.for %arg5 = 0 to 128 { %5 = affine.apply #map2(%arg3, %arg5) affine.for %arg6 = 0 to 128 { %6 = affine.apply #map2(%arg4, %arg6) %7 = load %arg2[%5, %6] : memref<4096x4096xf32> affine.store %7, %2[%arg5, %arg6] : memref<128x128xf32, 2> } } affine.for %arg5 = 0 to 4096 step 128 { %5 = affine.apply #map0(%arg3, %arg5) %6 = affine.apply #map1(%arg3, %arg5) %7 = alloc() : memref<128x128xf32, 2> // Copy-in LHS. affine.for %arg6 = 0 to 128 { %11 = affine.apply #map2(%arg3, %arg6) affine.for %arg7 = 0 to 128 { %12 = affine.apply #map2(%arg5, %arg7) %13 = load %arg0[%11, %12] : memref<4096x4096xf32> affine.store %13, %7[%arg6, %arg7] : memref<128x128xf32, 2> } } %8 = affine.apply #map0(%arg5, %arg4) %9 = affine.apply #map1(%arg5, %arg4) %10 = alloc() : memref<128x128xf32, 2> // Copy-in RHS. affine.for %arg6 = 0 to 128 { %11 = affine.apply #map2(%arg5, %arg6) affine.for %arg7 = 0 to 128 { %12 = affine.apply #map2(%arg4, %arg7) %13 = load %arg1[%11, %12] : memref<4096x4096xf32> affine.store %13, %10[%arg6, %arg7] : memref<128x128xf32, 2> } } // Compute. affine.for %arg6 = #map7(%arg3) to #map8(%arg3) { affine.for %arg7 = #map7(%arg4) to #map8(%arg4) { affine.for %arg8 = #map7(%arg5) to #map8(%arg5) { %11 = affine.load %7[-%arg3 + %arg6, -%arg5 + %arg8] : memref<128x128xf32, 2> %12 = affine.load %10[-%arg5 + %arg8, -%arg4 + %arg7] : memref<128x128xf32, 2> %13 = affine.load %2[-%arg3 + %arg6, -%arg4 + %arg7] : memref<128x128xf32, 2> %14 = mulf %11, %12 : f32 %15 = addf %13, %14 : f32 affine.store %15, %2[-%arg3 + %arg6, -%arg4 + %arg7] : memref<128x128xf32, 2> } } } dealloc %10 : memref<128x128xf32, 2> dealloc %7 : memref<128x128xf32, 2> } %3 = affine.apply #map0(%arg3, %arg4) %4 = affine.apply #map1(%arg3, %arg4) // Copy out result matrix. affine.for %arg5 = 0 to 128 { %5 = affine.apply #map2(%arg3, %arg5) affine.for %arg6 = 0 to 128 { %6 = affine.apply #map2(%arg4, %arg6) %7 = affine.load %2[%arg5, %arg6] : memref<128x128xf32, 2> store %7, %arg2[%5, %6] : memref<4096x4096xf32> } } dealloc %2 : memref<128x128xf32, 2> } } ``` With -affine-data-copy -dma: ``` affine.for %arg3 = 0 to 4096 step 128 { %0 = affine.apply #map3(%arg3) %1 = alloc() : memref<128xf32, 2> %2 = alloc() : memref<1xi32> affine.dma_start %arg2[%arg3], %1[%c0], %2[%c0], %c128_0 : memref<4096xf32>, memref<128xf32, 2>, memref<1xi32> affine.dma_wait %2[%c0], %c128_0 : memref<1xi32> %3 = alloc() : memref<1xi32> affine.for %arg4 = 0 to 4096 step 128 { %5 = affine.apply #map0(%arg3, %arg4) %6 = affine.apply #map1(%arg3, %arg4) %7 = alloc() : memref<128x128xf32, 2> %8 = alloc() : memref<1xi32> affine.dma_start %arg0[%arg3, %arg4], %7[%c0, %c0], %8[%c0], %c16384, %c4096, %c128_2 : memref<4096x4096xf32>, memref<128x128xf32, 2>, memref<1xi32> affine.dma_wait %8[%c0], %c16384 : memref<1xi32> %9 = affine.apply #map3(%arg4) %10 = alloc() : memref<128xf32, 2> %11 = alloc() : memref<1xi32> affine.dma_start %arg1[%arg4], %10[%c0], %11[%c0], %c128_1 : memref<4096xf32>, memref<128xf32, 2>, memref<1xi32> affine.dma_wait %11[%c0], %c128_1 : memref<1xi32> affine.for %arg5 = #map3(%arg3) to #map5(%arg3) { affine.for %arg6 = #map3(%arg4) to #map5(%arg4) { %12 = affine.load %7[-%arg3 + %arg5, -%arg4 + %arg6] : memref<128x128xf32, 2> %13 = affine.load %10[-%arg4 + %arg6] : memref<128xf32, 2> %14 = affine.load %1[-%arg3 + %arg5] : memref<128xf32, 2> %15 = mulf %12, %13 : f32 %16 = addf %14, %15 : f32 affine.store %16, %1[-%arg3 + %arg5] : memref<128xf32, 2> } } dealloc %11 : memref<1xi32> dealloc %10 : memref<128xf32, 2> dealloc %8 : memref<1xi32> dealloc %7 : memref<128x128xf32, 2> } %4 = affine.apply #map3(%arg3) affine.dma_start %1[%c0], %arg2[%arg3], %3[%c0], %c128 : memref<128xf32, 2>, memref<4096xf32>, memref<1xi32> affine.dma_wait %3[%c0], %c128 : memref<1xi32> dealloc %3 : memref<1xi32> dealloc %2 : memref<1xi32> dealloc %1 : memref<128xf32, 2> } ``` Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#50 PiperOrigin-RevId: 261221903	2019-08-01 16:31:58 -07:00
Lei Zhang	00a7b6706d	[spirv] Add support for specialization constant This CL extends the existing spv.constant op to also support specialization constant by adding an extra unit attribute on it. PiperOrigin-RevId: 261194869	2019-08-01 14:13:37 -07:00
Denis Khalikov	08ae08cbee	[spirv] Add binary logical operations. Add binary logical operations regarding to the spec section 3.32.15: OpIEqual, OpINotEqual, OpUGreaterThan, OpSGreaterThan, OpUGreaterThanEqual, OpSGreaterThanEqual, OpULessThan, OpSLessThan, OpULessThanEqual, OpSLessThanEqual. Closes tensorflow/mlir#61 PiperOrigin-RevId: 261181281	2019-08-01 13:06:02 -07:00
Lei Zhang	c72d849eb9	Replace the verifyUnusedValue directive with HasNoUseOf constraint verifyUnusedValue is a bit strange given that it is specified in a result pattern but used to generate match statements. Now we are able to support multi-result ops better, we can retire it and replace it with a HasNoUseOf constraint. This reduces the number of mechanisms. PiperOrigin-RevId: 261166863	2019-08-01 11:51:15 -07:00
Lei Zhang	88b175eea5	Migrate pattern symbol binding tests to use TestDialect PiperOrigin-RevId: 261045611	2019-07-31 19:29:07 -07:00
Lei Zhang	e032d0dc63	Fix support for auxiliary ops in declarative rewrite rules We allow to generate more ops than what are needed for replacing the matched root op. Only the last N static values generated are used as replacement; the others serve as auxiliary ops/values for building the replacement. With the introduction of multi-result op support, an op, if used as a whole, may be used to replace multiple static values of the matched root op. We need to consider this when calculating the result range an generated op is to replace. For example, we can have the following pattern: ```tblgen def : Pattern<(ThreeResultOp ...), [(OneResultOp ...), (OneResultOp ...), (OneResultOp ...)]>; // Two op to replace all three results def : Pattern<(ThreeResultOp ...), [(TwoResultOp ...), (OneResultOp ...)]>; // One op to replace all three results def : Pat<(ThreeResultOp ...), (ThreeResultOp ...)>; def : Pattern<(ThreeResultOp ...), [(AuxiliaryOp ...), (ThreeResultOp ...)]>; ``` PiperOrigin-RevId: 261017235	2019-07-31 16:03:42 -07:00
Lei Zhang	e44ba1f8bf	NFC: refactor ODS builder generation Previously we use one single method with lots of branches to generate multiple builders. This makes the method difficult to follow and modify. This CL splits the method into multiple dedicated ones, by extracting common logic into helper methods while leaving logic specific to each builder in their own methods. PiperOrigin-RevId: 261011082	2019-07-31 15:31:13 -07:00
Mahesh Ravishankar	cf66d7bb74	Use operand number during serialization to get the <id>s of the operands During serialization, the operand number must be used to get the values assocaited with an operand. Using the argument number in Op specification was wrong since some of the elements in the arguments list might be attributes on the operation. This resulted in a segfault during serialization. Add a test that exercise that path. PiperOrigin-RevId: 260977758	2019-07-31 12:34:51 -07:00
Denis Khalikov	ce358f9b37	[spirv] Add binary arithmetic operations tensorflow/mlir#2 . Add binary operations such as: OpUdiv, OpSDiv, OpUMod, OpSRem, OpSMod. Closes tensorflow/mlir#56 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/56 from denis0x0D:sandbox/bin_ops_int 4959325a693b4658b978a8b97f79b8237eb39764 PiperOrigin-RevId: 260961681	2019-07-31 11:10:50 -07:00
Alex Zinenko	206be96e63	Support hexadecimal floats in tensor literals Extend the recently introduced support for hexadecimal float literals to tensor literals, which may also contain special floating point values such as infinities and NaNs. Modify TensorLiteralParser to store the list of tokens representing values until the type is parsed instead of trying to guess the tensor element type from the token kinds (hexadecimal values can be either integers or floats, and can be mixed with both). Maintain the error reports as close as possible to the existing implementation to avoid disturbing the tests. They can be improved in a separate clean-up if deemed necessary. PiperOrigin-RevId: 260794716	2019-07-30 14:24:59 -07:00
Mahesh Ravishankar	1de519a753	Add support for (de)serialization of SPIR-V Op Decorations All non-argument attributes specified for an operation are treated as decorations on the result value and (de)serialized using OpDecorate instruction. An error is generated if an attribute is not an argument, and the name doesn't correspond to a Decoration enum. Name of the attributes that represent decoerations are to be the snake-case-ified version of the Decoration name. Add utility methods to convert to snake-case and camel-case. PiperOrigin-RevId: 260792638	2019-07-30 14:15:03 -07:00
Alex Zinenko	3b207d3691	Add support for hexadecimal float literals MLIR does not have support for parsing special floating point values such as infinities and NaNs. If programmatically constructed, these values are printed as NaN and (+-)Inf and cannot be parsed back. Add parser support for hexadecimal literals in float attributes, following LLVM IR. The literal corresponds to the in-memory representation of the floating point value. IEEE 754 defines a range of possible values for NaNs, storing the bitwise representation allows MLIR to properly roundtrip NaNs with different bit values of significands. The initial version of this commit was missing support for float literals that used to be printed in decimal notation as a fallback, but ended up being printed in hexadecimal format which became the fallback for special values. The decimal fallback behavior was not exercised by tests. It is currently reinstated and tested by the newly added test @f32_potential_precision_loss in parser.mlir. PiperOrigin-RevId: 260790900	2019-07-30 14:06:26 -07:00
Mahesh Ravishankar	ea56025f1e	Initial implementation to translate kernel fn in GPU Dialect to SPIR-V Dialect This CL adds an initial implementation for translation of kernel function in GPU Dialect (used with a gpu.launch_kernel) op to a spv.Module. The original function is translated into an entry function. Most of the heavy lifting is done by adding TypeConversion and other utility functions/classes that provide most of the functionality to translate from Standard Dialect to SPIR-V Dialect. These are intended to be reusable in implementation of different dialect conversion pipelines. Note : Some of the files for have been renamed to be consistent with the norm used by the other Conversion frameworks. PiperOrigin-RevId: 260759165	2019-07-30 11:55:55 -07:00
Denis Khalikov	4598c04dfe	[spirv] Add binary arithmetic operations. Add binary operations such as: OpIAdd, OpFAdd, OpISub, OpFSub, OpIMul, OpFDiv, OpFRem, OpFMod. Closes tensorflow/mlir#54 PiperOrigin-RevId: 260734166	2019-07-30 11:55:12 -07:00
Alex Zinenko	c7dab559ba	RewriterGen: properly handle zero-result ops RewriterGen was emitting invalid C++ code if the pattern required to create a zero-result operation due to the absence of a special case that would avoid generating a spurious comma. Handle this case. Also add rewriter tests for zero-argument operations. PiperOrigin-RevId: 260576998	2019-07-30 06:17:50 -07:00
Mehdi Amini	395c70c600	Fix SingleBlockImplicitTerminator traits to catch empty blocks The code was written with the assumption that on failure an error would be issued by another verifier. However verification is stopping on the first failure which lead to an empty output. Instead we make sure an error is displayed. Also add tests in the test dialect for this trait. PiperOrigin-RevId: 260541290	2019-07-30 06:17:35 -07:00
Mahesh Ravishankar	673bb7cbbe	Enable (de)serialization support for spirv::AccessChainOp Automatic generation of spirv::AccessChainOp (de)serialization needs the (de)serialization emitters to handle argument specified as Variadic<...>. To handle this correctly, this argument can only be the last entry in the arguments list. Add a test to (de)serialize spirv::AccessChainOp PiperOrigin-RevId: 260532598	2019-07-30 06:17:19 -07:00
Mehdi Amini	d5a02fcd96	Add a `HasParent` operation trait to enforce a specific parent on an operation (NFC) PiperOrigin-RevId: 260532592	2019-07-30 06:17:11 -07:00
Mehdi Amini	b2c2b4bb1d	[mlir-tblgen] Emit forward declarations for all the classes before the definitions This allows classes to refer to each other in the ODS file, for instance for traits. PiperOrigin-RevId: 260532419	2019-07-30 06:17:03 -07:00
Andy Davis	f4c6c23ac0	Verify that affine.load/store/dma_start/dma_wait operands are valid dimension or symbol identifiers. PiperOrigin-RevId: 260197567	2019-07-27 08:20:38 -07:00
Nicolas Vasilache	1304331926	Automated rollback of commit `3708f53219` PiperOrigin-RevId: 260136255	2019-07-26 11:05:17 -07:00
Nicolas Vasilache	3708f53219	Add sgemm specializations - NFC This CL adds a few specializations for sgemm. A minor change to alpha is made in cblas_interface.cpp to be compatible with actual BLAS calls. For now this is for internal testing purposes only. PiperOrigin-RevId: 260129027	2019-07-26 05:41:23 -07:00
Lei Zhang	9f02e88946	Support referencing a single value generated by a matched multi-result op It's quite common that we want to put further constraints on the matched multi-result op's specific results. This CL enables referencing symbols bound to source op with the `__N` syntax. PiperOrigin-RevId: 260122401	2019-07-26 04:31:46 -07:00
Nicolas Vasilache	54175c240a	Fix backward slice corner case In the backward slice computation, BlockArgument coming from function arguments represent a natural boundary for the traversal and should not trigger llvm_unreachable. This CL also improves the error message and adds a relevant test. PiperOrigin-RevId: 260118630	2019-07-26 03:49:17 -07:00
Nicolas Vasilache	00b48e1a9f	Fix linalg_matmul_impl interfacing with sgemm This CL provides a fix that makes linal_matmul_impl compliant with the BLAS interface. Before this CL it would compute either C += A * B when called with cblas.cpp:cblas_sgemm implementation and C = A * B with other implementations. PiperOrigin-RevId: 260117367	2019-07-26 03:34:21 -07:00
Nicolas Vasilache	fae4d94990	Use "standard" load and stores in LowerVectorTransfers Clipping creates non-affine memory accesses, use std_load and std_store instead of affine_load and affine_store. In the future we may also want a fill with the neutral element rather than clip, this would make the accesses affine if we wanted more analyses and transformations to happen post lowering to pointwise copies. PiperOrigin-RevId: 260110503	2019-07-26 02:34:24 -07:00
Mehdi Amini	387638f9c2	Automated rollback of commit `fc194a4f22` PiperOrigin-RevId: 260037115	2019-07-25 19:48:17 -07:00
Denis Khalikov	6552025736	[spirv] Add AccessChainOp operation. AccessChainOp creates a pointer into a composite object that can be used with OpLoad and OpStore. Closes tensorflow/mlir#52 PiperOrigin-RevId: 260035676	2019-07-25 15:43:12 -07:00
Alex Zinenko	fc194a4f22	Add support for hexadecimal float literals MLIR does not have support for parsing special floating point values such as infinities and NaNs. If programmatically constructed, these values are printed as NaN and (+-)Inf and cannot be parsed back. Add parser support for hexadecimal literals in float attributes, following LLVM IR. The literal corresponds to the in-memory representation of the floating point value. IEEE 754 defines a range of possible values for NaNs, storing the bitwise representation allows MLIR to properly roundtrip NaNs with different bit values of significands. PiperOrigin-RevId: 260018802	2019-07-25 14:16:02 -07:00
River Riddle	1293708473	Add support for an analysis mode to DialectConversion. This mode analyzes which operations are legalizable to the given target if a conversion were to be applied, i.e. no rewrites are ever performed even on success. This mode is useful for device partitioning or other utilities that may want to analyze the effect of conversion to different targets before performing it. The analysis method currently just fills a provided set with the operations that were found to be legalizable. This can be extended in the future to capture more information as necessary. PiperOrigin-RevId: 259987105	2019-07-25 11:31:07 -07:00
Nicolas Vasilache	dd652ce9cc	Fix backward slice computation to iterate through known control flow This CL fixes an oversight with dealing with loops in slicing analysis. The forward slice computation properly propagates through loops but not the backward slice. Add relevant unit tests. PiperOrigin-RevId: 259903396	2019-07-25 01:33:35 -07:00
Alex Zinenko	60965b4612	Move GPU dialect to {lib,include/mlir}/Dialect Per tacit agreement, individual dialects should now live in lib/Dialect/Name with headers in include/mlir/Dialect/Name and tests in test/Dialect/Name. PiperOrigin-RevId: 259896851	2019-07-25 00:41:17 -07:00
Alex Zinenko	055d7dedcb	Move SPIRV dialect tests under test/Dialect This was overlooked when the dialect code was moved under {lib,include/mlir}/Dialect. NFC. PiperOrigin-RevId: 259801927	2019-07-24 13:13:39 -07:00
Nicolas Vasilache	8ebb4281aa	Cleanup slicing test. Remove hardcoded SSA names and make use of CHECK-LABEL directives. PiperOrigin-RevId: 259767803	2019-07-24 10:28:33 -07:00
Nicolas Vasilache	1ff95e5120	Enable multi-level Linalg fusion This CL adds support for SubViewOp in the alias analysis to permit multiple Linalg fusion passes to compose. The debugging messages are also improved for better readability. The readability benefits came in handy when tracking this issue. A 2-level fusion test is added to capture the new behavior. PiperOrigin-RevId: 259720246	2019-07-24 05:10:54 -07:00
MLIR Team	8cb82c9478	Add sitofp to the standard dialect Conversion from integers (window or input size, padding etc) to floating point is required to express many ML kernels, for example average pooling. PiperOrigin-RevId: 259575284	2019-07-23 11:23:40 -07:00
Alex Zinenko	480d68f8de	Affine loop parallelism detection: conservatively handle unknown ops The loop parallelism detection utility only collects the affine.load and affine.store operations appearing inside the loop to analyze the access patterns for the absence of dependences. However, any operation, including unregistered operations, can appear in a body of an affine loop. If such operation has side effects, the result of parallelism analysis is incorrect. Conservatively assume affine loops are not parallel in presence of operations other than affine.load, affine.store, affine.for, affine.terminator that may have side effects. This required to update the loop-fusion unit test that relies on parallelism analysis and was exercising loop fusion in presence of an unregistered operation. PiperOrigin-RevId: 259560935	2019-07-23 10:18:46 -07:00
Alex Zinenko	0aed5222d8	ODS: support UnitAttr in Operation definitions A recent commit introduced UnitAttr into the ODS but did not include the support for using UnitAttrs in operation definitions (only patterns were supported). Extend the ODS definition of UnitAttr to be usable in operation definition by providing a trivial builder and an accessor that returns "true" if the unit attribute is present since the attribute presence itself has meaning. Additionally, test that unit attributes are effectively rewritten in patterns in addition to the already available FileCheck tests of the generated rewriter code. PiperOrigin-RevId: 259560653	2019-07-23 10:17:29 -07:00
Alex Zinenko	8543f8aaba	Introduce LLVMFuncOp Originally, MLIR only supported functions of the built-in FunctionType. On the conversion path to LLVM IR, we were creating MLIR functions that contained LLVM dialect operations and used LLVM IR types for everything expect top-level functions (e.g., a second-order function would have a FunctionType that consume or produces a wrapped LLVM function pointer type). With MLIR functions becoming operations, it is now possible to introduce non-built-in function operations. This will let us use conversion patterns for function conversion, simplify the MLIR-to-LLVM translation by removing the knowledge of the MLIR built-in function types, and provide stronger correctness verifications (e.g. LLVM functions only accept LLVM types). Furthermore, we can currently construct a situation where the same function is used with two different types: () -> () when its specified and called directly, and !llvm<"void ()"> when it's passed somewhere on called indirectly. Having a special function-op that is always of !llvm<"void ()"> type makes the function model and the llvm dialect type system more consistent. Introduce LLVMFuncOp to represent a function in the LLVM dialect. Unlike standard FuncOp, this function has an LLVMType wrapping an LLVM IR function type. Generalize the common behavior of function-defining operations (functions live in a symbol table of a module, contain a single region, are iterable as a list of blocks, and support argument attributes). This only defines the operation. Custom syntax, conversion and translation rules will be added in follow-ups. The operation name mentions LLVM explicitly to avoid confusion with standard FuncOp, especially in multiple files that use both `mlir` and `mlir::LLVM` namespaces. PiperOrigin-RevId: 259550940	2019-07-23 09:26:39 -07:00
River Riddle	42a767b23d	Allow std.constant to hold a boolean value. This was an oversight in the original implementation, std.constant already supports IntegerAttr just not BoolAttr. PiperOrigin-RevId: 259467710	2019-07-22 21:43:37 -07:00
Uday Bondhugula	b5f8a4be27	Introduce parser library method to parse list of region arguments - introduce parseRegionArgumentList (similar to parseOperandList) to parse a list of region arguments with a delimiter - allows defining custom parse for op's with multiple/variadic number of region arguments - use this on the gpu.launch op (although the latter has a fixed number of region arguments) - add a test dialect op to test region argument list parsing (with the no delimiter case) Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#40 PiperOrigin-RevId: 259442536	2019-07-22 17:42:08 -07:00
River Riddle	40493a07a3	Emit an error for missing '[' when parsing an AffineMapOfSSAIds. Fixes tensorflow/mlir#51 PiperOrigin-RevId: 259415034	2019-07-22 15:06:44 -07:00
Lei Zhang	8b8caa888d	(De)serialize composite spv.constant This CL covers the case of composite spv.constant. We encode/decode them into/from OpConstantComposite/OpConstantNull. PiperOrigin-RevId: 259394700	2019-07-22 13:31:00 -07:00
Lei Zhang	83c97a6784	(De)serialize float scalar spv.constant This CL adds support for float scalar spv.constant in (de)serialization. PiperOrigin-RevId: 259311776	2019-07-22 06:02:32 -07:00
Lei Zhang	c1844220cd	(De)serialize bool and integer scalar spv.constant SPIR-V has multiple constant instructions covering different constant types: * `OpConstantTrue` and `OpConstantFalse` for boolean constants * `OpConstant` for scalar constants * `OpConstantComposite` for composite constants * `OpConstantNull` for null constants * ... We model them all with a single spv.constant op for uniformity and friendliness to transformations. This does mean that when doing (de)serialization, we need to poke spv.constant's type to determine which SPIR-V binary instruction to use. This CL only covers the case of bool and integer spv.constant. The rest will follow. PiperOrigin-RevId: 259311698	2019-07-22 06:02:08 -07:00
Nicolas Vasilache	48a1baeb8a	Refactor LoopParametricTiling as a test pass - NFC This CL moves LoopParametricTiling into test/lib as a pass for purely testing purposes. PiperOrigin-RevId: 259300264	2019-07-22 04:31:17 -07:00
Alex Zinenko	fa86c89917	SingleBlockImplicitTerminator: report the wrong terminator op found In the trait verifier of SingleBlockImplicitTerminator, report the name of the unexpected terminator op found in the end of the block in addition to the name of the expected terminator op. This may simplify debugging, especially in cases where the terminator is omitted for brevity and/or after a long series of conversions. PiperOrigin-RevId: 259287452	2019-07-22 02:42:06 -07:00
River Riddle	00bdc8e070	Refactor region type signature conversion to be explicit via patterns. This cl enforces that the conversion of the type signatures for regions, and thus their entry blocks, is handled via ConversionPatterns. A new hook 'applySignatureConversion' is added to the ConversionPatternRewriter to perform the desired conversion on a region. This also means that the handling of rewriting the signature of a FuncOp is moved to a pattern. A default implementation is provided via 'mlir::populateFuncOpTypeConversionPattern'. This removes the hacky implicit 'dynamically legal' status of FuncOp that was present previously, and leaves it up to the user to decide when/how to convert the signature of a function. PiperOrigin-RevId: 259161999	2019-07-20 19:06:07 -07:00
Mahesh Ravishankar	2fb53e65ab	Add (de)serialization of EntryPointOp and ExecutionModeOp Since the serialization of EntryPointOp contains the name of the function as well, the function serialization emits the function name using OpName instruction, which is used during deserialization to get the correct function name. PiperOrigin-RevId: 259158784	2019-07-20 18:12:05 -07:00
Alex Zinenko	6b436eacbc	Merge TypeUtilities library into the IR library The TypeUtilities.{cpp,h}, currently living in {lib,include/mlir}/Support, do not belong to the Support library. Instead, they form a separate utility library that depends on the IR library. The operations it provides relate to standard types (tensors, memrefs) as well as to operation manipulation, making them a better fit for the main IR library. PiperOrigin-RevId: 259108314	2019-07-20 03:04:22 -07:00
Lei Zhang	05ab648740	[spirv] Avoid printing duplicate trailing type When printing the value attribute in spv.constant, OpAsmPrinter already attaches a trailing type. So we don't need to duplicate it again unless it's an array attribute, which does not have type by default but we use it for spirv::ArrayType. PiperOrigin-RevId: 258994197	2019-07-19 11:41:49 -07:00
Alex Zinenko	6fe99662aa	Move loop dialect tests into separate files - NFC This was overlooked when moving out loop operations from Standard to a separate dialect. PiperOrigin-RevId: 258970115	2019-07-19 11:41:12 -07:00
Lei Zhang	89a10b73fb	Add missing MLIRDialect dependency for MLIRDialect Tests for the Broadcastable trait was added in a previous CL, which makes MLIRDialect depend on MLIRDialect now. PiperOrigin-RevId: 258967332	2019-07-19 11:41:06 -07:00
Mahesh Ravishankar	03c8303a12	Make SPIR-V spv.EntryPoint and spv.ExecutionMode consistent with SPIR-V spec This CL changes the Op definition of spirv::EntryPointOp and spirv::ExecutionModeOp to be consistent with the SPIR-V spec. 1) The EntryPointOp doesn't return a value 2) The ExecutionModeOp takes as argument, the SymbolRefAttr to refer to the function, instead of the result of the EntryPointOp. Following this, the spirv::EntryPointType is no longer necessary, and is removed. PiperOrigin-RevId: 258964027	2019-07-19 11:40:58 -07:00
Alex Zinenko	287d111023	Generalize implicit terminator into an OpTrait Several groups of operations in different dialects (e.g. AffineForOp, AffineIfOp; loop::ForOp, loop::IfOp) share the requirement for their regions to contain 0 or 1 block, and for blocks to always have a specific terminator type. Furthermore, this terminator may be omitted from the custom syntax. Generalize this behavior into OpTrait::SingleBlockImplicitTerminator, parameterized by the terminator operation type. This trait provides the verifier that checks the presence of the terminator, and utility functions adding the terminator in case of absence. PiperOrigin-RevId: 258957180	2019-07-19 11:40:51 -07:00
Nicolas Vasilache	6204acacc7	Uniformize test name - NFC PiperOrigin-RevId: 258956693	2019-07-19 11:40:43 -07:00
Nicolas Vasilache	db4cd1c8dc	Utility function to map a loop on a parametric grid of virtual processors This CL introduces a simple loop utility function which rewrites the bounds and step of a loop so as to become mappable on a regular grid of processors whose identifiers are given by SSA values. A corresponding unit test is added. For example, using CUDA terminology, and assuming a 2-d grid with processorIds = [blockIdx.x, threadIdx.x] and numProcessors = [gridDim.x, blockDim.x], the loop: ``` loop.for %i = %lb to %ub step %step { ... } ``` is rewritten into a version resembling the following pseudo-IR: ``` loop.for %i = %lb + threadIdx.x + blockIdx.x * blockDim.x to %ub step %gridDim.x * blockDim.x { ... } ``` PiperOrigin-RevId: 258945942	2019-07-19 11:40:31 -07:00
Nicolas Vasilache	5bc344743c	Uniformize the API for the mlir::tile functions on AffineForOp and loop::ForOp This CL adapts the recently introduced parametric tiling to have an API matching the tiling of AffineForOp. The transformation using stripmineSink is more general and produces imperfectly nested loops. Perfect nesting invariants of the tiled version are obtained by selectively applying hoisting of ops to isolate perfectly nested bands. Such hoisting may fail to produce a perfect loop nest in cases where ForOp transitively depend on enclosing induction variables. In such cases, the API provides a LogicalResult return but the SimpleParametricLoopTilingPass does not currently use this result. A new unit test is added with a triangular loop for which the perfect nesting property does not hold. For this example, the old behavior was to produce IR that did not verify (some use was not dominated by its def). PiperOrigin-RevId: 258928309	2019-07-19 11:40:25 -07:00
River Riddle	28057ff3da	Add support for providing a legality callback for dynamic legality in DialectConversion. This allows for providing specific handling for dynamically legal operations/dialects without overriding the general 'isDynamicallyLegal' hook. This also means that a derived ConversionTarget class need not always be defined when some operations are dynamically legal. Example usage: ConversionTarget target(...); target.addDynamicallyLegalOp<ReturnOp>([](ReturnOp op) { return ... }; target.addDynamicallyLegalDialect<StandardOpsDialect>([](Operation *op) { return ... }; PiperOrigin-RevId: 258884753	2019-07-19 11:40:19 -07:00
River Riddle	8b447b6cad	NFC: Expose a ConversionPatternRewriter for use with ConversionPatterns. This specific PatternRewriter will allow for exposing hooks in the future that are only useful for the conversion framework, e.g. type conversions. PiperOrigin-RevId: 258818122	2019-07-19 11:40:00 -07:00
Feng Liu	701266c47a	Add an "is_signed" attribute to the quant_ConstFakeQuant op Some TensorFlow simulated quantize ops such as QuantizeAndDequantizeV2Op have attribute for the sign of the quantization, so quant_ConstFakeQuant should be able to represent it with the new attribute is added. The method for converting these attributes to an QuantizedType is updated to handle this new argument. PiperOrigin-RevId: 258810290	2019-07-19 11:39:54 -07:00
Lei Zhang	9f498f921b	Print boolean values in ElementsAttr as "true"/"false" We already parse boolean "true"/"false" as ElementsAttr elements. This CL makes it round-trippable that we are printing the same way. PiperOrigin-RevId: 258784962	2019-07-19 11:39:35 -07:00
Jing Pu	e558c040aa	Add UnitAttr in OpBase.td. Note that UnitAttr cannot be used for op definition yet. PiperOrigin-RevId: 258693338	2019-07-19 11:39:29 -07:00
Mahesh Ravishankar	c6cfebf1af	Automatically generate (de)serialization methods for SPIR-V ops For ops in SPIR-V dialect that are a direct mirror of SPIR-V operations, the serialization/deserialization methods can be automatically generated from the Op specification. To enable this an 'autogenSerialization' field is added to SPV_Ops. When set to non-zero, this will enable the automatic (de)serialization function generation Also adding tests that verify the spv.Load, spv.Store and spv.Variable ops are serialized and deserialized correctly. To fully support these tests also add serialization and deserialization of float types and spv.ptr types PiperOrigin-RevId: 258684764	2019-07-19 11:39:22 -07:00
Smit Hinsu	68c409238e	Simplify broadcastable traits We only verify broadcastable trait verifier and don't care about mutations so removed all CHECK statements and FileCheck invocation. PiperOrigin-RevId: 258662882	2019-07-19 11:39:10 -07:00
River Riddle	d097cc6119	Add support for parsing/printing the trailing type of a dialect attribute. This cl standardizes the printing of the type of dialect attributes to work the same as other attribute kinds. The type of dialect attributes will trail the dialect specific portion: `#` dialect-namespace `<` attr-data `>` `:` type The attribute parsing hooks on Dialect have been updated to take an optionally null expected type for the attribute. This matches the respective parseAttribute hooks in the OpAsmParser. PiperOrigin-RevId: 258661298	2019-07-19 11:39:04 -07:00

... 2 3 4 5 6 ...

1227 Commits