llvm-project

Commit Graph

Author	SHA1	Message	Date
Christian Sigg	9f11b0e12f	Change Block::getParent() to be a const function. This is only necessary because ilist_node_with_parent specifically requires a 'getParent() const' method. If/When ilist_node removes this constraint we should drop the const to fit the rest of the MLIR const model. PiperOrigin-RevId: 273316153	2019-10-07 10:03:28 -07:00
Nicolas Vasilache	9f98bcda47	Support AllocOp terminal in Linalg::AliasAnalysis. Now that linalg.view and strided memrefs are unified, there is no reason to disallow AllocOp in alias analysis. This CLs adds support for AllocOp which allows writing shorter tests that do not require explicitly creating a view for each operation. PiperOrigin-RevId: 273303060	2019-10-07 09:01:18 -07:00
Jacques Pienaar	27e8efedf8	Add DialectType and generate docs for dialect types Add new `typeDescription` (description was already used by base constraint class) field to type to allow writing longer descriptions about a type being defined. This allows for providing additional information/rationale for a defined type. This currently uses `description` as the heading/name for the type in the generated documentation. PiperOrigin-RevId: 273299332	2019-10-07 08:41:13 -07:00
MLIR Team	da984166df	Add OpaqueLoc to MLIR locations. See RFC: https://groups.google.com/a/tensorflow.org/forum/#!topic/mlir/xE2IzfhE3Wg. Opaque location stores two pointers, one of them points to some data structure that is external to MLIR, and the other one is unique for each type and represents type id of that data structure. OpaqueLoc also stores an optional location that can be used if the first one is not suitable. OpaqueLoc is managed similar to FileLineColLoc. It is passed around by MLIR transformations and can be used in compound locations like CallSiteLoc. PiperOrigin-RevId: 273266510	2019-10-07 05:05:42 -07:00
Christian Sigg	7c765d97f9	Support reduction of partial warps. gpu.all_reduce now supports block sizes that are not multiple of 32. PiperOrigin-RevId: 273255204	2019-10-07 03:31:00 -07:00
Jacques Pienaar	77672c9777	Enable emitting dialect summary & description during op generation Sort ops per dialect and emit summary & description (if provided) of each dialect before emitting the ops of the dialect. PiperOrigin-RevId: 273077138	2019-10-05 12:21:51 -07:00
Geoffrey Martin-Noble	18db4ce493	Allow element type traits to operate on scalars This allows confirming that a scalar argument has the same element type as a shaped one. It's easy to validate a type is shaped on its own if that's desirable, so this shouldn't make that use case harder. This matches the behavior of other traits that operate on element type (e.g. AllElementTypesMatch). Also this makes the code simpler because now we just use getElementTypeOrSelf. Verified that all uses in core already check the type is shaped in another way. PiperOrigin-RevId: 273068507	2019-10-05 10:06:06 -07:00
Lei Zhang	c020480fc6	[spirv] Allow return ops to be in control flow ops Use `getParentOfType<FunctionOp>()` instead of `cast<FuncOp>(getParentOp())` to avoid crash when return ops are used inside spv.selection/spv.loop. PiperOrigin-RevId: 273006041	2019-10-04 20:08:52 -07:00
Mahesh Ravishankar	3f8bde40cb	Add spv.Undef op to support OpUndef instruction in SPIR-V. Adding support for OpUndef instruction. Updating the dialect generation script to fix a few bugs in the instruction spec generation. PiperOrigin-RevId: 272975685	2019-10-04 16:00:22 -07:00
Mahesh Ravishankar	77a809d7a1	Add some utility builder functions for SPIR-V operations. Add builder functions for spv._address_of, spv.EntryPoint, spv.ExecutionMode and spv.Load to make it easier to create these operations. Fix a minor bug in printing of spv.EntryPoint Add a utility function to get the attribute name associated with a decoration. PiperOrigin-RevId: 272952846	2019-10-04 14:02:48 -07:00
Nicolas Vasilache	754ea72794	Replace constexpr MemRefType::kDynamicStrideOrOffset by a MemRefType:;getDynamicStrideOrOffset() method - NFC This fixes global ODR-use issues, some of which manifest in Parser.cpp. Fixes tensorflow/mlir#167. PiperOrigin-RevId: 272886347	2019-10-04 08:58:09 -07:00
Nicolas Vasilache	516f6a3477	Add missing Linalg lowerings to allow roundtrip.mlir to lower to LLVM Certain lowering patterns were reported as [missing](https://groups.google.com/a/tensorflow.org/forum/#!topic/mlir/dkdmHa77sSQ). This CL adds them and allows Linalg/roundtrip.mlir and Linalg/loops.mlir to lower to LLVM directly. Those 2 tests are updated to additionally check that the direct lowering to LLVM does not crash. The following points, left as TODOs still need to be addressed for correct end-to-end execution: 1. the lowering for ConvOp needs to pass attributes such as strides and dilations; the external library call needs to support it. 2. the lowering for GenericOp needs to support lowering to loops as a DialectConversion pattern. This is blocked on the DialectConversion infrastructure accepting an OperationFolder. PiperOrigin-RevId: 272878131	2019-10-04 08:07:54 -07:00
Deven Desai	d064469f6f	Moving the GPUIndexIntrinsicOpLowering template to a common location The GPUIndexIntrinsicOpLowering template is currently used by the code in both the GPUToNVVM and GPUToROCDL dirs. Moving it to a common location to remove code duplication. Closes tensorflow/mlir#163 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/163 from deven-amd:deven-refactor-gpu-index-ops-lowering b8dc2a5f5353df196039b6ff2ad42106028693ed PiperOrigin-RevId: 272863297	2019-10-04 06:20:05 -07:00
Christian Sigg	85dcaf19c7	Fix typos, NFC. PiperOrigin-RevId: 272851237	2019-10-04 04:37:53 -07:00
River Riddle	5830f71a45	Add support for inlining calls with different arg/result types from the callable. Some dialects have implicit conversions inherent in their modeling, meaning that a call may have a different type that the type that the callable expects. To support this, a hook is added to the dialect interface that allows for materializing conversion operations during inlining when there is a mismatch. A hook is also added to the callable interface to allow for introspecting the expected result types. PiperOrigin-RevId: 272814379	2019-10-03 23:10:51 -07:00
River Riddle	a20d96e436	Update the Inliner pass to work on SCCs of the CallGraph. This allows for the inliner to work on arbitrary call operations. The updated inliner will also work bottom-up through the callgraph enabling support for multiple levels of inlining. PiperOrigin-RevId: 272813876	2019-10-03 23:05:21 -07:00
Feng Liu	8c95223e3c	Add `axis` attribute to the quant.stats op The first dim length of the axisStats attribute should equals to the slice size of the input argument when splitted by the axis dimension. PiperOrigin-RevId: 272798042	2019-10-03 20:29:08 -07:00
MLIR Team	0dfa7fc908	Add fpext and fptrunc to the Standard dialect and includes conversion to LLVM PiperOrigin-RevId: 272768027	2019-10-03 16:37:24 -07:00
Christian Sigg	496f4590a1	Generalize parse/printBinaryOp to parse/printOneResultOp. PiperOrigin-RevId: 272722539	2019-10-03 13:00:12 -07:00
Nicolas Vasilache	218f0e611a	Add syntactic sugar for strided memref parsing. This CL implements the last remaining bit of the [strided memref proposal](https://groups.google.com/a/tensorflow.org/forum/#!topic/mlir/MaL8m2nXuio). The syntax is a bit more explicit than what was originally proposed and resembles: `memref<?x?xf32, offset: 0 strides: [?, 1]>` Nonnegative strides and offsets are currently supported. Future extensions will include negative strides. This also gives a concrete example of syntactic sugar for the ([RFC] Proposed Changes to MemRef and Tensor MLIR Types)[https://groups.google.com/a/tensorflow.org/forum/#!topic/mlir/-wKHANzDNTg]. The underlying implementation still uses AffineMap layout. PiperOrigin-RevId: 272717437	2019-10-03 12:34:36 -07:00
Alex Zinenko	0b93c092b6	Make Module::getName return Optional<StringRef> Module names are optional so it makes more sense to take and return an optional any time the name is involved. Also update the language reference to reflect the module names. PiperOrigin-RevId: 272684698	2019-10-03 10:04:48 -07:00
Alex Zinenko	8633b6bc8e	Give modules a name Modules are now Ops and, as such, can be nested. They do not produce an SSA value so there is no possibility to refer to them in the IR. Introduce support for symbol names attached to the module Op so that it can be referred to using SymbolRefAttrs. The name is optional, for example the implicit top-level module does not have a name. PiperOrigin-RevId: 272671600	2019-10-03 08:56:38 -07:00
Alex Zinenko	bd4762502c	Add parentheses around boolean operators in assert This removes a warning and is generally a good practice. PiperOrigin-RevId: 272613597	2019-10-03 01:47:14 -07:00
Alex Zinenko	e0d78eac23	NFC: rename Conversion/ControlFlowToCFG to Conversion/LoopToStandard This makes the name of the conversion pass more consistent with the naming scheme, since it actually converts from the Loop dialect to the Standard dialect rather than working with arbitrary control flow operations. PiperOrigin-RevId: 272612112	2019-10-03 01:35:03 -07:00
Alex Zinenko	44ef5e5525	Disallow index types in memrefs. As specified in the MLIR language reference and rationale documents, `memref` types should not be allowed to have `index` as element types. As observed in https://groups.google.com/a/tensorflow.org/forum/#!msg/mlir/P49hVWqTMNc/nW89a4i_AgAJ this restriction was lifted when canonicalization unit tests for affine operations were introduced, without sufficient motivation to lift the restriction itself. The test in question can be trivially rewritten (return the value from a function instead of storing it to prevent DCE from removing the producer operation) and the restriction put back in place. If `memref<...x index>` is relevant for some use cases, the relaxation of the type system can be implemented separately with appropriate modifications to the documentation. PiperOrigin-RevId: 272607043	2019-10-03 00:58:29 -07:00
Nicolas Vasilache	9604bb6269	Extract MemRefType::getStridesAndOffset as a free function and fix dynamic offset determination. This also adds coverage with a missing test, which uncovered a bug in the conditional for testing whether an offset is dynamic or not. PiperOrigin-RevId: 272505798	2019-10-02 13:25:05 -07:00
Lei Zhang	f294e0e513	[spirv] Add support for spv.selection Similar to spv.loop, spv.selection is another op for modelling SPIR-V structured control flow. It covers both OpBranchConditional and OpSwitch with OpSelectionMerge. Instead of having a `spv.SelectionMerge` op to directly model selection merge instruction for indicating the merge target, we use regions to delimit the boundary of the selection: the merge target is the next op following the `spv.selection` op. This way it's easier to discover all blocks belonging to the selection and it plays nicer with the MLIR system. PiperOrigin-RevId: 272475006	2019-10-02 11:01:57 -07:00
Deven Desai	e81b3129b4	[ROCm] Adding pass to lower GPU Dialect to ROCDL Dialect. This is a follow-up to the PRtensorflow/mlir#146 which introduced the ROCDL Dialect. This PR introduces a pass to lower GPU Dialect to the ROCDL Dialect. As with the previous PR, this one builds on the work done by @whchung, and addresses most of the review comments in the original PR. Closes tensorflow/mlir#154 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/154 from deven-amd:deven-lower-gpu-to-rocdl 809893e08236da5ab6a38e3459692fa04247773d PiperOrigin-RevId: 272390729	2019-10-02 01:50:30 -07:00
Jacques Pienaar	2b86e27dbd	Show type even if elementsattr is elided in graph The type is quite useful for debugging and shouldn't be too large. PiperOrigin-RevId: 272390311	2019-10-02 01:46:12 -07:00
Eric Schweitz	9e6dde3977	Add a pair of hooks to DominanceInfo. This exposes hooks for accessing internal dominance nodes, and updating the internal DFS numbers. Closes tensorflow/mlir#151 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/151 from schweitzpgi:dominance_hooks 69d14214a423b816cbd59feffcacdd02f3b5f921 PiperOrigin-RevId: 272287352	2019-10-01 14:06:50 -07:00
Alex Zinenko	c760f233b3	Fix and simplify CallOp/CallIndirectOp to LLVM::CallOp conversion A recent ABI compatibility change affected the conversion from standard CallOp/CallIndirectOp to LLVM::CallOp by changing its signature. In order to analyze the signature, the code was looking up the callee symbol in the module. This is incorrect since, during the conversion, the module may contain both the original and the converted function op that have the same symbol name. There is no strict guarantee on which of the two symbols will be found by the lookup. The conversion was not failing because the type legalizer converts the LLVM types to themselves making the original and the converted function signatures ultimately produce the same type. Instead of looking up the function signature to get the list of result types, use the types of the CallOp/CallIndirectOp results which must match those of the function in valid IR. These types are guaranteed to be the original, unconverted types when converting the operation. Furthermore, this avoids the need to perform a lookup of a symbol name in the module which may be expensive. Finally, propagate attributes as-is from the original op to the converted op since they share the attribute name for the callee of direct calls and the rest of attributes are not affected by the conversion. This removes the need for additional contorsions between direct and indirect calls to extract the name of the optional callee attribute only to insert it back. This also prevents the conversion from unintentionally dropping the other attributes of the op. PiperOrigin-RevId: 272218871	2019-10-01 08:41:50 -07:00
Nicolas Vasilache	e36337a998	Unify Linalg types by using strided memrefs This CL finishes the implementation of the Linalg + Affine type unification of the [strided memref RFC](https://groups.google.com/a/tensorflow.org/forum/#!topic/mlir/MaL8m2nXuio). As a consequence, the !linalg.view type, linalg::DimOp, linalg::LoadOp and linalg::StoreOp can now disappear and Linalg can use standard types everywhere. PiperOrigin-RevId: 272187165	2019-10-01 05:23:21 -07:00
Christian Sigg	1129931a62	Change all_reduce lowering to support 2D and 3D blocks. Perform second reduce only with first warp. This requires an additional __sync_threads(), but doesn't need special handling when the last warp is small. This simplifies support for block sizes that are not multiple of 32. Supporting partial warp reduce will be done in a separate CL. PiperOrigin-RevId: 272168917	2019-10-01 02:51:15 -07:00
Christian Sigg	8503ffbe3a	Add verification error message for ops that require at least one operand or result. PiperOrigin-RevId: 272153634	2019-10-01 00:57:18 -07:00
River Riddle	1c649d5785	Pass the pointer of the parent pipeline collection pass to PassInstrumentation::run*Pipeline. For the cases where there are multiple levels of nested pass managers, the parent thread ID is not enough to distinguish the parent of a given pass pipeline. Passing in the parent pass gives an exact anchor point. PiperOrigin-RevId: 272105461	2019-09-30 17:44:55 -07:00
Denis Khalikov	219421ece7	[spirv] Add array length check. According to the SPIR-V spec: "Length is the number of elements in the array. It must be at least 1." Closes tensorflow/mlir#160 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/160 from denis0x0D:sandbox/array_len 0840dc0986ad0088a3aa7d5d8d3e97d489377ed9 PiperOrigin-RevId: 272094669	2019-09-30 16:43:26 -07:00
Jacques Pienaar	f015b020f3	Add missing file from cmakelist PiperOrigin-RevId: 272054623	2019-09-30 13:37:54 -07:00
Jacques Pienaar	0b81eb928b	Enable autogenerating OpInterface method declarations Add DeclareOpInterfaceFunctions to enable specifying whether OpInterfaceMethods for an OpInterface should be generated automatically. This avoids needing to declare the extra methods, while also allowing adding function declaration by way of trait/inheritance. Most of this change is mechanical/extracting classes to be reusable. PiperOrigin-RevId: 272042739	2019-09-30 12:42:58 -07:00
Nicolas Vasilache	923b33ea16	Normalize MemRefType lowering to LLVM as strided MemRef descriptor This CL finishes the implementation of the lowering part of the [strided memref RFC](https://groups.google.com/a/tensorflow.org/forum/#!topic/mlir/MaL8m2nXuio). Strided memrefs correspond conceptually to the following templated C++ struct: ``` template <typename Elem, size_t Rank> struct { Elem ptr; int64_t offset; int64_t sizes[Rank]; int64_t strides[Rank]; }; ``` The linearization procedure for address calculation for strided memrefs is the same as for linalg views: `base_offset + SUM_i index_i stride_i`. The following CL will unify Linalg and Standard by removing !linalg.view in favor of strided memrefs. PiperOrigin-RevId: 272033399	2019-09-30 11:58:54 -07:00
Mahesh Ravishankar	2f7bb1e25f	Add support for Logical Ops in SPIR-V dialect Add operations corresponding to OpLogicalAnd, OpLogicalNot, OpLogicalEqual, OpLogicalNotEqual and OpLogicalOr instructions in SPIR-V dialect. This needs changes to class hierarchy in SPIR-V TableGen files to split SPIRVLogicalOp into SPIRVLogicalUnaryOp and SPIRVLogicalBinaryOp. All derived classes of SPIRVLogicalOp are updated accordingly. Update the spirv dialect generation script to 1) Allow specifying base class to use for instruction spec generation and file name to generate the specification in separately. 2) Use the existing descriptions for operations. 3) Update define_inst.sh to also invoke define_opcode.sh to also define the corresponding SPIR-V instruction opcode enum. PiperOrigin-RevId: 272014876	2019-09-30 10:40:36 -07:00
Nicolas Vasilache	1ce524623c	Fix MemRefType::getStrides corner case MemRefType::getStrides uses AffineExpr::walk which operates in post-order from the leaves. In order to compute strides properly, it needs to escape on terminal nodes and analyze binary ops only. This did not work for AffineExpr that consist of a single term (i.e. without a binary op). This CL fixes the corner case and adds relevant tests. PiperOrigin-RevId: 271975746	2019-09-30 07:27:39 -07:00
Christian Sigg	3d9679bde4	Switch comments from GPU dialect terms to CUDA terms (NFC). local workgroup -> block, subgroup -> warp, invocation -> thread. PiperOrigin-RevId: 271946342	2019-09-30 03:19:45 -07:00
Jacques Pienaar	e5a43186d3	Add InferTypeOpTrait & enable generating its member function definition Use OpInterfaces to add an interface for ops defining a return type function. This change does not use this trait in any meaningful way, I'll use it in a follow up to generalize and unify some of the op type traits/constraints. Also, currently the infer type function can only be manually specified in C++, that should rather be the fallback in future. PiperOrigin-RevId: 271883746	2019-09-29 17:29:00 -07:00
Jacques Pienaar	c57f202c8c	Switch explicit create methods to match generated build's order The generated build methods have result type before the arguments (operands and attributes, which are also now adjacent in the explicit create method). This also results in changing the create method's ordering to match most build method's ordering. PiperOrigin-RevId: 271755054	2019-09-28 09:35:58 -07:00
Yanan Cao	5f8dff936b	Append a newline when dumping a Value. This is more consistent with other dump methods. Otherwise successive Value dumps are concatenated in same line, hurting readability. PiperOrigin-RevId: 271669846	2019-09-27 16:20:46 -07:00
Nicolas Vasilache	bc4984e4f7	Add TODO to revisit coupling of CallOp to MemRefType lowering PiperOrigin-RevId: 271619132	2019-09-27 12:03:00 -07:00
Uday Bondhugula	74eabdd14e	NFC - clean up op accessor usage, std.load/store op verify, other stale info - also remove stale terminology/references in docs Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#148 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/148 from bondhugula:cleanup e846b641a3c2936e874138aff480a23cdbf66591 PiperOrigin-RevId: 271618279	2019-09-27 11:58:24 -07:00
Nicolas Vasilache	ddf737c5da	Promote MemRefDescriptor to a pointer to struct when passing function boundaries in LLVMLowering. The strided MemRef RFC discusses a normalized descriptor and interaction with library calls (https://groups.google.com/a/tensorflow.org/forum/#!topic/mlir/MaL8m2nXuio). Lowering of nested LLVM structs as value types does not play nicely with externally compiled C/C++ functions due to ABI issues. Solving the ABI problem generally is a very complex problem and most likely involves taking a dependence on clang that we do not want atm. A simple workaround is to pass pointers to memref descriptors at function boundaries, which this CL implement. PiperOrigin-RevId: 271591708	2019-09-27 09:57:36 -07:00
Nicolas Vasilache	6543e99fe5	Fix JitRunner.cpp Error creation pattern and reactivate tests. linalg_integration_test.mlir and simple.mlir were temporarily disabled due to an OSS-only failure. The issue is that, once created, an llvm::Error must be explicitly checked before it can be discarded or overwritten. This CL fixes the issue and reenable the test. PiperOrigin-RevId: 271589651	2019-09-27 09:56:40 -07:00
Deven Desai	fee40fef5c	[ROCm] Adding ROCDL Dialect. This commit introduces the ROCDL Dialect (i.e. the ROCDL ops + the code to lower those ROCDL ops to LLWM intrinsics/functions). Think of ROCDL Dialect as analogous to the NVVM Dialect, but for AMD GPUs. This patch contains just the essentials needed to get a simple example up and running. We expect to make further additions to the ROCDL Dialect. This is the first of 3 commits, the follow-up will be: * add a pass that lowers GPU Dialect to ROCDL Dialect * add a "mlir-rocm-runner" utility Closes tensorflow/mlir#146 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/146 from deven-amd:deven-rocdl-dialect e78e8005c75a78912631116c78dc844fcc4b0de9 PiperOrigin-RevId: 271511259	2019-09-27 00:22:32 -07:00
Nicolas Vasilache	445232df0b	Decouple tiling from fusion in Linalg. This CL modifies the linalg-fusion pass such that it does not tile anymore as part of the pass. Tiling is a separate concern that enables linalg fusion but should happen before. This makes fusion more composable with other decisions. In particular the fusion pass now becomes greedy and only applies the transformation on a best-effort basis. This should also let fusion work in a multi-hop fashion with chains of producer/consumers. Since the fusion pass does not perform tiling anymore, tests are rewritten to be in pretiled form and make the intent of the test clearer (albeit more verbose). PiperOrigin-RevId: 271357741	2019-09-26 08:44:31 -07:00
Alex Zinenko	99be3351b8	Drop support for memrefs from JitRunner The support for functions taking and returning memrefs of floats was introduced in the first version of the runner, created before MLIR had reliable lowering of allocation/deallocation to library calls. It forcibly runs MLIR transformation convering affine, loop and standard dialects into the LLVM dialect, unlike the other runner flows that accept the LLVM dialect directly. Memref support leads to more complex layering and is generally fragile. Drop it in favor of functions returning a scalar, or library-based function calls to print memrefs and other data structures. PiperOrigin-RevId: 271330839	2019-09-26 05:42:01 -07:00
Christian Sigg	116dac00ba	Add AllReduceOp to GPU dialect with lowering to NVVM. The reduction operation is currently fixed to "add", and the scope is fixed to "workgroup". The implementation is currently limited to sizes that are multiple 32 (warp size) and no larger than 1024. PiperOrigin-RevId: 271290265	2019-09-26 00:17:50 -07:00
Lei Zhang	94298cea93	Remove unused variables and methods to address compiler warnings PiperOrigin-RevId: 271256784	2019-09-25 19:05:30 -07:00
Mahesh Ravishankar	6f0e65441c	Add spv.Bitcast operation to SPIR-V dialect Support the OpBitcast instruction of SPIR-V using the spv.Bitcast operation. The semantics implemented in the dialect differ from the SPIR-V spec in that the dialect does not allow conversion to/from pointer types from/to non-pointer types. PiperOrigin-RevId: 271255957	2019-09-25 19:01:53 -07:00
Jing Pu	47a7021cc3	Change the return type of createPrintCFGGraphPass to match other passes. PiperOrigin-RevId: 271252404	2019-09-25 18:33:47 -07:00
Lei Zhang	ae13c28f3f	[spirv] Add SPV_UnaryOp and spv.FNegate This CL also moves common parsers and printers to the same section in SPIRVOps.cpp. PiperOrigin-RevId: 271233546	2019-09-25 16:35:08 -07:00
Mahesh Ravishankar	3a4bee0fe1	Miscellaneous fixes to SPIR-V Deserializer (details below). 1) Process and ignore the following debug instructions: OpSource, OpSourceContinued, OpSourceExtension, OpString, OpModuleProcessed. 2) While processing OpTypeInt instruction, ignore the signedness specification. Currently MLIR doesnt make a distinction between signed and unsigned integer types. 3) Process and ignore BufferBlock decoration (similar to Buffer decoration). StructType needs to be enhanced to track this attribute since its needed for proper validation checks. 4) Report better error for unhandled instruction during deserialization. PiperOrigin-RevId: 271057060	2019-09-24 22:51:02 -07:00
Lei Zhang	cf00feed03	[spirv] Replace bitwiseCast with llvm::bit_cast PiperOrigin-RevId: 271035618	2019-09-24 19:25:02 -07:00
Uday Bondhugula	458ede8775	Introduce splat op + provide its LLVM lowering - introduce splat op in standard dialect (currently for int/float/index input type, output type can be vector or statically shaped tensor) - implement LLVM lowering (when result type is 1-d vector) - add constant folding hook for it - while on Ops.cpp, fix some stale names Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#141 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/141 from bondhugula:splat 48976a6aa0a75be6d91187db6418de989e03eb51 PiperOrigin-RevId: 270965304	2019-09-24 12:44:58 -07:00
Nicolas Vasilache	42d8fa667b	Normalize lowering of MemRef types The RFC for unifying Linalg and Affine compilation passes into an end-to-end flow with a predictable ABI and linkage to external function calls raised the question of why we have variable sized descriptors for memrefs depending on whether they have static or dynamic dimensions (https://groups.google.com/a/tensorflow.org/forum/#!topic/mlir/MaL8m2nXuio). This CL standardizes the ABI on the rank of the memrefs. The LLVM struct for a memref becomes equivalent to: ``` template <typename Elem, size_t Rank> struct { Elem *ptr; int64_t sizes[Rank]; }; ``` PiperOrigin-RevId: 270947276	2019-09-24 11:21:49 -07:00
Christian Sigg	74cdbf5909	Clone called functions into nested GPU module. PiperOrigin-RevId: 270891190	2019-09-24 06:29:54 -07:00
Christian Sigg	eba6014cdc	Allow null Attribute for value when building GlobalOp. PiperOrigin-RevId: 270853596	2019-09-24 01:19:53 -07:00
Mahesh Ravishankar	69af468754	Make spirv::RuntimeArrayType part of spirv::CompositeType. According to SPIR-V spec, spirv::CompositeType includes spirv::RuntimeArrayType. This allows using objects of spirv::RuntimeArrayType with spirv::AccessChainOp. PiperOrigin-RevId: 270809492	2019-09-23 18:50:47 -07:00
Lei Zhang	0e7edcfe7e	Let mlir-translate support -split-input-file Similar to mlir-opt, having a -split-input-file mode is quite useful in mlir-translate. It allows to put logically related tests in the same test file for better organization. PiperOrigin-RevId: 270805467	2019-09-23 18:18:23 -07:00
Mahesh Ravishankar	75906bd565	Handle OpMemberName instruction in SPIR-V deserializer. Sdd support in deserializer for OpMemberName instruction. For now the name is just processed and not associated with the spirv::StructType being built. That needs an enhancement to spirv::StructTypes itself. Add tests to check for errors reported during deserialization with some refactoring to common out some utility functions. PiperOrigin-RevId: 270794524	2019-09-23 17:11:18 -07:00
Jacques Pienaar	4a862fbd63	Use constant's location for reporting errors in parsing of hex constant Before this the line following the error would be reported in some cases. PiperOrigin-RevId: 270778722	2019-09-23 15:51:42 -07:00
Mahesh Ravishankar	98d1d3fc43	Simplify the way spirv::StructTypes are parsed. The existing logic to parse spirv::StructTypes is very brittle. This change simplifies the parsing logic a lot. The simplification also allows for memberdecorations to be separated by commas instead of spaces (which was an artifact of the existing parsing logic). The change also needs a modification to mlir::parseType to return the number of chars parsed. Adding a new parseType method to do so. Also allow specification of spirv::StructType with no members. PiperOrigin-RevId: 270739672	2019-09-23 12:53:06 -07:00
Mehdi Amini	5583252173	Add convenience methods to set an OpBuilder insertion point after an Operation (NFC) PiperOrigin-RevId: 270727180	2019-09-23 11:54:55 -07:00
River Riddle	8cb405a8be	Add initial callgraph support. Using the two call interfaces, CallOpInterface and CallableOpInterface, this change adds support for an initial multi-level CallGraph. This call graph builds a set of nodes for each callable region, and connects them via edges. An edge may be any of the following types: * Abstract - An edge not produced by a call operation, used for connecting to internal nodes from external nodes. * Call - A call edge is an edge defined via a call-like operation. * Child - This is an artificial edge connecting nested callgraph nodes. This callgraph will be used, and improved upon, to begin supporting more interesting interprocedural analyses and transformation. In a followup, this callgraph will be used to support more complex inlining support. PiperOrigin-RevId: 270724968	2019-09-23 11:44:13 -07:00
River Riddle	8965011fad	Add interfaces for call-like/callable operations. These two operation interfaces will be used in a followup to support building a callgraph: * CallOpInterface - Operations providing this interface are call-like, and have a "call" target. A call target may be a symbol reference, via SymbolRefAttr, or a SSA value. * CallableOpInterface - Operations providing this interfaces define destinations to call-like operations, e.g. FuncOp. These operations may define any number of callable regions. PiperOrigin-RevId: 270723300	2019-09-23 11:37:06 -07:00
River Riddle	c61991ef01	Refactor DiagnosticEngine to support multiple registered diagnostic handlers. This fixes a problem with current save-restore pattern of diagnostics handlers, as there may be a thread race between when the previous handler is destroyed. For example, this occurs when using multiple ParallelDiagnosticHandlers asynchronously: Handler A Handler B \| - LifeTime - \| Restore A here. Handler C \| --- LifeTime ---\| Restore B after it has been destroyed. The new design allows for multiple handlers to be registered in a stack like fashion. Handlers can return success() to signal that they have fully processed a diagnostic, or failure to propagate otherwise. PiperOrigin-RevId: 270720625	2019-09-23 11:25:14 -07:00
River Riddle	4b6b58ec0f	NFC: Fix warning for uninitialized field. PiperOrigin-RevId: 270704572	2019-09-23 10:20:13 -07:00
Jacques Pienaar	59e3b30af0	Add variants of interleave that take separator Make the common case of string separator easier to specify. PiperOrigin-RevId: 270697581	2019-09-23 09:50:10 -07:00
Christian Sigg	b8676da1fc	Outline GPU kernel function into a nested module. Roll forward of commit `5684a12`. When outlining GPU kernels, put the kernel function inside a nested module. Then use a nested pipeline to generate the cubins, independently per kernel. In a final pass, move the cubins back to the parent module. PiperOrigin-RevId: 270639748	2019-09-23 03:17:01 -07:00
Christian Sigg	c900d4994e	Fix a number of Clang-Tidy warnings. PiperOrigin-RevId: 270632324	2019-09-23 02:34:27 -07:00
Denis Khalikov	6414c08556	Fix undefined reference to mlir::getElementTypeOrSelf(mlir::Type) Fix undefined reference: mlir/lib/Dialect/StandardOps/Ops.cpp:2029: undefined reference to `mlir::getElementTypeOrSelf(mlir::Type)' Closes tensorflow/mlir#144 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/144 from denis0x0D:sandbox/fix_undef 494d4f7fa2e98ba21954d2b2f7ec1776b9397e08 PiperOrigin-RevId: 270545190	2019-09-22 09:08:56 -07:00
Manuel Freiberger	2c11997d48	Add integer sign- and zero-extension and truncation to standard. This adds sign- and zero-extension and truncation of integer types to the standard dialects. This allows to perform integer type conversions without having to go to the LLVM dialect and introduce custom type casts (between standard and LLVM integer types). Closes tensorflow/mlir#134 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/134 from ombre5733:sext-zext-trunc-in-std c7657bc84c0ca66b304e53ec03797e09152e4d31 PiperOrigin-RevId: 270479722	2019-09-21 16:14:56 -07:00
Denis Khalikov	2ec8e2be1f	[spirv] Add OpControlBarrier and OpMemoryBarrier. Add OpControlBarrier and OpMemoryBarrier (de)serialization. Closes tensorflow/mlir#130 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/130 from denis0x0D:sandbox/memory_barrier 2e3fff16bca44904dc1039592cb9a65d526faea8 PiperOrigin-RevId: 270457478	2019-09-21 10:18:34 -07:00
Uday Bondhugula	f559c38c28	Upgrade/fix/simplify store to load forwarding - fix store to load forwarding for a certain set of cases (where forwarding shouldn't have happened); use AffineValueMap difference based MemRefAccess equality checking; utility logic is also greatly simplified - add missing equality/inequality operators for AffineExpr ==/!= ints - add == != operators on MemRefAccess Closes tensorflow/mlir#136 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/136 from bondhugula:store-load-forwarding d79fd1add8bcfbd9fa71d841a6a9905340dcd792 PiperOrigin-RevId: 270457011	2019-09-21 10:08:56 -07:00
Christian Sigg	33a3a91ba2	Make GlobalOp's value attribute optional. Make GlobalOp's value attribute an OptionalAttr. Change code that uses the value to handle 'nullopt'. Translate an unitialized value attribute to llvm::UndefValue. PiperOrigin-RevId: 270423646	2019-09-21 01:20:28 -07:00
River Riddle	3a643de92b	NFC: Pass OpAsmPrinter by reference instead of by pointer. MLIR follows the LLVM style of pass-by-reference. PiperOrigin-RevId: 270401378	2019-09-20 20:43:35 -07:00
River Riddle	729727ebc7	NFC: Pass OperationState by reference instead of by pointer. MLIR follows the LLVM convention of passing by reference instead of by pointer. PiperOrigin-RevId: 270396945	2019-09-20 19:47:32 -07:00
River Riddle	91125d33ed	Avoid iterator invalidation when recursively computing pattern depth. computeDepth calls itself recursively, which may insert into minPatternDepth. minPatternDepth is a DenseMap, which invalidates iterators on insertion, so this may lead to asan failures. PiperOrigin-RevId: 270374203	2019-09-20 16:30:29 -07:00
River Riddle	2797517ecf	NFC: Pass OpAsmParser by reference instead of by pointer. MLIR follows the LLVM style of pass-by-reference. PiperOrigin-RevId: 270315612	2019-09-20 11:37:21 -07:00
Nicolas Vasilache	d8fda38cea	Use SmallVectorImpl in getStrides No need to force a particular size on the user of the API. PiperOrigin-RevId: 270310570	2019-09-20 11:13:58 -07:00
Nicolas Vasilache	a00b568277	Add utility to extract strides from layout map in MemRefType. The RFC for unifying Linalg and Affine compilation passes into an end-to-end flow discusses the notion of a strided MemRef (https://groups.google.com/a/tensorflow.org/forum/#!topic/mlir/MaL8m2nXuio). This CL adds helper functions to extract strides from the layout map which in turn will allow converting between a strided form of the type and a layout map. For now strides are only computed on a single affine map with a single result (i.e. the closed subset of linearization maps that are compatible with striding semantics). This restriction will be reevaluated / lifted in the future based on concrete use cases. PiperOrigin-RevId: 270284686	2019-09-20 09:26:21 -07:00
Mahesh Ravishankar	9a4f5d2ee3	Allow specification of decorators on SPIR-V StructType members. Allow specification of decorators on SPIR-V StructType members. If the struct has layout information, these decorations are to be specified after the offset specification of the member. These decorations are emitted as OpMemberDecorate instructions on the struct <id>. Update (de)serialization to handle these decorations. PiperOrigin-RevId: 270130136	2019-09-19 14:50:05 -07:00
George Karpenkov	2df646bef6	Automated rollback of commit `5684a12434` PiperOrigin-RevId: 270126672	2019-09-19 14:34:30 -07:00
Feng Liu	c8961d408e	Quantize attribute values by per axis quantization parameters A new converter with per axis quantization parameters is added to quantize a dense elements attribute. For each slice along the quantization axis, it creates an uniform quantized value converter, with different scale and zero point, and quantizes the values in the slice. The current implementation doesn't handle sparse elements attributes. PiperOrigin-RevId: 270121986	2019-09-19 14:12:08 -07:00
MLIR Team	e79bfefb89	Add address space attribute to LLVMIR's GlobalOp. PiperOrigin-RevId: 270012505	2019-09-19 04:50:46 -07:00
MLIR Team	5684a12434	Outline GPU kernel function into a nested module. When outlining GPU kernels, put the kernel function inside a nested module. Then use a nested pipeline to generate the cubins, independently per kernel. In a final pass, move the cubins back to the parent module. PiperOrigin-RevId: 269987720	2019-09-19 01:51:28 -07:00
River Riddle	25f0f769aa	NFC: Remove stray logging from ~Block(). PiperOrigin-RevId: 269941815	2019-09-18 19:21:05 -07:00
River Riddle	35df51086a	Fix nested dominance relationship between parent results and child operations. This modifies DominanceInfo::properlyDominates(Value value, Operation op) to return false if the value is defined by a parent operation of 'op'. This prevents using values defined by the parent operation from within any child regions. PiperOrigin-RevId: 269934920	2019-09-18 18:23:41 -07:00
Uday Bondhugula	727a50ae2d	Support symbolic operands for memref replacement; fix memrefNormalize - allow symbols in index remapping provided for memref replacement - fix memref normalize crash on cases with layout maps with symbols Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Reported by: Alex Zinenko Closes tensorflow/mlir#139 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/139 from bondhugula:memref-rep-symbols 2f48c1fdb5d4c58915bbddbd9f07b18541819233 PiperOrigin-RevId: 269851182	2019-09-18 11:26:11 -07:00
MLIR Team	1c73be76d8	Unify error messages to start with lower-case. PiperOrigin-RevId: 269803466	2019-09-18 07:45:17 -07:00
Alex Zinenko	5709aeb993	SDBM: support sum expressions on the LHS of stripe expressions Introduce support for applying the stripe operator to sum expressions, as in (x + A) # B = x + A - (x + A) mod B. This is required to represent a combination of tiling and padding in the SDBM framework, and is a valid SDBM construct that was not originally supported. PiperOrigin-RevId: 269758807	2019-09-18 02:17:34 -07:00
Alex Zinenko	a15e0ce1ba	Simplify SDBM expressions more aggressively in operators and conversions Extend SDBM simplification patterns to support more cases where the addition of two expressions each involving one or two variables would result in a sum expression that only contains one variable and thus remains in the SDBM domain. This is made possible by the new canonical structure of SDBM where the constant term appears once. This simplification will be necessary to support round-tripping of stripe expressions containing constant terms on the LHS through affine expressions. PiperOrigin-RevId: 269757732	2019-09-18 02:09:08 -07:00
River Riddle	b58d9aee11	Add support to OpAsmParser for parsing unknown keywords. This is useful in several cases, for example a user may want to sugar the syntax of a string(as we do with custom operation syntax), or avoid many nested ifs for parsing a set of known keywords. PiperOrigin-RevId: 269695451	2019-09-17 17:55:34 -07:00
Mahesh Ravishankar	9330c1b9a1	Add (de)serialization support for OpRuntimeArray. Update the SPIR-V (de)serialization to handle RuntimeArrayType. PiperOrigin-RevId: 269667196	2019-09-17 15:21:57 -07:00
Lei Zhang	af45ca844f	Register a -test-spirv-roundtrip hook to mlir-translate This CL registers a new mlir-translate hook, -test-spirv-roundtrip, for testing SPIR-V serialization and deserialization round-trip. This CL also moves the existing -serialize-spirv and -deserialize-spirv hooks to one source file. PiperOrigin-RevId: 269659528	2019-09-17 14:48:24 -07:00
Lei Zhang	b991e8b1e4	Support file-to-file translation in mlir-translate Existing translations are either from MLIR or to MLIR. To support cases like round-tripping some external format via MLIR, one must chain two mlir-translate invocations together using pipes. This can be problematic to support -split-input-file in mlir-translate given that it won't work across pipes. Motivated by the above, this CL adds another translation category that allows file to file. This gives users more freedom. PiperOrigin-RevId: 269636438	2019-09-17 13:04:34 -07:00
Lei Zhang	b00a522b80	Change MLIR translation functions signature This CL changes translation functions to take MemoryBuffer as input and raw_ostream as output. It is generally better to avoid handling files directly in a library (unless the library is specifically for file manipulation) and we can unify all file handling to the mlir-translate binary itself. PiperOrigin-RevId: 269625911	2019-09-17 12:16:45 -07:00
Uday Bondhugula	bd7de6d4df	Add rewrite pattern to compose maps into affine load/stores - add canonicalization pattern to compose maps into affine loads/stores; templatize the pattern and reuse it for affine.apply as well - rename getIndices -> getMapOperands() (getIndices is confusing since these are no longer the indices themselves but operands to the map whose results are the indices). This also makes the accessor uniform across affine.apply/load/store. Change arg names on the affine load/store builder to avoid confusion. Drop an unused confusing build method on AffineStoreOp. - update incomplete doc comment for canonicalizeMapAndOperands (this was missed from a previous update). Addresses issue tensorflow/mlir#121 Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#122 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/122 from bondhugula:compose-load-store e71de1771e56a85c4282c10cb43f30cef0701c4f PiperOrigin-RevId: 269619540	2019-09-17 11:49:45 -07:00
Mehdi Amini	62e1faa6f6	Add missing CMake dependency from libAnalysis to the Vector dialect Fixes tensorflow/mlir#138 PiperOrigin-RevId: 269509668	2019-09-17 00:39:00 -07:00
Mahesh Ravishankar	2d86ad79f0	Autogenerate (de)serialization for Extended Instruction Sets A generic mechanism for (de)serialization of extended instruction sets is added with this CL. To facilitate this, a new class "SPV_ExtendedInstSetOp" is added which is a base class for all operations corresponding to extended instruction sets. The methods to (de)serialization such ops as well as its dispatch is generated automatically. The behavior controlled by autogenSerialization and hasOpcode is also slightly modified to enable this. They are now decoupled. 1) Setting hasOpcode=1 means the operation has a corresponding opcode in SPIR-V binary format, and its dispatch for (de)serialization is automatically generated. 2) Setting autogenSerialization=1 generates the function for (de)serialization automatically. So now it is possible to have hasOpcode=0 and autogenSerialization=1 (for example SPV_ExtendedInstSetOp). Since the dispatch functions is also auto-generated, the input file needs to contain all operations. To this effect, SPIRVGLSLOps.td is included into SPIRVOps.td. This makes the previously added SPIRVGLSLOps.h and SPIRVGLSLOps.cpp unnecessary, and are deleted. The SPIRVUtilsGen.cpp is also changed to make better use of formatv,making the code more readable. PiperOrigin-RevId: 269456263	2019-09-16 17:12:33 -07:00
Denis Khalikov	8a34d5d18c	[spirv] Add support for function calls. Add spv.FunctionCall operation and (de)serialization. Closes tensorflow/mlir#137 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/137 from denis0x0D:sandbox/function_call_op e2e6f07d21e7f23e8b44c7df8a8ab784f3356ce4 PiperOrigin-RevId: 269437167	2019-09-16 15:39:54 -07:00
River Riddle	9619ba10d4	Add support for multi-level value mapping to DialectConversion. When performing A->B->C conversion, an operation may still refer to an operand of A. This makes it necessary to unmap through multiple levels of replacement for a specific value. PiperOrigin-RevId: 269367859	2019-09-16 10:38:19 -07:00
Lei Zhang	6934a337f0	[spirv] Add support for BitEnumAttr Certain enum classes in SPIR-V, like function/loop control and memory access, are bitmasks. This CL introduces a BitEnumAttr to properly model this and drive auto-generation of verification code and utility functions. We still store the attribute using an 32-bit IntegerAttr for minimal memory footprint and easy (de)serialization. But utility conversion functions are adjusted to inspect each bit and generate "\|"-concatenated strings for the bits; vice versa. Each such enum class has a "None" case that means no bit is set. We need special handling for "None". Because of this, the logic is not general anymore. So right now the definition is placed in the SPIR-V dialect. If later this turns out to be useful for other dialects, then we can see how to properly adjust it and move to OpBase.td. Added tests for SPV_MemoryAccess to check and demonstrate. PiperOrigin-RevId: 269350620	2019-09-16 09:23:22 -07:00
Alex Zinenko	cb3ecb5291	Overhaul the SDBM expression kind hierarchy Swap the allowed nesting of sum and diff expressions: now a diff expression can contain a sum expression, but only on the left hand side. A difference of two expressions sum must be canonicalized by grouping their constant terms in a single expression. This change of sturcture became possible thanks to the introduction of the "direct" super-kind. It is necessary to enable support of sum expressions on the left hand side of the stripe expression. SDBM expressions are now grouped into the following structure - expression - varying - direct - sum <- (term, constant) - term - symbol - dimension - stripe <- (term, constant) - negation <- (direct) - difference <- (direct, term) - constant The notation <- (...) denotes the types of subexpressions a compound expression can combine. PiperOrigin-RevId: 269337222	2019-09-16 08:16:06 -07:00
MLIR Team	0ce64b0bf3	Unify how errors are emitted in LaunchFuncOp verification. PiperOrigin-RevId: 269331869	2019-09-16 07:45:59 -07:00
MLIR Team	1da0290c4b	Error out when kernel function is not found while translating GPU calls. PiperOrigin-RevId: 269327909	2019-09-16 07:19:36 -07:00
Alex Zinenko	6755dfdec9	Drop makePositionAttr and the like in favor of Builder::getI64ArrayAttr The helper functions makePositionAttr() and positionAttr() were originally introduced in the lowering-to-LLVM-dialect pass to construct integer array attributes that are used for static positions in extract/insertelement. Constructing an integer array attribute being fairly common, a utility function Builder::getI64ArrayAttr was later introduced into the Builder API. Drop makePositionAttr and similar homegrown functions and use that API instead. PiperOrigin-RevId: 269295836	2019-09-16 03:31:09 -07:00
Mahesh Ravishankar	9814b3fa0d	Add mechanism to specify extended instruction sets in SPIR-V. Add support for specifying extended instructions sets. The operations in SPIR-V dialect are named as 'spv.<extension-name>.<op-name>'. Use this mechanism to define a 'Exp' operation from GLSL(450) instructions. Later CLs will add support for (de)serialization of these operations, and update the dialect generation scripts to auto-generate the specification using the spec directly. Additional changes: Add a Type Constraint to OpBase.td to check for vector of specified lengths. This is used to check that the vector type used in SPIR-V dialect are of lengths 2, 3 or 4. Update SPIRVBase.td to use this Type constraints for vectors. PiperOrigin-RevId: 269234377	2019-09-15 19:40:07 -07:00
River Riddle	d37777c440	Update the IRPrinter instrumentation to work on non function/module operations. This is necessary now that the pass manager may work on different types of operations. PiperOrigin-RevId: 269139669	2019-09-14 21:56:38 -07:00
River Riddle	bbe65b46f5	NFC: Pass PassInstrumentations by unique_ptr instead of raw pointer. This makes the ownership model explicit, and removes potential user errors. PiperOrigin-RevId: 269122834	2019-09-14 17:44:50 -07:00
River Riddle	cb1bcba69b	NFC: Merge OpPass with OperationPass into just OperationPass. OperationPass' are defined exactly the same way as they are now: class DerivedPass : public OperationPass<DerivedPass>; OpPass' are now defined as OperationPass, but with an additional template parameter for the operation type: class DerivedPass : public OperationPass<DerivedPass, FuncOp>; PiperOrigin-RevId: 269122410	2019-09-14 17:37:43 -07:00
Jing Pu	38e7226606	Add convenience methods to create i8 and i16 attributes in Builder. PiperOrigin-RevId: 269120226	2019-09-14 17:02:54 -07:00
Uday Bondhugula	4f32ae61b4	NFC - Move explicit copy/dma generation utility out of pass and into LoopUtils - turn copy/dma generation method into a utility in LoopUtils, allowing it to be reused elsewhere. - no functional/logic change to the pass/utility - trim down header includes in files affected Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#124 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/124 from bondhugula:datacopy 9f346e62e5bd9dd1986720a30a35f302eb4d3252 PiperOrigin-RevId: 269106088	2019-09-14 13:23:48 -07:00
Uday Bondhugula	1366467a3b	update normalizeMemRef utility; handle missing failure check + add more tests - take care of symbolic operands with alloc - add missing check for compose map failure and a test case - add test cases on strides - drop incorrect check for one-to-one'ness Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#132 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/132 from bondhugula:normalize-memrefs 8aebf285fb0d7c19269d85255aed644657e327b7 PiperOrigin-RevId: 269105947	2019-09-14 13:21:35 -07:00
Uday Bondhugula	018cfa94d9	Clean up build trip count analysis method - avoid mutating IR - NFC - on any pass/utility logic/output. - Resolve TODO; the method building loop trip count maps was creating and deleting affine.apply ops (transforming IR from under analysis!, strictly speaking). Introduce AffineValueMap::difference to do this correctly (without the need to create any IR). - Move AffineApplyNormalizer out so that its methods are reusable from AffineStructures.cpp; add a helper method 'normalize' to it. Fix AffineApplyNormalize::renumberOneDim (Issue tensorflow/mlir#89). - Trim includes on files touched. - add test case on a scenario previously not covered Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#133 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/133 from bondhugula:trip-count-build 7fc34d857f7788f98b641792cafad6f5bd50e47b PiperOrigin-RevId: 269101118	2019-09-14 12:10:55 -07:00
River Riddle	2de18fb84d	NFC: Fix stray character in error message: 1 -> ' PiperOrigin-RevId: 269091468	2019-09-14 09:44:23 -07:00
Uday Bondhugula	f2eb0f02fa	Add pattern to canonicalize for loop bounds - add pattern to canonicalize affine.for loop bounds (using canonicalizeMapAndOperands) - rename AffineForLoopBoundFolder -> AffineForLoopBoundFolder for consistency Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#111 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/111 from bondhugula:bound-canonicalize ee8fb7f43a7ffd45f6df3f53c95098d8b7e494c7 PiperOrigin-RevId: 269041220	2019-09-13 22:11:56 -07:00
River Riddle	4e48beadbb	Verify that ModuleOps only contain dialect specific attributes. ModuleOp has no expected operations, so only dialect-specific attributes are valid. PiperOrigin-RevId: 269020062	2019-09-13 18:19:33 -07:00
Uday Bondhugula	1e6a93b7ca	add missing memref cast fold pattern for dim op - add missing canonicalization pattern to fold memref_cast + dim to dim (needed to propagate constant when folding a dynamic shape to a static one) - also fix an outdated/inconsistent comment in StandardOps/Ops.td Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#126 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/126 from bondhugula:quickfix 4566e75e49685c532faffff91d64c5d83d4da524 PiperOrigin-RevId: 269020058	2019-09-13 18:18:48 -07:00
River Riddle	d780bdef20	Publicly expose the functionality to parse a textual pass pipeline. This allows for users other than those on the command line to apply a textual description of a pipeline to a given pass manager. PiperOrigin-RevId: 269017028	2019-09-13 17:54:00 -07:00
Lei Zhang	113aadddf9	Update SPIR-V symbols and use GLSL450 instead of VulkanKHR SPIR-V recently publishes v1.5, which brings a bunch of symbols into core. So the suffix "KHR"/"EXT"/etc. is removed from the symbols. We use a script to pull information from the spec directly. Also changed conversion and tests to use GLSL450 instead of VulkanKHR memory model. GLSL450 is still the main memory model supported by Vulkan shaders and it does not require extra capability to enable. PiperOrigin-RevId: 268992661	2019-09-13 15:26:32 -07:00
River Riddle	f1b100c77b	NFC: Finish replacing FunctionPassBase/ModulePassBase with OpPassBase. These directives were temporary during the generalization of FunctionPass/ModulePass to OpPass. PiperOrigin-RevId: 268970259	2019-09-13 13:34:27 -07:00
River Riddle	8a1cdeb31b	Forward diagnostics from untracked threads in ParallelDiagnosticHandler. This allows for the use of multiple ParallelDiagnosticHandlers without having them conflict with each other. PiperOrigin-RevId: 268967407	2019-09-13 13:19:19 -07:00
River Riddle	9274ed66ef	Refactor pass pipeline command line parsing to support explicit pipeline strings. This allows for explicitly specifying the pipeline to add to the pass manager. This includes the nesting structure, as well as the passes/pipelines to run. A textual pipeline string is defined as a series of names, each of which may in itself recursively contain a nested pipeline description. A name is either the name of a registered pass, or pass pipeline, (e.g. "cse") or the name of an operation type (e.g. "func"). For example, the following pipeline: $ mlir-opt foo.mlir -cse -canonicalize -lower-to-llvm Could now be specified as: $ mlir-opt foo.mlir -pass-pipeline='func(cse, canonicalize), lower-to-llvm' This will allow for running pipelines on nested operations, like say spirv modules. This does not remove any of the current functionality, and in fact can be used in unison. The new option is available via 'pass-pipeline'. PiperOrigin-RevId: 268954279	2019-09-13 12:10:31 -07:00
Smit Hinsu	1854c64c7c	Log name of the generated illegal operation name in DialectConversion debug mode PiperOrigin-RevId: 268859399	2019-09-13 01:37:38 -07:00
Geoffrey Martin-Noble	2ccbb3f1ce	Cmpf constant folding for nan and inf PiperOrigin-RevId: 268783645	2019-09-12 15:43:59 -07:00
Lei Zhang	a84bc68acc	[spirv] Add support for spv.loop (de)serialization This CL adds support for serializing and deserializing spv.loop ops. This adds support for spv.Branch and spv.BranchConditional op (de)serialization, too, because they are needed for spv.loop. PiperOrigin-RevId: 268536962	2019-09-11 14:02:59 -07:00
Alex Zinenko	e15356f8ed	Rename SDBMPositiveExpr to SDBMTermExpr This better reflects how this kind of expressions is used and avoids the potential confusion since the expression can take negative values. Term expressions comprise dimensions, symbols and stripe expressions. In an SDBM domain, a stripe expression always corresponds to a variable, input or temporary. This expression can appear anywhere an input variable can, including on the LHS of other stripe expressions. PiperOrigin-RevId: 268486066	2019-09-11 10:18:29 -07:00
MLIR Team	d732aaf2cb	Don't leak TargetMachine in ExecutionEngine::setupTargetTriple PiperOrigin-RevId: 268361054	2019-09-10 19:03:21 -07:00
Lei Zhang	ee8cbccacf	Add folding rule for spv.CompositeExtract If the composite is a constant, we can fold it away. This only supports vector and array constants for now, given that struct constant is not supported in spv.constant yet. PiperOrigin-RevId: 268350340	2019-09-10 17:48:24 -07:00
Feng Liu	cf0a782339	Remove the constraint that min / max should stride zero Since we apply nudging for the zero point to make sure the nudged zerop points can be in the range of [qmin, qmax], the constraint that rmin / rmax should stride zero isn't necessary. This also matches the documentation of tensorflow's FakeQuantWithMinMaxArgs op, where min and max don't need to stride zero: https://www.tensorflow.org/api_docs/python/tf/quantization/fake_quant_with_min_max_args PiperOrigin-RevId: 268296285	2019-09-10 13:26:46 -07:00
Feng Liu	c68d5467d6	Convert ConstFakeQuantPerAxis to qcast and dcast pair This is also to add the test to the fakeQuantAttrsToType for per-channel fake quant. PiperOrigin-RevId: 268260032	2019-09-10 10:50:57 -07:00
Jacques Pienaar	277b6136ee	Remove unused variable PiperOrigin-RevId: 268173638	2019-09-10 01:31:17 -07:00
Jacques Pienaar	a23f69a37b	Remove redundant qualification Address GCC error: extra qualification not allowed [-fpermissive] PiperOrigin-RevId: 268133737	2019-09-09 19:50:53 -07:00
Jacques Pienaar	2660623a88	Add pass generate per block in a function a GraphViz Dot graph with ops as nodes * Add GraphTraits that treat a block as a graph, Operation* as node and use-relationship for edges; - Just basic graph output; * Add use iterator to iterate over all uses of an Operation; * Add testing pass to generate op graph; This does not support arbitrary operations other than function nor nested regions yet. PiperOrigin-RevId: 268121782	2019-09-09 18:12:41 -07:00
Feng Liu	d3a6dbc0b8	[NFC] Rename ExpressedToUniformQuantizedType to ExpressedToQuantizedType PiperOrigin-RevId: 268090906	2019-09-09 15:29:59 -07:00
Feng Liu	27d776fa6d	Convert per channel fake quant attributes to type For per channel fake quant attributes, the returned type should be UniformQuantizedPerAxisType. Currently, this method isn't under test because we haven't added the quant_ConstFakeQuantPerAxis op and the convert method. PiperOrigin-RevId: 268084017	2019-09-09 14:57:59 -07:00
River Riddle	893c86fff7	Explicitly declare the OpPassManager move constructor to avoid undefined errors. Some compilers will try to auto-generate the destructor, instead of using the user provided destructor, when creating a default move constructor. PiperOrigin-RevId: 268067367	2019-09-09 13:44:24 -07:00
MLIR Team	36508528c7	Overload LLVM::TerminatorOp::build() for empty operands list. PiperOrigin-RevId: 268041584	2019-09-09 11:39:03 -07:00
River Riddle	e702875d16	Add support for coalescing adjacent nested pass pipelines. This allows for parallelizing across pipelines of multiple operation types. AdaptorPasses can now hold pass managers for multiple operation types and will dispatch based upon the operation being operated on. PiperOrigin-RevId: 268017344	2019-09-09 09:52:25 -07:00
Stephan Herhut	318ff019cf	Addressing some late review comments on kernel inlining. Just formatting and better lit tests, no functional change. PiperOrigin-RevId: 267942907	2019-09-09 01:15:47 -07:00
Mehdi Amini	42b60d34fc	Add `parseGenericOperation()` to the OpAsmParser This method parses an operation in its generic form, from the current parser state. This is the symmetric of OpAsmPrinter::printGenericOp(). An immediate use case is illustrated in the test dialect, where an operation wraps another one in its region and makes use of a single-line pretty-print form. PiperOrigin-RevId: 267930869	2019-09-08 23:40:12 -07:00
River Riddle	120509a6b2	Refactor PassTiming to support nested pipelines. This is done via a new set of instrumentation hooks runBeforePipeline/runAfterPipeline, that signal the lifetime of a pass pipeline on a specific operation type. These hooks also provide the parent thread of the pipeline, allowing for accurate merging of timers running on different threads. PiperOrigin-RevId: 267909193	2019-09-08 19:58:13 -07:00
Mehdi Amini	6443583bfd	Refactor getUsedValuesDefinedAbove to expose a variant taking a callback (NFC) This will allow clients to implement a different collection strategy on these values, including collecting each uses within the region for example. PiperOrigin-RevId: 267803978	2019-09-07 17:03:01 -07:00
Uday Bondhugula	713ab0dde7	Set mlir-cpu-runner JIT codegen opt level correctly - the JIT codegen was being run at the default -O0 level; instead, propagate the opt level from the cmd line. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#123 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/123 from bondhugula:jit-runner 3b055e47f94c9a48bf487f6400787478738cda02 PiperOrigin-RevId: 267778586	2019-09-07 10:00:25 -07:00
Mehdi Amini	53bb528b19	Wrap debug dump in LLVM_DEBUG PiperOrigin-RevId: 267774506	2019-09-07 08:53:52 -07:00
River Riddle	b78410fd81	Restrict affine inlining to just Function operations. The current restrictions on dim/symbols require a top-level symbol for the conservative case of a non-affine region. This should be relaxed in the future. PiperOrigin-RevId: 267641838	2019-09-06 11:44:19 -07:00
Nagy Mostafa	8154370b49	Add custom builder for AffineIfOp Closes tensorflow/mlir#109 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/109 from nmostafa:nmostafa/AffineIfOp 7dbf2115f0092ffab26381ea8704aa05a0253971 PiperOrigin-RevId: 267633077	2019-09-06 11:03:03 -07:00
Nicolas Vasilache	1b8eff8fcd	Simplify Linalg ABI integration with external function calls. View descriptors are converted to pointer to LLVM struct to avoid ABI issues related to C struct packing. This creates unnecessary complexity and hampers unification with memrefs. Instead, this CL makes view descriptors convert to LLVM struct (as it was originally) and promotes all structs to pointers right before calling an external function. PiperOrigin-RevId: 267602693	2019-09-06 08:31:19 -07:00
Uday Bondhugula	854a384f50	Integer set + operands / affine if op canonicalization - turn canonicalizeMapAndOperands into a template that works on both sets and maps, and use it to introduce a utility to canonicalize an affine integer set and its operands - add pattern to canonicalize affine if op's. - rename IntegerSet::getNumOperands -> IntegerSet::getNumInputs to be consistent with AffineMap - add missing accessors for IntegerSet Doesn't need extensive testing since canonicalizeSetAndOperands just reuses canonicalizeMapAndOperands' logic, and the latter is tested on affine.apply map + operands; the new method works the same way on an integer set + operands of an affine if op for example. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#112 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/112 from bondhugula:set-canonicalize eff72f23250b96fa7d9f5caff3877440f5de2cec PiperOrigin-RevId: 267532876	2019-09-05 23:12:35 -07:00
River Riddle	85bc4889b3	Add support for conservatively inlining Affine operations. This commit defines an initial implementation of the DialectInlinerInterface for the AffineOps dialect. This change allows for affine operations to be inlined into any region that is not an affine region. Inlining into affine regions requires special handling for dimension/symbol identifiers that will be added in followups. PiperOrigin-RevId: 267467078	2019-09-05 15:20:25 -07:00
Lei Zhang	916eb980b0	[spirv] Add spv.loop SPIR-V can explicitly declare structured control-flow constructs using merge instructions. These explicitly declare a header block before the control flow diverges and a merge block where control flow subsequently converges. These blocks delimit constructs that must nest, and can only be entered and exited in structured ways. Instead of having a `spv.LoopMerge` op to directly model loop merge instruction for indicating the merge and continue target, we use regions to delimit the boundary of the loop: the merge target is the next op following the `spv.loop` op and the continue target is the block that has a back-edge pointing to the entry block inside the `spv.loop`'s region. This way it's easier to discover all blocks belonging to a construct and it plays nicer with the MLIR system. Updated the SPIR-V.md doc. PiperOrigin-RevId: 267431010	2019-09-05 12:45:53 -07:00
River Riddle	0ba0087887	Add the initial inlining infrastructure. This defines a set of initial utilities for inlining a region(or a FuncOp), and defines a simple inliner pass for testing purposes. A new dialect interface is defined, DialectInlinerInterface, that allows for dialects to override hooks controlling inlining legality. The interface currently provides the following hooks, but these are just premilinary and should be changed/added to/modified as necessary: * isLegalToInline - Determine if a region can be inlined into one of this dialect, or if an operation of this dialect can be inlined into a given region. * shouldAnalyzeRecursively - Determine if an operation with regions should be analyzed recursively for legality. This allows for child operations to be closed off from the legality checks for operations like lambdas. * handleTerminator - Process a terminator that has been inlined. This cl adds support for inlining StandardOps, but other dialects will be added in followups as necessary. PiperOrigin-RevId: 267426759	2019-09-05 12:24:13 -07:00
Nicolas Vasilache	cf26e5faf5	Use transform function on llvm::Module in the ExecutionEngine The refactoring of ExecutionEngine dropped the usage of the irTransform function used to pass -O3 and other options to LLVM. As a consequence, the proper optimizations do not kick in in LLMV-land. This CL makes use of the transform function and allows producing avx512 instructions, on an internal example, when using: `mlir-cpu-runner -dump-object-file=1 -object-filename=foo.o` combined with `objdump -D foo.o`. Assembly produced resembles: ``` 2b2e: 62 72 7d 48 18 04 0e vbroadcastss (%rsi,%rcx,1),%zmm8 2b35: 62 71 7c 48 28 ce vmovaps %zmm6,%zmm9 2b3b: 62 72 3d 48 a8 c9 vfmadd213ps %zmm1,%zmm8,%zmm9 2b41: 62 f1 7c 48 28 cf vmovaps %zmm7,%zmm1 2b47: 62 f2 3d 48 a8 c8 vfmadd213ps %zmm0,%zmm8,%zmm1 2b4d: 62 f2 7d 48 18 44 0e vbroadcastss 0x4(%rsi,%rcx,1),%zmm0 2b54: 01 2b55: 62 71 7c 48 28 c6 vmovaps %zmm6,%zmm8 2b5b: 62 72 7d 48 a8 c3 vfmadd213ps %zmm3,%zmm0,%zmm8 2b61: 62 f1 7c 48 28 df vmovaps %zmm7,%zmm3 2b67: 62 f2 7d 48 a8 da vfmadd213ps %zmm2,%zmm0,%zmm3 2b6d: 62 f2 7d 48 18 44 0e vbroadcastss 0x8(%rsi,%rcx,1),%zmm0 2b74: 02 2b75: 62 f2 7d 48 a8 f5 vfmadd213ps %zmm5,%zmm0,%zmm6 2b7b: 62 f2 7d 48 a8 fc vfmadd213ps %zmm4,%zmm0,%zmm7 ``` etc. Fixes tensorflow/mlir#120 PiperOrigin-RevId: 267281097	2019-09-04 19:17:16 -07:00
MLIR Team	b5652720c1	Retain address space during MLIR > LLVM conversion. PiperOrigin-RevId: 267206460	2019-09-04 12:26:52 -07:00
Jacques Pienaar	636bcbade0	Make isIsolatedAbove robuster to invalid IR This function is only called from the verifier. PiperOrigin-RevId: 267145495	2019-09-04 07:03:07 -07:00
Uday Bondhugula	8c9dc690eb	pipeline-data-transfer: remove dead tag alloc's and improve test coverage for replaceMemRefUsesWith / pipeline-data-transfer - address remaining comments from PR tensorflow/mlir#87 for better test coverage for pipeline-data-transfer/replaceAllMemRefUsesWith - remove dead tag allocs the same way they are removed for the replaced buffers Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#106 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/106 from bondhugula:followup 9e868666d047e8d43e5f82f43e4093b838c710fa PiperOrigin-RevId: 267144774	2019-09-04 06:59:09 -07:00
Stephan Herhut	dfd06af562	Make GPU kernel outlining inline constants. It is generally beneficial to pass less arguments to a kernel, so cloning constants into the kernel is beneficial. PiperOrigin-RevId: 267139084	2019-09-04 06:16:07 -07:00
MLIR Team	2f13df13b0	Add support for array-typed constants. PiperOrigin-RevId: 267121729	2019-09-04 03:46:06 -07:00
Nicolas Vasilache	0c8ad3aafb	Properly clone Linalg ops with regions This CL adds support for proper cloning of Linalg ops that have regions (i.e. the generic linalg op). This is used to properly implement tiling and fusion for such ops. Adequate tests are added. PiperOrigin-RevId: 267027176	2019-09-03 15:28:47 -07:00
Uday Bondhugula	54d674f51e	Utility to normalize memrefs with non-identity layout maps - introduce utility to convert memrefs with non-identity layout maps to ones with identity layout maps: convert the type and rewrite/remap all its uses - add this utility to -simplify-affine-structures pass for testing purposes Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#104 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/104 from bondhugula:memref-normalize f2c914aa1890e8860326c9e33f9aa160b3d65e6d PiperOrigin-RevId: 266985317	2019-09-03 12:14:28 -07:00
Lei Zhang	5593e005c6	Add folding rule and dialect materialization hook for spv.constant This will allow us to use MLIR's folding infrastructure to deduplicate SPIR-V constants. This CL also changed isValidSPIRVType in SPIRVDialect to a static method. PiperOrigin-RevId: 266984403	2019-09-03 12:09:58 -07:00
Uday Bondhugula	b1ef9dc22c	Fix affine data copy generation corner cases/bugs - the [begin, end) range identified for copying could end in between the block, which makes hoisting invalid in some cases. Change the range identification to always end with end of block. - add test case to exercise these (with fast mem capacity set to minimal so that single element memref buffers are generated at the innermost loop) - the location of begin/end of the block range for data copying was being confused with the insert points for copy in and copy out code. In cases, where we choose to hoist transfers, these are separate. - when copy loops are single iteration ones, promote their bodies at the end of the pass. - change default fast mem space to 1 (setting it to zero made it generate DMA op's that won't verify in the default case - since the DMA ops have a check for src/dest memref spaces being different). Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Co-Authored-By: Mehdi Amini <joker.eph@gmail.com> Closes tensorflow/mlir#88 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/88 from bondhugula:datacopy 88697267c45e850c3ced87671e16e4a930c02a42 PiperOrigin-RevId: 266980911	2019-09-03 11:53:16 -07:00
River Riddle	61ee7d640c	Fix an invalid assert when processing escaped strings. The assert assumed that the escaped character could not appear at the end of the string. Fixes tensorflow/mlir#117 PiperOrigin-RevId: 266975471	2019-09-03 11:27:39 -07:00
Alex Torres	6eb910a59c	Remove unused variables Remove unused variables and attributes from BaseViewConversionHelper on mlir/lib/Dialect/Linalg/Transforms/LowerToLLVMDialect.cpp Closes tensorflow/mlir#116 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/116 from alexst07:fix-warnings 5f638e4677492cf71a9cc040eeb6b57427d32e06 PiperOrigin-RevId: 266972082	2019-09-03 11:12:58 -07:00
Alex Zinenko	c335d9d313	LLVM dialect: prefix auxiliary operations with "mlir." Some of the operations in the LLVM dialect are required to model the LLVM IR in MLIR, for example "constant" operations are needed to declare a constant value since MLIR, unlike LLVM, does not support immediate values as operands. To avoid confusion with actual LLVM operations, we prefix such axuiliary operations with "mlir.". PiperOrigin-RevId: 266942838	2019-09-03 09:10:56 -07:00
Smit Hinsu	da646505c5	Support bf16 in Builder::getZeroAttr PiperOrigin-RevId: 266863802	2019-09-02 23:44:06 -07:00
Mahesh Ravishankar	2acd0dbf05	Add Select operation to SPIR-V dialect. The SelectOp models the semantics of OpSelect from SPIR-V spec. PiperOrigin-RevId: 266849559	2019-09-02 21:07:18 -07:00
River Riddle	5c036e682d	Refactor the pass manager to support operations other than FuncOp/ModuleOp. This change generalizes the structure of the pass manager to allow arbitrary nesting pass managers for other operations, at any level. The only user visible change to existing code is the fact that a PassManager must now provide an MLIRContext on construction. A new class `OpPassManager` has been added that represents a pass manager on a specific operation type. `PassManager` will remain the top-level entry point into the pipeline, with OpPassManagers being nested underneath. OpPassManagers will still be implicitly nested if the operation type on the pass differs from the pass manager. To explicitly build a pipeline, the 'nest' methods on OpPassManager may be used: // Pass manager for the top-level module. PassManager pm(ctx); // Nest a pipeline operating on FuncOp. OpPassManager &fpm = pm.nest<FuncOp>(); fpm.addPass(...); // Nest a pipeline under the FuncOp pipeline that operates on spirv::ModuleOp OpPassManager &spvModulePM = pm.nest<spirv::ModuleOp>(); // Nest a pipeline on FuncOps inside of the spirv::ModuleOp. OpPassManager &spvFuncPM = spvModulePM.nest<FuncOp>(); To help accomplish this a new general OperationPass is added that operates on opaque Operations. This pass can be inserted in a pass manager of any type to operate on any operation opaquely. An example of this opaque OperationPass is a VerifierPass, that simply runs the verifier opaquely on the current operation. /// Pass to verify an operation and signal failure if necessary. class VerifierPass : public OperationPass<VerifierPass> { void runOnOperation() override { Operation *op = getOperation(); if (failed(verify(op))) signalPassFailure(); markAllAnalysesPreserved(); } }; PiperOrigin-RevId: 266840344	2019-09-02 19:25:26 -07:00
River Riddle	6563b1c446	Add a new dialect interface for the OperationFolder `OpFolderDialectInterface`. This interface will allow for providing hooks to interrop with operation folding. The first hook, 'shouldMaterializeInto', will allow for controlling which region to insert materialized constants into. The folder will generally materialize constants into the top-level isolated region, this allows for materializing into a lower level ancestor region if it is more profitable/correct. PiperOrigin-RevId: 266702972	2019-09-01 20:07:08 -07:00
Mehdi Amini	ce702fc8da	Add a `getUsedValuesDefinedAbove()` overload that takes an `Operation` pointer (NFC) This is a convenient utility around the existing `getUsedValuesDefinedAbove()` that take two regions. PiperOrigin-RevId: 266686854	2019-09-01 16:32:10 -07:00
Mehdi Amini	765d60fd4d	Add missing lowering to CFG in mlir-cpu-runner + related cleanup - the list of passes run by mlir-cpu-runner included -lower-affine and -lower-to-llvm but was missing -lower-to-cfg (because -lower-affine at some point used to lower straight to CFG); add -lower-to-cfg in between. IR with affine ops can now be run by mlir-cpu-runner. - update -lower-to-cfg to be consistent with other passes (create*Pass methods were changed to return unique ptrs, but -lower-to-cfg appears to have been missed). - mlir-cpu-runner was unable to parse custom form of affine op's - fix link options - drop unnecessary run options from test/mlir-cpu-runner/simple.mlir (none of the test cases had loops) - -convert-to-llvmir was changed to -lower-to-llvm at some point, but the create pass method name wasn't updated (this pass converts/lowers to LLVM dialect as opposed to LLVM IR). Fix this. (If we prefer "convert", the cmd-line options could be changed to "-convert-to-llvm/cfg" then.) Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#115 PiperOrigin-RevId: 266666909	2019-09-01 11:33:22 -07:00
River Riddle	9c8a8a7d0d	Add a canonicalization to erase empty AffineForOps. AffineForOp themselves are pure and can be removed if there are no internal operations. PiperOrigin-RevId: 266481293	2019-08-30 16:49:32 -07:00
River Riddle	1dd9bf4739	Generalize the pass hierarchy by adding a general OpPass<PassT, OpT>. This pass class generalizes the current functionality between FunctionPass and ModulePass, and allows for operating on any operation type. The pass manager currently only supports OpPasses operating on FuncOp and ModuleOp, but this restriction will be relaxed in follow-up changes. A utility class OpPassBase<OpT> allows for generically referring to operation specific passes: e.g. FunctionPassBase == OpPassBase<FuncOp>. PiperOrigin-RevId: 266442239	2019-08-30 13:16:37 -07:00
Jacques Pienaar	06e8101034	Add mechanism to dump JIT-compiled objects to files This commit introduces the bits to be able to dump JIT-compile objects to external files by passing an object cache to OrcJit. The new functionality is tested in mlir-cpu-runner under the flag `dump-object-file`. Closes tensorflow/mlir#95 PiperOrigin-RevId: 266439265	2019-08-30 13:02:10 -07:00
Rob Suderman	8f90a442c3	Added a TableGen generator for structured data Similar to enum, added a generator for structured data. This provide Dictionary that stores a fixed set of values and guarantees the values are valid. It is intended to store a fixed number of values by a given name. PiperOrigin-RevId: 266437460	2019-08-30 12:52:13 -07:00
River Riddle	037742cdf2	Add support for early exit walk methods. This is done by providing a walk callback that returns a WalkResult. This result is either `advance` or `interrupt`. `advance` means that the walk should continue, whereas `interrupt` signals that the walk should stop immediately. An example is shown below: auto result = op->walk([](Operation *op) { if (some_invariant) return WalkResult::interrupt(); return WalkResult::advance(); }); if (result.wasInterrupted()) ...; PiperOrigin-RevId: 266436700	2019-08-30 12:47:53 -07:00
Lei Zhang	4f6c29223e	Add spv.Branch and spv.BranchConditional This CL just covers the op definition, its parsing, printing, and verification. (De)serialization is to be implemented in a subsequent CL. PiperOrigin-RevId: 266431077	2019-08-30 12:17:53 -07:00
River Riddle	3ee3710fd1	Change the parseSource* methods to return OwningModuleRef instead of ModuleOp. This avoids potential memory leaks from misuse of the API. PiperOrigin-RevId: 266305750	2019-08-29 22:20:10 -07:00
River Riddle	4bfae66d70	Refactor the 'walk' methods for operations. This change refactors and cleans up the implementation of the operation walk methods. After this refactoring is that the explicit template parameter for the operation type is no longer needed for the explicit op walks. For example: op->walk<AffineForOp>([](AffineForOp op) { ... }); is now accomplished via: op->walk([](AffineForOp op) { ... }); PiperOrigin-RevId: 266209552	2019-08-29 13:04:50 -07:00
Jacques Pienaar	a085700311	Make dumping using generic form more robust when IR ill-formed PiperOrigin-RevId: 266198057	2019-08-29 12:14:30 -07:00
Feng Liu	6de6c2c138	Add tests to verify 0.0 is quantized correctly We should consider both signed and narrow_range cases. PiperOrigin-RevId: 266167366	2019-08-29 10:09:22 -07:00
Uday Bondhugula	4bb6f8ecdb	Extend map canonicalization to propagate constant operands - extend canonicalizeMapAndOperands to propagate constant operands into the map's expressions (and thus drop those operands). - canonicalizeMapAndOperands previously only dropped duplicate and unused operands; however, operands that were constants were retained. This change makes IR maps/expressions generated by various utilities/passes even simpler; also makes some of the test checks more accurate and simpler -- for eg., 0' instead of symbol(%{{.*}}). Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#107 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/107 from bondhugula:canonicalize-maps c889a51486d14fbf7db489f224f881e7e1ff7d72 PiperOrigin-RevId: 266085289	2019-08-29 01:13:29 -07:00
Uday Bondhugula	bc2a543225	fix loop unroll and jam - operand mapping - imperfect nest case - fix operand mapping while cloning sub-blocks to jam - was incorrect for imperfect nests where def/use was across sub-blocks - strengthen/generalize the first test case to cover the previously missed scenario - clean up the other cases while on this. Previously, unroll-jamming the following nest ``` affine.for %arg0 = 0 to 2048 { %0 = alloc() : memref<512x10xf32> affine.for %arg1 = 0 to 10 { %1 = affine.load %0[%arg0, %arg1] : memref<512x10xf32> } dealloc %0 : memref<512x10xf32> } ``` would yield ``` %0 = alloc() : memref<512x10xf32> %1 = affine.apply #map0(%arg0) %2 = alloc() : memref<512x10xf32> affine.for %arg1 = 0 to 10 { %4 = affine.load %0[%arg0, %arg1] : memref<512x10xf32> %5 = affine.apply #map0(%arg0) %6 = affine.load %0[%5, %arg1] : memref<512x10xf32> } dealloc %0 : memref<512x10xf32> %3 = affine.apply #map0(%arg0) dealloc %0 : memref<512x10xf32> ``` instead of ``` module { affine.for %arg0 = 0 to 2048 step 2 { %0 = alloc() : memref<512x10xf32> %1 = affine.apply #map0(%arg0) %2 = alloc() : memref<512x10xf32> affine.for %arg1 = 0 to 10 { %4 = affine.load %0[%arg0, %arg1] : memref<512x10xf32> %5 = affine.apply #map0(%arg0) %6 = affine.load %2[%5, %arg1] : memref<512x10xf32> } dealloc %0 : memref<512x10xf32> %3 = affine.apply #map0(%arg0) dealloc %2 : memref<512x10xf32> } ``` Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#98 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/98 from bondhugula:ujam ddbc853f69b5608b3e8ff9b5ac1f6a5a0bb315a4 PiperOrigin-RevId: 266073460	2019-08-28 23:42:50 -07:00
Stephan Herhut	e90542c03b	Add verification for dimension attribute on GPUDialect index operations. PiperOrigin-RevId: 266073204	2019-08-28 23:39:57 -07:00
Feng Liu	7dd5efdf2c	Fix the equality check of two floating point values PiperOrigin-RevId: 266022088	2019-08-28 16:39:48 -07:00
River Riddle	29099e03ce	Generalize the analysis manager framework to work on any operation at any nesting. The pass manager is moving towards being able to run on operations at arbitrary nesting. An operation may have both parent and child operations, and the AnalysisManager must be able to handle this generalization. The AnalysisManager class now contains generic 'getCachedParentAnalysis' and 'getChildAnalysis/getCachedChildAnalysis' functions to query analyses on parent/child operations. This removes the hard coded nesting relationship between Module/Function. PiperOrigin-RevId: 266003636	2019-08-28 15:11:17 -07:00
Eric Schweitz	2225411690	Tweak to the pretty type parser to recognize that `->` is a special token. Tweak to the pretty type parser to recognize that `->` is a special token that shouldn't be split into two characters. This change allows dialect types to wrap function types as in `!my.ptr_type<(i32) -> i32>`. Closes tensorflow/mlir#105 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/105 from schweitzpgi:parse-arrow 8b2d768053f419daae5a1a864121a44c4319acbe PiperOrigin-RevId: 265986240	2019-08-28 13:55:42 -07:00
Stephan Herhut	c60c490356	Add implementation for tensor_load and tensor_store operations. This change adds definitions, parsing and verification for both ops. PiperOrigin-RevId: 265954051	2019-08-28 11:25:52 -07:00
Stephan Herhut	545c3e489f	Port mlir-cuda-runner to use dialect conversion framework. Instead of lowering the program in two steps (Standard->LLVM followed by GPU->NVVM), leading to invalid IR inbetween, the runner now uses one pattern based rewrite step to go directly from Standard+GPU to LLVM+NVVM. PiperOrigin-RevId: 265861934	2019-08-28 01:50:57 -07:00
Uday Bondhugula	aa2cee9cf5	Refactor / improve replaceAllMemRefUsesWith Refactor replaceAllMemRefUsesWith to split it into two methods: the new method does the replacement on a single op, and is used by the existing one. - make the methods return LogicalResult instead of bool - Earlier, when replacement failed (due to non-deferencing uses of the memref), the set of ops that had already been processed would have been replaced leaving the IR in an inconsistent state. Now, a pass is made over all ops to first check for non-deferencing uses, and then replacement is performed. No test cases were affected because all clients of this method were first checking for non-deferencing uses before calling this method (for other reasons). This isn't true for a use case in another upcoming PR (scalar replacement); clients can now bail out with consistent IR on failure of replaceAllMemRefUsesWith. Add test case. - multiple deferencing uses of the same memref in a single op is possible (we have no such use cases/scenarios), and this has always remained unsupported. Add an assertion for this. - minor fix to another test pipeline-data-transfer case. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#87 PiperOrigin-RevId: 265808183	2019-08-27 17:56:56 -07:00
MLIR Team	696fcb7520	Add 3 additional intrinsic ops to NVVM dialect, in preparation to implement block-wide reduce. PiperOrigin-RevId: 265720077	2019-08-27 10:56:18 -07:00
Lei Zhang	3af6b53381	[spirv] Fix the entry block to start with OpLabel Each basic block in SPIR-V must start with an OpLabel instruction. We don't support control flow yet, so this CL just makes sure that the entry block follows this rule and is valid. PiperOrigin-RevId: 265718841	2019-08-27 10:51:26 -07:00
Mahesh Ravishankar	4ced99c085	Enhance GPU To SPIR-V conversion to support builtins and load/store ops. To support a conversion of a simple load-compute-store kernel from GPU dialect to SPIR-V dialect, the conversion of operations like "gpu.block_dim", "gpu.thread_id" which allow threads to get the launch conversion is needed. In SPIR-V these are specified as global variables with builin attributes. This CL adds support to specify builtin variables in SPIR-V conversion framework. This is used to convert the relevant operations from GPU dialect to SPIR-V dialect. Also add support for conversion of load/store operation in Standard dialect to SPIR-V dialect. To simplify the conversion add a method to build a spv.AccessChain operation that automatically determines the return type based on the base pointer type and the indices provided. PiperOrigin-RevId: 265718525	2019-08-27 10:50:23 -07:00
Denis Khalikov	8f2dfb51d4	[spirv] Add Block decoration for spv.struct. Add Block decoration for top-level spv.struct. Closes tensorflow/mlir#102 PiperOrigin-RevId: 265716241	2019-08-27 10:41:42 -07:00
River Riddle	2f59f76876	NFC: Remove the explicit context from Operation::create and OperationState. The context can easily be recovered from the Location in these situations. PiperOrigin-RevId: 265578574	2019-08-26 17:34:48 -07:00
River Riddle	23251f9f3a	NFC: Remove unnecessary context parameters from several Location getters. The context can be recovered by other means in these methods and doesn't need to be passed explicitly. PiperOrigin-RevId: 265532956	2019-08-26 13:54:01 -07:00
Andy Ly	6a501e3d1b	Support folding of ops with inner ops in GreedyPatternRewriteDriver. This fixes a bug when folding ops with inner ops and inner ops are still being visited. PiperOrigin-RevId: 265475780	2019-08-26 09:44:39 -07:00
Alina Sbirlea	96e96e0243	NFC: Add doc for id-punct PiperOrigin-RevId: 265190168	2019-08-23 21:01:12 -07:00
Nicolas Vasilache	46084c03f4	Lower linalg.copy to LLVM dialect in the presence of transposes. Add an extra RewritePattern that does not convert types to rewrite a CopyOp that has non-identity permutations into a sequence of TransposeOp followed by a CopyOp without such permutations. This RewitePattern is made to fail in the non-permutation case so that the conversion pattern can kick in to lower to LLVM. This is an instance of A->A->B lowering where A->A is done by a RewritePattern in case_1 and A->B is done by a ConversionPatternRewriter when not(case_1). PiperOrigin-RevId: 265171380	2019-08-23 17:45:19 -07:00
Nicolas Vasilache	28e28e5d65	Lower linalg.transpose to LLVM dialect Add a conversion pattern that transforms a linalg.transpose op into: 1. A function entry `alloca` operation to allocate a ViewDescriptor. 2. A load of the ViewDescriptor from the pointer allocated in 1. 3. Updates to the ViewDescriptor to introduce the data ptr, offset, size and stride. Size and stride are permutations of the original values. 4. A store of the resulting ViewDescriptor to the alloca'ed pointer. The linalg.transpose op is replaced by the alloca'ed pointer. PiperOrigin-RevId: 265169112	2019-08-23 17:29:26 -07:00
Nicolas Vasilache	2c2c9ffd80	Add a linalg.transpose op A linalg.transpose op is a pure metadata operation that takes a view + permutation map and produces another view of the same underlying data, with a different reindexing. This is a pure metadata operation that does not touch the underlying data. Example: ``` %t = linalg.transpose %v (i, j) -> (j, i) : !linalg.view<?x?xf32> ``` PiperOrigin-RevId: 265139429	2019-08-23 14:48:13 -07:00
River Riddle	32052c8417	NFC: Add a note to 'applyPatternsGreedily' that it also performs folding/dce. Fixes tensorflow/mlir#72 PiperOrigin-RevId: 265097597	2019-08-23 11:28:45 -07:00
Nicolas Vasilache	7f42b3d721	Add lowering of linalg.copy to an external C++ library and a test. This CL extends support for lowering of linalg to external C++ libraries with CopyOp. Currently this can only work when the permutation maps in the copies are identity. Future support for permutations will be added later. PiperOrigin-RevId: 265093025	2019-08-23 11:09:53 -07:00
Lei Zhang	0e131d83fe	[spirv] NFC: move SPIR-V control flow ops to a separate file This CL is also purely moving code around for better file organization. PiperOrigin-RevId: 265092566	2019-08-23 11:07:52 -07:00
Chris Lattner	31a003dc3c	Introduce the ability for "isolated from above" ops to introduce shadowing names for the basic block arguments in their body. PiperOrigin-RevId: 265084627	2019-08-23 10:35:49 -07:00
MLIR Team	00177962ed	NFC: Update in-code documentation. Make the two grammar definitions of static-dimension-list consistent. PiperOrigin-RevId: 265084348	2019-08-23 10:34:35 -07:00
River Riddle	d906f84b52	Add iterator support to ElementsAttr and SparseElementsAttr. This will allow iterating the values of a non-opaque ElementsAttr, with all of the types currently supported by DenseElementsAttr. This should help reduce the amount of specialization on DenseElementsAttr. PiperOrigin-RevId: 264968151	2019-08-22 18:59:24 -07:00
Lei Zhang	51cbf97b53	[spirv] Add support for extension (de)serialization Only a few important KHR extensions are registered to the SPIR-V dialect for now. PiperOrigin-RevId: 264939428	2019-08-22 16:01:35 -07:00
Nicolas Vasilache	6f1d4bb8df	Avoid overflow when lowering linalg.slice linalg.subview used to lower to a slice with a bounded range resulting in correct bounded accesses. However linalg.slice could still index out of bounds. This CL moves the bounding to linalg.slice. LLVM select and cmp ops gain a more idiomatic builder. PiperOrigin-RevId: 264897125	2019-08-22 12:47:01 -07:00
Lei Zhang	27ed82f99c	[spirv] Add support for capability (de)serialization This CL pulls in capabilities defined in the spec and adds support for (de)serialize capabilities of a spv.module. PiperOrigin-RevId: 264877413	2019-08-22 11:15:41 -07:00
Jacques Pienaar	0c7f3f7108	Split out parsing location into separate functions per instance Split out method into specialized instances + add an early exit. Should be NFC, but simplifies reading the logic slightly IMHO. PiperOrigin-RevId: 264855529	2019-08-22 09:45:33 -07:00
Nicolas Vasilache	fa592908af	Let LLVMOpLowering specify a PatternBenefit - NFC Currently the benefit is always set to 1 which limits the ability to do A->B->C lowering PiperOrigin-RevId: 264854146	2019-08-22 09:38:42 -07:00
River Riddle	38d4e0b6a1	NFC: Fix path of LinalgLibraryOpInterfaces inc files. PiperOrigin-RevId: 264827908	2019-08-22 07:15:36 -07:00
River Riddle	b9377d7ec6	Add support for generating operation interfaces from the ODS framework. Operation interfaces generally require a bit of boilerplate code to connect all of the pieces together. This cl introduces mechanisms in the ODS to allow for generating operation interfaces via the 'OpInterface' class. Providing a definition of the `OpInterface` class will auto-generate the c++ classes for the interface. An `OpInterface` includes a name, for the c++ class, along with a list of interface methods. There are two types of methods that can be used with an interface, `InterfaceMethod` and `StaticInterfaceMethod`. They are both comprised of the same core components, with the distinction that `StaticInterfaceMethod` models a static method on the derived operation. An `InterfaceMethod` is comprised of the following components: * ReturnType - A string corresponding to the c++ return type of the method. * MethodName - A string corresponding to the desired name of the method. * Arguments - A dag of strings that correspond to a c++ type and variable name respectively. * MethodBody (Optional) - An optional explicit implementation of the interface method. def MyInterface : OpInterface<"MyInterface"> { let methods = [ // A simple non-static method with no inputs. InterfaceMethod<"unsigned", "foo">, // A new non-static method accepting an input argument. InterfaceMethod<"Value ", "bar", (ins "unsigned":$i)>, // Query a static property of the derived operation. StaticInterfaceMethod<"unsigned", "fooStatic">, // Provide the definition of a static interface method. // Note: `ConcreteOp` corresponds to the derived operation typename. StaticInterfaceMethod<"Operation ", "create", (ins "OpBuilder &":$builder, "Location":$loc), [{ return builder.create<ConcreteOp>(loc); }]>, // Provide a definition of the non-static method. // Note: `op` corresponds to the derived operation variable. InterfaceMethod<"unsigned", "getNumInputsAndOutputs", (ins), [{ return op.getNumInputs() + op.getNumOutputs(); }]>, ]; PiperOrigin-RevId: 264754898	2019-08-21 20:57:51 -07:00
River Riddle	85bc8655f0	Avoid assigning to an unchecked Error. Fixes tensorflow/mlir#97 PiperOrigin-RevId: 264743395	2019-08-21 19:03:39 -07:00
Lei Zhang	1d10eb162c	Point to spv.AccessChain when reporting spv.AccessChain errors PiperOrigin-RevId: 264742130	2019-08-21 18:54:06 -07:00
Nicolas Vasilache	62ced387e0	Remove dead getLLVMLibraryCallImplDefinition in Linalg's LowerToLLVMDialect.cpp - NFC PiperOrigin-RevId: 264740014	2019-08-21 18:36:52 -07:00
Nicolas Vasilache	fe3594f745	Reduce reliance on custom grown Jit implementation - NFC This CL makes use of the standard LLVM LLJIT and removes the need for a custom JIT implementation within MLIR. To achieve this, one needs to clone (i.e. serde) the produced llvm::Module into a new LLVMContext. This is currently necessary because the llvm::LLVMContext is owned by the LLVMDialect, somewhat deep in the call hierarchy. In the future we should remove the reliance of serding the llvm::Module by allowing the injection of an LLVMContext from the top-level. Unfortunately this will require deeper API changes and impact multiple places. It is therefore left for future work. PiperOrigin-RevId: 264737459	2019-08-21 18:16:02 -07:00
Lei Zhang	748edce6b8	Remove the wrapping function in SPIR-V (de)serialization Previously Module and Function are builtinn constructs in MLIR. Due to the structural requirements we must wrap the SPIR-V module inside a Function inside a Module. Now the requirement is lifted and we can remove the wrapping function! :) PiperOrigin-RevId: 264736051	2019-08-21 18:05:24 -07:00
MLIR Team	cc0d337620	NFC: Update in-code documentation for type. PiperOrigin-RevId: 264734014	2019-08-21 17:51:56 -07:00
MLIR Team	8869de5fbc	NFC: Update in-code documentation for function-type. PiperOrigin-RevId: 264723462	2019-08-21 16:51:54 -07:00
River Riddle	c400c9a1ec	Add a hook to the OpAsmDialectInterface to allow providing a special name for the operation result. This generalizes the current special handling for constant operations(they get named 'cst'/'true'/'false'/etc.) PiperOrigin-RevId: 264723379	2019-08-21 16:50:59 -07:00
River Riddle	b618221350	Automated rollback of commit `b9dc2e4818` PiperOrigin-RevId: 264672975	2019-08-21 13:01:03 -07:00
River Riddle	2e59b86541	NFC: Make the ModuleState field in the ModulePrinter optional. The ModuleState is only used for printing aliases, which is only done when printing the top-level module. PiperOrigin-RevId: 264664138	2019-08-21 12:16:49 -07:00
River Riddle	b9dc2e4818	Add iterator support to ElementsAttr and SparseElementsAttr. This will allow iterating the values of a non-opaque ElementsAttr, with all of the types currently supported by DenseElementsAttr. This should help reduce the amount of specialization on DenseElementsAttr. PiperOrigin-RevId: 264637293	2019-08-21 10:23:44 -07:00
River Riddle	7e1af594d2	Move the parser extensions for aliases currently on Dialect to a new OpAsmDialectInterface. This will allow for adding more hooks for controlling parser behavior without bloating Dialect in the common case. This cl also adds iteration support to the DialectInterfaceCollection. PiperOrigin-RevId: 264627846	2019-08-21 09:42:02 -07:00
Lei Zhang	8d18fdf2d3	[spirv] Support i1 as bool type PiperOrigin-RevId: 264612014	2019-08-21 08:17:50 -07:00
Lei Zhang	31cfee6077	Support variadic ops in declarative rewrite rules This CL extends declarative rewrite rules to support matching and generating ops with variadic operands/results. For this, the generated `matchAndRewrite()` method for each pattern now are changed to * Use "range" types for the local variables used to store captured values (`operand_range` for operands, `ArrayRef<Value >` for values, Op for results). This allows us to have a unified way of handling both single values and value ranges. * Create local variables for each operand for op creation. If the operand is variadic, then a `SmallVector<Value>` will be created to collect all values for that operand; otherwise a `Value` will be created. * Use a collective result type builder. All result types are specified via a single parameter to the builder. We can use one result pattern to replace multiple results of the matched root op. When that happens, it will require specifying types for multiple results. Add a new collective-type builder. PiperOrigin-RevId: 264588559	2019-08-21 05:35:32 -07:00
Lei Zhang	69cf811d5b	Materialize spv.constants at use sites In SPIR-V binary format, constants are placed at the module level and referenced by instructions inside functions using their result <id>s. To model this natively (using SSA values for result <id>s), it means we need to have implicit capturing functions. We will lose the ability to have function passes if going down that path. Instead, this CL changes to materialize constants at their use sites in deserialization. It's cheap to copy constants in MLIR given that attributes is uniqued to MLIRContext. By localizing constants into functions, we can preserve isolated functions. PiperOrigin-RevId: 264582532	2019-08-21 04:45:49 -07:00
River Riddle	ad8b410f16	NFC: Keep the dialect list in the context sorted by namespace. Most dialects are initialized statically, which does not have a guaranteed initialization order. By keeping the dialect list sorted, we can guarantee a deterministic iteration order of dialects. PiperOrigin-RevId: 264522875	2019-08-20 19:59:01 -07:00
River Riddle	5e17730cde	NFC: Use a DenseSet instead of a DenseMap for DialectInterfaceCollection. The interfaces are looked up by dialect, which can always be retrieved from an interface instance. PiperOrigin-RevId: 264516023	2019-08-20 18:49:37 -07:00
River Riddle	ffde975e21	NFC: Move AffineOps dialect to the Dialect sub-directory. PiperOrigin-RevId: 264482571	2019-08-20 15:36:39 -07:00
Lei Zhang	f4934bcc3e	Add spv.specConstant and spv._reference_of Similar to global variables, specialization constants also live in the module scope and can be referenced by instructions in functions in native SPIR-V. A direct modelling would be to allow functions in the SPIR-V dialect to implicit capture, but it means we are losing the ability to write passes for Functions. While in SPIR-V normally we want to process the module as a whole, it's not common to see multiple functions get used so we'd like to leave the door open for those cases. Therefore, similar to global variables, we introduce spv.specConstant to model three SPIR-V instructions: OpSpecConstantTrue, OpSpecConstantFalse, and OpSpecConstant. They do not return SSA value results; instead they have symbols and can only be referenced by the symbols. To use it in a function, we need to have another op spv._reference_of to turn the symbol into an SSA value. This breaks the tie and makes functions still explicit capture. Previously specialization constants were handled similarly as normal constants. That is incorrect given that specialization constant actually acts more like variable (without need to load and store). E.g., they cannot be de-duplicated like normal constants. This CL also refines various documents and comments. PiperOrigin-RevId: 264455172	2019-08-20 13:34:13 -07:00
Denis Khalikov	82cf6051ee	[spirv] Support (de)serialization of spv.struct Support (de)serialization of spv.struct with offset decorations. Closes tensorflow/mlir#94 PiperOrigin-RevId: 264421427	2019-08-20 11:03:42 -07:00
Diego Caballero	9e6cf0d025	Fix build of affine load/store with empty map tensorflow/mlir#58 fixed and exercised verification of load/store ops using empty affine maps. Unfortunately, it didn't exercise the creation of them. This PR addresses that aspect. It removes the assumption of AffineMap having at least one result and stores a pointer to MLIRContext as member of AffineMap. * Add empty map support to affine.store + test * Move MLIRContext to AffineMapStorage Closes tensorflow/mlir#74 PiperOrigin-RevId: 264416260	2019-08-20 10:44:18 -07:00
Alex Zinenko	006fcce44a	ConvertLaunchFuncToCudaCalls: use LLVM dialect globals This conversion has been using a stack-allocated array of i8 to store the null-terminated kernel name in order to pass it to the CUDA wrappers expecting a C string because the LLVM dialect was missing support for globals. Now that the suport is introduced, use a global instead. Refactor global string construction from GenerateCubinAccessors into a common utility function living in the LLVM namespace. PiperOrigin-RevId: 264382489	2019-08-20 07:52:01 -07:00
Alex Zinenko	0d82a292b0	JitRunner: support entry functions returning void JitRunner can use as entry points functions that produce either a single '!llvm.f32' value or a list of memrefs. Memref support is legacy and was introduced before MLIR could lower memref allocation and deallocation to malloc/free calls so as to allocate the memory externally, and is likely to be dropped in the future since it unconditionally runs affine+standard-to-llvm lowering on the module instead of accepting the LLVM dialect. CUDA runner relies on memref-based flow in the runner without actually returning anything. Introduce a runner flow to use functions that return void as entry points. PiperOrigin-RevId: 264381686	2019-08-20 07:46:17 -07:00
Nicolas Vasilache	f55ac5c076	Add support for LLVM lowering of binary ops on n-D vector types This CL allows binary operations on n-D vector types to be lowered to LLVMIR by performing an (n-1)-D extractvalue, 1-D vector operation and an (n-1)-D insertvalue. PiperOrigin-RevId: 264339118	2019-08-20 02:00:22 -07:00
Uday Bondhugula	07ecb011a7	Fix AffineExpr::simplifyAdd bug - fix missing check while simplifying an expression with floordiv to a mod - fixes issue tensorflow/mlir#82 Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#84 PiperOrigin-RevId: 264338353	2019-08-20 01:53:07 -07:00
Nicolas Vasilache	b628194013	Move Linalg and VectorOps dialects to the Dialect subdir - NFC PiperOrigin-RevId: 264277760	2019-08-19 17:11:38 -07:00
River Riddle	305516fcd3	Allow isolated regions to form isolated SSA name scopes in the printer. This will allow for naming values the same as existing SSA values for regions attached to operations that are isolated from above. This fits in with how the system already allows separate name scopes for sibling regions. This name shadowing can be enabled in the custom parser of operations by setting the 'enableNameShadowing' flag to true when calling 'parseRegion'. %arg = constant 10 : i32 foo.op { %arg = constant 10 : i32 } PiperOrigin-RevId: 264255999	2019-08-19 15:27:10 -07:00
Nicolas Vasilache	36f48063dd	Add alignment support to linalg.buffer_alloc This CL adds an integer attribute to linalg.buffer_alloc and lowering to LLVM. The alignment is constrained to be a positive power of 2. Lowering to LLVM produces the pattern: ``` %[[alloc:.]] = llvm.call @malloc(%[[s]]) : (!llvm.i64) -> !llvm<"i8"> %[[cast:.]] = llvm.bitcast %[[alloc]] : !llvm<"i8"> to !llvm.i64 %[[rem:.]] = llvm.urem %[[cast]], %[[c16]] : !llvm.i64 %[[drem:.]] = llvm.sub %[[c16]], %[[rem]] : !llvm.i64 %[[off:.]] = llvm.urem %[[drem]], %[[c16]] : !llvm.i64 llvm.getelementptr %{{.}}[%[[off]]] : (!llvm<"i8">, !llvm.i64) -> !llvm<"i8"> ``` where `ptr` is aligned on `align` by computing the address `ptr + (align - ptr % align) % align`. To allow dealloc op to still be able to free memory, additional information is needed in the buffer type. The buffer type is thus extended with an extra i8* for the base allocation address. PiperOrigin-RevId: 264244455	2019-08-19 14:37:18 -07:00
Mahesh Ravishankar	377bfb3a14	Fix parsing/printing of spv.globalVariable and spv._address_of Change the prining/parsing of spv.globalVariable to print the type of the variable after the ':' to be consistent with MLIR convention. The spv._address_of should print the variable type after the ':'. It was mistakenly printing the address of the return value. Add a (missing) test that should have caught that. Also move spv.globalVariable and spv._address_of tests to structure-ops.mlir. PiperOrigin-RevId: 264204686	2019-08-19 11:39:25 -07:00

... 3 4 5 6 7 ...

2234 Commits