llvm-project

Commit Graph

Author	SHA1	Message	Date
Lei Zhang	0117865412	[mlir][spirv] NFC: Shuffle code around to better follow convention This commit shuffles SPIR-V code around to better follow MLIR convention. Specifically, * Created IR/, Transforms/, Linking/, and Utils/ subdirectories and moved suitable code inside. * Created SPIRVEnums.{h\|cpp} for SPIR-V C/C++ enums generated from SPIR-V spec. Previously they are cluttered inside SPIRVTypes.{h\|cpp}. * Fixed include guards in various header files (both .h and .td). * Moved serialization tests under test/Target/SPIRV. * Renamed TableGen backend -gen-spirv-op-utils into -gen-spirv-attr-utils as it is only generating utility functions for attributes. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D93407	2020-12-17 11:03:26 -05:00
Alex Zinenko	96076a2edb	[mlir] Support index and memref types in llvm.mlir.cast This operation is designed to support partial conversion, more specifically the IR state in which some operations expect or produce built-in types and some operations produce and expect LLVM dialect types. It is reasonable for it to support cast between built-in types and any equivalent that could be produced by the type conversion. (At the same time, we don't want the dialect to depend on the type conversion as it could lead to a dependency cycle). Introduce support for casting from index to any integer type and back, and from memref to bare pointer or memref descriptor type and back. Contrary to what the TODO in the code stated, there are no particular precautions necessary to handle the bare pointer conversion for memerfs. This conversion applies exclusively to statically-shaped memrefs, so we can always recover the full descriptor contents from the type. This patch simultaneously tightens the verification for other types to only accept matching pairs of types, e.g., i64 and !llvm.i64, as opposed to the previous implementation that only checked if the types were generally allowed byt not for matching, e.g. i64 could be "casted" to !llvm.bfloat, which is not the intended semantics. Move the relevant test under test/Dialect/LLVMIR because it is not specific to the conversion pass, but rather exercises an op in the dialect. If we decide this op does not belong to the LLVM dialect, both the dialect and the op should move together. Reviewed By: silvas, ezhulenev Differential Revision: https://reviews.llvm.org/D93405	2020-12-17 09:21:42 +01:00
Tres Popp	b17a181563	[mlir] Modify linalg loops test to have nested regions Differential Revision: https://reviews.llvm.org/D93418	2020-12-17 01:19:46 +01:00
Mehdi Amini	c21ee1a942	Improve the verifier diagnostic on dominance error Address PR47937 Differential Revision: https://reviews.llvm.org/D93361	2020-12-16 22:05:17 +00:00
Eugene Zhulenev	900d71a851	[mlir] Async: re-enable tests after fixing fkakines Test flakiness was fixed by: `9edcedf7f2` Runs these tests to verify that all parts of the lowering work correctly. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D93384	2020-12-16 11:07:03 -08:00
Christian Sigg	a79b26db0e	[mlir] Fix for gpu-async-region pass. - the !gpu.async.token is the second result of 'gpu.alloc async', not the first. - async.execute construction takes operand types not yet wrapped in !async.value. - fix typo Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D93156	2020-12-16 19:08:10 +01:00
ergawy	6551c9ac36	[mlir][spirv] Add parsing and printing support for SpecConstantOperation Adds more support for `SpecConstantOperation` by defining a custom syntax for the op and implementing its parsing and printing. Reviewed By: mravishankar, antiagainst Differential Revision: https://reviews.llvm.org/D92919	2020-12-16 08:26:48 -05:00
Alex Zinenko	20d0cbd3fa	[mlir] Tighten type verifiers for LLVM dialect ops results Now that we have predicates for LLVM dialect types in ODS, we can use them to restrict the types allowed in results of LLVM dialect operations. This also serves as additional documentation for these operations. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D93329	2020-12-15 23:50:02 +01:00
River Riddle	95019de8a1	[mlir][IR] Define the singleton builtin types in ODS instead of C++ This exposes several issues with the current generation that this revision also fixes. * TypeDef now allows specifying the base class to use when generating. * TypeDef now inherits from DialectType, which allows for using it as a TypeConstraint * Parser/Printers are now no longer generated in the header(removing duplicate symbols), and are now only generated when necessary. - Now that generatedTypeParser/Printer are only generated in the definition file, existing users will need to manually expose this functionality when necessary. * ::get() is no longer generated for singleton types, because it isn't necessary. Differential Revision: https://reviews.llvm.org/D93270	2020-12-15 13:42:19 -08:00
Sean Silva	caf4f2e372	[mlir] Handle unknown ops in dynamic_tensor_from_elements bufferization Due to how the conversion infra works, the "clone" call that this pattern was using required all the cloned ops to be immediately legalized as part of this dialect conversion invocation. That was previously working due to a couple factors: - In the test case, there was scf.if, which we happen to mark as legal as part of marking the entire SCF dialect as legal for the scf.parallel we generate here. - Originally, this test case had std.extract_element in the body, which we happened to have a pattern for in this pass. After I migrated that to `tensor.extract` (which removed the tensor.extract bufferization from here), I hacked this up to use `std.dim` which we still have patterns for in this pass. This patch updates the test case to use a truly opaque op `test.source` that properly stresses this aspect of the pattern. (this also removes a stray dependency on the `tensor` dialect that I must have left behind as part of my hacking this pass up when migrating to `tensor.extract`) Differential Revision: https://reviews.llvm.org/D93262	2020-12-15 12:50:56 -08:00
Tres Popp	c77ea40528	[mlir] Add std.pow lowering to LLVMIR Differential Revision: https://reviews.llvm.org/D93311	2020-12-15 18:54:29 +01:00
Tres Popp	9adc64539f	[mlir] Add std.powf to ROCDL lowering. Differential Revision: https://reviews.llvm.org/D93313	2020-12-15 18:47:49 +01:00
Tres Popp	f3e8f27ca1	[mlir] Fix GPUToNVVM test	2020-12-15 18:41:16 +01:00
Tres Popp	e04785b131	[mlir] Add NVVM lowering for std.pow Differential Revision: https://reviews.llvm.org/D93303	2020-12-15 18:28:23 +01:00
Tres Popp	73c580405f	[mlir] Add std op for X raised to the power of Y Proposal: https://llvm.discourse.group/t/rfc-standard-add-powop-to-std-dialect/2377 Differential Revision: https://reviews.llvm.org/D93119	2020-12-15 17:06:26 +01:00
River Riddle	d7eba20052	[mlir][Inliner] Refactor the inliner to use nested pass pipelines instead of just canonicalization Now that passes have support for running nested pipelines, the inliner can now allow for users to provide proper nested pipelines to use for optimization during inlining. This revision also changes the behavior of optimization during inlining to optimize before attempting to inline, which should lead to a more accurate cost model and prevents the need for users to schedule additional duplicate cleanup passes before/after the inliner that would already be run during inlining. Differential Revision: https://reviews.llvm.org/D91211	2020-12-14 18:09:47 -08:00
River Riddle	b3ee7f1f31	[mlir][OpDefGen] Add support for generating local functions for shared utilities This revision adds a new `StaticVerifierFunctionEmitter` class that emits local static functions in the .cpp file for shared operation verification. This class deduplicates shared operation verification code by emitting static functions alongside the op definitions. These methods are local to the definition file, and are invoked within the operation verify methods. The first bit of shared verification is for the type constraints used when verifying operands and results. An example is shown below: ``` static LogicalResult localVerify(...) { ... } LogicalResult OpA::verify(...) { if (failed(localVerify(...))) return failure(); ... } LogicalResult OpB::verify(...) { if (failed(localVerify(...))) return failure(); ... } ``` This allowed for saving >400kb of code size from a downstream TensorFlow project (~15% of MLIR code size). Differential Revision: https://reviews.llvm.org/D91381	2020-12-14 14:21:30 -08:00
Javier Setoain	aece4e2793	[mlir][ArmSVE][RFC] Add an ArmSVE dialect This revision starts an Arm-specific ArmSVE dialect discussed in the discourse RFC thread: https://llvm.discourse.group/t/rfc-vector-dialects-neon-and-sve/2284 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D92172	2020-12-14 21:35:01 +00:00
River Riddle	6bc9439f59	[mlir][OpAsmParser] Add support for parsing integer literals without going through IntegerAttr Some operations use integer literals as part of their custom format that don't necessarily map to an internal IntegerAttr. This revision exposes the same `parseInteger` functions as the DialectAsmParser to allow for these operations to parse integer literals without incurring the otherwise unnecessary roundtrip through IntegerAttr. Differential Revision: https://reviews.llvm.org/D93152	2020-12-14 12:00:43 -08:00
River Riddle	c234b65cef	[mlir][OpFormat] Add support for emitting newlines from the custom format of an operation This revision adds a new `printNewline` hook to OpAsmPrinter that allows for printing a newline within the custom format of an operation, that is then indented to the start of the operation. Support for the declarative assembly format is also added, in the form of a `\n` literal. Differential Revision: https://reviews.llvm.org/D93151	2020-12-14 12:00:43 -08:00
Christian Sigg	a1eb154421	[flang] Use mlir::OpState::operator->() to get to methods of mlir::Operation. This is a preparation step to remove those methods from OpState. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D93194	2020-12-14 20:04:53 +01:00
Thomas Raoux	8955e9f6b7	[mlir][linalg] Fix bug in elementwise vectorization Fix a bug causing to pick the wrong vector size to broadcast to when the source vectors have different ranks. Differential Revision: https://reviews.llvm.org/D93118	2020-12-14 10:44:36 -08:00
Frederik Gossen	75d9a46090	[MLIR] Add atan and atan2 lowerings to CUDA intrinsics Differential Revision: https://reviews.llvm.org/D93124	2020-12-14 10:45:28 +01:00
Frederik Gossen	1c6bc2c0b5	[MLIR] Add lowerings for atan and atan2 to ROCDL intrinsics Differential Revision: https://reviews.llvm.org/D93123	2020-12-14 10:43:19 +01:00
ergawy	076f87a867	[MLIR][SPIRV] Add support for GLSL F/U/SClamp. Adds support for 3 ternary ops from SPIR-V extended instructions for GLSL. Namely, adds support for FClamp, UClamp, and SClamp. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D92859	2020-12-13 09:56:46 -05:00
Christian Sigg	1ffc1aaa09	[mlir] Use mlir::OpState::operator->() to get to methods of mlir::Operation. This is a preparation step to remove those methods from OpState. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D93098	2020-12-13 09:58:16 +01:00
Chris Lattner	a44e630353	[AsmParser] Fix support for zero bit integer types. Zero bit integer types are supported by IntegerType for consistency, but the asmparser never got updated. Allow them to be parsed, as required to fix CIRCT issue #316 Differential Revision: https://reviews.llvm.org/D93089	2020-12-12 21:24:18 -08:00
kweisamx	c84b53ca9b	[mlir] Add Python binding for MLIR Dict Attribute Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D93004	2020-12-13 04:30:35 +00:00
Brian Gesiak	09b0e0884a	[mlir] Print bad size in AttrSizedOperandSegments When printing verification errors for ops with the incorrect number of operand segments, print the required number as well as the actual number. Split off from D93005. Differential Revision: https://reviews.llvm.org/D93145	2020-12-12 13:12:31 -05:00
Mehdi Amini	aadcb26ee1	Store a MlirIdentifier instead of a MlirStringRef in MlirNameAttribute This mirror the C++ API for NamedAttribute, and has the advantage or internalizing earlier in the Context and not requiring the caller to keep the StringRef alive beyong this call. Differential Revision: https://reviews.llvm.org/D93133	2020-12-11 22:38:48 +00:00
Sean Silva	444822d77a	Revert "Revert "[mlir] Start splitting the `tensor` dialect out of `std`."" This reverts commit `0d48d265db`. This reapplies the following commit, with a fix for CAPI/ir.c: [mlir] Start splitting the `tensor` dialect out of `std`. This starts by moving `std.extract_element` to `tensor.extract` (this mirrors the naming of `vector.extract`). Curiously, `std.extract_element` supposedly works on vectors as well, and this patch removes that functionality. I would tend to do that in separate patch, but I couldn't find any downstream users relying on this, and the fact that we have `vector.extract` made it seem safe enough to lump in here. This also sets up the `tensor` dialect as a dependency of the `std` dialect, as some ops that currently live in `std` depend on `tensor.extract` via their canonicalization patterns. Part of RFC: https://llvm.discourse.group/t/rfc-split-the-tensor-dialect-from-std/2347/2 Differential Revision: https://reviews.llvm.org/D92991	2020-12-11 14:30:50 -08:00
Sean Silva	0d48d265db	Revert "[mlir] Start splitting the `tensor` dialect out of `std`." This reverts commit `cab8dda90f`. I mistakenly thought that CAPI/ir.c failure was unrelated to this change. Need to debug it.	2020-12-11 14:15:41 -08:00
Sean Silva	cab8dda90f	[mlir] Start splitting the `tensor` dialect out of `std`. This starts by moving `std.extract_element` to `tensor.extract` (this mirrors the naming of `vector.extract`). Curiously, `std.extract_element` supposedly works on vectors as well, and this patch removes that functionality. I would tend to do that in separate patch, but I couldn't find any downstream users relying on this, and the fact that we have `vector.extract` made it seem safe enough to lump in here. This also sets up the `tensor` dialect as a dependency of the `std` dialect, as some ops that currently live in `std` depend on `tensor.extract` via their canonicalization patterns. Part of RFC: https://llvm.discourse.group/t/rfc-split-the-tensor-dialect-from-std/2347/2 Differential Revision: https://reviews.llvm.org/D92991	2020-12-11 13:50:55 -08:00
Alex Zinenko	dacfb24b30	[mlir] Support inlining into affine operations Introduce support for inlining into affine operations. This uses the generic inline infrastructure and boils down to checking that, if applied, the inlining doesn't violate the affine dimension/symbol value categorization. Given valid IR, only the values that are valid dimensions/symbols thanks to being top-level in their affine scope need special handling. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D92770	2020-12-11 16:24:27 +01:00
Nicolas Vasilache	7310501f74	[mlir][ArmNeon][RFC] Add a Neon dialect This revision starts an Arm-specific ArmNeon dialect discussed in the [discourse RFC thread](https://llvm.discourse.group/t/rfc-vector-dialects-neon-and-sve/2284). Differential Revision: https://reviews.llvm.org/D92171	2020-12-11 13:49:40 +00:00
Adrian Kuegel	ada4c7a351	Add rsqrt lowering from standard to ROCDL. Add a lowering for rsqrt from standard dialect to ROCDL. Differential Revision: https://reviews.llvm.org/D93011	2020-12-11 13:18:57 +01:00
River Riddle	186c154991	[mlir] Remove the dependency on StandardOps from FoldUtils OperationFolder currently uses ConstantOp as a backup when trying to materialize a constant after an operation is folded. This dependency isn't really useful or necessary given that dialects can/should provide a `materializeConstant` implementation. Fixes PR#44866 Differential Revision: https://reviews.llvm.org/D92980	2020-12-10 14:13:57 -08:00
River Riddle	c24f88b4db	[mlir][SCCP] Don't visit private callables unless they are used when tracking interprocedural arguments/results This fixes a subtle bug where SCCP could incorrectly optimize a private callable while waiting for its arguments to be resolved. Fixes PR#48457 Differential Revision: https://reviews.llvm.org/D92976	2020-12-10 12:53:27 -08:00
Mehdi Amini	285c0aa262	Add MLIR Python binding for Array Attribute Differential Revision: https://reviews.llvm.org/D92948	2020-12-10 20:51:34 +00:00
River Riddle	75eca67c1c	[mlir][Parser] Fix crash in DenseElementsAttr parser when no elements are parsed This fixes a crash when no elements are parsed, but the type expects at least one. Fixes PR#47763 Differential Revision: https://reviews.llvm.org/D92982	2020-12-10 12:48:37 -08:00
River Riddle	1f5f006d9d	[mlir][StandardOps] Verify that the result of an integer constant is signless This was missed when supported for unsigned/signed integer types was first added, and results in crashes if a user tries to create/print a constant with the incorrect integer type. Fixes PR#46222 Differential Revision: https://reviews.llvm.org/D92981	2020-12-10 12:40:10 -08:00
Benjamin Kramer	1d00508c5b	[mlir][Shape] Make sure tensor_cast(constant_shape) folding uses the correct type This is still subtle, but I think the test cases are sufficient to show that it works. Differential Revision: https://reviews.llvm.org/D92927	2020-12-10 10:49:25 +01:00
Adrian Kuegel	09f717b929	Add sqrt lowering from standard to ROCDL Add a lowering for sqrt from standard dialect to ROCDL. Differential Revision: https://reviews.llvm.org/D92921	2020-12-10 09:47:37 +01:00
Sergei Grechanik	2d3b9fdc19	[mlir][Affine] Fix vectorizability check for multiple load/stores This patch fixes a bug that allowed vectorizing of loops with loads and stores having indexing functions varying along different memory dimensions. Reviewed By: aartbik, dcaballe Differential Revision: https://reviews.llvm.org/D92702	2020-12-09 12:19:34 -08:00
Christian Sigg	0bf4a82a5a	[mlir] Use mlir::OpState::operator->() to get to methods of mlir::Operation. This is a preparation step to remove the corresponding methods from OpState. Reviewed By: silvas, rriddle Differential Revision: https://reviews.llvm.org/D92878	2020-12-09 12:11:32 +01:00
Frederik Gossen	b4750f58d8	Add sqrt lowering from standard to NVVM Differential Revision: https://reviews.llvm.org/D92850	2020-12-08 17:08:27 +01:00
Benjamin Kramer	5844bc540c	[mlir][Shape] Canonicalize assume_all with one input and tensor_cast of constant_shape This allows simplifying some more complicated shape expressions Differential Revision: https://reviews.llvm.org/D92843	2020-12-08 17:07:24 +01:00
ergawy	6c69d3d68e	[MLIR][SPIRV] Add initial support for OpSpecConstantOp. This commit adds initial support for SPIR-V OpSpecConstantOp instruction. The following is introdcued: - A new `spv.specConstantOperation` operation consisting of a single region and of 2 operations within that regions (more details in the docs of the op itself). - A new `spv.yield` instruction that acts a terminator for `spv.specConstantOperation`. For now, the generic form of the new op is supported (i.e. no custom parsing or printing). This will be done in a follow up patch. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D92232	2020-12-08 09:07:52 -05:00
Frederik Gossen	bb7d43e7d5	Add rsqrt lowering from standard to NVVM Differential Revision: https://reviews.llvm.org/D92838	2020-12-08 14:33:58 +01:00
Alex Zinenko	80766ecc65	[mlir] Add an option to control the number of loops in affine parallelizer Add a pass option to control the number of nested parallel loops produced by the parallelization passes. This is useful to build end-to-end passes targeting systems that don't need multiple parallel dimensions (e.g., CPUs typically need only one). Reviewed By: wsmoses, chelini Differential Revision: https://reviews.llvm.org/D92765	2020-12-08 10:44:37 +01:00
Alex Zinenko	2fe30a3534	[mlir] properly support min/max in affine parallelization The existing implementation of the affine parallelization silently copies over the lower and upper bound maps from affine.for to affine.parallel. However, the semantics of these maps differ between these two ops: in affine.for, a max(min) of results is taken for the lower(upper) bound; in affine.parallel, multiple induction variables can be defined an each result corresponds to one induction variable. Thus the existing implementation could generate invalid IR or IR that passes the verifier but has different semantics than the original code. Fix the parallelization utility to emit dedicated min/max operations before the affine.parallel in such cases. Disallow parallelization if min/max would have been in an operation without the AffineScope trait, e.g., in another loop, since the result of these operations is not considered a valid affine dimension identifier and may not be properly handled by the affine analyses. Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D92763	2020-12-08 10:43:35 +01:00
Mehdi Amini	e56f398dd3	Add Python binding for MLIR Type Attribute Differential Revision: https://reviews.llvm.org/D92711	2020-12-07 23:06:58 +00:00
Mehdi Amini	e15ae454b4	Customize exception thrown from mlir.Operation.create() python bindings The default exception handling isn't very user friendly and does not point accurately to the issue. Instead we can indicate which of the operands isn't valid and provide contextual information in the error message. Differential Revision: https://reviews.llvm.org/D92710	2020-12-07 23:06:58 +00:00
Aart Bik	74cd9e587d	[mlir][sparse] hoist loop invariant tensor loads in sparse compiler After bufferization, the backend has much more trouble hoisting loop invariant loads from the loops generated by the sparse compiler. Therefore, this is done during sparse code generation. Note that we don't bother hoisting derived invariant expressions on SSA values, since the backend does that very well. Still TBD: scalarize reductions to avoid load-add-store cycles Reviewed By: penpornk Differential Revision: https://reviews.llvm.org/D92534	2020-12-07 11:59:48 -08:00
Navdeep Kumar	dc930e5f2f	[MLIR][Affine] Add affine.for normalization support Add support to normalize affine.for ops i.e., convert the lower bound to zero and loop step to one. The Upper bound is set to the trip count of the loop. The exact value of loopIV is calculated just inside the body of affine.for. Currently loops with lower bounds having single result are supported. No such restriction exists on upper bounds. Differential Revision: https://reviews.llvm.org/D92233	2020-12-07 22:04:07 +05:30
Sourabh Singh Tomar	c11d868a39	[MLIR,OpenMP] Added support for lowering MasterOp to LLVMIR Some Ops in OMP dialect have regions associated with them i.e `ParallelOp` `MasterOp`. Lowering of these regions involves interfacing with `OMPIRBuilder` using callbacks, yet there still exist opportunities for sharing common code in between. This patch factors out common code into a separate function and adds support for lowering `MasterOp` using that. Lowering of `ParallelOp` is also modified appropriately. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D87247	2020-12-07 10:23:54 +05:30
River Riddle	7924fb34f3	[mlir][OpFormatGen] Add support for optional enum attributes The check for formatting enum attributes was missing a call to get the base attribute, which is necessary to strip off the top-level OptionalAttr<> wrapper. Differential Revision: https://reviews.llvm.org/D92713	2020-12-04 21:00:44 -08:00
Thomas Raoux	3e3e276d22	[mlir][vector][NFC] Change UnrollVectorPattern to not be statically dependent on an op type Make UnrollVectorPattern inherit from RewritePattern instead of OpRewritePattern so that we don't need to create many patterns when applying to many different type of ops. Since we may want to apply the pattern to all arithmetic op, it is more convenient to filter dynamically. Differential Revision: https://reviews.llvm.org/D92635	2020-12-04 09:53:01 -08:00
Rahul Joshi	fe7fdcac87	[MLIR] Fix parseFunctionLikeOp() to fail parsing empty regions - Change parseOptionalRegion to return an OptionalParseResult. - Change parseFunctionLikeOp() to fail parsing if the function body was parsed but was empty. - See https://llvm.discourse.group/t/funcop-parsing-bug/2164 Differential Revision: https://reviews.llvm.org/D91886	2020-12-04 09:09:59 -08:00
Nicolas Vasilache	a1cd559ce5	[mlir][Linalg] Properly use distribution options. Let tiling to scf.for actually use the distribution method. For now only Cyclic is supported. Differential Revision: https://reviews.llvm.org/D92653	2020-12-04 14:00:54 +00:00
Hanhan Wang	f5f1a5c244	[mlir][Linalg] Handle fusion on tensors for projected permutation. In the past, the reshape op can be folded only if the indexing map is permutation in consumer's usage. We can relax to condition to be projected permutation. This patch still limits the fusion for scalar cases. Scalar case is a corner case, because we need to decide where to put extra dims. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D92466	2020-12-03 23:11:29 -08:00
River Riddle	c7cae0e4fa	[mlir][Attributes][NFC] Move all builtin Attribute classes to BuiltinAttributes.h This mirrors the file structure of Types. Differential Revision: https://reviews.llvm.org/D92499	2020-12-03 18:02:11 -08:00
River Riddle	09f7a55fad	[mlir][Types][NFC] Move all of the builtin Type classes to BuiltinTypes.h This is part of a larger refactoring the better congregates the builtin structures under the BuiltinDialect. This also removes the problematic "standard" naming that clashes with the "standard" dialect, which is not defined within IR/. A temporary forward is placed in StandardTypes.h to allow time for downstream users to replaced references. Differential Revision: https://reviews.llvm.org/D92435	2020-12-03 18:02:10 -08:00
Aart Bik	c95acf052b	[mlir][vector][avx512] move avx512 lowering pass into general vector lowering A separate AVX512 lowering pass does not compose well with the regular vector lowering pass. As such, it is at risk of code duplication and lowering inconsistencies. This change removes the separate AVX512 lowering pass and makes it an "option" in the regular vector lowering pass (viz. vector dialect "augmented" with AVX512 dialect). Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D92614	2020-12-03 17:23:46 -08:00
George	5f65c4a8e6	Use MlirStringRef in StandardAttributes.h	2020-12-03 16:12:01 -08:00
River Riddle	672cc75cce	[mlir][IR] Remove references to BuiltinOps from IR/ There isn't a good reason for anything within IR to specifically reference any of the builtin operations. The only place that had a good reason in the past was AsmPrinter, but the behavior there doesn't need to hardcode ModuleOp anymore. Differential Revision: https://reviews.llvm.org/D92448	2020-12-03 15:47:01 -08:00
Thomas Raoux	c503dc1b8a	[mlir][linalg] Add vectorization for element-wise linalg ops Add support for vectorization for linalg.generic representing element-wise ops. Those are converted to transfer_read + vector ops + transfer_write. Also re-organize the vectorization tests to be together. Implementation derived from the work of @burmako, @agrue and @fedelebron. Differential Revision: https://reviews.llvm.org/D92540	2020-12-03 15:31:13 -08:00
David Blaikie	0fd0f885eb	[mlir] Use long rather than int to address pointer-to-int narrowing warning	2020-12-03 13:09:36 -08:00
Mehdi Amini	1c2159494d	Use the generic form when printing from the python bindings and the verifier fails This reduces the chances of segfault. While it is a good practice to ensure robust custom printers, it is unfortunately common to have them crash on invalid input. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D92536	2020-12-03 18:45:00 +00:00
Max Kudryavtsev	636db7f87c	[MLIR] Fix vector::TransferWriteOp builder losing permutation map Supervectorizer pass uses this builder and loses the permutation map. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D92145	2020-12-03 09:53:08 -08:00
Haruki Imai	b2391d5f0d	[MLIR] Normalize the results of normalizable operations Memrefs with affine_map in the results of normalizable operation were not normalized by `--normalize-memrefs` option. This patch normalizes them. Differential Revision: https://reviews.llvm.org/D88719	2020-12-03 19:34:07 +05:30
Julian Gross	8aeca73702	[MLIR] Added support for dynamic shaped allocas to promote-buffers-to-stack pass. Extended promote buffers to stack pass to support dynamically shaped allocas. The conversion is limited by the rank of the underlying tensor. An option is added to the pass to adjust the given rank. Differential Revision: https://reviews.llvm.org/D91969	2020-12-03 11:47:49 +01:00
Christian Sigg	d9adde5ae2	[mlir][gpu] Move gpu.wait ops from async.execute regions to its dependencies. This can prevent unnecessary host synchronization. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D90346	2020-12-03 08:52:28 +01:00
Uday Bondhugula	b276bf5a57	[MLIR][NFC] Fix mix up between dialect attribute values and names Clear up documentation on dialect attribute values. Fix/improve ModuleOp verifier error message on dialect prefixed attribute names. Additional discussion is here: https://llvm.discourse.group/t/moduleop-attributes/2325 Differential Revision: https://reviews.llvm.org/D92502	2020-12-03 02:34:15 +05:30
Christian Sigg	c4a0405902	Add `Operation* OpState::operator->()` to provide more convenient access to members of Operation. Given that OpState already implicit converts to Operator*, this seems reasonable. The alternative would be to add more functions to OpState which forward to Operation. Reviewed By: rriddle, ftynse Differential Revision: https://reviews.llvm.org/D92266	2020-12-02 15:46:20 +01:00
River Riddle	abfd1a8b3b	[mlir][PDL] Add support for PDL bytecode and expose PDL support to OwningRewritePatternList PDL patterns are now supported via a new `PDLPatternModule` class. This class contains a ModuleOp with the pdl::PatternOp operations representing the patterns, as well as a collection of registered C++ functions for native constraints/creations/rewrites/etc. that may be invoked via the pdl patterns. Instances of this class are added to an OwningRewritePatternList in the same fashion as C++ RewritePatterns, i.e. via the `insert` method. The PDL bytecode is an in-memory representation of the PDL interpreter dialect that can be efficiently interpreted/executed. The representation of the bytecode boils down to a code array(for opcodes/memory locations/etc) and a memory buffer(for storing attributes/operations/values/any other data necessary). The bytecode operations are effectively a 1-1 mapping to the PDLInterp dialect operations, with a few exceptions in cases where the in-memory representation of the bytecode can be more efficient than the MLIR representation. For example, a generic `AreEqual` bytecode op can be used to represent AreEqualOp, CheckAttributeOp, and CheckTypeOp. The execution of the bytecode is split into two phases: matching and rewriting. When matching, all of the matched patterns are collected to avoid the overhead of re-running parts of the matcher. These matched patterns are then considered alongside the native C++ patterns, which rewrite immediately in-place via `RewritePattern::matchAndRewrite`, for the given root operation. When a PDL pattern is matched and has the highest benefit, it is passed back to the bytecode to execute its rewriter. Differential Revision: https://reviews.llvm.org/D89107	2020-12-01 15:05:50 -08:00
Rahul Joshi	6b043ecdb7	[MLIR] Fix genTypeInterfaceMethods() to work correctly with InferTypeOpInterface - Change InferTypeOpInterface::inferResultTypes to use fully qualified types matching the ones generated by genTypeInterfaceMethods, so the redundancy can be detected. - Move genTypeInterfaceMethods() before genOpInterfaceMethods() so that the inferResultTypes method generated by genTypeInterfaceMethods() takes precedence over the declaration that might be generated by genOpInterfaceMethods() - Modified an op in the test dialect to exercise this (the modified op would fail to generate valid C++ code due to duplicate inferResultTypes methods). Differential Revision: https://reviews.llvm.org/D92414	2020-12-01 13:36:25 -08:00
Ray (I-Jui) Sung	ff2e22853f	Don't count attributes when addressing operands. Fixes out-of-bound access in generated nested DAG rewriter matching code. Reviewed By: tpopp Differential Revision: https://reviews.llvm.org/D92075	2020-12-01 01:21:36 +00:00
Sean Silva	774f1d3ffd	[mlir] Small cleanups to func-bufferize/finalizing-bufferize - Address TODO in scf-bufferize: the argument materialization issue is now fixed and the code is now in Transforms/Bufferize.cpp - Tighten up finalizing-bufferize to avoid creating invalid IR when operand types potentially change - Tidy up the testing of func-bufferize, and move appropriate tests to a new finalizing-bufferize.mlir - The new stricter checking in finalizing-bufferize revealed that we needed a DimOp conversion pattern (found when integrating into npcomp). Previously, the converion infrastructure was blindly changing the operand type during finalization, which happened to work due to DimOp's tensor/memref polymorphism, but is generally not encouraged (the new pattern is the way to tell the conversion infrastructure that it is legal to change that type).	2020-11-30 17:04:14 -08:00
Nicolas Vasilache	047400ed82	[mlir][LLVMIR] Add support for InlineAsmOp The InlineAsmOp mirrors the underlying LLVM semantics with a notable exception: the embedded `asm_string` is not allowed to define or reference any symbol or any global variable: only the operands of the op may be read, written, or referenced. Attempting to define or reference any symbol or any global behavior is considered undefined behavior at this time. The asm dialect syntax is currently specified with an integer (0 [default] for the "att dialect", 1 for the intel dialect) to circumvent the ODS limitation on string enums. Translation to LLVM is provided and raises the fact that the asm constraints string must be well-formed with respect to in/out operands. No check is performed on the asm_string. An InlineAsm instruction in LLVM is a special call operation to a function that is constructed on the fly. It does not fit the current model of MLIR calls with symbols. As a consequence, the current implementation constructs the function type in ModuleTranslation.cpp. This should be refactored in the future. The mlir-cpu-runner is augmented with the global initialization of the X86 asm parser to allow proper execution in JIT mode. Previously, only the X86 asm printer was initialized. Differential revision: https://reviews.llvm.org/D92166	2020-11-30 08:32:02 +00:00
Stella Laurenzo	62195b7548	[mlir][CAPI] Convert the rest of the API int -> bool. * Follows on https://reviews.llvm.org/D92193 * I had a mid-air collision with some additional occurrences and then noticed that there were a lot more. Think I got them all. Differential Revision: https://reviews.llvm.org/D92292	2020-11-29 20:36:42 -08:00
Stella Laurenzo	ba0fe76b7e	[mlir][Python] Add an Operation.result property. * If ODS redefines this, it is fine, but I have found this accessor to be universally useful in the old npcomp bindings and I'm closing gaps that will let me switch. Differential Revision: https://reviews.llvm.org/D92287	2020-11-29 18:09:07 -08:00
Stella Laurenzo	bd2083c2fa	[mlir][Python] Python API cleanups and additions found during code audit. * Add capsule get/create for Attribute and Type, which already had capsule interop defined. * Add capsule interop and get/create for Location. * Add Location __eq__. * Use get() and implicit cast to go from PyAttribute, PyType, PyLocation to MlirAttribute, MlirType, MlirLocation (bundled with this change because I didn't want to continue the pattern one more time). Differential Revision: https://reviews.llvm.org/D92283	2020-11-29 18:09:07 -08:00
Jacques Pienaar	e534cee26a	[mlir] Add a shape function library op Op with mapping from ops to corresponding shape functions for those op in the library and mechanism to associate shape functions to functions. The mapping of operand to shape function is kept separate from the shape functions themselves as the operation is associated to the shape function and not vice versa, and one could have a common library of shape functions that can be used in different contexts. Use fully qualified names and require a name for shape fn lib ops for now and an explicit print/parse (based around the generated one & GPU module op ones). This commit reverts `d9da4c3e73`. Fixes missing headers (don't know how that was working locally). Differential Revision: https://reviews.llvm.org/D91672	2020-11-29 11:15:30 -08:00
Mehdi Amini	d9da4c3e73	Revert "[mlir] Add a shape function library op" This reverts commit `6dd9596b19`. Build is broken.	2020-11-29 05:28:42 +00:00
Jacques Pienaar	6dd9596b19	[mlir] Add a shape function library op Op with mapping from ops to corresponding shape functions for those op in the library and mechanism to associate shape functions to functions. The mapping of operand to shape function is kept separate from the shape functions themselves as the operation is associated to the shape function and not vice versa, and one could have a common library of shape functions that can be used in different contexts. Use fully qualified names and require a name for shape fn lib ops for now and an explicit print/parse (based around the generated one & GPU module op ones). Differential Revision: https://reviews.llvm.org/D91672	2020-11-28 15:53:59 -08:00
Christian Sigg	acb69f3b7c	[mlir] Change ConvertOpToLLVMPattern::matchAndRewrite argument to concrete operand type. Reviewed By: herhut, ftynse Differential Revision: https://reviews.llvm.org/D92111	2020-11-28 13:09:25 +01:00
Tamas Berghammer	e4c74fd9dd	Don't elide splat attributes during printing A splat attribute have a single element during printing so we should treat it as such when we decide if we elide it or not based on the flag intended to elide large attributes. Reviewed By: rriddle, mehdi_amini Differential Revision: https://reviews.llvm.org/D92165	2020-11-27 14:42:26 +00:00
Frederik Gossen	6484567f14	[MLIR][SCF] Find all innermost loops for parallel loop tiling Overcome the assumption that parallel loops are only nested in other parallel loops. Differential Revision: https://reviews.llvm.org/D92188	2020-11-27 10:08:56 +01:00
Christian Sigg	5535696c38	[mlir] Add gpu.allocate, gpu.deallocate ops with LLVM lowering to runtime function calls. The ops are very similar to the std variants, but support async GPU execution. gpu.alloc does not currently support an alignment attribute, and the new ops do not have canonicalizers/folders like their std siblings do. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D91698	2020-11-27 09:40:59 +01:00
Nicolas Vasilache	5dd5a08363	[mlir] Let ModuleTranslate propagate LLVM triple This adds LLVM triple propagation and updates the test that did not check it properly. Differential Revision: https://reviews.llvm.org/D92182	2020-11-27 08:01:44 +00:00
Stephan Herhut	20c926e079	[mlir][DialectConversion] Do not prematurely drop unused cast operations The rewrite logic has an optimization to drop a cast operation after rewriting block arguments if the cast operation has no users. This is unsafe as there might be a pending rewrite that replaced the cast operation itself and hence would trigger a second free. Instead, do not remove the casts and leave it up to a later canonicalization to do so. Differential Revision: https://reviews.llvm.org/D92184	2020-11-26 17:39:14 +01:00
Benjamin Kramer	9549abcbb8	Remove stray debug-only from test	2020-11-26 15:37:18 +01:00
Stephan Herhut	4dd5f79f07	[mlir][bufferize] Add argument materialization for bufferization This enables partial bufferization that includes function signatures. To test this, this change also makes the func-bufferize partial and adds a dedicated finalizing-bufferize pass. Differential Revision: https://reviews.llvm.org/D92032	2020-11-26 13:43:44 +01:00
Aart Bik	d5f0d0c0c4	[mlir][sparse] add ability to select pointer/index storage type This change gives sparse compiler clients more control over selecting individual types for the pointers and indices in the sparse storage schemes. Narrower width obviously results in smaller memory footprints, but the range should always suffice for the maximum number of entries or index value. Reviewed By: penpornk Differential Revision: https://reviews.llvm.org/D92126	2020-11-25 17:32:44 -08:00
Aart Bik	5c4e397e6c	[mlir][sparse] add parallelization strategies to sparse compiler This CL adds the ability to request different parallelization strategies for the generate code. Every "parallel" loop is a candidate, and converted to a parallel op if it is an actual for-loop (not a while) and the strategy allows dense/sparse outer/inner parallelization. This will connect directly with the work of @ezhulenev on parallel loops. Still TBD: vectorization strategy Reviewed By: penpornk Differential Revision: https://reviews.llvm.org/D91978	2020-11-24 17:17:13 -08:00
Sean Silva	dfbb5a087e	[mlir] Remove SameOperandsAndResultShape when redundant with ElementwiseMappable SameOperandsAndResultShape and ElementwiseMappable have similar verification, but in general neither is strictly redundant with the other. Examples: - SameOperandsAndResultShape allows `"foo"(%0) : tensor<2xf32> -> tensor<?xf32> but ElementwiseMappable does not. - ElementwiseMappable allows `select %scalar_pred, %true_tensor, %false_tensor` but SameOperandsAndResultShape does not. SameOperandsAndResultShape is redundant with ElementwiseMappable when we can prove that the mixed scalar/non-scalar case cannot happen. In those situations, `ElementwiseMappable & SameOperandsAndResultShape == ElementwiseMappable`: - Ops with 1 operand: the case of mixed scalar and non-scalar operands cannot happen since there is only one operand. - When SameTypeOperands is also present, the mixed scalar/non-scalar operand case cannot happen. Differential Revision: https://reviews.llvm.org/D91396	2020-11-24 13:53:22 -08:00
Aart Bik	b228e2bd92	[mlir][sparse] generalize invariant expression handling in sparse compiler Generalizes invariant handling to anything defined outside the Linalg op (parameters and SSA computations). Fixes bug that was using parameter number as tensor number. Reviewed By: penpornk Differential Revision: https://reviews.llvm.org/D91985	2020-11-24 13:41:14 -08:00
Alex Zinenko	119545f433	[mlir] Add conversion from SCF parallel loops to OpenMP Introduce a conversion pass from SCF parallel loops to OpenMP dialect constructs - parallel region and workshare loop. Loops with reductions are not supported because the OpenMP dialect cannot model them yet. The conversion currently targets only one level of parallelism, i.e. only one top-level `omp.parallel` operation is produced even if there are nested `scf.parallel` operations that could be mapped to `omp.wsloop`. Nested parallelism support is left for future work. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D91982	2020-11-24 21:12:56 +01:00
Tei Jeong	760063267c	Fix CalibratedQuantizedType's print function to match parser Reviewed By: liufengdb Differential Revision: https://reviews.llvm.org/D92034	2020-11-24 09:35:35 -08:00
Stella Laurenzo	db9713cd77	[mlir] Add Tosa dialect const folder for tosa.const. * Was missed in the initial submission and is required for a ConstantLike op. * Also adds a materializeConstant hook to preserve it. * Tightens up the argument constraint on tosa.const to match what is actually legal. Differential Revision: https://reviews.llvm.org/D92040	2020-11-24 17:33:00 +00:00
Nicolas Vasilache	a8de412f51	[mlir] NFC - Expose an OffsetSizeAndStrideOpInterface This revision will make it easier to create new ops base on the strided memref abstraction outside of the std dialect. OffsetSizeAndStrideOpInterface is an interface for ops that allow specifying mixed dynamic and static offsets, sizes and strides variadic operands. Ops that implement this interface need to expose the following methods: 1. `getArrayAttrRanks` to specify the length of static integer attributes. 2. `offsets`, `sizes` and `strides` variadic operands. 3. `static_offsets`, resp. `static_sizes` and `static_strides` integer array attributes. The invariants of this interface are: 1. `static_offsets`, `static_sizes` and `static_strides` have length exactly `getArrayAttrRanks()`[0] (resp. [1], [2]). 2. `offsets`, `sizes` and `strides` have each length at most `getArrayAttrRanks()`[0] (resp. [1], [2]). 3. if an entry of `static_offsets` (resp. `static_sizes`, `static_strides`) is equal to a special sentinel value, namely `ShapedType::kDynamicStrideOrOffset` (resp. `ShapedType::kDynamicSize`, `ShapedType::kDynamicStrideOrOffset`), then the corresponding entry is a dynamic offset (resp. size, stride). 4. a variadic `offset` (resp. `sizes`, `strides`) operand must be present for each dynamic offset (resp. size, stride). This interface is useful to factor out common behavior and provide support for carrying or injecting static behavior through the use of the static attributes. Differential Revision: https://reviews.llvm.org/D92011	2020-11-24 14:42:47 +00:00
Alexander Belyaev	fd92c5dbee	[mlir][linalg] Add bufferization pattern for `linalg.indexed_generic`. Differential Revision: https://reviews.llvm.org/D92014	2020-11-24 11:14:21 +01:00
Alex Zinenko	ee6255d207	[mlir] move lib/Bindings/Python/Attributes.td to include/mlir/Bindings/Python This file is intended to be included by other files, including out-of-tree dialects, and makes more sense in `include` than in `lib`. Depends On D91652 Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D91961	2020-11-24 09:19:01 +01:00
Alex Zinenko	029e199dbf	[mlir] Make attributes mutable in Python bindings Attributes represent additional data about an operation and are intended to be modifiable during the lifetime of the operation. In the dialect-specific Python bindings, attributes are exposed as properties on the operation class. Allow for assigning values to these properties. Also support creating new and deleting existing attributes through the generic "attributes" property of an operation. Any validity checking must be performed by the op verifier after the mutation, similarly to C++. Operations are not invalidated in the process: no dangling pointers can be created as all attributes are owned by the context and will remain live even if they are not used in any operation. Introduce a Python Test dialect by analogy with the Test dialect and to avoid polluting the latter with Python-specific constructs. Use this dialect to implement a test for the attribute access and mutation API. Reviewed By: stellaraccident, mehdi_amini Differential Revision: https://reviews.llvm.org/D91652	2020-11-24 09:16:25 +01:00
Alex Zinenko	f7d033f4d8	[mlir] Support WsLoopOp in OpenMP to LLVM dialect conversion It is a simple conversion that only requires to change the region argument types, generalize it from ParallelOp. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D91989	2020-11-23 23:28:02 +01:00
George	df9ae59928	Use MlirStringRef throughout the C API While this makes the unit tests a bit more verbose, this simplifies the creation of bindings because only the bidirectional mapping between the host language's string type and MlirStringRef need to be implemented. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D91905	2020-11-23 14:07:30 -08:00
MaheshRavishankar	e65a5e5b00	[mlir][Linalg] Fuse sequence of Linalg operation (on buffers) Enhance the tile+fuse logic to allow fusing a sequence of operations. Make sure the value used to obtain tile shape is a SubViewOp/SubTensorOp. Current logic used to get the bounds of loop depends on the use of `getOrCreateRange` method on `SubViewOp` and `SubTensorOp`. Make sure that the value/dim used to compute the range is from such ops. This fix is a reasonable WAR, but a btter fix would be to make `getOrCreateRange` method be a method of `ViewInterface`. Differential Revision: https://reviews.llvm.org/D90991	2020-11-23 10:30:51 -08:00
George	0c5cff300f	Add userData to the diagnostic handler C API Previously, there was no way to add context to the diagnostic engine via the C API. Adding this ability makes it much easier to reason about memory ownership, particularly in reference-counted languages such as Swift. There are more details in the review comments. Reviewed By: ftynse, mehdi_amini Differential Revision: https://reviews.llvm.org/D91738	2020-11-23 09:52:45 -08:00
Alex Zinenko	619630f997	[mlir] Temporarily disable flaky mlir-cpu-runner async tests These tests fail sporadically on irrelevant commits, e.g. http://lab.llvm.org:8011/#/builders/61/builds/1777 as well as in local builds.	2020-11-23 16:53:15 +01:00
Alex Zinenko	31a233d463	[mlir] canonicalize away zero-iteration SCF for loops An SCF 'for' loop does not iterate if its lower bound is equal to its upper bound. Remove loops where both bounds are the same SSA value as such bounds are guaranteed to be equal. Similarly, remove 'parallel' loops where at least one pair of respective lower/upper bounds is specified by the same SSA value. Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D91880	2020-11-23 15:04:31 +01:00
Alex Zinenko	1ec60862d7	[mlir] Avoid cloning ops in SCF parallel conversion to CFG The existing implementation of the conversion from SCF Parallel operation to SCF "for" loops in order to further convert those loops to branch-based CFG has been cloning the loop and reduction body operations into the new loop because ConversionPatternRewriter was missing support for moving blocks while replacing their arguments. This functionality now available, use it to implement the conversion and avoid cloning operations, which may lead to doubling of the IR size during the conversion. In addition, this fixes an issue with converting nested SCF "if" conditionals present in "parallel" operations that would cause the conversion infrastructure to stop because of the repeated application of the pattern converting "newly" created "if"s (which were in fact just moved). Arguably, this should be fixed at the infrastructure level and this fix is a workaround. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D91955	2020-11-23 14:01:22 +01:00
Nicolas Vasilache	9ac0b314a4	[mlir][Linalg] Drop symbol_source abstraction which does not pay for itself. Differential Revision: https://reviews.llvm.org/D91956	2020-11-23 12:43:02 +00:00
Nicolas Vasilache	01c4418544	[mlir][Linalg] NFC - Factor out Linalg functionality for shape and loop bounds computation This revision refactors code used in various Linalg transformations and makes it a first class citizen to the LinalgStructureOpInterface. This is in preparation to allowing more advanced Linalg behavior but is otherwise NFC. Differential revision: https://reviews.llvm.org/D91863	2020-11-23 10:17:18 +00:00
John Demme	95956c1c9a	[MLIR] ODS typedef gen fixes & improvements - Fixes bug 48242 point 3 crash. - Makes the improvments from points 1 & 2. https://bugs.llvm.org/show_bug.cgi?id=48262 ``` def RTLValueType : Type<CPred<"isRTLValueType($_self)">, "Type"> { string cppType = "::mlir::Type"; } ``` Works now, but merely by happenstance. Parameters expects a `TypeParameter` class def or a string representing a c++ type but doesn't enforce it. Reviewed By: lattner Differential Revision: https://reviews.llvm.org/D91939	2020-11-22 16:06:14 -08:00
Aart Bik	af42550523	[mlir][sparse] refine optimization, add few more test cases Adds tests for full sum reduction (tensors summed up into scalars) and the well-known sampled-dense-dense-matrix-product. Refines the optimizations rules slightly to handle the summation better. Reviewed By: penpornk Differential Revision: https://reviews.llvm.org/D91818	2020-11-20 17:01:59 -08:00
Thomas Raoux	369c51a74b	[mlir][vector] Add transfer_op LoadToStore forwarding and deadStore optimizations Add transformation to be able to forward transfer_write into transfer_read operation and to be able to remove dead transfer_write when a transfer_write is overwritten before being read. Differential Revision: https://reviews.llvm.org/D91321	2020-11-20 11:59:01 -08:00
William S. Moses	f5c5fd1c50	[MLIR] Correct block merge bug Block merging in MLIR will incorrectly merge blocks with operations whose values are used outside of that block. This change forbids this behavior and provides a test where it is illegal to perform such a merge. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D91745	2020-11-20 19:12:59 +01:00
Alex Zinenko	18d0f7d5c3	[mlir] add canonicalization patterns for trivial SCF 'for' and 'if' Add canoncalization patterns to remove zero-iteration 'for' loops, replace single-iteration 'for' loops with their bodies; remove known-false conditionals with no 'else' branch and replace conditionals with known value by the respective region. Although similar transformations are performed at the CFG level, not all flows reach that level, e.g., the GPU flow may want to remove single-iteration loops before deciding on loop mapping to thread dimensions. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D91865	2020-11-20 19:04:39 +01:00
Stephan Herhut	a89e55ca57	[mlir][std] Canonicalize a dim(memref_reshape) into a load from the shape operand This canonicalization helps propagate shape information through the program. Differential Revision: https://reviews.llvm.org/D91854	2020-11-20 14:03:02 +01:00
Stephan Herhut	6af81ea1d6	[mlir][std] Fold load(tensor_to_memref) into extract_element This canonicalization is useful to resolve loads into scalar values when doing partial bufferization. Differential Revision: https://reviews.llvm.org/D91855	2020-11-20 13:42:11 +01:00
Stephan Herhut	cb778c3423	[mlir][std] Fold comparisons when the operands are equal For equal operands, comparisons can be decided statically. Differential Revision: https://reviews.llvm.org/D91856	2020-11-20 13:26:41 +01:00
Mikhail Goncharov	0caa82e2ac	Revert "[mlir][Linalg] Fuse sequence of Linalg operation (on buffers)" This reverts commit `f8284d21a8`. Revert "[mlir][Linalg] NFC: Expose some utility functions used for promotion." This reverts commit `0c59f51592`. Revert "Remove unused isZero function" This reverts commit `0f9f0a4046`. Change `f8284d21` led to multiple failures in IREE compilation.	2020-11-20 13:12:54 +01:00
Eugene Zhulenev	a86a9b5ef7	[mlir] Automatic reference counting for Async values + runtime support for ref counted objects Depends On D89963 Automatic reference counting algorithm outline: 1. `ReturnLike` operations forward the reference counted values without modifying the reference count. 2. Use liveness analysis to find blocks in the CFG where the lifetime of reference counted values ends, and insert `drop_ref` operations after the last use of the value. 3. Insert `add_ref` before the `async.execute` operation capturing the value, and pairing `drop_ref` before the async body region terminator, to release the captured reference counted value when execution completes. 4. If the reference counted value is passed only to some of the block successors, insert `drop_ref` operations in the beginning of the blocks that do not have reference coutned value uses. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D90716	2020-11-20 03:08:44 -08:00
MaheshRavishankar	f8284d21a8	[mlir][Linalg] Fuse sequence of Linalg operation (on buffers) Enhance the tile+fuse logic to allow fusing a sequence of operations. Differential Revision: https://reviews.llvm.org/D90991	2020-11-19 19:03:06 -08:00
Alex Zinenko	9bb5bff570	[mlir] Add an assertion on creating an Operation with null result types Null types are commonly used as an error marker. Catch them in the constructor of Operation if they are present in the result type list, as otherwise this could lead to further surprising behavior when querying op result types. Fix AsyncToLLVM and StandardToLLVM that were using null types when constructing operations. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D91770	2020-11-19 22:28:38 +01:00
Tres Popp	b0750e2df6	Fix rollback of first block erasure in a region. Differential Revision: https://reviews.llvm.org/D91788	2020-11-19 21:24:10 +01:00
River Riddle	65fcddff24	[mlir][BuiltinDialect] Resolve comments from D91571 * Move ops to a BuiltinOps.h * Add file comments	2020-11-19 11:12:49 -08:00
ergawy	2f3adc54b5	[MLIR][SPIRV] Rename `spv._module_end` to `spv.mlir.endmodule` This commit does the renaming mentioned in the title in order to bring 'spv' dialect closer to the MLIR naming conventions. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D91792	2020-11-19 13:25:13 -05:00
Lei Zhang	5b7bd89b35	Revert "Reorder linalg.conv indexing_maps loop order" This reverts commit `9b47525824` and falls back to the original parallel-iterators-as-leading- dimensions convention. We can control the loop order by first converting the named op into linalg.generic and then performing interchange. Reviewed By: nicolasvasilache, asaadaldien Differential Revision: https://reviews.llvm.org/D91796	2020-11-19 13:16:16 -05:00
ergawy	341f3c1120	[MLIR][SPIRV] ModuleCombiner: deduplicate global vars, spec consts, and funcs. This commit extends the functionality of the SPIR-V module combiner library by adding new deduplication capabilities. In particular, implementation of deduplication of global variables and specialization constants, and functions is introduced. For global variables, 2 variables are considered duplicate if they either have the same descriptor set + binding or the same built_in attribute. For specialization constants, 2 spec constants are considered duplicate if they have the same spec_id attribute. 2 functions are deduplicated if they are identical. 2 functions are identical if they have the same prototype, attributes, and body. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D90951	2020-11-19 10:06:04 -05:00
ergawy	9bd50abc4c	[MLIR][SPIRV] Rename `spv._merge` to `spv.mlir.merge` This commit does the renaming mentioned in the title in order to bring 'spv' dialect closer to the MLIR naming conventions. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D91797	2020-11-19 10:04:35 -05:00
Lei Zhang	9e39a5d9a6	[mlir][linalg] Start a named ops to generic ops pass This commit starts a new pass and patterns for converting Linalg named ops to generic ops. This enables us to leverage the flexbility from generic ops during transformations. Right now only linalg.conv is supported; others will be added when useful. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D91357	2020-11-19 09:21:06 -05:00
Ji Kim	58ce4a8b11	[mlir][TableGen] Support intrinsics with multiple returns and overloaded operands. For intrinsics with multiple returns where one or more operands are overloaded, the overloaded type is inferred from the corresponding field of the resulting struct, instead of accessing the result directly. As such, the hasResult parameter of LLVM_IntrOpBase (and derived classes) is replaced with numResults. TableGen for intrinsics also updated to populate this field with the total number of results. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D91680	2020-11-19 09:59:42 +01:00
River Riddle	c0958b7b4c	[mlir] Add support for referencing a SymbolRefAttr in a SideEffectInstance This allows for operations that exclusively affect symbol operations to better describe their side effects. Differential Revision: https://reviews.llvm.org/D91581	2020-11-18 18:38:43 -08:00
Aart Bik	9ad62f62b9	[mlir][sparse] remove a few rewriting failures Rationale: Make sure preconditions are tested already during verfication. Currently, the only way a sparse rewriting rule can fail is if (1) the linalg op does not have sparse annotations, or (2) a yet to be handled operation is encounted inside the op Reviewed By: penpornk Differential Revision: https://reviews.llvm.org/D91748	2020-11-18 17:29:40 -08:00
Diego Caballero	c1ba9c43ad	[mlir][Affine] Refactor affine fusion code in pass to utilities Refactoring/clean-up step needed to add support for producer-consumer fusion with multi-store producer loops and, in general, to implement more general loop fusion strategies in Affine. It introduces the following changes: - AffineLoopFusion pass now uses loop fusion utilities more broadly to compute fusion legality (canFuseLoops utility) and perform the fusion transformation (fuseLoops utility). - Loop fusion utilities have been extended to deal with AffineLoopFusion requirements and assumptions while preserving both loop fusion utilities and AffineLoopFusion current functionality within a unified implementation. 'FusionStrategy' has been introduced for this purpose and, in the future, it will allow us to have a single loop fusion core implementation that will produce different fusion outputs depending on the strategy used. - Improve separation of concerns for legality and profitability analysis: 'isFusionProfitable' no longer filters out illegal scenarios that 'canFuse' didn't detect, or the other way around. 'canFuse' now takes loop dependences into account to determine the fusion loop depth (producer-consumer fusion only). - As a result, maximal fusion now doesn't require any profitability analysis. - Slices are now computed only once and reused across the legality, profitability and fusion transformation steps (producer-consumer). - Refactor some utilities and remove redundant copies of them. This patch is NFCI and should preserve the existing functionality of both the AffineLoopFusion pass and the affine fusion utilities. Reviewed By: andydavis1, bondhugula Differential Revision: https://reviews.llvm.org/D90798	2020-11-18 13:50:32 -08:00
ergawy	adf9f64a02	[MLIR][SPIRV] Rename `spv._reference_of` to `spv.mlir.referenceof` This commit does the renaming mentioned in the title in order to bring 'spv' dialect closer to the MLIR naming conventions. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D91715	2020-11-18 13:27:29 -05:00
Christian Sigg	8b97e17d16	[mlir] Simplify code generated by ConvertToLLVMPattern::getStridedElementPtr(). Make the interface match the one of ConvertToLLVMPattern::getDataPtr() (to be removed in a separate change). Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D91599	2020-11-18 11:52:09 +01:00
Alex Zinenko	052d24af29	[mlir] Introduce support for parametric side-effects The side effect infrastructure is based on the Effect and Resource class templates, instances of instantiations of which are constructed as thread-local singletons. With this scheme, it is impossible to further parameterize either of those, or the EffectInstance class that contains pointers to an Effect and Resource instances. Such a parameterization is necessary to express more detailed side effects, e.g. those of a loop or a function call with affine operations inside where it is possible to precisely specify the slices of accessed buffers. Include an additional Attribute to EffectInstance class for further parameterization. This allows to leverage the dialect-specific registration and uniquing capabilities of the attribute infrastructure without requiring Effect or Resource instantiations to be attached to a dialect themselves. Split out the generic part of the side effect Tablegen classes into a separate file to avoid generating built-in MemoryEffect interfaces when processing any .td file that includes SideEffectInterfaceBase.td. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D91493	2020-11-18 10:52:17 +01:00
zhanghb97	77133b29b9	[mlir] Get array from the dense elements attribute with buffer protocol. - Add `mlirElementsAttrGetType` C API. - Add `def_buffer` binding to PyDenseElementsAttribute. - Implement the protocol to access the buffer. Differential Revision: https://reviews.llvm.org/D91021	2020-11-18 15:50:59 +08:00
Tei Jeong	94e4ec6499	Add CalibratedQuantizedType to quant dialect This type supports a calibrated type with min, max provided. This will be used for importing calibration values of intermediate tensors (e.g. LSTM) which can't be imported with QuantStats op. This type was initially suggested in the following RFC: https://llvm.discourse.group/t/rfc-a-proposal-for-implementing-quantization-transformations-in-mlir/655 Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D91584	2020-11-17 22:14:54 -08:00
Aart Bik	eced4a8e6f	[mlir] [sparse] start of sparse tensor compiler support As discussed in https://llvm.discourse.group/t/mlir-support-for-sparse-tensors/2020 this CL is the start of sparse tensor compiler support in MLIR. Starting with a "dense" kernel expressed in the Linalg dialect together with per-dimension sparsity annotations on the tensors, the compiler automatically lowers the kernel to sparse code using the methods described in Fredrik Kjolstad's thesis. Many details are still TBD. For example, the sparse "bufferization" is purely done locally since we don't have a global solution for propagating sparsity yet. Furthermore, code to input and output the sparse tensors is missing. Nevertheless, with some hand modifications, the generated MLIR can be easily converted into runnable code already. Reviewed By: nicolasvasilache, ftynse Differential Revision: https://reviews.llvm.org/D90994	2020-11-17 13:10:42 -08:00
Christian Sigg	bedaad4495	[mlir] Simplify std.alloc lowering to LLVM. std.alloc only supports memrefs with identity layout, which means we can simplify the lowering to LLVM and compute strides only from (static and dynamic) sizes. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D91549	2020-11-17 18:55:34 +01:00
Rahul Joshi	8a4fe75d70	[NFC] Add unit tests for printing/parsing of variadic operands and results. Differential Revision: https://reviews.llvm.org/D91557	2020-11-17 09:21:46 -08:00
ergawy	9793edd5bf	[MLIR][SPIRV] Rename `spv._address_of` to `spv.mlir.addressof` This commit does the renaming mentioned in the title in order to bring `spv` dialect closer to the MLIR naming conventions. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D91609	2020-11-17 12:12:27 -05:00
Alex Zinenko	f3dab16dc7	[mlir] Add a _get_default_loc_context utility to Python bindings This utility function is helpful for dialect-specific builders that need to access the context through location, and the location itself may be either provided as an argument or expected to be recovered from the implicit location stack. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D91623	2020-11-17 17:55:47 +01:00
Stephan Herhut	c4472f8b4c	[mlir][std] Canonicalize extract_element(tensor_cast). Canonicalize extract_element(tensor_cast(v)) to just extract_element(v). Differential Revision: https://reviews.llvm.org/D91621	2020-11-17 14:41:39 +01:00
Stephan Herhut	3598605c0b	[mlir][std] Fold dim(dynamic_tensor_from_elements, %cst) The shape of the result of a dynamic_tensor_from_elements is defined via its result type and operands. We already fold dim operations when they reference one of the statically sized dimensions. Now, also fold dim on the dynamically sized dimensions by picking the corresponding operand. Differential Revision: https://reviews.llvm.org/D91616	2020-11-17 14:39:59 +01:00
Alex Zinenko	88f25bda13	[mlir] Allow for using interface class name in ODS interface definitions It may be necessary for interface methods to process or return variables with the interface class type, in particular for attribute and type interfaces that can return modified attributes and types that implement the same interface. However, the code generated by ODS in this case would not compile because the signature (and the body if provided) appear in the definition of the Model class and before the interface class, which derives from the Model. Change the ODS interface method generator to emit only method declarations in the Model class itself, and emit method definitions after the interface class. Mark as "inline" since their definitions are still emitted in the header and are no longer implicitly inline. Add a forward declaration of the interface class before the Concept+Model classes to make the class name usable in declarations. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D91499	2020-11-17 14:28:55 +01:00
Alex Zinenko	ef8e859c0b	[mlir] Fix Python tests after "module_terminator" migrated to ODS The "module_terminator" op now has a custom syntax and therefore is printed without quotes. Adapt Python tests to check for this syntax.	2020-11-17 14:16:31 +01:00
Alex Zinenko	c5a6712f8c	[mlir] Add basic support for attributes in ODS-generated Python bindings In ODS, attributes of an operation can be provided as a part of the "arguments" field, together with operands. Such attributes are accepted by the op builder and have accessors generated. Implement similar functionality for ODS-generated op-specific Python bindings: the `__init__` method now accepts arguments together with operands, in the same order as in the ODS `arguments` field; the instance properties are introduced to OpView classes to access the attributes. This initial implementation accepts and returns instances of the corresponding attribute class, and not the underlying values since the mapping scheme of the value types between C++, C and Python is not yet clear. Default-valued attributes are not supported as that would require Python to be able to parse C++ literals. Since attributes in ODS are tightely related to the actual C++ type system, provide a separate Tablegen file with the mapping between ODS storage type for attributes (typically, the underlying C++ attribute class), and the corresponding class name. So far, this might look unnecessary since all names match exactly, but this is not necessarily the cases for non-standard, out-of-tree attributes, which may also be placed in non-default namespaces or Python modules. This also allows out-of-tree users to generate Python bindings without having to modify the bindings generator itself. Storage type was preferred over the Tablegen "def" of the attribute class because ODS essentially encodes attribute _constraints_ rather than classes, e.g. there may be many Tablegen "def"s in the ODS that correspond to the same attribute type with additional constraints The presence of the explicit mapping requires the change in the .td file structure: instead of just calling the bindings generator directly on the main ODS file of the dialect, it becomes necessary to create a new file that includes the main ODS file of the dialect and provides the mapping for attribute types. Arguably, this approach offers better separability of the Python bindings in the build system as the main dialect no longer needs to know that it is being processed by the bindings generator. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D91542	2020-11-17 11:47:37 +01:00
River Riddle	73ca690df8	[mlir][NFC] Remove references to Module.h and Function.h These includes have been deprecated in favor of BuiltinDialect.h, which contains the definitions of ModuleOp and FuncOp. Differential Revision: https://reviews.llvm.org/D91572	2020-11-17 00:55:47 -08:00
Mehdi Amini	74207e78cf	Fix python bindings tests after change in visibility requirement for symbol declarations	2020-11-17 04:09:35 +00:00
Rahul Joshi	b7382ed3fe	[MLIR] Extend Symbol verification to reject public symbol declarations. - Extend the Symbol interface with `isDeclaration` to identify operations that declare a symbol as opposed to define it. - Extend verification to disallow public declarations as per the discussion in https://llvm.discourse.group/t/rfc-symbol-definition-declaration-x-visibility-checks/2140 - Adopt the new interface for `FuncOp` and fix test and code to not have/create public function declarations. Differential Revision: https://reviews.llvm.org/D91456	2020-11-16 16:05:32 -08:00
Sean Silva	7c62c6313b	[mlir] Add DecomposeCallGraphTypes pass. This replaces the old type decomposition logic that was previously mixed into bufferization, and makes it easily accessible. This also deletes TestFinalizingBufferize, because after we remove the type decomposition, it doesn't do anything that is not already provided by func-bufferize. Differential Revision: https://reviews.llvm.org/D90899	2020-11-16 12:25:35 -08:00
Christian Sigg	04481f26fa	[mlir] Require std.alloc() ops to have canonical layout during LLVM lowering. The current code allows strided layouts, but the number of elements allocated is ambiguous. It could be either the number of elements in the shape (the current implementation), or the amount of elements required to not index out-of-bounds with the given maps (which would require evaluating the layout map). If we require the canonical layouts, the two will be the same. Reviewed By: nicolasvasilache, ftynse Differential Revision: https://reviews.llvm.org/D91523	2020-11-16 17:29:36 +01:00
David Truby	843525075b	[MLIR][OpenMP] Add omp.wsloop operation This adds a simple definition of a "workshare loop" operation for the OpenMP MLIR dialect, excluding the "reduction" and "allocate" clauses and without a custom parser and pretty printer. The schedule clause also does not yet accept the modifiers that are permitted in OpenMP 5.0. Co-authored-by: Kiran Chandramohan <kiran.chandramohan@arm.com> Reviewed By: ftynse, clementval Differential Revision: https://reviews.llvm.org/D86071	2020-11-16 15:24:57 +00:00
Hanhan Wang	47fd19f22e	[mlir][StandardToSPIRV] Extend support for lowering cmpi to SPIRV. The logic of vector on boolean was missed. This patch adds the logic and test on it. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D91403	2020-11-16 06:51:05 -08:00
Nicolas Vasilache	7625742237	[mlir][Linalg] Add support for tileAndDistribute on tensors. scf.parallel is currently not a good fit for tiling on tensors. Instead provide a path to parallelism directly through scf.for. For now, this transformation ignores the distribution scheme and always does a block-cyclic mapping (where block is the tile size). Differential revision: https://reviews.llvm.org/D90475	2020-11-16 11:12:50 +00:00
Thomas Raoux	6ad31c0f4a	[mlir][vector] Support N-D vector in InsertMap/ExtractMap op Support multi-dimension vector for InsertMap/ExtractMap op and update the transformations. Currently the relation between IDs and dimension is implicitly deduced from the types. We can then calculate an AffineMap based on it. In the future the AffineMap could be part of the operation itself. Differential Revision: https://reviews.llvm.org/D90995	2020-11-13 12:40:17 -08:00
MaheshRavishankar	bf3861bf71	[mlir][Linalg] Change LinalgDependenceGraph to use LinalgOp. Using LinalgOp will reduce the repeated conversion from Operation <-> LinalgOp. Differential Revision: https://reviews.llvm.org/D91101	2020-11-13 12:34:38 -08:00
Scott Todd	c9e9cc3fe7	[MLIR] Allow setting "CodeView" flag in LLVMIR translation on MSVC. Reviewed By: ftynse, mehdi_amini Differential Revision: https://reviews.llvm.org/D91365	2020-11-13 17:31:18 +01:00
Eugene Zhulenev	c30ab6c2a3	[mlir] Transform scf.parallel to scf.for + async.execute Depends On D89958 1. Adds `async.group`/`async.awaitall` to group together multiple async tokens/values 2. Rewrite scf.parallel operation into multiple concurrent async.execute operations over non overlapping subranges of the original loop. Example: ``` scf.for (%i, %j) = (%lbi, %lbj) to (%ubi, %ubj) step (%si, %sj) { "do_some_compute"(%i, %j): () -> () } ``` Converted to: ``` %c0 = constant 0 : index %c1 = constant 1 : index // Compute blocks sizes for each induction variable. %num_blocks_i = ... : index %num_blocks_j = ... : index %block_size_i = ... : index %block_size_j = ... : index // Create an async group to track async execute ops. %group = async.create_group scf.for %bi = %c0 to %num_blocks_i step %c1 { %block_start_i = ... : index %block_end_i = ... : index scf.for %bj = %c0 t0 %num_blocks_j step %c1 { %block_start_j = ... : index %block_end_j = ... : index // Execute the body of original parallel operation for the current // block. %token = async.execute { scf.for %i = %block_start_i to %block_end_i step %si { scf.for %j = %block_start_j to %block_end_j step %sj { "do_some_compute"(%i, %j): () -> () } } } // Add produced async token to the group. async.add_to_group %token, %group } } // Await completion of all async.execute operations. async.await_all %group ``` In this example outer loop launches inner block level loops as separate async execute operations which will be executed concurrently. At the end it waits for the completiom of all async execute operations. Reviewed By: ftynse, mehdi_amini Differential Revision: https://reviews.llvm.org/D89963	2020-11-13 04:02:56 -08:00
Stephan Herhut	4a771108ac	[mlir][bufferize] Fix buffer promotion to stack for index types The index type does not have a bitsize and hence the size of corresponding allocations cannot be computed. Instead, the promotion pass now has an explicit option to specify the size of index. Differential Revision: https://reviews.llvm.org/D91360	2020-11-13 09:23:36 +01:00
Stephan Herhut	5da2423bc0	[mlir][gpu] Only transform mapped parallel loops to GPU. This exposes a hook to configure legality of operations such that only `scf.parallel` operations that have mapping attributes are marked as illegal. Consequently, the transformation can now also be applied to mixed forms. Differential Revision: https://reviews.llvm.org/D91340	2020-11-13 09:15:17 +01:00
River Riddle	48e8129edf	[mlir][Asm] Add support for resolving operation locations after parsing has finished This revision adds support in the parser/printer for "deferrable" aliases, i.e. those that can be resolved after printing has finished. This allows for printing aliases for operation locations after the module instead of before, i.e. this is now supported: ``` "foo.op"() : () -> () loc(#loc) #loc = loc("some_location") ``` Differential Revision: https://reviews.llvm.org/D91227	2020-11-12 23:34:36 -08:00
Mehdi Amini	a9386bb0f9	Fix MLIR lit test configuration after cmake Python detection change `07f1047f41` changed the CMake detection to use find_package(Python3 ... but didn't update the lit configuration to use the expected Python3_EXECUTABLE cmake variable to point to the interpreter path. This resulted in an empty path on MacOS.	2020-11-13 04:44:45 +00:00
Sean Silva	faa66b1b2c	[mlir] Bufferize tensor constant ops We lower them to a std.global_memref (uniqued by constant value) + a std.get_global_memref to produce the corresponding memref value. This allows removing Linalg's somewhat hacky lowering of tensor constants, now that std properly supports this. Differential Revision: https://reviews.llvm.org/D91306	2020-11-12 14:56:10 -08:00
Sean Silva	ad2f9f6745	[mlir] Fix subtensor_insert bufferization. It was incorrect in the presence of a tensor argument with multiple uses. The bufferization of subtensor_insert was writing into a converted memref operand, but there is no guarantee that the converted memref for that operand is safe to write into. In this case, the same converted memref is written to in-place by the subtensor_insert bufferization, violating the tensor-level semantics. I left some comments in a TODO about ways forward on this. I will be working actively on this problem in the coming days. Differential Revision: https://reviews.llvm.org/D91371	2020-11-12 14:56:09 -08:00
Jean-Michel Gorius	e47805c995	[mlir] Add plus, star and optional less/greater parsing The tokens are already handled by the lexer. This revision exposes them through the parser interface. This revision also adds missing functions for question mark parsing and completes the list of valid punctuation tokens in the documentation. Differential Revision: https://reviews.llvm.org/D90907	2020-11-12 13:28:31 +01:00
Alex Zinenko	f9265de8c6	[mlir] Generate Op builders for Python bindings Add an ODS-backed generator of default builders. This currently does not support operation with attribute arguments, for which the builder is just ignored. Attribute support will be introduced separately for builders and accessors. Default builders are always generated with the same number of result and operand groups as the ODS specification, i.e. one group per each operand or result. Optional elements accept None but cannot be omitted. Variadic groups accept iterable objects and cannot be replaced with a single object. For some operations, it is possible to infer the result type given the traits, but most traits rely on inline pieces of C++ that we cannot (yet) forward to Python bindings. Since the Ops where the inference is possible (having the `SameOperandAndResultTypes` trait or `TypeMatchesWith` without transform field) are a small minority, they also require the result type to make the builder syntax more consistent. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D91190	2020-11-12 11:29:23 +01:00
MaheshRavishankar	5ca20851e4	[mlir][Linalg] Improve the logic to perform tile and fuse with better dependence tracking. This change does two main things 1) An operation might have multiple dependences to the same producer. Not tracking them correctly can result in incorrect code generation with fusion. To rectify this the dependence tracking needs to also have the operand number in the consumer. 2) Improve the logic used to find the fused loops making it easier to follow. The only constraint for fusion is that linalg ops (on buffers) have update semantics for the result. Fusion should be such that only one iteration of the fused loop (which is also a tiled loop) must touch only one (disjoint) tile of the output. This could be relaxed by allowing for recomputation that is the default when oeprands are tensors, or can be made legal with promotion of the fused view (in future). Differential Revision: https://reviews.llvm.org/D90579	2020-11-12 00:25:24 -08:00
Aart Bik	e1dbc25ee2	[mlir][sparse] integrate sparse annotation into generic linalg op This CL integrates the new sparse annotations (hereto merely added as fully transparent attributes) more tightly to the generic linalg op in order to add verification of the annotations' consistency as well as to make make other passes more aware of their presence (in the long run, rewriting rules must preserve the integrity of the annotations). Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D91224	2020-11-11 17:26:30 -08:00
Mehdi Amini	a62d38a90d	Disable implicit nesting on parsing textual pass pipeline Previous the textual form of the pass pipeline would implicitly nest, instead we opt for the explicit form here: this has less surprise. This also avoids asserting in the bindings when passing a pass pipeline with incorrect nesting. Differential Revision: https://reviews.llvm.org/D91233	2020-11-11 19:21:51 +00:00
Thomas Raoux	023f2400f2	[mlir] Fix post-dominance between blocks of different regions. If block A and B are in different regions and region of A is not an ancestor of B, either A is included in region of B or the two regions are disjoint. In both case A doesn't post-dominate B. Differential Revision: https://reviews.llvm.org/D91225	2020-11-11 11:20:53 -08:00
Stella Laurenzo	5fef6ce0cc	[mlir][Python] Allow PassManager to interop with the capsule APIs. * Used in npcomp to cast Python objects via the C-API. Differential Revision: https://reviews.llvm.org/D91232	2020-11-11 10:37:21 -08:00
Eugene Zhulenev	bb0d5f767d	[mlir] Add NumberOfExecutions analysis + update RegionBranchOpInterface interface to query number of region invocations Implements RFC discussed in: https://llvm.discourse.group/t/rfc-operationinstancesinterface-or-any-better-name/2158/10 Reviewed By: silvas, ftynse, rriddle Differential Revision: https://reviews.llvm.org/D90922	2020-11-11 01:43:17 -08:00
Tres Popp	cc5b4a8603	[mlir] Rework DialectConversion inlineRegionBefore The previous logic for inlining a region A with N blocks into region B would produce incorrect results on rollback for N greater than 1. This rollback logic would leave blocks 1..N in region B and only move block 0 to region A. The new inlining action recording stores the block move actions from N-1 to 0. Now on roll back, block 0 is moved to region A and then 1..N is appended to the list of blocks in region A. Differential Revision: https://reviews.llvm.org/D91185	2020-11-11 10:42:33 +01:00
Christian Sigg	5bdb21df21	[mlir] Use assemblyFormat in AllocLikeOp. Split operands into dynamicSizes and symbolOperands. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D90589	2020-11-11 10:27:20 +01:00
Christian Sigg	5dfe6545d4	[mlir] Allow omitting spaces in assemblyFormat with a `` literal. I would like to use this for D90589 to switch std.alloc to assemblyFormat. Hopefully it will be useful in other places as well. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D91068	2020-11-11 09:34:43 +01:00
Sean Silva	53a0d45db6	[mlir] Add pass to convert elementwise ops to linalg. This patch converts elementwise ops on tensors to linalg.generic ops with the same elementwise op in the payload (except rewritten to operate on scalars, obviously). This is a great form for later fusion to clean up. E.g. ``` // Compute: %arg0 + %arg1 - %arg2 func @f(%arg0: tensor<?xf32>, %arg1: tensor<?xf32>, %arg2: tensor<?xf32>) -> tensor<?xf32> { %0 = addf %arg0, %arg1 : tensor<?xf32> %1 = subf %0, %arg2 : tensor<?xf32> return %1 : tensor<?xf32> } ``` Running this through `mlir-opt -convert-std-to-linalg -linalg-fusion-for-tensor-ops` we get: ``` func @f(%arg0: tensor<?xf32>, %arg1: tensor<?xf32>, %arg2: tensor<?xf32>) -> tensor<?xf32> { %0 = linalg.generic {indexing_maps = [#map0, #map0, #map0, #map0], iterator_types = ["parallel"]} ins(%arg0, %arg1, %arg2 : tensor<?xf32>, tensor<?xf32>, tensor<?xf32>) { ^bb0(%arg3: f32, %arg4: f32, %arg5: f32): // no predecessors %1 = addf %arg3, %arg4 : f32 %2 = subf %1, %arg5 : f32 linalg.yield %2 : f32 } -> tensor<?xf32> return %0 : tensor<?xf32> } ``` So the elementwise ops on tensors have nicely collapsed into a single linalg.generic, which is the form we want for further transformations. Differential Revision: https://reviews.llvm.org/D90354	2020-11-10 13:44:44 -08:00
Sean Silva	b4fa28b408	[mlir] Add ElementwiseMappable trait and apply it to std elementwise ops. This patch adds an `ElementwiseMappable` trait as discussed in the RFC here: https://llvm.discourse.group/t/rfc-std-elementwise-ops-on-tensors/2113/23 This trait can power a number of transformations and analyses. A subsequent patch adds a convert-elementwise-to-linalg pass exhibits how this trait allows writing generic transformations. See https://reviews.llvm.org/D90354 for that patch. This trait slightly changes some verifier messages, but the diagnostics are usually about as good. I fiddled with the ordering of the trait in the .td file trait lists to minimize the changes here. Differential Revision: https://reviews.llvm.org/D90731	2020-11-10 13:44:44 -08:00
Mehdi Amini	6cb1c0cae0	Add Python binding to run a PassManager on a MLIR Module Reviewed By: ftynse, stellaraccident Differential Revision: https://reviews.llvm.org/D90823	2020-11-10 20:06:23 +00:00
Mehdi Amini	dc43f78565	Add basic Python bindings for the PassManager and bind libTransforms This only exposes the ability to round-trip a textual pipeline at the moment. To exercise it, we also bind the libTransforms in a new Python extension. This does not include any interesting bindings, but it includes all the mechanism to add separate native extensions and load them dynamically. As such passes in libTransforms are only registered after `import mlir.transforms`. To support this global registration, the TableGen backend is also extended to bind to the C API the group registration for passes. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D90819	2020-11-10 19:55:21 +00:00
George Mitenkov	de3ad5bb09	[MLIR][SPIRVToLLVM] Enhanced conversion for execution mode This patch introduces a new conversion pattern for `spv.ExecutionMode`. `spv.ExecutionMode` may contain important information about the entry point, which we want to preserve. For example, `LocalSize` provides information about the work-group size that can be reused. Hence, the pattern for entry-point ops changes to the following: - `spv.EntryPoint` is still simply removed - Info from `spv.ExecutionMode` is used to create a global struct variable, which looks like: ``` struct { int32_t executionMode; int32_t values[]; // optional values }; ``` Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D89989	2020-11-10 18:33:54 +03:00
Alex Zinenko	fd407e1f1e	[mlir] ODS-backed python binding generator for custom op classes Introduce an ODS/Tablegen backend producing Op wrappers for Python bindings based on the ODS operation definition. Usage: mlir-tblgen -gen-python-op-bindings -Iinclude <path/to/Ops.td> \ -bind-dialect=<dialect-name> Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D90960	2020-11-10 10:58:29 +01:00
Alex Zinenko	6c7e6b2c9a	[mlir] Support slicing for operands in results in Python bindings Slicing, that is element access with `[being🔚step]` structure, is a common Python idiom for sequence-like containers. It is also necessary to support custom accessor for operations with variadic operands and results (an operation an return a slice of its operands that correspond to the given variadic group). Add generic utility to support slicing in Python bindings and use it for operation operands and results. Depends On D90923 Reviewed By: stellaraccident, mehdi_amini Differential Revision: https://reviews.llvm.org/D90936	2020-11-10 10:46:21 +01:00
Artur Bialas	3035e676a3	[mlir][spirv] Add VectorInsertDynamicOp and vector.insertelement lowering VectorInsertDynamicOp in SPIRV dialect conversion from vector.insertelement to spirv VectorInsertDynamicOp Differential Revision: https://reviews.llvm.org/D90927	2020-11-10 09:49:12 +01:00
River Riddle	892605b449	[mlir][Asm] Add support for using an alias for trailing operation locations Locations often get very long and clutter up operations when printed inline with them. This revision adds support for using aliases with trailing operation locations, and makes printing with aliases the default behavior. Aliases in the trailing location take the form `loc(<alias>)`, such as `loc(#loc0)`. As with all aliases, using `mlir-print-local-scope` can be used to disable them and get the inline behavior. Differential Revision: https://reviews.llvm.org/D90652	2020-11-09 21:54:47 -08:00
River Riddle	ebcc022507	[mlir][AsmPrinter] Refactor printing to only print aliases for attributes/types that will exist in the output. This revision refactors the way that attributes/types are considered when generating aliases. Instead of considering all of the attributes/types of every operation, we perform a "fake" print step that prints the operations using a dummy printer to collect the attributes and types that would actually be printed during the real process. This removes a lot of attributes/types from consideration that generally won't end up in the final output, e.g. affine map attributes in an `affine.apply`/`affine.for`. This resolves a long standing TODO w.r.t aliases, and helps to have a much cleaner textual output format. As a datapoint to the latter, as part of this change several tests were identified as testing for the presence of attributes aliases that weren't actually referenced by the custom form of any operation. To ensure that this wouldn't cause a large degradation in compile time due to the second full print, I benchmarked this change on a very large module with a lot of operations(The file is ~673M/~4.7 million lines long). This file before this change take ~6.9 seconds to print in the custom form, and ~7 seconds after this change. In the custom assembly case, this added an average of a little over ~100 miliseconds to the compile time. This increase was due to the way that argument attributes on functions are structured and how they get printed; i.e. with a better representation the negative impact here can be greatly decreased. When printing in the generic form, this revision had no observable impact on the compile time. This benchmarking leads me to believe that the impact of this change on compile time w.r.t printing is closely related to `print` methods that perform a lot of additional/complex processing outside of the OpAsmPrinter. Differential Revision: https://reviews.llvm.org/D90512	2020-11-09 21:54:47 -08:00
Alexander Belyaev	9d02e0e38d	[mlir][std] Add ExpandOps pass. The pass combines patterns of ExpandAtomic, ExpandMemRefReshape, StdExpandDivs passes. The pass is meant to legalize STD for conversion to LLVM. Differential Revision: https://reviews.llvm.org/D91082	2020-11-09 21:58:28 +01:00
Rahul Joshi	8b5a3e4632	[MLIR] Change FuncOp assembly syntax to print visibility inline instead of in attrib dict. - Change syntax for FuncOp to be `func <visibility>? @name` instead of printing the visibility in the attribute dictionary. - Since printFunctionLikeOp() and parseFunctionLikeOp() are also used by other operations, make the "inline visibility" an opt-in feature. - Updated unit test to use and check the new syntax. Differential Revision: https://reviews.llvm.org/D90859	2020-11-09 11:08:08 -08:00
Rahul Joshi	a97e357e8e	[MLIR] Support `global_memref` and `get_global_memref` in standard -> LLVM conversion. - Convert `global_memref` to LLVM::GlobalOp. - Convert `get_global_memref` to a memref descriptor with a pointer to the first element of the global stashed in it. - Extend unit test and a mlir-cpu-runner test to validate the generated LLVM IR. Differential Revision: https://reviews.llvm.org/D90803	2020-11-09 10:54:21 -08:00
Rahul Joshi	c96168975b	[MLIR] Flag no-terminator error on the last operation of non-empty blocks - When a block is not empty and does not end with a terminator, flag the error on the last operation of the block instead of the start of the block. Differential Revision: https://reviews.llvm.org/D90988	2020-11-09 09:42:11 -08:00
Nicolas Vasilache	6fc3a44394	[mlir][Linalg] Add support for bufferization of SubTensorOp and SubTensorInsertOp This revision adds support for bufferization by using a mix of `tensor_load`, `subview`, `linalg.copy` and `tensor_to_memref`.	2020-11-09 16:55:36 +00:00
Alex Zinenko	4669ea3bd8	[mlir] Add initial Python bindings for DenseInt/FPElementsAttr Enumerating elements in these classes is necessary to enable custom operand accessors for variadic operands. Depends On D90919 Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D90923	2020-11-09 15:23:54 +01:00
Alex Zinenko	c3a6e7c9b7	[mlir] Expose operation attributes to Python bindings Operations in a MLIR have a dictionary of attributes attached. Expose those to Python bindings through a pseudo-container that can be indexed either by attribute name, producing a PyAttribute, or by a contiguous index for enumeration purposes, producing a PyNamedAttribute. Depends On D90917 Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D90919	2020-11-09 14:59:56 +01:00
Stella Laurenzo	08c1a0dda4	[mlir][CAPI] Proposal: Always building a libMLIRPublicAPI.so (re-apply). Re-applies the reverted https://reviews.llvm.org/D90824 now that the link issue on BFD has been resolved. This reverts commit `bb9b5d3971`. Differential Revision: https://reviews.llvm.org/D91044	2020-11-08 16:57:51 -08:00
Stella Laurenzo	86b011777e	Remove TOSA test passes from non test registration. * Wires them in the same way that peer-dialect test passes are registered. * Fixes the build for -DLLVM_INCLUDE_TESTS=OFF. Differential Revision: https://reviews.llvm.org/D91022	2020-11-07 18:34:11 -08:00
Suraj Sudhir	b28121133d	TOSA MLIR Dialect This is the TOSA MLIR Dialect described in the following MLIR RFC: https://llvm.discourse.group/t/rfc-tosa-dialect-in-mlir/1971/24 Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D90411	2020-11-07 08:38:09 -08:00
George Mitenkov	89eed79c1f	[MLIR][SPIRVToLLVM] Added module name conversion Since SPIR-V module has an optional name, this patch makes a change to pass it to `ModuleOp` during conversion. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D90904	2020-11-07 12:27:44 +03:00
Mehdi Amini	e6f3ec6ebb	Don't link any LLVM/MLIR library to the C API unit-test The tests are intended to exercise the public C API and will link to a specific shared library exposing only the C API, this library itself may link to libMLIR.so. If we link some LLVM library statically in the test themselves, we end up with duplicated cl::opt registrations in LLVM. A possible setup if these libraries were needed could be to link libMLIR.so directly when available and link statically when it isn't available (in which case the libary exposing the C API would be statically link and isolated from the cl::opt registry, hopefully). Differential Revision: https://reviews.llvm.org/D90993	2020-11-07 01:54:31 +00:00
Sean Silva	e6e9e7eedf	[mlir][Linalg] Canonicalize duplicate args. I ran into this pattern when converting elementwise ops like `addf %arg0, %arg : tensor<?xf32>` to linalg. Redundant arguments can also easily arise from linalg-fusion-for-tensor-ops. Also, fix some small bugs in the logic in LinalgStructuredOpsInterface.td. Differential Revision: https://reviews.llvm.org/D90812	2020-11-06 14:40:51 -08:00
Alex Zinenko	b9c353fabb	[mlir] Use PyValue instead of PyOpResult in Python operand container The PyOpOperands container was erroneously constructing objects for individual operands as PyOpResult. Operands in fact are just values, which may or may not be results of another operation. The code would eventually crash if the operand was a block argument. Add a test that exercises the behavior that previously led to crashes. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D90917	2020-11-06 19:02:35 +01:00
Alex Zinenko	bb9b5d3971	Revert "[mlir][CAPI] Proposal: Always building a libMLIRPublicAPI.so." This reverts commit `80fe2f61fa`. Broke linkage with GNU ld. See original review thread for more details.	2020-11-06 18:59:58 +01:00
Stella Laurenzo	80fe2f61fa	[mlir][CAPI] Proposal: Always building a libMLIRPublicAPI.so. We were discussing on discord regarding the need for extension-based systems like Python to dynamically link against MLIR (or else you can only have one extension that depends on it). Currently, when I set that up, I piggy-backed off of the flag that enables build libLLVM.so and libMLIR.so and depended on libMLIR.so from the python extension if shared library building was enabled. However, this is less than ideal. In the current setup, libMLIR.so exports both all symbols from the C++ API and the C-API. The former is a kitchen sink and the latter is curated. We should be splitting them and for things that are properly factored to depend on the C-API, they should have the option to only depend on the C-API, and we should build that shared library no matter what. Its presence isn't just an optimization: it is a key part of the system. To do this right, I needed to: * Introduce visibility macros into mlir-c/Support.h. These should work on both nix and windows as-is. Create a new libMLIRPublicAPI.so with just the mlir-c object files. * Compile the C-API with -fvisibility=hidden. * Conditionally depend on the libMLIR.so from libMLIRPublicAPI.so if building libMLIR.so (otherwise, also links against the static libs and will produce a mondo libMLIRPublicAPI.so). * Disable re-exporting of static library symbols that come in as transitive deps. This gives us a dynamic linked C-API layer that is minimal and should work as-is on all platforms. Since we don't support libMLIR.so building on Windows yet (and it is not very DLL friendly), this will fall back to a mondo build of libMLIRPublicAPI.so, which has its uses (it is also the most size conscious way to go if you happen to know exactly what you need). Sizes (release/stripped, Ubuntu 20.04): Shared library build: libMLIRPublicAPI.so: 121Kb _mlir.cpython-38-x86_64-linux-gnu.so: 1.4Mb mlir-capi-ir-test: 135Kb libMLIR.so: 21Mb Static build: libMLIRPublicAPI.so: 5.5Mb (since this is a "static" build, this includes the MLIR implementation as non-exported code). _mlir.cpython-38-x86_64-linux-gnu.so: 1.4Mb mlir-capi-ir-test: 44Kb Things like npcomp and circt which bring their own dialects/transforms/etc would still need the shared library build and code that links against libMLIR.so (since it is all C++ interop stuff), but hopefully things that only depend on the public C-API can just have the one narrow dep. I spot checked everything with nm, and it looks good in terms of what is exporting/importing from each layer. I'm not in a hurry to land this, but if it is controversial, I'll probably split off the Support.h and API visibility macro changes, since we should set that pattern regardless. Reviewed By: mehdi_amini, benvanik Differential Revision: https://reviews.llvm.org/D90824	2020-11-06 09:00:56 -08:00
Alex Zinenko	0c782c214b	[mlir] Add folding of memref_cast inside another memref_cast There exists a generic folding facility that folds the operand of a memref_cast into users of memref_cast that support this. However, it was not used for the memref_cast itself. Fix it to enable elimination of memref_cast chains such as %1 = memref_cast %0 : A to B %2 = memref_cast %1 : B to A that is achieved by combining the folding with the existing "A to A" cast elimination. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D90910	2020-11-06 10:42:40 +01:00
Sean Silva	f7bc568266	[mlir] Remove AppendToArgumentsList functionality from BufferizeTypeConverter. This functionality is superceded by BufferResultsToOutParams pass (see https://reviews.llvm.org/D90071) for users the require buffers to be out-params. That pass should be run immediately after all tensors are gone from the program (before buffer optimizations and deallocation insertion), such as immediately after a "finalizing" bufferize pass. The -test-finalizing-bufferize pass now defaults to what used to be the `allowMemrefFunctionResults=true` flag. and the finalizing-bufferize-allowed-memref-results.mlir file is moved to test/Transforms/finalizing-bufferize.mlir. Differential Revision: https://reviews.llvm.org/D90778	2020-11-05 11:20:09 -08:00
Alexander Belyaev	72c65b698e	[mlir] Move TestDialect and its passes to mlir::test namespace. TestDialect has many operations and they all live in ::mlir namespace. Sometimes it is not clear whether the ops used in the code for the test passes belong to Standard or to Test dialects. Also, with this change it is easier to understand what test passes registered in mlir-opt are actually passes in mlir/test. Differential Revision: https://reviews.llvm.org/D90794	2020-11-05 15:29:15 +01:00
Alex Zinenko	b715fa330d	[mlir] Restructure C API tests for IR The test file is a long list of functions, followed by equally long FileCheck comments inside "main". Distribute FileCheck comments closer to the functions that produce the output we are checking. Reviewed By: mehdi_amini, stellaraccident Differential Revision: https://reviews.llvm.org/D90743	2020-11-05 10:12:46 +01:00
Nicolas Vasilache	ecca7852d9	[mlir][Linalg] Side effects interface for Linalg ops The LinalgDependenceGraph and alias analysis provide the necessary analysis for the Linalg fusion on buffers case. However this is not enough for linalg on tensors which require proper memory effects to play nicely with DCE and other transformations. This revision adds side effects to Linalg ops that were previously missing and has 2 consequences: 1. one example in the copy removal pass now fails since the linalg.generic op has side effects and the pass does not perform alias analysis / distinguish between reads and writes. 2. a few examples in fusion-tensor.mlir need to return the resulting tensor otherwise DCE automatically kicks in as part of greedy pattern application. Differential Revision: https://reviews.llvm.org/D90762	2020-11-05 09:00:28 +00:00
Artur Bialas	f9dca1039a	[mlir][spirv] Add VectorExtractDynamicOp and vector.extractelement lowering VectorExtractDynamicOp in SPIRV dialect conversion from vector.extractelement to spirv VectorExtractDynamicOp Differential Revision: https://reviews.llvm.org/D90679	2020-11-05 08:26:54 +01:00
Artur Bialas	1938b61bda	[mlir][spirv] Allow usage of vector size 8 and 16 with Vector16 capability Per spec, vector sizes 8 and 16 are allowed when Vector16 capability is present. This change expands the limitation of vector sizes to accept these sizes. Differential Revision: https://reviews.llvm.org/D90683	2020-11-05 08:26:15 +01:00
Alexandre Eichenberger	0795715616	[mlir][std] Add SignedCeilDivIOp and SignedFloorDivIOp with std to std lowering triggered by -std-expand-divs option. The new operations support positive/negative nominator/denominator numbers. Differential Revision: https://reviews.llvm.org/D89726 Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>	2020-11-04 14:16:23 -05:00
Sean Silva	eb8d386d51	[mlir] Make linalg-bufferize a composable bufferization pass Previously, linalg-bufferize was a "finalizing" bufferization pass (it did a "full" conversion). This wasn't great because it couldn't be used composably with other bufferization passes like std-bufferize and scf-bufferize. This patch makes linalg-bufferize a composable bufferization pass. Notice that the integration tests are switched over to using a pipeline of std-bufferize, linalg-bufferize, and (to finalize the conversion) func-bufferize. It all "just works" together. While doing this transition, I ran into a nasty bug in the 1-use special case logic for forwarding init tensors. That logic, while well-intentioned, was fundamentally flawed, because it assumed that if the original tensor value had one use, then the converted memref could be mutated in place. That assumption is wrong in many cases. For example: ``` %0 = some_tensor : tensor<4xf32> br ^bb0(%0, %0: tensor<4xf32>, tensor<4xf32>) ^bb0(%bbarg0: tensor<4xf32>, %bbarg1: tensor<4xf32>) // %bbarg0 is an alias of %bbarg1. We cannot safely write // to it without analyzing uses of %bbarg1. linalg.generic ... init(%bbarg0) {...} ``` A similar example can happen in many scenarios with function arguments. Even more sinister, if the converted memref is produced by a `std.get_global_memref` of a constant global memref, then we might attempt to write into read-only statically allocated storage! Not all memrefs are writable! Clearly, this 1-use check is not a local transformation that we can do on the fly in this pattern, so I removed it. The test is now drastically shorter and I basically rewrote the CHECK lines from scratch because: - the new composable linalg-bufferize just doesn't do as much, so there is less to test - a lot of the tests were related to the 1-use check, which is now gone, so there is less to test - the `-buffer-hoisting -buffer-deallocation` is no longer mixed in, so the checks related to that had to be rewritten Differential Revision: https://reviews.llvm.org/D90657	2020-11-04 10:16:55 -08:00
Sean Silva	f556af965f	[mlir] Fix materializations for unranked tensors. Differential Revision: https://reviews.llvm.org/D90656	2020-11-04 10:16:55 -08:00
Mehdi Amini	c7994bd939	Switch from C-style comments `/* ... /` to C++ style `//` (NFC) This is mostly a scripted update, it may not be perfect. function replace() { FROM=$1 TO=$2 git grep "$FROM" $REPO_PATH \|cut -f 1 -d : \| sort -u \| \ while read file; do sed -i "s#$FROM#$TO#" $file ; done } replace '\|\===----------------------------------------------------------------------===\\|$' '//===----------------------------------------------------------------------===//' replace '^/\ =' '//==' replace '^/\=' '//=' replace '^\\\=' '//=' replace '^\|\' '//' replace ' \\|$' '' replace '=\\\$' '=//' replace '== \/$' '===//' replace '==\/$' '==//' replace '^/\\$.$\/$' '///\1' replace '^/\$.$\/$' '//\1' replace '//============================================================================//' '//===----------------------------------------------------------------------===//' Differential Revision: https://reviews.llvm.org/D90732	2020-11-04 18:11:13 +00:00
Mehdi Amini	aeb4b1a9d8	Add facilities to print/parse a pass pipeline through the C API This also includes and exercise a register function for individual passes. Differential Revision: https://reviews.llvm.org/D90728	2020-11-04 17:29:49 +00:00
Nicolas Vasilache	85ff2705cd	[mlir][std] Add DimOp folding for dim(tensor_load(m)) -> dim(m). Differential Revision: https://reviews.llvm.org/D90755	2020-11-04 13:06:22 +00:00
Nicolas Vasilache	f202d32216	[mlir][SCF] Add canonicalization pattern for scf::For to eliminate yields that just forward. For instance: ``` func @for_yields_3(%lb : index, %ub : index, %step : index) -> (i32, i32, i32) { %a = call @make_i32() : () -> (i32) %b = call @make_i32() : () -> (i32) %r:3 = scf.for %i = %lb to %ub step %step iter_args(%0 = %a, %1 = %a, %2 = %b) -> (i32, i32, i32) { %c = call @make_i32() : () -> (i32) scf.yield %0, %c, %2 : i32, i32, i32 } return %r#0, %r#1, %r#2 : i32, i32, i32 } ``` Canonicalizes as: ``` func @for_yields_3(%arg0: index, %arg1: index, %arg2: index) -> (i32, i32, i32) { %0 = call @make_i32() : () -> i32 %1 = call @make_i32() : () -> i32 %2 = scf.for %arg3 = %arg0 to %arg1 step %arg2 iter_args(%arg4 = %0) -> (i32) { %3 = call @make_i32() : () -> i32 scf.yield %3 : i32 } return %0, %2, %1 : i32, i32, i32 } ``` Differential Revision: https://reviews.llvm.org/D90745	2020-11-04 11:36:27 +00:00
Alex Zinenko	8475fa6ed6	[mlir] Add a simpler lowering pattern for WhileOp representing a do-while loop When the "after" region of a WhileOp is merely forwarding its arguments back to the "before" region, i.e. WhileOp is a canonical do-while loop, a simpler CFG subgraph that omits the "after" region with its extra branch operation can be produced. Loop rotation from general "while" to "if { do-while }" is left for a future canonicalization pattern when it becomes necessary. Differential Revision: https://reviews.llvm.org/D90604	2020-11-04 09:43:13 +01:00
Alex Zinenko	4c0e255c98	[mlir] Add lowering to CFG for WhileOp The lowering is a straightforward inlining of the "before" and "after" regions connected by (conditional) branches. This plugs the WhileOp into the progressive lowering scheme. Future commits may choose to target WhileOp instead of CFG when lowering ForOp. Differential Revision: https://reviews.llvm.org/D90603	2020-11-04 09:43:13 +01:00
Alex Zinenko	79716559b5	[mlir] Add a generic while/do-while loop to the SCF dialect The new construct represents a generic loop with two regions: one executed before the loop condition is verifier and another after that. This construct can be used to express both a "while" loop and a "do-while" loop, depending on where the main payload is located. It is intended as an intermediate abstraction for lowering, which will be added later. This form is relatively easy to target from higher-level abstractions and supports transformations such as loop rotation and LICM. Differential Revision: https://reviews.llvm.org/D90255	2020-11-04 09:43:13 +01:00
Stella Laurenzo	8260db752c	[mlir][Python] Return and accept OpView for all functions. * All functions that return an Operation now return an OpView. * All functions that accept an Operation now accept an _OperationBase, which both Operation and OpView extend and can resolve to the backing Operation. * Moves user-facing instance methods from Operation -> _OperationBase so that both can have the same API. * Concretely, this means that if there are custom op classes defined (i.e. in Python), any iteration or creation will return the appropriate instance (i.e. if you get/create an std.addf, you will get an instance of the mlir.dialects.std.AddFOp class, getting full access to any custom API it exposes). * Refactors all __eq__ methods after realizing the proper way to do this for _OperationBase. Differential Revision: https://reviews.llvm.org/D90584	2020-11-03 22:48:34 -08:00
Mehdi Amini	f61d1028fa	Add a basic C API for the MLIR PassManager as well as a basic TableGen backend for creating passes This is exposing the basic functionalities (create, nest, addPass, run) of the PassManager through the C API in the new header: `include/mlir-c/Pass.h`. In order to exercise it in the unit-test, a basic TableGen backend is also provided to generate a simple C wrapper around the pass constructor. It is used to expose the libTransforms passes to the C API. Reviewed By: stellaraccident, ftynse Differential Revision: https://reviews.llvm.org/D90667	2020-11-04 06:36:31 +00:00
Rahul Joshi	c298824f9c	[MLIR] Check for duplicate entries in attribute dictionary during custom parsing - Verify that attributes parsed using a custom parser do not have duplicates. - If there are duplicated in the attribute dictionary in the input, they get caught during the dictionary parsing. - This check verifies that there is no duplication between the parsed dictionary and any attributes that might be added by the custom parser (or when the custom parsing code adds duplicate attributes). - Fixes https://bugs.llvm.org/show_bug.cgi?id=48025 Differential Revision: https://reviews.llvm.org/D90502	2020-11-03 16:40:46 -08:00
Thomas Raoux	29d1fba7b5	[mlir][vector] Make linalg FillOp vectorization use Transfer op Differential Revision: https://reviews.llvm.org/D90474	2020-11-03 14:35:26 -08:00
Thomas Raoux	36480657d8	[mlir][vector] Add canonicalization patterns for ExtractStride/ShapeCast + Splat constant Differential Revision: https://reviews.llvm.org/D90567	2020-11-03 11:29:54 -08:00
Lei Zhang	d5bf727bcd	[mlir][spirv] Support for a few more decorations in (de)serialization Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D90655	2020-11-03 08:11:19 -05:00
Alexander Bosch	5452fa6a59	[MLIR] Added test operations to replace linalg dependency for BufferizeTests. Summary: Added test operations to replace the LinalgDialect dependency in tests which use the buffer-deallocation, buffer-hoisting, buffer-loop-hoisting, promote-buffers-to-stack, buffer-placement-preparation-allowed-memref-resutls and buffer-placement-preparation pass. Adapted the corresponding tests cases and TestBufferPlacement.cpp. Differential Revision: https://reviews.llvm.org/D90037	2020-11-03 12:18:49 +01:00
Mehdi Amini	008b9d97cb	Make the implicit nesting behavior of the PassManager user-controllable and default to false This is an error prone behavior, I frequently have ~20 min debugging sessions when I hit an unexpected implicit nesting. This default makes the C++ API safer for users. Depends On D90669 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D90671	2020-11-03 11:17:44 +00:00
Mehdi Amini	cd7107a62b	Handle the verifier at run() time in the PassManager instead of build time This simplifies a few parts of the pass manager, but in particular we don't add as many verifierpass as there are passes in the pipeline, and we can now enable/disable the verifier after the fact on an already built PassManager. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D90669	2020-11-03 11:17:14 +00:00
Alexander Belyaev	9925168576	[mlir] Convert `memref_reshape` to LLVM. https://llvm.discourse.group/t/rfc-standard-memref-cast-ops/1454/15 Differential Revision: https://reviews.llvm.org/D90377	2020-11-03 11:39:08 +01:00
Tres Popp	d05d42199f	[mlir] Add partial lowering of shape.cstr_broadcastable. Because cstr operations allow more instruction reordering than asserts, we only lower cstr_broadcastable to std ops with cstr_require. This ensures that the more drastic lowering to asserts can happen specifically with the user's desire. Differential Revision: https://reviews.llvm.org/D89325	2020-11-03 09:57:23 +01:00
Diego Caballero	f82d307c98	[mlir][Affine] Remove single iteration affine.for ops in AffineLoopNormalize This patch renames AffineParallelNormalize to AffineLoopNormalize to make it more generic and be able to hold more loop normalization transformations in the future for affine.for and affine.parallel ops. Eventually, it could also be extended to support scf.for and scf.parallel. As a starting point for affine.for, the patch also adds support for removing single iteration affine.for ops to the the pass. Differential Revision: https://reviews.llvm.org/D90267	2020-11-02 16:44:04 -08:00
Rahul Joshi	549eac9d87	[MLIR] Remove unnecessary CHECK's from tests for which we do not run FileCheck. Differential Revision: https://reviews.llvm.org/D90651	2020-11-02 15:21:33 -08:00
Rahul Joshi	c254b0bb69	[MLIR] Introduce std.global_memref and std.get_global_memref operations. - Add standard dialect operations to define global variables with memref types and to retrieve the memref for to a named global variable - Extend unit tests to test verification for these operations. Differential Revision: https://reviews.llvm.org/D90337	2020-11-02 13:43:04 -08:00
Sean Silva	773ad135a3	[mlir][Bufferize] Rename TestBufferPlacement to TestFinalizingBufferize BufferPlacement is no longer part of bufferization. However, this test is an important test of "finalizing" bufferize passes. A "finalizing" bufferize conversion is one that performs a "full" conversion and expects all tensors to be gone from the program. This in particular involves rewriting funcs (including block arguments of the contained region), calls, and returns. The unique property of finalizing bufferization passes is that they cannot be done via a local transformation with suitable materializations to ensure composability (as other bufferization passes do). For example, if a call is rewritten, the callee needs to be rewritten otherwise the IR will end up invalid. Thus, finalizing bufferization passes require an atomic change to the entire program (e.g. the whole module). This new designation makes it clear also that it shouldn't be testing bufferization of linalg ops, so the tests have been updated to not use linalg.generic ops. (linalg.copy is still used as the "copy" op for copying into out-params) Differential Revision: https://reviews.llvm.org/D89979	2020-11-02 12:42:32 -08:00
Sean Silva	52b0fe6404	[mlir] Add func-bufferize pass. This is the most basic possible finalizing bufferization pass, which I also think is sufficient for most new use cases. The more concentrated nature of this pass also greatly clarifies the invariants that it requires on its input to safely transform the program (see the pass description in Passes.td). With this pass, I have now upstreamed practically all of the bufferizations from npcomp (the exception being std.constant, which can be upstreamed when std.global_memref lands: https://llvm.discourse.group/t/rfc-global-variables-in-mlir/2076/16 ) Differential Revision: https://reviews.llvm.org/D90205	2020-11-02 12:42:32 -08:00
Thomas Raoux	9081e7594d	[mlir][vector] Address post-commit review comments on vector ops folding patterns Differential Revision: https://reviews.llvm.org/D90183	2020-11-02 10:57:32 -08:00
Stella Laurenzo	b85f2f5c5f	[mlir][CAPI] Add APIs for mlirOperationGetName and Identifier. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D90583	2020-11-02 18:52:13 +00:00
Stella Laurenzo	af66cd173f	[mlir][Python] Context managers for Context, InsertionPoint, Location. * Finishes support for Context, InsertionPoint and Location to be carried by the thread using context managers. * Introduces type casters and utilities so that DefaultPyMlirContext and DefaultPyLocation in method signatures does the right thing (allows explicit or gets from the thread context). * Extend the rules for the thread context stack to handle nesting, appropriately inheriting and clearing depending on whether the context is the same. * Refactors all method signatures to follow the new convention on trailing parameters for defaulting parameters (loc, ip, context). When the objects are carried in the thread context, this allows most explicit uses of these values to be elided. * Removes the style guide section on putting accessors to construct global objects on the PyMlirContext: this style fails to make good use of the new facility since it is often the only thing remaining needing an MlirContext. * Moves Module parse/creation from mlir.ir.Context to static methods on mlir.ir.Module. * Moves Context.create_operation to a static Operation.create method. * Moves Type parsing from mlir.ir.Context to static methods on mlir.ir.Type. * Moves Attribute parsing from mlir.ir.Context to static methods on mlir.ir.Attribute. * Move Location factory methods from mlir.ir.Context to static methods on mlir.ir.Location. * Refactors the std dialect fake "ODS" generated code to take advantage of the new scheme. Differential Revision: https://reviews.llvm.org/D90547	2020-11-01 19:00:39 -08:00
Arthur Eubanks	5c31b8b94f	Revert "Use uint64_t for branch weights instead of uint32_t" This reverts commit `10f2a0d662`. More uint64_t overflows.	2020-10-31 00:25:32 -07:00
Mehdi Amini	72ddd559b8	Use `--allow-unused-prefixes=false` by default for FileCheck in MLIR testsuite This option catches unexpected mismatch when a prefix is given to FileCheck on the command line but never matches a single line in the test. See http://lists.llvm.org/pipermail/llvm-dev/2020-October/146162.html for more info. Differential Revision: https://reviews.llvm.org/D90501	2020-10-30 21:46:15 +00:00
Sean Silva	b866574246	[mlir] Add BufferResultsToOutParams pass. This pass allows removing getResultConversionKind from BufferizeTypeConverter. This pass replaces the AppendToArgumentsList functionality. As far as I could tell, the only use of this functionlity is to perform the transformation that is implemented in this pass. Future patches will remove the getResultConversionKind machinery from BufferizeTypeConverter, but sending this patch for individual review for clarity. Differential Revision: https://reviews.llvm.org/D90071	2020-10-30 14:06:14 -07:00
ergawy	90a8260cb4	[MLIR][SPIRV] Start module combiner. This commit adds a new library that merges/combines a number of spv modules into a combined one. The library has a single entry point: combine(...). To combine a number of MLIR spv modules, we move all the module-level ops from all the input modules into one big combined module. To that end, the combination process can proceed in 2 phases: (1) resolving conflicts between pairs of ops from different modules (2) deduplicate equivalent ops/sub-ops in the merged module. (TODO) This patch implements only the first phase. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D90477	2020-10-30 16:55:43 -04:00
Geoffrey Martin-Noble	1142eaed9d	Revert "[MLIR][SPIRV] Start module combiner." This reverts commit `27324f2855`. Shared libs build is broken linking lib/libMLIRSPIRVModuleCombiner.so: ``` ModuleCombiner.cpp: undefined reference to `mlir::spirv::ModuleOp::addressing_model() ``` https://buildkite.com/mlir/mlir-core/builds/8988#e3d966b9-ea43-492e-a192-b28e71e9a15b	2020-10-30 13:34:15 -07:00
ergawy	27324f2855	[MLIR][SPIRV] Start module combiner. This commit adds a new library that merges/combines a number of spv modules into a combined one. The library has a single entry point: combine(...). To combine a number of MLIR spv modules, we move all the module-level ops from all the input modules into one big combined module. To that end, the combination process can proceed in 2 phases: (1) resolving conflicts between pairs of ops from different modules (2) deduplicate equivalent ops/sub-ops in the merged module. (TODO) This patch implements only the first phase. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D90477	2020-10-30 14:58:17 -04:00
Arthur Eubanks	10f2a0d662	Use uint64_t for branch weights instead of uint32_t CallInst::updateProfWeight() creates branch_weights with i64 instead of i32. To be more consistent everywhere and remove lots of casts from uint64_t to uint32_t, use i64 for branch_weights. Reviewed By: davidxl Differential Revision: https://reviews.llvm.org/D88609	2020-10-30 10:03:46 -07:00

... 3 4 5 6 7 ...

3367 Commits