llvm-project

Commit Graph

Author	SHA1	Message	Date
Suraj Sudhir	b28121133d	TOSA MLIR Dialect This is the TOSA MLIR Dialect described in the following MLIR RFC: https://llvm.discourse.group/t/rfc-tosa-dialect-in-mlir/1971/24 Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D90411	2020-11-07 08:38:09 -08:00
Sean Silva	e6e9e7eedf	[mlir][Linalg] Canonicalize duplicate args. I ran into this pattern when converting elementwise ops like `addf %arg0, %arg : tensor<?xf32>` to linalg. Redundant arguments can also easily arise from linalg-fusion-for-tensor-ops. Also, fix some small bugs in the logic in LinalgStructuredOpsInterface.td. Differential Revision: https://reviews.llvm.org/D90812	2020-11-06 14:40:51 -08:00
Alex Zinenko	bb9b5d3971	Revert "[mlir][CAPI] Proposal: Always building a libMLIRPublicAPI.so." This reverts commit `80fe2f61fa`. Broke linkage with GNU ld. See original review thread for more details.	2020-11-06 18:59:58 +01:00
Stella Laurenzo	80fe2f61fa	[mlir][CAPI] Proposal: Always building a libMLIRPublicAPI.so. We were discussing on discord regarding the need for extension-based systems like Python to dynamically link against MLIR (or else you can only have one extension that depends on it). Currently, when I set that up, I piggy-backed off of the flag that enables build libLLVM.so and libMLIR.so and depended on libMLIR.so from the python extension if shared library building was enabled. However, this is less than ideal. In the current setup, libMLIR.so exports both all symbols from the C++ API and the C-API. The former is a kitchen sink and the latter is curated. We should be splitting them and for things that are properly factored to depend on the C-API, they should have the option to only depend on the C-API, and we should build that shared library no matter what. Its presence isn't just an optimization: it is a key part of the system. To do this right, I needed to: * Introduce visibility macros into mlir-c/Support.h. These should work on both nix and windows as-is. Create a new libMLIRPublicAPI.so with just the mlir-c object files. * Compile the C-API with -fvisibility=hidden. * Conditionally depend on the libMLIR.so from libMLIRPublicAPI.so if building libMLIR.so (otherwise, also links against the static libs and will produce a mondo libMLIRPublicAPI.so). * Disable re-exporting of static library symbols that come in as transitive deps. This gives us a dynamic linked C-API layer that is minimal and should work as-is on all platforms. Since we don't support libMLIR.so building on Windows yet (and it is not very DLL friendly), this will fall back to a mondo build of libMLIRPublicAPI.so, which has its uses (it is also the most size conscious way to go if you happen to know exactly what you need). Sizes (release/stripped, Ubuntu 20.04): Shared library build: libMLIRPublicAPI.so: 121Kb _mlir.cpython-38-x86_64-linux-gnu.so: 1.4Mb mlir-capi-ir-test: 135Kb libMLIR.so: 21Mb Static build: libMLIRPublicAPI.so: 5.5Mb (since this is a "static" build, this includes the MLIR implementation as non-exported code). _mlir.cpython-38-x86_64-linux-gnu.so: 1.4Mb mlir-capi-ir-test: 44Kb Things like npcomp and circt which bring their own dialects/transforms/etc would still need the shared library build and code that links against libMLIR.so (since it is all C++ interop stuff), but hopefully things that only depend on the public C-API can just have the one narrow dep. I spot checked everything with nm, and it looks good in terms of what is exporting/importing from each layer. I'm not in a hurry to land this, but if it is controversial, I'll probably split off the Support.h and API visibility macro changes, since we should set that pattern regardless. Reviewed By: mehdi_amini, benvanik Differential Revision: https://reviews.llvm.org/D90824	2020-11-06 09:00:56 -08:00
Stella Laurenzo	60e2c5b03b	[mlir][CAPI] Add missing 'static' to inline C function. * Asked to submit separately from https://reviews.llvm.org/D90824	2020-11-05 21:47:55 -08:00
Sean Silva	f7bc568266	[mlir] Remove AppendToArgumentsList functionality from BufferizeTypeConverter. This functionality is superceded by BufferResultsToOutParams pass (see https://reviews.llvm.org/D90071) for users the require buffers to be out-params. That pass should be run immediately after all tensors are gone from the program (before buffer optimizations and deallocation insertion), such as immediately after a "finalizing" bufferize pass. The -test-finalizing-bufferize pass now defaults to what used to be the `allowMemrefFunctionResults=true` flag. and the finalizing-bufferize-allowed-memref-results.mlir file is moved to test/Transforms/finalizing-bufferize.mlir. Differential Revision: https://reviews.llvm.org/D90778	2020-11-05 11:20:09 -08:00
Nicolas Vasilache	ecca7852d9	[mlir][Linalg] Side effects interface for Linalg ops The LinalgDependenceGraph and alias analysis provide the necessary analysis for the Linalg fusion on buffers case. However this is not enough for linalg on tensors which require proper memory effects to play nicely with DCE and other transformations. This revision adds side effects to Linalg ops that were previously missing and has 2 consequences: 1. one example in the copy removal pass now fails since the linalg.generic op has side effects and the pass does not perform alias analysis / distinguish between reads and writes. 2. a few examples in fusion-tensor.mlir need to return the resulting tensor otherwise DCE automatically kicks in as part of greedy pattern application. Differential Revision: https://reviews.llvm.org/D90762	2020-11-05 09:00:28 +00:00
Artur Bialas	f9dca1039a	[mlir][spirv] Add VectorExtractDynamicOp and vector.extractelement lowering VectorExtractDynamicOp in SPIRV dialect conversion from vector.extractelement to spirv VectorExtractDynamicOp Differential Revision: https://reviews.llvm.org/D90679	2020-11-05 08:26:54 +01:00
Artur Bialas	1938b61bda	[mlir][spirv] Allow usage of vector size 8 and 16 with Vector16 capability Per spec, vector sizes 8 and 16 are allowed when Vector16 capability is present. This change expands the limitation of vector sizes to accept these sizes. Differential Revision: https://reviews.llvm.org/D90683	2020-11-05 08:26:15 +01:00
Rahul Joshi	8e466f69cf	[MLIR][NFC] Update syntax of global_memref in ODS description. - The ODS description was using an old syntax that was updated during the review. This fixes the ODS description to match the current syntax. Differential Revision: https://reviews.llvm.org/D90797	2020-11-04 15:58:46 -08:00
Alexandre Eichenberger	0795715616	[mlir][std] Add SignedCeilDivIOp and SignedFloorDivIOp with std to std lowering triggered by -std-expand-divs option. The new operations support positive/negative nominator/denominator numbers. Differential Revision: https://reviews.llvm.org/D89726 Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>	2020-11-04 14:16:23 -05:00
Mehdi Amini	bf5c8625c4	Move MlirStringCallback declaration from mlir-c/IR.h to mlir-c/Support.h (NFC) This is a generic utility that can be reused beyond the IR bindings. Differential Revision: https://reviews.llvm.org/D90736	2020-11-04 18:46:36 +00:00
Rahul Joshi	8c2025cc61	[MLIR] Refactor memref type -> LLVM Type conversion - Eliminate duplicated information about mapping from memref -> its descriptor fields by consolidating that mapping in two functions: getMemRefDescriptorFields and getUnrankedMemRefDescriptorFields. - Change convertMemRefType() and convertUnrankedMemRefType() to use these functions. - Remove convertMemrefSignature and convertUnrankedMemrefSignature. Differential Revision: https://reviews.llvm.org/D90707	2020-11-04 10:32:56 -08:00
Rahul Joshi	63e72aa4f5	[MLIR] Remove NoSideEffect from std.global_memref op. - Also spell "isUninitialized" correctly. Differential Revision: https://reviews.llvm.org/D90768	2020-11-04 10:31:19 -08:00
Mehdi Amini	c7994bd939	Switch from C-style comments `/* ... /` to C++ style `//` (NFC) This is mostly a scripted update, it may not be perfect. function replace() { FROM=$1 TO=$2 git grep "$FROM" $REPO_PATH \|cut -f 1 -d : \| sort -u \| \ while read file; do sed -i "s#$FROM#$TO#" $file ; done } replace '\|\===----------------------------------------------------------------------===\\|$' '//===----------------------------------------------------------------------===//' replace '^/\ =' '//==' replace '^/\=' '//=' replace '^\\\=' '//=' replace '^\|\' '//' replace ' \\|$' '' replace '=\\\$' '=//' replace '== \/$' '===//' replace '==\/$' '==//' replace '^/\\$.$\/$' '///\1' replace '^/\$.$\/$' '//\1' replace '//============================================================================//' '//===----------------------------------------------------------------------===//' Differential Revision: https://reviews.llvm.org/D90732	2020-11-04 18:11:13 +00:00
Mehdi Amini	aeb4b1a9d8	Add facilities to print/parse a pass pipeline through the C API This also includes and exercise a register function for individual passes. Differential Revision: https://reviews.llvm.org/D90728	2020-11-04 17:29:49 +00:00
Paul C. Anagnostopoulos	d56cd4291e	[TableGen] Add !interleave operator to concatenate a list of values with delimiters Add a test. Use it in some TableGen files. Differential Revision: https://reviews.llvm.org/D90469	2020-11-04 09:23:54 -05:00
Frederik Gossen	1664462d70	[MLIR] Support walks over regions and blocks Relands - [MLIR] Support walks over regions and blocks (`dbae3d50f1`) - [MLIR] Use llvm::is_one_of in walk templates (`56299b1e58`) Differential Revision: https://reviews.llvm.org/D90753	2020-11-04 12:50:05 +00:00
Nicolas Vasilache	f202d32216	[mlir][SCF] Add canonicalization pattern for scf::For to eliminate yields that just forward. For instance: ``` func @for_yields_3(%lb : index, %ub : index, %step : index) -> (i32, i32, i32) { %a = call @make_i32() : () -> (i32) %b = call @make_i32() : () -> (i32) %r:3 = scf.for %i = %lb to %ub step %step iter_args(%0 = %a, %1 = %a, %2 = %b) -> (i32, i32, i32) { %c = call @make_i32() : () -> (i32) scf.yield %0, %c, %2 : i32, i32, i32 } return %r#0, %r#1, %r#2 : i32, i32, i32 } ``` Canonicalizes as: ``` func @for_yields_3(%arg0: index, %arg1: index, %arg2: index) -> (i32, i32, i32) { %0 = call @make_i32() : () -> i32 %1 = call @make_i32() : () -> i32 %2 = scf.for %arg3 = %arg0 to %arg1 step %arg2 iter_args(%arg4 = %0) -> (i32) { %3 = call @make_i32() : () -> i32 scf.yield %3 : i32 } return %0, %2, %1 : i32, i32, i32 } ``` Differential Revision: https://reviews.llvm.org/D90745	2020-11-04 11:36:27 +00:00
Alex Zinenko	79716559b5	[mlir] Add a generic while/do-while loop to the SCF dialect The new construct represents a generic loop with two regions: one executed before the loop condition is verifier and another after that. This construct can be used to express both a "while" loop and a "do-while" loop, depending on where the main payload is located. It is intended as an intermediate abstraction for lowering, which will be added later. This form is relatively easy to target from higher-level abstractions and supports transformations such as loop rotation and LICM. Differential Revision: https://reviews.llvm.org/D90255	2020-11-04 09:43:13 +01:00
Stella Laurenzo	ebe12df896	Fix linkage error on mlirLogicalResultIsFailure. * For C, this needs to be inline static like the others. Differential Revision: https://reviews.llvm.org/D90740	2020-11-03 22:47:07 -08:00
Mehdi Amini	b4fa6d3e13	Switch the CallbackOstream wrapper in the MLIR C API to an Unbuffered stream This delegate the control of the buffering to the user of the API. This seems like a safer option as messages are immediately propagated to the user, which may lead to less surprising behavior during debugging for instance. In terms of performance, a user can add a buffered stream on the other side of the callback. Differential Revision: https://reviews.llvm.org/D90726	2020-11-04 06:36:32 +00:00
Mehdi Amini	f61d1028fa	Add a basic C API for the MLIR PassManager as well as a basic TableGen backend for creating passes This is exposing the basic functionalities (create, nest, addPass, run) of the PassManager through the C API in the new header: `include/mlir-c/Pass.h`. In order to exercise it in the unit-test, a basic TableGen backend is also provided to generate a simple C wrapper around the pass constructor. It is used to expose the libTransforms passes to the C API. Reviewed By: stellaraccident, ftynse Differential Revision: https://reviews.llvm.org/D90667	2020-11-04 06:36:31 +00:00
Rahul Joshi	c298824f9c	[MLIR] Check for duplicate entries in attribute dictionary during custom parsing - Verify that attributes parsed using a custom parser do not have duplicates. - If there are duplicated in the attribute dictionary in the input, they get caught during the dictionary parsing. - This check verifies that there is no duplication between the parsed dictionary and any attributes that might be added by the custom parser (or when the custom parsing code adds duplicate attributes). - Fixes https://bugs.llvm.org/show_bug.cgi?id=48025 Differential Revision: https://reviews.llvm.org/D90502	2020-11-03 16:40:46 -08:00
mikeurbach	2e36e0dad5	[MLIR] Move eraseArguments and eraseResults to FunctionLike Previously, they were only defined for `FuncOp`. To support this, `FunctionLike` needs a way to get an updated type from the concrete operation. This adds a new hook for that purpose, called `getTypeWithoutArgsAndResults`. For now, `FunctionLike` continues to assume the type is `FunctionType`, and concrete operations that use another type can hide the `getType`, `setType`, and `getTypeWithoutArgsAndResults` methods. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D90363	2020-11-03 16:53:46 -07:00
Mehdi Amini	bd156fee05	Remove extra comma after macro, fix GCC warning (NFC)	2020-11-03 22:22:13 +00:00
Kiran Chandramohan	ab8a4cec55	[MLIR] NFC : Move OpenMP dialect include to translation The OpenMP dialect include is only needed for translation and is not required in LLVM dialect. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D90510	2020-11-03 22:12:10 +00:00
Thomas Raoux	36480657d8	[mlir][vector] Add canonicalization patterns for ExtractStride/ShapeCast + Splat constant Differential Revision: https://reviews.llvm.org/D90567	2020-11-03 11:29:54 -08:00
Mehdi Amini	008b9d97cb	Make the implicit nesting behavior of the PassManager user-controllable and default to false This is an error prone behavior, I frequently have ~20 min debugging sessions when I hit an unexpected implicit nesting. This default makes the C++ API safer for users. Depends On D90669 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D90671	2020-11-03 11:17:44 +00:00
Mehdi Amini	cd7107a62b	Handle the verifier at run() time in the PassManager instead of build time This simplifies a few parts of the pass manager, but in particular we don't add as many verifierpass as there are passes in the pipeline, and we can now enable/disable the verifier after the fact on an already built PassManager. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D90669	2020-11-03 11:17:14 +00:00
Mehdi Amini	bf523186fb	Change the PrintOpStatsPass to operate on any operation instead of just ModuleOp This allows to use it on other operation, like a GPUModule for example.	2020-11-03 11:15:32 +00:00
Mehdi Amini	0aaa2a4cb1	Remove mlir-c/Core.h which is superseded by the new API in mlir-c/IR.h This header was an initial early attempt at a crude C API for bindings, but it isn't used and redundant with the new API. At this point it only contributes to more confusion. Differential Revision: https://reviews.llvm.org/D90643	2020-11-03 11:15:32 +00:00
Alexander Belyaev	9925168576	[mlir] Convert `memref_reshape` to LLVM. https://llvm.discourse.group/t/rfc-standard-memref-cast-ops/1454/15 Differential Revision: https://reviews.llvm.org/D90377	2020-11-03 11:39:08 +01:00
Tres Popp	ca1bcdff4b	[mlir] Add to shape.is_broadcastable description	2020-11-03 10:23:55 +01:00
Diego Caballero	f82d307c98	[mlir][Affine] Remove single iteration affine.for ops in AffineLoopNormalize This patch renames AffineParallelNormalize to AffineLoopNormalize to make it more generic and be able to hold more loop normalization transformations in the future for affine.for and affine.parallel ops. Eventually, it could also be extended to support scf.for and scf.parallel. As a starting point for affine.for, the patch also adds support for removing single iteration affine.for ops to the the pass. Differential Revision: https://reviews.llvm.org/D90267	2020-11-02 16:44:04 -08:00
MaheshRavishankar	04776bd0ed	[mlir][Linalg] Add more utility functions to LinalgDependenceGraph. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D90582	2020-11-02 16:35:20 -08:00
River Riddle	b870d9ec83	[mlir] Optimize Op definitions and registration to optimize for code size This revision refactors the base Op/AbstractOperation classes to reduce the amount of generated code size when defining a new operation. The current scheme involves taking the address of functions defined directly on Op and Trait classes. This is problematic because even when these functions are empty/unused we still result in these functions being defined in the main executable. In this revision, we switch to using SFINAE and template type filtering to remove remove functions that are not needed/used. For example, if an operation does not define a custom `print` method we shouldn't define a templated `printAssembly` method for it. The same applies to parsing/folding/verification/etc. This dropped MLIR code size for a large downstream library by ~10%(~1 mb in an opt build). Differential Revision: https://reviews.llvm.org/D90196	2020-11-02 14:39:43 -08:00
Rahul Joshi	c254b0bb69	[MLIR] Introduce std.global_memref and std.get_global_memref operations. - Add standard dialect operations to define global variables with memref types and to retrieve the memref for to a named global variable - Extend unit tests to test verification for these operations. Differential Revision: https://reviews.llvm.org/D90337	2020-11-02 13:43:04 -08:00
Sean Silva	52b0fe6404	[mlir] Add func-bufferize pass. This is the most basic possible finalizing bufferization pass, which I also think is sufficient for most new use cases. The more concentrated nature of this pass also greatly clarifies the invariants that it requires on its input to safely transform the program (see the pass description in Passes.td). With this pass, I have now upstreamed practically all of the bufferizations from npcomp (the exception being std.constant, which can be upstreamed when std.global_memref lands: https://llvm.discourse.group/t/rfc-global-variables-in-mlir/2076/16 ) Differential Revision: https://reviews.llvm.org/D90205	2020-11-02 12:42:32 -08:00
Mehdi Amini	9be3c01eb9	Undef the `DEFINE_C_API_STRUCT` macro after using it in the MLIR C API header (NFC) Leaking macros isn't a good practice when defining headers. This requires to duplicate the macro definition in every header though, but that seems like a better tradeoff right now. Differential Revision: https://reviews.llvm.org/D90633	2020-11-02 19:18:32 +00:00
Stella Laurenzo	b85f2f5c5f	[mlir][CAPI] Add APIs for mlirOperationGetName and Identifier. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D90583	2020-11-02 18:52:13 +00:00
Frederik Gossen	327bf5c2d9	Revert "[MLIR] Support walks over regions and blocks" This reverts commit `dbae3d50f1`. Cannot build with gcc/g++ 7.5.0.	2020-11-02 16:21:29 +00:00
Frederik Gossen	6b74a5aab1	Revert "[MLIR] Use `llvm::is_one_of` in walk templates" This reverts commit `56299b1e58`. Cannot build with gcc/g++ 7.5.0.	2020-11-02 16:21:29 +00:00
Sean Silva	b866574246	[mlir] Add BufferResultsToOutParams pass. This pass allows removing getResultConversionKind from BufferizeTypeConverter. This pass replaces the AppendToArgumentsList functionality. As far as I could tell, the only use of this functionlity is to perform the transformation that is implemented in this pass. Future patches will remove the getResultConversionKind machinery from BufferizeTypeConverter, but sending this patch for individual review for clarity. Differential Revision: https://reviews.llvm.org/D90071	2020-10-30 14:06:14 -07:00
ergawy	90a8260cb4	[MLIR][SPIRV] Start module combiner. This commit adds a new library that merges/combines a number of spv modules into a combined one. The library has a single entry point: combine(...). To combine a number of MLIR spv modules, we move all the module-level ops from all the input modules into one big combined module. To that end, the combination process can proceed in 2 phases: (1) resolving conflicts between pairs of ops from different modules (2) deduplicate equivalent ops/sub-ops in the merged module. (TODO) This patch implements only the first phase. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D90477	2020-10-30 16:55:43 -04:00
Sean Silva	30e130c3ed	[mlir] Move some linalg patterns around. The bufferization patterns are moved to the .cpp file, which is preferred in the codebase when it makes sense. The LinalgToStandard patterns are kept a header because they are expected to be used individually. However, they are moved to LinalgToStandard.h which is the file corresponding to where they are defined. This also removes TensorCastOpConverter, which is handled by populateStdBufferizePatterns now. Eventually, the constant op lowering will be handled as well, but it there are currently holdups on moving it (see https://reviews.llvm.org/D89916). Differential Revision: https://reviews.llvm.org/D90254	2020-10-30 13:48:03 -07:00
Geoffrey Martin-Noble	1142eaed9d	Revert "[MLIR][SPIRV] Start module combiner." This reverts commit `27324f2855`. Shared libs build is broken linking lib/libMLIRSPIRVModuleCombiner.so: ``` ModuleCombiner.cpp: undefined reference to `mlir::spirv::ModuleOp::addressing_model() ``` https://buildkite.com/mlir/mlir-core/builds/8988#e3d966b9-ea43-492e-a192-b28e71e9a15b	2020-10-30 13:34:15 -07:00
ergawy	27324f2855	[MLIR][SPIRV] Start module combiner. This commit adds a new library that merges/combines a number of spv modules into a combined one. The library has a single entry point: combine(...). To combine a number of MLIR spv modules, we move all the module-level ops from all the input modules into one big combined module. To that end, the combination process can proceed in 2 phases: (1) resolving conflicts between pairs of ops from different modules (2) deduplicate equivalent ops/sub-ops in the merged module. (TODO) This patch implements only the first phase. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D90477	2020-10-30 14:58:17 -04:00
Mehdi Amini	b3430ed05f	Revert "[MLIR][SPIRV] Start module combiner" This reverts commit `316593ce83`. Build is broken with: TestModuleCombiner.cpp:(.text._ZN12_GLOBAL__N_122TestModuleCombinerPass14runOnOperationEv+0x195): undefined reference to `mlir::spirv::combine(llvm::MutableArrayRef<mlir::spirv::ModuleOp>, mlir::OpBuilder&, llvm::function_ref<void (mlir::spirv::ModuleOp, llvm::StringRef, llvm::StringRef)>)'	2020-10-30 15:09:21 +00:00
Frederik Gossen	56299b1e58	[MLIR] Use `llvm::is_one_of` in walk templates Differential Revision: https://reviews.llvm.org/D90449	2020-10-30 14:42:34 +00:00
ergawy	316593ce83	[MLIR][SPIRV] Start module combiner This commit adds a new library that merges/combines a number of spv modules into a combined one. The library has a single entry point: combine(...). To combine a number of MLIR spv modules, we move all the module-level ops from all the input modules into one big combined module. To that end, the combination process can proceed in 2 phases: (1) resolving conflicts between pairs of ops from different modules (2) deduplicate equivalent ops/sub-ops in the merged module. (TODO) This patch implements only the first phase. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D90022	2020-10-30 09:37:28 -04:00
Tres Popp	d2abbc17b2	[mlir] Add shape.is_broadcastable. This op returns a boolean value indicating whether 2 ops are broadcastable or not. This follows the same logic as the other ops with broadcast in their names in the shape dialect. Concretely, shape.is_broadcastable returning true implies that shape.broadcast will not give an error, and shape.cstr_broadcastable will not result in an assertion failure. Similarly, false implies an error or assertion failure.	2020-10-30 09:46:35 +01:00
River Riddle	a463ea50a4	[mlir][ASM] Refactor how attribute/type aliases are specified. Previously they were separated into "instance" and "kind" aliases, and also required that the dialect know ahead of time all of the instances that would have a corresponding alias. This approach was very clunky and not ergonomic to interact with. The new approach is to provide the dialect with an instance of an attribute/type to provide an alias for, fully replacing the original split approach. Differential Revision: https://reviews.llvm.org/D89354	2020-10-30 00:39:46 -07:00
Stella Laurenzo	c645ea5e29	Add InsertionPoint and context managers to the Python API. * Removes index based insertion. All insertion now happens through the insertion point. * Introduces thread local context managers for implicit creation relative to an insertion point. * Introduces (but does not yet use) binding the Context to the thread local context stack. Intent is to refactor all methods to take context optionally and have them use the default if available. * Adds C APIs for mlirOperationGetParentOperation(), mlirOperationGetBlock() and mlirBlockGetTerminator(). * Removes an assert in PyOperation creation that was incorrectly constraining. There is already a TODO to rework the keepAlive field that it was guarding and without the assert, it is no worse than the current state. Differential Revision: https://reviews.llvm.org/D90368	2020-10-29 17:50:13 -07:00
Christian Sigg	db7129a005	[mlir][gpu] Add pass to make GPU ops within a region execute asynchronously. Do not use the pass yet, except in a test. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D89937	2020-10-29 22:17:50 +01:00
Christian Sigg	3556114083	[mlir][gpu] Allow gpu.launch_func to be async. This is a roll-forward of rGec7780ebdab4, now that the remaining gpu.launch_func have been converted to custom form in rGb22f111023ba. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D90420	2020-10-29 21:48:38 +01:00
Mehdi Amini	834618a2ff	Revert "[mlir][gpu] Allow gpu.launch_func to be async." This reverts commit `ec7780ebda`. One of the bot is crashing in a test related to this change.	2020-10-29 17:30:27 +00:00
Christian Sigg	ec7780ebda	[mlir][gpu] Allow gpu.launch_func to be async. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D89324	2020-10-29 17:54:56 +01:00
Nicolas Vasilache	9b17bf2e54	[mlir][Linalg] Make Linalg fusion a test pass Linalg "tile-and-fuse" is currently exposed as a Linalg pass "-linalg-fusion" but only the mechanics of the transformation are currently relevant. Instead turn it into a "-test-linalg-greedy-fusion" pass which performs canonicalizations to enable more fusions to compose. This allows dropping the OperationFolder which is not meant to be used with the pattern rewrite infrastructure. Differential Revision: https://reviews.llvm.org/D90394	2020-10-29 15:18:51 +00:00
Frederik Gossen	dbae3d50f1	[MLIR] Support walks over regions and blocks Add specializations for `walk` to allow traversal of regions and blocks. Differential Revision: https://reviews.llvm.org/D90379	2020-10-29 14:34:22 +00:00
Valentin Clement	1ce5f8bbb6	[mlir][openacc] Add if and device_type to update op Update op is modelling the update directive (2.14.4) from the OpenACC specs. An if condition and a device_type list can be attached to the directive. This patch add these two information to the current op. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D90310	2020-10-29 09:54:44 -04:00
River Riddle	73547b08de	[mlir][SymbolTable] Small optimization to walking symbol references * Check region count for unknown symbol tables first, as it is a faster check * Add an accessor to MutableDictionaryAttr to get the internal dictionary without creating a new one if it is empty. This avoids an otherwise unnecessary lookup of an MLIRContext.	2020-10-28 22:01:10 -07:00
River Riddle	fa4174792a	[mlir][Inliner] Add a `wouldBeCloned` flag to each of the `isLegalToInline` hooks. Often times the legality of inlining can change depending on if the callable is going to be inlined in-place, or cloned. For example, some operations are not allowed to be duplicated and can only be inlined if the original callable will cease to exist afterwards. The new `wouldBeCloned` flag allows for dialects to hook into this when determining legality. Differential Revision: https://reviews.llvm.org/D90360	2020-10-28 21:49:28 -07:00
River Riddle	501fda0167	[mlir][Inliner] Add a new hook for checking if it is legal to inline a callable into a call In certain situations it isn't legal to inline a call operation, but this isn't something that is possible(at least not easily) to prevent with the current hooks. This revision adds a new hook so that dialects with call operations that shouldn't be inlined can prevent it. Differential Revision: https://reviews.llvm.org/D90359	2020-10-28 21:49:28 -07:00
Haruki Imai	a66e334ceb	[mlir] Convert raw data in dense element attributes for big-endian machines. This patch fixes a bug [[ https://bugs.llvm.org/show_bug.cgi?id=46091 \| 46091 ]] Raw data for the `dense-element attribute` is written in little endian (LE) format. This commit converts the format to big endian (BE) in ʻAttribute Parser` on the BE machine. Also, when outputting on a BE machine, the BE format is converted to LE in "AsmPrinter". Differential Revision: https://reviews.llvm.org/D80695	2020-10-28 17:06:16 -07:00
Alexander Belyaev	67760bb2d6	[mlir] Use OpBuilderDAG for MemRefReinterpretCastOp.	2020-10-28 21:42:14 +01:00
Alexander Belyaev	7a996027b9	[mlir] Convert memref_reshape to memref_reinterpret_cast. Differential Revision: https://reviews.llvm.org/D90235	2020-10-28 21:15:32 +01:00
Kazuaki Ishizaki	41b09f4eff	[mlir] NFC: fix trivial typos fix typos in comments and documents Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D90089	2020-10-29 04:05:22 +09:00
Mehdi Amini	72023442c1	Add a `mlirModuleGetBody()` accessor to the C API and bind it in Python Getting the body of a Module is a common need which justifies a dedicated accessor instead of forcing users to go through the region->blocks->front unwrapping manually. Differential Revision: https://reviews.llvm.org/D90287	2020-10-28 17:53:52 +00:00
Lei Zhang	b1b0ddbb67	[mlir] NFC: small fixes to LinalgTilingOptions API This commit changes to use plain values instead of references. We need to copy it anyway. References forbid using temporary values generated from expressions. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D90277	2020-10-28 08:28:58 -04:00
River Riddle	c09d10437f	[mlir][NFC] Fix incorrect header comments. Resolves missed comments in D89103	2020-10-27 16:22:31 -07:00
River Riddle	935d708568	[mlir][NFC] Remove unnecessary PatternRewriter::create methods At this point, these methods are just carbon copies of OpBuilder::create and aren't necessary given that PatternRewriter inherits from OpBuilder. Differential Revision: https://reviews.llvm.org/D90087	2020-10-27 16:16:51 -07:00
River Riddle	eacac2679d	[mlir][Interfaces] Optimize the implementation of InterfaceMap to reduce generated code size. An InterfaceMap is generated for every single operation type, and is responsible for a large amount of the code size from MLIR given that its internals highly utilize templates. This revision refactors the internal implementation to use bare malloc/free for interface instances as opposed to static variables and moves as much code out of templates as possible. This led to a decrease of over >1mb (~12% of total MLIR related code size) for a downstream MLIR library with a large amount of operations. Differential Revision: https://reviews.llvm.org/D90086	2020-10-27 16:16:51 -07:00
River Riddle	d989ae9069	[mlir][SIdeEffectInterface][NFC] Move several InterfaceMethods to the extraClassDeclarations instead All InterfaceMethods will have a corresponding entry in the interface model, and by extension have an implementation generated for every operation type. This can result in large binary size increases when a large amount of operations use an interface, such as the side effect interface. Differential Revision: https://reviews.llvm.org/D90084	2020-10-27 16:16:51 -07:00
Eugene Zhulenev	f6c9f6eccd	[mlir] JitRunner: add a config option to register symbols with ExecutionEngine at runtime Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D90264	2020-10-27 15:57:34 -07:00
Nicolai Hähnle	e025d09b21	Revert multiple patches based on "Introduce CfgTraits abstraction" These logically belong together since it's a base commit plus followup fixes to less common build configurations. The patches are: Revert "CfgInterface: rename interface() to getInterface()" This reverts commit `a74fc48158`. Revert "Wrap CfgTraitsFor in namespace llvm to please GCC 5" This reverts commit `f2a06875b6`. Revert "Try to make GCC5 happy about the CfgTraits thing" This reverts commit `03a5f7ce12`. Revert "Introduce CfgTraits abstraction" This reverts commit `c0cdd22c72`.	2020-10-27 20:33:30 +01:00
Alex Zinenko	89eab30e5c	[mlir] use OpBuilderDAG instead of OpBuilder A recent commit introduced a new syntax for specifying builder arguments in ODS, which is better amenable to automated processing, and deprecated the old form. Transition all dialects as well as Linalg ODS generator to use the new syntax. Add a deprecation notice to ODS generator. Reviewed By: rriddle, jpienaar Differential Revision: https://reviews.llvm.org/D90038	2020-10-27 10:21:49 +01:00
River Riddle	67f52f35d6	[mlir][StorageUniquer] Refactor parametric storage to use sharded dense sets This revisions implements sharding in the storage of parametric instances to decrease lock contention by sharding out the allocator/mutex/etc. to use for a specific storage instance based on the hash key. This is a somewhat common approach to reducing lock contention on data structures, and is used by the concurrent hashmaps provided by folly/java/etc. For several compilations tested, this removed all/most lock contention from profiles and reduced compile time by several seconds. Differential Revision: https://reviews.llvm.org/D89659	2020-10-26 19:40:19 -07:00
River Riddle	3fffffa882	[mlir][Pattern] Add a new FrozenRewritePatternList class This class represents a rewrite pattern list that has been frozen, and thus immutable. This replaces the uses of OwningRewritePatternList in pattern driver related API, such as dialect conversion. When PDL becomes more prevalent, this API will allow for optimizing a set of patterns once without the need to do this per run of a pass. Differential Revision: https://reviews.llvm.org/D89104	2020-10-26 18:01:06 -07:00
River Riddle	b6eb26fd0e	[mlir][NFC] Move around the code related to PatternRewriting to improve layering There are several pieces of pattern rewriting infra in IR/ that really shouldn't be there. This revision moves those pieces to a better location such that they are easier to evolve in the future(e.g. with PDL). More concretely this revision does the following: * Create a Transforms/GreedyPatternRewriteDriver.h and move the applyandFold methods there. The definitions for these methods are already in Transforms/ so it doesn't make sense for the declarations to be in IR. Create a new lib/Rewrite library and move PatternApplicator there. This new library will be focused on applying rewrites, and will also include compiling rewrites with PDL. Differential Revision: https://reviews.llvm.org/D89103	2020-10-26 18:01:06 -07:00
River Riddle	b99bd77162	[mlir][Pattern] Refactor the Pattern class into a "metadata only" class The Pattern class was originally intended to be used for solely matching operations, but that use never materialized. All of the pattern infrastructure uses RewritePattern, and the infrastructure for pure matching(Matchers.h) is implemented inline. This means that this class isn't a useful abstraction at the moment, so this revision refactors it to solely encapsulate the "metadata" of a pattern. The metadata includes the various state describing a pattern; benefit, root operation, etc. The API on PatternApplicator is updated to now operate on `Pattern`s as nothing special from `RewritePattern` is necessary. This refactoring is also necessary for the upcoming use of PDL patterns alongside C++ rewrite patterns. Differential Revision: https://reviews.llvm.org/D86258	2020-10-26 18:01:06 -07:00
River Riddle	8a1ca2cd34	[mlir] Add a conversion pass between PDL and the PDL Interpreter Dialect The conversion between PDL and the interpreter is split into several different parts. ** The Matcher: The matching section of all incoming pdl.pattern operations is converted into a predicate tree and merged. Each pattern is first converted into an ordered list of predicates starting from the root operation. A predicate is composed of three distinct parts: * Position - A position refers to a specific location on the input DAG, i.e. an existing MLIR entity being matched. These can be attributes, operands, operations, results, and types. Each position also defines a relation to its parent. For example, the operand `[0] -> 1` has a parent operation position `[0]` (the root). * Question - A question refers to a query on a specific positional value. For example, an operation name question checks the name of an operation position. * Answer - An answer is the expected result of a question. For example, when matching an operation with the name "foo.op". The question would be an operation name question, with an expected answer of "foo.op". After the predicate lists have been created and ordered(based on occurrence of common predicates and other factors), they are formed into a tree of nodes that represent the branching flow of a pattern match. This structure allows for efficient construction and merging of the input patterns. There are currently only 4 simple nodes in the tree: * ExitNode: Represents the termination of a match * SuccessNode: Represents a successful match of a specific pattern * BoolNode/SwitchNode: Branch to a specific child node based on the expected answer to a predicate question. Once the matcher tree has been generated, this tree is walked to generate the corresponding interpreter operations. ** The Rewriter: The rewriter portion of a pattern is generated in a very straightforward manor, similarly to lowerings in other dialects. Each PDL operation that may exist within a rewrite has a mapping into the interpreter dialect. The code for the rewriter is generated within a FuncOp, that is invoked by the interpreter on a successful pattern match. Referenced values defined in the matcher become inputs the generated rewriter function. An example lowering is shown below: ```mlir // The following high level PDL pattern: pdl.pattern : benefit(1) { %resultType = pdl.type %inputOperand = pdl.input %root, %results = pdl.operation "foo.op"(%inputOperand) -> %resultType pdl.rewrite %root { pdl.replace %root with (%inputOperand) } } // is lowered to the following: module { // The matcher function takes the root operation as an input. func @matcher(%arg0: !pdl.operation) { pdl_interp.check_operation_name of %arg0 is "foo.op" -> ^bb2, ^bb1 ^bb1: pdl_interp.return ^bb2: pdl_interp.check_operand_count of %arg0 is 1 -> ^bb3, ^bb1 ^bb3: pdl_interp.check_result_count of %arg0 is 1 -> ^bb4, ^bb1 ^bb4: %0 = pdl_interp.get_operand 0 of %arg0 pdl_interp.is_not_null %0 : !pdl.value -> ^bb5, ^bb1 ^bb5: %1 = pdl_interp.get_result 0 of %arg0 pdl_interp.is_not_null %1 : !pdl.value -> ^bb6, ^bb1 ^bb6: // This operation corresponds to a successful pattern match. pdl_interp.record_match @rewriters::@rewriter(%0, %arg0 : !pdl.value, !pdl.operation) : benefit(1), loc([%arg0]), root("foo.op") -> ^bb1 } module @rewriters { // The inputs to the rewriter from the matcher are passed as arguments. func @rewriter(%arg0: !pdl.value, %arg1: !pdl.operation) { pdl_interp.replace %arg1 with(%arg0) pdl_interp.return } } } ``` Differential Revision: https://reviews.llvm.org/D84580	2020-10-26 18:01:06 -07:00
Ulysse Beaugnon	db4863ffd1	[MLIR] Fix AttributeInterface declaration. Substitues `Type` by `Attribute` in the declaration of AttributeInterface. It looks like the code was written by copy-pasting the definition of TypeInterface, but the substitution of Type by Attribute was missing at some places. Reviewed By: rriddle, ftynse Differential Revision: https://reviews.llvm.org/D90138	2020-10-26 23:41:39 +01:00
Alex Zinenko	03e6f40cdb	[mlir] Do not print back 0 alignment in LLVM dialect 'alloca' op The alignment attribute in the 'alloca' op treats the '0' value as 'unset'. When parsing the custom form of the 'alloca' op, ignore the alignment attribute with if its value is '0' instead of actually creating it and producing a slightly different textually yet equivalent semantically form in the output. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D90179	2020-10-26 23:19:20 +01:00
Lei Zhang	f52b4a65f0	[mlir] NFC: properly align IR in comments Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D90164	2020-10-26 17:58:00 -04:00
Thomas Raoux	bd07be4f3f	[mlir][vector] Update doc strings for insert_map/extract_map and fix insert_map semantic Based on discourse discussion, fix the doc string and remove examples with wrong semantic. Also fix insert_map semantic by adding missing operand for vector we are inserting into. Differential Revision: https://reviews.llvm.org/D89563	2020-10-26 10:47:01 -07:00
Nicolas Vasilache	37e0fdd072	[mlir][Linalg] Add basic support for TileAndFuse on Linalg on tensors. This revision allows the fusion of the producer of input tensors in the consumer under a tiling transformation (which produces subtensors). Many pieces are still missing (e.g. support init_tensors, better refactor LinalgStructuredOp interface support, try to merge implementations and reuse code) but this still allows getting started. The greedy pass itself is just for testing purposes and will be extracted in a separate test pass. Differential revision: https://reviews.llvm.org/D89491	2020-10-26 17:19:08 +00:00
George Mitenkov	89808ce734	[MLIR][mlir-spirv-cpu-runner] A SPIR-V cpu runner prototype This patch introduces a SPIR-V runner. The aim is to run a gpu kernel on a CPU via GPU -> SPIRV -> LLVM conversions. This is a first prototype, so more features will be added in due time. - Overview The runner follows similar flow as the other runners in-tree. However, having converted the kernel to SPIR-V, we encode the bind attributes of global variables that represent kernel arguments. Then SPIR-V module is converted to LLVM. On the host side, we emulate passing the data to device by creating in main module globals with the same symbolic name as in kernel module. These global variables are later linked with ones from the nested module. We copy data from kernel arguments to globals, call the kernel function from nested module and then copy the data back. - Current state At the moment, the runner is capable of running 2 modules, nested one in another. The kernel module must contain exactly one kernel function. Also, the runner supports rank 1 integer memref types as arguments (to be scaled). - Enhancement of JitRunner and ExecutionEngine To translate nested modules to LLVM IR, JitRunner and ExecutionEngine were altered to take an optional (default to `nullptr`) function reference that is a custom LLVM IR module builder. This allows to customize LLVM IR module creation from MLIR modules. Reviewed By: ftynse, mravishankar Differential Revision: https://reviews.llvm.org/D86108	2020-10-26 09:09:29 -04:00
George Mitenkov	cae4067ec1	[MLIR][mlir-spirv-cpu-runner] A pass to emulate a call to kernel in LLVM This patch introduces a pass for running `mlir-spirv-cpu-runner` - LowerHostCodeToLLVMPass. This pass emulates `gpu.launch_func` call in LLVM dialect and lowers the host module code to LLVM. It removes the `gpu.module`, creates a sequence of global variables that are later linked to the varables in the kernel module, as well as a series of copies to/from them to emulate the memory transfer to/from the host or to/from the device sides. It also converts the remaining Standard dialect into LLVM dialect, emitting C wrappers. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D86112	2020-10-26 08:11:04 -04:00
Mehdi Amini	e7021232e6	Remove global dialect registration This has been deprecated for >1month now and removal was announced in: https://llvm.discourse.group/t/rfc-revamp-dialect-registration/1559/11 Differential Revision: https://reviews.llvm.org/D86356	2020-10-24 00:35:55 +00:00
Mehdi Amini	035a6b95c3	Fix a few warnings from GCC (NFC)	2020-10-24 00:35:55 +00:00
Mehdi Amini	6a72635881	Revert "Remove global dialect registration" This reverts commit `b22e2e4c6e`. Investigating broken builds	2020-10-23 21:26:48 +00:00
MaheshRavishankar	b6204b995e	[mlir][Vector] Introduce UnrollVectorOptions to control vector unrolling. The current pattern for vector unrolling takes the native shape to unroll to at pattern instantiation time, but the native shape might defer based on the types of the operand. Introduce a UnrollVectorOptions struct which allows for using a function that will return the native shape based on the operation. Move other options of unrolling like `filterConstraints` into this struct. Differential Revision: https://reviews.llvm.org/D89744	2020-10-23 13:52:26 -07:00
Mehdi Amini	b22e2e4c6e	Remove global dialect registration This has been deprecated for >1month now and removal was announced in: https://llvm.discourse.group/t/rfc-revamp-dialect-registration/1559/11 Differential Revision: https://reviews.llvm.org/D86356	2020-10-23 20:41:44 +00:00
Thomas Raoux	ea6a60a9a6	[mlir][vector] Add folder for ExtractStridedSliceOp Add folder for the case where ExtractStridedSliceOp source comes from a chain of InsertStridedSliceOp. Also add a folder for the trivial case where the ExtractStridedSliceOp is a no-op. Differential Revision: https://reviews.llvm.org/D89850	2020-10-23 12:18:09 -07:00
Sean Silva	1253c40727	[mlir] Add FuncOp::eraseResults I just found I needed this in an upcoming patch, and it seems generally useful to have. Differential Revision: https://reviews.llvm.org/D90000	2020-10-23 11:03:42 -07:00
Frederik Gossen	6d83e3b443	[MLIR] Extract buffer alias analysis for reuse Extract buffer alias analysis from buffer placement. Differential Revision: https://reviews.llvm.org/D89902	2020-10-23 13:23:32 +00:00
zhanghb97	448f25c86b	[mlir] Expose affine expression to C API This patch provides C API for MLIR affine expression. - Implement C API for methods of AffineExpr class. - Implement C API for methods of derived classes (AffineBinaryOpExpr, AffineDimExpr, AffineSymbolExpr, and AffineConstantExpr). Differential Revision: https://reviews.llvm.org/D89856	2020-10-23 20:06:32 +08:00
Julian Gross	0d1d363c51	[MLIR] Added PromoteBuffersToStackPass to convert heap- to stack-based allocations. Added optimization pass to convert heap-based allocs to stack-based allocas in buffer placement. Added the corresponding test file. Differential Revision: https://reviews.llvm.org/D89688	2020-10-23 12:02:25 +02:00
Lei Zhang	36ce915ac5	Revert "Revert "[mlir] Convert from Async dialect to LLVM coroutines"" This reverts commit `4986d5eaff` with proper patches to CMakeLists.txt: - Add MLIRAsync as a dependency to MLIRAsyncToLLVM - Add Coroutines as a dependency to MLIRExecutionEngine	2020-10-22 15:23:11 -04:00
Mehdi Amini	4986d5eaff	Revert "[mlir] Convert from Async dialect to LLVM coroutines" This reverts commit `a8b0ae3bdd` and commit `f8fcff5a9d`. The build with SHARED_LIBRARY=ON is broken.	2020-10-22 19:12:19 +00:00
Eugene Zhulenev	f8fcff5a9d	[mlir] Convert from Async dialect to LLVM coroutines Lower from Async dialect to LLVM by converting async regions attached to `async.execute` operations into LLVM coroutines (https://llvm.org/docs/Coroutines.html): 1. Outline all async regions to functions 2. Add LLVM coro intrinsics to mark coroutine begin/end 3. Use MLIR conversion framework to convert all remaining async types and ops to LLVM + Async runtime function calls All `async.await` operations inside async regions converted to coroutine suspension points. Await operation outside of a coroutine converted to the blocking wait operations. Implement simple runtime to support concurrent execution of coroutines. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D89292	2020-10-22 06:30:46 -07:00
Alexander Belyaev	461605c418	[mlir] Add MemRefReinterpretCastOp definition to Standard. Reuse most code for printing/parsing/verification from SubViewOp. https://llvm.discourse.group/t/rfc-standard-memref-cast-ops/1454/15 Differential Revision: https://https://reviews.llvm.org/D89720	2020-10-22 15:17:22 +02:00
Alexander Belyaev	d2ed2f16b8	[mlir] Add MemRefReshapeOp definition to Standard. https://llvm.discourse.group/t/rfc-standard-memref-cast-ops/1454/15 Differential Revision: https://reviews.llvm.org/D89784	2020-10-22 13:29:13 +02:00
rdzhabarov	281e0f3636	[mlir] Simplify DDR matching patterns with equal operands for operators where it's applicable. Added documentation. This https://reviews.llvm.org/D89254 diff introduced implicit matching between same name operands. Differential Revision: https://reviews.llvm.org/D89598	2020-10-21 21:31:39 +00:00
Stella Laurenzo	74a58ec9c2	[mlir][CAPI][Python] Plumb OpPrintingFlags to C and Python APIs. * Adds a new MlirOpPrintingFlags type and supporting accessors. * Adds a new mlirOperationPrintWithFlags function. * Adds a full featured python Operation.print method with all options and the ability to print directly to files/stdout in text or binary. * Adds an Operation.get_asm which delegates to print and returns a str or bytes. * Reworks Operation.__str__ to be based on get_asm. Differential Revision: https://reviews.llvm.org/D89848	2020-10-21 12:14:06 -07:00
Sean Silva	57b338c08a	[mlir][shape] Split out structural type conversions for shape dialect. A "structural" type conversion is one where the underlying ops are completely agnostic to the actual types involved and simply need to update their types. An example of this is shape.assuming -- the shape.assuming op and the corresponding shape.assuming_yield op need to update their types accordingly to the TypeConverter, but otherwise don't care what type conversions are happening. Also, the previous conversion code would not correctly materialize conversions for the shape.assuming_yield op. This should have caused a verification failure, but shape.assuming's verifier wasn't calling RegionBranchOpInterface::verifyTypes (which for reasons can't be called automatically as part of the trait verification, and requires being called manually). This patch also adds that verification. Differential Revision: https://reviews.llvm.org/D89833	2020-10-21 11:58:27 -07:00
Sean Silva	f0292ede9b	[mlir] Add structural type conversions for SCF dialect. A "structural" type conversion is one where the underlying ops are completely agnostic to the actual types involved and simply need to update their types. An example of this is scf.if -- the scf.if op and the corresponding scf.yield ops need to update their types accordingly to the TypeConverter, but otherwise don't care what type conversions are happening. To test the structural type conversions, it is convenient to define a bufferize pass for a dialect, which exercises them nicely. Differential Revision: https://reviews.llvm.org/D89757	2020-10-21 11:58:27 -07:00
Christian Sigg	3ac561d8c3	[mlir][gpu] Add lowering to LLVM for `gpu.wait` and `gpu.wait async`. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D89686	2020-10-21 18:20:42 +02:00
Christian Sigg	1c1803dbb0	[mlir][gpu] Add customer printer/parser for gpu.launch_func. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D89262	2020-10-21 18:19:00 +02:00
Frej Drejhammar	4b7dafd904	[mlir]: Clarify docs for external OpTrait::FunctionLike ops The documentation claims that an op with the trait FunctionLike has a single region containing the blocks that corresponding to the body of the function. It then goes on to say that the absence of a region corresponds to an external function when, in fact, this is represented by a single empty region. This patch changes the wording in the documentation to match the implementation. Signed-off-by: Frej Drejhammar <frej.drejhammar@gmail.com> Co-authored-by: Frej Drejhammar <frej.drejhammar@gmail.com> Co-authored-by: Klas Segeljakt <klasseg@kth.se> Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D89868	2020-10-21 18:08:10 +02:00
Alex Zinenko	6ec3872845	[mlir] ODS: support TableGen dag objects to specify OpBuilder parameters Historically, custom builder specification in OpBuilder has been accepting the formal parameter list for the builder method as a raw string containing C++. While this worked well to connect the signature and the body, this became problematic when ODS needs to manipulate the parameter list, e.g. to inject OpBuilder or to trim default values when generating the definition. This has also become inconsistent with other method declarations, in particular in interface definitions. Introduce the possibility to define OpBuilder formal parameters using a TableGen dag similarly to other methods. Additionally, introduce a mechanism to declare parameters with default values using an additional class. This mechanism can be reused in other methods. The string-based builder signature declaration is deprecated and will be removed after a transition period. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D89470	2020-10-21 11:42:50 +02:00
Alex Zinenko	580915d6a2	[mlir] Expose Value hierarchy to Python bindings Values are ubiquitous in the IR, in particular block argument and operation results are Values. Define Python classes for BlockArgument, OpResult and their common ancestor Value. Define pseudo-container classes for lists of block arguments and operation results, and use these containers to access the corresponding values in blocks and operations. Differential Revision: https://reviews.llvm.org/D89778	2020-10-21 09:49:22 +02:00
Lei Zhang	f2a06875b6	Wrap CfgTraitsFor in namespace llvm to please GCC 5	2020-10-20 13:04:02 -04:00
Nicolai Hähnle	c0cdd22c72	Introduce CfgTraits abstraction The CfgTraits abstraction simplfies writing algorithms that are generic over the type of CFG, and enables writing such algorithms as regular non-template code that operates on opaque references to CFG blocks and values. Implementations of CfgTraits provide operations on the concrete CFG types, e.g. `IrCfgTraits::BlockRef` is `BasicBlock `. CfgInterface is an abstract base class which provides operations on opaque types CfgBlockRef and CfgValueRef. Those opaque types encapsulate a `void `, but the meaning depends on the concrete CFG type. For example, MachineCfgTraits -- for use with MachineIR in SSA form -- encodes a Register inside CfgValueRef. Converting between concrete references and opaque/generic ones is done by CfgTraits::{fromGeneric,toGeneric}. Convenience methods CfgTraits::{un}wrap{Iterator,Range} are available as well. Writing algorithms in terms of CfgInterface adds some overhead (virtual method calls, plus in same cases it removes the opportunity to inline iterators), but can be much more convenient since generic algorithms can be written as non-templates. This patch adds implementations of CfgTraits for all CFGs on which dominator trees are calculated, so that the dominator tree can be ported to this machinery. Only IrCfgTraits (LLVM IR) and MachineCfgTraits (Machine IR in SSA form) are complete, the other implementations are limited to the absolute minimum required to make the upcoming dominator tree changes work. v5: - fix MachineCfgTraits::blockdef_iterator and allow it to iterate over the instructions in a bundle - use MachineBasicBlock::printName v6: - implement predecessors/successors for all CfgTraits implementations - fix error in unwrapRange - rename toGeneric/fromGeneric into wrapRef/unwrapRef to have naming that is consistent with {wrap,unwrap}{Iterator,Range} - use getVRegDef instead of getUniqueVRegDef v7: - std::forward fix in wrapping_iterator - fix typos v8: - cleanup operators on CfgOpaqueType - address other review comments Change-Id: Ia75f4f268fded33fca11218a7d578c9aec1f3f4d Differential Revision: https://reviews.llvm.org/D83088	2020-10-20 13:50:52 +02:00
Alex Zinenko	39613c2cbc	[mlir] Expose Value hierarchy to C API The Value hierarchy consists of BlockArgument and OpResult, both of which derive Value. Introduce IsA functions and functions specific to each class, similarly to other class hierarchies. Also, introduce functions for pointer-comparison of Block and Operation that are necessary for testing and are generally useful. Reviewed By: stellaraccident, mehdi_amini Differential Revision: https://reviews.llvm.org/D89714	2020-10-20 09:39:08 +02:00
Sean Silva	7885bf8b78	[mlir][DialectConversion] Fix recursive `clone` calls. The framework was not tracking ops created in any regions of the cloned op. Differential Revision: https://reviews.llvm.org/D89668	2020-10-19 15:51:46 -07:00
Sean Silva	f4abd3ed6d	[mlir] Add std.dynamic_tensor_from_elements bufferization. It's unfortunate that this requires adding a dependency on scf dialect to std bufferization (and hence all of std transforms). This is a bit perilous. We might want a lib/Transforms/Bufferize/ with a separate bufferization library per dialect? Differential Revision: https://reviews.llvm.org/D89667	2020-10-19 15:51:45 -07:00
Alexander Belyaev	1e1dd13034	[mlir][nfc] Move BaseOpWithOffsetSizesAndStrides to the beginning of Ops.td. Move the class to where all base classes are defined. Also remove all the builders since they are definted in subclasses anyway. Differential Revision: https://reviews.llvm.org/D89620	2020-10-19 13:36:03 +02:00
Marcel Koester	1b1c61ff47	[mlir] Refactored BufferPlacement transformation. The current BufferPlacement transformation contains several concepts for hoisting allocations. However, more advanced hoisting techniques should not be integrated into the BufferPlacement transformation. Hence, this CL refactors the current BufferPlacement pass into three separate pieces: BufferDeallocation and BufferAllocation(Loop)Hoisting. Moreover, it extends the hoisting functionality by allowing to move allocations out of loops. Differential Revision: https://reviews.llvm.org/D87756	2020-10-19 12:52:16 +02:00
Alex Zinenko	e7c90418fc	[mlir] Use `let arguments =` syntax instead of inheritance in LLVM dialect LLVM dialect has been defining Op arguments by deriving the `Arguments` ODS class. This has arguably worse readability due to large indentation caused by multiple derivations, and is inconsistent with other ODS files. Use the `let arguments` form instead. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D89560	2020-10-19 11:16:04 +02:00
Kiran Chandramohan	a71a0d6d21	[OpenMP][MLIR] Fix for nested parallel regions Usage of nested parallel regions were not working correctly and leading to assertion failures. Fix contains the following changes, 1) Don't set the insertion point in the body callback. 2) Save the continuation IP in a stack and set the branch to continuationIP at the terminator. Reviewed By: SouraVX, jdoerfert, ftynse Differential Revision: https://reviews.llvm.org/D88720	2020-10-19 08:45:50 +01:00
Christian Sigg	ad3ecc24b1	[mlir][gpu] NFC: Make room for more than one GPU rewrite pattern. AllReduceLowering is currently the only GPU rewrite pattern, but more are coming. This is a preparation change. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D89370	2020-10-19 07:52:47 +02:00
River Riddle	a5ea60456c	[mlir] Update SCCP and the Inliner to use SymbolTableCollection for symbol lookups This transforms the symbol lookups to O(1) from O(NM), greatly speeding up both passes. For a large MLIR module this shaved seconds off of the compilation time. Differential Revision: https://reviews.llvm.org/D89522	2020-10-16 12:08:48 -07:00
River Riddle	71eeb5ec4d	[mlir] Add a new SymbolUserOpInterface class The initial goal of this interface is to fix the current problems with verifying symbol user operations, but can extend beyond that in the future. The current problems with the verification of symbol uses are: * Extremely inefficient: Most current symbol users perform the symbol lookup using the slow O(N) string compare methods, which can lead to extremely long verification times in large modules. * Invalid/break the constraints of verification pass If the symbol reference is not-flat(and even if it is flat in some cases) a verifier for an operation is not permitted to touch the referenced operation because it may be in the process of being mutated by a different thread within the pass manager. The new SymbolUserOpInterface exposes a method `verifySymbolUses` that will be invoked from the parent symbol table to allow for verifying the constraints of any referenced symbols. This method is passed a `SymbolTableCollection` to allow for O(1) lookups of any necessary symbol operation. Differential Revision: https://reviews.llvm.org/D89512	2020-10-16 12:08:48 -07:00
River Riddle	7bc7d0ac7a	[mlir] Optimize symbol related checks in SymbolDCE This revision contains two optimizations related to symbol checking: * Optimize SymbolOpInterface to only check for a name attribute if the operation is an optional symbol. This removes an otherwise unnecessary attribute lookup from a majority of symbols. * Add a new SymbolTableCollection class to represent a collection of SymbolTables. This allows for perfoming non-flat symbol lookups in O(1) time by caching SymbolTables for symbol table operations. This class is very useful for algorithms that operate on multiple symbol tables, either recursively or not. Differential Revision: https://reviews.llvm.org/D89505	2020-10-16 12:08:48 -07:00
River Riddle	f3df3b58e7	[mlir] Add a utility class, ThreadLocalCache, for storing non static thread local objects. (Note: This is a reland of D82597) This class allows for defining thread local objects that have a set non-static lifetime. This internals of the cache use a static thread_local map between the various different non-static objects and the desired value type. When a non-static object destructs, it simply nulls out the entry in the static map. This will leave an entry in the map, but erase any of the data for the associated value. The current use cases for this are in the MLIRContext, meaning that the number of items in the static map is ~1-2 which aren't particularly costly enough to warrant the complexity of pruning. If a use case arises that requires pruning of the map, the functionality can be added. This is especially useful in the context of MLIR for implementing thread-local caching of context level objects that would otherwise have very high lock contention. This revision adds a thread local cache in the MLIRContext for attributes, identifiers, and types to reduce some of the locking burden. This led to a speedup of several seconds when compiling a somewhat large mlir module. Differential Revision: https://reviews.llvm.org/D89504	2020-10-16 12:08:48 -07:00
ahmedsabie	7dff6b818b	[MLIR] Add idempotent trait folding This trait simply adds a fold of f(f(x)) = f(x) when an operation is labelled as idempotent Reviewed By: rriddle, andyly Differential Revision: https://reviews.llvm.org/D89421	2020-10-16 15:51:04 +00:00
Stella Laurenzo	6771b98c4e	[mlir][CAPI] Add mlirAttributeGetType function. * Also fixes the const-ness of the various DenseElementsAttr construction functions. * Both issues identified when trying to use the DenseElementsAttr functions. Differential Revision: https://reviews.llvm.org/D89517	2020-10-15 18:33:50 -07:00
Rob Suderman	2bf423b021	[mlir] RewriterGen NativeCodeCall matcher with ConstantOp matcher Added an underlying matcher for generic constant ops. This included a rewriter of RewriterGen to make variable use more clear. Differential Revision: https://reviews.llvm.org/D89161	2020-10-15 16:32:20 -07:00
Thomas Raoux	edbdea7466	[mlir][vector] Add unrolling patterns for Transfer read/write Adding unroll support for transfer read and transfer write operation. This allows to pick the ideal size for the memory access for a given target. Differential Revision: https://reviews.llvm.org/D89289	2020-10-15 15:17:36 -07:00
Sean Silva	ee491ac91e	[mlir] Add std.tensor_to_memref op and teach the infra about it The opposite of tensor_to_memref is tensor_load. - Add some basic tensor_load/tensor_to_memref folding. - Add source/target materializations to BufferizeTypeConverter. - Add an example std bufferization pattern/pass that shows how the materialiations work together (more std bufferization patterns to come in subsequent commits). - In coming commits, I'll document how to write composable bufferization passes/patterns and update the other in-tree bufferization passes to match this convention. The populate* functions will of course continue to be exposed for power users. The naming on tensor_load/tensor_to_memref and their pretty forms are not very intuitive. I'm open to any suggestions here. One key observation is that the memref type must always be the one specified in the pretty form, since the tensor type can be inferred from the memref type but not vice-versa. With this, I've been able to replace all my custom bufferization type converters in npcomp with BufferizeTypeConverter! Part of the plan discussed in: https://llvm.discourse.group/t/what-is-the-strategy-for-tensor-memref-conversion-bufferization/1938/17 Differential Revision: https://reviews.llvm.org/D89437	2020-10-15 12:19:20 -07:00
Stephan Herhut	307124535f	[mlir][standard] Fix parsing of scalar subview and canonicalize Parsing of a scalar subview did not create the required static_offsets attribute. This also adds support for folding scalar subviews away. Differential Revision: https://reviews.llvm.org/D89467	2020-10-15 16:41:54 +02:00
MaheshRavishankar	6d9a72ec80	[mlir][SPIRV] Adding an attribute to capture configuration for cooperative matrix operations. Each hardware that supports SPV_C_CooperativeMatrixNV has a list of configurations that are supported natively. Add an attribute to specify the configurations supported to the `spv.target_env`. Reviewed By: antiagainst, ThomasRaoux Differential Revision: https://reviews.llvm.org/D89364	2020-10-14 22:33:11 -07:00
MaheshRavishankar	de2568aab8	[mlir][Linalg] Rethink fusion of linalg ops with reshape ops. The current fusion on tensors fuses reshape ops with generic ops by linearizing the indexing maps of the fused tensor in the generic op. This has some limitations - It only works for static shapes - The resulting indexing map has a linearization that would be potentially prevent fusion later on (for ex. tile + fuse). Instead, try to fuse the reshape consumer (producer) with generic op producer (consumer) by expanding the dimensionality of the generic op when the reshape is expanding (folding). This approach conflicts with the linearization approach. The expansion method is used instead of the linearization method. Further refactoring that changes the fusion on tensors to be a collection of patterns. Differential Revision: https://reviews.llvm.org/D89002	2020-10-14 13:50:31 -07:00
Sean Silva	9a14cb53cb	[mlir][bufferize] Rename BufferAssignment* to Bufferize* Part of the refactor discussed in: https://llvm.discourse.group/t/what-is-the-strategy-for-tensor-memref-conversion-bufferization/1938/17 Differential Revision: https://reviews.llvm.org/D89271	2020-10-14 12:39:16 -07:00
Sean Silva	1cca0f323e	[mlir] Refactor code out of BufferPlacement.cpp Now BufferPlacement.cpp doesn't depend on Bufferize.h. Part of the refactor discussed in: https://llvm.discourse.group/t/what-is-the-strategy-for-tensor-memref-conversion-bufferization/1938/17 Differential Revision: https://reviews.llvm.org/D89268	2020-10-14 12:39:16 -07:00
Sean Silva	6b30fb7653	[mlir] Rename ShapeTypeConversion to ShapeBufferize Once we have tensor_to_memref ops suitable for type materializations, this pass can be split into a generic type conversion pattern. Part of the refactor discussed in: https://llvm.discourse.group/t/what-is-the-strategy-for-tensor-memref-conversion-bufferization/1938/17 Differential Revision: https://reviews.llvm.org/D89258	2020-10-14 12:39:16 -07:00
Sean Silva	9ca97cde85	[mlir] Linalg refactor for using "bufferize" terminology. Part of the refactor discussed in: https://llvm.discourse.group/t/what-is-the-strategy-for-tensor-memref-conversion-bufferization/1938/17 Differential Revision: https://reviews.llvm.org/D89261	2020-10-14 12:39:15 -07:00
rdzhabarov	008c0ea6a4	[DDR] Introduce implicit equality check for the source pattern operands with the same name. This CL allows user to specify the same name for the operands in the source pattern which implicitly enforces equality on operands with the same name. E.g., Pat<(OpA $a, $b, $a) ... > would create a matching rule for checking equality for the first and the last operands. Equality of the operands is enforced at any depth, e.g., OpA ($a, $b, OpB($a, $c, OpC ($a))). Example usage: Pat<(Reshape $arg0, (Shape $arg0)), (replaceWithValue $arg0)> Note, this feature only covers operands but not attributes. Current use cases are based on the operand equality and explicitly add the constraint into the pattern. Attribute equality will be worked out on the different CL. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D89254	2020-10-14 11:05:13 -07:00
Jacques Pienaar	f4ad76deb8	[mlir] More changes to avoid args now inserted.NFC Migrates a bit more from the old/to be deprecated form.	2020-10-14 10:47:45 -07:00
Irina Dobrescu	65b9b9aa50	Add Allocate Clause to MLIR Parallel Operation Definition Differential Revision: https://reviews.llvm.org/D87684	2020-10-14 17:13:48 +01:00
Eric Schweitz	3ea4ccd857	[mlir] expand the legal floating-point types in the LLVM IR dialect type check This patch adds a couple missing LLVM IR dialect floating point types to the legality check. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D89350	2020-10-14 06:56:26 -07:00
Nicolas Vasilache	af5be38a01	[mlir][Linalg] Make a Linalg CodegenStrategy available. This revision adds a programmable codegen strategy from linalg based on staged rewrite patterns. Testing is exercised on a simple linalg.matmul op. Differential Revision: https://reviews.llvm.org/D89374	2020-10-14 11:11:26 +00:00
Aden Grue	2b60291285	Fix typos in the documentation of dynamic values in subview ops Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D89338	2020-10-14 08:29:47 +00:00
Mehdi Amini	0b793c4be0	Revert "[DDR] Introduce implicit equality check for the source pattern operands with the same name." This reverts commit `7271c1bcb9`. This broke the gcc-5 build: /usr/include/c++/5/ext/new_allocator.h:120:4: error: no matching function for call to 'std::pair<const std::__cxx11::basic_string<char>, mlir::tblgen::SymbolInfoMap::SymbolInfo>::pair(llvm::StringRef&, mlir::tblgen::SymbolInfoMap::SymbolInfo)' { ::new((void *)__p) _Up(std::forward<_Args>(__args)...); } ^ In file included from /usr/include/c++/5/utility:70:0, from llvm/include/llvm/Support/type_traits.h:18, from llvm/include/llvm/Support/Casting.h:18, from mlir/include/mlir/Support/LLVM.h:24, from mlir/include/mlir/TableGen/Pattern.h:17, from mlir/lib/TableGen/Pattern.cpp:14: /usr/include/c++/5/bits/stl_pair.h:206:9: note: candidate: template<class ... _Args1, long unsigned int ..._Indexes1, class ... _Args2, long unsigned int ..._Indexes2> std::pair<_T1, _T2>::pair(std::tuple<_Args1 ...>&, std::tuple<_Args2 ...>&, std::_Index_tuple<_Indexes1 ...>, std::_Index_tuple<_Indexes2 ...>) pair(tuple<_Args1...>&, tuple<_Args2...>&, ^	2020-10-14 00:37:10 +00:00
John Demme	5fe53c4128	[MLIR] Add support for defining Types in tblgen Adds a TypeDef class to OpBase and backing generation code. Allows one to define the Type, its parameters, and printer/parser methods in ODS. Can generate the Type C++ class, accessors, storage class, per-parameter custom allocators (for the storage constructor), and documentation. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D86904	2020-10-14 00:32:18 +00:00
rdzhabarov	7271c1bcb9	[DDR] Introduce implicit equality check for the source pattern operands with the same name. This CL allows user to specify the same name for the operands in the source pattern which implicitly enforces equality on operands with the same name. E.g., Pat<(OpA $a, $b, $a) ... > would create a matching rule for checking equality for the first and the last operands. Equality of the operands is enforced at any depth, e.g., OpA ($a, $b, OpB($a, $c, OpC ($a))). Example usage: Pat<(Reshape $arg0, (Shape $arg0)), (replaceWithValue $arg0)> Note, this feature only covers operands but not attributes. Current use cases are based on the operand equality and explicitly add the constraint into the pattern. Attribute equality will be worked out on the different CL. Differential Revision: https://reviews.llvm.org/D89254	2020-10-13 16:05:14 -07:00
ahmedsabie	c0b3abd19a	[MLIR] Add a foldTrait() mechanism to allow traits to define folding and test it with an Involution trait This is the same diff as https://reviews.llvm.org/D88809/ except side effect free check is removed for involution and a FIXME is added until the dependency is resolved for shared builds. The old diff has more details on possible fixes. Reviewed By: rriddle, andyly Differential Revision: https://reviews.llvm.org/D89333	2020-10-13 21:26:21 +00:00
Stella Laurenzo	ad958f648e	[mlir][Python] Add missing capsule->module and Context.create_module. * Extends Context/Operation interning to cover Module as well. * Implements Module.context, Attribute.context, Type.context, and Location.context back-references (facilitated testing and also on the TODO list). * Adds method to create an empty Module. * Discovered missing in npcomp. Differential Revision: https://reviews.llvm.org/D89294	2020-10-13 13:10:33 -07:00

1 2 3 4 5 ...

3560 Commits