llvm-project

Commit Graph

Author	SHA1	Message	Date
Abid Malik	c2fd2b5194	[MLIR][OpenMP]Basic OpenMP target operation This includes a basic implementation for the OpenMP target operation. Currently, the if, thread_limit, private, shared, device, and nowait clauses are included in this implementation. Co-authored-by: Kiran Chandramohan <kiran.chandramohan@arm.com> Reviewed By: ftynse, kiranchandramohan Differential Revision: https://reviews.llvm.org/D102816	2021-06-24 09:58:25 +01:00
Anthony Canino	3f429e82d3	Implement an scf.for range folding optimization pass. In cases where arithmetic (addi/muli) ops are performed on an scf.for loops induction variable with a single use, we can fold those ops directly into the scf.for loop. For example, in the following code: ``` scf.for %i = %c0 to %arg1 step %c1 { %0 = addi %arg2, %i : index %1 = muli %0, %c4 : index %2 = memref.load %arg0[%1] : memref<?xi32> %3 = muli %2, %2 : i32 memref.store %3, %arg0[%1] : memref<?xi32> } ``` we can lift `%0` up into the scf.for loop range, as it is the only user of %i: ``` %lb = addi %arg2, %c0 : index %ub = addi %arg2, %i : index scf.for %i = %lb to %ub step %c1 { %1 = muli %0, %c4 : index %2 = memref.load %arg0[%1] : memref<?xi32> %3 = muli %2, %2 : i32 memref.store %3, %arg0[%1] : memref<?xi32> } ``` Reviewed By: mehdi_amini, ftynse, Anthony Differential Revision: https://reviews.llvm.org/D104289	2021-06-24 01:07:28 +00:00
William S. Moses	71f6f7e00a	Revert "[MLIR][LLVM] Expose type translator from LLVM to MLIR Type" This reverts commit `5616a79398`.	2021-06-23 13:27:13 -04:00
William S. Moses	5616a79398	[MLIR][LLVM] Expose type translator from LLVM to MLIR Type This commit moves the type translator from LLVM to MLIR to a public header for use by external projects or other code Differential Revision: https://reviews.llvm.org/D104726	2021-06-23 13:22:50 -04:00
Nicolas Vasilache	f0d43a29e3	[mlir][LLVMIR] Fold ExtractValueOp coming from InsertValueOp Differential Revision: https://reviews.llvm.org/D104769	2021-06-23 10:04:24 +00:00
Tobias Gysi	f1844f15c1	[mlir][linalg] Change the FillOp library call signature. Adapt the FillOp library call signature to the updated operand order introduced in https://reviews.llvm.org/D10412. The patch reverts the special treatment of FillOp in LinalgToStandard. Differential Revision: https://reviews.llvm.org/D104360	2021-06-23 09:37:14 +00:00
Tobias Gysi	7cef24ee83	[mlir][linalg] Adapt the FillOp builder signature. Change the build operand order from output, value to value, output. The patch makes the argument order consistent with the pretty printed order updated by https://reviews.llvm.org/D104356. Differential Revision: https://reviews.llvm.org/D104359	2021-06-23 08:06:43 +00:00
Tobias Gysi	a21a6f51bc	[mlir][linalg] Change the pretty printed FillOp operand order. The patch changes the pretty printed FillOp operand order from output, value to value, output. The change is a follow up to https://reviews.llvm.org/D104121 that passes the fill value using a scalar input instead of the former capture semantics. Differential Revision: https://reviews.llvm.org/D104356	2021-06-23 07:03:00 +00:00
Vinayaka Bandishti	a873b6d466	[MLIR] Generalize detecting mods during slice computing During slice computation of affine loop fusion, detect one id as the mod of another id w.r.t a constant in a more generic way. Restrictions on co-efficients of the ids is removed. Also, information from the previously calculated ids is used for simplification of affine expressions, e.g., If `id1` = `id2`, `id_n - divisor * id_q - id_r + id1 - id2 = 0`, is simplified to: `id_n - divisor * id_q - id_r = 0`. If `c` is a non-zero integer, `cid_n - cdivisor * id_q - cid_r = 0`, is simplified to: `id_n - divisor id_q - id_r = 0`. Reviewed By: bondhugula, ayzhuang Differential Revision: https://reviews.llvm.org/D104614	2021-06-23 12:29:34 +05:30
Vinayaka Bandishti	0e55112242	[NFC][PDL] Fix documentation typo, redundant test Correct a documentation typo, and delete a duplicate test in `pdl-to-pdl-interp-rewriter.mlir`. Reviewed By: pr4tgpt, bondhugula, rriddle Differential Revision: https://reviews.llvm.org/D104688	2021-06-23 12:27:12 +05:30
Jack Xia	4666f309df	Fix typo in Toy Tutorial Ch-4 multiple_transpose -> multiply_transpose	2021-06-23 03:33:34 +00:00
River Riddle	84bd07aff9	[mlir] Fix GCC5 build after D104516 GCC5 isn't able to implicitly capture `this` properly in an `auto` lambda.	2021-06-23 03:16:34 +00:00
River Riddle	c43e8c0eef	[mlir][OpDefGen] Don't emit attribute name getters when there are no attributes This avoids generating otherwise unnecessary methods.	2021-06-23 03:03:54 +00:00
River Riddle	0246dd3004	[mlir] Fix slicing-utils.mlir test after D104516 Remove the duplicate unnecessary CHECK labels at the bottom of the file.	2021-06-23 02:52:17 +00:00
River Riddle	6569cf2a44	[mlir] Add a ThreadPool to MLIRContext and refactor MLIR threading usage This revision refactors the usage of multithreaded utilities in MLIR to use a common thread pool within the MLIR context, in addition to a new utility that makes writing multi-threaded code in MLIR less error prone. Using a unified thread pool brings about several advantages: * Better thread usage and more control We currently use the static llvm threading utilities, which do not allow multiple levels of asynchronous scheduling (even if there are open threads). This is due to how the current TaskGroup structure works, which only allows one truly multithreaded instance at a time. By having our own ThreadPool we gain more control and flexibility over our job/thread scheduling, and in a followup can enable threading more parts of the compiler. * The static nature of TaskGroup causes issues in certain configurations Due to the static nature of TaskGroup, there have been quite a few problems related to destruction that have caused several downstream projects to disable threading. See D104207 for discussion on some related fallout. By having a ThreadPool scoped to the context, we don't have to worry about destruction and can ensure that any additional MLIR thread usage ends when the context is destroyed. Differential Revision: https://reviews.llvm.org/D104516	2021-06-23 01:29:24 +00:00
River Riddle	18465bcf4d	[mlir][NFC] Cleanup the MLIRTestReducer pass	2021-06-23 01:29:24 +00:00
River Riddle	36b538f583	[mlir][NFC] Move several small methods from .cpp to .h to allow more aggressive inlining Differential Revision: https://reviews.llvm.org/D104756	2021-06-23 00:52:26 +00:00
Aart Bik	b13cbf537f	[mlir][sparse] integration test for "simply dynamic" sparse output tensors Reviewed By: gussmith23 Differential Revision: https://reviews.llvm.org/D104583	2021-06-22 14:28:02 -07:00
River Riddle	6000749804	[mlir] Fix build on gcc-5 after D104167	2021-06-22 21:16:02 +00:00
Aart Bik	36b66ab9ed	[mlir][sparse] add support for "simply dynamic" sparse tensor expressions Slowly we are moving toward full support of sparse tensor outputs. First step was support for all-dense annotated "sparse" tensors. This step adds support for truly sparse tensors, but only for operations in which the values of a tensor change, but not the nonzero structure (this was refered to as "simply dynamic" in the [Bik96] thesis). Some background text was posted on discourse: https://llvm.discourse.group/t/sparse-tensors-in-mlir/3389/25 Reviewed By: gussmith23 Differential Revision: https://reviews.llvm.org/D104577	2021-06-22 13:37:32 -07:00
River Riddle	87e59e47e9	[mlir] Remove the Identifier ThreadLocalCache from MLIRContext This used to be important for reducing lock contention when accessing identifiers, but the cost of the cache can be quite large if parsing in a multi-threaded context. After D104167, the win of keeping a cache is not worth the cost. Differential Revision: https://reviews.llvm.org/D104737	2021-06-22 19:56:05 +00:00
River Riddle	e4e31e19bb	[mlir][OpGen] Cache Identifiers for known attribute names in AbstractOperation. Operations currently rely on the string name of attributes during attribute lookup/removal/replacement, in build methods, and more. This unfortunately means that some of the most used APIs in MLIR require string comparisons, additional hashing(+mutex locking) to construct Identifiers, and more. This revision remedies this by caching identifiers for all of the attributes of the operation in its corresponding AbstractOperation. Just updating the autogenerated usages brings up to a 15% reduction in compile time, greatly reducing the cost of interacting with the attributes of an operation. This number can grow even higher as we use these methods in handwritten C++ code. Methods for accessing these cached identifiers are exposed via `<attr-name>AttrName` methods on the derived operation class. Moving forward, users should generally use these methods over raw strings when an attribute name is necessary. Differential Revision: https://reviews.llvm.org/D104167	2021-06-22 19:56:05 +00:00
Butygin	82c1fb5750	[mlir] Fix invalid handling of AllocOp symbolOperands by SimplifyAllocConst. symbolOperands were completely ignored by SimplifyAllocConst. Also, slightly improved diagnostic message for verifyAllocLikeOp. Differential Revision: https://reviews.llvm.org/D104260	2021-06-22 15:39:53 +03:00
Stephan Herhut	bb6afc69b2	[mlir][memref] Add memref.copy operation As the name suggests, it copies from one memref to another. Differential Revision: https://reviews.llvm.org/D104657	2021-06-22 13:21:44 +02:00
Matthias Springer	060208b4c8	[mlir][NFC] Move SubTensorOp and SubTensorInsertOp to TensorDialect The main goal of this commit is to remove the dependency of Standard dialect on the Tensor dialect. * Rename SubTensorOp -> tensor.extract_slice, SubTensorInsertOp -> tensor.insert_slice. * Some helper functions are (already) duplicated between the Tensor dialect and the MemRef dialect. To keep this commit smaller, this will be cleaned up in a separate commit. * Additional dialect dependencies: Shape --> Tensor, Tensor --> Standard * Remove dialect dependencies: Standard --> Tensor * Move canonicalization test cases to correct dialect (Tensor/MemRef). Note: This is a fixed version of https://reviews.llvm.org/D104499, which was reverted due to a missing update to two CMakeFile.txt. Differential Revision: https://reviews.llvm.org/D104676	2021-06-22 17:55:53 +09:00
Tobias Gysi	4882cacf12	[mlir][linalg] Adapt FillOp to use a scalar operand. Adapt the FillOp definition to use a scalar operand instead of a capture. This patch is a follow up to https://reviews.llvm.org/D104109. As the input operands are in front of the output operands the patch changes the internal operand order of the FillOp. The pretty printed version of the operation remains unchanged though. The patch also adapts the linalg to standard lowering to ensure the c signature of the FillOp remains unchanged as well. Differential Revision: https://reviews.llvm.org/D104121	2021-06-22 06:44:52 +00:00
Matthias Springer	2ba387a316	[mlir][linalg] Fusion of PadTensorOp Note: This commit (and previous ones) implements the same functionality as https://reviews.llvm.org/D103243 (which is abandoned). Differential Revision: https://reviews.llvm.org/D104683	2021-06-22 11:48:49 +09:00
Rob Suderman	ad1a9d629b	[mlir][tosa] Enable tosa.div for TosaMakeBroadcastable TosaMakeBroadcastable needs to include tosa.div, which was added later in the specification. Reviewed By: sjarus, NatashaKnk Differential Revision: https://reviews.llvm.org/D104157	2021-06-21 16:12:11 -07:00
Ahmed S. Taei	7e2d672a67	Add polynomial approximation for trigonometric sine and cosine functions The approximation relays on range reduced version y \in [0, pi/2]. An input x will have the property that sin(x) = sin(y), -sin(y), cos(y), -cos(y) depends on which quadrable x is in, where sin(y) and cos(y) are approximated with 5th degree polynomial (of x^2). As a result a single pattern can be used to compute approximation for both sine and cosine. Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D104582	2021-06-21 13:00:33 -07:00
thomasraoux	1244bca53f	[mlir][vector] Support distributing transfer op with permutation map Differential Revision: https://reviews.llvm.org/D104263	2021-06-21 12:56:08 -07:00
Mehdi Amini	60d97fb4cf	Revert "[mlir][NFC] Move SubTensorOp and SubTensorInsertOp to TensorDialect" This reverts commit `83bf801f5f`. This breaks the build with -DBUILD_SHARED_LIBS=ON	2021-06-21 16:39:24 +00:00
Matthias Springer	83bf801f5f	[mlir][NFC] Move SubTensorOp and SubTensorInsertOp to TensorDialect The main goal of this commit is to remove the dependency of Standard dialect on the Tensor dialect. * Rename ops: SubTensorOp --> ExtractTensorOp, SubTensorInsertOp --> InsertTensorOp * Some helper functions are (already) duplicated between the Tensor dialect and the MemRef dialect. To keep this commit smaller, this will be cleaned up in a separate commit. * Additional dialect dependencies: Shape --> Tensor, Tensor --> Standard * Remove dialect dependencies: Standard --> Tensor * Move canonicalization test cases to correct dialect (Tensor/MemRef). Differential Revision: https://reviews.llvm.org/D104499	2021-06-22 00:11:21 +09:00
Alexander Belyaev	2e972e366a	[mlir] Remove "getNumPayloadInductionVariables". This method always returns 0 after https://reviews.llvm.org/rG7cddf56d608f07b8e49f7e2eeb4a20082611adb6 Differential Revision: https://reviews.llvm.org/D104645	2021-06-21 16:38:47 +02:00
Vladislav Vinogradov	628e136738	[mlir][ODS] Fix copy ctor for generate Pass classes Redirect the copy ctor to the actual class instead of overwriting it with `TypeID` based ctor. This allows the final Pass classes to have extra fields and logic for their copy. Reviewed By: lattner Differential Revision: https://reviews.llvm.org/D104302	2021-06-21 14:07:31 +03:00
Benjamin Kramer	596989da65	[mlir][Linalg] Silence warnings in Release builds. NFC. mlir/lib/Dialect/Linalg/Transforms/ComprehensiveBufferize.cpp:940:8: warning: unused variable 'opProducesRootRead' [-Wunused-variable] bool opProducesRootRead = ^ mlir/lib/Dialect/Linalg/Transforms/ComprehensiveBufferize.cpp:942:8: warning: unused variable 'opProducesRootWrite' [-Wunused-variable] bool opProducesRootWrite = ^ mlir/lib/Dialect/Linalg/Transforms/ComprehensiveBufferize.cpp:1498:11: warning: unused variable 'resultNumber' [-Wunused-variable] int64_t resultNumber = result.getResultNumber(); ^ mlir/lib/Dialect/Linalg/Transforms/ComprehensiveBufferize.cpp:1497:11: warning: unused variable 'operandNumber' [-Wunused-variable] int64_t operandNumber = operand.getOperandNumber(); ^ mlir/lib/Dialect/Linalg/Transforms/ComprehensiveBufferize.cpp:267:20: warning: unused function 'getInPlace' [-Wunused-function] static InPlaceSpec getInPlace(Value v) { ^	2021-06-21 12:56:41 +02:00
Matthias Springer	66f878cee9	[mlir][NFC] Remove Standard dialect dependency on MemRef dialect * Remove dependency: Standard --> MemRef * Add dependencies: GPUToNVVMTransforms --> MemRef, Linalg --> MemRef, MemRef --> Tensor * Note: The `subtensor_insert_propagate_dest_cast` test case in MemRef/canonicalize.mlir will be moved to Tensor/canonicalize.mlir in a subsequent commit, which moves over the remaining Tensor ops from the Standard dialect to the Tensor dialect. Differential Revision: https://reviews.llvm.org/D104506	2021-06-21 17:55:23 +09:00
Matthias Springer	225b960cfc	[mlir][linalg] Support low padding in subtensor(pad_tensor) lowering Differential Revision: https://reviews.llvm.org/D104591	2021-06-21 16:34:26 +09:00
Nicolas Vasilache	11e9a72dfc	[mlir][Linalg] NFC - Drop unused variable definition.	2021-06-21 07:08:02 +00:00
Nicolas Vasilache	e04533d38a	[mlir][Linalg] Introduce a BufferizationAliasInfo (6/n) This revision adds a BufferizationAliasInfo which maintains and updates information about which tensors will alias once bufferized, which bufferized tensors are equivalent to others and how to handle clobbers. Bufferization greedily tries to bufferize inplace by: 1. first trying to bufferize SubTensorInsertOp inplace, in reverse order (these are deemed the most expensives). 2. then trying to bufferize all non SubTensorOp / SubTensorInsertOp, in reverse order. 3. lastly trying to bufferize all SubTensorOp in reverse order. Reverse order is a heuristic that seems to work nicely because structured tensor codegen very often proceeds by: 1. take a subset of a tensor 2. compute on that subset 3. insert the result subset into the full tensor and yield a new tensor. BufferizationAliasInfo + equivalence sets + clobber analysis allows bufferizing nested subtensor/compute/subtensor_insert sequences inplace to a certain extent. To fully realize inplace bufferization, additional container-containee analysis will be necessary and is left for a subsequent commit. Differential revision: https://reviews.llvm.org/D104110	2021-06-21 06:59:42 +00:00
Fangrui Song	558ee5843f	[mlir] Fix -Wunused-but-set-variable in -DLLVM_ENABLE_ASSERTIONS=off build. NFC	2021-06-20 11:55:00 -07:00
Marius Brehler	876de062f9	[mlir] Add EmitC dialect This upstreams the EmitC dialect and the corresponding Cpp target, both initially presented with [1], from [2] to MLIR core. For the related discussion, see [3]. [1] https://reviews.llvm.org/D76571 [2] https://github.com/iml130/mlir-emitc [3] https://llvm.discourse.group/t/emitc-generating-c-c-from-mlir/3388 Co-authored-by: Jacques Pienaar <jpienaar@google.com> Co-authored-by: Simon Camphausen <simon.camphausen@iml.fraunhofer.de> Co-authored-by: Oliver Scherf <oliver.scherf@iml.fraunhofer.de> Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D103969	2021-06-19 09:51:17 +02:00
Matthias Springer	24199f534f	[mlir][linalg] Lower subtensor(pad_tensor) to pad_tensor(subtensor) Only high padding is supported at the moment. Low padding will be added in a separate commit. Differential Revision: https://reviews.llvm.org/D104357	2021-06-19 13:44:47 +09:00
Stella Laurenzo	4b9d28bd53	Partial rollback: Disable MLIR verifier parallelism. Deadlocks have been found in several downstream projects as noted on the original patch: https://reviews.llvm.org/D104207 Disabling pending full root cause analysis. Differential Revision: https://reviews.llvm.org/D104570	2021-06-18 21:58:03 +00:00
River Riddle	d1baf28954	[mlir] Add support to SourceMgrDiagnosticHandler for filtering FileLineColLocs This revision adds support for passing a functor to SourceMgrDiagnosticHandler for filtering out FileLineColLocs when emitting a diagnostic. More specifically, this can be useful in situations where there may be large CallSiteLocs with locations that aren't necessarily important/useful for users. For now the filtering support is limited to FileLineColLocs, but conceptually we could allow filtering for all locations types if a need arises in the future. Differential Revision: https://reviews.llvm.org/D103649	2021-06-18 21:12:28 +00:00
Sean Silva	7f7be19e6a	[mlir] Add notes about using external interface application. Differential Revision: https://reviews.llvm.org/D104489	2021-06-18 07:42:47 -07:00
Uday Bondhugula	18c8c934d8	[MLIR] Introduce scf.execute_region op Introduce the execute_region op that is able to hold a region which it executes exactly once. The op encapsulates a CFG within itself while isolating it from the surrounding control flow. Proposal discussed here: https://llvm.discourse.group/t/introduce-std-inlined-call-op-proposal/282 execute_region enables one to inline a function without lowering out all other higher level control flow constructs (affine.for/if, scf.for/if) to the flat list of blocks / CFG form. It thus allows the benefit of transforms on higher level control flow ops available in the presence of the inlined calls. The inlined calls continue to benefit from propagation of SSA values across their top boundary. Functions won’t have to remain outlined until later than desired. Abstractions like affine execute_regions, lambdas with implicit captures could be lowered to this without first lowering out structured loops/ifs or outlining. But two potential early use cases are of: (1) an early inliner (which can inline functions by introducing execute_region ops), (2) lowering of an affine.execute_region, which cleanly maps to an scf.execute_region when going from the affine dialect to the scf dialect. Differential Revision: https://reviews.llvm.org/D75837	2021-06-18 15:22:33 +05:30
Matthias Springer	6f665cd53d	[mlir][linalg] Fix PadTensorOp constructor Differential Revision: https://reviews.llvm.org/D104510	2021-06-18 17:35:08 +09:00
Gus Smith	22911585bb	[mlir][sparse] Add Matricized Tensor Times Khatri-Rao Product (MTTKRP) integration test See this documentation from taco: http://tensor-compiler.org/docs/data_analytics/index.html Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D104417	2021-06-17 16:53:12 +00:00
Alexander Belyaev	7cddf56d60	[mlir] Remove linalg.indexed_generic forward decl.	2021-06-17 16:04:06 +02:00
Benjamin Kramer	c878d03d60	[mlir] Split things dependent on LLVM_DEBUG into a .cpp file LLVM_DEBUG in headers is awkward, better avoid it. DEBUG_TYPE in a header results in a lot of macro redefinition warnings.	2021-06-17 15:06:40 +02:00
Alexander Belyaev	5b3cb31edb	[mlir][linalg] Purge linalg.indexed_generic. Differential Revision: https://reviews.llvm.org/D104449	2021-06-17 14:45:37 +02:00
Alex Zinenko	6b6338195c	[mlir] define a customized DEBUG_TYPE in InterfaceSupport.h	2021-06-17 13:24:32 +02:00
Alex Zinenko	d7e8912134	[mlir] Enable delayed registration of attribute/operation/type interfaces This functionality is similar to delayed registration of dialect interfaces. It allows external interface models to be registered before the dialect containing the attribute/operation/type interface is loaded, or even before the context is created. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D104397	2021-06-17 13:19:24 +02:00
Alex Zinenko	23cdf7b6ed	[mlir] separable registration of operation interfaces This is similar to attribute and type interfaces and mostly the same mechanism (FallbackModel / ExternalModel, ODS generation). There are minor differences in how the concept-based polymorphism is implemented for operations that are accounted for by ODS backends, and this essentially adds a test and exposes the API. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D104294	2021-06-17 12:00:31 +02:00
Alex Zinenko	a4f81b2054	[mlir] ODS: emit interface traits outside of the interface class ODS currently emits the interface trait class as a nested class inside the interface class. As an unintended consequence, the default implementations of interface methods have implicit access to static fields of the interface class, e.g. those declared in `extraClassDeclaration`, including private methods (!), or in the parent class. This may break the use of default implementations for external models, which are not defined in the interface class, and generally complexifies the abstraction. Emit intraface traits outside of the interface class itself to avoid accidental implicit visibility. Public static fields can still be accessed via explicit qualification with a class name, e.g., `MyOpInterface::staticMethod()` instead of `staticMethod`. Update the documentation to clarify the role of `extraClassDeclaration` in interfaces. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D104384	2021-06-17 10:25:35 +02:00
MaheshRavishankar	3ed3e438a7	[mlir] Move `memref.dim` canonicalization using `InferShapedTypeOpInterface` to a separate pass. Based on dicussion in [this](https://llvm.discourse.group/t/remove-canonicalizer-for-memref-dim-via-shapedtypeopinterface/3641) thread the pattern to resolve the `memref.dim` of a value that is a result of an operation that implements the `InferShapedTypeOpInterface` is moved to a separate pass instead of running it as a canonicalization pass. This allows shape resolution to happen when explicitly required, instead of automatically through a canonicalization. Differential Revision: https://reviews.llvm.org/D104321	2021-06-16 22:13:11 -07:00
Mehdi Amini	6a071e535f	Improve error reporting on pass registration collision (NFC) Differential Revision: https://reviews.llvm.org/D104430	2021-06-17 02:42:43 +00:00
Haruki Imai	5a55205bb3	[mlir] Fixed dynamic operand storage on big-endian machines. Many tests fails by D101969 (https://reviews.llvm.org/D101969) on big-endian machines. This patch changes bit order of TrailingOperandStorage in big-endian machines. This patch works on System Z (Triple = "s390x-ibm-linux", CPU = "z14"). Signed-off-by: Haruki Imai <imaihal@jp.ibm.com> Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D104225	2021-06-16 18:38:08 -07:00
River Riddle	854ef875b9	[mlir-vscode] Add a link to mlir.llvm.org at the top of the vscode extension doc	2021-06-16 18:22:02 -07:00
River Riddle	d3c895a870	[mlir-lsp-server] Add an explicit blurb on where to send code contributions. When the vscode extension is published, it may be unclear how to contribute improvements to the extension. This revision makes it clear that contributions should follow the traditional LLVM guidelines.	2021-06-16 18:22:01 -07:00
River Riddle	fcecfcb92c	[mlir-lsp-server][Docs] Tweak the documentation for the visual studio code extension This revision updates the feature set, and cleans up the contributing section a little.	2021-06-16 17:58:53 -07:00
Mehdi Amini	a6e8ed7776	Improve error message on pass registration failures to include the faulty pass name	2021-06-17 00:24:50 +00:00
Mehdi Amini	b5e22e6d42	Migrate MLIR test passes to the new registration API Make sure they all define getArgument()/getDescription(). Depends On D104421 Differential Revision: https://reviews.llvm.org/D104426	2021-06-16 23:42:17 +00:00
Mehdi Amini	c8a3f561eb	Decouple registring passes from specifying argument/description This patch changes the (not recommended) static registration API from: static PassRegistration<MyPass> reg("my-pass", "My Pass Description."); to: static PassRegistration<MyPass> reg; And the explicit registration from: void registerPass("my-pass", "My Pass Description.", [] { return createMyPass(); }); To: void registerPass([] { return createMyPass(); }); It is expected that Pass implementations overrides the getArgument() method instead. This will ensure that pipeline description can be printed and parsed back. Differential Revision: https://reviews.llvm.org/D104421	2021-06-16 23:41:50 +00:00
Robert David	7cfb7a67c5	[mlir] Make Type::print and Type::dump const	2021-06-16 15:31:20 -07:00
Gus Smith	f9a6d47c36	Add sparse matrix multiplication integration test Adds an integration test for the SPMM (sparse matrix multiplication) kernel, which multiplies a sparse matrix by a dense matrix, resulting in a dense matrix. This is just a simple modification on the existing matrix-vector multiplication kernel. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D104334	2021-06-16 13:20:20 -07:00
Uday Bondhugula	54384d1723	[MLIR] Make store to load fwd condition less conservative Make store to load fwd condition for -memref-dataflow-opt less conservative. Post dominance info is not really needed. Add additional check for common cases. Differential Revision: https://reviews.llvm.org/D104174	2021-06-17 01:26:38 +05:30
Prashant Kumar	51d43bbc46	[MLIR] Fix affine parallelize pass. To control the number of outer parallel loops, we need to process the outer loops first and hence pre-order walk fixes the issue. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D104361	2021-06-17 01:25:24 +05:30
Jacques Pienaar	0e760a0870	Add hook for dialect specializing processing blocks post inlining calls This allows for dialects to do different post-processing depending on operations with the inliner (my use case requires different attribute propagation rules depending on call op). This hook runs before the regular processInlinedBlocks method. Differential Revision: https://reviews.llvm.org/D104399	2021-06-16 12:53:21 -07:00
Mehdi Amini	066b320723	Use early exist and simplify a condition in Block SuccessorRange (NFC)	2021-06-16 19:42:41 +00:00
Mehdi Amini	a6559b42ce	Fix verifier crashing on some invalid IR In a region with multiple blocks the verifier will try to look for dominance and may get successor list for blocks, even though a block may be empty or does not end with a terminator. Differential Revision: https://reviews.llvm.org/D104411	2021-06-16 19:36:28 +00:00
Aart Bik	619bfe8bd2	[mlir][sparse] support new kind of scalar in sparse linalg generic op We have several ways of introducing a scalar invariant value into linalg generic ops (should we limit this somewhat?). This revision makes sure we handle all of them correctly in the sparse compiler. Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D104335	2021-06-16 11:00:49 -07:00
Nicolas Vasilache	a1678987d9	[mlir] NFC - Drop newline form BlockArgument printing. Differential Revision: https://reviews.llvm.org/D104368	2021-06-16 13:28:43 +00:00
Alex Zinenko	61cf7c7258	[mlir] ODS: temporarily disbale external model in presence of extra class declarations Default implementations of interfaces may rely on extra class declarations, which aren't currently generated in the external model, that in turn may rely on functions defined in the main Attribute/Type class, which wouldn't be available on the external model.	2021-06-16 12:18:56 +02:00
Aart Bik	ec8910c4ad	[mlir][sparse] integration test for all-dense annotated "sparse" output Reviewed By: gussmith23 Differential Revision: https://reviews.llvm.org/D104277	2021-06-15 15:44:11 -07:00
MaheshRavishankar	621d93d263	[mlir][SCF] Remove empty else blocks of `scf.if` operations. Differential Revision: https://reviews.llvm.org/D104273	2021-06-15 15:07:20 -07:00
Aart Bik	727a63e0d9	[mlir][sparse] allow all-dense annotated "sparse" tensor output This is a very careful start with alllowing sparse tensors at the left-hand-side of tensor index expressions (viz. sparse output). Note that there is a subtle difference between non-annotated tensors (dense, remain n-dim, handled by classic bufferization) and all-dense annotated "sparse" tensors (linearized to 1-dim without overhead storage, bufferized by sparse compiler, backed by runtime support library). This revision gently introduces some new IR to facilitate annotated outputs, to be generalized to truly sparse tensors in the future. Reviewed By: gussmith23, bixia Differential Revision: https://reviews.llvm.org/D104074	2021-06-15 14:55:07 -07:00
Arpith C. Jacob	dd1992efd3	Support lowering of index-cast on vector types. The index cast operation accepts vector types. Implement its lowering in this patch. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D104280	2021-06-15 12:51:30 -07:00
Tobias Gysi	ff2ef4d684	[mlir][linalg] Adapt yaml codegen to support scalar parameters. The patch updates the C++ yaml code generation to support scalar operands as added in https://reviews.llvm.org/D104220. Differential Revision: https://reviews.llvm.org/D104224	2021-06-15 15:20:48 +00:00
Tobias Gysi	e893708aac	[mlir] Avoid GCC naming conflict. Rename AbstractType to AbstractTy to avoid a GCC naming conflict after https://reviews.llvm.org/D104234. Differential Revision: https://reviews.llvm.org/D104297	2021-06-15 14:48:30 +00:00
Adrian Kuegel	f112bd61eb	[mlir] Add SignOp to complex dialect. Also add a conversion pattern from Complex Dialect to Standard/Math Dialect. Differential Revision: https://reviews.llvm.org/D104292	2021-06-15 15:22:31 +02:00
Alex Zinenko	9b2a1bcf6f	[mlir] separable registration of attribute and type interfaces It may be desirable to provide an interface implementation for an attribute or a type without modifying the definition of said attribute or type. Notably, this allows to implement interfaces for attributes and types outside of the dialect that defines them and, in particular, provide interfaces for built-in types. Provide the mechanism to do so. Currently, separable registration requires the attribute or type to have been registered with the context, i.e. for the dialect containing the attribute or type to be loaded. This can be relaxed in the future using a mechanism similar to delayed dialect interface registration. See https://llvm.discourse.group/t/rfc-separable-attribute-type-interfaces/3637 Depends On D104233 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D104234	2021-06-15 15:20:27 +02:00
Tobias Gysi	662f9bff33	[mlir][linalg][python] Adapt the OpDSL to use scalars. The patch replaces the existing capture functionality by scalar operands that have been introduced by https://reviews.llvm.org/D104109. Scalar operands behave as tensor operands except for the fact that they are not indexed. As a result ScalarDefs can be accessed directly as no indexing expression is needed. The patch only updates the OpDSL. The C++ side is updated by a follow up patch. Differential Revision: https://reviews.llvm.org/D104220	2021-06-15 12:54:00 +00:00
Alex Zinenko	8ed8855958	[mlir] ODS: do not emit trailing const for static methods The generated C++ would be invalid. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D104233	2021-06-15 13:33:39 +02:00
Benjamin Kramer	cd93935146	[mlir][MemRef] Make sure types match when folding dim(reshape) Reshape can take integer types in addition to index, but dim always returns index. Differential Revision: https://reviews.llvm.org/D104287	2021-06-15 12:33:44 +02:00
Adrian Kuegel	662e074d90	[mlir] Add NegOp to complex dialect. Also add a lowering pattern from complex dialect to standard dialect. Differential Revision: https://reviews.llvm.org/D104284	2021-06-15 12:16:22 +02:00
Matthias Springer	b6ab4f1a8b	[mlir][linalg] Fold linalg.pad_tensor if src type == result type Fold PadTensorOp to source if source type and result type have static shape and are equal. Differential Revision: https://reviews.llvm.org/D103778	2021-06-15 17:25:12 +09:00
Tres Popp	6c7be41767	Support buffers in LinalgFoldUnitExtentDims This doesn't add any canonicalizations, but executes the same simplification on bufferSemantic linalg.generic ops by using linalg::ReshapeOp instead of linalg::TensorReshapeOp. Differential Revision: https://reviews.llvm.org/D103513	2021-06-15 08:22:22 +02:00
Sean Silva	853a614864	[mlir:OpFormatGen] Add Support for `$_ctxt` in the transformer. This is useful for "build tuple" type ops. In my case, in npcomp, I have an op: ``` // Result type is `!torch.tuple<!torch.tensor, !torch.tensor>`. torch.prim.TupleConstruct %0, %1 : !torch.tensor, !torch.tensor ``` and the context is required for the `Torch::TupleType::get` call (for the case of an empty tuple). The handling of these FmtContext's in the code is pretty ad-hoc -- I didn't attempt to rationalize it and just made a targeted fix. As someone unfamiliar with the code I had a hard time seeing how to more broadly fix the situation. Differential Revision: https://reviews.llvm.org/D104274	2021-06-14 18:02:55 -07:00
Hanhan Wang	e3bc4dbe8e	[mlir][Linalg] Make printer/parser have the same behavior. The parser of generic op did not recognize the output from mlir-opt when there are multiple outputs. One would wrap the result types with braces, and one would not. The patch makes the behavior the same. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D104256	2021-06-14 13:38:30 -07:00
Chris Lattner	a490ca8e01	[PassManager] Save compile time by not running the verifier unnecessarily. NFC This changes the pass manager to not rerun the verifier when a pass says it didn't change anything or after an OpToOpPassAdaptor, since neither of those cases need verification (and if the pass lied, then there will be much larger semantic problems than will be caught by the verifier). This maintains behavior in EXPENSIVE_CHECKS mode. Differential Revision: https://reviews.llvm.org/D104243	2021-06-14 11:43:52 -07:00
River Riddle	66e2708205	[mlir:Linalg] Populate LinalgOp patterns on LinalgDialect as opposed to each op Interface patterns are unique in that they get added to every operation that also implements that interface, given that they aren't tied to individual operations. When the same interface pattern gets added to multiple operations (such as the current behavior with Linalg), an reference to each of these patterns is added to every op (meaning that an operation will now have N references to effectively the same pattern). This revision fixes this problematic behavior in Linalg, and can bring upwards of a 25% reduction in compile time in Linalg based workloads. Differential Revision: https://reviews.llvm.org/D104160	2021-06-14 11:20:15 -07:00
Chris Lattner	ce77039596	[Verifier] Parallelize verification and dom checking. NFC. This changes the outer verification loop to not recurse into IsolatedFromAbove operations - instead return them up to a place where a parallel for loop can process them all in parallel. This also changes Dominance checking to happen on IsolatedFromAbove chunks of the region tree, which makes it easy to fold operation and dominance verification into a single simple parallel regime. This speeds up firtool in CIRCT from ~40s to 31s on a large testcase in -verify-each mode (the default). The .fir parser and module passes in particular benefit from this - FModule passes (roughly analogous to function passes) were already running the verifier in parallel as part of the pass manager. This allows the whole-module passes to verify their enclosed functions / FModules in parallel. -verify-each mode is still faster (26.3s on the same testcase), but we do expect the verifier to take some time. Differential Revision: https://reviews.llvm.org/D104207	2021-06-14 10:03:07 -07:00
Christian Sigg	abe501f240	[mlir] Mark gpu dialect illegal in gpu-to-llvm conversion Reviewed By: herhut, bondhugula Differential Revision: https://reviews.llvm.org/D104208	2021-06-14 17:45:44 +02:00
Denys Shabalin	c83e696732	Add AutomaticAllocationScope to memref.alloca_scope This change adds `AutomaticAllocationScope` to the memref.alloca_scope op. Additionally, it also clarifies that alloca_scope is is conceptually a passthrough operation. Reviewed By: ftynse, bondhugula Differential Revision: https://reviews.llvm.org/D104227	2021-06-14 16:09:06 +02:00
Uday Bondhugula	88e4aae57d	[MLIR][NFC] Rename MemRefDataFlow -> AffineScalarReplacement NFC. Rename MemRefDataFlow -> AffineScalarReplacement and move to AffineTransforms library. Pass command line rename: -memref-dataflow-opt -> affine-scalrep. Update outdated pass documentation. Rationale: https://llvm.discourse.group/t/move-and-rename-memref-dataflow-opt-lib-transforms-lib-affine-dialect-transforms/3640 Differential Revision: https://reviews.llvm.org/D104190	2021-06-14 17:52:53 +05:30
Guillaume Chatelet	1d49e5352f	[llvm] remove Sequence::asSmallVector() There's no need for `toSmallVector()` as `SmallVector.h` already provides a `to_vector` free function that takes a range. Reviewed By: Quuxplusone Differential Revision: https://reviews.llvm.org/D104024	2021-06-14 08:28:05 +00:00
Adrian Kuegel	2116eadf09	[mlir] Remove traits that require vector type support from ops in Complex dialect. Actually, no vector types are supported so far. We should add the traits once the vector types are supported (e.g. ElementwiseMappable.traits). Instead add Elementwise trait to each op. Differential Revision: https://reviews.llvm.org/D104103	2021-06-14 08:30:22 +02:00
Tobias Gysi	046922e100	[mlir][linalg] Add support for scalar input operands. Up to now all structured op operands are assumed to be shaped. The patch relaxes this assumption and allows scalar input operands. In contrast to shaped operands scalar operands are not indexed and directly forwarded to the body of the operation. As all other operands, scalar operands are associated to an indexing map that in case of a scalar or a 0D-operand has an empty range. We will use scalar operands as a replacement for the capture mechanism. In contrast to captures, the approach ensures we can generate the function signature from the operand list and it prevents outdated capture values in case a transformation updates only the capture operand but not the hidden body of a named operation. Removing captures and updating existing operations such as linalg.fill is left for a later patch. The patch depends on https://reviews.llvm.org/D103891 and https://reviews.llvm.org/D103890. Differential Revision: https://reviews.llvm.org/D104109	2021-06-14 06:27:16 +00:00
Matthias Springer	ddda52ce3c	[mlir][linalg] Lower PadTensorOps with non-constant pad value The padding of such ops is not generated in a vectorized way. Instead, emit a tensor::GenerateOp. We may vectorize GenerateOps in the future. Differential Revision: https://reviews.llvm.org/D103879	2021-06-14 15:11:13 +09:00

1 2 3 4 5 ...

7991 Commits