llvm-project

Commit Graph

Author	SHA1	Message	Date
Alex Zinenko	b84f95fe53	[mlir] Fix -Wunused-private-field in the Transform dialect Classes derived from TransformState::Extension may need access to the parent state.	2022-04-26 14:05:24 +02:00
Alex Zinenko	2b985a7ae8	[mlir] Add a title to the Transform Dialect doc	2022-04-26 13:04:41 +02:00
Krzysztof Drewniak	d35f7f254f	[mlir] Allow data flow analysis of non-control flow branch arguments This commit adds the visitNonControlFlowArguments method to DataFlowAnalysis, allowing analyses to provide lattice values for the arguments to a RegionSuccessor block that aren't directly tied to an op's inputs. For example, integer range interface can use this method to infer bounds for the step values in loops. This method has a default implementation that keeps the old behavior of assigning a pessimistic fixedpoint state to all such arguments. Reviewed By: Mogball, rriddle Differential Revision: https://reviews.llvm.org/D124021	2022-04-25 20:19:34 +00:00
jfurtek	c4caa90b15	[mlir][tblgen] Generate builders with inferred return types and unwrapped attributes This diff causes mlir-tblgen to generate code for an additional builder for an operation argument with a return type that can be inferred AND an attribute in the argument list can be "unwrapped." (Previously, the unwrapped build function was only generated for builders with explicit return types in separate or aggregate form.) As an example, this builder might be used by code that creates operations that implement the `SameOperandsAndResultType` interface. A test case was created. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D124043	2022-04-25 19:00:44 +00:00
Jeremy Furtek	a266a21000	[mlir][ods] Extend the EnumAttr tablegen class to support BitEnum attributes This diff allows the EnumAttr class to be used for bit enum attributes (in addition to previously supported integer enum attributes). While integer and bit enum attributes share many common implementation aspects, parsing bit enum values requires a separate implementation. This is accomplished by creating empty parser and printer strings in the EnumAttrInfo record, and having derived classes (specific to bit and integer enums) override with an appropriate parser/printer string. To support existing bit enums that may use a vertical bar separator, the parser is modified to support the \| token. Tests were added for bit enums alongside integer enums. Future diffs for fastmath attributes in the arithmetic dialect will use these changes. (resubmission of earlier abaondoned diff, updated to reflect subsequent changes in the repository) Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D123880	2022-04-25 19:00:00 +00:00
jfurtek	4e5dee2f30	[mlir][ods] Add tablegen field for concise printing of BitEnum attributes This diff introduces a tablegen field for bit enum attributes (`printBitEnumPrimaryGroups`) to control printing when the enum uses "group" cases. An example would be an implementation that uses a `fastmath` enum value as an alias for individual fastmath flags. The proposed field would allow printing of simply `fast` for the enum value, instead of the more verbose list that would include `fast` as well as the individual flags (e.g. `reassoc,nnan, ninf,nsz,arcp,contract,afn,fast`). Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D123871	2022-04-25 18:48:35 +00:00
Markus Böck	12a2716953	[mlir][LLVM] Support opaque pointers in `llvm.mlir.addressof` The verifier of llvm.mlir.addressof did not properly account for opaque pointers, that is, the pointer type not having an element type equal to the type of the referenced global or function. This patch fixes that by skipping the test for the element type if the pointer is opaque. Differential Revision: https://reviews.llvm.org/D124333	2022-04-25 12:23:16 +02:00
Alex Zinenko	4c807f2f57	[mlir][vector] insert `alloca`s outside of loops After https://reviews.llvm.org/D119743 added the `AutomaticAllocationScope` trait to loop-like constructs, the vector transfer full/partial splitting pass started inserting allocations for temporaries within the closest loop rather than the closest function (or other allocation scope such as `async.execute`). While this is correct as long as the lowered code takes care of automatic deallocation at the end of each iteration of the loop, this interferes with downstream optimizations that expect `alloca`s to be at the function level. Step over loops when looking for the closest allocation scope in vector transfer full/partial splitting pass thus restoring the original behavior. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D124366	2022-04-25 10:49:09 +02:00
Markus Böck	34312f1f0c	[mlir][LLVM] Support opaque pointers in data layout entries This is likely preferable to having it crash if one were to specify an opaque pointer type, and the actual element type is unused either way. Differential Revision: https://reviews.llvm.org/D124334	2022-04-25 09:14:33 +02:00
Nick Kreeger	4620032ee3	Revert "[mlir][sparse] Expose SpareTensor passes as enums instead of opaque numbers for vectorization and parallelization options." This reverts commit `d59cf901cb`. Build fails on NVIDIA Sparse tests: https://lab.llvm.org/buildbot/#/builders/61/builds/25447	2022-04-23 20:14:48 -05:00
Nick Kreeger	d59cf901cb	[mlir][sparse] Expose SpareTensor passes as enums instead of opaque numbers for vectorization and parallelization options. The SparseTensor passes currently use opaque numbers for the CLI, despite using an enum internally. This patch exposes the enums instead of numbered items that are matched back to the enum. Fixes GitHub issue #53389 Reviewed by: aartbik, mehdi_amini Differential Revision: https://reviews.llvm.org/D123876	2022-04-23 19:16:57 -05:00
Matthias Springer	48b8edac1c	[mlir][bufferize][NFC] Remove old references to Comprehensive Bufferize Differential Revision: https://reviews.llvm.org/D124324	2022-04-23 18:01:05 +09:00
Matthias Springer	940a3f6b3d	[mlir][bufferize][NFC] Clean up test cases Run `one-shot-bufferize` instead of `linalg-comprehensive-module-bufferize` and move some test cases to their respective dialects. Differential Revision: https://reviews.llvm.org/D124323	2022-04-23 18:00:55 +09:00
River Riddle	eda6f907d2	[mlir][NFC] Shift a bunch of dialect includes from the .h to the .cpp Now that dialect constructors are generated in the .cpp file, we can drop all of the dependent dialect includes from the .h file. Differential Revision: https://reviews.llvm.org/D124298	2022-04-23 01:09:29 -07:00
River Riddle	f3ebf828dc	[mlir] Generate Dialect constructors in .cpp instead of .h By generating in the .h file, we were forcing dialects to include a lot of additional header files because: * Fields of the dialect, e.g. std::unique_ptr<>, were unable to use forward declarations. * Dependent dialects are loaded in the constructor, requiring the full definition of each dependent dialect (which, depending on the file structure of the dialect, may include the operations). By generating in the .cpp we get much faster builds, and also better align with the rest of the code base. Fixes #55044 Differential Revision: https://reviews.llvm.org/D124297	2022-04-23 00:44:54 -07:00
Markus Böck	8ed2bd1e74	[mlir][LLVM] Fix `DataLayoutTypeInterface` for opqaue pointers with non-default address space As a fallback mechanism, if no entry was supplied for a given address space, the size or alignment for a pointer type with the default address space is returned instead. This code currently crashes with opaque pointers, as it tries to construct a typed pointer type from the opaque pointer type, leading to a null pointer dereference when fetching the element type. This patch fixes the issue by handling the opaque pointer cases explicitly. Differential Revision: https://reviews.llvm.org/D124290	2022-04-23 00:10:31 +02:00
Markus Böck	bab3d3778d	[mlir][LLVM] Fix crash when using opaque pointers in function signatures Using opaque pointers in function signatures leads to an attempt to recursively convert all types, including sub types in LLVM types. In the case of LLVM pointers, it may not have a subtype aka element type if it is opaque which would then lead to a null pointer dereference. Differential Revision: https://reviews.llvm.org/D124291	2022-04-23 00:10:31 +02:00
Yi Zhang	1cddcfdc3c	Fix CollapsedLayoutMap for dim size 1 case This change fixes `CollapsedLayoutMap` for cases where the collapsed dims are size 1. The cases where inner most dims are size 1 and noncontiguous can be represented by the strided form and therefore can be allowed. For such cases, the new stride should be of the next entry in an association whose dimension is not size 1. If the next entry is dynamic, it's not possible to decide which stride to use at compilation time and the stride is set to dynamic. Differential Revision: https://reviews.llvm.org/D124137	2022-04-22 17:48:24 -04:00
Alex Zinenko	40a8bd635b	[mlir] use side effects in the Transform dialect Currently, the sequence of Transform dialect operations only supports a single use of each operand (verified by the `transform.sequence` operation). This was originally motivated by the need to guard against accessing a payload IR operation associated with a transform IR value after this operation has likely been rewritten by a transformation. However, not all Transform dialect operations rewrite payload IR, in particular the "navigation" operation such as `transform.pdl_match` do not. Introduce memory effects to the Transform dialect operations to describe their effect on the payload IR and the mapping between payload IR opreations and transform IR values. Use these effects to replace the single-use rule, allowing repeated reads and disallowing use-after-free, where operations with the "free" effect are considered to "consume" the transform IR value and rewrite the corresponding payload IR operations). As an additional improvement, this enables code motion transformation on the transform IR itself. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D124181	2022-04-22 23:29:11 +02:00
Okwan Kwon	ee285faed2	[mlir] Do not bubble up extract slice when it is rank-reducing. The bubble up logic was written by assuming the slice operation is always a normal slice that outputs a tensor with the same rank. Differential Revision: https://reviews.llvm.org/D124283	2022-04-22 12:21:47 -07:00
cpillmayer	3e8560f890	[MLIR] Add option to print users of an operation as comment in the printer This allows printing the users of an operation as proposed in the git issue #53286. To be able to refer to operations with no result, these operations are assigned an ID in SSANameState. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D124048	2022-04-22 18:58:10 +00:00
Jacques Pienaar	9bae20b528	[mlir] Add shape.func Add shape func op for use (primarily) in shape function_library op. Allows setting default dialect for some simpler authoring. This is a minimal version of the ops needed. Differential Revision: https://reviews.llvm.org/D124055	2022-04-22 11:35:35 -07:00
Lei Zhang	6f28fd0bf7	[mlir][vector] Fold 1-element reduction into extract or arith ops If there is only one single element in the vector, then we can just extract the element to compute the final result. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D124129	2022-04-22 14:24:46 -04:00
Matthias Springer	d6dab38ae4	[mlir][bufferize][NFC] Add function boundary bufferization flag to BufferizationOptions This makes the API easier to use. Also allows us to check for incorrect API usage for easier debugging. Differential Revision: https://reviews.llvm.org/D124265	2022-04-23 01:11:37 +09:00
Matthias Springer	b0b19fae81	[mlir][bufferize][NFC] Rewrite op filter logic The `hasFilter` field is not needed. Instead, the filter accepts ops by default if no ALLOW rule was specified. Differential Revision: https://reviews.llvm.org/D124264	2022-04-23 00:25:24 +09:00
Lei Zhang	fc760c0260	[mlir][vector] Fold cancelling vector.shape_cast(vector.broadcast) vector.broadcast can inject all size one dimensions. If it's followed by a vector.shape_cast to the original type, we can cancel the op pair, like cancelling consecutive shape_cast ops. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D124094	2022-04-22 08:58:26 -04:00
Matthias Springer	494505f39f	[mlir][bufferize][NFC] Move SCF test cases to SCF dialect Differential Revision: https://reviews.llvm.org/D124249	2022-04-22 20:35:20 +09:00
Matthias Springer	e07a7fd5c0	[mlir][bufferization] Move ModuleBufferization to bufferization dialect * Move Module Bufferization to the bufferization dialect. The implementation is split into `OneShotModuleBufferize.cpp` and `FuncBufferizableOpInterfaceImpl.cpp`, so that the external model implementation can be easily moved to the func dialect in the future. * Split and clean up test cases. A few test cases are still remaining in Linalg and will be updated separately. * `linalg.inplaceable` is renamed to `bufferization.writable` to accurately reflect its current usage. * Attributes and their verifiers are moved from the Linalg dialect to the Bufferization dialect. * Expand documentation. * Add a new flag to One-Shot Bufferize to allow for function boundary bufferization. Differential Revision: https://reviews.llvm.org/D122229	2022-04-22 19:37:28 +09:00
Matthias Springer	bd1d87e3d1	[mlir][bufferization][NFC] Remove layout post processing step The layout postprocessing step was removed and is now part of the FuncOp bufferization. If the user specified a certain layout map for a tensor function arg, use that layout map directly when bufferizing the function signature. Previously, the bufferization used a generic layout map for every tensor function arg and then updated function signatures and CallOps in a separate step. Differential Revision: https://reviews.llvm.org/D122228	2022-04-22 18:49:47 +09:00
Matthias Springer	70777d967f	[mlir][bufferize][NFC] Move FuncOp bufferization to BufferizableOpInterface impl FuncOps are now less special. They must still be analyzed + bufferized in a certain order, but they are now bufferized same as other ops that have a region: Bufferize the op first (`bufferize` interface method), then bufferize the region body with other bufferization patterns. In the case of FuncOps, the function signature is bufferized together with ReturnOps. Similar to how, e.g., scf.for ops are bufferized together with scf.yield ops. This change is essentially a reimplementation of the FuncOp bufferization, but mostly NFC from a user's perspective (apart from error messages). This change is in preparation of moving the code to the bufferization dialect. Differential Revision: https://reviews.llvm.org/D123214	2022-04-22 18:47:12 +09:00
Matthias Springer	d820acdde1	[mlir][bufferize][NFC] Use custom walk instead of GreedyPatternRewriter The bufferization driver was previously using a GreedyPatternRewriter. This was problematic because bufferization must traverse ops top-to-bottom. The GreedyPatternRewriter was previously configured via `useTopDownTraversal`, but this was a hack; this API was just meant for performance improvements and should not affect the result of the rewrite. BEGIN_PUBLIC No public commit message needed. END_PUBLIC Differential Revision: https://reviews.llvm.org/D123618	2022-04-22 18:23:09 +09:00
jacquesguan	9b32886e7e	[mlir][Arithmetic] Use common constant fold function in RemSI and RemUI to cover splat. This patch replaces current fold function with the common constant fold funtion in order to cover the situation of constant splat. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D124236	2022-04-22 09:20:18 +00:00
jacquesguan	abc17a6751	[mlir][Arithmetic] Use matchPattern to simplify code. This patch replaces some code with matchPattern and move them before the constant folder function in order to avoid redundant invoking. Differential Revision: https://reviews.llvm.org/D124235	2022-04-22 08:42:51 +00:00
Adrian Kuegel	a74e5a89b9	[mlir] Move isGuaranteedCollapsible to CollapseShapeOp (NFC). It seems more natural than to have it as a static method of ExpandShapeOp. Also fix a typo ("the the" -> "the"). Differential Revision: https://reviews.llvm.org/D124234	2022-04-22 10:31:25 +02:00
Will Dietz	bb8c8751cf	[MLIR] prefer /bin/sh over /bin/bash for simple test scripts These scripts do not appear to require bash, and while /bin/sh is not guaranteed either it's more commonly available. Fixes tests on NixOS and in certain sandbox build environments. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D124205	2022-04-21 20:25:17 -05:00
Amy Zhuang	5bd4bcfc04	[mlir] Modify SuperVectorize to generate select op->combiner op Insert the select op before the combiner op when vectorizing a reduction loop that needs a mask, so the vectorized reduction loop can pass isLoopParallel check and be transformed correctly in later passes. Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D124047	2022-04-21 17:09:13 -07:00
Mahesh Ravishankar	0c090dcc8a	[mlir][Linalg] Deprecate legacy reshape + generic op folding patterns. These patterns have been superceded by the fusion by collapsing patterns. Differential Revision: https://reviews.llvm.org/D124145	2022-04-21 22:25:23 +00:00
Chris Lattner	31c8abc3f1	[AsmParser/Printer] Rework sourceloc support for function arguments. When Location tracking support for block arguments was added, we discussed various approaches to threading support for this through function-like argument parsing. At the time, we added a parallel array of locations that could hold this. It turns out that that approach was verbose and error prone, roughly no one adopted it. This patch takes a different approach, adding an optional source locator to the UnresolvedOperand class. This fits much more naturally into the standard structure we use for representing locators, and gives all the function like dialects locator support for free (e.g. see the test adding an example for the LLVM dialect). Differential Revision: https://reviews.llvm.org/D124188	2022-04-21 12:43:36 -07:00
Frederik Gossen	673e9828be	[MLIR] Fix iteration counting in greedy pattern application Previously, checking that a fix point is reached was counted as a full iteration. As this "iteration" never changes the IR, this seems counter- intuitive. Differential Revision: https://reviews.llvm.org/D123641	2022-04-21 15:17:28 -04:00
Alex Zinenko	0edb262d91	[mlir] enable doc generation for the transform dialect	2022-04-21 18:52:08 +02:00
Fangrui Song	ae46b3e01f	Revert D121279 "[MLIR][GPU] Add canonicalizer for gpu.memcpy" This reverts commit `12f55cac69`. Causes miscompile. Will follow up with a reproduce.	2022-04-21 08:55:13 -07:00
Alex Zinenko	30f22429d3	[mlir] Connect Transform dialect to PDL This introduces a pair of ops to the Transform dialect that connect it to PDL patterns. Transform dialect relies on PDL for matching the Payload IR ops that are about to be transformed. For this purpose, it provides a container op for patterns, a "pdl_match" op and transform interface implementations that call into the pattern matching infrastructure. To enable the caching of compiled patterns, this also provides the extension mechanism for TransformState. Extensions allow one to store additional information in the TransformState and thus communicate it between different Transform dialect operations when they are applied. They can be added and removed when applying transform ops. An extension containing a symbol table in which the pattern names are resolved and a pattern compilation cache is introduced as the first client. Depends On D123664 Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D124007	2022-04-21 16:23:10 +02:00
Markus Böck	850b2c6b3c	[mlir] Fix `Region`s `takeBody` method if the region is not empty The current implementation of takeBody first clears the Region, before then taking ownership of the blocks of the other regions. The issue here however, is that when clearing the region, it does not take into account references of operations to each other. In particular, blocks are deleted from front to back, and operations within a block are very likely to be deleted despite still having uses, causing an assertion to trigger [0]. This patch fixes that issue by simply calling dropAllReferences()before clearing the blocks. [0] `9a8bb4bc63/mlir/lib/IR/Operation.cpp (L154)` Differential Revision: https://reviews.llvm.org/D123913	2022-04-21 15:32:59 +02:00
Markus Böck	a41aaf166f	[mlir] Make `Regions`s `cloneInto` multithread-readable Prior to this patch, `cloneInto` would do a simple walk over the blocks and contained operations and clone and map them as it encounters them. As finishing touch it then remaps any successor and operands it has remapped during that process. This is generally fine, but sadly leads to a lot of uses of both operations and blocks from the source region, in the cloned operations in the target region. Those uses lead to writes in the use-def list of the operations, making `cloneInto` never thread safe. This patch reimplements `cloneInto` in three steps to avoid ever creating any extra uses on elements in the source region: * It first creates the mapping of all blocks and block operands * It then clones all operations to create the mapping of all operation results, but does not yet clone any regions or set the operands * After all operation results have been mapped, it now sets the operations operands and clones their regions. That way it is now possible to call `cloneInto` from multiple threads if the Region or Operation is isolated-from-above. This allows creating copies of functions or to use `mlir::inlineCall` with the same source region from multiple threads. In the general case, the method is thread-safe if through cloning, no new uses of `Value`s from outside the cloned Operation/Region are created. This can be ensured by mapping any outside operands via the `BlockAndValueMapping` to `Value`s owned by the caller thread. While I was at it, I also reworked the `clone` method of `Operation` a little bit and added a proper options class to avoid having a `cloneWithoutRegionsAndOperands` method, and be more extensible in the future. `cloneWithoutRegions` is now also a simple wrapper that calls `clone` with the proper options set. That way all the operation cloning code is now contained solely within `clone`. Differential Revision: https://reviews.llvm.org/D123917	2022-04-21 13:43:00 +02:00
Uday Bondhugula	f47a38f517	Add async dependencies support for gpu.launch op Add async dependencies support for gpu.launch op: this allows specifying a list of async tokens ("streams") as dependencies for the launch. Update the GPU kernel outlining pass lowering to propagate async dependencies from gpu.launch to gpu.launch_func op. Previously, a new stream was being created and destroyed for a kernel launch. The async deps support allows the kernel launch to be serialized on an existing stream. Differential Revision: https://reviews.llvm.org/D123499	2022-04-21 16:25:59 +05:30
Nimish Mishra	00c511b351	Added lowering support for atomic read and write constructs This patch adds lowering support for atomic read and write constructs. Also added is pointer modelling code to allow FIR pointer like types to be inferred and converted while lowering. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D122725 Co-authored-by: Kiran Chandramohan <kiran.chandramohan@arm.com>	2022-04-21 12:19:13 +05:30
River Riddle	0fd3a1ce60	[mlir][NFC] Update remaining textual references of un-namespaced `func` operations The special case parsing of operations in the `func` dialect is being removed, and operations will require the dialect namespace prefix.	2022-04-20 22:17:31 -07:00
River Riddle	cda6aa78f8	[mlir][NFC] Update textual references of `func` to `func.func` in Transform tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:30 -07:00
River Riddle	a4936cb3e8	[mlir][NFC] Update textual references of `func` to `func.func` in Pass/Target tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:30 -07:00
River Riddle	63237cddc1	[mlir][NFC] Update textual references of `func` to `func.func` in tool/runner tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:30 -07:00

1 2 3 4 5 ...

11093 Commits