llvm-project

Commit Graph

Author	SHA1	Message	Date
Bixia Zheng	69edacbcf0	[mlir][sparse] Add support for complex.im and complex.re to the sparse compiler. Add a test. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D125834	2022-05-18 15:53:07 +00:00
Benjamin Kramer	e497871356	[mlir][complex] Add pow/sqrt/tanh ops and lowering to libm Lowering through libm gives us a baseline version, even though it's not going to be particularly fast. This is similar to what we do for some math dialect ops. Differential Revision: https://reviews.llvm.org/D125550	2022-05-18 14:03:14 +02:00
Frederik Gossen	6d36cfed3b	[MLIR] Make `parseDimensionListRanked` configurable wrt parsing a trailing `x` Differential Revision: https://reviews.llvm.org/D125797	2022-05-18 05:42:35 -04:00
River Riddle	aa568e082b	[mlir:GreedyDriver] Return WalkResult::skip after deleting a known constant This avoids use-after-free when trying to access the regions after visiting the operation.	2022-05-18 02:14:02 -07:00
rkayaith	7814b559bd	[GreedyPatternRewriter] Avoid reversing constant order The previous fix from `af371f9f98` only applied when using a bottom-up traversal. The change here applies the constant preprocessing logic to the top-down case as well. This resolves the issue with the canonicalizer pass still reordering constants, since it uses a top-down traversal by default. Fixes #51892 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D125623	2022-05-18 00:55:59 -07:00
rkayaith	ebad5fb309	[mlir][Canonicalize] Fix command-line options The canonicalize command-line options currently have no effect, as the pass is reading the pass options in its constructor, before they're actually initialized. This results in the default values of the options always being used. The change here moves the initialization of the `GreedyRewriteConfig` out of the constructor, so that it runs after the pass options have been parsed. Fixes #55466 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D125621	2022-05-18 00:28:18 -07:00
River Riddle	17e2e7b788	[mlir:PDLL] Don't append / for directory code completion This allows for properly using / as a trigger character, i.e. more easily allows chaining include directory completions.	2022-05-18 00:23:47 -07:00
River Riddle	6d4471efb0	[mlir:PDLL] Improve the location ranges of several expressions during parsing This allows for the range to encompass more of the source associated with the full expression, making diagnostics easier to see/tooling easier/etc.	2022-05-18 00:23:47 -07:00
River Riddle	e213e5a999	[mlir:PDLL] Drop space as a completion commit character This causes annoyances when attempting to use space as a trigger character (to start a different completion).	2022-05-18 00:23:47 -07:00
Groverkss	e00cbbec06	[MLIR][Presburger] Cleanup getMaybeValues in FACV This patch cleans up multiple getMaybeValue functions to take an IdKind instead of special functions. Reviewed By: arjunp Differential Revision: https://reviews.llvm.org/D125617	2022-05-18 09:44:14 +05:30
Groverkss	862b5a5233	[MLIR][Presburger] Attach values only to non-local identifiers in FAVC This patch changes `FlatAffineValueConstraints` to only allow attaching values to non-local identifiers. The reasoning for this change is: 1. Information attached to local identifiers can be lost since local identifiers can be removed for output size optimizations. 2. There are no current use cases for attaching values to Local identifiers. 3. Attaching a value to a local identifier does not make sense since a local identifier represents existential quantification. This patch also adds some additional asserts to the affected functions. Reviewed By: arjunp, bondhugula Differential Revision: https://reviews.llvm.org/D125613	2022-05-18 09:17:23 +05:30
Robert Suderman	9294a1e9a8	[mlir][tosa] Rework tosa.apply_scale lowering for 32-bit Added handling rounding behavior in 32-bits for when possible. This avoids kernel compilation generating scalarized code on platforms where 64-bit vectors are not available. As the 48-bit lowering requires 64-bit anyway, we added a full 64-bit solution simplifying the old path. Reviewed By: dcaballe, mravishankar Differential Revision: https://reviews.llvm.org/D125583	2022-05-17 16:01:12 -07:00
Matthias Springer	996834e681	[mlir][SCF] Fix scf.while bufferization Before this fix, the bufferization implementation made the incorrect assumption that the values yielded from the "before" region must match with the values yielded from the "after" region. Differential Revision: https://reviews.llvm.org/D125835	2022-05-18 00:35:50 +02:00
jfurtek	5c3b20520b	[mlir] Update LLVMIR Fastmath flags use of MLIR BitEnum functionality This diff updates the LLVMIR dialect Fastmath flags attribute to use recently added features of `BitEnum` attributes. Specifically, this diff uses the bit enum "group" case to represent the `fast` value as an alias for a combination of other values (`ninf`, `nnan`, ...), instead of using a separate integer value. (This is in line with LLVM's fastmath flags representation.) This diff also leverages the `printBitEnumPrimaryGroups` `tblgen` field for concise enum printing. The `BitEnum` features were developed for an upcoming diff that adds `fastmath` support to the arithmetic dialect. This diff simply applies some of the relevant new features to the LLVM dialect attribute. Reviewed By: ftynse, Mogball Differential Revision: https://reviews.llvm.org/D124720	2022-05-17 18:19:14 +00:00
Min-Yih Hsu	0b168a49bf	[mlir][LLVMIR] Use a new way to verify GEPOp indices Previously, GEPOp relies on `findKnownStructIndices` to check if a GEP index should be static. The truth is, `findKnownStructIndices` can only tell you a GEP index _might_ be indexing into a struct (which should use a static GEP index). But GEPOp::build and GEPOp::verify are falsely taking this information as a certain answer, which creates many false alarms like the one depicted in `test/Target/LLVMIR/Import/dynamic-gep-index.ll`. The solution presented here adopts a new verification scheme: When we're recursively checking the child element types of a struct type, instead of checking every child types, we only check the one dictated by the (static) GEP index value. We also combine "refinement" logics -- refine/promote struct index mlir::Value into constants -- into the very verification process since they have lots of logics in common. The resulting code is more concise and less brittle. We also hide GEPOp::findKnownStructIndices since most of the aforementioned logics are already encapsulated within GEPOp::build and GEPOp::verify, we found little reason for findKnownStructIndices (or the new findStructIndices) to be public. Differential Revision: https://reviews.llvm.org/D124935	2022-05-17 10:28:44 -07:00
Cullen Rhodes	6ad6b00f6a	[mlir] vim: add bf16 type	2022-05-17 13:28:31 +00:00
Cullen Rhodes	9bb0f4616a	[mlir][licm] Fix debug output with newlines	2022-05-17 13:28:30 +00:00
David Spickett	9e469ced42	[mlir][Tablegen-LSP] Don't link with llvm dylib This updates `5de12bb703` to not link with the dylib since that does not include the tablegen library. Should fix flang dylib build failures: https://lab.llvm.org/buildbot/#/builders/177/builds/5120	2022-05-17 11:03:01 +00:00
Alex Zinenko	1075c8ca49	[mlir] support isa/cast/dyn_cast<Operation >(operation) again The support for this has been added by `946311b893` but then ignored by `bc22b5c9a2`. This enables one to write generic code that can be instantiated for both specific operation classes and the common base class without specialization. Examples include functions that take/return ops, such as: ```mlir template <typename FnTy> void applyIf(FnTy &&lambda, ...) { for (Operation op : ...) { auto specific = dyn_cast<function_traits<FnTy>::template arg_t<0>>(op); if (specific) lambda(specific); } } ``` that would otherwise need to rely on template specialization to support lambdas that take specific operations and those that take `Operation *`. Differential Revision: https://reviews.llvm.org/D125543 Reviewed by: rriddle	2022-05-17 11:15:17 +02:00
jacquesguan	9b519f416b	[mlir][LLVMIR] Add support for translating insertelement/extractelement. Add support for translating llvm::InsertElement and llvm::ExtractElement. Differential Revision: https://reviews.llvm.org/D125674	2022-05-17 03:18:31 +00:00
wren romano	bfadd13df4	[mlir][sparse] Moved _mlir_ciface_newSparseTensor closer to its macros This is a followup to D125431, to keep from confusing the machinery that generates diffs (since combining these two changes into one would obfuscate the changes actually made in the previous differential). Depends On D125431 Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D125432	2022-05-16 17:53:25 -07:00
River Riddle	6593886a35	[mlir][NFC] Fix the tags for various doc code blocks	2022-05-16 16:46:13 -07:00
River Riddle	1febbd67aa	[mlir][PDLL] Tweak the grammar to highlight partial code better This commit enables proper highlighting when inner statements are outside of a constraint/pattern/etc. This shouldn't really happen in actual code, but can happen in documentation (which uses the same syntax grammar).	2022-05-16 16:46:13 -07:00
wren romano	1313f5d307	[mlir][sparse] Restyling macros in the runtime library In addition to reducing code repetition, this also helps ensure that the various API functions follow the naming convention of mlir::sparse_tensor::primaryTypeFunctionSuffix (e.g., due to typos in the repetitious code). Depends On D125428 Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D125431	2022-05-16 16:43:39 -07:00
River Riddle	9f39867b10	[mlir][NFC] Fix a few langref typos	2022-05-16 16:23:01 -07:00
River Riddle	5de12bb703	[mlir][Tablegen-LSP] Add support for a basic TableGen language server This follows the same general structure of the MLIR and PDLL language servers. This commits adds the basic functionality for setting up the server, and initially only supports providing diagnostics. Followon commits will build out more comprehensive behavior. Realistically this should eventually live in llvm/, but building in MLIR is an easier initial step given that: * All of the necessary LSP functionality is already here * It allows for proving out useful language features (e.g. compilation databases) without affecting wider scale tablegen users * MLIR has a vscode extension that can immediately take advantage of it Differential Revision: https://reviews.llvm.org/D125440	2022-05-16 16:03:51 -07:00
wren romano	7694442011	[mlir][sparse] Adding "final" keyword wherever appropriate This enables the compiler to perform devirtualization. And benchmarks indicate devirtualization can sometimes give considerable speedup. Depends On D122061 Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D125428	2022-05-16 15:43:37 -07:00
wren romano	8cb332406c	[mlir][sparse] Enhancing sparse=>sparse conversion. Fixes: https://github.com/llvm/llvm-project/issues/51652 Depends On D122060 Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D122061	2022-05-16 15:42:19 -07:00
River Riddle	e0c3b94c80	[mlir] Restrict dialect doc gen to a single dialect In the overwhelmingly majority of cases only one dialect is generated at a time anyways, and this restriction more easily catches user error when multiple dialects might be generated. We hit this semi-recently with the PDL dialect, and circt+other downstream users are also actively hitting this as well. Differential Revision: https://reviews.llvm.org/D125651	2022-05-16 15:35:07 -07:00
Alex Zinenko	18fc395909	[mlir] allow for re-registering extension ops Op registration mechanism does not allow for ops with the same name to be re-registered. This is okay to avoid name conflicts and debug double-registration, but may be problematic for dialect extensions that may get registered several times (unlike dialects that are deduplicated in the registry). When registering ops through the Transform dialect extension mechanism, check first if the ops are already registered and only complain in the case of repeated registration with the same name but different TypeID. Differential Revision: https://reviews.llvm.org/D125554	2022-05-17 00:03:40 +02:00
Matthias Springer	0b293bf045	[mlir][bufferize] Better propagation of errors Return immediately when an op bufferization patterns fails. Differential Revision: https://reviews.llvm.org/D125087	2022-05-16 23:17:01 +02:00
Mogball	67f0e8eec3	[mlir][ods] Fix verification of attribute + colon type ambiguity An attribute without a type builder followed by a colon in an assembly format is potentially ambiguous because the parser will read ahead to parse the colon-type and pass this as the type argument to the attribute's constructor. However, the previous verifier that checks for this ambiguity erroneously produces an error in the case of ``` let assemblyFormat = "( `(` $attr `)` )? `:`"; ``` This patch fixes the bug by implementing a checker that correctly handles all edge cases, including very strange assembly formats like: ``` let assemblyFormat = "( `(` $attr ) : (`>`)? attr-dict (`>` $a^) : (`<`)? `:`"; ``` Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D125445	2022-05-16 21:15:27 +00:00
River Riddle	a6cef03f66	[mlir] Remove the `type` keyword from type alias definitions This was carry over from LLVM IR where the alias definition can be ambiguous, but MLIR type aliases have no such problems. Having the `type` keyword is superfluous and doesn't add anything. This commit drops it, which also nicely aligns with the syntax for attribute aliases (which doesn't have a keyword). Differential Revision: https://reviews.llvm.org/D125501	2022-05-16 13:54:02 -07:00
Mogball	c8457eb532	[mlir][transforms] Add a topological sort utility and pass This patch adds a topological sort utility and pass. A topological sort reorders the operations in a block without SSA dominance such that, as much as possible, users of values come after their producers. The utility function sorts topologically the operation range in a given block with an optional user-provided callback that can be used to virtually break cycles. The toposort pass itself recursively sorts graph regions under the target op. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D125063	2022-05-16 20:47:30 +00:00
Mogball	0533253d81	[mlir][ods] Ignore AttributeSelfTypeParameter in assembly formats The attribute self type parameter is currently treated like any other attribute parameter in the assembly format. The self type parameter should be handled by the operation parser and printer and play no role in the generated parsers and printers of attributes. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D125724	2022-05-16 20:23:54 +00:00
Aart Bik	736c1b66ef	[mlir][sparse] introduce complex type to sparse tensor support This is the first implementation of complex (f64 and f32) support in the sparse compiler, with complex add/mul as first operations. Note that various features are still TBD, such as other ops, and reading in complex values from file. Also, note that the std::complex<float> had a bit of an ABI issue when passed as single argument. It is still TBD if better solutions are possible. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D125596	2022-05-16 13:17:36 -07:00
Robert Suderman	cb4a5eae1e	[mlir][tosa] Use math.ctlz intrinsic for tosa.clz We were custom counting per bit for the clz instruction. Math dialect now has an intrinsic to do this in one instruction. Migrated to this instruction and fixed a minor bug math-to-llvm for the intrinsic. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D125592	2022-05-16 11:31:35 -07:00
Jakub Kuderski	ffc3a0db00	[mlir:toy][NFC] Remove unnecessary trailing return type In this instance, the trailing return type does not improve readability as it repeats what is returned in the same line. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D125697	2022-05-16 13:46:00 -04:00
Matthias Springer	f287da8a15	[mlir][bufferize] Better user control of layout maps This changes replaces the `fully-dynamic-layout-maps` options (which was badly named) with two new options: * `unknown-type-conversion` controls the layout maps on buffer types for which no layout map can be inferred. * `function-boundary-type-conversion` controls the layout maps on buffer types inside of function signatures. Differential Revision: https://reviews.llvm.org/D125615	2022-05-16 18:06:13 +02:00
Mehdi Amini	08482fa058	Apply clang-tidy fixes for llvm-qualified-auto in LinalgInterfaces.cpp (NFC)	2022-05-16 13:58:49 +00:00
Mehdi Amini	59c3be748f	Apply clang-tidy fixes for performance-move-const-arg in SerializeToHsaco.cpp (NFC)	2022-05-16 13:58:49 +00:00
Matthias Springer	12e41d9264	[mlir][bufferize] Infer memref types when possible Instead of recomputing memref types from tensor types, try to infer them when possible. This results in more precise layout maps. Differential Revision: https://reviews.llvm.org/D125614	2022-05-16 02:02:08 +02:00
Min-Yih Hsu	3da65c4c0b	[mlir][LLVMIR] Add support for translating shufflevector Add support for translating llvm::ShuffleVectorInst Differential Revision: https://reviews.llvm.org/D125030	2022-05-14 15:14:40 -07:00
Min-Yih Hsu	b8f52c08f8	[mlir][LLVMIR] Add support for translating insert/extractvalue Add support for translating llvm::InsertValue and llvm::ExtractValue. Differential Revision: https://reviews.llvm.org/D125028	2022-05-14 15:14:40 -07:00
Arnab Dutta	16219f8c94	[MLIR][GPU] Add canonicalizer for gpu.memcpy Erase gpu.memcpy op when only uses of dest are the memcpy op in question, its allocation and deallocation ops. Reviewed By: bondhugula, csigg Differential Revision: https://reviews.llvm.org/D124257	2022-05-14 19:01:04 +05:30
Christian Sigg	0e3d1ca54a	[MLIR][GPU] NFC: simplify kernel operand accessor implementations. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D125112	2022-05-14 14:14:42 +02:00
Chris Lattner	5ac9d66209	[DenseElementsAttr] Teach isValidRawBuffer that 1-elt values are splats. We want getRaw() on tensors with i1 element type with a zero or 1 value to be treated as a splat. This fixes: https://github.com/llvm/llvm-project/issues/55440	2022-05-14 11:49:43 +01:00
Mogball	bf8049dc48	[mlir][ods] (NFC) remove erroneous trait	2022-05-14 00:37:34 +00:00
Mogball	70b69c54fa	[mlir] Rename Zero* traits to Zero*s Rename ZeroResult -> ZeroResults ZeroSuccessor -> ZeroSuccessors ZeroRegion -> ZeroRegions to be in line with ZeroOperands and grammatically correct.	2022-05-14 00:20:28 +00:00
Chris Lattner	27478872fd	[ParseResult] Fix warning in flang build, incorporate feedback from River. The warning caused build errors on a couple flang testers that are building with -Werror. The diagnostic change makes the generated error correct. This is a followup to https://reviews.llvm.org/D125549 Differential Revision: https://reviews.llvm.org/D125587	2022-05-13 23:30:27 +01:00
Chris Lattner	1d7b5cd5bf	[ParseResult] Mark this as LLVM_NODISCARD (like LogicalResult) and fix issues. There are a lot of cases where we accidentally ignored the result of some parsing hook. Mark ParseResult as LLVM_NODISCARD just like ParseResult is. This exposed some stuff to clean up, so do. Differential Revision: https://reviews.llvm.org/D125549	2022-05-13 16:28:53 +01:00
Denys Shabalin	89d4904541	[mlir] Fix declaration of nano time function in benchmark infra In `d4555698f8`, the name of nano precision timer function has changed from `nano_time` to `nanoTime`, but benchmarks were not updated to reflect that. This change addresses the discrepancy. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D125217	2022-05-13 13:22:18 +02:00
Groverkss	7b323af52a	[MLIR] Fix areIdsUnique in AffineStructures This patch fixes a bug in areIdsUnique where it ignores the [start, end] range. No test case is added since there are no use cases through IR from where it can be tested, and it is hard to create a unittest since we do not currently have Values in unittests. Reviewed By: arjunp Differential Revision: https://reviews.llvm.org/D124735	2022-05-13 16:15:13 +05:30
Tres Popp	1de73629aa	Add cmake dependency for TensorToLinalg	2022-05-13 12:33:14 +02:00
Tres Popp	1dce51b888	[mlir] Add TensorToLinalgPass This pass is to handle computationally complex operations like tensor.pad which are not simply lowered to the exact same operation in the memref dialect. Differential Revision: https://reviews.llvm.org/D125384	2022-05-13 12:17:22 +02:00
Alex Zinenko	7deed49ab9	[mlir] use dynamic sections in MLIR Doxygen Due to an apparent bug in the Doxygen version <1.8.16 used to generate documentation for MLIR, parts of the navigation (specifically, the lists of inherited methods for classes) are unusable due to dynsections.js missing from the output generated by Doxygen. Setting this flag makes Doxygen always produce the file.	2022-05-13 11:43:52 +02:00
Matthias Springer	e9fa559097	[mlir][sparse][NFC] Use RewriterBase/OpBuilder when possible Most functions do not need a PatternRewriter or ConversionPatternRewriter. Differential Revision: https://reviews.llvm.org/D125466	2022-05-13 11:37:26 +02:00
Matthias Springer	8f42939a07	[mlir][bufferize][NFC] Make getContiguousMemRefType a static function No need to expose this as public API anymore. Differential Revision: https://reviews.llvm.org/D125361	2022-05-13 11:27:43 +02:00
wren romano	753fe330c1	[mlir][sparse] Factoring out an enumerator over elements of SparseTensorStorage Work towards fixing: https://github.com/llvm/llvm-project/issues/51652 Depends On D122928 Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D122060	2022-05-12 17:05:56 -07:00
Aart Bik	6f3c7dfb77	[mlir][sparse] add sparse sign integration test Implements a floating-point sign operator (using the new semi-ring ops) that accomodates +/-Inf and +/-NaN in consistent way. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D125494	2022-05-12 15:56:36 -07:00
River Riddle	80c28a400c	[mlir] Bump mlir-vscode to 0.0.7 The syntax highlighting for mlir has gotten a significant facelift, we also have some recent bug fixes for server launches.	2022-05-12 15:52:46 -07:00
River Riddle	86f5caeee9	[mlir] Significantly overhaul the textmate grammar The current grammar is really crusty, only supports a handful of cases, and is also out-of-date after various refactorings. This commit refactors the textmate grammar to handle significantly more cases, and now provides proper coloring for a majority of cases (including dialect attributes, operations, types, etc.) Differential Revision: https://reviews.llvm.org/D125458	2022-05-12 15:50:32 -07:00
River Riddle	86e1c2f097	[mlir] Fix pipeline-parsing.mlir on windows We shouldn't be making assumptions about the result of llvm::getTypeName, which may have different results for anonymous namespaces depending on the platform.	2022-05-12 13:40:16 -07:00
River Riddle	c2fb9c29b4	[mlir:Pass] Add support for op-agnostic pass managers This commit refactors the current pass manager support to allow for operation agnostic pass managers. This allows for a series of passes to be executed on any viable pass manager root operation, instead of one specific operation type. Op-agnostic/generic pass managers only allow for adding op-agnostic passes. These types of pass managers are extremely useful when constructing pass pipelines that can apply to many different types of operations, e.g., the default inliner simplification pipeline. With the advent of interface/trait passes, this support can be used to define FunctionOpInterface pass managers, or other pass managers that effectively operate on specific interfaces/traits/etc (see #52916 for an example). Differential Revision: https://reviews.llvm.org/D123536	2022-05-12 13:12:59 -07:00
Ashay Rane	5380e30e04	[mlir] translate memref.reshape ops that have static shapes This patch references code for translating memref.reinterpret_cast ops to add translation rules for memref.reshape ops that have a static shape argument. Since reshape ops don't have offsets, sizes, or strides, this patch simply sets the allocated and aligned pointers of the MemRef descriptor. Reviewed By: ftynse, cathyzhyi Differential Revision: https://reviews.llvm.org/D125039	2022-05-12 11:57:20 -07:00
Benjamin Kramer	434385ba41	[DenseElementAttr] Silence warning in -DNDEBUG builds. NFC.	2022-05-12 17:59:39 +02:00
Chris Lattner	3cce374ee6	Various improvements suggested by river NFC. Differential Revision: https://reviews.llvm.org/D125471	2022-05-12 16:18:23 +01:00
Chris Lattner	f21896f2c6	[DenseElementAttr] Simplify the public API for creating these. Instead of requiring the client to compute the "isSplat" bit, compute it internally. This makes the logic more consistent and defines away a lot of "elements.size()==1" in the clients. This addresses Issue #55185 Differential Revision: https://reviews.llvm.org/D125447	2022-05-12 16:18:23 +01:00
Thomas Raoux	d02f10d96d	[mlir][vector] Add lowering pattern for vector.warp_execute_on_lane_0 op Add lowering of the vector.warp_execute_on_lane_0 into scf.if plus memory transfer for the operands and yield values. This also add an integration test running on GPU warp. The same tests can be later re-used with different comment lines to tests distribution transformations. This is mostly from @springerm contribution. Differential Revision: https://reviews.llvm.org/D125430	2022-05-12 13:27:43 +00:00
Benjamin Kramer	303638248a	[mlir][linalg] Add lowering of named ops on complex numbers This lets linalg.dot and friends lower to a complex muladd using ops from the complex dialect. Differential Revision: https://reviews.llvm.org/D125461	2022-05-12 13:37:34 +02:00
Benjamin Kramer	27dad99622	[mlir][LLVM] Make the nested type restriction on complex constants less aggressive Complex nested in other types is perfectly fine, just nested structs aren't supported. Instead of checking whether there's nesting just check whether the struct we're dealing with is a complex number. Differential Revision: https://reviews.llvm.org/D125381	2022-05-12 11:47:01 +02:00
Daniil Dudkin	70c463efc8	[mlir][NFC] Fix `GpuKernelOutliningPass` copy constructor warnings 1. Call copy constructor of the base class 2. Assign value of the option directly Reviewed By: dcaballe, rriddle Differential Revision: https://reviews.llvm.org/D125101	2022-05-12 11:41:18 +03:00
Nikita Popov	f02716a806	[MLIR] Fix build without native arch D125214 split off a MLIRExecutionEngineUtils library that is used by MLIRGPUTransforms. However, currently the entire ExecutionEngine directory is skipped if the LLVM_NATIVE_ARCH target is not available. Move the check for LLVM_NATIVE_ARCH, such that MLIRExecutionEngineUtils always gets built, and only the JIT-related libraries are omitted without native arch. Differential Revision: https://reviews.llvm.org/D125357	2022-05-12 09:50:51 +02:00
Matthias Springer	82ea0d8b82	[mlir][bufferize] Support alloc hoisting across function boundaries This change integrates the BufferResultsToOutParamsPass into One-Shot Module Bufferization. This improves memory management (deallocation) when buffers are returned from a function. Note: This currently only works with statically-sized tensors. The generated code is not very efficient yet and there are opportunities for improvment (fewer copies). By default, this new functionality is deactivated. Differential Revision: https://reviews.llvm.org/D125376	2022-05-12 09:44:07 +02:00
Matthias Springer	2fe40c34ea	[mlir][bufferize] Fix op filter Bufferization has an optional filter to exclude certain ops from analysis+bufferization. There were a few remaining places in the codebase where the filter was not checked. Differential Revision: https://reviews.llvm.org/D125356	2022-05-12 09:33:07 +02:00
Matthias Springer	011f1b1c1f	[mlir][bufferize] Add helpers for templatized DENY filters We already have templatized ALLOW filters but the DENY filters were missing. Differential Revision: https://reviews.llvm.org/D125358	2022-05-12 09:18:21 +02:00
River Riddle	1155c1fe65	[mlir:Parser] Emit a better diagnostic when a custom operation is unknown When a custom operation is unknown and does not have a dialect prefix, we currently emit an error using the name of the operation with the default dialect prefix. This leads to a confusing error message, especially when operations get moved between dialects. For example, `func` was recently moved out of `builtin` and to the `func` dialect. The current error message we get is: ``` func @foo() ^ custom op 'builtin.func' is unknown ``` This could lead users to believe that there is supposed to be a `builtin.func`, because there used to be. This commit adds a better error message that does not assume that the operation is supposed to be in the default dialect: ``` func @foo() ^ custom op 'func' is unknown (tried 'builtin.func' as well) ``` Differential Revision: https://reviews.llvm.org/D125351	2022-05-11 22:54:44 -07:00
Mahesh Ravishankar	8be7e6f56a	[mlir][Linalg] Combine canonicalizers that deal with removing dead/redundant args. `linalg.generic` ops have canonicalizers that either remove arguments not used in the payload, or redundant arguments. Combine these and enhance the canonicalization to also remove results that have no use. This is effectively dead code elimination for Linalg ops. Differential Revision: https://reviews.llvm.org/D123632	2022-05-12 05:22:30 +00:00
Mogball	0ffef0c23b	[mlir][ods] (NFC) don't use std::function for map_range	2022-05-12 05:15:03 +00:00
Mogball	19906262c9	[mlir] (NFC) Use assembly format for test.graph_region	2022-05-12 04:19:25 +00:00
bzcheeseman	bc22b5c9a2	[MLIR][Operation] Simplify Operation casting, NFC We can simplify the code needed to implement dyn_cast/cast/isa support for MLIR operations with documented interfaces via the CastInfo structures. This will also provide an example of how to use CastInfo. Depends on D123901 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D124963	2022-05-12 00:17:01 -04:00
grosul1	a4b227c28a	[mlir] Fix loop unrolling: properly replace the arguments of the epilogue loop. Using "replaceUsesOfWith" is incorrect because the same initializer value may appear multiple times. For example, if the epilogue is needed when this loop is unrolled ``` %x:2 = scf.for ... iter_args(%arg1 = %c1, %arg2 = %c1) { ... } ``` then both epilogue's arguments will be incorrectly renamed to use the same result index (note #1 in both cases): ``` %x_unrolled:2 = scf.for ... iter_args(%arg1 = %c1, %arg2 = %c1) { ... } %x_epilogue:2 = scf.for ... iter_args(%arg1 = %x_unrolled#1, %arg2 = %x_unrolled#1) { ... } ```	2022-05-12 01:54:39 +00:00
Chris Lattner	86445e8c63	[AsmParser] Adopt emitWrongTokenError more, improving QoI This is a full audit of emitError calls, I took the opportunity to remove extranous parens and fix a couple cases where we'd generate multiple diagnostics for the same error. Differential Revision: https://reviews.llvm.org/D125355	2022-05-11 20:41:12 +01:00
River Riddle	5a9a438a54	[TableGen] Refactor TableGenParseFile to no longer use a callback Now that TableGen no longer relies on global Record state, we can allow for the client to own the RecordKeeper and SourceMgr. Given that TableGen internally still relies on the global llvm::SrcMgr, this method unfortunately still isn't thread-safe. Differential Revision: https://reviews.llvm.org/D125277	2022-05-11 11:55:33 -07:00
Matthias Springer	248e113e9f	[mlir][bufferize][NFC] Move helper functions to BufferizationOptions Move helper functions for creating allocs/deallocs/memcpys to BufferizationOptions. Differential Revision: https://reviews.llvm.org/D125375	2022-05-11 16:23:22 +02:00
Chris Lattner	34b6f206cb	[AsmParser] Improve error recovery again. Change the parsing logic to use StringRef instead of lower level char* logic. Also, if emitting a diagnostic on the first token in the file, we make sure to use that position instead of the very start of the file. Differential Revision: https://reviews.llvm.org/D125353	2022-05-11 08:25:36 +01:00
Chia-hung Duan	96e642652b	[mlir] Print some message for op-printing verification Before dump, Insetad of switching to generic form silently after verification failure. Print some debug logs to help identify why an op may be printed in a different way. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D125136	2022-05-10 22:48:47 +00:00
Thomas Raoux	15bcc36eed	[mlir][gpu] Move async copy ops to NVGPU and add caching hints Move async copy operations to NVGPU as they only exist on NV target and are designed to match ptx semantic. This allows us to also add more fine grain caching hint attribute to the op. Add hint to bypass L1 and hook it up to NVVM op. Differential Revision: https://reviews.llvm.org/D125244	2022-05-10 22:30:24 +00:00
Nicolas Vasilache	1f23211cb1	[mlir][SCF] Retire `cloneWithNewYields` helper function. This is now subsumed by `replaceLoopWithNewYields`. Differential Revision: https://reviews.llvm.org/D125309	2022-05-10 18:44:11 +00:00
Mahesh Ravishankar	567fd523bf	[mlir][SCF] Add utility method to add new yield values to a loop. The current implementation of `cloneWithNewYields` has a few issues - It clones the loop body of the original loop to create a new loop. This is very expensive. - It performs `erase` operations which are incompatible when this method is called from within a pattern rewrite. All erases need to go through `PatternRewriter`. To address these a new utility method `replaceLoopWithNewYields` is added which - moves the operations from the original loop into the new loop. - replaces all uses of the original loop with the corresponding results of the new loop - use a call back to allow caller to generate the new yield values. - the original loop is modified to just yield the basic block arguments corresponding to the iter_args of the loop. This represents a no-op loop. The loop itself is dead (since all its uses are replaced), but is not removed. The caller is expected to erase the op. Consequently, this method can be called from within a `matchAndRewrite` method of a `PatternRewriter`. The `cloneWithNewYields` could be replaces with `replaceLoopWithNewYields`, but that seems to trigger a failure during walks, potentially due to the operations being moved. That is left as a TODO. Differential Revision: https://reviews.llvm.org/D125147	2022-05-10 18:44:11 +00:00
Krzysztof Drewniak	814b605095	[mlir][AMDGPU] Add AMDGPU conversion patterns to ConvertGPUToROCDL This ensures that attributes such as the index bitwidth propagate correctly to the AMDGPUToROCDL patterns. Differential Revision: https://reviews.llvm.org/D125320	2022-05-10 16:49:11 +00:00
Ashay Rane	53ff0daa7e	[mlir] Fail early if AnalysisState::getBuffer() returns failure This patch updates calls to AnalysisState::getBuffer() so that we return early with a failure if the call does not succeed. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D125251	2022-05-10 08:08:38 -07:00
Krzysztof Drewniak	f1f05a91ca	[MLIR][AMDGPU] Add AMDGPU dialect, wrappers around raw buffer intrinsics By analogy with the NVGPU dialect, introduce an AMDGPU dialect for AMD-specific intrinsic wrappers. The dialect initially includes wrappers around the raw buffer intrinsics. On AMD GPUs, a memref can be converted to a "buffer descriptor" that allows more precise control of memory access, such as by allowing for out of bounds loads/stores to be replaced by 0/ignored without adding additional conditional logic, which is important for performance. The repository currently contains a limited conversion from transfer_read/transfer_write to Mubuf intrinsics, which are an older, deprecated intrinsic for the same functionality. The new amdgpu.raw_buffer_* ops allow these operations to be used explicitly and for including metadata such as whether the target chipset is an RDNA chip or not (which impacts the interpretation of some bits in the buffer descriptor), while still maintaining an MLIR-like interface. (This change also exposes the floating-point atomic add intrinsic.) Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D122765	2022-05-10 14:59:58 +00:00
Chris Lattner	ad3b358180	[MLIR Parser] Improve QoI for "expected token" errors A typical problem with missing a token is that the missing token is at the end of a line. The problem with this is that the error message gets reported on the start of the following line (which is where the next / invalid token is) which can be confusing. Handle this by noticing this case and backing up to the end of the previous line. Differential Revision: https://reviews.llvm.org/D125295	2022-05-10 15:44:17 +01:00
Adrian Kuegel	64c8574209	[mlir] Remove unused using declaration (NFC)	2022-05-10 12:58:01 +02:00
Nikita Popov	03ab30686d	[MLIR] Split off MLIRExecutionEngineUtils to fix libMLIR.so build (PR54242) Building libMLIR.so currently fails with: > /usr/bin/ld: /tmp/ccNzulEA.ltrans39.ltrans.o: in function `(anonymous namespace)::SerializeToHsacoPass::optimizeLlvm(llvm::Module&, llvm::TargetMachine&)': > /builddir/build/BUILD/llvm-project-15.0.0.src/mlir/lib/Dialect/GPU/Transforms/SerializeToHsaco.cpp:328: undefined reference to `mlir::makeOptimizingTransformer(unsigned int, unsigned int, llvm::TargetMachine*)' This is because MLIRGPUTransforms depends on MLIRExecutionEngine in `61bb2e4ea8/mlir/lib/Dialect/GPU/Transforms/SerializeToHsaco.cpp (L328)`, but MLIRExecutionEngine is marked as excluded from libMLIR.so. However, this code doesn't require the full execution engine: It only performs middle-end optimization, and does not need any of the JIT/codegen infrastructure. As such, split off a separate library MLIRExecutionEngineUtils, which only contains that part and is not excluded from libMLIR.so. Fixes https://github.com/llvm/llvm-project/issues/54242. Differential Revision: https://reviews.llvm.org/D125214	2022-05-10 10:17:52 +02:00
Stella Stamenova	784a5bccfd	[mlir] Fix python bindings build on Windows in Debug Currently, building mlir with the python bindings enabled on Windows in Debug is broken because pybind11, python and cmake don't like to play together. This change normalizes how the three interact, so that the builds can now run and succeed. The main issue is that python and cmake both make assumptions about which libraries are needed in a Windows build based on the flavor. - cmake assumes that a debug (or a debug-like) flavor of the build will always require pythonX_d.lib and provides no option/hint to tell it to use a different library. cmake does find both the debug and release versions, but then uses the debug library. - python (specifically pyconfig.h and by extension python.h) hardcodes the dependency on pythonX_d.lib or pythonX.lib depending on whether `_DEBUG` is defined. This is NOT transparent - it does not show up anywhere in the build logs until the link step fails with `pythonX_d.lib is missing` (or `pythonX.lib is missing`) - pybind11 tries to "fix" this by implementing a workaround - unless Py_DEBUG is defined, `_DEBUG` is explicitly undefined right before including python headers. This also requires some windows headers to be included differently, so while clever, this is a non-trivial workaround. mlir itself includes the pybind11 headers (which contain the workaround) AS WELL AS python.h, essentially always requiring both pythonX.lib and pythonX_d.lib for linking. cmake explicitly only adds one or the other, so the build fails. This change does a couple of things: - In the cmake files, explicitly add the release version of the python library on Windows builds regardless of flavor. Since Py_DEBUG is not defined, pybind11 will always require release and it will be satisfied - To satisfy python as well, this change removes any explicit inclusions of Python.h on Windows instead relying on the fact that pybind11 headers will bring in what is needed There are a few additional things that we could do but I rejected as unnecessary at this time: - define Py_DEBUG based on the CMAKE_BUILD_TYPE - this will mostly work, we'd have to think through multiconfig generators like VS, but it's possible. There doesn't seem to be a need to link against debug python at the moment, so I chose not to overcomplicate the build and always default to release - similar to above, but define Py_DEBUG based on the CMAKE_BUILD_TYPE as well as the presence of the debug python library (`Python3_LIBRARY_DEBUG`). Similar to above, this seems unnecessary right now. I think it's slightly better than above because most people don't actually have the debug version of python installed, so this would prevent breaks in that case. - similar to the two above, but add a cmake variable to control the logic - implement the pybind11 workaround directly in mlir (specifically in Interop.h) so that Python.h can still be included directly. This seems prone to error and a pain to maintain in lock step with pybind11 - reorganize how the pybind11 headers are included and place at least one of them in Interop.h directly, so that the header has all of its dependencies included as was the original intention. I decided against this because it really doesn't need pybind11 logic and it's always included after pybind11 is, so we don't necessarily need the python includes Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D125284	2022-05-09 19:46:47 -07:00
Mathieu Fehr	67d0bc27c0	[mlir][doc] Move documentation of extensible dialects Merge the documentation of the definition of extensible dialects with the definition of dialects. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D125200	2022-05-09 16:01:12 -07:00
River Riddle	867cd5007d	[mlir-LSP] Ensure existing documents are process synchronously This prevents races where we accidentally launched multiple servers.	2022-05-09 15:23:23 -07:00
Thomas Raoux	09fc685ce6	[mlir][nvvm] Add attribute to nvvm.cpAsyncOp to control l1 bypass Add attribute to be able to generate the intrinsic version of async copy generating a copy with l1 bypass. This correspond to cp.async.cg.shared.global in ptx. Differential Revision: https://reviews.llvm.org/D125241	2022-05-09 19:34:48 +00:00
Stella Stamenova	057863a9bc	[mlir] Fix build & test of mlir python bindings on Windows There are a couple of issues with the python bindings on Windows: - `create_symlink` requires special permissions on Windows - using `copy_if_different` instead allows the build to complete and then be usable - the path to the `python_executable` is likely to contain spaces if python is installed in Program Files. llvm's python substitution adds extra quotes in order to account for this case, but mlir's own python substitution does not - the location of the shared libraries is different on windows - if the type is not specified for numpy arrays, they appear to be treated as strings I've implemented the smallest possible changes for each of these in the patch, but I would actually prefer a slightly more comprehensive fix for the python_executable and the shared libraries. For the python substitution, I think it makes sense to leverage the existing %python instead of adding %PYTHON and instead add a new variable for the case when preloading is needed. This would also make it clearer which tests are which and should be skipped on platforms where the preloading won't work. For the shared libraries, I think it would make sense to pass the correct path and extension (possibly even the names) to the python script since these are known by lit and don't have to be hardcoded in the test at all. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D125122	2022-05-09 11:10:20 -07:00
Jakub Tucholski	167bbfcb9d	[mlir] Refactoring dialect and test code to use parseCommaSeparatedList Issue #55173 Reviewed By: lattner, rriddle Differential Revision: https://reviews.llvm.org/D124791	2022-05-09 12:36:54 -04:00
Jerry Wu	ad7c49bef7	[mlir][linalg] Fix padding size calculation for Conv2d ops. This patch fixed the padding size calculation for Conv2d ops when the stride > 1. It contains the changes below: - Use addBound to add constraint for AffineApplyOp in getUpperBoundForIndex. So the result value can be mapped and retrieved later. - Fixed the bound from AffineMinOp by adding as a closed bound. Originally the bound was added as an open upper bound, which results in the incorrect bounds when we multiply the values. For example: ``` %0 = affine.min affine_map<()[s0] -> (4, -s0 + 11)>()[iv0] %1 = affine.apply affine_map<()[s0] -> (s0 * 2)>()[%0] If we add the affine.min as an open bound, addBound will internally transform it into the close bound "%0 <= 3". The following sliceBounds will derive the bound of %1 as "%1 <= 6" and return the open bound "%1 < 7", while the correct bound should be "%1 <= 8". ``` - In addition to addBound, I also changed sliceBounds to support returning closed upper bound, since for the size computation, we usually care about the closed bounds. - Change the getUpperBoundForIndex to favor constant bounds when required. The sliceBounds will return a tighter but non-constant bounds, which can't be used for padding. The constantRequired option requires getUpperBoundForIndex to get the constant bounds when possible. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D124821	2022-05-09 08:45:37 -07:00
Ashay Rane	e287d647c6	[mlir] Add translation from tensor.reshape to memref.reshape This patch augments the `tensor-bufferize` pass by adding a conversion rule to translate ReshapeOp from the `tensor` dialect to the `memref` dialect, in addition to adding a unit test to validate the translation. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D125031	2022-05-09 17:45:07 +02:00
Benjamin Kramer	a48adc5658	[mlir][math] Promote (b)f16 to f32 when lowering to libm calls libm doesn't have overloads for the small types, so promote them to a bigger type and use the f32 function. Differential Revision: https://reviews.llvm.org/D125093	2022-05-09 11:59:55 +02:00
Christopher Bate	9879807393	[mlir][NvGpu] Fix nvgpu.mma.sync lowering to NVVM for f32, tf32 types Adds missing logic in the lowering from NvGPU to NVVM to support fp32 (in an accumulator operand) and tf32 (in multiplicand operand) types. Fixes logic in one of the helper functions for converting the result of a mma.sync operation with multiple 8x256bit output tiles, which is the case for f32 outputs. Differential Revision: https://reviews.llvm.org/D124533	2022-05-08 21:49:42 -06:00
Sam McCall	e571e1a6c3	Reland "[FuzzMutate] Split out FuzzerCLI library that doesn't depend on IR." This reverts commit `a1bb952e83`. I'd somehow missed updating llvm-yaml-parser-fuzzer, now fixed.	2022-05-07 13:49:54 +02:00
Aaron Ballman	a1bb952e83	Revert "[FuzzMutate] Split out FuzzerCLI library that doesn't depend on IR." This reverts commit `1c5e85b3da`. It broke a lot of bots with a link error: https://lab.llvm.org/buildbot/#/builders/171/builds/14222 https://lab.llvm.org/buildbot/#/builders/188/builds/13748 https://lab.llvm.org/buildbot/#/builders/109/builds/38127	2022-05-07 07:29:57 -04:00
Sam McCall	1c5e85b3da	[FuzzMutate] Split out FuzzerCLI library that doesn't depend on IR. All llvm-project fuzzers use this library to parse command-line arguments. Many of them don't deal with LLVM IR or modules in any way. Bundling those functions in one library forces build dependencies that don't need to be there. Among other things, this means check-clang-pseudo no longer depends on most of LLVM. Differential Revision: https://reviews.llvm.org/D125081	2022-05-07 12:11:51 +02:00
Mehdi Amini	25cd6fba98	Fix MLIR integration test after `a8308020` (`func.` prefix is required bythe parser now)	2022-05-07 09:09:24 +00:00
River Riddle	a8308020ac	[mlir] Remove special case parsing/printing of `func` operations This was leftover from when the standard dialect was destroyed, and when FuncOp moved to the func dialect. Now that these transitions have settled a bit we can drop these. Most updates were handled using a simple regex: replace `^( *)func` with `$1func.func` Differential Revision: https://reviews.llvm.org/D124146	2022-05-06 13:36:15 -07:00
Mehdi Amini	6a9c1029f8	Fix build with shared libs: add missing CMake dep to MLIR sparse pipeline	2022-05-06 20:20:03 +00:00
Mehdi Amini	b37d158f71	Apply clang-tidy fixes for bugprone-copy-constructor-init in TestPassManager.cpp (NFC)	2022-05-06 20:19:19 +00:00
Mehdi Amini	298d2fa1c5	Apply clang-tidy fixes for readability-identifier-naming in SparseTensorUtils.cpp (NFC)	2022-05-06 20:19:19 +00:00
Mehdi Amini	90c2af57af	Apply clang-tidy fixes for llvm-include-order in Merger.cpp (NFC)	2022-05-06 20:19:19 +00:00
Mehdi Amini	072e0aabbc	Enable the use of ThreadPoolTaskGroup in MLIR threading helper to enable nested parallelism The LLVM ThreadPool recently got the addition of the concept of ThreadPoolTaskGroup: this is a way to "partition" the threadpool into a group of tasks and enable nested parallelism through this grouping at every level of nesting. We make use of this feature in MLIR threading abstraction to fix a long lasting TODO and enable nested parallelism. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D124902	2022-05-06 19:40:22 +00:00
Mehdi Amini	c5ea8d509c	Apply clang-tidy fixes for llvm-else-after-return in Merger.cpp (NFC)	2022-05-06 19:38:03 +00:00
Mehdi Amini	061f253e13	Apply clang-tidy fixes for llvm-prefer-isa-or-dyn-cast-in-conditionals in OpenMPDialect.cpp (NFC)	2022-05-06 19:38:02 +00:00
Aart Bik	5b122a7310	[mlir][sparse] integration test for zero preserving math op Also fixes omission in lowering math ops that require lib support Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D125104	2022-05-06 10:42:33 -07:00
Nikita Popov	686bd6dd2b	[MLIR] Fix build with make https://reviews.llvm.org/D124075 causes MLIR to no longer build when using make rather than ninja, due to a tablegen-generated header being used before it is created. It seems that this is related to the use of LLVM_ENABLE_OBJLIB when using add_tablgen with a non-Ninja/Xcode generator. In that case an intermediate objlib target is generated. This patch fixes the issue by a) declaring dependencies in add_tablegen for mlir-pdll and b) making sure those dependencies are added to the objlib target. Differential Revision: https://reviews.llvm.org/D125010	2022-05-06 14:52:35 +02:00
Matthias Springer	9785eb1b98	[mlir][bufferize] Disallow adding new bufferizable ops during bufferization Ops that are created during the bufferization were not analyzed (when run with One-Shot Bufferize), and users should instead create memref ops directly. Futhermore, this fixes an issue where an op was erased (and put on the `erasedOps` list), but subsequently a new tensor op was created at the same memory location. This op was then not bufferized. Disallowing the creation of new tensor ops simplifies the bufferization and fixes such issues. Differential Revision: https://reviews.llvm.org/D125017	2022-05-06 18:41:49 +09:00
Matthias Springer	988748c077	[mlir][bufferize] Do not copy buffers with undefined contents Buffers with undefined contents (e.g., the result of an init_tensor) are no longer copied. Differential Revision: https://reviews.llvm.org/D125015	2022-05-06 17:31:01 +09:00
Matthias Springer	a5d09c6372	[mlir][scf] Implement BufferizableOpInterface for scf::WhileOp This follows the same implementation strategy as scf::ForOp and common functionality is extracted into helper functions. This implementation works well in cases where each yielded value (from either body/condition region) is equivalent to the corresponding bbArg of the parent block. In that case, each OpResult of the loop may be aliasing with the corresponding OpOperand of the loop (and with no other OpOperand). In the absence of said equivalence relationship, new buffer copies must be inserted, so that the aliasing OpOperand/OpResult contract of scf::WhileOp is honored. In essence, by yielding a newly allocated buffer, we can enforce the specified may-alias relationship. (Newly allocated buffers cannot alias with any OpOperands of the loop.) Differential Revision: https://reviews.llvm.org/D124929	2022-05-06 17:24:33 +09:00
Mehdi Amini	be310632d0	Apply clang-tidy fixes for llvm-else-after-return in OpenMPDialect.cpp (NFC)	2022-05-06 05:40:02 +00:00
Mehdi Amini	6e32535078	Apply clang-tidy fixes for bugprone-argument-comment in AffineOps.cpp (NFC)	2022-05-06 05:40:02 +00:00
Chengji Yao	e5a4cf6743	[mlir] Fix printer when it is a DenseElementsAttr of i1 A large DenseElementsAttr of i1could trigger a bug in printer/parser roundtrip. Ex. A DenseElementsAttr of i1 with 200 elements will print as Hex format of length 400 before the fix. However, when parsing the printed text, an error will be triggered. After fix, the printed length will be 50. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D122925	2022-05-05 16:56:43 -07:00
Aart Bik	952fa3018e	[mlir][sparse] add more zero-preserving unary ops to sparse compiler Although we now have semi-rings to deal with arbitrary ops, it is still good to convey zero-preserving semantics of ops to the sparse compiler. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D125043	2022-05-05 15:35:19 -07:00
River Riddle	6609c1cc59	[mlir] Add a better error message when failing to parse an attribute The fallback attribute parse path is parsing a Type attribute, but this results in a really unintuitive error message: `expected non-function type`, which doesn't really hint at tall that we were trying to parse an attribute. This commit fixes this by trying to optionally parse a type, and on failure emitting an error that we were expecting an attribute. Differential Revision: https://reviews.llvm.org/D124870	2022-05-05 15:06:11 -07:00
River Riddle	8bb5b657fe	[mlir:ExecutionEngine] Update use of getAddress now that lookup returns ExecutorAddr This was changed in `16dcbb53dc`	2022-05-05 14:24:32 -07:00
Stella Stamenova	d4555698f8	[mlir] Fix the names of exported functions The names of the functions that are supposed to be exported do not match the implementations. This is due in part to `cac7aabbd8`. This change makes the implementations and declarations match and adds a couple missing declarations. The new names follow the pattern of the existing `verify` functions where the prefix is maintained as `_mlir_ciface_` but the suffix follows the new naming convention. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D124891	2022-05-05 13:46:15 -07:00
Christopher Bate	22c6e7b277	[mlir][nvvm] Fix support for tf32 data type in mma.sync The NVVM dialect test coverage for all possible type/shape combinations in the `nvvm.mma.sync` op is mostly complete. However, there were tests missing for TF32 datatype support. This change adds tests for the one relevant shape/type combination. This uncovered a small bug in the op verifier, which this change also fixes. Differential Revision: https://reviews.llvm.org/D124975	2022-05-05 11:02:03 -06:00
Matthias Springer	e300682597	[mlir][scf][bufferize] Update verifyAnalysis error message The previous error message was technically incorrect. We do not compare equivalence of YieldOp operands and ForOp operands. Differential Revision: https://reviews.llvm.org/D124934	2022-05-05 16:56:50 +09:00
Matthias Springer	417e1c7d52	[mlir][scf][bufferize][NFC] Split ForOp bufferization into smaller functions This is in preparation of WhileOp bufferization, which reuses these functions. Differential Revision: https://reviews.llvm.org/D124933	2022-05-05 16:55:44 +09:00
Matthias Springer	f178c386f5	[mlir][scf][bufferize][NFC] Simplify verifyAnalysis implementation Differential Revision: https://reviews.llvm.org/D124928	2022-05-05 16:51:10 +09:00
Min-Yih Hsu	794c4218a6	[mlir][LLVMIR] Do not update instMap via assignments to entry references Inside processInstruction, we assign the translated mlir::Value to a reference previously taken from the corresponding entry in instMap. However, instMap (a DenseMap) might resize after the entry reference was taken, rendering the assignment useless since it's assigning to a dangling reference. Here is a (pseudo) snippet that shows the concept: ``` // inst has type llvm::Instruction * Value &v = instMap[inst]; ... // op is one of the operands of inst, has type llvm::Value * processValue(op); // instMap resizes inside processValue ... translatedValue = b.createOp<Foo>(...); // v is already a dangling reference at this point! // The following assignment is bogus. v = translatedValue; ``` Nevertheless, after we stop caching llvm::Constant into instMap, there is only one case that can cause processValue to resize instMap: If the operand is a llvm::ConstantExpr. In which case we will insert the derived llvm::Instruction into instMap. To trigger instMap to resize, which is a DenseMap, the threshold depends on the ratio between # of map entries and # of (hash) buckets. More specifically, it resizes if (# of map entries / # of buckets) >= 0.75. In this case # of map entries is equal to # of LLVM instructions, and # of buckets is the power-of-two upperbound of # of map entries. Thus, eventually in the attaching test case (test/Target/LLVMIR/Import/incorrect-instmap-assignment.ll), we picked 96 and 128 for the # of map entries and # of buckets, respectively. (We can't pick numbers that are too small since DenseMap used inlined storage for small number of entries). Therefore, the ConstantExpr in the said test case (i.e. a GEP) is the 96-th llvm::Value cached into the instMap, triggering the issue we're discussing here on its enclosing instruction (i.e. a load). This patch fixes this issue by calling `operator[]` everytime we need to update an entry. Differential Revision: https://reviews.llvm.org/D124627	2022-05-04 13:16:51 -07:00
Bixia Zheng	1cd13e6e98	[mlir][sparse][taco] Support more data types. Support int8, int16, int32 and int32. Also fix source code format in mlir_pytaco_utils.py. Add tests. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D124925	2022-05-04 10:05:20 -07:00
Alexander Belyaev	e8f7d019fc	[mlir] Add a flag to allow equivalent results. Differential Revision: https://reviews.llvm.org/D124931	2022-05-04 17:48:18 +02:00
Matthias Springer	b34ea97f55	[mlir][linalg][bufferize][NFC] Remove remaining Comprehensive Bufferize code This commit removes the Linalg Comprehensive Bufferize pass. Differential Revision: https://reviews.llvm.org/D124854	2022-05-04 17:19:44 +09:00
Matthias Springer	5f60c4825b	[mlir][linalg][bufferize][NFC] Make init_tensor elimination a separate pre-processing pass This commit decouples init_tensor elimination from the rest of the bufferization. Differential Revision: https://reviews.llvm.org/D124853	2022-05-04 17:17:27 +09:00
Matthias Springer	37a1473524	[mlir][bufferize] Allow in-place bufferization for writes to init_tensors in loops This commit relaxes the rules around ops that define a value but do not specify the tensor's contents. (The only such op at the moment is init_tensor.) When such a tensor is written in a loop, it should not cause out-of-place bufferization. Differential Revision: https://reviews.llvm.org/D124849	2022-05-04 16:43:43 +09:00
Marius Brehler	63aaf9a6e7	[mlir] Add missing CMake deps to mlir-pdll Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D124851	2022-05-04 06:17:13 +00:00
Aart Bik	2617f2f708	[mlir][sparse] fix build issue with unused local under opt builds Reviewed By: rdzhabarov Differential Revision: https://reviews.llvm.org/D124883	2022-05-03 14:55:32 -07:00
Aart Bik	1abcdc677c	[mlir][sparse] add missing types to from/to-MLIR conversion routines This will enable our usual set of element types in external environments, such as PyTACO support. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D124875	2022-05-03 14:36:37 -07:00
Stella Stamenova	5f14aee3bb	[mlir] Fix Visual Studio warnings There are only a couple of warnings when compiling with VS on Windows. This fixes the last remaining warnings so that we can enable LLVM_ENABLE_WERROR on the mlir windows bot. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D124862	2022-05-03 14:12:15 -07:00
Jim Kitchen	2c33266084	[mlir][sparse] Add lowering for unary and binary ops Adding lowering for Unary and Binary required several changes due to their unique nature of containing custom code for different "regions" of the sparse structure being operated on. Along with a Kind, a pointer to the Operation is passed along to be merged once the lattice structure is figured out. The original operation is maintained, as it is required for subsequent lattice decisions. However, sparse_tensor.binary has some branches are considered as fully handled and therefore are marked with as kBinaryBranch to distinguish them. A unique aspect of the custom code is that sometimes the desired result is no result at all -- i.e. a user wants overlapping sparse entries to become empty in the output. The solution to this is to return an uninitialized Value(), which is checked and handled elsewhere in the code and results in nothing being written to the output tensor for that case. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D123057	2022-05-03 15:50:26 -05:00
Goran Flegar	672b908bca	[mlir] Add sin & cos ops to complex dialect Also adds conversions for those ops to math + arith. Differential Revision: https://reviews.llvm.org/D124773	2022-05-03 19:36:12 +02:00
Min-Yih Hsu	857eb4a152	[mlir][LLVMIR] Add support for translating Switch instruction Add support for translating llvm::SwitchInst. Differential Revision: https://reviews.llvm.org/D124628	2022-05-03 09:41:40 -07:00
Hanhan Wang	919e459f1b	[Linalg] Remove Optional from getStaticLoopRanges interface method. It is very wrong if the ranges can't be infered. It's also checked in verifyStructuredOpInterface, so we don't need the Optional return type. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D124596	2022-05-03 05:12:54 -07:00
Alex Zinenko	6c57b0debe	[mlir] improve and test TransformState::Extension Add the mechanism for TransformState extensions to update the mapping between Transform IR values and Payload IR operations held by the state. The mechanism is intentionally restrictive, similarly to how results of the transform op are handled. Introduce test ops that exercise a simple extension that maintains information across the application of multiple transform ops. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D124778	2022-05-03 11:33:00 +02:00
Benjamin Kramer	dccb73318a	[mlir][MemRef] Return `0` for the canonical strided layout expr of a 0d memref There can't be any strides, and the offset for the canonical expr is always 0. Fixes #55229. Differential Revision: https://reviews.llvm.org/D124795	2022-05-03 10:58:44 +02:00
Vitaly Buka	8611909572	[mlir] Create lbOperands before op.setLowerBound To fix msan report like https://reviews.llvm.org/P8284 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D124575	2022-05-02 22:23:51 -07:00
Min-Yih Hsu	e927a336a5	[mlir][LLVMIR] Add support for translating FCmp & FP constants This patch add supports for translating FCmp and more kinds of FP constants in addition to 32 & 64-bit ones. However, we can't express ppc_fp128 constants right now because the semantics for its underlying APFloat is `S_PPCDoubleDouble` but mlir::FloatType doesn't support such semantics right now. Differential Revision: https://reviews.llvm.org/D124630	2022-05-02 16:22:35 -07:00
Raghu Maddhipatla	c685f82126	[mlir][OpenMP] Add omp.cancel and omp.cancellationpoint. Reviewed By: kiranchandramohan, peixin, shraiysh Differential Revision: https://reviews.llvm.org/D123828	2022-05-02 12:23:38 -05:00
Eugene Zhulenev	38d0df5577	[mlir] CRunnerUtils: qualify UnrankedMemRefType to avoid collisions with mlir::UnrankedMemRefType When CRunnerUtils included together with MLIR IR headers, it can lead to compilation errors. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D124744	2022-05-02 10:19:02 -07:00
Shraiysh Vaishay	a60fda59dc	[mlir][OpenMP] Restrict types for omp.parallel args This patch restricts the value of `if` clause expression to an I1 value. It also restricts the value of `num_threads` clause expression to an I32 value. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D124142	2022-05-02 14:17:34 +05:30
Alex Zinenko	946311b893	[mlir] support isa/cast/dyn_cast<Operation >(operation) This enables one to write generic code that can be instantiated for both specific operation classes and the common base class without specialization. Examples include functions that take/return ops, such as: ```mlir template <typename FnTy> void applyIf(FnTy &&lambda, ...) { for (Operation op : ...) { auto specific = dyn_cast<function_traits<FnTy>::template arg_t<0>>(op); if (specific) lambda(specific); } } ``` that would otherwise need to rely on template specialization to support lambdas that take specific operations and those that take `Operation *`. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D124675	2022-05-02 10:07:09 +02:00
River Riddle	3c75228991	[mlir:PDLInterp] Refactor the implementation of result type inferrence The current implementation uses a discrete "pdl_interp.inferred_types" operation, which acts as a "fake" handle to a type range. This op is used as a signal to pdl_interp.create_operation that types should be inferred. This is terribly awkward and clunky though: * This op doesn't have a byte code representation, and its conversion to bytecode kind of assumes that it is only used in a certain way. The current lowering is also broken and seemingly untested. * Given that this is a different operation, it gives off the assumption that it can be used multiple times, or that after the first use the value contains the inferred types. This isn't the case though, the resultant type range can never actually be used as a type range. This commit refactors the representation by removing the discrete InferredTypesOp, and instead adds a UnitAttr to pdl_interp.CreateOperation that signals when the created operations should infer their types. This leads to a much much cleaner abstraction, a more optimal bytecode lowering, and also allows for better error handling and diagnostics when a created operation doesn't actually support type inferrence. Differential Revision: https://reviews.llvm.org/D124587	2022-05-01 12:25:05 -07:00
Arjun P	ebbfe0136e	[MLIR][Presburger] subtraction: add support for divs defined by equalties Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D124668	2022-04-30 14:45:13 +01:00
Chris Lattner	d85eb4e2d6	[AsmParser] Introduce a new "Argument" abstraction + supporting logic MLIR has a common pattern for "arguments" that uses syntax like `%x : i32 {attrs} loc("sourceloc")` which is implemented in adhoc ways throughout the codebase. The approach this uses is verbose (because it is implemented with parallel arrays) and inconsistent (e.g. lots of things drop source location info). Solve this by introducing OpAsmParser::Argument and make addRegion (which sets up BlockArguments for the region) take it. Convert the world to propagating this down. This means that we correctly capture and propagate source location information in a lot more cases (e.g. see the affine.for testcase example), and it also simplifies much code. Differential Revision: https://reviews.llvm.org/D124649	2022-04-29 12:19:34 -07:00
Vitaly Buka	f735b3a2b0	[mlir] Prevent argStorage relocations This fixes msan reports like https://reviews.llvm.org/P8285 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D124576	2022-04-29 11:09:32 -07:00
Matthias Springer	3c2a74a3ae	[mlir][linalg][transform] Add TileOp to transform dialect This commit adds a tiling op to the transform dialect as an external op. Differential Revision: https://reviews.llvm.org/D124661	2022-04-29 21:35:31 +09:00
River Riddle	651d9f70ed	[mlir:PDLL] Fix the import of native constraints from ODS We weren't properly returning the result of the constraint, which leads to errors when actually trying to use the generated C++. Differential Revision: https://reviews.llvm.org/D124586	2022-04-28 12:58:00 -07:00
River Riddle	ebb1e900d3	[mlir:PDLL] Fix error handling of eof within a string literal We currently aren't handling this properly, and in the case of a string block just crash. This commit adds proper error handling and detection for eof. Differential Revision: https://reviews.llvm.org/D124585	2022-04-28 12:58:00 -07:00
River Riddle	32bf1f1d57	[mlir:LSP] Improve conversion between SourceMgr and LSP locations SourceMgr generally uses 1-based locations, whereas the LSP is zero based. This commit corrects this conversion and also enhances the conversion from SMLoc to SMRange to support string tokens. Differential Revision: https://reviews.llvm.org/D124584	2022-04-28 12:58:00 -07:00
River Riddle	9613a850b6	[mlir:PDL] Rework errors for pdl.operations with non-inferrable results We currently emit an error during verification if a pdl.operation with non-inferrable results is used within a rewrite. This allows for catching some errors during compile time, but is slightly broken. For one, the verification at the PDL level assumes that all dialects have been loaded, which is true at run time, but may not be true when the PDL is generated (such as via PDLL). This commit fixes this by not emitting the error if the operation isn't registered, i.e. it uses the `mightHave` variant of trait/interface methods. Secondly, we currently don't verify when a pdl.operation has no explicit results, but the operation being created is known to expect at least one. This commit adds a heuristic error to detect these cases when possible and fail. We can't always capture when the user made an error, but we can capture the most common case where the user expected an operation to infer its result types (when it actually isn't possible). Differential Revision: https://reviews.llvm.org/D124583	2022-04-28 12:58:00 -07:00
River Riddle	d4381b3f93	[mlir:PDL] Fix a syntax ambiguity in pdl.attribute pdl.attribute currently has a syntax ambiguity that leads to the incorrect parsing of pdl.attribute operations with locations that don't also have a constant value. For example: ``` pdl.attribute loc("foo") ``` The above IR is treated as being a pdl.attribute with a constant value containing the location, `loc("foo")`, which is incorrect. This commit changes the syntax to use `= <constant-value>` to clearly distinguish when the constant value is present, as opposed to just trying to parse an attribute. Differential Revision: https://reviews.llvm.org/D124582	2022-04-28 12:57:59 -07:00
River Riddle	92a836da07	[mlir] Attach InferTypeOpInterface on SameOperandsAndResultType operations when possible This allows for inferring the result types of operations in certain situations by using the type of an operand. This commit allowed for automatically supporting type inference for many more operations with no additional effort, e.g. nearly all Arithmetic operations now support result type inferrence with no additional changes. Differential Revision: https://reviews.llvm.org/D124581	2022-04-28 12:57:59 -07:00
River Riddle	1bd1edaf40	[mlir:ODS] Support using attributes in AllTypesMatch to automatically add InferTypeOpInterface This allows for using attribute types in result type inference for use with InferTypeOpInterface. This was a TODO before, but it isn't much additional work to properly support this. After this commit, arith::ConstantOp can now have its InferTypeOpInterface implementation automatically generated. Differential Revision: https://reviews.llvm.org/D124580	2022-04-28 12:57:59 -07:00
Chris Lattner	99499c3ea7	[OpAsmParser] Simplify logic for requiredOperandCount in parseOperandList. I would ideally like to eliminate 'requiredOperandCount' as a bit of verification that should be in the client side, but it is much more widely used than I expected. Just tidy some pieces up around it given we can't drop it immediately. NFC. Differential Revision: https://reviews.llvm.org/D124629	2022-04-28 12:05:10 -07:00
Jacques Pienaar	9a4472c56c	[mlir] Add basic tree-sitter grammar file tree-sitter grammar file that tries to closely matches LangRef (it could use some tweaking and cleanup, but kept fairly basic). Also updated LangRef in places where found some issues while doing the nearly direct transcription. This only adds a grammar file, not all the other parts (npm etc) that accompanies it. Those I'll propose for separate repo like we do for vscode extension. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D124352	2022-04-28 11:42:46 -07:00
Chris Lattner	5dedf911de	[AsmParser] Rework logic around "region argument parsing" The asm parser had a notional distinction between parsing an operand (like "%foo" or "%4#3") and parsing a region argument (which isn't supposed to allow a result number like #3). Unfortunately the implementation has two problems: 1) It didn't actually check for the result number and reject it. parseRegionArgument and parseOperand were identical. 2) It had a lot of machinery built up around it that paralleled operand parsing. This also was functionally identical, but also had some subtle differences (e.g. the parseOptional stuff had a different result type). I thought about just removing all of this, but decided that the missing error checking was important, so I reimplemented it with a `allowResultNumber` flag on parseOperand. This keeps the codepaths unified and adds the missing error checks. Differential Revision: https://reviews.llvm.org/D124470	2022-04-28 11:12:44 -07:00
Vitaly Buka	6e1ac68a0c	[mlir] Don't iterate mutable user list executeOp.operandsMutable().append(asyncTokens) in addAsyncDependencyAfter can resize and invalidate iterators. Fixes reports like https://reviews.llvm.org/P8286 Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D124577	2022-04-28 08:59:55 -07:00
Vitaly Buka	9f235a88f1	[mlir][msan] Don't access destroyed node	2022-04-28 08:58:27 -07:00
Marius Brehler	84fe39a45b	[mlir][emitc] Add a cast op This adds a cast operation that allows to perform an explicit type conversion. The cast op is emitted as a C-style cast. It can be applied to integer, float, index and EmitC types. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D123514	2022-04-28 15:50:59 +00:00
Vitaly Buka	0d70bc990b	[mlir][msan][test] Disable jit tests I am going to enable MLIR test on msan bot https://lab.llvm.org/buildbot/#/builders/sanitizer-x86_64-linux-bootstrap-msan Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D124574	2022-04-28 08:50:13 -07:00
Marius Brehler	50d648b40e	[mlir][emitc] Replace !emitc.opaque pointers Replaces using !emitc.opaque pointers which using !emitc.ptr types.	2022-04-28 15:20:39 +00:00
Marius Brehler	39dd29736f	[mlir][emitc] Disallow !emitc.opaque pointers Fordbids to express pointer via the `!emitc.opaque` type. Point the user to use the `!emitc.ptr` type instead. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D124002	2022-04-28 15:08:21 +00:00
Lei Zhang	bbffece383	[mlir][spirv] Remove layout decoration on unneeded storage classes Per SPIR-V validation rules, explict layout decorations are only needed for StorageBuffer, PhysicalStorageBuffer, Uniform, and PushConstant storage classes. (And even that is for Shader capabilities). So we don't need such decorations on the rest. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D124543	2022-04-28 08:18:23 -04:00
Lei Zhang	8854b73606	[mlir][spirv] Convert memref.alloca to spv.Variable Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D124542	2022-04-28 08:13:40 -04:00
Javier Setoain	6301574206	[mlir][SparseTensor] Enable VLA ops in index value generation Current index value generation uses fixed-length vector ops, this patch adds an alterantive codegen path compatible with scalable vectors by using `LLVM::StepVectorOp`. Differential Revision: https://reviews.llvm.org/D124454	2022-04-28 09:39:07 +01:00
Aart Bik	ccd047cba4	[mlir][sparse] optimize COO index handling By using a shared index pool, we reduce the footprint of each "Element" in the COO scheme and, in addition, reduce the overhead of allocating indices (trading many allocations of vectors for allocations in a single vector only). When the capacity is known, this means all allocation can be done in advance. This is a big win. For example, reading matrix SK-2005, with dimensions 50,636,154 x 50,636,154 and 1,949,412,601 nonzero elements improves as follows (time in ms), or about 3.5x faster overall ``` SK-2005 before after speedup --------------------------------------------- read 305,086.65 180,318.12 1.69 sort 2,836,096.23 510,492.87 5.56 pack 364,485.67 312,009.96 1.17 --------------------------------------------- TOTAL 3,505,668.56 1,002,820.95 3.50 ``` Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D124502	2022-04-27 10:20:47 -07:00
Min-Yih Hsu	a75657d66a	[mlir][LLVMIR] Do not cache llvm::Constant into instMap Constants in MLIR are not globally unique, unlike that in LLVM IR. Therefore, reusing previous-translated constants might cause the user operations not being dominated by the constant (because the previous-translated ones can be placed in arbitrary place) This indeed misses some opportunities where we actually can reuse a previous-translated constants, but verbosity is not our first priority here. Differential Revision: https://reviews.llvm.org/D124404	2022-04-27 09:43:49 -07:00
Min-Yih Hsu	ea9bcb8b27	[mlir][LLVMIR] Do not cache Instruction generated on-the-fly More specifically, the llvm::Instruction generated by llvm::ConstantExpr::getAsInstruction. Such Instruction will be deleted right away, but it's possible that when getAsInstruction is called again, it will create a new Instruction that has the same address with the one we just deleted. Thus, we shouldn't keep it in the `instMap` to avoid a conflicting index that triggers an assertion in processInstruction. Differential Revision: https://reviews.llvm.org/D124402	2022-04-27 09:42:59 -07:00
Min-Yih Hsu	00fcf9e95a	[mlir][LLVMIR] Add support for importing struct-type ConstantAggregate(Zero) And move importer test files from `test/Target/LLVMIR` into `test/Target/LLVMIR/Import`. We simply translate struct-type ConstantAggregate(Zero) into a serious of `llvm.insertvalue` operations against a `llvm.undef` root. Note that this doesn't affect the original logics on translating vector/array-type ConstantAggregate values. Differential Revision: https://reviews.llvm.org/D124399	2022-04-27 09:42:26 -07:00
Mathieu Fehr	88bc24a7e3	[mlir] Allow setting operation legality with an OperationName This is necessary to handle conversions of operations defined at runtime in extensible dialects. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D124353	2022-04-27 08:54:51 -07:00
Lei Zhang	d137c05fc9	[mlir][spirv] Add conversion from vector.reduction Only supports addition and multiplication for now; other cases to be implemented. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D124380	2022-04-27 10:29:46 -04:00
Lei Zhang	38e802a09d	[mlir][spirv] Allow converting from index type in unsigned ops `index` type is converted to `i32` in SPIR-V. This is fine to support for all signed/unsigned ops. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D124451	2022-04-27 10:13:50 -04:00
Mathieu Fehr	9e0b553359	[mlir] Add extensible dialects Depends on D104534 Add support for extensible dialects, which are dialects that can be extended at runtime with new operations and types. These operations and types cannot at the moment implement traits or interfaces. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D104554	2022-04-26 19:48:22 -07:00
River Riddle	71aad31c0b	[mlir:PDLL] Use normalized paths in compilation database test This fixes issues with the compilation database when the file path isn't in the correct form.	2022-04-26 19:45:30 -07:00
River Riddle	021b254547	[mlir:PDLL] Fix build on windows related to different file paths This fixes issues with the compilation database when the file path isn't in the correct form.	2022-04-26 19:40:41 -07:00
River Riddle	41d2c6df5c	[mlir][PDLL-LSP] Add code completion for include file paths This allows for providing completion results for include directive file paths by searching the set of include directories for the current file. Differential Revision: https://reviews.llvm.org/D124112	2022-04-26 18:33:17 -07:00
River Riddle	09af7fefc8	[mlir][PDLL] Add document link and hover support to mlir-pdll-lsp-server This allows for navigating to included files on click, and also provides hover information about the include file (similarly to clangd). Differential Revision: https://reviews.llvm.org/D124077	2022-04-26 18:33:17 -07:00
River Riddle	fb5a59f6e1	[mlir][PDLL] Add initial support for a PDLL compilation database The compilation database acts in a similar way to the compilation database (compile_commands.json) used by clang-tidy, i.e. it provides additional information about the compilation of project files to help the language server. The main piece of information provided by the PDLL compilation database in this commit is the set of include directories used when processing the input .pdll file. This allows for the server to properly process .pdll files that use includes anchored by the include directories set up in the build system. The structure of the textual form of a compilation database is a yaml file containing documents of the following form: ``` --- !FileInfo: filepath: <string> - Absolute file path of the file. includes: <string> - Semi-colon delimited list of include directories. ``` This commit also adds support to cmake for automatically generating a `pdll_compile_commands.yml` file at the top-level of the build directory. Differential Revision: https://reviews.llvm.org/D124076	2022-04-26 18:33:17 -07:00
River Riddle	597fde54a8	[mlir][PDLL] Add support for generating PDL patterns from PDLL at build time This essentially sets up mlir-pdll to function in a similar manner to mlir-tblgen. Aside from the boilerplate of configuring CMake and setting up a basic initial test, two new options are added to mlir-pdll to mirror options provided by tblgen: * -d This option generates a dependency file (i.e. a set of build time dependencies) while processing the input file. * --write-if-changed This option only writes to the output file if the data would have changed, which for the build system prevents unnecesarry rebuilds if the file was touched but not actually changed. Differential Revision: https://reviews.llvm.org/D124075	2022-04-26 18:33:16 -07:00
River Riddle	b3fc0fa84a	[mlir][PDLL] Don't use the result of `Constraint::getDefName()` when uniquing In the case of anonymous defs this may return the name of the base def class, which can lead to two different defs with the same name (which hits an assert). This commit adds a new `getUniqueDefName` method that returns a unique name for the constraint. Differential Revision: https://reviews.llvm.org/D124074	2022-04-26 18:33:16 -07:00
Aart Bik	33e8ab8ea0	[mlir][sparse] support pattern-only matrices from Matrix Market We simply set nonzero entries to the value "1" in this case. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D124475	2022-04-26 15:50:21 -07:00
Michael Kruse	ff289feeba	[OpenMPIRBuilder] Remove ContinuationBB argument from Body callback. The callback is expected to create a branch to the ContinuationBB (sometimes called FiniBB in some lambdas) argument when finishing. This creates problems: 1. The InsertPoint used for CodeGenIP does not need to be the end of a block. If it is not, a naive callback will insert a branch instruction into the middle of the block. 2. The BasicBlock the CodeGenIP is pointing to may or may not have a terminator. There is an conflict where to branch to if the block already has a terminator. 3. Some API functions work only with block having a terminator. Some workarounds have been used to insert a temporary terminator that is removed again. 4. Some callbacks are sensitive to whether the BasicBlock has a terminator or not. This creates a callback ordering problem where different callback may have different behaviour depending on whether a previous callback created a terminator or not. The problem also exists for FinalizeCallbackTy where some callbacks do create branch to another "continue" block, but unlike BodyGenCallbackTy does not receive the target as argument. This is not addressed in this patch. With this patch, the callback receives an CodeGenIP into a BasicBlock where to insert instructions. If it has to insert control flow, it can split the block at that position as needed but otherwise no separate ContinuationBB is needed. In particular, a callback can be empty without breaking the emitted IR. If the caller needs the control flow to branch to a specific target, it can insert the branch instruction itself and pass an InsertPoint before the terminator to the callback. Certain frontends such as Clang may expect the current IRBuilder position to be at the end of a basic block. In this case its callbacks must split the block at CodeGenIP before setting the IRBuilder position such that the instructions after CodeGenIP are moved to another basic block and before returning create a new branch instruction to the split block. Some utility functions such as `splitBB` are supporting correct splitting of BasicBlocks, independent of whether they have a terminator or not, returning/setting the InsertPoint of an IRBuilder to the end of split predecessor block, and optionally omitting creating a branch to the split successor block to be added later. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D118409	2022-04-26 16:35:01 -05:00
Yi Zhang	e1318078a4	Support non identity layout map for reshape ops in MemRefToLLVM lowering This change borrows the ideas from `computeExpanded/CollapsedLayoutMap` and computes the dynamic strides at runtime for the memref descriptors. Differential Revision: https://reviews.llvm.org/D124001	2022-04-26 13:03:53 -04:00
Alex Zinenko	b84f95fe53	[mlir] Fix -Wunused-private-field in the Transform dialect Classes derived from TransformState::Extension may need access to the parent state.	2022-04-26 14:05:24 +02:00
Alex Zinenko	2b985a7ae8	[mlir] Add a title to the Transform Dialect doc	2022-04-26 13:04:41 +02:00
Krzysztof Drewniak	d35f7f254f	[mlir] Allow data flow analysis of non-control flow branch arguments This commit adds the visitNonControlFlowArguments method to DataFlowAnalysis, allowing analyses to provide lattice values for the arguments to a RegionSuccessor block that aren't directly tied to an op's inputs. For example, integer range interface can use this method to infer bounds for the step values in loops. This method has a default implementation that keeps the old behavior of assigning a pessimistic fixedpoint state to all such arguments. Reviewed By: Mogball, rriddle Differential Revision: https://reviews.llvm.org/D124021	2022-04-25 20:19:34 +00:00
jfurtek	c4caa90b15	[mlir][tblgen] Generate builders with inferred return types and unwrapped attributes This diff causes mlir-tblgen to generate code for an additional builder for an operation argument with a return type that can be inferred AND an attribute in the argument list can be "unwrapped." (Previously, the unwrapped build function was only generated for builders with explicit return types in separate or aggregate form.) As an example, this builder might be used by code that creates operations that implement the `SameOperandsAndResultType` interface. A test case was created. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D124043	2022-04-25 19:00:44 +00:00
Jeremy Furtek	a266a21000	[mlir][ods] Extend the EnumAttr tablegen class to support BitEnum attributes This diff allows the EnumAttr class to be used for bit enum attributes (in addition to previously supported integer enum attributes). While integer and bit enum attributes share many common implementation aspects, parsing bit enum values requires a separate implementation. This is accomplished by creating empty parser and printer strings in the EnumAttrInfo record, and having derived classes (specific to bit and integer enums) override with an appropriate parser/printer string. To support existing bit enums that may use a vertical bar separator, the parser is modified to support the \| token. Tests were added for bit enums alongside integer enums. Future diffs for fastmath attributes in the arithmetic dialect will use these changes. (resubmission of earlier abaondoned diff, updated to reflect subsequent changes in the repository) Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D123880	2022-04-25 19:00:00 +00:00
jfurtek	4e5dee2f30	[mlir][ods] Add tablegen field for concise printing of BitEnum attributes This diff introduces a tablegen field for bit enum attributes (`printBitEnumPrimaryGroups`) to control printing when the enum uses "group" cases. An example would be an implementation that uses a `fastmath` enum value as an alias for individual fastmath flags. The proposed field would allow printing of simply `fast` for the enum value, instead of the more verbose list that would include `fast` as well as the individual flags (e.g. `reassoc,nnan, ninf,nsz,arcp,contract,afn,fast`). Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D123871	2022-04-25 18:48:35 +00:00
Markus Böck	12a2716953	[mlir][LLVM] Support opaque pointers in `llvm.mlir.addressof` The verifier of llvm.mlir.addressof did not properly account for opaque pointers, that is, the pointer type not having an element type equal to the type of the referenced global or function. This patch fixes that by skipping the test for the element type if the pointer is opaque. Differential Revision: https://reviews.llvm.org/D124333	2022-04-25 12:23:16 +02:00
Alex Zinenko	4c807f2f57	[mlir][vector] insert `alloca`s outside of loops After https://reviews.llvm.org/D119743 added the `AutomaticAllocationScope` trait to loop-like constructs, the vector transfer full/partial splitting pass started inserting allocations for temporaries within the closest loop rather than the closest function (or other allocation scope such as `async.execute`). While this is correct as long as the lowered code takes care of automatic deallocation at the end of each iteration of the loop, this interferes with downstream optimizations that expect `alloca`s to be at the function level. Step over loops when looking for the closest allocation scope in vector transfer full/partial splitting pass thus restoring the original behavior. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D124366	2022-04-25 10:49:09 +02:00
Markus Böck	34312f1f0c	[mlir][LLVM] Support opaque pointers in data layout entries This is likely preferable to having it crash if one were to specify an opaque pointer type, and the actual element type is unused either way. Differential Revision: https://reviews.llvm.org/D124334	2022-04-25 09:14:33 +02:00
Nick Kreeger	4620032ee3	Revert "[mlir][sparse] Expose SpareTensor passes as enums instead of opaque numbers for vectorization and parallelization options." This reverts commit `d59cf901cb`. Build fails on NVIDIA Sparse tests: https://lab.llvm.org/buildbot/#/builders/61/builds/25447	2022-04-23 20:14:48 -05:00
Nick Kreeger	d59cf901cb	[mlir][sparse] Expose SpareTensor passes as enums instead of opaque numbers for vectorization and parallelization options. The SparseTensor passes currently use opaque numbers for the CLI, despite using an enum internally. This patch exposes the enums instead of numbered items that are matched back to the enum. Fixes GitHub issue #53389 Reviewed by: aartbik, mehdi_amini Differential Revision: https://reviews.llvm.org/D123876	2022-04-23 19:16:57 -05:00
Matthias Springer	48b8edac1c	[mlir][bufferize][NFC] Remove old references to Comprehensive Bufferize Differential Revision: https://reviews.llvm.org/D124324	2022-04-23 18:01:05 +09:00
Matthias Springer	940a3f6b3d	[mlir][bufferize][NFC] Clean up test cases Run `one-shot-bufferize` instead of `linalg-comprehensive-module-bufferize` and move some test cases to their respective dialects. Differential Revision: https://reviews.llvm.org/D124323	2022-04-23 18:00:55 +09:00
River Riddle	eda6f907d2	[mlir][NFC] Shift a bunch of dialect includes from the .h to the .cpp Now that dialect constructors are generated in the .cpp file, we can drop all of the dependent dialect includes from the .h file. Differential Revision: https://reviews.llvm.org/D124298	2022-04-23 01:09:29 -07:00
River Riddle	f3ebf828dc	[mlir] Generate Dialect constructors in .cpp instead of .h By generating in the .h file, we were forcing dialects to include a lot of additional header files because: * Fields of the dialect, e.g. std::unique_ptr<>, were unable to use forward declarations. * Dependent dialects are loaded in the constructor, requiring the full definition of each dependent dialect (which, depending on the file structure of the dialect, may include the operations). By generating in the .cpp we get much faster builds, and also better align with the rest of the code base. Fixes #55044 Differential Revision: https://reviews.llvm.org/D124297	2022-04-23 00:44:54 -07:00
Markus Böck	8ed2bd1e74	[mlir][LLVM] Fix `DataLayoutTypeInterface` for opqaue pointers with non-default address space As a fallback mechanism, if no entry was supplied for a given address space, the size or alignment for a pointer type with the default address space is returned instead. This code currently crashes with opaque pointers, as it tries to construct a typed pointer type from the opaque pointer type, leading to a null pointer dereference when fetching the element type. This patch fixes the issue by handling the opaque pointer cases explicitly. Differential Revision: https://reviews.llvm.org/D124290	2022-04-23 00:10:31 +02:00
Markus Böck	bab3d3778d	[mlir][LLVM] Fix crash when using opaque pointers in function signatures Using opaque pointers in function signatures leads to an attempt to recursively convert all types, including sub types in LLVM types. In the case of LLVM pointers, it may not have a subtype aka element type if it is opaque which would then lead to a null pointer dereference. Differential Revision: https://reviews.llvm.org/D124291	2022-04-23 00:10:31 +02:00
Yi Zhang	1cddcfdc3c	Fix CollapsedLayoutMap for dim size 1 case This change fixes `CollapsedLayoutMap` for cases where the collapsed dims are size 1. The cases where inner most dims are size 1 and noncontiguous can be represented by the strided form and therefore can be allowed. For such cases, the new stride should be of the next entry in an association whose dimension is not size 1. If the next entry is dynamic, it's not possible to decide which stride to use at compilation time and the stride is set to dynamic. Differential Revision: https://reviews.llvm.org/D124137	2022-04-22 17:48:24 -04:00
Alex Zinenko	40a8bd635b	[mlir] use side effects in the Transform dialect Currently, the sequence of Transform dialect operations only supports a single use of each operand (verified by the `transform.sequence` operation). This was originally motivated by the need to guard against accessing a payload IR operation associated with a transform IR value after this operation has likely been rewritten by a transformation. However, not all Transform dialect operations rewrite payload IR, in particular the "navigation" operation such as `transform.pdl_match` do not. Introduce memory effects to the Transform dialect operations to describe their effect on the payload IR and the mapping between payload IR opreations and transform IR values. Use these effects to replace the single-use rule, allowing repeated reads and disallowing use-after-free, where operations with the "free" effect are considered to "consume" the transform IR value and rewrite the corresponding payload IR operations). As an additional improvement, this enables code motion transformation on the transform IR itself. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D124181	2022-04-22 23:29:11 +02:00
Okwan Kwon	ee285faed2	[mlir] Do not bubble up extract slice when it is rank-reducing. The bubble up logic was written by assuming the slice operation is always a normal slice that outputs a tensor with the same rank. Differential Revision: https://reviews.llvm.org/D124283	2022-04-22 12:21:47 -07:00
cpillmayer	3e8560f890	[MLIR] Add option to print users of an operation as comment in the printer This allows printing the users of an operation as proposed in the git issue #53286. To be able to refer to operations with no result, these operations are assigned an ID in SSANameState. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D124048	2022-04-22 18:58:10 +00:00
Jacques Pienaar	9bae20b528	[mlir] Add shape.func Add shape func op for use (primarily) in shape function_library op. Allows setting default dialect for some simpler authoring. This is a minimal version of the ops needed. Differential Revision: https://reviews.llvm.org/D124055	2022-04-22 11:35:35 -07:00
Lei Zhang	6f28fd0bf7	[mlir][vector] Fold 1-element reduction into extract or arith ops If there is only one single element in the vector, then we can just extract the element to compute the final result. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D124129	2022-04-22 14:24:46 -04:00
Matthias Springer	d6dab38ae4	[mlir][bufferize][NFC] Add function boundary bufferization flag to BufferizationOptions This makes the API easier to use. Also allows us to check for incorrect API usage for easier debugging. Differential Revision: https://reviews.llvm.org/D124265	2022-04-23 01:11:37 +09:00
Matthias Springer	b0b19fae81	[mlir][bufferize][NFC] Rewrite op filter logic The `hasFilter` field is not needed. Instead, the filter accepts ops by default if no ALLOW rule was specified. Differential Revision: https://reviews.llvm.org/D124264	2022-04-23 00:25:24 +09:00
Lei Zhang	fc760c0260	[mlir][vector] Fold cancelling vector.shape_cast(vector.broadcast) vector.broadcast can inject all size one dimensions. If it's followed by a vector.shape_cast to the original type, we can cancel the op pair, like cancelling consecutive shape_cast ops. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D124094	2022-04-22 08:58:26 -04:00
Matthias Springer	494505f39f	[mlir][bufferize][NFC] Move SCF test cases to SCF dialect Differential Revision: https://reviews.llvm.org/D124249	2022-04-22 20:35:20 +09:00
Matthias Springer	e07a7fd5c0	[mlir][bufferization] Move ModuleBufferization to bufferization dialect * Move Module Bufferization to the bufferization dialect. The implementation is split into `OneShotModuleBufferize.cpp` and `FuncBufferizableOpInterfaceImpl.cpp`, so that the external model implementation can be easily moved to the func dialect in the future. * Split and clean up test cases. A few test cases are still remaining in Linalg and will be updated separately. * `linalg.inplaceable` is renamed to `bufferization.writable` to accurately reflect its current usage. * Attributes and their verifiers are moved from the Linalg dialect to the Bufferization dialect. * Expand documentation. * Add a new flag to One-Shot Bufferize to allow for function boundary bufferization. Differential Revision: https://reviews.llvm.org/D122229	2022-04-22 19:37:28 +09:00
Matthias Springer	bd1d87e3d1	[mlir][bufferization][NFC] Remove layout post processing step The layout postprocessing step was removed and is now part of the FuncOp bufferization. If the user specified a certain layout map for a tensor function arg, use that layout map directly when bufferizing the function signature. Previously, the bufferization used a generic layout map for every tensor function arg and then updated function signatures and CallOps in a separate step. Differential Revision: https://reviews.llvm.org/D122228	2022-04-22 18:49:47 +09:00
Matthias Springer	70777d967f	[mlir][bufferize][NFC] Move FuncOp bufferization to BufferizableOpInterface impl FuncOps are now less special. They must still be analyzed + bufferized in a certain order, but they are now bufferized same as other ops that have a region: Bufferize the op first (`bufferize` interface method), then bufferize the region body with other bufferization patterns. In the case of FuncOps, the function signature is bufferized together with ReturnOps. Similar to how, e.g., scf.for ops are bufferized together with scf.yield ops. This change is essentially a reimplementation of the FuncOp bufferization, but mostly NFC from a user's perspective (apart from error messages). This change is in preparation of moving the code to the bufferization dialect. Differential Revision: https://reviews.llvm.org/D123214	2022-04-22 18:47:12 +09:00
Matthias Springer	d820acdde1	[mlir][bufferize][NFC] Use custom walk instead of GreedyPatternRewriter The bufferization driver was previously using a GreedyPatternRewriter. This was problematic because bufferization must traverse ops top-to-bottom. The GreedyPatternRewriter was previously configured via `useTopDownTraversal`, but this was a hack; this API was just meant for performance improvements and should not affect the result of the rewrite. BEGIN_PUBLIC No public commit message needed. END_PUBLIC Differential Revision: https://reviews.llvm.org/D123618	2022-04-22 18:23:09 +09:00
jacquesguan	9b32886e7e	[mlir][Arithmetic] Use common constant fold function in RemSI and RemUI to cover splat. This patch replaces current fold function with the common constant fold funtion in order to cover the situation of constant splat. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D124236	2022-04-22 09:20:18 +00:00
jacquesguan	abc17a6751	[mlir][Arithmetic] Use matchPattern to simplify code. This patch replaces some code with matchPattern and move them before the constant folder function in order to avoid redundant invoking. Differential Revision: https://reviews.llvm.org/D124235	2022-04-22 08:42:51 +00:00
Adrian Kuegel	a74e5a89b9	[mlir] Move isGuaranteedCollapsible to CollapseShapeOp (NFC). It seems more natural than to have it as a static method of ExpandShapeOp. Also fix a typo ("the the" -> "the"). Differential Revision: https://reviews.llvm.org/D124234	2022-04-22 10:31:25 +02:00
Will Dietz	bb8c8751cf	[MLIR] prefer /bin/sh over /bin/bash for simple test scripts These scripts do not appear to require bash, and while /bin/sh is not guaranteed either it's more commonly available. Fixes tests on NixOS and in certain sandbox build environments. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D124205	2022-04-21 20:25:17 -05:00
Amy Zhuang	5bd4bcfc04	[mlir] Modify SuperVectorize to generate select op->combiner op Insert the select op before the combiner op when vectorizing a reduction loop that needs a mask, so the vectorized reduction loop can pass isLoopParallel check and be transformed correctly in later passes. Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D124047	2022-04-21 17:09:13 -07:00
Mahesh Ravishankar	0c090dcc8a	[mlir][Linalg] Deprecate legacy reshape + generic op folding patterns. These patterns have been superceded by the fusion by collapsing patterns. Differential Revision: https://reviews.llvm.org/D124145	2022-04-21 22:25:23 +00:00
Chris Lattner	31c8abc3f1	[AsmParser/Printer] Rework sourceloc support for function arguments. When Location tracking support for block arguments was added, we discussed various approaches to threading support for this through function-like argument parsing. At the time, we added a parallel array of locations that could hold this. It turns out that that approach was verbose and error prone, roughly no one adopted it. This patch takes a different approach, adding an optional source locator to the UnresolvedOperand class. This fits much more naturally into the standard structure we use for representing locators, and gives all the function like dialects locator support for free (e.g. see the test adding an example for the LLVM dialect). Differential Revision: https://reviews.llvm.org/D124188	2022-04-21 12:43:36 -07:00
Frederik Gossen	673e9828be	[MLIR] Fix iteration counting in greedy pattern application Previously, checking that a fix point is reached was counted as a full iteration. As this "iteration" never changes the IR, this seems counter- intuitive. Differential Revision: https://reviews.llvm.org/D123641	2022-04-21 15:17:28 -04:00
Alex Zinenko	0edb262d91	[mlir] enable doc generation for the transform dialect	2022-04-21 18:52:08 +02:00
Fangrui Song	ae46b3e01f	Revert D121279 "[MLIR][GPU] Add canonicalizer for gpu.memcpy" This reverts commit `12f55cac69`. Causes miscompile. Will follow up with a reproduce.	2022-04-21 08:55:13 -07:00
Alex Zinenko	30f22429d3	[mlir] Connect Transform dialect to PDL This introduces a pair of ops to the Transform dialect that connect it to PDL patterns. Transform dialect relies on PDL for matching the Payload IR ops that are about to be transformed. For this purpose, it provides a container op for patterns, a "pdl_match" op and transform interface implementations that call into the pattern matching infrastructure. To enable the caching of compiled patterns, this also provides the extension mechanism for TransformState. Extensions allow one to store additional information in the TransformState and thus communicate it between different Transform dialect operations when they are applied. They can be added and removed when applying transform ops. An extension containing a symbol table in which the pattern names are resolved and a pattern compilation cache is introduced as the first client. Depends On D123664 Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D124007	2022-04-21 16:23:10 +02:00
Markus Böck	850b2c6b3c	[mlir] Fix `Region`s `takeBody` method if the region is not empty The current implementation of takeBody first clears the Region, before then taking ownership of the blocks of the other regions. The issue here however, is that when clearing the region, it does not take into account references of operations to each other. In particular, blocks are deleted from front to back, and operations within a block are very likely to be deleted despite still having uses, causing an assertion to trigger [0]. This patch fixes that issue by simply calling dropAllReferences()before clearing the blocks. [0] `9a8bb4bc63/mlir/lib/IR/Operation.cpp (L154)` Differential Revision: https://reviews.llvm.org/D123913	2022-04-21 15:32:59 +02:00
Markus Böck	a41aaf166f	[mlir] Make `Regions`s `cloneInto` multithread-readable Prior to this patch, `cloneInto` would do a simple walk over the blocks and contained operations and clone and map them as it encounters them. As finishing touch it then remaps any successor and operands it has remapped during that process. This is generally fine, but sadly leads to a lot of uses of both operations and blocks from the source region, in the cloned operations in the target region. Those uses lead to writes in the use-def list of the operations, making `cloneInto` never thread safe. This patch reimplements `cloneInto` in three steps to avoid ever creating any extra uses on elements in the source region: * It first creates the mapping of all blocks and block operands * It then clones all operations to create the mapping of all operation results, but does not yet clone any regions or set the operands * After all operation results have been mapped, it now sets the operations operands and clones their regions. That way it is now possible to call `cloneInto` from multiple threads if the Region or Operation is isolated-from-above. This allows creating copies of functions or to use `mlir::inlineCall` with the same source region from multiple threads. In the general case, the method is thread-safe if through cloning, no new uses of `Value`s from outside the cloned Operation/Region are created. This can be ensured by mapping any outside operands via the `BlockAndValueMapping` to `Value`s owned by the caller thread. While I was at it, I also reworked the `clone` method of `Operation` a little bit and added a proper options class to avoid having a `cloneWithoutRegionsAndOperands` method, and be more extensible in the future. `cloneWithoutRegions` is now also a simple wrapper that calls `clone` with the proper options set. That way all the operation cloning code is now contained solely within `clone`. Differential Revision: https://reviews.llvm.org/D123917	2022-04-21 13:43:00 +02:00
Uday Bondhugula	f47a38f517	Add async dependencies support for gpu.launch op Add async dependencies support for gpu.launch op: this allows specifying a list of async tokens ("streams") as dependencies for the launch. Update the GPU kernel outlining pass lowering to propagate async dependencies from gpu.launch to gpu.launch_func op. Previously, a new stream was being created and destroyed for a kernel launch. The async deps support allows the kernel launch to be serialized on an existing stream. Differential Revision: https://reviews.llvm.org/D123499	2022-04-21 16:25:59 +05:30
Nimish Mishra	00c511b351	Added lowering support for atomic read and write constructs This patch adds lowering support for atomic read and write constructs. Also added is pointer modelling code to allow FIR pointer like types to be inferred and converted while lowering. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D122725 Co-authored-by: Kiran Chandramohan <kiran.chandramohan@arm.com>	2022-04-21 12:19:13 +05:30
River Riddle	0fd3a1ce60	[mlir][NFC] Update remaining textual references of un-namespaced `func` operations The special case parsing of operations in the `func` dialect is being removed, and operations will require the dialect namespace prefix.	2022-04-20 22:17:31 -07:00
River Riddle	cda6aa78f8	[mlir][NFC] Update textual references of `func` to `func.func` in Transform tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:30 -07:00
River Riddle	a4936cb3e8	[mlir][NFC] Update textual references of `func` to `func.func` in Pass/Target tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:30 -07:00
River Riddle	63237cddc1	[mlir][NFC] Update textual references of `func` to `func.func` in tool/runner tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:30 -07:00
River Riddle	6a99d29022	[mlir][NFC] Update textual references of `func` to `func.func` in IR/Interface tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:30 -07:00
River Riddle	87db8e4439	[mlir][NFC] Update textual references of `func` to `func.func` in Integration tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:29 -07:00

... 3 4 5 6 7 ...

11491 Commits