llvm-project

Commit Graph

Author	SHA1	Message	Date
Rob Suderman	28e6420744	[mlir][tosa] Add tosa.argmax to linalg lowering Tosa's argmax lowering is representable as a linalg.indexed_generic operation. Include the lowering to this type for both integer and floating point types. Differential Revision: https://reviews.llvm.org/D99137	2021-03-23 16:06:55 -07:00
Rob Suderman	4157a079af	[mlir][tosa] Add tosa.pad to linalg.pad operation Lowers from tosa's pad op to the linalg equivalent for floating, integer, and quantized values. Differential Revision: https://reviews.llvm.org/D98990	2021-03-23 14:15:48 -07:00
River Riddle	76f3c2f3f3	[mlir][Pattern] Add better support for using interfaces/traits to match root operations in rewrite patterns To match an interface or trait, users currently have to use the `MatchAny` tag. This tag can be quite problematic for compile time for things like the canonicalizer, as the `MatchAny` patterns may get applied to every operation. This revision adds better support by bucketing interface/trait patterns based on which registered operations have them registered. This means that moving forward we will only attempt to match these patterns to operations that have this interface registered. Two simplify defining patterns that match traits and interfaces, two new utility classes have been added: OpTraitRewritePattern and OpInterfaceRewritePattern. Differential Revision: https://reviews.llvm.org/D98986	2021-03-23 14:05:33 -07:00
Chris Lattner	782c534117	[ODS] Implement a new 'hasCanonicalizeMethod' bit for cann patterns. This provides a simplified way to implement 'matchAndRewrite' style canonicalization patterns for ops that don't need the full power of RewritePatterns. Using this style, you can implement a static method with a signature like: ``` LogicalResult AssertOp::canonicalize(AssertOp op, PatternRewriter &rewriter) { return success(); } ``` instead of dealing with defining RewritePattern subclasses. This also adopts this for a few canonicalization patterns in the std dialect to show how it works. Differential Revision: https://reviews.llvm.org/D99143	2021-03-23 13:45:45 -07:00
Rob Suderman	2d72b675d5	[mlir][tosa] Add tosa.tile to linalg.generic lowering Tiling operations are generic operations with modified indexing. Updated to to linalg lowerings to perform this lowering. Differential Revision: https://reviews.llvm.org/D99113	2021-03-23 13:13:54 -07:00
natashaknk	e20911b5c0	[mlir][tosa] Add tosa.matmul and tosa.fully_connected lowering Adds lowerings for matmul and fully_connected. Only supports 2D tensors for inputs and weights, and 1D tensors for bias. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D99211	2021-03-23 13:09:53 -07:00
Alex Zinenko	20c68d9441	[mlir] silence -Wunused-variable in release mode in Linalg transforms	2021-03-23 18:59:12 +01:00
Nicolas Vasilache	2240568579	[MLIR][Linalg] Hoist padding across multiple levels of tiling This revision introduces proper backward slice computation during the hoisting of PadTensorOp. This allows hoisting padding even across multiple levels of tiling. Such hoisting requires the proper handling of loop bounds that may depend on enclosing loop variables. Differential revision: https://reviews.llvm.org/D98965	2021-03-23 17:47:32 +00:00
Alex Zinenko	5fac87d1bc	[mlir] verify that operand/result_segment_sizes attributes have i32 element This is an assumption that is made in numerous places in the code. In particular, in the code generated by mlir-tblgen for operand/result accessors in ops with attr-sized operand or result lists. Make sure to verify this assumption. Note that the operation traits are verified before running the custom op verifier, which can expect the trait verifier to have passed, but some traits may be verified before the AttrSizedOperand/ResultTrait and should not make such assumptions. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D99183	2021-03-23 18:26:31 +01:00
Frederik Gossen	94ef248d7b	Revert "[MLIR] Canonicalize `shape.assuming` op to yield only inner values" This reverts commit `5f8acd4fd2`.	2021-03-23 16:05:55 +01:00
Frederik Gossen	5f8acd4fd2	[MLIR] Canonicalize `shape.assuming` op to yield only inner values Differential Revision: https://reviews.llvm.org/D99156	2021-03-23 12:34:50 +01:00
Frederik Gossen	f368b3a029	[MLIR][Shape] Canonicalize duplicate operands in `shape.cstr_broadcastable` Differential Revision: https://reviews.llvm.org/D99159	2021-03-23 12:23:22 +01:00
Frederik Gossen	d78374b2d3	[MLIR] Add callback builder for `shape.assuming` op Differential Revision: https://reviews.llvm.org/D99153	2021-03-23 11:46:01 +01:00
Christian Sigg	ddae61dfef	[mlir] Remove deprecated methods from mlir::OpState Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D99150	2021-03-23 11:08:04 +01:00
River Riddle	6d6fe9ccc4	[mlir][OpAsmFormat] Add support for an "else" group on optional elements The "else" group of an optional element is a collection of elements that get parsed/printed when the anchor of the main element group is not present. This is useful when there is a special syntax when an element is not present. The new syntax for an optional element is shown below: ``` optional-group: `(` elements `)` (`:` `(` else-elements `)`)? `?` ``` An example of how this might be used is shown below: ```tablegen def FooOp : ... { let arguments = (ins UnitAttr:$foo); let assemblyFormat = "attr-dict (`foo_is_present` $foo^):(`foo_is_absent`)?"; } ``` would be formatted as such: ```mlir // When the `foo` attribute is present: foo.op foo_is_present // When the `foo` attribute is not present: foo.op foo_is_absent ``` Differential Revision: https://reviews.llvm.org/D99129	2021-03-22 18:19:23 -07:00
Sean Silva	0524a09cc7	[mlir] Tune error message for assertion. This assertion can fire in the case of different contexts as well, which is not difficult to do from Python bindings, for example.	2021-03-22 18:10:18 -07:00
Chris Lattner	79d7f618af	Rename FrozenRewritePatternList -> FrozenRewritePatternSet; NFC. This nicely aligns the naming with RewritePatternSet. This type isn't as widely used, but we keep a using declaration in to help with downstream consumption of this change. Differential Revision: https://reviews.llvm.org/D99131	2021-03-22 17:40:45 -07:00
Mehdi Amini	a0c776fc94	Add a mechanism for Dialects to customize printing/parsing operations when they are unregistered Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D99007	2021-03-23 00:40:03 +00:00
Chris Lattner	289ecccadd	Tidy up some docs. Depends on D99127. Differential Revision: https://reviews.llvm.org/D99130	2021-03-22 17:20:50 -07:00
Chris Lattner	dc4e913be9	[PatternMatch] Big mechanical rename OwningRewritePatternList -> RewritePatternSet and insert -> add. NFC This doesn't change APIs, this just cleans up the many in-tree uses of these names to use the new preferred names. We'll keep the old names around for a couple weeks to help transitions. Differential Revision: https://reviews.llvm.org/D99127	2021-03-22 17:20:50 -07:00
Chris Lattner	549e190236	[PatternRewriter] Rename OwningRewritePatternList -> RewritePatternSet and insert -> add This maintains the old name to have minimal source impact on downstream codes, and does not do the huge mechanical patch. I expect the huge mechanical patch to land sometime this week, but we can keep around the old names for a couple weeks to reduce impact on downstream projects. Differential Revision: https://reviews.llvm.org/D99119	2021-03-22 16:33:18 -07:00
Chris Lattner	6874726610	[PatternMatching] Add convenience insert method to OwningRewritePatternList. NFC. This allows adding a C function pointer as a matchAndRewrite style pattern, which is a very common case. This adopts it in ExpandTanh to show how it reduces a level of nesting. We could allow C++ lambdas here, but that doesn't work as well with type inference in the common case. Instead of: patterns.insert(convertTanhOp); you need to specify: patterns.insert<math::TanhOp>(convertTanhOp); which is boilerplate'y. Capturing state like this is very uncommon, so we choose to require clients to define their own structs and use the non-convenience method when they need to do so. Differential Revision: https://reviews.llvm.org/D99039	2021-03-22 11:18:21 -07:00
Chia-hung Duan	cec244354b	Fix the order of directives and the target string In the original structure, it will try to match CHECK-LABEL first then see if the subsequent doesn't have the target strings. This is not what we are expected. We are expecting the two functions which will be deleted should be matched before CHECK-LABEL. Also fixed the function names. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D99060	2021-03-22 11:10:12 -07:00
Rob Suderman	d7c44a5c78	[mlir][tosa] Fix tosa.mul to use tosa.apply_scale Multiply-shift requires wider compute types or CPU specific code to avoid premature truncation, apply_shift fixes this issue Also, Tosa's mul op supports different input / output types. Added path that sign-extends input values to int-32 values before multiplying. Differential Revision: https://reviews.llvm.org/D99011	2021-03-22 11:01:35 -07:00
Nicolas Vasilache	bcd6424f9b	[mlir][Linalg] Fix linalg on tensor fusion - Drop unnecessary occurrences of rewriter.eraseOp: dead linalg ops on tensors should be cleaned up by DCE. - reimplement the part of Linalg on fusion that constructs the body and block arguments: the previous implementation had too much magic. Instead this spells out all cases explicitly and asserts / introduces TODOs for incorrect cases. As a consequence, we can use the default traversal order for this pattern. Differential Revision: https://reviews.llvm.org/D99070	2021-03-22 13:29:40 +00:00
Adrian Kuegel	c691b9686b	[mlir] Add an option to still use bottom-up traversal GreedyPatternRewriteDriver was changed from bottom-up traversal to top-down traversal. Not all passes work yet with that change for traversal order. To give some time for fixing, add an option to allow to switch back to bottom-up traversal. Use this option in FusionOfTensorOpsPass which fails otherwise. Differential Revision: https://reviews.llvm.org/D99059	2021-03-22 09:49:44 +01:00
Stella Laurenzo	bdf4e93b2c	Fix extraneous context parameter in templated helper function. (missed in lattner's overall updates related to D99028)	2021-03-22 05:08:44 +00:00
Jacques Pienaar	113baa2b9f	Update examples post OwningRewritePatternList change	2021-03-21 15:15:54 -07:00
Chris Lattner	1d909c9a35	Remove the extraneous MLIRContext argument from populateWithGenerated. NFC.	2021-03-21 10:38:35 -07:00
Chris Lattner	ffde3acb1b	[ShapeDialect] Silence a build warning, NFC mlir/lib/Dialect/Shape/IR/Shape.cpp:573:26: warning: loop variable 'shape' is always a copy because the range of type '::mlir::Operation::operand_range' (aka 'mlir::OperandRange') does not return a reference [-Wrange-loop-analysis] for (const auto &shape : shapes()) { ^	2021-03-21 10:10:38 -07:00
Chris Lattner	3a506b31a3	Change OwningRewritePatternList to carry an MLIRContext with it. This updates the codebase to pass the context when creating an instance of OwningRewritePatternList, and starts removing extraneous MLIRContext parameters. There are many many more to be removed. Differential Revision: https://reviews.llvm.org/D99028	2021-03-21 10:06:31 -07:00
Chris Lattner	361b7d125b	[Canonicalizer] Process regions top-down instead of bottom up & reuse existing constants. This reapplies `b5d9a3c` / https://reviews.llvm.org/D98609 with a one line fix in processExistingConstants to skip() when erasing a constant we've already seen. Original commit message: 1) Change the canonicalizer to walk the function in top-down order instead of bottom-up order. This composes well with the "top down" nature of constant folding and simplification, reducing iterations and re-evaluation of ops in simple cases. 2) Explicitly enter existing constants into the OperationFolder table before canonicalizing. Previously we would "constant fold" them and rematerialize them, wastefully recreating a bunch fo constants, which lead to pointless memory traffic. Both changes together provide a 33% speedup for canonicalize on some mid-size CIRCT examples. One artifact of this change is that the constants generated in normal pattern application get inserted at the top of the function as the patterns are applied. Because of this, we get "inverted" constants more often, which is an aethetic change to the IR but does permute some testcases. Differential Revision: https://reviews.llvm.org/D99006	2021-03-20 16:30:15 -07:00
Butygin	7219b31d40	[mlir] Additional folding for SelectOp * Fold SelectOp when both true and false args are same SSA value * Fold some cmp + select patterns Differential Revision: https://reviews.llvm.org/D98576	2021-03-20 13:40:42 +03:00
Butygin	5657f93e78	[mlir] Canonicalize IfOp with trivial `then` and `else` bodies to list of SelectOp's * Do we need a threshold on maximum number of Yeild arguments processed (maximum number of SelectOp's to be generated)? * Had to modify some old IfOp tests to not get optimized by this pattern Differential Revision: https://reviews.llvm.org/D98592	2021-03-20 12:18:49 +03:00
Rob Suderman	e990fa2170	[mlir][tosa] Add tosa.reverse lowering to linalg.generic Reverse lowers to a linalg.generic op by reversing the read order in the index map. Differential Revision: https://reviews.llvm.org/D98997	2021-03-19 21:46:47 -07:00
Mehdi Amini	cdb6eb7e83	Update syntax for amx.tile_muli to use two Unit attr to mark the zext case This makes the annotation tied to the operand and the use of a keyword more explicit/readable on what it means. Differential Revision: https://reviews.llvm.org/D99001	2021-03-20 04:12:24 +00:00
Stella Laurenzo	8d05a28887	[mlir][python] Adapt to `segment_sizes` attribute type change. * Broken by https://reviews.llvm.org/rG1a75be0023cd80fd8560d689999a63d4368c90e6	2021-03-19 18:47:00 -07:00
Stella Laurenzo	d9343e6153	[mlir][python] Function decorator for capturing a FuncOp from a python function. * Moves this out of a test case where it was being developed to good effect and generalizes it. * Having tried a number of things like this, I think this balances concerns reasonably well. Differential Revision: https://reviews.llvm.org/D98989	2021-03-19 18:27:21 -07:00
River Riddle	caddfbd2a9	[mlir][docs] Remove the BuiltinDialect documentation from langref and generate it from ODS Now that all of the builtin dialect is generated from ODS, its documentation in LangRef can be split out and replaced with references to Dialects/Builtin.md. LangRef is quite crusty right now and should really have a full cleanup done in a followup. Differential Revision: https://reviews.llvm.org/D98562	2021-03-19 18:21:33 -07:00
Chris Lattner	b2f232b830	[testsuite] Make testsuite more stable vs canonicalization change. NFC. Differential Revision: https://reviews.llvm.org/D98998	2021-03-19 18:11:12 -07:00
River Riddle	1a75be0023	[mlir][NFC] Use the native range instead of APInt when computing operand ranges This removes the need to construct an APInt for each value, given that it is guaranteed to contain 32 bit elements. BEGIN_PUBLIC ...text exposed to open source public git repo... END_PUBLIC	2021-03-19 17:11:46 -07:00
River Riddle	d75a611afb	[mlir] Update `simplifyRegions` to use RewriterBase for erasure notifications This allows for notifying callers when operations/blocks get erased, which is especially useful for the greedy pattern driver. The current greedy pattern driver "throws away" all information on constants in the operation folder because it doesn't know if they get erased or not. By passing in RewriterBase, we can directly track this and prevent the need for the pattern driver to rediscover all of the existing constants. In some situations this cuts the compile time of the canonicalizer in half. Differential Revision: https://reviews.llvm.org/D98755	2021-03-19 16:33:54 -07:00
River Riddle	cde203e0f9	[mlir][Pass] Coalesce dynamic pass pipelines before running This was missed when dynamic pass pipelines were added, and is necessary for maximizing the performance/parallelism potential of the pass pipeline.	2021-03-19 14:35:42 -07:00
Stella Laurenzo	436c6c9c20	NFC: Break up the mlir python bindings into individual sources. * IRModules.cpp -> (IRCore.cpp, IRAffine.cpp, IRAttributes.cpp, IRTypes.cpp). * The individual pieces now compile in the 5-15s range whereas IRModules.cpp was starting to approach a minute (didn't capture a before time). * More fine grained splitting is possible, but this represents the most obvious. Differential Revision: https://reviews.llvm.org/D98978	2021-03-19 13:33:51 -07:00
Butygin	a531bbd9ad	[MLIR] Test pattern benefit sorting between operation specific and operation agnostic patterns. Previously low benefit op-specific patterns never had a chance to match even if high benefit op-agnostic pattern failed to match. This was already fixed upstream, this commit just adds testscase Differential Revision: https://reviews.llvm.org/D98513	2021-03-19 23:11:56 +03:00
Benjamin Kramer	6327a7cfd7	[mlir][Linalg] Make LLVM_DEBUG region bigger to avoid warnings in Release builds Transforms.cpp:586:16: error: unused variable 'v' [-Werror,-Wunused-variable] for (Value v : operands) ^	2021-03-19 20:56:59 +01:00
Rob Suderman	47286fc530	[mlir][tosa] Add tosa.cast to linalg lowering Handles lowering from the tosa CastOp to the equivalent linalg lowering. It includes support for interchange between bool, int, and floating point. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D98828	2021-03-19 11:48:37 -07:00
Rob Suderman	1b7498120d	[mlir][tosa] Add tosa.logical_* to linalg lowerings Adds lowerings for logical_* boolean operations. Each of these ops only operate on booleans allowing simple lowerings. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D98910	2021-03-19 11:30:42 -07:00
Stella Laurenzo	d4cba4a188	[mlir][linalg] Add structured op builders from python opdsl. * Makes the wrapped functions of the `@linalg_structured_op` decorator callable such that they emit IR imperatively when invoked. * There are numerous TODOs that I will keep working through to achieve generality. * Will true up exception handling tests as the feature progresses (for things that are actually errors once everything is implemented). * Includes the addition of an `isinstance` method on concrete types in the Python API. Differential Revision: https://reviews.llvm.org/D98754	2021-03-19 11:20:36 -07:00
thomasraoux	3587728ed5	[mlir] Fix cuda integration test failure	2021-03-19 10:33:55 -07:00
Nicolas Vasilache	5b2d8503d1	[mlir][Linalg] NFC - Expose helper function `substituteMin`.	2021-03-19 16:26:52 +00:00
Christian Sigg	a5f9cda173	[mlir] Rename gpu-to-llvm pass implementation file Also remove populate patterns function and binary annotation name option. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D98930	2021-03-19 13:58:13 +01:00
Alexander Belyaev	628f5c9da2	[mlir] Add a roundtrip test for 'linalg.tiled_loop' on buffers. https://llvm.discourse.group/t/rfc-add-linalg-tileop/2833 Differential Revision: https://reviews.llvm.org/D98900	2021-03-19 09:38:20 +01:00
Christian Sigg	74ffe8dc59	[mlir] Remove ConvertKernelFuncToBlob All users have been converted to gpu::SerializeToBlobPass. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D98928	2021-03-19 09:33:47 +01:00
Christian Sigg	a825fb2c07	[mlir] Remove mlir-rocm-runner This change combines for ROCm what was done for CUDA in D97463, D98203, D98360, and D98396. I did not try to compile SerializeToHsaco.cpp or test mlir/test/Integration/GPU/ROCM because I don't have an AMD card. I fixed the things that had obvious bit-rot though. Reviewed By: whchung Differential Revision: https://reviews.llvm.org/D98447	2021-03-19 00:24:10 -07:00
Andrew Young	f178c13fa8	[mlir] Support use-def cycles in graph regions during regionDCE When deleting operations in DCE, the algorithm uses a post-order walk of the IR to ensure that value uses were erased before value defs. Graph regions do not have the same structural invariants as SSA CFG, and this post order walk could delete value defs before uses. This problem is guaranteed to occur when there is a cycle in the use-def graph. This change stops DCE from visiting the operations and blocks in any meaningful order. Instead, we rely on explicitly dropping all uses of a value before deleting it. Reviewed By: mehdi_amini, rriddle Differential Revision: https://reviews.llvm.org/D98919	2021-03-18 23:06:45 -07:00
Vladislav Vinogradov	270a336ff4	[mlir] Fix Python bindings tests failure in Debug mode after D98474 Add extra `type.isa<FloatType>()` check to `FloatAttr::get(Type, double)` method. Otherwise it tries to call `type.cast<FloatType>()`, which fails with assertion in Debug mode. The `!type.isa<FloatType>()` case just redirercts the call to `FloatAttr::get(Type, APFloat)`, which will perform the actual check and emit appropriate error. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D98764	2021-03-19 05:32:32 +00:00
Rob Suderman	286a9d467e	[mlir][tosa] Add lowering for tosa.rescale to linalg.generic This adds a tosa.apply_scale operation that handles the scaling operation common to quantized operatons. This scalar operation is lowered in TosaToStandard. We use a separate ApplyScale factorization as this is a replicable pattern within TOSA. ApplyScale can be reused within pool/convolution/mul/matmul for their quantized variants. Tests are added to both tosa-to-standard and tosa-to-linalg-on-tensors that verify each pass is correct. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D98753	2021-03-18 16:14:05 -07:00
Rob Suderman	5627564fe0	[mlir][tosa] Add tosa.concat to subtensor inserts lowering Includes lowering for tosa.concat with indice computation with subtensor insert operations. Includes tests along two different indices. Differential Revision: https://reviews.llvm.org/D98813	2021-03-18 15:59:07 -07:00
thomasraoux	44f24f3996	[mlir] Fix build failure due to `1a572f4`	2021-03-18 14:58:32 -07:00
Lei Zhang	fcc1ce0093	Revert "Revert "[mlir] Add linalg.fill bufferization conversion"" This reverts commit `c69550c132` with proper fix applied.	2021-03-18 17:21:58 -04:00
Mehdi Amini	c69550c132	Revert "[mlir] Add linalg.fill bufferization conversion" This reverts commit `32a744ab20`. CI is broken: test/Dialect/Linalg/bufferize.mlir:274:12: error: CHECK: expected string not found in input // CHECK: %[[MEMREF:.*]] = tensor_to_memref %[[IN]] : memref<?xf32> ^	2021-03-18 21:18:07 +00:00
Eugene Zhulenev	32a744ab20	[mlir] Add linalg.fill bufferization conversion `BufferizeAnyLinalgOp` fails because `FillOp` is not a `LinalgGenericOp` and it fails while reading operand sizes attribute. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D98671	2021-03-18 13:41:16 -07:00
thomasraoux	1a572f4509	[mlir] Add vector op support to cuda-runner including vector.print Differential Revision: https://reviews.llvm.org/D97346	2021-03-18 13:03:08 -07:00
thomasraoux	16947650d5	[mlir][linalg] Extend linalg vectorization to support non-identity input maps This propagates the affine map to transfer_read op in case it is not a minor identity map. Differential Revision: https://reviews.llvm.org/D98523	2021-03-18 12:32:35 -07:00
lorenzo chelini	4c782a24d9	[mlir] Fix typo in SCF.cpp (NFC)	2021-03-18 19:15:33 +01:00
Alexander Belyaev	283799157e	[mlir][linalg] Add support for memref inputs/outputs for `linalg.tiled_loop`. Also use `ArrayAttr` to pass iterator pass to the TiledLoopOp builder. Differential Revision: https://reviews.llvm.org/D98871	2021-03-18 16:11:03 +01:00
David Truby	de155f4af2	[MLIR][OpenMP] Pretty printer and parser for omp.wsloop Co-authored-by: Kiran Chandramohan <kiran.chandramohan@arm.com> Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D92327	2021-03-18 13:37:01 +00:00
Vladislav Vinogradov	02834e1bd9	[mlir][ODS] Get rid of limitations in rewriters generator Do not limit the number of arguments in rewriter pattern. Introduce separate `FmtStrVecObject` class to handle format of variadic `std::string` array. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D97839	2021-03-18 12:21:06 +03:00
Frederik Gossen	1ce70c15ed	[MLIR] Canonicalize broadcast operations on single shapes This covers cases that are not folded away because the extent tensor type becomes more concrete in the process. Differential Revision: https://reviews.llvm.org/D98782	2021-03-18 08:59:50 +01:00
River Riddle	5a8d5a2859	[mlir][Toy] Tidy up the first half of Chapter 2. This performs a few rewordings, expands on a few parts, etc.	2021-03-17 17:37:28 -07:00
River Riddle	ee74860597	[mlir][Toy] Update the tutorial to use tablegen for dialect declarations This was missed when the feature was originally added. Differential Revision: https://reviews.llvm.org/D87060	2021-03-17 17:37:28 -07:00
Rob Suderman	f4bb076a44	[mlir][tosa] Add tosa.slice to std.subtensor lowering Lowering to subtensor is added for tosa.slice operator. Differential Revision: https://reviews.llvm.org/D98825	2021-03-17 17:28:18 -07:00
River Riddle	d70185ec48	[mlir][IR] Support parsing hex float values in the DialectSymbolParser This has been a TODO for a while, and prevents breakages for attributes/types that contain floats that can't roundtrip outside of the hex format. Differential Revision: https://reviews.llvm.org/D98808	2021-03-17 13:52:32 -07:00
Aart Bik	f2557cf7ed	[mlir][cpu-runner] register all llvm ir dialects This fixes broken JIT functionality on emulator platforms. With Alex' recent movement towards squashing llvm ir dialects into target specific dialects, we now must ensure these dialects are registered to the cpu runner to ensure JIT can lower this to proper LLVM IR before handing this off to the backend. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D98727	2021-03-17 10:05:46 -07:00
Aart Bik	9705cafc0f	[mlir][amx] regression test for tile-muli (all zero/sign-extension combinations) Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D98742	2021-03-17 10:04:04 -07:00
Ahmed Taei	f5963944d9	Add arm_neon.sdot operation Differential Revision: https://reviews.llvm.org/D98198	2021-03-17 08:24:58 -07:00
Vladislav Vinogradov	fee9054232	[mlir][ODS] Support specialized Attribute class for Enums Add a feature to `EnumAttr` definition to generate specialized Attribute class for the particular enumeration. This class will inherit `StringAttr` or `IntegerAttr` and will override `classof` and `getValue` methods. With this class the enumeration predicate can be checked with simple RTTI calls (`isa`, `dyn_cast`) and it will return the typed enumeration directly instead of raw string/integer. Based on the following discussion: https://llvm.discourse.group/t/rfc-add-enum-attribute-decorator-class/2252 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D97836	2021-03-17 16:44:24 +03:00
Adrian Kuegel	4a8c01a02b	Move BaseOpWithOffsetSizesAndStrides to OpBase.td It is used both by the Standard dialect and the MemRef dialect. Differential Revision: https://reviews.llvm.org/D98777	2021-03-17 13:54:04 +01:00
lorenzo chelini	0a74a7161b	[mlir] scf::ForOp: Drop iter arguments (and corresponding result) with no use 'ForOpIterArgsFolder' can now remove iterator arguments (and corresponding results) with no use. Example: ``` %cst = constant 32 : i32 %0:2 = scf.for %arg1 = %lb to %ub step %step iter_args(%arg2 = %arg0, %arg3 = %cst) -> (i32, i32) { %1 = addu %arg2, %cst : i32 scf.yield %1, %1 : i32, i32 } use(%0#0) ``` %arg3 is not used in the block, and its corresponding result `%0#1` has no use, thus remove the iter argument. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D98711	2021-03-17 12:06:17 +00:00
Stephan Herhut	5837fdc4cc	[mlir][llvm] Pass struct results as parameter in c wrapper Returning structs directly in LLVM does not necessarily align with the C ABI of the platform. This might happen to work on Linux but for small structs this breaks on Windows. With this change, the wrappers work platform independently. Differential Revision: https://reviews.llvm.org/D98725	2021-03-17 12:58:52 +01:00
Julian Gross	ea51e7d4f8	Added documentation for SSA like property in Bufferization. Added additional information about the SSA like properties that has to be fulfilled in the bufferization steps. Differential Revision: https://reviews.llvm.org/D95522	2021-03-17 12:49:37 +01:00
Gaurav Shukla	8e3075c2b0	[MLIR] Fix lowering of Affine IfOp in the presence of yield values. This commit fixes the lowering of `Affine.IfOp` to `SCF.IfOp` in the presence of yield values. These changes have been made as a part of `-lower-affine` pass. Differential Revision: https://reviews.llvm.org/D98760	2021-03-17 16:33:32 +05:30
River Riddle	e60d57451e	[mlir][Python] Fix test broken after D98474	2021-03-16 16:52:20 -07:00
River Riddle	caa7038a89	[mlir][IR] Move the remaining builtin attributes to ODS. With this revision, all builtin attributes and types will have been moved to the ODS generator. Differential Revision: https://reviews.llvm.org/D98474	2021-03-16 16:31:53 -07:00
River Riddle	425e11eea1	[mlir][AttrTypeDefGen] Add support for custom parameter comparators Some parameters to attributes and types rely on special comparison routines other than operator== to ensure equality. This revision adds support for those parameters by allowing them to specify a `comparator` code block that determines if `$_lhs` and `$_rhs` are equal. An example of one of these paramters is APFloat, which requires `bitwiseIsEqual` for bitwise comparison (which we want for attribute equality). Differential Revision: https://reviews.llvm.org/D98473	2021-03-16 16:31:53 -07:00
River Riddle	1f13963ec1	[mlir][pdl] Cast the OperationPosition to Position to fix MSVC miscompile If we don't cast, MSVC picks an overload that hasn't been defined yet(not sure why) and miscompiles.	2021-03-16 16:11:14 -07:00
Eugene Zhulenev	74f6138bd9	[mlir] Add lowering from math::Log1p to LLVM [mlir] Add lowering from math::Log1p to LLVM Reviewed By: cota Differential Revision: https://reviews.llvm.org/D98662	2021-03-16 15:59:09 -07:00
River Riddle	85ab413b53	[mlir][PDL] Add support for variadic operands and results in the PDL byte code Supporting ranges in the byte code requires additional complexity, given that a range can't be easily representable as an opaque void , as is possible with the existing bytecode value types (Attribute, Type, Value, etc.). To enable representing a range with void , an auxillary storage is used for the actual range itself, with the pointer being passed around in the normal byte code memory. For type ranges, a TypeRange is stored. For value ranges, a ValueRange is stored. The above problem represents a majority of the complexity involved in this revision, the rest is adapting/adding byte code operations to support the changes made to the PDL interpreter in the parent revision. After this revision, PDL will have initial end-to-end support for variadic operands/results. Differential Revision: https://reviews.llvm.org/D95723	2021-03-16 13:20:19 -07:00
River Riddle	3a833a0e0e	[mlir][PDL] Add support for variadic operands and results in the PDL Interpreter This revision extends the PDL Interpreter dialect to add support for variadic operands and results, with ranges of these values represented via the recently added !pdl.range type. To support this extension, three new operations have been added that closely match the single variant: * pdl_interp.check_types : Compare a range of types with a known range. * pdl_interp.create_types : Create a constant range of types. * pdl_interp.get_operands : Get a range of operands from an operation. * pdl_interp.get_results : Get a range of results from an operation. * pdl_interp.switch_types : Switch on a range of types. This revision handles adding support in the interpreter dialect and the conversion from PDL to PDLInterp. Support for variadic operands and results in the bytecode will be added in a followup revision. Differential Revision: https://reviews.llvm.org/D95722	2021-03-16 13:20:19 -07:00
River Riddle	1eb6994d6a	[mlir][PDL] Add support for variadic operands and results in PDL This revision extends the PDL dialect to add support for variadic operands and results, with ranges of these values represented via the recently added !pdl.range type. To support this extension, three new operations have been added that closely match the single variant: * pdl.operands : Define a range of input operands. * pdl.results : Extract a result group from an operation. * pdl.types : Define a handle to a range of types. Support for these in the pdl interpreter dialect and byte code will be added in followup revisions. Differential Revision: https://reviews.llvm.org/D95721	2021-03-16 13:20:18 -07:00
River Riddle	02c4c0d5b2	[mlir][pdl] Remove CreateNativeOp in favor of a more general ApplyNativeRewriteOp. This has a numerous amount of benefits, given the overly clunky nature of CreateNativeOp: * Users can now call into arbitrary rewrite functions from inside of PDL, allowing for more natural interleaving of PDL/C++ and enabling for more of the pattern to be in PDL. * Removes the need for an additional set of C++ functions/registry/etc. The new ApplyNativeRewriteOp will use the same PDLRewriteFunction as the existing RewriteOp. This reduces the API surface area exposed to users. This revision also introduces a new PDLResultList class. This class is used to provide results of native rewrite functions back to PDL. We introduce a new class instead of using a SmallVector to simplify the work necessary for variadics, given that ranges will require some changes to the structure of PDLValue. Differential Revision: https://reviews.llvm.org/D95720	2021-03-16 13:20:18 -07:00
River Riddle	242762c9a3	[mlir][pdl] Restructure how results are represented. Up until now, results have been represented as additional results to a pdl.operation. This is fairly clunky, as it mismatches the representation of the rest of the IR constructs(e.g. pdl.operand) and also isn't a viable representation for operations returned by pdl.create_native. This representation also creates much more difficult problems when factoring in support for variadic result groups, optional results, etc. To resolve some of these problems, and simplify adding support for variable length results, this revision extracts the representation for results out of pdl.operation in the form of a new `pdl.result` operation. This operation returns the result of an operation at a given index, e.g.: ``` %root = pdl.operation ... %result = pdl.result 0 of %root ``` Differential Revision: https://reviews.llvm.org/D95719	2021-03-16 13:20:18 -07:00
Aart Bik	b85d3e27ad	[mlir][amx] reformatted examples Examples were missing the underscore of the actual ops format. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D98723	2021-03-16 10:24:57 -07:00
Aart Bik	b388bbd3f9	[mlir][amx] blocked tilezero integration test This adds a new integration test. However, it also adapts to a recent memref.XXX change for existing tests Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D98680	2021-03-16 08:49:31 -07:00
Nicolas Vasilache	b661788b77	[mlir] NFC - Expose GlobalCreator so it can be reused.	2021-03-16 12:29:04 +00:00
Adrian Kuegel	2995e161b0	[mlir]: Add canonicalization for dim of 1D alloc of size rank. Differential Revision: https://reviews.llvm.org/D97542	2021-03-16 10:38:57 +01:00
David Zarzycki	1d297f9064	[lit] Sort test start times based on prior test timing data Lit as it exists today has three hacks that allow users to run tests earlier: 1) An entire test suite can set the `is_early` boolean. 2) A very recently introduced "early_tests" feature. 3) The `--incremental` flag forces failing tests to run first. All of these approaches have problems. 1) The `is_early` feature was until very recently undocumented. Nevertheless it still lacks testing and is a imprecise way of optimizing test starting times. 2) The `early_tests` feature requires manual updates and doesn't scale. 3) `--incremental` is undocumented, untested, and it requires modifying the source file system by "touching" the file. This "touch" based approach is arguably a hack because it confuses editors (because it looks like the test was modified behind the back of the editor) and "touching" the test source file doesn't work if the test suite is read only from the perspective of `lit` (via advanced filesystem/build tricks). This patch attempts to simplify and address all of the above problems. This patch formalizes, documents, tests, and defaults lit to recording the execution time of tests and then reordering all tests during the next execution. By reordering the tests, high core count machines run faster, sometimes significantly so. This patch also always runs failing tests first, which is a positive user experience win for those that didn't know about the hidden `--incremental` flag. Finally, if users want, they can _optionally_ commit the test timing data (or a subset thereof) back to the repository to accelerate bots and first-time runs of the test suite. Reviewed By: jhenderson, yln Differential Revision: https://reviews.llvm.org/D98179	2021-03-16 05:23:04 -04:00
Lorenzo Chelini	fd7eee64c5	scf::ForOp: Fold away iterator arguments with no use and for which the corresponding input is yielded Enhance 'ForOpIterArgsFolder' to remove unused iteration arguments in a scf::ForOp. If the block argument corresponding to the given iterator has no use and the yielded value equals the input, we fold it away. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D98503	2021-03-16 07:01:25 +00:00
Aart Bik	6ad7b97e20	[mlir][amx] Add Intel AMX dialect (architectural-specific vector dialect) The Intel Advanced Matrix Extensions (AMX) provides a tile matrix multiply unit (TMUL), a tile control register (TILECFG), and eight tile registers TMM0 through TMM7 (TILEDATA). This new MLIR dialect provides a bridge between MLIR concepts like vectors and memrefs and the lower level LLVM IR details of AMX. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D98470	2021-03-15 17:59:05 -07:00
Alex Zinenko	b868a3edad	[mlir] fix SPIR-V CPU and Vulkan runners after `e2310704d8` The commit in question changed the syntax but did not update the runner tests. This also required registering the MemRef dialect for custom parser to work correctly.	2021-03-15 18:36:58 +01:00
Christopher Tetreault	39970764af	[CMake] Require python 3.6 if enabling LLVM test targets The lit test suite uses python 3.6 features. Rather than a strange python syntax error upon running the lit tests, we will require the correct version in CMake. Reviewed By: serge-sans-paille, yln Differential Revision: https://reviews.llvm.org/D95635	2021-03-15 09:50:39 -07:00
Alex Zinenko	0aceb61665	[mlir] make memref.cast implement ViewLikeOpInterface This was seemingly dropped in `e2310704d8`, potentially due to a misrebase. The absence of this trait makes aliasing analysis incorrect, leading to, e.g., buffer deallocation pass inserting deallocations too early.	2021-03-15 17:21:27 +01:00
Alex Zinenko	7aa6f3aa0c	[mlir] fix integration tests post `e2310704d8` The commit in question moved some ops across dialects but did not update some of the target-specific integration tests that use these ops, presumably because the corresponding target hardware was not available. Fix these tests.	2021-03-15 14:41:27 +01:00
Alex Zinenko	e82a30bdce	[mlir] enable Python bindings for the MemRef dialect A previous commit moved multiple ops from Standard to MemRef dialect. Some of these ops are exercised in Python bindings. Enable bindings for the newly created MemRef dialect and update a test accordingly.	2021-03-15 14:07:51 +01:00
Alex Zinenko	0fb4a201c0	[mlir] fix shared-lib build fallout of `e2310704d8` The patch in question broke the build with shared libraries due to missing dependencies, one of which would have been circular between MLIRStandard and MLIRMemRef if added. Fix this by moving more code around and swapping the dependency direction. MLIRMemRef now depends on MLIRStandard, but MLIRStandard does _not_ depend on MLIRMemRef. Arguably, this is the right direction anyway since numerous libraries depend on MLIRStandard and don't necessarily need to depend on MLIRMemref. Other otable changes include: - some EDSC code is moved inline to MemRef/EDSC/Intrinsics.h because it creates MemRef dialect operations; - a utility function related to shape moved to BuiltinTypes.h/cpp because it only realtes to shaped types and not any particular dialect (standard dialect is erroneously believed to contain MemRefType); - a Python test for the standard dialect is disabled completely because the ops it tests moved to the new MemRef dialect, but it is not exposed to Python bindings, and the change for that is non-trivial.	2021-03-15 13:41:38 +01:00
Julian Gross	e2310704d8	[MLIR] Create memref dialect and move dialect-specific ops from std. Create the memref dialect and move dialect-specific ops from std dialect to this dialect. Moved ops: AllocOp -> MemRef_AllocOp AllocaOp -> MemRef_AllocaOp AssumeAlignmentOp -> MemRef_AssumeAlignmentOp DeallocOp -> MemRef_DeallocOp DimOp -> MemRef_DimOp MemRefCastOp -> MemRef_CastOp MemRefReinterpretCastOp -> MemRef_ReinterpretCastOp GetGlobalMemRefOp -> MemRef_GetGlobalOp GlobalMemRefOp -> MemRef_GlobalOp LoadOp -> MemRef_LoadOp PrefetchOp -> MemRef_PrefetchOp ReshapeOp -> MemRef_ReshapeOp StoreOp -> MemRef_StoreOp SubViewOp -> MemRef_SubViewOp TransposeOp -> MemRef_TransposeOp TensorLoadOp -> MemRef_TensorLoadOp TensorStoreOp -> MemRef_TensorStoreOp TensorToMemRefOp -> MemRef_BufferCastOp ViewOp -> MemRef_ViewOp The roadmap to split the memref dialect from std is discussed here: https://llvm.discourse.group/t/rfc-split-the-memref-dialect-from-std/2667 Differential Revision: https://reviews.llvm.org/D98041	2021-03-15 11:14:09 +01:00
Alex Zinenko	a88371490d	[mlir] better formatting in interface docs Start the description from a new line instead of putting the first paragraph in the section header. Wrap the class name in backticks to make it clear that it relates to the code.	2021-03-15 11:10:32 +01:00
Alex Zinenko	03085156ec	[mlir] fix cmake for generating data layout documentation	2021-03-15 11:02:03 +01:00
Alex Zinenko	40d8e4d3f9	Revert "[Canonicalizer] Process regions top-down instead of bottom up & reuse existing constants." This reverts commit `b5d9a3c923`. The commit introduced a memory error in canonicalization/operation walking that is exposed when compiled with ASAN. It leads to crashes in some "release" configurations.	2021-03-15 10:27:55 +01:00
Frederik Gossen	b55f424ffc	[MLIR] Add canonicalization for `shape.broadcast` Remove redundant operands and fold if only one left. Differential Revision: https://reviews.llvm.org/D98402	2021-03-15 10:11:28 +01:00
Frederik Gossen	2a71f95767	[MLIR] Allow compatible shapes in `Elementwise` operations Differential Revision: https://reviews.llvm.org/D98186	2021-03-15 09:56:20 +01:00
Matthias Springer	581672be04	[mlir][AVX512] Add while loop-based sparse vector-vector dot product variants. Differential Revision: https://reviews.llvm.org/D98480	2021-03-15 16:59:10 +09:00
Chris Lattner	91a6ad5ad8	[m_Constant] Check #operands/results before hasTrait() We know that all ConstantLike operations have one result and no operands, so check this first before doing the trait check. This change speeds up Canonicalize on a CIRCT testcase by ~5%. Differential Revision: https://reviews.llvm.org/D98615	2021-03-14 20:14:19 -07:00
Chris Lattner	b5d9a3c923	[Canonicalizer] Process regions top-down instead of bottom up & reuse existing constants. Two changes: 1) Change the canonicalizer to walk the function in top-down order instead of bottom-up order. This composes well with the "top down" nature of constant folding and simplification, reducing iterations and re-evaluation of ops in simple cases. 2) Explicitly enter existing constants into the OperationFolder table before canonicalizing. Previously we would "constant fold" them and rematerialize them, wastefully recreating a bunch fo constants, which lead to pointless memory traffic. Both changes together provide a 33% speedup for canonicalize on some mid-size CIRCT examples. One artifact of this change is that the constants generated in normal pattern application get inserted at the top of the function as the patterns are applied. Because of this, we get "inverted" constants more often, which is an aethetic change to the IR but does permute some testcases. Differential Revision: https://reviews.llvm.org/D98609	2021-03-14 18:21:42 -07:00
Aart Bik	e7ee4eaaf7	[mlir][sparse] disable nonunit stride dense vectorization This is a temporary work-around to get our all-annotations-all-flags stress testing effort run clean. In the long run, we want to provide efficient implementations of strided loads and stores though Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D98563	2021-03-12 16:49:32 -08:00
Nikita Popov	42eb658f65	[OpaquePtrs] Remove some uses of type-less CreateGEP() (NFC) This removes some (but not all) uses of type-less CreateGEP() and CreateInBoundsGEP() APIs, which are incompatible with opaque pointers. There are a still a number of tricky uses left, as well as many more variation APIs for CreateGEP.	2021-03-12 21:01:16 +01:00
Eugene Zhulenev	39b2cd4009	[mlir] Annotate functions used only in debug mode with LLVM_ATTRIBUTE_UNUSED Functions used only in `assert` cause warnings in release mode Reviewed By: mehdi_amini, dcaballe, ftynse Differential Revision: https://reviews.llvm.org/D98476	2021-03-12 11:25:46 -08:00
Alex Zinenko	4affd0c40e	[mlir] fix a memory leak in NestedPattern NestedPattern uses a BumpPtrAllocator to store child (nested) pattern objects to decrease the overhead of dynamic allocation. This assumes all allocations happen inside the allocator that will be freed as a whole. However, NestedPattern contains `std::function` as a member, which allocates internally using `new`, unaware of the BumpPtrAllocator. Since NestedPattern only holds pointers to the nested patterns allocated in the BumpPtrAllocator, it never calls their destructors, so the destructor of the `std::function`s they contain are never called either, leaking the allocated memory. Make NestedPattern explicitly call destructors of nested patterns. This additionally requires to actually copy the nested patterns in copy-construction and copy-assignment instead of just sharing the pointer to the arena-allocated list of children to avoid double-free. An alternative solution would be to add reference counting to the list of arena-allocated list of children. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D98485	2021-03-12 18:52:14 +01:00
Christian Sigg	1ef544d4a9	[mlir] Remove mlir-cuda-runner Change CUDA integration tests to use mlir-opt + mlir-cpu-runner instead. Depends On D98203 Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D98396	2021-03-12 14:06:43 +01:00
Alex Zinenko	be5b844a35	[mlir] fix memory leak on failure path in parser Forward references to blocks lead to `Block`s being allocated in the parser, but they are not necessarily included into a region if parsing fails, leading to a leak. Clean them up in parser destructor. Reviewed By: rriddle, mehdi_amini Differential Revision: https://reviews.llvm.org/D98403	2021-03-12 09:24:08 +01:00
Marius Brehler	849f8183fb	[mlir] Fix ConstantOp verifier This restricts the attributes to integers for constants of type IndexType. So far an attribute like StringAttr as in %c1 = constant "" : index is valid. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D98216	2021-03-12 08:49:25 +01:00
Sergei Grechanik	fd2b08969b	[mlir][Vector] Lowering of transfer_read/write to vector.load/store This patch introduces progressive lowering patterns for rewriting vector.transfer_read/write to vector.load/store and vector.broadcast in certain supported cases. Reviewed By: dcaballe, nicolasvasilache Differential Revision: https://reviews.llvm.org/D97822	2021-03-11 18:17:51 -08:00
Sergei Grechanik	46ef6ffdaf	[NFC] Test commit. Add empty lines.	2021-03-11 17:31:20 -08:00
Mehdi Amini	e1364f1068	Replace use of OperationState with builder::create in GPU Kernel Outlining (NFC) OperationState is a low level API that is rarely indicated, the builder API convenient wrapper is preferred when possible.	2021-03-12 00:14:02 +00:00
Diego Caballero	0fd0fb5329	Reland: [mlir][Affine][Vector] Add initial support for 'iter_args' to Affine vectorizer. This patch adds support for vectorizing loops with 'iter_args' when those loops are not a vector dimension. This allows vectorizing outer loops with an inner 'iter_args' loop (e.g., reductions). Vectorizing scenarios where 'iter_args' loops are vector dimensions would require more work (e.g., analysis, generating horizontal reduction, etc.) not included in this patch. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D97892	2021-03-12 01:08:28 +02:00
Diego Caballero	96891f0418	Reland: [mlir][Vector][Affine] Improve affine vectorizer algorithm This patch replaces the root-terminal vectorization approach implemented in the Affine vectorizer with a topological order approach that vectorizes all the operations within the target loop nest. These are the most important changes introduced by the new algorithm: * Removed tracking of root and terminal ops. Existing vectorization functionality is preserved and extended so that loop nests without root-terminal chains can be vectorized. * Vectorizing a loop nest now only requires a single topological traversal. * A new vector loop nest is incrementally built along the vectorization process. The original scalar loop is kept intact. No cloning guard is needed to recover the scalar loop if vectorization fails. This approach also simplifies the challenging task of replacing a loop operation amid the vectorization process without invalidating the analysis information that depends on the original loop. * Vectorization of specific operations has been implemented as independent, preparing them to be moved to a potential vectorization interface. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D97442	2021-03-12 00:19:50 +02:00
River Riddle	31bb8efd69	[mlir][StorageUniquer] Properly call the destructor on non-trivially destructible storage instances This allows for storage instances to store data that isn't uniqued in the context, or contain otherwise non-trivial logic, in the rare situations that they occur. Storage instances with trivial destructors will still have their destructor skipped. A consequence of this is that the storage instance definition must be visible from the place that registers the type. Differential Revision: https://reviews.llvm.org/D98311	2021-03-11 11:35:32 -08:00
Diego Caballero	ed193bce9d	[mlir][Vector][Affine] Fix heap-use-after-free in vectorizer This patch fixes a heap-use-after-free introduced by the recent changes in the vectorizer: https://reviews.llvm.org/rG95db7b4aeaad590f37720898e339a6d54313422f The problem is due to the way candidate loops are visited. All candidate loops are pattern-matched beforehand using the 'NestedMatch' utility. These matches may intersect with each other so it may happen that we try to vectorize a loop that was previously vectorized. The new vectorization algorithm replaces the original loops that are vectorized with new loops and, therefore, any reference to the original loops in the pre-computed matches becomes invalid. This patch fixes the problem by classifying the candidate matches into buckets before vectorization. Each bucket contains all the matches that intersect. The vectorizer uses these buckets to make sure that we only vectorize one match from each bucket, at most. Differential Revision: https://reviews.llvm.org/D98382	2021-03-11 20:44:07 +02:00
Nikita Popov	f3f0c6cd47	[mlir] Remove uses of type-less CreateLoad() APIs (NFC) For the use in LLVMOps.td I used the getPointerElementType() escape hatch, as it's not obvious to me how the load type should be properly obtained here.	2021-03-11 18:39:20 +01:00
Alex Zinenko	27104390e8	[mlir] fix cmake build	2021-03-11 18:22:00 +01:00
Alex Zinenko	3ba14fa0ce	[mlir] Introduce data layout modeling subsystem Data layout information allows to answer questions about the size and alignment properties of a type. It enables, among others, the generation of various linear memory addressing schemes for containers of abstract types and deeper reasoning about vectors. This introduces the subsystem for modeling data layouts in MLIR. The data layout subsystem is designed to scale to MLIR's open type and operation system. At the top level, it consists of attribute interfaces that can be implemented by concrete data layout specifications; type interfaces that should be implemented by types subject to data layout; operation interfaces that must be implemented by operations that can serve as data layout scopes (e.g., modules); and dialect interfaces for data layout properties unrelated to specific types. Built-in types are handled specially to decrease the overall query cost. A concrete default implementation of these interfaces is provided in the new Target dialect. Defaults for built-in types that match the current behavior are also provided. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D97067	2021-03-11 16:54:47 +01:00
Arpith C. Jacob	b4a516cc43	[mlir] Add LLVM loop codegen options to control software pipelining Support specifying the II and disabling pipelining. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D98420	2021-03-11 16:46:44 +01:00
Tres Popp	25a20b8aa6	[mlir] Correct verifyCompatibleShapes verifyCompatibleShapes is not transitive. Create an n-ary version and update SameOperandShapes and SameOperandAndResultShapes traits to use it. Differential Revision: https://reviews.llvm.org/D98331	2021-03-11 13:04:10 +01:00
Julian Gross	2aef202981	[mlir] Fix invalid hoisting of dependent allocs in buffer hoisting pass. Buffer hoisting moves allocs upwards although it has dependency within its nested region. This patch fixes this issue. https://bugs.llvm.org/show_bug.cgi?id=49142 Differential Revision: https://reviews.llvm.org/D98248	2021-03-11 11:46:16 +01:00
Christian Sigg	bafe418d12	[mlir] Change test-gpu-to-cubin to derive from SerializeToBlobPass Clean-up after D98279, remove one call to createConvertGPUKernelToBlobPass(). Depends On D98203 Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D98360	2021-03-11 10:42:20 +01:00
Frederik Gossen	b975e3b5aa	[MLIR] Add canoncalization for `shape.is_broadcastable` Canonicalize `is_broadcastable` to constant true if fewer than 2 unique shape operands. Eliminate redundant operands, otherwise. Differential Revision: https://reviews.llvm.org/D98361	2021-03-11 10:10:34 +01:00
Christian Sigg	2224221fb3	[mlir] Add NVVM to CUBIN conversion to mlir-opt If MLIR_CUDA_RUNNER_ENABLED, register a 'gpu-to-cubin' conversion pass to mlir-opt. The next step is to switch CUDA integration tests from mlir-cuda-runner to mlir-opt + mlir-cpu-runner and remove mlir-cuda-runner. Depends On D98279 Reviewed By: herhut, rriddle, mehdi_amini Differential Revision: https://reviews.llvm.org/D98203	2021-03-11 10:07:11 +01:00
Matthias Springer	c40e0d7609	[mlir][AVX512] Implement sparse vector dot product integration test. This test operates on two hardware-vector-sized vectors and utilizes vp2intersect and mask.compress. PHAB_REVIEW=D98099	2021-03-11 13:00:17 +09:00
River Riddle	4e02eb8014	[mlir] Optimize the implementation of RegionDCE The current implementation has some inefficiencies that become noticeable when running on large modules. This revision optimizes the code, and updates some out-dated idioms with newer utilities. The main components of this optimization include: * Add an overload of Block::eraseArguments that allows for O(N) erasure of disjoint arguments. * Don't process entry block arguments given that we don't erase them at this point. * Don't track individual operation results, given that we don't erase them. We can just track the parent operation. Differential Revision: https://reviews.llvm.org/D98309	2021-03-10 16:39:50 -08:00
Emilio Cota	c0891706bc	[mlir] Add polynomial approximation for math::Log2 ``` name old cpu/op new cpu/op delta BM_mlir_Log2_f32/10 134ns ±15% 45ns ± 4% -66.39% (p=0.000 n=20+17) BM_mlir_Log2_f32/100 1.03µs ±16% 0.12µs ±10% -88.78% (p=0.000 n=20+18) BM_mlir_Log2_f32/1k 10.3µs ±16% 0.7µs ± 5% -93.24% (p=0.000 n=20+17) BM_mlir_Log2_f32/10k 104µs ±15% 7µs ±14% -93.25% (p=0.000 n=20+20) BM_eigen_s_Log2_f32/10 95.3ns ±17% 90.9ns ± 6% ~ (p=0.228 n=20+18) BM_eigen_s_Log2_f32/100 907ns ± 3% 911ns ± 6% ~ (p=0.539 n=16+20) BM_eigen_s_Log2_f32/1k 9.88µs ± 4% 9.85µs ± 3% ~ (p=0.790 n=16+17) BM_eigen_s_Log2_f32/10k 105µs ±10% 110µs ±16% ~ (p=0.459 n=16+20) BM_eigen_v_Log2_f32/10 32.5ns ±31% 33.9ns ±14% +4.31% (p=0.028 n=17+20) BM_eigen_v_Log2_f32/100 176ns ± 8% 180ns ± 7% +2.19% (p=0.045 n=16+17) BM_eigen_v_Log2_f32/1k 1.44µs ± 4% 1.50µs ± 9% +3.91% (p=0.001 n=16+17) BM_eigen_v_Log2_f32/10k 14.5µs ±10% 15.0µs ± 8% +3.92% (p=0.002 n=16+19) ``` Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D98282	2021-03-10 14:49:22 -08:00
Christian Sigg	6a291ed0f0	[mlir] Remove unnecessary copying of pass options I missed a comment in D98279 that you don't need to copy pass options. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D98366	2021-03-10 21:55:28 +01:00
Weiwei Li	619c1505f9	[mlir][spirv] Define spv.Image Operation co-authered-by: Alan Liu <alanliu.yf@gmail.com> Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D98270	2021-03-10 15:48:04 -05:00
Alex Zinenko	79da91c59a	Revert "[mlir][Vector][Affine] Improve affine vectorizer algorithm" This reverts commit `95db7b4aea`. This breaks vectorize_2d.mlir and vectorize_3d.mlir test under ASAN (use after free).	2021-03-10 20:25:49 +01:00
Alex Zinenko	ed715536f1	Revert "[mlir][Affine][Vector] Add initial support for 'iter_args' to Affine vectorizer." This reverts commit `77a9d1549f`. Parent commit is broken.	2021-03-10 20:25:32 +01:00
Diego Caballero	77a9d1549f	[mlir][Affine][Vector] Add initial support for 'iter_args' to Affine vectorizer. This patch adds support for vectorizing loops with 'iter_args' when those loops are not a vector dimension. This allows vectorizing outer loops with an inner 'iter_args' loop (e.g., reductions). Vectorizing scenarios where 'iter_args' loops are vector dimensions would require more work (e.g., analysis, generating horizontal reduction, etc.) not included in this patch. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D97892	2021-03-10 20:40:21 +02:00
Diego Caballero	95db7b4aea	[mlir][Vector][Affine] Improve affine vectorizer algorithm This patch replaces the root-terminal vectorization approach implemented in the Affine vectorizer with a topological order approach that vectorizes all the operations within the target loop nest. These are the most important changes introduced by the new algorithm: * Removed tracking of root and terminal ops. Existing vectorization functionality is preserved and extended so that loop nests without root-terminal chains can be vectorized. * Vectorizing a loop nest now only requires a single topological traversal. * A new vector loop nest is incrementally built along the vectorization process. The original scalar loop is kept intact. No cloning guard is needed to recover the scalar loop if vectorization fails. This approach also simplifies the challenging task of replacing a loop operation amid the vectorization process without invalidating the analysis information that depends on the original loop. * Vectorization of specific operations has been implemented as independent, preparing them to be moved to a potential vectorization interface. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D97442	2021-03-10 20:29:58 +02:00
Vladislav Vinogradov	b599f464d4	[mlir][CMAKE] Fix build with BUILD_SHARED_LIBS=ON Link `MLIRStandardToLLVM` to `MLIRAVX512Transforms`, since the latter uses `LLVMTypeConverter` defined in the first one. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D98336	2021-03-10 14:52:36 +01:00
Alex Zinenko	e02dd790b1	[mlir] fix typo in OpDefinitions.md	2021-03-10 14:44:08 +01:00
Alex Zinenko	78f3fb4f46	[mlir] Update comments in ArmNeon dialect. NFC These were not updated when squashing LLVMArmNeon and ArmNeon dialects.	2021-03-10 13:35:57 +01:00
Alex Zinenko	a776942ba1	[mlir] squash LLVM_AVX512 dialect into AVX512 The dialect separation was introduced to demarkate ops operating in different type systems. This is no longer the case after the LLVM dialect has migrated to using built-in vector types, so the original reason for separation is no longer valid. Squash the two dialects into one. The code size decrease isn't quite large: the ops originally in LLVM_AVX512 are preserved because they match LLVM IR intrinsics specialized for vector element bitwidth. However, it is still conceptually beneficial to have only one dialect. I originally considered to use Tablegen multiclasses to define both the type-polymorphic op and its two intrinsic-related instantiations, but decided against it given both the complexity of the required Tablegen input and its dissimilarity with the rest of ODS-defined ops, both potentially resulting in very poor maintainability. Depends On D98327 Reviewed By: nicolasvasilache, springerm Differential Revision: https://reviews.llvm.org/D98328	2021-03-10 13:07:26 +01:00
Alex Zinenko	0af53de369	[mlir] simplify type constraints in AVX512 dialect VectorOfLengthAndType accepts a cartesian product of given lengths and types rather than types produced by co-indexed values in the corresponding lists. Update the definitions accordingly. The type validity is already enforced by op traits. Reviewed By: nicolasvasilache, springerm Differential Revision: https://reviews.llvm.org/D98327	2021-03-10 13:07:25 +01:00
Inho Seo	2ce4caf414	Moved getStaticLoopRanges and getStaticShape methods to LinalgInterfaces.td to add static shape verification It is to use the methods in LinalgInterfaces.cpp for additional static shape verification to match the shaped operands and loop on linalgOps. If I used the existing methods, I would face circular dependency linking issue. Now we can use them as methods of LinalgOp. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D98163	2021-03-10 04:06:22 -08:00
Christian Sigg	4d295cf5b5	[mlir] Add base class for GpuKernelToBlobPass Instead of configuring kernel-to-cubin/rocdl lowering through callbacks, introduce a base class that target-specific passes can derive from. Put the base class in GPU/Transforms, according to the discussion in D98203. The mlir-cuda-runner will go away shortly, and the mlir-rocdl-runner as well at some point. I therefore kept the existing code path working and will remove it in a separate step. Depends On D98168 Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D98279	2021-03-10 12:14:43 +01:00
Vladislav Vinogradov	f3bf5c053b	[mlir] Model MemRef memory space as Attribute Based on the following discussion: https://llvm.discourse.group/t/rfc-memref-memory-shape-as-attribute/2229 The goal of the change is to make memory space property to have more expressive representation, rather then "magic" integer values. It will allow to have more clean ASM form: ``` gpu.func @test(%arg0: memref<100xf32, "workgroup">) // instead of gpu.func @test(%arg0: memref<100xf32, 3>) ``` Explanation for `Attribute` choice instead of plain `string`: * `Attribute` classes allow to use more type safe API based on RTTI. * `Attribute` classes provides faster comparison operator based on pointer comparison in contrast to generic string comparison. * `Attribute` allows to store more complex things, like structs or dictionaries. It will allows to have more complex memory space hierarchy. This commit preserve old integer-based API and implements it on top of the new one. Depends on D97476 Reviewed By: rriddle, mehdi_amini Differential Revision: https://reviews.llvm.org/D96145	2021-03-10 12:57:27 +03:00
Hanhan Wang	d5d4fb635e	[mlir][linalg] Add support for using scalar attributes in TC ops. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D97876	2021-03-10 01:51:12 -08:00
Mehdi Amini	75f3f77805	Fix MLIR test post `890afad954`	2021-03-09 23:30:51 +00:00
Mehdi Amini	890afad954	Fix Flang build after MLIR API changes around `generatedTypeParser`	2021-03-09 23:19:30 +00:00
River Riddle	a776ecb6c2	[mlir][IR] Add an Operation::eraseOperands that supports batch erasure This method allows for removing multiple disjoint operands at once, reducing the need to erase operands individually (which results in shifting the operand list). Differential Revision: https://reviews.llvm.org/D98290	2021-03-09 15:07:53 -08:00
River Riddle	4a7aed4ee7	[mlir][IR] Add a new SymbolUserMap class This class provides efficient implementations of symbol queries related to uses, such as collecting the users of a symbol, replacing all uses, etc. This provides similar benefits to use related queries, as SymbolTableCollection did for lookup queries. Differential Revision: https://reviews.llvm.org/D98071	2021-03-09 15:07:52 -08:00
Mehdi Amini	cd9a69289c	Fix LLVM Dialect LoopOptionsAttr round-tripping: the keywords were missing in the output This indicated some missing test coverage, which are now added to the roundtrip test.	2021-03-09 22:00:22 +00:00
Mehdi Amini	fe81e8f3b5	Add default LoopOptionsAttrBuilder constructor and method to check if empty() (NFC) Also move setters out-of-line to make sure the templated helper is actually instantiated.	2021-03-09 21:12:15 +00:00
Christian Sigg	840ff84d33	[mlir] Default for gpu-binary-annotation option. Provide default for gpuBinaryAnnotation so that we don't need to specify it in tests. The annotation likely only needs to be target specific if we want to lower to e.g. both CUDA and ROCDL. Reviewed By: herhut, bondhugula Differential Revision: https://reviews.llvm.org/D98168	2021-03-09 21:01:50 +01:00
Mehdi Amini	79f736c150	Switch generatedTypeParser/generatedAttributeParser to return an OptionalParseResult This allows the caller to distinguish between a parse error or an unmatched keyword. It fixes the redundant error that was emitted by the caller when the generated parser would fail. Differential Revision: https://reviews.llvm.org/D98162	2021-03-09 19:43:45 +00:00
Mehdi Amini	8205c1a90a	Rework LLVM Dialect LoopOptions attribute Instead of storing an array of LoopOpt attributes, which were just wrapping std::pair<enum, int> anyway, we can have an attribute storing a sorted ArrayRef<std::pair<enum, int>> as a single unit. This improves here the textual format and the general API. Note that we're limiting the options to fit into an int64_t by design, but this isn't a new constraint. Building the LoopOptions attribute is likely worth a specific builder for efficient reason, that'll be the subject of a future patch. Differential Revision: https://reviews.llvm.org/D98105	2021-03-09 19:43:45 +00:00
Lei Zhang	50000abe3c	[mlir] Use affine.apply when distributing to processors This makes it easy to compose the distribution computation with other affine computations. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D98171	2021-03-09 08:37:20 -05:00
Alex Zinenko	8184247f0b	[mlir] move LLVM target import header and tests Move Target/LLVMIR.h to target/LLVMIR/Import.h to better reflect the purpose of this file. Also move all LLVM IR target tests under the LLVMIR directory. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D98178	2021-03-09 09:22:14 +01:00
Alex Zinenko	90fec5ed65	[mlir] make MLIRPresburger depend on MLIRIR The analysis library uses Location, which is defined in the MLIRIR library.	2021-03-09 09:19:53 +01:00
Vladislav Vinogradov	2241b3986c	[mlir][CMAKE] Fix cross-compilation build Use `MLIR_LINALG_ODS_GEN` and `MLIR_LINALG_ODS_YAML_GEN` variables instead of `MLIR_LINALG_ODS_GEN_EXE` and `MLIR_LINALG_ODS_YAML_GEN_EXE`. The former are defined in PARENT SCOPE only, so the `if` condition is never evaluates to `TRUE`. The logic should be the following (taken from tblgen part): 1. `TOOL_NAME` - CACHE variable (default equal to target name). User can override it to actual executable path. 2. `TOOL_NAME_EXE` - internal variable, initialized to `${TOOL_NAME}` first. In case of cross-compilation (`LLVM_USE_HOST_TOOLS == TRUE`) if user didn't set own path to native executable via `TOOL_NAME` variable, CMake will create separate targets to build native tool and will override `TOOL_NAME_EXE` to the executable produced by this target. 3. `TOOL_NAME_TARGET` - internal variable, which points to tool target name. If the native tool is built as described above, it will point to the target correspondant to that native tool. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D98025	2021-03-09 10:51:56 +03:00
Tobias Gysi	c1a4cd551f	[mlir][linalg] refactor the result handling during vectorization. Return the vectorization results using a vector passed by reference instead of returning them embedded in a structure. Differential Revision: https://reviews.llvm.org/D98182	2021-03-09 07:11:57 +00:00
Stella Laurenzo	e31c77b182	[mlir][python] Reorganize MLIR python into namespace packages. * Only leaf packages are non-namespace packages. This allows most of the top levels to be split into different directories or deployment packages. In the previous state, the presence of __init__.py files at each level meant that the entire tree could only ever exist in one physical directory on the path. * This changes the API usage slightly: `import mlir` will no longer do a deep import of `mlir.ir`, etc. This may necessitate some client code changes. * Dialect gen code was restructured so that the user is responsible for providing the `my_dialect.py` file, which then must import its peer `_my_dialect_ops_gen`. This gives complete control of the dialect namespace to the user instead of to tablegen code, allowing further dialect-specific python APIs. * Correspondingly, the previous extension modules `_my_dialect.py` are now `_my_dialect_ops_ext.py`. * Now that the `linalg` namespace is open, moved the `linalg_opdsl` tool into it. * This may require some corresponding downstream adjustments to npcomp, circt, et al: * Probably some shallow imports need to be converted to deep imports (i.e. not `import mlir` brings in the world). * Each tablegen generated dialect now needs an explicit `foo.py` which does a `from ._foo_ops_gen import `. This is similar to the way that generated code operates in the C++ world. If providing dialect op extensions, those need to be moved from `_foo.py` -> `_foo_ops_ext.py`. Differential Revision: https://reviews.llvm.org/D98096	2021-03-08 23:01:34 -08:00
Mehdi Amini	038f2a337d	Move LLVM::FMFAttr definition to TableGen (NFC) This is using the new Attribute storage generation support in TableGen to define the LLVM FastMathFlags. Differential Revision: https://reviews.llvm.org/D98007	2021-03-09 05:29:54 +00:00
River Riddle	0d01dfbc37	[mlir][IR][NFC] Move the remaining builtin types to ODS This will allow for removing the duplicated type documentation from LangRef and instead link to the builtin dialect documentation. Differential Revision: https://reviews.llvm.org/D98093	2021-03-08 14:32:40 -08:00
River Riddle	a4bb667d83	[mlir][IR][NFC] Define the Location classes in ODS instead of C++ This also removes the need for LocationDetail.h. Differential Revision: https://reviews.llvm.org/D98092	2021-03-08 14:32:40 -08:00
Rob Suderman	cb3542e1ca	[MLIR][TOSA] Added lowerings for Reduce operations to Linalg Lowerings for min, max, prod, and sum reduction operations on int and float values. This includes reduction tests for both cases. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D97893	2021-03-08 10:57:19 -08:00
Christian Sigg	7cdcb4a3b9	[mlir] NFC: Add #endif comment.	2021-03-08 19:25:24 +01:00
Benjamin Kramer	42c195f0ec	[mlir][Shape] Allow shape.split_at to return extent tensors and lower it to std.subtensor split_at can return an error if the split index is out of bounds. If the user knows that the index can never be out of bounds it's safe to use extent tensors. This has a straight-forward lowering to std.subtensor. Differential Revision: https://reviews.llvm.org/D98177	2021-03-08 16:48:05 +01:00
Frederik Gossen	3b9667a84c	Clarify documentation for `Elementwise`, `Scalarizable`, `Vectorizable`, and `Tensorizable` traits. Differential Revision: https://reviews.llvm.org/D97841	2021-03-08 10:35:22 +01:00
Mehdi Amini	e94e55712c	Forward the `LLVM_ENABLE_LIBCXX` CMake parameter to the mlir standalone test This allows to build and test MLIR with `-DLLVM_ENABLE_LIBCXX=ON`.	2021-03-08 05:07:26 +00:00
KareemErgawy-TomTom	3fb384d50e	[MLIR][SPIRV] Rename `spv.selection` to `spv.mlir.selection`. To unify the naming scheme across all ops in the SPIR-V dialect, we are moving from spv.camelCase to spv.CamelCase everywhere. For ops that don't have a SPIR-V spec counterpart, we use spv.mlir.snake_case. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D98014	2021-03-06 16:05:31 +01:00
Lei Zhang	bb6f5c8314	[mlir][spirv] Convert tensor.extract for very small tensors Normally tensors will be stored in buffers before converting to SPIR-V, given that is how a large amount of data is sent to the GPU. However, SPIR-V supports converting from tensors directly too. This is for the cases where the tensor just contains a small amount of elements and it makes sense to directly inline them as a small data array in the shader. To handle this, internally the conversion might create new local variables. SPIR-V consumers in GPU drivers may or may not optimize that away. So this has implications over register pressure. Therefore, a threshold is used to control when the patterns should kick in. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D98052	2021-03-06 08:03:36 -05:00
Mehdi Amini	f8fe6d9f3f	Use gen-dialect-doc instead of gen-op-doc for the Builtin dialect This is fixing the missing title and menu entry on the MLIR website.	2021-03-06 05:32:46 +00:00
Matthias Springer	acce0ea70c	[mlir][AVX512] Add mask.compress to AVX512 dialect. Adds mask.compress to the AVX512 dialect and defines a lowering to the LLVM dialect. Differential Revision: https://reviews.llvm.org/D97611	2021-03-06 10:02:48 +09:00
Mehdi Amini	a7cac0d9a5	Fix Dialect doc generation to special case for the Builtin dialect empty name This should fix the issue with an empty entry for the builtin dialect on the website. Differential Revision: https://reviews.llvm.org/D98074	2021-03-05 23:47:50 +00:00
Alex Zinenko	6410ee0d09	[mlir] Squash LLVM_ArmNeon dialect into ArmNeon The two dialects are largely redundant. The former was introduced as a mirror of the latter operating on LLVM dialect types. This is no longer necessary since the LLVM dialect operates on built-in types. Combine the two dialects. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D98060	2021-03-05 23:33:32 +01:00
Aart Bik	e5c8fc776f	[mlir][vector] canonicalize unmasked gather/scatter/compress/expand directly into l/s With the new vector.load/store operations, there is no need to go through unmasked transfer operations (which will canonicalized to l/s anyway). Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D98056	2021-03-05 14:23:50 -08:00
Diego Caballero	2de6dbda66	[mlir] Add 'Skip' result to Operation visitor This patch is a follow-up on D97217. It adds a new 'Skip' result to the Operation visitor so that a callback can stop the ongoing visit of an operation/block/region and continue visiting the next one without fully interrupting the walk. Skipping is needed to be able to erase an operation/block in pre-order and do not continue visiting the internals of that operation/block. Related to the skipping mechanism, the patch also introduces the following changes: * Added new TestIRVisitors pass with basic testing for the IR visitors. * Fixed missing early increment ranges in visitor implementation. * Updated documentation of walk methods to include erasure information and walk order information. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D97820	2021-03-06 00:02:20 +02:00
Diego Caballero	71a86245ca	[mlir] Extend Operation visitor with pre-order traversal This patch extends the Region, Block and Operation visitors to also support pre-order walks. We introduce a new template argument that dictates the walk order (only pre-order and post-order are supported for now). The default order for Regions, Blocks and Operations is post-order. Mixed orders (e.g., Region/Block pre-order + Operation post-order) could easily be implemented, as shown in NumberOfExecutions.cpp. Reviewed By: rriddle, frgossen, bondhugula Differential Revision: https://reviews.llvm.org/D97217	2021-03-06 00:02:20 +02:00
Diego Caballero	b635492c3f	[mlir][Affine][NFC] Return BlockArgument in AffineForOp::getInductionVar This avoids unnecessary casts when a BlockArgument is required. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D97879	2021-03-06 00:02:19 +02:00
KareemErgawy-TomTom	d48ceb45e3	[MLIR][SPIRV] Rename `spv.undef` to `spv.Undef`. To unify the naming scheme across all ops in the SPIR-V dialect, we are moving from spv.camelCase to spv.CamelCase everywhere. For ops that don't have a SPIR-V spec counterpart, we use spv.mlir.snake_case. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D98016	2021-03-05 15:49:44 -05:00
River Riddle	f175ba4a54	[mlir][AsmPrinter] Don't use string comparison when filtering list attributes In .mlir modules with larges amounts of attributes, e.g. a function with a larger number of argument attributes, the string comparison filtering greatly affects compile time. This revision switches to using a SmallDenseSet in these situations, resulting in over a 10x speed up in some situations. Differential Revision: https://reviews.llvm.org/D97980	2021-03-05 12:47:05 -08:00
KareemErgawy-TomTom	29812a6195	[MLIR][SPIRV] Rename `spv.loop` to `spv.mlir.loop`. To unify the naming scheme across all ops in the SPIR-V dialect, we are moving from spv.camelCase to spv.CamelCase everywhere. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D97918	2021-03-05 15:44:30 -05:00
Stella Laurenzo	0b5f1b859f	[mlir][linalg] Add linalg_opdsl tool first draft. * Mostly imported from experimental repo as-is with cosmetic changes. * Temporarily left out emission code (for building ops at runtime) to keep review size down. * Documentation and lit tests added fresh. * Sample op library that represents current Linalg named ops included. Differential Revision: https://reviews.llvm.org/D97995	2021-03-05 11:45:09 -08:00
Stella Laurenzo	a9ccdfbc7d	NFC: Glob all python sources in the MLIR Python bindings. * Also switches to use symlinks vs copy as that enables edit-and-continue python development. * Broken out of https://reviews.llvm.org/D97995 per request from reviewer. Differential Revision: https://reviews.llvm.org/D98005	2021-03-05 10:21:02 -08:00
Aart Bik	adc35b689f	[mlir][sparse] mask reduction update Reduction updates should be masked, just like the load and stores. Note that alternatively, we could use the fact that masked values are zero of += updates and mask invariants to get this working but that would not work for *= updates. Masking the update itself is cleanest. This change also replaces the constant mask with a broadcast of "true" since this constant folds much better for various folding patterns. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D98000	2021-03-05 08:56:10 -08:00
Nicolas Vasilache	c86d3c1a38	[mlir][Linalg] Fix order of dimensions in hoistPaddingOnTensors.	2021-03-05 15:11:35 +00:00
Christian Sigg	5fedf30748	[mlir] Make cuInit() call thread-safe. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D98024	2021-03-05 16:06:15 +01:00
Nicolas Vasilache	35908406dc	[mlir][scf] Canonicalize scf.for last tensor iteration result. Canonicalize the iter_args of an scf::ForOp that involve a tensor_load and for which only the last loop iteration is actually visible outside of the loop. The canonicalization looks for a pattern such as: ``` %t0 = ... : tensor_type %0 = scf.for ... iter_args(%bb0 : %t0) -> (tensor_type) { ... // %m is either tensor_to_memref(%bb00) or defined above the loop %m... : memref_type ... // uses of %m with potential inplace updates %new_tensor = tensor_load %m : memref_type ... scf.yield %new_tensor : tensor_type } ``` `%bb0` may have either 0 or 1 use. If it has 1 use it must be exactly a `%m = tensor_to_memref %bb0` op that feeds into the yielded `tensor_load` op. If no aliasing write of `%new_tensor` occurs between tensor_load and yield then the value %0 visible outside of the loop is the last `tensor_load` produced in the loop. For now, we approximate the absence of aliasing by only supporting the case when the tensor_load is the operation immediately preceding the yield. The canonicalization rewrites the pattern as: ``` // %m is either a tensor_to_memref or defined above %m... : memref_type scf.for ... { // no iter_args ... // uses of %m with potential inplace updates } %0 = tensor_load %m : memref_type ``` Differential revision: https://reviews.llvm.org/D97953	2021-03-05 09:42:19 +00:00
KareemErgawy-TomTom	c74eb466d2	[MLIR][SPIRV] Rename `spv.globalVariable` to `spv.GlobalVariable`. To unify the naming scheme across all ops in the SPIR-V dialect, we are moving from spv.camelCase to spv.CamelCase everywhere. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D97919	2021-03-04 16:24:59 -05:00
KareemErgawy-TomTom	5abdca47b3	[MLIR][SPIRV] Rename `spv.constant` to `spv.Constant`. To unify the naming scheme across all ops in the SPIR-V dialect, we are moving from `spv.camelCase` to `spv.CamelCase` everywhere. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D97917	2021-03-04 16:15:56 -05:00

... 2 3 4 5 6 ...

7268 Commits