llvm-project

Commit Graph

Author	SHA1	Message	Date
KareemErgawy-TomTom	4d90e460bc	[MLIR][SPIRV] Rename `spv.spcConstant...` to `spv.SpcConstant...`. To unify the naming scheme across all ops in the SPIR-V dialect, we are moving from spv.camelCase to spv.CamelCase everywhere. Differential Revision: https://reviews.llvm.org/D97920	2021-03-04 16:07:41 -05:00
River Riddle	2f37cdd569	[mlir][IR][NFC] Move a majority of the builtin attributes to ODS Now that attributes can be generated using ODS, we can move the builtin attributes as well. This revision removes a majority of the builtin attributes with a few left for followup revisions. The attributes moved to ODS in this revision are: AffineMapAttr, ArrayAttr, DictionaryAttr, IntegerSetAttr, StringAttr, SymbolRefAttr, TypeAttr, and UnitAttr. Differential Revision: https://reviews.llvm.org/D97591	2021-03-04 13:04:06 -08:00
River Riddle	1447ec5182	[mlir][AttrDefGen] Add support for specifying the value type of an attribute The value type of the attribute can be specified by either overriding the typeBuilder field on the AttrDef, or by providing a parameter of type `AttributeSelfTypeParameter`. This removes the need to define custom storage class constructors for attributes that have a value type other than NoneType. Differential Revision: https://reviews.llvm.org/D97590	2021-03-04 13:04:05 -08:00
River Riddle	6bc767cd07	[mlir] Add a DialectAsmParser::getChecked method This function simplifies calling the getChecked methods on Attributes and Types from within the parser, and removes any need to use `getEncodedSourceLocation` for these methods (by using an SMLoc instead). This is much more efficient than using an mlir::Location, as the encoding process to produce an mlir::Location is inefficient and undesirable for parsing (locations used during parsing should not persist afterwards unless otherwise necessary). Differential Revision: https://reviews.llvm.org/D97900	2021-03-04 11:53:24 -08:00
Ahmed Taei	da1e37a8b0	Fold full-size subview of static shapes. Differential Revision: https://reviews.llvm.org/D97429	2021-03-04 09:52:06 -08:00
Nicolas Vasilache	f21d78633a	[mlir] Tighten the rules around folding TensorLoadOp `tensor_load(tensor_to_memref(x)) -> x` is an incorrect folding because it ignores potential aliasing. This revision approximates no-aliasing by restricting the folding to occur only when tensor_to_memref is immediately preceded by tensor_load in the same block. This is a conservative step back towards correctness until better alias analysis becomes available. Context: https://llvm.discourse.group/t/properly-using-bufferization-related-passes/2913/6 Differential Revision: https://reviews.llvm.org/D97957	2021-03-04 17:48:09 +00:00
Arpith C. Jacob	4e393350c5	[mlir] Add an AccessGroup attribute to load/store LLVM dialect ops and generate the access_group LLVM metadata. This also includes LLVM dialect ops created from intrinsics. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D97944	2021-03-04 18:17:23 +01:00
Hanhan Wang	b47c6c686c	[mlir][linalg] Add suffix "Op" to pooling TC ops. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D97946	2021-03-04 07:08:30 -08:00
Nicolas Vasilache	f3cc854364	[mlir][Vector] Add folding of vector transfers from/into tensor producing ops. Add a folder to rewrite a sequence such as: ``` %t1 = ... %v = vector.transfer_read %t0[%c0...], {masked = [false...]} : tensor<static_sizesxf32>, vector<static_sizesxf32> %t2 = vector.transfer_write %v, %t1[%c0...] {masked = [false...]} : vector<static_sizesxf32>, tensor<static_sizesxf32> ``` into: ``` %t0 ``` The producer of t1 may or may not be DCE'd depending on whether it is a block argument or has side effects. Differential revision: https://reviews.llvm.org/D97934	2021-03-04 14:17:42 +00:00
Nicolas Vasilache	a756f12b4d	[mlir][Linalg] Add folding of linalg.copy that are in fact identities. Differential Revision: https://reviews.llvm.org/D97939	2021-03-04 13:37:26 +00:00
Nicolas Vasilache	4f4f3f1e59	[mlir] NFC - Add runner util functions to only print MemRef metadata. These are useful to debug execution, without having to print the whole content of a memref.	2021-03-04 12:35:45 +00:00
Nicolas Vasilache	05882157db	[mlir][Linalg] NFC - Add isOutputTensor to LinalgInterfaces.td	2021-03-04 12:33:21 +00:00
Christian Sigg	f69d5a7fc7	[mlir] Initialize CUDA context lazily. So we can remove the ignore-warning pragma again. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D97864	2021-03-04 13:07:56 +01:00
Alex Zinenko	32c49c7d73	[mlir] ODS: change OpBuilderDAG to OpBuilder We no longer have the non-DAG version. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D97856	2021-03-04 10:55:02 +01:00
Alex Zinenko	19db802e7b	[mlir] make implementations of translation to LLVM IR interfaces private There is no need for the interface implementations to be exposed, opaque registration functions are sufficient for all users, similarly to passes. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D97852	2021-03-04 09:16:32 +01:00
Arpith C. Jacob	4a2930f495	[mlir] Add loop codegen options to some LLVM dialect ops. Add a Loop Option attribute and generate llvm metadata attached to branch instructions to control code generation. Reviewed By: ftynse, mehdi_amini Differential Revision: https://reviews.llvm.org/D96820	2021-03-04 09:01:57 +01:00
Aart Bik	553cb6d473	[mlir][sparse] fix bug in reduction chain Found with exhaustive testing, it is possible that a while loop appears in between chainable for loops. As long as we don't scalarize reductions in while loops, this means we need to terminate the chain at the while. This also refactors the reduction code into more readable helper methods. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D97886	2021-03-03 17:38:22 -08:00
River Riddle	83ef862fad	[mlir] Add support for generating Attribute classes for ODS The support for attributes closely maps that of Types (basically 1-1) given that Attributes are defined in exactly the same way as Types. All of the current ODS TypeDef classes get an Attr equivalent. The generation of the attribute classes themselves share the same generator as types. Differential Revision: https://reviews.llvm.org/D97589	2021-03-03 16:41:49 -08:00
River Riddle	e07c968a6d	[mlir][pdl][NFC] Rename InputOp to OperandOp This better matches the actual IR concept that is being modeled, and is consistent with how the rest of PDL is structured. Differential Revision: https://reviews.llvm.org/D95718	2021-03-03 15:48:00 -08:00
River Riddle	55f878bad9	[mlir][pdl] Add a new !pdl.range<> type This type represents a range of positional values. It will be used in followup revisions to add support for variadic constructs to PDL, such as operand and result ranges. Differential Revision: https://reviews.llvm.org/D95717	2021-03-03 15:48:00 -08:00
River Riddle	3dfa86149e	[mlir][IR] Refactor the internal implementation of Value The current implementation of Value involves a pointer int pair with several different kinds of owners, i.e. BlockArgumentImpl, Operation , TrailingOpResult. This design arose from the desire to save memory overhead for operations that have a very small number of results (generally 0-2). There are, unfortunately, many problematic aspects of the current implementation that make Values difficult to work with or just inefficient. Operation result types are stored as a separate array on the Operation. This is very inefficient for many reasons: we use TupleType for multiple results, which can lead to huge amounts of memory usage if multi-result operations change types frequently(they do). It also means that simple methods like Value::getType/Value::setType now require complex logic to get to the desired type. Value only has one pointer bit free, severely limiting the ability to use it in things like PointerUnion/PointerIntPair. Given that we store the kind of a Value along with the "owner" pointer, we only leave one bit free for users of Value. This creates situations where we end up nesting PointerUnions to be able to use Value in one. As noted above, most of the methods in Value need to branch on at least 3 different cases which is both inefficient, possibly error prone, and verbose. The current storage of results also creates problems for utilities like ValueRange/TypeRange, which want to efficiently store base pointers to ranges (of which Operation isn't really useful as one). This revision greatly simplifies the implementation of Value by the introduction of a new ValueImpl class. This class contains all of the state shared between all of the various derived value classes; i.e. the use list, the type, and the kind. This shared implementation class provides several large benefits: * Most of the methods on value are now branchless, and often one-liners. * The "kind" of the value is now stored in ValueImpl instead of Value This frees up all of Value's pointer bits, allowing for users to take full advantage of PointerUnion/PointerIntPair/etc. It also allows for storing more operation results as "inline", 6 now instead of 2, freeing up 1 word per new inline result. * Operation result types are now stored in the result, instead of a side array This drops the size of zero-result operations by 1 word. It also removes the memory crushing use of TupleType for operations results (which could lead up to hundreds of megabytes of "dead" TupleTypes in the context). This also allowed restructured ValueRange, making it simpler and one word smaller. This revision does come with two conceptual downsides: * Operation::getResultTypes no longer returns an ArrayRef<Type> This conceptually makes some usages slower, as the iterator increment is slightly more complex. * OpResult::getOwner is slightly more expensive, as it now requires a little bit of arithmetic From profiling, neither of the conceptual downsides have resulted in any perceivable hit to performance. Given the advantages of the new design, most compiles are slightly faster. Differential Revision: https://reviews.llvm.org/D97804	2021-03-03 14:33:37 -08:00
David Blaikie	4fda0dc14b	Fix use of deprecated API	2021-03-03 14:07:28 -08:00
MaheshRavishankar	c118fdcd59	[mlir] Remove incorrect folding for SubTensorInsertOp The SubTensorInsertOp has a requirement that dest type and result type match. Just folding the tensor.cast operation violates this and creates verification errors during canonicalization. Also fix other canonicalization methods that werent inserting casts properly. Differential Revision: https://reviews.llvm.org/D97800	2021-03-03 13:58:05 -08:00
Hanhan Wang	83c56aa4ee	[mlir][linalg] Add depthwise_conv_2d_input_nhwc_filter_hwcf to Linalg TC ops. Different from the definition in Tensorflow and TOSA, the output is [N,H,W,C,M]. This can make transforms easier in LinAlg because the indexing maps are plain. E.g., to determine if the fill op has dependency between the depthwise conv op, the current pipeline only recognizes the dep if they are all projected affine map. Reviewed By: asaadaldien Differential Revision: https://reviews.llvm.org/D97798	2021-03-03 11:47:02 -08:00
Mehdi Amini	13cb431719	Add basic JIT Python Bindings This offers the ability to create a JIT and invoke a function by passing ctypes pointers to the argument and the result. Differential Revision: https://reviews.llvm.org/D97523	2021-03-03 18:19:40 +00:00
Mehdi Amini	86c8a7857d	Add C bindings for mlir::ExecutionEngine This adds minimalistic bindings for the execution engine, allowing to invoke the JIT from the C API. This is still quite early and experimental and shouldn't be considered stable in any way. Differential Revision: https://reviews.llvm.org/D96651	2021-03-03 18:19:40 +00:00
Hanhan Wang	497b7b8c00	[mlir][linalg] Delete unused vars if there are shaped-only operands. Reviewed By: stella.stamenova Differential Revision: https://reviews.llvm.org/D97851	2021-03-03 09:36:08 -08:00
MaheshRavishankar	5d7e0a23c6	[mlir] Add LinalgInterface method to clone with a given BlockAndValueMapping. Since Linalg operations have regions by default which are not isolated from above, add an another method to the interface that will take a BlockAndValueMapping to remap the values within the region as well. Differential Revision: https://reviews.llvm.org/D97709	2021-03-03 09:25:20 -08:00
Benjamin Kramer	73cb58dc48	[mlir][Shape] Lower cstr_eq to shape_eq + assert Differential Revision: https://reviews.llvm.org/D97860	2021-03-03 17:22:28 +01:00
Benjamin Kramer	24acadef8a	[mlir][Shape] Make shape_eq nary This gets rid of a dubious shape_eq %a, %a fold, that folds shape_eq even if %a is not an Attribute. Differential Revision: https://reviews.llvm.org/D97728	2021-03-03 16:26:40 +01:00
Benjamin Kramer	c714b441ef	[mlir][Shape] Make cstr_eq more like cstr_broadcastable This includes allowing extents and not just shapes. Differential Revision: https://reviews.llvm.org/D97716	2021-03-03 16:20:05 +01:00
Vladislav Vinogradov	5d613e42d3	[mlir][ODS] Use StringLiteral instead of StringRef when applicable Use `StringLiteral` for function return type if it is known to return constant string literals only. This will make it visible to API users, that such values can be safely stored, since they refers to constant data, which will never be deallocated. `StringRef` is general is not safe to store for a long term, since it might refer to temporal data allocated in heap. Add `inline` and `constexpr` methods support to `OpMethod`. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D97390	2021-03-03 16:15:12 +03:00
Vladislav Vinogradov	e3b350ce83	[mlir][NFC] Use `BaseMemRefType::isValidElementType` in Parser Instead of hard-coded check. Reviewed By: ftynse, rriddle Differential Revision: https://reviews.llvm.org/D97612	2021-03-03 16:00:54 +03:00
Christian Sigg	b6ac26fce5	[mlir] Silence -Wglobal-constructors error in CudaRuntimeWrapper.cpp Until I have a better solution with dynamic initialization, to get the nvidia build bot green again.	2021-03-03 13:48:03 +01:00
Christian Sigg	9d7be77bf9	[mlir] Move cuda tests Move test inputs to test/Integration directory. Move runtime wrappers to ExecutionEngine. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D97463	2021-03-03 13:16:51 +01:00
Aart Bik	5b333d3449	[mlir][sparse] do not ignore ordering for "dense" tensor linked with sparse type Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D97795	2021-03-02 15:21:51 -08:00
Hanhan Wang	c0f8115c73	[mlir][linalg] Only generate one var for an attrUse. Some variables are unused after D97383 landed. We should generate one symbol for one attrUse. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D97794	2021-03-02 12:48:20 -08:00
Frederik Gossen	bcc9b371e4	Split `ElementwiseMappable` trait into four more precise traits. Some elementwise operations are not scalarizable, vectorizable, or tensorizable. Split `ElementwiseMappable` trait into the following, more precise traits. - `Elementwise` - `Scalarizable` - `Vectorizable` - `Tensorizable` This allows for reuse of `Elementwise` in dialects like HLO. Differential Revision: https://reviews.llvm.org/D97674	2021-03-02 15:31:19 +01:00
Christian Sigg	caa5144d56	[mlir] Use mlir::OpState::operator->() to get to Operation::getAttrs(). This is a preparation step to remove getAttrs() from OpState.	2021-03-02 13:29:27 +01:00
KareemErgawy-TomTom	3b021fbdc0	[MLIR][LinAlg] Detensorize interal function control flow. This patch continues detensorizing implementation by detensoring internal control flow in functions. In order to detensorize functions, all the non-entry block's arguments are detensored and branches between such blocks are properly updated to reflect the detensored types as well. Function entry block (signature) is left intact. This continues work towards handling github/google/iree#1159. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D97148	2021-03-02 11:46:20 +01:00
Vladislav Vinogradov	37eca08e5b	[mlir][NFC] Rename `MemRefType::getMemorySpace` to `getMemorySpaceAsInt` Just a pure method renaming. It is a preparation step for replacing "memory space as raw integer" with more generic "memory space as attribute", which will be done in separate commit. The `MemRefType::getMemorySpace` method will return `Attribute` and become the main API, while `getMemorySpaceAsInt` will be declared as deprecated and will be replaced in all in-tree dialects (also in separate commits). Reviewed By: mehdi_amini, rriddle Differential Revision: https://reviews.llvm.org/D97476	2021-03-02 11:08:54 +03:00
Stella Laurenzo	6d2fd3d9cd	[mlir][linalg] Replace monomorphic contration ops with polymorphic variants. * Moves `batch_matmul`, `matmul`, `matvec`, `vectmat`, `dot` to the new mechanism. * This is not just an NFC change, in addition to using a new code generation mechanism, it also activates symbolic casting, allowing mixed precision operands and results. * These definitions were generated from DSL by the tool: https://github.com/stellaraccident/mlir-linalgpy/blob/main/mlir_linalg/oplib/core.py (will be upstreamed in a subsequent set of changes). Reviewed By: nicolasvasilache, ThomasRaoux Differential Revision: https://reviews.llvm.org/D97719	2021-03-01 21:19:53 -08:00
Stella Laurenzo	d36a15de1f	[mlir][linalg] Memoize indexing map generation. Differential Revision: https://reviews.llvm.org/D97602	2021-03-01 21:15:40 -08:00
MaheshRavishankar	a9e68db973	[mlir] Add canonicaliations for subtensor_insert operation. Add canonicalizers to subtensor_insert operations need canonicalizers that propagate the constant arguments within offsets, sizes and strides. Also add pattern to propogate tensor_cast operations. Differential Revision: https://reviews.llvm.org/D97704	2021-03-01 14:59:18 -08:00
Stella Stamenova	801067f4c0	[mlir][lldb] Fix several gcc warnings in mlir and lldb These warnings are raised when compiling with gcc due to either having too few or too many commas, or in the case of lldb, the possibility of a nullptr. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D97586	2021-03-01 13:48:22 -08:00
Rob Suderman	087bc20fe4	[MLIR][TOSA] Lower tosa.transpose to linalg.generic Lowers the transpose operation to a generic linalg op when permutations is a constant value. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D97508	2021-03-01 11:09:49 -08:00
Ahmed Taei	ea7f211b2e	[mlir] Add polynomial approximation for math::ExpOp Similar to fast_exp in https://github.com/boulos/syrah Differential Revision: https://reviews.llvm.org/D97599	2021-03-01 11:02:07 -08:00
Jacques Pienaar	87e05eb03b	Revert "Remove use of tuple for multiresult type storage" This reverts commit `08f0764ff5`.	2021-03-01 10:39:41 -08:00
Jacques Pienaar	08f0764ff5	Remove use of tuple for multiresult type storage Move the results in line with the op instead. This results in each operation having its own types recorded vs single tuple type, but comes at benefit that every mutation doesn't incurs uniquing. Ran into cases where updating result type of operation led to very large memory usage. Differential Revision: https://reviews.llvm.org/D97652	2021-03-01 09:30:24 -08:00
Jacques Pienaar	2f0b4db5ea	[mlir] Add convenience grouping for tensor type inference For ops that produces tensor types and implement the shaped type component interface, the type inference interface can be used. Create a grouping of these together to make it easier to specify (it cannot be added into a list of traits, but must rather be appended/concated to one as it isn't a trait but a list of traits). Differential Revision: https://reviews.llvm.org/D97636	2021-03-01 05:21:08 -08:00
Aart Bik	6afaea6682	[mlir][sparse] fixed inaccury in maintaining universal index The universal index was maintained if dense indices were still in place, and lattice points followed. However, it should only be kept if any of those following lattice points actually consumes the universal index. This change also fixes an inaccuracy with a missing broadcast around vector invariant. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D97594	2021-02-27 17:32:57 -08:00
Stella Laurenzo	769cd20bdb	NFC: Adds labels to a linalg test and extend a missing case.	2021-02-27 16:00:18 -08:00
Stella Laurenzo	2ceedc3a20	[mlir][linalg] Add symbolic type conversion to linalg named ops. This enables this kind of construct in the DSL to generate a named op that is polymorphic over numeric type variables `T` and `U`, generating the correct arithmetic casts at construction time: ``` @tc_def_op def polymorphic_matmul(A=TensorDef(T1, S.M, S.K), B=TensorDef(T2, S.K, S.N), C=TensorDef(U, S.M, S.N, output=True)): implements(ContractionOpInterface) C[D.m, D.n] += cast(U, A[D.m, D.k]) * cast(U, B[D.k, D.n]) ``` Presently, this only supports type variables that are bound to the element type of one of the arguments, although a further extension that allows binding a type variable to an attribute would allow some more expressiveness and may be useful for some formulations. This is left to a future patch. In addition, this patch does not yet materialize the verifier support which ensures that types are bound correctly (for such simple examples, failing to do so will yield IR that fails verification, it just won't yet fail with a precise error). Note that the full grid of extensions/truncation/int<->float conversions are supported, but many of them are lossy and higher level code needs to be mindful of numerics (it is not the job of this level). As-is, this should be sufficient for most integer matmul scenarios we work with in typical quantization schemes. Differential Revision: https://reviews.llvm.org/D97603	2021-02-27 15:52:35 -08:00
Stella Laurenzo	5867c18e2c	[mlir][linalg] Generate additional interfaces for named ops. * Adds ContractionOpInterface to polymorphic_matmul. Differential Revision: https://reviews.llvm.org/D97601	2021-02-27 15:43:41 -08:00
Mehdi Amini	014575932f	Fix Block::eraseArguments: keep track the first removed element while removing Not only this is likely more efficient than BitVector::find_first(), but also if the BitVector is empty find_first() returns -1, which llvm::drop_begin isn't robust against.	2021-02-27 19:18:09 +00:00
Mehdi Amini	7b06786de2	Fix Block::eraseArguments to properly update the cached positions This is fixing correctness and ASAN failure post-ee90bb3486948.	2021-02-27 19:04:12 +00:00
Mehdi Amini	a6d76cf610	Change the error message when `-split-input-file` is used with mlir-opt to make it recognizable by IDEs By adding the line number of the split point immediately after the file name (separated by `:`) this is recognized by various tool as a proper location. Ideally we would want to point to the line of the error, but that would require some very invasive changes I suspect. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D93363	2021-02-27 18:32:34 +00:00
Jacques Pienaar	dd2f50a4d0	[mlir] Improve test coverage for print-op-graph	2021-02-27 10:18:38 -08:00
Mehdi Amini	ee90bb3486	Store (cache) the Argument number (index in the argument list) inside the BlockArgumentImpl This avoids linear search in BlockArgument::getArgNumber(). Differential Revision: https://reviews.llvm.org/D97596	2021-02-27 17:21:08 +00:00
River Riddle	e6260ad043	[mlir] Simplify various pieces of code now that Identifier has access to the Context/Dialect This also exposed a bug in Dialect loading where it was not correctly identifying identifiers that had the dialect namespace as a prefix. Differential Revision: https://reviews.llvm.org/D97431	2021-02-26 18:00:05 -08:00
Rob Suderman	16abacaea9	[MLIR][TOSA] Resubmit Tosa to Standard/SCF Lowerings (const, if, while)" Includes a lowering for tosa.const, tosa.if, and tosa.while to Standard/SCF dialects. TosaToStandard is used for constant lowerings and TosaToSCF handles the if/while ops. Resubmission of https://reviews.llvm.org/D97518 with ASAN fixes. Differential Revision: https://reviews.llvm.org/D97529	2021-02-26 17:44:12 -08:00
Jacques Pienaar	91ab48ea6b	[mlir] Add regions to OpAdaptor Allows querying regions too via OpAdaptor's generated. This does not yet move region verification to adaptor nor require regions for ops where needed. Differential Revision: https://reviews.llvm.org/D97519	2021-02-26 16:25:00 -08:00
Rob Suderman	f685c9ac86	[MLIR][TOSA] Lower tosa.identity and tosa.identitiyn to linalg Both identity ops can be loweried by replacing their results with their inputs. We keep this as a linalg lowering as other backends may choose to create copies. Differential Revision: https://reviews.llvm.org/D97517	2021-02-26 15:45:07 -08:00
Aart Bik	df5ccf5a94	[mlir][vector] add higher dimensional support to gather/scatter Similar to mask-load/store and compress/expand, the gather and scatter operation now allow for higher dimension uses. Note that to support the mixed-type index, the new syntax is: vector.gather %base [%i,%j] [%kvector] .... The first client of this generalization is the sparse compiler, which needs to define scatter and gathers on dense operands of higher dimensions too. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D97422	2021-02-26 14:20:19 -08:00
Rob Suderman	caccddc52a	[MLIR][TOSA] Lower tosa.reshape to linalg.reshape Lowering from the tosa.reshape op to linalg.reshape. For same-rank or non-collapsed/expanded cases two linalg.reshapes are inserted. Differential Revision: https://reviews.llvm.org/D97439	2021-02-26 12:57:57 -08:00
Geoffrey Martin-Noble	21bb63893e	[MLIR][linalg] Make integer matmul ops cast before multiplying Right now they multiply before casting which means they would frequently overflow. There are various reasonable ways to do this, but until we have robust op description infra, this is a simple and safe default. More careful treatments are likely to be hardware specific, as well (e.g. using an i8*i8->i16 mul instruction). Reviewed By: nicolasvasilache, mravishankar Differential Revision: https://reviews.llvm.org/D97505	2021-02-26 08:36:31 -08:00
Hanhan Wang	bfd3771c9e	[mlir][linalg] Add pooling ops to Linalg TC ops. - Add EDSC builders for std_cmpf_ogt and std_cmpf_olt. - Add pooling_nhwc_min/max/sum ops Depends On D97384 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D97385	2021-02-26 07:18:03 -08:00
Hanhan Wang	855a119604	[mlir][linalg] Allow TC ops taking an unused shaped operand. If one operand is not used in the formula, it will be considered a shaped operand. And the result of indexing map of the operand will be the first reduction dims. Depends On D97383 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D97384	2021-02-26 06:45:56 -08:00
Benjamin Kramer	4941fef9c4	[mlir] Silence some deprecation warnings after `dffc487b07`	2021-02-26 15:15:56 +01:00
Marius Brehler	56774bdda5	[mlir] Replace deprecated 'getAttrs' 'getAttrs' has been explicitly marked deprecated. This patch refactors to use Operation::getAttrs(). Reviewed By: csigg Differential Revision: https://reviews.llvm.org/D97546	2021-02-26 14:52:40 +01:00
Christian Sigg	dffc487b07	[mlir] Mark OpState::removeAttr() deprecated. Fix call sites. The method will be removed 2 weeks later. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D97530	2021-02-26 12:04:41 +01:00
Vinayaka Bandishti	ce0f10a1d1	[MLIR][affine] Certain Call Ops to prevent fusion Fixes a bug in affine fusion pipeline where an incorrect fusion is performed despite a Call Op that potentially modifies memrefs under consideration exists between source and target. Fixes part of https://bugs.llvm.org/show_bug.cgi?id=49220 Reviewed By: bondhugula, dcaballe Differential Revision: https://reviews.llvm.org/D97252	2021-02-26 15:27:41 +05:30
Christian Sigg	0b05908feb	[mlir] Remove some rarely used OpState members and use Operation members instead. Skipping the deprecation dance here. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D97494	2021-02-26 08:37:11 +01:00
Rob Suderman	c47aa3c8de	Revert [MLIR][TOSA] Added Tosa to Standard/SCF Lowerings (const, if, while) This reverts commit `a813e9be5b`. Results in an ASAN failure due to bypassing rewriter. Differential Revision: https://reviews.llvm.org/D97518	2021-02-25 18:05:16 -08:00
James Y Knight	24539f1ef2	Add Alignment argument to IRBuilder CreateAtomicRMW and CreateAtomicCmpXchg. And then push those change throughout LLVM. Keep the old signature in Clang's CGBuilder for now -- that will be updated in a follow-on patch (D97224). The MLIR LLVM-IR dialect is not updated to support the new alignment attribute, but preserves its existing behavior. Differential Revision: https://reviews.llvm.org/D97223	2021-02-25 18:29:42 -05:00
Rob Suderman	a813e9be5b	[MLIR][TOSA] Added Tosa to Standard/SCF Lowerings (const, if, while) Includes a lowering for tosa.const, tosa.if, and tosa.while to Standard/SCF dialects. TosaToStandard is used for constant lowerings and TosaToSCF handles the if/while ops. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D97352	2021-02-25 14:35:21 -08:00
Christian Sigg	8c074cb0b7	[mlir] Mark OpState::getAttrs() deprecated. Fix call sites. The method will be removed 2 weeks later. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D97464	2021-02-25 20:54:42 +01:00
Diego Caballero	ebca222b65	[mlir] Check 'iter_args' in 'isLoopParallel' utility Fix 'isLoopParallel' utility so that 'iter_args' is taken into account and loops with loop-carried dependences are not classified as parallel. Reviewed By: tungld, vinayaka-polymage Differential Revision: https://reviews.llvm.org/D97347	2021-02-25 18:12:34 +02:00
Tung D. Le	203d5eeec5	[MLIR][affine-loop-fusion] Handle defining ops between the source and dest loops This patch handles defining ops between the source and dest loop nests, and prevents loop nests with `iter_args` from being fused. If there is any SSA value in the dest loop nest whose defining op has dependence from the source loop nest, we cannot fuse the loop nests. If there is a `affine.for` with `iter_args`, prevent it from being fused. Reviewed By: dcaballe, bondhugula Differential Revision: https://reviews.llvm.org/D97030	2021-02-25 18:12:34 +02:00
Vladislav Vinogradov	f4d78a5e3a	[mlir][NFC] Add missing namespace qualifier to ODS generated code Use `::mlir::Region` inside array ref for `VariadicRegion`. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D97376	2021-02-25 17:48:32 +03:00
Christian Sigg	f03826f896	Pass GPU events instead of streams across async regions. Lower !gpu.async.tokens returned from async.execute regions to events instead of streams. Make !gpu.async.token returned from !async.execute single-use. This allows creating one event per use and destroying them without leaking or ref-counting. Technically we only need this for stream/event-based lowering. I kept the code separate from the rest of the gpu-async-region pass so that we can make this optional or move to a separate pass as needed. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D96965	2021-02-25 13:18:18 +01:00
Marius Brehler	2d870a2f55	[mlir][nfc] Fix typo in documentation comment	2021-02-25 08:32:14 +01:00
Marius Brehler	699041123e	[mlir] Fix emitting attribute documentation This fixes the documentation emitted for type parameters. Also adds a missing empty line, rendered as line break in mark down. Co-authored-by: Simon Camphausen <simon.camphausen@iml.fraunhofer.de> Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D97267	2021-02-25 08:23:50 +01:00
Jing Pu	c519460745	Allow !shape.size type operands in "shape.from_extents" op. This expands the op to support error propagation and also makes it symmetric with "shape.get_extent" op. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D97261	2021-02-24 14:50:07 -08:00
River Riddle	e79cd47e16	[mlir][docs] Small fix to local Pass Manager reproduction documentation	2021-02-24 11:47:34 -08:00
Hanhan Wang	21895a2bef	[mlir][linalg] Reuse the symbol if attribute uses are identical. Depends On D97312 Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D97383	2021-02-24 11:42:13 -08:00
Hanhan Wang	705068cb8c	[mlir][linalg] Support for using output values in TC definitions. This will allow us to define select(pred, in, out) for TC ops, which is useful for pooling ops. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D97312	2021-02-24 11:37:45 -08:00
Weiwei Li	ce2ad938ff	[mlir][spirv] Define spv.GLSL.Ldexp co-authored-by: Alan Liu <alanliu.yf@gmail.com> Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D97228	2021-02-24 13:07:46 -05:00
Lei Zhang	5f8a80882b	[mlir] Add constBuilderCall to TypeAttr to simplify builders Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D97344	2021-02-24 13:04:03 -05:00
Eugene Zhulenev	ce976d2db3	[mlir] Add polynomial approximation for math::LogOp (using builders API) Replace math::LogOp with an approximations from the the Julien Pommier's SSE math library Link: http://gruntthepeon.free.fr/ssemath Reviewed By: asaadaldien Differential Revision: https://reviews.llvm.org/D97304	2021-02-24 07:50:25 -08:00
Alexander Belyaev	7377ef9357	[mlir] Add a builder to `linalg.tiled_loop`. https://llvm.discourse.group/t/rfc-add-linalg-tileop/2833 Differential Revision: https://reviews.llvm.org/D97372	2021-02-24 14:47:27 +01:00
Christian Sigg	eb8d6af5e4	[mlir] Specify cuda-runner pass pipeline as command line options. The cuda-runner registers two pass pipelines for nested passes, so that we don't have to use verbose textual pass pipeline specification. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D97091	2021-02-24 14:36:52 +01:00
Alexander Belyaev	945b76d428	[mlir][linalg] Fix Linalg roundtrip test. The test did not check whether the operations can be parsed again after printing them once. Differential Revision: https://reviews.llvm.org/D97368	2021-02-24 11:31:09 +01:00
River Riddle	59f0e4627a	[mlir][Inliner] Don't optimize callees in async mode if there is only one to optimize This avoids unnecessary async overhead in situations that won't benefit from it.	2021-02-23 18:44:09 -08:00
Kern Handa	3c4cdd0b6a	[mlir] ExecutionEngine needs special handling for COFF binaries Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D97141	2021-02-23 17:34:19 -08:00
River Riddle	16a50c9e64	[mlir][Inliner] Keep the number of async pass managers constant This prevents a bug in the pass instrumentation implementation where the main thread would end up with a different pass manager in different runs of the pass.	2021-02-23 16:40:24 -08:00
River Riddle	abd3c6f24c	[mlir][Inliner] Use llvm::parallelForEach instead of llvm::parallelTransformReduce llvm::parallelTransformReduce does not schedule work on the caller thread, which becomes very costly for the inliner where a majority of SCCs are small, often ~1 element. The switch to llvm::parallelForEach solves this, and also aligns the implementation with the PassManager (which realistically should share the same implementation). This change dropped compile time on an internal benchmark by ~1(25%) second. Differential Revision: https://reviews.llvm.org/D96086	2021-02-23 14:36:45 -08:00
River Riddle	65a3197a8f	[mlir] Refactor InterfaceMap to use a sorted vector of interfaces, as opposed to a DenseMap A majority of operations have a very small number of interfaces, which means that the cost of using a hash map is generally larger for interface lookups than just a binary search. In the future when there are a number of operations with large amounts of interfaces, we can switch to a hybrid approach that optimizes lookups based on the number of interfaces. For now, however, a binary search is the best approach. This dropped compile time on a largish TF MLIR module by 20%(half a second). Differential Revision: https://reviews.llvm.org/D96085	2021-02-23 14:36:45 -08:00
Aart Bik	17fa919847	[mlir][sparse] incorporate vector index into address computation When computing dense address, a vectorized index must be accounted for properly. This bug was formerly undetected because we get 0 * prev + i in most cases, which folds away the scalar part. Now it works for all cases. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D97317	2021-02-23 13:25:51 -08:00
Adam Straw	af8adea155	make Affine parallel and yield ops MemRefsNormalizable Affine parallel ops may contain and yield results from MemRefsNormalizable ops in the loop body. Thus, both affine.parallel and affine.yield should have the MemRefsNormalizable trait. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D96821	2021-02-23 10:16:47 -08:00
Nicolas Vasilache	8cf14b8dec	[mlir][Linalg] Retire hoistViewAllocOps. This transformation was only used for quick experimentation and is not general enough. Retire it. Differential Revision: https://reviews.llvm.org/D97266	2021-02-23 11:45:19 +00:00
Nicolas Vasilache	551ba72760	[mlir] NFC - Use declarative assembly for scf::YieldOp	2021-02-23 11:17:30 +00:00
Frederik Gossen	1fff7c8924	Fix unused variable	2021-02-23 11:19:35 +01:00
River Riddle	dc6a84fce6	[mlir] Add support for DebugCounters using the new DebugAction infrastructure DebugCounters allow for selectively enabling the execution of a debug action based upon a "counter". This counter is comprised of two components that are used in the control of execution of an action, a "skip" value and a "count" value. The "skip" value is used to skip a certain number of initial executions of a debug action. The "count" value is used to prevent a debug action from executing after it has executed for a set number of times (not including any executions that have been skipped). For example, a counter for a debug action with `skip=47` and `count=2`, would skip the first 47 executions, then execute twice, and finally prevent any further executions. This is effectively the same as the DebugCounter infrastructure in LLVM, but using the DebugAction infrastructure in MLIR. We can't simply reuse the DebugCounter support already present in LLVM due to its heavy reliance on global constructors (which are not allowed in MLIR). The DebugAction infrastructure already nicely supports the debug counter use case, and promotes the separation of policy and mechanism design philosophy. Differential Revision: https://reviews.llvm.org/D96395	2021-02-23 01:01:17 -08:00
River Riddle	72d5afa4ac	[mlir] Add a new debug action framework. This revision adds the infrastructure for `Debug Actions`. This is a DEBUG only API that allows for external entities to control various aspects of compiler execution. This is conceptually similar to something like DebugCounters in LLVM, but at a lower level. This framework doesn't make any assumptions about how the higher level driver is controlling the execution, it merely provides a framework for connecting the two together. This means that on top of DebugCounter functionality, we could also provide more interesting drivers such as interactive execution. A high level overview of the workflow surrounding debug actions is shown below: * Compiler developer defines an `action` that is taken by the a pass, transformation, utility that they are developing. * Depending on the needs, the developer dispatches various queries, pertaining to this action, to an `action manager` that will provide an answer as to what behavior the action should do. * An external entity registers an `action handler` with the action manager, and provides the logic to resolve queries on actions. The exact definition of an `external entity` is left opaque, to allow for more interesting handlers. This framework was proposed here: https://llvm.discourse.group/t/rfc-debug-actions-in-mlir-debug-counters-for-the-modern-world Differential Revision: https://reviews.llvm.org/D84986	2021-02-23 00:52:17 -08:00
KareemErgawy-TomTom	67e0d58de4	[MLIR][LinAlg] Start detensoring implementation. This commit is the first baby step towards detensoring in linalg-on-tensors. Detensoring is the process through which a tensor value is convereted to one or potentially more primitive value(s). During this process, operations with such detensored operands are also converted to an equivalen form that works on primitives. The detensoring process is driven by linalg-on-tensor ops. In particular, a linalg-on-tensor op is checked to see whether all its operands can be detensored. If so, those operands are converted to thier primitive counterparts and the linalg op is replaced by an equivalent op that takes those new primitive values as operands. This works towards handling github/google/iree#1159. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96271	2021-02-23 08:27:58 +01:00
Mehdi Amini	99b0032ce0	Move the MLIR integration tests as a subdirectory of test (NFC) This does not change the behavior directly: the tests only run when `-DMLIR_INCLUDE_INTEGRATION_TESTS=ON` is configured. However running `ninja check-mlir` will not run all the tests within a single lit invocation. The previous behavior would wait for all the integration tests to complete before starting to run the first regular test. The test results were also reported separately. This change is unifying all of this and allow concurrent execution of the integration tests with regular non-regression and unit-tests. Differential Revision: https://reviews.llvm.org/D97241	2021-02-23 05:55:47 +00:00
River Riddle	154cabe722	[mlir][pdl][NFC] Extract the execution of each bytecode operation into its own function This makes the implementation of each bytecode operation much easier to reason about, and lets the compiler decide which implementations are beneficial to inline into the main switch. Differential Revision: https://reviews.llvm.org/D95716	2021-02-22 19:02:48 -08:00
River Riddle	ddd556f10e	[mlir][pdl] Fix bug when ordering predicates We should be ordering predicates with higher primary/secondary sums first, but we are currently ordering them last. This allows for predicates more frequently encountered to be checked first. Differential Revision: https://reviews.llvm.org/D95715	2021-02-22 19:02:48 -08:00
River Riddle	06e25d5645	[mlir][IR] Refactor the `getChecked` and `verifyConstructionInvariants` methods on Attributes/Types `verifyConstructionInvariants` is intended to allow for verifying the invariants of an attribute/type on construction, and `getChecked` is intended to enable more graceful error handling aside from an assert. There are a few problems with the current implementation of these methods: * `verifyConstructionInvariants` requires an mlir::Location for emitting errors, which is prohibitively costly in the situations that would most likely use them, e.g. the parser. This creates an unfortunate code duplication between the verifier code and the parser code, given that the parser operates on llvm::SMLoc and it is an undesirable overhead to pre-emptively convert from that to an mlir::Location. * `getChecked` effectively requires duplicating the definition of the `get` method, creating a quite clunky workflow due to the subtle different in its signature. This revision aims to talk the above problems by refactoring the implementation to use a callback for error emission. Using a callback allows for deferring the costly part of error emission until it is actually necessary. Due to the necessary signature change in each instance of these methods, this revision also takes this opportunity to cleanup the definition of these methods by: * restructuring the signature of `getChecked` such that it can be generated from the same code block as the `get` method. * renaming `verifyConstructionInvariants` to `verify` to match the naming scheme of the rest of the compiler. Differential Revision: https://reviews.llvm.org/D97100	2021-02-22 17:37:49 -08:00
Aart Bik	0df59f234b	[sparse][mlir] simplify lattice optimization logic Simplifies the way lattices are optimized with less, but more powerful rules. This also fixes an inaccuracy where too many lattices resulted (expecting a non-existing universal index). Also puts no-side-effects on all proper getters and unifies bufferization flags order in integration tests (for future, more complex use cases). Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D97134	2021-02-22 16:52:06 -08:00
Geoffrey Martin-Noble	e2224dd753	Fix typo introduced in https://reviews.llvm.org/D97006 Differential Revision: https://reviews.llvm.org/D97220	2021-02-22 13:11:37 -08:00
Geoffrey Martin-Noble	54529c4be6	Add missing dep to fix shared libs build Followup to https://reviews.llvm.org/D97006 which broke the shared libs build because of a missing dependency. Differential Revision: https://reviews.llvm.org/D97213	2021-02-22 11:36:48 -08:00
Vivek	817d343fb0	[MLIR] Fix tilePerfectlyNested utility for handling non-unit step size The current implementation of tilePerfectlyNested utility doesn't handle the non-unit step size. We have added support to perform tiling correctly even if the step size of the loop to be tiled is non-unit. Fixes https://bugs.llvm.org/show_bug.cgi?id=49188. Differential Revision: https://reviews.llvm.org/D97037	2021-02-23 00:50:04 +05:30
Geoffrey Martin-Noble	2ce6a42cc9	[MLIR] Add Linalg support for integer (generalized) matmuls This patch adds Linalg named ops for various types of integer matmuls. Due to limitations in the tc spec/linalg-ods-gen ops cannot be type polymorphic, so this instead creates new ops (improvements to the methods for defining Linalg named ops are underway with a prototype at https://github.com/stellaraccident/mlir-linalgpy). To avoid the necessity of directly referencing these many new ops, this adds additional methods to ContractionOpInterface to allow classifying types of operations based on their indexing maps. Reviewed By: nicolasvasilache, mravishankar Differential Revision: https://reviews.llvm.org/D97006	2021-02-22 11:13:26 -08:00
Benjamin Kramer	ed4d12c2ce	[mlir][Shape] Fix a crash when folding nary broadcast ops operands[2] can be nullptr here. I'm not able to build a lit test for this because of the commutative reordering of operands. It's possible to trigger this with a createOrFold<BroadcastOp> though. Differential Revision: https://reviews.llvm.org/D97206	2021-02-22 20:06:37 +01:00
Vinayaka Bandishti	15332982c3	[MLIR][affine] Prevent fusion when ops with memory effect free are present between producer and consumer This commit fixes a bug in affine fusion pipeline where an incorrect fusion is performed despite a dealloc op is present between a producer and a consumer. This is done by creating a node for dealloc op in the MDG. Reviewed By: bondhugula, dcaballe Differential Revision: https://reviews.llvm.org/D97032	2021-02-22 23:21:02 +05:30
Tres Popp	5b20d80a03	[mlir] Mark std.subview as NoSideEffect Differential Revision: https://reviews.llvm.org/D96951	2021-02-22 09:34:38 +01:00
Kern Handa	2d62212b06	[mlir] Export CUDA and Vulkan runtime wrappers on Windows Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D97140	2021-02-21 22:58:55 -08:00
Jacques Pienaar	04c66edd32	[mlir] Add simple jupyter kernel Simple jupyter kernel using mlir-opt and reproducer to run passes. Useful for local experimentation & generating examples. The export to markdown from here is not immediately useful nor did I define a CodeMirror synax to make the HTML output prettier. It only supports one level of history (e.g., `_`) as I was mostly using with expanding a pipeline one pass at a time and so was all I needed. I placed this in utils directory next to editor & debugger utils. Differential Revision: https://reviews.llvm.org/D95742	2021-02-21 18:16:06 -08:00
Stella Laurenzo	6c9541d4dd	Implement simple type polymorphism for linalg named ops. * It was decided that this was the end of the line for the existing custom tc parser/generator, and this is the first step to replacing it with a declarative format that maps well to mathy source languages. * One such source language is implemented here: https://github.com/stellaraccident/mlir-linalgpy/blob/main/samples/mm.py * In fact, this is the exact source of the declarative `polymorphic_matmul` in this change. * I am working separately to clean this python implementation up and add it to MLIR (probably as `mlir.tools.linalg_opgen` or equiv). The scope of the python side is greater than just generating named ops: the ops are callable and directly emit `linalg.generic` ops fully dynamically, and this is intended to be a feature for frontends like npcomp to define custom linear algebra ops at runtime. * There is more work required to handle full type polymorphism, especially with respect to integer formulations, since they require more specificity wrt types. * Followups to this change will bring the new generator to feature parity with the current one and delete the current. Roughly, this involves adding support for interface declarations and attribute symbol bindings. Differential Revision: https://reviews.llvm.org/D97135	2021-02-21 14:30:31 -08:00
Jacques Pienaar	fa211f3ce9	Update test error string post pass registration change	2021-02-20 15:54:52 -08:00
Jacques Pienaar	02d7b260c6	[mlir] Register the print-op-graph pass using ODS Move over to ODS & use pass options.	2021-02-20 15:42:02 -08:00
Aart Bik	f32b3401e1	[mlir][sparse] convert function pass to module pass Rationale: Touching function level information can only be done within a module pass. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D97102	2021-02-19 20:04:02 -08:00
Mehdi Amini	f8c1f3b14a	Revert "Revert "Fix MLIR Toy tutorial JIT example and add a test to cover it"" This reverts commit `f36060417a` and reapply commit `ae15b1e7ad`. JIT test must be annotated to not run on Windows.	2021-02-19 23:54:52 +00:00
Stella Stamenova	f36060417a	Revert "Fix MLIR Toy tutorial JIT example and add a test to cover it" This reverts commit `ae15b1e7ad`. This commit caused failures on the mlir windows buildbot	2021-02-19 13:38:43 -08:00
Eugene Zhulenev	f99ccf6516	[mlir] Add math polynomial approximation pass This gives ~30x speedup compared to expanding Tanh into exp operations: ``` name old cpu/op new cpu/op delta BM_mlir_Tanh_f32/10 253ns ± 3% 55ns ± 7% -78.35% (p=0.000 n=44+41) BM_mlir_Tanh_f32/100 2.21µs ± 4% 0.14µs ± 8% -93.85% (p=0.000 n=48+49) BM_mlir_Tanh_f32/1k 22.6µs ± 4% 0.7µs ± 5% -96.68% (p=0.000 n=32+42) BM_mlir_Tanh_f32/10k 225µs ± 5% 7µs ± 6% -96.88% (p=0.000 n=49+55) name old time/op new time/op delta BM_mlir_Tanh_f32/10 259ns ± 1% 56ns ± 2% -78.31% (p=0.000 n=41+39) BM_mlir_Tanh_f32/100 2.27µs ± 1% 0.14µs ± 5% -93.89% (p=0.000 n=46+49) BM_mlir_Tanh_f32/1k 22.9µs ± 1% 0.8µs ± 4% -96.67% (p=0.000 n=30+42) BM_mlir_Tanh_f32/10k 230µs ± 0% 7µs ± 3% -96.88% (p=0.000 n=37+55) ``` This approximations is based on Eigen::generic_fast_tanh function Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D96739	2021-02-19 12:43:36 -08:00
Nicolas Vasilache	0ee4bf151c	[mlir] Add folding of tensor.cast -> subtensor_insert Differential Revision: https://reviews.llvm.org/D97059	2021-02-19 17:24:16 +00:00
Geoffrey Martin-Noble	236aab0b0c	[MLIR] Delete unused functions getCollapsedInitTensor and getExpandedInitTensor These are unused since https://reviews.llvm.org/rG81264dfbe80df08668a325a61613b64243b99c01 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D97014	2021-02-19 09:23:54 -08:00
Nicolas Vasilache	62f5c46eec	[mlir][Linalg] NFC - Expose more options to the CodegenStrategy	2021-02-19 14:01:44 +00:00
Alexander Belyaev	53367b8fe1	[mlir][nfc] Fix indentation in LinalgOps.td.	2021-02-19 13:02:58 +01:00
Nicolas Vasilache	d12fa33d73	[mlir] Add a TensorLoadToMemref canonicalization A folder of `tensor_load + tensor_to_memref` exists but it only applies when source and destination memref types are the same. This revision adds a canonicalize `tensor_load + tensor_to_memref` to `memref_cast` when type mismatches prevent folding to kick in. Differential Revision: https://reviews.llvm.org/D97038	2021-02-19 09:38:33 +00:00
Nicolas Vasilache	b3c227a25a	[mlir] Better support for rank-reducing subview / subtensor type inference. Differential Revision: https://reviews.llvm.org/D96995	2021-02-19 08:30:50 +00:00
Aart Bik	2556d62282	[mlir][sparse] assert fail on mismatch between rank and annotations array Rationale: Providing the wrong number of sparse/dense annotations was silently ignored or caused unrelated crashes. This minor change verifies that the provided number matches the rank. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D97034	2021-02-18 23:22:14 -08:00
Christian Sigg	c86c96a710	[mlir] Load dynamic libraries in JitRunner from absolute paths so that GDB can find the symbol tables. Reviewed By: mehdi_amini, ftynse Differential Revision: https://reviews.llvm.org/D96759	2021-02-19 07:33:35 +01:00
Geoffrey Martin-Noble	db011775e4	Reland "[MLIR] Make structured op tests permutation invariant" Relands with fix swapping DEPENDS for LINK_LIBS. This reverts commit `cd8cc00b9e`. Differential Revision: https://reviews.llvm.org/D97011	2021-02-18 18:09:49 -08:00
Mehdi Amini	ae15b1e7ad	Fix MLIR Toy tutorial JIT example and add a test to cover it	2021-02-19 01:53:36 +00:00
Jing Pu	d690cbf821	Add DivOp to the Shape dialect Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D96907	2021-02-18 16:58:47 -08:00
Mehdi Amini	cd8cc00b9e	Revert "[MLIR] Make structured op tests permutation invariant" This reverts commit `b9ff67099a`. The build is broken with -DBUILD_SHARED_LIBS=ON	2021-02-19 00:16:45 +00:00
Geoffrey Martin-Noble	b9ff67099a	[MLIR] Make structured op tests permutation invariant Extracts the relevant dimensions from the map under test to build up the maps to test against in a permutation-invariant way. This also includes a fix to the indexing maps used by isColumnMajorMatmul. The maps as currently written do not describe a column-major matmul. The linalg named op column_major_matmul has the correct maps (and notably fails the current test). If `C = matmul(A, B)` we want an operation that given A in column major format and B in column major format produces C in column major format. Given that for a matrix, faux column major is just transpose. `column_major_matmul(transpose(A), transpose(B)) = transpose(C)`. If `A` is `NxK` and `B` is `KxM`, then `C` is `NxM`, so `transpose(A)` is `KxN`, `transpose(B)` is `MxK` and `transpose(C)` is `MxN`, not `NxM` as these maps currently have. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96984	2021-02-18 14:36:07 -08:00
Nicolas Vasilache	b006902b2d	[mlir] Fold trivial subtensor / subtensor_insert ops. Static subtensor / subtensor_insert of the same size as the source / destination tensor and root @[0..0] with strides [1..1] are folded away. Differential revision: https://reviews.llvm.org/D96991	2021-02-18 21:34:55 +00:00
Nicolas Vasilache	8e01e2ec0f	[mlir][Vector] Fold tensor_cast + vector.transfer_read Differential Revision: https://reviews.llvm.org/D96988	2021-02-18 20:47:16 +00:00
Andrew Pritchard	08c681f645	Perform memory accesses in the same addrspace as the corresponding memref. It's not necessarily the case on all architectures that all memory is addressable in addrspace 0, so casting the pointer to addrspace 0 is liable to cause problems. Reviewed By: aartbik, ftynse, nicolasvasilache Differential Revision: https://reviews.llvm.org/D96380	2021-02-18 12:36:16 -08:00
natashaknk	25b4a6a7f0	[MLIR][TOSA] Add lowering from TOSA to Linalg for math-based and elementwise ops This patch adds lowering to Linalg for the following TOSA ops: negate, rsqrt, mul, select, clamp and reluN and includes support for signless integer and floating point types Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D96924	2021-02-18 12:10:10 -08:00
Alexander Belyaev	624fccba87	[mlir] Add `linalg.tiled_loop` op. `subtensor_insert` was used instead of `linalg.subtensor_yield` to make this PR smaller. Verification will be added in a follow-up PR. Differential Revision: https://reviews.llvm.org/D96943	2021-02-18 13:23:00 +01:00
Alexander Belyaev	a89035d750	Revert "[MLIR] Create memref dialect and move several dialect-specific ops from std." This commit introduced a cyclic dependency: Memref dialect depends on Standard because it used ConstantIndexOp. Std depends on the MemRef dialect in its EDSC/Intrinsics.h Working on a fix. This reverts commit `8aa6c3765b`.	2021-02-18 12:49:52 +01:00
Julian Gross	8aa6c3765b	[MLIR] Create memref dialect and move several dialect-specific ops from std. Create the memref dialect and move several dialect-specific ops without dependencies to other ops from std dialect to this dialect. Moved ops: AllocOp -> MemRef_AllocOp AllocaOp -> MemRef_AllocaOp DeallocOp -> MemRef_DeallocOp MemRefCastOp -> MemRef_CastOp GetGlobalMemRefOp -> MemRef_GetGlobalOp GlobalMemRefOp -> MemRef_GlobalOp PrefetchOp -> MemRef_PrefetchOp ReshapeOp -> MemRef_ReshapeOp StoreOp -> MemRef_StoreOp TransposeOp -> MemRef_TransposeOp ViewOp -> MemRef_ViewOp The roadmap to split the memref dialect from std is discussed here: https://llvm.discourse.group/t/rfc-split-the-memref-dialect-from-std/2667 Differential Revision: https://reviews.llvm.org/D96425	2021-02-18 11:29:39 +01:00
Alex Zinenko	12875ed976	[mlir] generate enum translation functions with unused attribute The functions translating enums to LLVM IR are generated in a single file included in many places, not all of which use all translations. Generate functions with "unused" attribute to silence compiler warnings. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D96880	2021-02-18 09:44:40 +01:00
Aart Bik	ff6c84b803	[mlir][sparse] generalize sparse storage format to many more types Rationale: Narrower types for overhead storage yield a smaller memory footprint for sparse tensors and thus needs to be supported. Also, more value types need to be supported to deal with all kinds of kernels. Since the "one-size-fits-all" sparse storage scheme implementation is used instead of actual codegen, the library needs to be able to support all combinations of desired types. With some crafty templating and overloading, the actual code for this is kept reasonably sized though. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D96819	2021-02-17 18:20:23 -08:00
Rob Suderman	55756f32f7	[MLIR][TOSA] Expand Tosa int types to I8 and I16 Tosa integers should include I8 and I16 values. Differential Revision: https://reviews.llvm.org/D96900	2021-02-17 14:18:38 -08:00
Alex Zinenko	4a3473ff3b	[mlir] silence unused-function warnings in table-generated code Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D96695	2021-02-17 19:28:31 +01:00
Eugene Zhulenev	519f5917b4	[mlir] Add fma operation to std dialect Will remove `vector.fma` operation in the followup CLs. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D96801	2021-02-17 10:06:01 -08:00
Hanhan Wang	c80484e16e	[mlir][StandardToSPIRV] Add support for lowering trunci to SPIR-V to i1 types. Add a pattern to converting some value to a boolean. spirv.S/UConvert does not work on i1 types. Thus, the pattern is lowered to cmpi + select. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D96851	2021-02-17 07:23:41 -08:00
Weiwei Li	7742620620	[mlir][spirv] Add spv.GLSL.FrexpStruct co-authored-by: Alan Liu <alanliu.yf@gmail.com> Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D96527	2021-02-17 09:02:03 -05:00
Benjamin Kramer	63a35f35ec	[mlir][Shape] Generalize cstr_broadcastable folding for n-ary broadcasts This is still fairly tricky code, but I tried to untangle it a bit. Differential Revision: https://reviews.llvm.org/D96800	2021-02-17 11:44:52 +01:00
Benjamin Kramer	82b692e546	[mlir][Shape] Mark BroadcastOp as not having side effects This allows it to be dead code eliminated when unused. Differential Revision: https://reviews.llvm.org/D96797	2021-02-17 10:26:14 +01:00
Stella Laurenzo	4c3f1be84f	[mlir][python] Add python binding for AffineMapAttribute. Differential Revision: https://reviews.llvm.org/D96815	2021-02-16 15:43:30 -08:00
MaheshRavishankar	81264dfbe8	[mlir][Linalg] Add utility method to reshape ops to express output shape in terms of input shape. Resolving the dim of outputs of a tensor_reshape op in terms of its input shape allows the op to be eliminated when its used only in its dims. The init_tensor -> tensor_reshape canonicalization can be simplified to use the dims of the output of the tensor_reshape which gets canonicalized away later making the tensor_reshape dead. Differential Revision: https://reviews.llvm.org/D96635	2021-02-16 13:42:08 -08:00
Adam Straw	99c0458f2f	separate AffineMapAccessInterface from AffineRead/WriteOpInterface Separating the AffineMapAccessInterface from AffineRead/WriteOp interface so that dialects which extend Affine capabilities (e.g. PlaidML PXA = parallel extensions for Affine) can utilize relevant passes (e.g. MemRef normalization). Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D96284	2021-02-16 13:05:27 -08:00
Thomas Raoux	adfd3c7083	[mlir] Fix memref_cast + subview folder when reducing rank When the destination of the subview has a lower rank than its source we need to fix the result type of the new subview op. Differential Revision: https://reviews.llvm.org/D96804	2021-02-16 12:00:59 -08:00
Alex Zinenko	ce8f10d6cb	[mlir] Simplify ModuleTranslation for LLVM IR A series of preceding patches changed the mechanism for translating MLIR to LLVM IR to use dialect interface with delayed registration. It is no longer necessary for specific dialects to derive from ModuleTranslation. Remove all virtual methods from ModuleTranslation and factor out the entry point to be a free function. Also perform some cleanups in ModuleTranslation internals. Depends On D96774 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96775	2021-02-16 18:42:52 +01:00
Alex Zinenko	2ab57c503e	[mlir] tighten LLVM dialect verifiers to generate valid LLVM IR Verification of the LLVM IR produced when translating various MLIR dialects was only active when calling the translation programmatically. This has led to several cases of invalid LLVM IR being generated that could not be caught with textual mlir-translate tests. Add verifiers for these cases and fix the tests in preparation for enforcing the validation of LLVM IR. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96774	2021-02-16 18:18:21 +01:00
Alex Zinenko	9cd47a26d5	[mlir] add verifiers for NVVM and ROCDL kernel attributes Make sure they can only be attached to LLVM functions as a result of converting GPU functions to the LLVM Dialect.	2021-02-16 18:06:54 +01:00
Thomas Raoux	397336dcab	[mlir][vector] Add missing support for contract of integer lowering. Some of the lowering of vector.contract didn't support integer case. Since reduction of integer cannot accumulate we always break up the reduction op, it should be merged by a separate canonicalization if possible. Differential Revision: https://reviews.llvm.org/D96461	2021-02-16 07:13:30 -08:00
Thomas Raoux	807e5467f3	[mlir] Add canonicalization for tensor_cast + tensor_to_memref This helps bufferization passes by removing tensor_cast operations. Differential Revision: https://reviews.llvm.org/D96745	2021-02-16 07:11:09 -08:00
Lei Zhang	cb1a42359b	[mlir][vector] Move splitting transfer ops into a separate entry point These patterns unrolls transfer read/write ops if the vector consumers/ producers are extract/insert slices op. Transfer ops can map to hardware load/store functionalities, where the vector size matters for bandwidth considerations. So these patterns should be collected separately, instead of being generic canonicalization patterns. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96782	2021-02-16 10:04:34 -05:00
Lei Zhang	d8c7f442ea	[mlir][vector] Add support for unrolling vector.fma Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96781	2021-02-16 09:56:25 -05:00
Tres Popp	787d771dce	[mlir] Don't return nullptrs from scf::IfOp::getSuccessorRegions Previously this might happen if there was no elseRegion and the method was asked for all successor regions. Differential Revision: https://reviews.llvm.org/D96764	2021-02-16 12:06:30 +01:00
Nicolas Vasilache	21debeae78	[mlir][Linalg] Generalize vector::transfer hoisting on tensors. This revision adds support for hoisting "subtensor + vector.transfer_read" / "subtensor_insert + vector.transfer_write pairs" across scf.for. The unit of hoisting becomes a HoistableRead / HoistableWrite struct which contains a pair of "vector.transfer_read + optional subtensor" / "vector.transfer_write + optional subtensor_insert". scf::ForOp canonicalization patterns are applied greedily on the successful application of the transformation to cleanup the IR more eagerly and potentially expose more transformation opportunities. Differential revision: https://reviews.llvm.org/D96731	2021-02-16 09:45:14 +00:00
Adrian Kuegel	07cc77187a	Lower math.expm1 to intrinsics in the GPUToNVVM and GPUToROCDL conversions. This adds the lowering for expm1 for GPU backends. Differential Revision: https://reviews.llvm.org/D96756	2021-02-16 10:23:42 +01:00
Adrian Kuegel	9f581815ae	Add Expm1 op to the math dialect. Differential Revision: https://reviews.llvm.org/D96704	2021-02-16 08:33:37 +01:00
Nicolas Vasilache	d01ea0edaa	[mlir] Drop reliance of SliceAnalysis on specific ops. SliceAnalysis originally was developed in the context of affine.for within mlfunc. It predates the notion of region. This revision updates it to not hardcode specific ops like scf::ForOp. When rooted at an op, the behavior of the slice computation changes as it recurses into the regions of the op. This does not support gathering all values transitively depending on a loop induction variable anymore. Additional variants rooted at a Value are added to also support the existing behavior. Differential revision: https://reviews.llvm.org/D96702	2021-02-16 06:34:32 +00:00
Nicolas Vasilache	02d053ed2d	[mlir][Vector] Add a canonicalization pattern for vector.contract + add Differential Revision: https://reviews.llvm.org/D96701	2021-02-15 21:22:36 +00:00
Jacques Pienaar	381a65fa06	[mlir] Add clone method to ShapedType Allow clients to create a new ShapedType of the same "container" type but with different element or shape. First use case is when refining shape during shape inference without needing to consider which ShapedType is being refined. Differential Revision: https://reviews.llvm.org/D96682	2021-02-15 11:04:16 -08:00
Tres Popp	3842d4b679	Make shape.is_broadcastable/shape.cstr_broadcastable nary This corresponds with the previous work to make shape.broadcast nary. Additionally, simplify the ConvertShapeConstraints pass. It now doesn't lower an implicit shape.is_broadcastable. This is still the same in combination with shape-to-standard when the 2 passes are used in either order. Differential Revision: https://reviews.llvm.org/D96401	2021-02-15 16:05:32 +01:00
Alex Zinenko	1d6f08e61d	[mlir] use new cmake targets in mlir-*-runner	2021-02-15 15:04:00 +01:00
Alex Zinenko	176379e0c8	[mlir] Use the interface-based translation for LLVM "intrinsic" dialects Port the translation of five dialects that define LLVM IR intrinsics (LLVMAVX512, LLVMArmNeon, LLVMArmSVE, NVVM, ROCDL) to the new dialect interface-based mechanism. This allows us to remove individual translations that were created for each of these dialects and just use one common MLIR-to-LLVM-IR translation that potentially supports all dialects instead, based on what is registered and including any combination of translatable dialects. This removal was one of the main goals of the refactoring. To support the addition of GPU-related metadata, the translation interface is extended with the `amendOperation` function that allows the interface implementation to post-process any translated operation with dialect attributes from the dialect for which the interface is implemented regardless of the operation's dialect. This is currently applied to "kernel" functions, but can be used to construct other metadata in dialect-specific ways without necessarily affecting operations. Depends On D96591, D96504 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96592	2021-02-15 14:43:07 +01:00
Tres Popp	89d900b2a1	[mlir] Add error message on shape.broadcast verification failure	2021-02-15 10:58:53 +01:00
Alex Zinenko	34ea608a47	[mlir] Support repeated delayed registration of dialect interfaces Dialects themselves do not support repeated addition of interfaces with the same TypeID. However, in case of delayed registration, the registry may contain such an interface, or have the same interface registered several times due to, e.g., dependencies. Make sure we delayed registration does not attempt to add an interface with the same TypeID more than once. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D96606	2021-02-15 10:46:26 +01:00
Tobias Gysi	99f3510b41	Reland "[mlir] add support for verification in integration tests" The patch extends the runner utils by verification methods that compare two memrefs. The methods compare the content of the two memrefs and print success if the data is identical up to a small numerical error. The methods are meant to simplify the development of integration tests that compare the results against a reference implementation (cf. the updates to the linalg matmul integration tests). Originally landed in `5fa893c` (https://reviews.llvm.org/D96326) and reverted in `dd719fd` due to a Windows build failure. Changes: - Remove the max function that requires the "algorithm" header on Windows - Eliminate the truncation warning in the float specialization of verifyElem by using a float constant Reviewed By: Kayjukh Differential Revision: https://reviews.llvm.org/D96593	2021-02-14 20:30:05 +01:00
Nicolas Vasilache	428bc6feed	[mlir][Linalg] Fix constant detection in linalg.pad_tensor vectorization.	2021-02-14 15:53:39 +00:00
Fangrui Song	3643828b51	[CMake][mlir] Fix mlir-linalg-ods-gen/CMakeLists.txt after D96645	2021-02-13 14:16:38 -08:00
daquexian	6e31a6b7c2	fix linalg ods gen cross compiling like other gen executables Signed-off-by: daquexian <daquexian566@gmail.com> Reviewed By: vinograd47 Differential Revision: https://reviews.llvm.org/D96645	2021-02-13 19:17:46 +00:00
Praveen Narayanan	a65fb1916c	Add a "kind" attribute to ContractionOp and OuterProductOp. Currently, vector.contract joins the intermediate result and the accumulator argument (of ranks K) using summation. We desire more joining operations --- such as max --- to help vector.contract express reductions. This change extends Vector_ContractionOp to take an optional attribute (called "kind", of enum type CombiningKind) specifying the joining operation to be add/mul/min/max for int/fp , and and/or/xor for int only. By default this attribute has value "add". To implement this we also need to extend vector.outerproduct, since vector.contract gets transformed to vector.outerproduct (and that to vector.fma). The extension for vector.outerproduct is also an optional kind attribute that uses the same enum type and possible values. The default is "add". In case of max/min we transform vector.outerproduct to a combination of compare and select. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D93280	2021-02-12 20:23:59 +00:00
Mehdi Amini	aa4e466caa	[mlir][Linalg] Improve region support in Linalg ops This revision takes advantage of the newly extended `ref` directive in assembly format to allow better region handling for LinalgOps. Specifically, FillOp and CopyOp now build their regions explicitly which allows retiring older behavior that relied on specific op knowledge in both lowering to loops and vectorization. This reverts commit `3f22547fd1` and reland `973e133b76` with a workaround for a gcc bug that does not accept lambda default parameters: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=59949 Differential Revision: https://reviews.llvm.org/D96598	2021-02-12 19:11:24 +00:00
Diego Caballero	656674a7c4	[mlir][Vector] Align gather/scatter/expand/compress API Align the vector gather/scatter/expand/compress API with the vector load/store/maskedload/maskedstore API. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D96396	2021-02-12 20:48:38 +02:00
Diego Caballero	ee66e43a96	[mlir][Vector] Introduce 'vector.load' and 'vector.store' ops This patch adds the 'vector.load' and 'vector.store' ops to the Vector dialect [1]. These operations model contiguous vector loads and stores from/to memory. Their semantics are similar to the 'affine.vector_load' and 'affine.vector_store' counterparts but without the affine constraints. The most relevant feature is that these new vector operations may perform a vector load/store on memrefs with a non-vector element type, unlike 'std.load' and 'std.store' ops. This opens the representation to model more generic vector load/store scenarios: unaligned vector loads/stores, perform scalar and vector memory access on the same memref, decouple memory allocation constraints from memory accesses, etc [1]. These operations will also facilitate the progressive lowering of both Affine vector loads/stores and Vector transfer reads/writes for those that read/write contiguous slices from/to memory. In particular, this patch adds the 'vector.load' and 'vector.store' ops to the Vector dialect, implements their lowering to the LLVM dialect, and changes the lowering of 'affine.vector_load' and 'affine.vector_store' ops to the new vector ops. The lowering of Vector transfer reads/writes will be implemented in the future, probably as an independent pass. The API of 'vector.maskedload' and 'vector.maskedstore' has also been changed slightly to align it with the transfer read/write ops and the vector new ops. This will improve reusability among all these operations. For example, the lowering of 'vector.load', 'vector.store', 'vector.maskedload' and 'vector.maskedstore' to the LLVM dialect is implemented with a single template conversion pattern. [1] https://llvm.discourse.group/t/memref-type-and-data-layout/ Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96185	2021-02-12 20:48:37 +02:00
Mehdi Amini	3f22547fd1	Revert "[mlir][Linalg] Improve region support in Linalg ops." This reverts commit `973e133b76`. It triggers an issue in gcc5 that require investigation, the build is broken with: /tmp/ccdpj3B9.s: Assembler messages: /tmp/ccdpj3B9.s:5821: Error: symbol `_ZNSt17_Function_handlerIFvjjEUljjE2_E9_M_invokeERKSt9_Any_dataOjS6_' is already defined /tmp/ccdpj3B9.s:5860: Error: symbol `_ZNSt14_Function_base13_Base_managerIUljjE2_E10_M_managerERSt9_Any_dataRKS3_St18_Manager_operation' is already defined	2021-02-12 18:15:51 +00:00
Alex Zinenko	66900b3eae	[mlir] Use dialect interfaces to translate OpenMP dialect to LLVM IR Migrate the translation of the OpenMP dialect operations to LLVM IR to the new dialect-based mechanism. Depends On D96503 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96504	2021-02-12 18:37:47 +01:00
Alex Zinenko	b77bac0572	[mlir] Introduce dialect interfaces for translation to LLVM IR The existing approach to translation to the LLVM IR relies on a single translation supporting the base LLVM dialect, extensible through inheritance to support intrinsic-based dialects also derived from LLVM IR such as NVVM and AVX512. This approach does not scale well as it requires additional translations to be created for each new intrinsic-based dialect and does not allow them to mix in the same module, contrary to the rest of the MLIR infrastructure. Furthermore, OpenMP translation ingrained itself into the main translation mechanism. Start refactoring the translation to LLVM IR to operate using dialect interfaces. Each dialect that contains ops translatable to LLVM IR can implement the interface for translating them, and the top-level translation driver can operate on interfaces without knowing about specific dialects. Furthermore, the delayed dialect registration mechanism allows one to avoid a dependency on LLVM IR in the dialect that is translated to it by implementing the translation as a separate library and only registering it at the client level. This change introduces the new mechanism and factors out the translation of the "main" LLVM dialect. The remaining dialects will follow suit. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96503	2021-02-12 17:49:44 +01:00
Nicolas Vasilache	f3fb2dd147	[mlir][Linalg] NFC - Add an OpFoldResult-based builder for InitTensorOp	2021-02-12 16:03:51 +00:00
Nicolas Vasilache	973e133b76	[mlir][Linalg] Improve region support in Linalg ops. This revision takes advantage of the newly extended `ref` directive in assembly format to allow better region handling for LinalgOps. Specifically, FillOp and CopyOp now build their regions explicitly which allows retiring older behavior that relied on specific op knowledge in both lowering to loops and vectorization. Differential Revision: https://reviews.llvm.org/D96598	2021-02-12 14:51:03 +00:00
Benjamin Kramer	530d6ea97b	[mlir][spirv] Lower sexti -> SConvert	2021-02-12 15:04:12 +01:00
Alex Zinenko	4c4876c314	[mlir] Use target-specific GPU kernel attributes in lowering pipelines Until now, the GPU translation to NVVM or ROCDL intrinsics relied on the presence of the generic `gpu.kernel` attribute to attach additional LLVM IR metadata to the relevant functions. This would be problematic if each dialect were to handle the conversion of its own options, which is the intended direction for the translation infrastructure. Introduce `nvvm.kernel` and `rocdl.kernel` in addition to `gpu.kernel` and base translation on these new attributes instead. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D96591	2021-02-12 14:09:24 +01:00
Stephan Herhut	2bfe27da17	[mlir][math] Fix cmake files after dialect splitting. This fixes some missing dependencies that broke the shared library build.	2021-02-12 11:25:15 +01:00
Stephan Herhut	4348d8ab7f	[mlir][math] Split off the math dialect. This does not split transformations, yet. Those will be done as future clean ups. Differential Revision: https://reviews.llvm.org/D96272	2021-02-12 10:55:12 +01:00
Alexander Belyaev	891e769ab6	[mlir] Initialize `isSmallAlloc` in `initialize` method.	2021-02-12 10:28:58 +01:00
Alexander Belyaev	16213e1f50	[mlir] Allow users of `PromoteBuffersToStackPass` to customize `small buffer` func. Differential Revision: https://reviews.llvm.org/D96579	2021-02-12 10:11:58 +01:00
Aart Bik	5f022ad6ed	[mlir] detect integer overflow in debug mode Rationale: This computation failed ASAN for the following input (integer overflow during 4032000000000000000 * 100): tensor<100x200x300x400x500x600x700x800xf32> This change adds a simple overflow detection during debug mode (which we run more regularly than ASAN). Arguably this is an unrealistic tensor input, but in the context of sparse tensors, we may start to see cases like this. Bug: https://bugs.llvm.org/show_bug.cgi?id=49136 Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D96530	2021-02-11 18:20:40 -08:00
Nicolas Vasilache	5bc4f8846c	s[mlir] Tighten computation of inferred SubView result type. The AffineMap in the MemRef inferred by SubViewOp may have uncompressed symbols which result in type mismatch on otherwise unused symbols. Make the computation of the AffineMap compress those unused symbols which results in better canonical types. Additionally, improve the error message to report which inferred type was expected. Differential Revision: https://reviews.llvm.org/D96551	2021-02-11 22:38:16 +00:00
Stella Stamenova	ed98676fa4	Support multi-configuration generators correctly in several config files Multi-configuration generators (such as Visual Studio and Xcode) allow the specification of a build flavor at build time instead of config time, so the lit configuration files need to support that - and they do for the most part. There are several places that had one of two issues (or both!): 1) Paths had %(build_mode)s set up, but then not configured, resulting in values that would not work correctly e.g. D:/llvm-build/%(build_mode)s/bin/dsymutil.exe 2) Paths did not have %(build_mode)s set up, but instead contained $(Configuration) (which is the value for Visual Studio at configuration time, for Xcode they would have had the equivalent) e.g. "D:/llvm-build/$(Configuration)/lib". This seems to indicate that we still have a lot of fragility in the configurations, but also that a number of these paths are never used (at least on Windows) since the errors appear to have been there a while. This patch fixes the configurations and it has been tested with Ninja and Visual Studio to generate the correct paths. We should consider removing some of these settings altogether. Reviewed By: JDevlieghere, mehdi_amini Differential Revision: https://reviews.llvm.org/D96427	2021-02-11 09:32:20 -08:00
Nicolas Vasilache	e332c22cdf	[mlir][LLVM] NFC - Refactor a lookupOrCreateFn to reuse common function creation. Differential revision: https://reviews.llvm.org/D96488	2021-02-11 15:52:33 +00:00
Alex Zinenko	0881a4f1bf	[mlir] make ModuleTranslation mapping fields private ModuleTranslation contains multiple fields that keep track of the mappings between various MLIR and LLVM IR components. The original ModuleTranslation extension model was based on inheritance, with these fields being protected and thus accessible in the ModuleTranslation and derived classes. The inheritance-based model doesn't scale to translation of more than one derived dialect and will be progressively replaced with a more flexible one based on dialect interfaces and a translation state that is separate from ModuleTranslation. This change prepares the replacement by making the mappings private and providing public methods to access them. Depends On D96436 Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D96437	2021-02-11 14:50:49 +01:00
Alex Zinenko	9a08f760fe	[mlir] Make JitRunnerMain main take a DialectRegistry Historically, JitRunner has been registering all available dialects with the context and depending on them without the real need. Make it take a registry that contains only the dialects that are expected in the input and stop linking in all dialects. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D96436	2021-02-11 14:50:48 +01:00
Stephan Herhut	33a58c1c5c	[mlir][gpu] Allow all dialects in SCF to GPU conversion. With the standard dialect being split up, the set of dialects that are used when converting to GPU is growing. This change modifies the SCFToGpu pass to allow all operations inside launch bodies. Differential Revision: https://reviews.llvm.org/D96480	2021-02-11 10:02:26 +01:00
Hanhan Wang	9325b8da17	[mlir][Linalg] Add conv ops with TF definition. The dimension order of a filter in tensorflow is [filter_height, filter_width, in_channels, out_channels], which is different from current definition. The current definition follows TOSA spec. Add TF version conv ops to .tc, so we do not have to insert a transpose op around a conv op. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D96038	2021-02-10 22:59:38 -08:00
Sanjoy Das	bac1f12727	NFC; fix typo in comment This should have gone in with `a76761cf0d`.	2021-02-10 21:34:29 -08:00
Sanjoy Das	a76761cf0d	NFC comment-only cleanups - Remove leftover comment from `de2568aab8` - Fix a typo in a comment	2021-02-10 21:30:52 -08:00
Aart Bik	11bec2a81c	[mlir][sparse] reduce tensor dimensions in sparse test Rationale: BuiltinTypes.cpp observed overflow when computing size of tensor<100x200x300x400x500x600x700x800xf32>. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D96475	2021-02-10 17:59:19 -08:00
Mehdi Amini	b1aaed023e	Enable `Pass::initialize()` to fail by returning a LogicalResult Differential Revision: https://reviews.llvm.org/D96474	2021-02-11 01:51:53 +00:00
Mehdi Amini	09cfec6243	Fix CMake configuration for MLIR unittests The CMake changes in `2aa1af9b1d` to make it possible to build MLIR as a standalone project unfortunately disabled all unit-tests from the regular in-tree build.	2021-02-11 01:17:49 +00:00
Rob Suderman	c19a412809	[MLIR][TOSA] Tosa elementwise broadcasting Added support for broadcasting size-1 dimensions for TOSA elemtnwise operations. Differential Revision: https://reviews.llvm.org/D96190	2021-02-10 15:28:18 -08:00
Sean Silva	b83361b82b	Minor fixes to Type ODS docs. Differential Revision: https://reviews.llvm.org/D96391	2021-02-10 15:22:59 -08:00
Sean Silva	6b07a97835	[mlir] Introduce more intuitive wording for attributes. After discussion, it seems like we want to go with "inherent/discardable". These seem to best capture the relationship with the op semantics and don't conflict with other terms. Please let me know your preferences. Some of the other contenders are: ``` "intrinsic" side \| "annotation" side -----------------+------------------ characteristic \| annotation closed \| open definitional \| advisory essential \| discardable expected \| unexpected innate \| acquired internal \| external intrinsic \| extrinsic known \| unknown local \| global native \| foreign inherent \| acquired ``` Rationale: - discardable: good. discourages use for stable data. - inherent: good - annotation: redundant and doesn't convey difference - intrinsic: confusable with "compiler intrinsics". - definitional: too much of a mounthful - extrinsic: too exotic of a word and hard to say - acquired: doesn't convey the relationship to the semantics - internal/external: not immediately obvious: what is internal to what? - innate: similar to intrinsic but worse - acquired: we don't typically think of an op as "acquiring" things - known/unknown: by who? - local/global: to what? - native/foreign: to where? - advisory: confusing distinction: is the attribute itself advisory or is the information it provides advisory? - essential: an intrinsic attribute need not be present. - expected: same issue as essential - unexpected: by who/what? - closed/open: whether the set is open or closed doesn't seem essential to the attribute being intrinsic. Also, in theory an op can have an unbounded set of intrinsic attributes (e.g. `arg<N>` for func). - characteristic: unless you have a math background this probably doesn't make as much sense Differential Revision: https://reviews.llvm.org/D96093	2021-02-10 15:22:59 -08:00
Nicolas Vasilache	24db783938	[mlir] NFC - Extend inferResultType API for SubViewOp and SubTensorOp	2021-02-10 22:55:28 +00:00
Nicolas Vasilache	4643fd27c8	[mlir][Linalg] Fix crash when tileSizeComputationFunction is left unspecified	2021-02-10 22:47:05 +00:00
Mehdi Amini	81987396ac	Fix StridedMemRefType operator[] SFINAE to allow correctly selecting the `int64_t` overload for non-container operands	2021-02-10 20:02:11 +00:00
Jing Pu	544cebd619	Change type constraint of the "index" in "shape.split_at" to Shape_SizeOrIndexType Make the type contraint consistent with other shape dialect operations. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D96377	2021-02-10 11:58:19 -08:00
Aart Bik	0b1764a3d7	[mlir][sparse] sparse tensor storage implementation This revision connects the generated sparse code with an actual sparse storage scheme, which can be initialized from a test file. Lacking a first-class citizen SparseTensor type (with buffer), the storage is hidden behind an opaque pointer with some "glue" to bring the pointer back to tensor land. Rather than generating sparse setup code for each different annotated tensor (viz. the "pack" methods in TACO), a single "one-size-fits-all" implementation has been added to the runtime support library. Many details and abstractions need to be refined in the future, but this revision allows full end-to-end integration testing and performance benchmarking (with on one end, an annotated Lingalg op and, on the other end, a JIT/AOT executable). Reviewed By: nicolasvasilache, bixia Differential Revision: https://reviews.llvm.org/D95847	2021-02-10 11:57:24 -08:00
Mehdi Amini	9680ea5c98	Add convenience C++ helper to manipulate ranked strided memref Reland `11f32a41c2` that was reverted in `e49967fbd9` after fixing the build. Differential Revision: https://reviews.llvm.org/D96192	2021-02-10 18:58:05 +00:00
Mehdi Amini	e49967fbd9	Revert "Add convenience C++ helper to manipulate ranked strided memref" This reverts commit `11f32a41c2`. The build is broken because this commit conflits with the refactoring of the DialectRegistry APIs in the context. It'll reland shortly after fixing the API usage.	2021-02-10 18:09:38 +00:00
Mehdi Amini	11f32a41c2	Add convenience C++ helper to manipulate ranked strided memref Differential Revision: https://reviews.llvm.org/D96192	2021-02-10 17:40:36 +00:00
Nicolas Vasilache	0ac3d97bf4	[mlir][Linalg] Fix pad hoisting. This revision fixes the indexing logic into the packed tensor that result from hoisting padding. Previously, the index was incorrectly set to the loop induction variable when in fact we need to compute the iteration count (i.e. `(iv - lb).ceilDiv(step)`). Differential Revision: https://reviews.llvm.org/D96417	2021-02-10 16:49:38 +00:00
Nicolas Vasilache	bb69de3f41	[mlir][Linalg] Add a vectorization pattern for linalg::PadTensorOp The new pattern is exercised from the TestLinalgTransforms pass. Differential Revision: https://reviews.llvm.org/D96410	2021-02-10 14:13:49 +00:00
Alex Zinenko	2996a8d675	[mlir] avoid exposing mutable DialectRegistry from MLIRContext MLIRContext allows its users to access directly to the DialectRegistry it contains. While sometimes useful for registering additional dialects on an already existing context, this breaks the encapsulation by essentially giving raw accesses to a part of the context's internal state. Remove this mutable access and instead provide a method to append a given DialectRegistry to the one already contained in the context. Also provide a shortcut mechanism to construct a context from an already existing registry, which seems to be a common use case in the wild. Keep read-only access to the registry contained in the context in case it needs to be copied or used for constructing another context. With this change, DialectRegistry is no longer concerned with loading the dialects and deciding whether to invoke delayed interface registration. Loading is concentrated in the MLIRContext, and the functionality of the registry better reflects its name. Depends On D96137 Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D96331	2021-02-10 12:07:34 +01:00
Alex Zinenko	3da51522fb	[mlir] enable delayed registration of dialect interfaces This introduces a mechanism to register interfaces for a dialect without making the dialect itself depend on the interface. The registration request happens on DialectRegistry and, if the dialect has not been loaded yet, the actual registration is delayed until the dialect is loaded. It requires DialectRegistry to become aware of the context that contains it and the context to expose methods for querying if a dialect is loaded. This mechanism will enable a simple extension mechanism for dialects that can have interfaces defined outside of the dialect code. It is particularly helpful for, e.g., translation to LLVM IR where we don't want the dialect itself to depend on LLVM IR libraries. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D96137	2021-02-10 12:07:32 +01:00
Tres Popp	f30f347da1	[mlir][shape] Generalize broadcast to a variadic number of shapes Previously broadcast was a binary op. Now it can support more inputs. This has been changed in such a way that for now, this is an NFC for all broadcast operations that were previously legal. Differential Revision: https://reviews.llvm.org/D95777	2021-02-10 08:31:28 +01:00
Uday Bondhugula	fdfd647837	[MLIR] NFC Fix vector transforms build warnings Fix build warnings from VectorTransforms.cpp.	2021-02-10 10:42:56 +05:30
Uday Bondhugula	5400f602cd	[MLIR] Update affine.for unroll utility for iter_args support Update affine.for loop unroll utility for iteration arguments support. Fix promoteIfSingleIteration as well. Fixes PR49084: https://bugs.llvm.org/show_bug.cgi?id=49084 Differential Revision: https://reviews.llvm.org/D96383	2021-02-10 10:38:47 +05:30
Andrew Pritchard	74c3615997	Add LLVMIR Dialect counterparts of @llvm.maximum and @llvm.minimum. These are similar to maxnum and minnum, but they're defined to treat -0 as less than +0. This behavior can't be expressed using float comparisons and selects, since comparisons are defined to treat different-signed zeros as equal. So, the only way to communicate this behavior into LLVM IR without defining target-specific intrinsics is to add the corresponding ops. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D96373	2021-02-10 00:57:43 +00:00
Andrew Pritchard	018645b81b	Fix side-effect detection in LLVMIRIntrinsicGen. Previously it reported an op had side-effects iff it declared that it didn't have any side-effects. This had the undesirable result that canonicalization would always delete any intrinsic calls that did memory stores and returned void. Reviewed By: ftynse, mehdi_amini Differential Revision: https://reviews.llvm.org/D96369	2021-02-10 00:48:16 +00:00
Jing Pu	4bd68f238c	Add NoSideEffect trait to shape.split_at and shape.concat Reviewed By: jpienaar, silvas Differential Revision: https://reviews.llvm.org/D96358	2021-02-09 15:19:53 -08:00
River Riddle	6e3292b0b7	[mlir][OpFormatGen] Refactor `type_ref` into a more general `ref` directive This allows for referencing nearly every component of an operation from within a custom directive. It also fixes a bug with the current type_ref implementation, PR48478 Differential Revision: https://reviews.llvm.org/D96189	2021-02-09 14:33:48 -08:00
River Riddle	b9c876bd7e	[mlir] Add initial support for an alias analysis framework in MLIR This revision adds a new `AliasAnalysis` class that represents the main alias analysis interface in MLIR. The purpose of this class is not to hold the aliasing logic itself, but to provide an interface into various different alias analysis implementations. As it evolves this should allow for users to plug in specialized alias analysis implementations for their own needs, and have them immediately usable by other analyses and transformations. This revision also adds an initial simple generic alias, LocalAliasAnalysis, that provides support for performing stateless local alias queries between values. This class is similar in scope to LLVM's BasicAA. Differential Revision: https://reviews.llvm.org/D92343	2021-02-09 14:21:27 -08:00
George	6962bd68f1	[MLIR] Add context accessor to identifier I knew I would miss one... Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D96321	2021-02-09 13:21:30 -08:00
River Riddle	fe7c0d90b2	[mlir][IR] Remove the concept of `OperationProperties` These properties were useful for a few things before traits had a better integration story, but don't really carry their weight well these days. Most of these properties are already checked via traits in most of the code. It is better to align the system around traits, and improve the performance/cost of traits in general. Differential Revision: https://reviews.llvm.org/D96088	2021-02-09 12:00:15 -08:00
Weiwei Li	2ef24139fc	[mlir][spirv] Add support for sampled image type co-authored-by: Alan Liu <alanliu.yf@gmail.com> Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D96169	2021-02-09 14:14:07 -05:00
Hanhan Wang	e8d31754a2	[mlir][Linalg] Add a build method for linalg.pad_tensor Add a build method that pads the source with a scalar value. Reviewed By: nicolasvasilache, antiagainst Differential Revision: https://reviews.llvm.org/D96343	2021-02-09 10:19:57 -08:00
Tobias Gysi	dd719fda76	Revert "[mlir] add support for verification in integration tests" This reverts commit `5fa893c`. Windows build bot fails due to missing header https://reviews.llvm.org/D96326	2021-02-09 19:16:02 +01:00
George	5099a48a3b	[MLIR] Replace dialect registration hooks with dialect handle Replace MlirDialectRegistrationHooks with MlirDialectHandle, which under-the-hood is an opaque pointer to MlirDialectRegistrationHooks. Then we expose the functionality previously directly on MlirDialectRegistrationHooks, as functions which take the opaque MlirDialectHandle struct. This makes the actual structure of the registration hooks an implementation detail, and happens to avoid this issue: https://llvm.discourse.group/t/strange-swift-issues-with-dialect-registration-hooks/2759/3 Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D96229	2021-02-09 09:02:16 -08:00
Tobias Gysi	5fa893cc38	[mlir] add support for verification in integration tests The patch extends the runner utils by verification methods that compare two memrefs. The methods compare the content of the two memrefs and print success if the data is identical up to a small numerical error. The methods are meant to simplify the development of integration tests that for example compare optimized and unoptimized code paths (cf. the updates to the linalg matmul integration tests). Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96326	2021-02-09 17:43:11 +01:00
Lei Zhang	7784ce078d	[mlir][linalg] Fix depthwise conv C++ symbol to be consistent The assembly mnemonic includes information about input/filter data format. The C++ symbol should be consistent.	2021-02-09 10:06:32 -05:00
Thomas Raoux	bfa508efd5	[mlir][linalg] Fix one more missing NoSideEffect in linalg tensor op Differential Revision: https://reviews.llvm.org/D96314	2021-02-09 07:04:50 -08:00
Denys Shabalin	fa581f9438	[mlir] Add stacksave, stackrestore to llvm dialect Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D96333	2021-02-09 15:13:16 +01:00
Lei Zhang	068bf9e802	[mlir][linalg] Define a depthwise 2-D convolution op This commit defines linalg.depthwise_conv_2d_nhwc for depthwise 2-D convolution with NHWC input/output data format. This op right now only support channel multiplier == 1, which is the most common case. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D94966	2021-02-09 08:55:20 -05:00
Lei Zhang	4c640e49c9	[mlir][linalg] Verify indexing map required attributes Indexing maps for named ops can reference attributes so that we can synthesize the indexing map dynamically. This supports cases like strides for convolution ops. However, it does cause an issue: now the indexing_maps() function call is dependent on those attributes. Linalg ops inherit LinalgOpInterfaceTraits, which calls verifyStructuredOpInterface() to verify the interface. verifyStructuredOpInterface() further calls indexing_maps(). Note that trait verification is done before the op itself, where ODS generates the verification for those attributes. So we can have indexing_maps() referencing non-existing or invalid attribute, before the ODS-generated verification kick in. There isn't a dependency handling mechansim for traits. This commit adds new interface methods to query whether an op hasDynamicIndexingMaps() and then perform verifyIndexingMapRequiredAttributes() in verifyStructuredOpInterface() to handle the dependency issue. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96297	2021-02-09 08:48:29 -05:00
Matthias Springer	b6910fd31d	[MLIR][AVX512] Add integration test for vp2intersect Differential Revision: https://reviews.llvm.org/D96306	2021-02-09 16:43:37 +09:00
George	8f130f108f	[MLIR] Add C API for navigating up the IR tree Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D96301	2021-02-08 19:54:38 -08:00
Uday Bondhugula	333d2cfc70	[MLIR][NFC] Fix std.copysign op documentation Fix std.copysign op documentation. NFC. Differential Revision: https://reviews.llvm.org/D96217	2021-02-09 05:59:05 +05:30
Nicolas Vasilache	d57a305fdf	[mlir][Linalg] Fix padding related bugs. This revision fixes the fact that the padding transformation did not have enough information to set the proper type for the padding value. Additionally, the verifier for Yield in the presence of PadTensorOp is fixed to properly report incorrect number of results or operands. Previously, the error would be silently ignored which made the core issue difficult to debug. Differential Revision: https://reviews.llvm.org/D96264	2021-02-08 18:59:24 +00:00

... 3 4 5 6 7 ...

7118 Commits