llvm-project

Commit Graph

Author	SHA1	Message	Date
River Riddle	b8ffcb12e2	[mlir:Pass] Generate a reproducer as early as possible This avoids keeping references to passes that may be freed by the time that the pass manager has finished executing (in the non-crash case). Fixes PR#52069 Differential Revision: https://reviews.llvm.org/D111106	2021-10-05 18:11:26 +00:00
Rob Suderman	d5a4c86d14	[mlir][tosa] tosa.cast support for unsigned integers Unsigned integers need to be handled for cast to floating point. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D111102	2021-10-05 10:57:16 -07:00
Aart Bik	16b8f4ddae	[mlir][sparse] add a "release" operation to sparse tensor dialect We have several ways to materialize sparse tensors (new and convert) but no explicit operation to release the underlying sparse storage scheme at runtime (other than making an explicit delSparseTensor() library call). To simplify memory management, a sparse_tensor.release operation has been introduced that lowers to the runtime library call while keeping tensors, opague pointers, and memrefs transparent in the initial IR. Note There is obviously some tension between the concept of immutable tensors and memory management methods. This tension is addressed by simply stating that after the "release" call, no further memref related operations are allowed on the tensor value. We expect the design to evolve over time, however, and arrive at a more satisfactory view of tensors and buffers eventually. Bug: http://llvm.org/pr52046 Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D111099	2021-10-05 09:35:59 -07:00
Nicolas Vasilache	af9dce18bf	[mlir][Linalg] Allow operand-less scf::ExecuteRegionOp to encapsulate scf::YieldOp These are considered noops. Buferization will still fail on scf.execute_region which yield values. This is used to make comprehensive bufferization interoperate better with external clients. Differential Revision: https://reviews.llvm.org/D111130	2021-10-05 11:34:53 +00:00
Adrian Kuegel	d009f6e51c	[mlir] Convert ConstShapeOp to a static tensor type. ConstShapeOp knows its shape, so it should also have a static tensor type. Differential Revision: https://reviews.llvm.org/D111127	2021-10-05 12:14:43 +02:00
Alex Zinenko	01d696e563	[mlir] rename the "packing" flag of linalg.pad_tensor to "nofold" The discussion in https://reviews.llvm.org/D110425 demonstrated that "packing" may be a confusing term to define the behavior of this op in presence of the attribute. Instead, indicate the intended effect of preventing the folder from being applied. Reviewed By: nicolasvasilache, silvas Differential Revision: https://reviews.llvm.org/D111046	2021-10-04 21:28:11 +02:00
Weiwei Li	1e4cfe5e4f	[mlir][SPIRVToLLVM] Propagate location attribute from spv.GlobalVariable to llvm.mlir.global This patch is mainly to propogate location attribute from spv.GlobalVariable to llvm.mlir.global. It also contains three small changes. 1. Remove the restriction on UniformConstant In SPIRVToLLVM.cpp; 2. Remove the errorCheck on relaxedPrecision when deserializering SPIR-V in Deserializer.cpp 3. In SPIRVOps.cpp, let ConstantOp take signedInteger too. Co-authered: Alan Liu <alanliu.yf@gmail.com> and Xinyi Liu <xyliuhelen@gmail.com> Reviewed by:antiagainst Differential revision: https://reviews.llvm.org/D110207	2021-10-05 00:09:09 +08:00
Nicolas Vasilache	fab634b4e2	[mlir] Tighten strided layout specification. Clarify that the strided layout specification is represented by a single semi-affine map. Differential Revision: https://reviews.llvm.org/D110921	2021-10-04 10:37:05 +00:00
Alex Zinenko	255a690971	[mlir][python] Provide more convenient constructors for std.CallOp The new constructor relies on type-based dynamic dispatch and allows one to construct call operations given an object representing a FuncOp or its name as a string, as opposed to requiring an explicitly constructed attribute. Depends On D110947 Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D110948	2021-10-04 11:45:29 +02:00
Alex Zinenko	3a3a09f654	[mlir][python] Provide more convenient wrappers for std.ConstantOp Constructing a ConstantOp using the default-generated API is verbose and requires to specify the constant type twice: for the result type of the operation and for the type of the attribute. It also requires to explicitly construct the attribute. Provide custom constructors that take the type once and accept a raw value instead of the attribute. This requires dynamic dispatch based on type in the constructor. Also provide the corresponding accessors to raw values. In addition, provide a "refinement" class ConstantIndexOp similar to what exists in C++. Unlike other "op view" Python classes, operations cannot be automatically downcasted to this class since it does not correspond to a specific operation name. It only exists to simplify construction of the operation. Depends On D110946 Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D110947	2021-10-04 11:45:27 +02:00
Alex Zinenko	ed9e52f3af	[mlir][python] Usability improvements for Python bindings Provide a couple of quality-of-life usability improvements for Python bindings, in particular: * give access to the list of types for the list of op results or block arguments, similarly to ValueRange->TypeRange, * allow for constructing empty dictionary arrays, * support construction of array attributes by concatenating an existing attribute with a Python list of attributes. All these are required for the upcoming customization of builtin and standard ops. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D110946	2021-10-04 11:45:25 +02:00
Tobias Gysi	32a7d60516	[mli][linalg] Change tensor size in unit test (NFC). As a follow up to https://reviews.llvm.org/D110849, adapt the input tensor size to match the iteration space. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D110906	2021-10-04 06:43:35 +00:00
Michał Górny	0f567f0e3e	[mlir] [test] Add missing tool substitutions Add missing mlir-capi-*-test tool substitutions in order to fix CAPI test failures when mlir is not installed yet. Differential Revision: https://reviews.llvm.org/D110991	2021-10-03 21:28:13 +02:00
Michał Górny	93769e81ed	[mlir] [test] Include mlir_tools_dir in PATH to fix mlir-reduce Include mlir_tools_dir in the PATH used in test environment, as otherwise mlir-reduce is unable to find mlir-opt when building standalone (and hence mlir_tools_dir != llvm_tools_dir). Differential Revision: https://reviews.llvm.org/D110992	2021-10-03 08:48:59 +02:00
Mehdi Amini	bce0c6429e	Fix ASAN execution for the MLIR Python tests First the leak sanitizer has to be disabled, as even an empty script leads to leak detection with Python. Then we need to preload the ASAN runtime, as the main binary (python) won't be linked against it. This will only work on Linux right now. Differential Revision: https://reviews.llvm.org/D111004	2021-10-03 05:07:32 +00:00
Mehdi Amini	86f5028898	Exclude MLIR python binding tests from Sanitizer tests for now This requires more config to work reliably during lit execution. But also I see many leaks when running manually right now.	2021-10-03 05:07:01 +00:00
Mehdi Amini	cb2e0eb68e	Fix last leaky MLIR integration test (NFC)	2021-10-03 05:04:34 +00:00
Mehdi Amini	903facd96b	Disable leak check for the MLIR Linalg CPU integration tests (NFC) See http://llvm.org/pr52047 for tracking.	2021-10-03 03:42:45 +00:00
Mehdi Amini	5de44d2521	Disable leak check for the MLIR Sparse CPU integration tests (NFC) See http://llvm.org/pr52046 for tracking.	2021-10-03 03:35:31 +00:00
Mehdi Amini	51b9f0b82a	Fix memory leaks in MLIR integration tests for vector dialect (NFC)	2021-10-03 03:28:24 +00:00
Mehdi Amini	bac4529b43	Fix/disable more MLIR tests exposing leaks in ASAN builds (NFC)	2021-10-02 23:53:02 +00:00
Mehdi Amini	4b28638bcc	Fix multiple memory leaks in mlir-cpu-runner tests (NFC)	2021-10-02 23:16:35 +00:00
Mehdi Amini	fe48ecb047	Fix memory leak in mlir-cpu-runner/sgemm_naive_codegen.mlir (NFC)	2021-10-02 23:07:49 +00:00
Mehdi Amini	57d9adefa0	Fix memory leaks in MLIR unit-tests (NFC)	2021-10-02 21:31:46 +00:00
Mehdi Amini	237d18a61a	Fix memory leaks in mlir/test/CAPI/ir.c	2021-10-02 04:45:40 +00:00
Mehdi Amini	a1d1c31746	Add a `check-mlir-build-only` build target that only builds the dependencies of the `check-mlir` test target (NFC)	2021-10-02 04:06:17 +00:00
Daniel Resnick	782a97a977	[mlir][capi] Add TypeID to MLIR C-API Exposes mlir::TypeID to the C API as MlirTypeID along with various accessors and helper functions. Differential Revision: https://reviews.llvm.org/D110897	2021-10-01 14:21:18 -06:00
Tobias Gysi	bf28849745	[mlir][linalg] Retire PoolingMaxOp/PoolingMinOp/PoolingSumOp. The pooling ops are among the last remaining hard coded Linalg operations that have no region attached. They got obsolete due to the OpDSL pooling operations. Removing them allows us to delete specialized code and tests that are not needed for the OpDSL counterparts that rely on the standard code paths. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110909	2021-10-01 13:51:56 +00:00
Uday Bondhugula	08b63db8bb	[MLIR][GPU] Add GPU launch op support for dynamic shared memory Add support for dynamic shared memory for GPU launch ops: add an optional operand to gpu.launch and gpu.launch_func ops to specify the amount of "dynamic" shared memory to use. Update lowerings to connect this operand to the GPU runtime. Differential Revision: https://reviews.llvm.org/D110800	2021-10-01 16:46:07 +05:30
Lei Zhang	cb2e651800	[mlir][linalg] Fix incorrect bound calculation for tiling conv For convolution, the input window dimension's access affine map is of the form `(d0 * s0 + d1)`, where `d0`/`d1` is the output/ filter window dimension, and `s0` is the stride. When tiling, https://reviews.llvm.org/D109267 changed how the way dimensions are acquired. Instead of directly querying using `.dim` ops on the original convolution op, we now get it by applying the access affine map to the loop upper bounds. This is fine for dimensions having single-dimension affine maps, like matmul, but not for convolution input. It will cause incorrect compuation and out of bound. A concrete example, say we have 1x225x225x3 (NHWC) input, 3x3x3x32 (HWCF) filter, and 1x112x112x3 (NHWC) output with stride 2, (112 2 + 3) would be 227, which is different from the correct input window dimension size 225. Instead, we should first calculate the max indices for each loop, and apply the affine map to them, and then plus one to get the dimension size. Note this makes no difference for matmul-like ops given they will have `d0 - 1 + 1` effectively. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110849	2021-09-30 13:50:57 -04:00
Stella Laurenzo	267bb194f3	[mlir] Remove old "tc" linalg ods generator. * This could have been removed some time ago as it only had one op left in it, which is redundant with the new approach. * `matmul_i8_i8_i32` (the remaining op) can be trivially replaced by `matmul`, which natively supports mixed precision. Differential Revision: https://reviews.llvm.org/D110792	2021-09-30 16:30:06 +00:00
Alex Zinenko	8c1b785ce1	[mlir][python] provide bindings for the SCF dialect This is an important core dialect that has not been exposed previously. Set up the default bindings generation and provide a nicer wrapper for the `for` loop with access to the loop configuration and body. Depends On D110758 Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D110759	2021-09-30 09:38:15 +02:00
Alex Zinenko	afeda4b9ed	[mlir][python] provide access to function argument/result attributes Without this change, these attributes can only be accessed through the generic operation attribute dictionary provided the caller knows the special operation attribute names used for this purpose. Add some Python wrapping to support this use case. Also provide access to function arguments usable inside the function along with a couple of quality-of-life improvements in using block arguments (function arguments being the arguments of its entry block). Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D110758	2021-09-30 09:38:13 +02:00
Chris Lattner	fb093c8314	[ODS/AsmParser] Don't pass MLIRContext with DialectAsmParser. The former is redundant because the later carries it as part of its builder. Add a getContext() helper method to DialectAsmParser to make this more convenient, and stop passing the context around explicitly. This simplifies ODS generated parser hooks for attrs and types. This resolves PR51985 Recommit `4b32f8bac4` after fixing a dependency. Differential Revision: https://reviews.llvm.org/D110796	2021-09-30 05:10:28 +00:00
Mehdi Amini	3310e0020c	Revert "[ODS/AsmParser] Don't pass MLIRContext with DialectAsmParser." This reverts commit `4b32f8bac4`. Seems like the build is broken with -DDBUILD_SHARED_LIBS=ON	2021-09-30 05:01:17 +00:00
Chris Lattner	4b32f8bac4	[ODS/AsmParser] Don't pass MLIRContext with DialectAsmParser. The former is redundant because the later carries it as part of its builder. Add a getContext() helper method to DialectAsmParser to make this more convenient, and stop passing the context around explicitly. This simplifies ODS generated parser hooks for attrs and types. This resolves PR51985 Differential Revision: https://reviews.llvm.org/D110796	2021-09-29 21:36:05 -07:00
Matthias Springer	27451a05ed	[mlir][vector] Fold transfer ops and tensor.extract/insert_slice. * Fold vector.transfer_read and tensor.extract_slice. * Fold vector.transfer_write and tensor.insert_slice. Differential Revision: https://reviews.llvm.org/D110627	2021-09-30 09:28:00 +09:00
Nicolas Vasilache	92ea624a13	[mlir][Linalg] Rewrite CodegenStrategy to populate a pass pipeline. This revision retires a good portion of the complexity of the codegen strategy and puts the logic behind pass logic. Differential revision: https://reviews.llvm.org/D110678	2021-09-29 13:35:45 +00:00
bakhtiyar	bdde959533	Remove unnecessary async group creates and awaits. Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D110605	2021-09-28 14:52:08 -07:00
bakhtiyar	55dfab39a2	Rename target block size to min task size for clarity. Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D110604	2021-09-28 14:51:55 -07:00
Amy Zhuang	7ab14b8886	[mlir] Unroll-and-jam loops with iter_args. Unroll-and-jam currently doesn't work when the loop being unroll-and-jammed or any of its inner loops has iter_args. This patch modifies the unroll-and-jam utility to support loops with iter_args. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D110085	2021-09-28 14:13:27 -07:00
thomasraoux	b12e4c17e0	[mlir] Fix bug in FoldSubview with rank reducing subview Fix how we calculate the new permutation map of the transfer ops. Differential Revision: https://reviews.llvm.org/D110638	2021-09-28 13:18:29 -07:00
Alexander Belyaev	9fb57c8c1d	[mlir] Add min/max operations to Standard. [RFC: Add min/max ops](https://llvm.discourse.group/t/rfc-add-min-max-operations/4353) I was following the naming style for Arith dialect in https://reviews.llvm.org/D110200, i.e. similar to DivSIOp and DivUIOp I defined MaxSIOp, MaxUIOp. When Arith PR is landed, I will migrate these ops as well. Differential Revision: https://reviews.llvm.org/D110540	2021-09-28 09:40:22 +02:00
Tobias Gysi	d20d0e145d	[mlir][linalg] Finer-grained padding control. Adapt the signature of the PaddingValueComputationFunction callback to either return the padding value or failure to signal padding is not desired. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110572	2021-09-27 19:21:37 +00:00
Aart Bik	06e2a0684e	[mlir][sparse] sampled matrix multiplication fusion test This integration tests runs a fused and non-fused version of sampled matrix multiplication. Both should eventually have the same performance! NOTE: relies on pending tensor.init fix! Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D110444	2021-09-27 11:50:49 -07:00
Aart Bik	ec97a205c3	[mlir][sparse] preserve zero-initialization for materializing buffers This revision makes sure that when the output buffer materializes locally (in contrast with the passing in of output tensors either in-place or not in-place), the zero initialization assumption is preserved. This also adds a bit more documentation on our sparse kernel assumption (viz. TACO assumptions). Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D110442	2021-09-27 11:22:05 -07:00
Sumesh Udayakumaran	b2af2aeea6	[mlir] Mode for explicitly controlling the fusion kind New mode option that allows for either running the default fusion kind that happens today or doing either of producer-consumer or sibling fusion. This will also be helpful to minimize the compile-time of the fusion tests. Reviewed By: bondhugula, dcaballe Differential Revision: https://reviews.llvm.org/D110102	2021-09-27 20:37:42 +03:00
William S. Moses	6dd5b1e33e	[MLIR][LLVM] Add error if using incorrect attribute type for specifying LLVM linkage Address post-commit review in https://reviews.llvm.org/D108524 to add appropriate diagnostics. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D110566	2021-09-27 13:24:05 -04:00
Bixia Zheng	fbd5821c6f	Implement the conversion from sparse constant to sparse tensors. The sparse constant provides a constant tensor in coordinate format. We first split the sparse constant into a constant tensor for indices and a constant tensor for values. We then generate a loop to fill a sparse tensor in coordinate format using the tensors for the indices and the values. Finally, we convert the sparse tensor in coordinate format to the destination sparse tensor format. Add tests. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D110373	2021-09-27 09:47:29 -07:00
Eugene Zhulenev	92db09cde0	[mlir] AsyncRuntime: use int64_t for ref counting operations Workaround for SystemZ ABI problem: https://bugs.llvm.org/show_bug.cgi?id=51898 Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D110550	2021-09-27 07:55:01 -07:00
Nicolas Vasilache	b74493ecea	[mlir][Linalg] Refactor padding hoisting - NFC This revision extracts padding hoisting in a new file and cleans it up in prevision of future improvements and extensions. Differential Revision: https://reviews.llvm.org/D110414	2021-09-27 09:50:31 +00:00
Matthias Springer	ffdf0a370d	[mlir][vector] Fix bug in vector-transfer-full-partial-split When splitting with linalg.copy, cannot write into the destination alloc directly. Instead, write into a subview of the alloc. Differential Revision: https://reviews.llvm.org/D110512	2021-09-27 18:12:17 +09:00
Mehdi Amini	b3891f28a3	Fix ClangTidyLegacy warning: "'virtual' is redundant since the function is already declared 'final' " (NFC)	2021-09-26 22:02:23 +00:00
River Riddle	ef764eeeb9	[mlir:ElementsAttr] Avoid crash on empty contiguous ranges We currently, incorrectly, assume that a range always has at least one element when building a contiguous range. This commit adds a proper empty check to avoid crashing. Differential Revision: https://reviews.llvm.org/D110457	2021-09-24 23:48:51 +00:00
Lei Zhang	b45476c94c	[mlir][tosa] Do not fold transpose with quantized types For such cases, the type of the constant DenseElementsAttr is different from the transpose op return type. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D110446	2021-09-24 16:57:55 -04:00
Diego Caballero	2a876a711d	[mlir] Create a generic reduction detection utility This patch introduces a generic reduction detection utility that works across different dialecs. It is mostly a generalization of the reduction detection algorithm in Affine. The reduction detection logic in Affine, Linalg and SCFToOpenMP have been replaced with this new generic utility. The utility takes some basic components of the potential reduction and returns: 1) the reduced value, and 2) a list with the combiner operations. The logic to match reductions involving multiple combiner operations disabled until we can properly test it. Reviewed By: ftynse, bondhugula, nicolasvasilache, pifon2a Differential Revision: https://reviews.llvm.org/D110303	2021-09-24 20:45:59 +00:00
River Riddle	aca9bea199	[mlir:MemRef] Move DmaStartOp/DmaWaitOp to ODS These are among the last operations still defined explicitly in C++. I've tried to keep this commit as NFC as possible, but these ops definitely need a non-NFC cleanup at some point. Differential Revision: https://reviews.llvm.org/D110440	2021-09-24 19:35:28 +00:00
Lei Zhang	e325ebb9c7	[mlir][tosa] Add some transpose folders * If the input is a constant splat value, we just need to reshape it. * If the input is a general constant with one user, we can also constant fold it, without bloating the IR. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D110439	2021-09-24 15:25:14 -04:00
River Riddle	ef976337f5	[mlir:OpConversion] Remove the remaing usages of the deprecated matchAndRewrite methods This commits updates the remaining usages of the ArrayRef<Value> based matchAndRewrite/rewrite methods in favor of the new OpAdaptor overload. Differential Revision: https://reviews.llvm.org/D110360	2021-09-24 17:51:41 +00:00
Alex Zinenko	5988a3b7a0	[mlir] Linalg: ensure tile-and-pad always creates padding as requested Initially, the padding transformation and the related operation were only used to guarantee static shapes of subtensors in tiled operations. The transformation would not insert the padding operation if the shapes were already static, and the overall code generation would actively remove such "noop" pads. However, this transformation can be also used to pack data into smaller tensors and marshall them into faster memory, regardless of the size mismatches. In context of expert-driven transformation, we should assume that, if padding is requested, a potentially padded tensor must be always created. Update the transformation accordingly. To do this, introduce an optional `packing` attribute to the `pad_tensor` op that serves as an indication that the padding is an intentional choice (as opposed to side effect of type normalization) and should be left alone by cleanups. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110425	2021-09-24 18:40:13 +02:00
Alex Zinenko	3f89e339bb	[mlir] add pad_tensor(tensor.cast) -> pad_tensor canonicalizer This canonicalization pattern complements the tensor.cast(pad_tensor) one in propagating constant type information when possible. It contributes to the feasibility of pad hoisting. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110343	2021-09-24 12:03:47 +02:00
Matthias Springer	f3f25ffc04	[mlir][linalg] Fix result type in FoldSourceTensorCast * Do not discard static result type information that cannot be inferred from lower/upper padding. * Add optional argument to `PadTensorOp::inferResultType` for specifying known result dimensions. Differential Revision: https://reviews.llvm.org/D110380	2021-09-24 16:47:18 +09:00
Mehdi Amini	83f3c615dd	Add missing storageType to AttrDef to ODS This is only noticeable when using an attribute across dialects I think. Previously the namespace would be ommited, but it wouldn't matter as long as the generated code stays within a single namespace. Differential Revision: https://reviews.llvm.org/D110367	2021-09-24 01:30:29 +00:00
Matthias Springer	2190f8a8b1	[mlir][linalg] Support tile+peel with TiledLoopOp Only scf.for was supported until now. Differential Revision: https://reviews.llvm.org/D110220	2021-09-24 10:23:31 +09:00
Matthias Springer	8dc16ba8d2	[mlir][linalg] Merge all tiling passes into a single one. Passes such as `linalg-tile-to-tiled-loop` are merged into `linalg-tile`. Differential Revision: https://reviews.llvm.org/D110214	2021-09-24 10:16:46 +09:00
John Demme	47cc166bc0	[MLIR] [Python] Make Attribute and Type hashable Enables putting types and attributes in sets and in dicts as keys. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D110301	2021-09-22 19:59:03 -07:00
Aart Bik	a924fcc7c3	[mlir][sparse] add sparse kernels test to sparse compiler test suite This test makes sure kernels map to efficient sparse code, i.e. all compressed for-loops, no co-iterating while loops. In addition, this revision removes the special constant folding inside the sparse compiler in favor of Mahesh' new generic linalg folding. Thanks! NOTE: relies on Mahesh fix, which needs to be rebased first Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D110001	2021-09-22 14:56:39 -07:00
Tyler Augustine	cd36bab4ca	Fix bug for Ops with default valued attributes and successors/variadic regions. When both a DefaultValuedAttr and a successor or variadic region was specified, this would generate invalid C++ declaration. There would be the parameter with a default value, followed by the successors/regions, which don't have a default, which is invalid. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D110205	2021-09-22 21:22:31 +00:00
MaheshRavishankar	a40a08ed98	[mlir][Linalg] Teach constant -> generic op fusion to handle scalar constants. The current folder of constant -> generic op only handles splat constants. The same logic holds for scalar constants. Teach the pattern to handle such cases. Differential Revision: https://reviews.llvm.org/D109982	2021-09-22 13:41:47 -07:00
River Riddle	6e60bb6883	[mlir:DataFlowAnalysis] Reprocess the arguments of already executable edges This fixes a bug where we discover new information about the arguments of an already executable edge, but don't visit the arguments. We only visit the arguments, and not the block itself, so this commit shouldn't really affect performance at all. Fixes PR#51871 Differential Revision: https://reviews.llvm.org/D110197	2021-09-22 20:14:55 +00:00
Aart Bik	5da21338bc	[mlir][sparse] generalize reduction support in sparse compiler Now not just SUM, but also PRODUCT, AND, OR, XOR. The reductions MIN and MAX are still to be done (also depends on recognizing these operations in cmp-select constructs). Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D110203	2021-09-22 12:36:46 -07:00
Alex Zinenko	bdaf038266	[mlir] Always create a list of alias scopes when emitting LLVM IR Previously, the translation to LLVM IR would emit IR that directly uses a scope metadata node in case only one scope was in use in alias.scopes or noalias metadata. It should always be a list of scopes. The verifier change in `8700f2bd36` enforced this and broke the test. Fix the translation to always create a list of scopes using a new metadata node, update and reenable the respective test. Fixes PR51919. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D110140	2021-09-22 00:00:46 +02:00
Tobias Gysi	8b5236def5	[mlir][linalg] Simplify slice dim computation for fusion on tensors (NFC). Compute the tiled producer slice dimensions directly starting from the consumer not using the producer at all. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110147	2021-09-21 15:09:46 +00:00
Nicolas Vasilache	101d017a64	[mlir][Linalg] Revisit heuristic ordering of tensor.insert_slice in comprehensive bufferize. It was previously assumed that tensor.insert_slice should be bufferized first in a greedy fashion to avoid out-of-place bufferization of the large tensor. This heuristic does not hold upon further inspection. This CL removes the special handling of such ops and adds a test that exhibits better behavior and appears in real use cases. The only test adversely affected is an artificial test which results in a returned memref: this pattern is not allowed by comprehensive bufferization in real scenarios anyway and the offending test is deleted. Differential Revision: https://reviews.llvm.org/D110072	2021-09-21 14:22:45 +00:00
Nicolas Vasilache	0d2c54e851	[mlir][Linalg] Revisit RAW dependence interference in comprehensive bufferize. Previously, comprehensive bufferize would consider all aliasing reads and writes to the result buffer and matching operand. This resulted in spurious dependences being considered and resulted in too many unnecessary copies. Instead, this revision revisits the gathering of read and write alias sets. This results in fewer alloc and copies. An exhaustive test cases is added that considers all possible permutations of `matmul(extract_slice(fill), extract_slice(fill), ...)`.	2021-09-21 14:22:22 +00:00
Morten Borup Petersen	032cb1650f	[MLIR][SCF] Add for-to-while loop transformation pass This pass transforms SCF.ForOp operations to SCF.WhileOp. The For loop condition is placed in the 'before' region of the while operation, and indctuion variable incrementation + the loop body in the 'after' region. The loop carried values of the while op are the induction variable (IV) of the for-loop + any iter_args specified for the for-loop. Any 'yield' ops in the for-loop are rewritten to additionally yield the (incremented) induction variable. This transformation is useful for passes where we want to consider structured control flow solely on the basis of a loop body and the computation of a loop condition. As an example, when doing high-level synthesis in CIRCT, the incrementation of an IV in a for-loop is "just another part" of a circuit datapath, and what we really care about is the distinction between our datapath and our control logic (the condition variable). Differential Revision: https://reviews.llvm.org/D108454	2021-09-21 09:09:54 +01:00
Chris Lattner	58abc8c34b	[OpAsmParser] Add a parseCommaSeparatedList helper and beef up Delimeter. Lots of custom ops have hand-rolled comma-delimited parsing loops, as does the MLIR parser itself. Provides a standard interface for doing this that is less error prone and less boilerplate. While here, extend Delimiter to support <> and {} delimited sequences as well (I have a use for <> in CIRCT specifically). Differential Revision: https://reviews.llvm.org/D110122	2021-09-20 20:59:11 -07:00
River Riddle	d80d3a358f	[mlir] Refactor ElementsAttr into an AttrInterface This revision refactors ElementsAttr into an Attribute Interface. This enables a common interface with which to interact with element attributes, without needing to modify the builtin dialect. It also removes a majority (if not all?) of the need for the current OpaqueElementsAttr, which was originally intended as a way to opaquely represent data that was not representable by the other builtin constructs. The new ElementsAttr interface not only allows for users to natively represent their data in the way that best suits them, it also allows for efficient opaque access and iteration of the underlying data. Attributes using the ElementsAttr interface can directly expose support for interacting with the held elements using any C++ data type they claim to support. For example, DenseIntOrFpElementsAttr supports iteration using various native C++ integer/float data types, as well as APInt/APFloat, and more. ElementsAttr instances that refer to DenseIntOrFpElementsAttr can use all of these data types for iteration: ```c++ DenseIntOrFpElementsAttr intElementsAttr = ...; ElementsAttr attr = intElementsAttr; for (uint64_t value : attr.getValues<uint64_t>()) ...; for (APInt value : attr.getValues<APInt>()) ...; for (IntegerAttr value : attr.getValues<IntegerAttr>()) ...; ``` ElementsAttr also supports failable range/iterator access, allowing for selective code paths depending on data type support: ```c++ ElementsAttr attr = ...; if (auto range = attr.tryGetValues<uint64_t>()) { for (uint64_t value : *range) ...; } ``` Differential Revision: https://reviews.llvm.org/D109190	2021-09-21 01:57:43 +00:00
River Riddle	4f21152af1	[mlir] Tighten verification of SparseElementsAttr SparseElementsAttr currently does not perform any verfication on construction, with the only verification existing within the parser. This revision moves the parser verification to SparseElementsAttr, and also adds additional verification for when a sparse index is not valid. Differential Revision: https://reviews.llvm.org/D109189	2021-09-21 01:57:42 +00:00
Chia-hung Duan	bb2506061b	[mlir-tblgen] Add DagNode StaticMatcher. Some patterns may share the common DAG structures. Generate a static function to do the match logic to reduce the binary size. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D105797	2021-09-20 23:37:42 +00:00
MaheshRavishankar	4cf9bf6c9f	[mlir][MemRef] Compute unused dimensions of a rank-reducing subviews using strides as well. For `memref.subview` operations, when there are more than one unit-dimensions, the strides need to be used to figure out which of the unit-dims are actually dropped. Differential Revision: https://reviews.llvm.org/D109418	2021-09-20 11:05:30 -07:00
MaheshRavishankar	0b33890f45	[mlir][Linalg] Add ConvolutionOpInterface. Add an interface that allows grouping together all covolution and pooling ops within Linalg named ops. The interface currently - the indexing map used for input/image access is valid - the filter and output are accessed using projected permutations - that all loops are charecterizable as one iterating over - batch dimension, - output image dimensions, - filter convolved dimensions, - output channel dimensions, - input channel dimensions, - depth multiplier (for depthwise convolutions) Differential Revision: https://reviews.llvm.org/D109793	2021-09-20 10:41:10 -07:00
Mehdi Amini	5edd79fc97	Revert "[MLIR][SCF] Add for-to-while loop transformation pass" This reverts commit `644b55d57e`. The added test is failing the bots.	2021-09-20 17:21:59 +00:00
Mehdi Amini	f18f1ab4fd	Temporarily XFAIL MLIR test that fails the LLVM verifier after `8700f2bd3`	2021-09-20 17:20:11 +00:00
Tobias Gysi	7be28d82b4	[mlir][linalg] Add IndexOp support to fusion on tensors. This revision depends on https://reviews.llvm.org/D109761 and https://reviews.llvm.org/D109766. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D109774	2021-09-20 15:59:35 +00:00
Morten Borup Petersen	644b55d57e	[MLIR][SCF] Add for-to-while loop transformation pass This pass transforms SCF.ForOp operations to SCF.WhileOp. The For loop condition is placed in the 'before' region of the while operation, and indctuion variable incrementation + the loop body in the 'after' region. The loop carried values of the while op are the induction variable (IV) of the for-loop + any iter_args specified for the for-loop. Any 'yield' ops in the for-loop are rewritten to additionally yield the (incremented) induction variable. This transformation is useful for passes where we want to consider structured control flow solely on the basis of a loop body and the computation of a loop condition. As an example, when doing high-level synthesis in CIRCT, the incrementation of an IV in a for-loop is "just another part" of a circuit datapath, and what we really care about is the distinction between our datapath and our control logic (the condition variable). Differential Revision: https://reviews.llvm.org/D108454	2021-09-20 16:57:50 +01:00
Tobias Gysi	6db928b8f3	[mlir][linalg] Fusion on tensors. Add a new version of fusion on tensors that supports the following scenarios: - support input and output operand fusion - fuse a producer result passed in via tile loop iteration arguments (update the tile loop iteration arguments) - supports only linalg operations on tensors - supports only scf::for - cannot add an output to the tile loop nest The LinalgTileAndFuseOnTensors pass tiles the root operation and fuses its producers. Reviewed By: nicolasvasilache, mravishankar Differential Revision: https://reviews.llvm.org/D109766	2021-09-20 14:45:34 +00:00
Valentin Clement	d6929aaa67	[mlir][openacc] Make use of the second counter extension in DataOp translation Make use of runtime extension for the second reference counter used in structured data region. This extension is implemented in D106510 and D106509. Differential Revision: https://reviews.llvm.org/D106517	2021-09-20 13:43:50 +02:00
KareemErgawy-TomTom	bdcf4b9b96	[MLIR][Linalg] Make detensoring cost-model more flexible. So far, the CF cost-model for detensoring was limited to discovering pure CF structures. This means, if while discovering the CF component, the cost-model found any op that is not detensorable, it gives up on detensoring altogether. This patch makes it a bit more flexible by cleaning-up the detensorable component from non-detensorable ops without giving up entirely. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D109965	2021-09-20 10:21:31 +02:00
Vladislav Vinogradov	ec03bbe8a7	[mlir] Fix bug in partial dialect conversion The discussion on forum: https://llvm.discourse.group/t/bug-in-partial-dialect-conversion/4115 The `applyPartialConversion` didn't handle the operations, that were marked as illegal inside dynamic legality callback. Instead of reporting error, if such operation was not converted to legal set, the method just added it to `unconvertedSet` in the same way as unknown operations. This patch fixes that and handle dynamically illegal operations as well. The patch includes 2 fixes for existing passes: * `tensor-bufferize` - explicitly mark `std.return` as legal. * `convert-parallel-loops-to-gpu` - ugly fix with marking visited operations to avoid recursive legality checks. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D108505	2021-09-20 10:39:10 +03:00
Jacques Pienaar	0a1e569d37	[mlir-c] Add getting fused loc For creating a fused loc using array of locations and metadata. Differential Revision: https://reviews.llvm.org/D110022	2021-09-18 06:57:51 -07:00
Uday Bondhugula	57eda9becc	[MLIR][GPU] Add constant propagator for gpu.launch op Add a constant propagator for gpu.launch op in cases where the grid/thread IDs can be trivially determined to take a single constant value of zero. Differential Revision: https://reviews.llvm.org/D109994	2021-09-18 12:02:46 +05:30
Aart Bik	46e77b5d10	[mlir][sparse] add a sparse quantized_matmul example to integration test Note that this revision adds a very tiny bit of constant folding in the sparse compiler lattice construction. Although I am generally trying to avoid such canonicalizations (and rely on other passes to fix this instead), the benefits of avoiding a very expensive disjunction lattice construction justify having this special code (at least for now). Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D109939	2021-09-17 13:04:44 -07:00
Aart Bik	d4e16171e8	[mlir][sparse] add dce test for all sparse tensor ops Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D109992	2021-09-17 13:03:42 -07:00
Krzysztof Drewniak	121aab84d1	[MLIR][Affine] Simplify nested modulo operations when able It is the case that, for all positive a and b such that b divides a (e mod (a * b)) mod b = e mod b. For example, ((d0 mod 35) mod 5) can be simplified to (d0 mod 5), but ((d0 mod 35) mod 4) cannot be simplified further (x = 36 is a counterexample). This change enables more complex simplifications. For example, ((d0 * 72 + d1) mod 144) mod 9 can now simplify to (d0 * 72 + d1) mod 9 and thus to d1 mod 9. Expressions with chained modulus operators are reasonably common in tensor applications, and this change _should_ improve code generation for such expressions. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D109930	2021-09-17 19:06:00 +00:00
thomasraoux	08f0cb7719	[mlir] Prevent crash in DropUnitDim pattern due to tensor with encoding Differential Revision: https://reviews.llvm.org/D109984	2021-09-17 12:03:16 -07:00
thomasraoux	36aac53b36	[mlir][linalg] Extend drop unit dim pattern to all cases of reduction Even with all parallel loops reading the output value is still allowed so we don't have to handle reduction loops differently. Differential Revision: https://reviews.llvm.org/D109851	2021-09-17 10:09:57 -07:00
thomasraoux	416679615d	[mlir] Linalg hoisting should ignore uses outside the loop Differential Revision: https://reviews.llvm.org/D109859	2021-09-17 10:06:57 -07:00
Aart Bik	233b42a8bb	[mlir][sparse] remove unused TENSOR environment Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D109919	2021-09-16 14:32:09 -07:00
Aart Bik	b1d44e5902	[mlir][sparse] add affine subscripts to sparse compilation pass This enables the sparsification of more kernels, such as convolutions where there is a x(i+j) subscript. It also enables more tensor invariants such as x(1) or other affine subscripts such as x(i+1). Currently, we reject sparsity altogether for such tensors. Despite this restriction, however, we can already handle a lot more kernels with compound subscripts for dense access (viz. convolution with dense input and sparse filter). Some unit tests and an integration test demonstrate new capability. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D109783	2021-09-15 20:28:04 -07:00
Mogball	cb8c30d35d	[DRR] Explicit Return Types in Rewrites Adds a new rewrite directive returnType that can be added at the end of an op's argument list to explicitly specify return types. ``` (OpX $v0, $v1, (returnType "$_builder.getI32Type()")) ``` Pass in a bound value to copy its return type, or pass a native code call to dynamically create new types. ``` (OpX $v0, $v1, (returnType $v0, (NativeCodeCall<"..."> $v1))) ``` Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D109472	2021-09-15 14:25:29 -07:00
Rob Suderman	1ac2d195ec	[mlir][linalg] Add canonicalizers for depthwise conv There are two main versions of depthwise conv depending whether the multiplier is 1 or not. In cases where m == 1 we should use the version without the multiplier channel as it can perform greater optimization. Add lowering for the quantized/float versions to have a multiplier of one. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D108959	2021-09-15 14:09:15 -07:00
Simon Camphausen	1b79efdc72	[mlir] Fix printing of EmitC attrs/types with escape characters Attributes and types were not escaped when printing. Reviewed By: jpienaar, marbre Differential Revision: https://reviews.llvm.org/D109143	2021-09-15 18:15:38 +00:00
Nicolas Vasilache	96ec0ff2b7	[mlir][Linalg] Revisit insertion points in comprehensive bufferization. This revision fixes a corner case that could appear due to incorrect insertion point behavior in comprehensive bufferization. Differential Revision: https://reviews.llvm.org/D109830	2021-09-15 18:11:38 +00:00
Nicolas Vasilache	6fe77b1051	[mlir][Linalg] Fail comprehensive bufferization if a memref is returned. Summary: Reviewers: Subscribers: Differential revision: https://reviews.llvm.org/D109824	2021-09-15 15:11:17 +00:00
Nicolas Vasilache	660f281b5e	[mlir][Linalg] Make codegen strategy late transformations opt-in Summary: Making the late transformations opt-in results in less surprising behavior when composing multiple calls to the codegen strategy. Reviewers: Subscribers: Differential revision: https://reviews.llvm.org/D109820	2021-09-15 11:02:14 +00:00
cwz920716	500d4c45ba	[MLIR] Use memref.copy ops in BufferResultsToOutParams pass. Both copy/alloc ops are using memref dialect after this change. Reviewed By: silvas, mehdi_amini Differential Revision: https://reviews.llvm.org/D109480	2021-09-15 02:59:30 +00:00
Tobias Gysi	44a889778c	[mlir][linalg] Fold ExtractSliceOps during tiling. Add the makeComposedExtractSliceOp method that creates an ExtractSliceOp and folds chains of ExtractSliceOps by computing the sum of their offsets and by multiplying their strides. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D109601	2021-09-14 11:43:52 +00:00
Matthias Springer	62883459cd	[mlir][linalg] makeTiledShape: No affine.min if tile size == 1 This improves codegen (more static type information) with `scalarize-dynamic-dims`. Differential Revision: https://reviews.llvm.org/D109415	2021-09-14 10:48:20 +09:00
Matthias Springer	fb1def9c66	[mlir][linalg] New tiling option: Scalarize dynamic dims This tiling option scalarizes all dynamic dimensions, i.e., it tiles all dynamic dimensions by 1. This option is useful for linalg ops with partly dynamic tensor dimensions. E.g., such ops can appear in the partial iteration after loop peeling. After scalarizing dynamic dims, those ops can be vectorized. Differential Revision: https://reviews.llvm.org/D109268	2021-09-14 10:40:50 +09:00
Matthias Springer	8faf35c0a5	[mlir][linalg] Add scf.for loop peeling to codegen strategy Only scf.for loops are supported at the moment. linalg.tiled_loop support will be added in a subsequent commit. Only static tensor sizes are supported. Loops for dynamic tensor sizes can be peeled, but the generated code is not optimal due to a missing canonicalization pattern. Differential Revision: https://reviews.llvm.org/D109043	2021-09-14 10:35:01 +09:00
Matthias Springer	a4a654d301	[mlir][linalg] TiledLoopOp peeling: Do not peel partial iterations Extend the unit test with an option for skipping partial iterations during loop peeling. Differential Revision: https://reviews.llvm.org/D109640	2021-09-14 10:01:46 +09:00
Nicolas Vasilache	181d18ef53	[mlir][Linalg] Insert static buffers as high as possible during ComprehensiveBufferization. This revision allows hoisting static alloc/dealloc pairs as high as possible during ComprehensiveBufferization. This also aligns such allocated buffers to 128B by default. This change exhibited some issues wrt insertion points and a missing copy that are also fixed in this revision; tests are updated accordingly. Differential Revision: https://reviews.llvm.org/D109684	2021-09-13 15:59:03 +00:00
Simon Camphausen	ec92f788f3	[mlir][emitc] Print signed integers properly Previously negative integers were printed as large unsigned values. Reviewed By: marbre Differential Revision: https://reviews.llvm.org/D109690	2021-09-13 15:29:30 +00:00
Jonas Paulsson	5f781ddffc	[MLIR] Mark test case XFAIL on SystemZ for now. mlir-cpu-runner/math_polynomial_approx.mlir This test case is currently failing on SystemZ, but it does not appear to necessarily be a target specific problem. See discussion at https://bugs.llvm.org/show_bug.cgi?id=51204.	2021-09-13 16:48:31 +02:00
Nicolas Vasilache	b01d223faf	[mlir][Linalg] Use reify for padded op shape derivation. Previously, we would insert a DimOp and rely on later canonicalizations. Unfortunately, reifyShape kind of rewrites are not canonicalizations anymore. This introduces undesirable pass dependencies. Instead, immediately reify the result shape and avoid the DimOp altogether. This is akin to a local folding, which avoids introducing more reliance on `-resolve-shaped-type-result-dims` (similar to compositions of `affine.apply` by construction to avoid chains of size > 1). It does not completely get rid of the reliance on the pass as the process is merely local: calling the pass may still be necessary for global effects. Indeed, one of the tests still requires the pass. Differential Revision: https://reviews.llvm.org/D109571	2021-09-13 11:54:59 +00:00
Mathieu Fehr	802bf02a73	[mlir] Allows to query traits from types and attributes Types and attributes now have a `hasTrait` function that allow users to check if a type defines a trait. Also, AbstractType and AbstractAttribute has now a `hasTraitFn` field to carry the implementation of the `hasTrait` function of the concrete type or attribute. This patch also adds the remaining functions to access type and attribute traits in TableGen. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D105202	2021-09-13 06:26:45 +00:00
Mehdi Amini	7fb2394a4f	Add sanity check in MLIR ODS to catch case where an arguments/results/regions/successors names overlap This is making a tablegen crash with a more friendly error. Differential Revision: https://reviews.llvm.org/D109474	2021-09-13 06:21:25 +00:00
Kiran Chandramohan	187d9f8cd9	[OpenMP][MLIR] Add a conversion pattern for the master op The conversion pattern is particularly useful for conversion of block arguments in the master op. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D109610	2021-09-12 10:13:40 +00:00
Rob Suderman	b0532286fe	[mlir][tosa] Add shape inference for tosa.while Tosa.while shape inference requires repeatedly running shape inference across the body of the loop until the types become static as we do not know the number of iterations required by the loop body. Once the least specific arguments are known they are propagated to both regions. To determine the final end type, the least restrictive types are determined from all yields. Differential Revision: https://reviews.llvm.org/D108801	2021-09-10 13:11:53 -07:00
Stephan Herhut	5e6c170b3f	[mlir][linalg] Fix bufferize pattern to allow unknown operations in body of generic The original version of the bufferization pattern for linalg.generic would manually clone operations within the region to the bufferized clone of the operation. This triggers legality requirements on those operations in the conversion infra. Instead, this now uses the rewriter to inline the region instead, avoiding those legality requirements. Differential Revision: https://reviews.llvm.org/D109581	2021-09-10 13:37:42 +02:00
Matthias Springer	0f3544d185	[mlir][scf] Loop peeling: Use scf.for for partial iteration Generate an scf.for instead of an scf.if for the partial iteration. This is for consistency reasons: The peeling of linalg.tiled_loop also uses another loop for the partial iteration. Note: Canonicalizations patterns may rewrite partial iterations to scf.if afterwards. Differential Revision: https://reviews.llvm.org/D109568	2021-09-10 19:07:09 +09:00
Nicolas Vasilache	5f1a1af4bf	[mlir][Linalg] Properly order extract_slice traversal in comprehensive bufferization This revision fixes the traversal order of extract_slice during the inplace analysis. It was previously thought that such ops could be analyzed at the very end. This is unfortunately not true as the AliasInfo for dependents of these ops need to be updated. This change allows the aliases introduced by the bufferization of extract_slice to be properly propagated. Differential Revision: https://reviews.llvm.org/D109519	2021-09-10 07:10:06 +00:00
Marius Brehler	6593cd3fe9	[mlir] Replace `include_directories` Switches to adding target specific, private includes instead of adding global includes. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D109494	2021-09-10 07:06:27 +00:00
natashaknk	d4d50e4710	[mlir][tosa] Add lowering for tosa.clz using scf::whileOp Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D109540	2021-09-09 15:57:35 -07:00
Aart Bik	066d786ce0	[mlir][sparse] add folding to sparse_tensor.convert folds conversion between identical types (with tests) Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D109545	2021-09-09 15:45:19 -07:00
Aart Bik	c34f3780a7	[mlir][sparse] fix broken test new flag requirements crossed the checkin of this new test Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D109524	2021-09-09 09:40:00 -07:00
Aart Bik	e2d3db42e5	[mlir][sparse] add casts to operations to lattice and exp builders Further enhance the set of operations that can be handled by the sparse compiler Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D109413	2021-09-09 08:49:50 -07:00
Alex Zinenko	8b58ab8ccd	[mlir] Factor type reconciliation out of Standard-to-LLVM conversion Conversion to the LLVM dialect is being refactored to be more progressive and is now performed as a series of independent passes converting different dialects. These passes may produce `unrealized_conversion_cast` operations that represent pending conversions between built-in and LLVM dialect types. Historically, a more monolithic Standard-to-LLVM conversion pass did not need these casts as all operations were converted in one shot. Previous refactorings have led to the requirement of running the Standard-to-LLVM conversion pass to clean up `unrealized_conversion_cast`s even though the IR had no standard operations in it. The pass must have been also run the last among all to-LLVM passes, in contradiction with the partial conversion logic. Additionally, the way it was set up could produce invalid operations by removing casts between LLVM and built-in types even when the consumer did not accept the uncasted type, or could lead to cryptic conversion errors (recursive application of the rewrite pattern on `unrealized_conversion_cast` as a means to indicate failure to eliminate casts). In fact, the need to eliminate A->B->A `unrealized_conversion_cast`s is not specific to to-LLVM conversions and can be factored out into a separate type reconciliation pass, which is achieved in this commit. While the cast operation itself has a folder pattern, it is insufficient in most conversion passes as the folder only applies to the second cast. Without complex legality setup in the conversion target, the conversion infra will either consider the cast operations valid and not fold them (a separate canonicalization would be necessary to trigger the folding), or consider the first cast invalid upon generation and stop with error. The pattern provided by the reconciliation pass applies to the first cast operation instead. Furthermore, having a separate pass makes it clear when `unrealized_conversion_cast`s could not have been eliminated since it is the only reason why this pass can fail. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D109507	2021-09-09 16:51:24 +02:00
Alex Zinenko	1ce752b741	[mlir] support reductions in SCF to OpenMP conversion OpenMP reductions need a neutral element, so we match some known reduction kinds (integer add/mul/or/and/xor, float add/mul, integer and float min/max) to define the neutral element and the atomic version when possible to express using atomicrmw (everything except float mul). The SCF-to-OpenMP pass becomes a module pass because it now needs to introduce new symbols for reduction declarations in the module. Reviewed By: chelini Differential Revision: https://reviews.llvm.org/D107549	2021-09-09 13:04:27 +02:00
Matthias Springer	c7d569b8f7	[mlir][scf] Fold dim(scf.for) to dim(iter_arg) Fold dim ops of scf.for results to dim ops of the respective iter args if the loop is shape preserving. Differential Revision: https://reviews.llvm.org/D109430	2021-09-09 13:47:13 +09:00
Matthias Springer	e2c8fcb9d0	[mlir][linalg] Fold dim(linalg.tiled_loop) to dim(output_arg) Fold dim ops of linalg.tiled_loop results to dim ops of the respective iter args if the loop is shape preserving. Differential Revision: https://reviews.llvm.org/D109431	2021-09-09 13:37:28 +09:00
Matthias Springer	f7137da174	[mlir][linalg] Fix dim(iter_arg) canonicalization Run a small analysis to see if the runtime type of the iter_arg is changing. Fold only if the runtime type stays the same. (Same as `DimOfIterArgFolder` in SCF.) Differential Revision: https://reviews.llvm.org/D109299	2021-09-09 12:13:05 +09:00
Matthias Springer	c95a7246a3	[mlir][linalg] Tiling: Use loop ub in extract_slice size computation if possible When tiling a LinalgOp, extract_slice/insert_slice pairs are inserted. To avoid going out-of-bounds when the tile size does not divide the shape size evenly (at the boundary), AffineMin ops are inserted. Some ops have assumptions regarding the dimensions of inputs/outputs. E.g., in a `A * B` matmul, `dim(A, 1) == dim(B, 0)`. However, loop bounds use either `dim(A, 1)` or `dim(B, 0)`. With this change, AffineMin ops are expressed in terms of loop bounds instead of tensor sizes. (Both have the same runtime value.) This simplifies canonicalizations. Differential Revision: https://reviews.llvm.org/D109267	2021-09-09 11:06:22 +09:00
Mehdi Amini	4eaaf05394	Add sanity check in MLIR ODS to catch case where two results have the same name This is making a tablegen crash with a more friendly error. Differential Revision: https://reviews.llvm.org/D109456	2021-09-08 23:38:50 +00:00
Chris Lattner	42431b8207	[tests] Make testsuite more resilient to "order of constant" changes. NFC.	2021-09-08 10:10:10 -07:00
Arnab Dutta	1524b01541	[MLIR] Add loop coalesce utility for affine.for Add loop coalesce utility for affine.for. This expects loops to have been normalized a-priori. This works for both constant as well non constant upper bounds having single/multiple result upper bound affine map. With contributions from Arnab Dutta and Uday Bondhugula. Reviewed By: bondhugula, ayzhuang Differential Revision: https://reviews.llvm.org/D108126	2021-09-08 18:02:23 +05:30
Matthias Springer	c57c4f888c	[mlir][linalg] linalg.tiled_loop peeling Differential Revision: https://reviews.llvm.org/D108270	2021-09-07 09:50:08 +09:00
Eugene Zhulenev	fd52b4357a	[mlir] Async: check awaited operand error state after sync await Previously only await inside the async function (coroutine after lowering to async runtime) would check the error state Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D109229	2021-09-04 05:00:17 -07:00
Loren Maggiore	361458b1ce	[mlir] create gpu memset op Create a gpu memset op and corresponding CUDA and ROCm wrappers. Reviewed By: herhut, lorenrose1013 Differential Revision: https://reviews.llvm.org/D107548	2021-09-04 08:13:04 +02:00
William S. Moses	21d43daf8f	[MLIR] Primitive linkage lowering of FuncOp FuncOp always lowers to an LLVM external linkage presently. This makes it impossible to define functions in mlir which are local to the current module. Until MLIR FuncOps have a more formal linkage specification, this commit allows funcop's to have an optionally specified llvm.linkage attribute, whose value will be used as the linkage of the llvm funcop when lowered. Differential Revision: https://reviews.llvm.org/D108524 Support LLVM linkage	2021-09-03 20:41:39 -04:00
Mehdi Amini	78accf9f35	Make LLVM Linkage a first class attribute instead of using an integer attribute This makes the IR more readable, in particular when this will be used on the builtin func outside of the LLVM dialect. Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D109209	2021-09-03 21:21:46 +00:00
Alexander Belyaev	5ee5bbd0ff	[mlir][linalg] Extend tiled_loop to SCF conversion to generate scf.parallel. Differential Revision: https://reviews.llvm.org/D109230	2021-09-03 18:05:54 +02:00
Aart Bik	b6d1a31c1b	[mlir][sparse] refine heuristic for iteration graph topsort The sparse index order must always be satisfied, but this may give a choice in topsorts for several cases. We broke ties in favor of any dense index order, since this gives good locality. However, breaking ties in favor of pushing unrelated indices into sparse iteration spaces gives better asymptotic complexity. This revision improves the heuristic. Note that in the long run, we are really interested in using ML for ML to find the best loop ordering as a replacement for such heuristics. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D109100	2021-09-03 08:37:15 -07:00
Jean Perier	49af2a6275	[mlir][flang] Do not prevent integer types from being parsed as MLIR keywords DialectAsmParser::parseKeyword is rejecting `'i' digit+` while it is a valid identifier according to mlir/docs/LangRef.md. Integer types actually used to be TOK_KEYWORD a while back before the change: `6af866c58d`. This patch Modifies `isCurrentTokenAKeyword` to return true for tokens that match integer types too. The motivation for this change is the parsing of `!fir.type<{` `component-name: component-type,`+ `}>` type in FIR that represent Fortran derived types. The component-names are parsed as keywords, and can very well be i32 or any ixxx (which are valid Fortran derived type component names). The Quant dialect type parser had to be modified since it relied on `iw` not being parsed as keywords. Differential Revision: https://reviews.llvm.org/D108913	2021-09-03 08:20:49 +02:00
Matthias Springer	4fa6c2734c	[mlir][scf] Allow runtime type of iter_args to change The limitation on iter_args introduced with D108806 is too restricting. Changes of the runtime type should be allowed. Extends the dim op canonicalization with a simple analysis to determine when it is safe to canonicalize. Differential Revision: https://reviews.llvm.org/D109125	2021-09-03 10:03:05 +09:00
Alex Zinenko	f9be7a7afd	[mlir] speed up construction of LLVM IR constants when possible The translation to LLVM IR used to construct sequential constants by recurring down to individual elements, creating constant values for them, and wrapping them into aggregate constants in post-order. This is highly inefficient for large constants with known data such as DenseElementsAttr. Use LLVM's ConstantData for the innermost dimension instead. LLVM does seem to support data constants for nested sequential constants so the outer dimensions are still handled recursively. Nevertheless, this speeds up the translation of large constants with equal dimensions by up to 30x. Users are advised to rewrite large constants to use flat types before translating to LLVM IR if more efficiency in translation is necessary. This is not done automatically as the translation is not aware of the expectations of the overall compilation flow about type changes and indexing, in particular for global constants with external linkage. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D109152	2021-09-02 23:07:30 +02:00
Kiran Chandramohan	711aa35759	[MLIR][OpenMP] Add support for declaring critical construct names Add an operation omp.critical.declare to declare names/symbols of critical sections. Named omp.critical operations should use symbols declared by omp.critical.declare. Having a declare operation ensures that the names of critical sections are global and unique. In the lowering flow to LLVM IR, the OpenMP IRBuilder creates unique names for critical sections. Reviewed By: ftynse, jeanPerier Differential Revision: https://reviews.llvm.org/D108713	2021-09-02 14:31:19 +00:00
Marius Brehler	2f0750dd2e	[mlir] Add Cpp emitter This upstreams the Cpp emitter, initially presented with [1], from [2] to MLIR core. Together with the previously upstreamed EmitC dialect [3], the target allows to translate MLIR to C/C++. [1] https://reviews.llvm.org/D76571 [2] https://github.com/iml130/mlir-emitc [3] https://reviews.llvm.org/D103969 Co-authored-by: Jacques Pienaar <jpienaar@google.com> Co-authored-by: Simon Camphausen <simon.camphausen@iml.fraunhofer.de> Co-authored-by: Oliver Scherf <oliver.scherf@iml.fraunhofer.de> Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D104632	2021-09-02 13:51:05 +00:00
Alex Zinenko	8647e4c3a0	[mlir] support translating OpenMP loops with reductions Use the recently introduced OpenMPIRBuilder facility to transate OpenMP workshare loops with reductions to LLVM IR calling OpenMP runtime. Most of the heavy lifting is done at the OpenMPIRBuilder. When other OpenMP dialect constructs grow support for reductions, the translation can be updated to operate on, e.g., an operation interface for all reduction containers instead of workshare loops specifically. Designing such a generic translation for the single operation that currently supports reductions is premature since we don't know how the reduction modeling itself will be generalized. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D107343	2021-09-02 15:38:20 +02:00
Jacques Pienaar	f7bf8a8658	[mlir][capi] Add NameLoc Add method to get NameLoc. Treat null child location as unknown to avoid needing to create UnknownLoc in C API where child loc is not needed. Differential Revision: https://reviews.llvm.org/D108678	2021-09-01 16:16:35 -07:00
Weiwei Li	a79d7c2c85	[mlir][SPIRV] Add Image Operands for Image Instructions This patch is to add Image Operands in SPIR-V Dialect and also let ImageDrefGather to use Image Operands. Image Operands are used in many image instructions. "Image Operands encodes what oprands follow, as per Image Operands". And ususally, they are optional to image instructions. The format of image operands looks like: %0 = spv.ImageXXXX %1, ... %3 : f32 ["Bias\|Lod"](%4, %5 : f32, f32) -> ... This patch doesn’t implement all operands (see Section 3.14 in SPIR-V Spec) but provides a skeleton of it. There is TODO in verifyImageOperands function. Co-authored: Alan Liu <alanliu.yf@gmail.com> Reviewed by: antiagainst Differential Revision: https://reviews.llvm.org/D108501	2021-09-02 04:14:17 +08:00
natashaknk	f596acc74d	[mlir][tosa] Small refactor to the functionality of Depthwise_Conv2D to add the bias at the end of the convolution Follow-up to the Conv2d and fully_connected lowering adjustments Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D108949	2021-09-01 10:01:00 -07:00
Tyler Augustine	7105512a34	Support alias.scope and noalias metadata lowering on intrinsics. Builds on https://reviews.llvm.org/D107870 to support annotating intrinsics with alias.scope and noalias metadata. Reviewed By: arpith-jacob, ftynse Differential Revision: https://reviews.llvm.org/D109025	2021-09-01 16:54:20 +00:00
Mehdi Amini	c7515a49b1	Fix MLIR python binding test after changes in ASM printer	2021-08-31 18:50:22 +00:00
MaheshRavishankar	b686fdbf92	[mlir][Linalg] Drop output tensor from `linalg.pad_tensor` op. The output tensor was added for tiling purposes. With use of `TilingInterface` for tiling pad operations, there is no need for an explicit operand for the shape of result of `linalg.pad_tensor` op. The interface allows the tiling pattern to query the value that can be used for the "init" needed for tiling dynamically. Differential Revision: https://reviews.llvm.org/D108613	2021-08-31 11:12:24 -07:00
Mehdi Amini	387f95541b	Add a new interface allowing to set a default dialect to be used for printing/parsing regions Currently the builtin dialect is the default namespace used for parsing and printing. As such module and func don't need to be prefixed. In the case of some dialects that defines new regions for their own purpose (like SpirV modules for example), it can be beneficial to change the default dialect in order to improve readability. Differential Revision: https://reviews.llvm.org/D107236	2021-08-31 17:52:40 +00:00
Mehdi Amini	c41b16c26b	Change ASM Op printer to print the operation name in the framework instead of leaving it up to each individual operation This aligns the printer with the parser contract: the operation isn't part of the user-controllable part of the syntax. Differential Revision: https://reviews.llvm.org/D108804	2021-08-31 17:52:40 +00:00
Mehdi Amini	fd87963eee	Change dialect `printOperation()` hook to `getOperationPrinter()` This makes the hook return a printer if available, instead of using LogicalResult to indicate if a printer was available (and invoked). This allows the caller to detect that the dialect has a printer for a given operation without actually invoking the printer. It'll be leveraged in a future revision to move printing the op name itself under control of the ASMPrinter. Differential Revision: https://reviews.llvm.org/D108803	2021-08-31 17:52:39 +00:00
Tres Popp	44485fcd97	[mlir] Prevent assertion failure in DropUnitDims Don't assert fail on strided memrefs when dropping unit dims. Instead just leave them unchanged. Differential Revision: https://reviews.llvm.org/D108205	2021-08-31 12:15:13 +02:00
marina kolpakova a.k.a. geexie	0080d2aa55	[mlir][gpu] folds memref.dim of gpu.alloc implements canonicalization which folds memref.dim(gpu.alloc(%size), %idx) -> %size Differential Revision: https://reviews.llvm.org/D108892	2021-08-31 12:33:10 +03:00
MaheshRavishankar	ba72cfe734	[mlir] Add an interface to allow operations to specify how they can be tiled. An interface to allow for tiling of operations is introduced. The tiling of the linalg.pad_tensor operation is modified to use this interface. Differential Revision: https://reviews.llvm.org/D108611	2021-08-30 16:31:18 -07:00
natashaknk	203d38b234	[mlir][tosa] Small refactor to the functionality of Conv2D and Fully_connected to add the bias at the end of the convolution Made to adjust for a modification to the tiling algorithm Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D108746	2021-08-30 13:18:43 -07:00
Stella Laurenzo	8e6c55c92c	[mlir][python] Extend C/Python API to be usable for CFG construction. * It is pretty clear that no one has tried this yet since it was both incomplete and broken. * Fixes a symbol hiding issues keeping even the generic builder from constructing an operation with successors. * Adds ODS support for successors. * Adds CAPI `mlirBlockGetParentRegion`, `mlirRegionEqual` + tests (and missing test for `mlirBlockGetParentOperation`). * Adds Python property: `Block.region`. * Adds Python methods: `Block.create_before` and `Block.create_after`. * Adds Python property: `InsertionPoint.block`. * Adds new blocks.py test to verify a plausible CFG construction case. Differential Revision: https://reviews.llvm.org/D108898	2021-08-30 08:28:00 -07:00
Chris Lattner	41d4aa7de6	[SymbolRefAttr] Revise SymbolRefAttr to hold a StringAttr. SymbolRefAttr is fundamentally a base string plus a sequence of nested references. Instead of storing the string data as a copies StringRef, store it as an already-uniqued StringAttr. This makes a lot of things simpler and more efficient because: 1) references to the symbol are already stored as StringAttr's: there is no need to copy the string data into MLIRContext multiple times. 2) This allows pointer comparisons instead of string comparisons (or redundant uniquing) within SymbolTable.cpp. 3) This allows SymbolTable to hold a DenseMap instead of a StringMap (which again copies the string data and slows lookup). This is a moderately invasive patch, so I kept a lot of compatibility APIs around. It would be nice to explore changing getName() to return a StringAttr for example (right now you have to use getNameAttr()), and eliminate things like the StringRef version of getSymbol. Differential Revision: https://reviews.llvm.org/D108899	2021-08-29 21:54:47 -07:00
Matthias Springer	d18ffd61d4	[mlir][SCF] Canonicalize dim(x) where x is an iter_arg * Add `DimOfIterArgFolder`. * Move existing cross-dialect canonicalization patterns to `LoopCanonicalization.cpp`. * Rename `SCFAffineOpCanonicalization` pass to `SCFForLoopCanonicalization`. * Expand documentaton of scf.for: The type of loop-carried variables may not change with iterations. (Not even the dynamic type.) Differential Revision: https://reviews.llvm.org/D108806	2021-08-30 01:39:56 +00:00
Aart Bik	0a7b8cc5dd	[mlir][sparse] fully implement sparse tensor to sparse tensor conversions with rigorous integration test Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108721	2021-08-27 15:08:18 -07:00
Rob Suderman	90478251c7	[mlir][tosa] Tosa reverse to linalg supporting dynamic shapes Needed to switch to extract to support tosa.reverse using dynamic shapes. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D108744	2021-08-26 13:23:59 -07:00
Rob Suderman	0600bb4d18	[mlir][tosa] Elementwise operation dynamic shape support Added dynamic shape support for elementwise operations. This assumes equal sizes (broadcasting 1-length dynamic is problematic). Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D108730	2021-08-26 11:18:58 -07:00
Aart Bik	d5f7f356ce	[mlir][sparse] add sparse-dense cases to storage integration test Reviewed By: grosul1 Differential Revision: https://reviews.llvm.org/D108685	2021-08-25 11:33:20 -07:00
River Riddle	c8d9e1ce43	[mlir][AttrTypeGen] Add support for specifying a "accessor" type of a parameter This allows for using a different type when accessing a parameter than the one used for storage. This allows for returning parameters by reference, enables using more optimized/convient reference results, and more. Differential Revision: https://reviews.llvm.org/D108593	2021-08-25 09:27:36 +00:00
Rob Suderman	5541a05d6a	[mlir][tosa] Quantized tosa.avg_pool2d lowering to linalg Includes the quantized version of average pool lowering to linalg dialect. This includes a lit test for the transform. It is not 100% correct as the multiplier / shift should be done in i64 however this is negligable rounding difference. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D108676	2021-08-24 18:54:23 -07:00
Rob Suderman	4ef1770abd	[mlir][tosa] Table did not apply offset before extract on i8 input Lowering to table was incorrect as it did not apply a 128 offset before extracting the value from the table. Fixed and correct tensor length on input table. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D108436	2021-08-24 18:52:33 -07:00
Matthias Springer	a9cff97f94	[mlir][SCF] Generalize AffineMinSCFCanonicalization to min/max ops * Add support for affine.max ops to SCF loop peeling pattern. * Add support for affine.max ops to `AffineMinSCFCanonicalizationPattern`. * Rename `AffineMinSCFCanonicalizationPattern` to `AffineOpSCFCanonicalizationPattern`. * Rename `AffineMinSCFCanonicalization` pass to `SCFAffineOpCanonicalization`. Differential Revision: https://reviews.llvm.org/D108009	2021-08-25 10:40:34 +09:00
Rob Suderman	a7bf93807b	[mlir][tosa] Fix conv/depthwise conv padding for quantized values When padding quantized operations, the padding needs to equal the zero point of the input value. Corrected the pass to change the padding value if quantized. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D108440	2021-08-24 18:13:22 -07:00
Matthias Springer	2de2dbef2a	[mlir][linalg] Replace AffineMinSCFCanonicalizationPattern with SCF reimplementation Use the new canonicalization pattern in the SCF dialect. Differential Revision: https://reviews.llvm.org/D107732	2021-08-25 08:52:56 +09:00
Aart Bik	c5735fada4	[mlir][sparse] enable a few vectorized runs in integration tests Recent changes outside sparse compiler exposed the requirement of running a new pass (lower-affine) but this only became apparent with private testing. By adding some vectorized runs to integration test, we will detect the need for such changes earlier and also widen codegen coverage of course. Reviewed By: gussmith23 Differential Revision: https://reviews.llvm.org/D108667	2021-08-24 16:08:01 -07:00
Matthias Springer	98aa694d0d	[mlir][scf] Add general affine.min canonicalization pattern This canonicalization simplifies affine.min operations inside "for loop"-like operations (e.g., scf.for and scf.parallel) based on two invariants: * iv >= lb * iv < lb + step * ((ub - lb - 1) floorDiv step) + 1 This commit adds a new pass `canonicalize-scf-affine-min` (instead of being a canonicalization pattern) to avoid dependencies between the Affine dialect and the SCF dialect. Differential Revision: https://reviews.llvm.org/D107731	2021-08-25 07:32:30 +09:00
Tyler Augustine	d25e91d7f6	Support alias.scope and noalias metadata Introduces new Ops to represent 1. alias.scope metadata in LLVM, and 2. domains for these scopes. These correspond to the metadata described in https://llvm.org/docs/LangRef.html#noalias-and-alias-scope-metadata. Lists of scopes are modeled the same way as access groups - as an ArrayAttr on the Op (added in https://reviews.llvm.org/D97944). Lowering 'noalias' attributes on function parameters is already supported. However, lowering `noalias` metadata on individual Ops is not, which is added in this change. LLVM uses the same keyword for these, but this change introduces a separate attribute name 'noalias_scopes' to represent this distinct concept. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D107870	2021-08-24 20:42:59 +02:00
Matthias Springer	ebf35370ff	[mlir][tensor] Insert explicit tensor.cast ops for insert_slice src If additional static type information can be deduced from a insert_slice's size operands, insert an explicit cast of the op's source operand. This enables other canonicalization patterns that are matching for tensor_cast ops such as `ForOpTensorCastFolder` in SCF. Differential Revision: https://reviews.llvm.org/D108617	2021-08-24 19:45:04 +09:00
MaheshRavishankar	b546f4347b	[mlir]Linalg] Allow controlling fusion of linalg.generic -> linalg.tensor_expand_shape. Differential Revision: https://reviews.llvm.org/D108565	2021-08-23 16:28:10 -07:00
Aart Bik	236a90802d	[mlir][sparse] replace support lib conversion with actual MLIR codegen Rationale: Passing in a pointer to the memref data in order to implement the dense to sparse conversion was a bit too low-level. This revision improves upon that approach with a cleaner solution of generating a loop nest in MLIR code itself that prepares the COO object before passing it to our "swiss army knife" setup. This is much more intuitive and now also allows for dynamic shapes. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108491	2021-08-23 14:26:05 -07:00
River Riddle	4e103a12d9	[mlir] Add support for VariadicOfVariadic operands This revision adds native ODS support for VariadicOfVariadic operand groups. An example of this is the SwitchOp, which has a variadic number of nested operand ranges for each of the case statements, where the number of case statements is variadic. Builtin ODS support allows for generating proper accessors for the nested operand ranges, builder support, and declarative format support. VariadicOfVariadic operands are supported by providing a segment attribute to use to store the operand groups, mapping similarly to the AttrSizedOperand trait (but with a user defined attribute name). `build` methods for VariadicOfVariadic operand expect inputs of the form `ArrayRef<ValueRange>`. Accessors for the variadic ranges return a new `OperandRangeRange` type, which represents a contiguous range of `OperandRange`. In the declarative assembly format, VariadicOfVariadic operands and types are by default formatted as a comma delimited list of value lists: `(<value>, <value>), (), (<value>)`. Differential Revision: https://reviews.llvm.org/D107774	2021-08-23 20:32:31 +00:00
MaheshRavishankar	4aeeb91a92	[mlir][Linalg] Allow all build methods of Structured ops to specify additional attributes. Differential Revision: https://reviews.llvm.org/D108338	2021-08-23 13:06:34 -07:00
River Riddle	e4635e6328	[mlir][FoldUtils] Ensure the created constant dominates the replaced op This revision fixes a bug where an operation would get replaced with a pre-existing constant that didn't dominate it. This can occur when a pattern inserts operations to be folded at the beginning of the constants insertion block. This revision fixes the bug by moving the existing constant before the replaced operation in such cases. This is fine because if a constant didn't already exist, a new one would have been inserted before this operation anyways. Differential Revision: https://reviews.llvm.org/D108498	2021-08-23 18:48:24 +00:00
Matthias Springer	bc194a5bb5	[mlir][SCF] Do not peel loops inside partial iterations Do not apply loop peeling to loops that are contained in the partial iteration of an already peeled loop. This is to avoid code explosion when dealing with large loop nests. Can be controlled with a new pass option `skip-partial`. Differential Revision: https://reviews.llvm.org/D108542	2021-08-23 21:35:46 +09:00
William S. Moses	973cb2c326	[MLIR][OMP] Ensure nested scf.parallel execute all iterations Presently, the lowering of nested scf.parallel loops to OpenMP creates one omp.parallel region, with two (nested) OpenMP worksharing loops on the inside. When lowered to LLVM and executed, this results in incorrect results. The reason for this is as follows: An OpenMP parallel region results in the code being run with whatever number of threads available to OpenMP. Within a parallel region a worksharing loop divides up the total number of requested iterations by the available number of threads, and distributes accordingly. For a single ws loop in a parallel region, this works as intended. Now consider nested ws loops as follows: omp.parallel { A: omp.ws %i = 0...10 { B: omp.ws %j = 0...10 { code(%i, %j) } } } Suppose we ran this on two threads. The first workshare loop would decide to execute iterations 0, 1, 2, 3, 4 on thread 0, and iterations 5, 6, 7, 8, 9 on thread 1. The second workshare loop would decide the same for its iteration. This means thread 0 would execute i \in [0, 5) and j \in [0, 5). Thread 1 would execute i \in [5, 10) and j \in [5, 10). This means that iterations i in [5, 10), j in [0, 5) and i in [0, 5), j in [5, 10) never get executed, which is clearly wrong. This permits two options for a remedy: 1) Change the semantics of the omp.wsloop to be distinct from that of the OpenMP runtime call or equivalently #pragma omp for. This could then allow some lowering transformation to remedy the aforementioned issue. I don't think this is desirable for an abstraction standpoint. 2) When lowering an scf.parallel always surround the wsloop with a new parallel region (thereby causing the innermost wsloop to use the number of threads available only to it). This PR implements the latter change. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D108426	2021-08-20 19:06:28 -04:00
Rob Suderman	871c812483	[mlir][linalg] Finish refactor of TC ops to YAML Multiple operations were still defined as TC ops that had equivalent versions as YAML operations. Reducing to a single compilation path guarantees that frontends can lower to their equivalent operations without missing the optimized fastpath. Some operations are maintained purely for testing purposes (mainly conv{1,2,3}D as they are included as sole tests in the vectorizaiton transforms. Differential Revision: https://reviews.llvm.org/D108169	2021-08-20 12:35:04 -07:00
Aart Bik	758ccf8506	[mlir][sparse] add test for DimOp folding Folding in the MLIR uses the order of the type directly but folding in the underlying implementation must take the dim ordering into account. These tests clarify that behavior and verify it is done right. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108474	2021-08-20 11:24:09 -07:00
Aart Bik	24ea94ad0c	[mlir][sparse][python] migrate more code from boilerplate into proper numpy land The boilerplate was setting up some arrays for testing. To fully illustrate python - MLIR potential, however, this data should also come from numpy land. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108336	2021-08-20 09:18:17 -07:00
Jacques Pienaar	a232a48dca	[mlir][ods] Skip adding TOC in doc gen when present Enables adding a TOC in the description to be able to interleave documentation before and after the TOC.	2021-08-20 07:01:54 -07:00
Rob Suderman	3205ee7e81	[mlir][tosa] Support UInt8 inputs and outputs for tosa.rescale Tosa rescale can contain uint8 types. Added support for these types using an unrealized conversion cast. Optimistically it would be better to use bitcast however it does not support unsigned integers. Differential Revision: https://reviews.llvm.org/D108427	2021-08-19 18:58:44 -07:00
Morten Borup Petersen	6c1436a9b0	[MLIR][SCF] Parenthesize multiple return types in scf.execute_region asm op Previously, ExecuteRegionOps with multiple return values would fail a round-trip test due to missing parenthesis around the types. Differential Revision: https://reviews.llvm.org/D108402	2021-08-19 21:31:51 +01:00
MaheshRavishankar	16ffb283c5	Revert "[mlir][Linalg] Allow all build methods of Structured ops to specify additional attributes." This reverts commit `95ddc8341a`. Differential Revision: https://reviews.llvm.org/D108396	2021-08-19 11:53:41 -07:00
MaheshRavishankar	95ddc8341a	[mlir][Linalg] Allow all build methods of Structured ops to specify additional attributes. Differential Revision: https://reviews.llvm.org/D108338	2021-08-19 11:14:35 -07:00
Matthias Springer	76a1861816	[mlir][SparseTensor] Split scf.for loop into masked/unmasked parts Apply the "for loop peeling" pattern from SCF dialect transforms. This pattern splits scf.for loops into full and partial iterations. In the full iteration, all masked loads/stores are canonicalized to unmasked loads/stores. Differential Revision: https://reviews.llvm.org/D107733	2021-08-19 21:53:11 +09:00
Matthias Springer	8e8b70aa84	[mlir][scf] Simplify affine.min ops after loop peeling Simplify affine.min ops, enabling various other canonicalizations inside the peeled loop body. affine.min ops such as: ``` map = affine_map<(d0)[s0, s1] -> (s0, -d0 + s1)> %r = affine.min #affine.min #map(%iv)[%step, %ub] ``` are rewritten them into (in the case the peeled loop): ``` %r = %step ``` To determine how an affine.min op should be rewritten and to prove its correctness, FlatAffineConstraints is utilized. Differential Revision: https://reviews.llvm.org/D107222	2021-08-19 17:24:53 +09:00
Tobias Gysi	234c4d2362	[mlir][linalg] Set result types in all builders. Add code to set the result types in all yaml op builders. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D108273	2021-08-19 06:19:12 +00:00
Matthias Springer	08dbed8a57	[mlir][linalg] Canonicalize dim ops of tiled_loop block args E.g.: ``` %y = ... : tensor<...> linalg.tiled_loop ... ins(%x = %y : tensor<...>) { tensor.dim %x, %c0 : tensor<...> } ``` is rewritten to: ``` %y = ... : tensor<...> linalg.tiled_loop ... ins(%x = %y : tensor<...>) { tensor.dim %y, %c0 : tensor<...> } ``` Differential Revision: https://reviews.llvm.org/D108272	2021-08-19 11:24:33 +09:00
Aart Bik	d37d72eaf8	[mlir][sparse] use shared util for DimOp generation This shares more code with existing utilities. Also, to be consistent, we moved dimension permutation on the DimOp to the tensor lowering phase. This way, both pre-existing DimOps on sparse tensors (not likely but possible) as well as compiler generated DimOps are handled consistently. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108309	2021-08-18 17:12:32 -07:00
Chia-hung Duan	41e5dbe0fa	Enables inferring return types for Shape op if possible Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D102565	2021-08-18 21:36:55 +00:00
Robert Suderman	76c9712196	[mlir][tosa] Fix clamp to restrict only within valid bitwidth range Its possible for the clamp to have invalid min/max values on its range. To fix this we validate the range of the min/max and clamp to a valid range. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D108256	2021-08-18 12:14:01 -07:00
William S. Moses	8c2ff7b69e	[MLIR] Correct linkage of lowered globalop LLVM considers global variables marked as externals to be defined within the module if it is initialized (including to an undef). Other external globals are considered as being defined externally and imported into the current translation unit. Lowering of MLIR Global Ops does not properly propagate undefined initializers, resulting in a global which is expected to be defined within the current TU, not being defined. Differential Revision: https://reviews.llvm.org/D108252	2021-08-18 11:09:43 -04:00
Butygin	ddc3d51d58	[mlir][spirv] Add (InBounds)PtrAccessChain ops Differential Revision: https://reviews.llvm.org/D108070	2021-08-18 17:59:21 +03:00
Jacques Pienaar	b41bfb819d	[mlir][ods] Fix packing in OperandOrAttribute Wrong combiner was used which led to information loss.	2021-08-17 20:55:48 -07:00
Lei Zhang	4c15ad2321	[mlir][linalg] Don't drop existing attributes when creating ops Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D108219	2021-08-17 15:44:56 -04:00
Matthias Springer	4c4ab673f1	[mlir][Analysis][NFC] Split FlatAffineConstraints class * Extract "value" functionality of `FlatAffineConstraints` into a new derived `FlatAffineValueConstraints` class. Current users of `FlatAffineConstraints` can use `FlatAffineValueConstraints` without additional code changes, thus NFC. * `FlatAffineConstraints` no longer associates dimensions with SSA Values. All functionality that requires this, is moved to `FlatAffineValueConstraints`. * `FlatAffineConstraints` no longer makes assumptions about where Values associated with dimensions are coming from. Differential Revision: https://reviews.llvm.org/D107725	2021-08-17 10:09:17 +09:00
Robert Suderman	65532ea6dd	[mlir][linalg] Clear unused linalg tc operations These operations are not lowered to from any source dialect and are only used for redundant tests. Removing these named ops, along with their associated tests, will make migration to YAML operations much more convenient. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D107993	2021-08-16 11:55:45 -07:00
Aart Bik	19a906f372	[mlir][sparse][python] make imports more selective Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108055	2021-08-16 11:53:29 -07:00
tashuang.zk	2d45e332ba	[MLIR][DISC] Revise ParallelLoopTilingPass with inbound_check mode Expand ParallelLoopTilingPass with an inbound_check mode. In default mode, the upper bound of the inner loop is from the min op; in inbound_check mode, the upper bound of the inner loop is the step of the outer loop and an additional inbound check will be emitted inside of the inner loop. This was 'FIXME' in the original codes and a typical usage is for GPU backends, thus the outer loop and inner loop can be mapped to blocks/threads in seperate. Differential Revision: https://reviews.llvm.org/D105455	2021-08-16 14:02:53 +02:00
Stephen Neuendorffer	7776b19eed	[MLIR] Move TestDialect to ::test namespace While the changes are extensive, they basically fall into a few categories: 1) Moving the TestDialect itself. 2) Updating C++ code in tablegen to explicitly use ::mlir, since it will be put in a headers that shouldn't expect a 'using'. 3) Updating some generic MLIR Interface definitions to do the same thing. 4) Updating the Tablegen generator in a few places to be explicit about namespaces 5) Doing the same thing for llvm references, since we no longer pick up the definitions from mlir/Support/LLVM.h Differential Revision: https://reviews.llvm.org/D88251	2021-08-14 13:24:41 -07:00
harsh-nod	e33f301ec2	[mlir] Add support for moving reductions to outer most dimensions in vector.multi_reduction The approach for handling reductions in the outer most dimension follows that for inner most dimensions, outlined below First, transpose to move reduction dims, if needed Convert reduction from n-d to 2-d canonical form Then, for outer reductions, we emit the appropriate op (add/mul/min/max/or/and/xor) and combine the results. Differential Revision: https://reviews.llvm.org/D107675	2021-08-13 12:59:50 -07:00
Adrian Kuegel	3c6f115ffc	[mlir] Remove unused header include. Also adjust BUILD.bazel and remove an unused dependency. Differential Revision: https://reviews.llvm.org/D108027	2021-08-13 14:23:14 +02:00
natashaknk	ba0997ca09	[mlir][tosa] Fix depthwise_conv2D strides/dilation and name Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D107997	2021-08-12 15:43:41 -07:00
Aart Bik	56d607006d	[mlir][sparse][python] add an "exhaustive" sparse test using python Using the python API to easily set up sparse kernels, this test exhaustively builds, compilers, and runs SpMM for all annotations on a sparse tensor, making sure every version generates the correct result. This test also illustrates using the python API to set up a sparse kernel and sparse compilation. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D107943	2021-08-12 11:13:04 -07:00
Florian Hahn	f999312872	Recommit "[Matrix] Overload stride arg in matrix.columnwise.load/store." This reverts the revert `28c04794df`. The failing MLIR test that caused the revert should be fixed in this version. Also includes a PPC test fix previously in `1f87c7c478`.	2021-08-12 18:31:57 +01:00
Tyler Augustine	3a2ff982d7	Support post-processing Ops in unrolled loop iterations This can be useful when one needs to know which unrolled iteration an Op belongs to, for example, conveying noalias information among memory-affecting ops in parallel-access loops. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D107789	2021-08-11 23:11:10 +00:00
Benjamin Kramer	35d6e75aba	[mlir] Drop LLVM dialect from TestPolynomialApproximation No longer needed after `c1ebefdf77`	2021-08-12 00:58:52 +02:00
Mehdi Amini	93e084e7e8	Add missing cmake dep to fix MLIR build with BUILD_SHARED_LIBS=ON (NFC)	2021-08-11 22:51:57 +00:00
Rob Suderman	7de439b2be	[mlir][tosa] Migrate tosa to more efficient linalg.conv Existing linalg.conv2d is not well optimized for performance. Changed to a version that is more aligned for optimziation. Include the corresponding transposes to use this optimized version. This also splits the conv and depthwise conv into separate implementations to avoid overly complex lowerings. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D107504	2021-08-11 11:05:12 -07:00
Benjamin Kramer	c1ebefdf77	[mlir] Make polynomial approximation emit std instead of LLVM ops This is a bit cleaner and removes issues with 2d vectors. It also has a big impact on constant folding, hence the test changes. Differential Revision: https://reviews.llvm.org/D107896	2021-08-11 16:37:21 +02:00
Alex Zinenko	a0d8a08e3e	[mlir] Add std.bitcast -> llvm.bitcast conversion The conversion is a straightforward one-to-one mapping with optional unrolling for nD vectors, similarly to other cast operations. Depends On D107889 Reviewed By: cota, akuegel Differential Revision: https://reviews.llvm.org/D107891	2021-08-11 16:30:21 +02:00
Alex Zinenko	79b0576dd4	[mlir] Tighten LLVM_AnyNonAggregate ODS type constraint The constraint was checking that the type is not an LLVM structure or array type, but was not checking that it is an LLVM-compatible type, making it accept incorrect types. As a result, some LLVM dialect ops could process values that are not compatible with the LLVM dialect leading to further issues with conversions and translations that assume all values are LLVM-compatible. Make LLVM_AnyNonAggregate only accept LLVM-compatible types. Reviewed By: cota, akuegel Differential Revision: https://reviews.llvm.org/D107889	2021-08-11 16:30:19 +02:00
Alexander Belyaev	1e733a8c04	Revert "Bufferization for tiled loop." This reverts commit `edaffebcb2`.	2021-08-11 10:04:12 +02:00
Alexander Belyaev	967578f0b8	Revert "[mlir] Change the pattern for TiledLoopOp bufferization." This reverts commit `2f946eaa9d`.	2021-08-11 10:01:36 +02:00
Rob Suderman	2b2ebb6f98	[mlir][tosa] Add folders for trivial tosa operation cases Some folding cases are trivial to fold away, specifically no-op cases where an operation's input and output are the same. Canonicalizing these away removes unneeded operations. The current version includes tensor cast operations to resolve shape discreprencies that occur when an operation's result type differs from the input type. These are resolved during a tosa shape propagation pass. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D107321	2021-08-10 14:43:00 -07:00
Rob Suderman	86858c62ba	[mlir][tosa] Add dilation to tosa.transpose_conv2d lowering Dilation only requires increasing the padding on the left/right side of the input, and including dilation in the convolution. This implementation still lacks support for strided convolutions. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D107680	2021-08-10 14:36:11 -07:00
natashaknk	a1f46569a1	[mlir][tosa] Add quantized and unquantized versions for tosa.depthwise_conv2d lowering Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D107855	2021-08-10 14:29:26 -07:00
Haruki Imai	b34b1c6955	[mlir] Support normalizing memrefs with MemRef_ReinterpretCastOp This patch enables normalizing memrefs with MemRef_ReinterpretCastOp by adding MemRefsNormalizable trait in the Op definition. Signed-off-by: Haruki Imai <imaihal@jp.ibm.com> Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D107425	2021-08-11 01:15:18 +05:30
Alexander Belyaev	2f946eaa9d	[mlir] Change the pattern for TiledLoopOp bufferization. This version is does not affect the patterns for Extract/InsertSliceOp and LinalgOps. Differential Revision: https://reviews.llvm.org/D107858	2021-08-10 21:27:02 +02:00
bakhtiyar	391456f33c	Fix a bug in algebraic simplification, and enable the tests. Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D107788	2021-08-10 04:15:56 -07:00
Alexander Belyaev	edaffebcb2	Cloned from CL 389610703 by 'g4 patch'. Original change by pifon@pifon:tfrt_clean:6896:citc on 2021/08/09 05:30:17. Ad b Differential Revision: https://reviews.llvm.org/D107762	2021-08-09 21:57:06 +02:00
Aart Bik	8cf8349eaa	[mlir][sparse] add an elaborate sparse storage scheme integration test Looks "under the hood" of the sparse stogage schemes. Users should typically not be interested in these details (hey, that is why we have "sparse compilers"!) but this test makes sure the compact contents are as expected. Reviewed By: ThomasRaoux, bixia Differential Revision: https://reviews.llvm.org/D107683	2021-08-09 12:54:15 -07:00
Aart Bik	05c7f450df	[mlir][sparse] add dense to sparse conversion implementation Implements lowering dense to sparse conversion, for static tensor types only. First step towards general sparse_tensor.convert support. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D107681	2021-08-09 12:12:39 -07:00
Alex Zinenko	8a7c657c4d	[mlir] support nD vector forms of shifts in std-to-llvm conversion These ops were not ported to the nD vector conversion when it was introduced and nobody needed them so far. Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D107750	2021-08-09 12:00:41 +02:00
Max Kudryavtsev	0b8cb87e0d	[MLIR][STD] Add safe scalar constant propagation for FPTruncOp Perform scalar constant propagation for FPTruncOp only if the resulting value can be represented without precision loss or rounding. Example: %cst = constant 1.000000e+00 : f32 %0 = fptrunc %cst : f32 to bf16 --> %cst = constant 1.000000e+00 : bf16 Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D107518	2021-08-06 16:31:29 -07:00
Alexander Belyaev	a552debdcf	[mlir] Add patterns for vector.transfer_read/write to Linalg bufferization. Differential Revision: https://reviews.llvm.org/D107643	2021-08-06 20:24:44 +02:00
Geoffrey Martin-Noble	ca6baf1e1d	[MLIR][std] Introduce bitcast operation This patch introduces a bitcast operation to the standard dialect. RFC: https://llvm.discourse.group/t/rfc-introduce-a-bitcast-op/3774 Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D105376	2021-08-06 08:47:51 -07:00
Alex Zinenko	c4c1030976	[mlir] support collapsed loops in OpenMP-to-LLVM translation Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D105706	2021-08-06 17:13:12 +02:00
Vladislav Vinogradov	59f59d1c62	[mlir] Allow to override type/attr aliases from various hooks Use new return type for `OpAsmDialectInterface::getAlias`: * `AliasResult::NoAlias` if an alias was not provided. * `AliasResult::OverridableAlias` if an alias was provided, but it might be overriden by other hook. * `AliasResult::FinalAlias` if an alias was provided and it should be used (no other hooks will be checked). In that case `AsmPrinter` will use either the first alias with `FinalAlias` result or the last alias with `OverridableAlias` result (it depends on dialect array order). Used `OverridableAlias` result for `BuiltinOpAsmDialectInterface`. Use case: provide more informative alias for built-in attributes like `AffineMapAttr` instead of generic "map<N>". Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D107437	2021-08-06 12:05:31 +03:00
Adrian Kuegel	d6b4993736	[mlir][MemRef] Fix canonicalization of BufferCast(TensorLoad). CastOp::areCastCompatible does not check whether casts are definitely compatible. When going from dynamic to static offset or stride, the canonicalization cannot know whether it is really cast compatible. In that case, it can only canonicalize to an alloc plus copy. Differential Revision: https://reviews.llvm.org/D107545	2021-08-06 08:32:35 +02:00
Jacques Pienaar	9d10be70a8	[mlir] std.call reference function return types in failure Makes it easier to see type mismatch from failure locally. Differential Revision: https://reviews.llvm.org/D107288	2021-08-05 19:51:48 -07:00
Gus Smith	0bd2d4c4b1	[mlir][sparse] Remove comment w/ code in it Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D107484	2021-08-04 21:41:36 +00:00
Matthias Springer	9102a16bef	[mlir] Support drawing control-flow graphs in ViewOpGraph.cpp * Add new pass option `print-data-flow-edges`, default value `true`. * Add new pass option `print-control-flow-edges`, default value `false`. * Remove `PrintCFGPass`. Same functionality now provided by `PrintOpPass`. Differential Revision: https://reviews.llvm.org/D106342	2021-08-04 20:45:15 +09:00
Stephen Neuendorffer	432341d8a8	[mlir] Handle cases where transfer_read should turn into a scalar load The existing vector transforms reduce the dimension of transfer_read ops. However, beyond a certain point, the vector op actually has to be reduced to a scalar load, since we can't load a zero-dimension vector. This handles this case. Note that in the longer term, it may be preferaby to support zero-dimension vectors. see https://llvm.discourse.group/t/should-we-have-0-d-vectors/3097. Differential Revision: https://reviews.llvm.org/D103432	2021-08-03 22:53:40 -07:00
Matthias Springer	faeb7ec68b	[mlir] Fix broken build in pass_manager.py This test ensures that an error is generated from the Python side when running a module pass on a function. The test used to instantiate ViewOpGraph, however, this pass was changed into a general "any op" pass in D106253. Therefore, a different pass must be used in this test. Differential Revision: https://reviews.llvm.org/D107424	2021-08-04 13:12:17 +09:00
Matthias Springer	8d15b7dcba	[mlir] Improve Graphviz visualization in PrintOpPass * Visualize blocks and regions as subgraphs. * Generate DOT file directly instead of using `GraphTraits`. `GraphTraits` does not support subgraphs. Differential Revision: https://reviews.llvm.org/D106253	2021-08-04 11:56:26 +09:00
Rob Suderman	1b00b94ffc	[mlir][tosa] Tosa shape propagation for tosa.cond_if We can propagate the shape from tosa.cond_if operands into the true/false regions then through the connected blocks. Then, using the tosa.yield ops we can determine what all possible return types are. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D105940	2021-08-03 17:54:54 -07:00
Rob Suderman	143edeca6d	[mlir][tosa] Shape inference for a few remaining easy cases: Handles shape inference for identity, cast, and rescale. These were missed during the initialy elementwise work. This includes resize shape propagation which includes both attribute and input type based propagation. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D105845	2021-08-03 17:20:32 -07:00
Aart Bik	817303ef34	[mlir][sparse] fix bug in permuting data structure Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D107379	2021-08-03 14:27:43 -07:00

... 3 4 5 6 7 ...

4826 Commits