llvm-project

Commit Graph

Author	SHA1	Message	Date
Jacques Pienaar	efb7727a96	[mlir] Flag near misses in file splitting Flags some potential cases where splitting isn't happening and so could result in confusing results. Also update some test files where there were near misses in splitting that seemed unintentional. Differential Revision: https://reviews.llvm.org/D109636	2021-12-12 08:03:30 -08:00
Nicolas Vasilache	408553dd96	[mlir][Vector] Support 0-D vectors in `CreateMaskOp` The 0-D case gets lowered in almost the same way that the 1-D case does in VectorCreateMaskOpConversion. I also had to slightly update the verifier for the op to always require exactly 1 operand in the 0-D case. Depends On D115220 Reviewed by: ftynse Differential revision: https://reviews.llvm.org/D115221	2021-12-12 13:32:29 +00:00
Michal Terepeta	a0c930d312	[mlir][Vector] Support 0-D vectors in `CmpIOp` Following the example of `VectorOfAnyRankOf`, I've done a few changes in the `.td` files to help with adding the support for the 0-D case gradually. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D115220	2021-12-12 13:28:26 +00:00
Lei Zhang	731676b10d	[mlir][spirv] Fix nested control flow serialization If we have a `spv.mlir.selection` op nested in a `spv.mlir.loop` op, when serializing the loop's block, we might need to jump from the selection op's merge block, which might be different than the immediate MLIR IR predecessor block. But we still need to get the block argument from the MLIR IR predecessor block. Also, if the `spv.mlir.selection` is in the `spv.mlir.loop`'s header block, we need to make sure `OpLoopMerge` is emitted in the current block before start processing the nested selection op. Otherwise we'll see the LoopMerge in the wrong SPIR-V basic block. Reviewed By: Hardcode84 Differential Revision: https://reviews.llvm.org/D115560	2021-12-11 14:47:19 -05:00
Jacques Pienaar	1ab3efac41	[mlir][python] Add fused location	2021-12-11 10:16:13 -08:00
Nicolas Vasilache	f2e945a393	Revert "[mlir][tensor] Fix insert_slice + tensor cast overflow" This reverts commit `5601821dae`. The prefix + canonical complete behavior is actually obsolete and should not be reintroduced. Reverting.	2021-12-10 22:53:52 +00:00
Nicolas Vasilache	5601821dae	[mlir][tensor] Fix insert_slice + tensor cast overflow InsertSliceOp may have subprefix semantics where missing trailing dimensions are automatically inferred directly from the operand shape. This revision fixes an overflow that occurs in such cases when the impl is based on the op rank. Differential Revision: https://reviews.llvm.org/D115549	2021-12-10 21:41:26 +00:00
River Riddle	233e9476d8	[mlir:PDL] Allow non-bound pdl.attribute/pdl.type operations that create constants This allows for passing in these attributes/types to constraints/rewrites as arguments. Differential Revision: https://reviews.llvm.org/D114817	2021-12-10 19:38:43 +00:00
River Riddle	06c3b9c7be	[mlir:PDL] Fix bugs in PDLPatternModule merging * Constraints/Rewrites registered before a pattern was added were dropped * Constraints/Rewrites may be registered multiple times (if different pattern sets depend on them) * ModuleOp no longer has a terminator, so we shouldn't be removing the terminator from it Differential Revision: https://reviews.llvm.org/D114816	2021-12-10 19:38:43 +00:00
River Riddle	98f5bd3489	[mlir:PDL] Adjust the assembly format for AttributeOp to avoid conflicts with DictionaryAttr Switch the attribute creation operations to use attr-dict-with- keyword to avoid conflicts (in the case of pdl.attribute) and confusion(in the case of pdl_interp.create_attribute) with having a DictionaryAttr as a value and specifying the attributes of the operation itself (as a dictionary). Differential Revision: https://reviews.llvm.org/D114815	2021-12-10 19:38:42 +00:00
River Riddle	9debc35f02	[mlir:PDL] Fix assembly format for pdl.apply_native_rewrite The results of a rewrite are optional, but we currently require them to be present in the assembly format. This commit makes the results component in the format optional. Differential Revision: https://reviews.llvm.org/D114814	2021-12-10 19:38:42 +00:00
Mogball	e40624ae60	[mlir][ods] Fix OpFormatGen sometimes not calling inferReturnTypes Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D115522	2021-12-10 19:35:56 +00:00
Mogball	0845635eda	[mlir][ir] Custom ops' parse/print fall back to dialect hooks Custom ops that have no parser or printer should fall back to the dialect's parser and/or printer hooks. This avoids the need to define parsers and printers that simply dispatch to the dialect hook. Reviewed By: mehdi_amini, rriddle Differential Revision: https://reviews.llvm.org/D115481	2021-12-10 19:34:25 +00:00
Alexander Belyaev	b618880e7b	[mlir] Move `linalg.tensor_expand/collapse_shape` to TensorDialect. RFC: https://llvm.discourse.group/t/rfc-reshape-ops-restructuring/3310 linalg.fill gets a canonicalizer, because `FoldFillWithTensorReshape` cannot be moved to tensorops (it uses linalg::FillOp inside). Before it was listed as a canonicalization pattern for the reshape operations, now it became a canonicalization for FillOp. Differential Revision: https://reviews.llvm.org/D115502	2021-12-10 12:11:48 +01:00
Rob Suderman	46c96fca0e	[mlir][tosa] Fix quantized type for tosa.conv2d canonicalization Wrong type was used for the result type in the tosa.conv_2d canonicalization. The type should match the result element type should match the result type not the input element type. Differential Revision: https://reviews.llvm.org/D115463	2021-12-09 12:39:23 -08:00
Aart Bik	880021df13	[mlir][sparse] reenable asan for sampled mm integration test Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D115364	2021-12-09 12:07:56 -08:00
MaheshRavishankar	9cfd8d7c6c	[mlir][Vector] Avoid infinite loop in InnerOuterDimReductionConversion. This patterns tries to convert an inner (outer) dim reduction to an outer (inner) dim reduction. Doing this on a 1D or 0D vector results in an infinite loop since the converted op is same as the original operation. Just returning failure when source rank <= 1 fixes the issue. Differential Revision: https://reviews.llvm.org/D115426	2021-12-09 09:30:05 -08:00
Bixia Zheng	64e171c2d0	Avoid unnecessary output buffer allocation and initialization. The sparse tensor code generator allocates memory for the output tensor. As such, we only need to allocate a MemRefDescriptor to receive the output tensor and do not need to allocate and initialize the storage for the tensor. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D115292	2021-12-09 08:29:02 -08:00
Krzysztof Drewniak	e1da62910e	[MLIR][GPU] Define gpu.printf op and its lowerings - Define a gpu.printf op, which can be lowered to any GPU printf() support (which is present in CUDA, HIP, and OpenCL). This op only supports constant format strings and scalar arguments - Define the lowering of gpu.pirntf to a call to printf() (which is what is required for AMD GPUs when using OpenCL) as well as to the hostcall interface present in the AMD Open Compute device library, which is the interface present when kernels are running under HIP. - Add a "runtime" enum that allows specifying which of the possible runtimes a ROCDL kernel will be executed under or that the runtime is unknown. This enum controls how gpu.printf is lowered This change does not enable lowering for Nvidia GPUs, but such a lowering should be possible in principle. And: [MLIR][AMDGPU] Always set amdgpu-implicitarg-num-bytes=56 on kernels This is something that Clang always sets on both OpenCL and HIP kernels, and failing to include it causes mysterious crashes with printf() support. In addition, revert the max-flat-work-group-size to (1, 256) to avoid triggering bugs in the AMDGPU backend. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D110448	2021-12-09 15:54:31 +00:00
Eugene Zhulenev	49ce40e9ab	[mlir] AsyncParallelFor: align block size to be a multiple of inner loops iterations Depends On D115263 By aligning block size to inner loop iterations parallel_compute_fn LLVM can later unroll and vectorize some of the inner loops with small number of trip counts. Up to 2x speedup in multiple benchmarks. Reviewed By: bkramer Differential Revision: https://reviews.llvm.org/D115436	2021-12-09 06:50:50 -08:00
Eugene Zhulenev	9f151b784b	[mlir] AsyncParallelFor: sink constants into the parallel compute function With complex recursive structure of async dispatch function LLVM can't always propagate constants to the parallel_compute_fn and it often prevents optimizations like loop unrolling and vectorization. We help LLVM by pushing known constants into the parallel_compute_fn explicitly. Reviewed By: bkramer Differential Revision: https://reviews.llvm.org/D115263	2021-12-09 06:48:23 -08:00
Matthias Springer	cc45a13422	[mlir][linalg][bufferize] LinalgOps can bufferize inplace with input args LinalgOp results usually bufferize inplace with output args. With this change, they may buffer inplace with input args if the value of the output arg is not used in the computation. Differential Revision: https://reviews.llvm.org/D115022	2021-12-09 21:54:54 +09:00
Shraiysh Vaishay	d82c1f4e4b	[MLIR][OpenMP] Added omp.atomic.update This patch supports the atomic construct (update) following section 2.17.7 of OpenMP 5.0 standard. Also added tests and verifier for the same. Reviewed By: kiranchandramohan, peixin Differential Revision: https://reviews.llvm.org/D112982	2021-12-09 15:21:24 +05:30
Nicolas Vasilache	d69f5e197c	[mlir][memref] Fix subview offset verification. Offset-specific verification seems to have been lost in one of the recent refactorings. Also add proper tests that would have caught this omission. This addresses the immediate issues discussed in: https://llvm.discourse.group/t/memref-subview-affine-map-and-symbols/4851 Differential Revision: https://reviews.llvm.org/D115427	2021-12-09 07:44:51 +00:00
MaheshRavishankar	6d7c9c3d0e	[mlir][Linalg] Bufferize the region of LinalgOps as well. The region of `linalg.generic` might contain `tensor` operations. For example, current lowering of `gather` uses a `tensor.extract` in the body of the `LinalgOp`. Bufferize the ops within a `LinalgOp` region as well to catch such cases. Differential Revision: https://reviews.llvm.org/D115322	2021-12-08 22:36:01 -08:00
Rob Suderman	23149d522b	[mlir] Added ctlz and cttz to math dialect and LLVM dialect Count leading/trailing zeros are an existing LLVM intrinsic. Added LLVM support for the intrinsics with lowerings from the math dialect to LLVM dialect. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D115206	2021-12-08 14:32:15 -08:00
Butygin	d8fce785de	[mlir][spirv] math.erf OpenCL lowering Differential Revision: https://reviews.llvm.org/D115335	2021-12-08 21:59:46 +03:00
Thomas Raoux	579c1ff67d	[mlir][nvvm] Add async copy ops to nvvm dialect Differential Revision: https://reviews.llvm.org/D115314	2021-12-08 09:42:20 -08:00
Matthias Springer	847710f7b7	[mlir][linalg][bufferize] Add dialect filter to BufferizationOptions This adds a new option `dialectFilter` to BufferizationOptions. Only ops from dialects that are allow-listed in the filter are bufferized. Other ops are left unbufferized. Note: This option requires `allowUnknownOps = true`. To make use of `dialectFilter`, BufferizationOptions or BufferizationState must be passed to various helper functions. The purpose of this change is to provide a better infrastructure for partial bufferization, which will be fully activated in a subsequent change. Differential Revision: https://reviews.llvm.org/D114691	2021-12-08 23:51:18 +09:00
Mehdi Amini	be0a7e9f27	Adjust "end namespace" comment in MLIR to match new agree'd coding style See D115115 and this mailing list discussion: https://lists.llvm.org/pipermail/llvm-dev/2021-December/154199.html Differential Revision: https://reviews.llvm.org/D115309	2021-12-08 06:05:26 +00:00
Mehdi Amini	ee0908703d	Change the printing/parsing behavior for Attributes used in declarative assembly format The new form of printing attribute in the declarative assembly is eliding the `#dialect.mnemonic` prefix to only keep the `<....>` part. Differential Revision: https://reviews.llvm.org/D113873	2021-12-08 02:02:37 +00:00
Aart Bik	e1b9d80532	[mlir][sparse] add a few more sparse output tests (for generated IR) also fixes two typos in IR doc Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D115288	2021-12-07 15:31:29 -08:00
Aart Bik	4f2ec7f983	[mlir][sparse] finalize sparse output in the presence of reductions This revision implements sparse outputs (from scratch) in all cases where the loops can be reordered with all but one parallel loops outer. If the inner parallel loop appears inside one or more reductions loops, then an access pattern expansion is required (aka. workspaces in TACO speak). Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D115091	2021-12-07 10:54:29 -08:00
Rob Suderman	e9fae0f19e	[mlir][tosa] Disable tosa.depthwise_conv2d canonicalizer for quantized case Quantized case needs to include zero-point corrections before the tosa.mul. Disabled for the quantized use-case. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D115264	2021-12-07 10:16:12 -08:00
Matthias Springer	8a232632c5	[mlir][linalg][bufferize] Add FuncOp bufferization pass This passes bufferizes FuncOp bodies, but not FuncOp boundaries. Differential Revision: https://reviews.llvm.org/D114671	2021-12-07 21:44:26 +09:00
Shraiysh Vaishay	31cf42bd9a	[mlir][OpenMP] Added omp.atomic.read lowering This patch adds lowering from omp.atomic.read to LLVM IR along with the memory ordering clause. Tests for the same are also added. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D115134	2021-12-07 11:17:30 +05:30
not-jenni	5911a29aa9	[mlir][tosa] Add tosa.depthwise_conv2d as tosa.mul canonicalization For a 1x1 weight and stride of 1, the input/weight can be reshaped and multiplied elementwise then reshaped back Reviewed By: rsuderman, KoolJBlack Differential Revision: https://reviews.llvm.org/D115207	2021-12-06 17:28:52 -08:00
Rob Suderman	05e33d846f	[mlir][tosa] Resubmit add tosa.conv2d as tosa.fully_connected canonicalization Fixed the tosa.conv2d to tosa.fully_connected canonicalization for incorrect output channels. Included uptes to tests to include checks for the result shapes during canonicalization. This allows conv2d to transform to the simpler fully_connected operation. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D115170	2021-12-06 15:33:07 -08:00
Eugene Zhulenev	68a7c001ad	[mlir] Improve async parallel for tests + fix typos Do load and store to verify that we process each element of the iteration space once. Reviewed By: cota Differential Revision: https://reviews.llvm.org/D115152	2021-12-06 13:27:54 -08:00
Rob Suderman	c5fef77bc3	[mlir] Add CtPop to MathOps with lowering to LLVM math.ctpop maths to the llvm.ctpop intrinsic. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D114998	2021-12-06 11:54:20 -08:00
Alex Zinenko	d64b3e47ba	[mlir] Avoid needlessly converting LLVM named structs with compatible elements Conversion of LLVM named structs leads to them being renamed since we cannot modify the body of the struct type once it is set. Previously, this applied to all named struct types, even if their element types were not affected by the conversion. Make this behvaior only applicable when element types are changed. This requires making the LLVM dialect type-compatibility check recursively look at the element types (arguably, it should have been doing than since the moment the LLVM dialect type system stopped being closed). In addition, have a more lax check for outer types only to avoid repeated check when necessary (e.g., parser, verifiers that are going to also look at the inner type). Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D115037	2021-12-06 13:42:11 +01:00
Matthias Springer	e9fb4dc9e9	[mlir][linalg][bufferize] Remove buffer equivalence from bufferize Remove all function calls related to buffer equivalence from bufferize implementations. Add a new PostAnalysisStep for scf.for that ensures that yielded values are equivalent to the corresponding BBArgs. (This was previously checked in `bufferize`.) This will be relaxed in a subsequent commit. Note: This commit changes two test cases. These were broken by design and should not have passed. With the new scf.for PostAnalysisStep, this bug was fixed. Differential Revision: https://reviews.llvm.org/D114927	2021-12-06 17:48:31 +09:00
Matthias Springer	cb4d0bf997	[mlir][linalg][bufferize][NFC] Collect equivalent FuncOp BBArgs in PostAnalysisStep Collect equivalent BBArgs right after the equivalence analysis of the FuncOp and before bufferizing. This is in preparation of decoupling bufferization from aliasInfo. Also gather equivalence info for CallOps, which was missing in the previous commit. Differential Revision: https://reviews.llvm.org/D114847	2021-12-06 17:31:39 +09:00
Michal Terepeta	caf89c0db6	[mlir][Vector] Support 0-D vectors in `ConstantMaskOp` To support creating both a mask with just a single `true` and `false` values, I had to relax the restriction in the verifier that the rank is always equal to the length of the attribute array, in other words, we now allow: - `vector.constant_mask [0] : vector<i1>` which gets lowered to `arith.constant dense<false> : vector<i1>` - `vector.constant_mask [1] : vector<i1>` which gets lowered to `arith.constant dense<true> : vector<i1>` (the attribute list for the 0-D case must be a singleton containing either `0` or `1`) Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D115023	2021-12-06 08:03:04 +00:00
Mehdi Amini	afb0582325	Fix TOSA verifier to emit verbose errors Also as a test for invalid ops which was missing.	2021-12-05 19:16:54 +00:00
Butygin	91072b74f8	[mlir] Add InlinerInterface to bufferization dialect Differential Revision: https://reviews.llvm.org/D115080	2021-12-04 23:45:56 +03:00
Hugo Pompougnac	5d49511b30	Apply the permutation map on each affine nest When using -test-loop-permutation="permutation-map=...", applies the permutation map on each affine nest in the function (and not only the first one). If the size of the permutation map and the size of a nest are not consistent, do nothing on this particular nest (instead of making MLIR crash). Differential Revision: https://reviews.llvm.org/D112947	2021-12-04 17:48:34 +05:30
Uday Bondhugula	2108ed0671	[MLIR] Fix affine.for unroll for multi-result upper bound maps Fix affine.for unroll for multi-result upper bound maps: these can't be unrolled/unroll-and-jammed in cases where the trip count isn't known to be a multiple of the unroll factor. Fix and clean up repeated/unnecessary checks/comments at helper callees. Also, fix clang-tidy variable naming warnings and redundant includes. Differential Revision: https://reviews.llvm.org/D114662	2021-12-04 07:20:26 +05:30
River Riddle	7169996159	[mlir] Allow shape dimensions larger than 2^32 Internally we use int64_t to hold shapes, but for some reason the parser was limiting shapes to unsigned. This change updates the parser to properly handle int64_t shape dimensions. Differential Revision: https://reviews.llvm.org/D115086	2021-12-04 01:29:50 +00:00
Uday Bondhugula	d20249fde6	[MLIR] NFC. Rename test cases in test/mlir-cpu-runner per convention Test case files at most places in MLIR uses hyphens and not underscores. A counter-pattern was somehow started to use underscores in some places. Rename test cases in test/mlir-cpu-runner to use hyphens so that it's consistent at least within its directory. Differential Revision: https://reviews.llvm.org/D114672	2021-12-04 06:53:39 +05:30
Mehdi Amini	48fb79effb	Improve error message when declarativeAssembly contains invalid literals Differential Revision: https://reviews.llvm.org/D115085	2021-12-04 00:27:32 +00:00
wren romano	4748cc6931	[mlir][sparse] Adding a stress test Addresses https://bugs.llvm.org/show_bug.cgi?id=52410 Depends on D114192 Reviewed By: aartbik, mehdi_amini Differential Revision: https://reviews.llvm.org/D114118	2021-12-03 14:59:39 -08:00
natashaknk	e2d8b60742	Revert "[mlir][tosa] Add tosa.conv2d as fully_connected canonicalization" This reverts commit `13bdb7ab4a`. The commit introduced/uncovered an unintended bug in models containing Conv2D. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D115079	2021-12-03 14:35:48 -08:00
Alex Zinenko	9dd1f8dfdd	[mlir] support recursive type conversion of named LLVM structs A previous commit added support for converting elemental types contained in LLVM dialect types in case they were not compatible with the LLVM dialect. It was missing support for named structs as they could be recursive, which was not supported by the conversion infra. Now that it is, add support for converting such named structs. Depends On D113579 Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D113580	2021-12-03 12:41:40 +01:00
Matthias Springer	ad1ba42f68	[mlir][linalg][bufferize] Allow unbufferizable ops in input Allow ops that are not bufferizable in the input IR. (Deactivated by default.) bufferization::ToMemrefOp and bufferization::ToTensorOp are generated at the bufferization boundaries. Differential Revision: https://reviews.llvm.org/D114669	2021-12-03 20:20:46 +09:00
Michal Terepeta	1423e8bf5d	[mlir][Vector] Support 0-D vectors in `BitCastOp` The implementation only allows to bit-cast between two 0-D vectors. We could probably support casting from/to vectors like `vector<1xf32>`, but I wasn't convinced that this would be important and it would require breaking the invariant that `BitCastOp` works only on vectors with equal rank. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114854	2021-12-03 08:55:59 +00:00
Michal Terepeta	8e2b373396	[mlir][Vector] Add some missing tests for `broadcast` and `splat` Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114853	2021-12-03 08:52:51 +00:00
Matthias Springer	d30fcadf07	[mlir][linalg][bufferize] Op interface implementation for Bufferization dialect ops This change provides `BufferizableOpInterface` implementations for ops from the Bufferization dialects. These ops are needed at the bufferization boundaries for partial bufferization. Differential Revision: https://reviews.llvm.org/D114618	2021-12-03 16:25:44 +09:00
Matthias Springer	4479138de8	[mlir][linalg][bufferize] Bufferization of tensor.insert This is a lightweight operation, useful for writing unit tests. It will be utilized for testing in subsequent commits. Differential Revision: https://reviews.llvm.org/D114693	2021-12-02 11:58:01 +09:00
Mogball	71668a9367	[mlir][ods][nfc] fixing test cases	2021-12-01 18:50:02 +00:00
Mogball	ca6bd9cd43	[mlir][ods] AttrOrTypeGen uses Class AttrOrType def generator uses `Class` code gen helper, instead of naked raw_ostream. Depends on D113714 and D114807 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D113715	2021-12-01 16:53:23 +00:00
Nicolas Vasilache	c537a94334	[mlir][Vector] Thread 0-d vectors through vector.transfer ops This revision adds 0-d vector support to vector.transfer ops. In the process, numerous cleanups are applied, in particular around normalizing and reducing the number of builders. Reviewed By: ThomasRaoux, springerm Differential Revision: https://reviews.llvm.org/D114803	2021-12-01 16:49:43 +00:00
Stephan Herhut	9fce961d2f	[mlir][linalg] Disable tensor-matmul test under asan The test is currently leaky. Disabling it to make the bots green. Differential Revision: https://reviews.llvm.org/D114857	2021-12-01 16:25:31 +01:00
Matthias Springer	2fd0ea960c	[mlir][linalg][bufferize] CallOps do not bufferize to memory writes However, since CallOps have no aliasing OpResults, their OpOperands always bufferize out-of-place. This change removes `bufferizesToMemoryWrite` from `CallOpInterface`. This method was called, but its return value did not matter. Differential Revision: https://reviews.llvm.org/D114616	2021-12-01 18:47:28 +09:00
Thomas Raoux	69a8a7cf2d	[mlir] Make sure linearizeCollapsedDims doesn't drop input map dims The new affine map generated by linearizeCollapsedDims should not drop dimensions. We need to make sure we create a map with at least as many dimensions as the source map. This prevents FoldProducerReshapeOpByLinearization from generating invalid IR. This solves regression in IREE due to `e4e4da86af` Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D114838 This reverts commit `9a844c2a9b`.	2021-11-30 22:51:56 -08:00
MaheshRavishankar	9a844c2a9b	Revert "[mlir] Make sure linearizeCollapsedDims doesn't drop input map dims" This reverts commit `bc38673e4d`.	2021-11-30 22:43:46 -08:00
MaheshRavishankar	bc38673e4d	[mlir] Make sure linearizeCollapsedDims doesn't drop input map dims The new affine map generated by linearizeCollapsedDims should not drop dimensions. We need to make sure we create a map with at least as many dimensions as the source map. This prevents FoldProducerReshapeOpByLinearization from generating invalid IR. This solves regression in IREE due to `e4e4da86af` Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D114838	2021-11-30 22:37:53 -08:00
Aart Bik	61e353e0b6	[mlir][sparse] added sparse out element wise mult integration test Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D114822	2021-11-30 16:44:38 -08:00
Aart Bik	fe0508dc9d	[mlir][sparse] fix typos in integration tests Reviewed By: bixia, wrengr Differential Revision: https://reviews.llvm.org/D114820	2021-11-30 15:32:20 -08:00
Stephen Neuendorffer	7386364889	Revert "[MLIR] Update Vector To LLVM conversion to be aware of assume_alignment" This reverts commit `29a50c5864`. After LLVM lowering, the original patch incorrectly moved alignment information across an unconstrained GEP operation. This is only correct for some index offsets in the GEP. It seems that the best approach is, in fact, to rely on LLVM to propagate information from the llvm.assume() to users. Thanks to Thomas Raoux for catching this.	2021-11-30 15:18:22 -08:00
Aart Bik	0e85232fa3	[mlir][sparse] refine simply dynamic sparse tensor outputs Proper test for sparse tensor outputs is a single condition throughout the whole tensor index expression (not a general conjunction, since this may include other conditions that cause cancellation). Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D114810	2021-11-30 13:45:58 -08:00
Nicolas Vasilache	a08b750ce9	[mlir][tensor] InsertSliceOp verification. This revision reintroduces tensor.insert_slice verification which seems to have vanished over time: a verifier was initially introduced in `cf9503c1b7` but for some reason the invalid.mlir was not properly updated; as time passed the verifier was not called anymore and later the code was deleted. As a consequence, a non-negligible portion of tests has run astray using invalid tensor.insert_slice semantics and needed to be fixed. Also, extract isRankReducedType from TensorOps for better reuse Originally, this facility was used by both tensor and memref forms but it got copied around as dialects were split. Differential Revision: https://reviews.llvm.org/D114715	2021-11-30 20:37:06 +00:00
MaheshRavishankar	311dd55c9e	[mlir][MemRef] Fix SubViewOp canonicalization when a subset of unit-dims are dropped. The canonical type of the result of the `memref.subview` needs to make sure that the previously dropped unit-dimensions are the ones dropped for the canonicalized type as well. This means the generic `inferRankReducedResultType` cannot be used. Instead the current dropped dimensions need to be querried and the same need to be dropped. Reviewed By: nicolasvasilache, ThomasRaoux Differential Revision: https://reviews.llvm.org/D114751	2021-11-30 20:37:06 +00:00
not-jenni	13bdb7ab4a	[mlir][tosa] Add tosa.conv2d as fully_connected canonicalization For a 1x1 weight and stride of 1, the input/weight can be reshaped and passed into a fully connected op then reshaped back Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D114757	2021-11-30 12:01:14 -08:00
gysit	c8f2139eb0	[mlir][linalg] Add decompose to CodegenStrategy. Add the decompose patterns that lower higher dimensional convolutions to lower dimensional ones to CodegenStrategy and use CodegenStrategy to test the decompose patterns. Additionally, remove the assertion that checks the anchor op name is set in the CodegenStrategyTest pass. Removing the assertion allows us to simplify the pipelines used in the interchange and decompose tests. Depends On D114797 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114798	2021-11-30 15:48:29 +00:00
gysit	316e627c2b	[mlir][linalg] Support the empty anchor op string when padding. Add support for an empty anchor op string in vectorization. An empty anchor op string is useful after fusion when there are multiple different operations to vectorize. Depends On D114689 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114690	2021-11-30 15:32:13 +00:00
gysit	7f7103cd06	[mlir][linalg] Use top down traversal for padding. Pad the operation using a top down traversal. The top down traversal unlocks folding opportunities and dim op canonicalizations due to the introduced extract slice operation after the padded operation. Depends On D114585 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114689	2021-11-30 15:30:45 +00:00
gysit	914e72d400	[mlir][linalg] Run CSE after every CodegenStrategy transformation. Add CSE after every transformation. Transformations such as tiling introduce redundant computation, for example, one AffineMinOp for every operand dimension pair. Follow up transformations such as Padding and Hoisting benefit from CSE since comparing slice sizes simplifies to comparing SSA values instead of analyzing affine expressions. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114585	2021-11-30 15:07:51 +00:00
Julian Gross	ae1ea0bead	[mlir] Decompose Bufferization Clone operation into Memref Alloc and Copy. This patch introduces a new conversion to convert bufferization.clone operations into a memref.alloc and a memref.copy operation. This transformation is needed to transform all remaining clones which "survive" all previous transformations, before a given program is lowered further (to LLVM e.g.). Otherwise, these operations cannot be handled anymore and lead to compile errors. See: https://llvm.discourse.group/t/bufferization-error-related-to-memref-clone/4665 Differential Revision: https://reviews.llvm.org/D114233	2021-11-30 10:15:56 +01:00
Alexander Belyaev	f89bb3c012	[mlir] Move bufferization-related passes to `bufferization` dialect. [RFC](https://llvm.discourse.group/t/rfc-dialect-for-bufferization-related-ops/4712) Differential Revision: https://reviews.llvm.org/D114698	2021-11-30 09:58:47 +01:00
gysit	0d0371f58f	[mlir][OpDSL] Fix OpDSL tests after https://reviews.llvm.org/D114680 . Update the shapes of the convolution / pooling tests that where detected after enabling verification during printing (https://reviews.llvm.org/D114680). Also split the emit_structured_generic.py file that previously contained all tests into multiple separate files to simplify debugging. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D114731	2021-11-30 08:57:28 +00:00
Stella Laurenzo	bdc3183742	[mlir][python] Implement more SymbolTable methods. * set_symbol_name, get_symbol_name, set_visibility, get_visibility, replace_all_symbol_uses, walk_symbol_tables * In integrations I've been doing, I've been reaching for all of these to do both general IR manipulation and module merging. * I don't love the replace_all_symbol_uses underlying APIs since they necessitate SYMBOL_COUNT walks and have various sharp edges. I'm hoping that whatever emerges eventually for this can still retain this simple API as a one-shot. Differential Revision: https://reviews.llvm.org/D114687	2021-11-29 20:31:13 -08:00
Aart Bik	7d4da4e1ab	[mlir][sparse] generalize sparse tensor output implementation Moves sparse tensor output support forward by generalizing from injective insertions only to include reductions. This revision accepts the case with all parallel outer and all reduction inner loops, since that can be handled with an injective insertion still. Next revision will allow the inner parallel loop to move inward (but that will require "access pattern expansion" aka "workspace"). Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D114399	2021-11-29 16:15:53 -08:00
Benjamin Kramer	8d474f1d15	[mlir] Handle an edge case when folding reshapes with multiple trailing 1 dimensions We would exit early and miss this case. Differential Revision: https://reviews.llvm.org/D114711	2021-11-29 18:31:43 +01:00
Stephan Herhut	95f34e318c	[mlir][memref] Fix bug in verification of memref.collapse_shape The verifier computed an illegal type with negative dimension size when collapsing partially static memrefs. Differential Revision: https://reviews.llvm.org/D114702	2021-11-29 15:47:12 +01:00
Stella Laurenzo	ace1d0ad3d	[mlir][python] Normalize asm-printing IR behavior. While working on an integration, I found a lot of inconsistencies on IR printing and verification. It turns out that we were: * Only doing "soft fail" verification on IR printing of Operation, not of a Module. * Failed verification was interacting badly with binary=True IR printing (causing a TypeError trying to pass an `str` to a `bytes` based handle). * For systematic integrations, it is often desirable to control verification yourself so that you can explicitly handle errors. This patch: * Trues up the "soft fail" semantics by having `Module.__str__` delegate to `Operation.__str__` vs having a shortcut implementation. * Fixes soft fail in the presence of binary=True (and adds an additional happy path test case to make sure the binary functionality works). * Adds an `assume_verified` boolean flag to the `print`/`get_asm` methods which disables internal verification, presupposing that the caller has taken care of it. It turns out that we had a number of tests which were generating illegal IR but it wasn't being caught because they were doing a print on the `Module` vs operation. All except two were trivially fixed: * linalg/ops.py : Had two tests for direct constructing a Matmul incorrectly. Fixing them made them just like the next two tests so just deleted (no need to test the verifier only at this level). * linalg/opdsl/emit_structured_generic.py : Hand coded conv and pooling tests appear to be using illegal shaped inputs/outputs, causing a verification failure. I just used the `assume_verified=` flag to restore the original behavior and left a TODO. Will get someone who owns that to fix it properly in a followup (would also be nice to break this file up into multiple test modules as it is hard to tell exactly what is failing). Notes to downstreams: * If, like some of our tests, you get verification failures after this patch, it is likely that your IR was always invalid and you will need to fix the root cause. To temporarily revert to prior (broken) behavior, replace calls like `print(module)` with `print(module.operation.get_asm(assume_verified=True))`. Differential Revision: https://reviews.llvm.org/D114680	2021-11-28 18:02:01 -08:00
Nicolas Vasilache	f5a9bfdf8f	[mlir] NFC - Move invalid.mlir tests to the proper dialects	2021-11-28 21:30:40 +00:00
Chris Jones	344eee6f38	[MLIR] Allow `Idempotent` trait to be applied to binary ops. Add `Idempotent` trait to `arith.{andi,ori}`. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D114574	2021-11-26 18:22:49 +00:00
Michal Terepeta	7e65fc9a60	[mlir][Vector] Support 0-D vectors in `BroadcastOp` This changes the op to produce `AnyVectorOfAnyRank` following mostly the code for 1-D vectors. Depends On D114598 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114550	2021-11-26 17:17:18 +00:00
Michal Terepeta	d0f927121e	[mlir][Standard] Support 0-D vectors in `SplatOp` This changes the op to produce `AnyVectorOfAnyRank` and implements this by just inserting the element (skipping the shuffle that we do for the 1-D case). Depends On D114549 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114598	2021-11-26 17:05:15 +00:00
Mats Petersson	30238c3676	[mlir][OpenMP] Add support for SIMD modifier Add support for SIMD modifier in OpenMP worksharing loops. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D111051	2021-11-26 14:04:46 +00:00
Stanislav Funiak	a76ee58f3c	Multi-root PDL matching using upward traversals. This is commit 4 of 4 for the multi-root matching in PDL, discussed in https://llvm.discourse.group/t/rfc-multi-root-pdl-patterns-for-kernel-matching/4148 (topic flagged for review). This PR integrates the various components (root ordering algorithm, nondeterministic execution of PDL bytecode) to implement multi-root PDL matching. The main idea is for the pattern to specify mulitple candidate roots. The PDL-to-PDLInterp lowering selects one of these roots and "hangs" the pattern from this root, traversing the edges downwards (from operation to its operands) when possible and upwards (from values to its uses) when needed. The root is selected by invoking the optimal matching multiple times, once for each candidate root, and the connectors are determined form the optimal matching. The costs in the directed graph are equal to the number of upward edges that need to be traversed when connecting the given two candidate roots. It can be shown that, for this choice of the cost function, "hanging" the pattern an inner node is no better than from the optimal root. The following three main additions were implemented as a part of this PR: 1. OperationPos predicate has been extended to allow tracing the operation accepting a value (the opposite of operation defining a value). 2. Predicate checking if two values are not equal - this is useful to ensure that we do not traverse the edge back downwards after we traversed it upwards. 3. Function for for building the cost graph among the candidate roots. 4. Updated buildPredicateList, building the predicates optimal branching has been determined. Testing: unit tests (an integration test to follow once the stack of commits has landed) Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D108550	2021-11-26 18:11:37 +05:30
Stanislav Funiak	3eb1647af0	Introduced iterative bytecode execution. This is commit 2 of 4 for the multi-root matching in PDL, discussed in https://llvm.discourse.group/t/rfc-multi-root-pdl-patterns-for-kernel-matching/4148 (topic flagged for review). This commit implements the features needed for the execution of the new operations pdl_interp.get_accepting_ops, pdl_interp.choose_op: 1. The implementation of the generation and execution of the two ops. 2. The addition of Stack of bytecode positions within the ByteCodeExecutor. This is needed because in pdl_interp.choose_op, we iterate over the values returned by pdl_interp.get_accepting_ops until we reach finalize. When we reach finalize, we need to return back to the position marked in the stack. 3. The functionality to extend the lifetime of values that cross the nondeterministic choice. The existing bytecode generator allocates the values to memory positions by representing the liveness of values as a collection of disjoint intervals over the matcher positions. This is akin to register allocation, and substantially reduces the footprint of the bytecode executor. However, because with iterative operation pdl_interp.choose_op, execution "returns" back, so any values whose original liveness cross the nondeterminstic choice must have their lifetime executed until finalize. Testing: pdl-bytecode.mlir test Reviewed By: rriddle, Mogball Differential Revision: https://reviews.llvm.org/D108547	2021-11-26 18:11:37 +05:30
Stanislav Funiak	842b6861c0	Defines new PDLInterp operations needed for multi-root matching in PDL. This is commit 1 of 4 for the multi-root matching in PDL, discussed in https://llvm.discourse.group/t/rfc-multi-root-pdl-patterns-for-kernel-matching/4148 (topic flagged for review). These operations are: * pdl.get_accepting_ops: Returns a list of operations accepting the given value or a range of values at the specified position. Thus if there are two operations `%op1 = "foo"(%val)` and `%op2 = "bar"(%val)` accepting a value at position 0, `%ops = pdl_interp.get_accepting_ops of %val : !pdl.value at 0` will return both of them. This allows us to traverse upwards from a value to operations accepting the value. * pdl.choose_op: Iteratively chooses one operation from a range of operations. Therefore, writing `%op = pdl_interp.choose_op from %ops` in the example above will select either `%op1`or `%op2`. Testing: Added the corresponding test cases to mlir/test/Dialect/PDLInterp/ops.mlir. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D108543	2021-11-26 17:59:22 +05:30
Tobias Gysi	8d07ba817c	[mlir][linalg] Simplify the hoist padding tests. Use primarily matvec instead of matmul to test hoist padding. Test the hoisting only starting from already padded IR. Use one-dimensional tiling only except for the tile_and_fuse test that exercises hoisting on a larger loop nest with fill and pad tensor operations in the backward slice. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114608	2021-11-26 07:40:22 +00:00
Matthias Springer	c94b80b438	[mlir][linalg][bufferize][NFC] Allow returning arbitrary memrefs If `allowReturnMemref` is set to true, arbitrary memrefs may be returned from FuncOps. Also remove allocation hoisting code, which is only partly implemented at the moment. The purpose of this commit is to untangle `bufferize` from `aliasInfo`. (Even with this change, they are not fully untangled yet.) Differential Revision: https://reviews.llvm.org/D114507	2021-11-26 11:26:46 +09:00
Michal Terepeta	cc311a155a	[mlir][Vector] Support 0-D vectors in `VectorPrintOpConversion` Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114549	2021-11-25 20:12:18 +00:00
Uday Bondhugula	c89fc1eec3	[MLIR] NFC. Rename MLIR CAPI ExecutionEngine target for consistency Rename MLIR CAPI ExecutionEngine target for consistency: MLIRCEXECUTIONENGINE -> MLIRCAPIExecutionEngine in line with other targets. Differential Revision: https://reviews.llvm.org/D114596	2021-11-26 00:23:17 +05:30
Alexander Belyaev	57470abc41	[mlir] Move memref.[tensor_load\|buffer_cast\|clone] to "bufferization" dialect. https://llvm.discourse.group/t/rfc-dialect-for-bufferization-related-ops/4712 Differential Revision: https://reviews.llvm.org/D114552	2021-11-25 11:50:39 +01:00
Tobias Gysi	43dc6d5d57	[mlir][linalg] Cleanup hoisting test (NFC). Rename the check prefixes to HOIST21 and HOIST32 to clarify the different flag configurations. Depends On D114438 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114442	2021-11-25 10:42:24 +00:00
Tobias Gysi	4b03906346	[mlir][linalg] Perform checks early in hoist padding. Instead of checking for unexpected operations (any operation with a region except for scf::For and `padTensorOp` or operations with a memory effect) while cloning the packing loop nest perform the checks early. Update `dropNonIndexDependencies` to check for unexpected operations. Additionally, check all of these operations have index type operands only. Depends On D114428 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114438	2021-11-25 10:37:12 +00:00
Tobias Gysi	fd723eaa92	[mlir][linalg] Limit hoist padding to constant paddings. Limit hoist padding to pad tensor ops that depend only on a constant value. Supporting arbitrary padding values that depend on computations part of the backward slice to hoist require complex analysis to ensure the computation can be hoisted. Depends On D114420 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114428	2021-11-25 10:31:39 +00:00
Tobias Gysi	ed7c1fb9b0	[mlir][linalg] Add backward slice filtering in hoist padding. Adapt hoist padding to filter the backward slice before cloning the packing loop nest. The filtering removes all operations that are not used to index the hoisted pad tensor op and its extract slice op. The filtering is needed to support the more complex loop nests created after fusion. For example, fusing the producer of an output operand can added linalg ops and pad tensor ops to the backward slice. These operations have regions and currently prevent hoisting. The following example demonstrates the effect of the newly introduced `dropNonIndexDependencies` method that filters the backward slice: ``` %source = linalg.fill(%cst, %arg0) scf.for %i %unrelated = linalg.fill(%cst, %arg1) // not used to index %source! scf.for %j (%arg2 = %unrelated) scf.for %k // not used to index %source! %ubi = affine.min #map(%i) %ubj = affine.min #map(%j) %slice = tensor.extract_slice %source [%i, %j] [%ubi, %ubj] %padded_slice = linalg.pad_tensor %slice ``` dropNonIndexDependencies(%padded_slice, %slice) removes [scf.for %k, linalg.fill(%cst, %arg1)] from backwardSlice. Depends On D114175 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114420	2021-11-25 10:30:10 +00:00
Alexander Belyaev	3c228573bc	Revert "[mlir][SCF] Further simplify affine maps during `for-loop-canonicalization`" This reverts commit `ee1bf18672`. It breaks IREE lowering. Reverting the commit for now while we investigate what's going on.	2021-11-25 10:54:52 +01:00
Butygin	8dae0b6b6c	[mlir][spirv] arith::RemSIOp OpenCL lowering Differential Revision: https://reviews.llvm.org/D114524	2021-11-25 12:44:06 +03:00
Uday Bondhugula	25d173499e	[MLIR] Rename test/python/dialects/math.py -> math_dialect.py Rename test/python/dialects/math.py -> math_dialect.py to avoid a collision with a Python standard package of the same name. These test scripts are run by path and are not part of a package. Python apparently implicitly adds the containing directory to its PYTHONPATH. As such, test scripts with common names run the risk of conflicting with global names and resolution of an import for the latter happens to the former. Differential Revision: https://reviews.llvm.org/D114568	2021-11-25 09:51:49 +05:30
Matthias Springer	ee1bf18672	[mlir][SCF] Further simplify affine maps during `for-loop-canonicalization` * Implement `FlatAffineConstraints::getConstantBound(EQ)`. * Inject a simpler constraint for loops that have at most 1 iteration. * Taking into account constant EQ bounds of FlatAffineConstraint dims/symbols during canonicalization of the resulting affine map in `canonicalizeMinMaxOp`. Differential Revision: https://reviews.llvm.org/D114138	2021-11-25 12:44:19 +09:00
bakhtiyar	7bd87a03fd	Promote readability by factoring out creation of min/max operation. Remove unnecessary divisions. Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D110680	2021-11-24 16:17:23 -08:00
Lei Zhang	cb395f66ac	[mlir][spirv] Change the return type for {Min\|Max}VersionBase For synthesizing an op's implementation of the generated interface from {Min\|Max}Version, we need to define an `initializer` and `mergeAction`. The `initializer` specifies the initial version, and `mergeAction` specifies how version specifications from different parts of the op should be merged to generate a final version requirements. Previously we use the specified version enum as the type for both the initializer and thus the final return type. This means we need to perform `static_cast` over some hopefully not used number (`~0u`) as the initializer. This is quite opaque and sort of not guaranteed to work. Also, there are ops that have an enum attribute where some values declare version requirements (e.g., enumerant `B` requires v1.1+) but some not (e.g., enumerant `A` requires nothing). Then a concrete op instance with `A` will still declare it implements the version interface (because interface implementation is static for an op) but actually theirs no requirements for version. So this commit changes to use an more explicit `llvm::Optional` to wrap around the returned version enum. This should make it more clear. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D108312	2021-11-24 17:33:01 -05:00
Tobias Gysi	b6e7b1be73	[mlir][linalg] Simplify padding test (NFC). The padding tests previously contained the tile loops. This revision removes the tile loops since padding itself does not consider the loops. Instead the induction variables are passed in as function arguments which promotes them to symbols in the affine expressions. Note that the pad-and-hoist.mlir test still exercises padding in the context of the full loop nest. Depends On D114175 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114227	2021-11-24 19:21:50 +00:00
Tobias Gysi	86f186efea	[mlir][linalg] Add makeComposedPadHighOp. Add the makeComposedPadHighOp method which creates a new PadTensorOp if necessary. If the source to pad is actually the result of a sequence of padded LinalgOps, the method checks if padding is needed or if we can use the padded result of the padded LinalgOp sequence directly. Example: ``` %0 = tensor.extract_slice %arg0 [%iv0, %iv1] [%sz0, %sz1] %1 = linalg.pad_tensor %0 low[0, 0] high[...] { linalg.yield %cst } %2 = linalg.matmul ins(...) outs(%1) %3 = tensor.extract_slice %2 [0, 0] [%sz0, %sz1] ``` when padding %3 return %2 instead of introducing ``` %4 = linalg.pad_tensor %3 low[0, 0] high[...] { linalg.yield %cst } ``` Depends On D114161 Reviewed By: nicolasvasilache, pifon2a Differential Revision: https://reviews.llvm.org/D114175	2021-11-24 19:18:59 +00:00
Tobias Gysi	a4fd8cb76f	[mlir][linalg] Update failure conditions for padOperandToSmallestStaticBoundingBox. Change the failure condition of padOperandToSmallestStaticBoundingBox to never fail if the operand is already statically sized. In particular: - if the padding value computation fails -> return failure if the operand shape is dynamic and success if it is static. - if there is no extract slice op -> return failure if the operand shape is dynamic and success if it is static. The latter change prevents padding from failure if the output operand passed by iteration argument is statically sized since in this case the extract / insert slice pairs are removed by canonicalization. Depends On D114153 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114161	2021-11-24 19:10:50 +00:00
Butygin	7f5d9bf13a	[mlir][scf] Canonicalize scf.while with unused results Differential Revision: https://reviews.llvm.org/D114291	2021-11-24 11:11:22 +03:00
Bixia Zheng	02710413a3	Accept symmetric sparse matrix in Matrix Market Exchange Format. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D114402	2021-11-23 19:53:17 -08:00
Butygin	75a1bee05d	[mlir][spirv] Add math to OpenCL conversion Differential Revision: https://reviews.llvm.org/D113780	2021-11-24 02:31:21 +03:00
Rob Suderman	0f1e52afa9	[mlir][tosa] Materialize tosa.pad value and fold noop pads Padding now can explicitly specify the padding value when non-zero is wanted. This also includes bypassing pads when the pad does nothing. Differential Revision: https://reviews.llvm.org/D113611	2021-11-23 12:23:42 -08:00
Rob Suderman	54eec7cafc	[mlir][tosa] Separate tosa.transpose_conv decomposition and added stride support Transpose convolution decomposition is now performed in a separate pass. This allows padding / constant propagation to be performed at the TOSA level. It also adds support for striding when there is no dilation. Differential Revision: https://reviews.llvm.org/D114409	2021-11-23 12:16:44 -08:00
wren romano	286248db2c	[mlir][sparse] Moving integration tests that merely use the Python API Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D114192	2021-11-23 10:59:38 -08:00
Nicolas Vasilache	3ff4e5f2a4	[mlir][Vector] Thread 0-d vectors through InsertElementOp. This revision makes concrete use of 0-d vectors to extend the semantics of InsertElementOp. Reviewed By: dcaballe, pifon2a Differential Revision: https://reviews.llvm.org/D114388	2021-11-23 12:55:11 +00:00
Nicolas Vasilache	e7026aba00	[mlir][Vector] Thread 0-d vectors through ExtractElementOp. This revision starts making concrete use of 0-d vectors to extend the semantics of ExtractElementOp. In the process a new VectorOfAnyRank Tablegen OpBase.td is added to allow progressive transition to supporting 0-d vectors by gradually opting in. Differential Revision: https://reviews.llvm.org/D114387	2021-11-23 12:39:44 +00:00
Nicolas Vasilache	b2729fda60	[mlir][Vector] Add a vblendps-based impl for transpose8x8 (both intrin and inline_asm) This revision follows up on the conversation titled: ```[llvm-dev] Understanding and controlling some of the AVX shuffle emission paths``` The revision adds a vblendps-based implementation for transpose8x8 and further distinguishes between and intrinsics and an inline_asm implementation. This results in roughly 20% fewer cycles as reported by llvm-mca: After this revision (intrinsic version, resolves to virtually identical assembly as per the llvm-dev discussion, no vblendps instruction is emitted): ``` Iterations: 100 Instructions: 5900 Total Cycles: 2415 Total uOps: 7300 Dispatch Width: 6 uOps Per Cycle: 3.02 IPC: 2.44 Block RThroughput: 24.0 Cycles with backend pressure increase [ 89.90% ] Throughput Bottlenecks: Resource Pressure [ 89.65% ] - SKXPort1 [ 0.04% ] - SKXPort2 [ 12.42% ] - SKXPort3 [ 12.42% ] - SKXPort5 [ 89.52% ] Data Dependencies: [ 37.06% ] - Register Dependencies [ 37.06% ] - Memory Dependencies [ 0.00% ] ``` After this revision (inline_asm version, vblendps instructions are indeed emitted): ``` Iterations: 100 Instructions: 6300 Total Cycles: 2015 Total uOps: 7700 Dispatch Width: 6 uOps Per Cycle: 3.82 IPC: 3.13 Block RThroughput: 20.0 Cycles with backend pressure increase [ 83.47% ] Throughput Bottlenecks: Resource Pressure [ 83.18% ] - SKXPort0 [ 14.49% ] - SKXPort1 [ 14.54% ] - SKXPort2 [ 19.70% ] - SKXPort3 [ 19.70% ] - SKXPort5 [ 83.03% ] - SKXPort6 [ 14.49% ] Data Dependencies: [ 39.75% ] - Register Dependencies [ 39.75% ] - Memory Dependencies [ 0.00% ] ``` An accessible copy of the conversation is available [here](https://gist.github.com/nicolasvasilache/68c7f34012584b0e00f335bcb374ede0). Differential Revision: https://reviews.llvm.org/D114393	2021-11-23 07:31:22 +00:00
Sandeep Dasgupta	e5a8c8c883	[mlir] Refactoring a few Parser APIs Refactored two new parser APIs parseGenericOperationAfterOperands and parseCustomOperationName out of parseGenericOperation and parseCustomOperation. Motivation: Sometimes an op can be printed in a special way if certain criteria is met. While parsing, we need to handle all the versions. `parseGenericOperationAfterOperands` is handy in situation where we already parsed the operands and decide to fall back to default parsing. `parseCustomOperationName` is useful when we need to know details (dialect, operation name etc.) about a parsed token meant to be an mlir operation. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D113719	2021-11-23 06:11:01 +00:00
Matthias Springer	fb99686bfd	[mlir][linalg][bufferize] Limited support for scf.execute_region Add support for analysis only. Differential Revision: https://reviews.llvm.org/D114055	2021-11-23 12:20:39 +09:00
Benjamin Kramer	966b720983	[mlir][memref] Fix expanded shape ops memref.cast folding with changed type `memref.expand_shape` has verification logic to make sure result dim must be static if all the collapsing src dims are static. This can be relaxed once expand_shape supports more dynamism. Differential Revision: https://reviews.llvm.org/D114391	2021-11-22 22:56:15 +01:00
Groverkss	98daa4e425	[MLIR] Fix incorrect removal of source loop in loop fusion This patch fixes a bug in loop fusion pass where the source loop was removed even when the fused loop did not cover all iterations of the source loop. This was because the fast hueristic check for checking if source loop and fused loop have same iterations did not take into account steps in loop. Reviewed By: dcaballe, bondhugula Differential Revision: https://reviews.llvm.org/D114164	2021-11-23 02:54:09 +05:30
Mehdi Amini	e0b7bee7cf	Revert "[mlir][Vector] Add a vblendps-based impl for transpose8x8 (both intrin and inline_asm)" This reverts commit `a9e236bed8`. This broke the Windows build: mlir\include\mlir/Dialect/X86Vector/Transforms.h(28): error C2061: syntax error: identifier 'uint'	2021-11-22 19:23:18 +00:00
Lei Zhang	93284120f2	[mlir][vector] Fix TransferOpReduceRank for 0-D tensors We cannot unconditionally generate memref.load ops for such cases; need to check the source's type. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114376	2021-11-22 12:30:46 -05:00
Alex Zinenko	9c5982ef8e	[mlir] support recursive types in type conversion infra MLIR supports recursive types but they could not be handled by the conversion infrastructure directly as it would result in infinite recursion in `convertType` for elemental types. Support this case by keeping the "call stack" of nested type conversions in the TypeConverter class and by passing it as an optional argument to the individual conversion callback. The callback can then check if a specific type is present on the stack more than once to detect and handle the recursive case. This approach is preferred to the alternative approach of having a separate callback dedicated to handling only the recursive case as the latter was observed to introduce ~3% time overhead on a 50MB IR file even if it did not contain recursive types. This approach is also preferred to keeping a local stack in type converters that need to handle recursive types as that would compose poorly in case of out-of-tree or cross-project extensions. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D113579	2021-11-22 18:16:02 +01:00
Tobias Gysi	32c43241e7	[mlir][linalg] Always generate an extract/insert slice pair when tiling output tensors. Adapt tiling to always generate an extract/insert slice pair for output tensors even if the tensor is not tiled. Having an explicit extract/insert slice pair simplifies followup transformations such as padding and bufferization. In particular, it makes read and written iteration argument slices explicit. Depends On D114067 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114085	2021-11-22 13:12:43 +00:00
Tobias Gysi	e3d386ea27	[mlir][linalg] Add a tile and fuse on tensors pattern. Add a pattern to apply the new tile and fuse on tensors method. Integrate the pattern into the CodegenStrategy and use the CodegenStrategy to implement the tests. Depends On D114012 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114067	2021-11-22 11:13:21 +00:00
Tobias Gysi	0ccc44cec0	[mlir][linalg] Fix tile and fuse for outermost reduction. Tile and fuse failed if the outermost tile loop is a reduction dimension. Add the necessary check to handle outermost reductions and introduce a test case to verify the change. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114012	2021-11-22 10:44:15 +00:00
Nicolas Vasilache	a9e236bed8	[mlir][Vector] Add a vblendps-based impl for transpose8x8 (both intrin and inline_asm) This revision follows up on the conversation titled: ```[llvm-dev] Understanding and controlling some of the AVX shuffle emission paths``` The revision adds a vblendps-based implementation for transpose8x8 and further distinguishes between and intrinsics and an inline_asm implementation. This results in roughly 20% fewer cycles as reported by llvm-mca: After this revision (intrinsic version, resolves to virtually identical assembly as per the llvm-dev discussion, no vblendps instruction is emitted): ``` Iterations: 100 Instructions: 5900 Total Cycles: 2415 Total uOps: 7300 Dispatch Width: 6 uOps Per Cycle: 3.02 IPC: 2.44 Block RThroughput: 24.0 Cycles with backend pressure increase [ 89.90% ] Throughput Bottlenecks: Resource Pressure [ 89.65% ] - SKXPort1 [ 0.04% ] - SKXPort2 [ 12.42% ] - SKXPort3 [ 12.42% ] - SKXPort5 [ 89.52% ] Data Dependencies: [ 37.06% ] - Register Dependencies [ 37.06% ] - Memory Dependencies [ 0.00% ] ``` After this revision (inline_asm version, vblendps instructions are indeed emitted): ``` Iterations: 100 Instructions: 6300 Total Cycles: 2015 Total uOps: 7700 Dispatch Width: 6 uOps Per Cycle: 3.82 IPC: 3.13 Block RThroughput: 20.0 Cycles with backend pressure increase [ 83.47% ] Throughput Bottlenecks: Resource Pressure [ 83.18% ] - SKXPort0 [ 14.49% ] - SKXPort1 [ 14.54% ] - SKXPort2 [ 19.70% ] - SKXPort3 [ 19.70% ] - SKXPort5 [ 83.03% ] - SKXPort6 [ 14.49% ] Data Dependencies: [ 39.75% ] - Register Dependencies [ 39.75% ] - Memory Dependencies [ 0.00% ] ``` An accessible copy of the conversation is available [here](https://gist.github.com/nicolasvasilache/68c7f34012584b0e00f335bcb374ede0). Reviewed By: ftynse, dcaballe Differential Revision: https://reviews.llvm.org/D114335	2021-11-22 10:32:34 +00:00
Jacques Pienaar	6f9cceb775	[mlir] Move trait to InferTypeOpInterface Step towards removing the hard coded behavior for this trait and to instead use common interface. Differential Revision: https://reviews.llvm.org/D114208	2021-11-21 14:41:12 -08:00
Arnab Dutta	ec7b0d4d34	[MLIR] Simplify Semi-affine expressions by rule based matching and replacing "expr - q * (expr floordiv q)" with "expr mod q" expression. Add rule based matching for detecting and transforming "expr - q * (expr floordiv q)" to "expr mod q", where q is a symbolic exxpression, in simplifyAdd function. Reviewed By: bondhugula, dcaballe Differential Revision: https://reviews.llvm.org/D112985	2021-11-20 21:05:36 +05:30
Thomas Raoux	47555d73f6	[mlir][gpu] Extend shuffle op modes and add nvvm lowering Add up, down and idx modes to gpu shuffle ops, also change the mode from string to enum Differential Revision: https://reviews.llvm.org/D114188	2021-11-19 11:14:31 -08:00
Thomas Raoux	06dbb28569	[mlir][vector] Remove usage of shapecast to remove unit dim Instead of using shape_cast op in the pattern removing leading unit dimensions we use extract/broadcast ops. This is part of the effort to restrict ShapeCastOp fuirther in the future and only allow them to convert to or from 1D vector. This also adds extra canonicalization to fill the gaps in simplifying broadcast/extract ops. Differential Revision: https://reviews.llvm.org/D114205	2021-11-19 10:25:21 -08:00
Krzysztof Drewniak	f849640a0c	[MLIR] Make the ROCM integration tests runnable - Move the #define s to the GPU Transform library from GPU Ops so that SerializeToHsaco is non-trivially compiled - Add required includes to SerializeToHsaco - Move MCSubtargetInfo creation to the correct point in the compilation process - Change mlir in ROCM tests to account for renamed/moved ops Differential Revision: https://reviews.llvm.org/D114184	2021-11-19 17:09:53 +00:00
Mogball	7c5ecc8b7e	[mlir][vector] Insert/extract element can accept index `vector::InsertElementOp` and `vector::ExtractElementOp` have had their `position` operand changed to accept `AnySignlessIntegerOrIndex` for better operability with operations that use `index`, such as affine loops. LLVM's `extractelement` and `insertelement` can also accept `i64`, so lowering directly to these operations without explicitly inserting casts is allowed. SPIRV's equivalent ops can also accept `i64`. Reviewed By: nicolasvasilache, jpienaar Differential Revision: https://reviews.llvm.org/D114139	2021-11-18 22:40:29 +00:00
Markus Böck	0a8a5902a6	[mlir] Fully qualify default generated type/attribute printer and parser This patch makes it possible to use the newly added useDefaultAttributePrinterParser and useDefaultTypePrinterParser dialect options without any using namespace declarations. Two things had to be done to make this possible: * Fully qualify any type usages or functions from the mlir namespace in the generated C++ code * Makes sure to emit the printers and parsers inside the same namespace as the Dialect Differential Revision: https://reviews.llvm.org/D114168	2021-11-18 20:24:00 +01:00
MaheshRavishankar	526dfe3f4d	[mlir][Linalg] Do not return failure when all tile sizes are zero. Returning failure when tile sizes are all zero prevents the change in the marker. This makes pattern rewriter run the pattern multiple times only to exit when it hits a limit. Instead just clone the operation (since tiling is essentially cloning in this case). Then the transformation filter kicks in to avoid the pattern rewriter to be invoked many times. Differential Revision: https://reviews.llvm.org/D113949	2021-11-18 09:28:25 -08:00
Krzysztof Drewniak	fb1a06aa13	[MLIR][GPU] Add target arguments to SerializeToHsaco Compiling code for AMD GPUs requires knowledge of which chipset is being targeted, especially if the code uses chipset-specific intrinsics (which is the case in a downstream convolution generator). This commit adds `target`, `chipset` and `features` arguments to the SerializeToHsaco constructor to enable passing in this required information. It also amends the ROCm integration tests to pass in the target chipset, which is set to the chipset of the first GPU on the system executing the tests. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D114107	2021-11-18 16:28:44 +00:00
Michal Terepeta	54c9984207	[mlir][Python] Fix generation of accessors for Optional Previously, in case there was only one `Optional` operand/result within the list, we would always return `None` from the accessor, e.g., for a single optional result we would generate: ``` return self.operation.results[0] if len(self.operation.results) > 1 else None ``` But what we really want is to return `None` only if the length of `results` is smaller than the total number of element groups (i.e., the optional operand/result is in fact missing). This commit also renames a few local variables in the generator to make the distinction between `isVariadic()` and `isVariableLength()` a bit more clear. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D113855	2021-11-18 09:42:57 +01:00
Matthias Springer	ebf8d74e92	[mlir][linalg][bufferize] Fix bufferize bug where non-tensor ops are not skipped `BufferizableOpInterface::bufferize` will only be called on ops that have tensor operands and/or results. Differential Revision: https://reviews.llvm.org/D113962	2021-11-18 16:20:22 +09:00
River Riddle	0c7890c844	[mlir] Convert NamedAttribute to be a class NamedAttribute is currently represented as an std::pair, but this creates an extremely clunky .first/.second API. This commit converts it to a class, with better accessors (getName/getValue) and also opens the door for more convenient API in the future. Differential Revision: https://reviews.llvm.org/D113956	2021-11-18 05:39:29 +00:00
Aart Bik	1ce77b562d	[mlir][sparse] refine lexicographic insertion to any tensor First version was vectors only. With some clever "path" insertion, we now support any d-dimensional tensor. Up next: reductions too Reviewed By: bixia, wrengr Differential Revision: https://reviews.llvm.org/D114024	2021-11-17 18:08:42 -08:00
Robert Suderman	6e41a06911	[mlir][tosa] Revert add-0 canonicalization for floating-point Floating point optimization can produce incorrect numerical resutls for -0.0 + 0.0 optimization as result needs to be -0.0. Reviewed By: eric-k256 Differential Revision: https://reviews.llvm.org/D114127	2021-11-17 17:29:57 -08:00
Rob Suderman	044e7e013e	[mlir][tosa] Fixed shape inference for tosa.transpose_conv2d Transpose conv2d shape inference was incorrect, tests did not properly validate that the shape inference was executing. Corrected shape inference, and extended tests to actually execute. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D114026	2021-11-17 14:59:52 -08:00
Alex Zinenko	bca003dea8	[mlir] Fix wrong variable name in Linalg OpDSL The name seems to have been left over from a renaming effort on an unexercised codepaths that are difficult to catch in Python. Fix it and add a test that exercises the codepath. Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D114004	2021-11-17 22:55:35 +01:00
Michal Terepeta	ddf2d62c7d	[mlir][Vector] First step for 0D vector type There seems to be a consensus that we should allow 0D vectors: https://llvm.discourse.group/t/should-we-have-0-d-vectors/3097 This commit is only the first step: it changes the verifier and the parser to allow vectors like `vector<f32>` (but does not allow explicit 0 dimensions, i.e., `vector<0xf32>` is not allowed). Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114086	2021-11-17 14:58:24 +00:00
River Riddle	195730a650	[mlir][NFC] Replace references to Identifier with StringAttr This is part of the replacement of Identifier with StringAttr. Differential Revision: https://reviews.llvm.org/D113953	2021-11-16 17:36:26 +00:00
William S. Moses	30d87d4a5d	[MLIR][LLVM] Permit integer types in switch other than i32 LLVM switchop currently only permits i32. Both LLVM IR and MLIR Standard switch permit other integer types leading to an illegal state when lowering an i8 switch from MLIR standard Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D113955	2021-11-16 12:00:37 -05:00
Nicolas Vasilache	b377807a76	[mlir][LLVM] Fix folding of LLVM::ExtractValueOp Limit the backtracking along def-use chains when a prefix is encountered as it would generate incorrect foldings. Differential Revision: https://reviews.llvm.org/D113975	2021-11-16 14:49:05 +00:00
Butygin	6c48f6aafe	[mlir][spirv] add AtomicFAddEXTOp Differential Revision: https://reviews.llvm.org/D113764	2021-11-16 14:24:22 +03:00
Butygin	526b71e44a	[mlir] spirv: Add scf.while spirv conversion * It works similar to scf.for coversion, but convert condition and yield ops as part of scf.whille pattern so it don't need to maintain external state Differential Revision: https://reviews.llvm.org/D113007	2021-11-16 13:19:34 +03:00
Adrian Kuegel	921d91f3ac	[mlir] Support multi-dimensional vectors in MathToLibm conversion. Differential Revision: https://reviews.llvm.org/D113969	2021-11-16 11:13:52 +01:00
Arnab Dutta	1402299271	[MLIR] Simplify semi-affine expressions using flattening For the semi affine expressions, whenever rhs of a floordiv, ceildiv, mod or product expression is a symbolic expression, we introduce a local variable representing the result, and store the floordiv/ceildiv, mod or product affine expression in LocalExprs. In this way the expression is flattened, and trivial addition and subtraction related simplifications are performed. Also rule based matching for detecting and transforming "expr - q * (expr floordiv q)" to "expr mod q", where q is a symbolic exxpression, in simplifyAdd function. Differential Revision: https://reviews.llvm.org/D112808	2021-11-16 15:42:22 +05:30
Mehdi Amini	1585b13024	Revert "[MLIR][LLVM] Permit integer types in switch other than i32" This reverts commit `94992670fc`. Build is broken with: tools/mlir/include/mlir/Dialect/LLVMIR/LLVMOps.cpp.inc:23996:3: error: no matching function for call to 'printSwitchOpCases' printSwitchOpCases(_odsPrinter, *this, getValue().getType(), getCaseValuesAttr(), getCaseDestinations(), getCaseOperands(), getCaseOperands().getTypes()); ^~~~~~~~~~~~~~~~~~	2021-11-16 05:59:12 +00:00
William S. Moses	94992670fc	[MLIR][LLVM] Permit integer types in switch other than i32 LLVM switchop currently only permits i32. Both LLVM IR and MLIR Standard switch permit other integer types leading to an illegal state when lowering an i8 switch from MLIR standard Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D113955	2021-11-16 00:46:25 -05:00
Aart Bik	f66e5769d4	[mlir][sparse] first version of "truly" dynamic sparse tensors as outputs of kernels This revision contains all "sparsification" ops and rewriting necessary to support sparse output tensors when the kernel has no reduction (viz. insertions occur in lexicographic order and are "injective"). This will be later generalized to allow reductions too. Also, this first revision only supports sparse 1-d tensors (viz. vectors) as output in the runtime support library. This will be generalized to n-d tensors shortly. But this way, the revision is kept to a manageable size. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D113705	2021-11-15 15:33:32 -08:00
natashaknk	381677dfbf	[tosa][mlir] Refactor tosa.reshape lowering to linalg for dynamic cases. Split tosa.reshape into three individual lowerings: collapse, expand and a combination of both. Add simple dynamic shape support. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D113936	2021-11-15 15:31:37 -08:00
not-jenni	cdb0623ad8	[mlir][tosa] Add tosa.mul by one canonicalization Multiply by one can be removed during canonicalization. This optimizes away unneeded operations. Differential Revision: https://reviews.llvm.org/D113807	2021-11-15 14:52:16 -08:00
Nicolas Vasilache	0b17336f79	[mlir][Vector] Make vector.shape_cast based size-1 foldings opt-in and separate. This is in prevision of dropping them altogether and using insert/extract based patterns. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D113928	2021-11-15 21:17:57 +00:00
Nicolas Vasilache	b828506eca	[mlir][Linalg] Add a DownscaleDepthwiseConv2DNhwcHwcOp decomposition pattern. Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D113907	2021-11-15 20:48:16 +00:00
Nicolas Vasilache	641fe70776	[mlir][Linalg] Fix and improve vectorization of depthwise convolutions. When trying to connect the vectorization of depthwise convolutions to e2e execution a number of problems surfaced. Fix an off-by-one error on the size of the input vector (similary to what was previously done for regular conv). Rewrite the lowering to vector.fma instead of vector.contract: the KW reduction dimension has already been unrolled and vector.contract requires a reduction dimension to be valid. Differential Revision: https://reviews.llvm.org/D113884	2021-11-15 12:58:05 +00:00
Nicolas Vasilache	ee80ffbf9a	[mlir][Linalg] Add bounded recursion declaration to FMAOp -> LLVM conversion. FMAOp -> LLVM conversion is done progressively by peeling off 1 dimension from FMAOp at each pattern iteration. Add the recursively bounded property declaration to the pattern so that the rewriter can apply it multiple times. Without this, FMAOps with 3+D do not lower to LLVM. Differential Revision: https://reviews.llvm.org/D113886	2021-11-15 12:41:52 +00:00
Alexander Belyaev	9b1d90e8ac	[mlir] Move min/max ops from Std to Arith. Differential Revision: https://reviews.llvm.org/D113881	2021-11-15 13:19:17 +01:00
Nicolas Vasilache	f1c86b8354	[mlir][Linalg] Fix off-by-one error in conv vector size computation. Differential Revision: https://reviews.llvm.org/D113877	2021-11-15 11:37:44 +00:00
Matthias Springer	542a8cfba7	[mlir][linalg][bufferize] Fix insertion point of result buffers Differential Revision: https://reviews.llvm.org/D113723	2021-11-15 19:27:33 +09:00
Nicolas Vasilache	f67171ac58	[mlir][Linalg] Make depthwise convolution naming scheme consistent. Names should be consistent across all operations otherwise painful bugs will surface. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D113762	2021-11-15 07:54:29 +00:00
Stella Laurenzo	132bc6e2d4	Re-apply "[mlir] Allow out-of-tree python building from installed MLIR." Re-applies D111513: * Adds a full-fledged Python example dialect and tests to the Standalone example (need to do a bit of tweaking in the top level CMake and lit tests to adapt better to if not building with Python enabled). * Rips out remnants of custom extension building in favor of pybind11_add_module which does the right thing. * Makes python and extension sources installable (outputs to src/python/${name} in the install tree): Both Python and C++ extension sources get installed as downstreams need all of this in order to build a derived version of the API. * Exports sources targets (with our properties that make everything work) by converting them to INTERFACE libraries (which have export support), as recommended for the forseeable future by CMake devs. Renames custom properties to start with lower-case letter, as also recommended/required (groan). * Adds a ROOT_DIR argument to declare_mlir_python_extension since now all C++ sources for an extension must be under the same directory (to line up at install time). * Downstreams will need to adapt by: * Remove absolute paths from any SOURCES for declare_mlir_python_extension (I believe all downstreams are just using ${CMAKE_CURRENT_SOURCE_DIR} here, which can just be ommitted). May need to set ROOT_DIR if not relative to the current source directory. * To allow further downstreams to install/build, will need to make sure that all C++ extension headers are also listed under SOURCES for declare_mlir_python_extension. This reverts commit `1a6c26d1f5`. Reviewed By: stephenneuendorffer Differential Revision: https://reviews.llvm.org/D113732	2021-11-14 20:31:34 -08:00
Mogball	d259594be9	[mlir][ods] AttrOrTypeDef format: parse types Add template specialization to `FieldParser` for parsing types. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D113867	2021-11-14 23:24:29 +00:00
Nicolas Vasilache	99ff697bf7	[mlir][Vector] Add support for 1D depthwise conv vectorization At this time the 2 flavors of conv are a little too different to allow significant code sharing and other will likely come up. so we go the easy route first by duplicating and adapting. Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D113758	2021-11-12 13:14:09 +00:00
Nicolas Vasilache	aa37318067	[mlir][Linalg] Rewrite DownscaleSizeOneWindowed2DConvolution to use rank-reducing insert/extract slices. This rewriting enables better bufferization and canonicalizations. Differential Revision: https://reviews.llvm.org/D113745	2021-11-12 11:57:12 +00:00
Mehdi Amini	f5f11e6b16	Add a cppType string in AttrDef to make it possible to use them as parameters in other attributes Differential Revision: https://reviews.llvm.org/D113737	2021-11-12 07:26:06 +00:00
Stella Laurenzo	c265170110	[mlir] Add MLIR-C dylib. Per discussion on discord and various feature requests across bindings (Haskell and Rust bindings authors have asked me directly), we should be building a link-ready MLIR-C dylib which exports the C API and can be used without linking to anything else. This patch: * Adds a new MLIR-C aggregate shared library (libMLIR-C.so), which is similar in name and function to libLLVM-C.so. * It is guarded by the new CMake option MLIR_BUILD_MLIR_C_DYLIB, which has a similar purpose/name to the LLVM_BUILD_LLVM_C_DYLIB option. * On all platforms, this will work with both static, BUILD_SHARED_LIBS, and libMLIR builds, if supported: * In static builds: libMLIR-C.so will export the CAPI symbols and statically link all dependencies into itself. * In BUILD_SHARED_LIBS: libMLIR-C.so will export the CAPI symbols and have dynamic dependencies on implementation shared libraries. * In libMLIR.so mode: same as static. libMLIR.so was not finished for actual linking use within the project. An eventual relayering so that libMLIR-C.so depends on libMLIR.so is possible but requires first re-engineering the latter to use the aggregate facility. * On Linux, exported symbols are filtered to only the CAPI. On others (MacOS, Windows), all symbols are exported. A CMake status is printed unless if global visibility is hidden indicating that this has not yet been implemented. The library should still work, but it will be larger and more likely to conflict until fixed. Someone should look at lifting the corresponding support from libLLVM-C.so and adapting. Or, for special uses, just build with `-DCMAKE_CXX_VISIBILITY_PRESET=hidden -DCMAKE_C_VISIBILITY_PRESET=hidden`. * Includes fixes to execution engine symbol export macros to enable default visibility. Without this, the advice to use hidden visibility would have resulted in test failures and unusable execution engine support libraries. Differential Revision: https://reviews.llvm.org/D113731	2021-11-11 22:58:13 -08:00
Mehdi Amini	1a6c26d1f5	Revert "[mlir] Allow out-of-tree python building from installed MLIR." This reverts commit `c7be8b7539`. Build is broken (multiple buildbots)	2021-11-12 02:30:53 +00:00
Stella Laurenzo	c7be8b7539	[mlir] Allow out-of-tree python building from installed MLIR. * Depends on D111504, which provides the boilerplate for building aggregate shared libraries from installed MLIR. * Adds a full-fledged Python example dialect and tests to the Standalone example (need to do a bit of tweaking in the top level CMake and lit tests to adapt better to if not building with Python enabled). * Rips out remnants of custom extension building in favor of `pybind11_add_module` which does the right thing. * Makes python and extension sources installable (outputs to src/python/${name} in the install tree): Both Python and C++ extension sources get installed as downstreams need all of this in order to build a derived version of the API. * Exports sources targets (with our properties that make everything work) by converting them to INTERFACE libraries (which have export support), as recommended for the forseeable future by CMake devs. Renames custom properties to start with lower-case letter, as also recommended/required (groan). * Adds a ROOT_DIR argument to `declare_mlir_python_extension` since now all C++ sources for an extension must be under the same directory (to line up at install time). * Need to validate against a downstream or two and adjust, prior to submitting. Downstreams will need to adapt by: * Remove absolute paths from any SOURCES for `declare_mlir_python_extension` (I believe all downstreams are just using `${CMAKE_CURRENT_SOURCE_DIR}` here, which can just be ommitted). May need to set `ROOT_DIR` if not relative to the current source directory. * To allow further downstreams to install/build, will need to make sure that all C++ extension headers are also listed under SOURCES for `declare_mlir_python_extension`. Reviewed By: stephenneuendorffer, mikeurbach Differential Revision: https://reviews.llvm.org/D111513	2021-11-11 18:04:31 -08:00
Mogball	b8186b313c	[mlir][ods] Unique attribute, successor, region constraints With `-Os` turned on, results in 2-5% binary size reduction (depends on the original binary). Without it, the binary size is essentially unchanged. Depends on D113128 Differential Revision: https://reviews.llvm.org/D113331	2021-11-12 01:04:08 +00:00
Thomas Raoux	e7969240dc	[mlir][VectorToGPU] Support more cases in conversion to MMA ops Support load with broadcast, elementwise divf op and remove the hardcoded restriction on the vector size. Picking the right size should be enfored by user and will fail conversion to llvm/spirv if it is not supported. Differential Revision: https://reviews.llvm.org/D113618	2021-11-11 13:10:38 -08:00
Nicolas Vasilache	8fd2f56c99	[mlir][Linalg] Add 1-d depthwise conv with opdsl Differential Revision: https://reviews.llvm.org/D113686	2021-11-11 17:49:26 +00:00
Nicolas Vasilache	800694a697	[mlir][Linalg] Make a LinalgStrategyDecomposePass available. Differential Revision: https://reviews.llvm.org/D113684	2021-11-11 17:47:27 +00:00
Stephan Herhut	b241226aec	[mlir][linalg] Avoid illegal elementwise fusion into reductions Fusing into a reduction is only valid if doing so does not erase information on a reduction dimensions size. Differential Revision: https://reviews.llvm.org/D113500	2021-11-11 15:56:12 +01:00
Nicolas Vasilache	74d9c4a7d8	[mlir] Fix build post `34ff857350`	2021-11-11 08:05:39 +00:00
Nicolas Vasilache	34ff857350	[mlir][X86Vector] Add specialized vector.transpose lowering patterns for AVX2 This revision adds an implementation of 2-D vector.transpose for 4x8 and 8x8 for AVX2 and surfaces it to the Linalg level of control. Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D113347	2021-11-11 07:33:31 +00:00
Mehdi Amini	f97e72aaca	Use base class AsmParser/AsmPrinter in Types and Attribute print/parse method (NFC) This decouples the printing/parsing from the "context" in which the parsing occurs. This will allow to invoke these methods directly using an OpAsmParser/OpAsmPrinter. Differential Revision: https://reviews.llvm.org/D113637	2021-11-11 06:26:33 +00:00
River Riddle	120591e126	[mlir] Replace usages of Identifier with StringAttr Identifier and StringAttr essentially serve the same purpose, i.e. to hold a string value. Keeping these seemingly identical pieces of functionality separate has caused problems in certain situations: * Identifier has nice accessors that StringAttr doesn't * Identifier can't be used as an Attribute, meaning strings are often duplicated between Identifier/StringAttr (e.g. in PDL) The only thing that Identifier has that StringAttr doesn't is support for caching a dialect that is referenced by the string (e.g. dialect.foo). This functionality is added to StringAttr, as this is useful for StringAttr in generally the same ways it was useful for Identifier. Differential Revision: https://reviews.llvm.org/D113536	2021-11-11 02:02:24 +00:00
lipracer	8165eaa885	[mlir](arithmetic) Add ceildivui to the arithmetic dialect The specific description is [[ https://llvm.discourse.group/t/adding-unsigned-integer-ceil-and-floor-in-std-dialect/4541 \| Adding unsigned integer ceil in Std Dialect ]] . When we lower ceilDivOp this will generate below code, sometimes we know m and n are unsigned intergal.Here are some redundant judgments about positive and negative. So we need to add some unsigned operations to simplify the instructions. ``` ceilDiv(n, m) x = (m > 0) ? -1 : 1 return (n*m>0) ? ((n+x) / m) + 1 : - (-n / m) ``` unsigned operations: ``` ceilDivU(n, m) return n ==0 ? 0 : ((n - 1) / m) + 1 ``` Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D113363	2021-11-11 01:49:14 +00:00
Matthias Springer	2e0d821bd5	[mlir][linalg][bufferize] Store analysis results in BufferizationAliasInfo * Store inplace bufferization decisions in `inplaceBufferized`. * Remove `InPlaceSpec`. Use a bool instead. * Use `BufferizableOpInterface::bufferizesToWritableMemory` and `bufferizesToWritableMemory` instead of `getInPlace(BlockArgument)`. The analysis does not care about inplacability of block arguments. It only cares whether the buffer can be written to or not. * The `kInPlaceResultsAttrName` op attribute is for testing purposes only. This commit further decouples BufferizationAliasInfo from other dialects such as SCF. Differential Revision: https://reviews.llvm.org/D113375	2021-11-11 10:36:49 +09:00
Matthias Springer	996d4ffe30	[mlir][linalg][bufferize] Fix bug in InitTensor elimination After replacing then init_tensor with a new value, the new value must be inserted into the corresponding union/equivalence sets. Differential Revision: https://reviews.llvm.org/D113374	2021-11-11 10:28:17 +09:00
Jacques Pienaar	7b9dea634e	[mlir] Fix predicate.td ODS test case	2021-11-10 17:11:55 -08:00
Jacques Pienaar	32b327e4ed	[mlir][ods] Use lambda in element type check pred rather than repeated casts Avoids multiple cast & getElementType calls. Just a local change for ShapedType containers but reduces one model case from 24.7 to 24.04s. Resultant code generated change: https://gist.github.com/jpienaar/7ffd2e9b0737134ba2ea2729b91c9572 Differential Revision: https://reviews.llvm.org/D113621	2021-11-10 16:27:37 -08:00
Jacques Pienaar	ec0b53d4e4	[mlir] Add traits, interfaces, effects to generated docs Simply emit traits, interfaces & effects (with some minimal formatting) to the generated docs to make this information easier to find in the docs. Differential Revision: https://reviews.llvm.org/D113539	2021-11-10 16:09:43 -08:00
Rob Suderman	860d3811a9	[mlir][tosa] Add lowering for tosa.pad with explicit value New TOSA pad operation can support explicitly specifying the pad value. Added lowering to linalg that uses the explicit value. Differential Revision: https://reviews.llvm.org/D113515	2021-11-10 14:15:20 -08:00
Uday Bondhugula	51ae78a6d6	[MLIR][Affine][NFC] affine.store op verifier message fix and check Fix typo in affine.store op verifier message and test case. Differential Revision: https://reviews.llvm.org/D113360	2021-11-11 01:52:23 +05:30
Kevin Cheng	bef966eb37	tosa-make-broadcatable pass now supports numpy style broadcasting only. - fix bug that in [c,1] + [a, b, c, d] broadcast - add test [3,3,4,1] + [4,5] Signed-off-by: Kevin Cheng <kevin.cheng@arm.com> Change-Id: Iaed2f04df8775f655c82c740271395274163d147 Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D113596	2021-11-10 11:48:35 -08:00
thomasraoux	5aa6038a40	[mlir] Make topologicalSort iterative and consider op regions When doing topological sort we need to make sure an op is scheduled before any of the ops within its regions. Also change the algorithm to not be recursive in order to prevent potential stack overflow. Differential Revision: https://reviews.llvm.org/D113423	2021-11-10 10:05:01 -08:00
thomasraoux	f309939d06	[mlir][nvvm] Remove special case ptr arithmetic lowering in gpu to nvvm Use existing helper instead of handling only a subset of indices lowering arithmetic. Also relax the restriction on the memref rank for the GPU mma ops as we can now support any rank. Differential Revision: https://reviews.llvm.org/D113383	2021-11-10 10:00:12 -08:00
Alex Zinenko	e64c76672f	[mlir] recursively convert builtin types to LLVM when possible Given that LLVM dialect types may now optionally contain types from other dialects, which itself is motivated by dialect interoperability and progressive lowering, the conversion should no longer assume that the outermost LLVM dialect type can be left as is. Instead, it should inspect the types it contains and attempt to convert them to the LLVM dialect. Introduce this capability for LLVM array, pointer and structure types. Only literal structures are currently supported as handling identified structures requires the converison infrastructure to have a mechanism for avoiding infite recursion in case of recursive types. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D112550	2021-11-10 18:11:00 +01:00
Tobias Gysi	b326eb64fd	[mli][linalg] Use CodegenStrategy to test interchange (NFC). Use CodegenStrategy instead of a separate test pass to test iterator interchange. Depends On D113409 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D113550	2021-11-10 15:44:44 +00:00
Tobias Gysi	659586bf19	[mlir][linalg] Remove padding test pass (NFC). Remove padding test pass that was replaced by CodegenStrategy. Depends On D113411 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D113412	2021-11-10 15:33:26 +00:00
Tobias Gysi	b676a67092	[mlir][linalg] Use CodegenStrategy to test hoisting (NFC). Use CodegenStrategy instead of a separate test pass to test hoisting. Depends On D113410 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D113411	2021-11-10 15:06:31 +00:00
Tobias Gysi	0c7c532643	[mli][linalg] Use CodegenStrategy to test padding (NFC). Use CodegenStrategy instead of a separate test pass to test padding. Depends On D113409 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D113410	2021-11-10 15:00:06 +00:00
Tobias Gysi	b86b2309ce	[mlir][linalg] Use AffineApplyOp to compute padding width (NFC). Use AffineApplyOp instead of SubIOp to compute the padding width when creating a pad tensor operation. Depends On D113382 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D113404	2021-11-10 14:53:52 +00:00
Tobias Gysi	ba2ac9c97c	[mli][linalg] Add flag to control CodegenStrategy enable pass. Add a flag to control if CodegenStrategy runs the EnablePass between the transformations. Depends On D113382 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D113409	2021-11-10 14:11:40 +00:00
Tobias Gysi	0609eb1b32	[mlir][linalg] Remove padding from tiling options. Remove the padding options from the tiling options since padding is now implemented by a separate pattern/pass introduced in https://reviews.llvm.org/D112412. The revsion remove the tile-and-pad-tensors.mlir and replaces it with the pad.mlir that tests padding in isolation (without tiling). Similarly, hoist-padding.mlir is replaced by pad-and-hoist.mlir introduced in https://reviews.llvm.org/D112713. Depends On D112838 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D113382	2021-11-10 13:33:28 +00:00
Denys Shabalin	aaea92e1cd	[mlir] Reintroduce nano time to execution_engine Prior change had a broken test that wasn't run by accident. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D113488	2021-11-10 13:14:18 +01:00
Matthias Springer	c3eb967e2a	[mlir][linalg][bufferize] Bufferize ops via PreOrder traversal The existing PostOrder traversal with special rules for certain ops was complicated and had a bug. Switch to PreOrder traversal. Differential Revision: https://reviews.llvm.org/D113338	2021-11-10 18:51:39 +09:00
Matthias Springer	99ad2079d4	[mlir][linalg][bufferize] Fix buffer equivalence around scf.if ops Also extend the comments for aliasInfo and equivalenceInfo. Differential Revision: https://reviews.llvm.org/D113340	2021-11-10 18:33:08 +09:00
Matthias Springer	f74f09128b	[mlir][linalg][bufferize] Relax tensor.insert_slice conflict rules A tensor.insert_slice write does not conflict with a subsequent read of the source if the source is originating from a matching tensor.extract_slice. Differential Revision: https://reviews.llvm.org/D113446	2021-11-10 18:23:29 +09:00
Stephan Herhut	c0cad9d535	[mlir][linalg] Enable insertion of dealloc for end2end tests This uses the buffer-deallocation pass or, in case the test case does not use bufferization, has added explicit deallocs. Differential Revision: https://reviews.llvm.org/D111059	2021-11-10 09:50:41 +01:00
Jacques Pienaar	d1a688ce0e	[mlir-c] Add Region iterators matching Block & Operation ones Enables using the same iterator interface to these even though underlying storage is different. Differential Revision: https://reviews.llvm.org/D113512	2021-11-09 17:52:56 -08:00
Mehdi Amini	1370f52bb7	Fix ODS Attribute/Type declarative assembly generator after API change for Attribute/Type print The change in `f30a8a6f67` conflicted with the recently landed feature on ODS assembly format for Attribute/Type.	2021-11-10 01:08:35 +00:00
Mehdi Amini	f30a8a6f67	Change the contract with the type/attribute parsing to let the dispatch handle the mnemonic This breaking change requires to remove printing the mnemonic in the print() method on Type/Attribute classes. This makes it consistent with the parsing code which alread handles the mnemonic outside of the parsing method. This likely won't break the build for anyone, but tests will start failing for dialects downstream. The fix is trivial and look like going from: void emitc::OpaqueType::print(DialectAsmPrinter &printer) const { printer << "opaque<\""; to: void emitc::OpaqueAttr::print(DialectAsmPrinter &printer) const { printer << "<\""; Reviewed By: rriddle, aartbik Differential Revision: https://reviews.llvm.org/D113334	2021-11-10 00:47:15 +00:00
Mehdi Amini	fd6b404183	Emit the boilerplate for Attribute printer/parser dialect dispatching from ODS Add a new `useDefaultAttributePrinterParser` boolean settings on the dialect (default to false for now) that emits the boilerplate to dispatch attribute parsing/printing to the auto-generated method. We will likely turn this on by default in the future. Differential Revision: https://reviews.llvm.org/D113329	2021-11-10 00:38:19 +00:00
Mehdi Amini	9d506ae0f6	Restructure the Test dialect ODS to include the AttrDef in TestOps.td (NFC) This structure is necessary to be able to use AttrDef as arguments on operations. Differential Revision: https://reviews.llvm.org/D113327	2021-11-10 00:38:19 +00:00
Mogball	2dd00c17e0	[mlir][ods] Cleanup of handling Op vs OpAdaptor In preparation for implementation subrange lookup on attributes. Depends on D113039 Reviewed By: jpienaar, Chia-hungDuan Differential Revision: https://reviews.llvm.org/D113128	2021-11-09 20:09:21 +00:00
Mehdi Amini	c296609b68	Revert "[mlir] Add nano precision clock to execution engine" This reverts commit `48d1f099d4`. Broke the MLIR buildbots	2021-11-09 18:12:42 +00:00
Denys Shabalin	48d1f099d4	[mlir] Add nano precision clock to execution engine Reviewed By: ftynse, nicolasvasilache Differential Revision: https://reviews.llvm.org/D113476	2021-11-09 14:32:36 +01:00
River Riddle	ae40d62541	[mlir] Refactor ElementsAttr's value access API There are several aspects of the API that either aren't easy to use, or are deceptively easy to do the wrong thing. The main change of this commit is to remove all of the `getValue<T>`/`getFlatValue<T>` from ElementsAttr and instead provide operator[] methods on the ranges returned by `getValues<T>`. This provides a much more convenient API for the value ranges. It also removes the easy-to-be-inefficient nature of getValue/getFlatValue, which under the hood would construct a new range for the type `T`. Constructing a range is not necessarily cheap in all cases, and could lead to very poor performance if used within a loop; i.e. if you were to naively write something like: ``` DenseElementsAttr attr = ...; for (int i = 0; i < size; ++i) { // We are internally rebuilding the APFloat value range on each iteration!! APFloat it = attr.getFlatValue<APFloat>(i); } ``` Differential Revision: https://reviews.llvm.org/D113229	2021-11-09 00:15:08 +00:00
Chia-hung Duan	2d99c815d7	[mlir-tblgen] Support `either` in Tablegen DRR. Add a new directive `either` to specify the operands can be matched in either order Reviewed By: jpienaar, Mogball Differential Revision: https://reviews.llvm.org/D110666	2021-11-08 23:16:03 +00:00
Chia-hung Duan	f3798ad5fa	Static verifier for type/attribute in DRR Generate static function for matching the type/attribute to reduce the memory footprint. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D110199	2021-11-08 21:34:17 +00:00
Suraj Sudhir	82568021dd	[mlir][tosa] Spec v0.23 updates Add pad_const field to tosa.pad. Add builders to enable optional construction of pad_const in pad op. Update documentation of tosa.clamp to match spec wording. Signed-off-by: Suraj Sudhir <suraj.sudhir@arm.com> Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D113322	2021-11-08 10:13:54 -08:00
Jeff Niu	9a2fdc369d	[MLIR] Attribute and type formats in ODS Declarative attribute and type formats with assembly formats. Define an `assemblyFormat` field in attribute and type defs with a `mnemonic` to generate a parser and printer. ```tablegen def MyAttr : AttrDef<MyDialect, "MyAttr"> { let parameters = (ins "int64_t":$count, "AffineMap":$map); let mnemonic = "my_attr"; let assemblyFormat = "`<` $count `,` $map `>`"; } ``` Use `struct` to define a comma-separated list of key-value pairs: ```tablegen def MyType : TypeDef<MyDialect, "MyType"> { let parameters = (ins "int":$one, "int":$two, "int":$three); let mnemonic = "my_attr"; let assemblyFormat = "`<` $three `:` struct($one, $two) `>`"; } ``` Use `struct(*)` to capture all parameters. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D111594	2021-11-08 17:38:28 +00:00
Tobias Gysi	1726c956ae	[mlir][linalg] Improve hoist padding buffer size computation. Adapt the Fourier Motzkin elimination to take into account affine computations happening outside of the cloned loop nest. Depends On D112713 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D112838	2021-11-08 12:02:57 +00:00
Tobias Gysi	9fbcad3298	[mlir][linalg] Improve the padding packing loop computation. The revision updates the packing loop search in hoist padding. Instead of considering all loops in the backward slice, we now compute a separate backward slice containing the index computations only. This modification ensures we do not add packing loops that are not used to index the packed buffer due to spurious dependencies. One instance where such spurious dependencies can appear is the extract slice operation introduced between the tile loops of a double tiling. Depends On D112412 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D112713	2021-11-08 10:20:33 +00:00
Shraiysh Vaishay	19a7e4729d	[MLIR][OpenMP] Added omp.sections and omp.section Added omp.sections and omp.section operation according to the section 2.8.1 of OpenMP Standard 5.0. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D110844	2021-11-06 19:27:35 +05:30
Aart Bik	38c366e467	[mlir][sparse] run more integration tests with and without SIMD Reviewed By: wrengr Differential Revision: https://reviews.llvm.org/D113205	2021-11-05 12:51:38 -07:00
River Riddle	4070f305f9	[mlir][DialectConversion] Legalize all live argument conversions Previously we didn't materialize conversions for arguments in certain cases as the implicit type propagation was being heavily relied on by many patterns. Now that those patterns have been fixed to properly handle type conversions, we can drop the special behavior. Differential Revision: https://reviews.llvm.org/D113233	2021-11-05 18:43:56 +00:00
Aart Bik	2f0ee17017	[mlir][sparse] test for SIMD reduction chaining in consecutive vector loops Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D113197	2021-11-05 10:14:17 -07:00
Alex Zinenko	6981e5ec91	[mlir][python] fix constructor generation for optional operands in presence of segment attribute The ODS-based Python op bindings generator has been generating incorrect specification of the operand segment in presence if both optional and variadic operand groups: optional groups were treated as variadic whereas they require separate treatement. Make sure it is the case. Also harden the tests around generated op constructors as they could hitherto accept the code for both optional and variadic arguments. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D113259	2021-11-05 12:40:27 +01:00
Christian Sigg	fce529fc6e	Fix `insertFunctionArguments()` block argument order. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D113171	2021-11-05 10:08:20 +01:00
Aart Bik	7373cabcda	[mlir][sparse] implement full reduction "scalarization" across loop nests The earlier reduction "scalarization" was only applied to a chain of innermost and for loops. This revision generalizes this to any nesting of for- and while-loops. This implies that reductions can be implemented with a lot less load and store operations. The chaining is implemented with a forest of yield statements (but not as bad as when we would also include the while-induction). Fixes https://bugs.llvm.org/show_bug.cgi?id=52311 Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D113078	2021-11-04 17:38:47 -07:00
not-jenni	07a029c057	Canonicalization for add to no-op if one of the inputs is zero Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D113207	2021-11-04 16:52:47 -07:00
Mogball	8129b04b8a	[mlir][ods] Op::verify should not call OpAdaptor::verify OpAdaptor::verify performs string lookups on an attribute dictionary. By calling OpAdaptor::verify, Op::verify is not able to use cached attribute identifiers for faster lookups. Reviewed By: jpienaar, rriddle Differential Revision: https://reviews.llvm.org/D113039	2021-11-04 19:12:55 +00:00
Aart Bik	4aa9b39824	[mlir][sparse] reject sparsity annotation in "scalar" tensors Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D113152	2021-11-04 09:49:05 -07:00
Tobias Gysi	29c31cb79b	[mlir][linalg] Add support for transitive fusion. Extend fusion on tensors to fuse producers greedily. Reviewed By: nicolasvasilache, hanchung Differential Revision: https://reviews.llvm.org/D110262	2021-11-04 16:25:06 +00:00
River Riddle	7f312f6d79	[mlir] Avoid folding in OpBuilder::tryFold when types change This was missed when tightening fold restrictions in https://reviews.llvm.org/D95991. Differential Revision: https://reviews.llvm.org/D113138	2021-11-03 20:35:46 +00:00
Butygin	1cb13fddb9	[mlir] spirv: Add some atomic ops Differential Revision: https://reviews.llvm.org/D112812	2021-11-03 14:47:12 +03:00
Alex Zinenko	34f72d9125	[mlir][python] expose the shape property of shaped types This has been missing in the original definition of shaped types. Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D113025	2021-11-03 10:49:12 +01:00
Alex Zinenko	fc7594cc4a	[mlir][python] improve usability of Python affine construct bindings - Provide the operator overloads for constructing (semi-)affine expressions in Python by combining existing expressions with constants. - Make AffineExpr, AffineMap and IntegerSet hashable in Python. - Expose the AffineExpr composition functionality. Reviewed By: gysit, aoyal Differential Revision: https://reviews.llvm.org/D113010	2021-11-03 10:48:01 +01:00
rkayaith	f78fe0b7b8	[mlir][python] Make Operation and Value hashable This allows operations and values to be used as dict keys Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D112669	2021-11-03 10:40:03 +01:00
Nicolas Vasilache	9c4971740b	[mlir][Linalg] Refactor vectorization of conv1d more aggressively. This better decouples transfer read/write from vector-only rewrite of conv. This form is close to ready to plop into a new vector.conv op and the vector.transfer operations to be generalized as part of generic vectorization once the properties ConvolutionOpInterface are inferred from the indexing maps. This also results in a nice perf boost in the dw == 1 cases. Differential revision: https://reviews.llvm.org/D112822	2021-11-03 08:18:01 +00:00
Nicolas Vasilache	7b09f157e1	[mlir][Linalg] Refactor conv vectorization to decouple memory from vector ops. This refactoring prepares conv1d vectorization for a future integration into the generic codegen path. Once transfer_read / transfer_write vectorization also supports sliding windows, the special pattern for conv can disappear. This will also likely need a vector.conv operation. Differential Revision: https://reviews.llvm.org/D112797	2021-11-03 08:03:40 +00:00
Nicolas Vasilache	885072820c	[mlir][Vector] Add a pattern to lower 2-D vector.transpose to shape_cast+shuffle. The 2-D case can be rewritten to generate quite fewer instructions and a single vector.shuffle which seems to provide a nice performance boost. Add this arrow to our quiver by exposing it with a new vector transform option. Differential Revision: https://reviews.llvm.org/D113062	2021-11-02 22:12:46 +00:00
thomasraoux	7fbb0678fa	[mlir][VectorToGPU] Add support for elementwise mma to vector to GPU Differential Revision: https://reviews.llvm.org/D112960	2021-11-02 08:01:04 -07:00
Lei Zhang	7b615a87dc	[mlir][linalg] Rewrite `linalg.conv_2d_nhwc_hwcf` into 1-D We'd like to take a progressive approach towards Fconvolution op CodeGen, by 1) tiling it to fit compute hierarchy first, and then 2) tiling along window dimensions with size 1 to reduce the problem to be matmul-like. After that, we can 3) downscale high-D convolution ops to low-D by removing the size-1 window dimensions. The final step would be 4) vectorizing the low-D convolution op directly. We have patterns for 1), 2), and 4). This commit adds a pattern for 3) for `linalg.conv_2d_nhwc_hwcf` ops as a starter. Supporting other high-D convolution ops should be similar and mechanical. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D112928	2021-11-02 09:56:26 -04:00
Alex Zinenko	30d61893fb	[mlir] provide C API and Python bindings for symbol tables Symbol tables are a largely useful top-level IR construct, for example, they make it easy to access functions in a module by name instead of traversing the list of module's operations to find the corresponding function. Depends On D112886 Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D112821	2021-11-02 14:22:58 +01:00
thomasraoux	8a992b20db	[mlir][gpu] Add basic support to do elementwise ops on mma matrix type In order to support fusion with mma matrix type we need to be able to execute elementwise operations on them. This add an op to be able to support some basic elementwise operations. This is a is not a full solution as it only supports a limited scope or operations. Ideally we would want to be able to fuse with more kind of operations. Differential Revision: https://reviews.llvm.org/D112857	2021-11-01 11:51:19 -07:00
MaheshRavishankar	d115a48e90	[mlir][python] Add test for tensor dialect. Differential Revision: https://reviews.llvm.org/D112781	2021-11-01 10:59:31 -07:00
thomasraoux	77eafb8430	[mlir][nvvm] Generalize wmma ops to handle more types and shapes wmma intrinsics have a large number of combinations, ideally we want to be able to target all the different variants. To avoid a combinatorial explosion in the number of mlir op we use attributes to represent the different variation of load/store/mma ops. We also can generate with tablegen helpers to know which combinations are available. Using this we can avoid having too hardcode a path for specific shapes and can support more types. This patch also adds boiler plates for tf32 op support. Differential Revision: https://reviews.llvm.org/D112689	2021-11-01 10:27:26 -07:00
Weiwei Li	3483fc5a31	[mlir][SPIRVToLLVM] Add shufflevector conversion Add the shufflevector conversion. It only handles the static, i.e., IntegerAttr, index. Co-authored: Xinyi Liu <xyliuhelen@gmail.com> Reviewed by: antiagainst Differential revision: https://reviews.llvm.org/D112161	2021-11-01 23:05:37 +08:00
Alex Zinenko	24685aaeb7	[mlir][python] allow for detaching operations from a block Provide support for removing an operation from the block that contains it and moving it back to detached state. This allows for the operation to be moved to a different block, a common IR manipulation for, e.g., module merging. Also fix a potential one-past-end iterator dereference in Operation::moveAfter discovered in the process. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D112700	2021-10-31 09:42:15 +01:00
xndcn	6e2c0e6931	[mlir][spirv] Add conversions from arith.bitcast, std.br, std.cond_br to spirv. Differential Revision: https://reviews.llvm.org/D112819	2021-10-31 00:40:35 +08:00
wren romano	6be36fd794	[mlir][sparse] Improve handling of dynamic-sizes for sparse=>dense conversion Allows the result to be more dynamically-sized than the source. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D112854	2021-10-29 17:44:40 -07:00
Aart Bik	0121c96f37	[mlir][sparse] refine the mixed width sparse conversion test Added a type with different pointer/index bit width. Also added some sanity CHECKs on the stored indices. Reviewed By: wrengr Differential Revision: https://reviews.llvm.org/D112778	2021-10-29 13:31:04 -07:00
Ahmed Taei	813fa79c15	Don't drop in_bounds when vector-transfer-collapse-inner-most-dims When operand is a subview we don't infer in_bounds and some default cases (e.g case in the tests) will crash with `operand is NULL` when converting to LLVM Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D112772	2021-10-29 09:07:57 -07:00
Tobias Gysi	6638112b42	[mlir][linalg] Add padding pass to strategy passes. Add a strategy pass that pads and hoists after tiling and fusion. Depends On D112412 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D112480	2021-10-29 15:30:42 +00:00
Tobias Gysi	d0ec4a8ed9	[mlir][linalg] Add pad and hoist test pass. Adding a padding and hoisting pattern, a test pass, and tests. The patch prepares the split of tiling/fusion and padding. Depends On D112255 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D112412	2021-10-29 15:08:16 +00:00
Adrian Kuegel	9fb1086b94	[mlir][python] Add a __contains__ method to the python bindings for DictionaryAttr. This makes it easier to check in python whether a certain attribute is there. Differential Revision: https://reviews.llvm.org/D112814	2021-10-29 15:19:16 +02:00
Tobias Gysi	e83d8466fb	[mlir][linalg] Adapt hoistPaddingOnTensors signature to support patterns (NFC). Adapt hoistPaddingOnTensors to leave replacing and erasing the old pad tensor operation to the caller. This change makes the function pattern friendly. Depends On D112003 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D112255	2021-10-29 06:51:38 +00:00
Aart Bik	185960dc8d	[mlir][sparse] fix conversion bug when changing pointer/index sizes Reviewed By: wrengr Differential Revision: https://reviews.llvm.org/D112770	2021-10-28 17:24:38 -07:00
wren romano	5389cdc8f6	[mlir][sparse] Adding dynamic-size support for sparse=>dense conversion Depends On D110790 Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D112674	2021-10-28 16:56:18 -07:00
wren romano	28882b6575	[mlir][sparse] Implementing sparse=>dense conversion. Depends On D110882, D110883, D110884 Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D110790	2021-10-28 15:27:35 -07:00
Eugene Zhulenev	627fa0b9a8	[mlir] MathApproximations: unroll virtual vectors into hardware vectors for ISA specific operation Reviewed By: cota Differential Revision: https://reviews.llvm.org/D112736	2021-10-28 12:52:04 -07:00
Markus Böck	10a80c4413	[mlir] Implement replacement of SymbolRefAttrs in Dialect attributes using SubElementAttr interface This patch extends the SubElementAttr interface to allow replacing a contained sub attribute. The attribute that should be replaced is identified by an index which denotes the n-th element returned by the accompanying walkImmediateSubElements method. Using this addition the patch implements replacing SymbolRefAttrs contained within any dialect attributes. Differential Revision: https://reviews.llvm.org/D111357	2021-10-28 19:08:20 +02:00
Aart Bik	947e14be98	[mlir][sparse] move conversion test back to original CHECK testing Rationale: The silent exit(1) gives little clues on where the error occurs on failure and may even be confusing at first. The CHECK testing of all computed values and indices may be a little bit more elaborate, but it directly pinpoints where errors happen if they occur. This style is also consistent with the other tests, which I actually prefer. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D112688	2021-10-28 09:03:26 -07:00
Uday Bondhugula	57b9b29649	[MLIR][LLVM] Add llvm.mlir.global_ctors/dtors and translation support Add llvm.mlir.global_ctors and global_dtors ops and their translation support to LLVM global_ctors/global_dtors global variables. Differential Revision: https://reviews.llvm.org/D112524	2021-10-28 18:09:34 +05:30
Shraiysh Vaishay	30bd11fab4	[MLIR][OpenMP] Fixed the missing inclusive clause in omp.wsloop and fix order clause This patch adds the inclusive clause (which was missed in previous reorganization - https://reviews.llvm.org/D110903) in omp.wsloop operation. Added a test for validating it. Also fixes the order clause, which was not accepting any values. It now accepts "concurrent" as a value, as specified in the standard. Reviewed By: kiranchandramohan, peixin, clementval Differential Revision: https://reviews.llvm.org/D112198	2021-10-28 14:18:05 +05:30
thomasraoux	eacd6e1ebe	[mlir][GPUtoNVVM] Relax restriction on wmma op lowering Allow lowering of wmma ops with 64bits indexes. Change the default version of the test to use default layout. Differential Revision: https://reviews.llvm.org/D112479	2021-10-27 21:31:55 -07:00
Roman Lebedev	42712698fd	Revert "[IR] `IRBuilderBase::CreateAdd()`: short-circuit `x + 0` --> `x`" Clang OpenMP codegen tests are failing. This reverts commit `288f1f8abe`. This reverts commit `cb90e5356a`.	2021-10-27 22:21:37 +03:00
Roman Lebedev	288f1f8abe	Fix MLIR LLVMIR test after `4723c9b3c6`	2021-10-27 21:52:56 +03:00
Eugene Zhulenev	0d9b478932	[mlir] Reduce the number of iterations in async microbenchmarks Differential Revision: https://reviews.llvm.org/D112609	2021-10-27 03:20:06 -07:00
Matthias Springer	5b98e4ed16	[mlir][linalg][bufferize] Add analysis fuzzer option Analyze ops in a pseudo-random order to see if any assertions are triggered. Randomizing the order of analysis likely worsens the quality of the bufferization result (more out-of-place bufferizations). However, assertions should never fail, as that would indicate a problem with our implementation. Differential Revision: https://reviews.llvm.org/D112581	2021-10-27 17:37:56 +09:00
Shraiysh Vaishay	9fb52cb3f1	[MLIR][OpenMP] Added omp.atomic.read and omp.atomic.write This patch supports the atomic construct (read and write) following section 2.17.7 of OpenMP 5.0 standard. Also added tests and verifier for the same. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D111992	2021-10-27 14:05:44 +05:30
Nicolas Vasilache	00ac874ff6	[mlir][Vector] Add InsertStridedSliceOp -> ShuffleOp for the rank-1 cases. This also fixes the vector.shuffle C++ builder which had an incorrect type assumption that triggers with this new rewrite. The vector.shuffle semantics were correct though. Differential revision: https://reviews.llvm.org/D112578	2021-10-27 07:57:17 +00:00
River Riddle	015192c634	[mlir:DialectConversion] Restructure how argument/target materializations get invoked The current implementation invokes materializations whenever an input operand does not have a mapping for the desired type, i.e. it requires materialization at the earliest possible point. This conflicts with goal of dialect conversion (and also the current documentation) which states that a materialization is only required if the materialization is supposed to persist after the conversion process has finished. This revision refactors this such that whenever a target materialization "might" be necessary, we insert an unrealized_conversion_cast to act as a temporary materialization. This allows for deferring the invocation of the user materialization hooks until the end of the conversion process, where we actually have a better sense if it's actually necessary. This has several benefits: * In some cases a target materialization hook is no longer necessary When performing a full conversion, there are some situations where a temporary materialization is necessary. Moving forward, these users won't need to provide any target materializations, as the temporary materializations do not require the user to provide materialization hooks. * getRemappedValue can now handle values that haven't been converted yet Before this commit, it wasn't well supported to get the remapped value of a value that hadn't been converted yet (making it difficult/impossible to convert multiple operations in many situations). This commit updates getRemappedValue to properly handle this case by inserting temporary materializations when necessary. Another code-health related benefit is that with this change we can move a majority of the complexity related to materializations to the end of the conversion process, instead of handling adhoc while conversion is happening. Differential Revision: https://reviews.llvm.org/D111620	2021-10-27 02:09:04 +00:00
Jacques Pienaar	0ef217d8e1	[mlir] Fix missing prefix for region accessor on OpAdaptor Also flip op-decl-and-defs test to _Prefixed to test more.	2021-10-26 17:35:16 -07:00
Aart Bik	1e6ef0cfb0	[mlir][sparse] refine trait of sparse_tensor.convert Rationale: The currently used trait was demanding that all types are the same which is not true (since the sparse part may change and the dim sizes may be relaxed). This revision uses the correct trait and makes the rank match test explicit in the verify method. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D112576	2021-10-26 14:36:49 -07:00
Alexander Belyaev	96cee29762	[mlir] Allow polynomial approximations for N-d vectors. Polynomial approximation can be extented to support N-d vectors. N-dimensional vectors are useful when vectorizing operations on N-dimensional tiles. Before lowering to LLVM these vectors are usually unrolled or flattened to 1-dimensional vectors. Differential Revision: https://reviews.llvm.org/D112566	2021-10-26 20:50:00 +02:00
Stella Laurenzo	d86688fb1f	[mlir][python] Segment MLIR Python test dialect to avoid testonly dependency. With https://reviews.llvm.org/rG14c9207063bb00823a5126131e50c93f6e288bd3, the build is broken with -DMLIR_INCLUDE_TESTS=OFF. This patch fixes the build and we may want to do a better fix to the layering in a followup. Differential Revision: https://reviews.llvm.org/D112560	2021-10-26 18:47:36 +00:00
Amy Zhuang	b9ae741d3e	[mlir] Fix getVectorReductionOp 1.Combining kind min/max of Vector reduction op has been changed to minf/maxf, minsi/maxsi, and minui/maxui. Modify getVectorReductionOp accordingly. 2.Add min/max to supported reductions. Reviewed By: dcaballe, nicolasvasilache Differential Revision: https://reviews.llvm.org/D112246	2021-10-26 08:42:34 -07:00
Uday Bondhugula	41a8b46007	[MLIR] Fix AffineExpr getLargestKnownDivisor for ceildiv and floordiv Fix AffineExpr `getLargestKnownDivisor` for ceil/floor div cases. In these cases, nothing can be inferred on the divisor of the result. Add test case for `mod` as well. Differential Revision: https://reviews.llvm.org/D112523	2021-10-26 16:21:29 +05:30
Mehdi Amini	f431d3878a	Make Python MLIR Operation not iterable The current behavior is conveniently allowing to iterate on the regions of an operation implicitly by exposing an operation as Iterable. However this is also error prone and code that may intend to iterate on the results or the operands could end up "working" apparently instead of throwing a runtime error. The lack of static type checking in Python contributes to the ambiguity here, it seems safer to not do this and require and explicit qualification to iterate (`op.results`, `op.regions`, ...). Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D111697	2021-10-26 07:21:09 +00:00
Matthias Kramm	16e530d43b	When generating C++ code, use C++ string escaping. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D112468	2021-10-25 22:53:44 +00:00
Robert Suderman	58901a5a29	[mlir][tosa] Correct tosa.avg_pool2d for specification error Specification specified the output type for quantized average pool should be an i32. Only accumulator should be an i32, result type should match the input type. Caused in https://reviews.llvm.org/D111590 Reviewed By: sjarus, GMNGeoffrey Differential Revision: https://reviews.llvm.org/D112484	2021-10-25 14:41:16 -07:00
MaheshRavishankar	2f572818b0	[mlir][Linalg] Allow comprehensive bufferization to use callbacks for alloc/dealloc. Using callbacks for allocation/deallocation allows users to override the default. Also add an option to comprehensive bufferization pass to use `alloca` instead of `alloc`s. Note that this option is just for testing. The option to use `alloca` does not work well with the option to allow for returning memrefs.	2021-10-25 12:43:10 -07:00
Boian Petkantchin	f1b922188e	[MLIR][Math] Add erf to math dialect Add math.erf lowering to libm call. Add math.erf polynomial approximation. Reviewed By: silvas, ezhulenev Differential Revision: https://reviews.llvm.org/D112200	2021-10-25 18:30:17 +00:00
Aart Bik	1b15160ef3	[mlir][sparse] lower trivial tensor.cast on identical sparse tensors Even though tensor.cast is not part of the sparse tensor dialect, it may be used to cast static dimension sizes to dynamic dimension sizes for sparse tensors without changing the actual sparse tensor itself. Those cases should be lowered properly when replacing sparse tensor types with their opaque pointers. Likewise, no op sparse conversions are handled by this revision in a similar manner. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D112173	2021-10-25 10:30:19 -07:00
MaheshRavishankar	5fb46a9fa3	Revert "[mlir][Linalg] Allow comprehensive bufferization to use callbacks for alloc/dealloc." This reverts commit `c86f218fe4`. Revert because it causes build failure.	2021-10-25 08:57:53 -07:00
MaheshRavishankar	c86f218fe4	[mlir][Linalg] Allow comprehensive bufferization to use callbacks for alloc/dealloc. Using callbacks for allocation/deallocation allows users to override the default. Also add an option to comprehensive bufferization pass to use `alloca` instead of `alloc`s. Note that this option is just for testing. The option to use `alloca` does not work well with the option to allow for returning memrefs. Differential Revision: https://reviews.llvm.org/D112166	2021-10-25 08:50:25 -07:00
Nicolas Vasilache	d054b80bd3	[mlir][Vector] NFC - Add option to hook vector.transpose lowering to strategies. This revision also moves some code around to improve overall structure. Differential Revision: https://reviews.llvm.org/D112437	2021-10-25 12:26:33 +00:00
Nicolas Vasilache	176a0ea535	[mlr][Linalg] NFC - Add option to hook vector.multi_reduction lowering to strategies. Differential Revision: https://reviews.llvm.org/D112414	2021-10-25 11:31:39 +00:00
Alex Zinenko	2995d29bb4	[mlir][python] Infer result types in generated constructors whenever possible In several cases, operation result types can be unambiguously inferred from operands and attributes at operation construction time. Stop requiring the user to provide these types as arguments in the ODS-generated constructors in Python bindings. In particular, handle the SameOperandAndResultTypes and FirstAttrDerivedResultType traits as well as InferTypeOpInterface using the recently added interface support. This is a significant usability improvement for IR construction, similar to what C++ ODS provides. Depends On D111656 Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D111811	2021-10-25 12:50:44 +02:00
Alex Zinenko	14c9207063	[mlir] support interfaces in Python bindings Introduce the initial support for operation interfaces in C API and Python bindings. Interfaces are a key component of MLIR's extensibility and should be available in bindings to make use of full potential of MLIR. This initial implementation exposes InferTypeOpInterface all the way to the Python bindings since it can be later used to simplify the operation construction methods by inferring their return types instead of requiring the user to do so. The general infrastructure for binding interfaces is defined and InferTypeOpInterface can be used as an example for binding other interfaces. Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D111656	2021-10-25 12:50:42 +02:00
Shraiysh Vaishay	a81672b31a	[NFC][MLIR][OpenMP] Splitting the WsLoop tests. Splitting the WsLoop tests they were getting harder to debug with the offsets over 100 for some of them. Reviewed By: clementval Differential Revision: https://reviews.llvm.org/D112407	2021-10-25 14:00:36 +05:30
Nicolas Vasilache	1b702eea94	[mlir][Linalg] NFC - Reorganize options nesting. This removes duplication and makes nesting more clear. It also reduces the amount of changes necessary for exposing future options. Differential revision: https://reviews.llvm.org/D112344	2021-10-25 06:21:30 +00:00
Jacques Pienaar	42e9af9e8f	[mlir] Rename to avoid overlap in accessor prefixing Split out renaming from D112383 into standalone change.	2021-10-24 18:17:09 -07:00
Nicolas Vasilache	e03b443113	Revert "[mlir][Linalg] NFC - Reorganize options nesting." This reverts commit `4703a07e6c`. Didnt' mean to push this yet, sorry about the noise.	2021-10-23 13:46:22 +00:00
Nicolas Vasilache	4703a07e6c	[mlir][Linalg] NFC - Reorganize options nesting. This removes duplication and makes nesting more clear. It also reduces the amount of changes necessary for exposing future options. Differential revision: https://reviews.llvm.org/D112344	2021-10-23 13:03:01 +00:00
Emilio Cota	35553d452b	[mlir] Add polynomial approximation for vectorized math::Rsqrt This patch adds a polynomial approximation that matches the approximation in Eigen. Note that the approximation only applies to vectorized inputs; the scalar rsqrt is left unmodified. The approximation is protected with a flag since it emits an AVX2 intrinsic (generated via the X86Vector). This is the only reasonably clean way that I could find to generate the exact approximation that I wanted (i.e. an identical one to Eigen's). I considered two alternatives: 1. Introduce a Rsqrt intrinsic in LLVM, which doesn't exist yet. I believe this is because there is no definition of Rsqrt that all backends could agree on, since hardware instructions that implement it have widely varying degrees of precision. This is something that the standard could mandate, but Rsqrt is not part of IEEE754, so I don't think this option is feasible. 2. Emit fdiv(1.0, sqrt) with fast math flags to allow reciprocal transformations. Although portable, this doesn't allow us to generate exactly the code we want; it is the LLVM backend, and not MLIR, who controls what code is generated based on the target CPU. Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D112192	2021-10-23 04:56:12 -07:00
Nicolas Vasilache	89d55d3c86	[mlir][Linalg] Retire CodegenStrategy::transform Instead each pass should constructed a nested OpPassManager and runPipeline on that. Differential Revision: https://reviews.llvm.org/D112308	2021-10-22 20:27:14 +00:00
Nicolas Vasilache	489fec2777	[mlir][Linalg] NFC - Drop Optional in favor of FailureOr Differential revision: https://reviews.llvm.org/D112332	2021-10-22 19:28:18 +00:00
Mats Petersson	3f00e10bdd	[mlir][OpenMP]Support for modifiers in workshare loops Pass the modifiers from the Flang parser to FIR/MLIR workshare loop operation. Not yet supporting the SIMD modifier, which is a bit more work than just adding it to the list of modifiers, so will go in a separate patch. This adds a new field to the WsLoopOp. Also add test for dynamic WSLoop, checking that dynamic schedule calls the init and next functions as expected. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D111053	2021-10-22 14:19:33 +01:00
Matthias Springer	3bbc869e2e	[mlir][linalg][bufferize] Support scf::IfOp This commit adds support for scf::IfOp to comprehensive bufferization. Support is currently limited to cases where both branches yield tensors that bufferize to the same buffer. To keep the analysis simple, scf::IfOp are treated as memory writes for analysis purposes, even if no op inside any branch is writing. (scf::ForOps are handled in the same way.) Differential Revision: https://reviews.llvm.org/D111929	2021-10-22 10:12:55 +09:00
Mogball	516884f58b	[MLIR] Fix FloorDivSIOpConverter that was failing for index type after the arithmetic op refactor ConstantOp should be used instead of ConstantIntOp to be able to support index type. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D112191	2021-10-21 21:42:30 +00:00
thomasraoux	93d0ade17c	[mlir][linalg] Remove special case for contraction vectorization Handle contraction op like all the other generic op reductions. This simpifies the code. We now rely on contractionOp canonicalization to keep the same code quality. Differential Revision: https://reviews.llvm.org/D112171	2021-10-21 14:10:54 -07:00
thomasraoux	1d8cc45b0e	[mlir][vector] Add patterns to convert multidimreduce to vector.contract add several patterns that will simplify contraction vectorization in the future. With those canonicalizationns we will be able to remove the special case for contration during vectorization and rely on those transformations to avoid materizalizing broadcast ops. Differential Revision: https://reviews.llvm.org/D112121	2021-10-21 14:03:32 -07:00
Ahmed Taei	21f9e4a1ed	Avoid infinity arithmetics when computing exp approximations Otherwise this can result a poison value on some platforms see https://bugs.llvm.org/show_bug.cgi?id=51204 Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D112115	2021-10-21 10:09:18 -07:00
Nicolas Vasilache	203accf0bd	[mlir][Linalg] Improve conv vectorization for the stride==1 case. In the stride == 1 case, conv1d reads contiguous data along the input dimension. This can be advantageaously used to bulk memory transfers and compute while avoiding unrolling. Experimentally, this can yield speedups of up to 50%. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D112139	2021-10-21 15:18:28 +00:00
Matthias Springer	7a7e93f122	[mlir][linalg][bufferize] Avoid creating copies that are never read Differential Revision: https://reviews.llvm.org/D111956	2021-10-21 21:47:00 +09:00
Matthias Springer	c5501a7a5c	[mlir][linalg][bufferize] Eliminate InitTensorOps of InsertSliceOp sources An InitTensorOp is replaced with an ExtractSliceOp on the InsertSliceOp's destination. This optimization is applied after analysis and only to InsertSliceOps that were decided to bufferize inplace. Another analysis on the new ExtractSliceOp is needed after the rewrite. Differential Revision: https://reviews.llvm.org/D111955	2021-10-21 21:33:45 +09:00
Benjamin Kramer	898e80964c	[mlir] Fix a crash when creating a 1d zero element LLVM constant Fixes a regression introduced in `f9be7a7afd` Differential Revision: https://reviews.llvm.org/D112208	2021-10-21 12:55:08 +02:00
Peixin-Qiao	b37e5187f2	[MLIR][OpenMP] Add support for ordered construct This patch supports the ordered construct in OpenMP dialect following Section 2.19.9 of the OpenMP 5.1 standard. Also lowering to LLVM IR using OpenMP IRBduiler. Lowering to LLVM IR for ordered simd directive is not supported yet since LLVM optimization passes do not support it for now. Reviewed By: kiranchandramohan, clementval, ftynse, shraiysh Differential Revision: https://reviews.llvm.org/D110015	2021-10-21 16:30:46 +08:00
Matthias Springer	9c55e718f5	[mlir][linalg][bufferize] Bufferize using PostOrder traversal This is required for bufferization of scf::IfOp, which is added in a subsequent commit. Some ops (scf::ForOp, TiledLoopOp) require PreOrder traversal to make sure that bbArgs are mapped before bufferizing the loop body. Differential Revision: https://reviews.llvm.org/D111924	2021-10-21 17:21:52 +09:00
Mehdi Amini	cb11ddb96c	Revert "[MLIR][OpenMP] Add support for ordered construct" This reverts commit `dc2be87ecf`. Seems like this broke all the CI bots.	2021-10-21 04:53:45 +00:00
Peixin-Qiao	dc2be87ecf	[MLIR][OpenMP] Add support for ordered construct This patch supports the ordered construct in OpenMP dialect following Section 2.19.9 of the OpenMP 5.1 standard. Also lowering to LLVM IR using OpenMP IRBduiler. Lowering to LLVM IR for ordered simd directive is not supported yet since LLVM optimization passes do not support it for now. Reviewed By: kiranchandramohan, clementval, ftynse, shraiysh Differential Revision: https://reviews.llvm.org/D110015	2021-10-21 09:16:04 +08:00
Aart Bik	bd5494d127	[mlir][sparse] make index type explicit in public API of support library The current implementation used explicit index->int64_t casts for some, but not all instances of passing values of type "index" in and from the sparse support library. This revision makes the situation more consistent by using new "index_t" type at all such places (which allows for less trivial casting in the generated MLIR code). Note that the current revision still assumes that "index" is 64-bit wide. If we want to support targets with alternative "index" bit widths, we need to build the support library different. But the current revision is a step forward by making this requirement explicit and more visible. Reviewed By: wrengr Differential Revision: https://reviews.llvm.org/D112122	2021-10-20 12:46:31 -07:00
Ahmed S. Taei	a3dd4e7770	Drop transfer_read inner most unit dimensions Add a pattern to take a rank-reducing subview and drop inner most contiguous unit dim. This is useful when lowering vector to backends with 1d vector types. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D111561	2021-10-20 19:27:04 +00:00
Alex Zinenko	310736e098	[mlir] fix region property generation in python bindings	2021-10-20 19:00:59 +02:00
Shraiysh Vaishay	c4c7e06bd7	[MLIR][OpenMP] Shifted hint from CriticalOp to CriticalDeclareOp According to the OpenMP 5.0 standard, names and hints of critical operation are closely related. The following are the restrictions on them: - Unless the effect is as if `hint(omp_sync_hint_none)` was specified, the critical construct must specify a name. - If the hint clause is specified, each of the critical constructs with the same name must have a hint clause for which the hint-expression evaluates to the same value. These restrictions will be enforced by design if the hint expression is a part of the `omp.critical.declare` operation. - Any operation with no "name" will be considered to have `hint(omp_sync_hint_none)`. - All the operations with the same "name" will have the same hint value. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D112134	2021-10-20 21:36:09 +05:30
Jacques Pienaar	6a99423390	[mlir] Expand prefixing to OpFormatGen Follow up to also use the prefixed emitters in OpFormatGen (moved getGetterName(s) and getSetterName(s) to Operator as that is most convenient usage wise even though it just depends on Dialect). Prefix accessors in Test dialect and follow up on missed changes in OpDefinitionsGen. Differential Revision: https://reviews.llvm.org/D112118	2021-10-20 07:08:37 -07:00
Nicolas Vasilache	6bb7d2474f	[mlir][Linalg] Add a first vectorization pattern for conv1d in NWCxWCF format. This revision uses the newly refactored StructuredGenerator to create a simple vectorization for conv1d_nwc_wcf. Note that the pattern is not specific to the op and is technically not even specific to the ConvolutionOpInterface (modulo minor details related to dilations and strides). The overall design follows the same ideas as the lowering of vector::ContractionOp -> vector::OuterProduct: it seeks to be minimally complex, composable and extensible while avoiding inference analysis. Instead, we metaprogram the maps/indexings we expect and we match against them. This is just a first stab and still needs to be evaluated for performance. Other tradeoffs are possible that should be explored. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D111894	2021-10-20 13:54:18 +00:00
Kojo Acquah	9c62bb55f4	Implementation of `ReshapeNoopOptimization` canonicalizer. This canonicalizer replaces reshapes of constant tensors that contain the updated shape (skipping the reshape operation). Differential Revision: https://reviews.llvm.org/D112038	2021-10-19 16:07:34 -07:00
bakhtiyar	f97f946839	Canonicalize max/min operations on integers. Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D112051	2021-10-19 05:25:59 -07:00
Shraiysh Vaishay	d576f45014	[MLIR][OpenMP] Added parseClauses Code reorganized in OpenMPDialect.cpp to have all functions corresponding to an operation together. Added parseClauses function to avoid code duplication while parsing clauses in OpenMP operations. Also added printers and verifiers for clauses, which are being used for multiple operations. Reviewed By: kiranchandramohan, peixin Differential Revision: https://reviews.llvm.org/D110903	2021-10-19 17:31:36 +05:30
Vladislav Vinogradov	e41ebbecf9	[mlir][RFC] Refactor layout representation in MemRefType The change is based on the proposal from the following discussion: https://llvm.discourse.group/t/rfc-memreftype-affine-maps-list-vs-single-item/3968 * Introduce `MemRefLayoutAttr` interface to get `AffineMap` from an `Attribute` (`AffineMapAttr` implements this interface). * Store layout as a single generic `MemRefLayoutAttr`. This change removes the affine map composition feature and related API. Actually, while the `MemRefType` itself supported it, almost none of the upstream can work with more than 1 affine map in `MemRefType`. The introduced `MemRefLayoutAttr` allows to re-implement this feature in a more stable way - via separate attribute class. Also the interface allows to use different layout representations rather than affine maps. For example, the described "stride + offset" form, which is currently supported in ASM parser only, can now be expressed as separate attribute. Reviewed By: ftynse, bondhugula Differential Revision: https://reviews.llvm.org/D111553	2021-10-19 12:31:15 +03:00
not-jenni	4ada6c2aaf	[mlir][tosa] Adds a canonicalization to the transpose op if the perms are a no op Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D112037	2021-10-18 16:30:53 -07:00
wren romano	bd0cae6d16	[mlir][sparse] Renaming variables for consistency/clarity Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D112029	2021-10-18 15:12:03 -07:00
Aart Bik	9d1db3d4a1	[mlir][sparse] generalize sparse_tensor.convert on static/dynamic dimension sizes This revison lifts the artificial restriction on having exact matches between source and destination type shapes. A static size may become dynamic. We still reject changing a dynamic size into a static size to avoid the need for a runtime "assert" on the conversion. This revision also refactors some of the conversion code to share same-content buffers. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D111915	2021-10-18 13:54:03 -07:00
Eugene Zhulenev	bf32bb7e05	[mlir] Update approximation range for Tanh operation Use wider range for approximating Tanh to match results computed in Eigen with AVX. Reviewed By: cota Differential Revision: https://reviews.llvm.org/D112011	2021-10-18 10:57:31 -07:00
Caitlyn Cano	2ea5e7ba57	[mlir] SPIR-V: add sin, cos, log, sqrt OCL ops Differential Revision: https://reviews.llvm.org/D111884	2021-10-18 20:48:59 +03:00
Jacques Pienaar	62bf850910	[mlir] Flipping Test dialect to prefixed form _Both Starting with a mostly NFC change to be able to differentiate between mechanical changes from ones that require more detailed review. This will be used to flush out flow before flipping dialects used outside local testing. As this dialect is not intended to be used generally rather than in tests in core, I will not be following 2 week staging approach here.	2021-10-18 10:00:37 -07:00
Mathieu Fehr	d78136121e	[mlir] Add AnyAttrOf tablegen attribute constraint AnyAttrOf, similar to AnyTypeOf, expects the attribute to be one of the given attributes. For instance, `AnyAttrOf<[I32Attr, StrAttr]>` expects either a `I32Attr`, or a `StrAttr`. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D111739	2021-10-18 15:45:25 +00:00
Ahmed Taei	b0c4aaff24	Allow only valid vector.shape_cast transitive folding When folding A->B->C => A->C only accept A->C that is valid shape cast Reviewed By: ThomasRaoux, nicolasvasilache Differential Revision: https://reviews.llvm.org/D111473	2021-10-18 07:57:55 -07:00
rkayaith	d5429a13da	[mlir][python] Add 'loc' property to ops Add a read-only `loc` property to Operation and OpView Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D111972	2021-10-18 16:01:12 +02:00
William S. Moses	40b9c39db1	[MLIR][LLVM] Add memset intrinsic Add memset intrinsic into LLVM dialect Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D111906	2021-10-16 18:20:48 -04:00
Matthias Springer	e7bb8dd929	[mlir][linalg][bufferize] Relax rules for extract_slice/insert_slice matching The rules were too restrictive, causing out-of-place bufferization when the result of two ExtractSliceOp is fed into an InsertSliceOp. Differential Revision: https://reviews.llvm.org/D111861	2021-10-16 17:08:47 +09:00
Jacques Pienaar	965ec6dbe7	[mlir] Add folder for shape.add	2021-10-15 17:30:17 -07:00
Aart Bik	e9b1c974be	[mlir][sparse] run less combinations of SpMM in test (to reduce runtime) This revision also adds a few passes to the sparse compiler part to unify the transformation sequence with all other paths we currently use. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D111900	2021-10-15 16:04:01 -07:00
Geoffrey Martin-Noble	efc6fe963c	[MLIR][TOSA] Drop "OnTensors" suffix This is the only lowering to Linalg Tosa has, so it's needlessly verbose. Likely this was a carry over from IREE's usage where we originally lowered to linalg on buffers (the only linalg that existed at the time), so the everything on tensors needed the suffix. We're dropping it in IREE also, having transitioned entirely to using Linalg on tensors. Reviewed By: sjarus Differential Revision: https://reviews.llvm.org/D111911	2021-10-15 16:01:19 -07:00
Aart Bik	b24788abd8	[mlir][sparse] implement sparse tensor init operation Next step towards supporting sparse tensors outputs. Also some minor refactoring of enum constants as well as replacing tensor arguments with proper buffer arguments (latter is required for more general sizes arguments for the sparse_tensor.init operation, as well as more general spares_tensor.convert operations later) Reviewed By: wrengr Differential Revision: https://reviews.llvm.org/D111771	2021-10-15 09:33:16 -07:00
Mogball	44610c01ae	[MLIR][ODS] default-valued strings should be in quotes `DefaultValuedAttr<StrAttr, "">` and `ConstantAttr<StrAttr, "">` result in bugs in which TableGen will not recognize that the attribute has a default value, because `""` is an empty TableGen string. Strings no longer have special treatment. Instead, string values must be wrapped in quotes: "\"foo\"". Two helpers, `DefaultValuedStrAttr` and `ConstantStrAttr` have been added to keep code clean. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D111855	2021-10-15 03:00:41 +00:00
Matthias Springer	7dd7078760	[mlir][linalg][bufferize] Handle scf::ForOp correctly in bufferizesToMemoryRead From the perspective of analysis, scf::ForOp is treated as a black box. Basic block arguments do not alias with their respective OpOperands on the ForOp, so they do not participate in conflict analysis with ops defined outside of the loop. However, bufferizesToMemoryRead and bufferizesToMemoryWrite on the scf::ForOp itself are used to determine how the scf::ForOp interacts with its surrounding ops. Differential Revision: https://reviews.llvm.org/D111775	2021-10-15 11:24:21 +09:00
Matthias Springer	d3cb6bf2d4	[mlir][linalg][bufferize] Rewrite conflict detection For each memory read, follow SSA use-def chains to find the op that produces the data being read (i.e., the most recent write). A memory write to an alias is a conflict if it takes places after the "most recent write" but before the read. This CL introduces two main changes: * There is a concise definition of a conflict. Given a piece of IR with InPlaceSpec annotations and a computes alias set, it is easy to compute whether this program has a conflict. No need to consider multiple cases such as "read of operand after in-place write" etc. * No need to check for clobbering. Differential Revision: https://reviews.llvm.org/D111287	2021-10-15 10:31:02 +09:00
Jacques Pienaar	65c9907c80	[mlir][ods] Enable emitting getter/setter prefix Allow emitting get & set prefix for accessors generated for ops. If enabled, then the argument/return/region name gets converted from snake_case to UpperCamel and prefix added. The attribute also allows generating both the current "raw" method along with the prefix'd one to make it easier to stage changes. The option is added on the dialect and currently defaults to existing raw behavior. The expectation is that the staging where both are generated would be short lived and so optimized to keeping the changes local/less invasive (it just generates two functions for each accessor with the same body - most of these internally again call a helper function). But generation can be optimized if needed. I'm unsure about OpAdaptor classes as there it is all get methods (it is a named view into raw data structures), so prefix doesn't add much. This starts with emitting raw-only form (as current behavior) as default, then one can opt-in to raw & prefixed, then just prefixed. The default in OpBase will switch to prefixed-only to be consistent with MLIR style guide. And the option potentially removed later (considered enabling specifying prefix but current discussion more pro keeping it limited and stuck with that). Also add more explicit checking for pruned functions to avoid emitting where no function was added (and so avoiding dereferencing nullptr) during op def/decl generation. See https://bugs.llvm.org/show_bug.cgi?id=51916 for further discussion. Differential Revision: https://reviews.llvm.org/D111033	2021-10-14 15:58:44 -07:00
Mogball	cb3aa49ec0	[MLIR][arith] fix references to std.constant in comments Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D111820	2021-10-14 20:38:47 +00:00
thomasraoux	afad0cdf31	[mlir][vector] Refactor linalg vectorization for reductions Emit reduction during op vectorization instead of doing it when creating the transfer write. This allow us to not broadcast output arguments for reduction initial value. Differential Revision: https://reviews.llvm.org/D111825	2021-10-14 13:37:56 -07:00
Rob Suderman	59dd418e89	[mlir][tosa] Fix tosa.cast UiToFp32 for tosa-to-linalg Part of the arith update broke UiToFp32. Fixed the lowering and included a new test to detect a regression. Differential Revision: https://reviews.llvm.org/D111772	2021-10-14 11:34:10 -07:00
Nicolas Vasilache	82dd977baf	[mlir][Linalg] Tighten canonicalization of InsertSliceOp that triggers infinite loop I am unclear this is reproducible with correct IR but atm the verifier for InsertSliceOp is not powerful enough and this triggers an infinite loop that is worth fixing independently. Differential Revision: https://reviews.llvm.org/D111812	2021-10-14 15:26:03 +00:00
Nicolas Vasilache	0eeaad3012	[mlir][Linalg] Fix insertion point in comprehensive bufferization	2021-10-14 15:24:09 +00:00
Alex Zinenko	18fbd5fe34	[mlir][python] Better support for variadic regions in Python bindings Improve support for variadic regions in ODS-generated operation view classes. In particular, make generated constructors take an extra argument that specifies the number of variadic regions if the operation has them. Previously, there was no mechanism to specify a non-zero number of variadic regions. Also generate named accessors to regions. Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D111783	2021-10-14 13:15:13 +02:00
Alex Zinenko	a04c0b7ed2	[mlir][python] Fix MemRefType IsAFunction in Python bindings MemRefType was using a wrong `isa` function in the bindings code, which could lead to invalid IR being constructed. Also run the verifier in memref dialect tests. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D111784	2021-10-14 13:12:37 +02:00
Uday Bondhugula	05fb26062c	[MLIR] Fix assert crash when an unregistered dialect op is encountered Fix assert crash when an unregistered dialect op is encountered during parsing and `-allow-unregistered-dialect' isn't on. Instead, emit an error. While on this, clean up "registered" vs "loaded" on `getDialect()` and local clang-tidy warnings. https://llvm.discourse.group/t/assert-behavior-on-unregistered-dialect-ops/4402 Differential Revision: https://reviews.llvm.org/D111628	2021-10-14 15:43:53 +05:30
Tobias Gysi	a8f69be61f	[mlir][linalg] Expose flag to control nofold attribute when padding. Setting the nofold attribute enables packing an operand. At the moment, the attribute is set by default. The pack introduces a callback to control the flag. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D111718	2021-10-14 10:07:07 +00:00
Tobias Gysi	eaa52750ce	[mlir][linalg] Verify every LinalgOp has a body. After removing the last LinalgOps that have no region attached we can verify there is a region. The patch performs the following changes: - Move the SingleBlockImplicitTerminator trait further up the the structured op base class. - Adapt the LinalgOp verification since the trait only check if there is 0 or 1 block. - Introduce a getBlock method on the LinalgOp interface. - Access the LinalgOp body using either getBlock() or getBody() if the concrete operation type is known. This patch is a follow up to https://reviews.llvm.org/D111233. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D111393	2021-10-14 09:08:39 +00:00
Stella Laurenzo	fe6d9937b3	[mlir] Ability to build CAPI dylibs from out of tree projects against installed LLVM. * Incorporates a reworked version of D106419 (which I have closed but has comments on it). * Extends the standalone example to include a minimal CAPI (for registering its dialect) and a test which, from out of tree, creates an aggregate dylib and links a little sample program against it. This will likely only work today in static MLIR builds (until the TypeID fiasco is finally put to bed). It should work on all platforms, though (including Windows - albeit I haven't tried this exact incarnation there). * This is the biggest pre-requisite to being able to build out of tree MLIR Python-based projects from an installed MLIR/LLVM. * I am rather nauseated by the CMake shenanigans I had to endure to get this working. The primary complexity, above and beyond the previous patch is because (with no reason given), it is impossible to export target properties that contain generator expressions... because, of course it isn't. In this case, the primary reason we use generator expressions on the individual embedded libraries is to support arbitrary ordering. Since that need doesn't apply to out of tree (which import everything via FindPackage at the outset), we fall back to a more imperative way of doing the same thing if we detect that the target was imported. Gross, but I don't expect it to need a lot of maintenance. * There should be a relatively straight-forward path from here to rebase libMLIR.so on top of this facility and also make it include the CAPI. Differential Revision: https://reviews.llvm.org/D111504	2021-10-13 18:45:55 -07:00
Aart Bik	a652e5b53a	[mlir][sparse] emergency fix after constant -> arith.constant change Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D111743	2021-10-13 10:26:17 -07:00
Aart Bik	35517a251d	[mlir][sparse] add init sparse tensor operation This is the first step towards supporting general sparse tensors as output of operations. The init sparse tensor is used to materialize an empty sparse tensor of given shape and sparsity into a subsequent computation (similar to the dense tensor init operation counterpart). Example: %c = sparse_tensor.init %d1, %d2 : tensor<?x?xf32, #SparseMatrix> %0 = linalg.matmul ins(%a, %b: tensor<?x?xf32>, tensor<?x?xf32>) outs(%c: tensor<?x?xf32, #SparseMatrix>) -> tensor<?x?xf32, #SparseMatrix> Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D111684	2021-10-13 09:47:56 -07:00
xndcn	8c1553f0d7	[mlir][spirv] Add memory semantics verify for atomic operations Differential Revision: https://reviews.llvm.org/D111510	2021-10-14 00:00:55 +08:00
Alex Zinenko	7fd6f40dbd	[mlir][python] Add custom constructor for memref load The type can be inferred trivially, but it is currently done as string stitching between ODS and C++ and is not easily exposed to Python. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D111712	2021-10-13 17:11:02 +02:00
thomasraoux	cc83c2444f	[mlir][vector] Add canonicalization extract + splat Make canonicalization working on broadcast also work on splat op. Differential Revision: https://reviews.llvm.org/D111690	2021-10-13 08:08:46 -07:00
Alex Zinenko	78f2dae00d	[mlir][python] Provide some methods and properties for API completeness When writing the user-facing documentation, I noticed several inconsistencies and asymmetries in the Python API we provide. Fix them by adding: - the `owner` property to regions, similarly to blocks; - the `isinstance` method to any class derived from `PyConcreteAttr`, `PyConcreteValue` and `PyConreteAffineExpr`, similar to `PyConcreteType` to enable `isa`-like calls without having to handle exceptions; - a mechanism to create the first block in the region as we could only create blocks relative to other blocks, with is impossible in an empty region. Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D111556	2021-10-13 14:30:55 +02:00
Jacques Pienaar	e67cbbef03	[mlir][python] Expose CallSiteLoc Python side This exposes creating a CallSiteLoc with a callee & list of frames for callers. Follows the creation approach in C++ side where a list of frames may be provided. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D111670	2021-10-13 10:25:40 +02:00
Mogball	a54f4eae0e	[MLIR] Replace std ops with arith dialect ops Precursor: https://reviews.llvm.org/D110200 Removed redundant ops from the standard dialect that were moved to the `arith` or `math` dialects. Renamed all instances of operations in the codebase and in tests. Reviewed By: rriddle, jpienaar Differential Revision: https://reviews.llvm.org/D110797	2021-10-13 03:07:03 +00:00
Weiwei Li	c0a6381e49	[mlir][SPIRVToLLVM] Solve ExecutionModeOp redefinition and add OpTypeSampledImage into SPV_Type 1. To avoid two ExecutionModeOp using the same name, adding the value of execution mode in name when converting to LLVM dialect. 2. To avoid syntax error in spv.OpLoad, add OpTypeSampledImage into SPV_Type. Reviewed by:antiagainst Differential revision:https://reviews.llvm.org/D111193	2021-10-13 10:03:25 +08:00
thomasraoux	aa71f487f3	[mlir] update new linalg vectorization tests after vectorization fix	2021-10-12 16:10:30 -07:00
thomasraoux	7c97e328b3	[mlir][linalg] Fix generic reduction vectorization We shouldn't broadcast the original value when doing reduction. Instead we compute the reduction and then combine it with the original value. Differential Revision: https://reviews.llvm.org/D111666	2021-10-12 15:46:04 -07:00
Diego Caballero	eeb09fd646	[mlir][Linalg] Enable vectorization of 'mul', 'and', 'or' and 'xor' reductions This patch adds support for vectorizing 'mul', 'and', 'or' anx 'xor' reductions to Linalg. Reviewed By: pifon2a, ThomasRaoux, aartbik Differential Revision: https://reviews.llvm.org/D111565	2021-10-12 21:08:23 +00:00
Diego Caballero	5c1d356c18	[mlir][Linalg] Enable vectorization of explicit broadcasts This patch teaches `isProjectedPermutation` and `inverseAndBroadcastProjectedPermutation` utilities to deal with maps representing an explicit broadcast, e.g., (d0, d1) -> (d0, 0). This extension is needed to enable vectorization of such explicit broadcast in Linalg. Reviewed By: pifon2a, nicolasvasilache Differential Revision: https://reviews.llvm.org/D111563	2021-10-12 21:08:22 +00:00
Rob Suderman	95e4b71519	[mlir][tosa] Fix tosa average_pool2d to linalg type issue Average pool assumed the same input/output type. Result type for integers is always an i32, should be updated appropriately. Reviewed By: GMNGeoffrey Differential Revision: https://reviews.llvm.org/D111590	2021-10-12 13:09:21 -07:00
Jacques Pienaar	04d76d3694	[mlir][python] Add nameloc getter Expose the nameloc getter to Python API. Differential Revision: https://reviews.llvm.org/D111663	2021-10-12 12:45:57 -07:00
Benjamin Kramer	f67d57c95f	[mlir][Shape] Add a pattern to turn extract from shape_of into tensor.dim If I remember correctly this wasn't done previously because dim used to be in the memref dialect. Differential Revision: https://reviews.llvm.org/D111651	2021-10-12 19:09:21 +02:00
Lei Zhang	519b350de0	[mlir][vector] Add folder for no-op InsertStridedSliceOp Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D111636	2021-10-12 11:41:35 -04:00
Nicolas Vasilache	b24c91fffc	[mlir][Vector][Bigfix] Fix vector transfer to store lowering to insert a proper ExtractOp Differential Revision: https://reviews.llvm.org/D111641	2021-10-12 13:28:12 +00:00
Nicolas Vasilache	0a7f81a451	mlir][Vector] Fix spuriously disabled test.	2021-10-12 12:56:40 +00:00
Nicolas Vasilache	753a67b5c9	[mlir][Linalg] Refactor and improve vectorization to add support for reduction into 0-d tensors. This revision takes advantage of the recently added support for 0-d transfers and vector.multi_reduction that return a scalar. Reviewed By: pifon2a Differential Revision: https://reviews.llvm.org/D111626	2021-10-12 12:47:36 +00:00
Lei Zhang	bdd37c9f49	[mlir][tensor] Add some folders for insert/extract slice ops * Fold extract_slice immediately after insert_slice. * Fold overlapping insert_slice. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D111439	2021-10-12 08:40:54 -04:00
Nicolas Vasilache	0c74b12a2e	[mlir][Vector] NFC - Add test to exercise lowering of vector.transfer to scf This revision also renames and moves some tests around. Differential Revision: https://reviews.llvm.org/D111606	2021-10-12 12:38:33 +00:00
Nicolas Vasilache	47f7938a94	[mlir][Vector] Add support for lowering 0-d transfers to load/store. Reviewed By: pifon2a Differential Revision: https://reviews.llvm.org/D111603	2021-10-12 12:35:19 +00:00
Nicolas Vasilache	67b10532c6	[mlir][Vector] Allow a 0-d for for vector transfer ops. This revision updates the op semantics, printer, parser and verifier to allow 0-d transfers. Until 0-d vectors are available, such transfers have a special form that transits through vector<1xt>. This is a stepping stone towards the longer term work of adding 0-d vectors and will help significantly reduce corner cases in vectorization. Transformations and lowerings do not yet support this form, extensions will follow. Differential Revision: https://reviews.llvm.org/D111559	2021-10-12 11:48:42 +00:00
Nicolas Vasilache	8f1650cb65	[mlir][Linalg] NFC - Refactor vector.broadcast op verification logic and make it available as a precondition in Linalg vectorization. Reviewed By: pifon2a Differential Revision: https://reviews.llvm.org/D111558	2021-10-12 11:35:34 +00:00
Nicolas Vasilache	31270eb165	[mlir][Vector] Let vector.multi_reduction reduce down to a scalar. vector.multi_reduction currently does not allow reducing down to a scalar. This creates corner cases that are hard to handle during vectorization. This revision extends the semantics and adds the proper transforms, lowerings and canonicalizations to allow lowering out of vector.multi_reduction to other abstractions all the way to LLVM. In a future, where we will also allow 0-d vectors, scalars will still be relevant: 0-d vector and scalars are not equivalent on all hardware. In the process, splice out the implementation patterns related to vector.multi_reduce into a new file. Reviewed By: pifon2a Differential Revision: https://reviews.llvm.org/D111442	2021-10-12 11:03:54 +00:00
Shraiysh Vaishay	7a79c6afea	[mlir][OpenMP] OpenMP Synchronization Hints stored as IntegerAttr `hint-expression` is an IntegerAttr, because it can be a combination of multiple values from the enum `omp_sync_hint_t` (Section 2.17.12 of OpenMP 5.0) Reviewed By: ftynse, kiranchandramohan Differential Revision: https://reviews.llvm.org/D111360	2021-10-12 11:01:19 +00:00
Vladislav Vinogradov	c6390f19f2	[mlir] Fix AsmPrinter for types with sub elements Call `printType(subElemType)` instead of `os << subElemType` for them. It allows to handle type aliases inside complex types. As a side effect, fixed `test.int` parsing. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D111536	2021-10-12 12:08:16 +03:00
Vladislav Vinogradov	505afd1e64	[mlir] Clean up boolean flags usage in LIT tests * Call `llvm_canonicalize_cmake_booleans` for all CMake options, which are propagated to `lit.local.cfg` files. * Use Python native boolean values instead of strings for such options. This fixes the cases, when CMake variables have values other than `ON` (like `TRUE`). This might happen due to IDE integration or due to CMake preset usage. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D110073	2021-10-12 11:44:48 +03:00
Daniel Resnick	1760d8b36b	[mlir][ODS] Support result type inference in custom assembly format Operations that have the InferTypeOpInterface trait can now omit the return types in their custom assembly formats. Differential Revision: https://reviews.llvm.org/D111326	2021-10-11 14:07:56 -06:00
Aart Bik	849f016ce8	[mlir][sparse] accept affine subscripts in outer dimensions of dense memrefs This relaxes vectorization of dense memrefs a bit so that affine expressions are allowed in more outer dimensions. Vectorization of non unit stride references is disabled though, since this seems ineffective anyway. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D111469	2021-10-11 11:45:14 -07:00
Uday Bondhugula	b2217b36fe	[MLIR] Fix affine loop unroll corner case for full unroll Fix affine loop unroll for zero trip count loops. Add missing check. Differential Revision: https://reviews.llvm.org/D111375	2021-10-11 10:22:24 +05:30
Amy Zhuang	5ce368cfe2	[mlir] Vectorize induction variables 1. Add support to vectorize induction variables of loops that are not mapped to any vector dimension in SuperVectorize pass. 2. Fix a bug in getForInductionVarOwner. Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D111370	2021-10-09 12:40:24 -07:00
Mehdi Amini	8c9f506d8c	Disable mlir/test/mlir-cpu-runner/async-group.mlir with ASAN This test is crashing 9 out of 10 runs in CI, but I can't reproduce locally right now. Disabling to get the CI back to green and avoid backsliding with more ASAN issues that would go unnoticed.	2021-10-09 03:02:53 +00:00
Emilio Cota	57c56cf20c	X86Vector: relax checks in rsqrt's integration test Instead of hard-coding results for both Intel and AMD, let's relax the checks to simplify the test while supporting both implementations. Note that: - If a new hardware implementation comes up in the future, it is likely to pass the relaxed tests, i.e. no future maintenance burden for us. - If something terribly wrong happens (e.g. instead of rsqrt we execute 1/sqrt), the tests will probably catch it, since the relaxed tests expect low precision (e.g. rsqrt(1) != 1.0). Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D111461	2021-10-08 13:59:18 -07:00
Stella Laurenzo	a201829a20	Fix parsing of hex-format index dense tensor attributes. TensorLiteralParser::getHexAttr does a isIntOrIndexOrFloat check and properly handles index elements, but TensorLiteralParser::getAttr that calls into it has a mismatched check. This just makes the checks match so that index element attrs can parse when of type tensor. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D111374	2021-10-08 15:44:02 +00:00
Matthias Springer	f8453ea75f	[mlir][linalg][bufferize] Rewrite "write into non-writable memory" detection The purpose of this revision is to make "write into non-writable memory" conflict detection easier to understand. The main idea is that there is a conflict in the case of inplace bufferization if: 1. Someone writes to (an alias of) opOperand, opResult or the to-be-bufferized op writes itself. 2. And, opOperand or opResult aliases a non-writable buffer. Differential Revision: https://reviews.llvm.org/D111379	2021-10-08 21:27:49 +09:00
Lei Zhang	4cd7ff6728	[mlir][linalg] Constant fold linalg.generic that are transposes This commit adds a pattern to perform constant folding on linalg generic ops which are essentially transposes. We see real cases where model importers may generate such patterns. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D110597	2021-10-08 08:09:13 -04:00
Eugene Zhulenev	e2a37bb540	[mlir] Add alignment option to constant tensor bufferization pass Reviewed By: bkramer Differential Revision: https://reviews.llvm.org/D111364	2021-10-08 03:17:20 -07:00
Alex Zinenko	b164f23c29	[mlir][python] support taking ops instead of values in op constructors Introduce support for accepting ops instead of values when constructing ops. A single-result op can be used instead of a value, including in lists of values, and any op can be used instead of a list of values. This is similar to, but more powerful, than the C++ API that allows for implicitly casting an OpType to Value if it is statically known to have a single result - the cast in Python is based on the op dynamically having a single result, and also handles the multi-result case. This allows to build IR in a more concise way: op = dialect.produce_multiple_results() other = dialect.produce_single_result() dialect.consume_multiple_results(other, op) instead of having to access the results manually op = dialect.produce.multiple_results() other = dialect.produce_single_result() dialect.consume_multiple_results(other.result, op.operation.results) The dispatch is implemented directly in Python and is triggered automatically for autogenerated OpView subclasses. Extension OpView classes should use the functions provided in ods_common.py if they want to implement this behavior. An alternative could be to implement the dispatch in the C++ bindings code, but it would require to forward opaque types through all Python functions down to a binding call, which makes it hard to inspect them in Python, e.g., to obtain the types of values. Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D111306	2021-10-08 09:49:48 +02:00
Tobias Gysi	8ed2e8e04f	[mlir][linalg] Retire Linalg ConvOp. The convolution op is one of the remaining hard coded Linalg operations that have no region attached. It got obsolete due to the OpDSL convolution operations. Removing it allows us to delete specialized code and tests that are not needed for the OpDSL counterparts that rely on the standard code paths. Test needed due to specialized implementations are removed. Tiling and fusion tests are replaced by variants using linalg.conv_2d. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D111233	2021-10-08 06:56:37 +00:00
Tobias Gysi	23800b05be	[mlir][linalg] Add loop interchange to CodegenStrategy. Add a loop interchange pass and integrate it with CodegenStrategy. This patch depends on https://reviews.llvm.org/D110728 and https://reviews.llvm.org/D110746. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110748	2021-10-08 06:39:22 +00:00
Tobias Gysi	1ebd197bc5	[mlir][linalg] Add generalization to CodegenStrategy. Add a generalization pass and integrate it with CodegenStrategy. This patch depends on https://reviews.llvm.org/D110728. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110746	2021-10-08 06:31:19 +00:00
Mehdi Amini	82cd8b81aa	Fix test-rsqrt.mlir to accept AMD's approximation of rsqrt as well These kind of function can behave differently on these X86 chips, there isn't really "one true answer" so we'll accept both. Also remove spurious passes and use mattr="avx" to match the instruction used here. Differential Revision: https://reviews.llvm.org/D111373	2021-10-08 04:24:24 +00:00

... 6 7 8 9 10 ...

5393 Commits