llvm-project

Commit Graph

Author	SHA1	Message	Date
natashaknk	b9b10c0e61	[tosa][mlir] Lowering for dynamic shapes in the reduce_x ops in tosa-to-linalg Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D117691	2022-01-19 11:15:14 -08:00
Thomas Raoux	d9edc1a585	[mlir][spirv] Add math.fma lowering to spirv Differential Revision: https://reviews.llvm.org/D117704	2022-01-19 10:57:05 -08:00
Mogball	aae5125550	[mlir] Replace StrEnumAttr -> EnumAttr in core dialects Removes uses of `StrEnumAttr` in core dialects Reviewed By: mehdi_amini, rriddle Differential Revision: https://reviews.llvm.org/D117514	2022-01-18 17:15:00 +00:00
Mogball	5c36ee8d57	[mlir] Drop the leading space when printing regions The leading space that is always printed at the beginning of regions is not consistent with other parts of the printing API. Moreover, this leading space can lead to undesirable assembly formats: ``` attr-dict-with-keyword $region ``` Prints as: ``` // Two spaces between `}` and `{` attributes {foo} { ... } ``` Moreover, the leading space results in the odd generic op format: ``` "test.op"() ( {...}) : () -> () ``` Reviewed By: rriddle, mehdi_amini Differential Revision: https://reviews.llvm.org/D117411	2022-01-18 16:52:34 +00:00
Nicolas Vasilache	cc0d208805	[mlir][Linalg] Drop deprecated convolution vectorization patterns Differential revision: https://reviews.llvm.org/D117326	2022-01-18 09:26:50 +00:00
Benjamin Kramer	964dc368e7	[AsyncToLLVM] aligned_alloc requires the size to be a multiple of aignment, so round up Fixes a crash with debug malloc.	2022-01-17 21:48:00 +01:00
Nicolas Vasilache	f40a579bea	Revert "[mlir][Linalg] NFC - Drop vectorization reliance on ConvolutionOpInterface" This reverts commit `c8f5735301`. The integration tests are broken.	2022-01-17 19:38:07 +00:00
Benjamin Kramer	5acd6e0522	[AsyncToLLVM] Align frames to 64 bytes Coroutine lowering always takes the natural alignment when spilling to the frame (issue #53148) so using AVX2 or AVX512 in a coroutine doesn't work. Always overalign to 64 bytes to avoid this issue until we have a better solution. Differential Revision: https://reviews.llvm.org/D117501	2022-01-17 18:51:42 +01:00
Nicolas Vasilache	c8f5735301	[mlir][Linalg] NFC - Drop vectorization reliance on ConvolutionOpInterface Differential Revision: https://reviews.llvm.org/D117323	2022-01-17 17:01:36 +00:00
Benoit Jacob	499703e9c0	Enable ReassociatingReshapeOpConversion with "non-identity" layouts. Enable ReassociatingReshapeOpConversion with "non-identity" layouts. This removes an early-return in this function, which seems unnecessary and is preventing some memref.collapse_shape from converting to LLVM (see included lit test). It seems unnecessary because the return message says "only empty layout map is supported" but there actually is code in this function to deal with non-empty layout maps. Maybe it refers to an earlier state of implementation and is just out of date? Though, there is another concern about this early return: the condition that it actually checks, `{src,dst}MemrefType.getLayout().isIdentity()`, is not quite the same as what the return message says, "only empty layout map is supported". Stepping through this `getLayout().isIdentity()` code in GDB, I found that it evaluates to `.getAffineMap().isIdentity()` which does (AffineMap.cpp:271): ``` if (getNumDims() != getNumResults()) return false; ``` This seems that it would always return false for memrefs of rank greater than 1 ? Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114808	2022-01-13 17:46:20 +00:00
natashaknk	310e9636ca	[tosa][mlir] Support dynamic batch dimension for ops where the batch dim is explicit Dynamic batch for rescale, gather, max_pool, avg_pool, conv2D and depthwise_conv2D. Split helper functions into a separate header file. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D117031	2022-01-12 14:16:50 -08:00
Rob Suderman	aa1c533a4e	[mlir][tosa] Expand tosa.apply_scale lowering for vectors Apply scale may encounter scalar, tensor, or vector operations. Expand the lowering so that it can lower arbitrary of container types. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D117080	2022-01-12 14:07:52 -08:00
Alex Zinenko	f50cfc44d6	[mlir] Require struct indices in LLVM::GEPOp to be constant Recent commits added a possibility for indices in LLVM dialect GEP operations to be supplied directly as constant attributes to ensure they remain such until translation to LLVM IR happens. Make this required for indexing into LLVM struct types to match LLVM IR requirements, otherwise the translation would assert on constructing such IR. For better compatibility with MLIR-style operation construction interface, allow GEP operations to be constructed programmatically using Values pointing to known constant operations as struct indices. Depends On D116758 Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D116759	2022-01-07 09:56:05 +01:00
Stanislav Funiak	2692eae574	[MLIR][PDL] Refactor the positions for multi-root patterns. When the original version of multi-root patterns was reviewed, several improvements were made to the pdl_interp operations during the review process. Specifically, the "get users of a value at the specified operand index" was split up into "get users" and "compare the users' operands with that value". The iterative execution was also cleaned up to `pdl_interp.foreach`. However, the positions in the pdl-to-pdl_interp lowering were not similarly refactored. This introduced several problems, including hard-to-detect bugs in the lowering and duplicate evaluation of `pdl_interp.get_users`. This diff cleans up the positions. The "upward" `OperationPosition` was split-out into `UsersPosition` and `ForEachPosition`, and the operand comparison was replaced with a simple predicate. In the process, I fixed three bugs: 1. When multiple roots were had the same connector (i.e., a node that they shared with a subtree at the previously visited root), we would generate a single foreach loop rather than one foreach loop for each such root. The reason for this is that such connectors shared the position. The solution for this is to add root index as an id to the newly introduced `ForEachPosition`. 2. Previously, we would use `pdl_interp.get_operands` indiscriminately, whether or not the operand was variadic. We now correctly detect variadic operands and insert `pdl_interp.get_operand` when needed. 3. In certain corner cases, we would trigger the "connector has not been traversed yet" assertion. This was caused by not inserting the values during the upward traversal correctly. This has now been fixed. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D116080	2022-01-04 08:03:44 +05:30
William S. Moses	a6a583dae4	[MLIR] Move AtomicRMW into MemRef dialect and enum into Arith Per the discussion in https://reviews.llvm.org/D116345 it makes sense to move AtomicRMWOp out of the standard dialect. This was accentuated by the need to add a fold op with a memref::cast. The only dialect that would permit this is the memref dialect (keeping it in the standard dialect or moving it to the arithmetic dialect would require those dialects to have a dependency on the memref dialect, which breaks linking). As the AtomicRMWKind enum is used throughout, this has been moved to Arith. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D116392	2021-12-30 14:31:33 -05:00
MaheshRavishankar	7df7586a0b	[mlir][MemRef] Deprecate unspecified trailing offset, size, and strides semantics of `OffsetSizeAndStrideOpInterface`. The semantics of the ops that implement the `OffsetSizeAndStrideOpInterface` is that if the number of offsets, sizes or strides are less than the rank of the source, then some default values are filled along the trailing dimensions (0 for offset, source dimension of sizes, and 1 for strides). This is confusing, especially with rank-reducing semantics. Immediate issue here is that the methods of `OffsetSizeAndStridesOpInterface` assumes that the number of values is same as the source rank. This cause out-of-bounds errors. So simplifying the specification of `OffsetSizeAndStridesOpInterface` to make it invalid to specify number of offsets/sizes/strides not equal to the source rank. Differential Revision: https://reviews.llvm.org/D115677	2021-12-29 11:18:29 -08:00
William S. Moses	99fc000c87	[MLIR] Expose atomicrmw and/or LLVM (dialect and IR) have atomics for and/or. This patch enables atomic_rmw ops in the standard dialect for and/or that lower to these (in addition to the existing atomics such as addi, etc). Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D116345	2021-12-29 00:23:28 -05:00
Rob Suderman	f0cb77d7d5	[mlir][tosa] Resubmit split tosa-to-linalg named ops out of pass Includes dependency fix that resulted in canonicalizer pass not linking in. Linalg named ops lowering are moved to a separate pass. This allows TOSA canonicalizers to run between named-ops lowerings and the general TOSA lowerings. This allows the TOSA canonicalizers to run between lowerings. Differential Revision: https://reviews.llvm.org/D116057	2021-12-28 11:22:58 -08:00
Mehdi Amini	735fe1da6b	Revert "[mlir][tosa] Split tosa-to-linalg named ops out of pass" This reverts commit `313de31fbb`. There is a missing CMake dependency, building with shared libraries is broken: 55.509 [45/4/3061] Linking CXX shared library lib/libMLIRTosaToLinalg.so.14git FAILED: lib/libMLIRTosaToLinalg.so.14git ... TosaToLinalgPass.cpp: undefined reference to `mlir::createCanonicalizerPass()'	2021-12-24 00:09:15 +00:00
Rob Suderman	313de31fbb	[mlir][tosa] Split tosa-to-linalg named ops out of pass Linalg named ops lowering are moved to a separate pass. This allows TOSA canonicalizers to run between named-ops lowerings and the general TOSA lowerings. This allows the TOSA canonicalizers to run between lowerings. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D116057	2021-12-23 12:23:19 -08:00
Rob Suderman	0763f12213	[mlir][tosa] Handle rescale case where shift > 63 It is possible for the shift value to exceed the number of bits. In these cases we can just multiply by zero. This is relatively rare occurence but should be handled. Reviewed By: not-jenni Differential Revision: https://reviews.llvm.org/D115779	2021-12-16 15:30:48 -08:00
Lei Zhang	223be5f630	[mlir][spirv] Perform partial conversion in VectorToSPIRVPass This allows the pass to participate in progressive lowering and it also allows us to write tests better. Along the way, cleaned up the tests. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D115756	2021-12-16 09:35:56 -05:00
Lei Zhang	96130b5dc7	[mlir][spirv] Support size-1 vector/tensor constant during conversion Reviewed By: ThomasRaoux, mravishankar Differential Revision: https://reviews.llvm.org/D115518	2021-12-14 15:58:08 -05:00
Alexander Belyaev	15f8f3e20a	[mlir] Split std.rank into tensor.rank and memref.rank. Move `std.rank` similarly to how `std.dim` was moved to TensorOps and MemRefOps. Differential Revision: https://reviews.llvm.org/D115665	2021-12-14 10:15:55 +01:00
Jacques Pienaar	efb7727a96	[mlir] Flag near misses in file splitting Flags some potential cases where splitting isn't happening and so could result in confusing results. Also update some test files where there were near misses in splitting that seemed unintentional. Differential Revision: https://reviews.llvm.org/D109636	2021-12-12 08:03:30 -08:00
Nicolas Vasilache	408553dd96	[mlir][Vector] Support 0-D vectors in `CreateMaskOp` The 0-D case gets lowered in almost the same way that the 1-D case does in VectorCreateMaskOpConversion. I also had to slightly update the verifier for the op to always require exactly 1 operand in the 0-D case. Depends On D115220 Reviewed by: ftynse Differential revision: https://reviews.llvm.org/D115221	2021-12-12 13:32:29 +00:00
Michal Terepeta	a0c930d312	[mlir][Vector] Support 0-D vectors in `CmpIOp` Following the example of `VectorOfAnyRankOf`, I've done a few changes in the `.td` files to help with adding the support for the 0-D case gradually. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D115220	2021-12-12 13:28:26 +00:00
River Riddle	233e9476d8	[mlir:PDL] Allow non-bound pdl.attribute/pdl.type operations that create constants This allows for passing in these attributes/types to constraints/rewrites as arguments. Differential Revision: https://reviews.llvm.org/D114817	2021-12-10 19:38:43 +00:00
Alexander Belyaev	b618880e7b	[mlir] Move `linalg.tensor_expand/collapse_shape` to TensorDialect. RFC: https://llvm.discourse.group/t/rfc-reshape-ops-restructuring/3310 linalg.fill gets a canonicalizer, because `FoldFillWithTensorReshape` cannot be moved to tensorops (it uses linalg::FillOp inside). Before it was listed as a canonicalization pattern for the reshape operations, now it became a canonicalization for FillOp. Differential Revision: https://reviews.llvm.org/D115502	2021-12-10 12:11:48 +01:00
Krzysztof Drewniak	e1da62910e	[MLIR][GPU] Define gpu.printf op and its lowerings - Define a gpu.printf op, which can be lowered to any GPU printf() support (which is present in CUDA, HIP, and OpenCL). This op only supports constant format strings and scalar arguments - Define the lowering of gpu.pirntf to a call to printf() (which is what is required for AMD GPUs when using OpenCL) as well as to the hostcall interface present in the AMD Open Compute device library, which is the interface present when kernels are running under HIP. - Add a "runtime" enum that allows specifying which of the possible runtimes a ROCDL kernel will be executed under or that the runtime is unknown. This enum controls how gpu.printf is lowered This change does not enable lowering for Nvidia GPUs, but such a lowering should be possible in principle. And: [MLIR][AMDGPU] Always set amdgpu-implicitarg-num-bytes=56 on kernels This is something that Clang always sets on both OpenCL and HIP kernels, and failing to include it causes mysterious crashes with printf() support. In addition, revert the max-flat-work-group-size to (1, 256) to avoid triggering bugs in the AMDGPU backend. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D110448	2021-12-09 15:54:31 +00:00
Rob Suderman	23149d522b	[mlir] Added ctlz and cttz to math dialect and LLVM dialect Count leading/trailing zeros are an existing LLVM intrinsic. Added LLVM support for the intrinsics with lowerings from the math dialect to LLVM dialect. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D115206	2021-12-08 14:32:15 -08:00
Butygin	d8fce785de	[mlir][spirv] math.erf OpenCL lowering Differential Revision: https://reviews.llvm.org/D115335	2021-12-08 21:59:46 +03:00
Rob Suderman	c5fef77bc3	[mlir] Add CtPop to MathOps with lowering to LLVM math.ctpop maths to the llvm.ctpop intrinsic. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D114998	2021-12-06 11:54:20 -08:00
Alex Zinenko	d64b3e47ba	[mlir] Avoid needlessly converting LLVM named structs with compatible elements Conversion of LLVM named structs leads to them being renamed since we cannot modify the body of the struct type once it is set. Previously, this applied to all named struct types, even if their element types were not affected by the conversion. Make this behvaior only applicable when element types are changed. This requires making the LLVM dialect type-compatibility check recursively look at the element types (arguably, it should have been doing than since the moment the LLVM dialect type system stopped being closed). In addition, have a more lax check for outer types only to avoid repeated check when necessary (e.g., parser, verifiers that are going to also look at the inner type). Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D115037	2021-12-06 13:42:11 +01:00
Michal Terepeta	caf89c0db6	[mlir][Vector] Support 0-D vectors in `ConstantMaskOp` To support creating both a mask with just a single `true` and `false` values, I had to relax the restriction in the verifier that the rank is always equal to the length of the attribute array, in other words, we now allow: - `vector.constant_mask [0] : vector<i1>` which gets lowered to `arith.constant dense<false> : vector<i1>` - `vector.constant_mask [1] : vector<i1>` which gets lowered to `arith.constant dense<true> : vector<i1>` (the attribute list for the 0-D case must be a singleton containing either `0` or `1`) Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D115023	2021-12-06 08:03:04 +00:00
Alex Zinenko	9dd1f8dfdd	[mlir] support recursive type conversion of named LLVM structs A previous commit added support for converting elemental types contained in LLVM dialect types in case they were not compatible with the LLVM dialect. It was missing support for named structs as they could be recursive, which was not supported by the conversion infra. Now that it is, add support for converting such named structs. Depends On D113579 Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D113580	2021-12-03 12:41:40 +01:00
Michal Terepeta	1423e8bf5d	[mlir][Vector] Support 0-D vectors in `BitCastOp` The implementation only allows to bit-cast between two 0-D vectors. We could probably support casting from/to vectors like `vector<1xf32>`, but I wasn't convinced that this would be important and it would require breaking the invariant that `BitCastOp` works only on vectors with equal rank. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114854	2021-12-03 08:55:59 +00:00
Nicolas Vasilache	c537a94334	[mlir][Vector] Thread 0-d vectors through vector.transfer ops This revision adds 0-d vector support to vector.transfer ops. In the process, numerous cleanups are applied, in particular around normalizing and reducing the number of builders. Reviewed By: ThomasRaoux, springerm Differential Revision: https://reviews.llvm.org/D114803	2021-12-01 16:49:43 +00:00
Stephen Neuendorffer	7386364889	Revert "[MLIR] Update Vector To LLVM conversion to be aware of assume_alignment" This reverts commit `29a50c5864`. After LLVM lowering, the original patch incorrectly moved alignment information across an unconstrained GEP operation. This is only correct for some index offsets in the GEP. It seems that the best approach is, in fact, to rely on LLVM to propagate information from the llvm.assume() to users. Thanks to Thomas Raoux for catching this.	2021-11-30 15:18:22 -08:00
Nicolas Vasilache	a08b750ce9	[mlir][tensor] InsertSliceOp verification. This revision reintroduces tensor.insert_slice verification which seems to have vanished over time: a verifier was initially introduced in `cf9503c1b7` but for some reason the invalid.mlir was not properly updated; as time passed the verifier was not called anymore and later the code was deleted. As a consequence, a non-negligible portion of tests has run astray using invalid tensor.insert_slice semantics and needed to be fixed. Also, extract isRankReducedType from TensorOps for better reuse Originally, this facility was used by both tensor and memref forms but it got copied around as dialects were split. Differential Revision: https://reviews.llvm.org/D114715	2021-11-30 20:37:06 +00:00
Julian Gross	ae1ea0bead	[mlir] Decompose Bufferization Clone operation into Memref Alloc and Copy. This patch introduces a new conversion to convert bufferization.clone operations into a memref.alloc and a memref.copy operation. This transformation is needed to transform all remaining clones which "survive" all previous transformations, before a given program is lowered further (to LLVM e.g.). Otherwise, these operations cannot be handled anymore and lead to compile errors. See: https://llvm.discourse.group/t/bufferization-error-related-to-memref-clone/4665 Differential Revision: https://reviews.llvm.org/D114233	2021-11-30 10:15:56 +01:00
Michal Terepeta	7e65fc9a60	[mlir][Vector] Support 0-D vectors in `BroadcastOp` This changes the op to produce `AnyVectorOfAnyRank` following mostly the code for 1-D vectors. Depends On D114598 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114550	2021-11-26 17:17:18 +00:00
Michal Terepeta	d0f927121e	[mlir][Standard] Support 0-D vectors in `SplatOp` This changes the op to produce `AnyVectorOfAnyRank` and implements this by just inserting the element (skipping the shuffle that we do for the 1-D case). Depends On D114549 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114598	2021-11-26 17:05:15 +00:00
Stanislav Funiak	a76ee58f3c	Multi-root PDL matching using upward traversals. This is commit 4 of 4 for the multi-root matching in PDL, discussed in https://llvm.discourse.group/t/rfc-multi-root-pdl-patterns-for-kernel-matching/4148 (topic flagged for review). This PR integrates the various components (root ordering algorithm, nondeterministic execution of PDL bytecode) to implement multi-root PDL matching. The main idea is for the pattern to specify mulitple candidate roots. The PDL-to-PDLInterp lowering selects one of these roots and "hangs" the pattern from this root, traversing the edges downwards (from operation to its operands) when possible and upwards (from values to its uses) when needed. The root is selected by invoking the optimal matching multiple times, once for each candidate root, and the connectors are determined form the optimal matching. The costs in the directed graph are equal to the number of upward edges that need to be traversed when connecting the given two candidate roots. It can be shown that, for this choice of the cost function, "hanging" the pattern an inner node is no better than from the optimal root. The following three main additions were implemented as a part of this PR: 1. OperationPos predicate has been extended to allow tracing the operation accepting a value (the opposite of operation defining a value). 2. Predicate checking if two values are not equal - this is useful to ensure that we do not traverse the edge back downwards after we traversed it upwards. 3. Function for for building the cost graph among the candidate roots. 4. Updated buildPredicateList, building the predicates optimal branching has been determined. Testing: unit tests (an integration test to follow once the stack of commits has landed) Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D108550	2021-11-26 18:11:37 +05:30
Michal Terepeta	cc311a155a	[mlir][Vector] Support 0-D vectors in `VectorPrintOpConversion` Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114549	2021-11-25 20:12:18 +00:00
Butygin	8dae0b6b6c	[mlir][spirv] arith::RemSIOp OpenCL lowering Differential Revision: https://reviews.llvm.org/D114524	2021-11-25 12:44:06 +03:00
Butygin	75a1bee05d	[mlir][spirv] Add math to OpenCL conversion Differential Revision: https://reviews.llvm.org/D113780	2021-11-24 02:31:21 +03:00
Rob Suderman	54eec7cafc	[mlir][tosa] Separate tosa.transpose_conv decomposition and added stride support Transpose convolution decomposition is now performed in a separate pass. This allows padding / constant propagation to be performed at the TOSA level. It also adds support for striding when there is no dilation. Differential Revision: https://reviews.llvm.org/D114409	2021-11-23 12:16:44 -08:00
Nicolas Vasilache	3ff4e5f2a4	[mlir][Vector] Thread 0-d vectors through InsertElementOp. This revision makes concrete use of 0-d vectors to extend the semantics of InsertElementOp. Reviewed By: dcaballe, pifon2a Differential Revision: https://reviews.llvm.org/D114388	2021-11-23 12:55:11 +00:00
Nicolas Vasilache	e7026aba00	[mlir][Vector] Thread 0-d vectors through ExtractElementOp. This revision starts making concrete use of 0-d vectors to extend the semantics of ExtractElementOp. In the process a new VectorOfAnyRank Tablegen OpBase.td is added to allow progressive transition to supporting 0-d vectors by gradually opting in. Differential Revision: https://reviews.llvm.org/D114387	2021-11-23 12:39:44 +00:00
Thomas Raoux	47555d73f6	[mlir][gpu] Extend shuffle op modes and add nvvm lowering Add up, down and idx modes to gpu shuffle ops, also change the mode from string to enum Differential Revision: https://reviews.llvm.org/D114188	2021-11-19 11:14:31 -08:00
Mogball	7c5ecc8b7e	[mlir][vector] Insert/extract element can accept index `vector::InsertElementOp` and `vector::ExtractElementOp` have had their `position` operand changed to accept `AnySignlessIntegerOrIndex` for better operability with operations that use `index`, such as affine loops. LLVM's `extractelement` and `insertelement` can also accept `i64`, so lowering directly to these operations without explicitly inserting casts is allowed. SPIRV's equivalent ops can also accept `i64`. Reviewed By: nicolasvasilache, jpienaar Differential Revision: https://reviews.llvm.org/D114139	2021-11-18 22:40:29 +00:00
William S. Moses	30d87d4a5d	[MLIR][LLVM] Permit integer types in switch other than i32 LLVM switchop currently only permits i32. Both LLVM IR and MLIR Standard switch permit other integer types leading to an illegal state when lowering an i8 switch from MLIR standard Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D113955	2021-11-16 12:00:37 -05:00
Butygin	526b71e44a	[mlir] spirv: Add scf.while spirv conversion * It works similar to scf.for coversion, but convert condition and yield ops as part of scf.whille pattern so it don't need to maintain external state Differential Revision: https://reviews.llvm.org/D113007	2021-11-16 13:19:34 +03:00
Adrian Kuegel	921d91f3ac	[mlir] Support multi-dimensional vectors in MathToLibm conversion. Differential Revision: https://reviews.llvm.org/D113969	2021-11-16 11:13:52 +01:00
Mehdi Amini	1585b13024	Revert "[MLIR][LLVM] Permit integer types in switch other than i32" This reverts commit `94992670fc`. Build is broken with: tools/mlir/include/mlir/Dialect/LLVMIR/LLVMOps.cpp.inc:23996:3: error: no matching function for call to 'printSwitchOpCases' printSwitchOpCases(_odsPrinter, *this, getValue().getType(), getCaseValuesAttr(), getCaseDestinations(), getCaseOperands(), getCaseOperands().getTypes()); ^~~~~~~~~~~~~~~~~~	2021-11-16 05:59:12 +00:00
William S. Moses	94992670fc	[MLIR][LLVM] Permit integer types in switch other than i32 LLVM switchop currently only permits i32. Both LLVM IR and MLIR Standard switch permit other integer types leading to an illegal state when lowering an i8 switch from MLIR standard Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D113955	2021-11-16 00:46:25 -05:00
natashaknk	381677dfbf	[tosa][mlir] Refactor tosa.reshape lowering to linalg for dynamic cases. Split tosa.reshape into three individual lowerings: collapse, expand and a combination of both. Add simple dynamic shape support. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D113936	2021-11-15 15:31:37 -08:00
Nicolas Vasilache	ee80ffbf9a	[mlir][Linalg] Add bounded recursion declaration to FMAOp -> LLVM conversion. FMAOp -> LLVM conversion is done progressively by peeling off 1 dimension from FMAOp at each pattern iteration. Add the recursively bounded property declaration to the pattern so that the rewriter can apply it multiple times. Without this, FMAOps with 3+D do not lower to LLVM. Differential Revision: https://reviews.llvm.org/D113886	2021-11-15 12:41:52 +00:00
Alexander Belyaev	9b1d90e8ac	[mlir] Move min/max ops from Std to Arith. Differential Revision: https://reviews.llvm.org/D113881	2021-11-15 13:19:17 +01:00
Nicolas Vasilache	f67171ac58	[mlir][Linalg] Make depthwise convolution naming scheme consistent. Names should be consistent across all operations otherwise painful bugs will surface. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D113762	2021-11-15 07:54:29 +00:00
Thomas Raoux	e7969240dc	[mlir][VectorToGPU] Support more cases in conversion to MMA ops Support load with broadcast, elementwise divf op and remove the hardcoded restriction on the vector size. Picking the right size should be enfored by user and will fail conversion to llvm/spirv if it is not supported. Differential Revision: https://reviews.llvm.org/D113618	2021-11-11 13:10:38 -08:00
Rob Suderman	860d3811a9	[mlir][tosa] Add lowering for tosa.pad with explicit value New TOSA pad operation can support explicitly specifying the pad value. Added lowering to linalg that uses the explicit value. Differential Revision: https://reviews.llvm.org/D113515	2021-11-10 14:15:20 -08:00
thomasraoux	f309939d06	[mlir][nvvm] Remove special case ptr arithmetic lowering in gpu to nvvm Use existing helper instead of handling only a subset of indices lowering arithmetic. Also relax the restriction on the memref rank for the GPU mma ops as we can now support any rank. Differential Revision: https://reviews.llvm.org/D113383	2021-11-10 10:00:12 -08:00
Alex Zinenko	e64c76672f	[mlir] recursively convert builtin types to LLVM when possible Given that LLVM dialect types may now optionally contain types from other dialects, which itself is motivated by dialect interoperability and progressive lowering, the conversion should no longer assume that the outermost LLVM dialect type can be left as is. Instead, it should inspect the types it contains and attempt to convert them to the LLVM dialect. Introduce this capability for LLVM array, pointer and structure types. Only literal structures are currently supported as handling identified structures requires the converison infrastructure to have a mechanism for avoiding infite recursion in case of recursive types. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D112550	2021-11-10 18:11:00 +01:00
thomasraoux	7fbb0678fa	[mlir][VectorToGPU] Add support for elementwise mma to vector to GPU Differential Revision: https://reviews.llvm.org/D112960	2021-11-02 08:01:04 -07:00
thomasraoux	8a992b20db	[mlir][gpu] Add basic support to do elementwise ops on mma matrix type In order to support fusion with mma matrix type we need to be able to execute elementwise operations on them. This add an op to be able to support some basic elementwise operations. This is a is not a full solution as it only supports a limited scope or operations. Ideally we would want to be able to fuse with more kind of operations. Differential Revision: https://reviews.llvm.org/D112857	2021-11-01 11:51:19 -07:00
thomasraoux	77eafb8430	[mlir][nvvm] Generalize wmma ops to handle more types and shapes wmma intrinsics have a large number of combinations, ideally we want to be able to target all the different variants. To avoid a combinatorial explosion in the number of mlir op we use attributes to represent the different variation of load/store/mma ops. We also can generate with tablegen helpers to know which combinations are available. Using this we can avoid having too hardcode a path for specific shapes and can support more types. This patch also adds boiler plates for tf32 op support. Differential Revision: https://reviews.llvm.org/D112689	2021-11-01 10:27:26 -07:00
Weiwei Li	3483fc5a31	[mlir][SPIRVToLLVM] Add shufflevector conversion Add the shufflevector conversion. It only handles the static, i.e., IntegerAttr, index. Co-authored: Xinyi Liu <xyliuhelen@gmail.com> Reviewed by: antiagainst Differential revision: https://reviews.llvm.org/D112161	2021-11-01 23:05:37 +08:00
xndcn	6e2c0e6931	[mlir][spirv] Add conversions from arith.bitcast, std.br, std.cond_br to spirv. Differential Revision: https://reviews.llvm.org/D112819	2021-10-31 00:40:35 +08:00
thomasraoux	eacd6e1ebe	[mlir][GPUtoNVVM] Relax restriction on wmma op lowering Allow lowering of wmma ops with 64bits indexes. Change the default version of the test to use default layout. Differential Revision: https://reviews.llvm.org/D112479	2021-10-27 21:31:55 -07:00
Nicolas Vasilache	00ac874ff6	[mlir][Vector] Add InsertStridedSliceOp -> ShuffleOp for the rank-1 cases. This also fixes the vector.shuffle C++ builder which had an incorrect type assumption that triggers with this new rewrite. The vector.shuffle semantics were correct though. Differential revision: https://reviews.llvm.org/D112578	2021-10-27 07:57:17 +00:00
River Riddle	015192c634	[mlir:DialectConversion] Restructure how argument/target materializations get invoked The current implementation invokes materializations whenever an input operand does not have a mapping for the desired type, i.e. it requires materialization at the earliest possible point. This conflicts with goal of dialect conversion (and also the current documentation) which states that a materialization is only required if the materialization is supposed to persist after the conversion process has finished. This revision refactors this such that whenever a target materialization "might" be necessary, we insert an unrealized_conversion_cast to act as a temporary materialization. This allows for deferring the invocation of the user materialization hooks until the end of the conversion process, where we actually have a better sense if it's actually necessary. This has several benefits: * In some cases a target materialization hook is no longer necessary When performing a full conversion, there are some situations where a temporary materialization is necessary. Moving forward, these users won't need to provide any target materializations, as the temporary materializations do not require the user to provide materialization hooks. * getRemappedValue can now handle values that haven't been converted yet Before this commit, it wasn't well supported to get the remapped value of a value that hadn't been converted yet (making it difficult/impossible to convert multiple operations in many situations). This commit updates getRemappedValue to properly handle this case by inserting temporary materializations when necessary. Another code-health related benefit is that with this change we can move a majority of the complexity related to materializations to the end of the conversion process, instead of handling adhoc while conversion is happening. Differential Revision: https://reviews.llvm.org/D111620	2021-10-27 02:09:04 +00:00
Robert Suderman	58901a5a29	[mlir][tosa] Correct tosa.avg_pool2d for specification error Specification specified the output type for quantized average pool should be an i32. Only accumulator should be an i32, result type should match the input type. Caused in https://reviews.llvm.org/D111590 Reviewed By: sjarus, GMNGeoffrey Differential Revision: https://reviews.llvm.org/D112484	2021-10-25 14:41:16 -07:00
Boian Petkantchin	f1b922188e	[MLIR][Math] Add erf to math dialect Add math.erf lowering to libm call. Add math.erf polynomial approximation. Reviewed By: silvas, ezhulenev Differential Revision: https://reviews.llvm.org/D112200	2021-10-25 18:30:17 +00:00
Geoffrey Martin-Noble	efc6fe963c	[MLIR][TOSA] Drop "OnTensors" suffix This is the only lowering to Linalg Tosa has, so it's needlessly verbose. Likely this was a carry over from IREE's usage where we originally lowered to linalg on buffers (the only linalg that existed at the time), so the everything on tensors needed the suffix. We're dropping it in IREE also, having transitioned entirely to using Linalg on tensors. Reviewed By: sjarus Differential Revision: https://reviews.llvm.org/D111911	2021-10-15 16:01:19 -07:00
Rob Suderman	59dd418e89	[mlir][tosa] Fix tosa.cast UiToFp32 for tosa-to-linalg Part of the arith update broke UiToFp32. Fixed the lowering and included a new test to detect a regression. Differential Revision: https://reviews.llvm.org/D111772	2021-10-14 11:34:10 -07:00
Mogball	a54f4eae0e	[MLIR] Replace std ops with arith dialect ops Precursor: https://reviews.llvm.org/D110200 Removed redundant ops from the standard dialect that were moved to the `arith` or `math` dialects. Renamed all instances of operations in the codebase and in tests. Reviewed By: rriddle, jpienaar Differential Revision: https://reviews.llvm.org/D110797	2021-10-13 03:07:03 +00:00
Weiwei Li	c0a6381e49	[mlir][SPIRVToLLVM] Solve ExecutionModeOp redefinition and add OpTypeSampledImage into SPV_Type 1. To avoid two ExecutionModeOp using the same name, adding the value of execution mode in name when converting to LLVM dialect. 2. To avoid syntax error in spv.OpLoad, add OpTypeSampledImage into SPV_Type. Reviewed by:antiagainst Differential revision:https://reviews.llvm.org/D111193	2021-10-13 10:03:25 +08:00
Rob Suderman	95e4b71519	[mlir][tosa] Fix tosa average_pool2d to linalg type issue Average pool assumed the same input/output type. Result type for integers is always an i32, should be updated appropriately. Reviewed By: GMNGeoffrey Differential Revision: https://reviews.llvm.org/D111590	2021-10-12 13:09:21 -07:00
Nicolas Vasilache	0a7f81a451	mlir][Vector] Fix spuriously disabled test.	2021-10-12 12:56:40 +00:00
Nicolas Vasilache	0c74b12a2e	[mlir][Vector] NFC - Add test to exercise lowering of vector.transfer to scf This revision also renames and moves some tests around. Differential Revision: https://reviews.llvm.org/D111606	2021-10-12 12:38:33 +00:00
Eugene Zhulenev	8276ac13e9	[mlir] Add alignment attribute to memref.global Revived https://reviews.llvm.org/D102435 Add alignment attribute to `memref.global` and propagate it to llvm global in memref->llvm lowering Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D111309	2021-10-07 06:21:57 -07:00
natashaknk	4c48f7e29b	[mlir][tosa] Create basic dynamic shape support for several ops. Transpose, Matmul and Fully-connected dynamic shape support Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D111167	2021-10-06 10:36:04 -07:00
Lei Zhang	7a89444cd9	[mlir][spirv] Add ops and patterns for lowering standard max/min ops Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D111143	2021-10-05 14:27:32 -04:00
Rob Suderman	d5a4c86d14	[mlir][tosa] tosa.cast support for unsigned integers Unsigned integers need to be handled for cast to floating point. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D111102	2021-10-05 10:57:16 -07:00
Adrian Kuegel	d009f6e51c	[mlir] Convert ConstShapeOp to a static tensor type. ConstShapeOp knows its shape, so it should also have a static tensor type. Differential Revision: https://reviews.llvm.org/D111127	2021-10-05 12:14:43 +02:00
Weiwei Li	1e4cfe5e4f	[mlir][SPIRVToLLVM] Propagate location attribute from spv.GlobalVariable to llvm.mlir.global This patch is mainly to propogate location attribute from spv.GlobalVariable to llvm.mlir.global. It also contains three small changes. 1. Remove the restriction on UniformConstant In SPIRVToLLVM.cpp; 2. Remove the errorCheck on relaxedPrecision when deserializering SPIR-V in Deserializer.cpp 3. In SPIRVOps.cpp, let ConstantOp take signedInteger too. Co-authered: Alan Liu <alanliu.yf@gmail.com> and Xinyi Liu <xyliuhelen@gmail.com> Reviewed by:antiagainst Differential revision: https://reviews.llvm.org/D110207	2021-10-05 00:09:09 +08:00
Uday Bondhugula	08b63db8bb	[MLIR][GPU] Add GPU launch op support for dynamic shared memory Add support for dynamic shared memory for GPU launch ops: add an optional operand to gpu.launch and gpu.launch_func ops to specify the amount of "dynamic" shared memory to use. Update lowerings to connect this operand to the GPU runtime. Differential Revision: https://reviews.llvm.org/D110800	2021-10-01 16:46:07 +05:30
William S. Moses	6dd5b1e33e	[MLIR][LLVM] Add error if using incorrect attribute type for specifying LLVM linkage Address post-commit review in https://reviews.llvm.org/D108524 to add appropriate diagnostics. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D110566	2021-09-27 13:24:05 -04:00
Eugene Zhulenev	92db09cde0	[mlir] AsyncRuntime: use int64_t for ref counting operations Workaround for SystemZ ABI problem: https://bugs.llvm.org/show_bug.cgi?id=51898 Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D110550	2021-09-27 07:55:01 -07:00
Kiran Chandramohan	187d9f8cd9	[OpenMP][MLIR] Add a conversion pattern for the master op The conversion pattern is particularly useful for conversion of block arguments in the master op. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D109610	2021-09-12 10:13:40 +00:00
natashaknk	d4d50e4710	[mlir][tosa] Add lowering for tosa.clz using scf::whileOp Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D109540	2021-09-09 15:57:35 -07:00
Alex Zinenko	8b58ab8ccd	[mlir] Factor type reconciliation out of Standard-to-LLVM conversion Conversion to the LLVM dialect is being refactored to be more progressive and is now performed as a series of independent passes converting different dialects. These passes may produce `unrealized_conversion_cast` operations that represent pending conversions between built-in and LLVM dialect types. Historically, a more monolithic Standard-to-LLVM conversion pass did not need these casts as all operations were converted in one shot. Previous refactorings have led to the requirement of running the Standard-to-LLVM conversion pass to clean up `unrealized_conversion_cast`s even though the IR had no standard operations in it. The pass must have been also run the last among all to-LLVM passes, in contradiction with the partial conversion logic. Additionally, the way it was set up could produce invalid operations by removing casts between LLVM and built-in types even when the consumer did not accept the uncasted type, or could lead to cryptic conversion errors (recursive application of the rewrite pattern on `unrealized_conversion_cast` as a means to indicate failure to eliminate casts). In fact, the need to eliminate A->B->A `unrealized_conversion_cast`s is not specific to to-LLVM conversions and can be factored out into a separate type reconciliation pass, which is achieved in this commit. While the cast operation itself has a folder pattern, it is insufficient in most conversion passes as the folder only applies to the second cast. Without complex legality setup in the conversion target, the conversion infra will either consider the cast operations valid and not fold them (a separate canonicalization would be necessary to trigger the folding), or consider the first cast invalid upon generation and stop with error. The pattern provided by the reconciliation pass applies to the first cast operation instead. Furthermore, having a separate pass makes it clear when `unrealized_conversion_cast`s could not have been eliminated since it is the only reason why this pass can fail. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D109507	2021-09-09 16:51:24 +02:00
Alex Zinenko	1ce752b741	[mlir] support reductions in SCF to OpenMP conversion OpenMP reductions need a neutral element, so we match some known reduction kinds (integer add/mul/or/and/xor, float add/mul, integer and float min/max) to define the neutral element and the atomic version when possible to express using atomicrmw (everything except float mul). The SCF-to-OpenMP pass becomes a module pass because it now needs to introduce new symbols for reduction declarations in the module. Reviewed By: chelini Differential Revision: https://reviews.llvm.org/D107549	2021-09-09 13:04:27 +02:00
Eugene Zhulenev	fd52b4357a	[mlir] Async: check awaited operand error state after sync await Previously only await inside the async function (coroutine after lowering to async runtime) would check the error state Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D109229	2021-09-04 05:00:17 -07:00
Loren Maggiore	361458b1ce	[mlir] create gpu memset op Create a gpu memset op and corresponding CUDA and ROCm wrappers. Reviewed By: herhut, lorenrose1013 Differential Revision: https://reviews.llvm.org/D107548	2021-09-04 08:13:04 +02:00
William S. Moses	21d43daf8f	[MLIR] Primitive linkage lowering of FuncOp FuncOp always lowers to an LLVM external linkage presently. This makes it impossible to define functions in mlir which are local to the current module. Until MLIR FuncOps have a more formal linkage specification, this commit allows funcop's to have an optionally specified llvm.linkage attribute, whose value will be used as the linkage of the llvm funcop when lowered. Differential Revision: https://reviews.llvm.org/D108524 Support LLVM linkage	2021-09-03 20:41:39 -04:00
natashaknk	f596acc74d	[mlir][tosa] Small refactor to the functionality of Depthwise_Conv2D to add the bias at the end of the convolution Follow-up to the Conv2d and fully_connected lowering adjustments Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D108949	2021-09-01 10:01:00 -07:00
Mehdi Amini	387f95541b	Add a new interface allowing to set a default dialect to be used for printing/parsing regions Currently the builtin dialect is the default namespace used for parsing and printing. As such module and func don't need to be prefixed. In the case of some dialects that defines new regions for their own purpose (like SpirV modules for example), it can be beneficial to change the default dialect in order to improve readability. Differential Revision: https://reviews.llvm.org/D107236	2021-08-31 17:52:40 +00:00

1 2 3 4 5 ...

904 Commits