llvm-project

Commit Graph

Author	SHA1	Message	Date
Nicolas Vasilache	a664c14001	[mlir][LLVM] Revert bareptr calling convention handling as an argument materialization. Type conversion and argument materialization are context-free: there is no available information on which op / branch is currently being converted. As a consequence, bare ptr convention cannot be handled as an argument materialization: it would apply irrespectively of the parent op. This doesn't typecheck in the case of non-funcOp and we would see cases where a memref descriptor would be inserted in place of the pointer in another memref descriptor. For now the proper behavior is to revert to a specific BarePtrFunc implementation and drop the blanket argument materialization logic. This reverts the relevant piece of the conversion to LLVM to what it was before https://reviews.llvm.org/D105880 and adds a relevant test and documentation to avoid the mistake by whomever attempts this again in the future. Reviewed By: arpith-jacob Differential Revision: https://reviews.llvm.org/D106495	2021-07-21 22:06:50 +00:00
Rob Suderman	40a02fae87	[mlir][tosa] Added tosa to linalg lowering to unstrided transposed conv The unstrided transposed conv can be represented as a regular convolution. Lower to this variant to handle the basic case. This includes transitioning from the TC defined convolution operation and a yaml defined one. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D106389	2021-07-20 15:07:08 -07:00
Rob Suderman	6bf0f6a4f7	[mlir][tosa] Add quantized lowering for matmul and fully_connected Added the named op variants for quantized matmul and quantized batch matmul with the necessary lowerings/tests from tosa's matmul/fully connected ops. Current version does not use the contraction op interface as its verifiers are not compatible with scalar operations. Differential Revision: https://reviews.llvm.org/D105063	2021-07-20 12:58:02 -07:00
Yi Zhang	381c3b9299	Dyanamic shape support for memref reassociation reshape ops Only memref with identity layout map is supported for now. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D106180	2021-07-19 15:14:36 -07:00
Hanhan Wang	9c49195330	[mlir][Linalg] Migrate 2D pooling ops from tc definition to yaml definition. This deletes all the pooling ops in LinalgNamedStructuredOpsSpec.tc. All the uses are replaced with the yaml pooling ops. Reviewed By: gysit, rsuderman Differential Revision: https://reviews.llvm.org/D106181	2021-07-19 09:24:02 -07:00
Matthias Springer	d1a9e9a7cb	[mlir][vector] Remove vector.transfer_read/write to LLVM lowering This simplifies the vector to LLVM lowering. Previously, both vector.load/store and vector.transfer_read/write lowered directly to LLVM. With this commit, there is a single path to LLVM vector load/store instructions and vector.transfer_read/write ops must first be lowered to vector.load/store ops. * Remove vector.transfer_read/write to LLVM lowering. * Allow non-unit memref strides on all but the most minor dimension for vector.load/store ops. * Add maxTransferRank option to populateVectorTransferLoweringPatterns. * vector.transfer_reads with changing element type can no longer be lowered to LLVM. (This functionality is needed only for SPIRV.) Differential Revision: https://reviews.llvm.org/D106118	2021-07-17 14:07:27 +09:00
Alex Zinenko	a24e020d1a	[mlir] add missing build dependency	2021-07-16 15:20:07 +02:00
Alex Zinenko	881dc34f73	[mlir] replace llvm.mlir.cast with unrealized_conversion_cast The dialect-specific cast between builtin (ex-standard) types and LLVM dialect types was introduced long time before built-in support for unrealized_conversion_cast. It has a similar purpose, but is restricted to compatible builtin and LLVM dialect types, which may hamper progressive lowering and composition with types from other dialects. Replace llvm.mlir.cast with unrealized_conversion_cast, and drop the operation that became unnecessary. Also make unrealized_conversion_cast legal by default in LLVMConversionTarget as the majority of convesions using it are partial conversions that actually want the casts to persist in the IR. The standard-to-llvm conversion, which is still expected to run last, cleans up the remaining casts standard-to-llvm conversion, which is still expected to run last, cleans up the remaining casts Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D105880	2021-07-16 15:14:09 +02:00
Alexander Belyaev	46ef86b5d8	[mlir] Move linalg::Expand/CollapseShapeOp to memref dialect. RFC: https://llvm.discourse.group/t/rfc-reshape-ops-restructuring/3310 Differential Revision: https://reviews.llvm.org/D106141	2021-07-16 13:32:17 +02:00
Adrian Kuegel	74b88807ae	[mlir][rocdl] Add math::Exp2Op lowering to ROCDL Differential Revision: https://reviews.llvm.org/D106057	2021-07-15 14:33:04 +02:00
Adrian Kuegel	ffe6a58325	[mlir][nvvm]: Add math::Exp2Op lowering to NVVM. Differential Revision: https://reviews.llvm.org/D106050	2021-07-15 13:06:30 +02:00
Frederik Gossen	ed1f149b54	[MLIR][StandardToLLVM] Move `copyUnrankedDescriptors` to pattern Make the function `copyUnrankedDescriptors` accessible in `ConvertToLLVMPattern`. Differential Revision: https://reviews.llvm.org/D105810	2021-07-12 15:35:07 +02:00
Alex Zinenko	26e59cc19f	[mlir] factor math-to-llvm out of standard-to-llvm After the Math has been split out of the Standard dialect, the conversion to the LLVM dialect remained as a huge monolithic pass. This is undesirable for the same complexity management reasons as having a huge Standard dialect itself, and is even more confusing given the existence of a separate dialect. Extract the conversion of the Math dialect operations to LLVM into a separate library and a separate conversion pass. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D105702	2021-07-12 11:09:42 +02:00
Alex Zinenko	75e5f0aac9	[mlir] factor memref-to-llvm lowering out of std-to-llvm After the MemRef has been split out of the Standard dialect, the conversion to the LLVM dialect remained as a huge monolithic pass. This is undesirable for the same complexity management reasons as having a huge Standard dialect itself, and is even more confusing given the existence of a separate dialect. Extract the conversion of the MemRef dialect operations to LLVM into a separate library and a separate conversion pass. Reviewed By: herhut, silvas Differential Revision: https://reviews.llvm.org/D105625	2021-07-09 14:49:52 +02:00
Alex Zinenko	684dfe8adb	[mlir] factor out ConvertToLLVMPattern This class and classes that extend it are general utilities for any dialect that is being converted into the LLVM dialect. They are in no way specific to Standard-to-LLVM conversion and should not make their users depend on it. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D105542	2021-07-08 10:27:05 +02:00
William S. Moses	9a11c70c18	[SCF] Handle lowering of Execute region to Standard CFG Lower SCF.executeregionop to llvm by essentially inlining the region and replacing the return Differential Revision: https://reviews.llvm.org/D105567	2021-07-07 15:27:21 -04:00
William S. Moses	eaf22ba011	[MLIR] Provide lowering of std switch to llvm switch This patch allows lowering of std switch to llvm switch Differential Revision: https://reviews.llvm.org/D105580	2021-07-07 15:25:55 -04:00
thomasraoux	291025389c	[mlir][vector] Refactor Vector Unrolling and remove Tuple ops Simplify vector unrolling pattern to be more aligned with rest of the patterns and be closer to vector distribution. The new implementation uses ExtractStridedSlice/InsertStridedSlice instead of the Tuple ops. After this change the ops based on Tuple don't have any more used so they can be removed. This allows removing signifcant amount of dead code and will allow extending the unrolling code going forward. Differential Revision: https://reviews.llvm.org/D105381	2021-07-07 11:11:26 -07:00
Alexander Belyaev	6412a13539	[mlir] Move common reshapeops-related code to ReshapeOpsUtils.h. This is a first step to move (Tensor)Expand/CollapseShapeOp to tensor/memref dialects. Differential Revision: https://reviews.llvm.org/D105547	2021-07-07 14:56:16 +02:00
Adrian Kuegel	6e80e3bd1b	Add Log1pOp to complex dialect. Also add a lowering pattern from Complex to Standard/Math dialect. Differential Revision: https://reviews.llvm.org/D105538	2021-07-07 11:33:54 +02:00
Alex Zinenko	b5d847b1b9	[mlir] factor out common parts of the converstion to the LLVM dialect "Standard-to-LLVM" conversion is one of the oldest passes in existence. It has become quite large due to the size of the Standard dialect itself, which is being split into multiple smaller dialects. Furthermore, several conversion features are useful for any dialect that is being converted to the LLVM dialect, which, without this refactoring, creates a dependency from those conversions to the "standard-to-llvm" one. Put several of the reusable utilities from this conversion to a separate library, namely: - type converter from builtin to LLVM dialect types; - utility for building and accessing values of LLVM structure type; - utility for building and accessing values that represent memref in the LLVM dialect; - lowering options applicable everywhere. Additionally, remove the type wrapping/unwrapping notion from the type converter that is no longer relevant since LLVM types has been reimplemented as first-class MLIR types. Reviewed By: pifon2a Differential Revision: https://reviews.llvm.org/D105534	2021-07-07 10:51:08 +02:00
Uday Bondhugula	4acf3807e3	[MLIR] Split out GPU ops library from Transforms Split out GPU ops library from GPU transforms. This allows libraries to depend on GPU Ops without needing/building its transforms. Differential Revision: https://reviews.llvm.org/D105472	2021-07-07 11:26:49 +05:30
Adrian Kuegel	bf17ee1950	Add MulOp lowering from Complex dialect to Standard/Math dialect. The lowering handles special cases with NaN or infinity like C++. Differential Revision: https://reviews.llvm.org/D105270	2021-07-05 12:51:51 +02:00
Adrian Kuegel	380fa71fb0	[mlir] Add LogOp lowering from Complex dialect to Standard/Math dialect. Differential Revision: https://reviews.llvm.org/D105342	2021-07-05 09:33:45 +02:00
Matthias Springer	2c115ecc41	[mlir][NFC] MemRef cleanup: Remove helper functions Remove `getDynOperands` and `createOrFoldDimOp` from MemRef.h to decouple MemRef a bit from Tensor. These two functions are used in other dialects/transforms. Differential Revision: https://reviews.llvm.org/D105260	2021-07-05 10:10:21 +09:00
Matthias Springer	c0a6318d96	[mlir][tensor] Add tensor.dim operation * Split memref.dim into two operations: memref.dim and tensor.dim. Both ops have the same builder interface and op argument names, so that they can be used with templates in patterns that apply to both tensors and memrefs (e.g., some patterns in Linalg). * Add constant materializer to TensorDialect (needed for folding in affine.apply etc.). * Remove some MemRefDialect dependencies, make some explicit. Differential Revision: https://reviews.llvm.org/D105165	2021-07-01 10:00:19 +09:00
thomasraoux	0298f2cfb1	[mlir] Fix wrong type in WmmaConstantOpToNVVMLowering InsertElement takes a scalar integer attribute not an array of integer. Differential Revision: https://reviews.llvm.org/D105174	2021-06-30 09:10:02 -07:00
thomasraoux	4392841949	[mlir][VectorToGPU] Support converting vetor.broadcast to MMA op Differential Revision: https://reviews.llvm.org/D105175	2021-06-30 09:08:55 -07:00
Felipe de Azevedo Piovezan	8ca04b0513	[mlir] Add support for LLVM's dso_local attr This patch brings support for setting runtime preemption specifiers of LLVM's GlobalValues. In LLVM semantics, if the `dso_local` attribute is not explicitly requested, then it is inferred based on linkage and visibility. We model this same behavior with a UnitAttribute: if it is present, then we explicitly request the GlobalValue to marked as `dso_local`, otherwise we rely on the GlobalValue itself to make this decision. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D104983	2021-06-29 15:00:48 +02:00
Stephan Herhut	88d5eba139	Revert "Revert "[mlir][memref] Implement lowering of memref.copy to llvm"" This reverts commit `7d6e589fc8`. Windows build was unbroken.	2021-06-28 18:48:00 +02:00
Jacques Pienaar	7d6e589fc8	Revert "[mlir][memref] Implement lowering of memref.copy to llvm" This reverts commit `e939644977`. Breaks Windows build.	2021-06-28 07:50:11 -07:00
Stephan Herhut	e939644977	[mlir][memref] Implement lowering of memref.copy to llvm This lowering uses a library call to implement copying in the general case, i.e., supporting arbitrary rank and strided layouts.	2021-06-28 14:52:07 +02:00
Matthias Springer	0813700de1	[mlir][NFC] Cleanup: Move helper functions to StaticValueUtils Reduce code duplication: Move various helper functions, that are duplicated in TensorDialect, MemRefDialect, LinalgDialect, StandardDialect, into a new StaticValueUtils.cpp. Differential Revision: https://reviews.llvm.org/D104687	2021-06-27 15:56:48 +09:00
Eugene Zhulenev	34a164c938	[mlir:Async] Submit accidentally omitted changes Accidentally pushed old branches that did not include all the changes discussed in the PRs. https://reviews.llvm.org/rGd43b23608ad664f02f56e965ca78916bde220950 https://reviews.llvm.org/rG86ad0af87054c3cccd68d32e103a6f1f6c6194c7 Differential Revision: https://reviews.llvm.org/D104943	2021-06-25 12:23:02 -07:00
Eugene Zhulenev	d43b23608a	[mlir:Async] Add the size parameter to the async.group Specify the `!async.group` size (the number of tokens that will be added to it) at construction time. `async.await_all` operation can potentially race with `async.execute` operations that keep updating the group, for this reason it is required to know upfront how many tokens will be added to the group. Reviewed By: ftynse, herhut Differential Revision: https://reviews.llvm.org/D104780	2021-06-25 10:26:50 -07:00
thomasraoux	1a86559276	[mlir][VectorToGPU] Add conversion for scf::For op with Matrix operands Differential Revision: https://reviews.llvm.org/D104134	2021-06-24 15:42:28 -07:00
thomasraoux	6413226dce	[mlir][VectorToGPU] Add conversion for splat constant to MMA const matrix Differential Revision: https://reviews.llvm.org/D104133	2021-06-24 15:38:12 -07:00
William S. Moses	929189a499	[MLIR][LLVM] Expose type translator from LLVM to MLIR Type This commit moves the type translator from LLVM to MLIR to a public header for use by external projects or other code. Unlike a previous attempt (https://reviews.llvm.org/D104726), this patch moves the type conversion into separate files which remedies the linker error which was only caught by CI. Differential Revision: https://reviews.llvm.org/D104834	2021-06-24 12:06:34 -04:00
Tobias Gysi	f1844f15c1	[mlir][linalg] Change the FillOp library call signature. Adapt the FillOp library call signature to the updated operand order introduced in https://reviews.llvm.org/D10412. The patch reverts the special treatment of FillOp in LinalgToStandard. Differential Revision: https://reviews.llvm.org/D104360	2021-06-23 09:37:14 +00:00
Tobias Gysi	7cef24ee83	[mlir][linalg] Adapt the FillOp builder signature. Change the build operand order from output, value to value, output. The patch makes the argument order consistent with the pretty printed order updated by https://reviews.llvm.org/D104356. Differential Revision: https://reviews.llvm.org/D104359	2021-06-23 08:06:43 +00:00
Matthias Springer	060208b4c8	[mlir][NFC] Move SubTensorOp and SubTensorInsertOp to TensorDialect The main goal of this commit is to remove the dependency of Standard dialect on the Tensor dialect. * Rename SubTensorOp -> tensor.extract_slice, SubTensorInsertOp -> tensor.insert_slice. * Some helper functions are (already) duplicated between the Tensor dialect and the MemRef dialect. To keep this commit smaller, this will be cleaned up in a separate commit. * Additional dialect dependencies: Shape --> Tensor, Tensor --> Standard * Remove dialect dependencies: Standard --> Tensor * Move canonicalization test cases to correct dialect (Tensor/MemRef). Note: This is a fixed version of https://reviews.llvm.org/D104499, which was reverted due to a missing update to two CMakeFile.txt. Differential Revision: https://reviews.llvm.org/D104676	2021-06-22 17:55:53 +09:00
Tobias Gysi	4882cacf12	[mlir][linalg] Adapt FillOp to use a scalar operand. Adapt the FillOp definition to use a scalar operand instead of a capture. This patch is a follow up to https://reviews.llvm.org/D104109. As the input operands are in front of the output operands the patch changes the internal operand order of the FillOp. The pretty printed version of the operation remains unchanged though. The patch also adapts the linalg to standard lowering to ensure the c signature of the FillOp remains unchanged as well. Differential Revision: https://reviews.llvm.org/D104121	2021-06-22 06:44:52 +00:00
Mehdi Amini	60d97fb4cf	Revert "[mlir][NFC] Move SubTensorOp and SubTensorInsertOp to TensorDialect" This reverts commit `83bf801f5f`. This breaks the build with -DBUILD_SHARED_LIBS=ON	2021-06-21 16:39:24 +00:00
Matthias Springer	83bf801f5f	[mlir][NFC] Move SubTensorOp and SubTensorInsertOp to TensorDialect The main goal of this commit is to remove the dependency of Standard dialect on the Tensor dialect. * Rename ops: SubTensorOp --> ExtractTensorOp, SubTensorInsertOp --> InsertTensorOp * Some helper functions are (already) duplicated between the Tensor dialect and the MemRef dialect. To keep this commit smaller, this will be cleaned up in a separate commit. * Additional dialect dependencies: Shape --> Tensor, Tensor --> Standard * Remove dialect dependencies: Standard --> Tensor * Move canonicalization test cases to correct dialect (Tensor/MemRef). Differential Revision: https://reviews.llvm.org/D104499	2021-06-22 00:11:21 +09:00
Matthias Springer	66f878cee9	[mlir][NFC] Remove Standard dialect dependency on MemRef dialect * Remove dependency: Standard --> MemRef * Add dependencies: GPUToNVVMTransforms --> MemRef, Linalg --> MemRef, MemRef --> Tensor * Note: The `subtensor_insert_propagate_dest_cast` test case in MemRef/canonicalize.mlir will be moved to Tensor/canonicalize.mlir in a subsequent commit, which moves over the remaining Tensor ops from the Standard dialect to the Tensor dialect. Differential Revision: https://reviews.llvm.org/D104506	2021-06-21 17:55:23 +09:00
Alexander Belyaev	5b3cb31edb	[mlir][linalg] Purge linalg.indexed_generic. Differential Revision: https://reviews.llvm.org/D104449	2021-06-17 14:45:37 +02:00
Arpith C. Jacob	dd1992efd3	Support lowering of index-cast on vector types. The index cast operation accepts vector types. Implement its lowering in this patch. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D104280	2021-06-15 12:51:30 -07:00
Adrian Kuegel	f112bd61eb	[mlir] Add SignOp to complex dialect. Also add a conversion pattern from Complex Dialect to Standard/Math Dialect. Differential Revision: https://reviews.llvm.org/D104292	2021-06-15 15:22:31 +02:00
Adrian Kuegel	662e074d90	[mlir] Add NegOp to complex dialect. Also add a lowering pattern from complex dialect to standard dialect. Differential Revision: https://reviews.llvm.org/D104284	2021-06-15 12:16:22 +02:00
Christian Sigg	abe501f240	[mlir] Mark gpu dialect illegal in gpu-to-llvm conversion Reviewed By: herhut, bondhugula Differential Revision: https://reviews.llvm.org/D104208	2021-06-14 17:45:44 +02:00
Guillaume Chatelet	1d49e5352f	[llvm] remove Sequence::asSmallVector() There's no need for `toSmallVector()` as `SmallVector.h` already provides a `to_vector` free function that takes a range. Reviewed By: Quuxplusone Differential Revision: https://reviews.llvm.org/D104024	2021-06-14 08:28:05 +00:00
Adrian Kuegel	73cbc91c93	[mlir] Add ExpOp to Complex dialect. Also add a conversion pattern from Complex to Standard/Math dialect. Differential Revision: https://reviews.llvm.org/D104108	2021-06-14 08:08:53 +02:00
Denys Shabalin	fdc0d4360b	Introduce alloca_scope op ## Introduction This proposal describes the new op to be added to the `std` (and later moved `memref`) dialect called `alloca_scope`. ## Motivation Alloca operations are easy to misuse, especially if one relies on it while doing rewriting/conversion passes. For example let's consider a simple example of two independent dialects, one defines an op that wants to allocate on-stack and another defines a construct that corresponds to some form of looping: ``` dialect1.looping_op { %x = dialect2.stack_allocating_op } ``` Since the dialects might not know about each other they are going to define a lowering to std/scf/etc independently: ``` scf.for … { %x_temp = std.alloca … … // do some domain-specific work using %x_temp buffer … // and store the result into %result %x = %result } ``` Later on the scf and `std.alloca` is going to be lowered to llvm using a combination of `llvm.alloca` and unstructured control flow. At this point the use of `%x_temp` is bound to either be either optimized by llvm (for example using mem2reg) or in the worst case: perform an independent stack allocation on each iteration of the loop. While the llvm optimizations are likely to succeed they are not guaranteed to do so, and they provide opportunities for surprising issues with unexpected use of stack size. ## Proposal We propose a new operation that defines a finer-grain allocation scope for the alloca-allocated memory called `alloca_scope`: ``` alloca_scope { %x_temp = alloca … ... } ``` Here the lifetime of `%x_temp` is going to be bound to the narrow annotated region within `alloca_scope`. Moreover, one can also return values out of the alloca_scope with an accompanying `alloca_scope.return` op (that behaves similarly to `scf.yield`): ``` %result = alloca_scope { %x_temp = alloca … … alloca_scope.return %myvalue } ``` Under the hood the `alloca_scope` is going to lowered to a combination of `llvm.intr.stacksave` and `llvm.intr.strackrestore` that are going to be invoked automatically as control-flow enters and leaves the body of the `alloca_scope`. The key value of the new op is to allow deterministic guaranteed stack use through an explicit annotation in the code which is finer-grain than the function-level scope of `AutomaticAllocationScope` interface. `alloca_scope` can be inserted at arbitrary locations and doesn’t require non-trivial transformations such as outlining. ## Which dialect Before memref dialect is split, `alloca_scope` can temporarily reside in `std` dialect, and later on be moved to `memref` together with the rest of memory-related operations. ## Implementation An implementation of the op is available [here](https://reviews.llvm.org/D97768). Original commits: * Add initial scaffolding for alloca_scope op * Add alloca_scope.return op * Add no region arguments and variadic results * Add op descriptions * Add failing test case * Add another failing test * Initial implementation of lowering for std.alloca_scope * Fix backticks * Fix getSuccessorRegions implementation Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D97768	2021-06-11 19:28:41 +02:00
thomasraoux	edd9515bd1	[mlir][VectorToGPU] First step to convert vector ops to GPU MMA ops This is the first step to convert vector ops to MMA operations in order to target GPUs tensor core ops. This currently only support simple cases, transpose and element-wise operation will be added later. Differential Revision: https://reviews.llvm.org/D102962	2021-06-11 07:52:32 -07:00
Benoit Jacob	20daedacca	2d Arm Neon sdot op, and lowering to the intrinsic. This adds Sdot2d op, which is similar to the usual Neon intrinsic except that it takes 2d vector operands, reflecting the structure of the arithmetic that it's performing: 4 separate 4-dimensional dot products, whence the vector<4x4xi8> shape. This also adds a new pass, arm-neon-2d-to-intr, lowering this new 2d op to the 1d intrinsic. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D102504	2021-06-10 14:36:39 -07:00
thomasraoux	428a62f65f	[mlir][gpu] Add op to create MMA constant matrix This allow creating a matrix with all elements set to a given value. This is needed to be able to implement a simple dot op. Differential Revision: https://reviews.llvm.org/D103870	2021-06-10 08:34:04 -07:00
Guillaume Chatelet	e0569033e2	[llvm] Make Sequence reverse-iterable This is a roll forward of D102679. This patch simplifies the implementation of Sequence and makes it compatible with llvm::reverse. It exposes the reverse iterators through rbegin/rend which prevents a dangling reference in std::reverse_iterator::operator++(). Note: Compared to D102679, this patch introduces a `asSmallVector()` member function and fixes compilation issue with GCC 5. Differential Revision: https://reviews.llvm.org/D103948	2021-06-10 11:15:28 +00:00
Rob Suderman	0e083cef70	[mlir][tosa] Update tosa.matmul lowering to linalg.batch_matmul tosa.matmul is a batched matmul, update the lowering for linalg with the tests. Reviewed By: sjarus Differential Revision: https://reviews.llvm.org/D103937	2021-06-09 11:05:36 -07:00
Lei Zhang	56f60a1ce7	[mlir][spirv] Use SingleBlock + NoTerminator for spv.module This allows us to remove the `spv.mlir.endmodule` op and all the code associated with it. Along the way, tightened the APIs for `spv.module` a bit by removing some aliases. Now we use `getRegion` to get the only region, and `getBody` to get the region's only block. Reviewed By: mravishankar, hanchung Differential Revision: https://reviews.llvm.org/D103265	2021-06-09 14:00:06 -04:00
thomasraoux	9b496c2373	[mlir][gpu][NFC] Simplify conversion of MMA type to NVVM Consolidate the type conversion in a single function to make it simpler to use. This allow to re-use the type conversion for up coming ops. Differential Revision: https://reviews.llvm.org/D103868	2021-06-09 09:33:38 -07:00
Mehdi Amini	a4e2cf712a	Revert "[llvm] Make Sequence reverse-iterable" This reverts commit `e772216e70` (and fixup `7f6c878a2c`). The build is broken with gcc5 host compiler: In file included from from mlir/lib/Dialect/Utils/StructuredOpsUtils.cpp:9: tools/mlir/include/mlir/IR/BuiltinAttributes.h.inc:424:57: error: type/value mismatch at argument 1 in template parameter list for 'template<class ItTy, class FuncTy, class FuncReturnTy> class llvm::mapped_iterator' std::function<T(ptrdiff_t)>>; ^ tools/mlir/include/mlir/IR/BuiltinAttributes.h.inc:424:57: note: expected a type, got 'decltype (seq<ptrdiff_t>(0, 0))::const_iterator'	2021-06-08 17:03:10 +00:00
Chris Lattner	92a79dbe91	[Core] Add Twine support for StringAttr and Identifier. NFC. This is both more efficient and more ergonomic than going through an std::string, e.g. when using llvm::utostr and in string concat cases. Unfortunately we can't just overload ::get(). This causes an ambiguity because both twine and stringref implicitly convert from std::string. Differential Revision: https://reviews.llvm.org/D103754	2021-06-08 09:47:07 -07:00
Guillaume Chatelet	e772216e70	[llvm] Make Sequence reverse-iterable This patch simplifies the implementation of Sequence and makes it compatible with llvm::reverse. It exposes the reverse iterators through rbegin/rend which prevents a dangling reference in std::reverse_iterator::operator++(). Differential Revision: https://reviews.llvm.org/D102679	2021-06-08 13:18:57 +00:00
Alex Zinenko	c59ce1f625	[mlir] support memref of memref in standard-to-llvm conversion Now that memref supports arbitrary element types, add support for memref of memref and make sure it is properly converted to the LLVM dialect. The type support itself avoids adding the interface to the memref type itself similarly to other built-in types. This allows the shape, and therefore byte size, of the memref descriptor to remain a lowering aspect that is easier to customize and evolve as opposed to sanctifying it in the data layout specification for the memref type itself. Factor out the code previously in a testing pass to live in a dedicated data layout analysis and use that analysis in the conversion to compute the allocation size for memref of memref. Other conversions will be ported separately. Depends On D103827 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D103828	2021-06-08 11:11:31 +02:00
Alex Zinenko	ada9aa5a22	[mlir] Make MemRef element type extensible Historically, MemRef only supported a restricted list of element types that were known to be storable in memory. This is unnecessarily restrictive given the open nature of MLIR's type system. Allow types to opt into being used as MemRef elements by implementing a type interface. For now, the interface is merely a declaration with no methods. Later, methods to query, e.g., the type size or whether a type can alias elements of another type may be added. Harden the "standard"-to-LLVM conversion against memrefs with non-builtin types. See https://llvm.discourse.group/t/rfc-memref-of-custom-types/3558. Depends On D103826 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D103827	2021-06-08 11:11:30 +02:00
Alex Zinenko	3c70a82e28	[mlir] fix integer type mismatch in alloc conversion to LLVM Some places in the alloc-like op conversion use the converted index type whereas other places use the pointer-sized integer type, which may not be the same. Consistently use the converted index type, similarly to other address calculations. Reviewed By: pifon2a Differential Revision: https://reviews.llvm.org/D103826	2021-06-08 11:11:28 +02:00
Valentin Clement	fb5b590b5e	[mlir][openacc] Add conversion for if operand to scf.if for standalone data operation This patch convert the if condition on standalone data operation such as acc.update, acc.enter_data and acc.exit_data to a scf.if with the operation in the if region. It removes the operation when the if condition is constant and false. It removes the the condition if it is contant and true. Conversion to scf.if is done in order to use the translation to LLVM IR dialect out of the box. Not sure this is the best approach or we should perform this during the translation from OpenACC to LLVM IR dialect. Any thoughts welcome. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D103325	2021-06-07 12:10:03 -04:00
Valentin Clement	cfcdebaf32	[mlir][openacc] Conversion of data operands in acc.parallel to LLVM IR dialect Convert data operands from the acc.parallel operation using the same conversion pattern than D102170. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D103337	2021-06-07 11:22:20 -04:00
Rob Suderman	d86ef4364f	[mlir][tosa] Update tosa.rescale for i48 input type i48 integers require slightly tweaked behavior, specifically supporting zero point offsetting with slightly higher bitdepth. Updated results lowering appropriately. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D102659	2021-06-04 16:36:48 -07:00
Matthias Springer	700b64dc54	[mlir] Mark VectorToSCF patterns as recursive Differential Revision: https://reviews.llvm.org/D103599	2021-06-04 23:40:57 +09:00
Valentin Clement	fcb1547229	[mlir][openacc] Conversion of data operands in acc.data to LLVM IR dialect Convert data operands from the acc.data operation using the same conversion pattern than D102170. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D103332	2021-06-04 10:26:22 -04:00
Christian Sigg	fd3f2518a4	[mlir] Catch nonconvertible types in async conversion Reviewed By: ezhulenev, ftynse Differential Revision: https://reviews.llvm.org/D103592	2021-06-04 13:53:41 +02:00
MaheshRavishankar	cfa9ae9940	[mlir][SPIRV] Add lowering for math.log1p operation to SPIR-V dialect. Differential Revision: https://reviews.llvm.org/D103635	2021-06-03 16:27:19 -07:00
Alexander Belyaev	485c21be8a	[mlir] Split linalg reshape ops into expand/collapse. Differential Revision: https://reviews.llvm.org/D103548	2021-06-03 11:40:22 +02:00
Adrian Kuegel	942be7cb4d	[mlir] Add DivOp lowering from Complex dialect to Standard/Math dialect. Differential Revision: https://reviews.llvm.org/D103507	2021-06-02 11:16:00 +02:00
Matthias Springer	bd20756d2c	[mlir] Support tensor types in unrolled VectorToSCF Differential Revision: https://reviews.llvm.org/D102668	2021-06-02 10:44:04 +09:00
Matthias Springer	558e740170	[mlir] Support tensor types in non-unrolled VectorToSCF Support for tensor types in the unrolled version will follow in a separate commit. Add a new pass option to activate lowering of transfer ops with tensor types (default: deactivated). Differential Revision: https://reviews.llvm.org/D102666	2021-06-02 10:37:58 +09:00
Rob Suderman	422c7036d5	[mlir] Updated depthwise conv to support kernel dilation Depthwise convolution should support kernel dilation and non-dilation should not be a special case. Updated op definition to include a dilation attribute. This also adds a tosa.depthwise_conv2d lowering to linalg to support the new linalg behavior. Differential Revision: https://reviews.llvm.org/D103219	2021-06-01 13:25:19 -07:00
Tres Popp	1ebf7ce950	[mlir] Use interfaces in MathToLibm Previously, this assumed use of ModuleOp and FuncOp. There is no need to restrict this, and using interfaces allows these patterns to be used during dialect conversion to LLVM. Some assertions were removed due to inconsistent implementation of FunctionLikeOps. Differential Revision: https://reviews.llvm.org/D103447	2021-06-01 13:56:32 +02:00
Tres Popp	2290a80b4d	[mlir][NFC] Remove illegal TanhOp in LLVMConversionTarget No tests fail and this seems to be technical debt from when the math dialect was created. This should not be there as it prevents users from configuring their converion target freely and results in unexpected behavior on seemingly unrelated ops. Differential Revision: https://reviews.llvm.org/D103388	2021-05-31 10:40:09 +02:00
Butygin	bb542f2a76	[mlir] StandardToLLVM: option to disable AllocOp lowering Differential Revision: https://reviews.llvm.org/D103237	2021-05-30 17:50:25 +03:00
Eugene Zhulenev	d8c84d2a4e	[mlir] Async: Add error propagation support to async groups Depends On D103109 If any of the tokens/values added to the `!async.group` switches to the error state, than the group itself switches to the error state. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D103203	2021-05-27 09:35:11 -07:00
Eugene Zhulenev	39957aa424	[mlir] Add error state and error propagation to async runtime values Depends On D103102 Not yet implemented: 1. Error handling after synchronous await 2. Error handling for async groups Will be addressed in the followup PRs Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D103109	2021-05-27 09:28:47 -07:00
thomasraoux	b44007bec2	[mlir][gpu] Relax restriction on MMA store op to allow chain of mma ops. In order to allow large matmul operations using the MMA ops we need to chain operations this is not possible unless "DOp" and "COp" type have matching layout so remove the "DOp" layout and force accumulator and result type to match. Added a test for the case where the MMA value is accumulated. Differential Revision: https://reviews.llvm.org/D103023	2021-05-27 09:13:51 -07:00
harsh-nod	94d67b51dd	[mlir] Add n-D vector lowering to LLVM for cast ops The casting ops (sitofp, uitofp, fptosi, fptoui) lowering currently does not handle n-D vectors. This patch fixes that. Differential Revision: https://reviews.llvm.org/D103207	2021-05-26 15:26:49 -07:00
Rob Suderman	e5d227e95c	[NFC][MLIR][TOSA] Replaced tosa linalg.indexed_generic lowerings with linalg.index Indexed Generic should be going away in the future. Migrate to linalg.index. Reviewed By: NatashaKnk, nicolasvasilache Differential Revision: https://reviews.llvm.org/D103110	2021-05-25 15:34:28 -07:00
Matthias Springer	f718a53d7e	[mlir] Disallow certain transfer ops in VectorToSCF Disallow transfer ops that change the element type of the transfer. Such transfers could be supported in the future, if needed. Differential Revision: https://reviews.llvm.org/D102746	2021-05-25 21:39:43 +09:00
Matthias Springer	5017b0f88b	[mlir] Check only last dim stride in transfer op lowering Lower a 1D vector transfer op to LLVM if the last dim stride is 1. Also fixes a bug in the original unit stride computation. Differential Revision: https://reviews.llvm.org/D102897	2021-05-25 17:53:24 +09:00
Uday Bondhugula	9c21ddb70a	[MLIR] Make MLIR cmake variable names consistent Fix inconsistent MLIR CMake variable names. Consistently name them as MLIR_ENABLE_<feature>. Eg: MLIR_CUDA_RUNNER_ENABLED -> MLIR_ENABLE_CUDA_RUNNER MLIR follows (or has mostly followed) the convention of naming cmake enabling variables in the from MLIR_ENABLE_... etc. Using a convention here is easy and also important for convenience. A counter pattern was started with variables named MLIR_..._ENABLED. This led to a sequence of related counter patterns: MLIR_CUDA_RUNNER_ENABLED, MLIR_ROCM_RUNNER_ENABLED, etc.. From a naming standpoint, the imperative form is more meaningful. Additional discussion at: https://llvm.discourse.group/t/mlir-cmake-enable-variable-naming-convention/3520 Switch all inconsistent ones to the ENABLE form. Keep the couple of old mappings needed until buildbot config is migrated. Differential Revision: https://reviews.llvm.org/D102976	2021-05-24 08:43:10 +05:30
Butygin	9afbca746b	[mlir] ConvertStandardToLLVM: make AllocLikeOpLowering public It is useful for someone who wants to implement custom AllocOp LLVM lowering Differential Revision: https://reviews.llvm.org/D102932	2021-05-22 12:57:45 +03:00
Navdeep Kumar	eaaf7a6a09	[MLIR][GPU][NVVM] Add conversion of warp synchronous matrix-multiply accumulate GPU ops Add conversion of warp synchronous matrix-multiply accumulate GPU ops Add conversion of warp synchronous matrix-multiply accumulate GPU ops to NVVM ops. The following conversions are added :- 1.) subgroup_mma_load_matrix -> wmma.m16n16k16.load.[a,b,c]..row.stride 2.) subgroup_mma_store_matrix -> wmma.m16n16k16.store.d.[f16,f32].row.stride 3.) subgroup_mma_compute -> wmma.m16n16k16.mma.row.row.[f16,f32].[f16,f32] Reviewed By: bondhugula, ftynse Differential Revision: https://reviews.llvm.org/D95331	2021-05-21 21:20:33 +05:30
Matthias Springer	8fb4897934	[mlir] Disallow tensor types in VectorToSCF Support for tensor types can be added if needed. Differential Revision: https://reviews.llvm.org/D102749	2021-05-21 22:45:14 +09:00
Nicolas Vasilache	8eb18a0f3e	[mlir][Standard] NFC - Drop remaining EDSC usage Drop the remaining EDSC subdirectories and update all uses. Differential Revision: https://reviews.llvm.org/D102911	2021-05-21 10:40:39 +00:00
Adrian Kuegel	fb8b2b86d3	[mlir] Add conversion from Complex to Standard dialect for NotEqualOp. Differential Revision: https://reviews.llvm.org/D102902	2021-05-21 10:46:50 +02:00
Nicolas Vasilache	e84a9b9bb3	[mlir][Affine] NFC - Drop Affine EDSC usage Drop the Affine dialect EDSC subdirectory and update all uses. Differential Revision: https://reviews.llvm.org/D102878	2021-05-20 21:45:45 +00:00
Adrian Kuegel	ac00cb0d2a	[mlir] Add conversion from complex to standard dialect for EqualOp. This adds the straightforward conversion for EqualOp (two complex numbers are equal if both the real and the imaginary part are equal). Differential Revision: https://reviews.llvm.org/D102840	2021-05-20 14:25:56 +02:00
Stephen Neuendorffer	29a50c5864	[MLIR] Update Vector To LLVM conversion to be aware of assume_alignment vector.transfer_read and vector.transfer_write operations are converted to llvm intrinsics with specific alignment information, however there doesn't seem to be a way in llvm to take information from llvm.assume intrinsics and change this alignment information. In any event, due the to the structure of the llvm.assume instrinsic, applying this information at the llvm level is more cumbersome. Instead, let's generate the masked vector load and store instrinsic with the right alignment information from MLIR in the first place. Since we're bothering to do this, lets just emit the proper alignment for loads, stores, scatter, and gather ops too. Differential Revision: https://reviews.llvm.org/D100444	2021-05-19 10:50:48 -07:00
Nicolas Vasilache	6825bfe23e	[mlir][Vector] NFC - Drop vector EDSC usage Drop the vector dialect EDSC subdirectory and update all uses.	2021-05-19 12:44:38 +00:00
Matthias Springer	fb7ec1f187	[mlir] Use VectorTransferPermutationMapLoweringPatterns in VectorToSCF VectorTransferPermutationMapLoweringPatterns can be enabled via a pass option. These additional patterns lower permutation maps to minor identity maps with broadcasting, if possible, allowing for more efficient vector load/stores. The option is deactivated by default. Differential Revision: https://reviews.llvm.org/D102593	2021-05-19 14:46:19 +09:00
River Riddle	2257e4a70e	[mlir] Allow derived rewrite patterns to define a non-virtual `initialize` hook This is a hook that allows for providing custom initialization of the pattern, e.g. if it has bounded recursion, setting the debug name, etc., without needing to define a custom constructor. A non-virtual hook was chosen to avoid polluting the vtable with code that we really just want to be inlined when constructing the pattern. The alternative to this would be to just define a constructor for each pattern, this unfortunately creates a lot of otherwise unnecessary boiler plate for a lot of patterns and a hook provides a much simpler/cleaner interface for the very common case. Differential Revision: https://reviews.llvm.org/D102440	2021-05-18 14:40:32 -07:00
Rob Suderman	a91fb4328f	[mlir][tosa] Cleanup of tosa.rescale lowering to linalg Comment was poorly written. Changed to bail on contradictory information in the double round. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D102651	2021-05-17 17:31:20 -07:00
Rob Suderman	08068ddba7	[mlir][tosa] Fix tosa.avg_pool2d lowering to normalize correctly Initial version of pooling assumed normalization was accross all elements equally. TOSA actually requires the noramalization is perform by how many elements were summed (edges are not artifically dimmer). Updated the lowering to reflect this change with corresponding tests. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D102540	2021-05-17 10:00:43 -07:00
Julian Gross	1fbb484ea4	[WIP][mlir] Resolve memref dependency in canonicalize pass. Splitting the memref dialect lead to an introduction of several dependencies to avoid compilation issues. The canonicalize pass also depends on the memref dialect, but it shouldn't. This patch resolves the dependencies and the unintuitive includes are removed. However, the dependency moves to the constructor of the std dialect. Differential Revision: https://reviews.llvm.org/D102060	2021-05-17 11:33:38 +02:00
Matthias Springer	a088bed4e3	[mlir] VectorToSCF cleanup Group functions/structs in namespaces for better code readability. Depends On D102123 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D102124	2021-05-14 11:04:37 +09:00
Matthias Springer	2ca887de6e	[mlir] VectorToSCF target rank is a pass option Make "target rank" a pass option of VectorToSCF. Depends On D102101 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D102123	2021-05-14 10:30:43 +09:00
Valentin Clement	8fdfead71a	[mlir][openacc][NFC] add anonymous namespace around LegalizeDataOpForLLVMTranslation class Add missing anonymous namespace around LegalizeDataOpForLLVMTranslation class . Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D102380	2021-05-13 20:27:59 -04:00
Rob Suderman	f97d970a49	[mlir][tosa] Add lowering to tosa.abs for integer cases Integer case requires decomposing to simple LLVM operatons. Differential Revision: https://reviews.llvm.org/D101809	2021-05-13 13:55:17 -07:00
natashaknk	0831793ed9	[mlir][tosa] Add tosa.div integer lowering to linalg.generic. Lowering div elementwise op to the linalg dialect. Since tosa only supports integer division, that is the only version that is currently implemented. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D102430	2021-05-13 13:16:00 -07:00
Matthias Springer	0f24163870	[mlir] Replace vector-to-scf with progressive-vector-to-scf Depends On D102388 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D102101	2021-05-13 23:27:31 +09:00
Matthias Springer	d020dd2b21	[mlir] Migrate vector-to-loops.mlir to ProgressiveVectorToSCF Create a copy of vector-to-loops.mlir and adapt the test for ProgressiveVectorToSCF. Fix a small bug in getExtractOp() triggered by this test. Differential Revision: https://reviews.llvm.org/D102388	2021-05-13 22:48:20 +09:00
Matthias Springer	bf068e1077	[mlir] Do not use pass labels in unrolled ProgressiveVectorToSCF Do not rely on pass labels to detect if the pattern was already applied in the past (which allows for more some extra optimizations to avoid extra InsertOps and ExtractOps). Instead, check if these optimizations can be applied on-the-fly. This also fixes a bug, where vector.insert and vector.extract ops sometimes disappeared in the middle of the pass because they get folded away, but the next application of the pattern expected them to be there. Differential Revision: https://reviews.llvm.org/D102206	2021-05-13 22:01:08 +09:00
Rob Suderman	3f8aafd790	[mlir][tosa] Fix tosa.cast semantics to perform rounding/clipping Rounding to integers requires rounding (for floating points) and clipping to the min/max values of the destination range. Added this behavior and updated tests appropriately. Reviewed By: sjarus, silvas Differential Revision: https://reviews.llvm.org/D102375	2021-05-12 21:53:53 -07:00
Matthias Springer	2a51e9ff2e	[mlir] Support memref layout maps in vector transfer ops Differential Revision: https://reviews.llvm.org/D102042	2021-05-13 13:22:21 +09:00
Matthias Springer	9b77be5583	[mlir] Unrolled progressive-vector-to-scf. Instead of an SCF for loop, these pattern generate fully unrolled loops with no temporary buffer allocations. Differential Revision: https://reviews.llvm.org/D101981	2021-05-13 13:08:48 +09:00
Matthias Springer	864adf399e	[mlir] Allow empty position in vector.insert and vector.extract Such ops are no-ops and are folded to their respective `source`/`vector` operand. Differential Revision: https://reviews.llvm.org/D101879	2021-05-13 12:54:18 +09:00
Matthias Springer	c52cbe63e4	[mlir] Fix masked vector transfer ops with broadcasts Broadcast dimensions of a vector transfer op have no corresponding dimension in the mask vector. E.g., a 2-D TransferReadOp, where one dimension is a broadcast, can have a 1-D `mask` attribute. This commit also adds a few additional transfer op integration tests for various combinations of broadcasts, masking, dim transposes, etc. Differential Revision: https://reviews.llvm.org/D101745	2021-05-13 12:46:03 +09:00
Matthias Springer	6555e53ab0	Revert "[mlir] Fix masked vector transfer ops with broadcasts" This reverts commit `c9087788f7`. Accidentally pushed old version of the commit.	2021-05-13 11:55:00 +09:00
Matthias Springer	c9087788f7	[mlir] Fix masked vector transfer ops with broadcasts Broadcast dimensions of a vector transfer op have no corresponding dimension in the mask vector. E.g., a 2-D TransferReadOp, where one dimension is a broadcast, can have a 1-D `mask` attribute. This commit also adds a few additional transfer op integration tests for various combinations of broadcasts, masking, dim transposes, etc. Differential Revision: https://reviews.llvm.org/D101745	2021-05-13 11:37:36 +09:00
Suraj Sudhir	4b01435230	[mlir][tosa] Remove tosa.identityn operator Removes the identityn operator from TOSA MLIR definition. Removes TosaToLinAlg mappings Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D102329	2021-05-12 12:46:22 -07:00
Valentin Clement	6110b667b0	[mlir][openacc] Conversion of data operand to LLVM IR dialect Add a conversion pass to convert higher-level type before translation. This conversion extract meangingful information and pack it into a struct that the translation (D101504) will be able to understand. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D102170	2021-05-12 11:34:15 -04:00
Tobias Gysi	0fb364a97e	[mlir][linalg] Remove IndexedGenericOp support from LinalgToStandard... after introducing the IndexedGenericOp to GenericOp canonicalization (https://reviews.llvm.org/D101612). Differential Revision: https://reviews.llvm.org/D102236	2021-05-12 11:56:07 +00:00
Dumitru Potop	9a0ea5994b	[mlir] Support alignment in LLVM dialect GlobalOp First step in adding alignment as an attribute to MLIR global definitions. Alignment can be specified for global objects in LLVM IR. It can also be specified as a named attribute in the LLVMIR dialect of MLIR. However, this attribute has no standing and is discarded during translation from MLIR to LLVM IR. This patch does two things: First, it adds the attribute to the syntax of the llvm.mlir.global operation, and by doing this it also adds accessors and verifications. The syntax is "align=XX" (with XX being an integer), placed right after the value of the operation. Second, it allows transforming this operation to and from LLVM IR. It is checked whether the value is an integer power of 2. Reviewed By: ftynse, mehdi_amini Differential Revision: https://reviews.llvm.org/D101492	2021-05-12 09:07:20 +02:00
Rob Suderman	764ad3b3fa	[mlir][tosa] Tosa elementwise broadcasting had some minor bugs Updated tests to include broadcast of left and right. Includes bypass if in-type and out-type match shape (no broadcasting). Differential Revision: https://reviews.llvm.org/D102276	2021-05-11 13:58:06 -07:00
River Riddle	53b946aa63	[mlir] Refactor the representation of function-like argument/result attributes. The current design uses a unique entry for each argument/result attribute, with the name of the entry being something like "arg0". This provides for a somewhat sparse design, but ends up being much more expensive (from a runtime perspective) in-practice. The design requires building a string every time we lookup the dictionary for a specific arg/result, and also requires N attribute lookups when collecting all of the arg/result attribute dictionaries. This revision restructures the design to instead have an ArrayAttr that contains all of the attribute dictionaries for arguments and another for results. This design reduces the number of attribute name lookups to 1, and allows for O(1) lookup for individual element dictionaries. The major downside is that we can end up with larger memory usage, as the ArrayAttr contains an entry for each element even if that element has no attributes. If the memory usage becomes too problematic, we can experiment with a more sparse structure that still provides a lot of the wins in this revision. This dropped the compilation time of a somewhat large TensorFlow model from ~650 seconds to ~400 seconds. Differential Revision: https://reviews.llvm.org/D102035	2021-05-07 19:32:31 -07:00
thomasraoux	565ee6afc7	[mlir][spirv] add support lowering of extract_slice to scalar type Differential Revision: https://reviews.llvm.org/D102041	2021-05-07 07:52:02 -07:00
Rob Suderman	7abb56c78b	[mlir][tosa] Add tosa.depthwise lowering to existing linalg.depthwise_conv Implements support for undialated depthwise convolution using the existing depthwise convolution operation. Once convolutions migrate to yaml defined versions we can rewrite for cleaner implementation. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D101579	2021-05-05 13:30:05 -07:00
Sergei Grechanik	d80b04ab00	[mlir][Affine][Vector] Support vectorizing reduction loops This patch adds support for vectorizing loops with 'iter_args' implementing known reductions along the vector dimension. Comparing to the non-vector-dimension case, two additional things are done during vectorization of such loops: - The resulting vector returned from the loop is reduced to a scalar using `vector.reduce`. - In some cases a mask is applied to the vector yielded at the end of the loop to prevent garbage values from being written to the accumulator. Vectorization of reduction loops is disabled by default. To enable it, a map from loops to array of reduction descriptors should be explicitly passed to `vectorizeAffineLoops`, or `vectorize-reductions=true` should be passed to the SuperVectorize pass. Current limitations: - Loops with a non-unit step size are not supported. - n-D vectorization with n > 1 is not supported. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D100694	2021-05-05 09:03:59 -07:00
Rob Suderman	1f7adf8cb1	[mlir][tosa] Fix tosa.concat by inserting linalg.fill after linalg.init All linalg.init operations must be fed into a linalg operation before subtensor. The inserted linalg.fill guarantees it executes correctly. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D101848	2021-05-04 14:26:28 -07:00
Adrian Kuegel	93537fabce	[mlir] Add lowering from math.expm1 to LLVM. Differential Revision: https://reviews.llvm.org/D96776	2021-05-04 14:22:10 +02:00
natashaknk	07ce5c99d7	[mlir][tosa] Add lowerings for tosa.equal and tosa.arithmetic_right_shift Lowerings equal and arithmetic_right_shift for elementwise ops to linalg dialect using linalg.generic Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D101804	2021-05-03 18:26:49 -07:00
thomasraoux	d51275cbc0	[mlir][spirv] Add support to convert std.splat op Differential Revision: https://reviews.llvm.org/D101511	2021-05-03 10:57:40 -07:00
Benjamin Kramer	96a7900eb0	[mlir] Fix multidimensional lowering from std.select to llvm.select The converter assumed that all operands have the same type, that's not true for select. Differential Revision: https://reviews.llvm.org/D101767	2021-05-03 19:30:49 +02:00
thomasraoux	be8e2801a4	[mlir][vector][NFC] split TransposeOp lowerning out of contractLowering Move TransposeOp lowering in its own populate function as in some cases it is better to keep it during ContractOp lowering to better canonicalize it rather than emiting scalar insert/extract. Differential Revision: https://reviews.llvm.org/D101647	2021-05-03 10:23:45 -07:00
Benjamin Kramer	cdeb4a8a64	[mlir] Allow lowering cmpi/cmpf with multidimensional vectors to LLVM Differential Revision: https://reviews.llvm.org/D101535	2021-05-03 11:30:21 +02:00
Rob Suderman	be01b091af	[mlir][tosa] Remove constant-0 dim expr values from TOSA lowerings Constant-0 dim expr values should be avoided for linalg as it can prevent fusion. This includes adding support for rank-0 reshapes. Differential Revision: https://reviews.llvm.org/D101418	2021-04-29 15:06:03 -07:00
Benjamin Kramer	b389c80963	[mlir] Fix lowering of multi-dimensional vector log1p to LLVM This was using the untransformed operand, leading to invalid IR. Differential Revision: https://reviews.llvm.org/D101531	2021-04-29 21:53:52 +02:00
Alex Zinenko	6841e6afba	[mlir] support max/min lower/upper bounds in affine.parallel This enables to express more complex parallel loops in the affine framework, for example, in cases of tiling by sizes not dividing loop trip counts perfectly or inner wavefront parallelism, among others. One can't use affine.max/min and supply values to the nested loop bounds since the results of such affine.max/min operations aren't valid symbols. Making them valid symbols isn't an option since they would introduce selection trees into memref subscript arithmetic as an unintended and undesired consequence. Also add support for converting such loops to SCF. Drop some API that isn't used in the core repo from AffineParallelOp since its semantics becomes ambiguous in presence of max/min bounds. Loop normalization is currently unavailable for such loops. Depends On D101171 Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D101172	2021-04-29 13:16:25 +02:00
Frederik Gossen	eb56fa97de	[MLIR][Shape] Fix `shape.broadcast` to standard lowering Differential Revision: https://reviews.llvm.org/D101456	2021-04-29 10:09:15 +02:00
Adrian Kuegel	2ea7fb7b1c	[MLIR] Add ComplexToStandard conversion pass. So far, only a conversion for complex::AbsOp is done, but more will be added. Differential Revision: https://reviews.llvm.org/D101442	2021-04-28 14:17:46 +02:00
Rob Suderman	cc1ae54ebc	[tosa][mlir] Fix FullyConnected to correctly order dimensions MatMul and FullyConnected have transposed dimensions for the weights. Also, removed uneeded tensor reshape for bias. Differential Revision: https://reviews.llvm.org/D101220	2021-04-27 17:26:04 -07:00
Rob Suderman	8f190b13ba	[mlir][tosa] Add tosa.negate lowerings for quantized cases Quantized negation can be performed using higher bits operations. Minimal bits are picked to perform the operation. Differential Revision: https://reviews.llvm.org/D101225	2021-04-27 17:16:39 -07:00
natashaknk	6f720d5eca	[mlir][tosa] Add tosa.gather lowering to linalg.indexed_generic Lowering gather operation to linalg dialect. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D101200	2021-04-23 22:42:56 -07:00
Rob Suderman	97c571abbc	[mlir][tosa] Add tosa.resize lowering to linalg generic Includes tests and implementation for both integer and floating point values. Both nearest neighbor and bilinear interpolation is included. Differential Revision: https://reviews.llvm.org/D101009	2021-04-23 12:43:02 -07:00
Matthias Springer	64f7fb5dfc	[mlir] Support masked N-D vector transfer ops in ProgressiveVectorToSCF. Mask vectors are handled similar to data vectors in N-D TransferWriteOp. They are copied into a temporary memory buffer, which can be indexed into with non-constant values. Differential Revision: https://reviews.llvm.org/D101136	2021-04-23 18:23:51 +09:00
Matthias Springer	545f98efc7	[mlir] Support masked 1D vector transfer ops in ProgressiveVectorToSCF Support for masked N-D vector transfer ops will be added in a subsequent commit. Differential Revision: https://reviews.llvm.org/D101132	2021-04-23 18:08:50 +09:00
Matthias Springer	a819e73393	[mlir] Support broadcast dimensions in ProgressiveVectorToSCF This commit adds support for broadcast dimensions in permutation maps of vector transfer ops. Also fixes a bug in VectorToSCF that generated incorrect in-bounds checks for broadcast dimensions. Differential Revision: https://reviews.llvm.org/D101019	2021-04-23 18:01:32 +09:00
Matthias Springer	f6a3e92e0a	[mlir] Use SCF for loops in ProgressiveVectorToSCF Use SCF for loops instead of Affine for loops. Differential Revision: https://reviews.llvm.org/D101013	2021-04-23 17:54:12 +09:00
Matthias Springer	ab154176bf	[mlir] Support dimension permutations in ProgressiveVectorToSCF This commit adds support for dimension permutations in permutation maps of vector transfer ops. Differential Revision: https://reviews.llvm.org/D101007	2021-04-23 17:46:35 +09:00
Matthias Springer	afaf36b69e	[mlir] Handle strided 1D vector transfer ops in ProgressiveVectorToSCF Strided 1D vector transfer ops are 1D transfers operating on a memref dimension different from the last one. Such transfer ops do not accesses contiguous memory blocks (vectors), but access memory in a strided fashion. In the absence of a mask, strided 1D vector transfer ops can also be lowered using matrix.column.major.* LLVM instructions (in a later commit). Subsequent commits will extend the pass to handle the remaining missing permutation maps (broadcasts, transposes, etc.). Differential Revision: https://reviews.llvm.org/D100946	2021-04-23 17:19:22 +09:00
Rob Suderman	648dfdfc24	[mlir][tosa] Add tosa.avg_pool2d lowering Added the float lowerings for avg pool with corresponding tests. Differential Revision: https://reviews.llvm.org/D100793	2021-04-21 19:07:52 -07:00
Nico Weber	56f987fafe	[mlir] yet more iwyu fixes after `ba7a92c01e`	2021-04-21 10:54:44 -04:00
thomasraoux	b2e72cd38d	[mlir][spirv] Support conversion of extract op from vector<1xT> type Differential Revision: https://reviews.llvm.org/D100814	2021-04-20 09:11:41 -07:00
Hanhan Wang	7b7df8e85e	[mlir][StandardToSPIRV] Add support for lowering std.xor on bool to SPIR-V std.xor ops on bool are lowered to spv.LogicalNotEqual. For Boolean values, xor and not-equal are the same thing. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D100817	2021-04-20 07:35:20 -07:00
Matthias Springer	7cc8106f67	[mlir] Progressively lower vector to SCF Add a new ProgressiveVectorToSCF pass that lowers vector transfer ops to SCF by gradually unpacking one dimension at time. Unpacking stops at 1D, but can be configured to stop earlier, should the HW support (N>1)-d vectors. The current implementation cannot handle permutation maps, masks, tensor types and unrolling yet. These will be added in subsequent commits. Once features are on par with VectorToSCF, this implementation will replace VectorToSCF. Differential Revision: https://reviews.llvm.org/D100622	2021-04-20 18:49:36 +09:00
Tres Popp	34810e1b9c	[mlir] Add patterns to lower Math operations to LLVM based libm calls. Some Math operations do not have an equivalent in LLVM. In these cases, allow a low priority fallback of calling the libm functions. This is to give functionality and is not a performant option. Differential Revision: https://reviews.llvm.org/D100367	2021-04-20 11:38:55 +02:00
Tobias Gysi	27ad213680	[mlir][linalg] enable library call rewrites for linalg operations with index semantics. The patch enables the library call lowering for linalg operations that contain index operations. Differential Revision: https://reviews.llvm.org/D100537	2021-04-19 12:50:59 +00:00
Javier Setoain	b739bada9d	[mlir][ArmSVE] Cleanup dialect registration ArmSVE dialect is behind the recent changes in how the Vector dialect interacts with backend vector dialects and the MLIR -> LLVM IR translation module. This patch cleans up ArmSVE initialization within Vector and removes the need for an LLVMArmSVE dialect. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D100171	2021-04-16 15:56:51 +02:00
River Riddle	4efb7754e0	[mlir][NFC] Add a using directive for llvm::SetVector Differential Revision: https://reviews.llvm.org/D100436	2021-04-15 16:09:34 -07:00
Hanhan Wang	d9b03ef2e8	[mlir][StandardToSPIRV] Add support for lowering math.powf to SPIR-V. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D100403	2021-04-13 22:36:47 -07:00
Rob Suderman	7e1fb9a0d2	[mlir][tosa] Add conv2d lowering to linalg.conv2d operator for FP Handles lowering conv2d to linalg's convolution operator. This implementation only supports floating point values but handles all strides, dilations, and padding values. Differential Revision: https://reviews.llvm.org/D100061	2021-04-13 13:26:02 -07:00
Lei Zhang	2eb98d89ac	[mlir][spirv] Allow bitwidth emulation on runtime arrays Runtime arrays are converted from memrefs with unknown dimensions. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D100335	2021-04-12 17:04:18 -04:00
Lei Zhang	0deeaaca39	[mlir] Move memref.subview patterns to MemRef/Transforms/ These patterns have been used as a prerequisite step for lowering to SPIR-V. But they don't involve SPIR-V dialect ops; they are pure memref/vector op transformations. Given now we have a dedicated MemRef dialect, moving them to Memref/Transforms/, which is a more suitable place to host them, to allow used by others. This commit just moves code around and renames patterns/passes accordingly. CMakeLists.txt for existing MemRef libraries are also improved along the way. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D100326	2021-04-12 16:38:22 -04:00
Emilio Cota	8508a63b88	[mlir] Rename AVX512 dialect to X86Vector We will soon be adding non-AVX512 operations to MLIR, such as AVX's rsqrt. In https://reviews.llvm.org/D99818 several possibilities were discussed, namely to (1) add non-AVX512 ops to the AVX512 dialect, (2) add more dialects (e.g. AVX dialect for AVX rsqrt), and (3) expand the scope of the AVX512 to include these SIMD x86 ops, thereby renaming the dialect to something more accurate such as X86Vector. Consensus was reached on option (3), which this patch implements. Reviewed By: aartbik, ftynse, nicolasvasilache Differential Revision: https://reviews.llvm.org/D100119	2021-04-12 19:20:04 +02:00
Tobias Gysi	93f9922d65	[mlir][linalg] adding operation to access the iteration index of enclosing linalg ops. The `linalg.index` operation provides access to the iteration indexes of immediately enclosing linalg operations. It takes a dimension `dim` attribute and returns the iteration index in the given dimension. Having `linalg.index` allows us to unify `linalg.generic` and `linalg.indexed_generic` and also enables index access in named operations. Differential Revision: https://reviews.llvm.org/D100292	2021-04-12 13:37:17 +00:00
Rob Suderman	ceeb5b0f87	[mlir][tosa] Add tosa.max_pool2d lowering to linalg int max pooling additions Lowerings tosa.max_pool2d to linalg equivalent operations. Includes adding max pooling operations for linalg, with corresponding tests. Differential Revision: https://reviews.llvm.org/D99824	2021-04-08 18:17:16 -07:00
Hanhan Wang	c361435845	[mlir][StandardToSPIRV] Handle i1 case for lowering memref.load/store op This patch unconditionally converts i1 types to i8 types on memrefs. If the extensions or capabilities are not met, they will be converted to i32. Hence the logic in IntLoadPattern and IntStorePattern are also updated. Also added the implementation of SPIRVTypeConverter::getOptions(). Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D99724	2021-04-08 12:15:25 -07:00
Lei Zhang	5299843c31	[mlir][spirv] Add control for non-32-bit scalar type emulation Non-32-bit scalar types require special hardware support that may not exist on all GPUs. This is reflected in SPIR-V as that non-32-bit scalar types require special capabilities or extensions. Previously when there is a non-32-bit type and no native support, we unconditionally emulate it with 32-bit ones. This isn't good given that it can have implications over ABI and data layout consistency. This commit introduces an option to control whether to use 32-bit types to emulate. Differential Revision: https://reviews.llvm.org/D100059	2021-04-08 08:19:47 -04:00
Tobias Gysi	b614ada0e8	[mlir] add support for index type in vectors. The patch enables the use of index type in vectors. It is a prerequisite to support vectorization for indexed Linalg operations. This refactoring became possible due to the newly introduced data layout infrastructure. The data layout of a module defines the bitwidth of the index type needed to verify bitcasts and similar vector operations. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D99948	2021-04-08 08:17:13 +00:00
Matthias Springer	65a3f28939	[mlir] Add "mask" operand to vector.transfer_read/write. Also factors out out-of-bounds mask generation from vector.transfer_read/write into a new MaterializeTransferMask pattern. Differential Revision: https://reviews.llvm.org/D100001	2021-04-07 21:33:13 +09:00
Rob Suderman	0312b25df0	[mlir][tosa] Add tosa.table lowering to linalg.generic Table op lowering to linalg.generic for both i8 (behaves like a gather) and a pair of gathers with a quantized interpolation. Differential Revision: https://reviews.llvm.org/D99756	2021-04-06 13:57:18 -07:00
Alex Zinenko	7dc7790ec5	[mlir] Fix support for lowering non-32-bit affine reductions. The existing implementation was always creating 32-bit constants for floating-point and integer reductions regardless of the actual type, which resulted in invalid IR being generated for any types other than f32 and i32 when lowering affine.parallel to SCF. Use the actual type instead. Reviewed By: chelini Differential Revision: https://reviews.llvm.org/D99942	2021-04-06 14:00:15 +02:00
Rob Suderman	eb1b55c652	[mlir][tosa] Add tosa.reduce_any and tosa.reduce_all linalg lowering Added lowerings for Tosa's reduce boolean operations. This includes a fix to maintain the output rank of reduce operations. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D99228	2021-04-02 14:32:18 -07:00
Lei Zhang	6dd07fa513	[mlir][spirv] Add utilities for push constant value This commit add utility functions for creating push constant storage variable and loading values from it. Along the way, performs some clean up: * Deleted `setABIAttrs`, which is just a 4-liner function with one user. * Moved `SPIRVConverstionTarget` into `mlir` namespace, to be consistent with `SPIRVTypeConverter` and `LLVMConversionTarget`. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D99725	2021-04-02 07:51:07 -04:00
natashaknk	a879a1b034	[mlir][tosa] Add tosa.reciprocal and tosa.sigmoid lowerings Lowering reciprocal and sigmoid elementwise operations to linalg dialect. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D99676	2021-03-31 14:21:03 -07:00
Matthias Springer	95f8135043	[mlir] Change vector.transfer_read/write "masked" attribute to "in_bounds". This is in preparation for adding a new "mask" operand. The existing "masked" attribute was used to specify dimensions that may be out-of-bounds. Such transfers can be lowered to masked load/stores. The new "in_bounds" attribute is used to specify dimensions that are guaranteed to be within bounds. (Semantics is inverted.) Differential Revision: https://reviews.llvm.org/D99639	2021-03-31 18:04:22 +09:00
Mehdi Amini	973ddb7d6e	Define a `NoTerminator` traits that allows operations with a single block region to not provide a terminator In particular for Graph Regions, the terminator needs is just a historical artifact of the generalization of MLIR from CFG region. Operations like Module don't need a terminator, and before Module migrated to be an operation with region there wasn't any needed. To validate the feature, the ModuleOp is migrated to use this trait and the ModuleTerminator operation is deleted. This patch is likely to break clients, if you're in this case: - you may iterate on a ModuleOp with `getBody()->without_terminator()`, the solution is simple: just remove the ->without_terminator! - you created a builder with `Builder::atBlockTerminator(module_body)`, just use `Builder::atBlockEnd(module_body)` instead. - you were handling ModuleTerminator: it isn't needed anymore. - for generic code, a `Block::mayNotHaveTerminator()` may be used. Differential Revision: https://reviews.llvm.org/D98468	2021-03-25 03:59:03 +00:00
Rob Suderman	f5ba3eea67	[mlir][tosa] Add tosa.bitwise_not lowering to constant and xor Lowering of bitwise_not to linalg dialect using a xor operation with a constant of all-bits-one. Differential Revision: https://reviews.llvm.org/D99221	2021-03-24 17:27:27 -07:00
Alex Zinenko	b3386a734e	[mlir] introduce data layout entry for index type Index type is an integer type of target-specific bitwidth present in many MLIR operations (loops, memory accesses). Converting values of this type to fixed-size integers has always been problematic. Introduce a data layout entry to specify the bitwidth of `index` in a given layout scope, defaulting to 64 bits, which is a commonly used assumption, e.g., in constants. Port builtin-to-LLVM type conversion to use this data layout entry when converting `index` type and untie it from pointer size. This is particularly relevant for GPU targets. Keep a possibility to forcibly override the index type in lowerings. Depends On D98525 Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D98937	2021-03-24 15:13:42 +01:00
Vladislav Vinogradov	18a2f479bf	[mlir][NFC] Replace `getMemorySpaceAsInt` with `getMemorySpace` where possible Use new `MemRefType::getMemorySpace` method with generic Attribute in cases, where there is no specific logic around the memory space. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D99154	2021-03-24 13:23:59 +03:00
Rob Suderman	28e6420744	[mlir][tosa] Add tosa.argmax to linalg lowering Tosa's argmax lowering is representable as a linalg.indexed_generic operation. Include the lowering to this type for both integer and floating point types. Differential Revision: https://reviews.llvm.org/D99137	2021-03-23 16:06:55 -07:00
Rob Suderman	4157a079af	[mlir][tosa] Add tosa.pad to linalg.pad operation Lowers from tosa's pad op to the linalg equivalent for floating, integer, and quantized values. Differential Revision: https://reviews.llvm.org/D98990	2021-03-23 14:15:48 -07:00
River Riddle	76f3c2f3f3	[mlir][Pattern] Add better support for using interfaces/traits to match root operations in rewrite patterns To match an interface or trait, users currently have to use the `MatchAny` tag. This tag can be quite problematic for compile time for things like the canonicalizer, as the `MatchAny` patterns may get applied to every operation. This revision adds better support by bucketing interface/trait patterns based on which registered operations have them registered. This means that moving forward we will only attempt to match these patterns to operations that have this interface registered. Two simplify defining patterns that match traits and interfaces, two new utility classes have been added: OpTraitRewritePattern and OpInterfaceRewritePattern. Differential Revision: https://reviews.llvm.org/D98986	2021-03-23 14:05:33 -07:00
Rob Suderman	2d72b675d5	[mlir][tosa] Add tosa.tile to linalg.generic lowering Tiling operations are generic operations with modified indexing. Updated to to linalg lowerings to perform this lowering. Differential Revision: https://reviews.llvm.org/D99113	2021-03-23 13:13:54 -07:00
natashaknk	e20911b5c0	[mlir][tosa] Add tosa.matmul and tosa.fully_connected lowering Adds lowerings for matmul and fully_connected. Only supports 2D tensors for inputs and weights, and 1D tensors for bias. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D99211	2021-03-23 13:09:53 -07:00
Chris Lattner	79d7f618af	Rename FrozenRewritePatternList -> FrozenRewritePatternSet; NFC. This nicely aligns the naming with RewritePatternSet. This type isn't as widely used, but we keep a using declaration in to help with downstream consumption of this change. Differential Revision: https://reviews.llvm.org/D99131	2021-03-22 17:40:45 -07:00
Chris Lattner	dc4e913be9	[PatternMatch] Big mechanical rename OwningRewritePatternList -> RewritePatternSet and insert -> add. NFC This doesn't change APIs, this just cleans up the many in-tree uses of these names to use the new preferred names. We'll keep the old names around for a couple weeks to help transitions. Differential Revision: https://reviews.llvm.org/D99127	2021-03-22 17:20:50 -07:00
Rob Suderman	d7c44a5c78	[mlir][tosa] Fix tosa.mul to use tosa.apply_scale Multiply-shift requires wider compute types or CPU specific code to avoid premature truncation, apply_shift fixes this issue Also, Tosa's mul op supports different input / output types. Added path that sign-extends input values to int-32 values before multiplying. Differential Revision: https://reviews.llvm.org/D99011	2021-03-22 11:01:35 -07:00
Chris Lattner	1d909c9a35	Remove the extraneous MLIRContext argument from populateWithGenerated. NFC.	2021-03-21 10:38:35 -07:00
Chris Lattner	3a506b31a3	Change OwningRewritePatternList to carry an MLIRContext with it. This updates the codebase to pass the context when creating an instance of OwningRewritePatternList, and starts removing extraneous MLIRContext parameters. There are many many more to be removed. Differential Revision: https://reviews.llvm.org/D99028	2021-03-21 10:06:31 -07:00
Rob Suderman	e990fa2170	[mlir][tosa] Add tosa.reverse lowering to linalg.generic Reverse lowers to a linalg.generic op by reversing the read order in the index map. Differential Revision: https://reviews.llvm.org/D98997	2021-03-19 21:46:47 -07:00
Rob Suderman	47286fc530	[mlir][tosa] Add tosa.cast to linalg lowering Handles lowering from the tosa CastOp to the equivalent linalg lowering. It includes support for interchange between bool, int, and floating point. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D98828	2021-03-19 11:48:37 -07:00
Rob Suderman	1b7498120d	[mlir][tosa] Add tosa.logical_* to linalg lowerings Adds lowerings for logical_* boolean operations. Each of these ops only operate on booleans allowing simple lowerings. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D98910	2021-03-19 11:30:42 -07:00
Christian Sigg	a5f9cda173	[mlir] Rename gpu-to-llvm pass implementation file Also remove populate patterns function and binary annotation name option. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D98930	2021-03-19 13:58:13 +01:00
Christian Sigg	74ffe8dc59	[mlir] Remove ConvertKernelFuncToBlob All users have been converted to gpu::SerializeToBlobPass. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D98928	2021-03-19 09:33:47 +01:00
Rob Suderman	286a9d467e	[mlir][tosa] Add lowering for tosa.rescale to linalg.generic This adds a tosa.apply_scale operation that handles the scaling operation common to quantized operatons. This scalar operation is lowered in TosaToStandard. We use a separate ApplyScale factorization as this is a replicable pattern within TOSA. ApplyScale can be reused within pool/convolution/mul/matmul for their quantized variants. Tests are added to both tosa-to-standard and tosa-to-linalg-on-tensors that verify each pass is correct. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D98753	2021-03-18 16:14:05 -07:00
Rob Suderman	5627564fe0	[mlir][tosa] Add tosa.concat to subtensor inserts lowering Includes lowering for tosa.concat with indice computation with subtensor insert operations. Includes tests along two different indices. Differential Revision: https://reviews.llvm.org/D98813	2021-03-18 15:59:07 -07:00
thomasraoux	44f24f3996	[mlir] Fix build failure due to `1a572f4`	2021-03-18 14:58:32 -07:00
thomasraoux	1a572f4509	[mlir] Add vector op support to cuda-runner including vector.print Differential Revision: https://reviews.llvm.org/D97346	2021-03-18 13:03:08 -07:00
Rob Suderman	f4bb076a44	[mlir][tosa] Add tosa.slice to std.subtensor lowering Lowering to subtensor is added for tosa.slice operator. Differential Revision: https://reviews.llvm.org/D98825	2021-03-17 17:28:18 -07:00
Vladislav Vinogradov	fee9054232	[mlir][ODS] Support specialized Attribute class for Enums Add a feature to `EnumAttr` definition to generate specialized Attribute class for the particular enumeration. This class will inherit `StringAttr` or `IntegerAttr` and will override `classof` and `getValue` methods. With this class the enumeration predicate can be checked with simple RTTI calls (`isa`, `dyn_cast`) and it will return the typed enumeration directly instead of raw string/integer. Based on the following discussion: https://llvm.discourse.group/t/rfc-add-enum-attribute-decorator-class/2252 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D97836	2021-03-17 16:44:24 +03:00
Stephan Herhut	5837fdc4cc	[mlir][llvm] Pass struct results as parameter in c wrapper Returning structs directly in LLVM does not necessarily align with the C ABI of the platform. This might happen to work on Linux but for small structs this breaks on Windows. With this change, the wrappers work platform independently. Differential Revision: https://reviews.llvm.org/D98725	2021-03-17 12:58:52 +01:00
Gaurav Shukla	8e3075c2b0	[MLIR] Fix lowering of Affine IfOp in the presence of yield values. This commit fixes the lowering of `Affine.IfOp` to `SCF.IfOp` in the presence of yield values. These changes have been made as a part of `-lower-affine` pass. Differential Revision: https://reviews.llvm.org/D98760	2021-03-17 16:33:32 +05:30
River Riddle	1f13963ec1	[mlir][pdl] Cast the OperationPosition to Position to fix MSVC miscompile If we don't cast, MSVC picks an overload that hasn't been defined yet(not sure why) and miscompiles.	2021-03-16 16:11:14 -07:00
Eugene Zhulenev	74f6138bd9	[mlir] Add lowering from math::Log1p to LLVM [mlir] Add lowering from math::Log1p to LLVM Reviewed By: cota Differential Revision: https://reviews.llvm.org/D98662	2021-03-16 15:59:09 -07:00
River Riddle	3a833a0e0e	[mlir][PDL] Add support for variadic operands and results in the PDL Interpreter This revision extends the PDL Interpreter dialect to add support for variadic operands and results, with ranges of these values represented via the recently added !pdl.range type. To support this extension, three new operations have been added that closely match the single variant: * pdl_interp.check_types : Compare a range of types with a known range. * pdl_interp.create_types : Create a constant range of types. * pdl_interp.get_operands : Get a range of operands from an operation. * pdl_interp.get_results : Get a range of results from an operation. * pdl_interp.switch_types : Switch on a range of types. This revision handles adding support in the interpreter dialect and the conversion from PDL to PDLInterp. Support for variadic operands and results in the bytecode will be added in a followup revision. Differential Revision: https://reviews.llvm.org/D95722	2021-03-16 13:20:19 -07:00
River Riddle	02c4c0d5b2	[mlir][pdl] Remove CreateNativeOp in favor of a more general ApplyNativeRewriteOp. This has a numerous amount of benefits, given the overly clunky nature of CreateNativeOp: * Users can now call into arbitrary rewrite functions from inside of PDL, allowing for more natural interleaving of PDL/C++ and enabling for more of the pattern to be in PDL. * Removes the need for an additional set of C++ functions/registry/etc. The new ApplyNativeRewriteOp will use the same PDLRewriteFunction as the existing RewriteOp. This reduces the API surface area exposed to users. This revision also introduces a new PDLResultList class. This class is used to provide results of native rewrite functions back to PDL. We introduce a new class instead of using a SmallVector to simplify the work necessary for variadics, given that ranges will require some changes to the structure of PDLValue. Differential Revision: https://reviews.llvm.org/D95720	2021-03-16 13:20:18 -07:00
River Riddle	242762c9a3	[mlir][pdl] Restructure how results are represented. Up until now, results have been represented as additional results to a pdl.operation. This is fairly clunky, as it mismatches the representation of the rest of the IR constructs(e.g. pdl.operand) and also isn't a viable representation for operations returned by pdl.create_native. This representation also creates much more difficult problems when factoring in support for variadic result groups, optional results, etc. To resolve some of these problems, and simplify adding support for variable length results, this revision extracts the representation for results out of pdl.operation in the form of a new `pdl.result` operation. This operation returns the result of an operation at a given index, e.g.: ``` %root = pdl.operation ... %result = pdl.result 0 of %root ``` Differential Revision: https://reviews.llvm.org/D95719	2021-03-16 13:20:18 -07:00
Aart Bik	6ad7b97e20	[mlir][amx] Add Intel AMX dialect (architectural-specific vector dialect) The Intel Advanced Matrix Extensions (AMX) provides a tile matrix multiply unit (TMUL), a tile control register (TILECFG), and eight tile registers TMM0 through TMM7 (TILEDATA). This new MLIR dialect provides a bridge between MLIR concepts like vectors and memrefs and the lower level LLVM IR details of AMX. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D98470	2021-03-15 17:59:05 -07:00
Julian Gross	e2310704d8	[MLIR] Create memref dialect and move dialect-specific ops from std. Create the memref dialect and move dialect-specific ops from std dialect to this dialect. Moved ops: AllocOp -> MemRef_AllocOp AllocaOp -> MemRef_AllocaOp AssumeAlignmentOp -> MemRef_AssumeAlignmentOp DeallocOp -> MemRef_DeallocOp DimOp -> MemRef_DimOp MemRefCastOp -> MemRef_CastOp MemRefReinterpretCastOp -> MemRef_ReinterpretCastOp GetGlobalMemRefOp -> MemRef_GetGlobalOp GlobalMemRefOp -> MemRef_GlobalOp LoadOp -> MemRef_LoadOp PrefetchOp -> MemRef_PrefetchOp ReshapeOp -> MemRef_ReshapeOp StoreOp -> MemRef_StoreOp SubViewOp -> MemRef_SubViewOp TransposeOp -> MemRef_TransposeOp TensorLoadOp -> MemRef_TensorLoadOp TensorStoreOp -> MemRef_TensorStoreOp TensorToMemRefOp -> MemRef_BufferCastOp ViewOp -> MemRef_ViewOp The roadmap to split the memref dialect from std is discussed here: https://llvm.discourse.group/t/rfc-split-the-memref-dialect-from-std/2667 Differential Revision: https://reviews.llvm.org/D98041	2021-03-15 11:14:09 +01:00
Christian Sigg	2224221fb3	[mlir] Add NVVM to CUBIN conversion to mlir-opt If MLIR_CUDA_RUNNER_ENABLED, register a 'gpu-to-cubin' conversion pass to mlir-opt. The next step is to switch CUDA integration tests from mlir-cuda-runner to mlir-opt + mlir-cpu-runner and remove mlir-cuda-runner. Depends On D98279 Reviewed By: herhut, rriddle, mehdi_amini Differential Revision: https://reviews.llvm.org/D98203	2021-03-11 10:07:11 +01:00
Alex Zinenko	a776942ba1	[mlir] squash LLVM_AVX512 dialect into AVX512 The dialect separation was introduced to demarkate ops operating in different type systems. This is no longer the case after the LLVM dialect has migrated to using built-in vector types, so the original reason for separation is no longer valid. Squash the two dialects into one. The code size decrease isn't quite large: the ops originally in LLVM_AVX512 are preserved because they match LLVM IR intrinsics specialized for vector element bitwidth. However, it is still conceptually beneficial to have only one dialect. I originally considered to use Tablegen multiclasses to define both the type-polymorphic op and its two intrinsic-related instantiations, but decided against it given both the complexity of the required Tablegen input and its dissimilarity with the rest of ODS-defined ops, both potentially resulting in very poor maintainability. Depends On D98327 Reviewed By: nicolasvasilache, springerm Differential Revision: https://reviews.llvm.org/D98328	2021-03-10 13:07:26 +01:00
Christian Sigg	4d295cf5b5	[mlir] Add base class for GpuKernelToBlobPass Instead of configuring kernel-to-cubin/rocdl lowering through callbacks, introduce a base class that target-specific passes can derive from. Put the base class in GPU/Transforms, according to the discussion in D98203. The mlir-cuda-runner will go away shortly, and the mlir-rocdl-runner as well at some point. I therefore kept the existing code path working and will remove it in a separate step. Depends On D98168 Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D98279	2021-03-10 12:14:43 +01:00
Christian Sigg	840ff84d33	[mlir] Default for gpu-binary-annotation option. Provide default for gpuBinaryAnnotation so that we don't need to specify it in tests. The annotation likely only needs to be target specific if we want to lower to e.g. both CUDA and ROCDL. Reviewed By: herhut, bondhugula Differential Revision: https://reviews.llvm.org/D98168	2021-03-09 21:01:50 +01:00
Mehdi Amini	038f2a337d	Move LLVM::FMFAttr definition to TableGen (NFC) This is using the new Attribute storage generation support in TableGen to define the LLVM FastMathFlags. Differential Revision: https://reviews.llvm.org/D98007	2021-03-09 05:29:54 +00:00
Rob Suderman	cb3542e1ca	[MLIR][TOSA] Added lowerings for Reduce operations to Linalg Lowerings for min, max, prod, and sum reduction operations on int and float values. This includes reduction tests for both cases. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D97893	2021-03-08 10:57:19 -08:00
Benjamin Kramer	42c195f0ec	[mlir][Shape] Allow shape.split_at to return extent tensors and lower it to std.subtensor split_at can return an error if the split index is out of bounds. If the user knows that the index can never be out of bounds it's safe to use extent tensors. This has a straight-forward lowering to std.subtensor. Differential Revision: https://reviews.llvm.org/D98177	2021-03-08 16:48:05 +01:00
KareemErgawy-TomTom	3fb384d50e	[MLIR][SPIRV] Rename `spv.selection` to `spv.mlir.selection`. To unify the naming scheme across all ops in the SPIR-V dialect, we are moving from spv.camelCase to spv.CamelCase everywhere. For ops that don't have a SPIR-V spec counterpart, we use spv.mlir.snake_case. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D98014	2021-03-06 16:05:31 +01:00
Lei Zhang	bb6f5c8314	[mlir][spirv] Convert tensor.extract for very small tensors Normally tensors will be stored in buffers before converting to SPIR-V, given that is how a large amount of data is sent to the GPU. However, SPIR-V supports converting from tensors directly too. This is for the cases where the tensor just contains a small amount of elements and it makes sense to directly inline them as a small data array in the shader. To handle this, internally the conversion might create new local variables. SPIR-V consumers in GPU drivers may or may not optimize that away. So this has implications over register pressure. Therefore, a threshold is used to control when the patterns should kick in. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D98052	2021-03-06 08:03:36 -05:00
Matthias Springer	acce0ea70c	[mlir][AVX512] Add mask.compress to AVX512 dialect. Adds mask.compress to the AVX512 dialect and defines a lowering to the LLVM dialect. Differential Revision: https://reviews.llvm.org/D97611	2021-03-06 10:02:48 +09:00
Alex Zinenko	6410ee0d09	[mlir] Squash LLVM_ArmNeon dialect into ArmNeon The two dialects are largely redundant. The former was introduced as a mirror of the latter operating on LLVM dialect types. This is no longer necessary since the LLVM dialect operates on built-in types. Combine the two dialects. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D98060	2021-03-05 23:33:32 +01:00
KareemErgawy-TomTom	29812a6195	[MLIR][SPIRV] Rename `spv.loop` to `spv.mlir.loop`. To unify the naming scheme across all ops in the SPIR-V dialect, we are moving from spv.camelCase to spv.CamelCase everywhere. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D97918	2021-03-05 15:44:30 -05:00
KareemErgawy-TomTom	c74eb466d2	[MLIR][SPIRV] Rename `spv.globalVariable` to `spv.GlobalVariable`. To unify the naming scheme across all ops in the SPIR-V dialect, we are moving from spv.camelCase to spv.CamelCase everywhere. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D97919	2021-03-04 16:24:59 -05:00
KareemErgawy-TomTom	5abdca47b3	[MLIR][SPIRV] Rename `spv.constant` to `spv.Constant`. To unify the naming scheme across all ops in the SPIR-V dialect, we are moving from `spv.camelCase` to `spv.CamelCase` everywhere. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D97917	2021-03-04 16:15:56 -05:00
Alex Zinenko	19db802e7b	[mlir] make implementations of translation to LLVM IR interfaces private There is no need for the interface implementations to be exposed, opaque registration functions are sufficient for all users, similarly to passes. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D97852	2021-03-04 09:16:32 +01:00
River Riddle	e07c968a6d	[mlir][pdl][NFC] Rename InputOp to OperandOp This better matches the actual IR concept that is being modeled, and is consistent with how the rest of PDL is structured. Differential Revision: https://reviews.llvm.org/D95718	2021-03-03 15:48:00 -08:00
River Riddle	3dfa86149e	[mlir][IR] Refactor the internal implementation of Value The current implementation of Value involves a pointer int pair with several different kinds of owners, i.e. BlockArgumentImpl, Operation , TrailingOpResult. This design arose from the desire to save memory overhead for operations that have a very small number of results (generally 0-2). There are, unfortunately, many problematic aspects of the current implementation that make Values difficult to work with or just inefficient. Operation result types are stored as a separate array on the Operation. This is very inefficient for many reasons: we use TupleType for multiple results, which can lead to huge amounts of memory usage if multi-result operations change types frequently(they do). It also means that simple methods like Value::getType/Value::setType now require complex logic to get to the desired type. Value only has one pointer bit free, severely limiting the ability to use it in things like PointerUnion/PointerIntPair. Given that we store the kind of a Value along with the "owner" pointer, we only leave one bit free for users of Value. This creates situations where we end up nesting PointerUnions to be able to use Value in one. As noted above, most of the methods in Value need to branch on at least 3 different cases which is both inefficient, possibly error prone, and verbose. The current storage of results also creates problems for utilities like ValueRange/TypeRange, which want to efficiently store base pointers to ranges (of which Operation isn't really useful as one). This revision greatly simplifies the implementation of Value by the introduction of a new ValueImpl class. This class contains all of the state shared between all of the various derived value classes; i.e. the use list, the type, and the kind. This shared implementation class provides several large benefits: * Most of the methods on value are now branchless, and often one-liners. * The "kind" of the value is now stored in ValueImpl instead of Value This frees up all of Value's pointer bits, allowing for users to take full advantage of PointerUnion/PointerIntPair/etc. It also allows for storing more operation results as "inline", 6 now instead of 2, freeing up 1 word per new inline result. * Operation result types are now stored in the result, instead of a side array This drops the size of zero-result operations by 1 word. It also removes the memory crushing use of TupleType for operations results (which could lead up to hundreds of megabytes of "dead" TupleTypes in the context). This also allowed restructured ValueRange, making it simpler and one word smaller. This revision does come with two conceptual downsides: * Operation::getResultTypes no longer returns an ArrayRef<Type> This conceptually makes some usages slower, as the iterator increment is slightly more complex. * OpResult::getOwner is slightly more expensive, as it now requires a little bit of arithmetic From profiling, neither of the conceptual downsides have resulted in any perceivable hit to performance. Given the advantages of the new design, most compiles are slightly faster. Differential Revision: https://reviews.llvm.org/D97804	2021-03-03 14:33:37 -08:00
Benjamin Kramer	73cb58dc48	[mlir][Shape] Lower cstr_eq to shape_eq + assert Differential Revision: https://reviews.llvm.org/D97860	2021-03-03 17:22:28 +01:00
Benjamin Kramer	24acadef8a	[mlir][Shape] Make shape_eq nary This gets rid of a dubious shape_eq %a, %a fold, that folds shape_eq even if %a is not an Attribute. Differential Revision: https://reviews.llvm.org/D97728	2021-03-03 16:26:40 +01:00
Vladislav Vinogradov	37eca08e5b	[mlir][NFC] Rename `MemRefType::getMemorySpace` to `getMemorySpaceAsInt` Just a pure method renaming. It is a preparation step for replacing "memory space as raw integer" with more generic "memory space as attribute", which will be done in separate commit. The `MemRefType::getMemorySpace` method will return `Attribute` and become the main API, while `getMemorySpaceAsInt` will be declared as deprecated and will be replaced in all in-tree dialects (also in separate commits). Reviewed By: mehdi_amini, rriddle Differential Revision: https://reviews.llvm.org/D97476	2021-03-02 11:08:54 +03:00
Stella Stamenova	801067f4c0	[mlir][lldb] Fix several gcc warnings in mlir and lldb These warnings are raised when compiling with gcc due to either having too few or too many commas, or in the case of lldb, the possibility of a nullptr. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D97586	2021-03-01 13:48:22 -08:00
Rob Suderman	087bc20fe4	[MLIR][TOSA] Lower tosa.transpose to linalg.generic Lowers the transpose operation to a generic linalg op when permutations is a constant value. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D97508	2021-03-01 11:09:49 -08:00
Rob Suderman	16abacaea9	[MLIR][TOSA] Resubmit Tosa to Standard/SCF Lowerings (const, if, while)" Includes a lowering for tosa.const, tosa.if, and tosa.while to Standard/SCF dialects. TosaToStandard is used for constant lowerings and TosaToSCF handles the if/while ops. Resubmission of https://reviews.llvm.org/D97518 with ASAN fixes. Differential Revision: https://reviews.llvm.org/D97529	2021-02-26 17:44:12 -08:00
Rob Suderman	f685c9ac86	[MLIR][TOSA] Lower tosa.identity and tosa.identitiyn to linalg Both identity ops can be loweried by replacing their results with their inputs. We keep this as a linalg lowering as other backends may choose to create copies. Differential Revision: https://reviews.llvm.org/D97517	2021-02-26 15:45:07 -08:00
Aart Bik	df5ccf5a94	[mlir][vector] add higher dimensional support to gather/scatter Similar to mask-load/store and compress/expand, the gather and scatter operation now allow for higher dimension uses. Note that to support the mixed-type index, the new syntax is: vector.gather %base [%i,%j] [%kvector] .... The first client of this generalization is the sparse compiler, which needs to define scatter and gathers on dense operands of higher dimensions too. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D97422	2021-02-26 14:20:19 -08:00
Rob Suderman	caccddc52a	[MLIR][TOSA] Lower tosa.reshape to linalg.reshape Lowering from the tosa.reshape op to linalg.reshape. For same-rank or non-collapsed/expanded cases two linalg.reshapes are inserted. Differential Revision: https://reviews.llvm.org/D97439	2021-02-26 12:57:57 -08:00
Benjamin Kramer	4941fef9c4	[mlir] Silence some deprecation warnings after `dffc487b07`	2021-02-26 15:15:56 +01:00
Marius Brehler	56774bdda5	[mlir] Replace deprecated 'getAttrs' 'getAttrs' has been explicitly marked deprecated. This patch refactors to use Operation::getAttrs(). Reviewed By: csigg Differential Revision: https://reviews.llvm.org/D97546	2021-02-26 14:52:40 +01:00
Christian Sigg	dffc487b07	[mlir] Mark OpState::removeAttr() deprecated. Fix call sites. The method will be removed 2 weeks later. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D97530	2021-02-26 12:04:41 +01:00
Rob Suderman	c47aa3c8de	Revert [MLIR][TOSA] Added Tosa to Standard/SCF Lowerings (const, if, while) This reverts commit `a813e9be5b`. Results in an ASAN failure due to bypassing rewriter. Differential Revision: https://reviews.llvm.org/D97518	2021-02-25 18:05:16 -08:00
Rob Suderman	a813e9be5b	[MLIR][TOSA] Added Tosa to Standard/SCF Lowerings (const, if, while) Includes a lowering for tosa.const, tosa.if, and tosa.while to Standard/SCF dialects. TosaToStandard is used for constant lowerings and TosaToSCF handles the if/while ops. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D97352	2021-02-25 14:35:21 -08:00
Christian Sigg	8c074cb0b7	[mlir] Mark OpState::getAttrs() deprecated. Fix call sites. The method will be removed 2 weeks later. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D97464	2021-02-25 20:54:42 +01:00
Christian Sigg	f03826f896	Pass GPU events instead of streams across async regions. Lower !gpu.async.tokens returned from async.execute regions to events instead of streams. Make !gpu.async.token returned from !async.execute single-use. This allows creating one event per use and destroying them without leaking or ref-counting. Technically we only need this for stream/event-based lowering. I kept the code separate from the rest of the gpu-async-region pass so that we can make this optional or move to a separate pass as needed. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D96965	2021-02-25 13:18:18 +01:00
Lei Zhang	5f8a80882b	[mlir] Add constBuilderCall to TypeAttr to simplify builders Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D97344	2021-02-24 13:04:03 -05:00
Christian Sigg	eb8d6af5e4	[mlir] Specify cuda-runner pass pipeline as command line options. The cuda-runner registers two pass pipelines for nested passes, so that we don't have to use verbose textual pass pipeline specification. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D97091	2021-02-24 14:36:52 +01:00
River Riddle	ddd556f10e	[mlir][pdl] Fix bug when ordering predicates We should be ordering predicates with higher primary/secondary sums first, but we are currently ordering them last. This allows for predicates more frequently encountered to be checked first. Differential Revision: https://reviews.llvm.org/D95715	2021-02-22 19:02:48 -08:00
Andrew Pritchard	08c681f645	Perform memory accesses in the same addrspace as the corresponding memref. It's not necessarily the case on all architectures that all memory is addressable in addrspace 0, so casting the pointer to addrspace 0 is liable to cause problems. Reviewed By: aartbik, ftynse, nicolasvasilache Differential Revision: https://reviews.llvm.org/D96380	2021-02-18 12:36:16 -08:00
natashaknk	25b4a6a7f0	[MLIR][TOSA] Add lowering from TOSA to Linalg for math-based and elementwise ops This patch adds lowering to Linalg for the following TOSA ops: negate, rsqrt, mul, select, clamp and reluN and includes support for signless integer and floating point types Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D96924	2021-02-18 12:10:10 -08:00
Alexander Belyaev	a89035d750	Revert "[MLIR] Create memref dialect and move several dialect-specific ops from std." This commit introduced a cyclic dependency: Memref dialect depends on Standard because it used ConstantIndexOp. Std depends on the MemRef dialect in its EDSC/Intrinsics.h Working on a fix. This reverts commit `8aa6c3765b`.	2021-02-18 12:49:52 +01:00
Julian Gross	8aa6c3765b	[MLIR] Create memref dialect and move several dialect-specific ops from std. Create the memref dialect and move several dialect-specific ops without dependencies to other ops from std dialect to this dialect. Moved ops: AllocOp -> MemRef_AllocOp AllocaOp -> MemRef_AllocaOp DeallocOp -> MemRef_DeallocOp MemRefCastOp -> MemRef_CastOp GetGlobalMemRefOp -> MemRef_GetGlobalOp GlobalMemRefOp -> MemRef_GlobalOp PrefetchOp -> MemRef_PrefetchOp ReshapeOp -> MemRef_ReshapeOp StoreOp -> MemRef_StoreOp TransposeOp -> MemRef_TransposeOp ViewOp -> MemRef_ViewOp The roadmap to split the memref dialect from std is discussed here: https://llvm.discourse.group/t/rfc-split-the-memref-dialect-from-std/2667 Differential Revision: https://reviews.llvm.org/D96425	2021-02-18 11:29:39 +01:00
Eugene Zhulenev	519f5917b4	[mlir] Add fma operation to std dialect Will remove `vector.fma` operation in the followup CLs. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D96801	2021-02-17 10:06:01 -08:00

... 3 4 5 6 7 ...

1384 Commits