llvm-project

Commit Graph

Author	SHA1	Message	Date
natashaknk	203d38b234	[mlir][tosa] Small refactor to the functionality of Conv2D and Fully_connected to add the bias at the end of the convolution Made to adjust for a modification to the tiling algorithm Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D108746	2021-08-30 13:18:43 -07:00
Rob Suderman	90478251c7	[mlir][tosa] Tosa reverse to linalg supporting dynamic shapes Needed to switch to extract to support tosa.reverse using dynamic shapes. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D108744	2021-08-26 13:23:59 -07:00
Rob Suderman	0600bb4d18	[mlir][tosa] Elementwise operation dynamic shape support Added dynamic shape support for elementwise operations. This assumes equal sizes (broadcasting 1-length dynamic is problematic). Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D108730	2021-08-26 11:18:58 -07:00
Rob Suderman	5541a05d6a	[mlir][tosa] Quantized tosa.avg_pool2d lowering to linalg Includes the quantized version of average pool lowering to linalg dialect. This includes a lit test for the transform. It is not 100% correct as the multiplier / shift should be done in i64 however this is negligable rounding difference. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D108676	2021-08-24 18:54:23 -07:00
Rob Suderman	4ef1770abd	[mlir][tosa] Table did not apply offset before extract on i8 input Lowering to table was incorrect as it did not apply a 128 offset before extracting the value from the table. Fixed and correct tensor length on input table. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D108436	2021-08-24 18:52:33 -07:00
Rob Suderman	a7bf93807b	[mlir][tosa] Fix conv/depthwise conv padding for quantized values When padding quantized operations, the padding needs to equal the zero point of the input value. Corrected the pass to change the padding value if quantized. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D108440	2021-08-24 18:13:22 -07:00
William S. Moses	973cb2c326	[MLIR][OMP] Ensure nested scf.parallel execute all iterations Presently, the lowering of nested scf.parallel loops to OpenMP creates one omp.parallel region, with two (nested) OpenMP worksharing loops on the inside. When lowered to LLVM and executed, this results in incorrect results. The reason for this is as follows: An OpenMP parallel region results in the code being run with whatever number of threads available to OpenMP. Within a parallel region a worksharing loop divides up the total number of requested iterations by the available number of threads, and distributes accordingly. For a single ws loop in a parallel region, this works as intended. Now consider nested ws loops as follows: omp.parallel { A: omp.ws %i = 0...10 { B: omp.ws %j = 0...10 { code(%i, %j) } } } Suppose we ran this on two threads. The first workshare loop would decide to execute iterations 0, 1, 2, 3, 4 on thread 0, and iterations 5, 6, 7, 8, 9 on thread 1. The second workshare loop would decide the same for its iteration. This means thread 0 would execute i \in [0, 5) and j \in [0, 5). Thread 1 would execute i \in [5, 10) and j \in [5, 10). This means that iterations i in [5, 10), j in [0, 5) and i in [0, 5), j in [5, 10) never get executed, which is clearly wrong. This permits two options for a remedy: 1) Change the semantics of the omp.wsloop to be distinct from that of the OpenMP runtime call or equivalently #pragma omp for. This could then allow some lowering transformation to remedy the aforementioned issue. I don't think this is desirable for an abstraction standpoint. 2) When lowering an scf.parallel always surround the wsloop with a new parallel region (thereby causing the innermost wsloop to use the number of threads available only to it). This PR implements the latter change. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D108426	2021-08-20 19:06:28 -04:00
Rob Suderman	3205ee7e81	[mlir][tosa] Support UInt8 inputs and outputs for tosa.rescale Tosa rescale can contain uint8 types. Added support for these types using an unrealized conversion cast. Optimistically it would be better to use bitcast however it does not support unsigned integers. Differential Revision: https://reviews.llvm.org/D108427	2021-08-19 18:58:44 -07:00
Robert Suderman	76c9712196	[mlir][tosa] Fix clamp to restrict only within valid bitwidth range Its possible for the clamp to have invalid min/max values on its range. To fix this we validate the range of the min/max and clamp to a valid range. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D108256	2021-08-18 12:14:01 -07:00
William S. Moses	8c2ff7b69e	[MLIR] Correct linkage of lowered globalop LLVM considers global variables marked as externals to be defined within the module if it is initialized (including to an undef). Other external globals are considered as being defined externally and imported into the current translation unit. Lowering of MLIR Global Ops does not properly propagate undefined initializers, resulting in a global which is expected to be defined within the current TU, not being defined. Differential Revision: https://reviews.llvm.org/D108252	2021-08-18 11:09:43 -04:00
natashaknk	ba0997ca09	[mlir][tosa] Fix depthwise_conv2D strides/dilation and name Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D107997	2021-08-12 15:43:41 -07:00
Rob Suderman	7de439b2be	[mlir][tosa] Migrate tosa to more efficient linalg.conv Existing linalg.conv2d is not well optimized for performance. Changed to a version that is more aligned for optimziation. Include the corresponding transposes to use this optimized version. This also splits the conv and depthwise conv into separate implementations to avoid overly complex lowerings. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D107504	2021-08-11 11:05:12 -07:00
Alex Zinenko	a0d8a08e3e	[mlir] Add std.bitcast -> llvm.bitcast conversion The conversion is a straightforward one-to-one mapping with optional unrolling for nD vectors, similarly to other cast operations. Depends On D107889 Reviewed By: cota, akuegel Differential Revision: https://reviews.llvm.org/D107891	2021-08-11 16:30:21 +02:00
Rob Suderman	2b2ebb6f98	[mlir][tosa] Add folders for trivial tosa operation cases Some folding cases are trivial to fold away, specifically no-op cases where an operation's input and output are the same. Canonicalizing these away removes unneeded operations. The current version includes tensor cast operations to resolve shape discreprencies that occur when an operation's result type differs from the input type. These are resolved during a tosa shape propagation pass. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D107321	2021-08-10 14:43:00 -07:00
Rob Suderman	86858c62ba	[mlir][tosa] Add dilation to tosa.transpose_conv2d lowering Dilation only requires increasing the padding on the left/right side of the input, and including dilation in the convolution. This implementation still lacks support for strided convolutions. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D107680	2021-08-10 14:36:11 -07:00
natashaknk	a1f46569a1	[mlir][tosa] Add quantized and unquantized versions for tosa.depthwise_conv2d lowering Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D107855	2021-08-10 14:29:26 -07:00
Alex Zinenko	8a7c657c4d	[mlir] support nD vector forms of shifts in std-to-llvm conversion These ops were not ported to the nD vector conversion when it was introduced and nobody needed them so far. Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D107750	2021-08-09 12:00:41 +02:00
Eugene Zhulenev	b537c5b414	[mlir] Async: clone constants into async.execute functions and parallel compute functions Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D107007	2021-08-02 12:17:41 -07:00
Lei Zhang	0065bd2ad5	[mlir][spirv] Fix loading bool with proper storage capabilities If the source value to load is bool, and we have native storage capability support for the source bitwidth, we still cannot directly rewrite uses; we need to perform casting to bool first. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D107119	2021-07-30 18:06:11 -04:00
Lei Zhang	9f5300c8be	[mlir][spirv] Fix storing bool with proper storage capabilities If the source value to store is bool, and we have native storage capability support for the target bitwidth, we still cannot directly store; we need to perform casting to match the target memref element's bitwidth. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D107114	2021-07-30 18:06:10 -04:00
Lei Zhang	26be7fe27c	[mlir] NFC: split MemRef to SPIR-V conversion into their own files Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D107094	2021-07-29 16:34:10 -04:00
Lei Zhang	995c3984ef	[mlir] NFC: split Math to SPIR-V conversion into their own files Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D107093	2021-07-29 16:34:10 -04:00
River Riddle	f8479d9de5	[mlir] Set the namespace of the BuiltinDialect to 'builtin' Historically the builtin dialect has had an empty namespace. This has unfortunately created a very awkward situation, where many utilities either have to special case the empty namespace, or just don't work at all right now. This revision adds a namespace to the builtin dialect, and starts to cleanup some of the utilities to no longer handle empty namespaces. For now, the assembly form of builtin operations does not require the `builtin.` prefix. (This should likely be re-evaluated though) Differential Revision: https://reviews.llvm.org/D105149	2021-07-28 21:00:10 +00:00
Alex Zinenko	c1f719d1a7	[mlir] harden result type verification in llvm.call The verifier of the llvm.call operation was not checking for mismatches between the number of operation results and the number of results in the signature of the callee. Furthermore, it was possible to construct an llvm.call operation producing an SSA value of !llvm.void type, which should not exist. Add the verification and treat !llvm.void result type as absence of call results. Update the GPU conversions to LLVM that were mistakenly assuming that it was fine for llvm.call to produce values of !llvm.void type and ensure these calls do not produce results. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D106937	2021-07-28 18:15:56 +02:00
Adrian Kuegel	fb978f092c	[mlir][Complex]: Add lowerings for AddOp and SubOp from Complex dialect to Standard. Differential Revision: https://reviews.llvm.org/D106429	2021-07-23 12:43:45 +02:00
Rob Suderman	cf8a1f6208	[mlir][tosa] Quantized Conv2DOp lowering to linalg added. Includes a version of a quantized conv2D operations with a lowering from TOSA to linalg with corresponding test. We keep the quantized and quantized variants as separate named ops to avoid the additional operations for non-quantized convolutions. Differential Revision: https://reviews.llvm.org/D106407	2021-07-22 15:42:26 -07:00
Nicolas Vasilache	a664c14001	[mlir][LLVM] Revert bareptr calling convention handling as an argument materialization. Type conversion and argument materialization are context-free: there is no available information on which op / branch is currently being converted. As a consequence, bare ptr convention cannot be handled as an argument materialization: it would apply irrespectively of the parent op. This doesn't typecheck in the case of non-funcOp and we would see cases where a memref descriptor would be inserted in place of the pointer in another memref descriptor. For now the proper behavior is to revert to a specific BarePtrFunc implementation and drop the blanket argument materialization logic. This reverts the relevant piece of the conversion to LLVM to what it was before https://reviews.llvm.org/D105880 and adds a relevant test and documentation to avoid the mistake by whomever attempts this again in the future. Reviewed By: arpith-jacob Differential Revision: https://reviews.llvm.org/D106495	2021-07-21 22:06:50 +00:00
Rob Suderman	40a02fae87	[mlir][tosa] Added tosa to linalg lowering to unstrided transposed conv The unstrided transposed conv can be represented as a regular convolution. Lower to this variant to handle the basic case. This includes transitioning from the TC defined convolution operation and a yaml defined one. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D106389	2021-07-20 15:07:08 -07:00
Rob Suderman	6bf0f6a4f7	[mlir][tosa] Add quantized lowering for matmul and fully_connected Added the named op variants for quantized matmul and quantized batch matmul with the necessary lowerings/tests from tosa's matmul/fully connected ops. Current version does not use the contraction op interface as its verifiers are not compatible with scalar operations. Differential Revision: https://reviews.llvm.org/D105063	2021-07-20 12:58:02 -07:00
Yi Zhang	381c3b9299	Dyanamic shape support for memref reassociation reshape ops Only memref with identity layout map is supported for now. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D106180	2021-07-19 15:14:36 -07:00
Hanhan Wang	9c49195330	[mlir][Linalg] Migrate 2D pooling ops from tc definition to yaml definition. This deletes all the pooling ops in LinalgNamedStructuredOpsSpec.tc. All the uses are replaced with the yaml pooling ops. Reviewed By: gysit, rsuderman Differential Revision: https://reviews.llvm.org/D106181	2021-07-19 09:24:02 -07:00
Matthias Springer	d1a9e9a7cb	[mlir][vector] Remove vector.transfer_read/write to LLVM lowering This simplifies the vector to LLVM lowering. Previously, both vector.load/store and vector.transfer_read/write lowered directly to LLVM. With this commit, there is a single path to LLVM vector load/store instructions and vector.transfer_read/write ops must first be lowered to vector.load/store ops. * Remove vector.transfer_read/write to LLVM lowering. * Allow non-unit memref strides on all but the most minor dimension for vector.load/store ops. * Add maxTransferRank option to populateVectorTransferLoweringPatterns. * vector.transfer_reads with changing element type can no longer be lowered to LLVM. (This functionality is needed only for SPIRV.) Differential Revision: https://reviews.llvm.org/D106118	2021-07-17 14:07:27 +09:00
Alex Zinenko	881dc34f73	[mlir] replace llvm.mlir.cast with unrealized_conversion_cast The dialect-specific cast between builtin (ex-standard) types and LLVM dialect types was introduced long time before built-in support for unrealized_conversion_cast. It has a similar purpose, but is restricted to compatible builtin and LLVM dialect types, which may hamper progressive lowering and composition with types from other dialects. Replace llvm.mlir.cast with unrealized_conversion_cast, and drop the operation that became unnecessary. Also make unrealized_conversion_cast legal by default in LLVMConversionTarget as the majority of convesions using it are partial conversions that actually want the casts to persist in the IR. The standard-to-llvm conversion, which is still expected to run last, cleans up the remaining casts standard-to-llvm conversion, which is still expected to run last, cleans up the remaining casts Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D105880	2021-07-16 15:14:09 +02:00
Alexander Belyaev	46ef86b5d8	[mlir] Move linalg::Expand/CollapseShapeOp to memref dialect. RFC: https://llvm.discourse.group/t/rfc-reshape-ops-restructuring/3310 Differential Revision: https://reviews.llvm.org/D106141	2021-07-16 13:32:17 +02:00
Adrian Kuegel	74b88807ae	[mlir][rocdl] Add math::Exp2Op lowering to ROCDL Differential Revision: https://reviews.llvm.org/D106057	2021-07-15 14:33:04 +02:00
Adrian Kuegel	ffe6a58325	[mlir][nvvm]: Add math::Exp2Op lowering to NVVM. Differential Revision: https://reviews.llvm.org/D106050	2021-07-15 13:06:30 +02:00
Alex Zinenko	26e59cc19f	[mlir] factor math-to-llvm out of standard-to-llvm After the Math has been split out of the Standard dialect, the conversion to the LLVM dialect remained as a huge monolithic pass. This is undesirable for the same complexity management reasons as having a huge Standard dialect itself, and is even more confusing given the existence of a separate dialect. Extract the conversion of the Math dialect operations to LLVM into a separate library and a separate conversion pass. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D105702	2021-07-12 11:09:42 +02:00
Alex Zinenko	c282d55a38	[mlir] add support for reductions in OpenMP WsLoopOp Use a modeling similar to SCF ParallelOp to support arbitrary parallel reductions. The two main differences are: (1) reductions are named and declared beforehand similarly to functions using a special op that provides the neutral element, the reduction code and optionally the atomic reduction code; (2) reductions go through memory instead because this is closer to the OpenMP semantics. See https://llvm.discourse.group/t/rfc-openmp-reduction-support/3367. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D105358	2021-07-09 17:54:20 +02:00
Alex Zinenko	75e5f0aac9	[mlir] factor memref-to-llvm lowering out of std-to-llvm After the MemRef has been split out of the Standard dialect, the conversion to the LLVM dialect remained as a huge monolithic pass. This is undesirable for the same complexity management reasons as having a huge Standard dialect itself, and is even more confusing given the existence of a separate dialect. Extract the conversion of the MemRef dialect operations to LLVM into a separate library and a separate conversion pass. Reviewed By: herhut, silvas Differential Revision: https://reviews.llvm.org/D105625	2021-07-09 14:49:52 +02:00
William S. Moses	9a11c70c18	[SCF] Handle lowering of Execute region to Standard CFG Lower SCF.executeregionop to llvm by essentially inlining the region and replacing the return Differential Revision: https://reviews.llvm.org/D105567	2021-07-07 15:27:21 -04:00
William S. Moses	eaf22ba011	[MLIR] Provide lowering of std switch to llvm switch This patch allows lowering of std switch to llvm switch Differential Revision: https://reviews.llvm.org/D105580	2021-07-07 15:25:55 -04:00
thomasraoux	291025389c	[mlir][vector] Refactor Vector Unrolling and remove Tuple ops Simplify vector unrolling pattern to be more aligned with rest of the patterns and be closer to vector distribution. The new implementation uses ExtractStridedSlice/InsertStridedSlice instead of the Tuple ops. After this change the ops based on Tuple don't have any more used so they can be removed. This allows removing signifcant amount of dead code and will allow extending the unrolling code going forward. Differential Revision: https://reviews.llvm.org/D105381	2021-07-07 11:11:26 -07:00
Adrian Kuegel	6e80e3bd1b	Add Log1pOp to complex dialect. Also add a lowering pattern from Complex to Standard/Math dialect. Differential Revision: https://reviews.llvm.org/D105538	2021-07-07 11:33:54 +02:00
Adrian Kuegel	bf17ee1950	Add MulOp lowering from Complex dialect to Standard/Math dialect. The lowering handles special cases with NaN or infinity like C++. Differential Revision: https://reviews.llvm.org/D105270	2021-07-05 12:51:51 +02:00
Adrian Kuegel	380fa71fb0	[mlir] Add LogOp lowering from Complex dialect to Standard/Math dialect. Differential Revision: https://reviews.llvm.org/D105342	2021-07-05 09:33:45 +02:00
Rob Suderman	8dea784b3e	[mlir][tosa] Add tosa shape inference with InferReturnTypeComponent Added InferReturnTypeComponents for NAry operations, reshape, and reverse. With the additional tosa-infer-shapes pass, we can infer/propagate shapes across a set of TOSA operations. Current version does not modify the FuncOp type by inserting an unrealized conversion cast prior to any new non-matchin returns. Differential Revision: https://reviews.llvm.org/D105312	2021-07-01 16:04:26 -07:00
Matthias Springer	c0a6318d96	[mlir][tensor] Add tensor.dim operation * Split memref.dim into two operations: memref.dim and tensor.dim. Both ops have the same builder interface and op argument names, so that they can be used with templates in patterns that apply to both tensors and memrefs (e.g., some patterns in Linalg). * Add constant materializer to TensorDialect (needed for folding in affine.apply etc.). * Remove some MemRefDialect dependencies, make some explicit. Differential Revision: https://reviews.llvm.org/D105165	2021-07-01 10:00:19 +09:00
thomasraoux	0298f2cfb1	[mlir] Fix wrong type in WmmaConstantOpToNVVMLowering InsertElement takes a scalar integer attribute not an array of integer. Differential Revision: https://reviews.llvm.org/D105174	2021-06-30 09:10:02 -07:00
thomasraoux	4392841949	[mlir][VectorToGPU] Support converting vetor.broadcast to MMA op Differential Revision: https://reviews.llvm.org/D105175	2021-06-30 09:08:55 -07:00
Eugene Zhulenev	d43b23608a	[mlir:Async] Add the size parameter to the async.group Specify the `!async.group` size (the number of tokens that will be added to it) at construction time. `async.await_all` operation can potentially race with `async.execute` operations that keep updating the group, for this reason it is required to know upfront how many tokens will be added to the group. Reviewed By: ftynse, herhut Differential Revision: https://reviews.llvm.org/D104780	2021-06-25 10:26:50 -07:00
thomasraoux	1a86559276	[mlir][VectorToGPU] Add conversion for scf::For op with Matrix operands Differential Revision: https://reviews.llvm.org/D104134	2021-06-24 15:42:28 -07:00
thomasraoux	6413226dce	[mlir][VectorToGPU] Add conversion for splat constant to MMA const matrix Differential Revision: https://reviews.llvm.org/D104133	2021-06-24 15:38:12 -07:00
Tobias Gysi	a21a6f51bc	[mlir][linalg] Change the pretty printed FillOp operand order. The patch changes the pretty printed FillOp operand order from output, value to value, output. The change is a follow up to https://reviews.llvm.org/D104121 that passes the fill value using a scalar input instead of the former capture semantics. Differential Revision: https://reviews.llvm.org/D104356	2021-06-23 07:03:00 +00:00
Vinayaka Bandishti	0e55112242	[NFC][PDL] Fix documentation typo, redundant test Correct a documentation typo, and delete a duplicate test in `pdl-to-pdl-interp-rewriter.mlir`. Reviewed By: pr4tgpt, bondhugula, rriddle Differential Revision: https://reviews.llvm.org/D104688	2021-06-23 12:27:12 +05:30
Matthias Springer	060208b4c8	[mlir][NFC] Move SubTensorOp and SubTensorInsertOp to TensorDialect The main goal of this commit is to remove the dependency of Standard dialect on the Tensor dialect. * Rename SubTensorOp -> tensor.extract_slice, SubTensorInsertOp -> tensor.insert_slice. * Some helper functions are (already) duplicated between the Tensor dialect and the MemRef dialect. To keep this commit smaller, this will be cleaned up in a separate commit. * Additional dialect dependencies: Shape --> Tensor, Tensor --> Standard * Remove dialect dependencies: Standard --> Tensor * Move canonicalization test cases to correct dialect (Tensor/MemRef). Note: This is a fixed version of https://reviews.llvm.org/D104499, which was reverted due to a missing update to two CMakeFile.txt. Differential Revision: https://reviews.llvm.org/D104676	2021-06-22 17:55:53 +09:00
Mehdi Amini	60d97fb4cf	Revert "[mlir][NFC] Move SubTensorOp and SubTensorInsertOp to TensorDialect" This reverts commit `83bf801f5f`. This breaks the build with -DBUILD_SHARED_LIBS=ON	2021-06-21 16:39:24 +00:00
Matthias Springer	83bf801f5f	[mlir][NFC] Move SubTensorOp and SubTensorInsertOp to TensorDialect The main goal of this commit is to remove the dependency of Standard dialect on the Tensor dialect. * Rename ops: SubTensorOp --> ExtractTensorOp, SubTensorInsertOp --> InsertTensorOp * Some helper functions are (already) duplicated between the Tensor dialect and the MemRef dialect. To keep this commit smaller, this will be cleaned up in a separate commit. * Additional dialect dependencies: Shape --> Tensor, Tensor --> Standard * Remove dialect dependencies: Standard --> Tensor * Move canonicalization test cases to correct dialect (Tensor/MemRef). Differential Revision: https://reviews.llvm.org/D104499	2021-06-22 00:11:21 +09:00
Arpith C. Jacob	dd1992efd3	Support lowering of index-cast on vector types. The index cast operation accepts vector types. Implement its lowering in this patch. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D104280	2021-06-15 12:51:30 -07:00
Adrian Kuegel	f112bd61eb	[mlir] Add SignOp to complex dialect. Also add a conversion pattern from Complex Dialect to Standard/Math Dialect. Differential Revision: https://reviews.llvm.org/D104292	2021-06-15 15:22:31 +02:00
Adrian Kuegel	662e074d90	[mlir] Add NegOp to complex dialect. Also add a lowering pattern from complex dialect to standard dialect. Differential Revision: https://reviews.llvm.org/D104284	2021-06-15 12:16:22 +02:00
Adrian Kuegel	73cbc91c93	[mlir] Add ExpOp to Complex dialect. Also add a conversion pattern from Complex to Standard/Math dialect. Differential Revision: https://reviews.llvm.org/D104108	2021-06-14 08:08:53 +02:00
Denys Shabalin	fdc0d4360b	Introduce alloca_scope op ## Introduction This proposal describes the new op to be added to the `std` (and later moved `memref`) dialect called `alloca_scope`. ## Motivation Alloca operations are easy to misuse, especially if one relies on it while doing rewriting/conversion passes. For example let's consider a simple example of two independent dialects, one defines an op that wants to allocate on-stack and another defines a construct that corresponds to some form of looping: ``` dialect1.looping_op { %x = dialect2.stack_allocating_op } ``` Since the dialects might not know about each other they are going to define a lowering to std/scf/etc independently: ``` scf.for … { %x_temp = std.alloca … … // do some domain-specific work using %x_temp buffer … // and store the result into %result %x = %result } ``` Later on the scf and `std.alloca` is going to be lowered to llvm using a combination of `llvm.alloca` and unstructured control flow. At this point the use of `%x_temp` is bound to either be either optimized by llvm (for example using mem2reg) or in the worst case: perform an independent stack allocation on each iteration of the loop. While the llvm optimizations are likely to succeed they are not guaranteed to do so, and they provide opportunities for surprising issues with unexpected use of stack size. ## Proposal We propose a new operation that defines a finer-grain allocation scope for the alloca-allocated memory called `alloca_scope`: ``` alloca_scope { %x_temp = alloca … ... } ``` Here the lifetime of `%x_temp` is going to be bound to the narrow annotated region within `alloca_scope`. Moreover, one can also return values out of the alloca_scope with an accompanying `alloca_scope.return` op (that behaves similarly to `scf.yield`): ``` %result = alloca_scope { %x_temp = alloca … … alloca_scope.return %myvalue } ``` Under the hood the `alloca_scope` is going to lowered to a combination of `llvm.intr.stacksave` and `llvm.intr.strackrestore` that are going to be invoked automatically as control-flow enters and leaves the body of the `alloca_scope`. The key value of the new op is to allow deterministic guaranteed stack use through an explicit annotation in the code which is finer-grain than the function-level scope of `AutomaticAllocationScope` interface. `alloca_scope` can be inserted at arbitrary locations and doesn’t require non-trivial transformations such as outlining. ## Which dialect Before memref dialect is split, `alloca_scope` can temporarily reside in `std` dialect, and later on be moved to `memref` together with the rest of memory-related operations. ## Implementation An implementation of the op is available [here](https://reviews.llvm.org/D97768). Original commits: * Add initial scaffolding for alloca_scope op * Add alloca_scope.return op * Add no region arguments and variadic results * Add op descriptions * Add failing test case * Add another failing test * Initial implementation of lowering for std.alloca_scope * Fix backticks * Fix getSuccessorRegions implementation Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D97768	2021-06-11 19:28:41 +02:00
thomasraoux	edd9515bd1	[mlir][VectorToGPU] First step to convert vector ops to GPU MMA ops This is the first step to convert vector ops to MMA operations in order to target GPUs tensor core ops. This currently only support simple cases, transpose and element-wise operation will be added later. Differential Revision: https://reviews.llvm.org/D102962	2021-06-11 07:52:32 -07:00
thomasraoux	428a62f65f	[mlir][gpu] Add op to create MMA constant matrix This allow creating a matrix with all elements set to a given value. This is needed to be able to implement a simple dot op. Differential Revision: https://reviews.llvm.org/D103870	2021-06-10 08:34:04 -07:00
Rob Suderman	0e083cef70	[mlir][tosa] Update tosa.matmul lowering to linalg.batch_matmul tosa.matmul is a batched matmul, update the lowering for linalg with the tests. Reviewed By: sjarus Differential Revision: https://reviews.llvm.org/D103937	2021-06-09 11:05:36 -07:00
Lei Zhang	56f60a1ce7	[mlir][spirv] Use SingleBlock + NoTerminator for spv.module This allows us to remove the `spv.mlir.endmodule` op and all the code associated with it. Along the way, tightened the APIs for `spv.module` a bit by removing some aliases. Now we use `getRegion` to get the only region, and `getBody` to get the region's only block. Reviewed By: mravishankar, hanchung Differential Revision: https://reviews.llvm.org/D103265	2021-06-09 14:00:06 -04:00
Alex Zinenko	c59ce1f625	[mlir] support memref of memref in standard-to-llvm conversion Now that memref supports arbitrary element types, add support for memref of memref and make sure it is properly converted to the LLVM dialect. The type support itself avoids adding the interface to the memref type itself similarly to other built-in types. This allows the shape, and therefore byte size, of the memref descriptor to remain a lowering aspect that is easier to customize and evolve as opposed to sanctifying it in the data layout specification for the memref type itself. Factor out the code previously in a testing pass to live in a dedicated data layout analysis and use that analysis in the conversion to compute the allocation size for memref of memref. Other conversions will be ported separately. Depends On D103827 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D103828	2021-06-08 11:11:31 +02:00
Alex Zinenko	ada9aa5a22	[mlir] Make MemRef element type extensible Historically, MemRef only supported a restricted list of element types that were known to be storable in memory. This is unnecessarily restrictive given the open nature of MLIR's type system. Allow types to opt into being used as MemRef elements by implementing a type interface. For now, the interface is merely a declaration with no methods. Later, methods to query, e.g., the type size or whether a type can alias elements of another type may be added. Harden the "standard"-to-LLVM conversion against memrefs with non-builtin types. See https://llvm.discourse.group/t/rfc-memref-of-custom-types/3558. Depends On D103826 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D103827	2021-06-08 11:11:30 +02:00
Alex Zinenko	3c70a82e28	[mlir] fix integer type mismatch in alloc conversion to LLVM Some places in the alloc-like op conversion use the converted index type whereas other places use the pointer-sized integer type, which may not be the same. Consistently use the converted index type, similarly to other address calculations. Reviewed By: pifon2a Differential Revision: https://reviews.llvm.org/D103826	2021-06-08 11:11:28 +02:00
Valentin Clement	fb5b590b5e	[mlir][openacc] Add conversion for if operand to scf.if for standalone data operation This patch convert the if condition on standalone data operation such as acc.update, acc.enter_data and acc.exit_data to a scf.if with the operation in the if region. It removes the operation when the if condition is constant and false. It removes the the condition if it is contant and true. Conversion to scf.if is done in order to use the translation to LLVM IR dialect out of the box. Not sure this is the best approach or we should perform this during the translation from OpenACC to LLVM IR dialect. Any thoughts welcome. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D103325	2021-06-07 12:10:03 -04:00
Valentin Clement	cfcdebaf32	[mlir][openacc] Conversion of data operands in acc.parallel to LLVM IR dialect Convert data operands from the acc.parallel operation using the same conversion pattern than D102170. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D103337	2021-06-07 11:22:20 -04:00
Rob Suderman	d86ef4364f	[mlir][tosa] Update tosa.rescale for i48 input type i48 integers require slightly tweaked behavior, specifically supporting zero point offsetting with slightly higher bitdepth. Updated results lowering appropriately. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D102659	2021-06-04 16:36:48 -07:00
Valentin Clement	fcb1547229	[mlir][openacc] Conversion of data operands in acc.data to LLVM IR dialect Convert data operands from the acc.data operation using the same conversion pattern than D102170. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D103332	2021-06-04 10:26:22 -04:00
MaheshRavishankar	cfa9ae9940	[mlir][SPIRV] Add lowering for math.log1p operation to SPIR-V dialect. Differential Revision: https://reviews.llvm.org/D103635	2021-06-03 16:27:19 -07:00
Alexander Belyaev	485c21be8a	[mlir] Split linalg reshape ops into expand/collapse. Differential Revision: https://reviews.llvm.org/D103548	2021-06-03 11:40:22 +02:00
Adrian Kuegel	942be7cb4d	[mlir] Add DivOp lowering from Complex dialect to Standard/Math dialect. Differential Revision: https://reviews.llvm.org/D103507	2021-06-02 11:16:00 +02:00
Matthias Springer	bd20756d2c	[mlir] Support tensor types in unrolled VectorToSCF Differential Revision: https://reviews.llvm.org/D102668	2021-06-02 10:44:04 +09:00
Matthias Springer	558e740170	[mlir] Support tensor types in non-unrolled VectorToSCF Support for tensor types in the unrolled version will follow in a separate commit. Add a new pass option to activate lowering of transfer ops with tensor types (default: deactivated). Differential Revision: https://reviews.llvm.org/D102666	2021-06-02 10:37:58 +09:00
Rob Suderman	422c7036d5	[mlir] Updated depthwise conv to support kernel dilation Depthwise convolution should support kernel dilation and non-dilation should not be a special case. Updated op definition to include a dilation attribute. This also adds a tosa.depthwise_conv2d lowering to linalg to support the new linalg behavior. Differential Revision: https://reviews.llvm.org/D103219	2021-06-01 13:25:19 -07:00
Tres Popp	5aa5eba135	[mlir][NFC] Rename MathToLLVM->MathToLibm	2021-05-31 08:41:00 +02:00
Eugene Zhulenev	39957aa424	[mlir] Add error state and error propagation to async runtime values Depends On D103102 Not yet implemented: 1. Error handling after synchronous await 2. Error handling for async groups Will be addressed in the followup PRs Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D103109	2021-05-27 09:28:47 -07:00
thomasraoux	b44007bec2	[mlir][gpu] Relax restriction on MMA store op to allow chain of mma ops. In order to allow large matmul operations using the MMA ops we need to chain operations this is not possible unless "DOp" and "COp" type have matching layout so remove the "DOp" layout and force accumulator and result type to match. Added a test for the case where the MMA value is accumulated. Differential Revision: https://reviews.llvm.org/D103023	2021-05-27 09:13:51 -07:00
harsh-nod	94d67b51dd	[mlir] Add n-D vector lowering to LLVM for cast ops The casting ops (sitofp, uitofp, fptosi, fptoui) lowering currently does not handle n-D vectors. This patch fixes that. Differential Revision: https://reviews.llvm.org/D103207	2021-05-26 15:26:49 -07:00
Adrian Kuegel	b99f892b02	[mlir] Fold complex.re(complex.create) and complex.im(complex.create) This extends the folding we already have. A test needs to be adjusted. Differential Revision: https://reviews.llvm.org/D103141	2021-05-26 10:53:05 +02:00
Rob Suderman	e5d227e95c	[NFC][MLIR][TOSA] Replaced tosa linalg.indexed_generic lowerings with linalg.index Indexed Generic should be going away in the future. Migrate to linalg.index. Reviewed By: NatashaKnk, nicolasvasilache Differential Revision: https://reviews.llvm.org/D103110	2021-05-25 15:34:28 -07:00
Chris Lattner	a004da0d77	[Canonicalize] Switch the default setting to "top down". This provides a sizable compile time improvement by seeding the worklist in an order that leads to less iterations of the worklist. This patch only changes the behavior of the Canonicalize pass itself, it does not affect other passes that use the GreedyPatternRewrite driver Differential Revision: https://reviews.llvm.org/D103053	2021-05-25 13:42:11 -07:00
Navdeep Kumar	eaaf7a6a09	[MLIR][GPU][NVVM] Add conversion of warp synchronous matrix-multiply accumulate GPU ops Add conversion of warp synchronous matrix-multiply accumulate GPU ops Add conversion of warp synchronous matrix-multiply accumulate GPU ops to NVVM ops. The following conversions are added :- 1.) subgroup_mma_load_matrix -> wmma.m16n16k16.load.[a,b,c]..row.stride 2.) subgroup_mma_store_matrix -> wmma.m16n16k16.store.d.[f16,f32].row.stride 3.) subgroup_mma_compute -> wmma.m16n16k16.mma.row.row.[f16,f32].[f16,f32] Reviewed By: bondhugula, ftynse Differential Revision: https://reviews.llvm.org/D95331	2021-05-21 21:20:33 +05:30
Adrian Kuegel	fb8b2b86d3	[mlir] Add conversion from Complex to Standard dialect for NotEqualOp. Differential Revision: https://reviews.llvm.org/D102902	2021-05-21 10:46:50 +02:00
Adrian Kuegel	ac00cb0d2a	[mlir] Add conversion from complex to standard dialect for EqualOp. This adds the straightforward conversion for EqualOp (two complex numbers are equal if both the real and the imaginary part are equal). Differential Revision: https://reviews.llvm.org/D102840	2021-05-20 14:25:56 +02:00
Stephen Neuendorffer	29a50c5864	[MLIR] Update Vector To LLVM conversion to be aware of assume_alignment vector.transfer_read and vector.transfer_write operations are converted to llvm intrinsics with specific alignment information, however there doesn't seem to be a way in llvm to take information from llvm.assume intrinsics and change this alignment information. In any event, due the to the structure of the llvm.assume instrinsic, applying this information at the llvm level is more cumbersome. Instead, let's generate the masked vector load and store instrinsic with the right alignment information from MLIR in the first place. Since we're bothering to do this, lets just emit the proper alignment for loads, stores, scatter, and gather ops too. Differential Revision: https://reviews.llvm.org/D100444	2021-05-19 10:50:48 -07:00
Rob Suderman	08068ddba7	[mlir][tosa] Fix tosa.avg_pool2d lowering to normalize correctly Initial version of pooling assumed normalization was accross all elements equally. TOSA actually requires the noramalization is perform by how many elements were summed (edges are not artifically dimmer). Updated the lowering to reflect this change with corresponding tests. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D102540	2021-05-17 10:00:43 -07:00
Rob Suderman	f97d970a49	[mlir][tosa] Add lowering to tosa.abs for integer cases Integer case requires decomposing to simple LLVM operatons. Differential Revision: https://reviews.llvm.org/D101809	2021-05-13 13:55:17 -07:00
natashaknk	0831793ed9	[mlir][tosa] Add tosa.div integer lowering to linalg.generic. Lowering div elementwise op to the linalg dialect. Since tosa only supports integer division, that is the only version that is currently implemented. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D102430	2021-05-13 13:16:00 -07:00
Matthias Springer	0f24163870	[mlir] Replace vector-to-scf with progressive-vector-to-scf Depends On D102388 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D102101	2021-05-13 23:27:31 +09:00
Matthias Springer	d020dd2b21	[mlir] Migrate vector-to-loops.mlir to ProgressiveVectorToSCF Create a copy of vector-to-loops.mlir and adapt the test for ProgressiveVectorToSCF. Fix a small bug in getExtractOp() triggered by this test. Differential Revision: https://reviews.llvm.org/D102388	2021-05-13 22:48:20 +09:00
Rob Suderman	3f8aafd790	[mlir][tosa] Fix tosa.cast semantics to perform rounding/clipping Rounding to integers requires rounding (for floating points) and clipping to the min/max values of the destination range. Added this behavior and updated tests appropriately. Reviewed By: sjarus, silvas Differential Revision: https://reviews.llvm.org/D102375	2021-05-12 21:53:53 -07:00
Matthias Springer	9b77be5583	[mlir] Unrolled progressive-vector-to-scf. Instead of an SCF for loop, these pattern generate fully unrolled loops with no temporary buffer allocations. Differential Revision: https://reviews.llvm.org/D101981	2021-05-13 13:08:48 +09:00
Suraj Sudhir	4b01435230	[mlir][tosa] Remove tosa.identityn operator Removes the identityn operator from TOSA MLIR definition. Removes TosaToLinAlg mappings Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D102329	2021-05-12 12:46:22 -07:00
Valentin Clement	6110b667b0	[mlir][openacc] Conversion of data operand to LLVM IR dialect Add a conversion pass to convert higher-level type before translation. This conversion extract meangingful information and pack it into a struct that the translation (D101504) will be able to understand. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D102170	2021-05-12 11:34:15 -04:00
Rob Suderman	764ad3b3fa	[mlir][tosa] Tosa elementwise broadcasting had some minor bugs Updated tests to include broadcast of left and right. Includes bypass if in-type and out-type match shape (no broadcasting). Differential Revision: https://reviews.llvm.org/D102276	2021-05-11 13:58:06 -07:00

1 2 3 4 5 ...

804 Commits