llvm-project

Commit Graph

Author	SHA1	Message	Date
River Riddle	5a7b919409	[mlir][NFC] Rename StandardToLLVM to FuncToLLVM The current StandardToLLVM conversion patterns only really handle the Func dialect. The pass itself adds patterns for Arithmetic/CFToLLVM, but those should be/will be split out in a followup. This commit focuses solely on being an NFC rename. Aside from the directory change, the pattern and pass creation API have been renamed: * populateStdToLLVMFuncOpConversionPattern -> populateFuncToLLVMFuncOpConversionPattern * populateStdToLLVMConversionPatterns -> populateFuncToLLVMConversionPatterns * createLowerToLLVMPass -> createConvertFuncToLLVMPass Differential Revision: https://reviews.llvm.org/D120778	2022-03-07 11:25:23 -08:00
Diego Caballero	917d95fc8a	[mlir][Vector] Improve default lowering of vector transpose operations The default lowering of vector transpose operations generates a large sequence of scalar extract/insert operations, one pair for each scalar element in the input tensor. In other words, the vector transpose is scalarized. However, there are transpose patterns where one or more adjacent high-order dimensions are not transposed (for example, in the transpose pattern [1, 0, 2, 3], dimensions 2 and 3 are not transposed). This patch improves the lowering of those cases by not scalarizing them and extracting/ inserting a full n-D vector, where 'n' is the number of adjacent high-order dimensions not being transposed. By doing so, we prevent the scalarization of the code and generate a more performant vector version. Paradoxically, this patch shouldn't improve the performance of transpose operations if we are using LLVM. The LLVM pipeline is able to optimize away some of the extract/insert operations and the SLP vectorizer is converting the scalar operations back to its vector form. However, scalarizing a vector version of the code in MLIR and relying on the SLP vectorizer to reconstruct the vector code again is highly undesirable for several reasons. Reviewed By: nicolasvasilache, ThomasRaoux Differential Revision: https://reviews.llvm.org/D120601	2022-03-07 17:56:02 +00:00
Uday Bondhugula	9b740c035c	Update normalizeAffineFor to canonicalize maps/operands before using them Update normalizeAffineFor utility to canonicalize maps and operands before using them. Differential Revision: https://reviews.llvm.org/D121086	2022-03-07 18:49:50 +05:30
William S. Moses	62f84c73d2	[MLIR][SCF] Allow combining subsequent if statements that yield & negated condition This patch extends the existing if combining canonicalization to also handle the case where a value returned by the first if is used within the body of the second if. This patch also extends if combining to support if's whose conditions are logical negations of each other. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D120924	2022-03-04 12:07:47 -05:00
William S. Moses	1d1791572c	[MLIR][MemRef] Ensure alloca_scope is inlined with no allocating ops Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D120841	2022-03-04 11:58:59 -05:00
William S. Moses	4a94a33ca6	[MLIR][LLVM] Fold extractvalue to ignore insertvalue at distinct index We can simplify an extractvalue of an insertvalue to extract out of the base of the insertvalue, if the insert and extract are at distinct and non-prefix'd indices Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D120915	2022-03-04 11:03:34 -05:00
Uday Bondhugula	5a99b776eb	[MLIR] Extend isLoopMemoryParallel to account for locally allocated memrefs Extend isLoopMemoryParallel check to include locally allocated memrefs. This strengthens and also speeds up the dependence check used by the utility by excluding locally allocated memrefs where appropriate. Additional memref dialect ops can be supported exhaustively via proper interfaces. Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D120617	2022-03-04 09:16:28 +05:30
Matthias Springer	16cbe883b5	[mlir][linalg][bufferize] Migrate --linalg-bufferize to BufferizableOpInterface-based bufferization This commit deletes the old dialect conversion-based bufferization patterns, which are now obsolete. Differential Revision: https://reviews.llvm.org/D120883	2022-03-03 20:12:37 +09:00
William S. Moses	2af81c6978	[MLIR][Arith] Canonicalize cmpi of extui/extsi Canonicalize cmpi(eq, ext a, ext b) and cmpi(ne, ext a, ext b) Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D120620	2022-03-02 12:30:03 -05:00
William S. Moses	db31da279f	[MLIR][Arith] Add constant folder for left shift Add constant folder for left shift Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D120661	2022-03-02 12:00:23 -05:00
Alex Zinenko	f64170aa1d	[mlir] Data layout for integer and float types Add support for integer and float types into the data layout subsystem with default logic similar to LLVM IR. Given the flexibility of the sybsystem, the logic can be easily overwritten by operations if necessary. This provides the connection necessary, e.g., for the GPU target where alignment requirements for integers and floats differ from those provided by default (although still compatible with the LLVM IR model). Previously, it was impossible to use non-default alignment requirements for integer and float types, which could lead to incorrect address and size calculations when targeting GPUs. Depends On D120737 Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D120739	2022-03-02 14:56:49 +01:00
Shraiysh Vaishay	d2f0fe23d2	[mlir][OpenMP] Added assemblyFormat for atomic and critical operations This patch adds assemblyFormat for `omp.critical.declare`, `omp.atomic.read`, `omp.atomic.write`, `omp.atomic.update` and `omp.atomic.capture`. Also removing those clauses from `parseClauses` that aren't needed anymore, thanks to the new assemblyFormats. Reviewed By: NimishMishra, rriddle Differential Revision: https://reviews.llvm.org/D120248	2022-03-02 11:22:09 +05:30
River Riddle	2f5715dc78	[mlir][NFC] Rename the old Standard dialect test directory to Func The remanants of Standard was renamed to Func, but the test directory remained named as Standard. In adidition to fixing the name, this commit also moves the tests for operations not in the Func dialect to the proper parent dialect test directory.	2022-03-01 13:48:34 -08:00
River Riddle	23aa5a7446	[mlir] Rename the Standard dialect to the Func dialect The last remaining operations in the standard dialect all revolve around FuncOp/function related constructs. This patch simply handles the initial renaming (which by itself is already huge), but there are a large number of cleanups unlocked/necessary afterwards: * Removing a bunch of unnecessary dependencies on Func * Cleaning up the From/ToStandard conversion passes * Preparing for the move of FuncOp to the Func dialect See the discussion at https://discourse.llvm.org/t/standard-dialect-the-final-chapter/6061 Differential Revision: https://reviews.llvm.org/D120624	2022-03-01 12:10:04 -08:00
William S. Moses	78fb4f9d5d	[SCF][MemRef] Enable SCF.Parallel Lowering to use Scope Op As discussed in https://reviews.llvm.org/D119743 scf.parallel would continuously stack allocate since the alloca op was placd in the wsloop rather than the omp.parallel. This PR is the second stage of the fix for that problem. Specifically, we now introduce an alloca scope around the inlined body of the scf.parallel and enable a canonicalization to hoist the allocations to the surrounding allocation scope (e.g. omp.parallel). Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D120423	2022-03-01 13:25:09 -05:00
Alex Zinenko	5c73db24df	[mlir] disallow side-effecting ops in llvm.mlir.global The llvm.mlir.global operation accepts a region as initializer. This region corresponds to an LLVM IR constant expression and therefore should not accept operations with side effects. Add a corresponding verifier. Reviewed By: wsmoses, bondhugula Differential Revision: https://reviews.llvm.org/D120632	2022-03-01 14:16:09 +01:00
gysit	24357fec8d	[mlir][OpDSL] Add arithmetic function attributes. The revision extends OpDSL with unary and binary function attributes. A function attribute, makes the operations used in the body of a structured operation configurable. For example, a pooling operation may take an aggregation function attribute that specifies if the op shall implement a min or a max pooling. The goal of this revision is to define less and more flexible operations. We may thus for example define an element wise op: ``` linalg.elem(lhs, rhs, outs=[out], op=BinaryFn.mul) ``` If the op argument is not set the default operation is used. Depends On D120109 Reviewed By: nicolasvasilache, aartbik Differential Revision: https://reviews.llvm.org/D120110	2022-03-01 07:45:47 +00:00
Okwan Kwon	4c901bf447	[mlir] Match Arithmetic::ConstantOp and Tensor::ExtractSliceOp. Add a pattern matcher for ExtractSliceOp when its source is a constant. The matching heuristics can be governed by the control function since generating a new constant is not always beneficial. Differential Revision: https://reviews.llvm.org/D119605	2022-02-28 23:09:03 +00:00
Lei Zhang	96bc2233c4	[mlir][linalg] Enhance FoldInsertPadIntoFill to support op chain If we have a chain of `tensor.insert_slice` ops inserting some `tensor.pad` op into a `linalg.fill` and ranges do not overlap, we can also elide the `tensor.pad` later. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D120446	2022-02-28 16:51:17 -05:00
Lei Zhang	5d47332783	[mlir][linalg] Fold tensor.pad when inserting into linalg.fill Fold tensor.insert_slice(tensor.pad(<input>), linalg.fill) into tensor.insert_slice(<input>, linalg.fill) if the padding value and the filling value are the same. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D120410	2022-02-28 16:42:32 -05:00
Okwan Kwon	4f5eb53e68	Revert "[mlir] Fold Arithmetic::ConstantOp and Tensor::ExtractSliceOp." This reverts commit `3104994104`.	2022-02-28 19:14:05 +00:00
Okwan Kwon	3104994104	[mlir] Fold Arithmetic::ConstantOp and Tensor::ExtractSliceOp. Fold ExtractSliceOp when the source is a constant.	2022-02-28 17:47:29 +00:00
gysit	11d144c576	[mlir][linalg] Check the iterator types are valid. Improve the LinalgOp verification to ensure the iterator types is known. Previously, unknown iterator types have been ignored without warning, which can lead to confusing bugs. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D120649	2022-02-28 11:25:40 +00:00
Alexander Belyaev	1a829d2d06	[mlir] Purge linalg.tiled_loop. Differential Revision: https://reviews.llvm.org/D119415	2022-02-28 09:05:18 +01:00
Hanhan Wang	748bf4bb28	[mlir][Linalg] Add support for tileFuseAndDistribute on tensors. This extends TileAndFuse to handle distribution on tensors. Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D120441	2022-02-25 11:51:11 -08:00
Diego Caballero	875bbce9f7	[mlir][Vector] Prevent AVX2 lowering for non-f32 transpose ops The AVX2 lowering for transpose operations is only applicable to f32 vector types. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D120427	2022-02-25 19:27:32 +00:00
Diego Caballero	d7e0a0846b	[mlir][Vector] Generalize AVX2 transpose lowering to n-D vectors The existing AVX2 lowering patterns for the transpose op only triggers if the input vector is 2-D. This patch extends the patterns to trigger for n-D vectors which are effectively 2-D vectors (e.g., vector<1x4x1x8x1). The main constraint for the generalized AVX2 patterns to be applicable to these vectors is that the dimensions that are greater than one must be transposed. Otherwise, the existing patterns are not applicable. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D119505	2022-02-25 19:27:32 +00:00
Chia-hung Duan	9445b39673	[mlir] Support verification order (2/3) This change gives explicit order of verifier execution and adds `hasRegionVerifier` and `verifyWithRegions` to increase the granularity of verifier classification. The orders are as below, 1. InternalOpTrait will be verified first, they can be run independently. 2. `verifyInvariants` which is constructed by ODS, it verifies the type, attributes, .etc. 3. Other Traits/Interfaces that have marked their verifier as `verifyTrait` or `verifyWithRegions=0`. 4. Custom verifier which is defined in the op and has marked `hasVerifier=1` If an operation has regions, then it may have the second phase, 5. Traits/Interfaces that have marked their verifier as `verifyRegionTrait` or `verifyWithRegions=1`. This implies the verifier needs to access the operations in its regions. 6. Custom verifier which is defined in the op and has marked `hasRegionVerifier=1` Note that the second phase will be run after the operations in the region are verified. Based on the verification order, you will be able to avoid verifying duplicate things. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D116789	2022-02-25 19:04:56 +00:00
gysit	51fdd802c7	[mlir][OpDSL] Add type function attributes. Previously, OpDSL operation used hardcoded type conversion operations (cast or cast_unsigned). Supporting signed and unsigned casts thus meant implementing two different operations. Type function attributes allow us to define a single operation that has a cast type function attribute which at operation instantiation time may be set to cast or cast_unsigned. We may for example, defina a matmul operation with a cast argument: ``` @linalg_structured_op def matmul(A=TensorDef(T1, S.M, S.K), B=TensorDef(T2, S.K, S.N), C=TensorDef(U, S.M, S.N, output=True), cast=TypeFnAttrDef(default=TypeFn.cast)): C[D.m, D.n] += cast(U, A[D.m, D.k]) * cast(U, B[D.k, D.n]) ``` When instantiating the operation the attribute may be set to the desired cast function: ``` linalg.matmul(lhs, rhs, outs=[out], cast=TypeFn.cast_unsigned) ``` The revsion introduces a enum in the Linalg dialect that maps one-by-one to the type functions defined by OpDSL. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D119718	2022-02-25 08:25:23 +00:00
Thomas Raoux	b1357fe618	[mlir][memref] Add transformation to do loop multi-buffering This transformation is useful to break dependency between consecutive loop iterations by increasing the size of a temporary buffer. This is usually combined with heavy software pipelining. Differential Revision: https://reviews.llvm.org/D119406	2022-02-24 09:41:21 -08:00
Marius Brehler	1fa1251116	[mlir][emitc] Add a variable op This adds a variable op, emitted as C/C++ locale variable, which can be used if the `emitc.constant` op is not sufficient. As an example, the canonicalization pass would transform ```mlir %0 = "emitc.constant"() {value = 0 : i32} : () -> i32 %1 = "emitc.constant"() {value = 0 : i32} : () -> i32 %2 = emitc.apply "&"(%0) : (i32) -> !emitc.ptr<i32> %3 = emitc.apply "&"(%1) : (i32) -> !emitc.ptr<i32> emitc.call "write"(%2, %3) : (!emitc.ptr<i32>, !emitc.ptr<i32>) -> () ``` into ```mlir %0 = "emitc.constant"() {value = 0 : i32} : () -> i32 %1 = emitc.apply "&"(%0) : (i32) -> !emitc.ptr<i32> %2 = emitc.apply "&"(%0) : (i32) -> !emitc.ptr<i32> emitc.call "write"(%1, %2) : (!emitc.ptr<i32>, !emitc.ptr<i32>) -> () ``` resulting in pointer aliasing, as %1 and %2 point to the same address. In such a case, the `emitc.variable` operation can be used instead. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D120098	2022-02-24 15:25:21 +00:00
Benjamin Kramer	92cf9f1481	[mlir][linalg] Cast back to the original type after making linalg.generic outputs more static This codepath was entirely untested. Differential Revision: https://reviews.llvm.org/D120473	2022-02-24 13:35:54 +01:00
Javier Setoain	cd0d21b47b	[mlir][LLVM] Allow scalable vectors in ShuffleVectorOp The current implementation of ShuffleVectorOp assumes all vectors are scalable. LLVM IR allows shufflevector operations on scalable vectors, and the current translation between LLVM Dialect and LLVM IR does the rigth thing when the shuffle mask is all zeroes. This is required to do a splat operation on a scalable vector, but it doesn't make sense for scalable vectors outside of that operation, i.e.: with non-all zero masks. Differential Revision: https://reviews.llvm.org/D118371	2022-02-24 11:24:34 +00:00
Matthias Springer	25bc684603	[mlir][linalg][bufferize] Always bufferize in-place with "out" operands by default In D115022, we introduced an optimization where OpResults of a `linalg.generic` may bufferize in-place with an "in" OpOperand if the corresponding "out" OpOperand is not used in the computation. This optimization can lead to unexpected behavior if the newly chosen OpOperand is in the same alias set as another OpOperand (that is used in the computation). In that case, the newly chosen OpOperand must bufferize out-of-place. This can be confusing to users, as always choosing the "out" OpOperand (regardless of whether it is used) would be expected when having the notion of "destination-passing style" in mind. With this change, we go back to always bufferizing in-place with "out" OpOperands by default, but letting users override the behavior with a bufferization option. Differential Revision: https://reviews.llvm.org/D120182	2022-02-24 19:58:05 +09:00
William S. Moses	1b2a1f8473	[MLIR][Arith] Canonicalize cmpf(int to fp) to cmpi Given a cmpf of either uitofp or sitofp and a constant, attempt to canonicalize it to a cmpi. This PR rewrites equivalent code within LLVM to now apply to MLIR arith. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D117257	2022-02-23 14:09:20 -05:00
Eugene Zhulenev	beff16f7bd	[mlir] Async: update condition for dispatching block-aligned compute function + compare block size with the unrollable inner dimension + reduce nesting in the code and simplify a bit IR building Reviewed By: cota Differential Revision: https://reviews.llvm.org/D120075	2022-02-23 10:29:55 -08:00
Okwan Kwon	f79f430d4b	Fold Tensor.extract_slice into a constant splat. Fold arith.extract_slice into arith.constant when the source is a constant splat and the result type is statically shaped.	2022-02-22 21:39:57 +00:00
Matthias Springer	d2dacde5d8	[mlir][bufferize][NFC] Rename `comprehensive-function-bufferize` to `one-shot-bufferize` The related functionality is moved over to the bufferization dialect. Test cases are cleaned up a bit. Differential Revision: https://reviews.llvm.org/D120191	2022-02-22 17:19:20 +09:00
Prateek Gupta	1a2bb03eda	[MLIR][LINALG] Add canonicalization pattern in `linalg.generic` op for static shape inference. This commit adds canonicalization pattern in `linalg.generic` op for static shape inference. If any of the inputs or outputs have static shape or is casted from a tensor of static shape, then shapes of all the inputs and outputs can be inferred by using the affine map of the static shape input/output. Signed-Off-By: Prateek Gupta <prateek@nod-labs.com> Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D118929	2022-02-21 07:51:13 +00:00
Shraiysh Vaishay	c1e4e01945	[mlir][OpenMP] Added assemblyFormat for SectionsOp This patch adds assemblyFormat for omp.sections operation. Some existing functions have been altered to fit the custom directive in assemblyFormat. This has led to their callsites to get modified too, but those will be removed in later patches, when other operations get their assemblyFormat. All operations were not changed in one patch for ease of review. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D120176	2022-02-21 13:01:49 +05:30
Matthias Springer	4ec00fb3ea	[mlir][bufferize] Add a way for ops to fail the analysis Add `BufferizableOpInterface::verifyAnalysis`. Ops can implement this method to check for expected invariants and limitations. The purpose of this change is to introduce a modular way of checking assertions such as `assertScfForAliasingProperties`. Differential Revision: https://reviews.llvm.org/D120189	2022-02-20 05:51:18 +09:00
Shraiysh Vaishay	39151717db	[mlir][OpenMP] Added assemblyFormat for ParallelOp This patch adds assemblyFormat for omp.parallel operation. Some existing functions have been altered to fit the custom directive in assemblyFormat. This has led to their callsites to get modified too, but those will be removed in later patches, when other operations get their assemblyFormat. All operations were not changed in one patch for ease of review. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D120157	2022-02-19 10:28:58 +05:30
Shraiysh Vaishay	60210f9acb	[mlir][OpenMP] Added assemblyformat for TargetOp This patch removes custom parser/printer for `omp.target` and adds assemblyformat. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D120138	2022-02-19 01:22:59 +05:30
Shraiysh Vaishay	5ee500acbb	[mlir][OpenMP] Remove clauses that are not being handled This patch removes the following clauses from OpenMP Dialect: - private - firstprivate - lastprivate - shared - default - copyin - copyprivate The privatization clauses are being handled in the flang frontend. The data copying clauses are not being handled anywhere for now. Once we have a better picture of how to handle these clauses in OpenMP Dialect, we can add these. For the time being, removing unneeded clauses. For detailed discussion about this refer to [[ https://discourse.llvm.org/t/rfc-privatisation-in-openmp-dialect/3526 \| Privatisation in OpenMP dialect ]] Reviewed By: kiranchandramohan, clementval Differential Revision: https://reviews.llvm.org/D120029	2022-02-19 01:13:05 +05:30
Benjamin Kramer	d558540fae	[mlir][Vector] Add return type inference for multi_reduction This subsumes the builder and verifier.	2022-02-18 13:00:42 +01:00
Benjamin Kramer	b47be47ac2	[mlir][Vector] Switch ExtractOp to the declarative assembly format This is a bit awkward since ExtractOp allows both `f32` and `vector<1xf32>` results for a scalar extraction. Allow both, but make inference return the scalar to make this as NFC as possible.	2022-02-18 11:45:59 +01:00
Matthias Springer	4086b3be44	[mlir][bufferize][NFC] Remove obsolete tensor bufferization patterns from Linalg/Bufferize.cpp Differential Revision: https://reviews.llvm.org/D119824	2022-02-18 19:39:44 +09:00
Matthias Springer	fa7c8cb4d0	[mlir][bufferize] Support memrefs with non-standard layout in `finalizing-bufferize` Differential Revision: https://reviews.llvm.org/D119935	2022-02-18 19:34:04 +09:00
Stephan Herhut	a43f7d6d76	[mlir][tensor] Extend reshape utils. This change changes the handling of trailing dimensions with unknown extent. Users of the changessociationIndicesForReshape helper should see benefits when transforming reshape like operations into expand/collapse pairs if the higher-rank type has trailing unknown dimensions. The motivating example is a reshape from tensor<16x1x?xi32> to tensor<16xi32> that can be modeled as collapsing the three dimensions. Differential Revision: https://reviews.llvm.org/D119730	2022-02-18 09:57:39 +01:00
Tres Popp	f5efe28070	[mlir] Propagate NaNs in PolynomialApproximation Previously, NaNs would be dropped in favor of bounded values which was strictly incorrect. Now the min/max operation propagate this information. Not all uses of min/max need this, but the given change will help protect future additions, and this prevents the need for an additional cmpf and select operation to handle NaNs. Differential Revision: https://reviews.llvm.org/D120020	2022-02-18 09:25:36 +01:00
Benjamin Kramer	f0dd818be3	[mlir][Vector] Switch ShuffleOp to the declarative assembly format This also requires implementing return type deduction.	2022-02-18 01:46:58 +01:00
Krzysztof Drewniak	84718d37db	[MLIR][GPU] Add gpu.set_default_device op This op is added to allow MLIR code running on multi-GPU systems to select the GPU they want to execute operations on when no GPU is otherwise specified. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D119883	2022-02-17 21:30:09 +00:00
Benjamin Kramer	3773d04a13	[mlir][memref] Switch ViewOp to the declarative assembly format	2022-02-17 21:34:15 +01:00
Lei Zhang	c9b36807be	[mlir][spirv] Add a pass to unify aliased resource variables In SPIR-V, resources are represented as global variables that are bound to certain descriptor. SPIR-V requires those global variables to be declared as aliased if multiple ones are bound to the same slot. Such aliased decorations can cause issues for transcompilers like SPIRV-Cross when converting to source shading languages like MSL. So this commit adds a pass to perform analysis of aliased resources and see if we can unify them into one. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D119872	2022-02-17 09:08:58 -05:00
Benjamin Kramer	d955ca4937	[BufferDeallocation] Don't assume successor operands are unique This would create a double free when a memref is passed twice to the same op. This wasn't a problem at the time the pass was written but is common since the introduction of scf.while. There's a latent non-determinism that's triggered by the test, but this change is messy enough as-is so I'll leave that for later. Differential Revision: https://reviews.llvm.org/D120044	2022-02-17 14:16:32 +01:00
Ivan Butygin	d271fc04d5	[mlir][gpu] Split ops sinking from gpu-kernel-outlining pass into separate pass Previously `gpu-kernel-outlining` pass was also doing index computation sinking into gpu.launch before actual outlining. Split ops sinking from `gpu-kernel-outlining` pass into separate pass, so users can use theirs own sinking pass before outlining. To achieve old behavior users will need to call both passes: `-gpu-launch-sink-index-computations -gpu-kernel-outlining`. Differential Revision: https://reviews.llvm.org/D119932	2022-02-17 10:34:20 +03:00
Aart Bik	34381a76c1	[mlir][sparse] avoid some codeup in sparsification transformation A very small refactoring, but a big impact on tests that expect an exact order. This revision fixes the tests, but also makes them less brittle for similar minor changes in the future! Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D119992	2022-02-16 17:39:04 -08:00
Eugene Zhulenev	b171583ae7	[mlir] Async: create async.group inside the scf.if branch Reviewed By: cota Differential Revision: https://reviews.llvm.org/D119959	2022-02-16 14:47:04 -08:00
Lei Zhang	e027c00821	[mlir][tensor] Add a pattern to split tensor.pad ops This commit adds a pattern to wrap a tensor.pad op with an scf.if op to separate the cases where we don't need padding (all pad sizes are actually zeros) and where we indeed need padding. This pattern is meant to handle padding inside tiled loops. Under such cases the padding sizes typically depend on the loop induction variables. Splitting them would allow treating perfect tiles and edge tiles separately. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D117018	2022-02-16 13:43:57 -05:00
Mahesh Ravishankar	2c58cde003	[mlir][Linalg] Add pattern for folding reshape by collapsing. Fusion of `linalg.generic` with `tensor.expand_shape/tensor.collapse_shape` currently handles fusion with reshape by expanding the dimensionality of the `linalg.generic` operation. This helps fuse elementwise operations better since they are fused at the highest dimensionality while keeping all indexing maps involved projected permutations. The intent of these is to push the reshape to the boundaries of functions. The presence of named ops (or other ops across which the reshape cannot be propagated) stops the propagation to the edges of the function. At this stage, the converse patterns that fold the reshapes with generic ops by collapsing the dimensions of the generic op can push the reshape towards edges. In particular it helps the case where reshapes exist in between named ops and generic ops. `linalg.named_op` -> `tensor.expand_shape` -> `linalg.generic` Pushing the reshape down will help fusion of `linalg.named_op` -> `linalg.generic` using tile + fuse transformations. This pattern is intended to replace the following patterns 1) FoldReshapeByLinearization : These patterns create indexing maps that are not projected permutations that affect future transformations. They are only useful for folding unit-dimensions. 2) PushReshapeByExpansion : This pattern has the same functionality but has some restrictions a) It tries to avoid creating new reshapes that limits its applicability. The pattern added here can achieve the same functionality through use of the `controlFn` that allows clients of the pattern freedom to make this decision. b) It does not work for ops with indexing semantics. These patterns will be deprecated in a future patch. Differential Revision: https://reviews.llvm.org/D119365	2022-02-16 03:15:20 +00:00
Thomas Raoux	0736bbd7e2	[mlir][scf] Add callback to annotate ops during pipelining This allow user to register a callback that can annotate operations during software pipelining. This allows user potential annotate op to know what part of the pipeline they correspond to. Differential Revision: https://reviews.llvm.org/D119866	2022-02-15 12:48:01 -08:00
Javier Setoain	71705f531f	[mlir][Arith] Disallow casting between scalable and fixed-length vectors Casting between scalable vectors and fixed-length vectors doesn't make sense. If one of the operands is scalable, the other has to be scalable to be able to guarantee they have the same shape at runtime. Differential Revision: https://reviews.llvm.org/D119568	2022-02-15 17:34:42 +00:00
Adrian Kuegel	b122cbebec	[mlir][Math] Fix NaN handling in Exp approximation Differential Revision: https://reviews.llvm.org/D119832	2022-02-15 15:17:56 +01:00
Shraiysh Vaishay	166713f987	[mlir][OpenMP] Change omp.atomic.update to have generic updates This patch changes the syntax of omp.atomic.update to allow the other dialects to modify the variable with appropriate operations in the region. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D119522	2022-02-15 17:58:13 +05:30
Matthias Springer	73e880fbf1	[mlir][bufferize] Add vector-bufferize pass and remove obsolete patterns from Linalg Bufferize Differential Revision: https://reviews.llvm.org/D119444	2022-02-15 21:25:14 +09:00
Adrian Kuegel	87de451bc5	[mlir][Math] Fix NaN handling in ExpM1 approximation. Differential Revision: https://reviews.llvm.org/D119822	2022-02-15 12:10:12 +01:00
Matthias Springer	e6f691615e	[mlir][bufferize] Support tensor.expand_shape and tensor.collapse_shape Differential Revision: https://reviews.llvm.org/D112512	2022-02-15 19:53:49 +09:00
Akshay Baviskar	f1efac7f08	Add verifier for gpu.alloc op Add verifier for gpu.alloc op to verify if the dimension operand counts and symbol operand counts are same as their memref counterparts. Differential Revision: https://reviews.llvm.org/D117427	2022-02-15 15:57:58 +05:30
Ivan Butygin	32389d0c2e	[mlir][spirv] Add OpenCL fma op and lowering Also, it seems Khronos has changed html spec format so small adjustment to script was needed. Base op parsing is also probably broken. Differential Revision: https://reviews.llvm.org/D119678	2022-02-15 11:28:20 +03:00
Marius Brehler	88b9d1a49a	[mlir][emitc] Add a pointer type Adds a pointer type to EmitC. The emission of pointers is so far only possible by using the `emitc.opaque` type Co-authored-by: Simon Camphausen <simon.camphausen@iml.fraunhofer.de> Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D119337	2022-02-14 16:42:21 +00:00
gysit	d50571ab07	[mlir][OpDSL] Add default value to index attributes. Index attributes had no default value, which means the attribute values had to be set on the operation. This revision adds a default parameter to `IndexAttrDef`. After the change, every index attribute has to define a default value. For example, we may define the following strides attribute: ``` ``` When using the operation the default stride is used if the strides attribute is not set. The mechanism is implemented using `DefaultValuedAttr`. Additionally, the revision uses the naming index attribute instead of attribute more consistently, which is a preparation for follow up revisions that will introduce function attributes. Depends On D119125 Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D119126	2022-02-14 12:14:12 +00:00
Ivan Butygin	cd0d095c07	[mlir][tensor] Check ops generated by InsertSliceOpCastFolder are valid Fixes https://github.com/llvm/llvm-project/issues/53099 Differential Revision: https://reviews.llvm.org/D119663	2022-02-13 21:37:31 +03:00
gysit	a3655de2c8	[mlir][OpDSL] Add support for basic rank polymorphism. Previously, OpDSL did not support rank polymorphism, which required a separate implementation of linalg.fill. This revision extends OpDSL to support rank polymorphism for a limited class of operations that access only scalars and tensors of rank zero. At operation instantiation time, it scales these scalar computations to multi-dimensional pointwise computations by replacing the empty indexing maps with identity index maps. The revision does not change the DSL itself, instead it adapts the Python emitter and the YAML generator to generate different indexing maps and and iterators depending on the rank of the first output. Additionally, the revision introduces a `linalg.fill_tensor` operation that in a future revision shall replace the current handwritten `linalg.fill` operation. `linalg.fill_tensor` is thus only temporarily available and will be renamed to `linalg.fill`. Reviewed By: nicolasvasilache, stellaraccident Differential Revision: https://reviews.llvm.org/D119003	2022-02-11 08:27:49 +00:00
Thomas Raoux	5ab04bc068	[mlir][gpu] Add device side async copy operations Add new operations to the gpu dialect to represent device side asynchronous copies. This also add the lowering of those operations to nvvm dialect. Those ops are meant to be low level and map directly to llvm dialects like nvvm or rocdl. We can further add higher level of abstraction by building on top of those operations. This has been discuss here: https://discourse.llvm.org/t/modeling-gpu-async-copy-ampere-feature/4924 Differential Revision: https://reviews.llvm.org/D119191	2022-02-10 17:25:59 -08:00
Nirvedh	ad9b5a4b8e	[mlir][vector] Add pattern to drop lead unit dim for Contraction Op If the result operand has a unit leading dim it is removed from all operands. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D119206	2022-02-10 09:51:07 -08:00
Lei Zhang	06a0385142	[mlir][linalg] Fold tensor.pad(linalg.fill) with the same value Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D119160	2022-02-10 08:39:35 -05:00
Matthias Springer	fe0bf7d469	[mlir][vector][NFC] Use CombiningKindAttr instead of StringAttr This makes the op consistent with other ops in vector dialect. Differential Revision: https://reviews.llvm.org/D119343	2022-02-10 19:13:29 +09:00
Tres Popp	34ff99a0b7	Revert "[MLIR] Fix fold-memref-subview-ops for affine.load/store" This reverts commit `ac6cb41303`. This code has a stack-use-after-scope error that can be seen with asan.	2022-02-10 10:46:59 +01:00
Uday Bondhugula	ac6cb41303	[MLIR] Fix fold-memref-subview-ops for affine.load/store Fix fold-memref-subview-ops for affine.load/store. We need to expand out the affine apply on its operands. Differential Revision: https://reviews.llvm.org/D119402	2022-02-10 13:55:38 +05:30
Matthias Springer	22a1973dbe	[mlir][linalg][bufferize] Print results of FuncOp read/write analysis Print more information with test-analysis-only. Differential Revision: https://reviews.llvm.org/D119118	2022-02-09 20:52:38 +09:00
Jacques Pienaar	bbddd19ec7	[mlir][math] Expand coverage of atan2 expansion Reuse the higher precision F32 approximation for the F16 one (by expanding and truncating). This is partly RFC as I'm not sure what the expectations are here (e.g., these are only for F32 and should not be expanded, that reusing higher-precision ones for lower precision is undesirable due to increased compute cost and only approximations per exact type is preferred, or this is appropriate [at least as fallback] but we need to see how to make it more generic across all the patterns here). Differential Revision: https://reviews.llvm.org/D118968	2022-02-08 15:00:39 -08:00
harsh	4a876b13fb	Add case to handle 0-D vectors in FlattenContiguousRowMajorTransferWritePattern and FlattenContiguousRowMajorTransferReadPattern. For 0-D as well as 1-D vectors, both these patterns should return a failure as there is no need to collapse the shape of the source. Currently, only 1-D vectors were handled. This patch handles the 0-D case as well. Reviewed By: Benoit, ThomasRaoux Differential Revision: https://reviews.llvm.org/D119202	2022-02-08 20:00:12 +00:00
Mahesh Ravishankar	2abd7f13bc	[mlir][Linalg] NFC: Combine elementwise fusion test passes. There are a few different test passes that check elementwise fusion in Linalg. Consolidate them to a single pass controlled by different pass options (in keeping with how `TestLinalgTransforms` exists).	2022-02-08 18:08:37 +00:00
Tres Popp	64b918852c	Remove restriction on static dimensions in Shape method mlir::shape::ToExtentTensorOp::areCastCompatible didn't allow the input to have a static dimension, but that is allowed.	2022-02-08 11:20:01 +01:00
River Riddle	2418cd92c0	[mlir] Update uses of `parser`/`printer` ODS op field to `hasCustomAssemblyFormat` The parser/printer fields are deprecated and in the process of being removed.	2022-02-07 19:03:58 -08:00
Mahesh Ravishankar	7568f7101f	Revert "[mlir][Linalg] NFC: Combine elementwise fusion test passes." This reverts commit `d730336411`.	2022-02-07 22:51:29 +00:00
Mahesh Ravishankar	d730336411	[mlir][Linalg] NFC: Combine elementwise fusion test passes. There are a few different test passes that check elementwise fusion in Linalg. Consolidate them to a single pass controlled by different pass options (in keeping with how `TestLinalgTransforms` exists).	2022-02-07 22:46:57 +00:00
Sergei Grechanik	bb39ad43ce	[mlir][spirv] Fix verification of nested array constants Fix the verification function of spirv::ConstantOp to allow nesting array attributes. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D118939	2022-02-07 13:48:53 -08:00
Matthias Springer	9aa74347d5	[mlir][SCF] Further simplify affine maps during `for-loop-canonicalization` * Implement `FlatAffineConstraints::getConstantBound(EQ)`. * Inject a simpler constraint for loops that have at most 1 iteration. * Taking into account constant EQ bounds of FlatAffineConstraint dims/symbols during canonicalization of the resulting affine map in `canonicalizeMinMaxOp`. Differential Revision: https://reviews.llvm.org/D119153	2022-02-08 02:40:08 +09:00
Benjamin Kramer	6635c12ada	[mlir] Use SmallBitVector instead of SmallDenseSet for AffineMap::compressSymbols This is both more efficient and more ergonomic to use, as inverting a bit vector is trivial while inverting a set is annoying. Sadly this leaks into a bunch of APIs downstream, so adapt them as well. This would be NFC, but there is an ordering dependency in MemRefOps's computeMemRefRankReductionMask. This is now deterministic, previously it was dependent on SmallDenseSet's unspecified iteration order. Differential Revision: https://reviews.llvm.org/D119076	2022-02-07 00:21:44 +01:00
River Riddle	ace01605e0	[mlir] Split out a new ControlFlow dialect from Standard This dialect is intended to model lower level/branch based control-flow constructs. The initial set of operations are: AssertOp, BranchOp, CondBranchOp, SwitchOp; all split out from the current standard dialect. See https://discourse.llvm.org/t/standard-dialect-the-final-chapter/6061 Differential Revision: https://reviews.llvm.org/D118966	2022-02-06 14:51:16 -08:00
Eugene Zhulenev	edca177cbe	[mlir] Add canonicalizer to remove redundant shape.cstr_broadcastable ops Depends On D119025 Reviewed By: frgossen Differential Revision: https://reviews.llvm.org/D119043	2022-02-06 14:46:42 -08:00
Eugene Zhulenev	981f0a14f1	[mlir] Add canonicalizer to merge shape.assuming_all ops Depends On D119021 Reviewed By: frgossen Differential Revision: https://reviews.llvm.org/D119025	2022-02-04 15:27:37 -08:00
Lei Zhang	9dd4c2dcb6	[mlir][vector] Add constant folder for vector.shuffle ops Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D119032	2022-02-04 16:59:32 -05:00
gysit	b5ea288d13	[mlir][linalg] Let tile and fuse fail for tile sizes zero. Adapt `tileConsumerAndFuseProducers` to return failure if the generated tile loop nest is empty since all tile sizes are zero. Additionally, fix `LinalgTileAndFuseTensorOpsPattern` to return success if the pattern applied successfully. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D118878	2022-02-04 19:19:21 +00:00
Thomas Raoux	c3c1c5c695	[mlir][scf] Fix bug in pipelining prologue emission Induction variable calculation was ignoring scf.for step value. Fix it to get the correct induction variable value in the prologue. Differential Revision: https://reviews.llvm.org/D118932	2022-02-03 13:12:50 -08:00
Abhishek Varma	59b23c4aec	[MLIR][SCF] Remove loop invariant arguments of scf.while -- This commit adds a canonicalization pattern on scf.while to remove the loop invariant arguments. -- An argument is considered loop invariant if the iteration argument value is the same as the corresponding one being yielded (at the same position) in both the before/after block of scf.while. -- For the arguments removed, their use within scf.while and their corresponding scf.while's result are replaced with their corresponding initial value. Signed-off-by: Abhishek Varma <abhishek.varma@polymagelabs.com> Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D116923	2022-02-03 17:13:25 +01:00
River Riddle	8e123ca65f	[mlir:Standard] Remove support for creating a `unit` ConstantOp This is completely unused upstream, and does not really have well defined semantics on what this is supposed to do/how this fits into the ecosystem. Given that, as part of splitting up the standard dialect it's best to just remove this behavior, instead of try to awkwardly fit it somewhere upstream. Downstream users are encouraged to define their own operations that clearly can define the semantics of this. This also uncovered several lingering uses of ConstantOp that weren't updated to use arith::ConstantOp, and worked during conversions because the constant was removed/converted into something else before verification. See https://llvm.discourse.group/t/standard-dialect-the-final-chapter/ for more discussion. Differential Revision: https://reviews.llvm.org/D118654	2022-02-02 14:45:12 -08:00
River Riddle	dec8af701f	[mlir] Move SelectOp from Standard to Arithmetic This is part of splitting up the standard dialect. See https://llvm.discourse.group/t/standard-dialect-the-final-chapter/ for discussion. Differential Revision: https://reviews.llvm.org/D118648	2022-02-02 14:45:12 -08:00
River Riddle	6a8ba3186e	[mlir] Split std.splat into tensor.splat and vector.splat This is part of the larger effort to split the standard dialect. This will also allow for pruning some additional dependencies on Standard (done in a followup). Differential Revision: https://reviews.llvm.org/D118202	2022-02-02 14:45:12 -08:00
River Riddle	ef72cf4413	[mlir][NFC] Update OpenACC/OpenMP operations to use `hasVerifier` instead of `verifier` The verifier field is deprecated, and slated for removal. Differential Revision: https://reviews.llvm.org/D118825	2022-02-02 13:34:30 -08:00
Nicolas Vasilache	3c3810e72e	[mlir][vector] Avoid hoisting alloca'ed temporary buffers across AutomaticAllocationScope This revision avoids incorrect hoisting of alloca'd buffers across an AutomaticAllocationScope boundary. In the more general case, we will probably need a ParallelScope-like interface. Differential Revision: https://reviews.llvm.org/D118768	2022-02-02 06:00:42 -05:00
gysit	dc82547b17	[mlir][vector] Make write permutation lowering work with tensors. Use type inference when building the TransferWriteOp in the TransferWritePermutationLowering. Previously, the result type has been set to Type() which triggers an assertion if the pattern is used with tensors instead of memrefs. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D118758	2022-02-02 09:21:10 +00:00
Mahesh Ravishankar	a2361eb281	Avoid doing tile + fuse if tile sizes are zero. Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D118576	2022-02-01 18:34:06 +00:00
Alexander Belyaev	ebc8153786	Revert "Revert "[mlir] Purge `linalg.copy` and use `memref.copy` instead."" This reverts commit `25bf6a2a9b`.	2022-02-01 18:21:21 +01:00
Christian Sigg	9b078f8fd2	[MLIR][arith] Mark addf/mulf as commutative Following the discussion in D118318, mark `arith.addf/mulf` commutative. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D118600	2022-02-01 08:33:48 +01:00
bakhtiyar	149311b405	[async] Get the number of worker threads from the runtime. Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D117751	2022-01-31 12:06:01 -08:00
Christian Sigg	f278cf9cbc	[MLIR][arith] More float op folders Fold `arith.fadd %x, -0.0 -> %x` and similarly for `fsub`, `fmul`, `fdiv`. Fold `arith.fmin %x, %x -> %x`, `arith.fmin %x, +inf -> %x` and similarly for `fmax`. Reviewed By: pifon2a, mehdi_amini, bondhugula Differential Revision: https://reviews.llvm.org/D118244	2022-01-31 19:31:48 +01:00
Alexander Belyaev	25bf6a2a9b	Revert "[mlir] Purge `linalg.copy` and use `memref.copy` instead." This reverts commit `016956b680`. Reverting it to fix NVidia build without being in a hurry.	2022-01-31 18:51:39 +01:00
Alexander Belyaev	016956b680	[mlir] Purge `linalg.copy` and use `memref.copy` instead. Differential Revision: https://reviews.llvm.org/D118028	2022-01-31 18:25:56 +01:00
Uday Bondhugula	f8a2cd67b9	Support affine.load/store ops in fold-memref-subview-ops pass Support affine.load/store ops in fold-memref-subview ops pass. The existing pass just "inlines" the subview operation on load/stores by inserting affine.apply ops in front of the memref load/store ops: this is by design always consistent with the semantics on affine.load/store ops and the same would work even more naturally/intuitively with the latter. Differential Revision: https://reviews.llvm.org/D118565	2022-01-31 10:10:49 +05:30
Uday Bondhugula	92ccb8cc50	[MLIR][NFC] Update SCF pass cmd line names to prefix scf Update SCF pass cmd line names to prefix `scf`. This is consistent with guidelines/convention on how to name dialect passes. This also avoids ambiguity on the context given the multiple `for` operations in the tree. NFC. Differential Revision: https://reviews.llvm.org/D118564	2022-01-31 07:09:30 +05:30
Matthias Springer	6700a26d5f	[mlir][linalg][bufferize] Fix insertion point InitTensorElimination There was a bug where some of the OpOperands needed in the replacement op were not in scope. It does not matter where the replacement op is inserted. Any insertion point is OK as long as there are no dominance errors. In the worst case, the newly inserted op will bufferize out-of-place. This is no worse than not eliminating the InitTensorOp at all. Differential Revision: https://reviews.llvm.org/D117685	2022-01-30 22:25:39 +09:00
Matthias Springer	ab47418df6	[mlir][bufferize] Merge tensor-constant-bufferize into arith-bufferize The bufferization of arith.constant ops is also switched over to BufferizableOpInterface-based bufferization. The old implementation is deleted. Both implementations utilize GlobalCreator, now renamed to just `getGlobalFor`. GlobalCreator no longer maintains a set of all created allocations to avoid duplicate allocations of the same constant. Instead, `getGlobalFor` scans the module to see if there is already a global allocation with the same constant value. For compatibility reasons, it is still possible to create a pass that bufferizes only `arith.constant`. This pass (createConstantBufferizePass) could be deleted once all users were switched over to One-Shot bufferization. Differential Revision: https://reviews.llvm.org/D118483	2022-01-30 21:37:48 +09:00
harsh	80e0bf1af1	Add vector.scan op This patch adds the vector.scan op which computes the scan for a given n-d vector. It requires specifying the operator, the identity element and whether the scan is inclusive or exclusive. TEST: Added test in ops.mlir Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D117171	2022-01-28 20:07:57 +00:00
Frederik Gossen	2c7b0685e1	Fix tensor.extract for complex elements	2022-01-28 04:33:15 +01:00
Mogball	1e3a02162d	[mlir][scf] Update IfOp to have getInvocationBounds This allows `scf.if` to be used by Control-Flow sink. Depends on D115088 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D115089	2022-01-27 23:15:53 +00:00
Matthias Springer	075e3fdda1	[mlir][bufferize] Move arith BufferizableOpInterface impl to arith dialect Also switch the implementation of `-arith-bufferize` to BufferizableOpInterface. Differential Revision: https://reviews.llvm.org/D118325	2022-01-28 01:40:22 +09:00
Matthias Springer	b2f5004259	Revert "[mlir][bufferize] Insert memref.cast ops during finalizing pass" This reverts commit `1043107ce5`. This commit caused a breakage in `finalizing-bufferize.mlir`.	2022-01-27 20:48:58 +09:00
Matthias Springer	dbd1bbced9	[mlir][linalg][bufferize] Support arith.index_cast bufferization This is in preparation of switching `-tensor-constant-bufferize` and `-arith-bufferize` to BufferizableOpInterface-based implementations. Differential Revision: https://reviews.llvm.org/D118324	2022-01-27 19:50:31 +09:00
Matthias Springer	daf18108ec	[mlir][tensor] Replace tensor-bufferize with BufferizableOpInterface impl This commit switches the `tensor-bufferize` pass over to BufferizableOpInterface-based bufferization. Differential Revision: https://reviews.llvm.org/D118246	2022-01-27 19:30:45 +09:00
Matthias Springer	1043107ce5	[mlir][bufferize] Insert memref.cast ops during finalizing pass The pass can currently not handle to_memref(to_tensor(x)) folding where a cast is necessary. This is required with the new unified bufferization. There is already a canonicalization pattern that handles such foldings and it should be used during this pass. Differential Revision: https://reviews.llvm.org/D117988	2022-01-27 19:06:53 +09:00
Uday Bondhugula	fa5c5230d9	[MLIR] NFC. Rename pass cmd-line to prefix affine Prefix "affine-" to affine transform passes that were missing it -- to avoid ambiguity and for uniformity. There were only two needed this. Move mispaced affine coalescing test case file. NFC. Differential Revision: https://reviews.llvm.org/D118314	2022-01-27 13:01:39 +05:30
River Riddle	7d0426dd95	[mlir] Move ComposeSubView+ExpandOps from Standard to MemRef These transformations already operate on memref operations (as part of splitting up the standard dialect). Now that the operations have moved, it's time for these transformations to move as well. Differential Revision: https://reviews.llvm.org/D118285	2022-01-26 23:11:02 -08:00
River Riddle	632a4f8829	[mlir] Move std.generic_atomic_rmw to the memref dialect This is part of splitting up the standard dialect. The move makes sense anyways, given that the memref dialect already holds memref.atomic_rmw which is the non-region sibling operation of std.generic_atomic_rmw (the relationship is even more clear given they have nearly the same description % how they represent the inner computation). Differential Revision: https://reviews.llvm.org/D118209	2022-01-26 11:52:01 -08:00
River Riddle	480cd4cb85	[mlir] Move the complex support of std.constant to a new complex.constant operation This is part of splitting up the standard dialect. Differential Revision: https://reviews.llvm.org/D118182	2022-01-26 11:52:00 -08:00
River Riddle	b88a4d72d9	[mlir:GPU] Replace reference to LLVMFuncOp with FuncOpInterface The GPU dialect currently contains an explicit reference to LLVMFuncOp during verification to handle the situation where the kernel has already been converted. This commit changes that reference to instead use FunctionOpInterface, which has two main benefits: * It allows for removing an otherwise unnecessary dependency on the LLVM dialect * It removes hardcoded assumptions about the lowering path and use of the GPU dialect Differential Revision: https://reviews.llvm.org/D118172	2022-01-26 11:52:00 -08:00
Matthias Springer	268524238e	[mlir][bufferization] Add an option to use memref types without layout maps This is for compatibility with existing bufferization passes. Also clean up memref type generation a bit. Differential Revision: https://reviews.llvm.org/D118243	2022-01-27 00:03:34 +09:00
Nicolas Vasilache	9b6c2ea302	[mlir][Linalg] Add GenericOp self-copy on buffers folding Reviewed By: pifon2a Differential Revision: https://reviews.llvm.org/D118116	2022-01-26 05:56:31 -05:00
Alexander Batashev	e9b4239fef	[mlir][openmp] Custom syntax for `omp.target` operation Add a custom parser and printer for `omp.target` operation. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D117539	2022-01-26 10:26:19 +00:00
Rob Suderman	7c984be21a	[mlir] Propagate arith.index_cast past tensor.extract If we are extracting it is more useful to push the index_cast past the extraction. This increases the chance the tensor.extract can evaluated at compile time. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D118204	2022-01-25 22:16:07 -08:00
Rob Suderman	d81a3c51e7	[mlir] Fold tensor.reshape operations into tensor.from_elements. There is not much of a benefit to reshape a from element vs reloading it. Updated to progagate shape manipulations into the output type of tensor.from_elements. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D118201	2022-01-25 15:54:57 -08:00
MaheshRavishankar	ea1ac183f4	[mlir][Linalg] Fix incorrect fusion with reshape ops by linearization. Fusion of reshape ops by linearization incorrectly inverted the indexing map before linearizing dimensions. This leads to incorrect indexing maps used in the fused operation. Differential Revision: https://reviews.llvm.org/D117908	2022-01-25 11:42:58 -08:00
MaheshRavishankar	e5a315f57a	[mlir][Linalg] Disallow ops with index semantics in `PushExpandingReshape`. This pattern is not written to handle operations with `linalg.index` operations in its body, i.e. operations that have index semantics. Differential Revision: https://reviews.llvm.org/D117856	2022-01-25 10:37:30 -08:00
Matthias Springer	d581c94d6b	[mlir][linalg][bufferize] Support tensor.from_elements This is mostly a copy of the existing tensor.from_elements bufferization. Once TensorInterfaceImpl.cpp is moved to the tensor dialect, the existing rewrite pattern can be deleted. Differential Revision: https://reviews.llvm.org/D117775	2022-01-25 22:19:59 +09:00
Matthias Springer	71bbb78b8f	[mlir][linalg][bufferize] Support tensor.generate This is mostly a copy of the existing tensor.generate bufferization. Once TensorInterfaceImpl.cpp is moved to the tensor dialect, the existing rewrite pattern can be deleted. Differential Revision: https://reviews.llvm.org/D117770	2022-01-25 22:19:22 +09:00
Shraiysh Vaishay	320dc8c4df	[mlir][OpenMP] Added omp.atomic.capture operation This patch supports the atomic construct (capture) following section 2.17.7 of OpenMP 5.0 standard. Also added tests for the same. Reviewed By: peixin, kiranchandramohan Differential Revision: https://reviews.llvm.org/D115851	2022-01-25 12:25:54 +05:30
gysit	e494278cee	[mlir][linalg] Add transpose support to hoist padding. Add a transpose option to hoist padding to transpose the padded tensor before storing it into the packed tensor. The early transpose improves the memory access patterns of the actual compute kernel. The patch introduces a transpose right after the hoisted pad tensor and a second transpose inside the compute loop. The second transpose can either be fused into the compute operation or will canonicalize away when lowering to vector instructions. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D117893	2022-01-24 16:33:05 +00:00
Matthias Springer	c30d2893a4	[mlir][bufferize] Change insertion point for ToTensorOps Both insertion points are valid. This is to make BufferizableOpInteface-based bufferization compatible with existing partial bufferization test cases. (So less changes are necessary to unit tests.) Differential Revision: https://reviews.llvm.org/D117986	2022-01-25 00:43:04 +09:00
Matthias Springer	fc08d1c294	[mlir][tensor][bufferize] Support tensor.rank in BufferizableOpInterfaceImpl This is the only op that is not supported via BufferizableOpInterfaceImpl bufferization. Once this op is supported we can switch `tensor-bufferize` over to the new unified bufferization. Differential Revision: https://reviews.llvm.org/D117985	2022-01-25 00:31:20 +09:00
Alexander Belyaev	4041354b4c	[mlir] Add SingleBlockImplicitTerminator<"tensor::YieldOp"> to PadOp.	2022-01-22 11:46:27 +01:00
not-jenni	08574ce4d6	[mlir][tosa] Add clamp + clamp as single clamp canonicalization When 2 clamp ops are in a row, they can be canonicalized into a single clamp that uses the most constrained range Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D117934	2022-01-21 16:24:43 -08:00
Aart Bik	efa15f4178	[mlir][sparse] add ability for sparse tensor output Rationale: Although file I/O is a bit alien to MLIR itself, we provide two convenient ways for sparse tensor I/O. The input part was already there (behind the swiss army knife sparse_tensor.new). Now we have a sparse_tensor.out to write out data. As before, the ops are kept vague and may change in the future. For now this allows us to compare TACO vs MLIR very easily. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D117850	2022-01-21 15:43:29 -08:00
Rob Suderman	2f9f9afa4e	[mlir] Add polynomial approximation for atan and atan2 Implement a taylor series approximation for atan and add an atan2 lowering that uses atan's appromation. This includes tests for edge cases and tests for each quadrant. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D115682	2022-01-21 12:22:58 -08:00
Alexander Belyaev	fd0c6f5391	[mlir] Move linalg::PadTensorOp to tensor::PadOp. RFC: https://llvm.discourse.group/t/rfc-move-linalg-padtensorop-to-tensor-padop/5785 Differential Revision: https://reviews.llvm.org/D117892	2022-01-21 20:02:39 +01:00
MaheshRavishankar	a99e06aa86	[mlir][Linalg] Avoid generating illegal operations during elementwise fusion. In some cases, fusion can produce illegal operations if after fusion the range of some of the loops cannot be computed from shapes of its operands. Check for this case and abort the fusion if this happens. Differential Revision: https://reviews.llvm.org/D117602	2022-01-20 23:43:50 -08:00
Mehdi Amini	26167cae45	Print the `// ----` separator between modules when using -split-input-file with mlir-opt This allows to pipe sequences of `mlir-opt -split-input-file \| mlir-opt -split-input-file`. Depends On D117750 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D117756	2022-01-21 05:16:02 +00:00
Mogball	e99835ffed	[mlir][pdl] Make `pdl` the default dialect when parsing/printing PDLDialect being a somewhat user-facing dialect and whose ops contain exclusively other PDL ops in their regions can take advantage of `OpAsmOpInterface` to provide nicer IR. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D117828	2022-01-20 20:22:53 +00:00
Mogball	7c471b56f2	[mlir][pdl] OperationOp should not be side-effect free Unbound OperationOp in the matcher (i.e. one with no uses) is already disallowed by the verifier. However, an OperationOp in the rewriter is not side-effect free -- it's creating an op! Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D117825	2022-01-20 20:22:01 +00:00
Sergei Grechanik	5abf116322	[mlir][vector] Allow values outside of [0; dim-size] in create_mask This commits explicitly states that negative values and values exceeding vector dimensions are allowed in vector.create_mask (but not in vector.constant_mask). These values are now truncated when canonicalizing vector.create_mask to vector.constant_mask. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D116069	2022-01-20 09:34:42 -08:00

1 2 3 4 5 ...

2301 Commits