llvm-project

Commit Graph

Author	SHA1	Message	Date
Nicolas Vasilache	3c3810e72e	[mlir][vector] Avoid hoisting alloca'ed temporary buffers across AutomaticAllocationScope This revision avoids incorrect hoisting of alloca'd buffers across an AutomaticAllocationScope boundary. In the more general case, we will probably need a ParallelScope-like interface. Differential Revision: https://reviews.llvm.org/D118768	2022-02-02 06:00:42 -05:00
gysit	dc82547b17	[mlir][vector] Make write permutation lowering work with tensors. Use type inference when building the TransferWriteOp in the TransferWritePermutationLowering. Previously, the result type has been set to Type() which triggers an assertion if the pattern is used with tensors instead of memrefs. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D118758	2022-02-02 09:21:10 +00:00
Mahesh Ravishankar	a2361eb281	Avoid doing tile + fuse if tile sizes are zero. Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D118576	2022-02-01 18:34:06 +00:00
Alexander Belyaev	ebc8153786	Revert "Revert "[mlir] Purge `linalg.copy` and use `memref.copy` instead."" This reverts commit `25bf6a2a9b`.	2022-02-01 18:21:21 +01:00
Christian Sigg	9b078f8fd2	[MLIR][arith] Mark addf/mulf as commutative Following the discussion in D118318, mark `arith.addf/mulf` commutative. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D118600	2022-02-01 08:33:48 +01:00
bakhtiyar	149311b405	[async] Get the number of worker threads from the runtime. Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D117751	2022-01-31 12:06:01 -08:00
Christian Sigg	f278cf9cbc	[MLIR][arith] More float op folders Fold `arith.fadd %x, -0.0 -> %x` and similarly for `fsub`, `fmul`, `fdiv`. Fold `arith.fmin %x, %x -> %x`, `arith.fmin %x, +inf -> %x` and similarly for `fmax`. Reviewed By: pifon2a, mehdi_amini, bondhugula Differential Revision: https://reviews.llvm.org/D118244	2022-01-31 19:31:48 +01:00
Alexander Belyaev	25bf6a2a9b	Revert "[mlir] Purge `linalg.copy` and use `memref.copy` instead." This reverts commit `016956b680`. Reverting it to fix NVidia build without being in a hurry.	2022-01-31 18:51:39 +01:00
Alexander Belyaev	016956b680	[mlir] Purge `linalg.copy` and use `memref.copy` instead. Differential Revision: https://reviews.llvm.org/D118028	2022-01-31 18:25:56 +01:00
Uday Bondhugula	f8a2cd67b9	Support affine.load/store ops in fold-memref-subview-ops pass Support affine.load/store ops in fold-memref-subview ops pass. The existing pass just "inlines" the subview operation on load/stores by inserting affine.apply ops in front of the memref load/store ops: this is by design always consistent with the semantics on affine.load/store ops and the same would work even more naturally/intuitively with the latter. Differential Revision: https://reviews.llvm.org/D118565	2022-01-31 10:10:49 +05:30
Uday Bondhugula	92ccb8cc50	[MLIR][NFC] Update SCF pass cmd line names to prefix scf Update SCF pass cmd line names to prefix `scf`. This is consistent with guidelines/convention on how to name dialect passes. This also avoids ambiguity on the context given the multiple `for` operations in the tree. NFC. Differential Revision: https://reviews.llvm.org/D118564	2022-01-31 07:09:30 +05:30
Matthias Springer	6700a26d5f	[mlir][linalg][bufferize] Fix insertion point InitTensorElimination There was a bug where some of the OpOperands needed in the replacement op were not in scope. It does not matter where the replacement op is inserted. Any insertion point is OK as long as there are no dominance errors. In the worst case, the newly inserted op will bufferize out-of-place. This is no worse than not eliminating the InitTensorOp at all. Differential Revision: https://reviews.llvm.org/D117685	2022-01-30 22:25:39 +09:00
Matthias Springer	ab47418df6	[mlir][bufferize] Merge tensor-constant-bufferize into arith-bufferize The bufferization of arith.constant ops is also switched over to BufferizableOpInterface-based bufferization. The old implementation is deleted. Both implementations utilize GlobalCreator, now renamed to just `getGlobalFor`. GlobalCreator no longer maintains a set of all created allocations to avoid duplicate allocations of the same constant. Instead, `getGlobalFor` scans the module to see if there is already a global allocation with the same constant value. For compatibility reasons, it is still possible to create a pass that bufferizes only `arith.constant`. This pass (createConstantBufferizePass) could be deleted once all users were switched over to One-Shot bufferization. Differential Revision: https://reviews.llvm.org/D118483	2022-01-30 21:37:48 +09:00
harsh	80e0bf1af1	Add vector.scan op This patch adds the vector.scan op which computes the scan for a given n-d vector. It requires specifying the operator, the identity element and whether the scan is inclusive or exclusive. TEST: Added test in ops.mlir Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D117171	2022-01-28 20:07:57 +00:00
Frederik Gossen	2c7b0685e1	Fix tensor.extract for complex elements	2022-01-28 04:33:15 +01:00
Mogball	1e3a02162d	[mlir][scf] Update IfOp to have getInvocationBounds This allows `scf.if` to be used by Control-Flow sink. Depends on D115088 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D115089	2022-01-27 23:15:53 +00:00
Matthias Springer	075e3fdda1	[mlir][bufferize] Move arith BufferizableOpInterface impl to arith dialect Also switch the implementation of `-arith-bufferize` to BufferizableOpInterface. Differential Revision: https://reviews.llvm.org/D118325	2022-01-28 01:40:22 +09:00
Matthias Springer	b2f5004259	Revert "[mlir][bufferize] Insert memref.cast ops during finalizing pass" This reverts commit `1043107ce5`. This commit caused a breakage in `finalizing-bufferize.mlir`.	2022-01-27 20:48:58 +09:00
Matthias Springer	dbd1bbced9	[mlir][linalg][bufferize] Support arith.index_cast bufferization This is in preparation of switching `-tensor-constant-bufferize` and `-arith-bufferize` to BufferizableOpInterface-based implementations. Differential Revision: https://reviews.llvm.org/D118324	2022-01-27 19:50:31 +09:00
Matthias Springer	daf18108ec	[mlir][tensor] Replace tensor-bufferize with BufferizableOpInterface impl This commit switches the `tensor-bufferize` pass over to BufferizableOpInterface-based bufferization. Differential Revision: https://reviews.llvm.org/D118246	2022-01-27 19:30:45 +09:00
Matthias Springer	1043107ce5	[mlir][bufferize] Insert memref.cast ops during finalizing pass The pass can currently not handle to_memref(to_tensor(x)) folding where a cast is necessary. This is required with the new unified bufferization. There is already a canonicalization pattern that handles such foldings and it should be used during this pass. Differential Revision: https://reviews.llvm.org/D117988	2022-01-27 19:06:53 +09:00
Uday Bondhugula	fa5c5230d9	[MLIR] NFC. Rename pass cmd-line to prefix affine Prefix "affine-" to affine transform passes that were missing it -- to avoid ambiguity and for uniformity. There were only two needed this. Move mispaced affine coalescing test case file. NFC. Differential Revision: https://reviews.llvm.org/D118314	2022-01-27 13:01:39 +05:30
River Riddle	7d0426dd95	[mlir] Move ComposeSubView+ExpandOps from Standard to MemRef These transformations already operate on memref operations (as part of splitting up the standard dialect). Now that the operations have moved, it's time for these transformations to move as well. Differential Revision: https://reviews.llvm.org/D118285	2022-01-26 23:11:02 -08:00
River Riddle	632a4f8829	[mlir] Move std.generic_atomic_rmw to the memref dialect This is part of splitting up the standard dialect. The move makes sense anyways, given that the memref dialect already holds memref.atomic_rmw which is the non-region sibling operation of std.generic_atomic_rmw (the relationship is even more clear given they have nearly the same description % how they represent the inner computation). Differential Revision: https://reviews.llvm.org/D118209	2022-01-26 11:52:01 -08:00
River Riddle	480cd4cb85	[mlir] Move the complex support of std.constant to a new complex.constant operation This is part of splitting up the standard dialect. Differential Revision: https://reviews.llvm.org/D118182	2022-01-26 11:52:00 -08:00
River Riddle	b88a4d72d9	[mlir:GPU] Replace reference to LLVMFuncOp with FuncOpInterface The GPU dialect currently contains an explicit reference to LLVMFuncOp during verification to handle the situation where the kernel has already been converted. This commit changes that reference to instead use FunctionOpInterface, which has two main benefits: * It allows for removing an otherwise unnecessary dependency on the LLVM dialect * It removes hardcoded assumptions about the lowering path and use of the GPU dialect Differential Revision: https://reviews.llvm.org/D118172	2022-01-26 11:52:00 -08:00
Matthias Springer	268524238e	[mlir][bufferization] Add an option to use memref types without layout maps This is for compatibility with existing bufferization passes. Also clean up memref type generation a bit. Differential Revision: https://reviews.llvm.org/D118243	2022-01-27 00:03:34 +09:00
Nicolas Vasilache	9b6c2ea302	[mlir][Linalg] Add GenericOp self-copy on buffers folding Reviewed By: pifon2a Differential Revision: https://reviews.llvm.org/D118116	2022-01-26 05:56:31 -05:00
Alexander Batashev	e9b4239fef	[mlir][openmp] Custom syntax for `omp.target` operation Add a custom parser and printer for `omp.target` operation. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D117539	2022-01-26 10:26:19 +00:00
Rob Suderman	7c984be21a	[mlir] Propagate arith.index_cast past tensor.extract If we are extracting it is more useful to push the index_cast past the extraction. This increases the chance the tensor.extract can evaluated at compile time. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D118204	2022-01-25 22:16:07 -08:00
Rob Suderman	d81a3c51e7	[mlir] Fold tensor.reshape operations into tensor.from_elements. There is not much of a benefit to reshape a from element vs reloading it. Updated to progagate shape manipulations into the output type of tensor.from_elements. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D118201	2022-01-25 15:54:57 -08:00
MaheshRavishankar	ea1ac183f4	[mlir][Linalg] Fix incorrect fusion with reshape ops by linearization. Fusion of reshape ops by linearization incorrectly inverted the indexing map before linearizing dimensions. This leads to incorrect indexing maps used in the fused operation. Differential Revision: https://reviews.llvm.org/D117908	2022-01-25 11:42:58 -08:00
MaheshRavishankar	e5a315f57a	[mlir][Linalg] Disallow ops with index semantics in `PushExpandingReshape`. This pattern is not written to handle operations with `linalg.index` operations in its body, i.e. operations that have index semantics. Differential Revision: https://reviews.llvm.org/D117856	2022-01-25 10:37:30 -08:00
Matthias Springer	d581c94d6b	[mlir][linalg][bufferize] Support tensor.from_elements This is mostly a copy of the existing tensor.from_elements bufferization. Once TensorInterfaceImpl.cpp is moved to the tensor dialect, the existing rewrite pattern can be deleted. Differential Revision: https://reviews.llvm.org/D117775	2022-01-25 22:19:59 +09:00
Matthias Springer	71bbb78b8f	[mlir][linalg][bufferize] Support tensor.generate This is mostly a copy of the existing tensor.generate bufferization. Once TensorInterfaceImpl.cpp is moved to the tensor dialect, the existing rewrite pattern can be deleted. Differential Revision: https://reviews.llvm.org/D117770	2022-01-25 22:19:22 +09:00
Shraiysh Vaishay	320dc8c4df	[mlir][OpenMP] Added omp.atomic.capture operation This patch supports the atomic construct (capture) following section 2.17.7 of OpenMP 5.0 standard. Also added tests for the same. Reviewed By: peixin, kiranchandramohan Differential Revision: https://reviews.llvm.org/D115851	2022-01-25 12:25:54 +05:30
gysit	e494278cee	[mlir][linalg] Add transpose support to hoist padding. Add a transpose option to hoist padding to transpose the padded tensor before storing it into the packed tensor. The early transpose improves the memory access patterns of the actual compute kernel. The patch introduces a transpose right after the hoisted pad tensor and a second transpose inside the compute loop. The second transpose can either be fused into the compute operation or will canonicalize away when lowering to vector instructions. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D117893	2022-01-24 16:33:05 +00:00
Matthias Springer	c30d2893a4	[mlir][bufferize] Change insertion point for ToTensorOps Both insertion points are valid. This is to make BufferizableOpInteface-based bufferization compatible with existing partial bufferization test cases. (So less changes are necessary to unit tests.) Differential Revision: https://reviews.llvm.org/D117986	2022-01-25 00:43:04 +09:00
Matthias Springer	fc08d1c294	[mlir][tensor][bufferize] Support tensor.rank in BufferizableOpInterfaceImpl This is the only op that is not supported via BufferizableOpInterfaceImpl bufferization. Once this op is supported we can switch `tensor-bufferize` over to the new unified bufferization. Differential Revision: https://reviews.llvm.org/D117985	2022-01-25 00:31:20 +09:00
Alexander Belyaev	4041354b4c	[mlir] Add SingleBlockImplicitTerminator<"tensor::YieldOp"> to PadOp.	2022-01-22 11:46:27 +01:00
not-jenni	08574ce4d6	[mlir][tosa] Add clamp + clamp as single clamp canonicalization When 2 clamp ops are in a row, they can be canonicalized into a single clamp that uses the most constrained range Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D117934	2022-01-21 16:24:43 -08:00
Aart Bik	efa15f4178	[mlir][sparse] add ability for sparse tensor output Rationale: Although file I/O is a bit alien to MLIR itself, we provide two convenient ways for sparse tensor I/O. The input part was already there (behind the swiss army knife sparse_tensor.new). Now we have a sparse_tensor.out to write out data. As before, the ops are kept vague and may change in the future. For now this allows us to compare TACO vs MLIR very easily. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D117850	2022-01-21 15:43:29 -08:00
Rob Suderman	2f9f9afa4e	[mlir] Add polynomial approximation for atan and atan2 Implement a taylor series approximation for atan and add an atan2 lowering that uses atan's appromation. This includes tests for edge cases and tests for each quadrant. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D115682	2022-01-21 12:22:58 -08:00
Alexander Belyaev	fd0c6f5391	[mlir] Move linalg::PadTensorOp to tensor::PadOp. RFC: https://llvm.discourse.group/t/rfc-move-linalg-padtensorop-to-tensor-padop/5785 Differential Revision: https://reviews.llvm.org/D117892	2022-01-21 20:02:39 +01:00
MaheshRavishankar	a99e06aa86	[mlir][Linalg] Avoid generating illegal operations during elementwise fusion. In some cases, fusion can produce illegal operations if after fusion the range of some of the loops cannot be computed from shapes of its operands. Check for this case and abort the fusion if this happens. Differential Revision: https://reviews.llvm.org/D117602	2022-01-20 23:43:50 -08:00
Mehdi Amini	26167cae45	Print the `// ----` separator between modules when using -split-input-file with mlir-opt This allows to pipe sequences of `mlir-opt -split-input-file \| mlir-opt -split-input-file`. Depends On D117750 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D117756	2022-01-21 05:16:02 +00:00
Mogball	e99835ffed	[mlir][pdl] Make `pdl` the default dialect when parsing/printing PDLDialect being a somewhat user-facing dialect and whose ops contain exclusively other PDL ops in their regions can take advantage of `OpAsmOpInterface` to provide nicer IR. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D117828	2022-01-20 20:22:53 +00:00
Mogball	7c471b56f2	[mlir][pdl] OperationOp should not be side-effect free Unbound OperationOp in the matcher (i.e. one with no uses) is already disallowed by the verifier. However, an OperationOp in the rewriter is not side-effect free -- it's creating an op! Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D117825	2022-01-20 20:22:01 +00:00
Sergei Grechanik	5abf116322	[mlir][vector] Allow values outside of [0; dim-size] in create_mask This commits explicitly states that negative values and values exceeding vector dimensions are allowed in vector.create_mask (but not in vector.constant_mask). These values are now truncated when canonicalizing vector.create_mask to vector.constant_mask. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D116069	2022-01-20 09:34:42 -08:00
Stephan Herhut	6d45284618	[mlir][memref] Add better support for identity layouts in memref.collapse_shape canonicalizer When computing the new type of a collapse_shape operation, we need to at least take into account whether the type has an identity layout, in which case we can easily support dynamic strides. Otherwise, the canonicalizer creates invalid IR. Longer term, both the verifier and the canoncializer need to be extended to support the general case. Differential Revision: https://reviews.llvm.org/D117772	2022-01-20 15:31:43 +01:00

1 2 3 4 5 ...

2100 Commits