llvm-project

Commit Graph

Author	SHA1	Message	Date
Matthias Springer	90ebc489de	Add vp2intersect to AVX512 dialect. Adds vp2intersect to the AVX512 dialect and defines a lowering to the LLVM dialect. Author: Matthias Springer <springerm@google.com> Differential Revision: https://reviews.llvm.org/D95301	2021-01-26 07:32:26 +00:00
Eugene Zhulenev	d37b5393e8	[mlir:Async] Use LLVM coro operations in async.coro lowering Instead of using llvm.call operations to call LLVM coro intrinsics use Coro operations from the LLVM dialect. (This was reviewed as a part of https://reviews.llvm.org/D94923 but was lost in arc land from local branch) Differential Revision: https://reviews.llvm.org/D95405	2021-01-25 16:42:11 -08:00
Eugene Zhulenev	9c53b8e52e	[mlir:Async] Add intermediate async.coro and async.runtime operations to simplify Async to LLVM lowering [NFC] No new functionality, mostly a cleanup and one more abstraction level between Async and LLVM IR. Instead of lowering from Async to LLVM coroutines and Async Runtime API in one shot, do it progressively via async.coro and async.runtime operations. 1. Lower from async to async.runtime/coro (e.g. async.execute to function with coro setup and runtime calls) 2. Lower from async.runtime/coro to LLVM intrinsics and runtime API calls Intermediate coro/runtime operations will allow to run transformations on a higher level IR and do not try to match IR based on the LLVM::CallOp properties. Although async.coro is very close to LLVM coroutines, it is not exactly the same API, instead it is optimized for usability in async lowering, and misses a lot of details that are present in @llvm.coro intrinsic. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D94923	2021-01-25 14:04:33 -08:00
Alex Zinenko	f5c7c031e2	[mlir] Add C API for IntegerSet Depends On D95357 Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D95368	2021-01-25 20:16:22 +01:00
Diego Caballero	c8fc5c0385	[mlir][Affine] Add support for multi-store producer fusion This patch adds support for producer-consumer fusion scenarios with multiple producer stores to the AffineLoopFusion pass. The patch introduces some changes to the producer-consumer algorithm, including: * For a given consumer loop, producer-consumer fusion iterates over its producer candidates until a fixed point is reached. * Producer candidates are gathered beforehand for each iteration of the consumer loop and visited in reverse program order (not strictly guaranteed) to maximize the number of loops fused per iteration. In general, these changes were needed to simplify the multi-store producer support and remove some of the workarounds that were introduced in the past to support more fusion cases under the single-store producer limitation. This patch also preserves the existing functionality of AffineLoopFusion with one minor change in behavior. Producer-consumer fusion didn't fuse scenarios with escaping memrefs and multiple outgoing edges (from a single store). Multi-store producer scenarios will usually (always?) have multiple outgoing edges so we couldn't fuse any with escaping memrefs, which would greatly limit the applicability of this new feature. Therefore, the patch enables fusion for these scenarios. Please, see modified tests for specific details. Reviewed By: andydavis1, bondhugula Differential Revision: https://reviews.llvm.org/D92876	2021-01-25 20:31:17 +02:00
Alex Zinenko	1e739552ee	[mlir] Use more C99 comments in C API header files These were left over from the original reformatting commit. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D95357	2021-01-25 19:23:06 +01:00
Nicolas Vasilache	05d5125d8a	[mlir] Generalize OpFoldResult usage in ops with offsets, sizes and operands. This revision starts evolving the APIs to manipulate ops with offsets, sizes and operands towards a ValueOrAttr abstraction that is already used in folding under the name OpFoldResult. The objective, in the future, is to allow such manipulations all the way to the level of ODS to avoid all the genuflexions involved in distinguishing between values and attributes for generic constant foldings. Once this evolution is accepted, the next step will be a mechanical OpFoldResult -> ValueOrAttr. Differential Revision: https://reviews.llvm.org/D95310	2021-01-25 14:17:03 +00:00
Nicolas Vasilache	dbf9bedf40	[mlir][Linalg] Add a hoistPaddingOnTensors transformation This transformation anchors on a padding op whose result is only used as an input to a Linalg op and pulls it out of a given number of loops. The result is a packing of padded tailes of ops that is amortized just before the outermost loop from which the pad operation is hoisted. Differential revision: https://reviews.llvm.org/D95243	2021-01-25 12:41:18 +00:00
Benjamin Kramer	6367306a1b	[mlir] Perfectly forward ImplicitLocOpBuilder ctors to OpBuilder This is both cleaner and less prone to creating a mess out of overload resolution.	2021-01-25 11:48:58 +01:00
Nicolas Vasilache	3747eb9c85	[mlir][Linalg] Add a padding option to Linalg tiling This revision allows the base Linalg tiling pattern to optionally require padding to a constant bounding shape. When requested, a simple analysis is performed, similar to buffer promotion. A temporary `linalg.simple_pad` op is added to model padding for the purpose of connecting the dots. This will be replaced by a more fleshed out `linalg.pad_tensor` op when it is available. In the meantime, this temporary op serves the purpose of exhibiting the necessary properties required from a more fleshed out pad op, to compose with transformations properly. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D95149	2021-01-25 09:17:30 +00:00
Stella Laurenzo	52586c46b0	[mlir][CAPI] Add result type inference to the CAPI. * Adds a flag to MlirOperationState to enable result type inference using the InferTypeOpInterface. * I chose this level of implementation for a couple of reasons: a) In the creation flow is naturally where generated and custom builder code will be invoking such a thing b) it is a bit more efficient to share the data structure and unpacking vs having a standalone entry-point c) we can always decide to expose more of these interfaces with first-class APIs, but that doesn't preclude that we will always want to use this one in this way (and less API surface area for common things is better for API stability and evolution). * I struggled to find an appropriate way to test it since we don't link the test dialect into anything CAPI accessible at present. I opted instead for one of the simplest ops I found in a regular dialect which implements the interface. * This does not do any trait-based type selection. That will be left to generated tablegen wrappers. Differential Revision: https://reviews.llvm.org/D95283	2021-01-23 14:30:51 -08:00
MaheshRavishankar	6e8ef3b76a	[mlir][Linalg] Make Fill operation work on tensors. Depends on D95109	2021-01-22 14:39:27 -08:00
MaheshRavishankar	430d43e010	[mlir][Linalg] Disable fusion of tensor_reshape op by expansion when unit-dims are involved Fusion of generic/indexed_generic operations with tensor_reshape by expansion when the latter just adds/removes unit-dimensions is disabled since it just adds unit-trip count loops. Differential Revision: https://reviews.llvm.org/D94626	2021-01-22 12:55:25 -08:00
MaheshRavishankar	01defcc8d7	[mlir][Linalg] Extend tile+fuse to work on Linalg operation on tensors. Differential Revision: https://reviews.llvm.org/D93086	2021-01-22 11:33:35 -08:00
MaheshRavishankar	bce318f58d	[mlir][Linalg] NFC: Refactor LinalgDependenceGraphElem to allow representing dependence from producer result to consumer. With Linalg on tensors the dependence between operations can be from the result of the producer to the consumer. This change just does a NFC refactoring of the LinalgDependenceGraphElem to allow representing both OpResult and OpOperand*. Differential Revision: https://reviews.llvm.org/D95208	2021-01-22 11:19:59 -08:00
Lei Zhang	e27197f360	[mlir][spirv] Define spv.IsNan/spv.IsInf and add lowerings spv.Ordered/spv.Unordered are meant for OpenCL Kernel capability. For Vulkan Shader capability, we should use spv.IsNan to check whether a number is NaN. Add a new pattern for converting `std.cmpf ord\|uno` to spv.IsNan and bumped the pattern converting to spv.Ordered/spv.Unordered to a higher benefit. The SPIR-V target environment will properly select between these two patterns. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D95237	2021-01-22 13:09:33 -05:00
Lei Zhang	167fb9b4b4	[mlir][spirv] Fix script for availability autogen and refresh ops Previously we only autogen the availability for ops that are direct instantiating `SPV_Op` and expected other subclasses of `SPV_Op` to define aggregated availability for all ops. This is quite error prone and we can miss capabilities for certain ops. Also it's arguable to have multiple levels of subclasses and try to deduplicate too much: having the availability directly in the op can be quite explicit and clear. A few extra lines of declarative code is fine. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D95236	2021-01-22 13:07:36 -05:00
Eugene Zhulenev	cc77a2c768	[mlir] Add coro intrinsics operations to LLVM dialect This PR only has coro intrinsics needed for the Async to LLVM lowering. Will add other intrinsics as needed in the followup PRs. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D95143	2021-01-22 10:01:45 -08:00
Hanhan Wang	1b535df1cc	[mlir][StandardOps] Fix typos in the td file. - Fix arguments name for subview and subtensor. - Fix a typo in a comment of subtensor's method. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D95211	2021-01-22 09:03:16 -08:00
Arjun P	14056dfb4d	[MLIR] Add support for extracting an integer sample point (if one exists) from an unbounded FlatAffineConstraints. With this, we have complete support for finding integer sample points in FlatAffineConstraints. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D95047	2021-01-22 22:28:38 +05:30
Hanhan Wang	16d4bbef30	[mlir][Linalg] Introduce linalg.pad_tensor op. `linalg.pad_tensor` is an operation that pads the `source` tensor with given `low` and `high` padding config. Example 1: ```mlir %pad_value = ... : f32 %1 = linalg.pad_tensor %0 low[1, 2] high[2, 3] { ^bb0(%arg0 : index, %arg1 : index): linalg.yield %pad_value : f32 } : tensor<?x?xf32> to tensor<?x?xf32> ``` Example 2: ```mlir %pad_value = ... : f32 %1 = linalg.pad_tensor %arg0 low[2, %arg1, 3, 3] high[3, 3, %arg1, 2] { ^bb0(%arg2: index, %arg3: index, %arg4: index, %arg5: index): linalg.yield %pad_value : f32 } : tensor<1x2x2x?xf32> to tensor<6x?x?x?xf32> ``` Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D93704	2021-01-21 22:09:28 -08:00
Jacques Pienaar	aee622fa20	[mlir] Enable passing crash reproducer stream factory method Add factory to create streams for logging the reproducer. Allows for more general logging (beyond file) and logging the configuration/module separately (logged in order, configuration before module). Also enable querying filename of ToolOutputFile. Differential Revision: https://reviews.llvm.org/D94868	2021-01-21 20:03:15 -08:00
mikeurbach	0a7a1ac73d	[mlir] Support FuncOpSignatureConversion for more FunctionLike ops. This extracts the implementation of getType, setType, and getBody from FunctionSupport.h into the mlir::impl namespace and defines them generically in FunctionSupport.cpp. This allows them to be used elsewhere for any FunctionLike ops that use FunctionType for their type signature. Using the new helpers, FuncOpSignatureConversion is generalized to work with all such FunctionLike ops. Convenience helpers are added to configure the pattern for a given concrete FunctionLike op type. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D95021	2021-01-21 18:35:09 -07:00
Christian Sigg	8827e07aaf	Remove deprecated methods from OpState. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D95123	2021-01-21 21:29:08 +01:00
MaheshRavishankar	615167c9f7	[mlir]][SPIRV] Define OrderedOp and UnorderedOp and add lowerings from Standard. Define OrderedOp and UnorderedOp instructions in SPIR-V and convert cmpf operations with `ord` and `uno` tag to these instructions respectively. Differential Revision: https://reviews.llvm.org/D95098	2021-01-21 07:56:44 -08:00
MaheshRavishankar	4234292ecf	[mlir][SPIRV] Rename OpSpecConstantOperation -> OpSpecConstantOp The SPIR-V spec uses OpSpecConstantOp. Using an inconsistent name makes the dialect generation scripts fail. Update to use the right operation name, and fix the auto generation scripts as well. Differential Revision: https://reviews.llvm.org/D95097	2021-01-21 07:56:43 -08:00
Alexander Belyaev	fc58bfd02f	[mlir] Remove complex ops from Standard dialect. `complex` dialect should be used instead. https://llvm.discourse.group/t/rfc-split-the-complex-dialect-from-std/2496/2 Differential Revision: https://reviews.llvm.org/D95077	2021-01-21 10:34:26 +01:00
mfehr	8a7ff7301a	[mlir] Make MLIRContext::getOrLoadDialect(StringRef, TypeID, ...) public Having this function in a public scope is helpful to register dialects that are defined at runtime, and thus that need a runtime-defined TypeID. Also, a similar function in DialectRegistry, insert(TypeID, StringRef, ...), has a public scope. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D95091	2021-01-21 00:29:58 +00:00
River Riddle	c78219f644	[mlir] Add a new builtin `unrealized_conversion_cast` operation An `unrealized_conversion_cast` operation represents an unrealized conversion from one set of types to another, that is used to enable the inter-mixing of different type systems. This operation should not be attributed any special representational or execution semantics, and is generally only intended to be used to satisfy the temporary intermixing of type systems during the conversion of one type system to another. This operation was discussed in the following RFC(and ODM): https://llvm.discourse.group/t/open-meeting-1-14-dialect-conversion-and-type-conversion-the-question-of-cast-operations/ Differential Revision: https://reviews.llvm.org/D94832	2021-01-20 16:28:18 -08:00
River Riddle	6ccf2d62b4	[mlir] Add an interface for Cast-Like operations A cast-like operation is one that converts from a set of input types to a set of output types. The arity of the inputs may be from 0-N, whereas the arity of the outputs may be anything from 1-N. Cast-like operations are removable in cases where they produce a "no-op", i.e when the input types and output types match 1-1. Differential Revision: https://reviews.llvm.org/D94831	2021-01-20 16:28:17 -08:00
Diego Caballero	735a07f047	Revert "[mlir][Affine] Add support for multi-store producer fusion" This reverts commit `7dd198852b`. ASAN issue.	2021-01-21 00:37:23 +02:00
Nicolas Vasilache	555a395f2c	[mlir] NFC - Fix unused variable in non-debug mode	2021-01-20 22:20:38 +00:00
Nicolas Vasilache	866cb26039	[mlir] Fix SubTensorInsertOp semantics Like SubView, SubTensor/SubTensorInsertOp are allowed to have rank-reducing/expanding semantics. In the case of SubTensorInsertOp , the rank of offsets/sizes/strides should be the rank of the destination tensor. Also, add a builder flavor for SubTensorOp to return a rank-reduced tensor. Differential Revision: https://reviews.llvm.org/D95076	2021-01-20 20:16:01 +00:00
Nicolas Vasilache	c075572646	[mlir][Linalg] NFC - Expose getSmallestBoundingIndex as an utility function	2021-01-20 19:53:09 +00:00
Nicolas Vasilache	f5d8eb085a	[mlir][Linalg] NFC - getAssumedNonShapedOperands now returns OperandRange Also adds a isInput interface method.	2021-01-20 19:23:26 +00:00
Frederik Gossen	cc4244d55f	[MLIR][Standard] Add log1p operation to std Differential Revision: https://reviews.llvm.org/D95041	2021-01-20 18:56:20 +01:00
Diego Caballero	7dd198852b	[mlir][Affine] Add support for multi-store producer fusion This patch adds support for producer-consumer fusion scenarios with multiple producer stores to the AffineLoopFusion pass. The patch introduces some changes to the producer-consumer algorithm, including: * For a given consumer loop, producer-consumer fusion iterates over its producer candidates until a fixed point is reached. * Producer candidates are gathered beforehand for each iteration of the consumer loop and visited in reverse program order (not strictly guaranteed) to maximize the number of loops fused per iteration. In general, these changes were needed to simplify the multi-store producer support and remove some of the workarounds that were introduced in the past to support more fusion cases under the single-store producer limitation. This patch also preserves the existing functionality of AffineLoopFusion with one minor change in behavior. Producer-consumer fusion didn't fuse scenarios with escaping memrefs and multiple outgoing edges (from a single store). Multi-store producer scenarios will usually (always?) have multiple outgoing edges so we couldn't fuse any with escaping memrefs, which would greatly limit the applicability of this new feature. Therefore, the patch enables fusion for these scenarios. Please, see modified tests for specific details. Reviewed By: andydavis1, bondhugula Differential Revision: https://reviews.llvm.org/D92876	2021-01-20 19:03:07 +02:00
Christian Sigg	cf50f4f764	[mlir] Link mlir_runner_utils statically into cuda/rocm-runtime-wrappers. The runtime-wrappers depend on LLVMSupport, pulling in static initialization code (e.g. command line arguments). Dynamically loading multiple such libraries results in ODR violoations. So far this has not been an issue, but in D94421, I would like to load both the async-runtime and the cuda-runtime-wrappers as part of a cuda-runner integration test. When doing this, code that asserts that an option category is only registered once fails (note that I've only experienced this in Google's bazel where the async-runtime depends on LLVMSupport, but a similar issue would happen in cmake if more than one runtime-wrapper starts to depend on LLVMSupport). The underlying issue is that we have a mix of static and dynamic linking. If all dependencies were loaded as shared objects (i.e. if LLVMSupport was linked dynamically to the runtime wrappers), each dependency would only get loaded once. However, linking dependencies dynamically would require special attention to paths (one could dynamically load the dependencies first given explicit paths). The simpler approach seems to be to link all dependencies statically into a single shared object. This change basically applies the same logic that we have in the c_runner_utils: we have a shared object target that can be loaded dynamically, and we have a static library target that can be linked to other runtime-wrapper shared object targets. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D94399	2021-01-20 12:10:16 +01:00
Aart Bik	b5c542d64b	[mlir][sparse] add narrower choices for pointers/indices Use cases with 16- or even 8-bit pointer/index structures have been identified. Reviewed By: penpornk Differential Revision: https://reviews.llvm.org/D95015	2021-01-19 20:20:38 -08:00
Jackson Fellows	1bf2b1665b	Implement constant folding for DivFOp Add a constant folder for DivFOp. Analogous to existing folders for AddFOp, SubFOp, and MulFOp. Matches the behavior of existing LLVM constant folding (`999f5da6b3/llvm/lib/IR/ConstantFold.cpp (L1432)`). Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D94939	2021-01-19 23:08:06 +00:00
Sean Silva	be7352c00d	[mlir][splitting std] move 2 more ops to `tensor` - DynamicTensorFromElementsOp - TensorFromElements Differential Revision: https://reviews.llvm.org/D94994	2021-01-19 13:49:25 -08:00
KareemErgawy-TomTom	27820496a7	[MLIR][SPIRV] Add `SignedOp` trait. This commit adds a new trait that can be attached to ops that have signed semantics. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D94896	2021-01-19 17:40:40 +01:00
Lei Zhang	3a56a96664	[mlir][spirv] Define spv.GLSL.Fma and add lowerings Also changes some rewriter.create + rewriter.replaceOp calls into rewriter.replaceOpWithNewOp calls. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D94965	2021-01-19 09:14:21 -05:00
Nicolas Vasilache	93a873dfc9	[mlir][Affine] Revisit and simplify composeAffineMapAndOperands. In prehistorical times, AffineApplyOp was allowed to produce multiple values. This allowed the creation of intricate SSA use-def chains. AffineApplyNormalizer was originally introduced as a means of reusing the AffineMap::compose method to write SSA use-def chains. Unfortunately, symbols that were produced by an AffineApplyOp needed to be promoted to dims and reordered for the mathematical composition to be valid. Since then, single result AffineApplyOp became the law of the land but the original assumptions were not revisited. This revision revisits these assumptions and retires AffineApplyNormalizer. Differential Revision: https://reviews.llvm.org/D94920	2021-01-19 13:52:07 +00:00
Alexander Belyaev	11f4c58c15	[mlir] Add `complex.abs`, `complex.div` and `complex.mul` to ComplexOps. Differential Revision: https://reviews.llvm.org/D94911	2021-01-19 12:09:59 +01:00
Arjun P	9f32f1d6fb	[MLIR] Support checking if two FlatAffineConstraints are equal This patch adds support for checking if two PresburgerSets are equal. In particular, one can check if two FlatAffineConstraints are equal by constructing PrebsurgerSets from them and comparing these. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D94915	2021-01-18 21:46:01 +05:30
Vladislav Vinogradov	aca240b4f6	[mlir] Fix cross-compilation (Linalg ODS gen) Use cross-compilation approach for `mlir-linalg-ods-gen` application similar to TblGen tools. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D94598	2021-01-18 11:57:55 +01:00
Aart Bik	d8fc27301d	[mlir][sparse] improved sparse runtime support library Added the ability to read (an extended version of) the FROSTT file format, so that we can now read in sparse tensors of arbitrary rank. Generalized the API to deal with more than two dimensions. Also added the ability to sort the indices of sparse tensors lexicographically. This is an important step towards supporting auto gen of initialization code, since sparse storage formats are easier to initialize if the indices are sorted. Since most external formats don't enforce such properties, it is convenient to have this ability in our runtime support library. Lastly, the re-entrant problem of the original implementation is fixed by passing an opaque object around (rather than having a single static variable, ugh!). Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D94852	2021-01-16 12:16:10 -08:00
Thomas Raoux	3afbfb4145	[mlir][NFC] Move helper substWithMin into Affine utils This allow using this helper outside of the linalg canonicalization. Differential Revision: https://reviews.llvm.org/D94826	2021-01-15 17:13:56 -08:00
Alexander Belyaev	d0cb0d30a4	[mlir] Add Complex dialect. Differential Revision: https://reviews.llvm.org/D94764	2021-01-15 19:58:10 +01:00

1 2 3 4 5 ...

3784 Commits