llvm-project

Commit Graph

Author	SHA1	Message	Date
Tres Popp	2790cbedd0	Revert "[mlir][Linalg] Replace SimplePad with PadTensor in hoist-padding" This reverts commit `d9b953d84b`. This commit resulted in build bot failures and the author is away from a computer, so I am reverting on their behalf until they have a chance to look into this.	2021-02-01 09:43:55 +01:00
Christian Sigg	a4b7d52f3a	[mlir] Fix missing null termination in cuLinkAddData argument. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D95679	2021-02-01 09:32:50 +01:00
Hanhan Wang	d9b953d84b	[mlir][Linalg] Replace SimplePad with PadTensor in hoist-padding This is the last revision to migrate using SimplePadOp to PadTensorOp, and the SimplePadOp is removed in the patch. Update a bit in SliceAnalysis because the PadTensorOp takes a region different from SimplePadOp. This is not covered by LinalgOp because it is not a structured op. Also, remove a duplicated comment from cpp file, which is already described in a header file. And update the pseudo-mlir in the comment. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D95671	2021-02-01 00:02:37 -08:00
Jacques Pienaar	2eb5f34542	Fix omitted kw in type alias printer * Fixing missing `type` keyword in alias print * Add test for large tuple type alias & rerun output to verify printed form can be parsed (which caught the above).	2021-01-31 14:06:58 -08:00
Matthias Springer	5ec59f021c	[mlir][AVX512] Fix result type of vp2intersect The result values of vp2intersect are vectors of bits, i.e., vector<8xi1> or vector<16xi8> (instead of i8 or i16). Differential Revision: https://reviews.llvm.org/D95678	2021-01-31 12:03:46 +09:00
Jacques Pienaar	4d9336923e	Use type alias for large tuples Tuples can occupy quite a lot of space, instead of printing out tuple type everywhere, just use the type alias if larger (arbitrarily chose a bound for now). Differential Revision: https://reviews.llvm.org/D95707	2021-01-29 17:42:23 -08:00
karimnosseir	0af2527536	Update ElementsAttr::isValidIndex to handle ElementsAttr with a scalar. Scalar will have rank 0. Update ElementsAttr::isValidIndex to handle ElementsAttr with a scalar. Scalar will have rank 0. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D95663	2021-01-29 16:56:00 -08:00
Alexander Belyaev	8d7cbcf582	[mlir] Preserve lexicographic order after loop collapsing. Currently, for a scf.parallel (i,j,k) after the loop collapsing to 1D is done, the IVs would be traversed as for an scf.parallel(k,j,i). Differential Revision: https://reviews.llvm.org/D95693	2021-01-29 21:32:36 +01:00
Christopher Tetreault	d3e8b9fdc0	Revert "[CMake] Actually require python 3.6 or greater" There are builders that do not have python 3.6. Revert until this situation can be rectified This reverts commit `0703b0753c`.	2021-01-29 12:03:32 -08:00
Christopher Tetreault	0703b0753c	[CMake] Actually require python 3.6 or greater Previously, CMake would find any version of Python3. However, the project claims to require 3.6 or greater, and 3.6 features are being used. Reviewed By: yln Differential Revision: https://reviews.llvm.org/D95635	2021-01-29 11:47:21 -08:00
Jordan Rupprecht	010b176cde	[mlir][docs] Fix typo: even -> event	2021-01-29 09:16:35 -08:00
Christian Sigg	27924b1263	[mlir] Remove mlir_c_runner_utils_static. The library is not actually static when BUILD_SHARED_LIBS is on, and tests need to explicitly load it already. Also, the shared objects it was linked to did not use any symbols from it and it was therefore never linked to it. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D95612	2021-01-29 15:04:48 +01:00
Tres Popp	0c5e4a25ee	[mlir] Prevent segfault in Tensor canonicalization This segfault could occur from out of bounds accesses when simplifying tensor.extract with a constant index and a tensor created by tensor.from_elements. This IR is not necesarilly invalid as it might conditionally be never executed. Differential Revision: https://reviews.llvm.org/D95535	2021-01-29 10:57:58 +01:00
Mehdi Amini	e9dc94291e	Introduce a new DialectIdentifier structure, extending Identifier with a Dialect information This class is looking up a dialect prefix on the identifier on initialization and keeping a pointer to the Dialect when found. The NamedAttribute key is now a DialectIdentifier. Reviewed By: rriddle, jpienaar Differential Revision: https://reviews.llvm.org/D95418	2021-01-29 00:05:36 +00:00
Richard Smith	dfe26d5f44	[mlir][Linalg] Fix SFINAE check to actually check the value. No internal functionality change intended, but this fixes out-of-tree uses.	2021-01-28 15:15:46 -08:00
MaheshRavishankar	98835e3d98	[mlir][Linalg] Enable TileAndFusePattern to work with tensors. Differential Revision: https://reviews.llvm.org/D94531	2021-01-28 14:13:01 -08:00
Hanhan Wang	2c7cc5fd20	Revert "[mlir][Linalg] Replace SimplePad with PadTensor in hoist-padding" This reverts commit `1e790b745d`. Differential Revision: https://reviews.llvm.org/D95636	2021-01-28 11:25:02 -08:00
Hanhan Wang	1e790b745d	[mlir][Linalg] Replace SimplePad with PadTensor in hoist-padding This is the last revision to migrate using SimplePadOp to PadTensorOp, and the SimplePadOp is removed in the patch. Update a bit in SliceAnalysis because the PadTensorOp takes a region different from SimplePadOp. This is not covered by LinalgOp because it is not a structured op. Also, remove a duplicated comment from cpp file, which is already described in a header file. And update the pseudo-mlir in the comment. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D95615	2021-01-28 11:09:57 -08:00
Jacques Pienaar	acaf85f700	Add convenience function for checking arrays of shapes compatible. Expand existing one to handle the common case for verifying compatible is existing and inferred. This considers arrays equivalent if they they have the same size and pairwise compatible elements.	2021-01-28 10:47:08 -08:00
Aart Bik	8af0ccf5a4	[sparse][mlir] give all sparse kernels an explicit "output" tensor Rationale: Providing an output tensor, even if one is not used as input to the kernel provides the right pattern for using lingalg sparse kernels (in contrast with reusing a tensor just to provide the shape). This prepares proper bufferization that will follow. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D95587	2021-01-28 10:41:17 -08:00
Alex Zinenko	d6be277347	[mlir] turn complex-to-llvm into a partial conversion It is no longer necessary to also convert other "standard" ops along with the complex dialect: the element types are now built-in integers or floating point types, and the top-level cast between complex and struct is automatically inserted and removed in progressive lowering. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D95625	2021-01-28 19:14:01 +01:00
Christian Sigg	51457cd506	[mlir] NFC: split --shared-libs option into multiple lines.	2021-01-28 18:54:05 +01:00
Nicolas Vasilache	0f2901201e	[mlir] Fix test by adapting to C util functions moving to libmlir_c_runner_utils	2021-01-28 17:35:51 +00:00
Aart Bik	6640b9aa8a	[mlir][sparse] use typenames for opaque pointers Makes intent more readable Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D95592	2021-01-28 09:23:11 -08:00
Nicolas Vasilache	9cbef8c905	[mlir] Fix integration tests	2021-01-28 16:54:50 +00:00
Christian Sigg	5bdc771fc9	[mlir] Make cuda/rocm-runtime-wrappers not depend on LLVMSupport. Depending on the headers only is fine, but we do not want to use any symbols from LLVMSupport. If we do, static registration of cl options is linked in as well, and loading multiple such libraries in the cuda/rocm-runner fails because the same cl options are registered multiple times. The cuda/rocm-runners also depend on LLVMSupport, so one could think that already loading a single such library would fail. It does not because the map of cl options is not shared between the runner and the loaded libraries (but it is shared across all loaded libraries, presumably because it has external linkage, in contrast to the static registration which has internal linkage). This change is a preparation step for dynamically loading the mlir_async_runtime.so and cuda-runtime-wrappers.so in the same test. The async runtime depends on LLVMSupport in a more fundamental way (llvm::ThreadPool), and as explained above there can only be one. This change also switches to add_mlir_library to make it consistent with the other runner_utils libraries. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D95613	2021-01-28 17:25:01 +01:00
Hanhan Wang	469096d18e	[mlir][Linalg] Fix tests in tile-and-pad The check match in D95555 was wrong, this patch fixes it. Differential Revision: https://reviews.llvm.org/D95618	2021-01-28 07:59:33 -08:00
Nicolas Vasilache	303ef609a3	[mlir] Fix gcc-8 build	2021-01-28 15:58:49 +00:00
Hanhan Wang	c818fa6729	[mlir][Linalg] Replace SimplePad with PadTensor in tile-and-pad This revision creates a build method of PadTensorOp which can be mapped to SimplePad op. The verifier is updated to accept a static custom result type, which has the same semantic as SimplePadOp. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D95555	2021-01-28 06:50:26 -08:00
Nicolas Vasilache	7e6fe5c48a	[mlir] Fix subview verifier. The subview verifier in the rank-reduced case is plainly skipping verification when the resulting type is a memref with empty affine map. This is generally incorrect. Instead, form the actual expected rank-reduced MemRefType that takes into account the projections of 1's dimensions. Then, check the canonicalized expected rank-reduced type against the canonicalized candidate type. Differential Revision: https://reviews.llvm.org/D95316	2021-01-28 13:55:39 +00:00
Nicolas Vasilache	8900acc796	[mlir][Linalg] Reenable test that was mistakenly disabled	2021-01-28 13:25:59 +00:00
Nicolas Vasilache	299cc5da6d	[mlir][Linalg] Further improve codegen strategy and add a linalg.matmul_i8_i8_i32 This revision adds a layer of SFINAE to the composable codegen strategy so it does not have to require statically defined ops but instead can also be used with OpInterfaces, Operation* and an op name string. A linalg.matmul_i8_i8_i32 is added to the .tc spec to demonstrate how all this works end to end. Differential Revision: https://reviews.llvm.org/D95600	2021-01-28 13:02:42 +00:00
Nicolas Vasilache	d0c9fb1b8e	[mlir][Linalg] Improve codegen strategy This revision improves the usage of the codegen strategy by adding a few flags that make it easier to control for the CLI. Usage of ModuleOp is replaced by FuncOp as this created issues in multi-threaded mode. A simple benchmarking capability is added for linalg.matmul as well as linalg.matmul_column_major. This latter op is also added to linalg. Now obsolete linalg integration tests that also take too long are deleted. Correctness checks are still missing at this point. Differential revision: https://reviews.llvm.org/D95531	2021-01-28 10:59:16 +00:00
KareemErgawy-TomTom	279e7ea63b	[MLIR][LinAlg][Docs] Add missing example code and other small fixes. Fixes a few small issues in the docs. It seems one of the examples was missing the expected MLIR output due to a copy-paste typo. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D95599	2021-01-28 11:49:36 +01:00
River Riddle	02bc4c95f0	[mlir][PassManager] Only reinitialize the pass manager if the context registry changes This prevents needless reinitialization for clients that want to reuse a pass manager multiple times. A new `getRegisryHash` function is exposed by the context to give a rough indicator of when the context registry has changed. Differential Revision: https://reviews.llvm.org/D95493	2021-01-27 17:41:51 -08:00
Tres Popp	bc8d8e69a6	[mlir] Fold shape.eq %a, %a to true Differential Revision: https://reviews.llvm.org/D95430	2021-01-27 16:22:15 +01:00
Eugene Zhulenev	f63f28ed54	[mlir:async] Fix deadlock in async runtime await-and-execute functions `emplace???` functions running concurrently can set the ready flag and then pending awaiter will never be executed Differential Revision: https://reviews.llvm.org/D95517	2021-01-27 05:08:53 -08:00
Nicolas Vasilache	5133673df4	[mlir] Extend semantic of OffsetSizeAndStrideOpInterface. OffsetSizeAndStrideOpInterface now have the ability to specify only a leading subset of offset, sizes, strides operands/attributes. The size of that leading subset must be limited by the corresponding entry in `getArrayAttrMaxRanks` to avoid overflows. Missing trailing dimensions are assumed to span the whole range (i.e. [0 .. dim)). This brings more natural semantics to slice-like op on top of subview and is a simplifies to removing all uses of SliceOp in dependent projects. Differential revision: https://reviews.llvm.org/D95441	2021-01-27 09:02:35 +00:00
MaheshRavishankar	7c15e0f64c	[mlir][Linalg] Add canonicalization for init_tensor -> subtensor op. Differential Revision: https://reviews.llvm.org/D95305	2021-01-26 23:22:28 -08:00
Eric Schweitz	1d6df1fcf0	[mlir] sret and byval now require a type argument when constructed. Fixes the LLVM code gen bugs and adds the missing tests. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D95378	2021-01-26 10:47:19 -08:00
Christian Sigg	8262cd8a0e	[mlir] Set CUDA/ROCm context before creating resources. The current context is thread-local state, and in preparation of GPU async execution (on multiple threads) we need to set the context before calling API that create resources. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D94495	2021-01-26 19:07:06 +01:00
Alex Zinenko	b208e5bcd0	[mlir] Add Python bindings for IntegerSet This follows up on the introduction of C API for the same object and is similar to AffineExpr and AffineMap. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D95437	2021-01-26 17:32:51 +01:00
Alexander Belyaev	80966447a2	[mlir][nfc] Move `getInnermostParallelLoops` to SCF/Transforms/Utils.h.	2021-01-26 17:00:15 +01:00
Alex Zinenko	91bd1156f3	[mlir] drop unused statics	2021-01-26 13:30:45 +01:00
Eugene Zhulenev	25f80e16d1	[mlir] Async: add a separate pass to lower from async to async.coro and async.runtime Depends On D95000 Move async.execute outlining and async -> async.runtime lowering into the separate Async transformation pass Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D95311	2021-01-26 03:33:20 -08:00
Eugene Zhulenev	2f7baffdc1	[mlir:async] Use ODS to define async types Depends On D94923 Migrate Async dialect to ODS `TypeDef` Reviewed By: ftynse, rriddle Differential Revision: https://reviews.llvm.org/D95000	2021-01-26 02:37:50 -08:00
Matthias Springer	90ebc489de	Add vp2intersect to AVX512 dialect. Adds vp2intersect to the AVX512 dialect and defines a lowering to the LLVM dialect. Author: Matthias Springer <springerm@google.com> Differential Revision: https://reviews.llvm.org/D95301	2021-01-26 07:32:26 +00:00
zhanghb97	a2914e0c15	[mlir][Python] Fix comments of 'getCapsule' and 'createFromCapsule' The `getCapsule` and `createFromCapsule` comments incorrectly state the `PyMlirContext` and `MlirContext` in `PyLocation`, `PyAttribute`, and `PyType` classes. Differential Revision: https://reviews.llvm.org/D95413	2021-01-26 12:53:21 +08:00
Eugene Zhulenev	d37b5393e8	[mlir:Async] Use LLVM coro operations in async.coro lowering Instead of using llvm.call operations to call LLVM coro intrinsics use Coro operations from the LLVM dialect. (This was reviewed as a part of https://reviews.llvm.org/D94923 but was lost in arc land from local branch) Differential Revision: https://reviews.llvm.org/D95405	2021-01-25 16:42:11 -08:00
Eugene Zhulenev	9c53b8e52e	[mlir:Async] Add intermediate async.coro and async.runtime operations to simplify Async to LLVM lowering [NFC] No new functionality, mostly a cleanup and one more abstraction level between Async and LLVM IR. Instead of lowering from Async to LLVM coroutines and Async Runtime API in one shot, do it progressively via async.coro and async.runtime operations. 1. Lower from async to async.runtime/coro (e.g. async.execute to function with coro setup and runtime calls) 2. Lower from async.runtime/coro to LLVM intrinsics and runtime API calls Intermediate coro/runtime operations will allow to run transformations on a higher level IR and do not try to match IR based on the LLVM::CallOp properties. Although async.coro is very close to LLVM coroutines, it is not exactly the same API, instead it is optimized for usability in async lowering, and misses a lot of details that are present in @llvm.coro intrinsic. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D94923	2021-01-25 14:04:33 -08:00

1 2 3 4 5 ...

6595 Commits