llvm-project

Commit Graph

Author	SHA1	Message	Date
Gus Smith	22911585bb	[mlir][sparse] Add Matricized Tensor Times Khatri-Rao Product (MTTKRP) integration test See this documentation from taco: http://tensor-compiler.org/docs/data_analytics/index.html Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D104417	2021-06-17 16:53:12 +00:00
Gus Smith	f9a6d47c36	Add sparse matrix multiplication integration test Adds an integration test for the SPMM (sparse matrix multiplication) kernel, which multiplies a sparse matrix by a dense matrix, resulting in a dense matrix. This is just a simple modification on the existing matrix-vector multiplication kernel. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D104334	2021-06-16 13:20:20 -07:00
Aart Bik	ec8910c4ad	[mlir][sparse] integration test for all-dense annotated "sparse" output Reviewed By: gussmith23 Differential Revision: https://reviews.llvm.org/D104277	2021-06-15 15:44:11 -07:00
thomasraoux	750799b7bc	[mlir][NFC] Don't outline kernel in MMA integration tests This matches better how other gpu integration tests are done. Differential Revision: https://reviews.llvm.org/D103099	2021-05-27 09:43:54 -07:00
thomasraoux	b44007bec2	[mlir][gpu] Relax restriction on MMA store op to allow chain of mma ops. In order to allow large matmul operations using the MMA ops we need to chain operations this is not possible unless "DOp" and "COp" type have matching layout so remove the "DOp" layout and force accumulator and result type to match. Added a test for the case where the MMA value is accumulated. Differential Revision: https://reviews.llvm.org/D103023	2021-05-27 09:13:51 -07:00
Aart Bik	ca446e58c8	[sparse][mlir] simplify sparse runtime support library Removed some of the older raw "MLIRized" versions that are no longer needed now that the sparse runtime support library can focus on the proper sparse tensor types rather than the opague pointer approach of the past. This avoids legacy... Reviewed By: penpornk Differential Revision: https://reviews.llvm.org/D102960	2021-05-25 09:39:14 -07:00
Matthias Springer	5017b0f88b	[mlir] Check only last dim stride in transfer op lowering Lower a 1D vector transfer op to LLVM if the last dim stride is 1. Also fixes a bug in the original unit stride computation. Differential Revision: https://reviews.llvm.org/D102897	2021-05-25 17:53:24 +09:00
thomasraoux	dae9038611	[mlir] Lower sm version for TensorCore intergration tests Those tests only require sm70, this allows to run those integration tests on more hardware. Differential Revision: https://reviews.llvm.org/D103049	2021-05-24 14:45:24 -07:00
Navdeep Kumar	e552fa28da	[MLIR][GPU] Add CUDA Tensor core WMMA test Add a test case to test the complete execution of WMMA ops on a Nvidia GPU with tensor cores. These tests are enabled under MLIR_RUN_CUDA_TENSOR_CORE_TESTS. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D95334	2021-05-22 16:19:36 +05:30
Aart Bik	c194b49c9c	[mlir][sparse] add full dimension ordering support This revision completes the "dimension ordering" feature of sparse tensor types that enables the programmer to define a preferred order on dimension access (other than the default left-to-right order). This enables e.g. selection of column-major over row-major storage for sparse matrices, but generalized to any rank, as in: dimOrdering = affine_map<(i,j,k,l,m,n,o,p) -> (p,o,j,k,i,l,m,n)> Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D102856	2021-05-21 12:35:13 -07:00
Matthias Springer	fb7ec1f187	[mlir] Use VectorTransferPermutationMapLoweringPatterns in VectorToSCF VectorTransferPermutationMapLoweringPatterns can be enabled via a pass option. These additional patterns lower permutation maps to minor identity maps with broadcasting, if possible, allowing for more efficient vector load/stores. The option is deactivated by default. Differential Revision: https://reviews.llvm.org/D102593	2021-05-19 14:46:19 +09:00
Nicolas Vasilache	f8dbd61074	[mlir][Linalg] Drop spuriously long matmul_column_major benchmark	2021-05-18 10:07:19 +00:00
Aart Bik	5879da496c	[mlir][sparse] replace experimental flag with inplace attribute The experimental flag for "inplace" bufferization in the sparse compiler can be replaced with the new inplace attribute. This gives a uniform way of expressing the more efficient way of bufferization. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D102538	2021-05-17 11:43:44 -07:00
Matthias Springer	6774e5a995	[mlir] Fix in_bounds attr handling in TransferReadPermutationLowering The in_bounds attribute should also be transposed. Differential Revision: https://reviews.llvm.org/D102572	2021-05-17 15:28:16 +09:00
Matthias Springer	0f24163870	[mlir] Replace vector-to-scf with progressive-vector-to-scf Depends On D102388 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D102101	2021-05-13 23:27:31 +09:00
Matthias Springer	bf068e1077	[mlir] Do not use pass labels in unrolled ProgressiveVectorToSCF Do not rely on pass labels to detect if the pattern was already applied in the past (which allows for more some extra optimizations to avoid extra InsertOps and ExtractOps). Instead, check if these optimizations can be applied on-the-fly. This also fixes a bug, where vector.insert and vector.extract ops sometimes disappeared in the middle of the pass because they get folded away, but the next application of the pattern expected them to be there. Differential Revision: https://reviews.llvm.org/D102206	2021-05-13 22:01:08 +09:00
Matthias Springer	9b77be5583	[mlir] Unrolled progressive-vector-to-scf. Instead of an SCF for loop, these pattern generate fully unrolled loops with no temporary buffer allocations. Differential Revision: https://reviews.llvm.org/D101981	2021-05-13 13:08:48 +09:00
Matthias Springer	c52cbe63e4	[mlir] Fix masked vector transfer ops with broadcasts Broadcast dimensions of a vector transfer op have no corresponding dimension in the mask vector. E.g., a 2-D TransferReadOp, where one dimension is a broadcast, can have a 1-D `mask` attribute. This commit also adds a few additional transfer op integration tests for various combinations of broadcasts, masking, dim transposes, etc. Differential Revision: https://reviews.llvm.org/D101745	2021-05-13 12:46:03 +09:00
Matthias Springer	6555e53ab0	Revert "[mlir] Fix masked vector transfer ops with broadcasts" This reverts commit `c9087788f7`. Accidentally pushed old version of the commit.	2021-05-13 11:55:00 +09:00
Matthias Springer	c9087788f7	[mlir] Fix masked vector transfer ops with broadcasts Broadcast dimensions of a vector transfer op have no corresponding dimension in the mask vector. E.g., a 2-D TransferReadOp, where one dimension is a broadcast, can have a 1-D `mask` attribute. This commit also adds a few additional transfer op integration tests for various combinations of broadcasts, masking, dim transposes, etc. Differential Revision: https://reviews.llvm.org/D101745	2021-05-13 11:37:36 +09:00
Aart Bik	96a23911f6	[mlir][sparse] complete migration to sparse tensor type A very elaborate, but also very fun revision because all puzzle pieces are finally "falling in place". 1. replaces lingalg annotations + flags with proper sparse tensor types 2. add rigorous verification on sparse tensor type and sparse primitives 3. removes glue and clutter on opaque pointers in favor of sparse tensor types 4. migrates all tests to use sparse tensor types NOTE: next CL will remove all obsoleted sparse code in Linalg Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D102095	2021-05-10 12:55:22 -07:00
Aart Bik	a2c9d4bb04	[mlir][sparse] Introduce proper sparsification passes This revision migrates more code from Linalg into the new permanent home of SparseTensor. It replaces the test passes with proper compiler passes. NOTE: the actual removal of the last glue and clutter in Linalg will follow Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D101811	2021-05-04 17:10:09 -07:00
Aart Bik	319072f4e3	[mlir][sparse] migrate sparse operations into new sparse tensor dialect This is the very first step toward removing the glue and clutter from linalg and replace it with proper sparse tensor types. This revision migrates the LinalgSparseOps into SparseTensorOps of a sparse tensor dialect. This also provides a new home for sparse tensor related transformation. NOTE: the actual replacement with sparse tensor types (and removal of linalg glue/clutter) will follow but I am trying to keep the amount of changes per revision manageable. Differential Revision: https://reviews.llvm.org/D101573	2021-04-29 15:52:35 -07:00
Mehdi Amini	086e0f05bf	Revert "[mlir][sparse] migrate sparse operations into new sparse tensor dialect" This reverts commit `a6d92a9711`. The build with -DBUILD_SHARED_LIBS=ON is broken.	2021-04-29 20:59:41 +00:00
Aart Bik	a6d92a9711	[mlir][sparse] migrate sparse operations into new sparse tensor dialect This is the very first step toward removing the glue and clutter from linalg and replace it with proper sparse tensor types. This revision migrates the LinalgSparseOps into SparseTensorOps of a sparse tensor dialect. This also provides a new home for sparse tensor related transformation. NOTE: the actual replacement with sparse tensor types (and removal of linalg glue/clutter) will follow but I am trying to keep the amount of changes per revision manageable. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D101488	2021-04-29 12:09:10 -07:00
Matthias Springer	64f7fb5dfc	[mlir] Support masked N-D vector transfer ops in ProgressiveVectorToSCF. Mask vectors are handled similar to data vectors in N-D TransferWriteOp. They are copied into a temporary memory buffer, which can be indexed into with non-constant values. Differential Revision: https://reviews.llvm.org/D101136	2021-04-23 18:23:51 +09:00
Matthias Springer	545f98efc7	[mlir] Support masked 1D vector transfer ops in ProgressiveVectorToSCF Support for masked N-D vector transfer ops will be added in a subsequent commit. Differential Revision: https://reviews.llvm.org/D101132	2021-04-23 18:08:50 +09:00
Matthias Springer	a819e73393	[mlir] Support broadcast dimensions in ProgressiveVectorToSCF This commit adds support for broadcast dimensions in permutation maps of vector transfer ops. Also fixes a bug in VectorToSCF that generated incorrect in-bounds checks for broadcast dimensions. Differential Revision: https://reviews.llvm.org/D101019	2021-04-23 18:01:32 +09:00
Matthias Springer	ab154176bf	[mlir] Support dimension permutations in ProgressiveVectorToSCF This commit adds support for dimension permutations in permutation maps of vector transfer ops. Differential Revision: https://reviews.llvm.org/D101007	2021-04-23 17:46:35 +09:00
Matthias Springer	afaf36b69e	[mlir] Handle strided 1D vector transfer ops in ProgressiveVectorToSCF Strided 1D vector transfer ops are 1D transfers operating on a memref dimension different from the last one. Such transfer ops do not accesses contiguous memory blocks (vectors), but access memory in a strided fashion. In the absence of a mask, strided 1D vector transfer ops can also be lowered using matrix.column.major.* LLVM instructions (in a later commit). Subsequent commits will extend the pass to handle the remaining missing permutation maps (broadcasts, transposes, etc.). Differential Revision: https://reviews.llvm.org/D100946	2021-04-23 17:19:22 +09:00
Matthias Springer	7cc8106f67	[mlir] Progressively lower vector to SCF Add a new ProgressiveVectorToSCF pass that lowers vector transfer ops to SCF by gradually unpacking one dimension at time. Unpacking stops at 1D, but can be configured to stop earlier, should the HW support (N>1)-d vectors. The current implementation cannot handle permutation maps, masks, tensor types and unrolling yet. These will be added in subsequent commits. Once features are on par with VectorToSCF, this implementation will replace VectorToSCF. Differential Revision: https://reviews.llvm.org/D100622	2021-04-20 18:49:36 +09:00
Aart Bik	916f3e16bd	[mlir][vector][avx] add AVX dot product to X86Vector dialect with lowering In the long run, we want to unify the dot product codegen solutions between all target architectures, but this intrinsic enables experimenting with AVX specific implementations in the meantime. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D100593	2021-04-15 15:01:39 -07:00
Emilio Cota	cf20286bcc	[mlir] Use default lli JIT in Integration tests Now that `9b8e7a9d` ("[lli] Honor the --entry-function flag in orc and orc-lazy modes") fixed https://llvm.org/PR49906. Reviewed By: mehdi_amini, aartbik Differential Revision: https://reviews.llvm.org/D100407	2021-04-14 12:55:00 -07:00
Matthias Springer	3f4c1e13bc	[mlir] Fix return values of AMX tests Differential Revision: https://reviews.llvm.org/D100422	2021-04-14 09:40:49 +09:00
Eugene Zhulenev	8a316b00d6	[mlir] Convert async dialect passes from function passes to op agnostic passes Differential Revision: https://reviews.llvm.org/D100401	2021-04-13 11:46:00 -07:00
Emilio Cota	0b63e3222b	[mlir] X86Vector: Add AVX Rsqrt Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D99818	2021-04-13 08:43:48 -07:00
Eugene Zhulenev	a6628e596e	[mlir] Async: add automatic reference counting at async.runtime operations level Depends On D95311 Previous automatic-ref-counting pass worked with high level async operations (e.g. async.execute), however async values reference counting is a runtime implementation detail. New pass mostly relies on the save liveness analysis to place drop_ref operations, and does better verification of CFG with different liveIn sets in block successors. This is almost NFC change. No new reference counting ideas, just a cleanup of the previous version. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D95390	2021-04-12 18:54:55 -07:00
Emilio Cota	1310a19af0	[mlir] Use MCJIT to fix integration tests Since `c42c67ad` ('Re-apply "[lli] Make -jit-kind=orc the default JIT engine"'), ORC is the default JIT. Unfortunately, ORC seems to ignore the --entry-function flag, which breaks all tests that use the flag, namely the AMX and X86Vector integration tests. This has been reported in PR#49906 (https://bugs.llvm.org/show_bug.cgi?id=49906). Work around this by explicitly selecting MCJIT. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D100344	2021-04-12 18:25:33 -07:00
Emilio Cota	8508a63b88	[mlir] Rename AVX512 dialect to X86Vector We will soon be adding non-AVX512 operations to MLIR, such as AVX's rsqrt. In https://reviews.llvm.org/D99818 several possibilities were discussed, namely to (1) add non-AVX512 ops to the AVX512 dialect, (2) add more dialects (e.g. AVX dialect for AVX rsqrt), and (3) expand the scope of the AVX512 to include these SIMD x86 ops, thereby renaming the dialect to something more accurate such as X86Vector. Consensus was reached on option (3), which this patch implements. Reviewed By: aartbik, ftynse, nicolasvasilache Differential Revision: https://reviews.llvm.org/D100119	2021-04-12 19:20:04 +02:00
Tobias Gysi	b614ada0e8	[mlir] add support for index type in vectors. The patch enables the use of index type in vectors. It is a prerequisite to support vectorization for indexed Linalg operations. This refactoring became possible due to the newly introduced data layout infrastructure. The data layout of a module defines the bitwidth of the index type needed to verify bitcasts and similar vector operations. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D99948	2021-04-08 08:17:13 +00:00
Aart Bik	3acf49829c	[mlir][sparse] support integral types i32,i16,i8 for numerical values Some sparse matrices operate on integral values (in contrast with the common f32 and f64 values). This CL expands the compiler and runtime support to deal with several common type combinations. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D99999	2021-04-07 10:01:37 -07:00
Matthias Springer	65a3f28939	[mlir] Add "mask" operand to vector.transfer_read/write. Also factors out out-of-bounds mask generation from vector.transfer_read/write into a new MaterializeTransferMask pattern. Differential Revision: https://reviews.llvm.org/D100001	2021-04-07 21:33:13 +09:00
Aart Bik	a46f59a747	[mlir][sparse] move new test to right directory structure Differential Revision: https://reviews.llvm.org/D99899	2021-04-05 14:41:18 -07:00
Aart Bik	a0c5b7e3b5	[mlir][sparse] support for very narrow index and pointer types Rationale: Small indices and values, when allowed by the required range of the input tensors, can reduce the memory footprint of sparse tensors even more. Note, however, that we must be careful zero extending the values (since sparse tensors never use negatives for indexing), but LLVM treats the index type as signed in most memory operations (like the scatter and gather). This CL dots all the i's in this regard. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D99777	2021-04-01 18:21:27 -07:00
Matthias Springer	95f8135043	[mlir] Change vector.transfer_read/write "masked" attribute to "in_bounds". This is in preparation for adding a new "mask" operand. The existing "masked" attribute was used to specify dimensions that may be out-of-bounds. Such transfers can be lowered to masked load/stores. The new "in_bounds" attribute is used to specify dimensions that are guaranteed to be within bounds. (Semantics is inverted.) Differential Revision: https://reviews.llvm.org/D99639	2021-03-31 18:04:22 +09:00
Mehdi Amini	cdb6eb7e83	Update syntax for amx.tile_muli to use two Unit attr to mark the zext case This makes the annotation tied to the operand and the use of a keyword more explicit/readable on what it means. Differential Revision: https://reviews.llvm.org/D99001	2021-03-20 04:12:24 +00:00
thomasraoux	3587728ed5	[mlir] Fix cuda integration test failure	2021-03-19 10:33:55 -07:00
Christian Sigg	a825fb2c07	[mlir] Remove mlir-rocm-runner This change combines for ROCm what was done for CUDA in D97463, D98203, D98360, and D98396. I did not try to compile SerializeToHsaco.cpp or test mlir/test/Integration/GPU/ROCM because I don't have an AMD card. I fixed the things that had obvious bit-rot though. Reviewed By: whchung Differential Revision: https://reviews.llvm.org/D98447	2021-03-19 00:24:10 -07:00
thomasraoux	1a572f4509	[mlir] Add vector op support to cuda-runner including vector.print Differential Revision: https://reviews.llvm.org/D97346	2021-03-18 13:03:08 -07:00
Aart Bik	9705cafc0f	[mlir][amx] regression test for tile-muli (all zero/sign-extension combinations) Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D98742	2021-03-17 10:04:04 -07:00
Aart Bik	b388bbd3f9	[mlir][amx] blocked tilezero integration test This adds a new integration test. However, it also adapts to a recent memref.XXX change for existing tests Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D98680	2021-03-16 08:49:31 -07:00
Aart Bik	6ad7b97e20	[mlir][amx] Add Intel AMX dialect (architectural-specific vector dialect) The Intel Advanced Matrix Extensions (AMX) provides a tile matrix multiply unit (TMUL), a tile control register (TILECFG), and eight tile registers TMM0 through TMM7 (TILEDATA). This new MLIR dialect provides a bridge between MLIR concepts like vectors and memrefs and the lower level LLVM IR details of AMX. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D98470	2021-03-15 17:59:05 -07:00
Alex Zinenko	7aa6f3aa0c	[mlir] fix integration tests post `e2310704d8` The commit in question moved some ops across dialects but did not update some of the target-specific integration tests that use these ops, presumably because the corresponding target hardware was not available. Fix these tests.	2021-03-15 14:41:27 +01:00
Julian Gross	e2310704d8	[MLIR] Create memref dialect and move dialect-specific ops from std. Create the memref dialect and move dialect-specific ops from std dialect to this dialect. Moved ops: AllocOp -> MemRef_AllocOp AllocaOp -> MemRef_AllocaOp AssumeAlignmentOp -> MemRef_AssumeAlignmentOp DeallocOp -> MemRef_DeallocOp DimOp -> MemRef_DimOp MemRefCastOp -> MemRef_CastOp MemRefReinterpretCastOp -> MemRef_ReinterpretCastOp GetGlobalMemRefOp -> MemRef_GetGlobalOp GlobalMemRefOp -> MemRef_GlobalOp LoadOp -> MemRef_LoadOp PrefetchOp -> MemRef_PrefetchOp ReshapeOp -> MemRef_ReshapeOp StoreOp -> MemRef_StoreOp SubViewOp -> MemRef_SubViewOp TransposeOp -> MemRef_TransposeOp TensorLoadOp -> MemRef_TensorLoadOp TensorStoreOp -> MemRef_TensorStoreOp TensorToMemRefOp -> MemRef_BufferCastOp ViewOp -> MemRef_ViewOp The roadmap to split the memref dialect from std is discussed here: https://llvm.discourse.group/t/rfc-split-the-memref-dialect-from-std/2667 Differential Revision: https://reviews.llvm.org/D98041	2021-03-15 11:14:09 +01:00
Matthias Springer	581672be04	[mlir][AVX512] Add while loop-based sparse vector-vector dot product variants. Differential Revision: https://reviews.llvm.org/D98480	2021-03-15 16:59:10 +09:00
Christian Sigg	1ef544d4a9	[mlir] Remove mlir-cuda-runner Change CUDA integration tests to use mlir-opt + mlir-cpu-runner instead. Depends On D98203 Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D98396	2021-03-12 14:06:43 +01:00
Christian Sigg	2224221fb3	[mlir] Add NVVM to CUBIN conversion to mlir-opt If MLIR_CUDA_RUNNER_ENABLED, register a 'gpu-to-cubin' conversion pass to mlir-opt. The next step is to switch CUDA integration tests from mlir-cuda-runner to mlir-opt + mlir-cpu-runner and remove mlir-cuda-runner. Depends On D98279 Reviewed By: herhut, rriddle, mehdi_amini Differential Revision: https://reviews.llvm.org/D98203	2021-03-11 10:07:11 +01:00
Matthias Springer	c40e0d7609	[mlir][AVX512] Implement sparse vector dot product integration test. This test operates on two hardware-vector-sized vectors and utilizes vp2intersect and mask.compress. PHAB_REVIEW=D98099	2021-03-11 13:00:17 +09:00
Matthias Springer	acce0ea70c	[mlir][AVX512] Add mask.compress to AVX512 dialect. Adds mask.compress to the AVX512 dialect and defines a lowering to the LLVM dialect. Differential Revision: https://reviews.llvm.org/D97611	2021-03-06 10:02:48 +09:00
Christian Sigg	9d7be77bf9	[mlir] Move cuda tests Move test inputs to test/Integration directory. Move runtime wrappers to ExecutionEngine. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D97463	2021-03-03 13:16:51 +01:00
Aart Bik	df5ccf5a94	[mlir][vector] add higher dimensional support to gather/scatter Similar to mask-load/store and compress/expand, the gather and scatter operation now allow for higher dimension uses. Note that to support the mixed-type index, the new syntax is: vector.gather %base [%i,%j] [%kvector] .... The first client of this generalization is the sparse compiler, which needs to define scatter and gathers on dense operands of higher dimensions too. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D97422	2021-02-26 14:20:19 -08:00
Mehdi Amini	99b0032ce0	Move the MLIR integration tests as a subdirectory of test (NFC) This does not change the behavior directly: the tests only run when `-DMLIR_INCLUDE_INTEGRATION_TESTS=ON` is configured. However running `ninja check-mlir` will not run all the tests within a single lit invocation. The previous behavior would wait for all the integration tests to complete before starting to run the first regular test. The test results were also reported separately. This change is unifying all of this and allow concurrent execution of the integration tests with regular non-regression and unit-tests. Differential Revision: https://reviews.llvm.org/D97241	2021-02-23 05:55:47 +00:00

1 2 3

112 Commits