llvm-project/mlir
Navdeep Kumar 875eb523c1 [MLIR][GPU][NVVM] Add warp synchronous matrix-multiply accumulate ops
Add warp synchronous matrix-multiply accumulate ops in GPU and NVVM
dialect. Add following three ops to GPU dialect :-
  1.) subgroup_mma_load_matrix
  2.) subgroup_mma_store_matrix
  3.) subgroup_mma_compute
Add following three ops to NVVM dialect :-
  1.) wmma.m16n16k16.load.[a,b,c].[f16,f32].row.stride
  2.) wmma.m16n16k16.store.d.[f16,f32].row.stride
  3.) wmma.m16n16k16.mma.row.row.[f16,f32].[f16,f32]

Reviewed By: bondhugula, ftynse, ThomasRaoux

Differential Revision: https://reviews.llvm.org/D95330
2021-05-06 12:06:25 +05:30
..
cmake/modules Move MLIR python sources to mlir/python. 2021-05-03 18:36:48 +00:00
docs Move MLIR python sources to mlir/python. 2021-05-03 18:36:48 +00:00
examples Change add_mlir_doc CMake macro to take the tablegen command as last argument to allow extra flags 2021-04-15 02:59:40 +00:00
include [MLIR][GPU][NVVM] Add warp synchronous matrix-multiply accumulate ops 2021-05-06 12:06:25 +05:30
lib [MLIR][GPU][NVVM] Add warp synchronous matrix-multiply accumulate ops 2021-05-06 12:06:25 +05:30
python Move MLIR python sources to mlir/python. 2021-05-03 18:36:48 +00:00
test [MLIR][GPU][NVVM] Add warp synchronous matrix-multiply accumulate ops 2021-05-06 12:06:25 +05:30
tools [mlir][sparse] Introduce proper sparsification passes 2021-05-04 17:10:09 -07:00
unittests [mlir] Pass AnalysisManager as optional parameter to analysis ctor, so it can request any other analysis as dependency 2021-04-20 19:18:36 +03:00
utils [mlir] Add a vscode language extension for MLIR 2021-04-21 14:44:37 -07:00
.clang-format
.clang-tidy Fix MLIR clang-tidy: when tweaking it does not inherit from the parent 2020-03-07 17:44:21 +00:00
CMakeLists.txt Move MLIR python sources to mlir/python. 2021-05-03 18:36:48 +00:00
LICENSE.TXT
README.md

README.md

Multi-Level Intermediate Representation

See https://mlir.llvm.org/ for more information.