llvm-project

History

Navdeep Kumar 875eb523c1 [MLIR][GPU][NVVM] Add warp synchronous matrix-multiply accumulate ops Add warp synchronous matrix-multiply accumulate ops in GPU and NVVM dialect. Add following three ops to GPU dialect :- 1.) subgroup_mma_load_matrix 2.) subgroup_mma_store_matrix 3.) subgroup_mma_compute Add following three ops to NVVM dialect :- 1.) wmma.m16n16k16.load.[a,b,c].[f16,f32].row.stride 2.) wmma.m16n16k16.store.d.[f16,f32].row.stride 3.) wmma.m16n16k16.mma.row.row.[f16,f32].[f16,f32] Reviewed By: bondhugula, ftynse, ThomasRaoux Differential Revision: https://reviews.llvm.org/D95330		2021-05-06 12:06:25 +05:30
..
amx.mlir	[mlir][amx] Add Intel AMX dialect (architectural-specific vector dialect)	2021-03-15 17:59:05 -07:00
arm-neon.mlir	Add arm_neon.sdot operation	2021-03-17 08:24:58 -07:00
arm-sve.mlir	[mlir][ArmSVE] Add masked arithmetic operations	2021-05-05 17:41:58 +01:00
import.ll	[MLIR] Add and propagate section attribute for LLVM_GlobalOp	2021-04-28 04:15:49 +00:00
llvmir-debug.mlir	…
llvmir-intrinsics.mlir	…
llvmir-invalid.mlir	…
llvmir-types.mlir	…
llvmir.mlir	[MLIR] Add and propagate section attribute for LLVM_GlobalOp	2021-04-28 04:15:49 +00:00
nvvmir.mlir	[MLIR][GPU][NVVM] Add warp synchronous matrix-multiply accumulate ops	2021-05-06 12:06:25 +05:30
openmp-llvm.mlir	…
rocdl.mlir	…
vector-to-llvm-ir.mlir	…
x86vector.mlir	[mlir] X86Vector: Add AVX Rsqrt	2021-04-13 08:43:48 -07:00