llvm-project

History

Krzysztof Drewniak c57b2a0635 [MLIR][GPU] Make max flat work group size for ROCDL kernels configurable While the default value for the amdgpu-flat-work-group-size attribute, "1, 256", matches the defaults from Clang, some users of the ROCDL dialect, namely Tensorflow, use larger workgroups, such as 1024. Therefore, instead of hardcoding this value, we add a rocdl.max_flat_work_group_size attribute that can be set on GPU kernels to override the default value. Reviewed By: whchung Differential Revision: https://reviews.llvm.org/D115741		2021-12-14 20:12:23 +00:00
..
Cpp	[mlir] Convert NamedAttribute to be a class	2021-11-18 05:39:29 +00:00
LLVMIR	[MLIR][GPU] Make max flat work group size for ROCDL kernels configurable	2021-12-14 20:12:23 +00:00
SPIRV	[mlir][spirv] Use ScopedPrinter in deserialization debugging	2021-12-13 10:51:56 -05:00
CMakeLists.txt	[mlir] Add Cpp emitter	2021-09-02 13:51:05 +00:00