llvm-project

Commit Graph

Author	SHA1	Message	Date
Tim Shen	b762bbd4c8	[MLIR] change NVVM.mma.sync to the most useful variant. Summary: the .row.col variant turns out to be the popular one, contrary to what I thought as .row.row. Since .row.col is so prevailing (as I inspect cuDNN's behavior), I'm going to remove the .row.row support here, which makes the patch a little bit easier. Reviewers: ftynse Subscribers: jholewinski, bixia, sanjoy.google, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74655	2020-02-18 17:57:04 -08:00
Christian Sigg	9b85582682	Automated rollback of commit `f68ac464d8` PiperOrigin-RevId: 285162061	2019-12-12 03:48:38 -08:00
Christian Sigg	f68ac464d8	Switch from shfl.bfly to shfl.down. Both work for the current use case, but the latter allows implementing prefix sums and is a little easier to understand for partial warps. PiperOrigin-RevId: 285145287	2019-12-12 01:28:01 -08:00
MLIR Team	1f43d0d000	[NVVM] Add mma.sync operation. PiperOrigin-RevId: 278440547	2019-11-04 12:36:37 -08:00
Christian Sigg	c3e56cd12c	Get active source lane predicate from shuffle instruction. nvvm.shfl.sync.bfly optionally returns a predicate whether source lane was active. Support for this was added to clang in https://reviews.llvm.org/D68892. Add an optional 'pred' unit attribute to the instruction to return this predicate. Specify this attribute in the partial warp reduction so we don't need to manually compute the predicate. PiperOrigin-RevId: 275616564	2019-10-19 01:53:25 -07:00
Alex Zinenko	5e7959a353	Use llvm.func to define functions with wrapped LLVM IR function type This function-like operation allows one to define functions that have wrapped LLVM IR function type, in particular variadic functions. The operation was added in parallel to the existing lowering flow, this commit only switches the flow to use it. Using a custom function type makes the LLVM IR dialect type system more consistent and avoids complex conversion rules for functions that previously had to use the built-in function type instead of a wrapped LLVM IR dialect type and perform conversions during the analysis. PiperOrigin-RevId: 273910855	2019-10-10 01:34:06 -07:00
MLIR Team	5e65dafbfa	Add warpsize and laneid intrinsics to NVVM dialect. PiperOrigin-RevId: 268041263	2019-09-09 11:38:03 -07:00
MLIR Team	696fcb7520	Add 3 additional intrinsic ops to NVVM dialect, in preparation to implement block-wide reduce. PiperOrigin-RevId: 265720077	2019-08-27 10:56:18 -07:00
Alex Zinenko	f35d0c8570	NVVM target: emit nvvm.annotations for kernel functions PTX backend in LLVM expects additional module-level metadata `!nvvm.annotations` that lists functions that can be used as GPU kernels. Generate this metadata based on the `gpu.kernel` attribute attached to functions. This attribute is added automatically by the kernel outlining pass in the GPU dialect lowering flow. PiperOrigin-RevId: 254957345	2019-06-25 09:19:27 -07:00
Stephan Herhut	5d7231d812	Add transformation of the NVVM dialect to an LLVM module. Only handles the generation of intrinsics out of NVVM index ops for now. -- PiperOrigin-RevId: 245933152	2019-05-06 08:22:14 -07:00

10 Commits