llvm-project/mlir/docs/Dialects/GPU.md

# 'gpu' Dialect

Note: this dialect is more likely to change than others in the near future; use
with caution.

This dialect provides middle-level abstractions for launching GPU kernels
following a programming model similar to that of CUDA or OpenCL. It provides
abstractions for kernel invocations (and may eventually provide those for device
management) that are not present at the lower level (e.g., as LLVM IR intrinsics
for GPUs). Its goal is to abstract away device- and driver-specific
manipulations to launch a GPU kernel and provide a simple path towards GPU
execution from MLIR. It may be targeted, for example, by DSLs using MLIR. The
dialect uses `gpu` as its canonical prefix.

## Memory attribution

Memory buffers are defined at the function level, either in "gpu.launch" or in
"gpu.func" ops. This encoding makes it clear where the memory belongs and makes
the lifetime of the memory visible. The memory is only accessible while the
kernel is launched/the function is currently invoked. The latter is more strict
than actual GPU implementations but using static memory at the function level is
just for convenience. It is also always possible to pass pointers to the
workgroup memory into other functions, provided they expect the correct memory
space.

The buffers are considered live throughout the execution of the GPU function
body. The absence of memory attribution syntax means that the function does not
require special buffers. Rationale: although the underlying models declare
memory buffers at the module level, we chose to do it at the function level to
provide some structuring for the lifetime of those buffers; this avoids the
incentive to use the buffers for communicating between different kernels or
launches of the same kernel, which should be done through function arguments
instead; we chose not to use `alloca`-style approach that would require more
complex lifetime analysis following the principles of MLIR that promote
structure and representing analysis results in the IR.

## Operations

[include "Dialects/GPUOps.md"]
[mlir] Update all dialects docs to use 'dialect-namespace' in the header 2020-03-31 03:25:00 +08:00			`# 'gpu' Dialect`
Start GPU Dialect Define a new dialect related to GPU kernels. Currently, it only contains a single operation for launching a kernel on a three-dimensional grid of thread blocks, following a model similar to that of CUDA. In particular, the body of the kernel contains operations executed by each thread and uses region arguments to accept thread and block identifiers (similar to how the loop body region accepts the induction value). -- PiperOrigin-RevId: 245713728 2019-04-29 18:00:25 +08:00
			`Note: this dialect is more likely to change than others in the near future; use`
			`with caution.`

			`This dialect provides middle-level abstractions for launching GPU kernels`
			`following a programming model similar to that of CUDA or OpenCL. It provides`
			`abstractions for kernel invocations (and may eventually provide those for device`
			`management) that are not present at the lower level (e.g., as LLVM IR intrinsics`
			`for GPUs). Its goal is to abstract away device- and driver-specific`
			`manipulations to launch a GPU kernel and provide a simple path towards GPU`
			`execution from MLIR. It may be targeted, for example, by DSLs using MLIR. The`
			dialect uses `gpu` as its canonical prefix.

Move GPU::FuncOp definition to ODS - NFC Move the definition of the GPU function opreation from hand-rolled C++ code to ODS framework. This only does the moves, a follow-up is necessary to clean up users of custom functions that could be auto-generated by ODS. PiperOrigin-RevId: 284233245 2019-12-07 03:59:59 +08:00			`## Memory attribution`

			`Memory buffers are defined at the function level, either in "gpu.launch" or in`
			`"gpu.func" ops. This encoding makes it clear where the memory belongs and makes`
			`the lifetime of the memory visible. The memory is only accessible while the`
			`kernel is launched/the function is currently invoked. The latter is more strict`
			`than actual GPU implementations but using static memory at the function level is`
			`just for convenience. It is also always possible to pass pointers to the`
			`workgroup memory into other functions, provided they expect the correct memory`
			`space.`

			`The buffers are considered live throughout the execution of the GPU function`
			`body. The absence of memory attribution syntax means that the function does not`
			`require special buffers. Rationale: although the underlying models declare`
			`memory buffers at the module level, we chose to do it at the function level to`
			`provide some structuring for the lifetime of those buffers; this avoids the`
			`incentive to use the buffers for communicating between different kernels or`
			`launches of the same kernel, which should be done through function arguments`
			instead; we chose not to use `alloca`-style approach that would require more
			`complex lifetime analysis following the principles of MLIR that promote`
			`structure and representing analysis results in the IR.`

Start GPU Dialect Define a new dialect related to GPU kernels. Currently, it only contains a single operation for launching a kernel on a three-dimensional grid of thread blocks, following a model similar to that of CUDA. In particular, the body of the kernel contains operations executed by each thread and uses region arguments to accept thread and block identifiers (similar to how the loop body region accepts the induction value). -- PiperOrigin-RevId: 245713728 2019-04-29 18:00:25 +08:00			`## Operations`

[mlir][NFC] Update dialect/op documentation to be consistent Summary: This revision performs a lot of different cleanups on operation documentation to ensure that they are consistent, e.g. using mlir code blocks, formatting, etc. This revision also includes the auto-generated documentation into the hand-written documentation for the dialects that have a specific top-level dialect file. This updates the documentation for all dialects aside from SPIRV and STD. These dialects will be updated in a followup. Differential Revision: https://reviews.llvm.org/D76734 2020-03-30 13:00:26 +08:00			`[include "Dialects/GPUOps.md"]`