forked from OSchip/llvm-project
11d12670da
Originally, we were attaching attributes containing CUBIN blobs to the kernel function called by `gpu.launch_func`. This kernel is now contained in a nested module that is used as a compilation unit. Attach compiled CUBIN blobs to the module rather than to the function since we were compiling the module. This also avoids duplication of the attribute on multiple kernels within the same module. PiperOrigin-RevId: 273497303 |
||
---|---|---|
.. | ||
GPUCommon | ||
GPUToCUDA | ||
GPUToNVVM | ||
GPUToROCDL | ||
GPUToSPIRV | ||
LoopToStandard | ||
LoopsToGPU | ||
StandardToLLVM | ||
StandardToSPIRV | ||
VectorToLLVM | ||
CMakeLists.txt |