llvm-project/clang/test/CodeGenCUDA
Michael Liao c7b683c126 [PGO][CUDA][HIP] Skip generating profile on the device stub and wrong-side functions.
- Skip generating profile data on `__global__` function in the host
  compilation. It's a host-side stub function only and don't have
  profile instrumentation generated on the real function body. The extra
  profile data results in the malformed instrumentation profile data.
- Skip generating region mapping on functions in the wrong-side, i.e.,
  + For the device compilation, skip host-only functions; and,
  + For the host compilation, skip device-only functions (including
    `__global__` functions.)
- As the device-side profiling is not ready yet, only host-side profile
  code generation is checked.

Differential Revision: https://reviews.llvm.org/D85276
2020-08-10 11:01:46 -04:00
..
Inputs [HIP] Support new kernel launching API 2019-09-24 19:16:40 +00:00
address-spaces.cu
alias.cu
amdgpu-hip-implicit-kernarg.cu [AMDGPU] Increased the number of implicit argument bytes for both OpenCL and HIP (CLANG). 2019-07-10 15:10:08 +00:00
amdgpu-kernel-arg-pointer-type.cu [hip] Refine `clang/test/CodeGenCUDA/amdgpu-kernel-arg-pointer-type.cu` 2020-06-25 23:57:08 -04:00
amdgpu-kernel-attrs.cu [HIP] Change default --gpu-max-threads-per-block value to 1024 2020-06-03 11:09:22 -04:00
amdgpu-visibility.cu [HIP] Fix visibility for 'extern' device variables. 2019-11-05 14:19:32 -05:00
amdgpu-workgroup-size.cu [AMDGPU] Add __builtin_amdgcn_workgroup_size_x/y/z 2020-03-28 01:03:20 -04:00
builtins-amdgcn.cu Fix how cc1 command line options are mapped into FP options. 2020-06-01 22:00:30 -04:00
constexpr-variables.cu [CUDA][HIP] Support accessing static device variable in host code for -fno-gpu-rdc 2020-08-05 07:57:38 -04:00
convergent.cu
cuda-builtin-vars.cu
debug-info-address-class.cu [DEBUG_INFO][NVPTX] Generate correct data about variable address class. 2019-02-05 19:45:57 +00:00
debug-info-template.cu [CUDA][HIP][DebugInfo] Skip reference device function 2019-03-06 21:16:27 +00:00
deferred-diag.cu Speed up deferred diagnostic emitter 2020-04-06 13:07:43 -04:00
dependent-libs.cu Fix failure of lit test dependent-libs.cu 2019-05-29 01:34:44 +00:00
device-init-fun.cu [HIP] Add option -fgpu-allow-device-init 2019-10-22 16:06:20 -04:00
device-stub.cu [cuda][hip] Fix `RegisterVar` function prototype. 2020-04-03 12:57:09 -04:00
device-var-init.cu [CUDA][HIP] Skip setting `externally_initialized` for static device variables. 2019-05-29 17:23:27 +00:00
device-vtable.cu
filter-decl.cu
flush-denormals.cu clang/AMDGPU: Stop setting old denormal subtarget features 2020-04-02 17:17:12 -04:00
fp-contract.cu
function-overload.cu
kernel-amdgcn.cu [HIP] Change default --gpu-max-threads-per-block value to 1024 2020-06-03 11:09:22 -04:00
kernel-args-alignment.cu LLVM IR: Generate new-style byval-with-Type from Clang 2019-06-05 21:12:14 +00:00
kernel-args.cu clang: Use byref for aggregate kernel arguments 2020-08-06 15:52:26 -04:00
kernel-call.cu [HIP] Support new kernel launching API 2019-09-24 19:16:40 +00:00
kernel-dbg-info.cu Fix debug info for NoDebug attr 2020-05-21 09:02:56 -04:00
kernel-stub-name.cu [HIP] Fix device stub name 2020-03-09 16:40:05 -04:00
lambda.cu [CUDA][HIP] Let lambda be host device by default 2020-07-08 13:10:26 -04:00
launch-bounds.cu
library-builtin.cu Fix how cc1 command line options are mapped into FP options. 2020-06-01 22:00:30 -04:00
link-device-bitcode.cu
llvm-used.cu
ms-linker-options.cu [CUDA][HIP] Disable emitting llvm.linker.options in device compilation 2019-11-04 23:21:39 -05:00
norecurse.cu [OpenCL][CUDA][HIP][SYCL] Add norecurse 2020-02-16 20:41:00 -05:00
nothrow.cu
openmp-target.cu [CUDA][HIP] Fix host/device check with -fopenmp 2019-10-09 23:54:10 +00:00
printf-aggregate.cu
printf.cu
profile-coverage-mapping.cu [PGO][CUDA][HIP] Skip generating profile on the device stub and wrong-side functions. 2020-08-10 11:01:46 -04:00
propagate-metadata.cu Assume ieee behavior without denormal-fp-math attribute 2020-03-07 12:10:56 -05:00
ptx-kernels.cu
static-device-var-no-rdc.cu [CUDA][HIP] Support accessing static device variable in host code for -fno-gpu-rdc 2020-08-05 07:57:38 -04:00
surface.cu [cuda][hip] Add CUDA builtin surface/texture reference support. 2020-03-27 17:18:49 -04:00
texture.cu [cuda][hip] Add CUDA builtin surface/texture reference support. 2020-03-27 17:18:49 -04:00
types.cu Do not copy long double and 128-bit fp format from aux target for AMDGPU 2019-01-31 21:57:51 +00:00
unnamed-types.cu [HIP] Fix device stub name 2020-03-09 16:40:05 -04:00
usual-deallocators.cu IR: print value numbers for unnamed function arguments 2019-08-03 14:28:34 +00:00