llvm-project/clang/test/CodeGenCUDA
Steffen Larsen f226e28a88 [Clang][NVPTX] Add NVPTX intrinsics and builtins for CUDA PTX redux.sync instructions
Adds NVPTX builtins and intrinsics for the CUDA PTX `redux.sync` instructions
for `sm_80` architecture or newer.

PTX ISA description of `redux.sync`:
https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#parallel-synchronization-and-communication-instructions-redux-sync

Authored-by: Steffen Larsen <steffen.larsen@codeplay.com>

Differential Revision: https://reviews.llvm.org/D100124
2021-05-17 09:46:59 -07:00
..
Inputs [HIP] Emit kernel symbol 2021-03-01 16:31:40 -05:00
address-spaces.cu [test] Add {{.*}} to make ELF tests immune to dso_local/dso_preemptable/(none) differences 2020-12-31 00:27:11 -08:00
alias.cu
amdgpu-atomic-ops.cu Let clang atomic builtins fetch add/sub support floating point types 2021-04-06 15:44:00 -04:00
amdgpu-func-attrs.cu [test] Add {{.*}} to make ELF tests immune to dso_local/dso_preemptable/(none) differences 2020-12-31 00:27:11 -08:00
amdgpu-hip-implicit-kernarg.cu
amdgpu-kernel-arg-pointer-type.cu [test] Add {{.*}} to make ELF tests immune to dso_local/dso_preemptable/(none) differences 2020-12-31 00:27:11 -08:00
amdgpu-kernel-attrs.cu Relands "[HIP] Change default --gpu-max-threads-per-block value to 1024" 2021-02-12 10:53:59 -05:00
amdgpu-visibility.cu [test] Add {{.*}} to make ELF tests immune to dso_local/dso_preemptable/(none) differences 2020-12-31 00:27:11 -08:00
amdgpu-workgroup-size.cu [AMDGPU] Add __builtin_amdgcn_workgroup_size_x/y/z 2020-03-28 01:03:20 -04:00
builtin-mangled-name.cu [CUDA][HIP] add __builtin_get_device_side_mangled_name 2021-03-25 15:25:29 -04:00
builtins-amdgcn.cu [amdgpu] Add `llvm.amdgcn.endpgm` support. 2020-11-05 19:06:50 -05:00
constexpr-variables.cu [CUDA][HIP] Support accessing static device variable in host code for -fno-gpu-rdc 2020-08-05 07:57:38 -04:00
convergent.cu [test] Add {{.*}} to make ELF tests immune to dso_local/dso_preemptable/(none) differences 2020-12-31 00:27:11 -08:00
correctly-rounded-div.cu [HIP] Fix device lib selection 2021-05-01 20:31:11 -04:00
cuda-builtin-vars.cu [test] Add {{.*}} to make ELF tests immune to dso_local/dso_preemptable/(none) differences 2020-12-31 00:27:11 -08:00
cxx-call-kernel.cpp Fix test cxx-call-kernel.cpp 2021-03-01 17:10:53 -05:00
debug-info-address-class.cu
debug-info-template.cu
deferred-diag.cu Speed up deferred diagnostic emitter 2020-04-06 13:07:43 -04:00
dependent-libs.cu
device-fun-linkage.cu [test] Add {{.*}} to make ELF tests immune to dso_local/dso_preemptable/(none) differences 2020-12-31 00:27:11 -08:00
device-init-fun.cu [HIP] Add option -fgpu-allow-device-init 2019-10-22 16:06:20 -04:00
device-stub.cu [CUDA][HIP] Fix device template variables 2021-05-12 11:13:29 -04:00
device-use-host-var.cu [CUDA][HIP] Allow non-ODR use of host var in device 2021-04-19 14:45:24 -04:00
device-var-init.cu [test] Add {{.*}} to make ELF tests immune to dso_local/dso_preemptable/(none) differences 2020-12-31 00:27:11 -08:00
device-var-linkage.cu [CUDA][HIP] Support accessing static device variable in host code for -fgpu-rdc 2021-02-24 18:23:45 -05:00
device-vtable.cu [test] Add {{.*}} to make ELF tests immune to dso_local/dso_preemptable/(none) differences 2020-12-31 00:27:11 -08:00
dft-func-attr-skip-intrinsic.hip [clang][codegen] Skip adding default function attributes on intrinsics. 2020-09-16 14:10:05 -04:00
filter-decl.cu
flush-denormals.cu [CUDA][HIP] rename -fcuda-flush-denormals-to-zero 2021-04-05 00:13:51 -04:00
fp-contract.cu [HIP] Fix regressions due to fp contract change 2020-11-24 08:10:06 -05:00
function-overload.cu [test] Add {{.*}} to make ELF tests immune to dso_local/dso_preemptable/(none) differences 2020-12-31 00:27:11 -08:00
host-used-device-var.cu [CUDA][HIP] Fix device template variables 2021-05-12 11:13:29 -04:00
kernel-amdgcn.cu Relands "[HIP] Change default --gpu-max-threads-per-block value to 1024" 2021-02-12 10:53:59 -05:00
kernel-args-alignment.cu
kernel-args.cu [test] Add {{.*}} to make ELF tests immune to dso_local/dso_preemptable/(none) differences 2020-12-31 00:27:11 -08:00
kernel-call.cu [HIP] Support new kernel launching API 2019-09-24 19:16:40 +00:00
kernel-dbg-info.cu [HIP] Emit kernel symbol 2021-03-01 16:31:40 -05:00
kernel-stub-name.cu [HIP, test] Fix use of undef FileCheck var 2021-04-04 19:30:49 +01:00
lambda-reference-var.cu [test] Make ELF tests immune to dso_local/dso_preemptable/(none) differences 2020-12-31 13:59:44 -08:00
lambda.cu [test] Add {{.*}} to make ELF tests immune to dso_local/dso_preemptable/(none) differences 2020-12-31 00:27:11 -08:00
launch-bounds.cu
library-builtin.cu [test] Add {{.*}} to make ELF tests immune to dso_local/dso_preemptable/(none) differences 2020-12-31 00:27:11 -08:00
link-device-bitcode.cu [test] Add {{.*}} to make ELF tests immune to dso_local/dso_preemptable/(none) differences 2020-12-31 00:27:11 -08:00
llvm-used.cu Change some addUsedGlobal to addUsedOrCompilerUsedGlobal 2021-02-26 10:42:07 -08:00
managed-var.cu [CUDA][HIP] Support accessing static device variable in host code for -fgpu-rdc 2021-02-24 18:23:45 -05:00
ms-linker-options.cu Revert "[hip][cuda] Enable extended lambda support on Windows." 2021-02-04 07:10:46 -05:00
norecurse.cu [OpenCL][CUDA][HIP][SYCL] Add norecurse 2020-02-16 20:41:00 -05:00
nothrow.cu [test] Add {{.*}} to make ELF tests immune to dso_local/dso_preemptable/(none) differences 2020-12-31 00:27:11 -08:00
openmp-target.cu [CUDA][HIP] Fix host/device check with -fopenmp 2019-10-09 23:54:10 +00:00
printf-aggregate.cu
printf.cu
profile-coverage-mapping.cu [PGO][CUDA][HIP] Skip generating profile on the device stub and wrong-side functions. 2020-08-10 11:01:46 -04:00
propagate-metadata.cu [clang][cli] Remove -f[no-]trapping-math from -cc1 command line 2021-01-12 10:00:23 +01:00
ptx-kernels.cu [test] Add {{.*}} to make ELF tests immune to dso_local/dso_preemptable/(none) differences 2020-12-31 00:27:11 -08:00
redux-builtins.cu [Clang][NVPTX] Add NVPTX intrinsics and builtins for CUDA PTX redux.sync instructions 2021-05-17 09:46:59 -07:00
static-device-var-no-rdc.cu [CUDA][HIP] Fix device template variables 2021-05-12 11:13:29 -04:00
static-device-var-rdc.cu [CUDA][HIP] Fix device template variables 2021-05-12 11:13:29 -04:00
surface.cu [test] Add {{.*}} to make ELF tests immune to dso_local/dso_preemptable/(none) differences 2020-12-31 00:27:11 -08:00
texture.cu [test] Add {{.*}} to make ELF tests immune to dso_local/dso_preemptable/(none) differences 2020-12-31 00:27:11 -08:00
types.cu [test] Add {{.*}} to make ELF tests immune to dso_local/dso_preemptable/(none) differences 2020-12-31 00:27:11 -08:00
unnamed-types.cu [HIP] Emit kernel symbol 2021-03-01 16:31:40 -05:00
unused-global-var.cu [AMDGPU] Add GlobalDCE before internalization pass 2021-04-17 11:25:25 -04:00
usual-deallocators.cu [CUDA] Normalize handling of defauled dtor. 2021-01-21 10:48:07 -08:00