2016-08-23 02:50:34 +08:00
|
|
|
// Tests the bindings generated for a CUDA offloading target for different
|
|
|
|
// combinations of:
|
|
|
|
// - Number of gpu architectures;
|
|
|
|
// - Host/device-only compilation;
|
|
|
|
// - User-requested final phase - binary or assembly.
|
|
|
|
// It parallels cuda-phases.cu test, but verifies whether output file is temporary or not.
|
|
|
|
|
|
|
|
// It's hard to check whether file name is temporary in a portable
|
|
|
|
// way. Instead we check whether we've generated a permanent name on
|
|
|
|
// device side, which appends '-device-cuda-<triple>' suffix.
|
|
|
|
|
|
|
|
// REQUIRES: clang-driver
|
|
|
|
// REQUIRES: powerpc-registered-target
|
|
|
|
// REQUIRES: nvptx-registered-target
|
|
|
|
|
|
|
|
//
|
|
|
|
// Test single gpu architecture with complete compilation.
|
|
|
|
// No intermediary device files should have "-device-cuda..." in the name.
|
|
|
|
//
|
|
|
|
// RUN: %clang -target powerpc64le-ibm-linux-gnu -ccc-print-bindings --cuda-gpu-arch=sm_30 %s 2>&1 \
|
|
|
|
// RUN: | FileCheck -check-prefix=BIN %s
|
|
|
|
// BIN: # "nvptx64-nvidia-cuda" - "clang",{{.*}} output:
|
|
|
|
// BIN-NOT: cuda-bindings-device-cuda-nvptx64
|
|
|
|
// BIN: # "nvptx64-nvidia-cuda" - "NVPTX::Assembler",{{.*}} output:
|
|
|
|
// BIN-NOT: cuda-bindings-device-cuda-nvptx64
|
|
|
|
// BIN: # "nvptx64-nvidia-cuda" - "NVPTX::Linker",{{.*}} output:
|
|
|
|
// BIN-NOT: cuda-bindings-device-cuda-nvptx64
|
|
|
|
// BIN: # "powerpc64le-ibm-linux-gnu" - "clang",{{.*}} output:
|
|
|
|
// BIN-NOT: cuda-bindings-device-cuda-nvptx64
|
|
|
|
// BIN: # "powerpc64le-ibm-linux-gnu" - "GNU::Linker", inputs:{{.*}}, output: "a.out"
|
|
|
|
|
|
|
|
//
|
|
|
|
// Test single gpu architecture up to the assemble phase.
|
|
|
|
//
|
|
|
|
// RUN: %clang -target powerpc64le-ibm-linux-gnu -ccc-print-bindings --cuda-gpu-arch=sm_30 %s -S 2>&1 \
|
|
|
|
// RUN: | FileCheck -check-prefix=ASM %s
|
[Driver][OpenMP] Add support to create jobs for unbundling actions.
Summary:
This patch adds the support to create jobs for the `OffloadBundlingAction` which will invoke the `clang-offload-bundler` tool to unbundle input files.
Unlike other actions, unbundling actions have multiple outputs. Therefore, this patch adds the required changes to have a variant of `Tool::ConstructJob` with multiple outputs.
The way the naming of the results is implemented is also slightly modified so that the same action can use a different offloading prefix for each use by the different offloading actions.
With this patch, it is possible to compile a functional OpenMP binary with offloading support, even with separate compilation.
Reviewers: echristo, tra, jlebar, ABataev, hfinkel
Subscribers: mkuron, whchung, mehdi_amini, cfe-commits, Hahnfeld, andreybokhanko, arpith-jacob, carlo.bertolli, caomhin
Differential Revision: https://reviews.llvm.org/D21857
llvm-svn: 285326
2016-10-28 02:14:55 +08:00
|
|
|
// ASM-DAG: # "nvptx64-nvidia-cuda" - "clang",{{.*}} output: "cuda-bindings-cuda-nvptx64-nvidia-cuda-sm_30.s"
|
[CUDA][OpenMP] Add a generic offload action builder
Summary:
This patch proposes a new class to generate and record action dependences related with offloading. The builder provides three main functionalities:
- Add device dependences to host actions.
- Add host dependence to device actions.
- Register device top-level actions.
The constructor of the builder detect the programming models that should be supported, and generates a specialized builder for each. If a new programming model is to be added in the future, only a new specialized builder has to be implemented.
When the specialized builder is generated, it produces programming-model-specific diagnostics.
A CUDA specialized builder is proposed in the patch that mostly consists of the partition of the current `buildCudaAction` by the three different functionalities.
Reviewers: tra, echristo, ABataev, jlebar, hfinkel
Subscribers: Hahnfeld, whchung, guansong, jlebar, mehdi_amini, andreybokhanko, tcramer, mkuron, cfe-commits, arpith-jacob, carlo.bertolli, caomhin
Differential Revision: https://reviews.llvm.org/D18172
llvm-svn: 282865
2016-09-30 23:34:19 +08:00
|
|
|
// ASM-DAG: # "powerpc64le-ibm-linux-gnu" - "clang",{{.*}} output: "cuda-bindings.s"
|
2016-08-23 02:50:34 +08:00
|
|
|
|
|
|
|
//
|
|
|
|
// Test two gpu architectures with complete compilation.
|
|
|
|
//
|
|
|
|
// RUN: %clang -target powerpc64le-ibm-linux-gnu -ccc-print-bindings --cuda-gpu-arch=sm_30 --cuda-gpu-arch=sm_35 %s 2>&1 \
|
|
|
|
// RUN: | FileCheck -check-prefix=BIN2 %s
|
|
|
|
// BIN2: # "nvptx64-nvidia-cuda" - "clang",{{.*}} output:
|
|
|
|
// BIN2-NOT: cuda-bindings-device-cuda-nvptx64
|
|
|
|
// BIN2: # "nvptx64-nvidia-cuda" - "NVPTX::Assembler",{{.*}} output:
|
|
|
|
// BIN2-NOT: cuda-bindings-device-cuda-nvptx64
|
|
|
|
// BIN2: # "nvptx64-nvidia-cuda" - "clang",{{.*}} output:
|
|
|
|
// BIN2-NOT: cuda-bindings-device-cuda-nvptx64
|
|
|
|
// BIN2: # "nvptx64-nvidia-cuda" - "NVPTX::Assembler",{{.*}} output:
|
|
|
|
// BIN2-NOT: cuda-bindings-device-cuda-nvptx64
|
|
|
|
// BIN2: # "nvptx64-nvidia-cuda" - "NVPTX::Linker",{{.*}} output:
|
|
|
|
// BIN2-NOT: cuda-bindings-device-cuda-nvptx64
|
|
|
|
// BIN2: # "powerpc64le-ibm-linux-gnu" - "clang",{{.*}} output:
|
|
|
|
// BIN2-NOT: cuda-bindings-device-cuda-nvptx64
|
|
|
|
// BIN2: # "powerpc64le-ibm-linux-gnu" - "GNU::Linker", inputs:{{.*}}, output: "a.out"
|
|
|
|
|
|
|
|
//
|
|
|
|
// Test two gpu architectures up to the assemble phase.
|
|
|
|
//
|
|
|
|
// RUN: %clang -target powerpc64le-ibm-linux-gnu -ccc-print-bindings \
|
|
|
|
// RUN: --cuda-gpu-arch=sm_30 --cuda-gpu-arch=sm_35 %s -S 2>&1 \
|
|
|
|
// RUN: | FileCheck -check-prefix=ASM2 %s
|
[Driver][OpenMP] Add support to create jobs for unbundling actions.
Summary:
This patch adds the support to create jobs for the `OffloadBundlingAction` which will invoke the `clang-offload-bundler` tool to unbundle input files.
Unlike other actions, unbundling actions have multiple outputs. Therefore, this patch adds the required changes to have a variant of `Tool::ConstructJob` with multiple outputs.
The way the naming of the results is implemented is also slightly modified so that the same action can use a different offloading prefix for each use by the different offloading actions.
With this patch, it is possible to compile a functional OpenMP binary with offloading support, even with separate compilation.
Reviewers: echristo, tra, jlebar, ABataev, hfinkel
Subscribers: mkuron, whchung, mehdi_amini, cfe-commits, Hahnfeld, andreybokhanko, arpith-jacob, carlo.bertolli, caomhin
Differential Revision: https://reviews.llvm.org/D21857
llvm-svn: 285326
2016-10-28 02:14:55 +08:00
|
|
|
// ASM2-DAG: # "nvptx64-nvidia-cuda" - "clang",{{.*}} output: "cuda-bindings-cuda-nvptx64-nvidia-cuda-sm_30.s"
|
|
|
|
// ASM2-DAG: # "nvptx64-nvidia-cuda" - "clang",{{.*}} output: "cuda-bindings-cuda-nvptx64-nvidia-cuda-sm_35.s"
|
[CUDA][OpenMP] Add a generic offload action builder
Summary:
This patch proposes a new class to generate and record action dependences related with offloading. The builder provides three main functionalities:
- Add device dependences to host actions.
- Add host dependence to device actions.
- Register device top-level actions.
The constructor of the builder detect the programming models that should be supported, and generates a specialized builder for each. If a new programming model is to be added in the future, only a new specialized builder has to be implemented.
When the specialized builder is generated, it produces programming-model-specific diagnostics.
A CUDA specialized builder is proposed in the patch that mostly consists of the partition of the current `buildCudaAction` by the three different functionalities.
Reviewers: tra, echristo, ABataev, jlebar, hfinkel
Subscribers: Hahnfeld, whchung, guansong, jlebar, mehdi_amini, andreybokhanko, tcramer, mkuron, cfe-commits, arpith-jacob, carlo.bertolli, caomhin
Differential Revision: https://reviews.llvm.org/D18172
llvm-svn: 282865
2016-09-30 23:34:19 +08:00
|
|
|
// ASM2-DAG: # "powerpc64le-ibm-linux-gnu" - "clang",{{.*}} output: "cuda-bindings.s"
|
2016-08-23 02:50:34 +08:00
|
|
|
|
|
|
|
//
|
|
|
|
// Test one or more gpu architecture with complete compilation in host-only
|
|
|
|
// compilation mode.
|
|
|
|
//
|
|
|
|
// RUN: %clang -target powerpc64le-ibm-linux-gnu -ccc-print-bindings \
|
|
|
|
// RUN: --cuda-gpu-arch=sm_30 %s --cuda-host-only 2>&1 \
|
|
|
|
// RUN: | FileCheck -check-prefix=HBIN %s
|
|
|
|
// RUN: %clang -target powerpc64le-ibm-linux-gnu -ccc-print-bindings \
|
|
|
|
// RUN: --cuda-gpu-arch=sm_30 --cuda-gpu-arch=sm_35 %s --cuda-host-only 2>&1 \
|
|
|
|
// RUN: | FileCheck -check-prefix=HBIN %s
|
|
|
|
// HBIN: # "powerpc64le-ibm-linux-gnu" - "clang",{{.*}} output:
|
|
|
|
// HBIN-NOT: cuda-bindings-device-cuda-nvptx64
|
|
|
|
// HBIN: # "powerpc64le-ibm-linux-gnu" - "GNU::Linker", inputs:{{.*}}, output: "a.out"
|
|
|
|
|
|
|
|
//
|
|
|
|
// Test one or more gpu architecture up to the assemble phase in host-only
|
|
|
|
// compilation mode.
|
|
|
|
//
|
|
|
|
// RUN: %clang -target powerpc64le-ibm-linux-gnu -ccc-print-bindings \
|
|
|
|
// RUN: --cuda-gpu-arch=sm_30 %s --cuda-host-only -S 2>&1 \
|
|
|
|
// RUN: | FileCheck -check-prefix=HASM %s
|
|
|
|
// RUN: %clang -target powerpc64le-ibm-linux-gnu -ccc-print-bindings \
|
|
|
|
// RUN: --cuda-gpu-arch=sm_30 --cuda-gpu-arch=sm_35 %s --cuda-host-only -S 2>&1 \
|
|
|
|
// RUN: | FileCheck -check-prefix=HASM %s
|
|
|
|
// HASM: # "powerpc64le-ibm-linux-gnu" - "clang",{{.*}} output: "cuda-bindings.s"
|
|
|
|
|
|
|
|
//
|
|
|
|
// Test single gpu architecture with complete compilation in device-only
|
|
|
|
// compilation mode.
|
|
|
|
//
|
|
|
|
// RUN: %clang -target powerpc64le-ibm-linux-gnu -ccc-print-bindings \
|
|
|
|
// RUN: --cuda-gpu-arch=sm_30 %s --cuda-device-only 2>&1 \
|
|
|
|
// RUN: | FileCheck -check-prefix=DBIN %s
|
|
|
|
// DBIN: # "nvptx64-nvidia-cuda" - "clang",{{.*}} output:
|
|
|
|
// DBIN-NOT: cuda-bindings-device-cuda-nvptx64
|
[Driver][OpenMP] Add support to create jobs for unbundling actions.
Summary:
This patch adds the support to create jobs for the `OffloadBundlingAction` which will invoke the `clang-offload-bundler` tool to unbundle input files.
Unlike other actions, unbundling actions have multiple outputs. Therefore, this patch adds the required changes to have a variant of `Tool::ConstructJob` with multiple outputs.
The way the naming of the results is implemented is also slightly modified so that the same action can use a different offloading prefix for each use by the different offloading actions.
With this patch, it is possible to compile a functional OpenMP binary with offloading support, even with separate compilation.
Reviewers: echristo, tra, jlebar, ABataev, hfinkel
Subscribers: mkuron, whchung, mehdi_amini, cfe-commits, Hahnfeld, andreybokhanko, arpith-jacob, carlo.bertolli, caomhin
Differential Revision: https://reviews.llvm.org/D21857
llvm-svn: 285326
2016-10-28 02:14:55 +08:00
|
|
|
// DBIN: # "nvptx64-nvidia-cuda" - "NVPTX::Assembler",{{.*}} output: "cuda-bindings-cuda-nvptx64-nvidia-cuda-sm_30.o"
|
2016-08-23 02:50:34 +08:00
|
|
|
|
|
|
|
//
|
|
|
|
// Test single gpu architecture up to the assemble phase in device-only
|
|
|
|
// compilation mode.
|
|
|
|
//
|
|
|
|
// RUN: %clang -target powerpc64le-ibm-linux-gnu -ccc-print-bindings \
|
|
|
|
// RUN: --cuda-gpu-arch=sm_30 %s --cuda-device-only -S 2>&1 \
|
|
|
|
// RUN: | FileCheck -check-prefix=DASM %s
|
[Driver][OpenMP] Add support to create jobs for unbundling actions.
Summary:
This patch adds the support to create jobs for the `OffloadBundlingAction` which will invoke the `clang-offload-bundler` tool to unbundle input files.
Unlike other actions, unbundling actions have multiple outputs. Therefore, this patch adds the required changes to have a variant of `Tool::ConstructJob` with multiple outputs.
The way the naming of the results is implemented is also slightly modified so that the same action can use a different offloading prefix for each use by the different offloading actions.
With this patch, it is possible to compile a functional OpenMP binary with offloading support, even with separate compilation.
Reviewers: echristo, tra, jlebar, ABataev, hfinkel
Subscribers: mkuron, whchung, mehdi_amini, cfe-commits, Hahnfeld, andreybokhanko, arpith-jacob, carlo.bertolli, caomhin
Differential Revision: https://reviews.llvm.org/D21857
llvm-svn: 285326
2016-10-28 02:14:55 +08:00
|
|
|
// DASM: # "nvptx64-nvidia-cuda" - "clang",{{.*}} output: "cuda-bindings-cuda-nvptx64-nvidia-cuda-sm_30.s"
|
2016-08-23 02:50:34 +08:00
|
|
|
|
|
|
|
//
|
|
|
|
// Test two gpu architectures with complete compilation in device-only
|
|
|
|
// compilation mode.
|
|
|
|
//
|
|
|
|
// RUN: %clang -target powerpc64le-ibm-linux-gnu -ccc-print-bindings \
|
|
|
|
// RUN: --cuda-gpu-arch=sm_30 --cuda-gpu-arch=sm_35 %s --cuda-device-only 2>&1 \
|
|
|
|
// RUN: | FileCheck -check-prefix=DBIN2 %s
|
|
|
|
// DBIN2: # "nvptx64-nvidia-cuda" - "clang",{{.*}} output:
|
|
|
|
// DBIN2-NOT: cuda-bindings-device-cuda-nvptx64
|
[Driver][OpenMP] Add support to create jobs for unbundling actions.
Summary:
This patch adds the support to create jobs for the `OffloadBundlingAction` which will invoke the `clang-offload-bundler` tool to unbundle input files.
Unlike other actions, unbundling actions have multiple outputs. Therefore, this patch adds the required changes to have a variant of `Tool::ConstructJob` with multiple outputs.
The way the naming of the results is implemented is also slightly modified so that the same action can use a different offloading prefix for each use by the different offloading actions.
With this patch, it is possible to compile a functional OpenMP binary with offloading support, even with separate compilation.
Reviewers: echristo, tra, jlebar, ABataev, hfinkel
Subscribers: mkuron, whchung, mehdi_amini, cfe-commits, Hahnfeld, andreybokhanko, arpith-jacob, carlo.bertolli, caomhin
Differential Revision: https://reviews.llvm.org/D21857
llvm-svn: 285326
2016-10-28 02:14:55 +08:00
|
|
|
// DBIN2: # "nvptx64-nvidia-cuda" - "NVPTX::Assembler",{{.*}} output: "cuda-bindings-cuda-nvptx64-nvidia-cuda-sm_30.o"
|
2016-08-23 02:50:34 +08:00
|
|
|
// DBIN2: # "nvptx64-nvidia-cuda" - "clang",{{.*}} output:
|
|
|
|
// DBIN2-NOT: cuda-bindings-device-cuda-nvptx64
|
[Driver][OpenMP] Add support to create jobs for unbundling actions.
Summary:
This patch adds the support to create jobs for the `OffloadBundlingAction` which will invoke the `clang-offload-bundler` tool to unbundle input files.
Unlike other actions, unbundling actions have multiple outputs. Therefore, this patch adds the required changes to have a variant of `Tool::ConstructJob` with multiple outputs.
The way the naming of the results is implemented is also slightly modified so that the same action can use a different offloading prefix for each use by the different offloading actions.
With this patch, it is possible to compile a functional OpenMP binary with offloading support, even with separate compilation.
Reviewers: echristo, tra, jlebar, ABataev, hfinkel
Subscribers: mkuron, whchung, mehdi_amini, cfe-commits, Hahnfeld, andreybokhanko, arpith-jacob, carlo.bertolli, caomhin
Differential Revision: https://reviews.llvm.org/D21857
llvm-svn: 285326
2016-10-28 02:14:55 +08:00
|
|
|
// DBIN2: # "nvptx64-nvidia-cuda" - "NVPTX::Assembler",{{.*}} output: "cuda-bindings-cuda-nvptx64-nvidia-cuda-sm_35.o"
|
2016-08-23 02:50:34 +08:00
|
|
|
|
|
|
|
//
|
|
|
|
// Test two gpu architectures up to the assemble phase in device-only
|
|
|
|
// compilation mode.
|
|
|
|
//
|
|
|
|
// RUN: %clang -target powerpc64le-ibm-linux-gnu -ccc-print-bindings \
|
|
|
|
// RUN: --cuda-gpu-arch=sm_30 --cuda-gpu-arch=sm_35 %s --cuda-device-only -S 2>&1 \
|
|
|
|
// RUN: | FileCheck -check-prefix=DASM2 %s
|
[Driver][OpenMP] Add support to create jobs for unbundling actions.
Summary:
This patch adds the support to create jobs for the `OffloadBundlingAction` which will invoke the `clang-offload-bundler` tool to unbundle input files.
Unlike other actions, unbundling actions have multiple outputs. Therefore, this patch adds the required changes to have a variant of `Tool::ConstructJob` with multiple outputs.
The way the naming of the results is implemented is also slightly modified so that the same action can use a different offloading prefix for each use by the different offloading actions.
With this patch, it is possible to compile a functional OpenMP binary with offloading support, even with separate compilation.
Reviewers: echristo, tra, jlebar, ABataev, hfinkel
Subscribers: mkuron, whchung, mehdi_amini, cfe-commits, Hahnfeld, andreybokhanko, arpith-jacob, carlo.bertolli, caomhin
Differential Revision: https://reviews.llvm.org/D21857
llvm-svn: 285326
2016-10-28 02:14:55 +08:00
|
|
|
// DASM2: # "nvptx64-nvidia-cuda" - "clang",{{.*}} output: "cuda-bindings-cuda-nvptx64-nvidia-cuda-sm_30.s"
|
|
|
|
// DASM2: # "nvptx64-nvidia-cuda" - "clang",{{.*}} output: "cuda-bindings-cuda-nvptx64-nvidia-cuda-sm_35.s"
|