llvm-project

Go to file

Samuel Antao d06239d359 [CUDA][OpenMP] Create generic offload action Summary: This patch replaces the CUDA specific action by a generic offload action. The offload action may have multiple dependences classier in “host” and “device”. The way this generic offloading action is used is very similar to what is done today by the CUDA implementation: it is used to set a specific toolchain and architecture to its dependences during the generation of jobs. This patch also proposes propagating the offloading information through the action graph so that that information can be easily retrieved at any time during the generation of commands. This allows e.g. the "clang tool” to evaluate whether CUDA should be supported for the device or host and ptas to easily retrieve the target architecture. This is an example of how the action graphs would look like (compilation of a single CUDA file with two GPU architectures) ``` 0: input, "cudatests.cu", cuda, (host-cuda) 1: preprocessor, {0}, cuda-cpp-output, (host-cuda) 2: compiler, {1}, ir, (host-cuda) 3: input, "cudatests.cu", cuda, (device-cuda, sm_35) 4: preprocessor, {3}, cuda-cpp-output, (device-cuda, sm_35) 5: compiler, {4}, ir, (device-cuda, sm_35) 6: backend, {5}, assembler, (device-cuda, sm_35) 7: assembler, {6}, object, (device-cuda, sm_35) 8: offload, "device-cuda (nvptx64-nvidia-cuda:sm_35)" {7}, object 9: offload, "device-cuda (nvptx64-nvidia-cuda:sm_35)" {6}, assembler 10: input, "cudatests.cu", cuda, (device-cuda, sm_37) 11: preprocessor, {10}, cuda-cpp-output, (device-cuda, sm_37) 12: compiler, {11}, ir, (device-cuda, sm_37) 13: backend, {12}, assembler, (device-cuda, sm_37) 14: assembler, {13}, object, (device-cuda, sm_37) 15: offload, "device-cuda (nvptx64-nvidia-cuda:sm_37)" {14}, object 16: offload, "device-cuda (nvptx64-nvidia-cuda:sm_37)" {13}, assembler 17: linker, {8, 9, 15, 16}, cuda-fatbin, (device-cuda) 18: offload, "host-cuda (powerpc64le-unknown-linux-gnu)" {2}, "device-cuda (nvptx64-nvidia-cuda)" {17}, ir 19: backend, {18}, assembler 20: assembler, {19}, object 21: input, "cuda", object 22: input, "cudart", object 23: linker, {20, 21, 22}, image ``` The changes in this patch pass the existent regression tests (keeps the existent functionality) and resulting binaries execute correctly in a Power8+K40 machine. Reviewers: echristo, hfinkel, jlebar, ABataev, tra Subscribers: guansong, andreybokhanko, tcramer, mkuron, cfe-commits, arpith-jacob, carlo.bertolli, caomhin Differential Revision: https://reviews.llvm.org/D18171 llvm-svn: 275645		2016-07-15 23:13:27 +00:00
clang	[CUDA][OpenMP] Create generic offload action	2016-07-15 23:13:27 +00:00
clang-tools-extra	[clang-rename] fix testset	2016-07-15 12:22:38 +00:00
compiler-rt	[asan] trying to fix the windows build	2016-07-15 23:13:03 +00:00
debuginfo-tests	New round of fixes for "Always compile debuginfo-tests for the host triple"	2014-10-18 23:47:59 +00:00
libclc	configure: Remove device specific defines	2016-06-17 20:30:50 +00:00
libcxx	Constuct a sentry object in istream::readsome, and handle failures appropriately. Fixes PR#28217.	2016-07-13 16:58:48 +00:00
libcxxabi	libc++abi: add a top level option for using CompilerRT	2016-07-15 00:49:42 +00:00
libunwind	[PATCH] [libunwind][ehabi] Use early returns where possible.	2016-07-08 12:13:31 +00:00
lld	ELF: Include filenames in error messages.	2016-07-15 20:38:28 +00:00
lldb	Fixes for standalone build:	2016-07-15 22:46:15 +00:00
llgo	Upgrade all the .arcconfigs to https.	2016-07-14 13:15:37 +00:00
llvm	Reapply "Mips: Avoid implicit iterator conversions, NFC"	2016-07-15 23:09:47 +00:00
openmp	http://reviews.llvm.org/D22134 : Implementation of OpenMP 4.5 nonmonotonic schedule modifier	2016-07-11 10:44:57 +00:00
parallel-libs	Change mailing list names in README	2016-06-24 22:50:09 +00:00
polly	GPGPU: Format statements scheduled on the host ourselves	2016-07-15 17:12:41 +00:00