llvm-project

Commit Graph

Author	SHA1	Message	Date
Sergey Dmitriev	36bfdb7096	[Clang][Driver] Disable llvm passes for the first host OpenMP offload compilation Summary: With OpenMP offloading host compilation is done in two phases to capture host IR that is passed to all device compilations as input. But it turns out that we currently run entire LLVM optimization pipeline on host IR on both compilations which may have unpredictable effects on the resulting code. This patch fixes this problem by disabling LLVM passes on the first compilation, so the host IR that is passed to device compilations will be captured right after front end. Reviewers: ABataev, jdoerfert, hfinkel Reviewed By: ABataev Subscribers: guansong, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73721	2020-01-30 10:16:41 -08:00
Douglas Yung	0bb06f6f66	Slightly relax restriction on exact order arguments must appear. llvm-svn: 374627	2019-10-12 02:22:36 +00:00
Sergey Dmitriev	a0d83768f1	[Clang][OpenMP Offload] Add new tool for wrapping offload device binaries This patch removes the remaining part of the OpenMP offload linker scripts which was used for inserting device binaries into the output linked binary. Device binaries are now inserted into the host binary with a help of the wrapper bit-code file which contains device binaries as data. Wrapper bit-code file is dynamically created by the clang driver with a help of new tool clang-offload-wrapper which takes device binaries as input and produces bit-code file with required contents. Wrapper bit-code is then compiled to an object and resulting object is appended to the host linking by the clang driver. This is the second part of the patch for eliminating OpenMP linker script (please see https://reviews.llvm.org/D64943). Differential Revision: https://reviews.llvm.org/D68166 llvm-svn: 374219	2019-10-09 20:42:58 +00:00
Sergey Dmitriev	4b343fd84c	[Clang][OpenMP Offload] Create start/end symbols for the offloading entry table with a help of a linker Linker automatically provides __start_<section name> and __stop_<section name> symbols to satisfy unresolved references if <section name> is representable as a C identifier (see https://sourceware.org/binutils/docs/ld/Input-Section-Example.html for details). These symbols indicate the start address and end address of the output section respectively. Therefore, renaming OpenMP offload entries section name from ".omp.offloading_entries" to "omp_offloading_entries" to use this feature. This is the first part of the patch for eliminating OpenMP linker script (please see https://reviews.llvm.org/D64943). Differential Revision: https://reviews.llvm.org/D68070 llvm-svn: 373118	2019-09-27 20:00:51 +00:00
Reid Kleckner	549ed544c3	[Driver] Move the "-o OUT -x TYPE SRC.c" flags to the end of -cc1 New -cc1 arguments, such as -faddrsig, have started appearing after the input name. I personally find it convenient for the input to be the last argument to the compile command line, since I often need to edit it when running crash reproduction scripts. Differential Revision: https://reviews.llvm.org/D62270 llvm-svn: 361530	2019-05-23 18:35:43 +00:00
Douglas Yung	48d680dd56	Further relax restriction in tests to include where "-E" and "-S" must appear. Also updated a few instances of "-emit-llvm-bc" and "-emit-obj" that were missed in the previous change. llvm-svn: 354063	2019-02-14 21:37:19 +00:00
Douglas Yung	607a1b2234	Relax restriction in tests to where "-emit-llvm-bc" and "-emit-obj" must appear. The CHECK lines as structured were requiring them to appear only in a certain position while all that is really needed is to check that they are present. llvm-svn: 354001	2019-02-14 01:11:32 +00:00
Alexey Bataev	a5178f5369	[DRIVER][OFFLOAD] Do not invoke unbundler on unsupported file types. clang-offload-bundler should not be invoked with the unbundling action when the input file type does not match the action type. For example, .so files should be unbundled during linking phase and should be linked only with the host code. llvm-svn: 343335	2018-09-28 16:17:59 +00:00
Alexey Bataev	3dfc993437	Revert "[DRIVER][OFFLOAD] Do not invoke unbundler on unsupported file types." It reverts commit r342991 + several other commits intended to fix the tests. Still have some failed tests, need to investigate it. llvm-svn: 343002	2018-09-25 18:31:56 +00:00
Alexey Bataev	a55471138d	[OPENMP] Fix the test, NFC. Fixed test to pacify buildbot. llvm-svn: 342996	2018-09-25 17:58:08 +00:00
Alexey Bataev	99de44bc54	[OPENMP] Fix failed test, NFC. llvm-svn: 342995	2018-09-25 17:47:53 +00:00
Alexey Bataev	464ab241e7	[DRIVER][OFFLOAD] Do not invoke unbundler on unsupported file types. clang-offload-bundler should not be invoked with the unbundling action when the input file type does not match the action type. For example, .so files should be unbundled during linking phase and should be linked only with the host code. llvm-svn: 342991	2018-09-25 17:09:17 +00:00
Petr Hosek	7b27454477	[ADT] Normalize empty triple components LLVM triple normalization is handling "unknown" and empty components differently; for example given "x86_64-unknown-linux-gnu" and "x86_64-linux-gnu" which should be equivalent, triple normalization returns "x86_64-unknown-linux-gnu" and "x86_64--linux-gnu". autoconf's config.sub returns "x86_64-unknown-linux-gnu" for both "x86_64-linux-gnu" and "x86_64-unknown-linux-gnu". This changes the triple normalization to behave the same way, replacing empty triple components with "unknown". This addresses PR37129. Differential Revision: https://reviews.llvm.org/D50219 llvm-svn: 339294	2018-08-08 22:23:57 +00:00
Jonas Hahnfeld	cbbcd7fc1b	[test] Pass in fixed triple for openmp-offload.c This should fix the test on other architectures. Related to: https://reviews.llvm.org/D38372 llvm-svn: 314904	2017-10-04 13:54:09 +00:00
Jonas Hahnfeld	bbf56fb621	[OpenMP] Fix passing of -m arguments correctly The recent fix in D38258 was wrong: getAuxTriple() only returns non-null values for the CUDA toolchain. That is why the now added test for PPC and X86 failed. Differential Revision: https://reviews.llvm.org/D38372 llvm-svn: 314902	2017-10-04 13:32:59 +00:00
Jonas Hahnfeld	757e61fa4f	[OpenMP] Fix passing of -m arguments to device toolchain AuxTriple is not set if host and device share a toolchain. Also, removing an argument modifies the DAL which needs to be returned for future use. (Move tests back to offload-openmp.c as they are not related to GPUs.) Differential Revision: https://reviews.llvm.org/D38258 llvm-svn: 314329	2017-09-27 18:12:34 +00:00
Jonas Hahnfeld	21b60edb05	Fix memory leak in ToolChain::TranslateOpenMPTargetArgs rL310433 introduced a code path where DAL is not returned and must be freed. This change allows to run openmp-offload.c when Clang is built with ASan. llvm-svn: 310817	2017-08-14 07:44:05 +00:00
Alex Shlyapnikov	a9f6a52925	Disabling openmp-offload.c on linux until it is stabilized on all local configurations. Differential revision: https://reviews.llvm.org/D29660 llvm-svn: 310772	2017-08-11 23:10:39 +00:00
Gheorghe-Teodor Bercea	0499212ebf	[OpenMP] Move failing flag tests to disabled GPU offloading test file. This should prevent further errors with the sanitizer. Diff: D29660 llvm-svn: 310765	2017-08-11 21:17:50 +00:00
Gheorghe-Teodor Bercea	9c52574886	[OpenMP] Enable previously successful offloading tests. Create a separate test file to contain all tests for OpenMP offloading to GPUs. Make libdevice checking more robust by accounting for the case in which no libdevice is found. This changes are in connrection with diff: D29660 llvm-svn: 310718	2017-08-11 15:46:22 +00:00
Alex Shlyapnikov	5f1ac1444b	Disabling openmp-offload.c on linux until it is stabilized on all local configurations. llvm-svn: 310640	2017-08-10 17:55:01 +00:00
Gheorghe-Teodor Bercea	14528c60ba	[OpenMP] Delete tests in openmp-offload.c which cuase failures until a better way to perform these tests is figured out. Change connected to diff: D29654 llvm-svn: 310625	2017-08-10 16:56:59 +00:00
Alex Lorenz	994f231792	Revert r310489 and follow-up commits r310505, r310519, r310537 and r310549 Commit r310489 caused 'openmp-offload.c' test failures on Darwin and other platforms: http://lab.llvm.org:8080/green/job/clang-stage1-cmake-RA-incremental_check/39230/testReport/junit/Clang/Driver/openmp_offload_c/ The follow-up commits tried to fix the test, but the test is still failing. llvm-svn: 310580	2017-08-10 10:34:46 +00:00
Gheorghe-Teodor Bercea	57dbc55c88	[OpenMP] Remove offending test. Diff: D29660 llvm-svn: 310537	2017-08-09 23:47:41 +00:00
Gheorghe-Teodor Bercea	94ab42b49c	[OpenMP] Fix failing test for D29660. Non-functional change. llvm-svn: 310519	2017-08-09 20:52:58 +00:00
Gheorghe-Teodor Bercea	9d652e9294	[OpenMP] Make the PTX version tests general enough to work on all toolchains. Add explicit test for Darwin and PowerPC. Clean-up tests. Non-functional change. Original diff: D29660 llvm-svn: 310505	2017-08-09 18:25:52 +00:00
Gheorghe-Teodor Bercea	6b26dcb6d6	[OpenMP] Add flag for overwriting default PTX version for OpenMP targets Summary: This flag "--fopenmp-ptx=" enables the overwriting of the default PTX version used for GPU offloaded OpenMP target regions: "+ptx42". Reviewers: arpith-jacob, caomhin, carlo.bertolli, ABataev, Hahnfeld, jlebar, hfinkel, tstellar Reviewed By: ABataev Subscribers: rengolin, cfe-commits Differential Revision: https://reviews.llvm.org/D29660 llvm-svn: 310489	2017-08-09 15:56:54 +00:00
Gheorghe-Teodor Bercea	0846582878	[OpenMP] Add flag for disabling the default generation of relocatable OpenMP target code for NVIDIA GPUs. Summary: Previously we have added the "-c" flag which gets passed to PTXAS by default to generate relocatable OpenMP target code by default. This set of flags exposes control over this behaviour. Reviewers: arpith-jacob, caomhin, carlo.bertolli, ABataev, Hahnfeld, jlebar, hfinkel, tstellar Reviewed By: ABataev Subscribers: Hahnfeld, rengolin, cfe-commits Differential Revision: https://reviews.llvm.org/D29659 llvm-svn: 310484	2017-08-09 15:27:39 +00:00
Gheorghe-Teodor Bercea	b9d117233f	[OpenMP] Make OpenMP generated code for the NVIDIA device relocatable by default Original Diff: D29642 This patch was previously reverted due to an error with patch D29654 that this depends on. llvm-svn: 310479	2017-08-09 14:59:35 +00:00
Gheorghe-Teodor Bercea	5289843597	[OpenMP] Fix bug regarding cubin integration into host binary when a BindArchAction is used. This is not a functional change. Original Diff: D29654 llvm-svn: 310433	2017-08-09 01:02:19 +00:00
Gheorghe-Teodor Bercea	ee914104d9	Non-functional change. Fix test for D29654. llvm-svn: 310368	2017-08-08 15:13:07 +00:00
Gheorghe-Teodor Bercea	2c92693280	[OpenMP] OpenMP device offloading code generation produces a cubin file which is then integrated in the host binary using the host linker. Diff: D29654 llvm-svn: 310362	2017-08-08 14:33:05 +00:00
Alex Lorenz	7e9c478cda	Revert r310291, r310300 and r310332 because of test failure on Darwin The commit r310291 introduced the failure. r310332 was a test fix commit and r310300 was a followup commit. I reverted these two to avoid merge conflicts when reverting. The 'openmp-offload.c' test is failing on Darwin because the following run lines: // RUN: touch %t1.o // RUN: touch %t2.o // RUN: %clang -### -no-canonical-prefixes -fopenmp=libomp -fopenmp-targets=nvptx64-nvidia-cuda -save-temps -no-canonical-prefixes %t1.o %t2.o 2>&1 \ // RUN: \| FileCheck -check-prefix=CHK-TWOCUBIN %s trigger the following assertion: Driver.cpp:3418: assert(CachedResults.find(ActionTC) != CachedResults.end() && "Result does not exist??"); llvm-svn: 310345	2017-08-08 11:20:17 +00:00
Reid Kleckner	908ac3916d	Fix openmp-offload.c test on Windows llvm-svn: 310332	2017-08-08 01:36:16 +00:00
Gheorghe-Teodor Bercea	ceb422236a	[OpenMP] Make OpenMP generated code for the NVIDIA device relocatable by default Summary: When device offloading is enabled and the device is an NVIDIA GPU, OpenMP target regions must be compiled with relocation enabled by passing the "-c" flag to the PTXAS invocation. Reviewers: arpith-jacob, caomhin, carlo.bertolli, ABataev, Hahnfeld, jlebar, hfinkel, tstellar Reviewed By: Hahnfeld Subscribers: Hahnfeld, rengolin, mkuron, cfe-commits Differential Revision: https://reviews.llvm.org/D29642 llvm-svn: 310300	2017-08-07 20:31:51 +00:00
Gheorghe-Teodor Bercea	4cdba82ee0	[OpenMP] Integrate OpenMP target region cubin into host binary Summary: OpenMP device offloading code generation produces a cubin file which is then integrated in the host binary using the host linker. Reviewers: arpith-jacob, caomhin, carlo.bertolli, ABataev, Hahnfeld, jlebar, rnk, hfinkel, tstellar Reviewed By: hfinkel Subscribers: sfantao, rnk, rengolin, cfe-commits Differential Revision: https://reviews.llvm.org/D29654 llvm-svn: 310291	2017-08-07 20:01:48 +00:00
Benjamin Kramer	4504fe2449	Add some missing -no-canonical-prefixes. llvm-svn: 310278	2017-08-07 18:31:01 +00:00
Gheorghe-Teodor Bercea	47e0cf378c	[OpenMP] Add flag for specifying the target device architecture for OpenMP device offloading Summary: OpenMP has the ability to offload target regions to devices which may have different architectures. A new -fopenmp-target-arch flag is introduced to specify the device architecture. In this patch I use the new flag to specify the compute capability of the underlying NVIDIA architecture for the OpenMP offloading CUDA tool chain. Only a host-offloading test is provided since full device offloading capability will only be available when [[ https://reviews.llvm.org/D29654 \| D29654 ]] lands. Reviewers: hfinkel, Hahnfeld, carlo.bertolli, caomhin, ABataev Reviewed By: hfinkel Subscribers: guansong, cfe-commits Tags: #openmp Differential Revision: https://reviews.llvm.org/D34784 llvm-svn: 310263	2017-08-07 15:39:11 +00:00
Eric Christopher	b11a3222e8	Add -no-canonical-prefixes to the test line so that we can handle different build modes. llvm-svn: 306790	2017-06-30 06:03:47 +00:00
Reid Kleckner	b640ab92c2	Fix openmp-offload.c test on Windows llvm-svn: 306751	2017-06-29 22:31:16 +00:00
Gheorghe-Teodor Bercea	7916da2347	[OpenMP] Fix test for revision D29645. NFC llvm-svn: 306724	2017-06-29 18:49:16 +00:00
Gheorghe-Teodor Bercea	3addb7d850	[OpenMP] Pass -fopenmp-is-device to preprocessing and machine specific code generation stages Summary: The preprocessing and code generation and optimization stages of the compiler are also passed the "-fopenmp-is-device" flag. This is used to trigger machine specific preprocessing and code generation when performing device offloading to an NVIDIA GPU via OpenMP directives. Reviewers: arpith-jacob, caomhin, carlo.bertolli, Hahnfeld, hfinkel, tstellar Reviewed By: Hahnfeld Subscribers: Hahnfeld, rengolin Differential Revision: https://reviews.llvm.org/D29645 llvm-svn: 306691	2017-06-29 15:59:19 +00:00
Gheorghe-Teodor Bercea	59d7b77b16	[OpenMP] Add support for auxiliary triple specification Summary: Device offloading requires the specification of an additional flag containing the triple of the //other// architecture the code is being compiled on if such an architecture exists. If compiling for the host, the auxiliary triple flag will contain the triple describing the device and vice versa. Reviewers: arpith-jacob, sfantao, caomhin, carlo.bertolli, ABataev, Hahnfeld, jlebar, hfinkel, tstellar Reviewed By: Hahnfeld Subscribers: rengolin, cfe-commits Differential Revision: https://reviews.llvm.org/D29339 llvm-svn: 306689	2017-06-29 15:49:03 +00:00
Alexey Bataev	9780faaf8f	[OpenMP][Driver] Put target binary for each offload target into a separate section, by Sergey Dmitriev Linker script that is generated by the clang driver for creating fat binary puts target binaries for all offload targets into a single ELF section .omp_offloading. This is not convenient because it greatly complicates operations with the final fat binary once it is linked. For example extracting target binary for a particular target from such fat executable would not be an easy task if you have more than one offload target. Attached patch changes clang driver to put target binary for each offload target into a separate ELF section .omp_offloading.<target triple>. Differential Revision: https://reviews.llvm.org/D33254 llvm-svn: 304229	2017-05-30 18:57:51 +00:00
Rafael Espindola	ee908d21f8	Fix this test when we have clang-offload-bundler.exe. llvm-svn: 285561	2016-10-31 11:47:37 +00:00
NAKAMURA Takumi	720033ad19	clang/test/Driver/openmp-offload.c: Relax expressions if "ld.exe" exists, like mingw. llvm-svn: 285511	2016-10-30 02:58:48 +00:00
Samuel Antao	5e6b413a26	Define extra variable in OpenMP offloading driver tests. llvm-svn: 285408	2016-10-28 15:42:38 +00:00
Samuel Antao	cf5bac8766	Change OpenMP offload driver tests so that it doesn't use the full file path during tests. This was causing failures on windows bots. llvm-svn: 285404	2016-10-28 15:11:50 +00:00
Benjamin Kramer	0137bcb958	[openmp] Remove test assumption that canonical binary name contains "clang" Patch by Sam McCall! Differential Revision: https://reviews.llvm.org/D26067 llvm-svn: 285388	2016-10-28 09:20:02 +00:00
Samuel Antao	1b0ebed572	Use -fopenmp=libomp in all OpenMP offloading tests. This will make sure the right features are being tested even for machines that default to libgomp. llvm-svn: 285327	2016-10-27 18:29:57 +00:00

1 2

58 Commits