llvm-project

Commit Graph

Author	SHA1	Message	Date
Alexey Bataev	2ba67045e3	[OPENMP] Generalize capturing of clauses expressions. The handling and capturing of the non-constant expressions of some of the capturable clauses in combined directives is generalized. llvm-svn: 319227	2017-11-28 21:11:44 +00:00
Alexey Bataev	0b49f9e489	[OPENMP] Codegen for `distribute parallel for simd` directive. Initial codegen for `#pragma omp distribute parallel for simd` directive and its clauses. llvm-svn: 319079	2017-11-27 19:38:58 +00:00
Alexey Bataev	10a5431239	[OPENMP] Improve handling of cancel directives in target-based constructs, NFC. Improved handling of cancel\|cancellation point directives inside target-based for directives. llvm-svn: 319046	2017-11-27 16:54:08 +00:00
Alexey Bataev	16e798873e	[OPENMP] Add support for cancel constructs in `target teams distribute parallel for`. Add support for cancel/cancellation point directives inside `target teams distribute parallel for` directives. llvm-svn: 318881	2017-11-22 21:12:03 +00:00
Alexey Bataev	dcb4b8fbc1	[OPENMP] Add support for cancel constructs in [teams] distribute parallel for directives. Added codegen/sema support for cancel constructs in [teams] distribute parallel for directives. llvm-svn: 318872	2017-11-22 20:19:50 +00:00
Alexey Bataev	438388c2ad	[OPENMP] Added missed checks for for [simd] based directives. Added missed checks/analysis for safelen/simdlen clauses + linear clause in for [simd] based directives. llvm-svn: 318860	2017-11-22 18:34:02 +00:00
Alexey Bataev	b45d43c397	[OPENMP] Do not mark captured variables as artificial in debug info. Captured variables should not be marked as artificial parameters in outlined functions in debug info. llvm-svn: 318843	2017-11-22 16:02:03 +00:00
Jonas Hahnfeld	891c7fb19d	[OpenMP] Adjust arguments of nvptx runtime functions In the future the compiler will analyze whether the OpenMP runtime needs to be (fully) initialized and avoid that overhead if possible. The functions already take an argument to transfer that information to the runtime, so pass in the default value 1. (This is needed for binary compatibility with libomptarget-nvptx currently being upstreamed.) Differential Revision: https://reviews.llvm.org/D40354 llvm-svn: 318836	2017-11-22 14:46:49 +00:00
Alexey Bataev	f9fc42e50b	[OPENMP] Codegen for `target teams` directive. Added codegen of the clauses for `target teams` directive. llvm-svn: 318834	2017-11-22 14:25:55 +00:00
Richard Trieu	430c96b67a	[OpenMP] Fix tests after r318789 Update use of __tgt_target that had some 32bit types updated to 64bit. llvm-svn: 318811	2017-11-21 22:53:19 +00:00
George Rokos	63bc9d6f66	[Clang][OpenMP] New clang/libomptarget map interface: new function signatures, clang-side This clang patch changes the __tgt_* API function signatures in preparation for the new map interface. Changes are: Device IDs 32bits --> 64bits, Flags 32bits --> 64bits Differential revision: https://reviews.llvm.org/D40281 llvm-svn: 318789	2017-11-21 18:25:12 +00:00
Jonas Hahnfeld	cfd162d8e5	Fix test/OpenMP/nvptx_data_sharing.cpp This was an oversight that stayed in the test from development. llvm-svn: 318779	2017-11-21 16:49:11 +00:00
Gheorghe-Teodor Bercea	eb89b1d46f	[OpenMP] Add implicit data sharing support when offloading to NVIDIA GPUs using OpenMP device offloading Summary: This patch is part of the development effort to add support in the current OpenMP GPU offloading implementation for implicitly sharing variables between a target region executed by the team master thread and the worker threads within that team. This patch is the first of three required for successfully performing the implicit sharing of master thread variables with the worker threads within a team. The remaining two patches are: - Patch D38978 to the LLVM NVPTX backend which ensures the lowering of shared variables to an device memory which allows the sharing of references; - Patch (coming soon) is a patch to libomptarget runtime library which ensures that a list of references to shared variables is properly maintained. A simple code snippet which illustrates an implicit data sharing situation is as follows: ``` #pragma omp target { // master thread only int v; #pragma omp parallel { // worker threads // use v } } ``` Variable v is implicitly shared from the team master thread which executes the code in between the target and parallel directives. The worker threads must operate on the latest version of v, including any updates performed by the master. The code generated in this patch relies on the LLVM NVPTX patch (mentioned above) which prevents v from being lowered in the thread local memory of the master thread thus making the reference to this variable un-shareable with the workers. This ensures that the code generated by this patch is correct. Since the parallel region is outlined the passing of arguments to the outlined regions must preserve the original order of arguments. The runtime therefore maintains a list of references to shared variables thus ensuring their passing in the correct order. The passing of arguments to the outlined parallel function is performed in a separate function which the data sharing infrastructure constructs in this patch. The function is inlined when optimizations are enabled. Reviewers: hfinkel, carlo.bertolli, arpith-jacob, Hahnfeld, ABataev, caomhin Reviewed By: ABataev Subscribers: cfe-commits, jholewinski Differential Revision: https://reviews.llvm.org/D38976 llvm-svn: 318773	2017-11-21 15:54:54 +00:00
Carlo Bertolli	62fae15600	[OpenMP] Initial implementation of code generation for pragma 'teams distribute parallel for' on host https://reviews.llvm.org/D40187 This patch implements code gen for 'teams distribute parallel for' on the host, including all its clauses and related regression tests. llvm-svn: 318692	2017-11-20 20:46:39 +00:00
Jonas Hahnfeld	87d4426988	[OpenMP] Show error if VLAs are not supported Some target devices (e.g. Nvidia GPUs) don't support dynamic stack allocation and hence no VLAs. Print errors with description instead of failing in the backend or generating code that doesn't work. This patch handles explicit uses of VLAs (local variable in target or declare target region) or implicitly generated (private) VLAs for reductions on VLAs or on array sections with non-constant size. Differential Revision: https://reviews.llvm.org/D39505 llvm-svn: 318601	2017-11-18 21:00:46 +00:00
Alexey Bataev	f836537516	[OPENMP] Codegen for `target simd` construct. Added codegen support for `target simd` directive. llvm-svn: 318536	2017-11-17 17:57:25 +00:00
Alexey Bataev	2139ed638b	[OPENMP] Add support for cancelling inside target parallel for directive. Added missed support for cancelling of target parallel for construct. llvm-svn: 318434	2017-11-16 18:20:21 +00:00
Alexey Bataev	817d7f36e9	[OPENMP] Fix DSA analysis for threadprivates after deserialization. If threadprivate vaible is deserialized, it is not marked as threadprivate in DSAStack. llvm-svn: 318194	2017-11-14 21:01:01 +00:00
Carlo Bertolli	8760acb8e3	[NFC] Pacify bbot for OpenMP 'teams distribute parallel for' llvm-svn: 317898	2017-11-10 16:49:09 +00:00
Carlo Bertolli	3808ff743e	[OpenMP] Parse+Sema for copyin clause of 'teams distribute parallel for' https://reviews.llvm.org/D39902 Simply leverage existing implementation and verify correct functioning with two regression tests. llvm-svn: 317893	2017-11-10 16:05:00 +00:00
Alexey Bataev	77aed73c2d	[OpenMP] diagnose assign to firstprivate const, patch by Joel E. Denny Summary: [OpenMP] diagnose assign to firstprivate const Clang does not diagnose assignments to const variables declared firstprivate. Furthermore, codegen is broken such that, at run time, such assignments simply have no effect. For example, the following prints 0 not 1: int main() { const int i = 0; #pragma omp parallel firstprivate(i) { i=1; printf("%d\n", i); } return 0; } This commit makes these assignments a compile error, which is consistent with other OpenMP compilers I've tried (pgcc 17.4-0, gcc 6.3.0). Reviewers: ABataev Reviewed By: ABataev Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D39859 llvm-svn: 317891	2017-11-10 15:39:50 +00:00
Alexey Bataev	5d7edca316	[OPENMP] Codegen for `#pragma omp target parallel for simd`. Added codegen for `#pragma omp target parallel for simd` and clauses. llvm-svn: 317813	2017-11-09 17:32:15 +00:00
Alexey Bataev	9a5e64f56a	[OPENMP] Treat '#pragma omp target parallel for simd' as simd directive. `#pragma omp target parallel for simd` mistakenly was not treated as a simd directive, fixed this problem. llvm-svn: 317811	2017-11-09 17:01:35 +00:00
Alexey Bataev	fb0ebecf0e	[OPENMP] Codegen for `#pragma omp target parallel for`. llvm-svn: 317719	2017-11-08 20:16:14 +00:00
Alexey Bataev	5d2c9a46fc	[OPENMP] Fix PR35152: Do not use getInvokeDest() function for EH checks. The compiler may crash under some conditions if the getInvokeDest() is used, but later it is not used. Fixed this problem in OpenMP. llvm-svn: 317227	2017-11-02 18:55:05 +00:00
Alexey Bataev	0e1b45897e	[OPENMP] Fix PR35156: Get correct thread id with windows exceptions. If the thread id is requested in windows mode within funclets, we may generate incorrect function call that could lead to broken codegen. llvm-svn: 317208	2017-11-02 14:25:34 +00:00
Jonas Hahnfeld	96087f3825	[OpenMP] Extend "Avoid VLAs for reduction" optimization to VLAs as base We can generate constant sized arrays whenever the array section has constant length, even if the base expression itself is a VLA. Differential Revision: https://reviews.llvm.org/D39504 llvm-svn: 317207	2017-11-02 13:30:42 +00:00
Alexey Bataev	d070a585f6	[OPENMP] Improve debug info for taskgroup implicitly generated expressions. llvm-svn: 316585	2017-10-25 15:54:04 +00:00
Alexey Bataev	1b48c5e56b	[OPENMP] Fix PR35013: Fix passing VLAs captures to outlined functions. Fixed passing of VLAs and variably-modified types to outlined functions. Synchronized passing with the types codegen. llvm-svn: 316488	2017-10-24 19:52:31 +00:00
Jonas Hahnfeld	4525c82428	[OpenMP] Avoid VLAs for some reductions on array sections In some cases the compiler can deduce the length of an array section as constants. With this information, VLAs can be avoided in place of a constant sized array or even a scalar value if the length is 1. Example: int a[4], b[2]; pragma omp parallel reduction(+: a[1:2], b[1:1]) { } For chained array sections, this optimization is restricted to cases where all array sections except the last have a constant length 1. This trivially guarantees that there are no holes in the memory region that needs to be privatized. Example: int c[3][4]; pragma omp parallel reduction(+: c[1:1][1:2]) { } This relands commit r316229 that I reverted in r316235 because it failed on some bots. During investigation I found that this was because Clang and GCC evaluate the two arguments to emplace_back() in ReductionCodeGen::emitSharedLValue() in a different order, hence leading to a different order of generated instructions in the final LLVM IR. Fix this by passing in the arguments from temporary variables that are evaluated in a defined order. Differential Revision: https://reviews.llvm.org/D39136 llvm-svn: 316362	2017-10-23 19:01:35 +00:00
Jonas Hahnfeld	c95a6985bd	Revert "[OpenMP] Avoid VLAs for some reductions on array sections" This breaks at least two buildbots: http://lab.llvm.org:8011/builders/clang-cmake-x86_64-avx2-linux/builds/1175 http://lab.llvm.org:8011/builders/clang-atom-d525-fedora-rel/builds/10478 This reverts commit r316229 during local investigation. llvm-svn: 316235	2017-10-20 20:16:17 +00:00
Jonas Hahnfeld	b6229be460	[OpenMP] Avoid VLAs for some reductions on array sections In some cases the compiler can deduce the length of an array section as constants. With this information, VLAs can be avoided in place of a constant sized array or even a scalar value if the length is 1. Example: int a[4], b[2]; pragma omp parallel reduction(+: a[1:2], b[1:1]) { } For chained array sections, this optimization is restricted to cases where all array sections except the last have a constant length 1. This trivially guarantees that there are no holes in the memory region that needs to be privatized. Example: int c[3][4]; pragma omp parallel reduction(+: c[1:1][1:2]) { } Differential Revision: https://reviews.llvm.org/D39136 llvm-svn: 316229	2017-10-20 19:40:40 +00:00
Alexey Bataev	7ba57afd6e	[OPENMP] Fix capturing of boolean variables in debug mode. If the variables is boolean and we generating inner function with real types, the codegen may crash because of not loading boolean value from memory. llvm-svn: 316011	2017-10-17 16:47:34 +00:00
Jonas Hahnfeld	23604a8399	[OpenMP] Implement omp_is_initial_device() as builtin This allows to return the static value that we know at compile time. Differential Revision: https://reviews.llvm.org/D38968 llvm-svn: 316001	2017-10-17 14:28:14 +00:00
Alexey Bataev	a7b19157ba	[OPENMP] Fix PR34927: Emit initializer for reduction array with declare reduction. If the reduction is an array or an array section and reduction operation is declare reduction without initializer, it may lead to crash. llvm-svn: 315611	2017-10-12 20:03:39 +00:00
Alexey Bataev	7b0f1f09a9	[OPENMP] Fix PR34926: Fix handling of the array sections passed as function params. Codegen could crash if the array section base expression is the function parameter. llvm-svn: 315586	2017-10-12 15:18:41 +00:00
Alexey Bataev	311a928359	[OPENMP] Fix PR34925: Fix getting thread_id lvalue for inlined regions in C. If we try to get the lvalue for thread_id variables in inlined regions, we did not use the correct version of function. Fixed this bug by adding overrided version of the function getThreadIDVariableLValue for inlined regions. llvm-svn: 315578	2017-10-12 13:51:32 +00:00
Alexey Bataev	e213f3e61a	[OPENMP] Fix PR34916: Crash on mixing taskloop\|tasks directives. If both taskloop and task directives are used at the same time in one program, we may ran into the situation when the particular type for task directive is reused for taskloop directives. Patch fixes this problem. llvm-svn: 315464	2017-10-11 15:29:40 +00:00
Alexey Bataev	2a0c4f57dd	[OPENMP] Add default codegen\|tests for 'target parallel for[ simd]' constructs. Added default codegen for 'target parallel for' construct + tests for default codegen of 'target parallel for[ simd]' constructs. llvm-svn: 315315	2017-10-10 14:14:43 +00:00
Alexey Bataev	afe5057b59	[OPENMP] Do not capture local static variables. Previously we may erroneously try to capture locally declared static variables, which will lead to crash for target-based constructs. Patch fixes this problem. llvm-svn: 315076	2017-10-06 17:00:28 +00:00
Alexey Bataev	cab496d3a9	[OPENMP] Capture references to global variables. In C++11 variable to global variables are considered as constant expressions and these variables are not captured in the outlined regions. Patch allows capturing of such variables in the OpenMP regions. llvm-svn: 315074	2017-10-06 16:17:25 +00:00
Alexey Bataev	2fd0cb2ae7	[OPENMP] Fix mapping\|privatization of implicitly captured variables. If the `defaultmap(tofrom:scalar)` clause is specified, the scalars must be mapped with 'tofrom' modifiers, otherwise they must be captured as firstprivates. llvm-svn: 314995	2017-10-05 17:51:39 +00:00
Carlo Bertolli	ba1487ba69	[OpenMP] Initial implementation of teams distribute code generation https://reviews.llvm.org/D38371 This patch implements codegen for the combined 'teams distribute" OpenMP pragma and adds regression tests for all its clauses. llvm-svn: 314905	2017-10-04 14:12:09 +00:00
Alexey Bataev	502ec49793	[OPENMP] Allow use of declare target directive inside struct declaration. Patch allows using of the `#pragma omp declare target`\| `#pragma omp end declare target` directives inside the structures if we need to mark as declare target only some static members. llvm-svn: 314833	2017-10-03 20:00:00 +00:00
Alexey Bataev	931e19bf51	[OPENMP] Capture argument of `device` clause for target-based directives. The argument of the `device` clause in target-based executable directives must be captured to support codegen for the `target` directives with the `depend` clauses. llvm-svn: 314686	2017-10-02 16:32:39 +00:00
Alexey Bataev	5dadb79fcd	[OPENMP] Fix test, NFC. llvm-svn: 314673	2017-10-02 14:35:31 +00:00
Alexey Bataev	2a007e05a0	[OPENMP] Simplify codegen for non-offloading code. Simplified and generalized codegen for non-offloading part that works if offloading is failed or condition of the `if` clause is `false`. llvm-svn: 314670	2017-10-02 14:20:58 +00:00
Gheorghe-Teodor Bercea	965c7e9c6e	[OpenMP] Add an additional test for D34888 Summary: Test for checking if the mapping is performed correctly. This is a test initially included in Patch https://reviews.llvm.org/D29905 Reviewers: Hahnfeld, carlo.bertolli, caomhin, ABataev Reviewed By: Hahnfeld Subscribers: tra, cfe-commits Differential Revision: https://reviews.llvm.org/D38040 llvm-svn: 314303	2017-09-27 14:31:08 +00:00
Gheorghe-Teodor Bercea	db6decd456	Revert commmit 314228. llvm-svn: 314240	2017-09-26 19:58:23 +00:00
Gheorghe-Teodor Bercea	b379ba6a62	[OpenMP] Add an additional test for D34888 Summary: Test for checking if the mapping is performed correctly. This is a test initially included in Patch https://reviews.llvm.org/D29905 Reviewers: Hahnfeld, carlo.bertolli, caomhin Reviewed By: Hahnfeld Subscribers: tra, cfe-commits Differential Revision: https://reviews.llvm.org/D38040 llvm-svn: 314228	2017-09-26 18:12:12 +00:00
Alexey Bataev	5ec3893b3a	[OPENMP] Fix handling of implicit mapping of array sections. llvm-svn: 314220	2017-09-26 16:19:04 +00:00
Gheorghe-Teodor Bercea	1f29745315	Revert commit 314210. llvm-svn: 314215	2017-09-26 15:24:34 +00:00
Gheorghe-Teodor Bercea	9db6e861ac	[OpenMP] Add an additional test for D34888 Summary: Test for checking if the mapping is performed correctly. This is a test initially included in Patch https://reviews.llvm.org/D29905 Reviewers: Hahnfeld, carlo.bertolli, caomhin Reviewed By: Hahnfeld Subscribers: tra, cfe-commits Differential Revision: https://reviews.llvm.org/D38040 llvm-svn: 314210	2017-09-26 14:56:50 +00:00
Alexey Bataev	f47c4b4184	[OPENMP] Generate implicit map\|firstprivate clauses for target-based directives. If the variable is used in the target-based region but is not found in any private\|mapping clause, then generate implicit firstprivate\|map clauses for these implicitly mapped variables. llvm-svn: 314205	2017-09-26 13:47:31 +00:00
Alexey Bataev	b7f18c3297	[OPENMP] Handle re-declaration of captured variables in CodeGen. If the captured variable has re-declaration we may end up with the situation where the captured variable is the re-declaration while the referenced variable is the canonical declaration (or vice versa). In this case we may generate wrong code. Patch fixes this situation. llvm-svn: 313995	2017-09-22 16:56:13 +00:00
Wei Mi	b086289787	[Atomic][X8664] set max atomic inline width according to the target This is to fix PR31620. MaxAtomicInlineWidth is set to 128 for x86_64. However for target without cx16 support, 128 atomic operation will generate __sync_* libcalls. The patch set MaxAtomicInlineWidth to 64 if the target doesn't support cx16. Differential Revision: https://reviews.llvm.org/D38046 llvm-svn: 313992	2017-09-22 16:30:00 +00:00
Alexey Bataev	0dce2ea758	[OPENMP] Use canonical declarations for redeclarations checks. If the captured variable has some redeclarations we may run into the situation where the redeclaration is used instead of the canonical declaration and we may consider this variable as one not captured before. llvm-svn: 313880	2017-09-21 14:06:59 +00:00
Alexey Bataev	e85de8fcf9	[OPENMP] Support for re-declarations when checking captured variables. Need to check for variables re-declarations when checking that the variable was already captured in the captured region. llvm-svn: 313805	2017-09-20 20:11:31 +00:00
Rachel Craik	1cf49e4d12	[OpenMP] fix seg-faults printing diagnostics with invalid ordered(n) values When the value specified for n in ordered(n) is larger than the number of loops a segmentation fault can occur in one of two ways when attempting to print out a diagnostic for an associated depend(sink : vec): 1) The iteration vector vec contains less than n items 2) The iteration vector vec contains a variable that is not a loop control variable This patch addresses both of these issues. Differential Revision: https://reviews.llvm.org/D38049 llvm-svn: 313675	2017-09-19 21:04:23 +00:00
Alexey Bataev	36f2c4df12	[OPENMP] Fix types for the target specific parameters in debug mode. Used incorrect types for target specific parameters in debug mode, should use original pointers rather than the pointee types. llvm-svn: 313186	2017-09-13 20:20:59 +00:00
Alexey Bataev	ed94bd9223	[OPENMP] Allow all classes as mappable types. According to upcoming OpenMP 5.0 all classes/structs are now considered as mappable, even polymorphic and with static members. llvm-svn: 313141	2017-09-13 11:12:35 +00:00
Alexey Bataev	f43f714213	[OPENMP] Fix for PR33922: New ident_t flags for __kmpc_for_static_fini(). Added special flags for calls of __kmpc_for_static_fini(), like previous ly for __kmpc_for_static_init(). Added flag OMP_IDENT_WORK_DISTRIBUTE for distribute cnstruct, OMP_IDENT_WORK_SECTIONS for sections-based constructs and OMP_IDENT_WORK_LOOP for loop-based constructs in location flags. llvm-svn: 312642	2017-09-06 16:17:35 +00:00
Alexey Bataev	070f43aee7	[OPENMP] Fix for PR34445: Reduction initializer segfaults at runtime in move constructor. Previously user-defined reduction initializer was considered as an assignment expression, not as initializer. Fixed this by treating the initializer expression as an initializer. llvm-svn: 312638	2017-09-06 14:49:58 +00:00
Alexey Bataev	760e118634	[OPENMP] Fix the test, NFC. llvm-svn: 312296	2017-08-31 23:34:33 +00:00
Alexey Bataev	5372fb8cc1	[OPENMP] Fix for PR34398: assert with random access iterator if the step>1. If the loop is a loot with random access iterators and the iteration construct is represented it += n, then the compiler crashed because of reusing of the same MaterializedTemporaryExpr around N. Patch fixes it by using the expression as written, without any special kind of wrappings. llvm-svn: 312292	2017-08-31 23:06:52 +00:00
Alexey Bataev	61498fb88f	[OPENMP] Capture global variables in all target executable regions. Capturing of the global variables occurs only in target regions. Patch fixes it and allows capturing of globals in all target executable directives. llvm-svn: 312024	2017-08-29 19:30:57 +00:00
Alexey Bataev	fdc2035373	[OPENMP] Fix for PR34321: ustom OpenMP reduction in C++ template causes SEGFAULT at compile time Compiler crashed when tried to rebuild non-template expression in dependent context. llvm-svn: 311777	2017-08-25 15:43:55 +00:00
Reid Kleckner	6d353348e5	Parse and print DIExpressions inline to ease IR and MIR testing Summary: Most DIExpressions are empty or very simple. When they are complex, they tend to be unique, so checking them inline is reasonable. This also avoids the need for CodeGen passes to append to the llvm.dbg.mir named md node. See also PR22780, for making DIExpression not be an MDNode. Reviewers: aprantl, dexonsmith, dblaikie Subscribers: qcolombet, javed.absar, eraman, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D37075 llvm-svn: 311594	2017-08-23 20:31:27 +00:00
Alexey Bataev	6a71f364f1	[OPENMP] Fix for PR34014: OpenMP 4.5: Target construct in static method of class fails to map class static variable. If the global variable is captured and it has several redeclarations, sometimes it may lead to a compiler crash. Patch fixes this by working only with canonical declarations. llvm-svn: 311479	2017-08-22 17:54:52 +00:00
Alexey Bataev	8c3edfef6b	[OPENMP] Fix for PR28581: OpenMP linear clause - wrong results. If worksharing construct has at least one linear item, an implicit synchronization point must be emitted to avoid possible conflict with the loading/storing values to the original variables. Added implicit barrier if the linear item is found before actual start of the worksharing construct. llvm-svn: 311013	2017-08-16 15:58:46 +00:00
Alexey Bataev	aee18557f7	[OPRNMP] Fix for PR33445: ICE: OpenMP target containing ordered for. If exceptions are enabled, there may be a problem with the codegen of the finalization functions from OpenMP runtime. It happens because of the problem with the getting of thread identifier value. Patch tries to fix it by using the result of the call of function __kmpc_global_thread_num() rather than loading of value of outlined function parameter. llvm-svn: 311007	2017-08-16 14:01:00 +00:00
Alexey Bataev	07ed94a7c7	[OPENMP] Fix compiler crash on argument translate for NVPTX. When translating arguments for NVPTX target it is not taken into account that function may have variable number of arguments. Patch fixes this problem. llvm-svn: 310920	2017-08-15 14:34:04 +00:00
Alexey Bataev	0f87dbee4e	[OPENMP] Fix for PR33922: New ident_t flags for __kmpc_for_static_init(). OpenMP 5.0 will include OpenMP Tools interface that requires distinguishing different worksharing constructs. Since the same entry point (__kmp_for_static_init(ident_t *loc, kmp_int32 global_tid,........)) is called in case static loop/sections/distribute it is suggested using 'flags' field of the ident_t structure to pass the type of the construct. llvm-svn: 310865	2017-08-14 17:56:13 +00:00
Alexey Bataev	6e01dc1b84	[OPENMP][DEBUG] Fix for PR33676: Debug info for OpenMP region is broken. After some changes in clang/LLVM debug info for task-based regions was not generated at all. Patch fixes this problem. llvm-svn: 310850	2017-08-14 16:03:47 +00:00
Alexey Bataev	3c595a6b2c	[OPENMP] Generalization of calls of the outlined functions. General improvement of the outlined functions calls. llvm-svn: 310840	2017-08-14 15:01:03 +00:00
Alexey Bataev	e754b18f5e	[OPENMP] Emit non-debug version of outlined functions with original name. If the host code is compiled with the debug info, while the target without, there is a problem that the compiler is unable to find the debug wrapper. Patch fixes this problem by emitting special name for the debug version of the code. llvm-svn: 310511	2017-08-09 19:38:53 +00:00
Tim Northover	0241637c0e	Sema: disable implicit conversion from _Complex to real types in C++. Converting a _Complex type to a real one simply discards the imaginary part. This can easily lead to loss of information so for safety (and GCC compatibility) this patch disallows that when the conversion would be implicit. The one exception is bool, which actually compares both real and imaginary parts and so is safe. llvm-svn: 310427	2017-08-08 23:18:05 +00:00
Alexey Bataev	3b8d5586ec	[OPENMP][DEBUG] Set proper address space info if required by target. Arguments, passed to the outlined function, must have correct address space info for proper Debug info support. Patch sets global address space for arguments that are mapped and passed by reference. Also, cuda-gdb does not handle reference types correctly, so reference arguments are represented as pointers. llvm-svn: 310387	2017-08-08 18:04:06 +00:00
Alexey Bataev	4aa19052f3	Revert "[OPENMP][DEBUG] Set proper address space info if required by target." This reverts commit r310377. llvm-svn: 310379	2017-08-08 16:45:36 +00:00
Alexey Bataev	5a497136be	[OPENMP][DEBUG] Set proper address space info if required by target. Arguments, passed to the outlined function, must have correct address space info for proper Debug info support. Patch sets global address space for arguments that are mapped and passed by reference. Also, cuda-gdb does not handle reference types correctly, so reference arguments are represented as pointers. llvm-svn: 310377	2017-08-08 16:29:11 +00:00
Alexey Bataev	6a824b9a45	Revert "[OPENMP][DEBUG] Set proper address space info if required by target." This reverts commit r310360. llvm-svn: 310364	2017-08-08 14:44:43 +00:00
Alexey Bataev	59b81e51d3	[OPENMP][DEBUG] Set proper address space info if required by target. Arguments, passed to the outlined function, must have correct address space info for proper Debug info support. Patch sets global address space for arguments that are mapped and passed by reference. Also, cuda-gdb does not handle reference types correctly, so reference arguments are represented as pointers. llvm-svn: 310360	2017-08-08 14:25:14 +00:00
Gheorghe-Teodor Bercea	ef5e106fc1	[OpenMP] Error when trying to offload to an unsupported architecture Summary: Throw an error when offloading is unsupported for a particular target architecture. Reviewers: sfantao, caomhin, carlo.bertolli, ABataev, Hahnfeld Reviewed By: ABataev Subscribers: cfe-commits, rengolin Differential Revision: https://reviews.llvm.org/D32035 llvm-svn: 310307	2017-08-07 21:11:10 +00:00
Gheorghe-Teodor Bercea	e3b0a19d4f	[OpenMP] Prevent emission of exception handling code when using OpenMP to offload to NVIDIA devices. Summary: For the OpenMP toolchain which offloads to NVIDIA GPUs make sure that no exception handling code is emitted. Reviewers: arpith-jacob, sfantao, caomhin, carlo.bertolli, ABataev, Hahnfeld, hfinkel, tstellar Reviewed By: ABataev, Hahnfeld Subscribers: rengolin, Hahnfeld, cfe-commits Differential Revision: https://reviews.llvm.org/D29904 llvm-svn: 310306	2017-08-07 20:57:59 +00:00
Alexey Bataev	d90ec748a8	Revert "[OPENMP][DEBUG] Set proper address space info if required by target." This reverts commit r310104. llvm-svn: 310135	2017-08-04 21:27:11 +00:00
Alexey Bataev	be83fad57e	[OPENMP][DEBUG] Set proper address space info if required by target. Arguments, passed to the outlined function, must have correct address space info for proper Debug info support. Patch sets global address space for arguments that are mapped and passed by reference. Also, cuda-gdb does not handle reference types correctly, so reference arguments are represented as pointers. llvm-svn: 310104	2017-08-04 19:46:10 +00:00
Alexey Bataev	2c7eee5b84	[OPENMP] Unify generation of outlined function calls. llvm-svn: 310098	2017-08-04 19:10:54 +00:00
Alex Lorenz	45b4014711	Recommit r308327 3rd time: Add a warning for missing '#pragma pack (pop)' and suspicious uses of '#pragma pack' in included files The second recommit (r309106) was reverted because the "non-default #pragma pack value chages the alignment of struct or union members in the included file" warning proved to be too aggressive for external projects like Chromium (https://bugs.chromium.org/p/chromium/issues/detail?id=749197). This recommit makes the problematic warning a non-default one, and gives it the -Wpragma-pack-suspicious-include warning option. The first recommit (r308441) caused a "non-default #pragma pack value might change the alignment of struct or union members in the included file" warning in LLVM itself. This recommit tweaks the added warning to avoid warnings for #includes that don't have any records that are affected by the non-default alignment. This tweak avoids the previously emitted warning in LLVM. Original message: This commit adds a new -Wpragma-pack warning. It warns in the following cases: - When a translation unit is missing terminating #pragma pack (pop) directives. - When entering an included file if the current alignment value as determined by '#pragma pack' directives is different from the default alignment value. - When leaving an included file that changed the state of the current alignment value. rdar://10184173 Differential Revision: https://reviews.llvm.org/D35484 llvm-svn: 309386	2017-07-28 14:41:21 +00:00
Alexey Bataev	463a9fea1e	[OPENMP] Allow all lvalues in 'depend' clause. According to upcoming OpenMP 5.0 all addressable lvalue expressions are allowed in deoend clause. llvm-svn: 309309	2017-07-27 19:15:30 +00:00
Erich Keane	0026ed2f9c	Fix double destruction of objects when OpenMP construct is canceled When an omp for loop is canceled the constructed objects are being destructed twice. It looks like the desired code is: { Obj o; If (cancelled) branch-through-cleanups to cancel.exit. } [cleanups] cancel.exit: __kmpc_for_static_fini br cancel.cont (*) cancel.cont: __kmpc_barrier return The problem seems to be the branch to cancel.cont is currently also going through the cleanups calling them again. This change just does a direct branch instead. Patch By: michael.p.rice@intel.com Differential Revision: https://reviews.llvm.org/D35854 llvm-svn: 309288	2017-07-27 16:28:20 +00:00
Alexey Bataev	88202be1f0	[OPENMP] Codegen for 'in_reduction' clause. Added codegen for task-based directive with in_reduction clause. ``` <body> ``` The next code is emitted: ``` void td; ... td = call i8 @__kmpc_task_reduction_init(); ... <type> priv = (<type> )call i8* @__kmpc_task_reduction_get_th_data(i32 GTID, i8* td, i8* <orig>) ``` llvm-svn: 309270	2017-07-27 13:20:36 +00:00
Hans Wennborg	b4ece98a34	Revert r309106 "Recommit r308327 2nd time: Add a warning for missing" The warning fires on non-suspicious code in Chromium. Reverting until a solution is figured out. > Recommit r308327 2nd time: Add a warning for missing > '#pragma pack (pop)' and suspicious uses of '#pragma pack' in included files > > The first recommit (r308441) caused a "non-default #pragma pack value might > change the alignment of struct or union members in the included file" warning > in LLVM itself. This recommit tweaks the added warning to avoid warnings for > #includes that don't have any records that are affected by the non-default > alignment. This tweak avoids the previously emitted warning in LLVM. > > Original message: > > This commit adds a new -Wpragma-pack warning. It warns in the following cases: > > - When a translation unit is missing terminating #pragma pack (pop) directives. > - When entering an included file if the current alignment value as determined > by '#pragma pack' directives is different from the default alignment value. > - When leaving an included file that changed the state of the current alignment > value. > > rdar://10184173 > > Differential Revision: https://reviews.llvm.org/D35484 llvm-svn: 309186	2017-07-26 21:29:24 +00:00
Alex Lorenz	5d48424a30	Recommit r308327 2nd time: Add a warning for missing '#pragma pack (pop)' and suspicious uses of '#pragma pack' in included files The first recommit (r308441) caused a "non-default #pragma pack value might change the alignment of struct or union members in the included file" warning in LLVM itself. This recommit tweaks the added warning to avoid warnings for #includes that don't have any records that are affected by the non-default alignment. This tweak avoids the previously emitted warning in LLVM. Original message: This commit adds a new -Wpragma-pack warning. It warns in the following cases: - When a translation unit is missing terminating #pragma pack (pop) directives. - When entering an included file if the current alignment value as determined by '#pragma pack' directives is different from the default alignment value. - When leaving an included file that changed the state of the current alignment value. rdar://10184173 Differential Revision: https://reviews.llvm.org/D35484 llvm-svn: 309106	2017-07-26 12:20:57 +00:00
Alexey Bataev	3b1b8951b9	[OPENMP] Codegen for 'task_reduction' clause. Added codegen for taskgroup directive with task_reduction clause. ``` <body> ``` The next code is emitted: ``` %struct.kmp_task_red_input_t red_init[n]; void td; call void @__kmpc_taskgroup(%ident_t id, i32 gtid) ... red_init[i].shar = &<item>; red_init[i].size = sizeof(<item>); red_init[i].init = (void)initializer_function; red_init[i].fini = (void)destructor_function; red_init[i].comb = (void)combiner_function; red_init[i].flags = flags; ... td = call i8* @__kmpc_task_reduction_init(i32 gtid, i32 n, i8* (void)red_init); call void @__kmpc_end_taskgroup(%ident_t id, i32 gtid) void initializer_function(i8 priv) { (<type>)priv = <red_init>; ret void; } void destructor_function(i8* priv) { (<type>)priv->~(); ret void; } void combiner_function(i8 inout, i8* in) { (<type>)inout = (<type>)inout <red_id> (<type>)in; ret void; } ``` llvm-svn: 308979	2017-07-25 15:53:26 +00:00
Alexey Bataev	f189cb79d4	[OPENMP] Fix the restrictions for in_reduction clause. This patch allows to use in_reduction clause even if the innermost directive is not taskgroup. llvm-svn: 308883	2017-07-24 14:52:13 +00:00
Alexey Bataev	030f8b4c99	[OPENMP] Add ast-print tests for in_reduction clause, NFC. llvm-svn: 308771	2017-07-21 18:55:46 +00:00
Alexey Bataev	fa312f33f8	[OPENMP] Initial support for 'in_reduction' clause. Parsing/sema analysis for 'in_reduction' clause for task-based directives. llvm-svn: 308768	2017-07-21 18:48:21 +00:00
Alexey Bataev	effbdf1604	[OPENMP] Simplify analysis of data-sharing attributes. llvm-svn: 308759	2017-07-21 17:24:30 +00:00
Alexey Bataev	4d4624c20c	[OPENMP] Fix DSA processing for member declaration. If the member declaration is captured in the OMPCapturedExprDecl, we may loose data-sharing attribute info for this declaration. Patch fixes this bug. llvm-svn: 308629	2017-07-20 16:47:47 +00:00
Hans Wennborg	f365d423a0	Revert r308441 "Recommit r308327: Add a warning for missing '#pragma pack (pop)' and suspicious uses of '#pragma pack' in included files" This seems to have broken the sanitizer-x86_64-linux buildbot. Reverting until it's fixed, especially since this landed just before the 5.0 branch. > This commit adds a new -Wpragma-pack warning. It warns in the following cases: > > - When a translation unit is missing terminating #pragma pack (pop) directives. > - When entering an included file if the current alignment value as determined > by '#pragma pack' directives is different from the default alignment value. > - When leaving an included file that changed the state of the current alignment > value. > > rdar://10184173 > > Differential Revision: https://reviews.llvm.org/D35484 llvm-svn: 308455	2017-07-19 12:31:01 +00:00
Alex Lorenz	287f684c18	Recommit r308327: Add a warning for missing '#pragma pack (pop)' and suspicious uses of '#pragma pack' in included files This commit adds a new -Wpragma-pack warning. It warns in the following cases: - When a translation unit is missing terminating #pragma pack (pop) directives. - When entering an included file if the current alignment value as determined by '#pragma pack' directives is different from the default alignment value. - When leaving an included file that changed the state of the current alignment value. rdar://10184173 Differential Revision: https://reviews.llvm.org/D35484 llvm-svn: 308441	2017-07-19 11:30:41 +00:00
Alexey Bataev	169d96a203	[OPENMP] Initial support for 'task_reduction' clause. Parsing/sema analysis of the 'task_reduction' clause. llvm-svn: 308352	2017-07-18 20:17:46 +00:00
Alex Lorenz	aa61922157	Revert r308327 I forgot to test clang-tools-extra which is now failing. llvm-svn: 308328	2017-07-18 17:36:42 +00:00
Alex Lorenz	ad273341a4	Add a warning for missing '#pragma pack (pop)' and suspicious uses of '#pragma pack' in included files This commit adds a new -Wpragma-pack warning. It warns in the following cases: - When a translation unit is missing terminating #pragma pack (pop) directives. - When entering an included file if the current alignment value as determined by '#pragma pack' directives is different from the default alignment value. - When leaving an included file that changed the state of the current alignment value. rdar://10184173 Differential Revision: https://reviews.llvm.org/D35484 llvm-svn: 308327	2017-07-18 17:23:51 +00:00
Alexey Bataev	6e7f49fc4a	[OPENMP] Pacify windows buildbots, NFC. llvm-svn: 308243	2017-07-18 00:42:35 +00:00
Mandeep Singh Grang	a16b27cfda	[clang] Remove redundant check-prefix=CHECK from tests. NFC. Reviewers: t.p.northover, mstorsjo, rsmith, mcrosier Reviewed By: mstorsjo, mcrosier Subscribers: mcrosier, javed.absar, cfe-commits Differential Revision: https://reviews.llvm.org/D35465 llvm-svn: 308192	2017-07-17 17:31:44 +00:00
Alexey Bataev	ec8c9f8eb9	[OPENMP] Fix reduction combiner test llvm-svn: 308183	2017-07-17 14:53:02 +00:00
Alexey Bataev	55c2a961e0	[OPENMP] Further fixes of the reduction codegen tests llvm-svn: 308182	2017-07-17 14:47:59 +00:00
Alexey Bataev	7766e51a61	[OPENMP] Further test fixes. llvm-svn: 308178	2017-07-17 14:22:34 +00:00
Alexey Bataev	a2a814f925	[OPENMP] Rework tests to pacify buildbots. llvm-svn: 308176	2017-07-17 14:06:41 +00:00
Alexey Bataev	be5a8b42cd	[OPENMP] Codegen for reduction clauses in 'taskloop' directives. Adds codegen for taskloop-based directives. llvm-svn: 308174	2017-07-17 13:30:36 +00:00
Alexey Bataev	c9dcaaae69	[OPENMP] Fix reduction tests, NFC. llvm-svn: 307916	2017-07-13 15:15:25 +00:00
Alexey Bataev	faede1d0d8	[OPENMP] Fix reduction tests, NFC. llvm-svn: 307915	2017-07-13 15:09:05 +00:00
Alexey Bataev	280f308b71	[OPENMP] Further reduction test fix, NFC. llvm-svn: 307914	2017-07-13 15:02:27 +00:00
Alexey Bataev	80621121e2	[OPENMP] Fix reduction tests, NFC. llvm-svn: 307912	2017-07-13 14:54:42 +00:00
Alexey Bataev	3ef6fd5637	[OPENMP] Fix reduction tests, NFC. llvm-svn: 307911	2017-07-13 14:29:19 +00:00
Alexey Bataev	5c40bec5eb	[OPENMP] Generalization of codegen for reduction clauses. Reworked codegen for reduction clauses for future support of reductions in task-based directives. llvm-svn: 307910	2017-07-13 13:36:14 +00:00
Alexey Bataev	3344603f7b	[OPENMP] Emit implicit taskgroup block around taskloop directives. If taskloop directive has no associated nogroup clause, it must emitted inside implicit taskgroup block. Runtime supports it, but we need to generate implicit taskgroup block explicitly to support future reductions codegen. llvm-svn: 307822	2017-07-12 18:09:32 +00:00
Alexey Bataev	bcd0ae0617	[OPENMP] Add restriction for reduction clause in taskloop directives. Added checks for the reduction clauses in the taskloop directives: 1. Only addressable items must be used in reduction clauses. 2. Reduction clauses cannot be used with nogroup clauses. llvm-svn: 307693	2017-07-11 19:16:44 +00:00
Jonas Hahnfeld	f7c4d7b0b1	[OpenMP] Fix mapping of scalars for combined directives Combined directives like 'target parallel' have two captured statements. Sema has to check the right one from the right direction. Previously, Sema::IsOpenMPCapturedByRef would return false for mapped scalars on combined directives. This results in a wrong signature of the outlined function which triggers an assertion: void llvm::CallInst::init(llvm::FunctionType , llvm::Value , ArrayRef<llvm::Value *>, ArrayRef<OperandBundleDef>, const llvm::Twine &): Assertion `(i >= FTy->getNumParams() \|\| FTy->getParamType(i) == Args[i]->getType()) && "Calling a function with a bad signature!"' failed. Fixes PR30975 (and PR31985). New function was taken from clang-ykt. Differential Revision: https://reviews.llvm.org/D34888 llvm-svn: 306956	2017-07-01 10:40:50 +00:00
Alexey Bataev	7b6097ce73	[OPENMP] Initial support for taskloop reductions. Add sema/parsupping ort for taskloop [simd] reductions. llvm-svn: 306825	2017-06-30 13:50:13 +00:00
Alexey Bataev	1fdfdf7155	[OPENMP][DEBUG] Generate second function with correct arg types. Currently, if the some of the parameters are captured by value, this argument is converted to uintptr_t type and thus we loosing the debug info about real type of the argument (captured variable): ``` void @.outlined_function.(uintptr %par); ... %a = alloca i32 %a.casted = alloca uintptr %cast = bitcast uintptr* %a.casted to i32* %a.val = load i32, i32 %a store i32 %a.val, i32 %cast %a.casted.val = load uintptr, uintptr* %a.casted call void @.outlined_function.(uintptr %a.casted.val) ... ``` To resolve this problem, in debug mode a speciall external wrapper function is generated, that calls the outlined function with the correct parameters types: ``` void @.wrapper.(uintptr %par) { %a = alloca i32 %cast = bitcast i32* %a to uintptr* store uintptr %par, uintptr %cast %a.val = load i32, i32 %a call void @.outlined_function.(i32 %a) ret void } void @.outlined_function.(i32 %par); ... %a = alloca i32 %a.casted = alloca uintptr %cast = bitcast uintptr* %a.casted to i32* %a.val = load i32, i32 %a store i32 %a.val, i32 %cast %a.casted.val = load uintptr, uintptr* %a.casted call void @.wrapper.(uintptr %a.casted.val) ... ``` llvm-svn: 306697	2017-06-29 16:43:05 +00:00
Diana Picus	bec724cbb0	Revert "Revert r301742 which made ExprConstant checking apply to all full-exprs." This reverts commit r305239 because it broke the buildbots (the diag-flags.cpp test is failing). llvm-svn: 305287	2017-06-13 12:50:06 +00:00
Nick Lewycky	54992386f3	Revert r301742 which made ExprConstant checking apply to all full-exprs. This patch also exposed pre-existing bugs in clang, see PR32864 and PR33140#c3 . llvm-svn: 305239	2017-06-12 21:59:18 +00:00
Alexey Bataev	95b64a9826	[OPENMP] Allow 'use_device_ptr' clause in 'target data' alone. According to OpenMP 5.0 at least one 'map' or 'use_device_ptr' clause must be specified for 'target data' construct. Patch adds support for this feature. llvm-svn: 304216	2017-05-30 16:00:04 +00:00
George Rokos	29d0f00340	[OpenMP] Create COMDAT group for OpenMP offload registration code to avoid multiple copies Thanks to Sergey Dmitriev for submitting the patch. Differential Revision: https://reviews.llvm.org/D33509 llvm-svn: 304056	2017-05-27 03:03:13 +00:00
Alexey Bataev	979966fcd8	[OPENMP] Allow value of thread local variables in target regions. If the variable is marked as TLS variable and target device does not support TLS, the error is emitted for the variable even if it is not used in target regions. Patch fixes this and allows to use the values of the TLS variables in target regions. llvm-svn: 303768	2017-05-24 16:00:02 +00:00
Alexey Bataev	2c84541a21	[OPENMP] Check DSA for variables captured by value. Currently clang checks for default data sharing attributes only for variables captured in OpenMP regions by reference. Patch adds checks for variables captured by value. llvm-svn: 303077	2017-05-15 16:26:15 +00:00
Reid Kleckner	f1deb837ee	Fix bugs checking va_start in lambdas and erroneous contexts Summary: First, getCurFunction looks through blocks and lambdas, which is wrong. Inside a lambda, va_start should refer to the lambda call operator prototype. This fixes PR32737. Second, we shouldn't use any of the getCur* methods, because they look through contexts that we don't want to look through (EnumDecl, CapturedStmtDecl). We can use CurContext directly as the calling context. Finally, this code assumed that CallExprs would never appear outside of code contexts (block, function, obj-c method), which is wrong. Struct member initializers are an easy way to create and parse exprs in a non-code context. Reviewers: rsmith Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D32761 llvm-svn: 302188	2017-05-04 19:51:05 +00:00
Carlo Bertolli	d8844b9d43	[OpenMP] Extended parse for 'always' map modifier https://reviews.llvm.org/D32807 This patch allows the map modifier 'always' to be separated by the map type (to, from, tofrom) only by a whitespace, rather than strictly by a comma as in current trunk. llvm-svn: 302031	2017-05-03 15:28:48 +00:00
Nick Lewycky	e7d6fbdfb7	Remove Sema::CheckForIntOverflow, and instead check all full-expressions. CheckForIntOverflow used to implement a whitelist of top-level expressions to send to the constant expression evaluator, which handled many more expressions than the CheckForIntOverflow whitelist did. llvm-svn: 301742	2017-04-29 09:33:46 +00:00
Alexey Bataev	b435a5ff5e	[OPENMP] Fix failing test. llvm-svn: 301417	2017-04-26 15:30:36 +00:00
Alexey Bataev	4b46539ef3	[OPENMP] Fix handling of OpenMP code during template instantiation. If some function template is instantiated during handling of OpenMP code, currently it may cause crash of compiler because of trying of capturing variables in non-capturing function scopes. Patch fixes this bug. llvm-svn: 301416	2017-04-26 15:06:24 +00:00
Carlo Bertolli	356822fe7b	Minor fix for distribute_parallel_for_num_threads_codegen on AARCH64 llvm-svn: 301348	2017-04-25 18:59:37 +00:00
Carlo Bertolli	b0ff0a69c3	Recommit of [OpenMP] Initial implementation of code generation for pragma 'distribute parallel for' on host https://reviews.llvm.org/D29508 This patch makes the following additions: It abstracts away loop bound generation code from procedures associated with pragma 'for' and loops in general, in such a way that the same procedures can be used for 'distribute parallel for' without the need for a full re-implementation. It implements code generation for 'distribute parallel for' and adds regression tests. It includes tests for clauses. It is important to notice that most of the clauses are implemented as part of existing procedures. For instance, firstprivate is already implemented for 'distribute' and 'for' as separate pragmas. As the implementation of 'distribute parallel for' is based on the same procedures, then we automatically obtain implementation for such clauses without the need to add new code. However, this requires regression tests that verify correctness of produced code. llvm-svn: 301340	2017-04-25 17:52:12 +00:00
Carlo Bertolli	f09daae75d	Revert r301223 llvm-svn: 301233	2017-04-24 19:50:35 +00:00
Carlo Bertolli	4287d65c10	[OpenMP] Initial implementation of code generation for pragma 'distribute parallel for' on host https://reviews.llvm.org/D29508 This patch makes the following additions: 1. It abstracts away loop bound generation code from procedures associated with pragma 'for' and loops in general, in such a way that the same procedures can be used for 'distribute parallel for' without the need for a full re-implementation. 2. It implements code generation for 'distribute parallel for' and adds regression tests. It includes tests for clauses. It is important to notice that most of the clauses are implemented as part of existing procedures. For instance, firstprivate is already implemented for 'distribute' and 'for' as separate pragmas. As the implementation of 'distribute parallel for' is based on the same procedures, then we automatically obtain implementation for such clauses without the need to add new code. However, this requires regression tests that verify correctness of produced code. Looking forward to comments. llvm-svn: 301223	2017-04-24 19:26:11 +00:00
Carlo Bertolli	ffafe10fac	[OpenMP] Prepare sema to support combined constructs with omp distribute and omp for https://reviews.llvm.org/D32237 This patch prepares sema with additional fields to support all those composite and combined constructs of OpenMP that include pragma 'distribute' and 'for', such as 'distribute parallel for'. It also extends the regression tests for 'distribute parallel for' and adds a new one. llvm-svn: 300802	2017-04-20 00:39:39 +00:00
Alexey Bataev	f7ce166220	[OPENMP] Fix for PR32333: Crash in call of outlined Function. If the type of the captured variable is a pointer(s) to variably modified type, this type was not processed correctly. Need to drill into the type, find the innermost variably modified array type and convert it to canonical parameter type. llvm-svn: 299868	2017-04-10 19:16:45 +00:00
Jonas Hahnfeld	bf5061b18c	[test] Unbreak OpenMP/linking.c with arch-specific libdir After rL296927, -rpath gets added after linking the OpenMP runtime. That's why -lgcc does not immediately follow -lomp or -lgomp. llvm-svn: 297264	2017-03-08 09:07:33 +00:00
Jonas Hahnfeld	64a9e3c530	[OpenMP] Generate better diagnostics for cancel and cancellation point checkNestingOfRegions uses CancelRegion to determine whether cancel and cancellation point are valid in the given nesting. This leads to unuseful diagnostics if CancelRegion is invalid. The given test case has produced: region cannot be closely nested inside 'parallel' region As a solution, introduce checkCancelRegion and call it first to get the expected error: one of 'for', 'parallel', 'sections' or 'taskgroup' is expected Differential Revision: https://reviews.llvm.org/D30135 llvm-svn: 295808	2017-02-22 06:49:10 +00:00
Jonas Hahnfeld	b07931f01d	[OpenMP] Fix cancellation point in task with no cancel With tasks, the cancel may happen in another task. This has a different region info which means that we can't find it here. Differential Revision: https://reviews.llvm.org/D30091 llvm-svn: 295474	2017-02-17 18:32:58 +00:00
Jonas Hahnfeld	20fce72f1b	[OpenMP] Remove barriers at cancel and cancellation point This resolves a deadlock with the cancel directive when there is no explicit cancellation point. In that case, the implicit barrier acts as cancellation point. After removing the barrier after cancel, the now unmatched barrier for the explicit cancellation point has to go as well. This has probably worked before rL255992: With the calls for the explicit barrier, it was sure that all threads passed a barrier before exiting. Reported by Simon Convent and Joachim Protze! Differential Revision: https://reviews.llvm.org/D30088 llvm-svn: 295473	2017-02-17 18:32:51 +00:00
Arpith Chacko Jacob	fc711b1f47	[OpenMP] Teams reduction on the NVPTX device. This patch implements codegen for the reduction clause on any teams construct for elementary data types. It builds on parallel reductions on the GPU. Subsequently, the team master writes to a unique location in a global memory scratchpad. The last team to do so loads and reduces this array to calculate the final result. This patch emits two helper functions that are used by the OpenMP runtime on the GPU to perform reductions across teams. Patch by Tian Jin in collaboration with Arpith Jacob Reviewers: ABataev Differential Revision: https://reviews.llvm.org/D29879 llvm-svn: 295335	2017-02-16 16:48:49 +00:00
Arpith Chacko Jacob	101e8fb1f3	[OpenMP] Parallel reduction on the NVPTX device. This patch implements codegen for the reduction clause on any parallel construct for elementary data types. An efficient implementation requires hierarchical reduction within a warp and a threadblock. It is complicated by the fact that variables declared in the stack of a CUDA thread cannot be shared with other threads. The patch creates a struct to hold reduction variables and a number of helper functions. The OpenMP runtime on the GPU implements reduction algorithms that uses these helper functions to perform reductions within a team. Variables are shared between CUDA threads using shuffle intrinsics. An implementation of reductions on the NVPTX device is substantially different to that of CPUs. However, this patch is written so that there are minimal changes to the rest of OpenMP codegen. The implemented design allows the compiler and runtime to be decoupled, i.e., the runtime does not need to know of the reduction operation(s), the type of the reduction variable(s), or the number of reductions. The design also allows reuse of host codegen, with appropriate specialization for the NVPTX device. While the patch does introduce a number of abstractions, the expected use case calls for inlining of the GPU OpenMP runtime. After inlining and optimizations in LLVM, these abstractions are unwound and performance of OpenMP reductions is comparable to CUDA-canonical code. Patch by Tian Jin in collaboration with Arpith Jacob Reviewers: ABataev Differential Revision: https://reviews.llvm.org/D29758 llvm-svn: 295333	2017-02-16 16:20:16 +00:00
Arpith Chacko Jacob	bd6344c0be	Revert r295319 while investigating buildbot failure. llvm-svn: 295323	2017-02-16 14:25:35 +00:00
Arpith Chacko Jacob	8e170fc857	[OpenMP] Parallel reduction on the NVPTX device. This patch implements codegen for the reduction clause on any parallel construct for elementary data types. An efficient implementation requires hierarchical reduction within a warp and a threadblock. It is complicated by the fact that variables declared in the stack of a CUDA thread cannot be shared with other threads. The patch creates a struct to hold reduction variables and a number of helper functions. The OpenMP runtime on the GPU implements reduction algorithms that uses these helper functions to perform reductions within a team. Variables are shared between CUDA threads using shuffle intrinsics. An implementation of reductions on the NVPTX device is substantially different to that of CPUs. However, this patch is written so that there are minimal changes to the rest of OpenMP codegen. The implemented design allows the compiler and runtime to be decoupled, i.e., the runtime does not need to know of the reduction operation(s), the type of the reduction variable(s), or the number of reductions. The design also allows reuse of host codegen, with appropriate specialization for the NVPTX device. While the patch does introduce a number of abstractions, the expected use case calls for inlining of the GPU OpenMP runtime. After inlining and optimizations in LLVM, these abstractions are unwound and performance of OpenMP reductions is comparable to CUDA-canonical code. Patch by Tian Jin in collaboration with Arpith Jacob Reviewers: ABataev Differential Revision: https://reviews.llvm.org/D29758 llvm-svn: 295319	2017-02-16 14:03:36 +00:00
Reid Kleckner	9de921470d	[CodeGen] Treat auto-generated __dso_handle symbol as HiddenVisibility Fixes https://bugs.llvm.org/show_bug.cgi?id=31932 Based on a patch by Roland McGrath Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D29843 llvm-svn: 294978	2017-02-13 18:49:21 +00:00
Charles Li	4f80074629	[Lit Test] Make tests C++11 compatible - Parse OpenMP Differential Revision: https://reviews.llvm.org/D29725 llvm-svn: 294504	2017-02-08 19:46:15 +00:00
Carlo Bertolli	d2192d1b63	[OpenMP] Remove fixme comment in regression test and related unnecessary statement https://reviews.llvm.org/D29501 It looks like I forgot to remove a FIXME comment with the associated statement. The test does not need it and it gives the wrong impression of being an incomplete test. llvm-svn: 294195	2017-02-06 16:03:41 +00:00

1 2 3 4 5 ...

886 Commits