llvm-project

Commit Graph

Author	SHA1	Message	Date
Samuel Antao	f8b5012dfb	[OpenMP] Add TLS-based implementation for threadprivate directive. llvm-svn: 242080	2015-07-13 22:54:53 +00:00
Benjamin Kramer	642f173ae9	Switch users of the 'for (StmtRange range = stmt->children(); range; ++range)‘ pattern to range for loops. The pattern was born out of the lack of range-based for loops in C++98 and is somewhat obscure. No functionality change intended. llvm-svn: 241300	2015-07-02 21:03:14 +00:00
Alexey Bataev	80909878ad	[OPENMP 4.0] Initial support for 'omp cancel' construct. Implemented parsing/sema analysis + (de)serialization. llvm-svn: 241253	2015-07-02 11:25:17 +00:00
Alexey Bataev	6d4ed05830	[OPENMP 4.0] Initial support for 'omp cancellation point' construct. Add parsing and sema analysis for 'omp cancellation point' directive. llvm-svn: 241145	2015-07-01 06:57:41 +00:00
Alexey Bataev	1c2cfbc3ea	[OPENMP] Initial support for 'depend' clause (4.0). Parsing and sema analysis (without support for array sections in arguments) for 'depend' clause (used in 'task' directive, OpenMP 4.0). llvm-svn: 240409	2015-06-23 14:25:19 +00:00
NAKAMURA Takumi	0332edac98	Fix a warning. [-Wsign-compare] FIXME: Should "Level" be unsigned? llvm-svn: 240391	2015-06-23 10:01:20 +00:00
Alexey Bataev	aac108a324	[OPENMP] Do not emit references to original variables in 'private' clause. Currently if the variable is captured in captured region, capture record for this region stores reference to this variable for future use. But we don't need to provide the reference to the original variable if it was explicitly marked as private in the 'private' clause of the OpenMP construct, this variable is replaced by private copy. Differential Revision: http://reviews.llvm.org/D9550 llvm-svn: 240377	2015-06-23 04:51:00 +00:00
Alexander Kornienko	ab9db51042	Revert r240270 ("Fixed/added namespace ending comments using clang-tidy"). llvm-svn: 240353	2015-06-22 23:07:51 +00:00
Alexander Kornienko	3d9d929e42	Fixed/added namespace ending comments using clang-tidy. NFC The patch is generated using this command: $ tools/extra/clang-tidy/tool/run-clang-tidy.py -fix \ -checks=-,llvm-namespace-comment -header-filter='llvm/.\|clang/.*' \ work/llvm/tools/clang To reduce churn, not touching namespaces spanning less than 10 lines. llvm-svn: 240270	2015-06-22 09:47:44 +00:00
Alexey Bataev	c30dd2daf9	[OPENMP] Support for '#pragma omp taskgroup' directive. Added parsing, sema analysis and codegen for '#pragma omp taskgroup' directive (OpenMP 4.0). The code for directive is generated the following way: #pragma omp taskgroup <body> void __kmpc_taskgroup(<loc>, thread_id); <body> void __kmpc_end_taskgroup(<loc>, thread_id); llvm-svn: 240011	2015-06-18 12:14:09 +00:00
Alexey Bataev	3b5b5c492e	[OPENMP] Add support for 'omp parallel for' directive. Codegen for this directive is a combined codegen for 'omp parallel' region with 'omp for simd' region inside. Clauses are supported. llvm-svn: 240006	2015-06-18 10:10:12 +00:00
Alexey Bataev	58e5bdb091	[OPENMP] Add support for 'omp for simd' directive. Added codegen for combined 'omp for simd' directives, that is a combination of 'omp for' directive followed by 'omp simd' directive. Includes support for all clauses. llvm-svn: 239990	2015-06-18 04:45:29 +00:00
Alexey Bataev	ae05c29ab5	[OPENMP] Remove last iteration separation for loop-based constructs. Previously the last iteration for simd loop-based OpenMP constructs were generated as a separate code. This feature is not required and codegen is simplified. llvm-svn: 239810	2015-06-16 11:59:36 +00:00
Alexey Bataev	3ae88e2124	[OPENMP] Prepare codegen for privates in tasks for non-capturing of privates in CapturedStmt. Reworked codegen for privates in tasks: call @kmpc_omp_task_alloc(); ... call @kmpc_omp_task(task_proxy); void map_privates(.privates_rec. privs, type1 * priv1_ref, ..., typen *privn_ref) { priv1_ref = &privs->private1; ... privn_ref = &privs->privaten; ret void } i32 task_entry(i32 ThreadId, i32 PartId, void privs, void (void, ...) map_privates, shareds captures) { type1 priv1; ... typen privn; call map_privates(privs, priv1, ..., privn); <Task body with priv1, .., privn instead of the captured variables>. ret i32 } i32 task_proxy(i32 ThreadId, kmp_task_t_with_privates *tt) { call task_entry(ThreadId, tt->task_data.PartId, &tt->privates, map_privates, tt->task_data.shareds); } llvm-svn: 238010	2015-05-22 08:56:35 +00:00
Alexey Bataev	5129d3a4f5	[OPENMP] Fixed codegen for parameters privatization. For parameters we shall take a derived type of parameters, not the original one. llvm-svn: 237882	2015-05-21 09:47:46 +00:00
Alexey Bataev	16dc7b68c4	Fix for aggregate copying of variable length arrays. Patch fixes codegen for aggregate copying of VLAs. Currently method CodeGenFunction::EmitAggregateCopy() does not support copying of VLAs. Patch checks if the size of the type is 0, then checks if the type is actually a variable-length array. Then it calculates total length for this array and calculates total size of the array in bytes: <total number of elements in array> * aligned_sizeof(ElementType) (if copy assignment is requested). If simple copying is requested, size is calculated like: <total number of elements in array> * aligned_sizeof(ElementType) - aligned_sizeof(ElementType) + sizeof(ElementType). memcpy() is used with this calculated size of the VLA. Differential Revision: http://reviews.llvm.org/D9851 llvm-svn: 237768	2015-05-20 03:46:04 +00:00
Alexey Bataev	ccb59ec9b5	[OPENMP] Prohibit VLAs in 'private/firstprivate' clauses of 'task' directive. Currently runtime does not allow to support variably modified types for 'private' and 'firstprivate' clauses in 'task' directives. llvm-svn: 237674	2015-05-19 08:44:56 +00:00
Alexey Bataev	7a3e5853df	[OPENMP] Prohibit variably modified types in 'copyprivate' clause. Runtime does not allow to work with VLAs in copyprivate clause. llvm-svn: 237672	2015-05-19 08:19:24 +00:00
Alexey Bataev	f120c0d6f2	[OPENMP] Fixed analysis of function arguments and their data sharing attributes. Added proper analysis for types of function arguments. llvm-svn: 237670	2015-05-19 07:46:42 +00:00
Alexey Bataev	0c024df9d1	[OPENMP] Allow using of threadprivate variables as loop-control variables in lop based directives. llvm-svn: 237102	2015-05-12 09:02:07 +00:00
Alexey Bataev	040d540940	[OPENMP] Fixed support for 'schedule' clause with non-constant chunk size. 'schedule' clause for combined directives requires additional processing. Special helper variable is generated, that is captured in the outlined parallel region for 'parallel for' region. This captured variable is used to store chunk expression from the 'schedule' clause in this 'parallel for' region. llvm-svn: 237100	2015-05-12 08:35:28 +00:00
Alexey Bataev	39f915b8f4	[OPENMP] Code cleanup for capturing of variables in OpenMP regions. llvm-svn: 236821	2015-05-08 10:41:21 +00:00
Alexey Bataev	69a4779965	[OPENMP] Fixed codegen for 'reduction' clause. Fixed codegen for reduction operations min, max, && and \|\|. Codegen for them is quite similar and I was confused by this similarity. Also added a call to kmpc_end_reduce() in atomic part of reduction codegen (call to kmpc_end_reduce_nowait() is not required). Differential Revision: http://reviews.llvm.org/D9513 llvm-svn: 236689	2015-05-07 03:54:03 +00:00
Alexey Bataev	f2453a01fb	[OPENMP] Fixed messages about predetermined DSA for loop control variables. llvm-svn: 236574	2015-05-06 07:25:08 +00:00
Alexey Bataev	1a8b3f1a4e	[OPENMP] Fix for http://llvm.org/PR23387 : clang fails to compile magick/attribute.c Allow to use variables with 'register' storage class as loop control variables in OpenMP loop based constructs. llvm-svn: 236571	2015-05-06 06:34:55 +00:00
Alexey Bataev	9c82103743	[OPENMP] Allow to use global variables as lcv in loop-based directives. For proper codegen we need to capture variable in the OpenMP region. In loop-based directives loop control variables are private by default and they must be captured in this region. There was a problem with capturing of globals, used as lcv, as they was not marked as private by default. Differential Revision: http://reviews.llvm.org/D9336 llvm-svn: 236201	2015-04-30 04:23:23 +00:00
Alexey Bataev	c925aa3ab8	[OPENMP] Simplified iteration over clauses, NFC. llvm-svn: 235838	2015-04-27 08:00:32 +00:00
Alexey Bataev	5e018f9e29	[OPENMP] Codegen for 'atomic capture'. Adds codegen for 'atomic capture' constructs with the following forms of expressions/statements: v = x binop= expr; v = x++; v = ++x; v = x--; v = --x; v = x = x binop expr; v = x = expr binop x; {v = x; x = binop= expr;} {v = x; x++;} {v = x; ++x;} {v = x; x--;} {v = x; --x;} {x = x binop expr; v = x;} {x binop= expr; v = x;} {x++; v = x;} {++x; v = x;} {x--; v = x;} {--x; v = x;} {x = x binop expr; v = x;} {x = expr binop x; v = x;} {v = x; x = expr;} If x and expr are integer and binop is associative or x is a LHS in a RHS of the assignment expression, and atomics are allowed for type of x on the target platform atomicrmw instruction is emitted. Otherwise compare-and-swap sequence is emitted. Update of 'v' is not required to be be atomic with respect to the read or write of the 'x'. bb: ... atomic load <x> cont: <expected> = phi [ <x>, label %bb ], [ <new_failed>, %cont ] <desired> = <expected> binop <expr> <res> = cmpxchg atomic &<x>, desired, expected <new_failed> = <res>.field1; br <res>field2, label %exit, label %cont exit: atomic store <old/new x>, <v> ... Differential Revision: http://reviews.llvm.org/D9049 llvm-svn: 235573	2015-04-23 06:35:10 +00:00
Alexey Bataev	50a6458870	[OPENMP] Codegen for 'private' clause in 'for' directive. This patch generates helper variables which used as a private copies of the corresponding original variables inside an OpenMP 'for' directive. These generated variables are initialized by default (with the default constructor, if any). In OpenMP region references to original variables are replaced by the references to these private helper variables. Differential Revision: http://reviews.llvm.org/D9106 llvm-svn: 235503	2015-04-22 12:24:45 +00:00
Alexey Bataev	62dbb979c0	[OPENMP] Fix use of unsigned counters in loops with zero trip count. Patch fixes bugs in codegen for loops with unsigned counters and zero trip count. Previously preconditions for all loops were built using logic (Upper - Lower) > 0. But if the loop is a loop with zero trip count, then Upper - Lower is < 0 only for signed integer, for unsigned we're running into an underflow situation. In this patch we're using original Lower<Upper condition to check that loop body can be executed at least once. Also this allows to skip code generation for loops, if it is known that preconditions for the loop are always false. Differential Revision: http://reviews.llvm.org/D9103 llvm-svn: 235500	2015-04-22 11:59:37 +00:00
Alexey Bataev	6ddfe1a6af	[OPENMP] Fix for checking of data-sharing attributes for canonical var decls only. Currently checks for active data-sharing attributes for variables are performed for found var decls. Instead these checks must be performed for canonical decls of these variables to avoid possible troubles with with the differently qualified re-declarations of the same variable, for example: namespace A { int x; } namespace B { using A::x; } Both A::x and B::x actually reference the same object A::x and this fact must be taken into account during data-sharing attributes analysis. llvm-svn: 235096	2015-04-16 13:49:42 +00:00
Alexey Bataev	f56f98c925	[OPENMP] Codegen for 'copyin' clause in 'parallel' directive. Emits the following code for the clause at the beginning of the outlined function for implicit threads: if (<not a master thread>) { ... <thread local copy of var> = <master thread local copy of var>; ... } <sync point>; Checking for a non-master thread is performed by comparing of the address of the thread local variable with the address of the master's variable. Master thread always uses original variables, so you always know the address of the variable in the master thread. Differential Revision: http://reviews.llvm.org/D9026 llvm-svn: 235075	2015-04-16 05:39:01 +00:00
Alexey Bataev	38e8953352	[OPENMP] Codegen for 'lastprivate' clause in 'for' directive. #pragma omp for lastprivate(<var>) for (i = a; i < b; ++b) <BODY>; This construct is translated into something like: <last_iter> = alloca i32 <lastprivate_var> = alloca <type> <last_iter> = 0 ; No initializer for simple variables or a default constructor is called for objects. ; For arrays perform element by element initialization by the call of the default constructor. ... OMP_FOR_START(...,<last_iter>, ..); sets <last_iter> to 1 if this is the last iteration. <BODY> ... OMP_FOR_END if (<last_iter> != 0) { <var> = <lastprivate_var> ; Update original variable with the lastprivate value. } call __kmpc_cancel_barrier() ; an implicit barrier to avoid possible data race. Differential Revision: http://reviews.llvm.org/D8658 llvm-svn: 235074	2015-04-16 04:54:05 +00:00
Alexey Bataev	69c62a9bdb	[OPENMP] Codegen for 'firstprivate' clause in 'for' directive. Adds proper codegen for 'firstprivate' clause in for directive. Initially codegen for 'firstprivate' clause was implemented for 'parallel' directive only. Also this patch emits sync point only after initialization of firstprivate variables, not all private variables. This sync point is not required for privates, lastprivates etc., only for initialization of firstprivate variables. Differential Revision: http://reviews.llvm.org/D8660 llvm-svn: 234978	2015-04-15 04:52:20 +00:00
Alexey Bataev	420d45b2dd	[OPENMP] Fixed codegen for arrays in 'copyprivate' clause. Fixed a bug with codegen of variables with array types specified in 'copyprivate' clause of 'single' directive. Differential Revision: http://reviews.llvm.org/D8914 llvm-svn: 234856	2015-04-14 05:11:24 +00:00
Alexey Bataev	68adb7da1a	[OPENMP] Initial codegen for 'parallel sections' directive. Emits code for outlined 'parallel' directive with the implicitly inlined 'sections' directive: ... call __kmpc_fork_call(..., outlined_function, ...); ... define internal void outlined_function(...) { <code for implicit sections directive>; } Differential Revision: http://reviews.llvm.org/D8997 llvm-svn: 234849	2015-04-14 03:29:22 +00:00
Alexey Bataev	794ba0dcb7	[OPENMP] Codegen for 'reduction' clause in 'parallel' directive. Emit a code for reduction clause. Next code should be emitted for reductions: static kmp_critical_name lock = { 0 }; void reduce_func(void lhs[<n>], void rhs[<n>]) { ... (Type<i> )lhs[i] = RedOp<i>((Type<i> )lhs[i], (Type<i> )rhs[i]); ... } ... void RedList[<n>] = {&<RHSExprs>[0], ..., &<RHSExprs>[<n> - 1]}; switch (__kmpc_reduce{_nowait}(<loc>, <gtid>, <n>, sizeof(RedList), RedList, reduce_func, &<lock>)) { case 1: ... <LHSExprs>[i] = RedOp<i>(<LHSExprs>[i], <RHSExprs>[i]); ... __kmpc_end_reduce{_nowait}(<loc>, <gtid>, &<lock>); break; case 2: ... Atomic(<LHSExprs>[i] = RedOp<i>(<LHSExprs>[i], *<RHSExprs>[i])); ... break; default: ; } Reduction variables are a kind of a private variables, they have private copies, but initial values are chosen in accordance with the reduction operation. Differential Revision: http://reviews.llvm.org/D8915 llvm-svn: 234583	2015-04-10 10:43:45 +00:00
Alexey Bataev	7c2ed44905	[OPENMP] Allow redeclaration of variables as threadprivate. No need to emit an error message if the variable is redeclared as threadprivate. llvm-svn: 234402	2015-04-08 12:45:41 +00:00
Alexey Bataev	8bf6b3eaf7	[OPENMP] Fix crash on private variables not used in OpenMP region in templates. llvm-svn: 233913	2015-04-02 13:07:08 +00:00
Alexey Bataev	a8d4a54346	[OPENMP] Fix crash on private variables not used in OpenMP region. llvm-svn: 233902	2015-04-02 07:48:16 +00:00
Alexey Bataev	b78ca83d3b	[OPENMP] Sema analysis for 'atomic capture' construct. Added sema checks for forms of expressions/statements allowed under control of 'atomic capture' directive + generation of helper objects for future codegen. llvm-svn: 233785	2015-04-01 03:33:17 +00:00
Alexey Bataev	b4505a7229	[OPENMP] Codegen for 'atomic update' construct. Adds atomic update codegen for the following forms of expressions: x binop= expr; x++; ++x; x--; --x; x = x binop expr; x = expr binop x; If x and expr are integer and binop is associative or x is a LHS in a RHS of the assignment expression, and atomics are allowed for type of x on the target platform atomicrmw instruction is emitted. Otherwise compare-and-swap sequence is emitted: bb: ... atomic load <x> cont: <expected> = phi [ <x>, label %bb ], [ <new_failed>, %cont ] <desired> = <expected> binop <expr> <res> = cmpxchg atomic &<x>, desired, expected <new_failed> = <res>.field1; br <res>field2, label %exit, label %cont exit: ... Differential Revision: http://reviews.llvm.org/D8536 llvm-svn: 233513	2015-03-30 05:20:59 +00:00
Alexey Bataev	a63048e4fd	[OPENMP] Codegen for 'copyprivate' clause ('single' directive). If there is at least one 'copyprivate' clause is associated with the single directive, the following code is generated: ``` i32 did_it = 0; \\ for 'copyprivate' clause if(__kmpc_single(ident_t , gtid)) { SingleOpGen(); __kmpc_end_single(ident_t , gtid); did_it = 1; \\ for 'copyprivate' clause } <copyprivate_list>[0] = &var0; ... <copyprivate_list>[n] = &varn; call __kmpc_copyprivate(ident_t , gtid, <copyprivate_list_size>, <copyprivate_list>, <copy_func>, did_it); ... void<copy_func>(void LHSArg, void RHSArg) { Dst = (void [n])(LHSArg); Src = (void * [n])(RHSArg); Dst[0] = Src[0]; ... Dst[n] = Src[n]; } ``` All list items from all 'copyprivate' clauses are gathered into single <copyprivate list> (<copyprivate_list_size> is a size in bytes of this list) and <copy_func> is used to propagate values of private or threadprivate variables from the 'single' region to other implicit threads from outer 'parallel' region. Differential Revision: http://reviews.llvm.org/D8410 llvm-svn: 232932	2015-03-23 06:18:07 +00:00
Alexander Musman	3276a27b5c	[OPENMP] CodeGen of the 'linear' clause for the 'omp simd' directive. The linear variable is privatized (similar to 'private') and its value on current iteration is calculated, similar to the loop counter variables. Differential revision: http://reviews.llvm.org/D8375 llvm-svn: 232890	2015-03-21 10:12:56 +00:00
Alexey Bataev	1d160b1945	[OPENMP] Additional sema analysis for 'omp atomic[ update]'. Adds additional semantic analysis + generation of helper expressions for proper codegen. llvm-svn: 232164	2015-03-13 12:27:31 +00:00
Alexey Bataev	10fec57e5a	[OPENMP] Fix for ExprWithCleanups in 'omp atomic' constructs. This patch allows using of ExprWithCleanups expressions and other complex expressions in 'omp atomic' construct Differential Revision: http://reviews.llvm.org/D8200 llvm-svn: 231905	2015-03-11 04:48:56 +00:00
Alexey Bataev	62b63b197d	[OPENMP] Initial codegen for 'omp task' directive. The task region is emmitted in several steps: Emit a call to kmp_task_t __kmpc_omp_task_alloc(ident_t , kmp_int32 gtid, kmp_int32 flags, size_t sizeof_kmp_task_t, size_t sizeof_shareds, kmp_routine_entry_t task_entry). Here task_entry is a pointer to the function: kmp_int32 .omp_task_entry.(kmp_int32 gtid, kmp_task_t tt) { TaskFunction(gtid, tt->part_id, tt->shareds); return 0; } Copy a list of shared variables to field shareds of the resulting structure kmp_task_t returned by the previous call (if any). Copy a pointer to destructions function to field destructions of the resulting structure kmp_task_t. Emit a call to kmp_int32 __kmpc_omp_task(ident_t , kmp_int32 gtid, kmp_task_t new_task), where new_task is a resulting structure from previous items. Differential Revision: http://reviews.llvm.org/D7560 llvm-svn: 231762	2015-03-10 07:28:44 +00:00
Alexey Bataev	b832926176	[OPENMP] Codegen for "#pragma omp atomic write" For global reg lvalue - use regular store through global register. For simple lvalue - use simple atomic store. For bitfields, vector element, extended vector elements - the original value of the whole storage (for vector elements) or of some aligned value (for bitfields) is atomically read, the part of this value for the given lvalue is modified and then use atomic compare-and-exchange operation to try to atomically write modified value (if it was not modified). Also, changes in this patch fix the bug for '#pragma omp atomic read' applied to extended vector elements. Differential Revision: http://reviews.llvm.org/D7369 llvm-svn: 230736	2015-02-27 06:33:30 +00:00
Alexey Bataev	42971a3342	[OPENMP] Fixed DSA processing for predetermined shared variables. This patch allows to use predetermined shared variables in private clauses in parallel or tasks regions. llvm-svn: 226549	2015-01-20 07:03:46 +00:00
Alexey Bataev	3255bf3aac	[OPENMP] Disable copyprivate an nowait clauses in 'single' directive. The copyprivate clause must not be used with the nowait clause in single directive. llvm-svn: 226429	2015-01-19 05:20:46 +00:00

1 2 3 4

167 Commits