llvm-project

Commit Graph

Author	SHA1	Message	Date
Samuel Antao	6d0042642a	Re-apply r272900 - [OpenMP] Cast captures by copy when passed to fork call so that they are compatible to what the runtime library expects. An issue in one of the regression tests was fixed for 32-bit hosts. llvm-svn: 272931	2016-06-16 18:39:34 +00:00
Samuel Antao	b1f9501242	Revert r272900 - [OpenMP] Cast captures by copy when passed to fork call so that they are compatible to what the runtime library expects. Was causing trouble in one of the regression tests for a 32-bit address space. llvm-svn: 272908	2016-06-16 16:06:22 +00:00
Samuel Antao	4951617980	[OpenMP] Cast captures by copy when passed to fork call so that they are compatible to what the runtime library expects. Summary: This patch fixes an issue detected when firstprivate variables are passed to an OpenMP outlined function vararg list. Currently they are not compatible with what the runtime library expects causing malfunction in some targets. This patch fixes the issue by moving the casting logic already in place for offloading to the common code that creates the outline function and arguments and updates the regression tests accordingly. Reviewers: hfinkel, arpith-jacob, carlo.bertolli, kkwli0, ABataev Subscribers: cfe-commits, caomhin Differential Revision: http://reviews.llvm.org/D21150 llvm-svn: 272900	2016-06-16 15:09:31 +00:00
Alexey Bataev	ad537bb8a0	[OPENMP 4.5] Fixed codegen for 'priority' and destructors in task-based directives. 'kmp_task_t' record type added a new field for 'priority' clause and changed the representation of pointer to destructors for privates used within loop-based directives. Old representation: typedef struct kmp_task { /* GEH: Shouldn't this be aligned somehow? / void shareds; /*< pointer to block of pointers to shared vars / kmp_routine_entry_t routine; /*< pointer to routine to call for executing task / kmp_int32 part_id; /*< part id for the task / kmp_routine_entry_t destructors; /* pointer to function to invoke deconstructors of firstprivate C++ objects / / private vars / } kmp_task_t; New representation: typedef struct kmp_task { / GEH: Shouldn't this be aligned somehow? / void shareds; /*< pointer to block of pointers to shared vars / kmp_routine_entry_t routine; /*< pointer to routine to call for executing task / kmp_int32 part_id; /*< part id for the task / kmp_cmplrdata_t data1; /* Two known optional additions: destructors and priority / kmp_cmplrdata_t data2; / Process destructors first, priority second / / future data / / private vars */ } kmp_task_t; Also excessive initialization of 'destructors' fields to 'null' was removed from codegen if it is known that no destructors shal be used. Currently a special bit is used in 'kmp_tasking_flags_t' bitfields ('destructors_thunk' bitfield). llvm-svn: 271201	2016-05-30 09:06:50 +00:00
Samuel Antao	8d2d730f2a	[OpenMP] Codegen for target update directive. Summary: This patch implements the code generation for the `target update` directive. The implemntation relies on the logic already in place for target data standalone directives, i.e. target enter/exit data. Reviewers: hfinkel, carlo.bertolli, arpith-jacob, kkwli0, ABataev Subscribers: caomhin, cfe-commits Differential Revision: http://reviews.llvm.org/D20650 llvm-svn: 270886	2016-05-26 18:30:22 +00:00
Samuel Antao	ec172c6da0	[OpenMP] Parsing and sema support for the from clause Summary: The patch contains the parsing and sema support for the `from` clause. Patch based on the original post by Kelvin Li. Reviewers: hfinkel, carlo.bertolli, kkwli0, arpith-jacob, ABataev Subscribers: caomhin, cfe-commits Differential Revision: http://reviews.llvm.org/D18488 llvm-svn: 270882	2016-05-26 17:49:04 +00:00
Samuel Antao	661c0904e1	[OpenMP] Parsing and sema support for the to clause Summary: The patch contains the parsing and sema support for the `to` clause. Patch based on the original post by Kelvin Li. Reviewers: carlo.bertolli, hfinkel, kkwli0, arpith-jacob, ABataev Subscribers: caomhin, cfe-commits Differential Revision: http://reviews.llvm.org/D18597 llvm-svn: 270880	2016-05-26 17:39:58 +00:00
Samuel Antao	686c70c3dc	[OpenMP] Parsing and sema support for target update directive Summary: This patch is to add parsing and sema support for `target update` directive. Support for the `to` and `from` clauses will be added by a different patch. This patch also adds support for other clauses that are already implemented upstream and apply to `target update`, e.g. `device` and `if`. This patch is based on the original post by Kelvin Li. Reviewers: hfinkel, carlo.bertolli, kkwli0, arpith-jacob, ABataev Subscribers: caomhin, cfe-commits Differential Revision: http://reviews.llvm.org/D15944 llvm-svn: 270878	2016-05-26 17:30:50 +00:00
Hal Finkel	c07e19b2c1	Add a loop's debug location to its llvm.loop metadata Getting accurate locations for loops is important, because those locations are used by the frontend to generate optimization remarks. Currently, optimization remarks for loops often appear on the wrong line, often the first line of the loop body instead of the loop itself. This is confusing because that line might itself be another loop, or might be somewhere else completely if the body was an inlined function call. This happens because of the way we find the loop's starting location. First, we look for a preheader, and if we find one, and its terminator has a debug location, then we use that. Otherwise, we look for a location on an instruction in the loop header. The fallback heuristic is not bad, but will almost always find the beginning of the body, and not the loop statement itself. The preheader location search often fails because there's often not a preheader, and even when there is a preheader, depending on how it was formed, it sometimes carries the location of some preceeding code. I don't see any good theoretical way to fix this problem. On the other hand, this seems like a straightforward solution: Put the debug location in the loop's llvm.loop metadata. When emitting debug information, this commit causes us to add the debug location as an operand to each loop's llvm.loop metadata. Thus, we now generate this metadata for all loops (not just loops with optimization hints) when we're otherwise generating debug information. The remark test case changes depend on the companion LLVM commit r270771. llvm-svn: 270772	2016-05-25 21:53:24 +00:00
Alexey Bataev	8b42706a6e	[OPENMP 4.5] Codegen for dacross loop synchronization constructs. OpenMP 4.5 adds support for doacross loop synchronization. Patch implements codegen for this construct. llvm-svn: 270690	2016-05-25 12:36:08 +00:00
Alexey Bataev	9afe57541e	[OPENMP] Fixed codegen for firstprivate vars in standalone worksharing directives. If firstprivate variable is is captured by value in outlined region and then used as firstprivate variable in inner worksharing directive, the copy for this firstprivate variable was not created. Fixed this bug. llvm-svn: 270536	2016-05-24 07:40:12 +00:00
Alexey Bataev	7ace49dff1	[OPENMP] Pass scalar firstprivate vars by value. For better performance and to unify code with offloading part we pass scalar firstprivate values by value, instead of by reference. It will remove some extra copying operations. llvm-svn: 269751	2016-05-17 08:55:33 +00:00
Alexey Bataev	1e1e286a6b	[OPENMP 4.5] Initial codegen for 'priority' clause in task-based directives. OpenMP 4.5 supports clause 'priority' in task-based directives. Patch adds initial codegen support for this clause in codegen. llvm-svn: 269050	2016-05-10 12:21:02 +00:00
Alexey Bataev	9ebd742748	[OPENMP 4.5] Add codegen support in runtime for '[non]monotonic' schedule modifiers. Runtime library expects some additional data in schedule argument for loop-based directives, that have additional schedule modifiers 'monotonic\|nonmonotonic'. llvm-svn: 269035	2016-05-10 09:57:36 +00:00
Alexey Bataev	f93095a003	[OPENMP 4.5] Codegen for 'lastprivate' clauses in 'taskloop' directives. OpenMP 4.5 adds taskloop/taskloop simd directives. These directives allow to use lastprivate clause. Patch adds codegen for this clause. llvm-svn: 268618	2016-05-05 08:46:22 +00:00
Alexey Bataev	1e73ef3882	[OPENMP 4.5] Initial codegen for 'taskloop simd' directive. OpenMP 4.5 defines 'taskloop simd' directive, which is combined directive for 'taskloop' and 'simd' directives. Patch adds initial codegen support for this directive and its 2 basic clauses 'safelen' and 'simdlen'. llvm-svn: 267872	2016-04-28 12:14:51 +00:00
Alexey Bataev	24b5baed27	[OPENMP] Simplified interface for codegen of tasks, NFC. Reduced number of arguments in member functions of runtime support library for task-based directives. llvm-svn: 267863	2016-04-28 09:23:51 +00:00
Alexey Bataev	2b19a6fe53	[OPENMP 4.5] Codegen for 'grainsize/num_tasks' clauses of 'taskloop' directive. OpenMP 4.5 defines 'taskloop' directive and 2 additional clauses 'grainsize' and 'num_tasks' for this directive. Patch adds codegen for these clauses. These clauses are generated as arguments of the '__kmpc_taskloop' libcall and are encoded the following way: void __kmpc_taskloop(ident_t loc, int gtid, kmp_task_t task, int if_val, kmp_uint64 lb, kmp_uint64 ub, kmp_int64 st, int nogroup, int sched, kmp_uint64 grainsize, void *task_dup); If 'grainsize' is specified, 'sched' argument must be set to '1' and 'grainsize' argument must be set to the value of the 'grainsize' clause. If 'num_tasks' is specified, 'sched' argument must be set to '2' and 'grainsize' argument must be set to the value of the 'num_tasks' clause. It is possible because these 2 clauses are mutually exclusive and can't be used at the same time on the same directive. If none of these clauses is specified, 'sched' argument must be set to '0'. llvm-svn: 267862	2016-04-28 09:15:06 +00:00
Samuel Antao	8dd6628743	[OpenMP] Code generation for target exit data directive Summary: This patch adds support for the target exit data directive code generation. Given that, apart from the employed runtime call, target exit data requires the same code generation pattern as target enter data, the OpenMP codegen entry point was renamed and reused for both. Reviewers: hfinkel, carlo.bertolli, arpith-jacob, kkwli0, ABataev Subscribers: cfe-commits, fraggamuffin, caomhin Differential Revision: http://reviews.llvm.org/D17369 llvm-svn: 267814	2016-04-27 23:14:30 +00:00
Samuel Antao	bd0ae2e14c	[OpenMP] Code generation for target enter data directive Summary: This patch adds support for the target enter data directive code generation. Reviewers: hfinkel, carlo.bertolli, arpith-jacob, kkwli0, ABataev Subscribers: cfe-commits, fraggamuffin, caomhin Differential Revision: http://reviews.llvm.org/D17368 llvm-svn: 267812	2016-04-27 23:07:29 +00:00
Samuel Antao	df158d5567	[OpenMP] Code generation for target data directive Summary: This patch adds support for the target data directive code generation. Part of the already existent functionality related with data maps is moved to a new function so that it could be reused. Reviewers: hfinkel, carlo.bertolli, arpith-jacob, kkwli0, ABataev Subscribers: cfe-commits, fraggamuffin, caomhin Differential Revision: http://reviews.llvm.org/D17367 llvm-svn: 267811	2016-04-27 22:58:19 +00:00
Alexey Bataev	8fbae8cf09	[OPENMP] Fix crash on initialization of classes with no init clause in declare reductions. If reduction clause is applied to instance of class with user-defined reduction operation without initialization clause, it may cause a crash. Patch fixes this issue. llvm-svn: 267695	2016-04-27 11:38:05 +00:00
Alexey Bataev	4ba78a46ff	[OPENMP] Fix for codegen of captured variables in inlined directives. Currently there is a problem with codegen of inlined directives inside lambdas, it may cause a crash during codegen because of incorrect capturing of variables. Patch fixes this problem. llvm-svn: 267677	2016-04-27 07:56:03 +00:00
Alexey Bataev	7292c29bb5	[OPENMP 4.5] Codegen for 'taskloop' directive. The taskloop construct specifies that the iterations of one or more associated loops will be executed in parallel using OpenMP tasks. The iterations are distributed across tasks created by the construct and scheduled to be executed. The next code will be generated for the taskloop directive: #pragma omp taskloop num_tasks(N) lastprivate(j) for( i=0; i<NGRAINSTRIDE-1; i+=STRIDE ) { int th = omp_get_thread_num(); #pragma omp atomic counter++; #pragma omp atomic th_counter[th]++; j = i; } Generated code: task = __kmpc_omp_task_alloc(NULL,gtid,1,sizeof(struct task),sizeof(struct shar),&task_entry); psh = task->shareds; psh->pth_counter = &th_counter; psh->pcounter = &counter; psh->pj = &j; task->lb = 0; task->ub = NGRAINSTRIDE-2; task->st = STRIDE; __kmpc_taskloop( NULL, // location gtid, // gtid task, // task structure 1, // if clause value &task->lb, // lower bound &task->ub, // upper bound STRIDE, // loop increment 0, // 1 if nogroup specified 2, // schedule type: 0-none, 1-grainsize, 2-num_tasks N, // schedule value (ignored for type 0) (void*)&__task_dup_entry // tasks duplication routine ); llvm-svn: 267395	2016-04-25 12:22:29 +00:00
Alexey Bataev	feddd64bff	[OPENMP] Fix for PR27463: Privatizing struct fields with array type causes code generation failure. The codegen part of firstprivate clause for member decls used type of original variable without skipping reference type from OMPCapturedExprDecl. Patch fixes this problem. llvm-svn: 267125	2016-04-22 09:05:03 +00:00
Alexey Bataev	5dff95c04d	[OPENMP] Fix for LCV in simd directives in explicit clauses. If loop control variable for simd-based directives is explicitly marked as linear/lastprivate in clauses, codegen for such construct would crash. Patch fixes this problem. llvm-svn: 267101	2016-04-22 03:56:56 +00:00
Alexey Bataev	48591dd98c	[OPENMP] Codegen for untied tasks. If the untied clause is present on a task construct, any thread in the team can resume the task region after a suspension. Patch adds proper codegen for untied tasks. llvm-svn: 266853	2016-04-20 04:01:36 +00:00
Alexey Bataev	995e861ba6	Revert "[OPENMP] Codegen for untied tasks." This reverts commit r266754. llvm-svn: 266755	2016-04-19 16:36:01 +00:00
Alexey Bataev	823acfacdf	[OPENMP] Codegen for untied tasks. If the untied clause is present on a task construct, any thread in the team can resume the task region after a suspension. Patch adds proper codegen for untied tasks. llvm-svn: 266754	2016-04-19 16:27:55 +00:00
Alexey Bataev	bec9572213	Revert "[OPENMP] Codegen for untied tasks." This reverts commit 266722. llvm-svn: 266724	2016-04-19 09:27:38 +00:00
Alexey Bataev	26b2577f6b	[OPENMP] Codegen for untied tasks. If the untied clause is present on a task construct, any thread in the team can resume the task region after a suspension. Patch adds proper codegen for untied tasks. llvm-svn: 266722	2016-04-19 09:10:27 +00:00
Alexey Bataev	e48a5fc56d	[OPENMP 4.0] Support for 'uniform' clause in 'declare simd' directive. OpenMP 4.0 defines clause 'uniform' in 'declare simd' directive: 'uniform' '(' <argument-list> ')' The uniform clause declares one or more arguments to have an invariant value for all concurrent invocations of the function in the execution of a single SIMD loop. The special this pointer can be used as if was one of the arguments to the function in any of the linear, aligned, or uniform clauses. llvm-svn: 266041	2016-04-12 05:28:34 +00:00
JF Bastien	92f4ef1017	NFC: make AtomicOrdering an enum class Summary: See LLVM change D18775 for details, this change depends on it. Reviewers: jyknight, reames Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D18776 llvm-svn: 265569	2016-04-06 17:26:42 +00:00
Carlo Bertolli	c687225b43	[OPENMP] Codegen for teams directive for NVPTX This patch implements the teams directive for the NVPTX backend. It is different from the host code generation path as it: Does not call kmpc_fork_teams. All necessary teams and threads are started upon touching the target region, when launching a CUDA kernel, and their execution is coordinated through sequential and parallel regions within the target region. Does not call kmpc_push_num_teams even if a num_teams of thread_limit clause is present. Setting the number of teams and the thread limit is implemented by the nvptx-related runtime. Please note that I am now passing a Clang Expr * to emitPushNumTeams instead of the originally chosen llvm::Value * type. The reason for that is that I want to avoid emitting expressions for num_teams and thread_limit if they are not needed in the target region. http://reviews.llvm.org/D17963 llvm-svn: 265304	2016-04-04 15:55:02 +00:00
Alexey Bataev	5a3af13d93	[OPENMP] Remove extra code transformation. For better support of some specific GNU extensions some extra transformation of AST nodes were introduced. These transformations are very hard to handle. The code is improved in handling of these extensions by using captured expressions construct. llvm-svn: 264709	2016-03-29 08:58:54 +00:00
Alexey Bataev	14fa1c6b60	[OPENMP] Allow runtime insert its own code inside OpenMP regions. Solution unifies interface of RegionCodeGenTy type to allow insert runtime-specific code before/after main codegen action defined in CGStmtOpenMP.cpp file. Runtime should not define its own RegionCodeGenTy for general OpenMP directives, but must be allowed to insert its own (required) code to support target specific codegen. llvm-svn: 264700	2016-03-29 05:34:15 +00:00
Alexey Bataev	f539faa733	Revert "[OPENMP] Allow runtime insert its own code inside OpenMP regions." Reverting because of failed tests. llvm-svn: 264577	2016-03-28 12:58:34 +00:00
Alexey Bataev	424be92831	[OPENMP] Allow runtime insert its own code inside OpenMP regions. Solution unifies interface of RegionCodeGenTy type to allow insert runtime-specific code before/after main codegen action defined in CGStmtOpenMP.cpp file. Runtime should not define its own RegionCodeGenTy for general OpenMP directives, but must be allowed to insert its own (required) code to support target specific codegen. llvm-svn: 264576	2016-03-28 12:52:58 +00:00
Alexey Bataev	f662b5943c	Revert "[OPENMP] Allow runtime insert its own code inside OpenMP regions." This reverts commit 3ee791165100607178073f14531a0dc90c622b36. llvm-svn: 264570	2016-03-28 10:12:03 +00:00
Alexey Bataev	b8c425c4f7	[OPENMP] Allow runtime insert its own code inside OpenMP regions. Solution unifies interface of RegionCodeGenTy type to allow insert runtime-specific code before/after main codegen action defined in CGStmtOpenMP.cpp file. Runtime should not define its own RegionCodeGenTy for general OpenMP directives, but must be allowed to insert its own (required) code to support target specific codegen. llvm-svn: 264569	2016-03-28 09:53:43 +00:00
Alexey Bataev	a839dddf92	[OPENMP 4.0] Use 'declare reduction' constructs in 'reduction' clauses. OpenMP 4.0 allows to define custom reduction operations using '#pragma omp declare reduction' construct. Patch allows to use this custom defined reduction operations in 'reduction' clauses. llvm-svn: 263701	2016-03-17 10:19:46 +00:00
John McCall	c56a8b3284	Preserve ExtParameterInfos into CGFunctionInfo. As part of this, make the function-arrangement interfaces a little simpler and more semantic. NFC. llvm-svn: 263191	2016-03-11 04:30:31 +00:00
Alexey Bataev	ef549a8955	[OPENMP 4.5] Codegen for data members in 'linear' clause OpenMP 4.5 allows privatization of non-static data members in OpenMP constructs. Patch adds proper codegen support for data members in 'linear' clause llvm-svn: 263003	2016-03-09 09:49:09 +00:00
Alexey Bataev	78849fb464	[OPENMP 4.5] Codegen for data members in 'linear' clause. OpenMP 4.5 allows to use data members in private clauses. Patch adds codegen support for 'linear' clause. llvm-svn: 263002	2016-03-09 09:49:00 +00:00
Carlo Bertolli	0ff587d4a4	[OPENMP] Codegen for distribute directive: fix bug in ordering of parameters. llvm-svn: 262833	2016-03-07 16:19:13 +00:00
Carlo Bertolli	fc35ad2bbc	Reapply r262741 [OPENMP] Codegen for distribute directive This patch provide basic implementation of codegen for teams directive, excluding all clauses except dist_schedule. It also fixes parts of AST reader/writer to enable correct pre-compiled header handling. http://reviews.llvm.org/D17170 llvm-svn: 262832	2016-03-07 16:04:49 +00:00
Samuel Antao	bf4d18d3d2	Revert r262741 - [OPENMP] Codegen for distribute directive Was causing a failure in one of the buildbot slaves. llvm-svn: 262744	2016-03-04 21:02:14 +00:00
Carlo Bertolli	4a56e3831d	[OPENMP] Codegen for distribute directive This patch provide basic implementation of codegen for teams directive, excluding all clauses except dist_schedule. It also fixes parts of AST reader/writer to enable correct pre-compiled header handling. http://reviews.llvm.org/D17170 llvm-svn: 262741	2016-03-04 20:24:58 +00:00
Carlo Bertolli	6ad7b5aff2	[OPENMP] firstprivate and private clauses of teams, host codegeneration Add code generation support for firstprivate and private clauses of teams on the host. Add extensive regression tests including lambda functions and vla testing. http://reviews.llvm.org/D17582 llvm-svn: 262663	2016-03-03 22:09:40 +00:00
Carlo Bertolli	430d8ecc55	Add code generation for teams directive inside target region llvm-svn: 262652	2016-03-03 20:34:23 +00:00
Samuel Antao	b68e2db8f9	[OpenMP] Code generation for teams - kernel launching Summary: This patch implements the launching of a target region in the presence of a nested teams region, i.e calls tgt_target_teams with the required arguments gathered from the enclosed teams directive. The actual codegen of the region enclosed by the teams construct will be contributed in a separate patch. Reviewers: hfinkel, arpith-jacob, kkwli0, carlo.bertolli, ABataev Subscribers: cfe-commits, caomhin, fraggamuffin Differential Revision: http://reviews.llvm.org/D17019 llvm-svn: 262625	2016-03-03 16:20:23 +00:00
Alexey Bataev	2bbf7217ea	[OPENMP 4.5] Initial support for data members in 'linear' clause. OpenMP 4.5 allows to privatize data members of current class in member functions. Patch adds initial support for privatization of data members in 'linear' clause, no codegen support. llvm-svn: 262578	2016-03-03 03:52:24 +00:00
Alexey Bataev	61205070c4	[OPENMP 4.5] Codegen for data members in 'reduction' clause. OpenMP 4.5 allows to privatize non-static data members of current class in non-static member functions. Patch supports codegen for non-static data members in 'reduction' clauses. llvm-svn: 262460	2016-03-02 04:57:40 +00:00
Alexey Bataev	005248ac8a	[OPENMP 4.5] Codegen for member decls in 'lastprivate' clause. OpenMP 4.5 allows to privatize non-static member decls in non-static member functions. Patch captures such decls by reference in general (for bitfields, by value) and then operates with this capture. For bitfields, at the end of codegen for lastprivates original bitfield is updated with the value of captured copy. llvm-svn: 261824	2016-02-25 05:25:57 +00:00
Richard Trieu	cc3949d99a	Remove use of builtin comma operator. Cleanup for upcoming Clang warning -Wcomma. No functionality change intended. llvm-svn: 261271	2016-02-18 22:34:54 +00:00
Alexey Bataev	8ffcc949b1	[OPENMP] Fix codegen for lastprivate loop counters. Patch fixes bug with codegen for lastprivate loop counters. Also it may improve performance for lastprivates calculations in some cases. llvm-svn: 261209	2016-02-18 13:48:15 +00:00
Alexey Bataev	417089fc7e	[OPENMP 4.5] Codegen support for data members in 'firstprivate' clause. Added codegen for captured data members in non-static member functions. llvm-svn: 261089	2016-02-17 13:19:37 +00:00
Alexey Bataev	3392d76081	[OPENMP] Improved handling of pseudo-captured expressions in OpenMP. Expressions inside 'schedule'\|'dist_schedule' clause must be captured in combined directives to avoid possible crash during codegen. Patch improves handling of such constructs llvm-svn: 260954	2016-02-16 11:18:12 +00:00
Alexey Bataev	cd8b6a2cf1	[OPENMP] Remove extra sync barriers for 'firstprivate' clause. Sync barrier will be emitted after generation of firstprivate variables only if one of the firstprivate vars is used in lastprivate clause. llvm-svn: 260877	2016-02-15 08:07:17 +00:00
Alexey Bataev	4244be25bd	[OPENMP] Rename OMPCapturedFieldDecl to OMPCapturedExprDecl, NFC. OMPCapturedExprDecl allows caopturing not only of fielddecls, but also other expressions. It also allows to simplify codegen for several clauses. llvm-svn: 260492	2016-02-11 05:35:55 +00:00
Alexey Bataev	31300ed0a5	[OPENMP 4.0] Fixed support of array sections/array subscripts. Codegen for array sections/array subscripts worked only for expressions with arrays as base. Patch fixes codegen for bases with pointer/reference types. llvm-svn: 259776	2016-02-04 11:27:03 +00:00
Arpith Chacko Jacob	05bebb578a	[OpenMP] Parsing + sema for target parallel for directive. Summary: This patch adds parsing + sema for the target parallel for directive along with testcases. Reviewers: ABataev Differential Revision: http://reviews.llvm.org/D16759 llvm-svn: 259654	2016-02-03 15:46:42 +00:00
Arpith Chacko Jacob	e955b3d3fe	[OpenMP] Parsing + sema for target parallel directive. Summary: This patch adds parsing + sema for the target parallel directive and its clauses along with testcases. Reviewers: ABataev Differential Revision: http://reviews.llvm.org/D16553 Rebased to current trunk and updated test cases. llvm-svn: 258832	2016-01-26 18:48:41 +00:00
Arpith Chacko Jacob	3cf89040b0	[OpenMP] Parsing + sema for defaultmap clause. Summary: This patch adds parsing + sema for the defaultmap clause associated with the target directive (among others). Reviewers: ABataev Differential Revision: http://reviews.llvm.org/D16527 llvm-svn: 258817	2016-01-26 16:37:23 +00:00
Alexey Bataev	1189bd0205	[OPENMP 4.5] Allow arrays in 'reduction' clause. OpenMP 4.5, alogn with array sections, allows to use variables of array type in reductions. llvm-svn: 258804	2016-01-26 12:20:39 +00:00
Alexey Bataev	3015bcc62a	[OPENMP] Generalize codegen for 'sections'-based directive. If 'sections' directive has only one sub-section, the code for 'single'-based directive was emitted. Removed this codegen, because it causes crashes in different cases. llvm-svn: 258495	2016-01-22 08:56:50 +00:00
Alexey Bataev	8524d15954	[OPENMP] Fix crash on reduction for complex variables. reworked codegen for reduction operation for complex types to avoid crash llvm-svn: 258394	2016-01-21 12:35:58 +00:00
Alexey Bataev	9619f04c0e	[OPENMP 4.0] Fix for codegen of 'cancel' directive within 'sections' directive. Allow to emit code for 'cancel' directive within 'sections' directive with single sub-section. llvm-svn: 258307	2016-01-20 12:29:47 +00:00
Samuel Antao	7259076032	[OpenMP] Parsing + sema for "target exit data" directive. Patch by Arpith Jacob. Thanks! llvm-svn: 258177	2016-01-19 20:04:50 +00:00
Samuel Antao	df67fc468e	[OpenMP] Parsing + sema for "target enter data" directive. Patch by Arpith Jacob. Thanks! llvm-svn: 258165	2016-01-19 19:15:56 +00:00
Carlo Bertolli	b4adf55e0f	Add OpenMP dist_schedule clause to distribute directive and related regression tests. llvm-svn: 257917	2016-01-15 18:50:31 +00:00
Samuel Antao	ee8fb302f5	[OpenMP] Reapply rL256842: [OpenMP] Offloading descriptor registration and device codegen. This patch attempts to fix the regressions identified when the patch was committed initially. Thanks to Michael Liao for identifying the fix in the offloading metadata generation related with side effects in evaluation of function arguments. llvm-svn: 256933	2016-01-06 13:42:12 +00:00
Samuel Antao	7d5de9a1ee	[OpenMP] Revert rL256842: [OpenMP] Offloading descriptor registration and device codegen. It was causing two regression, so I'm reverting until the cause is found. llvm-svn: 256858	2016-01-05 19:16:13 +00:00
Samuel Antao	4d5f0bbea1	[OpenMP] Offloading descriptor registration and device codegen. Summary: In order to offloading work properly two things need to be in place: - a descriptor with all the offloading information (device entry functions, and global variable) has to be created by the host and registered in the OpenMP offloading runtime library. - all the device functions need to be emitted for the device and a convention has to be in place so that the runtime library can easily map the host ID of an entry point with the actual function in the device. This patch adds support for these two things. However, only entry functions are being registered given that 'declare target' directive is not yet implemented. About offloading descriptor: The details of the descriptor are explained with more detail in http://goo.gl/L1rnKJ. Basically the descriptor will have fields that specify the number of devices, the pointers to where the device images begin and end (that will be defined by the linker), and also pointers to a the begin and end of table whose entries contain information about a specific entry point. Each entry has the type: ``` struct __tgt_offload_entry{ void addr; char name; int64_t size; }; ``` and will be implemented in a pre determined (ELF) section `.omp_offloading.entries` with 1-byte alignment, so that when all the objects are linked, the table is in that section with no padding in between entries (will be like a C array). The code generation ensures that all `__tgt_offload_entry` entries are emitted in the same order for both host and device so that the runtime can have the corresponding entries in both host and device in same index of the table, and efficiently implement the mapping. The resulting descriptor is registered/unregistered with the runtime library using the calls `__tgt_register_lib` and `__tgt_unregister_lib`. The registration is implemented in a high priority global initializer so that the registration happens always before any initializer (that can potentially include target regions) is run. The driver flag -omptargets= was created to specify a comma separated list of devices the user wants to support so that the new functionality can be exercised. Each device is specified with its triple. About target codegen: The target codegen is pretty much straightforward as it reuses completely the logic of the host version for the same target region. The tricky part is to identify the meaningful target regions in the device side. Unlike other programming models, like CUDA, there are no already outlined functions with attributes that mark what should be emitted or not. So, the information on what to emit is passed in the form of metadata in host bc file. This requires a new option to pass the host bc to the device frontend. Then everything is similar to what happens in CUDA: the global declarations emission is intercepted to check to see if it is an "interesting" declaration. The difference is that instead of checking an attribute, the metadata information in checked. Right now, there is only a form of metadata to pass information about the device entry points (target regions). A class `OffloadEntriesInfoManagerTy` was created to manage all the information and queries related with the metadata. The metadata looks like this: ``` !omp_offload.info = !{!0, !1, !2, !3, !4, !5, !6} !0 = !{i32 0, i32 52, i32 77426347, !"_ZN2S12r1Ei", i32 479, i32 13, i32 4} !1 = !{i32 0, i32 52, i32 77426347, !"_ZL7fstatici", i32 461, i32 11, i32 5} !2 = !{i32 0, i32 52, i32 77426347, !"_Z9ftemplateIiET_i", i32 444, i32 11, i32 6} !3 = !{i32 0, i32 52, i32 77426347, !"_Z3fooi", i32 99, i32 11, i32 0} !4 = !{i32 0, i32 52, i32 77426347, !"_Z3fooi", i32 272, i32 11, i32 3} !5 = !{i32 0, i32 52, i32 77426347, !"_Z3fooi", i32 127, i32 11, i32 1} !6 = !{i32 0, i32 52, i32 77426347, !"_Z3fooi", i32 159, i32 11, i32 2} ``` The fields in each metadata entry are (in sequence): Entry 1) an ID of the type of metadata - right now only zero is used meaning "OpenMP target region". Entry 2) a unique ID of the device where the input source file that contain the target region lives. Entry 3) a unique ID of the file where the input source file that contain the target region lives. Entry 4) a mangled name of the function that encloses the target region. Entries 5) and 6) line and column number where the target region was found. Entry 7) is the order the entry was emitted. Entry 2) and 3) are required to distinguish files that have the same function name. Entry 4) is required to distinguish different instances of the same declaration (usually templated ones) Entries 5) and 6) are required to distinguish the particular target region in body of the function (it is possible that a given target region is not an entry point - if clause can evaluate always to zero - and therefore we need to identify the "interesting" target regions. ) This patch replaces http://reviews.llvm.org/D12306. Reviewers: ABataev, hfinkel, tra, rjmccall, sfantao Subscribers: FBrygidyn, piotr.rak, Hahnfeld, cfe-commits Differential Revision: http://reviews.llvm.org/D12614 llvm-svn: 256842	2016-01-05 16:23:04 +00:00
Alexey Bataev	a6f2a14b94	[OPENMP 4.5] Codegen for 'schedule' clause with monotonic/nonmonotonic modifiers. OpenMP 4.5 adds support for monotonic/nonmonotonic modifiers in 'schedule' clause. Add codegen for these modifiers. llvm-svn: 256666	2015-12-31 06:52:34 +00:00
Alexey Bataev	6f531ec0a2	[OPENMP] Remove explicit call for implicit barrier #pragma omp parallel needs an implicit barrier that is currently done by an explicit call to __kmpc_barrier. However, the runtime already ensures a barrier in __kmpc_fork_call which currently leads to two barriers per region per thread. Differential Revision: http://reviews.llvm.org/D15561 llvm-svn: 255992	2015-12-18 10:24:53 +00:00
Alexey Bataev	8ef3141127	[OPENMP] Fix for http://llvm.org/PR25878 : Error compiling an OpenMP program OpenMP codegen tried to emit the code for its constructs even if it was detected as a dead-code. Added checks to ensure that the code is emitted if the code is not dead. llvm-svn: 255990	2015-12-18 07:58:25 +00:00
Alexey Bataev	fc57d1601d	[OPENMP 4.5] Codegen for 'hint' clause of 'critical' directive OpenMP 4.5 defines 'hint' clause for 'critical' directive. Patch adds codegen for this clause. llvm-svn: 255639	2015-12-15 10:55:09 +00:00
Alexey Bataev	28c75417b2	[OPENMP 4.5] Parsing/sema for 'hint' clause of 'critical' directive. OpenMP 4.5 adds 'hint' clause to critical directive. Patch adds parsing/semantic analysis for this clause. llvm-svn: 255625	2015-12-15 08:19:24 +00:00
Carlo Bertolli	6200a3d0f3	Add parse and sema of OpenMP distribute directive with all clauses except dist_schedule llvm-svn: 255498	2015-12-14 14:51:25 +00:00
Alexey Bataev	33c56402d8	[OPENMP] Fix debug info for 'atomic' construct. Debug info for statement under 'atomic' construct must point exactly to that statement, not the directive itself. llvm-svn: 255487	2015-12-14 09:26:19 +00:00
NAKAMURA Takumi	2d5c6ddf74	Revert r255001, "Add parse and sema for OpenMP distribute directive and all its clauses excluding dist_schedule." It causes memory leak. Some tests in test/OpenMP would fail. llvm-svn: 255094	2015-12-09 04:35:57 +00:00
Alexey Bataev	382967a2e4	[OPENMP 4.5] Parsing/sema for 'num_tasks' clause. OpenMP 4.5 adds directives 'taskloop' and 'taskloop simd'. These directives support clause 'num_tasks'. Patch adds parsing/semantic analysis for this clause. llvm-svn: 255008	2015-12-08 12:06:20 +00:00
Carlo Bertolli	b9bfa75b28	Add parse and sema for OpenMP distribute directive and all its clauses excluding dist_schedule. llvm-svn: 255001	2015-12-08 04:21:03 +00:00
Alexey Bataev	1fd4aed26b	[OPENMP 4.5] parsing/sema support for 'grainsize' clause. OpenMP 4.5 adds 'taksloop' and 'taskloop simd' directives, which have 'grainsize' clause. Patch adds parsing/sema analysis of this clause. llvm-svn: 254903	2015-12-07 12:52:51 +00:00
Alexey Bataev	b825de17b7	[OPENMP 4.5] parsing/sema support for 'nogroup' clause. OpenMP 4.5 adds 'taskloop' and 'taskloop simd' directives. These directives have new 'nogroup' clause. Patch adds basic parsing/sema support for this clause. llvm-svn: 254899	2015-12-07 10:51:44 +00:00
Serge Pavlov	3a5614599a	[PGO] Instrument only base constructors and destructors. Constructors and destructors may be represented by several functions in IR. Only base structors correspond to source code, others are small pieces of code and eventually call the base variant. In this case instrumentation of non-base structors has little sense, this fix remove it. Now profile data of a declaration corresponds to exactly one function in IR, it agrees with the current logic of the profile data loading. This change fixes PR24996. Differential Revision: http://reviews.llvm.org/D15158 llvm-svn: 254876	2015-12-06 14:32:39 +00:00
Alexey Bataev	0a6ed84a0d	[OPENMP 4.5] Parsing/sema support for 'omp taskloop simd' directive. OpenMP 4.5 adds directive 'taskloop simd'. Patch adds parsing/sema analysis for 'taskloop simd' directive and its clauses. llvm-svn: 254597	2015-12-03 09:40:15 +00:00
Samuel Antao	4af1b7b693	[OpenMP] Update target directive codegen to use 4.5 implicit data mappings. Summary: This patch implements the 4.5 specification for the implicit data maps. OpenMP 4.5 specification changes the default way data is captured into a target region. All the non-aggregate kinds are passed by value by default. This required activating the capturing by value during SEMA for the target region. All the non-aggregate values that can be encoded in the size of a pointer are properly casted and forwarded to the runtime library. On top of fixing the previous weird behavior for mapping pointers in nested data regions (an explicit map was always required), this also improves performance as the number of allocations/transactions to the device per non-aggregate map are reduced from two to only one - instead of passing a reference and the value, only the value passed. Explicit maps will be added later on once firstprivate, private, and map clauses' SEMA and parsing are available. Reviewers: hfinkel, rjmccall, ABataev Subscribers: cfe-commits, carlo.bertolli Differential Revision: http://reviews.llvm.org/D14940 llvm-svn: 254521	2015-12-02 17:44:43 +00:00
Alexey Bataev	a056935a2f	[OPENMP 4.5] Parsing/sema analysis for 'priority' clause. OpenMP 4.5 defines new clause 'priority' for 'task', 'taskloop' and 'taskloop simd' directives. Added parsing and sema analysis for 'priority' clause in 'task' and 'taskloop' directives. llvm-svn: 254398	2015-12-01 10:17:31 +00:00
Alexey Bataev	49f6e78d71	[OPENMP 4.5] Parsing/sema analysis for 'taskloop' directive. Adds initial parsing and semantic analysis for 'taskloop' directive. llvm-svn: 254367	2015-12-01 04:18:41 +00:00
Kelvin Li	a15fb1a110	[OpenMP] Parsing and sema support for thread_limit clause. http://reviews.llvm.org/D15029 llvm-svn: 254207	2015-11-27 18:47:36 +00:00
Kelvin Li	099bb8c65d	[OpenMP] Parsing and sema support for num_teams clause http://reviews.llvm.org/D14802 llvm-svn: 254019	2015-11-24 20:50:12 +00:00
Kelvin Li	0bff7afab5	[OpenMP] Parsing and sema support for map clause http://reviews.llvm.org/D14134 llvm-svn: 253849	2015-11-23 05:32:03 +00:00
NAKAMURA Takumi	811a09ec5b	CGStmtOpenMP.cpp: Prune redundant \param. [-Wdocumentation] llvm-svn: 249698	2015-10-08 16:41:42 +00:00
Alexey Bataev	f24e7b1f60	[OPENMP 4.1] Codegen for array sections/subscripts in 'reduction' clause. OpenMP 4.1 adds support for array sections/subscripts in 'reduction' clause. Patch adds codegen for this feature. llvm-svn: 249672	2015-10-08 09:10:53 +00:00
Samuel Antao	bed3c46632	[OpenMP] Target directive host codegen. This patch implements the outlining for offloading functions for code annotated with the OpenMP target directive. It uses a temporary naming of the outlined functions that will have to be updated later on once target side codegen and registration of offloading libraries is implemented - the naming needs to be made unique in the produced library. llvm-svn: 249148	2015-10-02 16:14:20 +00:00
Alexey Bataev	5f600d6a49	[OPENMP 4.1] Codegen for ‘simd’ clause in ‘ordered’ directive. Description. If the simd clause is specified, the ordered regions encountered by any thread will use only a single SIMD lane to execute the ordered regions in the order of the loop iterations. Restrictions. An ordered construct with the simd clause is the only OpenMP construct that can appear in the simd region. An ordered directive with ‘simd’ clause is generated as an outlined function and corresponding function call to prevent this part of code from vectorization later in backend. llvm-svn: 248772	2015-09-29 03:48:57 +00:00
Alexey Bataev	d14d1e6f25	[OPENMP 4.1] Add 'simd' clause for 'ordered' directive. Parsing and sema analysis for 'simd' clause in 'ordered' directive. Description If the simd clause is specified, the ordered regions encountered by any thread will use only a single SIMD lane to execute the ordered regions in the order of the loop iterations. Restrictions An ordered construct with the simd clause is the only OpenMP construct that can appear in the simd region llvm-svn: 248696	2015-09-28 06:39:35 +00:00
Alexey Bataev	346265e3bc	[OPENMP 4.1] Add 'threads' clause for '#pragma omp ordered'. OpenMP 4.1 extends format of '#pragma omp ordered'. It adds 3 additional clauses: 'threads', 'simd' and 'depend'. If no clause is specified, the ordered construct behaves as if the threads clause had been specified. If the threads clause is specified, the threads in the team executing the loop region execute ordered regions sequentially in the order of the loop iterations. The loop region to which an ordered region without any clause or with a threads clause binds must have an ordered clause without the parameter specified on the corresponding loop directive. llvm-svn: 248569	2015-09-25 10:37:12 +00:00
Alexey Bataev	87933c7ced	[OPENMP 4.0] Add 'if' clause for 'cancel' directive. Add parsing, sema analysis and codegen for 'if' clause in 'cancel' directive. llvm-svn: 247976	2015-09-18 08:07:34 +00:00
Alexey Bataev	25e5b44654	[OPENMP] Emit __kmpc_cancel_barrier() and code for 'cancellation point' only if 'cancel' is found. Patch improves codegen for OpenMP constructs. If the OpenMP region does not have internal 'cancel' construct, a call to 'void __kmpc_barrier()' runtime function is generated for all implicit/explicit barriers. If the region has inner 'cancel' directive, then ``` if (__kmpc_cancel_barrier()) exit from outer construct; ``` code is generated. Also, the code for 'canellation point' directive is not generated if parent directive does not have 'cancel' directive. llvm-svn: 247681	2015-09-15 12:52:43 +00:00
Alexey Bataev	c71a4099cf	[OPENMP] Preserve alignment of the original variables for the captured references. Patch makes codegen to preserve alignment of the shared variables captured and used in OpenMP regions. llvm-svn: 247401	2015-09-11 10:29:41 +00:00
Alexey Bataev	2377fe95c6	[OPENMP] Outlined function for parallel and other regions with list of captured variables. Currently all variables used in OpenMP regions are captured into a record and passed to outlined functions in this record. It may result in some poor performance because of too complex analysis later in optimization passes. Patch makes to emit outlined functions for parallel-based regions with a list of captured variables. It reduces code for 2*n GEPs, stores and loads at least. Codegen for task-based regions remains unchanged because runtime requires that all captured variables are passed in captured record. llvm-svn: 247251	2015-09-10 08:12:02 +00:00
John McCall	7f416cc426	Compute and preserve alignment more faithfully in IR-generation. Introduce an Address type to bundle a pointer value with an alignment. Introduce APIs on CGBuilderTy to work with Address values. Change core APIs on CGF/CGM to traffic in Address where appropriate. Require alignments to be non-zero. Update a ton of code to compute and propagate alignment information. As part of this, I've promoted CGBuiltin's EmitPointerWithAlignment helper function to CGF and made use of it in a number of places in the expression emitter. The end result is that we should now be significantly more correct when performing operations on objects that are locally known to be under-aligned. Since alignment is not reliably tracked in the type system, there are inherent limits to this, but at least we are no longer confused by standard operations like derived-to-base conversions and array-to-pointer decay. I've also fixed a large number of bugs where we were applying the complete-object alignment to a pointer instead of the non-virtual alignment, although most of these were hidden by the very conservative approach we took with member alignment. Also, because IRGen now reliably asserts on zero alignments, we should no longer be subject to an absurd but frustrating recurring bug where an incomplete type would report a zero alignment and then we'd naively do a alignmentAtOffset on it and emit code using an alignment equal to the largest power-of-two factor of the offset. We should also now be emitting much more aggressive alignment attributes in the presence of over-alignment. In particular, field access now uses alignmentAtOffset instead of min. Several times in this patch, I had to change the existing code-generation pattern in order to more effectively use the Address APIs. For the most part, this seems to be a strict improvement, like doing pointer arithmetic with GEPs instead of ptrtoint. That said, I've tried very hard to not change semantics, but it is likely that I've failed in a few places, for which I apologize. ABIArgInfo now always carries the assumed alignment of indirect and indirect byval arguments. In order to cut down on what was already a dauntingly large patch, I changed the code to never set align attributes in the IR on non-byval indirect arguments. That is, we still generate code which assumes that indirect arguments have the given alignment, but we don't express this information to the backend except where it's semantically required (i.e. on byvals). This is likely a minor regression for those targets that did provide this information, but it'll be trivial to add it back in a later patch. I partially punted on applying this work to CGBuiltin. Please do not add more uses of the CreateDefaultAligned{Load,Store} APIs; they will be going away eventually. llvm-svn: 246985	2015-09-08 08:05:57 +00:00
Alexey Bataev	caacd53dde	[OPENMP] Fix for http://llvm.org/PR24674 : assertion failed and and abort trap Fix processing of shared variables with reference types in OpenMP constructs. Previously, if the variable was not marked in one of the private clauses, the reference to this variable was emitted incorrectly and caused an assertion later. llvm-svn: 246846	2015-09-04 11:26:21 +00:00
Alexey Bataev	7371aa365c	[OPENMP 4.1] Codegen for extended format of 'if' clause. Fixed codegen for extended format of 'if' clauses with special 'directive-name-modifier' + ast-print tests for extended format of 'if' clause. llvm-svn: 246748	2015-09-03 08:45:56 +00:00
Benjamin Kramer	fc600dc2ec	[OpenMP] Make the filetered clause iterator a real iterator and type safe. This replaces the filtered generic iterator with a type-specfic one based on dyn_cast instead of comparing the kind enum. This allows us to use range-based for loops and eliminates casts. No functionality change intended. llvm-svn: 246384	2015-08-30 15:12:28 +00:00
Alexey Bataev	45bfad51d8	[OPENMP 4.1] Add codegen for 'simdlen' clause. Add emission of metadata for simd loops in presence of 'simdlen' clause. If 'simdlen' clause is provided without 'safelen' clause, the vectorizer width for the loop is set to value of 'simdlen' clause + all read/write ops in loop are marked with '!llvm.mem.parallel_loop_access' metadata. If 'simdlen' clause is provided along with 'safelen' clause, the vectorizer width for the loop is set to value of 'simdlen' clause + all read/write ops in loop are not marked with '!llvm.mem.parallel_loop_access' metadata. If 'safelen' clause is provided without 'simdlen' clause, the vectorizer width for the loop is set to value of 'safelen' clause + all read/write ops in loop are not marked with '!llvm.mem.parallel_loop_access' metadata. llvm-svn: 245697	2015-08-21 12:19:04 +00:00
Alexey Bataev	66b15b505f	[OPENMP 4.1] Initial support for 'simdlen' clause. Add parsing/sema analysis for 'simdlen' clause in simd directives. Also add check that if both 'safelen' and 'simdlen' clauses are specified, the value of 'simdlen' parameter is less than the value of 'safelen' parameter. llvm-svn: 245692	2015-08-21 11:14:16 +00:00
Alexey Bataev	bd9fec1eaa	[OPENMP 4.1] Allow variables with reference types in private clauses. OpenMP 4.1 allows to use variables with reference types in all private clauses (private, firstprivate, lastprivate, linear etc.). Patch allows to use such variables and fixes codegen for linear variables with reference types. llvm-svn: 245268	2015-08-18 06:47:21 +00:00
Alexey Bataev	b08f89ffc1	[OPENMP] Fix for http://llvm.org/PR24371 : Assert failure compiling blender 2.75. blender uses statements expression in condition of the loop under control of the '#pragma omp parallel for'. This condition is used several times in different expressions required for codegen of the loop directive. If there are some variables defined in statement expression, it fires an assert during codegen because of redefinition of the same variables. We have to rebuild several expression to be sure that all variables are unique. llvm-svn: 245041	2015-08-14 12:25:37 +00:00
Michael Wong	b5c1698994	This patch fixes the assert in emitting captured code in the target data construct. This is on behalf of Kelvin Li. http://reviews.llvm.org/D11475 llvm-svn: 244569	2015-08-11 04:52:01 +00:00
Filipe Cabecinhas	7af183d841	Propagate SourceLocations through to get a Loc on float_cast_overflow Summary: float_cast_overflow is the only UBSan check without a source location attached. This patch propagates SourceLocations where necessary to get them to the EmitCheck() call. Reviewers: rsmith, ABataev, rjmccall Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D11757 llvm-svn: 244568	2015-08-11 04:19:28 +00:00
Michael Wong	e710d5459e	This patch commits OpenMP 4 target device clauses This is committed on behalf of Kelvin Li http://reviews.llvm.org/D11469?id=31227 llvm-svn: 244325	2015-08-07 16:16:36 +00:00
Alexey Bataev	a889917493	[OPENMP 4.1] Allow references in init expression for loop-based constructs. OpenMP 4.1 allows to use variables with reference types in private clauses and, therefore, in init expressions of the cannonical loop forms. llvm-svn: 244209	2015-08-06 12:30:57 +00:00
Samuel Antao	9c75cfe976	[OpenMP] Add capture for threadprivate variables used in copyin clause if TLS is enabled in OpenMP code generation. llvm-svn: 243277	2015-07-27 16:38:06 +00:00
Michael Wong	65f367fcbb	Commit for http://reviews.llvm.org/D10765 for OpenMP 4 target data directive parsing and sema. This commit is on behalf of Kelvin Li. llvm-svn: 242785	2015-07-21 13:44:28 +00:00
Tyler Nowicki	da46d0ea8c	Make the variable names match the name of the metadata they control. Rename Vectorizer to Vectorize and VectorizeUnroll to InterleaveCount. llvm-svn: 242241	2015-07-14 23:03:09 +00:00
Alexey Bataev	7d5d33ea33	[OPENMP 4.0] Codegen for 'omp cancel' directive. Add the next codegen for 'omp cancel' directive: if (__kmpc_cancel()) { __kmpc_cancel_barrier(); <exit construct>; } llvm-svn: 241429	2015-07-06 05:50:32 +00:00
Alexey Bataev	81c7ea0ec3	[OPENMP 4.0] Fixed codegen for 'cancellation point' construct. Generate the next code for 'cancellation point': if (__kmpc_cancellationpoint()) { __kmpc_cancel_barrier(); <exit construct>; } llvm-svn: 241336	2015-07-03 09:56:58 +00:00
Benjamin Kramer	642f173ae9	Switch users of the 'for (StmtRange range = stmt->children(); range; ++range)‘ pattern to range for loops. The pattern was born out of the lack of range-based for loops in C++98 and is somewhat obscure. No functionality change intended. llvm-svn: 241300	2015-07-02 21:03:14 +00:00
Alexey Bataev	80909878ad	[OPENMP 4.0] Initial support for 'omp cancel' construct. Implemented parsing/sema analysis + (de)serialization. llvm-svn: 241253	2015-07-02 11:25:17 +00:00
Alexey Bataev	0f34da12e4	[OPENMP 4.0] Codegen for 'cancellation point' directive. The next code is generated for this construct: ``` if (__kmpc_cancellationpoint(ident_t *loc, kmp_int32 global_tid, kmp_int32 cncl_kind) != 0) <exit from outer innermost construct>; ``` llvm-svn: 241239	2015-07-02 04:17:07 +00:00
Alexey Bataev	0039651304	[OPENMP] Introduced type trait "__builtin_omp_required_simd_align" for default simd alignment. Adds type trait "__builtin_omp_required_simd_align" after discussions here http://reviews.llvm.org/D9894 Differential Revision: http://reviews.llvm.org/D10597 llvm-svn: 241237	2015-07-02 03:40:19 +00:00
Alexey Bataev	6d4ed05830	[OPENMP 4.0] Initial support for 'omp cancellation point' construct. Add parsing and sema analysis for 'omp cancellation point' directive. llvm-svn: 241145	2015-07-01 06:57:41 +00:00
Alexey Bataev	1d2353d4f3	[OPENMP] Codegen for 'depend' clause (OpenMP 4.0). If task directive has associated 'depend' clause then function kmp_int32 __kmpc_omp_task_with_deps ( ident_t loc_ref, kmp_int32 gtid, kmp_task_t new_task, kmp_int32 ndeps, kmp_depend_info_t dep_list,kmp_int32 ndeps_noalias, kmp_depend_info_t noalias_dep_list) must be called instead of __kmpc_omp_task(). If this directive has associated 'if' clause then also before a call of kmpc_omp_task_begin_if0() a function void __kmpc_omp_wait_deps ( ident_t loc_ref, kmp_int32 gtid, kmp_int32 ndeps, kmp_depend_info_t dep_list, kmp_int32 ndeps_noalias, kmp_depend_info_t *noalias_dep_list) must be called. Array sections are not supported yet. llvm-svn: 240532	2015-06-24 11:01:36 +00:00
Alexey Bataev	1c2cfbc3ea	[OPENMP] Initial support for 'depend' clause (4.0). Parsing and sema analysis (without support for array sections in arguments) for 'depend' clause (used in 'task' directive, OpenMP 4.0). llvm-svn: 240409	2015-06-23 14:25:19 +00:00
Alexey Bataev	7f210c6dab	[OPENMP] Codegen for 'proc_bind' clause (4.0). Adds emission of the code for 'proc_bind(master\|close\|spread)' clause: call void @__kmpc_push_proc_bind(<loc>, i32 thread_id, i32 4\|3\|2) llvm-svn: 240018	2015-06-18 13:40:03 +00:00
Alexey Bataev	c30dd2daf9	[OPENMP] Support for '#pragma omp taskgroup' directive. Added parsing, sema analysis and codegen for '#pragma omp taskgroup' directive (OpenMP 4.0). The code for directive is generated the following way: #pragma omp taskgroup <body> void __kmpc_taskgroup(<loc>, thread_id); <body> void __kmpc_end_taskgroup(<loc>, thread_id); llvm-svn: 240011	2015-06-18 12:14:09 +00:00
Alexey Bataev	3b5b5c492e	[OPENMP] Add support for 'omp parallel for' directive. Codegen for this directive is a combined codegen for 'omp parallel' region with 'omp for simd' region inside. Clauses are supported. llvm-svn: 240006	2015-06-18 10:10:12 +00:00
Alexey Bataev	58e5bdb091	[OPENMP] Add support for 'omp for simd' directive. Added codegen for combined 'omp for simd' directives, that is a combination of 'omp for' directive followed by 'omp simd' directive. Includes support for all clauses. llvm-svn: 239990	2015-06-18 04:45:29 +00:00
Alexey Bataev	cbdcbb7690	[OPENMP] Code reformatting for omp simd codegen, NFC. llvm-svn: 239889	2015-06-17 07:45:51 +00:00
Alexey Bataev	89e7e8eb0e	[OPENMP] Supported reduction clause in omp simd construct. The following code is generated for reduction clause within 'omp simd' loop construct: #pragma omp simd reduction(op:var) for (...) <body> alloca priv_var priv_var = <initial reduction value>; <loop_start>: <body> // references to original 'var' are replaced by 'priv_var' <loop_end>: var op= priv_var; llvm-svn: 239881	2015-06-17 06:21:39 +00:00
Alexey Bataev	fc087ecc05	[OPENMP] Support lastprivate clause in omp simd directive. Added codegen for lastprivate clauses within simd loop-based directives. llvm-svn: 239813	2015-06-16 13:14:42 +00:00
Alexey Bataev	ae05c29ab5	[OPENMP] Remove last iteration separation for loop-based constructs. Previously the last iteration for simd loop-based OpenMP constructs were generated as a separate code. This feature is not required and codegen is simplified. llvm-svn: 239810	2015-06-16 11:59:36 +00:00
Alexey Bataev	6e8248fdad	[OPENMP] Fox for http://llvm.org/PR23663 : OpenMP crash Destroy RuntimeCleanupScope before generation of termination instruction in parallel loop precondition. llvm-svn: 239524	2015-06-11 10:53:56 +00:00
Alexey Bataev	3ae88e2124	[OPENMP] Prepare codegen for privates in tasks for non-capturing of privates in CapturedStmt. Reworked codegen for privates in tasks: call @kmpc_omp_task_alloc(); ... call @kmpc_omp_task(task_proxy); void map_privates(.privates_rec. privs, type1 * priv1_ref, ..., typen *privn_ref) { priv1_ref = &privs->private1; ... privn_ref = &privs->privaten; ret void } i32 task_entry(i32 ThreadId, i32 PartId, void privs, void (void, ...) map_privates, shareds captures) { type1 priv1; ... typen privn; call map_privates(privs, priv1, ..., privn); <Task body with priv1, .., privn instead of the captured variables>. ret i32 } i32 task_proxy(i32 ThreadId, kmp_task_t_with_privates *tt) { call task_entry(ThreadId, tt->task_data.PartId, &tt->privates, map_privates, tt->task_data.shareds); } llvm-svn: 238010	2015-05-22 08:56:35 +00:00
Alexey Bataev	5129d3a4f5	[OPENMP] Fixed codegen for parameters privatization. For parameters we shall take a derived type of parameters, not the original one. llvm-svn: 237882	2015-05-21 09:47:46 +00:00
Alexey Bataev	7a228ff439	[OPENMP] Fixed codegen for lastprivate LCV in worksharing constructs. If loop control variable in a worksharing construct is marked as lastprivate, we should copy last calculated value of private counter back to original variable. llvm-svn: 237879	2015-05-21 07:59:51 +00:00
Alexey Bataev	d7589ffe1d	[OPENMP] Fix codegen for ordered loop directives. loops with ordered clause must be generated the same way as dynamic loops, but with static scheduleing. llvm-svn: 237788	2015-05-20 13:12:48 +00:00
Alexey Bataev	1d9c15cf18	[OPENMP] Fixed codegen for copying/initialization of array variables/parameters. This modification generates proper copyin/initialization sequences for array variables/parameters. Before they were considered as pointers, not arrays. llvm-svn: 237691	2015-05-19 12:31:28 +00:00
Alexey Bataev	d130fd17f1	[OPENMP] Fixed codegen for firstprivate variables, also marked as lastprivate. In some rare cases shared copies of lastprivate/firstprivate variables were not updated after the loop directive. llvm-svn: 237243	2015-05-13 10:23:02 +00:00
Alexey Bataev	040d540940	[OPENMP] Fixed support for 'schedule' clause with non-constant chunk size. 'schedule' clause for combined directives requires additional processing. Special helper variable is generated, that is captured in the outlined parallel region for 'parallel for' region. This captured variable is used to store chunk expression from the 'schedule' clause in this 'parallel for' region. llvm-svn: 237100	2015-05-12 08:35:28 +00:00
Alexey Bataev	9d541a72e8	[OPENMP] Fixed atomic construct with non-integer expressions. Do not emit 'atomicrmw' instruction for simple atomic constructs with non-integer expressions. llvm-svn: 236828	2015-05-08 11:47:16 +00:00
Alexey Bataev	39f915b8f4	[OPENMP] Code cleanup for capturing of variables in OpenMP regions. llvm-svn: 236821	2015-05-08 10:41:21 +00:00
Alexey Bataev	53223c986c	[OPENMP] Generate !llvm.mem.loop_parallel_access metadata for loops with dynamic/guided scheduling. Inner bodies of OpenMP worksharing loop-based constructs with dynamic or guided scheduling are allowed to be marked with !llvm.mem.parallel_loop_access metadata for better optimization. Worksharing constructs with static scheduling cannot be marked this way (according to OpenMP standard "A data dependence between the same logical iterations in two such loops is guaranteed"). Constructs with auto and runtime scheduling are also not marked because automatically chosen scheduling may be static also. Differential Revision: http://reviews.llvm.org/D9518 llvm-svn: 236693	2015-05-07 04:25:17 +00:00
Alexey Bataev	9e03404d8d	[OPENMP] Codegen for 'firstprivate' clause in 'task' directive. For tasks codegen for private/firstprivate variables are different rather than for other directives. 1. Build an internal structure of privates for each private variable: struct .kmp_privates_t. { Ty1 var1; ... Tyn varn; }; 2. Add a new field to kmp_task_t type with list of privates. struct kmp_task_t { void * shareds; kmp_routine_entry_t routine; kmp_int32 part_id; kmp_routine_entry_t destructors; .kmp_privates_t. privates; }; 3. Create a function with destructors calls for all privates after end of task region. kmp_int32 .omp_task_destructor.(kmp_int32 gtid, kmp_task_t tt) { ~Destructor(&tt->privates.var1); ... ~Destructor(&tt->privates.varn); return 0; } 4. Perform initialization of all firstprivate fields (by simple copying for POD data, copy constructor calls for classes) + provide address of a destructor function after kmpc_omp_task_alloc() and before kmpc_omp_task() calls. kmp_task_t new_task = __kmpc_omp_task_alloc(ident_t , kmp_int32 gtid, kmp_int32 flags, size_t sizeof_kmp_task_t, size_t sizeof_shareds, kmp_routine_entry_t task_entry); CopyConstructor(new_task->privates.var1, new_task->shareds.var1_ref); new_task->shareds.var1_ref = &new_task->privates.var1; ... CopyConstructor(new_task->privates.varn, new_task->shareds.varn_ref); new_task->shareds.varn_ref = &new_task->privates.varn; new_task->destructors = .omp_task_destructor.; kmp_int32 __kmpc_omp_task(ident_t , kmp_int32 gtid, kmp_task_t new_task) Differential Revision: http://reviews.llvm.org/D9370 llvm-svn: 236479	2015-05-05 04:05:12 +00:00
Benjamin Kramer	439ee9d7bc	Make helper functions static. NFC. llvm-svn: 236315	2015-05-01 13:59:53 +00:00
Alexey Bataev	36c1eb95e0	[OPENMP] Codegen for 'private' clause in 'task' directive. For tasks codegen for private/firstprivate variables are different rather than for other directives. 1. Build an internal structure of privates for each private variable: struct .kmp_privates_t. { Ty1 var1; ... Tyn varn; }; 2. Add a new field to kmp_task_t type with list of privates. struct kmp_task_t { void * shareds; kmp_routine_entry_t routine; kmp_int32 part_id; kmp_routine_entry_t destructors; .kmp_privates_t. privates; }; 3. Create a function with destructors calls for all privates after end of task region. kmp_int32 .omp_task_destructor.(kmp_int32 gtid, kmp_task_t tt) { ~Destructor(&tt->privates.var1); ... ~Destructor(&tt->privates.varn); return 0; } 4. Perform default initialization of all private fields (no initialization for POD data, default constructor calls for classes) + provide address of a destructor function after kmpc_omp_task_alloc() and before kmpc_omp_task() calls. kmp_task_t new_task = __kmpc_omp_task_alloc(ident_t , kmp_int32 gtid, kmp_int32 flags, size_t sizeof_kmp_task_t, size_t sizeof_shareds, kmp_routine_entry_t task_entry); DefaultConstructor(new_task->privates.var1); new_task->shareds.var1_ref = &new_task->privates.var1; ... DefaultConstructor(new_task->privates.varn); new_task->shareds.varn_ref = &new_task->privates.varn; new_task->destructors = .omp_task_destructor.; kmp_int32 __kmpc_omp_task(ident_t , kmp_int32 gtid, kmp_task_t new_task) Differential Revision: http://reviews.llvm.org/D9322 llvm-svn: 236207	2015-04-30 06:51:57 +00:00
Alexey Bataev	6111469a4a	[OPENMP] Fix crash on loop control vars explicitly marked as private. It is allowed to mark loop control vars as private in 'private' or 'lastprivate' clause, so no need to assert here. llvm-svn: 235985	2015-04-28 13:20:05 +00:00
Alexey Bataev	c925aa3ab8	[OPENMP] Simplified iteration over clauses, NFC. llvm-svn: 235838	2015-04-27 08:00:32 +00:00
Alexey Bataev	8b8e202a33	[OPENMP] Codegen for 'taskwait' directive. Emit the following code for 'taskwait' directive within tied task: call i32 @__kmpc_omp_taskwait(<loc>, i32 <thread_id>); Differential Revision: http://reviews.llvm.org/D9245 llvm-svn: 235836	2015-04-27 05:22:09 +00:00
Alexey Bataev	a89adf22db	[OPENMP] Codegen for 'reduction' clause in 'sections' directive. Emit a code for reduction clause. Next code should be emitted for reductions: static kmp_critical_name lock = { 0 }; void reduce_func(void lhs[<n>], void rhs[<n>]) { (Type0)lhs[0] = ReductionOperation0((Type0)lhs[0], (Type0)rhs[0]); ... (Type<n>-1)lhs[<n>-1] = ReductionOperation<n>-1((Type<n>-1)lhs[<n>-1], (Type<n>-1)rhs[<n>-1]); } ... void RedList[<n>] = {&<RHSExprs>[0], ..., &<RHSExprs>[<n>-1]}; switch (__kmpc_reduce{_nowait}(<loc>, <gtid>, <n>, sizeof(RedList), RedList, reduce_func, &<lock>)) { case 1: <LHSExprs>[0] = ReductionOperation0(<LHSExprs>[0], <RHSExprs>[0]); ... <LHSExprs>[<n>-1] = ReductionOperation<n>-1(<LHSExprs>[<n>-1], <RHSExprs>[<n>-1]); __kmpc_end_reduce{_nowait}(<loc>, <gtid>, &<lock>); break; case 2: Atomic(<LHSExprs>[0] = ReductionOperation0(<LHSExprs>[0], <RHSExprs>[0])); ... Atomic(<LHSExprs>[<n>-1] = ReductionOperation<n>-1(<LHSExprs>[<n>-1], *<RHSExprs>[<n>-1])); break; default:; } Reduction variables are a kind of a private variables, they have private copies, but initial values are chosen in accordance with the reduction operation. If sections directive has only single section, then original shared variables are used instead with barrier at the end of the directive. Differential Revision: http://reviews.llvm.org/D9242 llvm-svn: 235835	2015-04-27 05:04:13 +00:00
Alexey Bataev	9efc03b6f7	[OPENMP] Codegen for 'lastprivate' clause in 'sections' directive. #pragma omp sections lastprivate(<var>) <BODY>; This construct is translated into something like: <last_iter> = alloca i32 <init for lastprivates>; <last_iter> = 0 ; No initializer for simple variables or a default constructor is called for objects. ; For arrays perform element by element initialization by the call of the default constructor. ... OMP_FOR_START(...,<last_iter>, ..); sets <last_iter> to 1 if this is the last iteration. <BODY> ... OMP_FOR_END if (<last_iter> != 0) { <final copy for lastprivate>; Update original variable with the lastprivate value. } call __kmpc_cancel_barrier() ; an implicit barrier to avoid possible data race. If there is only one section, there is no special code generation, original shared variables are used + barrier is emitted at the end of the directive. Differential Revision: http://reviews.llvm.org/D9240 llvm-svn: 235834	2015-04-27 04:34:03 +00:00
Alexey Bataev	7387083d95	[OPENMP] Codegen for 'private' clause in 'sections' directive. If there are 2 or more sections in a 'section' directive the following code is generated: <default init for privates> @__kmpc_for_static_init_4(); <BODY for sections directive> @__kmpc_for_static_fini() If there is only one section, the following code is generated: if (@__kmpc_single()) { <default init for privates> @__kmpc_end_single(); } Differential Revision: http://reviews.llvm.org/D9239 llvm-svn: 235833	2015-04-27 04:12:12 +00:00
Alexey Bataev	59c654aa43	[OPENMP] Codegen for 'private' clause in 'single' directive. Emit the following code for 'single' directive with 'private' clause: if (@__kmpc_single()) { <default init for privates> @__kmpc_end_single(); } Differential Revision: http://reviews.llvm.org/D9238 llvm-svn: 235832	2015-04-27 03:48:52 +00:00
Alexey Bataev	5521d78532	[OPENMP] Codegen for 'firstprivate' clause in 'single' directive. Emit the following code for 'single' directive with 'firtstprivate' clause: if (@__kmpc_single()) { <init for firstprivates> @__kmpc_end_single(); } @__kmpc_cancel_barrier(); // To avoid data race in firstprivate init Differential Revision: http://reviews.llvm.org/D9223 llvm-svn: 235694	2015-04-24 04:21:15 +00:00
Alexey Bataev	8b72566eec	[OPENMP] Do not emit implicit barrier for single directive with 'copyprivate' clause(s). Runtime function for 'copyprivate' directive generates implicit barriers, so no need to emit it. Differential Revision: http://reviews.llvm.org/D9215 llvm-svn: 235692	2015-04-24 04:00:39 +00:00
Alexey Bataev	2cb9b95adf	[OPENMP] Codegen for 'firstprivate' clause in 'sections' directive. If there are 2 or more sections in a 'section' directive the following code is generated: <init for firstprivates> @__kmpc_cancel_barrier();// To avoid data race in firstprivate init @__kmpc_for_static_init_4(); <BODY for sections directive> @__kmpc_for_static_fini() If there is only one section, the following code is generated: if (@__kmpc_single()) { <init for firstprivates> @__kmpc_end_single(); } @__kmpc_cancel_barrier(); // To avoid data race in firstprivate init Differential Revision: http://reviews.llvm.org/D9214 llvm-svn: 235691	2015-04-24 03:37:03 +00:00
Justin Bogner	66242d6c5e	InstrProf: Stop using RegionCounter outside of CodeGenPGO (NFC) The RegionCounter type does a lot of legwork, but most of it is only meaningful within the implementation of CodeGenPGO. The uses elsewhere in CodeGen generally just want to increment or read counters, so do that directly. llvm-svn: 235664	2015-04-23 23:06:47 +00:00
Alexey Bataev	5e018f9e29	[OPENMP] Codegen for 'atomic capture'. Adds codegen for 'atomic capture' constructs with the following forms of expressions/statements: v = x binop= expr; v = x++; v = ++x; v = x--; v = --x; v = x = x binop expr; v = x = expr binop x; {v = x; x = binop= expr;} {v = x; x++;} {v = x; ++x;} {v = x; x--;} {v = x; --x;} {x = x binop expr; v = x;} {x binop= expr; v = x;} {x++; v = x;} {++x; v = x;} {x--; v = x;} {--x; v = x;} {x = x binop expr; v = x;} {x = expr binop x; v = x;} {v = x; x = expr;} If x and expr are integer and binop is associative or x is a LHS in a RHS of the assignment expression, and atomics are allowed for type of x on the target platform atomicrmw instruction is emitted. Otherwise compare-and-swap sequence is emitted. Update of 'v' is not required to be be atomic with respect to the read or write of the 'x'. bb: ... atomic load <x> cont: <expected> = phi [ <x>, label %bb ], [ <new_failed>, %cont ] <desired> = <expected> binop <expr> <res> = cmpxchg atomic &<x>, desired, expected <new_failed> = <res>.field1; br <res>field2, label %exit, label %cont exit: atomic store <old/new x>, <v> ... Differential Revision: http://reviews.llvm.org/D9049 llvm-svn: 235573	2015-04-23 06:35:10 +00:00
Alexey Bataev	1d67713b44	[OPENMP] Codegen for 'if' clause in 'task' directive. If condition evaluates to true, the code executes task by calling @__kmpc_omp_task() runtime function. If condition evaluates to false, the code executes serial version of the code by executing the following code: call void @__kmpc_omp_task_begin_if0(<loc>, <threadid>, <task_t_ptr, returned by @__kmpc_omp_task_alloc()>); proxy_task_entry(<gtid>, <task_t_ptr, returned by @__kmpc_omp_task_alloc()>); call void @__kmpc_omp_task_complete_if0(<loc>, <threadid>, <task_t_ptr, returned by @__kmpc_omp_task_alloc()>); Also it checks if the condition is constant and if it is constant it evaluates its value and then generates either parallel version of the code (if the condition evaluates to true), or the serial version of the code (if the condition evaluates to false). Differential Revision: http://reviews.llvm.org/D9143 llvm-svn: 235507	2015-04-22 13:57:31 +00:00
Alexey Bataev	7ebe5fddac	[OPENMP] Codegen for 'reduction' clause in 'for' directive. Emit a code for reduction clause. Next code should be emitted for reductions: static kmp_critical_name lock = { 0 }; void reduce_func(void lhs[<n>], void rhs[<n>]) { (Type0)lhs[0] = ReductionOperation0((Type0)lhs[0], (Type0)rhs[0]); ... (Type<n>-1)lhs[<n>-1] = ReductionOperation<n>-1((Type<n>-1)lhs[<n>-1], (Type<n>-1)rhs[<n>-1]); } ... void RedList[<n>] = {&<RHSExprs>[0], ..., &<RHSExprs>[<n>-1]}; switch (__kmpc_reduce{_nowait}(<loc>, <gtid>, <n>, sizeof(RedList), RedList, reduce_func, &<lock>)) { case 1: <LHSExprs>[0] = ReductionOperation0(<LHSExprs>[0], <RHSExprs>[0]); ... <LHSExprs>[<n>-1] = ReductionOperation<n>-1(<LHSExprs>[<n>-1], <RHSExprs>[<n>-1]); __kmpc_end_reduce{_nowait}(<loc>, <gtid>, &<lock>); break; case 2: Atomic(<LHSExprs>[0] = ReductionOperation0(<LHSExprs>[0], <RHSExprs>[0])); ... Atomic(<LHSExprs>[<n>-1] = ReductionOperation<n>-1(<LHSExprs>[<n>-1], *<RHSExprs>[<n>-1])); break; default:; } Reduction variables are a kind of a private variables, they have private copies, but initial values are chosen in accordance with the reduction operation. Differential Revision: http://reviews.llvm.org/D9139 llvm-svn: 235506	2015-04-22 13:43:03 +00:00
Alexey Bataev	50a6458870	[OPENMP] Codegen for 'private' clause in 'for' directive. This patch generates helper variables which used as a private copies of the corresponding original variables inside an OpenMP 'for' directive. These generated variables are initialized by default (with the default constructor, if any). In OpenMP region references to original variables are replaced by the references to these private helper variables. Differential Revision: http://reviews.llvm.org/D9106 llvm-svn: 235503	2015-04-22 12:24:45 +00:00
Alexey Bataev	62dbb979c0	[OPENMP] Fix use of unsigned counters in loops with zero trip count. Patch fixes bugs in codegen for loops with unsigned counters and zero trip count. Previously preconditions for all loops were built using logic (Upper - Lower) > 0. But if the loop is a loop with zero trip count, then Upper - Lower is < 0 only for signed integer, for unsigned we're running into an underflow situation. In this patch we're using original Lower<Upper condition to check that loop body can be executed at least once. Also this allows to skip code generation for loops, if it is known that preconditions for the loop are always false. Differential Revision: http://reviews.llvm.org/D9103 llvm-svn: 235500	2015-04-22 11:59:37 +00:00
Alexey Bataev	98eb6e3d41	[OPENMP] Codegen for 'ordered' directive. Add codegen for 'ordered' directive: __kmpc_ordered(ident_t , gtid); <associated statement>; __kmpc_end_ordered(ident_t , gtid); Also for 'for' directives with the dynamic scheduling and an 'ordered' clause added a call to '__kmpc_dispatch_fini_(4\|8)[u]()' function after increment expression for loop control variable: while(__kmpc_dispatch_next(&LB, &UB)) { idx = LB; while (idx <= UB) { BODY; ++idx; __kmpc_dispatch_fini_(4\|8)[u](); // For ordered loops only. } // inner loop } Differential Revision: http://reviews.llvm.org/D9070 llvm-svn: 235496	2015-04-22 11:15:40 +00:00
Benjamin Kramer	5df7c1a4eb	Make helper function static. NFC. llvm-svn: 235253	2015-04-18 10:00:10 +00:00
Alexey Bataev	f56f98c925	[OPENMP] Codegen for 'copyin' clause in 'parallel' directive. Emits the following code for the clause at the beginning of the outlined function for implicit threads: if (<not a master thread>) { ... <thread local copy of var> = <master thread local copy of var>; ... } <sync point>; Checking for a non-master thread is performed by comparing of the address of the thread local variable with the address of the master's variable. Master thread always uses original variables, so you always know the address of the variable in the master thread. Differential Revision: http://reviews.llvm.org/D9026 llvm-svn: 235075	2015-04-16 05:39:01 +00:00
Alexey Bataev	38e8953352	[OPENMP] Codegen for 'lastprivate' clause in 'for' directive. #pragma omp for lastprivate(<var>) for (i = a; i < b; ++b) <BODY>; This construct is translated into something like: <last_iter> = alloca i32 <lastprivate_var> = alloca <type> <last_iter> = 0 ; No initializer for simple variables or a default constructor is called for objects. ; For arrays perform element by element initialization by the call of the default constructor. ... OMP_FOR_START(...,<last_iter>, ..); sets <last_iter> to 1 if this is the last iteration. <BODY> ... OMP_FOR_END if (<last_iter> != 0) { <var> = <lastprivate_var> ; Update original variable with the lastprivate value. } call __kmpc_cancel_barrier() ; an implicit barrier to avoid possible data race. Differential Revision: http://reviews.llvm.org/D8658 llvm-svn: 235074	2015-04-16 04:54:05 +00:00
Alexey Bataev	69c62a9bdb	[OPENMP] Codegen for 'firstprivate' clause in 'for' directive. Adds proper codegen for 'firstprivate' clause in for directive. Initially codegen for 'firstprivate' clause was implemented for 'parallel' directive only. Also this patch emits sync point only after initialization of firstprivate variables, not all private variables. This sync point is not required for privates, lastprivates etc., only for initialization of firstprivate variables. Differential Revision: http://reviews.llvm.org/D8660 llvm-svn: 234978	2015-04-15 04:52:20 +00:00
Alexey Bataev	420d45b2dd	[OPENMP] Fixed codegen for arrays in 'copyprivate' clause. Fixed a bug with codegen of variables with array types specified in 'copyprivate' clause of 'single' directive. Differential Revision: http://reviews.llvm.org/D8914 llvm-svn: 234856	2015-04-14 05:11:24 +00:00
Alexey Bataev	68adb7da1a	[OPENMP] Initial codegen for 'parallel sections' directive. Emits code for outlined 'parallel' directive with the implicitly inlined 'sections' directive: ... call __kmpc_fork_call(..., outlined_function, ...); ... define internal void outlined_function(...) { <code for implicit sections directive>; } Differential Revision: http://reviews.llvm.org/D8997 llvm-svn: 234849	2015-04-14 03:29:22 +00:00
Alexey Bataev	671605e85b	[OPENMP] Initial codegen for 'parallel for' directive. Allows generation of combined 'parallel for' directive that represents 'parallel' region with internal implicit 'for' worksharing region. Differential Revision: http://reviews.llvm.org/D8631 llvm-svn: 234722	2015-04-13 05:28:11 +00:00
Alexey Bataev	794ba0dcb7	[OPENMP] Codegen for 'reduction' clause in 'parallel' directive. Emit a code for reduction clause. Next code should be emitted for reductions: static kmp_critical_name lock = { 0 }; void reduce_func(void lhs[<n>], void rhs[<n>]) { ... (Type<i> )lhs[i] = RedOp<i>((Type<i> )lhs[i], (Type<i> )rhs[i]); ... } ... void RedList[<n>] = {&<RHSExprs>[0], ..., &<RHSExprs>[<n> - 1]}; switch (__kmpc_reduce{_nowait}(<loc>, <gtid>, <n>, sizeof(RedList), RedList, reduce_func, &<lock>)) { case 1: ... <LHSExprs>[i] = RedOp<i>(<LHSExprs>[i], <RHSExprs>[i]); ... __kmpc_end_reduce{_nowait}(<loc>, <gtid>, &<lock>); break; case 2: ... Atomic(<LHSExprs>[i] = RedOp<i>(<LHSExprs>[i], *<RHSExprs>[i])); ... break; default: ; } Reduction variables are a kind of a private variables, they have private copies, but initial values are chosen in accordance with the reduction operation. Differential Revision: http://reviews.llvm.org/D8915 llvm-svn: 234583	2015-04-10 10:43:45 +00:00
Alexey Bataev	6f1ffc069b	[OPENMP] Refactoring of codegen for OpenMP directives. Refactored API of OpenMPRuntime for compatibility with combined directives. Differential Revision: http://reviews.llvm.org/D8859 llvm-svn: 234564	2015-04-10 04:50:10 +00:00
Alexey Bataev	b4505a7229	[OPENMP] Codegen for 'atomic update' construct. Adds atomic update codegen for the following forms of expressions: x binop= expr; x++; ++x; x--; --x; x = x binop expr; x = expr binop x; If x and expr are integer and binop is associative or x is a LHS in a RHS of the assignment expression, and atomics are allowed for type of x on the target platform atomicrmw instruction is emitted. Otherwise compare-and-swap sequence is emitted: bb: ... atomic load <x> cont: <expected> = phi [ <x>, label %bb ], [ <new_failed>, %cont ] <desired> = <expected> binop <expr> <res> = cmpxchg atomic &<x>, desired, expected <new_failed> = <res>.field1; br <res>field2, label %exit, label %cont exit: ... Differential Revision: http://reviews.llvm.org/D8536 llvm-svn: 233513	2015-03-30 05:20:59 +00:00
Alexey Bataev	f268568447	[OPENMP] Improved codegen for implicit/explicit 'barrier' constructs. Replace boolean IsExplicit parameter of OpenMPRuntime::emitBarrierCall() method by OpenMPDirectiveKind Kind for better compatibility with the runtime library. Also add processing of 'nowait' clause on worksharing directives. Differential Revision: http://reviews.llvm.org/D8659 llvm-svn: 233511	2015-03-30 04:30:22 +00:00
Alexey Bataev	a63048e4fd	[OPENMP] Codegen for 'copyprivate' clause ('single' directive). If there is at least one 'copyprivate' clause is associated with the single directive, the following code is generated: ``` i32 did_it = 0; \\ for 'copyprivate' clause if(__kmpc_single(ident_t , gtid)) { SingleOpGen(); __kmpc_end_single(ident_t , gtid); did_it = 1; \\ for 'copyprivate' clause } <copyprivate_list>[0] = &var0; ... <copyprivate_list>[n] = &varn; call __kmpc_copyprivate(ident_t , gtid, <copyprivate_list_size>, <copyprivate_list>, <copy_func>, did_it); ... void<copy_func>(void LHSArg, void RHSArg) { Dst = (void [n])(LHSArg); Src = (void * [n])(RHSArg); Dst[0] = Src[0]; ... Dst[n] = Src[n]; } ``` All list items from all 'copyprivate' clauses are gathered into single <copyprivate list> (<copyprivate_list_size> is a size in bytes of this list) and <copy_func> is used to propagate values of private or threadprivate variables from the 'single' region to other implicit threads from outer 'parallel' region. Differential Revision: http://reviews.llvm.org/D8410 llvm-svn: 232932	2015-03-23 06:18:07 +00:00
Alexander Musman	3276a27b5c	[OPENMP] CodeGen of the 'linear' clause for the 'omp simd' directive. The linear variable is privatized (similar to 'private') and its value on current iteration is calculated, similar to the loop counter variables. Differential revision: http://reviews.llvm.org/D8375 llvm-svn: 232890	2015-03-21 10:12:56 +00:00
Alexander Musman	7931b98735	[OPENMP] Enable codegen of the ‘private’ clause for ‘omp simd’ directive llvm-svn: 232353	2015-03-16 07:14:41 +00:00
Alexander Musman	92bdaabf97	[OPENMP] CodeGen - 'omp for' with dynamic schedule kinds. Differential Revision: http://reviews.llvm.org/D7138 llvm-svn: 232036	2015-03-12 13:37:50 +00:00
Alexey Bataev	2df54a07bf	[OPENMP] Initial codegen for 'omp sections' and 'omp section' directives. If only one section is found in the sections region, it is emitted just like single region. Otherwise it is emitted as a static non-chunked loop. #pragma omp sections { #pragma omp section {1} ... #pragma omp section {n} } is translated to something like i32 <iter_var> i32 <last_iter> = 0 i32 <lower_bound> = 0 i32 <upper_bound> = n-1 i32 <stride> = 1 call void @__kmpc_for_static_init_4(<loc>, i32 <gtid>, i32 34/static non-chunked/, i32* <last_iter>, i32* <lower_bound>, i32* <upper_bound>, i32* <stride>, i32 1/increment always 1/, i32 1/chunk always 1/) <upper_bound> = min(<upper_bound>, n-1) <iter_var> = <lb> check: br <iter_var> <= <upper_bound>, label cont, label exit continue: switch (IV) { case 0: {1}; break; ... case <NumSection> - 1: {n}; break; } ++<iter_var> br label check exit: call void @__kmpc_for_static_fini(<loc>, i32 <gtid>) Differential Revision: http://reviews.llvm.org/D8244 llvm-svn: 232021	2015-03-12 08:53:29 +00:00
Alexey Bataev	10fec57e5a	[OPENMP] Fix for ExprWithCleanups in 'omp atomic' constructs. This patch allows using of ExprWithCleanups expressions and other complex expressions in 'omp atomic' construct Differential Revision: http://reviews.llvm.org/D8200 llvm-svn: 231905	2015-03-11 04:48:56 +00:00
Alexey Bataev	62b63b197d	[OPENMP] Initial codegen for 'omp task' directive. The task region is emmitted in several steps: Emit a call to kmp_task_t __kmpc_omp_task_alloc(ident_t , kmp_int32 gtid, kmp_int32 flags, size_t sizeof_kmp_task_t, size_t sizeof_shareds, kmp_routine_entry_t task_entry). Here task_entry is a pointer to the function: kmp_int32 .omp_task_entry.(kmp_int32 gtid, kmp_task_t tt) { TaskFunction(gtid, tt->part_id, tt->shareds); return 0; } Copy a list of shared variables to field shareds of the resulting structure kmp_task_t returned by the previous call (if any). Copy a pointer to destructions function to field destructions of the resulting structure kmp_task_t. Emit a call to kmp_int32 __kmpc_omp_task(ident_t , kmp_int32 gtid, kmp_task_t new_task), where new_task is a resulting structure from previous items. Differential Revision: http://reviews.llvm.org/D7560 llvm-svn: 231762	2015-03-10 07:28:44 +00:00
Alexey Bataev	36bf011e83	[OPENMP] Improved code for generating debug info + generation of all OpenMP regions in termination scope Patch adds proper generation of debug info for all OpenMP regions. Also, all OpenMP regions are generated in a termination scope, because standard does not allow to throw exceptions out of structured blocks, associated with the OpenMP regions Differential Revision: http://reviews.llvm.org/D7935 llvm-svn: 231757	2015-03-10 05:15:26 +00:00
Rafael Espindola	eb26ddf559	Revert "[OPENMP] Improved code for generating debug info + generation of all OpenMP regions in termination scope Patch adds proper generation of debug info for all OpenMP regions. Also, all OpenMP regions are generated in a termination scope, because standard does not allow to throw exceptions out of structured blocks, associated with the OpenMP regions Differential Revision: http://reviews.llvm.org/D7935 " This reverts commit r231752. It was failing to link with cmake: lib64/libclangCodeGen.a(CGOpenMPRuntime.cpp.o):/home/espindola/llvm/llvm/tools/clang/lib/CodeGen/CGOpenMPRuntime.cpp:function clang::CodeGen::InlinedOpenMPRegionRAII::~InlinedOpenMPRegionRAII(): error: undefined reference to 'clang::CodeGen::EHScopeStack::popTerminate()' clang-3.7: error: linker command failed with exit code 1 (use -v to see invocation) llvm-svn: 231754	2015-03-10 04:40:21 +00:00
Alexey Bataev	7ab2cc178f	[OPENMP] Improved code for generating debug info + generation of all OpenMP regions in termination scope Patch adds proper generation of debug info for all OpenMP regions. Also, all OpenMP regions are generated in a termination scope, because standard does not allow to throw exceptions out of structured blocks, associated with the OpenMP regions Differential Revision: http://reviews.llvm.org/D7935 llvm-svn: 231752	2015-03-10 04:22:11 +00:00
Alexey Bataev	b832926176	[OPENMP] Codegen for "#pragma omp atomic write" For global reg lvalue - use regular store through global register. For simple lvalue - use simple atomic store. For bitfields, vector element, extended vector elements - the original value of the whole storage (for vector elements) or of some aligned value (for bitfields) is atomically read, the part of this value for the given lvalue is modified and then use atomic compare-and-exchange operation to try to atomically write modified value (if it was not modified). Also, changes in this patch fix the bug for '#pragma omp atomic read' applied to extended vector elements. Differential Revision: http://reviews.llvm.org/D7369 llvm-svn: 230736	2015-02-27 06:33:30 +00:00
Alexey Bataev	8cbe0a6b62	[OPENMP] Fixed codegen for directives without function outlining. Fixed crash on codegen for directives like 'omp for', 'omp single' etc. inside of the 'omp parallel', 'omp task' etc. regions. llvm-svn: 230621	2015-02-26 10:27:34 +00:00
Alexey Bataev	3eff5f46d7	[OPENMP] Rename methods of OpenMPRuntime class. NFC. llvm-svn: 230470	2015-02-25 08:32:46 +00:00
David Majnemer	a5b195a1dc	Revert "Revert r229082 for a bit, it caused PR22577." This reverts commit r229123. It was a red herring, the bug was present without r229082. llvm-svn: 229205	2015-02-14 01:35:12 +00:00
Nico Weber	7ce96b853d	Revert r229082 for a bit, it caused PR22577. llvm-svn: 229123	2015-02-13 16:27:00 +00:00
David Majnemer	abc482effc	MS ABI: Implement /volatile:ms The /volatile:ms semantics turn volatile loads and stores into atomic acquire and release operations. This distinction is important because volatile memory operations do not form a happens-before relationship with non-atomic memory. This means that a volatile store is not sufficient for implementing a mutex unlock routine. Differential Revision: http://reviews.llvm.org/D7580 llvm-svn: 229082	2015-02-13 07:55:47 +00:00
Alexey Bataev	6956e2e683	[OPENMP] Initial codegen for 'single' directive. This patch emits the following code for the single directive: #pragma omp single <body> <----> if(__kmpc_single(...)) { <body> __kmpc_end_single(...); } Differential Revision: http://reviews.llvm.org/D7045 llvm-svn: 228275	2015-02-05 06:35:41 +00:00
Alexey Bataev	9f797f32e2	[OPENMP] Codegen for 'taskyield' directive For 'taskyield' directive emit call to kmp_int32 __kmpc_omp_taskyield(ident_t *, kmp_int32 global_tid, int end_part); runtime function call with end_part arg set to 0 (it is ignored). Differential Revision: http://reviews.llvm.org/D7047 llvm-svn: 228272	2015-02-05 05:57:51 +00:00
Adrian Prantl	95b24e9b59	Address review feedback for r228003. - use named constructors - get rid of MarkAsPrologue llvm-svn: 228021	2015-02-03 20:00:54 +00:00
Adrian Prantl	39428e74a0	Merge ArtificialLocation into ApplyDebugLocation and make a clear distinction between the different use-cases. With the previous default behavior we would occasionally emit empty debug locations in situations where they actually were strictly required (= on invoke insns). We now have a choice between defaulting to an empty location or an artificial location. Specifically, this fixes a bug caused by a missing debug location when emitting C++ EH cleanup blocks from within an artificial function, such as an ObjC destroy helper function. rdar://problem/19670595 llvm-svn: 228003	2015-02-03 18:40:42 +00:00
Alexander Musman	df7a8e2bc8	Support ‘omp for’ with static chunked schedule kind. Differential Revision: http://reviews.llvm.org/D7006 llvm-svn: 226795	2015-01-22 08:49:35 +00:00
Alexey Bataev	b57056f483	[OPENMP] CodeGen for "omp atomic read [seq_cst]" directive. "omp atomic read [seq_cst]" accepts expressions "v=x;". In this patch we perform an atomic load of "x" (using builtin atomic loading instructions or a call to "atomic_load()" for simple lvalues and "kmpc_atomic_start();load <x>;kmpc_atomic_end();" for other lvalues), convert the result of loading to type of "v" (using EmitScalarConversion() for simple types and EmitComplexToScalarConversion() for conversions from complex to scalar) and then store the result in "v".) Differential Revision: http://reviews.llvm.org/D6431 llvm-svn: 226788	2015-01-22 06:17:56 +00:00

... 2 3 4 5 6 ...

409 Commits