llvm-project

Commit Graph

Author	SHA1	Message	Date
Alexey Bataev	2f5ed34279	Fix for PR30639: CGDebugInfo Null dereference with OpenMP array access, by Erich Keane OpenMP creates a variable array type with a a null size-expr. The Debug generation failed to due to this. This patch corrects the openmp implementation, updates the tests, and adds a new one for this condition. Differential Revision: https://reviews.llvm.org/D25373 llvm-svn: 284110	2016-10-13 09:52:46 +00:00
Diana Picus	8b44bbc077	Revert "[OpenMP] Sema and parsing for 'teams distribute simd’ pragma" This reverts commit r279003 as it breaks some of our buildbots (e.g. clang-cmake-aarch64-quick, clang-x86_64-linux-selfhost-modules). The error is in OpenMP/teams_distribute_simd_ast_print.cpp: clang: /home/buildslave/buildslave/clang-cmake-aarch64-quick/llvm/include/llvm/ADT/DenseMap.h:527: bool llvm::DenseMapBase<DerivedT, KeyT, ValueT, KeyInfoT, BucketT>::LookupBucketFor(const LookupKeyT&, const BucketT&) const [with LookupKeyT = clang::Stmt; DerivedT = llvm::DenseMap<clang::Stmt, long unsigned int>; KeyT = clang::Stmt; ValueT = long unsigned int; KeyInfoT = llvm::DenseMapInfo<clang::Stmt>; BucketT = llvm::detail::DenseMapPair<clang::Stmt, long unsigned int>]: Assertion `!KeyInfoT::isEqual(Val, EmptyKey) && !KeyInfoT::isEqual(Val, TombstoneKey) && "Empty/Tombstone value shouldn't be inserted into map!"' failed. llvm-svn: 279045	2016-08-18 09:25:07 +00:00
Kelvin Li	0e3bde8216	[OpenMP] Sema and parsing for 'teams distribute simd’ pragma This patch is to implement sema and parsing for 'teams distribute simd’ pragma. This patch is originated by Carlo Bertolli. Differential Revision: https://reviews.llvm.org/D23528 llvm-svn: 279003	2016-08-17 23:13:03 +00:00
Kelvin Li	0253287633	[OpenMP] Sema and parsing for 'teams distribute' pragma This patch is to implement sema and parsing for 'teams distribute' pragma. Differential Revision: https://reviews.llvm.org/D23189 llvm-svn: 277818	2016-08-05 14:37:37 +00:00
Samuel Antao	cc10b85789	[OpenMP] Codegen for use_device_ptr clause. Summary: This patch adds support for the use_device_ptr clause. It includes changes in SEMA that could not be tested without codegen, namely, the use of the first private logic and mappable expressions support. Reviewers: hfinkel, carlo.bertolli, arpith-jacob, kkwli0, ABataev Subscribers: caomhin, cfe-commits Differential Revision: https://reviews.llvm.org/D22691 llvm-svn: 276977	2016-07-28 14:23:26 +00:00
Samuel Antao	403ffd409f	[OpenMP] Add support for mapping array sections through pointer references. Summary: This patch fixes a bug in the map of array sections whose base is a reference to a pointer. The existing mapping support was not prepared to deal with it, causing the compiler to crash. Mapping a reference to a pointer enjoys the same characteristics of a regular pointer, i.e., it is passed by value. Therefore, the reference has to be materialized in the target region. Reviewers: hfinkel, carlo.bertolli, kkwli0, ABataev Subscribers: caomhin, cfe-commits Differential Revision: https://reviews.llvm.org/D22690 llvm-svn: 276933	2016-07-27 22:49:49 +00:00
Kelvin Li	986330c190	[OpenMP] Sema and parsing for 'target simd' pragma This patch is to implement sema and parsing for 'target simd' pragma. Differential Revision: https://reviews.llvm.org/D22479 llvm-svn: 276203	2016-07-20 22:57:10 +00:00
Alexey Bataev	5140e748b5	[OPENMP] Improved processing of 'priority' clause, NFC. Removed some old comments + improved handling of 'priority' clause value during codegen after comments from Richard Smith. llvm-svn: 275945	2016-07-19 04:21:09 +00:00
Kelvin Li	a579b9196c	[OpenMP] Sema and parsing for 'target parallel for simd' pragma This patch is to implement sema and parsing for 'target parallel for simd' pragma. Differential Revision: http://reviews.llvm.org/D22096 llvm-svn: 275365	2016-07-14 02:54:56 +00:00
Carlo Bertolli	70594e9282	[OpenMP] Initial implementation of parse+sema for OpenMP clause 'is_device_ptr' of target http://reviews.llvm.org/D22070 llvm-svn: 275282	2016-07-13 17:16:49 +00:00
Carlo Bertolli	2404b17192	[OpenMP] Initial implementation of parse+sema for clause use_device_ptr of 'target data' http://reviews.llvm.org/D21904 This patch is similar to the implementation of 'private' clause: it adds a list of private pointers to be used within the target data region to store the device pointers returned by the runtime. Please refer to the following document for a full description of what the runtime witll return in this case (page 10 and 11): https://github.com/clang-omp/OffloadingDesign I am happy to answer any question related to the runtime interface to help reviewing this patch. llvm-svn: 275271	2016-07-13 15:37:16 +00:00
Kelvin Li	787f3fcc6b	[OpenMP] Sema and parsing for 'distribute simd' pragma Summary: This patch is an implementation of sema and parsing for the OpenMP composite pragma 'distribute simd'. Differential Revision: http://reviews.llvm.org/D22007 llvm-svn: 274604	2016-07-06 04:45:38 +00:00
Kelvin Li	4a39add05e	[OpenMP] Sema and parse for 'distribute parallel for simd' Summary: This patch is an implementation of sema and parsing for the OpenMP composite pragma 'distribute parallel for simd'. Differential Revision: http://reviews.llvm.org/D21977 llvm-svn: 274530	2016-07-05 05:00:15 +00:00
Carlo Bertolli	9925f15661	Resubmission of http://reviews.llvm.org/D21564 after fixes. [OpenMP] Initial implementation of parse and sema for composite pragma 'distribute parallel for' This patch is an initial implementation for #distribute parallel for. The main differences that affect other pragmas are: The implementation of 'distribute parallel for' requires blocking of the associated loop, where blocks are "distributed" to different teams and iterations within each block are scheduled to parallel threads within each team. To implement blocking, sema creates two additional worksharing directive fields that are used to pass the team assigned block lower and upper bounds through the outlined function resulting from 'parallel'. In this way, scheduling for 'for' to threads can use those bounds. As a consequence of blocking, the stride of 'distribute' is not 1 but it is equal to the blocking size. This is returned by the runtime and sema prepares a DistIncrExpr variable to hold that value. As a consequence of blocking, the global upper bound (EnsureUpperBound) expression of the 'for' is not the original loop upper bound (e.g. in for(i = 0 ; i < N; i++) this is 'N') but it is the team-assigned block upper bound. Sema creates a new expression holding the calculation of the actual upper bound for 'for' as UB = min(UB, PrevUB), where UB is the loop upper bound, and PrevUB is the team-assigned block upper bound. llvm-svn: 273884	2016-06-27 14:55:37 +00:00
Carlo Bertolli	b8503d5399	Revert r273705 [OpenMP] Initial implementation of parse and sema for composite pragma 'distribute parallel for' llvm-svn: 273709	2016-06-24 19:20:02 +00:00
Carlo Bertolli	e77d6e0e4d	[OpenMP] Initial implementation of parse and sema for composite pragma 'distribute parallel for' http://reviews.llvm.org/D21564 This patch is an initial implementation for #distribute parallel for. The main differences that affect other pragmas are: The implementation of 'distribute parallel for' requires blocking of the associated loop, where blocks are "distributed" to different teams and iterations within each block are scheduled to parallel threads within each team. To implement blocking, sema creates two additional worksharing directive fields that are used to pass the team assigned block lower and upper bounds through the outlined function resulting from 'parallel'. In this way, scheduling for 'for' to threads can use those bounds. As a consequence of blocking, the stride of 'distribute' is not 1 but it is equal to the blocking size. This is returned by the runtime and sema prepares a DistIncrExpr variable to hold that value. As a consequence of blocking, the global upper bound (EnsureUpperBound) expression of the 'for' is not the original loop upper bound (e.g. in for(i = 0 ; i < N; i++) this is 'N') but it is the team-assigned block upper bound. Sema creates a new expression holding the calculation of the actual upper bound for 'for' as UB = min(UB, PrevUB), where UB is the loop upper bound, and PrevUB is the team-assigned block upper bound. llvm-svn: 273705	2016-06-24 18:53:35 +00:00
Samuel Antao	6d0042642a	Re-apply r272900 - [OpenMP] Cast captures by copy when passed to fork call so that they are compatible to what the runtime library expects. An issue in one of the regression tests was fixed for 32-bit hosts. llvm-svn: 272931	2016-06-16 18:39:34 +00:00
Samuel Antao	b1f9501242	Revert r272900 - [OpenMP] Cast captures by copy when passed to fork call so that they are compatible to what the runtime library expects. Was causing trouble in one of the regression tests for a 32-bit address space. llvm-svn: 272908	2016-06-16 16:06:22 +00:00
Samuel Antao	4951617980	[OpenMP] Cast captures by copy when passed to fork call so that they are compatible to what the runtime library expects. Summary: This patch fixes an issue detected when firstprivate variables are passed to an OpenMP outlined function vararg list. Currently they are not compatible with what the runtime library expects causing malfunction in some targets. This patch fixes the issue by moving the casting logic already in place for offloading to the common code that creates the outline function and arguments and updates the regression tests accordingly. Reviewers: hfinkel, arpith-jacob, carlo.bertolli, kkwli0, ABataev Subscribers: cfe-commits, caomhin Differential Revision: http://reviews.llvm.org/D21150 llvm-svn: 272900	2016-06-16 15:09:31 +00:00
Alexey Bataev	ad537bb8a0	[OPENMP 4.5] Fixed codegen for 'priority' and destructors in task-based directives. 'kmp_task_t' record type added a new field for 'priority' clause and changed the representation of pointer to destructors for privates used within loop-based directives. Old representation: typedef struct kmp_task { /* GEH: Shouldn't this be aligned somehow? / void shareds; /*< pointer to block of pointers to shared vars / kmp_routine_entry_t routine; /*< pointer to routine to call for executing task / kmp_int32 part_id; /*< part id for the task / kmp_routine_entry_t destructors; /* pointer to function to invoke deconstructors of firstprivate C++ objects / / private vars / } kmp_task_t; New representation: typedef struct kmp_task { / GEH: Shouldn't this be aligned somehow? / void shareds; /*< pointer to block of pointers to shared vars / kmp_routine_entry_t routine; /*< pointer to routine to call for executing task / kmp_int32 part_id; /*< part id for the task / kmp_cmplrdata_t data1; /* Two known optional additions: destructors and priority / kmp_cmplrdata_t data2; / Process destructors first, priority second / / future data / / private vars */ } kmp_task_t; Also excessive initialization of 'destructors' fields to 'null' was removed from codegen if it is known that no destructors shal be used. Currently a special bit is used in 'kmp_tasking_flags_t' bitfields ('destructors_thunk' bitfield). llvm-svn: 271201	2016-05-30 09:06:50 +00:00
Samuel Antao	8d2d730f2a	[OpenMP] Codegen for target update directive. Summary: This patch implements the code generation for the `target update` directive. The implemntation relies on the logic already in place for target data standalone directives, i.e. target enter/exit data. Reviewers: hfinkel, carlo.bertolli, arpith-jacob, kkwli0, ABataev Subscribers: caomhin, cfe-commits Differential Revision: http://reviews.llvm.org/D20650 llvm-svn: 270886	2016-05-26 18:30:22 +00:00
Samuel Antao	ec172c6da0	[OpenMP] Parsing and sema support for the from clause Summary: The patch contains the parsing and sema support for the `from` clause. Patch based on the original post by Kelvin Li. Reviewers: hfinkel, carlo.bertolli, kkwli0, arpith-jacob, ABataev Subscribers: caomhin, cfe-commits Differential Revision: http://reviews.llvm.org/D18488 llvm-svn: 270882	2016-05-26 17:49:04 +00:00
Samuel Antao	661c0904e1	[OpenMP] Parsing and sema support for the to clause Summary: The patch contains the parsing and sema support for the `to` clause. Patch based on the original post by Kelvin Li. Reviewers: carlo.bertolli, hfinkel, kkwli0, arpith-jacob, ABataev Subscribers: caomhin, cfe-commits Differential Revision: http://reviews.llvm.org/D18597 llvm-svn: 270880	2016-05-26 17:39:58 +00:00
Samuel Antao	686c70c3dc	[OpenMP] Parsing and sema support for target update directive Summary: This patch is to add parsing and sema support for `target update` directive. Support for the `to` and `from` clauses will be added by a different patch. This patch also adds support for other clauses that are already implemented upstream and apply to `target update`, e.g. `device` and `if`. This patch is based on the original post by Kelvin Li. Reviewers: hfinkel, carlo.bertolli, kkwli0, arpith-jacob, ABataev Subscribers: caomhin, cfe-commits Differential Revision: http://reviews.llvm.org/D15944 llvm-svn: 270878	2016-05-26 17:30:50 +00:00
Hal Finkel	c07e19b2c1	Add a loop's debug location to its llvm.loop metadata Getting accurate locations for loops is important, because those locations are used by the frontend to generate optimization remarks. Currently, optimization remarks for loops often appear on the wrong line, often the first line of the loop body instead of the loop itself. This is confusing because that line might itself be another loop, or might be somewhere else completely if the body was an inlined function call. This happens because of the way we find the loop's starting location. First, we look for a preheader, and if we find one, and its terminator has a debug location, then we use that. Otherwise, we look for a location on an instruction in the loop header. The fallback heuristic is not bad, but will almost always find the beginning of the body, and not the loop statement itself. The preheader location search often fails because there's often not a preheader, and even when there is a preheader, depending on how it was formed, it sometimes carries the location of some preceeding code. I don't see any good theoretical way to fix this problem. On the other hand, this seems like a straightforward solution: Put the debug location in the loop's llvm.loop metadata. When emitting debug information, this commit causes us to add the debug location as an operand to each loop's llvm.loop metadata. Thus, we now generate this metadata for all loops (not just loops with optimization hints) when we're otherwise generating debug information. The remark test case changes depend on the companion LLVM commit r270771. llvm-svn: 270772	2016-05-25 21:53:24 +00:00
Alexey Bataev	8b42706a6e	[OPENMP 4.5] Codegen for dacross loop synchronization constructs. OpenMP 4.5 adds support for doacross loop synchronization. Patch implements codegen for this construct. llvm-svn: 270690	2016-05-25 12:36:08 +00:00
Alexey Bataev	9afe57541e	[OPENMP] Fixed codegen for firstprivate vars in standalone worksharing directives. If firstprivate variable is is captured by value in outlined region and then used as firstprivate variable in inner worksharing directive, the copy for this firstprivate variable was not created. Fixed this bug. llvm-svn: 270536	2016-05-24 07:40:12 +00:00
Alexey Bataev	7ace49dff1	[OPENMP] Pass scalar firstprivate vars by value. For better performance and to unify code with offloading part we pass scalar firstprivate values by value, instead of by reference. It will remove some extra copying operations. llvm-svn: 269751	2016-05-17 08:55:33 +00:00
Alexey Bataev	1e1e286a6b	[OPENMP 4.5] Initial codegen for 'priority' clause in task-based directives. OpenMP 4.5 supports clause 'priority' in task-based directives. Patch adds initial codegen support for this clause in codegen. llvm-svn: 269050	2016-05-10 12:21:02 +00:00
Alexey Bataev	9ebd742748	[OPENMP 4.5] Add codegen support in runtime for '[non]monotonic' schedule modifiers. Runtime library expects some additional data in schedule argument for loop-based directives, that have additional schedule modifiers 'monotonic\|nonmonotonic'. llvm-svn: 269035	2016-05-10 09:57:36 +00:00
Alexey Bataev	f93095a003	[OPENMP 4.5] Codegen for 'lastprivate' clauses in 'taskloop' directives. OpenMP 4.5 adds taskloop/taskloop simd directives. These directives allow to use lastprivate clause. Patch adds codegen for this clause. llvm-svn: 268618	2016-05-05 08:46:22 +00:00
Alexey Bataev	1e73ef3882	[OPENMP 4.5] Initial codegen for 'taskloop simd' directive. OpenMP 4.5 defines 'taskloop simd' directive, which is combined directive for 'taskloop' and 'simd' directives. Patch adds initial codegen support for this directive and its 2 basic clauses 'safelen' and 'simdlen'. llvm-svn: 267872	2016-04-28 12:14:51 +00:00
Alexey Bataev	24b5baed27	[OPENMP] Simplified interface for codegen of tasks, NFC. Reduced number of arguments in member functions of runtime support library for task-based directives. llvm-svn: 267863	2016-04-28 09:23:51 +00:00
Alexey Bataev	2b19a6fe53	[OPENMP 4.5] Codegen for 'grainsize/num_tasks' clauses of 'taskloop' directive. OpenMP 4.5 defines 'taskloop' directive and 2 additional clauses 'grainsize' and 'num_tasks' for this directive. Patch adds codegen for these clauses. These clauses are generated as arguments of the '__kmpc_taskloop' libcall and are encoded the following way: void __kmpc_taskloop(ident_t loc, int gtid, kmp_task_t task, int if_val, kmp_uint64 lb, kmp_uint64 ub, kmp_int64 st, int nogroup, int sched, kmp_uint64 grainsize, void *task_dup); If 'grainsize' is specified, 'sched' argument must be set to '1' and 'grainsize' argument must be set to the value of the 'grainsize' clause. If 'num_tasks' is specified, 'sched' argument must be set to '2' and 'grainsize' argument must be set to the value of the 'num_tasks' clause. It is possible because these 2 clauses are mutually exclusive and can't be used at the same time on the same directive. If none of these clauses is specified, 'sched' argument must be set to '0'. llvm-svn: 267862	2016-04-28 09:15:06 +00:00
Samuel Antao	8dd6628743	[OpenMP] Code generation for target exit data directive Summary: This patch adds support for the target exit data directive code generation. Given that, apart from the employed runtime call, target exit data requires the same code generation pattern as target enter data, the OpenMP codegen entry point was renamed and reused for both. Reviewers: hfinkel, carlo.bertolli, arpith-jacob, kkwli0, ABataev Subscribers: cfe-commits, fraggamuffin, caomhin Differential Revision: http://reviews.llvm.org/D17369 llvm-svn: 267814	2016-04-27 23:14:30 +00:00
Samuel Antao	bd0ae2e14c	[OpenMP] Code generation for target enter data directive Summary: This patch adds support for the target enter data directive code generation. Reviewers: hfinkel, carlo.bertolli, arpith-jacob, kkwli0, ABataev Subscribers: cfe-commits, fraggamuffin, caomhin Differential Revision: http://reviews.llvm.org/D17368 llvm-svn: 267812	2016-04-27 23:07:29 +00:00
Samuel Antao	df158d5567	[OpenMP] Code generation for target data directive Summary: This patch adds support for the target data directive code generation. Part of the already existent functionality related with data maps is moved to a new function so that it could be reused. Reviewers: hfinkel, carlo.bertolli, arpith-jacob, kkwli0, ABataev Subscribers: cfe-commits, fraggamuffin, caomhin Differential Revision: http://reviews.llvm.org/D17367 llvm-svn: 267811	2016-04-27 22:58:19 +00:00
Alexey Bataev	8fbae8cf09	[OPENMP] Fix crash on initialization of classes with no init clause in declare reductions. If reduction clause is applied to instance of class with user-defined reduction operation without initialization clause, it may cause a crash. Patch fixes this issue. llvm-svn: 267695	2016-04-27 11:38:05 +00:00
Alexey Bataev	4ba78a46ff	[OPENMP] Fix for codegen of captured variables in inlined directives. Currently there is a problem with codegen of inlined directives inside lambdas, it may cause a crash during codegen because of incorrect capturing of variables. Patch fixes this problem. llvm-svn: 267677	2016-04-27 07:56:03 +00:00
Alexey Bataev	7292c29bb5	[OPENMP 4.5] Codegen for 'taskloop' directive. The taskloop construct specifies that the iterations of one or more associated loops will be executed in parallel using OpenMP tasks. The iterations are distributed across tasks created by the construct and scheduled to be executed. The next code will be generated for the taskloop directive: #pragma omp taskloop num_tasks(N) lastprivate(j) for( i=0; i<NGRAINSTRIDE-1; i+=STRIDE ) { int th = omp_get_thread_num(); #pragma omp atomic counter++; #pragma omp atomic th_counter[th]++; j = i; } Generated code: task = __kmpc_omp_task_alloc(NULL,gtid,1,sizeof(struct task),sizeof(struct shar),&task_entry); psh = task->shareds; psh->pth_counter = &th_counter; psh->pcounter = &counter; psh->pj = &j; task->lb = 0; task->ub = NGRAINSTRIDE-2; task->st = STRIDE; __kmpc_taskloop( NULL, // location gtid, // gtid task, // task structure 1, // if clause value &task->lb, // lower bound &task->ub, // upper bound STRIDE, // loop increment 0, // 1 if nogroup specified 2, // schedule type: 0-none, 1-grainsize, 2-num_tasks N, // schedule value (ignored for type 0) (void*)&__task_dup_entry // tasks duplication routine ); llvm-svn: 267395	2016-04-25 12:22:29 +00:00
Alexey Bataev	feddd64bff	[OPENMP] Fix for PR27463: Privatizing struct fields with array type causes code generation failure. The codegen part of firstprivate clause for member decls used type of original variable without skipping reference type from OMPCapturedExprDecl. Patch fixes this problem. llvm-svn: 267125	2016-04-22 09:05:03 +00:00
Alexey Bataev	5dff95c04d	[OPENMP] Fix for LCV in simd directives in explicit clauses. If loop control variable for simd-based directives is explicitly marked as linear/lastprivate in clauses, codegen for such construct would crash. Patch fixes this problem. llvm-svn: 267101	2016-04-22 03:56:56 +00:00
Alexey Bataev	48591dd98c	[OPENMP] Codegen for untied tasks. If the untied clause is present on a task construct, any thread in the team can resume the task region after a suspension. Patch adds proper codegen for untied tasks. llvm-svn: 266853	2016-04-20 04:01:36 +00:00
Alexey Bataev	995e861ba6	Revert "[OPENMP] Codegen for untied tasks." This reverts commit r266754. llvm-svn: 266755	2016-04-19 16:36:01 +00:00
Alexey Bataev	823acfacdf	[OPENMP] Codegen for untied tasks. If the untied clause is present on a task construct, any thread in the team can resume the task region after a suspension. Patch adds proper codegen for untied tasks. llvm-svn: 266754	2016-04-19 16:27:55 +00:00
Alexey Bataev	bec9572213	Revert "[OPENMP] Codegen for untied tasks." This reverts commit 266722. llvm-svn: 266724	2016-04-19 09:27:38 +00:00
Alexey Bataev	26b2577f6b	[OPENMP] Codegen for untied tasks. If the untied clause is present on a task construct, any thread in the team can resume the task region after a suspension. Patch adds proper codegen for untied tasks. llvm-svn: 266722	2016-04-19 09:10:27 +00:00
Alexey Bataev	e48a5fc56d	[OPENMP 4.0] Support for 'uniform' clause in 'declare simd' directive. OpenMP 4.0 defines clause 'uniform' in 'declare simd' directive: 'uniform' '(' <argument-list> ')' The uniform clause declares one or more arguments to have an invariant value for all concurrent invocations of the function in the execution of a single SIMD loop. The special this pointer can be used as if was one of the arguments to the function in any of the linear, aligned, or uniform clauses. llvm-svn: 266041	2016-04-12 05:28:34 +00:00
JF Bastien	92f4ef1017	NFC: make AtomicOrdering an enum class Summary: See LLVM change D18775 for details, this change depends on it. Reviewers: jyknight, reames Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D18776 llvm-svn: 265569	2016-04-06 17:26:42 +00:00
Carlo Bertolli	c687225b43	[OPENMP] Codegen for teams directive for NVPTX This patch implements the teams directive for the NVPTX backend. It is different from the host code generation path as it: Does not call kmpc_fork_teams. All necessary teams and threads are started upon touching the target region, when launching a CUDA kernel, and their execution is coordinated through sequential and parallel regions within the target region. Does not call kmpc_push_num_teams even if a num_teams of thread_limit clause is present. Setting the number of teams and the thread limit is implemented by the nvptx-related runtime. Please note that I am now passing a Clang Expr * to emitPushNumTeams instead of the originally chosen llvm::Value * type. The reason for that is that I want to avoid emitting expressions for num_teams and thread_limit if they are not needed in the target region. http://reviews.llvm.org/D17963 llvm-svn: 265304	2016-04-04 15:55:02 +00:00
Alexey Bataev	5a3af13d93	[OPENMP] Remove extra code transformation. For better support of some specific GNU extensions some extra transformation of AST nodes were introduced. These transformations are very hard to handle. The code is improved in handling of these extensions by using captured expressions construct. llvm-svn: 264709	2016-03-29 08:58:54 +00:00
Alexey Bataev	14fa1c6b60	[OPENMP] Allow runtime insert its own code inside OpenMP regions. Solution unifies interface of RegionCodeGenTy type to allow insert runtime-specific code before/after main codegen action defined in CGStmtOpenMP.cpp file. Runtime should not define its own RegionCodeGenTy for general OpenMP directives, but must be allowed to insert its own (required) code to support target specific codegen. llvm-svn: 264700	2016-03-29 05:34:15 +00:00
Alexey Bataev	f539faa733	Revert "[OPENMP] Allow runtime insert its own code inside OpenMP regions." Reverting because of failed tests. llvm-svn: 264577	2016-03-28 12:58:34 +00:00
Alexey Bataev	424be92831	[OPENMP] Allow runtime insert its own code inside OpenMP regions. Solution unifies interface of RegionCodeGenTy type to allow insert runtime-specific code before/after main codegen action defined in CGStmtOpenMP.cpp file. Runtime should not define its own RegionCodeGenTy for general OpenMP directives, but must be allowed to insert its own (required) code to support target specific codegen. llvm-svn: 264576	2016-03-28 12:52:58 +00:00
Alexey Bataev	f662b5943c	Revert "[OPENMP] Allow runtime insert its own code inside OpenMP regions." This reverts commit 3ee791165100607178073f14531a0dc90c622b36. llvm-svn: 264570	2016-03-28 10:12:03 +00:00
Alexey Bataev	b8c425c4f7	[OPENMP] Allow runtime insert its own code inside OpenMP regions. Solution unifies interface of RegionCodeGenTy type to allow insert runtime-specific code before/after main codegen action defined in CGStmtOpenMP.cpp file. Runtime should not define its own RegionCodeGenTy for general OpenMP directives, but must be allowed to insert its own (required) code to support target specific codegen. llvm-svn: 264569	2016-03-28 09:53:43 +00:00
Alexey Bataev	a839dddf92	[OPENMP 4.0] Use 'declare reduction' constructs in 'reduction' clauses. OpenMP 4.0 allows to define custom reduction operations using '#pragma omp declare reduction' construct. Patch allows to use this custom defined reduction operations in 'reduction' clauses. llvm-svn: 263701	2016-03-17 10:19:46 +00:00
John McCall	c56a8b3284	Preserve ExtParameterInfos into CGFunctionInfo. As part of this, make the function-arrangement interfaces a little simpler and more semantic. NFC. llvm-svn: 263191	2016-03-11 04:30:31 +00:00
Alexey Bataev	ef549a8955	[OPENMP 4.5] Codegen for data members in 'linear' clause OpenMP 4.5 allows privatization of non-static data members in OpenMP constructs. Patch adds proper codegen support for data members in 'linear' clause llvm-svn: 263003	2016-03-09 09:49:09 +00:00
Alexey Bataev	78849fb464	[OPENMP 4.5] Codegen for data members in 'linear' clause. OpenMP 4.5 allows to use data members in private clauses. Patch adds codegen support for 'linear' clause. llvm-svn: 263002	2016-03-09 09:49:00 +00:00
Carlo Bertolli	0ff587d4a4	[OPENMP] Codegen for distribute directive: fix bug in ordering of parameters. llvm-svn: 262833	2016-03-07 16:19:13 +00:00
Carlo Bertolli	fc35ad2bbc	Reapply r262741 [OPENMP] Codegen for distribute directive This patch provide basic implementation of codegen for teams directive, excluding all clauses except dist_schedule. It also fixes parts of AST reader/writer to enable correct pre-compiled header handling. http://reviews.llvm.org/D17170 llvm-svn: 262832	2016-03-07 16:04:49 +00:00
Samuel Antao	bf4d18d3d2	Revert r262741 - [OPENMP] Codegen for distribute directive Was causing a failure in one of the buildbot slaves. llvm-svn: 262744	2016-03-04 21:02:14 +00:00
Carlo Bertolli	4a56e3831d	[OPENMP] Codegen for distribute directive This patch provide basic implementation of codegen for teams directive, excluding all clauses except dist_schedule. It also fixes parts of AST reader/writer to enable correct pre-compiled header handling. http://reviews.llvm.org/D17170 llvm-svn: 262741	2016-03-04 20:24:58 +00:00
Carlo Bertolli	6ad7b5aff2	[OPENMP] firstprivate and private clauses of teams, host codegeneration Add code generation support for firstprivate and private clauses of teams on the host. Add extensive regression tests including lambda functions and vla testing. http://reviews.llvm.org/D17582 llvm-svn: 262663	2016-03-03 22:09:40 +00:00
Carlo Bertolli	430d8ecc55	Add code generation for teams directive inside target region llvm-svn: 262652	2016-03-03 20:34:23 +00:00
Samuel Antao	b68e2db8f9	[OpenMP] Code generation for teams - kernel launching Summary: This patch implements the launching of a target region in the presence of a nested teams region, i.e calls tgt_target_teams with the required arguments gathered from the enclosed teams directive. The actual codegen of the region enclosed by the teams construct will be contributed in a separate patch. Reviewers: hfinkel, arpith-jacob, kkwli0, carlo.bertolli, ABataev Subscribers: cfe-commits, caomhin, fraggamuffin Differential Revision: http://reviews.llvm.org/D17019 llvm-svn: 262625	2016-03-03 16:20:23 +00:00
Alexey Bataev	2bbf7217ea	[OPENMP 4.5] Initial support for data members in 'linear' clause. OpenMP 4.5 allows to privatize data members of current class in member functions. Patch adds initial support for privatization of data members in 'linear' clause, no codegen support. llvm-svn: 262578	2016-03-03 03:52:24 +00:00
Alexey Bataev	61205070c4	[OPENMP 4.5] Codegen for data members in 'reduction' clause. OpenMP 4.5 allows to privatize non-static data members of current class in non-static member functions. Patch supports codegen for non-static data members in 'reduction' clauses. llvm-svn: 262460	2016-03-02 04:57:40 +00:00
Alexey Bataev	005248ac8a	[OPENMP 4.5] Codegen for member decls in 'lastprivate' clause. OpenMP 4.5 allows to privatize non-static member decls in non-static member functions. Patch captures such decls by reference in general (for bitfields, by value) and then operates with this capture. For bitfields, at the end of codegen for lastprivates original bitfield is updated with the value of captured copy. llvm-svn: 261824	2016-02-25 05:25:57 +00:00
Richard Trieu	cc3949d99a	Remove use of builtin comma operator. Cleanup for upcoming Clang warning -Wcomma. No functionality change intended. llvm-svn: 261271	2016-02-18 22:34:54 +00:00
Alexey Bataev	8ffcc949b1	[OPENMP] Fix codegen for lastprivate loop counters. Patch fixes bug with codegen for lastprivate loop counters. Also it may improve performance for lastprivates calculations in some cases. llvm-svn: 261209	2016-02-18 13:48:15 +00:00
Alexey Bataev	417089fc7e	[OPENMP 4.5] Codegen support for data members in 'firstprivate' clause. Added codegen for captured data members in non-static member functions. llvm-svn: 261089	2016-02-17 13:19:37 +00:00
Alexey Bataev	3392d76081	[OPENMP] Improved handling of pseudo-captured expressions in OpenMP. Expressions inside 'schedule'\|'dist_schedule' clause must be captured in combined directives to avoid possible crash during codegen. Patch improves handling of such constructs llvm-svn: 260954	2016-02-16 11:18:12 +00:00
Alexey Bataev	cd8b6a2cf1	[OPENMP] Remove extra sync barriers for 'firstprivate' clause. Sync barrier will be emitted after generation of firstprivate variables only if one of the firstprivate vars is used in lastprivate clause. llvm-svn: 260877	2016-02-15 08:07:17 +00:00
Alexey Bataev	4244be25bd	[OPENMP] Rename OMPCapturedFieldDecl to OMPCapturedExprDecl, NFC. OMPCapturedExprDecl allows caopturing not only of fielddecls, but also other expressions. It also allows to simplify codegen for several clauses. llvm-svn: 260492	2016-02-11 05:35:55 +00:00
Alexey Bataev	31300ed0a5	[OPENMP 4.0] Fixed support of array sections/array subscripts. Codegen for array sections/array subscripts worked only for expressions with arrays as base. Patch fixes codegen for bases with pointer/reference types. llvm-svn: 259776	2016-02-04 11:27:03 +00:00
Arpith Chacko Jacob	05bebb578a	[OpenMP] Parsing + sema for target parallel for directive. Summary: This patch adds parsing + sema for the target parallel for directive along with testcases. Reviewers: ABataev Differential Revision: http://reviews.llvm.org/D16759 llvm-svn: 259654	2016-02-03 15:46:42 +00:00
Arpith Chacko Jacob	e955b3d3fe	[OpenMP] Parsing + sema for target parallel directive. Summary: This patch adds parsing + sema for the target parallel directive and its clauses along with testcases. Reviewers: ABataev Differential Revision: http://reviews.llvm.org/D16553 Rebased to current trunk and updated test cases. llvm-svn: 258832	2016-01-26 18:48:41 +00:00
Arpith Chacko Jacob	3cf89040b0	[OpenMP] Parsing + sema for defaultmap clause. Summary: This patch adds parsing + sema for the defaultmap clause associated with the target directive (among others). Reviewers: ABataev Differential Revision: http://reviews.llvm.org/D16527 llvm-svn: 258817	2016-01-26 16:37:23 +00:00
Alexey Bataev	1189bd0205	[OPENMP 4.5] Allow arrays in 'reduction' clause. OpenMP 4.5, alogn with array sections, allows to use variables of array type in reductions. llvm-svn: 258804	2016-01-26 12:20:39 +00:00
Alexey Bataev	3015bcc62a	[OPENMP] Generalize codegen for 'sections'-based directive. If 'sections' directive has only one sub-section, the code for 'single'-based directive was emitted. Removed this codegen, because it causes crashes in different cases. llvm-svn: 258495	2016-01-22 08:56:50 +00:00
Alexey Bataev	8524d15954	[OPENMP] Fix crash on reduction for complex variables. reworked codegen for reduction operation for complex types to avoid crash llvm-svn: 258394	2016-01-21 12:35:58 +00:00
Alexey Bataev	9619f04c0e	[OPENMP 4.0] Fix for codegen of 'cancel' directive within 'sections' directive. Allow to emit code for 'cancel' directive within 'sections' directive with single sub-section. llvm-svn: 258307	2016-01-20 12:29:47 +00:00
Samuel Antao	7259076032	[OpenMP] Parsing + sema for "target exit data" directive. Patch by Arpith Jacob. Thanks! llvm-svn: 258177	2016-01-19 20:04:50 +00:00
Samuel Antao	df67fc468e	[OpenMP] Parsing + sema for "target enter data" directive. Patch by Arpith Jacob. Thanks! llvm-svn: 258165	2016-01-19 19:15:56 +00:00
Carlo Bertolli	b4adf55e0f	Add OpenMP dist_schedule clause to distribute directive and related regression tests. llvm-svn: 257917	2016-01-15 18:50:31 +00:00
Samuel Antao	ee8fb302f5	[OpenMP] Reapply rL256842: [OpenMP] Offloading descriptor registration and device codegen. This patch attempts to fix the regressions identified when the patch was committed initially. Thanks to Michael Liao for identifying the fix in the offloading metadata generation related with side effects in evaluation of function arguments. llvm-svn: 256933	2016-01-06 13:42:12 +00:00
Samuel Antao	7d5de9a1ee	[OpenMP] Revert rL256842: [OpenMP] Offloading descriptor registration and device codegen. It was causing two regression, so I'm reverting until the cause is found. llvm-svn: 256858	2016-01-05 19:16:13 +00:00
Samuel Antao	4d5f0bbea1	[OpenMP] Offloading descriptor registration and device codegen. Summary: In order to offloading work properly two things need to be in place: - a descriptor with all the offloading information (device entry functions, and global variable) has to be created by the host and registered in the OpenMP offloading runtime library. - all the device functions need to be emitted for the device and a convention has to be in place so that the runtime library can easily map the host ID of an entry point with the actual function in the device. This patch adds support for these two things. However, only entry functions are being registered given that 'declare target' directive is not yet implemented. About offloading descriptor: The details of the descriptor are explained with more detail in http://goo.gl/L1rnKJ. Basically the descriptor will have fields that specify the number of devices, the pointers to where the device images begin and end (that will be defined by the linker), and also pointers to a the begin and end of table whose entries contain information about a specific entry point. Each entry has the type: ``` struct __tgt_offload_entry{ void addr; char name; int64_t size; }; ``` and will be implemented in a pre determined (ELF) section `.omp_offloading.entries` with 1-byte alignment, so that when all the objects are linked, the table is in that section with no padding in between entries (will be like a C array). The code generation ensures that all `__tgt_offload_entry` entries are emitted in the same order for both host and device so that the runtime can have the corresponding entries in both host and device in same index of the table, and efficiently implement the mapping. The resulting descriptor is registered/unregistered with the runtime library using the calls `__tgt_register_lib` and `__tgt_unregister_lib`. The registration is implemented in a high priority global initializer so that the registration happens always before any initializer (that can potentially include target regions) is run. The driver flag -omptargets= was created to specify a comma separated list of devices the user wants to support so that the new functionality can be exercised. Each device is specified with its triple. About target codegen: The target codegen is pretty much straightforward as it reuses completely the logic of the host version for the same target region. The tricky part is to identify the meaningful target regions in the device side. Unlike other programming models, like CUDA, there are no already outlined functions with attributes that mark what should be emitted or not. So, the information on what to emit is passed in the form of metadata in host bc file. This requires a new option to pass the host bc to the device frontend. Then everything is similar to what happens in CUDA: the global declarations emission is intercepted to check to see if it is an "interesting" declaration. The difference is that instead of checking an attribute, the metadata information in checked. Right now, there is only a form of metadata to pass information about the device entry points (target regions). A class `OffloadEntriesInfoManagerTy` was created to manage all the information and queries related with the metadata. The metadata looks like this: ``` !omp_offload.info = !{!0, !1, !2, !3, !4, !5, !6} !0 = !{i32 0, i32 52, i32 77426347, !"_ZN2S12r1Ei", i32 479, i32 13, i32 4} !1 = !{i32 0, i32 52, i32 77426347, !"_ZL7fstatici", i32 461, i32 11, i32 5} !2 = !{i32 0, i32 52, i32 77426347, !"_Z9ftemplateIiET_i", i32 444, i32 11, i32 6} !3 = !{i32 0, i32 52, i32 77426347, !"_Z3fooi", i32 99, i32 11, i32 0} !4 = !{i32 0, i32 52, i32 77426347, !"_Z3fooi", i32 272, i32 11, i32 3} !5 = !{i32 0, i32 52, i32 77426347, !"_Z3fooi", i32 127, i32 11, i32 1} !6 = !{i32 0, i32 52, i32 77426347, !"_Z3fooi", i32 159, i32 11, i32 2} ``` The fields in each metadata entry are (in sequence): Entry 1) an ID of the type of metadata - right now only zero is used meaning "OpenMP target region". Entry 2) a unique ID of the device where the input source file that contain the target region lives. Entry 3) a unique ID of the file where the input source file that contain the target region lives. Entry 4) a mangled name of the function that encloses the target region. Entries 5) and 6) line and column number where the target region was found. Entry 7) is the order the entry was emitted. Entry 2) and 3) are required to distinguish files that have the same function name. Entry 4) is required to distinguish different instances of the same declaration (usually templated ones) Entries 5) and 6) are required to distinguish the particular target region in body of the function (it is possible that a given target region is not an entry point - if clause can evaluate always to zero - and therefore we need to identify the "interesting" target regions. ) This patch replaces http://reviews.llvm.org/D12306. Reviewers: ABataev, hfinkel, tra, rjmccall, sfantao Subscribers: FBrygidyn, piotr.rak, Hahnfeld, cfe-commits Differential Revision: http://reviews.llvm.org/D12614 llvm-svn: 256842	2016-01-05 16:23:04 +00:00
Alexey Bataev	a6f2a14b94	[OPENMP 4.5] Codegen for 'schedule' clause with monotonic/nonmonotonic modifiers. OpenMP 4.5 adds support for monotonic/nonmonotonic modifiers in 'schedule' clause. Add codegen for these modifiers. llvm-svn: 256666	2015-12-31 06:52:34 +00:00
Alexey Bataev	6f531ec0a2	[OPENMP] Remove explicit call for implicit barrier #pragma omp parallel needs an implicit barrier that is currently done by an explicit call to __kmpc_barrier. However, the runtime already ensures a barrier in __kmpc_fork_call which currently leads to two barriers per region per thread. Differential Revision: http://reviews.llvm.org/D15561 llvm-svn: 255992	2015-12-18 10:24:53 +00:00
Alexey Bataev	8ef3141127	[OPENMP] Fix for http://llvm.org/PR25878 : Error compiling an OpenMP program OpenMP codegen tried to emit the code for its constructs even if it was detected as a dead-code. Added checks to ensure that the code is emitted if the code is not dead. llvm-svn: 255990	2015-12-18 07:58:25 +00:00
Alexey Bataev	fc57d1601d	[OPENMP 4.5] Codegen for 'hint' clause of 'critical' directive OpenMP 4.5 defines 'hint' clause for 'critical' directive. Patch adds codegen for this clause. llvm-svn: 255639	2015-12-15 10:55:09 +00:00
Alexey Bataev	28c75417b2	[OPENMP 4.5] Parsing/sema for 'hint' clause of 'critical' directive. OpenMP 4.5 adds 'hint' clause to critical directive. Patch adds parsing/semantic analysis for this clause. llvm-svn: 255625	2015-12-15 08:19:24 +00:00
Carlo Bertolli	6200a3d0f3	Add parse and sema of OpenMP distribute directive with all clauses except dist_schedule llvm-svn: 255498	2015-12-14 14:51:25 +00:00
Alexey Bataev	33c56402d8	[OPENMP] Fix debug info for 'atomic' construct. Debug info for statement under 'atomic' construct must point exactly to that statement, not the directive itself. llvm-svn: 255487	2015-12-14 09:26:19 +00:00
NAKAMURA Takumi	2d5c6ddf74	Revert r255001, "Add parse and sema for OpenMP distribute directive and all its clauses excluding dist_schedule." It causes memory leak. Some tests in test/OpenMP would fail. llvm-svn: 255094	2015-12-09 04:35:57 +00:00
Alexey Bataev	382967a2e4	[OPENMP 4.5] Parsing/sema for 'num_tasks' clause. OpenMP 4.5 adds directives 'taskloop' and 'taskloop simd'. These directives support clause 'num_tasks'. Patch adds parsing/semantic analysis for this clause. llvm-svn: 255008	2015-12-08 12:06:20 +00:00
Carlo Bertolli	b9bfa75b28	Add parse and sema for OpenMP distribute directive and all its clauses excluding dist_schedule. llvm-svn: 255001	2015-12-08 04:21:03 +00:00
Alexey Bataev	1fd4aed26b	[OPENMP 4.5] parsing/sema support for 'grainsize' clause. OpenMP 4.5 adds 'taksloop' and 'taskloop simd' directives, which have 'grainsize' clause. Patch adds parsing/sema analysis of this clause. llvm-svn: 254903	2015-12-07 12:52:51 +00:00
Alexey Bataev	b825de17b7	[OPENMP 4.5] parsing/sema support for 'nogroup' clause. OpenMP 4.5 adds 'taskloop' and 'taskloop simd' directives. These directives have new 'nogroup' clause. Patch adds basic parsing/sema support for this clause. llvm-svn: 254899	2015-12-07 10:51:44 +00:00
Serge Pavlov	3a5614599a	[PGO] Instrument only base constructors and destructors. Constructors and destructors may be represented by several functions in IR. Only base structors correspond to source code, others are small pieces of code and eventually call the base variant. In this case instrumentation of non-base structors has little sense, this fix remove it. Now profile data of a declaration corresponds to exactly one function in IR, it agrees with the current logic of the profile data loading. This change fixes PR24996. Differential Revision: http://reviews.llvm.org/D15158 llvm-svn: 254876	2015-12-06 14:32:39 +00:00
Alexey Bataev	0a6ed84a0d	[OPENMP 4.5] Parsing/sema support for 'omp taskloop simd' directive. OpenMP 4.5 adds directive 'taskloop simd'. Patch adds parsing/sema analysis for 'taskloop simd' directive and its clauses. llvm-svn: 254597	2015-12-03 09:40:15 +00:00
Samuel Antao	4af1b7b693	[OpenMP] Update target directive codegen to use 4.5 implicit data mappings. Summary: This patch implements the 4.5 specification for the implicit data maps. OpenMP 4.5 specification changes the default way data is captured into a target region. All the non-aggregate kinds are passed by value by default. This required activating the capturing by value during SEMA for the target region. All the non-aggregate values that can be encoded in the size of a pointer are properly casted and forwarded to the runtime library. On top of fixing the previous weird behavior for mapping pointers in nested data regions (an explicit map was always required), this also improves performance as the number of allocations/transactions to the device per non-aggregate map are reduced from two to only one - instead of passing a reference and the value, only the value passed. Explicit maps will be added later on once firstprivate, private, and map clauses' SEMA and parsing are available. Reviewers: hfinkel, rjmccall, ABataev Subscribers: cfe-commits, carlo.bertolli Differential Revision: http://reviews.llvm.org/D14940 llvm-svn: 254521	2015-12-02 17:44:43 +00:00
Alexey Bataev	a056935a2f	[OPENMP 4.5] Parsing/sema analysis for 'priority' clause. OpenMP 4.5 defines new clause 'priority' for 'task', 'taskloop' and 'taskloop simd' directives. Added parsing and sema analysis for 'priority' clause in 'task' and 'taskloop' directives. llvm-svn: 254398	2015-12-01 10:17:31 +00:00
Alexey Bataev	49f6e78d71	[OPENMP 4.5] Parsing/sema analysis for 'taskloop' directive. Adds initial parsing and semantic analysis for 'taskloop' directive. llvm-svn: 254367	2015-12-01 04:18:41 +00:00
Kelvin Li	a15fb1a110	[OpenMP] Parsing and sema support for thread_limit clause. http://reviews.llvm.org/D15029 llvm-svn: 254207	2015-11-27 18:47:36 +00:00
Kelvin Li	099bb8c65d	[OpenMP] Parsing and sema support for num_teams clause http://reviews.llvm.org/D14802 llvm-svn: 254019	2015-11-24 20:50:12 +00:00
Kelvin Li	0bff7afab5	[OpenMP] Parsing and sema support for map clause http://reviews.llvm.org/D14134 llvm-svn: 253849	2015-11-23 05:32:03 +00:00
NAKAMURA Takumi	811a09ec5b	CGStmtOpenMP.cpp: Prune redundant \param. [-Wdocumentation] llvm-svn: 249698	2015-10-08 16:41:42 +00:00
Alexey Bataev	f24e7b1f60	[OPENMP 4.1] Codegen for array sections/subscripts in 'reduction' clause. OpenMP 4.1 adds support for array sections/subscripts in 'reduction' clause. Patch adds codegen for this feature. llvm-svn: 249672	2015-10-08 09:10:53 +00:00
Samuel Antao	bed3c46632	[OpenMP] Target directive host codegen. This patch implements the outlining for offloading functions for code annotated with the OpenMP target directive. It uses a temporary naming of the outlined functions that will have to be updated later on once target side codegen and registration of offloading libraries is implemented - the naming needs to be made unique in the produced library. llvm-svn: 249148	2015-10-02 16:14:20 +00:00
Alexey Bataev	5f600d6a49	[OPENMP 4.1] Codegen for ‘simd’ clause in ‘ordered’ directive. Description. If the simd clause is specified, the ordered regions encountered by any thread will use only a single SIMD lane to execute the ordered regions in the order of the loop iterations. Restrictions. An ordered construct with the simd clause is the only OpenMP construct that can appear in the simd region. An ordered directive with ‘simd’ clause is generated as an outlined function and corresponding function call to prevent this part of code from vectorization later in backend. llvm-svn: 248772	2015-09-29 03:48:57 +00:00
Alexey Bataev	d14d1e6f25	[OPENMP 4.1] Add 'simd' clause for 'ordered' directive. Parsing and sema analysis for 'simd' clause in 'ordered' directive. Description If the simd clause is specified, the ordered regions encountered by any thread will use only a single SIMD lane to execute the ordered regions in the order of the loop iterations. Restrictions An ordered construct with the simd clause is the only OpenMP construct that can appear in the simd region llvm-svn: 248696	2015-09-28 06:39:35 +00:00
Alexey Bataev	346265e3bc	[OPENMP 4.1] Add 'threads' clause for '#pragma omp ordered'. OpenMP 4.1 extends format of '#pragma omp ordered'. It adds 3 additional clauses: 'threads', 'simd' and 'depend'. If no clause is specified, the ordered construct behaves as if the threads clause had been specified. If the threads clause is specified, the threads in the team executing the loop region execute ordered regions sequentially in the order of the loop iterations. The loop region to which an ordered region without any clause or with a threads clause binds must have an ordered clause without the parameter specified on the corresponding loop directive. llvm-svn: 248569	2015-09-25 10:37:12 +00:00
Alexey Bataev	87933c7ced	[OPENMP 4.0] Add 'if' clause for 'cancel' directive. Add parsing, sema analysis and codegen for 'if' clause in 'cancel' directive. llvm-svn: 247976	2015-09-18 08:07:34 +00:00
Alexey Bataev	25e5b44654	[OPENMP] Emit __kmpc_cancel_barrier() and code for 'cancellation point' only if 'cancel' is found. Patch improves codegen for OpenMP constructs. If the OpenMP region does not have internal 'cancel' construct, a call to 'void __kmpc_barrier()' runtime function is generated for all implicit/explicit barriers. If the region has inner 'cancel' directive, then ``` if (__kmpc_cancel_barrier()) exit from outer construct; ``` code is generated. Also, the code for 'canellation point' directive is not generated if parent directive does not have 'cancel' directive. llvm-svn: 247681	2015-09-15 12:52:43 +00:00
Alexey Bataev	c71a4099cf	[OPENMP] Preserve alignment of the original variables for the captured references. Patch makes codegen to preserve alignment of the shared variables captured and used in OpenMP regions. llvm-svn: 247401	2015-09-11 10:29:41 +00:00
Alexey Bataev	2377fe95c6	[OPENMP] Outlined function for parallel and other regions with list of captured variables. Currently all variables used in OpenMP regions are captured into a record and passed to outlined functions in this record. It may result in some poor performance because of too complex analysis later in optimization passes. Patch makes to emit outlined functions for parallel-based regions with a list of captured variables. It reduces code for 2*n GEPs, stores and loads at least. Codegen for task-based regions remains unchanged because runtime requires that all captured variables are passed in captured record. llvm-svn: 247251	2015-09-10 08:12:02 +00:00
John McCall	7f416cc426	Compute and preserve alignment more faithfully in IR-generation. Introduce an Address type to bundle a pointer value with an alignment. Introduce APIs on CGBuilderTy to work with Address values. Change core APIs on CGF/CGM to traffic in Address where appropriate. Require alignments to be non-zero. Update a ton of code to compute and propagate alignment information. As part of this, I've promoted CGBuiltin's EmitPointerWithAlignment helper function to CGF and made use of it in a number of places in the expression emitter. The end result is that we should now be significantly more correct when performing operations on objects that are locally known to be under-aligned. Since alignment is not reliably tracked in the type system, there are inherent limits to this, but at least we are no longer confused by standard operations like derived-to-base conversions and array-to-pointer decay. I've also fixed a large number of bugs where we were applying the complete-object alignment to a pointer instead of the non-virtual alignment, although most of these were hidden by the very conservative approach we took with member alignment. Also, because IRGen now reliably asserts on zero alignments, we should no longer be subject to an absurd but frustrating recurring bug where an incomplete type would report a zero alignment and then we'd naively do a alignmentAtOffset on it and emit code using an alignment equal to the largest power-of-two factor of the offset. We should also now be emitting much more aggressive alignment attributes in the presence of over-alignment. In particular, field access now uses alignmentAtOffset instead of min. Several times in this patch, I had to change the existing code-generation pattern in order to more effectively use the Address APIs. For the most part, this seems to be a strict improvement, like doing pointer arithmetic with GEPs instead of ptrtoint. That said, I've tried very hard to not change semantics, but it is likely that I've failed in a few places, for which I apologize. ABIArgInfo now always carries the assumed alignment of indirect and indirect byval arguments. In order to cut down on what was already a dauntingly large patch, I changed the code to never set align attributes in the IR on non-byval indirect arguments. That is, we still generate code which assumes that indirect arguments have the given alignment, but we don't express this information to the backend except where it's semantically required (i.e. on byvals). This is likely a minor regression for those targets that did provide this information, but it'll be trivial to add it back in a later patch. I partially punted on applying this work to CGBuiltin. Please do not add more uses of the CreateDefaultAligned{Load,Store} APIs; they will be going away eventually. llvm-svn: 246985	2015-09-08 08:05:57 +00:00
Alexey Bataev	caacd53dde	[OPENMP] Fix for http://llvm.org/PR24674 : assertion failed and and abort trap Fix processing of shared variables with reference types in OpenMP constructs. Previously, if the variable was not marked in one of the private clauses, the reference to this variable was emitted incorrectly and caused an assertion later. llvm-svn: 246846	2015-09-04 11:26:21 +00:00
Alexey Bataev	7371aa365c	[OPENMP 4.1] Codegen for extended format of 'if' clause. Fixed codegen for extended format of 'if' clauses with special 'directive-name-modifier' + ast-print tests for extended format of 'if' clause. llvm-svn: 246748	2015-09-03 08:45:56 +00:00
Benjamin Kramer	fc600dc2ec	[OpenMP] Make the filetered clause iterator a real iterator and type safe. This replaces the filtered generic iterator with a type-specfic one based on dyn_cast instead of comparing the kind enum. This allows us to use range-based for loops and eliminates casts. No functionality change intended. llvm-svn: 246384	2015-08-30 15:12:28 +00:00
Alexey Bataev	45bfad51d8	[OPENMP 4.1] Add codegen for 'simdlen' clause. Add emission of metadata for simd loops in presence of 'simdlen' clause. If 'simdlen' clause is provided without 'safelen' clause, the vectorizer width for the loop is set to value of 'simdlen' clause + all read/write ops in loop are marked with '!llvm.mem.parallel_loop_access' metadata. If 'simdlen' clause is provided along with 'safelen' clause, the vectorizer width for the loop is set to value of 'simdlen' clause + all read/write ops in loop are not marked with '!llvm.mem.parallel_loop_access' metadata. If 'safelen' clause is provided without 'simdlen' clause, the vectorizer width for the loop is set to value of 'safelen' clause + all read/write ops in loop are not marked with '!llvm.mem.parallel_loop_access' metadata. llvm-svn: 245697	2015-08-21 12:19:04 +00:00
Alexey Bataev	66b15b505f	[OPENMP 4.1] Initial support for 'simdlen' clause. Add parsing/sema analysis for 'simdlen' clause in simd directives. Also add check that if both 'safelen' and 'simdlen' clauses are specified, the value of 'simdlen' parameter is less than the value of 'safelen' parameter. llvm-svn: 245692	2015-08-21 11:14:16 +00:00
Alexey Bataev	bd9fec1eaa	[OPENMP 4.1] Allow variables with reference types in private clauses. OpenMP 4.1 allows to use variables with reference types in all private clauses (private, firstprivate, lastprivate, linear etc.). Patch allows to use such variables and fixes codegen for linear variables with reference types. llvm-svn: 245268	2015-08-18 06:47:21 +00:00
Alexey Bataev	b08f89ffc1	[OPENMP] Fix for http://llvm.org/PR24371 : Assert failure compiling blender 2.75. blender uses statements expression in condition of the loop under control of the '#pragma omp parallel for'. This condition is used several times in different expressions required for codegen of the loop directive. If there are some variables defined in statement expression, it fires an assert during codegen because of redefinition of the same variables. We have to rebuild several expression to be sure that all variables are unique. llvm-svn: 245041	2015-08-14 12:25:37 +00:00
Michael Wong	b5c1698994	This patch fixes the assert in emitting captured code in the target data construct. This is on behalf of Kelvin Li. http://reviews.llvm.org/D11475 llvm-svn: 244569	2015-08-11 04:52:01 +00:00
Filipe Cabecinhas	7af183d841	Propagate SourceLocations through to get a Loc on float_cast_overflow Summary: float_cast_overflow is the only UBSan check without a source location attached. This patch propagates SourceLocations where necessary to get them to the EmitCheck() call. Reviewers: rsmith, ABataev, rjmccall Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D11757 llvm-svn: 244568	2015-08-11 04:19:28 +00:00
Michael Wong	e710d5459e	This patch commits OpenMP 4 target device clauses This is committed on behalf of Kelvin Li http://reviews.llvm.org/D11469?id=31227 llvm-svn: 244325	2015-08-07 16:16:36 +00:00
Alexey Bataev	a889917493	[OPENMP 4.1] Allow references in init expression for loop-based constructs. OpenMP 4.1 allows to use variables with reference types in private clauses and, therefore, in init expressions of the cannonical loop forms. llvm-svn: 244209	2015-08-06 12:30:57 +00:00
Samuel Antao	9c75cfe976	[OpenMP] Add capture for threadprivate variables used in copyin clause if TLS is enabled in OpenMP code generation. llvm-svn: 243277	2015-07-27 16:38:06 +00:00
Michael Wong	65f367fcbb	Commit for http://reviews.llvm.org/D10765 for OpenMP 4 target data directive parsing and sema. This commit is on behalf of Kelvin Li. llvm-svn: 242785	2015-07-21 13:44:28 +00:00
Tyler Nowicki	da46d0ea8c	Make the variable names match the name of the metadata they control. Rename Vectorizer to Vectorize and VectorizeUnroll to InterleaveCount. llvm-svn: 242241	2015-07-14 23:03:09 +00:00
Alexey Bataev	7d5d33ea33	[OPENMP 4.0] Codegen for 'omp cancel' directive. Add the next codegen for 'omp cancel' directive: if (__kmpc_cancel()) { __kmpc_cancel_barrier(); <exit construct>; } llvm-svn: 241429	2015-07-06 05:50:32 +00:00
Alexey Bataev	81c7ea0ec3	[OPENMP 4.0] Fixed codegen for 'cancellation point' construct. Generate the next code for 'cancellation point': if (__kmpc_cancellationpoint()) { __kmpc_cancel_barrier(); <exit construct>; } llvm-svn: 241336	2015-07-03 09:56:58 +00:00
Benjamin Kramer	642f173ae9	Switch users of the 'for (StmtRange range = stmt->children(); range; ++range)‘ pattern to range for loops. The pattern was born out of the lack of range-based for loops in C++98 and is somewhat obscure. No functionality change intended. llvm-svn: 241300	2015-07-02 21:03:14 +00:00
Alexey Bataev	80909878ad	[OPENMP 4.0] Initial support for 'omp cancel' construct. Implemented parsing/sema analysis + (de)serialization. llvm-svn: 241253	2015-07-02 11:25:17 +00:00
Alexey Bataev	0f34da12e4	[OPENMP 4.0] Codegen for 'cancellation point' directive. The next code is generated for this construct: ``` if (__kmpc_cancellationpoint(ident_t *loc, kmp_int32 global_tid, kmp_int32 cncl_kind) != 0) <exit from outer innermost construct>; ``` llvm-svn: 241239	2015-07-02 04:17:07 +00:00
Alexey Bataev	0039651304	[OPENMP] Introduced type trait "__builtin_omp_required_simd_align" for default simd alignment. Adds type trait "__builtin_omp_required_simd_align" after discussions here http://reviews.llvm.org/D9894 Differential Revision: http://reviews.llvm.org/D10597 llvm-svn: 241237	2015-07-02 03:40:19 +00:00
Alexey Bataev	6d4ed05830	[OPENMP 4.0] Initial support for 'omp cancellation point' construct. Add parsing and sema analysis for 'omp cancellation point' directive. llvm-svn: 241145	2015-07-01 06:57:41 +00:00
Alexey Bataev	1d2353d4f3	[OPENMP] Codegen for 'depend' clause (OpenMP 4.0). If task directive has associated 'depend' clause then function kmp_int32 __kmpc_omp_task_with_deps ( ident_t loc_ref, kmp_int32 gtid, kmp_task_t new_task, kmp_int32 ndeps, kmp_depend_info_t dep_list,kmp_int32 ndeps_noalias, kmp_depend_info_t noalias_dep_list) must be called instead of __kmpc_omp_task(). If this directive has associated 'if' clause then also before a call of kmpc_omp_task_begin_if0() a function void __kmpc_omp_wait_deps ( ident_t loc_ref, kmp_int32 gtid, kmp_int32 ndeps, kmp_depend_info_t dep_list, kmp_int32 ndeps_noalias, kmp_depend_info_t *noalias_dep_list) must be called. Array sections are not supported yet. llvm-svn: 240532	2015-06-24 11:01:36 +00:00
Alexey Bataev	1c2cfbc3ea	[OPENMP] Initial support for 'depend' clause (4.0). Parsing and sema analysis (without support for array sections in arguments) for 'depend' clause (used in 'task' directive, OpenMP 4.0). llvm-svn: 240409	2015-06-23 14:25:19 +00:00
Alexey Bataev	7f210c6dab	[OPENMP] Codegen for 'proc_bind' clause (4.0). Adds emission of the code for 'proc_bind(master\|close\|spread)' clause: call void @__kmpc_push_proc_bind(<loc>, i32 thread_id, i32 4\|3\|2) llvm-svn: 240018	2015-06-18 13:40:03 +00:00
Alexey Bataev	c30dd2daf9	[OPENMP] Support for '#pragma omp taskgroup' directive. Added parsing, sema analysis and codegen for '#pragma omp taskgroup' directive (OpenMP 4.0). The code for directive is generated the following way: #pragma omp taskgroup <body> void __kmpc_taskgroup(<loc>, thread_id); <body> void __kmpc_end_taskgroup(<loc>, thread_id); llvm-svn: 240011	2015-06-18 12:14:09 +00:00
Alexey Bataev	3b5b5c492e	[OPENMP] Add support for 'omp parallel for' directive. Codegen for this directive is a combined codegen for 'omp parallel' region with 'omp for simd' region inside. Clauses are supported. llvm-svn: 240006	2015-06-18 10:10:12 +00:00
Alexey Bataev	58e5bdb091	[OPENMP] Add support for 'omp for simd' directive. Added codegen for combined 'omp for simd' directives, that is a combination of 'omp for' directive followed by 'omp simd' directive. Includes support for all clauses. llvm-svn: 239990	2015-06-18 04:45:29 +00:00
Alexey Bataev	cbdcbb7690	[OPENMP] Code reformatting for omp simd codegen, NFC. llvm-svn: 239889	2015-06-17 07:45:51 +00:00
Alexey Bataev	89e7e8eb0e	[OPENMP] Supported reduction clause in omp simd construct. The following code is generated for reduction clause within 'omp simd' loop construct: #pragma omp simd reduction(op:var) for (...) <body> alloca priv_var priv_var = <initial reduction value>; <loop_start>: <body> // references to original 'var' are replaced by 'priv_var' <loop_end>: var op= priv_var; llvm-svn: 239881	2015-06-17 06:21:39 +00:00
Alexey Bataev	fc087ecc05	[OPENMP] Support lastprivate clause in omp simd directive. Added codegen for lastprivate clauses within simd loop-based directives. llvm-svn: 239813	2015-06-16 13:14:42 +00:00
Alexey Bataev	ae05c29ab5	[OPENMP] Remove last iteration separation for loop-based constructs. Previously the last iteration for simd loop-based OpenMP constructs were generated as a separate code. This feature is not required and codegen is simplified. llvm-svn: 239810	2015-06-16 11:59:36 +00:00
Alexey Bataev	6e8248fdad	[OPENMP] Fox for http://llvm.org/PR23663 : OpenMP crash Destroy RuntimeCleanupScope before generation of termination instruction in parallel loop precondition. llvm-svn: 239524	2015-06-11 10:53:56 +00:00
Alexey Bataev	3ae88e2124	[OPENMP] Prepare codegen for privates in tasks for non-capturing of privates in CapturedStmt. Reworked codegen for privates in tasks: call @kmpc_omp_task_alloc(); ... call @kmpc_omp_task(task_proxy); void map_privates(.privates_rec. privs, type1 * priv1_ref, ..., typen *privn_ref) { priv1_ref = &privs->private1; ... privn_ref = &privs->privaten; ret void } i32 task_entry(i32 ThreadId, i32 PartId, void privs, void (void, ...) map_privates, shareds captures) { type1 priv1; ... typen privn; call map_privates(privs, priv1, ..., privn); <Task body with priv1, .., privn instead of the captured variables>. ret i32 } i32 task_proxy(i32 ThreadId, kmp_task_t_with_privates *tt) { call task_entry(ThreadId, tt->task_data.PartId, &tt->privates, map_privates, tt->task_data.shareds); } llvm-svn: 238010	2015-05-22 08:56:35 +00:00
Alexey Bataev	5129d3a4f5	[OPENMP] Fixed codegen for parameters privatization. For parameters we shall take a derived type of parameters, not the original one. llvm-svn: 237882	2015-05-21 09:47:46 +00:00
Alexey Bataev	7a228ff439	[OPENMP] Fixed codegen for lastprivate LCV in worksharing constructs. If loop control variable in a worksharing construct is marked as lastprivate, we should copy last calculated value of private counter back to original variable. llvm-svn: 237879	2015-05-21 07:59:51 +00:00
Alexey Bataev	d7589ffe1d	[OPENMP] Fix codegen for ordered loop directives. loops with ordered clause must be generated the same way as dynamic loops, but with static scheduleing. llvm-svn: 237788	2015-05-20 13:12:48 +00:00
Alexey Bataev	1d9c15cf18	[OPENMP] Fixed codegen for copying/initialization of array variables/parameters. This modification generates proper copyin/initialization sequences for array variables/parameters. Before they were considered as pointers, not arrays. llvm-svn: 237691	2015-05-19 12:31:28 +00:00
Alexey Bataev	d130fd17f1	[OPENMP] Fixed codegen for firstprivate variables, also marked as lastprivate. In some rare cases shared copies of lastprivate/firstprivate variables were not updated after the loop directive. llvm-svn: 237243	2015-05-13 10:23:02 +00:00
Alexey Bataev	040d540940	[OPENMP] Fixed support for 'schedule' clause with non-constant chunk size. 'schedule' clause for combined directives requires additional processing. Special helper variable is generated, that is captured in the outlined parallel region for 'parallel for' region. This captured variable is used to store chunk expression from the 'schedule' clause in this 'parallel for' region. llvm-svn: 237100	2015-05-12 08:35:28 +00:00
Alexey Bataev	9d541a72e8	[OPENMP] Fixed atomic construct with non-integer expressions. Do not emit 'atomicrmw' instruction for simple atomic constructs with non-integer expressions. llvm-svn: 236828	2015-05-08 11:47:16 +00:00
Alexey Bataev	39f915b8f4	[OPENMP] Code cleanup for capturing of variables in OpenMP regions. llvm-svn: 236821	2015-05-08 10:41:21 +00:00
Alexey Bataev	53223c986c	[OPENMP] Generate !llvm.mem.loop_parallel_access metadata for loops with dynamic/guided scheduling. Inner bodies of OpenMP worksharing loop-based constructs with dynamic or guided scheduling are allowed to be marked with !llvm.mem.parallel_loop_access metadata for better optimization. Worksharing constructs with static scheduling cannot be marked this way (according to OpenMP standard "A data dependence between the same logical iterations in two such loops is guaranteed"). Constructs with auto and runtime scheduling are also not marked because automatically chosen scheduling may be static also. Differential Revision: http://reviews.llvm.org/D9518 llvm-svn: 236693	2015-05-07 04:25:17 +00:00
Alexey Bataev	9e03404d8d	[OPENMP] Codegen for 'firstprivate' clause in 'task' directive. For tasks codegen for private/firstprivate variables are different rather than for other directives. 1. Build an internal structure of privates for each private variable: struct .kmp_privates_t. { Ty1 var1; ... Tyn varn; }; 2. Add a new field to kmp_task_t type with list of privates. struct kmp_task_t { void * shareds; kmp_routine_entry_t routine; kmp_int32 part_id; kmp_routine_entry_t destructors; .kmp_privates_t. privates; }; 3. Create a function with destructors calls for all privates after end of task region. kmp_int32 .omp_task_destructor.(kmp_int32 gtid, kmp_task_t tt) { ~Destructor(&tt->privates.var1); ... ~Destructor(&tt->privates.varn); return 0; } 4. Perform initialization of all firstprivate fields (by simple copying for POD data, copy constructor calls for classes) + provide address of a destructor function after kmpc_omp_task_alloc() and before kmpc_omp_task() calls. kmp_task_t new_task = __kmpc_omp_task_alloc(ident_t , kmp_int32 gtid, kmp_int32 flags, size_t sizeof_kmp_task_t, size_t sizeof_shareds, kmp_routine_entry_t task_entry); CopyConstructor(new_task->privates.var1, new_task->shareds.var1_ref); new_task->shareds.var1_ref = &new_task->privates.var1; ... CopyConstructor(new_task->privates.varn, new_task->shareds.varn_ref); new_task->shareds.varn_ref = &new_task->privates.varn; new_task->destructors = .omp_task_destructor.; kmp_int32 __kmpc_omp_task(ident_t , kmp_int32 gtid, kmp_task_t new_task) Differential Revision: http://reviews.llvm.org/D9370 llvm-svn: 236479	2015-05-05 04:05:12 +00:00
Benjamin Kramer	439ee9d7bc	Make helper functions static. NFC. llvm-svn: 236315	2015-05-01 13:59:53 +00:00
Alexey Bataev	36c1eb95e0	[OPENMP] Codegen for 'private' clause in 'task' directive. For tasks codegen for private/firstprivate variables are different rather than for other directives. 1. Build an internal structure of privates for each private variable: struct .kmp_privates_t. { Ty1 var1; ... Tyn varn; }; 2. Add a new field to kmp_task_t type with list of privates. struct kmp_task_t { void * shareds; kmp_routine_entry_t routine; kmp_int32 part_id; kmp_routine_entry_t destructors; .kmp_privates_t. privates; }; 3. Create a function with destructors calls for all privates after end of task region. kmp_int32 .omp_task_destructor.(kmp_int32 gtid, kmp_task_t tt) { ~Destructor(&tt->privates.var1); ... ~Destructor(&tt->privates.varn); return 0; } 4. Perform default initialization of all private fields (no initialization for POD data, default constructor calls for classes) + provide address of a destructor function after kmpc_omp_task_alloc() and before kmpc_omp_task() calls. kmp_task_t new_task = __kmpc_omp_task_alloc(ident_t , kmp_int32 gtid, kmp_int32 flags, size_t sizeof_kmp_task_t, size_t sizeof_shareds, kmp_routine_entry_t task_entry); DefaultConstructor(new_task->privates.var1); new_task->shareds.var1_ref = &new_task->privates.var1; ... DefaultConstructor(new_task->privates.varn); new_task->shareds.varn_ref = &new_task->privates.varn; new_task->destructors = .omp_task_destructor.; kmp_int32 __kmpc_omp_task(ident_t , kmp_int32 gtid, kmp_task_t new_task) Differential Revision: http://reviews.llvm.org/D9322 llvm-svn: 236207	2015-04-30 06:51:57 +00:00
Alexey Bataev	6111469a4a	[OPENMP] Fix crash on loop control vars explicitly marked as private. It is allowed to mark loop control vars as private in 'private' or 'lastprivate' clause, so no need to assert here. llvm-svn: 235985	2015-04-28 13:20:05 +00:00
Alexey Bataev	c925aa3ab8	[OPENMP] Simplified iteration over clauses, NFC. llvm-svn: 235838	2015-04-27 08:00:32 +00:00
Alexey Bataev	8b8e202a33	[OPENMP] Codegen for 'taskwait' directive. Emit the following code for 'taskwait' directive within tied task: call i32 @__kmpc_omp_taskwait(<loc>, i32 <thread_id>); Differential Revision: http://reviews.llvm.org/D9245 llvm-svn: 235836	2015-04-27 05:22:09 +00:00
Alexey Bataev	a89adf22db	[OPENMP] Codegen for 'reduction' clause in 'sections' directive. Emit a code for reduction clause. Next code should be emitted for reductions: static kmp_critical_name lock = { 0 }; void reduce_func(void lhs[<n>], void rhs[<n>]) { (Type0)lhs[0] = ReductionOperation0((Type0)lhs[0], (Type0)rhs[0]); ... (Type<n>-1)lhs[<n>-1] = ReductionOperation<n>-1((Type<n>-1)lhs[<n>-1], (Type<n>-1)rhs[<n>-1]); } ... void RedList[<n>] = {&<RHSExprs>[0], ..., &<RHSExprs>[<n>-1]}; switch (__kmpc_reduce{_nowait}(<loc>, <gtid>, <n>, sizeof(RedList), RedList, reduce_func, &<lock>)) { case 1: <LHSExprs>[0] = ReductionOperation0(<LHSExprs>[0], <RHSExprs>[0]); ... <LHSExprs>[<n>-1] = ReductionOperation<n>-1(<LHSExprs>[<n>-1], <RHSExprs>[<n>-1]); __kmpc_end_reduce{_nowait}(<loc>, <gtid>, &<lock>); break; case 2: Atomic(<LHSExprs>[0] = ReductionOperation0(<LHSExprs>[0], <RHSExprs>[0])); ... Atomic(<LHSExprs>[<n>-1] = ReductionOperation<n>-1(<LHSExprs>[<n>-1], *<RHSExprs>[<n>-1])); break; default:; } Reduction variables are a kind of a private variables, they have private copies, but initial values are chosen in accordance with the reduction operation. If sections directive has only single section, then original shared variables are used instead with barrier at the end of the directive. Differential Revision: http://reviews.llvm.org/D9242 llvm-svn: 235835	2015-04-27 05:04:13 +00:00
Alexey Bataev	9efc03b6f7	[OPENMP] Codegen for 'lastprivate' clause in 'sections' directive. #pragma omp sections lastprivate(<var>) <BODY>; This construct is translated into something like: <last_iter> = alloca i32 <init for lastprivates>; <last_iter> = 0 ; No initializer for simple variables or a default constructor is called for objects. ; For arrays perform element by element initialization by the call of the default constructor. ... OMP_FOR_START(...,<last_iter>, ..); sets <last_iter> to 1 if this is the last iteration. <BODY> ... OMP_FOR_END if (<last_iter> != 0) { <final copy for lastprivate>; Update original variable with the lastprivate value. } call __kmpc_cancel_barrier() ; an implicit barrier to avoid possible data race. If there is only one section, there is no special code generation, original shared variables are used + barrier is emitted at the end of the directive. Differential Revision: http://reviews.llvm.org/D9240 llvm-svn: 235834	2015-04-27 04:34:03 +00:00
Alexey Bataev	7387083d95	[OPENMP] Codegen for 'private' clause in 'sections' directive. If there are 2 or more sections in a 'section' directive the following code is generated: <default init for privates> @__kmpc_for_static_init_4(); <BODY for sections directive> @__kmpc_for_static_fini() If there is only one section, the following code is generated: if (@__kmpc_single()) { <default init for privates> @__kmpc_end_single(); } Differential Revision: http://reviews.llvm.org/D9239 llvm-svn: 235833	2015-04-27 04:12:12 +00:00
Alexey Bataev	59c654aa43	[OPENMP] Codegen for 'private' clause in 'single' directive. Emit the following code for 'single' directive with 'private' clause: if (@__kmpc_single()) { <default init for privates> @__kmpc_end_single(); } Differential Revision: http://reviews.llvm.org/D9238 llvm-svn: 235832	2015-04-27 03:48:52 +00:00
Alexey Bataev	5521d78532	[OPENMP] Codegen for 'firstprivate' clause in 'single' directive. Emit the following code for 'single' directive with 'firtstprivate' clause: if (@__kmpc_single()) { <init for firstprivates> @__kmpc_end_single(); } @__kmpc_cancel_barrier(); // To avoid data race in firstprivate init Differential Revision: http://reviews.llvm.org/D9223 llvm-svn: 235694	2015-04-24 04:21:15 +00:00
Alexey Bataev	8b72566eec	[OPENMP] Do not emit implicit barrier for single directive with 'copyprivate' clause(s). Runtime function for 'copyprivate' directive generates implicit barriers, so no need to emit it. Differential Revision: http://reviews.llvm.org/D9215 llvm-svn: 235692	2015-04-24 04:00:39 +00:00
Alexey Bataev	2cb9b95adf	[OPENMP] Codegen for 'firstprivate' clause in 'sections' directive. If there are 2 or more sections in a 'section' directive the following code is generated: <init for firstprivates> @__kmpc_cancel_barrier();// To avoid data race in firstprivate init @__kmpc_for_static_init_4(); <BODY for sections directive> @__kmpc_for_static_fini() If there is only one section, the following code is generated: if (@__kmpc_single()) { <init for firstprivates> @__kmpc_end_single(); } @__kmpc_cancel_barrier(); // To avoid data race in firstprivate init Differential Revision: http://reviews.llvm.org/D9214 llvm-svn: 235691	2015-04-24 03:37:03 +00:00
Justin Bogner	66242d6c5e	InstrProf: Stop using RegionCounter outside of CodeGenPGO (NFC) The RegionCounter type does a lot of legwork, but most of it is only meaningful within the implementation of CodeGenPGO. The uses elsewhere in CodeGen generally just want to increment or read counters, so do that directly. llvm-svn: 235664	2015-04-23 23:06:47 +00:00
Alexey Bataev	5e018f9e29	[OPENMP] Codegen for 'atomic capture'. Adds codegen for 'atomic capture' constructs with the following forms of expressions/statements: v = x binop= expr; v = x++; v = ++x; v = x--; v = --x; v = x = x binop expr; v = x = expr binop x; {v = x; x = binop= expr;} {v = x; x++;} {v = x; ++x;} {v = x; x--;} {v = x; --x;} {x = x binop expr; v = x;} {x binop= expr; v = x;} {x++; v = x;} {++x; v = x;} {x--; v = x;} {--x; v = x;} {x = x binop expr; v = x;} {x = expr binop x; v = x;} {v = x; x = expr;} If x and expr are integer and binop is associative or x is a LHS in a RHS of the assignment expression, and atomics are allowed for type of x on the target platform atomicrmw instruction is emitted. Otherwise compare-and-swap sequence is emitted. Update of 'v' is not required to be be atomic with respect to the read or write of the 'x'. bb: ... atomic load <x> cont: <expected> = phi [ <x>, label %bb ], [ <new_failed>, %cont ] <desired> = <expected> binop <expr> <res> = cmpxchg atomic &<x>, desired, expected <new_failed> = <res>.field1; br <res>field2, label %exit, label %cont exit: atomic store <old/new x>, <v> ... Differential Revision: http://reviews.llvm.org/D9049 llvm-svn: 235573	2015-04-23 06:35:10 +00:00
Alexey Bataev	1d67713b44	[OPENMP] Codegen for 'if' clause in 'task' directive. If condition evaluates to true, the code executes task by calling @__kmpc_omp_task() runtime function. If condition evaluates to false, the code executes serial version of the code by executing the following code: call void @__kmpc_omp_task_begin_if0(<loc>, <threadid>, <task_t_ptr, returned by @__kmpc_omp_task_alloc()>); proxy_task_entry(<gtid>, <task_t_ptr, returned by @__kmpc_omp_task_alloc()>); call void @__kmpc_omp_task_complete_if0(<loc>, <threadid>, <task_t_ptr, returned by @__kmpc_omp_task_alloc()>); Also it checks if the condition is constant and if it is constant it evaluates its value and then generates either parallel version of the code (if the condition evaluates to true), or the serial version of the code (if the condition evaluates to false). Differential Revision: http://reviews.llvm.org/D9143 llvm-svn: 235507	2015-04-22 13:57:31 +00:00
Alexey Bataev	7ebe5fddac	[OPENMP] Codegen for 'reduction' clause in 'for' directive. Emit a code for reduction clause. Next code should be emitted for reductions: static kmp_critical_name lock = { 0 }; void reduce_func(void lhs[<n>], void rhs[<n>]) { (Type0)lhs[0] = ReductionOperation0((Type0)lhs[0], (Type0)rhs[0]); ... (Type<n>-1)lhs[<n>-1] = ReductionOperation<n>-1((Type<n>-1)lhs[<n>-1], (Type<n>-1)rhs[<n>-1]); } ... void RedList[<n>] = {&<RHSExprs>[0], ..., &<RHSExprs>[<n>-1]}; switch (__kmpc_reduce{_nowait}(<loc>, <gtid>, <n>, sizeof(RedList), RedList, reduce_func, &<lock>)) { case 1: <LHSExprs>[0] = ReductionOperation0(<LHSExprs>[0], <RHSExprs>[0]); ... <LHSExprs>[<n>-1] = ReductionOperation<n>-1(<LHSExprs>[<n>-1], <RHSExprs>[<n>-1]); __kmpc_end_reduce{_nowait}(<loc>, <gtid>, &<lock>); break; case 2: Atomic(<LHSExprs>[0] = ReductionOperation0(<LHSExprs>[0], <RHSExprs>[0])); ... Atomic(<LHSExprs>[<n>-1] = ReductionOperation<n>-1(<LHSExprs>[<n>-1], *<RHSExprs>[<n>-1])); break; default:; } Reduction variables are a kind of a private variables, they have private copies, but initial values are chosen in accordance with the reduction operation. Differential Revision: http://reviews.llvm.org/D9139 llvm-svn: 235506	2015-04-22 13:43:03 +00:00
Alexey Bataev	50a6458870	[OPENMP] Codegen for 'private' clause in 'for' directive. This patch generates helper variables which used as a private copies of the corresponding original variables inside an OpenMP 'for' directive. These generated variables are initialized by default (with the default constructor, if any). In OpenMP region references to original variables are replaced by the references to these private helper variables. Differential Revision: http://reviews.llvm.org/D9106 llvm-svn: 235503	2015-04-22 12:24:45 +00:00
Alexey Bataev	62dbb979c0	[OPENMP] Fix use of unsigned counters in loops with zero trip count. Patch fixes bugs in codegen for loops with unsigned counters and zero trip count. Previously preconditions for all loops were built using logic (Upper - Lower) > 0. But if the loop is a loop with zero trip count, then Upper - Lower is < 0 only for signed integer, for unsigned we're running into an underflow situation. In this patch we're using original Lower<Upper condition to check that loop body can be executed at least once. Also this allows to skip code generation for loops, if it is known that preconditions for the loop are always false. Differential Revision: http://reviews.llvm.org/D9103 llvm-svn: 235500	2015-04-22 11:59:37 +00:00
Alexey Bataev	98eb6e3d41	[OPENMP] Codegen for 'ordered' directive. Add codegen for 'ordered' directive: __kmpc_ordered(ident_t , gtid); <associated statement>; __kmpc_end_ordered(ident_t , gtid); Also for 'for' directives with the dynamic scheduling and an 'ordered' clause added a call to '__kmpc_dispatch_fini_(4\|8)[u]()' function after increment expression for loop control variable: while(__kmpc_dispatch_next(&LB, &UB)) { idx = LB; while (idx <= UB) { BODY; ++idx; __kmpc_dispatch_fini_(4\|8)[u](); // For ordered loops only. } // inner loop } Differential Revision: http://reviews.llvm.org/D9070 llvm-svn: 235496	2015-04-22 11:15:40 +00:00
Benjamin Kramer	5df7c1a4eb	Make helper function static. NFC. llvm-svn: 235253	2015-04-18 10:00:10 +00:00
Alexey Bataev	f56f98c925	[OPENMP] Codegen for 'copyin' clause in 'parallel' directive. Emits the following code for the clause at the beginning of the outlined function for implicit threads: if (<not a master thread>) { ... <thread local copy of var> = <master thread local copy of var>; ... } <sync point>; Checking for a non-master thread is performed by comparing of the address of the thread local variable with the address of the master's variable. Master thread always uses original variables, so you always know the address of the variable in the master thread. Differential Revision: http://reviews.llvm.org/D9026 llvm-svn: 235075	2015-04-16 05:39:01 +00:00
Alexey Bataev	38e8953352	[OPENMP] Codegen for 'lastprivate' clause in 'for' directive. #pragma omp for lastprivate(<var>) for (i = a; i < b; ++b) <BODY>; This construct is translated into something like: <last_iter> = alloca i32 <lastprivate_var> = alloca <type> <last_iter> = 0 ; No initializer for simple variables or a default constructor is called for objects. ; For arrays perform element by element initialization by the call of the default constructor. ... OMP_FOR_START(...,<last_iter>, ..); sets <last_iter> to 1 if this is the last iteration. <BODY> ... OMP_FOR_END if (<last_iter> != 0) { <var> = <lastprivate_var> ; Update original variable with the lastprivate value. } call __kmpc_cancel_barrier() ; an implicit barrier to avoid possible data race. Differential Revision: http://reviews.llvm.org/D8658 llvm-svn: 235074	2015-04-16 04:54:05 +00:00
Alexey Bataev	69c62a9bdb	[OPENMP] Codegen for 'firstprivate' clause in 'for' directive. Adds proper codegen for 'firstprivate' clause in for directive. Initially codegen for 'firstprivate' clause was implemented for 'parallel' directive only. Also this patch emits sync point only after initialization of firstprivate variables, not all private variables. This sync point is not required for privates, lastprivates etc., only for initialization of firstprivate variables. Differential Revision: http://reviews.llvm.org/D8660 llvm-svn: 234978	2015-04-15 04:52:20 +00:00
Alexey Bataev	420d45b2dd	[OPENMP] Fixed codegen for arrays in 'copyprivate' clause. Fixed a bug with codegen of variables with array types specified in 'copyprivate' clause of 'single' directive. Differential Revision: http://reviews.llvm.org/D8914 llvm-svn: 234856	2015-04-14 05:11:24 +00:00
Alexey Bataev	68adb7da1a	[OPENMP] Initial codegen for 'parallel sections' directive. Emits code for outlined 'parallel' directive with the implicitly inlined 'sections' directive: ... call __kmpc_fork_call(..., outlined_function, ...); ... define internal void outlined_function(...) { <code for implicit sections directive>; } Differential Revision: http://reviews.llvm.org/D8997 llvm-svn: 234849	2015-04-14 03:29:22 +00:00
Alexey Bataev	671605e85b	[OPENMP] Initial codegen for 'parallel for' directive. Allows generation of combined 'parallel for' directive that represents 'parallel' region with internal implicit 'for' worksharing region. Differential Revision: http://reviews.llvm.org/D8631 llvm-svn: 234722	2015-04-13 05:28:11 +00:00
Alexey Bataev	794ba0dcb7	[OPENMP] Codegen for 'reduction' clause in 'parallel' directive. Emit a code for reduction clause. Next code should be emitted for reductions: static kmp_critical_name lock = { 0 }; void reduce_func(void lhs[<n>], void rhs[<n>]) { ... (Type<i> )lhs[i] = RedOp<i>((Type<i> )lhs[i], (Type<i> )rhs[i]); ... } ... void RedList[<n>] = {&<RHSExprs>[0], ..., &<RHSExprs>[<n> - 1]}; switch (__kmpc_reduce{_nowait}(<loc>, <gtid>, <n>, sizeof(RedList), RedList, reduce_func, &<lock>)) { case 1: ... <LHSExprs>[i] = RedOp<i>(<LHSExprs>[i], <RHSExprs>[i]); ... __kmpc_end_reduce{_nowait}(<loc>, <gtid>, &<lock>); break; case 2: ... Atomic(<LHSExprs>[i] = RedOp<i>(<LHSExprs>[i], *<RHSExprs>[i])); ... break; default: ; } Reduction variables are a kind of a private variables, they have private copies, but initial values are chosen in accordance with the reduction operation. Differential Revision: http://reviews.llvm.org/D8915 llvm-svn: 234583	2015-04-10 10:43:45 +00:00
Alexey Bataev	6f1ffc069b	[OPENMP] Refactoring of codegen for OpenMP directives. Refactored API of OpenMPRuntime for compatibility with combined directives. Differential Revision: http://reviews.llvm.org/D8859 llvm-svn: 234564	2015-04-10 04:50:10 +00:00
Alexey Bataev	b4505a7229	[OPENMP] Codegen for 'atomic update' construct. Adds atomic update codegen for the following forms of expressions: x binop= expr; x++; ++x; x--; --x; x = x binop expr; x = expr binop x; If x and expr are integer and binop is associative or x is a LHS in a RHS of the assignment expression, and atomics are allowed for type of x on the target platform atomicrmw instruction is emitted. Otherwise compare-and-swap sequence is emitted: bb: ... atomic load <x> cont: <expected> = phi [ <x>, label %bb ], [ <new_failed>, %cont ] <desired> = <expected> binop <expr> <res> = cmpxchg atomic &<x>, desired, expected <new_failed> = <res>.field1; br <res>field2, label %exit, label %cont exit: ... Differential Revision: http://reviews.llvm.org/D8536 llvm-svn: 233513	2015-03-30 05:20:59 +00:00
Alexey Bataev	f268568447	[OPENMP] Improved codegen for implicit/explicit 'barrier' constructs. Replace boolean IsExplicit parameter of OpenMPRuntime::emitBarrierCall() method by OpenMPDirectiveKind Kind for better compatibility with the runtime library. Also add processing of 'nowait' clause on worksharing directives. Differential Revision: http://reviews.llvm.org/D8659 llvm-svn: 233511	2015-03-30 04:30:22 +00:00
Alexey Bataev	a63048e4fd	[OPENMP] Codegen for 'copyprivate' clause ('single' directive). If there is at least one 'copyprivate' clause is associated with the single directive, the following code is generated: ``` i32 did_it = 0; \\ for 'copyprivate' clause if(__kmpc_single(ident_t , gtid)) { SingleOpGen(); __kmpc_end_single(ident_t , gtid); did_it = 1; \\ for 'copyprivate' clause } <copyprivate_list>[0] = &var0; ... <copyprivate_list>[n] = &varn; call __kmpc_copyprivate(ident_t , gtid, <copyprivate_list_size>, <copyprivate_list>, <copy_func>, did_it); ... void<copy_func>(void LHSArg, void RHSArg) { Dst = (void [n])(LHSArg); Src = (void * [n])(RHSArg); Dst[0] = Src[0]; ... Dst[n] = Src[n]; } ``` All list items from all 'copyprivate' clauses are gathered into single <copyprivate list> (<copyprivate_list_size> is a size in bytes of this list) and <copy_func> is used to propagate values of private or threadprivate variables from the 'single' region to other implicit threads from outer 'parallel' region. Differential Revision: http://reviews.llvm.org/D8410 llvm-svn: 232932	2015-03-23 06:18:07 +00:00
Alexander Musman	3276a27b5c	[OPENMP] CodeGen of the 'linear' clause for the 'omp simd' directive. The linear variable is privatized (similar to 'private') and its value on current iteration is calculated, similar to the loop counter variables. Differential revision: http://reviews.llvm.org/D8375 llvm-svn: 232890	2015-03-21 10:12:56 +00:00
Alexander Musman	7931b98735	[OPENMP] Enable codegen of the ‘private’ clause for ‘omp simd’ directive llvm-svn: 232353	2015-03-16 07:14:41 +00:00
Alexander Musman	92bdaabf97	[OPENMP] CodeGen - 'omp for' with dynamic schedule kinds. Differential Revision: http://reviews.llvm.org/D7138 llvm-svn: 232036	2015-03-12 13:37:50 +00:00
Alexey Bataev	2df54a07bf	[OPENMP] Initial codegen for 'omp sections' and 'omp section' directives. If only one section is found in the sections region, it is emitted just like single region. Otherwise it is emitted as a static non-chunked loop. #pragma omp sections { #pragma omp section {1} ... #pragma omp section {n} } is translated to something like i32 <iter_var> i32 <last_iter> = 0 i32 <lower_bound> = 0 i32 <upper_bound> = n-1 i32 <stride> = 1 call void @__kmpc_for_static_init_4(<loc>, i32 <gtid>, i32 34/static non-chunked/, i32* <last_iter>, i32* <lower_bound>, i32* <upper_bound>, i32* <stride>, i32 1/increment always 1/, i32 1/chunk always 1/) <upper_bound> = min(<upper_bound>, n-1) <iter_var> = <lb> check: br <iter_var> <= <upper_bound>, label cont, label exit continue: switch (IV) { case 0: {1}; break; ... case <NumSection> - 1: {n}; break; } ++<iter_var> br label check exit: call void @__kmpc_for_static_fini(<loc>, i32 <gtid>) Differential Revision: http://reviews.llvm.org/D8244 llvm-svn: 232021	2015-03-12 08:53:29 +00:00
Alexey Bataev	10fec57e5a	[OPENMP] Fix for ExprWithCleanups in 'omp atomic' constructs. This patch allows using of ExprWithCleanups expressions and other complex expressions in 'omp atomic' construct Differential Revision: http://reviews.llvm.org/D8200 llvm-svn: 231905	2015-03-11 04:48:56 +00:00
Alexey Bataev	62b63b197d	[OPENMP] Initial codegen for 'omp task' directive. The task region is emmitted in several steps: Emit a call to kmp_task_t __kmpc_omp_task_alloc(ident_t , kmp_int32 gtid, kmp_int32 flags, size_t sizeof_kmp_task_t, size_t sizeof_shareds, kmp_routine_entry_t task_entry). Here task_entry is a pointer to the function: kmp_int32 .omp_task_entry.(kmp_int32 gtid, kmp_task_t tt) { TaskFunction(gtid, tt->part_id, tt->shareds); return 0; } Copy a list of shared variables to field shareds of the resulting structure kmp_task_t returned by the previous call (if any). Copy a pointer to destructions function to field destructions of the resulting structure kmp_task_t. Emit a call to kmp_int32 __kmpc_omp_task(ident_t , kmp_int32 gtid, kmp_task_t new_task), where new_task is a resulting structure from previous items. Differential Revision: http://reviews.llvm.org/D7560 llvm-svn: 231762	2015-03-10 07:28:44 +00:00
Alexey Bataev	36bf011e83	[OPENMP] Improved code for generating debug info + generation of all OpenMP regions in termination scope Patch adds proper generation of debug info for all OpenMP regions. Also, all OpenMP regions are generated in a termination scope, because standard does not allow to throw exceptions out of structured blocks, associated with the OpenMP regions Differential Revision: http://reviews.llvm.org/D7935 llvm-svn: 231757	2015-03-10 05:15:26 +00:00
Rafael Espindola	eb26ddf559	Revert "[OPENMP] Improved code for generating debug info + generation of all OpenMP regions in termination scope Patch adds proper generation of debug info for all OpenMP regions. Also, all OpenMP regions are generated in a termination scope, because standard does not allow to throw exceptions out of structured blocks, associated with the OpenMP regions Differential Revision: http://reviews.llvm.org/D7935 " This reverts commit r231752. It was failing to link with cmake: lib64/libclangCodeGen.a(CGOpenMPRuntime.cpp.o):/home/espindola/llvm/llvm/tools/clang/lib/CodeGen/CGOpenMPRuntime.cpp:function clang::CodeGen::InlinedOpenMPRegionRAII::~InlinedOpenMPRegionRAII(): error: undefined reference to 'clang::CodeGen::EHScopeStack::popTerminate()' clang-3.7: error: linker command failed with exit code 1 (use -v to see invocation) llvm-svn: 231754	2015-03-10 04:40:21 +00:00
Alexey Bataev	7ab2cc178f	[OPENMP] Improved code for generating debug info + generation of all OpenMP regions in termination scope Patch adds proper generation of debug info for all OpenMP regions. Also, all OpenMP regions are generated in a termination scope, because standard does not allow to throw exceptions out of structured blocks, associated with the OpenMP regions Differential Revision: http://reviews.llvm.org/D7935 llvm-svn: 231752	2015-03-10 04:22:11 +00:00
Alexey Bataev	b832926176	[OPENMP] Codegen for "#pragma omp atomic write" For global reg lvalue - use regular store through global register. For simple lvalue - use simple atomic store. For bitfields, vector element, extended vector elements - the original value of the whole storage (for vector elements) or of some aligned value (for bitfields) is atomically read, the part of this value for the given lvalue is modified and then use atomic compare-and-exchange operation to try to atomically write modified value (if it was not modified). Also, changes in this patch fix the bug for '#pragma omp atomic read' applied to extended vector elements. Differential Revision: http://reviews.llvm.org/D7369 llvm-svn: 230736	2015-02-27 06:33:30 +00:00
Alexey Bataev	8cbe0a6b62	[OPENMP] Fixed codegen for directives without function outlining. Fixed crash on codegen for directives like 'omp for', 'omp single' etc. inside of the 'omp parallel', 'omp task' etc. regions. llvm-svn: 230621	2015-02-26 10:27:34 +00:00
Alexey Bataev	3eff5f46d7	[OPENMP] Rename methods of OpenMPRuntime class. NFC. llvm-svn: 230470	2015-02-25 08:32:46 +00:00
David Majnemer	a5b195a1dc	Revert "Revert r229082 for a bit, it caused PR22577." This reverts commit r229123. It was a red herring, the bug was present without r229082. llvm-svn: 229205	2015-02-14 01:35:12 +00:00
Nico Weber	7ce96b853d	Revert r229082 for a bit, it caused PR22577. llvm-svn: 229123	2015-02-13 16:27:00 +00:00
David Majnemer	abc482effc	MS ABI: Implement /volatile:ms The /volatile:ms semantics turn volatile loads and stores into atomic acquire and release operations. This distinction is important because volatile memory operations do not form a happens-before relationship with non-atomic memory. This means that a volatile store is not sufficient for implementing a mutex unlock routine. Differential Revision: http://reviews.llvm.org/D7580 llvm-svn: 229082	2015-02-13 07:55:47 +00:00
Alexey Bataev	6956e2e683	[OPENMP] Initial codegen for 'single' directive. This patch emits the following code for the single directive: #pragma omp single <body> <----> if(__kmpc_single(...)) { <body> __kmpc_end_single(...); } Differential Revision: http://reviews.llvm.org/D7045 llvm-svn: 228275	2015-02-05 06:35:41 +00:00
Alexey Bataev	9f797f32e2	[OPENMP] Codegen for 'taskyield' directive For 'taskyield' directive emit call to kmp_int32 __kmpc_omp_taskyield(ident_t *, kmp_int32 global_tid, int end_part); runtime function call with end_part arg set to 0 (it is ignored). Differential Revision: http://reviews.llvm.org/D7047 llvm-svn: 228272	2015-02-05 05:57:51 +00:00
Adrian Prantl	95b24e9b59	Address review feedback for r228003. - use named constructors - get rid of MarkAsPrologue llvm-svn: 228021	2015-02-03 20:00:54 +00:00
Adrian Prantl	39428e74a0	Merge ArtificialLocation into ApplyDebugLocation and make a clear distinction between the different use-cases. With the previous default behavior we would occasionally emit empty debug locations in situations where they actually were strictly required (= on invoke insns). We now have a choice between defaulting to an empty location or an artificial location. Specifically, this fixes a bug caused by a missing debug location when emitting C++ EH cleanup blocks from within an artificial function, such as an ObjC destroy helper function. rdar://problem/19670595 llvm-svn: 228003	2015-02-03 18:40:42 +00:00
Alexander Musman	df7a8e2bc8	Support ‘omp for’ with static chunked schedule kind. Differential Revision: http://reviews.llvm.org/D7006 llvm-svn: 226795	2015-01-22 08:49:35 +00:00
Alexey Bataev	b57056f483	[OPENMP] CodeGen for "omp atomic read [seq_cst]" directive. "omp atomic read [seq_cst]" accepts expressions "v=x;". In this patch we perform an atomic load of "x" (using builtin atomic loading instructions or a call to "atomic_load()" for simple lvalues and "kmpc_atomic_start();load <x>;kmpc_atomic_end();" for other lvalues), convert the result of loading to type of "v" (using EmitScalarConversion() for simple types and EmitComplexToScalarConversion() for conversions from complex to scalar) and then store the result in "v".) Differential Revision: http://reviews.llvm.org/D6431 llvm-svn: 226788	2015-01-22 06:17:56 +00:00
Alexey Bataev	02e1ec6966	Revert commit revision 226786 Need to add initialization of AtomicInfo::EvaluationKind field llvm-svn: 226787	2015-01-22 06:09:48 +00:00
Alexey Bataev	92efdce503	[OPENMP] CodeGen for "omp atomic read [seq_cst]" directive. "omp atomic read [seq_cst]" accepts expressions "v=x;". In this patch we perform an atomic load of "x" (using builtin atomic loading instructions or a call to "atomic_load()" for simple lvalues and "kmpc_atomic_start();load <x>;kmpc_atomic_end();" for other lvalues), convert the result of loading to type of "v" (using EmitScalarConversion() for simple types and EmitComplexToScalarConversion() for conversions from complex to scalar) and then store the result in "v".) Differential Revision: http://reviews.llvm.org/D6431 llvm-svn: 226786	2015-01-22 05:44:37 +00:00
Alexey Bataev	137421c8a9	Revert commit r226784. Accidentally modified file SemaType.cpp must be restored to its original state. llvm-svn: 226785	2015-01-22 05:35:53 +00:00
Alexey Bataev	13c7c4930c	[OPENMP] CodeGen for "omp atomic read [seq_cst]" directive. "omp atomic read [seq_cst]" accepts expressions "v=x;". In this patch we perform an atomic load of "x" (using builtin atomic loading instructions or a call to "atomic_load()" for simple lvalues and "kmpc_atomic_start();load <x>;kmpc_atomic_end();" for other lvalues), convert the result of loading to type of "v" (using EmitScalarConversion() for simple types and EmitComplexToScalarConversion() for conversions from complex to scalar) and then store the result in "v". Differential Revision: http://reviews.llvm.org/D6431 llvm-svn: 226784	2015-01-22 05:29:28 +00:00
Chandler Carruth	0d9593ddec	[cleanup] Re-sort all #include lines with llvm/utils/sort_includes.py Sorry for the noise, I managed to miss a bunch of recent regressions of include orderings here. This should actually sort all the includes for Clang. Again, no functionality changed, this is just a mechanical cleanup that I try to run periodically to keep the #include lines as regular as possible across the project. llvm-svn: 225979	2015-01-14 11:29:14 +00:00
David Blaikie	66e4197f07	Reapply r225000 (reverted in r225555): DebugInfo: Generalize debug info location handling (and follow-up commits). Several pieces of code were relying on implicit debug location setting which usually lead to incorrect line information anyway. So I've fixed those (in r225955 and r225845) separately which should pave the way for this commit to be cleanly reapplied. The reason these implicit dependencies resulted in crashes with this patch is that the debug location would no longer implicitly leak from one place to another, but be set back to invalid. Once a call with no/invalid location was emitted, if that call was ever inlined it could produce invalid debugloc chains and assert during LLVM's codegen. There may be further cases of such bugs in this patch - they're hard to flush out with regression testing, so I'll keep an eye out for reports and investigate/fix them ASAP if they come up. Original commit message: Reapply "DebugInfo: Generalize debug info location handling" Originally committed in r224385 and reverted in r224441 due to concerns this change might've introduced a crash. Turns out this change fixes the crash introduced by one of my earlier more specific location handling changes (those specific fixes are reverted by this patch, in favor of the more general solution). Recommitted in r224941 and reverted in r224970 after it caused a crash when building compiler-rt. Looks to be due to this change zeroing out the debug location when emitting default arguments (which were meant to inherit their outer expression's location) thus creating call instructions without locations - these create problems for inlining and must not be created. That is fixed and tested in this version of the change. Original commit message: This is a more scalable (fixed in mostly one place, rather than many places that will need constant improvement/maintenance) solution to several commits I've made recently to increase source fidelity for subexpressions. This resetting had to be done at the DebugLoc level (not the SourceLocation level) to preserve scoping information (if the resetting was done with CGDebugInfo::EmitLocation, it would've caused the tail end of an expression's codegen to end up in a potentially different scope than the start, even though it was at the same source location). The drawback to this is that it might leave CGDebugInfo out of sync. Ideally CGDebugInfo shouldn't have a duplicate sense of the current SourceLocation, but for now it seems it does... - I don't think I'm going to tackle removing that just now. I expect this'll probably cause some more buildbot fallout & I'll investigate that as it comes up. Also these sort of improvements might be starting to show a weakness/bug in LLVM's line table handling: we don't correctly emit is_stmt for statements, we just put it on every line table entry. This means one statement split over multiple lines appears as multiple 'statements' and two statements on one line (without column info) are treated as one statement. I don't think we have any IR representation of statements that would help us distinguish these cases and identify the beginning of each statement - so that might be something we need to add (possibly to the lexical scope chain - a scope for each statement). This does cause some problems for GDB and possibly other DWARF consumers. llvm-svn: 225956	2015-01-14 07:38:27 +00:00
David Blaikie	f353d3ecd0	Revert "DebugInfo: Generalize debug info location handling" and related commits This reverts commit r225000, r225021, r225083, r225086, r225090. The root change (r225000) still has several issues where it's caused calls to be emitted without debug locations. This causes assertion failures if/when those calls are inlined. I'll work up some test cases and fixes before recommitting this. llvm-svn: 225555	2015-01-09 23:00:28 +00:00
David Blaikie	84fe79cfc3	Reapply "DebugInfo: Generalize debug info location handling" Originally committed in r224385 and reverted in r224441 due to concerns this change might've introduced a crash. Turns out this change fixes the crash introduced by one of my earlier more specific location handling changes (those specific fixes are reverted by this patch, in favor of the more general solution). Recommitted in r224941 and reverted in r224970 after it caused a crash when building compiler-rt. Looks to be due to this change zeroing out the debug location when emitting default arguments (which were meant to inherit their outer expression's location) thus creating call instructions without locations - these create problems for inlining and must not be created. That is fixed and tested in this version of the change. Original commit message: This is a more scalable (fixed in mostly one place, rather than many places that will need constant improvement/maintenance) solution to several commits I've made recently to increase source fidelity for subexpressions. This resetting had to be done at the DebugLoc level (not the SourceLocation level) to preserve scoping information (if the resetting was done with CGDebugInfo::EmitLocation, it would've caused the tail end of an expression's codegen to end up in a potentially different scope than the start, even though it was at the same source location). The drawback to this is that it might leave CGDebugInfo out of sync. Ideally CGDebugInfo shouldn't have a duplicate sense of the current SourceLocation, but for now it seems it does... - I don't think I'm going to tackle removing that just now. I expect this'll probably cause some more buildbot fallout & I'll investigate that as it comes up. Also these sort of improvements might be starting to show a weakness/bug in LLVM's line table handling: we don't correctly emit is_stmt for statements, we just put it on every line table entry. This means one statement split over multiple lines appears as multiple 'statements' and two statements on one line (without column info) are treated as one statement. I don't think we have any IR representation of statements that would help us distinguish these cases and identify the beginning of each statement - so that might be something we need to add (possibly to the lexical scope chain - a scope for each statement). This does cause some problems for GDB and possibly other DWARF consumers. llvm-svn: 225000	2014-12-30 19:39:33 +00:00
David Blaikie	608a24501c	Revert "DebugInfo: Generalize debug info location handling" Asserting when building compiler-rt when using a GCC host compiler. Reverting while I investigate. This reverts commit r224941. llvm-svn: 224970	2014-12-29 23:49:00 +00:00
David Blaikie	3945d1bd99	Reapply "DebugInfo: Generalize debug info location handling" Originally committed in r224385 and reverted in r224441 due to concerns this change might've introduced a crash. Turns out this change fixes the crash introduced by one of my earlier more specific location handling changes (those specific fixes are reverted by this patch, in favor of the more general solution). Original commit message: This is a more scalable (fixed in mostly one place, rather than many places that will need constant improvement/maintenance) solution to several commits I've made recently to increase source fidelity for subexpressions. This resetting had to be done at the DebugLoc level (not the SourceLocation level) to preserve scoping information (if the resetting was done with CGDebugInfo::EmitLocation, it would've caused the tail end of an expression's codegen to end up in a potentially different scope than the start, even though it was at the same source location). The drawback to this is that it might leave CGDebugInfo out of sync. Ideally CGDebugInfo shouldn't have a duplicate sense of the current SourceLocation, but for now it seems it does... - I don't think I'm going to tackle removing that just now. I expect this'll probably cause some more buildbot fallout & I'll investigate that as it comes up. Also these sort of improvements might be starting to show a weakness/bug in LLVM's line table handling: we don't correctly emit is_stmt for statements, we just put it on every line table entry. This means one statement split over multiple lines appears as multiple 'statements' and two statements on one line (without column info) are treated as one statement. I don't think we have any IR representation of statements that would help us distinguish these cases and identify the beginning of each statement - so that might be something we need to add (possibly to the lexical scope chain - a scope for each statement). This does cause some problems for GDB and possibly other DWARF consumers. llvm-svn: 224941	2014-12-29 18:18:45 +00:00
David Blaikie	06b2c54db9	Revert "DebugInfo: Generalize debug info location handling" Fails an ASan bootstrap - I'll try to reproduce locally & sort that out before recommitting. This reverts commit r224385. llvm-svn: 224441	2014-12-17 18:02:04 +00:00
David Blaikie	bf22a4eaee	DebugInfo: Generalize debug info location handling This is a more scalable (fixed in mostly one place, rather than many places that will need constant improvement/maintenance) solution to several commits I've made recently to increase source fidelity for subexpressions. This resetting had to be done at the DebugLoc level (not the SourceLocation level) to preserve scoping information (if the resetting was done with CGDebugInfo::EmitLocation, it would've caused the tail end of an expression's codegen to end up in a potentially different scope than the start, even though it was at the same source location). The drawback to this is that it might leave CGDebugInfo out of sync. Ideally CGDebugInfo shouldn't have a duplicate sense of the current SourceLocation, but for now it seems it does... - I don't think I'm going to tackle removing that just now. I expect this'll probably cause some more buildbot fallout & I'll investigate that as it comes up. Also these sort of improvements might be starting to show a weakness/bug in LLVM's line table handling: we don't correctly emit is_stmt for statements, we just put it on every line table entry. This means one statement split over multiple lines appears as multiple 'statements' and two statements on one line (without column info) are treated as one statement. I don't think we have any IR representation of statements that would help us distinguish these cases and identify the beginning of each statement - so that might be something we need to add (possibly to the lexical scope chain - a scope for each statement). This does cause some problems for GDB and possibly other DWARF consumers. llvm-svn: 224385	2014-12-16 22:49:17 +00:00
Alexey Bataev	f841bd9fcd	[OPENMP] Bugfix for processing of global variables in OpenMP regions. Currently, if global variable is marked as a private OpenMP variable, the compiler crashes in debug version or generates incorrect code in release version. It happens because in the OpenMP region the original global variable is used instead of the generated private copy. It happens because currently globals variables are not captured in the OpenMP region. This patch adds capturing of global variables iff private copy of the global variable must be used in the OpenMP region. Differential Revision: http://reviews.llvm.org/D6259 llvm-svn: 224323	2014-12-16 07:00:22 +00:00
Alexander Musman	c638868bdf	First patch with codegen of the 'omp for' directive. It implements the simplest case, which is used when no chunk_size is specified in the schedule(static) or no 'schedule' clause is specified - the iteration space is divided by the library into chunks that are approximately equal in size, and at most one chunk is distributed to each thread. In this case, we do not need an outer loop in each thread - each thread requests once which iterations range it should handle (using __kmpc_for_static_init runtime call) and then runs the inner loop on this range. Differential Revision: http://reviews.llvm.org/D5865 llvm-svn: 224233	2014-12-15 07:07:06 +00:00
Alexey Bataev	8f7c1b0b9d	[OPENMP] Codegen for 'omp barrier' directive. Adds generation of call to "i32 kmpc_cancel_barrier(ident_t *, i32)" libcall for explicitly specified barriers (OMP_IDENT_BARRIER_EXPL flag is added to "flags" field of "ident_t" structure). Also this patch replaces all calls to "kmpc_barrier" function by calls of "__kmpc_cancel_barrier" function which provides additional functionality for OpenMP 4.0. Also, library specific enum OpenMPLocationFlags moved to private section of CGOpenMPRuntime class to make it more independent from library implementation. Differential Revision: http://reviews.llvm.org/D6447 llvm-svn: 223444	2014-12-05 04:09:23 +00:00
Alexey Bataev	8d690650e8	[OPENMP] Codegen for 'omp master' directive Patch adds 2 library functions to OpenMPRuntime class - int32 kmpc_master(ident_t , int32 gtid) and void kmpc_end_master(ident_t , int32 gtid); For 'omp master' directive the next code is generated: if (__kmpc_master(loc, gtid)) { <Associated structured block>; __kmpc_end_master(log, gtid); } Differential Revision: http://reviews.llvm.org/D6473 llvm-svn: 223342	2014-12-04 07:23:53 +00:00
Alexey Bataev	75ddfabed7	[OPENMP] Formating and code improvement for codegen of 'omp critical' directive. No functional changes, only code improvements. llvm-svn: 223010	2014-12-01 11:32:38 +00:00
Alexey Bataev	cc37cc1db2	[OPENMP] Codegen for "omp flush" directive. For each "omp flush" directive a call to "void kmpc_flush(ident_t , ...)" function is generated. Directive "omp flush" may have an associated list of variables to flush, but currently runtime function ignores them. So the patch generates just "call kmpc_flush(ident_t <loc>, i32 0)". Differential Revision: http://reviews.llvm.org/D6292 llvm-svn: 222409	2014-11-20 04:34:54 +00:00
Alexey Bataev	03b340a3a5	[OPENMP] Codegen for 'private' clause in 'parallel' directive. This patch generates some helper variables which used as a private copies of the corresponding original variables inside an OpenMP 'parallel' directive. These generated variables are initialized by default (with the default constructor, if any). In outlined function references to original variables are replaced by the references to these private helper variables. At the end of the initialization of the private variables and implicit barier is set by calling __kmpc_barrier(...) runtime function to be sure that all threads were initialized using original values of the variables. Differential Revision: http://reviews.llvm.org/D4752 llvm-svn: 220262	2014-10-21 03:16:40 +00:00
Alexey Bataev	b205978100	[OPENMP] Codegen for 'num_threads' clause in 'parallel' directive. This patch generates call to "kmpc_push_num_threads(ident_t *loc, kmp_int32 global_tid, kmp_int32 num_threads);" library function before calling "kmpc_fork_call" each time there is an associated "num_threads" clause in the "omp parallel" directive. Differential Revision: http://reviews.llvm.org/D5145 llvm-svn: 219599	2014-10-13 08:23:51 +00:00
Alexey Bataev	d74d060d6d	[OPENMP] Codegen for 'if' clause in 'parallel' directive. Adds codegen for 'if' clause. Currently only for 'if' clause used with the 'parallel' directive. If condition evaluates to true, the code executes parallel version of the code by calling __kmpc_fork_call(loc, 1, microtask, captured_struct/context/), where loc - debug location, 1 - number of additional parameters after "microtask" argument, microtask - is outlined finction for the code associated with the 'parallel' directive, captured_struct - list of variables captured in this outlined function. If condition evaluates to false, the code executes serial version of the code by executing the following code: global_thread_id.addr = alloca i32 store i32 global_thread_id, global_thread_id.addr zero.addr = alloca i32 store i32 0, zero.addr kmpc_serialized_parallel(loc, global_thread_id); microtask(global_thread_id.addr, zero.addr, captured_struct/context/); kmpc_end_serialized_parallel(loc, global_thread_id); Where loc - debug location, global_thread_id - global thread id, returned by __kmpc_global_thread_num() call or passed as a first parameter in microtask() call, global_thread_id.addr - address of the variable, where stored global_thread_id value, zero.addr - implicit bound thread id (should be set to 0 for serial call), microtask() and captured_struct are the same as in parallel call. Also this patch checks if the condition is constant and if it is constant it evaluates its value and then generates either parallel version of the code (if the condition evaluates to true), or the serial version of the code (if the condition evaluates to false). Differential Revision: http://reviews.llvm.org/D4716 llvm-svn: 219597	2014-10-13 06:02:40 +00:00
Alexey Bataev	1809571c76	Code reformatting and improvement for OpenMP. Moved CGOpenMPRegionInfo from CGOpenMPRuntime.h to CGOpenMPRuntime.cpp file and reworked the code for this change. Also added processing of ThreadID variable passed as an argument in outlined functions in parallel and task directives. llvm-svn: 219490	2014-10-10 12:19:54 +00:00
Alexey Bataev	435ad7ba5e	Code improvements in OpenMP CodeGen. This patch makes class OMPPrivateScope a common class for all private variables. Reworked processing of firstprivate variables (now it is based on OMPPrivateScope too). llvm-svn: 219486	2014-10-10 09:48:26 +00:00
Alexey Bataev	13314bf526	[OPENMP] 'omp teams' directive basic support. Includes parsing and semantic analysis for 'omp teams' directive support from OpenMP 4.0. Adds additional analysis to 'omp target' directive with 'omp teams' directive. llvm-svn: 219385	2014-10-09 04:18:56 +00:00
Alexey Bataev	4a5bb772c3	[OPENMP] Codegen for 'firstprivate' clause. This patch generates some helper variables that used as private copies of the corresponding original variables inside an OpenMP 'parallel' directive. These generated variables are initialized by copy using values of the original variables (with the copy constructor, if any). For arrays, initializator is generated for single element and in the codegen procedure this initial value is automatically propagated between all elements of the private copy. In outlined function, references to original variables are replaced by the references to these private helper variables. At the end of the initialization of the private variables an implicit barier is generated by calling __kmpc_barrier(...) runtime function to be sure that all threads were initialized using original values of the variables. Differential Revision: http://reviews.llvm.org/D5140 llvm-svn: 219306	2014-10-08 14:01:46 +00:00
Alexey Bataev	8068b643c4	Revert commit r219297. Still troubles with OpenMP/parallel_firstprivate_codegen.cpp (now in ARM buildbots). llvm-svn: 219298	2014-10-08 12:00:22 +00:00
Alexey Bataev	3854f63aaf	[OPENMP] Codegen for 'firstprivate' clause. This patch generates some helper variables that used as private copies of the corresponding original variables inside an OpenMP 'parallel' directive. These generated variables are initialized by copy using values of the original variables (with the copy constructor, if any). For arrays, initializator is generated for single element and in the codegen procedure this initial value is automatically propagated between all elements of the private copy. In outlined function, references to original variables are replaced by the references to these private helper variables. At the end of the initialization of the private variables an implicit barier is generated by calling __kmpc_barrier(...) runtime function to be sure that all threads were initialized using original values of the variables. Differential Revision: http://reviews.llvm.org/D5140 llvm-svn: 219297	2014-10-08 11:35:04 +00:00
Alexey Bataev	bdef50e1ad	Revert back r219295. To fix issues with test OpenMP/parallel_firstprivate_codegen.cpp llvm-svn: 219296	2014-10-08 11:12:35 +00:00
Alexey Bataev	e7a5517a58	[OPENMP] Codegen for 'firstprivate' clause. This patch generates some helper variables that used as private copies of the corresponding original variables inside an OpenMP 'parallel' directive. These generated variables are initialized by copy using values of the original variables (with the copy constructor, if any). For arrays, initializator is generated for single element and in the codegen procedure this initial value is automatically propagated between all elements of the private copy. In outlined function, references to original variables are replaced by the references to these private helper variables. At the end of the initialization of the private variables an implicit barier is generated by calling __kmpc_barrier(...) runtime function to be sure that all threads were initialized using original values of the variables. Differential Revision: http://reviews.llvm.org/D5140 llvm-svn: 219295	2014-10-08 10:42:55 +00:00
Renato Golin	9804fa5d48	Revert "[OPENMP] 'omp teams' directive basic support. Includes parsing and semantic analysis for 'omp teams' directive support from OpenMP 4.0. Adds additional analysis to 'omp target' directive with 'omp teams' directive." This reverts commit r219197 because it broke ARM self-hosting buildbots with segmentation fault errors in many tests. llvm-svn: 219289	2014-10-08 09:06:45 +00:00
Alexey Bataev	941bbec6f4	[OPENMP] 'omp teams' directive basic support. Includes parsing and semantic analysis for 'omp teams' directive support from OpenMP 4.0. Adds additional analysis to 'omp target' directive with 'omp teams' directive. llvm-svn: 219197	2014-10-07 10:13:33 +00:00
Alexander Musman	d196ef2124	[OPENMP] Small refactoring of EmitOMPSimdLoop helper routine. No functional changes intended. Renamed EmitOMPSimdLoop to EmitOMPInnerLoop, I plan to re-use it to emit inner loop in the future patches for CodeGen of the worksharing loop directives (omp for, omp for simd). llvm-svn: 219195	2014-10-07 08:57:09 +00:00
Alexander Musman	a5f070aec0	[OPENMP] Loop collapsing and codegen for 'omp simd' directive. This patch implements collapsing of the loops (in particular, in presense of clause 'collapse'). It calculates number of iterations N and expressions nesessary to calculate the nested loops counters values based on new iteration variable (that goes from 0 to N-1) in Sema. It also adds Codegen for 'omp simd', which uses (and tests) this feature. Differential Revision: http://reviews.llvm.org/D5184 llvm-svn: 218743	2014-10-01 06:03:56 +00:00
Alexander Musman	09184fedc0	[OPENMP] Codegen of the ‘aligned’ clause for the ‘omp simd’ directive. Differential Revision: http://reviews.llvm.org/D5499 llvm-svn: 218660	2014-09-30 05:29:28 +00:00
Alexander Musman	e4e893bb36	[OPENMP] Parsing/Sema of directive omp parallel for simd llvm-svn: 218299	2014-09-23 09:33:00 +00:00
Alexey Bataev	3a3bf0bbe3	[OPENMP] Codegen for 'omp critical' directive. This patch adds codegen for constructs: #pragma omp critical [name] <body> It generates global variable ".gomp_critical_user_[name].var" of type int32[8]. Then it generates library call "kmpc_critical(loc, gtid, .gomp_critical_user_[name].var)", code for <body> statement and final call "kmpc_end_critical(loc, gtid, .gomp_critical_user_[name].var)". Differential Revision: http://reviews.llvm.org/D5202 llvm-svn: 218239	2014-09-22 10:01:53 +00:00
Alexey Bataev	0bd520b767	[OPENMP] Initial parsing/sema analysis of 'target' directive. llvm-svn: 218110	2014-09-19 08:19:49 +00:00
Alexander Musman	f82886e502	Parsing/Sema of directive omp for simd llvm-svn: 218029	2014-09-18 05:12:34 +00:00
Alexey Bataev	0162e459ef	[OPENMP] Initial parsing and sema analysis for 'atomic' directive. llvm-svn: 213639	2014-07-22 10:10:35 +00:00
Alexey Bataev	9fb6e647e7	[OPENMP] Initial parsing and sema analysis for 'ordered' directive. llvm-svn: 213616	2014-07-22 06:45:04 +00:00
Alexey Bataev	6125da9258	[OPENMP] Initial parsing and sema analysis for 'flush' directive. llvm-svn: 213512	2014-07-21 11:26:11 +00:00
Alexander Musman	d9ed09f7a5	[OPENMP] Parsing/Sema of the OpenMP directive 'critical'. llvm-svn: 213510	2014-07-21 09:42:05 +00:00
Alexey Bataev	2df347ad96	[OPENMP] Initial parsing and sema analysis for 'taskwait' directive. llvm-svn: 213363	2014-07-18 10:17:07 +00:00
Alexey Bataev	4d1dfeabc9	[OPENMP] Initial parsing and sema analysis for 'barrier' directive. llvm-svn: 213360	2014-07-18 09:11:51 +00:00
Alexey Bataev	68446b7253	[OPENMP] Initial parsing and sema analysis of 'taskyield' directive. llvm-svn: 213355	2014-07-18 07:47:19 +00:00
Alexander Musman	80c2289a03	[OPENMP] Parsing/Sema analysis of directive 'master' llvm-svn: 213237	2014-07-17 08:54:58 +00:00
Alexey Bataev	9c2e8ee72f	[OPENMP] Parsing and sema analysis for 'omp task' directive. llvm-svn: 212804	2014-07-11 11:25:16 +00:00
Alexey Bataev	84d0b3efee	[OPENMP] Parsing and sema analysis for 'omp parallel sections' directive. llvm-svn: 212516	2014-07-08 08:12:03 +00:00
Alexey Bataev	4acb859fbd	[OPENMP] Added initial support for 'omp parallel for'. llvm-svn: 212453	2014-07-07 13:01:15 +00:00
Alexey Bataev	aca7fcf276	Using of variable length arrays in captured statements and OpenMP constructs. Differential Revision: http://reviews.llvm.org/D4067 llvm-svn: 212010	2014-06-30 02:55:54 +00:00
Alexey Bataev	d1e40fbfe1	[OPENMP] Initial parsing and sema analysis for 'single' directive. llvm-svn: 211774	2014-06-26 12:05:45 +00:00
Alexey Bataev	1e0498a92d	[OPENMP] Initial parsing and sema analysis for 'section' directive. llvm-svn: 211767	2014-06-26 08:21:58 +00:00
Alexey Bataev	d3f8dd2d15	[OPENMP] Initial support for 'sections' directive. llvm-svn: 211685	2014-06-25 11:44:49 +00:00
Alexey Bataev	23b6942634	[OPENMP] Reformatting and code improvement. llvm-svn: 211147	2014-06-18 07:08:49 +00:00
Alexey Bataev	f29276edb7	[OPENMP] Initial support for '#pragma omp for' (fixed incompatibility with MSVC). llvm-svn: 211140	2014-06-18 04:14:57 +00:00
Rafael Espindola	a566efbec9	Revert "[OPENMP] Initial support for '#pragma omp for'." This reverts commit r211096. Looks like it broke the msvc build: SemaOpenMP.cpp(140) : error C4519: default template arguments are only allowed on a class template llvm-svn: 211113	2014-06-17 17:20:53 +00:00
Alexey Bataev	c77dd5257a	[OPENMP] Initial support for '#pragma omp for'. llvm-svn: 211096	2014-06-17 11:49:22 +00:00
Alexander Musman	515ad8c490	This patch adds a helper class (CGLoopInfo) for marking memory instructions with llvm.mem.parallel_loop_access metadata. It also adds a simple initial version of codegen for pragma omp simd (it will change in the future to support all the clauses). Differential revision: http://reviews.llvm.org/D3644 llvm-svn: 209411	2014-05-22 08:54:05 +00:00
Alexey Bataev	9959db5fa9	[OPENMP] Initial codegen for '#pragma omp parallel' llvm-svn: 208077	2014-05-06 10:08:46 +00:00

... 4 5 6 7 8 ...

525 Commits