llvm-project

Commit Graph

Author	SHA1	Message	Date
Daniel Neilson	6e938effaa	Change memcpy/memove/memset to have dest and source alignment attributes (Step 1). Summary: Upstream LLVM is changing the the prototypes of the @llvm.memcpy/memmove/memset intrinsics. This change updates the Clang tests for this change. The @llvm.memcpy/memmove/memset intrinsics currently have an explicit argument which is required to be a constant integer. It represents the alignment of the dest (and source), and so must be the minimum of the actual alignment of the two. This change removes the alignment argument in favour of placing the alignment attribute on the source and destination pointers of the memory intrinsic call. For example, code which used to read: call void @llvm.memcpy.p0i8.p0i8.i32(i8* %dest, i8* %src, i32 100, i32 4, i1 false) will now read call void @llvm.memcpy.p0i8.p0i8.i32(i8* align 4 %dest, i8* align 4 %src, i32 100, i1 false) At this time the source and destination alignments must be the same (Step 1). Step 2 of the change, to be landed shortly, will relax that contraint and allow the source and destination to have different alignments. llvm-svn: 322964	2018-01-19 17:12:54 +00:00
Alexey Bataev	a8a9153a37	[OPENMP] Support for -fopenmp-simd option with compilation of simd loops only. Added support for -fopenmp-simd option that allows compilation of simd-based constructs without emission of OpenMP runtime calls. llvm-svn: 321560	2017-12-29 18:07:07 +00:00
Jonas Hahnfeld	273d261b8f	Fix PR35542: Correct adjusting of private reduction variable The adjustment is calculated with CreatePtrDiff() which returns the difference in (base) elements. This is passed to CreateGEP() so make sure that the GEP base has the correct pointer type: It needs to be a pointer to the base type, not a pointer to a constant sized array. Differential Revision: https://reviews.llvm.org/D40911 llvm-svn: 319931	2017-12-06 19:15:28 +00:00
Alexey Bataev	c2e88a8a6b	[OPENMP] Fix PR35486: crash when collapsing loops with dependent iteration spaces. Though it is incorrect from point of view of OpenMP standard to have dependent iteration space in OpenMP loops, compiler should not crash. Patch fixes this problem. llvm-svn: 319700	2017-12-04 21:30:42 +00:00
Jonas Hahnfeld	96087f3825	[OpenMP] Extend "Avoid VLAs for reduction" optimization to VLAs as base We can generate constant sized arrays whenever the array section has constant length, even if the base expression itself is a VLA. Differential Revision: https://reviews.llvm.org/D39504 llvm-svn: 317207	2017-11-02 13:30:42 +00:00
Alexey Bataev	1b48c5e56b	[OPENMP] Fix PR35013: Fix passing VLAs captures to outlined functions. Fixed passing of VLAs and variably-modified types to outlined functions. Synchronized passing with the types codegen. llvm-svn: 316488	2017-10-24 19:52:31 +00:00
Jonas Hahnfeld	4525c82428	[OpenMP] Avoid VLAs for some reductions on array sections In some cases the compiler can deduce the length of an array section as constants. With this information, VLAs can be avoided in place of a constant sized array or even a scalar value if the length is 1. Example: int a[4], b[2]; pragma omp parallel reduction(+: a[1:2], b[1:1]) { } For chained array sections, this optimization is restricted to cases where all array sections except the last have a constant length 1. This trivially guarantees that there are no holes in the memory region that needs to be privatized. Example: int c[3][4]; pragma omp parallel reduction(+: c[1:1][1:2]) { } This relands commit r316229 that I reverted in r316235 because it failed on some bots. During investigation I found that this was because Clang and GCC evaluate the two arguments to emplace_back() in ReductionCodeGen::emitSharedLValue() in a different order, hence leading to a different order of generated instructions in the final LLVM IR. Fix this by passing in the arguments from temporary variables that are evaluated in a defined order. Differential Revision: https://reviews.llvm.org/D39136 llvm-svn: 316362	2017-10-23 19:01:35 +00:00
Jonas Hahnfeld	c95a6985bd	Revert "[OpenMP] Avoid VLAs for some reductions on array sections" This breaks at least two buildbots: http://lab.llvm.org:8011/builders/clang-cmake-x86_64-avx2-linux/builds/1175 http://lab.llvm.org:8011/builders/clang-atom-d525-fedora-rel/builds/10478 This reverts commit r316229 during local investigation. llvm-svn: 316235	2017-10-20 20:16:17 +00:00
Jonas Hahnfeld	b6229be460	[OpenMP] Avoid VLAs for some reductions on array sections In some cases the compiler can deduce the length of an array section as constants. With this information, VLAs can be avoided in place of a constant sized array or even a scalar value if the length is 1. Example: int a[4], b[2]; pragma omp parallel reduction(+: a[1:2], b[1:1]) { } For chained array sections, this optimization is restricted to cases where all array sections except the last have a constant length 1. This trivially guarantees that there are no holes in the memory region that needs to be privatized. Example: int c[3][4]; pragma omp parallel reduction(+: c[1:1][1:2]) { } Differential Revision: https://reviews.llvm.org/D39136 llvm-svn: 316229	2017-10-20 19:40:40 +00:00
Alexey Bataev	be5a8b42cd	[OPENMP] Codegen for reduction clauses in 'taskloop' directives. Adds codegen for taskloop-based directives. llvm-svn: 308174	2017-07-17 13:30:36 +00:00
Alexey Bataev	c9dcaaae69	[OPENMP] Fix reduction tests, NFC. llvm-svn: 307916	2017-07-13 15:15:25 +00:00
Alexey Bataev	faede1d0d8	[OPENMP] Fix reduction tests, NFC. llvm-svn: 307915	2017-07-13 15:09:05 +00:00
Alexey Bataev	280f308b71	[OPENMP] Further reduction test fix, NFC. llvm-svn: 307914	2017-07-13 15:02:27 +00:00
Alexey Bataev	80621121e2	[OPENMP] Fix reduction tests, NFC. llvm-svn: 307912	2017-07-13 14:54:42 +00:00
Alexey Bataev	3ef6fd5637	[OPENMP] Fix reduction tests, NFC. llvm-svn: 307911	2017-07-13 14:29:19 +00:00
Alexey Bataev	5c40bec5eb	[OPENMP] Generalization of codegen for reduction clauses. Reworked codegen for reduction clauses for future support of reductions in task-based directives. llvm-svn: 307910	2017-07-13 13:36:14 +00:00
Reid Kleckner	7a2037151c	Remove unnecessary x86 backend requirements from OpenMP tests Clang can generate LLVM IR for x86 without a registered x86 backend. llvm-svn: 284836	2016-10-21 16:09:20 +00:00
Alexey Bataev	2f5ed34279	Fix for PR30639: CGDebugInfo Null dereference with OpenMP array access, by Erich Keane OpenMP creates a variable array type with a a null size-expr. The Debug generation failed to due to this. This patch corrects the openmp implementation, updates the tests, and adds a new one for this condition. Differential Revision: https://reviews.llvm.org/D25373 llvm-svn: 284110	2016-10-13 09:52:46 +00:00
Alexey Bataev	4ba78a46ff	[OPENMP] Fix for codegen of captured variables in inlined directives. Currently there is a problem with codegen of inlined directives inside lambdas, it may cause a crash during codegen because of incorrect capturing of variables. Patch fixes this problem. llvm-svn: 267677	2016-04-27 07:56:03 +00:00
Alexey Bataev	31300ed0a5	[OPENMP 4.0] Fixed support of array sections/array subscripts. Codegen for array sections/array subscripts worked only for expressions with arrays as base. Patch fixes codegen for bases with pointer/reference types. llvm-svn: 259776	2016-02-04 11:27:03 +00:00
Alexey Bataev	1189bd0205	[OPENMP 4.5] Allow arrays in 'reduction' clause. OpenMP 4.5, alogn with array sections, allows to use variables of array type in reductions. llvm-svn: 258804	2016-01-26 12:20:39 +00:00
Alexey Bataev	6f531ec0a2	[OPENMP] Remove explicit call for implicit barrier #pragma omp parallel needs an implicit barrier that is currently done by an explicit call to __kmpc_barrier. However, the runtime already ensures a barrier in __kmpc_fork_call which currently leads to two barriers per region per thread. Differential Revision: http://reviews.llvm.org/D15561 llvm-svn: 255992	2015-12-18 10:24:53 +00:00
Pete Cooper	3b39e88ae0	Revert "Change memcpy/memset/memmove to have dest and source alignments." This reverts commit r253512. This likely broke the bots in: http://lab.llvm.org:8011/builders/clang-ppc64-elf-linux2/builds/20202 http://bb.pgr.jp/builders/clang-3stage-i686-linux/builds/3787 llvm-svn: 253542	2015-11-19 05:55:59 +00:00
Pete Cooper	7bfd5cb7be	Change memcpy/memset/memmove to have dest and source alignments. This is a follow on from a similar LLVM commit: r253511. Note, this was reviewed (and more details are in) http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20151109/312083.html These intrinsics currently have an explicit alignment argument which is required to be a constant integer. It represents the alignment of the source and dest, and so must be the minimum of those. This change allows source and dest to each have their own alignments by using the alignment attribute on their arguments. The alignment argument itself is removed. The only code change to clang is hidden in CGBuilder.h which now passes both dest and source alignment to IRBuilder, instead of taking the minimum of dest and source alignments. Reviewed by Hal Finkel. llvm-svn: 253512	2015-11-18 22:18:45 +00:00
Alexey Bataev	f24e7b1f60	[OPENMP 4.1] Codegen for array sections/subscripts in 'reduction' clause. OpenMP 4.1 adds support for array sections/subscripts in 'reduction' clause. Patch adds codegen for this feature. llvm-svn: 249672	2015-10-08 09:10:53 +00:00
Alexey Bataev	25e5b44654	[OPENMP] Emit __kmpc_cancel_barrier() and code for 'cancellation point' only if 'cancel' is found. Patch improves codegen for OpenMP constructs. If the OpenMP region does not have internal 'cancel' construct, a call to 'void __kmpc_barrier()' runtime function is generated for all implicit/explicit barriers. If the region has inner 'cancel' directive, then ``` if (__kmpc_cancel_barrier()) exit from outer construct; ``` code is generated. Also, the code for 'canellation point' directive is not generated if parent directive does not have 'cancel' directive. llvm-svn: 247681	2015-09-15 12:52:43 +00:00
Alexey Bataev	2377fe95c6	[OPENMP] Outlined function for parallel and other regions with list of captured variables. Currently all variables used in OpenMP regions are captured into a record and passed to outlined functions in this record. It may result in some poor performance because of too complex analysis later in optimization passes. Patch makes to emit outlined functions for parallel-based regions with a list of captured variables. It reduces code for 2*n GEPs, stores and loads at least. Codegen for task-based regions remains unchanged because runtime requires that all captured variables are passed in captured record. llvm-svn: 247251	2015-09-10 08:12:02 +00:00
John McCall	7f416cc426	Compute and preserve alignment more faithfully in IR-generation. Introduce an Address type to bundle a pointer value with an alignment. Introduce APIs on CGBuilderTy to work with Address values. Change core APIs on CGF/CGM to traffic in Address where appropriate. Require alignments to be non-zero. Update a ton of code to compute and propagate alignment information. As part of this, I've promoted CGBuiltin's EmitPointerWithAlignment helper function to CGF and made use of it in a number of places in the expression emitter. The end result is that we should now be significantly more correct when performing operations on objects that are locally known to be under-aligned. Since alignment is not reliably tracked in the type system, there are inherent limits to this, but at least we are no longer confused by standard operations like derived-to-base conversions and array-to-pointer decay. I've also fixed a large number of bugs where we were applying the complete-object alignment to a pointer instead of the non-virtual alignment, although most of these were hidden by the very conservative approach we took with member alignment. Also, because IRGen now reliably asserts on zero alignments, we should no longer be subject to an absurd but frustrating recurring bug where an incomplete type would report a zero alignment and then we'd naively do a alignmentAtOffset on it and emit code using an alignment equal to the largest power-of-two factor of the offset. We should also now be emitting much more aggressive alignment attributes in the presence of over-alignment. In particular, field access now uses alignmentAtOffset instead of min. Several times in this patch, I had to change the existing code-generation pattern in order to more effectively use the Address APIs. For the most part, this seems to be a strict improvement, like doing pointer arithmetic with GEPs instead of ptrtoint. That said, I've tried very hard to not change semantics, but it is likely that I've failed in a few places, for which I apologize. ABIArgInfo now always carries the assumed alignment of indirect and indirect byval arguments. In order to cut down on what was already a dauntingly large patch, I changed the code to never set align attributes in the IR on non-byval indirect arguments. That is, we still generate code which assumes that indirect arguments have the given alignment, but we don't express this information to the backend except where it's semantically required (i.e. on byvals). This is likely a minor regression for those targets that did provide this information, but it'll be trivial to add it back in a later patch. I partially punted on applying this work to CGBuiltin. Please do not add more uses of the CreateDefaultAligned{Load,Store} APIs; they will be going away eventually. llvm-svn: 246985	2015-09-08 08:05:57 +00:00
Alexey Bataev	caacd53dde	[OPENMP] Fix for http://llvm.org/PR24674 : assertion failed and and abort trap Fix processing of shared variables with reference types in OpenMP constructs. Previously, if the variable was not marked in one of the private clauses, the reference to this variable was emitted incorrectly and caused an assertion later. llvm-svn: 246846	2015-09-04 11:26:21 +00:00
Alexey Bataev	b44fdfc7b2	[OPENMP] Drop type qualifiers from private variables. If the variable is marked as private in OpenMP construct, the reference to this variable should not keep type qualifiers for the original variable. Private copy is not volatile or constant, so we can use unqualified type for private copy. llvm-svn: 242133	2015-07-14 10:32:29 +00:00
Adrian Prantl	bc068586ac	Revert "Revert r241620 and follow-up commits" and move the initialization of the llvm targets from clang/CodeGen into ClangCheck.cpp and CIndex.cpp. llvm-svn: 241653	2015-07-08 01:00:30 +00:00
Adrian Prantl	142ec39739	Revert r241620 and follow-up commits while investigating linux buildbot failures. llvm-svn: 241642	2015-07-07 23:19:46 +00:00
Adrian Prantl	726855a8da	Update testcases that use precompiled headers to require a target after r241620. llvm-svn: 241623	2015-07-07 20:31:18 +00:00
Alexey Bataev	db39021cee	[OPENMP] -fopenmp enables OpenMP support (fix for http://llvm.org/PR23492 ) -fopenmp turns on OpenMP support and links libiomp5 as OpenMP library. Also there is -fopenmp={libiomp5\|libgomp} option that allows to override effect of -fopenmp and link libgomp library (if -fopenmp=libgomp is specified). Differential Revision: http://reviews.llvm.org/D9736 llvm-svn: 237769	2015-05-20 04:24:19 +00:00
Alexey Bataev	f0ab553fea	[OPENMP] Fixed bug in atomic update/capture/write constructs. Fixed a bug with codegen for destination atomic l-value with padding and junk in this padding bytes. llvm-svn: 237422	2015-05-15 08:36:34 +00:00
Alexey Bataev	69a4779965	[OPENMP] Fixed codegen for 'reduction' clause. Fixed codegen for reduction operations min, max, && and \|\|. Codegen for them is quite similar and I was confused by this similarity. Also added a call to kmpc_end_reduce() in atomic part of reduction codegen (call to kmpc_end_reduce_nowait() is not required). Differential Revision: http://reviews.llvm.org/D9513 llvm-svn: 236689	2015-05-07 03:54:03 +00:00
Alexey Bataev	19fa2c3b66	[OPENMP] Fix crash on reductions codegen for short circuit reduction operations. llvm-svn: 236084	2015-04-29 05:21:03 +00:00
Alexey Bataev	7ebe5fddac	[OPENMP] Codegen for 'reduction' clause in 'for' directive. Emit a code for reduction clause. Next code should be emitted for reductions: static kmp_critical_name lock = { 0 }; void reduce_func(void lhs[<n>], void rhs[<n>]) { (Type0)lhs[0] = ReductionOperation0((Type0)lhs[0], (Type0)rhs[0]); ... (Type<n>-1)lhs[<n>-1] = ReductionOperation<n>-1((Type<n>-1)lhs[<n>-1], (Type<n>-1)rhs[<n>-1]); } ... void RedList[<n>] = {&<RHSExprs>[0], ..., &<RHSExprs>[<n>-1]}; switch (__kmpc_reduce{_nowait}(<loc>, <gtid>, <n>, sizeof(RedList), RedList, reduce_func, &<lock>)) { case 1: <LHSExprs>[0] = ReductionOperation0(<LHSExprs>[0], <RHSExprs>[0]); ... <LHSExprs>[<n>-1] = ReductionOperation<n>-1(<LHSExprs>[<n>-1], <RHSExprs>[<n>-1]); __kmpc_end_reduce{_nowait}(<loc>, <gtid>, &<lock>); break; case 2: Atomic(<LHSExprs>[0] = ReductionOperation0(<LHSExprs>[0], <RHSExprs>[0])); ... Atomic(<LHSExprs>[<n>-1] = ReductionOperation<n>-1(<LHSExprs>[<n>-1], *<RHSExprs>[<n>-1])); break; default:; } Reduction variables are a kind of a private variables, they have private copies, but initial values are chosen in accordance with the reduction operation. Differential Revision: http://reviews.llvm.org/D9139 llvm-svn: 235506	2015-04-22 13:43:03 +00:00

38 Commits