llvm-project

Commit Graph

Author	SHA1	Message	Date
Fady Ghanim	80e15b4574	[Clang][OpenMP][OMPBuilder] Moving OMP allocation and cache creation code to OMPBuilderCBHelpers Summary: Modified the OMPBuilderCBHelpers in the following ways: - Moved location of class definition and deleted all constructors - Moved OpenMP-specific address allocation of local variables - Moved threadprivate variable creation for the current thread Reviewers: jdoerfert Subscribers: yaxunl, guansong, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D79676	2020-06-28 19:04:20 -04:00
Xiangling Liao	22337bfe7d	[AIX][Frontend] Static init implementation for AIX considering no priority 1. Provides no piroirity supoort && disables three priority related attributes: init_priority, ctor attr, dtor attr; 2. '-qunique' in XL compiler equivalent behavior of emitting sinit and sterm functions name using getUniqueModuleId() util function in LLVM (currently no support for InternalLinkage and WeakODRLinkage symbols); 3. Add testcases to emit IR sample with __sinit80000000, __dtor, and __sterm80000000; 4. Temporarily side-steps the need to implement the functionality of llvm.global_ctors and llvm.global_dtors arrays. The uses of that functionality in this patch (with respect to the name of the functions involved) are not representative of how the functionality will be used once implemented. Differential Revision: https://reviews.llvm.org/D74166	2020-06-19 08:27:07 -04:00
Sander de Smalen	ad828e3f4d	[SveEmitter] Add builtins for struct loads/stores (ld2/ld3/etc) The struct store intrinsics in LLVM IR take the individual parts as arguments, so this patch uses the intrinsics used for `svget` to break the tuples into individual parts. Reviewers: c-rhodes, efriedma, ctetreau, david-arm Reviewed By: efriedma Tags: #clang Differential Revision: https://reviews.llvm.org/D81466	2020-06-19 10:35:42 +01:00
Lucas Prates	ada4c9dc4a	[ARM][Clang] Removing lowering of half-precision FP arguments and returns from Clang's CodeGen Summary: On the process of moving the argument lowering handling for half-precision floating point arguments and returns to the backend, this patch removes the code that was responsible for handling the coercion of those arguments in Clang's Codegen. Reviewers: rjmccall, chill, ostannard, dnsampaio Reviewed By: ostannard Subscribers: stuij, kristof.beyls, dmgreen, danielkiss, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D81451	2020-06-18 13:17:07 +01:00
Sander de Smalen	1d7b4a7e5e	[SveEmitter] Add builtins for tuple creation (svcreate2/svcreate3/etc) The svcreate builtins allow constructing a tuple from individual vectors, e.g. svint32x2_t svcreate2(svint32_t v2, svint32_t v2)` Reviewers: c-rhodes, david-arm, efriedma Reviewed By: c-rhodes, efriedma Tags: #clang Differential Revision: https://reviews.llvm.org/D81463	2020-06-18 10:07:09 +01:00
Tyker	51e4aa87e0	attempt to fix failing buildbots after `3bab88b7ba` Prevent IR-gen from emitting consteval declarations Summary: with this patch instead of emitting calls to consteval function. the IR-gen will emit a store of the already computed result.	2020-06-15 12:58:37 +02:00
Kirill Bobyrev	550c4562d1	Revert "Prevent IR-gen from emitting consteval declarations" This reverts commit `3bab88b7ba`. This patch causes test failures: http://lab.llvm.org:8011/builders/clang-cmake-armv7-quick/builds/17260	2020-06-15 12:14:15 +02:00
Tyker	3bab88b7ba	Prevent IR-gen from emitting consteval declarations Summary: with this patch instead of emitting calls to consteval function. the IR-gen will emit a store of the already computed result. Reviewers: rsmith Reviewed By: rsmith Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76420	2020-06-15 10:47:14 +02:00
Akira Hatanaka	c9a52de002	[CodeGen] Simplify the way lifetime of block captures is extended Rather than pushing inactive cleanups for the block captures at the entry of a full expression and activating them during the creation of the block literal, just call pushLifetimeExtendedDestroy to ensure the cleanups are popped at the end of the scope enclosing the block expression. rdar://problem/63996471 Differential Revision: https://reviews.llvm.org/D81624	2020-06-11 16:06:22 -07:00
John McCall	7fac1acc61	Set the LLVM FP optimization flags conservatively. Functions can have local pragmas that override the global settings. We set the flags eagerly based on global settings, but if we emit an expression under the influence of a pragma, we clear the appropriate flags from the function. In order to avoid doing a ton of redundant work whenever we emit an FP expression, configure the IRBuilder to default to global settings, and only reconfigure it when we see an FP expression that's not using the global settings. Patch by Michele Scandale! https://reviews.llvm.org/D80462	2020-06-11 18:16:41 -04:00
Alexey Bataev	90b54fa045	[OPENMP50]Codegen for use_device_addr clauses. Summary: Added codegen for use_device_addr clause. The components of the list items are mapped as a kind of RETURN components and then the returned base address is used instead of the real address of the base declaration used in the use_device_addr expressions. Reviewers: jdoerfert Subscribers: yaxunl, guansong, sstefan1, cfe-commits, caomhin Tags: #clang Differential Revision: https://reviews.llvm.org/D80730	2020-06-11 09:54:51 -04:00
Saiyedul Islam	675cefbf60	[AMDGPU] Introduce Clang builtins to be mapped to AMDGCN atomic inc/dec intrinsics Summary: __builtin_amdgcn_atomic_inc32(int Ptr, int Val, unsigned MemoryOrdering, const char SyncScope) __builtin_amdgcn_atomic_inc64(int64_t Ptr, int64_t Val, unsigned MemoryOrdering, const char SyncScope) __builtin_amdgcn_atomic_dec32(int Ptr, int Val, unsigned MemoryOrdering, const char SyncScope) __builtin_amdgcn_atomic_dec64(int64_t Ptr, int64_t Val, unsigned MemoryOrdering, const char SyncScope) First and second arguments gets transparently passed to the amdgcn atomic inc/dec intrinsic. Fifth argument of the intrinsic is set as true if the first argument of the builtin is a volatile pointer. The third argument of this builtin is one of the memory-ordering specifiers ATOMIC_ACQUIRE, ATOMIC_RELEASE, ATOMIC_ACQ_REL, or ATOMIC_SEQ_CST following C++11 memory model semantics. This is mapped to corresponding LLVM atomic memory ordering for the atomic inc/dec instruction using CLANG atomic C ABI. The fourth argument is an AMDGPU-specific synchronization scope defined as string. Reviewers: arsenm, sameerds, JonChesterfield, jdoerfert Reviewed By: arsenm, sameerds Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, jfb, kerbowa, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D80804	2020-06-09 17:02:58 +00:00
Alexey Bataev	cb9191c042	[OPENMP]Improve code readability, NFC. Reuse existing function instead of code duplication and use better type.	2020-06-09 08:50:36 -04:00
Alexey Bataev	4e3d4622b1	Fix undefined behaviour when trying to deref nullptr.	2020-06-04 17:52:06 -04:00
Alexey Bataev	bd1c03d7b7	[OPENMP50]Codegen for inscan reductions in worksharing directives. Summary: Implemented codegen for reduction clauses with inscan modifiers in worksharing constructs. Emits the code for the directive with inscan reductions. The code is the following: ``` size num_iters = <num_iters>; <type> buffer[num_iters]; for (i: 0..<num_iters>) { <input phase>; buffer[i] = red; } for (int k = 0; k != ceil(log2(num_iters)); ++k) for (size cnt = last_iter; cnt >= pow(2, k); --k) buffer[i] op= buffer[i-pow(2,k)]; for (0..<num_iters>) { red = InclusiveScan ? buffer[i] : buffer[i-1]; <scan phase>; } ``` Reviewers: jdoerfert Subscribers: yaxunl, guansong, arphaman, cfe-commits, caomhin Tags: #clang Differential Revision: https://reviews.llvm.org/D79948	2020-06-04 16:29:33 -04:00
John McCall	8a8d703be0	Fix how cc1 command line options are mapped into FP options. Canonicalize on storing FP options in LangOptions instead of redundantly in CodeGenOptions. Incorporate -ffast-math directly into the values of those LangOptions rather than considering it separately when building FPOptions. Build IR attributes from those options rather than a mix of sources. We should really simplify the driver/cc1 interaction here and have the driver pass down options that cc1 directly honors. That can happen in a follow-up, though. Patch by Michele Scandale! https://reviews.llvm.org/D80315	2020-06-01 22:00:30 -04:00
Florian Hahn	8f3f88d2f5	[Matrix] Implement matrix index expressions ([][]). This patch implements matrix index expressions (matrix[RowIdx][ColumnIdx]). It does so by introducing a new MatrixSubscriptExpr(Base, RowIdx, ColumnIdx). MatrixSubscriptExprs are built in 2 steps in ActOnMatrixSubscriptExpr. First, if the base of a subscript is of matrix type, we create a incomplete MatrixSubscriptExpr(base, idx, nullptr). Second, if the base is an incomplete MatrixSubscriptExpr, we create a complete MatrixSubscriptExpr(base->getBase(), base->getRowIdx(), idx) Similar to vector elements, it is not possible to take the address of a MatrixSubscriptExpr. For CodeGen, a new MatrixElt type is added to LValue, which is very similar to VectorElt. The only difference is that we may need to cast the type of the base from an array to a vector type when accessing it. Reviewers: rjmccall, anemet, Bigcheese, rsmith, martong Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D76791	2020-06-01 20:08:49 +01:00
Zequan Wu	e36076ee3a	[clang] Add nomerge function attribute to clang Differential Revision: https://reviews.llvm.org/D79121	2020-05-21 17:07:39 -07:00
Zequan Wu	b0a0f01bc1	Revert "Add nomerge function attribute to clang" This reverts commit `307e853954`.	2020-05-21 16:13:18 -07:00
Zequan Wu	307e853954	Add nomerge function attribute to clang	2020-05-21 15:28:27 -07:00
Eli Friedman	62f3ef2b53	[CGCall] Annotate references with "align" attribute. If we're going to assume references are dereferenceable, we should also assume they're aligned: otherwise, we can't actually dereference them. See also D80072. Differential Revision: https://reviews.llvm.org/D80166	2020-05-19 20:21:30 -07:00
Sander de Smalen	d6936be2ef	[SveEmitter] Add builtins for svdup and svindex Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D79357	2020-05-12 11:02:32 +01:00
Sander de Smalen	4cad97595f	[SveEmitter] Add builtins for svmovlb and svmovlt These builtins are expanded in CGBuiltin to use intrinsics for (signed/unsigned) shift left long top/bottom. Reviewers: efriedma, SjoerdMeijer Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D79579	2020-05-11 09:41:58 +01:00
Sander de Smalen	3cb8b4c193	[SveEmitter] Add builtins for SVE2 Polynomial arithmetic This patch adds builtins for: - sveorbt - sveortb - svpmul - svpmullb, svpmullb_pair - svpmullt, svpmullt_pair The svpmullb and svpmullt builtins are expressed using the svpmullb_pair and svpmullt_pair LLVM IR intrinsics, respectively. Reviewers: SjoerdMeijer, efriedma, rengolin Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D79480	2020-05-07 11:53:04 +01:00
Michael Liao	276c8dde0b	[clang][codegen] Refactor argument loading in function prolog. NFC. Summary: - Skip copying function arguments and unnecessary casting by using them directly. Reviewers: rjmccall, kerbowa, yaxunl Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D79394	2020-05-05 15:31:51 -04:00
Sander de Smalen	aed6bd6f42	Reland D78750: [SveEmitter] Add builtins for svdupq and svdupq_lane Edit: Changed a few CHECK lines into CHECK-DAG lines. This reverts commit `90f3f62cb0`.	2020-05-05 10:42:11 +01:00
Sander de Smalen	90f3f62cb0	Revert "[SveEmitter] Add builtins for svdupq and svdupq_lane" It seems this patch broke some buildbots, so reverting until I have had a chance to investigate. This reverts commit `6b90a6887d`.	2020-05-04 21:31:55 +01:00
Sander de Smalen	6b90a6887d	[SveEmitter] Add builtins for svdupq and svdupq_lane * svdupq builtins that duplicate scalars to every quadword of a vector are defined using builtins for svld1rq (load and replicate quadword). * svdupq builtins that duplicate boolean values to fill a predicate vector are defined using `svcmpne`. Reviewers: SjoerdMeijer, efriedma, ctetreau Reviewed By: efriedma Tags: #clang Differential Revision: https://reviews.llvm.org/D78750	2020-05-04 20:38:47 +01:00
Melanie Blower	f5360d4bb3	Reapply "Add support for #pragma float_control" with buildbot fixes Add support for #pragma float_control Reviewers: rjmccall, erichkeane, sepavloff Differential Revision: https://reviews.llvm.org/D72841 This reverts commit `fce82c0ed3`.	2020-05-04 05:51:25 -07:00
Melanie Blower	fce82c0ed3	Revert "Reapply "Add support for #pragma float_control" with improvements to" This reverts commit `69aacaf699`.	2020-05-01 10:31:09 -07:00
Melanie Blower	69aacaf699	Reapply "Add support for #pragma float_control" with improvements to test cases Add support for #pragma float_control Reviewers: rjmccall, erichkeane, sepavloff Differential Revision: https://reviews.llvm.org/D72841 This reverts commit `85dc033cac`, and makes corrections to the test cases that failed on buildbots.	2020-05-01 10:03:30 -07:00
Melanie Blower	85dc033cac	Revert "Add support for #pragma float_control" This reverts commit `4f1e9a17e9`. due to fail on buildbot, sorry for the noise	2020-05-01 06:36:58 -07:00
Melanie Blower	4f1e9a17e9	Add support for #pragma float_control Reviewers: rjmccall, erichkeane, sepavloff Differential Revision: https://reviews.llvm.org/D72841	2020-05-01 06:14:24 -07:00
Sander de Smalen	42a56bf63f	[SveEmitter] Add builtins for gather prefetches Patch by Andrzej Warzynski Reviewed By: efriedma Tags: #clang Differential Revision: https://reviews.llvm.org/D78677	2020-04-29 11:52:49 +01:00
Christopher Tetreault	ef3678cfee	[SVE] Update EmitSVEPredicateCast to take a ScalableVectorType Summary: Removes usage of VectorType::getNumElements identified by test located at CodeGen/aarch64-sve-intrinsics/acle_sve_abs.c. Since the type is an SVE predicate vector, it makes sense to specialize the code for scalable vectors only. Reviewers: rengolin, efriedma Reviewed By: efriedma Subscribers: tschuett, kristof.beyls, rkruppe, psnobl, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D78958	2020-04-28 11:22:20 -07:00
Momchil Velikov	102b4105e3	[CMSE] Clear padding bits of struct/unions/fp16 passed by value When passing a value of a struct/union type from secure to non-secure state (that is returning from a CMSE entry function or passing an argument to CMSE-non-secure call), there is a potential sensitive information leak via the padding bits in the structure. It is not possible in the general case to ensure those bits are cleared by using Standard C/C++. This patch makes the compiler emit code to clear such padding bits. Since type information is lost in LLVM IR, the code generation is done by Clang. For each interesting record type, we build a bitmask, in which all the bits, corresponding to user declared members, are set. Values of record types are returned by coercing them to an integer. After the coercion, the coerced value is masked (with bitwise AND) and then returned by the function. In a similar manner, values of record types are passed as arguments by coercing them to an array of integers, and the coerced values themselves are masked. For union types, we effectively clear only bits, which aren't part of any member, since we don't know which is the currently active one. The compiler will issue a warning, whenever a union is passed to non-secure state. Values of half-precision floating-point types are passed in the least significant bits of a 32-bit register (GPR or FPR) with the most significant bits unspecified. Since this is also a potential leak of sensitive information, this patch also clears those unspecified bits. Differential Revision: https://reviews.llvm.org/D76369	2020-04-28 17:05:58 +01:00
Christopher Tetreault	da8918f27e	[SVE][NFC] Use ScalableVectorType in CGBuiltin Summary: * Upgrade some usages of VectorType to use ScalableVectorType Reviewers: efriedma, david-arm, fpetrogalli, kmclaughlin Reviewed By: efriedma Subscribers: tschuett, rkruppe, psnobl, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D78842	2020-04-27 16:29:45 -07:00
Sander de Smalen	823e2a670a	[SveEmitter] Add builtins for contiguous prefetches This patch also adds the enum `sv_prfop` for the prefetch operation specifier and checks to ensure the passed enum values are valid. Reviewers: SjoerdMeijer, efriedma, ctetreau Reviewed By: efriedma Tags: #clang Differential Revision: https://reviews.llvm.org/D78674	2020-04-24 11:35:59 +01:00
Sander de Smalen	2d1baf606a	[SveEmitter] Add builtins for svwhilerw/svwhilewr This also adds the IsOverloadWhileRW flag which tells CGBuiltin to use the result predicate type and the first pointer type as the overloaded types for the LLVM IR intrinsic. Reviewers: SjoerdMeijer, efriedma Reviewed By: efriedma Tags: #clang Differential Revision: https://reviews.llvm.org/D78238	2020-04-22 21:49:18 +01:00
Sander de Smalen	662cbaf647	[SveEmitter] Add IsOverloadNone flag and builtins for svpfalse and svcnt[bhwd]_pat Add the IsOverloadNone flag to tell CGBuiltin that it does not have an overloaded type. This is used for e.g. svpfalse which does not take any arguments and always returns a svbool_t. This patch also adds builtins for svcntb_pat, svcnth_pat, svcntw_pat and svcntd_pat, as those don't require custom codegen. Reviewers: SjoerdMeijer, efriedma, rovka Reviewed By: efriedma Tags: #clang Differential Revision: https://reviews.llvm.org/D77596	2020-04-22 16:42:08 +01:00
Sander de Smalen	41d52662d5	[SveEmitter] Add support for _n form builtins The ACLE has builtins that take a scalar value that is to be expanded into a vector by the operation. While the ISA may have an instruction that takes an immediate or a scalar to represent this, the LLVM IR intrinsic may not, so Clang will have to splat the scalar value. This patch also adds the _n forms for svabd, svadd, svdiv, svdivr, svmax, svmin, svmul, svmulh, svub and svsubr. Reviewers: SjoerdMeijer, efriedma, rovka Reviewed By: SjoerdMeijer Tags: #clang Differential Revision: https://reviews.llvm.org/D77594	2020-04-22 14:23:54 +01:00
Andrzej Warzynski	72f565899d	[SveEmitter] Implement builtins for gathers/scatters This patch adds builtins for: * regular, first-faulting and non-temporal gather loads * regular and non-temporal scatter stores Differential Revision: https://reviews.llvm.org/D77735	2020-04-22 13:21:39 +01:00
Sander de Smalen	c8a5b30bac	[SveEmitter] Add range checks for immediates and predicate patterns. Summary: This patch adds a mechanism to easily add range checks for a builtin's immediate operands. This patch is tested with the qdech intrinsic, which takes both an enum for the predicate pattern, as well as an immediate for the multiplier. Reviewers: efriedma, SjoerdMeijer, rovka Reviewed By: efriedma, SjoerdMeijer Subscribers: mgorny, tschuett, mgrang, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76678	2020-04-14 16:49:32 +01:00
Sander de Smalen	17a68c61a9	[SveEmitter] Implement builtins for contiguous loads/stores This adds builtins for all contiguous loads/stores, including non-temporal, first-faulting and non-faulting. Reviewers: efriedma, SjoerdMeijer Reviewed By: SjoerdMeijer Tags: #clang Differential Revision: https://reviews.llvm.org/D76238	2020-04-14 15:24:57 +01:00
Lucas Prates	f56550cf7f	[ARM] Enabling range checks on Neon intrinsics' lane arguments Summary: Range checks were not properly performed in the lane arguments of Neon intrinsics implemented based on splat operations. Calls to those intrinsics where translated to `__builtin__shufflevector` calls directly by the pre-processor through the arm_neon.h macros, missing the chance for the proper range checks. This patch enables the range check by introducing an auxiliary splat instruction in arm_neon.td, delaying the translation to shufflevector calls to CGBuiltin.cpp in clang after the checks were performed. Reviewers: jmolloy, t.p.northover, rsmith, olista01, ostannard Reviewed By: ostannard Subscribers: ostannard, dnsampaio, danielkiss, kristof.beyls, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D74619	2020-03-19 12:07:23 +00:00
Lucas Prates	7bf23563f4	Revert "[ARM] Setting missing isLaneQ attribute on Neon Intrisics definitions" This reverts commit `62ab15ffa3`. Multiple commits were unintentionally squashed into this one. Reverting so each of them can be pushed properly.	2020-03-19 12:01:13 +00:00
Lucas Prates	62ab15ffa3	[ARM] Setting missing isLaneQ attribute on Neon Intrisics definitions Summary: Some of the `*_laneq` intrinsics defined in arm_neon.td were missing the setting of the `isLaneQ` attribute. This patch sets the attribute on the related definitions, as they will be required to properly perform range checks on their lane arguments. Reviewers: jmolloy, t.p.northover, rsmith, olista01, dnsampaio Reviewed By: dnsampaio Subscribers: dnsampaio, kristof.beyls, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D74616	2020-03-19 11:52:41 +00:00
Sander de Smalen	c5b81466c2	Reland D75470 [SVE] Auto-generate builtins and header for svld1. Reworked the patch to avoid sharing a header (SVETypeFlags.h) between include/clang/Basic and utils/TableGen/SveEmitter.cpp. Now the patch generates the enum/flags which is included in TargetBuiltins.h. Also renamed one of the SveEmitter options to be in line with MVE. Summary: This is a first patch in a series for the SveEmitter to generate the arm_sve.h header file and builtins. I've tried my best to strip down this patch as best as I could, but there are still a few changes that are not necessarily exercised by the load intrinsics in this patch, mostly around the SVEType class which has some common logic to represent types from a type and prototype string. I thought it didn't make much sense to remove that from this patch and split it up.	2020-03-18 11:16:28 +00:00
Sander de Smalen	6ce537ccfc	Revert "[SVE] Auto-generate builtins and header for svld1." This reverts commit `8b409eabaf`. Reverting this patch for now because it breaks some buildbots.	2020-03-16 15:22:15 +00:00
Sander de Smalen	8b409eabaf	[SVE] Auto-generate builtins and header for svld1. This is a first patch in a series for the SveEmitter to generate the arm_sve.h header file and builtins. I've tried my best to strip down this patch as best as I could, but there are still a few changes that are not necessarily exercised by the load intrinsics in this patch, mostly around the SVEType class which has some common logic to represent types from a type and prototype string. I thought it didn't make much sense to remove that from this patch and split it up. Reviewers: efriedma, rovka, SjoerdMeijer, rsandifo-arm, rengolin Reviewed By: SjoerdMeijer Tags: #clang Differential Revision: https://reviews.llvm.org/D75470	2020-03-16 10:52:37 +00:00

1 2 3 4 5 ...

1450 Commits