llvm-project

Commit Graph

Author	SHA1	Message	Date
Sven van Haastregt	6690b7d3ac	[OpenCL] Ensure atomic_init is guarded with extension The named and generic address space overloads for atomic_init added by `50f8abb9f4` ("[OpenCL] Add OpenCL 3.0 atomics to -fdeclare-opencl-builtins", 2022-02-11) were not guarded by the corresponding extensions.	2022-02-16 15:12:23 +00:00
Chuanqi Xu	d30ca5e2e2	[C++20] [Coroutines] Implement return value optimization for get_return_object This patch tries to implement RVO for coroutine's return object got from get_return_object. From [dcl.fct.def.coroutine]/p7 we could know that the return value of get_return_object is either a reference or a prvalue. So it makes sense to do copy elision for the return value. The return object should be constructed directly into the storage where they would otherwise be copied/moved to. Test Plan: folly, check-all Reviewed By: junparser Differential revision: https://reviews.llvm.org/D117087	2022-02-16 13:38:00 +08:00
Nico Weber	125abb61f7	Revert "Add support for floating-point option `ffp-eval-method` and for" This reverts commit `4bafe65c2b`. Breaks at least Misc/warning-flags.c, see comments on https://reviews.llvm.org/D109239	2022-02-15 22:02:25 -05:00
Zahira Ammarguellat	4bafe65c2b	Add support for floating-point option `ffp-eval-method` and for `pragma clang fp eval_method`.	2022-02-15 13:59:27 -08:00
Yaxun (Sam) Liu	73b22935a7	[CUDA][HIP] Do not promote constexpr var with non-constant initializer constexpr var may be initialized with address of non-const variable. In this case the initializer is not constant in device compilation. This has been handled for const vars but not for constexpr vars. This patch makes handling of const var and constexpr var consistent. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D119615 Fixes: https://github.com/llvm/llvm-project/issues/53780	2022-02-15 15:15:55 -05:00
Amy Kwan	5dc0a1657b	[PowerPC] Fix __builtin_pdepd and __builtin_pextd to be 64-bit and P10 only. The `__builtin_pdepd` and `__builtin_pextd` are P10 builtins that are meant to be used under 64-bit only. For instance, when the builtins are compiled under 32-bit mode: ``` $ cat t.c unsigned long long foo(unsigned long long a, unsigned long long b) { return __builtin_pextd(a,b); } $ clang -c t.c -mcpu=pwr10 -m32 ExpandIntegerResult #0: t31: i64 = llvm.ppc.pextd TargetConstant:i32<6928>, t28, t29 fatal error: error in backend: Do not know how to expand the result of this operator! ``` This patch adds sema checking for these builtins to compile under 64-bit mode only and on P10. The builtins will emit a diagnostic when they are compiled on non-P10 compilations and on 32-bit mode. Differential Revision: https://reviews.llvm.org/D118753	2022-02-15 12:30:50 -06:00
Aaron Ballman	a766545402	Update the diagnostic behavior of [[noreturn]] in C2x Post-commit review feedback suggested dropping the deprecated diagnostic for the 'noreturn' macro (the diagnostic from the header file suffices and the macro diagnostic could be confusing) and to only issue the deprecated diagnostic for [[_Noreturn]] when the attribute identifier is either directly written or not from a system macro. Amends the commit made in `5029dce492`.	2022-02-14 14:04:32 -05:00
Arthur O'Dwyer	3c8d2aa87c	[clang] Don't emit redundant warnings for 'return;' when the function declaration's return type is already invalid for some reason. This is relevant to https://github.com/llvm/llvm-project/issues/49188 because another way that the declaration's return type could become invalid is that it might be `C auto` where `C<void>` is false. Differential Revision: https://reviews.llvm.org/D119094	2022-02-14 11:28:32 -05:00
Aaron Ballman	5029dce492	Implement WG14 N2764 the [[noreturn]] attribute This adds support for http://www.open-std.org/jtc1/sc22/wg14/www/docs/n2764.pdf, which was adopted at the Feb 2022 WG14 meeting. That paper adds [[noreturn]] and [[_Noreturn]] to the list of supported attributes in C2x. These attributes have the same semantics as the [[noreturn]] attribute in C++. The [[_Noreturn]] attribute was added as a deprecated feature so that translation units which include <stdnoreturn.h> do not get an error on use of [[noreturn]] because the macro expands to _Noreturn. Users can use -Wno-deprecated-attributes to silence the diagnostic. Use of <stdnotreturn.h> or the noreturn macro were both deprecated. Users can define the _CLANG_DISABLE_CRT_DEPRECATION_WARNINGS macro to suppress the deprecation diagnostics coming from the header file.	2022-02-14 09:38:26 -05:00
Aaron Ballman	76032b0e3f	Check for the overloadable attribute in all the appropriate syntactic locations When forming the function type from a declarator, we look for an overloadable attribute before issuing a diagnostic in C about a function signature containing only .... When the attribute is present, we allow such a declaration for compatibility with the overloading rules in C++. However, we were not looking for the attribute in all of the places it is legal to write it on a declarator and so we only accepted the signature in some forms and incorrectly rejected the signature in others. We now check for the attribute preceding the declarator instead of only being applied to the declarator directly.	2022-02-14 08:54:21 -05:00
Simon Pilgrim	4b43209683	[clang][sema] Use castAs<> instead of getAs<> to avoid dereference of nullptr The pointers are referenced immediately, so assert the cast is correct instead of returning nullptr	2022-02-13 13:14:05 +00:00
Simon Pilgrim	fbe0ca576d	[clang][sema] checkNoThrow - use cast<> instead of dyn_cast<> to avoid dereference of nullptr The pointer is referenced immediately, so assert the cast is correct instead of returning nullptr	2022-02-12 11:18:49 +00:00
Simon Pilgrim	f0f4452ed4	[clang][sema] Sema::CheckFreeArguments - use cast<> instead of dyn_cast<> to avoid dereference of nullptr The pointer is referenced immediately, so assert the cast is correct instead of returning nullptr	2022-02-12 11:17:02 +00:00
Simon Pilgrim	5e27487fc9	[clang][sema] TryStaticCast - use castAs<> instead of getAs<> to avoid dereference of nullptr The pointer is referenced immediately, so assert the cast is correct instead of returning nullptr	2022-02-12 11:14:19 +00:00
Simon Pilgrim	1e64590041	[clang][sema] - remove CodeCompleter nullptr checks All paths have already dereferenced the CodeCompleter pointer in the ResultBuilder constructor	2022-02-11 15:09:44 +00:00
Simon Pilgrim	5fb180704c	[clang][sema] ActOnExplicitInstantiation - remove Prev nullptr check All paths have already dereferenced the Prev pointer	2022-02-11 15:09:44 +00:00
Dávid Bolvanský	b2936caf77	[clang] inheritance fix for nomerge attribute Discussed here: https://reviews.llvm.org/D119061#3310822 Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D119451	2022-02-11 15:51:08 +01:00
Anton Zabaznov	bfb1a33bec	[OpenCL] Adjust diagnostic for subgroup support. OpenCL C 3.0 __opencl_c_subgroups feature is slightly different then other equivalent features and extensions (fp64 and 3d image writes): OpenCL C 3.0 device can support the extension but not the feature. cl_khr_subgroups requires subgroup independent forward progress. This patch adjusts the check which is used when translating language builtins to check either the extension or feature is supported. Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D118999	2022-02-11 17:40:24 +03:00
Sven van Haastregt	50f8abb9f4	[OpenCL] Add OpenCL 3.0 atomics to -fdeclare-opencl-builtins Add the atomic overloads for the `global` and `local` address spaces, which are new in OpenCL 3.0. Ensure the preexisting `generic` overloads are guarded by the generic address space feature macro. Ensure a subset of the atomic builtins are guarded by the `__opencl_c_atomic_order_seq_cst` and `__opencl_c_atomic_scope_device` feature macros, and enable those macros for SPIR/SPIR-V targets in `opencl-c-base.h`. Also guard the `cl_ext_float_atomics` builtins with the atomic order and scope feature macros. Differential Revision: https://reviews.llvm.org/D119420	2022-02-11 10:14:14 +00:00
Sven van Haastregt	8d37043520	[OpenCL] Refactor cl_ext_float_atomics declarations; NFC Reduce the amount of repetition in the declarations by leveraging more TableGen constructs. This is in preparation for adding the OpenCL 3.0 atomics feature optionality.	2022-02-10 09:43:32 +00:00
Sven van Haastregt	fe690587be	[OpenCL] Fix atomic_fetch_add/sub with half type An error in the tablegen description affects the declarations provided by `-fdeclare-opencl-builtins` for `atomic_fetch_add` and `atomic_fetch_sub`. The atomic argument should be an atomic_half, not an atomic_float.	2022-02-09 10:47:45 +00:00
Bill Wendling	deaf22bc0e	[X86] Implement -fzero-call-used-regs option The "-fzero-call-used-regs" option tells the compiler to zero out certain registers before the function returns. It's also available as a function attribute: zero_call_used_regs. The two upper categories are: - "used": Zero out used registers. - "all": Zero out all registers, whether used or not. The individual options are: - "skip": Don't zero out any registers. This is the default. - "used": Zero out all used registers. - "used-arg": Zero out used registers that are used for arguments. - "used-gpr": Zero out used registers that are GPRs. - "used-gpr-arg": Zero out used GPRs that are used as arguments. - "all": Zero out all registers. - "all-arg": Zero out all registers used for arguments. - "all-gpr": Zero out all GPRs. - "all-gpr-arg": Zero out all GPRs used for arguments. This is used to help mitigate Return-Oriented Programming exploits. Reviewed By: nickdesaulniers Differential Revision: https://reviews.llvm.org/D110869	2022-02-08 17:42:54 -08:00
Sylvestre Ledru	f2c2e924e7	Fix a typo (occured => occurred) Reported: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=1005195	2022-02-08 21:35:26 +01:00
James Y Knight	9545976ff1	Revert "[Clang] Propagate guaranteed alignment for malloc and others" The above change assumed that malloc (and friends) would always allocate memory to getNewAlign(), even for allocations which have a smaller size. This is not actually required by spec (a 1-byte allocation may validly have 1-byte alignment). Some real-world malloc implementations do not provide this guarantee, and thus this optimization is breaking programs. Fixes #53540 This reverts commit `c2297544c0`. Differential Revision: https://reviews.llvm.org/D118804	2022-02-08 14:34:44 -05:00
Steffen Larsen	ead1690d31	Allow parameter pack expansions and initializer lists in annotate attribute These changes make the Clang parser recognize expression parameter pack expansion and initializer lists in attribute arguments. Because expression parameter pack expansion requires additional handling while creating and instantiating templates, the support for them must be explicitly supported through the AcceptsExprPack flag. Handling expression pack expansions may require a delay to when the arguments of an attribute are correctly populated. To this end, attributes that are set to accept these - through setting the AcceptsExprPack flag - will automatically have an additional variadic expression argument member named DelayedArgs. This member is not exposed the same way other arguments are but is set through the new CreateWithDelayedArgs creator function generated for applicable attributes. To illustrate how to implement support for expression pack expansion support, clang::annotate is made to support pack expansions. This is done by making handleAnnotationAttr delay setting the actual attribute arguments until after template instantiation if it was unable to populate the arguments due to dependencies in the parsed expressions.	2022-02-08 13:38:07 -05:00
Corentin Jabot	c151225096	[C++2b] Implement multidimentional subscript operator Implement P2128R6 in C++23 mode. Unlike GCC's implementation, this doesn't try to recover when a user meant to use a comma expression. Because the syntax changes meaning in C++23, the patch is NOT implemented as an extension. Instead, declaring an array with not exactly 1 parameter is an error in older languages modes. There is an off-by-default extension warning in C++23 mode. Unlike the standard, we supports default arguments; Ie, we assume, based on conversations in WG21, that the proposed resolution to CWG2507 will be accepted. We allow arrays OpenMP sections and C++23 multidimensional array to coexist: [a , b] multi dimensional array [a : b] open mp section [a, b: c] // error The rest of the patch is relatively straight forward: we take care to support an arbitrary number of arguments everywhere.	2022-02-08 12:10:47 -05:00
David Pagan	0a7cc078ac	Enable inoutset dependency-type in depend clause. Done in manner similar to mutexinoutset (see https://reviews.llvm.org/D57576) Runtime support already exists in LLVM OpenMP runtime (see https://reviews.llvm.org/D97085). The value used to identify an inoutset dependency type in the LLVM OpenMP runtime is 8. Some tests updated due to change in dependency type error messages that now include new dependency type. Also updated test/OpenMP/task_codegen.cpp to verify we emit the right code.	2022-02-08 08:35:36 -05:00
Simon Pilgrim	c00db97159	[Clang] Add elementwise saturated add/sub builtins This patch implements `__builtin_elementwise_add_sat` and `__builtin_elementwise_sub_sat` builtins. These map to the add/sub saturated math intrinsics described here: https://llvm.org/docs/LangRef.html#saturation-arithmetic-intrinsics With this in place we should then be able to replace the x86 SSE adds/subs intrinsics with these generic variants - it looks like other targets should be able to use these as well (arm/aarch64/webassembly all have similar examples in cgbuiltin). Differential Revision: https://reviews.llvm.org/D117898	2022-02-08 11:22:01 +00:00
Chuanqi Xu	3504937dfb	[C++20] [Modules] Don't create multiple global module fragment Since the serialization code would recognize modules by names and the name of all global module fragment is <global>, so that the serialization code would complain for the same module. This patch fixes this by using a unique global module fragment in Sema. Before this patch, the compiler would fail on an assertion complaining the duplicated modules. Reviewed By: urnathan, rsmith Differential Revision: https://reviews.llvm.org/D115610	2022-02-08 11:52:09 +08:00
Chuanqi Xu	e39ba04617	[C++20] [Coroutines] Warning for always_inline coroutine See the discussion in https://reviews.llvm.org/D100282. The coroutine marked always inline might not be inlined properly in current compiler support. Since the coroutine would be splitted into pieces. And the call to resume() and destroy() functions might be indirect call. Also the ramp function wouldn't get inlined under O0 due to pipeline ordering problems. It might be different to what users expects to. Emit a warning to tell it. This is what GCC does too: https://godbolt.org/z/7eajb1Gf8 Reviewed By: Quuxplusone Differential Revision: https://reviews.llvm.org/D115867	2022-02-08 11:43:42 +08:00
Mariya Podchishchaeva	1cee960898	[SYCL] Disallow explicit casts between mismatching address spaces Reviewed By: bader Differential Revision: https://reviews.llvm.org/D118935	2022-02-07 11:57:30 +03:00
Dávid Bolvanský	7119f76c47	[clang] added allocsize attribute to allocation functions	2022-02-05 14:26:35 +01:00
Dávid Bolvanský	74b1c4c367	[clang] added alloc allign attr to memalign	2022-02-05 11:46:56 +01:00
Shilei Tian	b8ec430de7	[Clang][Sema][OpenMP] Fix uninitialized variable Op This can fix the case atomic_messages	2022-02-04 15:00:43 -05:00
Devin Jeanpierre	56d46b36fc	[clang] roll-forward "[clang] Mark `trivial_abi` types as "trivially relocatable"". This reverts commit `852afed5e0`. Changes since D114732: On PS4, we reverse the expectation that classes whose constructor is deleted are not trivially relocatable. Because, at the moment, only classes which are passed in registers are trivially relocatable, and PS4 allows passing in registers if the copy constructor is deleted, the original assertions were broken on PS4. (This is kinda similar to DR1734.) Reviewed By: gribozavr2 Differential Revision: https://reviews.llvm.org/D119017	2022-02-04 20:17:34 +01:00
Benjamin Kramer	012c811fed	[Clang][Sema][OpenMP] Remove unused variables. NFC.	2022-02-04 19:27:59 +01:00
Shilei Tian	b35be6fe98	[Clang][Sema][OpenMP] Sema support for `atomic compare` This patch adds the Sema support for `atomic compare`. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D116637	2022-02-04 12:30:56 -05:00
Saiyedul Islam	ae9c074064	[OpenMP][Clang] Allow ancestor device modifier only with reverse offloading OpenMP Spec 5.0 [2.12.5, Restrictions]: If a device clause in which the ancestor device-modifier appears is present on the target construct, then a requires directive with the reverse_offload clause must be specified. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D118887	2022-02-04 12:10:14 +00:00
Sven van Haastregt	31fa3a4d44	[OpenCL] Move OpenCL 2.0 atomics into multiclass; NFC This is in preparation for adding the OpenCL 3.0 builtins with named address space arguments.	2022-02-04 10:17:48 +00:00
Dmitri Gribenko	852afed5e0	Revert "[clang] Mark `trivial_abi` types as "trivially relocatable"." This reverts commit `19aa2db023`. It breaks a PS4 buildbot.	2022-02-03 22:31:44 +01:00
Sven van Haastregt	d97a4dfea6	[OpenCL] Move most _explicit atomics into multiclass; NFC This will simplify future conditionalization for OpenCL 3.0 optionality of atomic features. The only set of atomic functions not using the multiclass is atomic_compare_exchange_strong/weak, as these don't fit the common pattern due to having 2 MemoryOrder arguments.	2022-02-03 11:09:41 +00:00
Devin Jeanpierre	19aa2db023	[clang] Mark `trivial_abi` types as "trivially relocatable". This change enables library code to skip paired move-construction and destruction for `trivial_abi` types, as if they were trivially-movable and trivially-destructible. This offers an extension to the performance fix offered by `trivial_abi`: rather than only offering trivial-type-like performance for pass-by-value, it also offers it for library code that moves values but not as arguments. For example, if we use `memcpy` for trivially relocatable types inside of vector reallocation, and mark `unique_ptr` as `trivial_abi` (via `_LIBCPP_ABI_ENABLE_UNIQUE_PTR_TRIVIAL_ABI` / `_LIBCPP_ABI_UNSTABLE` / etc.), this would speed up `vector<unique_ptr>::push_back` by 40% on my benchmarks. (Though note that in this case, the compiler could have done this anyway, but happens not to due to the inlining horizon.) If accepted, I intend to follow up with exactly such changes to library code, including and especially `std::vector`, making them use a trivial relocation operation on trivially relocatable types. D50119 and P1144: This change is very similar to D50119, which was rejected from Clang. (That change was an implementation of P1144, which is not yet part of the C++ standard.) The intent of this change, rather than trying to pick a winning proposal for trivial relocation operations, is to extend the behavior of `trivial_abi` in a way that could be made compatible with any such proposal. If P1144 or any similar proposal were accepted, then `trivial_abi`, `__is_trivially_relocatable`, and everything else in this change would be redefined in terms of that. Safety: It's worth pointing out, specifically, that `trivial_abi` already implies trivial relocatability in a narrow sense: a `trivial_abi` type, when passed by value, has its constructor run in one location, and its destructor run in another, after the type has been trivially relocated (through registers). Trivial relocatability optimizations could change the number of paired constructor/destructor calls, but this seems unlikely to matter for `trivial_abi` types. Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D114732	2022-02-02 17:42:20 -08:00
Zahira Ammarguellat	e692654a4d	The methods visited for a special class must have an identifier.	2022-02-02 13:12:33 -08:00
Arthur O'Dwyer	c0185ffaec	[clang] Don't typo-fix an expression in a SFINAE context. If this is a SFINAE context, then continuing to look up names (in particular, to treat a non-function as a function, and then do ADL) might too-eagerly complete a type that it's not safe to complete right now. We should just say "okay, that's a substitution failure" and not do any more work than absolutely required. Fixes #52970. Differential Revision: https://reviews.llvm.org/D117603	2022-02-01 15:17:28 -05:00
Arthur O'Dwyer	f6ce456707	[clang] Correctly(?) handle placeholder types in ExprRequirements. Bug #52905 was originally papered over in a different way, but I believe this is the actually proper fix, or at least closer to it. We need to detect placeholder types as close to the front-end as possible, and cause them to fail constraints, rather than letting them persist into later stages. Fixes #52905. Fixes #52909. Fixes #53075. Differential Revision: https://reviews.llvm.org/D118552	2022-02-01 15:16:17 -05:00
David Blaikie	275c56226d	Disable -Wmissing-prototypes for internal linkage functions that aren't explicitly marked "static" Some functions can end up non-externally visible despite not being declared "static" or in an unnamed namespace in C++ - such as by having parameters that are of non-external types. Such functions aren't mistakenly intended to be defining some function that needs a declaration. They could be maybe more legible (except for the `operator new` example) with an explicit static, but that's a stylistic thing outside what should be addressed by a warning.	2022-01-31 17:34:51 -08:00
Michael Kruse	8a9e4f245b	[Clang][OpenMPIRBuilder] Fix off-by-one error when dividing by stepsize. When the stepsize does not evenly divide the range's end, round-up to ensure that that last multiple of the stepsize before the reaching the upper boud is reached. For instance, the trip count of for (int i = 0; i < 7; i+=5) is two (i=0 and i=5), not (7-0)/5 == 1. Reviewed By: peixin Differential Revision: https://reviews.llvm.org/D118542	2022-01-31 17:28:52 -06:00
Sven van Haastregt	8e6099291d	[OpenCL] Make generic addrspace optional for -fdeclare-opencl-builtins Currently, -fdeclare-opencl-builtins always adds the generic address space overloads of e.g. the vload builtin functions in OpenCL 3.0 mode, even when the generic address space feature is disabled. Guard the generic address space overloads by the `__opencl_c_generic_address_space` feature instead of by OpenCL version. Guard the private, global, and local overloads using the internal `__opencl_c_named_address_space_builtins` feature. Differential Revision: https://reviews.llvm.org/D107769	2022-01-31 10:21:05 +00:00
serge-sans-paille	b8290ffa9f	Fix -Wreserved-identifier in presence of system macro Do not warn on reserved identifiers resulting from expansion of system macros. Also properly test -Wreserved-identifier wrt. system headers. Should fix #49592 Differential Revision: https://reviews.llvm.org/D118532	2022-01-31 11:16:28 +01:00
Kazu Hirata	fdd0e745c3	[clang] Remove redundant string initialization (NFC) Identified with readability-redundant-string-init.	2022-01-30 12:32:55 -08:00

1 2 3 4 5 ...

18212 Commits