llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	508b3f437d	Attempt to fix sphinx 'Malformed table' warning.	2022-02-08 11:48:37 +00:00
Simon Pilgrim	c00db97159	[Clang] Add elementwise saturated add/sub builtins This patch implements `__builtin_elementwise_add_sat` and `__builtin_elementwise_sub_sat` builtins. These map to the add/sub saturated math intrinsics described here: https://llvm.org/docs/LangRef.html#saturation-arithmetic-intrinsics With this in place we should then be able to replace the x86 SSE adds/subs intrinsics with these generic variants - it looks like other targets should be able to use these as well (arm/aarch64/webassembly all have similar examples in cgbuiltin). Differential Revision: https://reviews.llvm.org/D117898	2022-02-08 11:22:01 +00:00
Devin Jeanpierre	56d46b36fc	[clang] roll-forward "[clang] Mark `trivial_abi` types as "trivially relocatable"". This reverts commit `852afed5e0`. Changes since D114732: On PS4, we reverse the expectation that classes whose constructor is deleted are not trivially relocatable. Because, at the moment, only classes which are passed in registers are trivially relocatable, and PS4 allows passing in registers if the copy constructor is deleted, the original assertions were broken on PS4. (This is kinda similar to DR1734.) Reviewed By: gribozavr2 Differential Revision: https://reviews.llvm.org/D119017	2022-02-04 20:17:34 +01:00
Dmitri Gribenko	852afed5e0	Revert "[clang] Mark `trivial_abi` types as "trivially relocatable"." This reverts commit `19aa2db023`. It breaks a PS4 buildbot.	2022-02-03 22:31:44 +01:00
Devin Jeanpierre	19aa2db023	[clang] Mark `trivial_abi` types as "trivially relocatable". This change enables library code to skip paired move-construction and destruction for `trivial_abi` types, as if they were trivially-movable and trivially-destructible. This offers an extension to the performance fix offered by `trivial_abi`: rather than only offering trivial-type-like performance for pass-by-value, it also offers it for library code that moves values but not as arguments. For example, if we use `memcpy` for trivially relocatable types inside of vector reallocation, and mark `unique_ptr` as `trivial_abi` (via `_LIBCPP_ABI_ENABLE_UNIQUE_PTR_TRIVIAL_ABI` / `_LIBCPP_ABI_UNSTABLE` / etc.), this would speed up `vector<unique_ptr>::push_back` by 40% on my benchmarks. (Though note that in this case, the compiler could have done this anyway, but happens not to due to the inlining horizon.) If accepted, I intend to follow up with exactly such changes to library code, including and especially `std::vector`, making them use a trivial relocation operation on trivially relocatable types. D50119 and P1144: This change is very similar to D50119, which was rejected from Clang. (That change was an implementation of P1144, which is not yet part of the C++ standard.) The intent of this change, rather than trying to pick a winning proposal for trivial relocation operations, is to extend the behavior of `trivial_abi` in a way that could be made compatible with any such proposal. If P1144 or any similar proposal were accepted, then `trivial_abi`, `__is_trivially_relocatable`, and everything else in this change would be redefined in terms of that. Safety: It's worth pointing out, specifically, that `trivial_abi` already implies trivial relocatability in a narrow sense: a `trivial_abi` type, when passed by value, has its constructor run in one location, and its destructor run in another, after the type has been trivially relocated (through registers). Trivial relocatability optimizations could change the number of paired constructor/destructor calls, but this seems unlikely to matter for `trivial_abi` types. Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D114732	2022-02-02 17:42:20 -08:00
serge-sans-paille	6cacd420a1	Document several clang-supported builtins Namely __builtin_alloca __builtin_alloca_with_align __builtin_call_with_static_chain __builtin_expect __builtin_expect_with_probablity __builtin_prefetch Differential Revision: https://reviews.llvm.org/D117296	2022-01-14 22:03:56 +01:00
Nick Desaulniers	5c562f62a4	[clang] number labels in asm goto strings after tied inputs I noticed that the following case would compile in Clang but not GCC: void x(void) { void p = &&foo; asm goto ("# %0\n\t# %l1":"+r"(p):::foo); foo:; return p; } Changing the output template above from %l2 would compile in GCC but not Clang. This demonstrates that when using tied outputs (say via the "+r" output constraint), the hidden inputs occur or are numbered BEFORE the labels, at least with GCC. In fact, GCC does denote this in its documentation: https://gcc.gnu.org/onlinedocs/gcc-11.2.0/gcc/Extended-Asm.html#Goto-Labels > Output operand with constraint modifier ‘+’ is counted as two operands > because it is considered as one output and one input operand. For the sake of compatibility, I think it's worthwhile to just make this change. It's better to use symbolic names for compatibility (especially now between released version of Clang that support asm goto with outputs). ie. %l1 from the above would be %l[foo]. The GCC docs also make this recommendation. Also, I cleaned up some cruft in GCCAsmStmt::getNamedOperand. AFAICT, NumPlusOperands was no longer used, though I couldn't find which commit didn't clean that up correctly. Link: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98096 Link: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103640 Link: https://gcc.gnu.org/onlinedocs/gcc-11.2.0/gcc/Extended-Asm.html#Goto-Labels Reviewed By: void Differential Revision: https://reviews.llvm.org/D115471	2022-01-11 12:09:24 -08:00
serge-sans-paille	491984c4e6	Document __builtin_trap and __builtin_debugtrap Differential Revision: https://reviews.llvm.org/D116598	2022-01-05 03:10:17 -05:00
Sami Tolvanen	ec2e26eaf6	[Clang] Add __builtin_function_start Control-Flow Integrity (CFI) replaces references to address-taken functions with pointers to the CFI jump table. This is a problem for low-level code, such as operating system kernels, which may need the address of an actual function body without the jump table indirection. This change adds the __builtin_function_start() builtin, which accepts an argument that can be constant-evaluated to a function, and returns the address of the function body. Link: https://github.com/ClangBuiltLinux/linux/issues/1353 Depends on D108478 Reviewed By: pcc, rjmccall Differential Revision: https://reviews.llvm.org/D108479	2021-12-20 12:55:33 -08:00
Chuanqi Xu	352e36e10d	[Coroutines] Remove unused coroutine builtin/intrinsics llvm.coro.param (NFC-ish) I found that the coroutine intrinsic llvm.coro.param in documentation (https://llvm.org/docs/Coroutines.html#id101) didn't get used actually since there isn't lowering codes in LLVM. I also checked the implementation of libstdc++ and libc++. Both of them didn't use llvm.coro.param. So I am pretty sure that the llvm.coro.param intrinsic is unused. I think it would be better t to remove it to avoid possible misleading understandings. Note: according to [class.copy.elision]/p1.3, this optimization is allowed by the C++ language specification. Let's make it someday. Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D115222	2021-12-09 14:40:25 +08:00
Aaron Ballman	6c75ab5f66	Introduce _BitInt, deprecate _ExtInt WG14 adopted the _ExtInt feature from Clang for C23, but renamed the type to be _BitInt. This patch does the vast majority of the work to rename _ExtInt to _BitInt, which accounts for most of its size. The new type is exposed in older C modes and all C++ modes as a conforming extension. However, there are functional changes worth calling out: * Deprecates _ExtInt with a fix-it to help users migrate to _BitInt. * Updates the mangling for the type. * Updates the documentation and adds a release note to warn users what is going on. * Adds new diagnostics for use of _BitInt to call out when it's used as a Clang extension or as a pre-C23 compatibility concern. * Adds new tests for the new diagnostic behaviors. I want to call out the ABI break specifically. We do not believe that this break will cause a significant imposition for early adopters of the feature, and so this is being done as a full break. If it turns out there are critical uses where recompilation is not an option for some reason, we can consider using ABI tags to ease the transition.	2021-12-06 12:52:01 -05:00
Zahira Ammarguellat	fd759d42c9	Revert "The _Float16 type is supported on x86 systems with SSE2 enabled." This reverts commit `6623c02d70`. The change seems to be breaking build of compiler-rt on Debian.	2021-11-23 08:00:57 -05:00
Zahira Ammarguellat	6623c02d70	The _Float16 type is supported on x86 systems with SSE2 enabled. Operations are emulated by software emulation and “float” instructions. This patch is allowing the support of _Float16 type without the use of -max512fp16 flag. The final goal being, perform _Float16 emulation for all arithmetic expressions.	2021-11-19 08:59:50 -05:00
Aaron Ballman	3874277f41	Improve docs & test for #pragma clang attribute's any clause; NFC There was some confusion during the discussion of a patch as to whether `any` can be used to blast an attribute with no subject list onto basically everything in a program by not specifying a subrule. This patch adds documentation and tests to make it clear that this situation is not supported and will be diagnosed.	2021-11-17 08:24:26 -05:00
Shao-Ce SUN	0c660256eb	[NFC] Trim trailing whitespace in *.rst	2021-11-15 09:17:08 +08:00
Nathan Sidwell	da979f6cf8	[clang] Fix restructured markup	2021-11-09 18:41:22 -05:00
Nathan Sidwell	ae5c52b933	[clang] [docs] Fix markup code-block needs a blank line Differential Revision: https://reviews.llvm.org/D113425	2021-11-09 07:40:27 -08:00
Nathan Sidwell	bf6986d99e	[clang] GCC directive extension extension: Hash NNN lines Some time back I extended GCC's '# NNN' line marker semantics. Specifically popping to a blank filename will restore the filename to that of the popped-to include. Restore to line 5 of including file (escaped BOL #'s to avoid git eliding them): \# 5 "" 2 Added documentation for this line control extension. This was useful in developing modules tests, but turned out to also be useful with machine-generated source code. Specifically, a generated include file that itself includes fragments from elsewhere. The ability to pop to the generated include file -- with its full path prefix -- is useful for diagnostic & debug purposes. For instance something like: // Machine generated -- DO NOT EDIT Type Var = { \# 7 "encoded.dsl" 1 // push to snippet-container {snippet, of, code} \# 6 " 2 // Restore to machined-generated source , }; // user-code ... \#include "dsl.h" ... That pop to "" will restore the filename to '..includepath../dsl.h', which is better than restoring to plain "dsl.h". Differential Revision: https://reviews.llvm.org/D113425	2021-11-09 07:31:03 -08:00
Chuanqi Xu	ec117158a3	[Coroutines] [Frontend] Lookup in std namespace first Now in libcxx and clang, all the coroutine components are defined in std::experimental namespace. And now the coroutine TS is merged into C++20. So in the working draft like N4892, we could find the coroutine components is defined in std namespace instead of std::experimental namespace. And the coroutine support in clang seems to be relatively stable. So I think it may be suitable to move the coroutine component into the experiment namespace now. This patch would make clang lookup coroutine_traits in std namespace first. For the compatibility consideration, clang would lookup in std::experimental namespace if it can't find definitions in std namespace. So the existing codes wouldn't be break after update compiler. And in case the compiler found std::coroutine_traits and std::experimental::coroutine_traits at the same time, it would emit an error for it. The support for looking up std::experimental::coroutine_traits would be removed in Clang16. Reviewed By: lxfind, Quuxplusone Differential Revision: https://reviews.llvm.org/D108696	2021-11-04 11:53:47 +08:00
Kai Luo	6ea2431d3f	[clang][compiler-rt][atomics] Add `__c11_atomic_fetch_nand` builtin and support `__atomic_fetch_nand` libcall Add `__c11_atomic_fetch_nand` builtin to language extensions and support `__atomic_fetch_nand` libcall in compiler-rt. Reviewed By: theraven Differential Revision: https://reviews.llvm.org/D112400	2021-10-28 02:18:43 +00:00
Zahira Ammarguellat	e84c5419e2	Fix indentation and pragma name.	2021-10-26 13:58:43 -04:00
Florian Hahn	025988ded6	Specify Clang vector builtins. This patch specifies a set of vector builtins for Clang, as discussed on cfe-dev: https://lists.llvm.org/pipermail/cfe-dev/2021-September/068999.html https://lists.llvm.org/pipermail/cfe-dev/2021-October/069070.html Reviewed By: scanon Differential Revision: https://reviews.llvm.org/D111529	2021-10-26 15:34:31 +01:00
Nathan Sidwell	b8ff780f20	[clang][NFC] Correct doc markup Spotted when implementing an extension.	2021-10-13 04:20:15 -07:00
Erich Keane	9324cc2ca9	Change __builtin_sycl_unique_stable_name to just use an Itanium mangling After significant problems in our downstream with the previous implementation, the SYCL standard has opted to make using macros/etc to change kernel-naming-lambdas in any way UB (even passively). As a result, we are able to just emit the itanium mangling. However, this DOES require a little work in the CXXABI, as the microsoft and itanium mangler use different numbering schemes for lambdas. This patch adds a pair of mangling contexts that use the normal 'itanium' mangling strategy to fill in the "DeviceManglingNumber" used previously by CUDA. Differential Revision: https://reviews.llvm.org/D110281	2021-09-28 06:41:03 -07:00
Chris Bieneman	18cf5b220d	Fixing docs build I always forget that new line...	2021-09-27 14:16:28 -05:00
Chris Bieneman	1e48ef2035	Implement #pragma clang final extension This patch adds a new preprocessor extension ``#pragma clang final`` which enables warning on undefinition and re-definition of macros. The intent of this warning is to extend beyond ``-Wmacro-redefined`` to warn against any and all alterations to macros that are marked `final`. This warning is part of the ``-Wpedantic-macros`` diagnostics group. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D108567	2021-09-27 14:11:16 -05:00
Louis Dionne	79f8b5f0d0	Revert "[Coroutines] [Clang] Look up coroutine component in std namespace first" This reverts commit `2fbd254aa4`, which broke the libc++ CI. I'm reverting to get things stable again until we've figured out a way forward. Differential Revision: https://reviews.llvm.org/D108696	2021-09-03 16:01:09 -04:00
Chuanqi Xu	2fbd254aa4	[Coroutines] [Clang] Look up coroutine component in std namespace first Summary: Now in libcxx and clang, all the coroutine components are defined in std::experimental namespace. And now the coroutine TS is merged into C++20. So in the working draft like N4892, we could find the coroutine components is defined in std namespace instead of std::experimental namespace. And the coroutine support in clang seems to be relatively stable. So I think it may be suitable to move the coroutine component into the experiment namespace now. But move the coroutine component into the std namespace may be an break change. So I planned to split this change into two patch. One in clang and other in libcxx. This patch would make clang lookup coroutine_traits in std namespace first. For the compatibility consideration, clang would lookup in std::experimental namespace if it can't find definitions in std namespace and emit a warning in this case. So the existing codes wouldn't be break after update compiler. Test Plan: check-clang, check-libcxx Reviewed By: lxfind Differential Revision: https://reviews.llvm.org/D108696	2021-09-03 10:22:55 +08:00
Zahira Ammarguellat	cec7c2b32e	Revert "[CLANG][PATCH][FPEnv] Add support for option -ffp-eval-method and extend #pragma float_control similarly" The intent of this patch is to add support of -fp-model=[source\|double\|extended] to allow the compiler to use a wider type for intermediate floating point calculations. As a side effect to that, the value of FLT_EVAL_METHOD is changed according to the pragma float_control. Unfortunately some issue was uncovered with this change in preprocessing. See details in https://reviews.llvm.org/D93769 . We are therefore reverting this patch until we find a way to reconcile the value of FLT_EVAL_METHOD, the pragma and the -E flow. This reverts commit `66ddac22e2`.	2021-09-01 04:48:50 -07:00
Justas Janickas	9dc92bba6c	[OpenCL][NFC] Fix code example in __remove_address_space documentation.	2021-08-25 21:24:32 +01:00
Chris Bieneman	43de869d77	Implement #pragma clang restrict_expansion This patch adds `#pragma clang restrict_expansion ` to enable flagging macros as unsafe for header use. This is to allow macros that may have ABI implications to be avoided in headers that have ABI stability promises. Using macros in headers (particularly public headers) can cause a variety of issues relating to ABI and modules. This new pragma logs warnings when using annotated macros outside the main source file. This warning is added under a new diagnostics group -Wpedantic-macros Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D107095	2021-08-23 09:46:38 -07:00
Wang, Pengfei	6f7f5b54c8	[X86] AVX512FP16 instructions enabling 1/6 1. Enable FP16 type support and basic declarations used by following patches. 2. Enable new instructions VMOVW and VMOVSH. Ref.: https://software.intel.com/content/www/us/en/develop/download/intel-avx512-fp16-architecture-specification.html Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D105263	2021-08-10 12:46:01 +08:00
Justas Janickas	a5a2f05dcc	[C++4OpenCL] Introduces __remove_address_space utility This change provides a way to conveniently declare types that have address space qualifiers removed. Since OpenCL adds address spaces implicitly even when they are not specified in source, it is useful to allow deriving address space unqualified types. Fixes llvm.org/PR45326 Differential Revision: https://reviews.llvm.org/D106785	2021-08-06 10:40:22 +01:00
Chris Bieneman	f8819c109e	Fixing broken docs build Need an empty line after the code-block directive.	2021-07-29 12:45:56 -05:00
Chris Bieneman	26c695b789	Support macro deprecation #pragma clang deprecated This patch adds `#pragma clang deprecated` to enable deprecation of preprocessor macros. The macro must be defined before `#pragma clang deprecated`. When deprecating a macro a custom message may be optionally provided. Warnings are emitted at the use site of a deprecated macro, and can be controlled via the `-Wdeprecated` warning group. This patch takes some rough inspiration and a few lines of code from https://reviews.llvm.org/D67935. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D106732	2021-07-29 12:40:53 -05:00
Melanie Blower	66ddac22e2	[CLANG][PATCH][FPEnv] Add support for option -ffp-eval-method and extend #pragma float_control similarly The Intel compiler ICC supports the option "-fp-model=(source\|double\|extended)" which causes the compiler to use a wider type for intermediate floating point calculations. Also supported is a way to embed this effect in the source program with #pragma float_control(source\|double\|extended). This patch extends pragma float_control syntax, and also adds support for a new floating point option "-ffp-eval-method=(source\|double\|extended)". source: intermediate results use source precision double: intermediate results use double precision extended: intermediate results use extended precision Reviewed By: Aaron Ballman Differential Revision: https://reviews.llvm.org/D93769	2021-07-28 10:50:32 -04:00
Aaron Ballman	b0ef3d8f66	Allow #pragma float_control(push\|pop) within a language linkage specification Currently, we prohibit this pragma from appearing within a language linkage specification, but this is useful functionality that is supported by MSVC (which is where we inherited this feature from). This patch allows you to use the pragma within an extern "C" {} (etc) block.	2021-07-28 07:37:56 -04:00
Melanie Blower	d48ad358b1	Revert "[CLANG][PATCH][FPEnv] Add support for option -ffp-eval-method and extend #pragma float_control similarly" This reverts commit `ce8024e8ff`. There are a couple buildbot problems	2021-07-20 16:40:55 -04:00
Melanie Blower	ce8024e8ff	[CLANG][PATCH][FPEnv] Add support for option -ffp-eval-method and extend #pragma float_control similarly The Intel compiler ICC supports the option "-fp-model=(source\|double\|extended)" which causes the compiler to use a wider type for intermediate floating point calculations. Also supported is a way to embed this effect in the source program with #pragma float_control(source\|double\|extended). This patch extends pragma float_control syntax, and also adds support for a new floating point option "-ffp-eval-method=(source\|double\|extended)". source: intermediate results use source precision double: intermediate results use double precision extended: intermediate results use extended precision Reviewed By: Aaron Ballman Differential Revision: https://reviews.llvm.org/D93769	2021-07-20 16:02:09 -04:00
Yaxun (Sam) Liu	8fe058dbe4	[clang] Document llvm options controlling pragma unroll Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D105035	2021-07-12 16:42:50 -04:00
Saurabh Jha	c8f3f46c69	[Docs] Minor fixes with language extension docs There were some issues in the patch https://reviews.llvm.org/D104198. I also forgot to address one comment. This patch addresses these. Reviewed By: xgupta Differential Revision: https://reviews.llvm.org/D104971	2021-06-26 10:07:33 +01:00
Saurabh Jha	cd256c8bcc	Add documentation for compound assignment and type conversion of matrix types	2021-06-24 15:50:58 +01:00
Louis Dionne	97d234935f	[clang][Parse] Add parsing support for C++ attributes on using-declarations This is a re-application of `dc67299` which was reverted in `f63adf5b` because it broke the build. The issue should now be fixed. Attribution note: The original author of this patch is Erik Pilkington. I'm only trying to land it after rebasing. Differential Revision: https://reviews.llvm.org/D91630	2021-06-01 08:47:50 -04:00
Nico Weber	f63adf5b67	Revert "[clang][Parse] Add parsing support for C++ attributes on using-declarations" This reverts commit `dc672999a9`. Breaks check-clang everywhere, see https://reviews.llvm.org/D91630	2021-05-28 14:49:18 -04:00
Erik Pilkington	dc672999a9	[clang][Parse] Add parsing support for C++ attributes on using-declarations Differential Revision: https://reviews.llvm.org/D91630	2021-05-28 12:00:33 -04:00
Erich Keane	eba69b59d1	Reimplement __builtin_unique_stable_name- The original version of this was reverted, and @rjmcall provided some advice to architect a new solution. This is that solution. This implements a builtin to provide a unique name that is stable across compilations of this TU for the purposes of implementing the library component of the unnamed kernel feature of SYCL. It does this by running the Itanium mangler with a few modifications. Because it is somewhat common to wrap non-kernel-related lambdas in macros that aren't present on the device (such as for logging), this uniquely generates an ID for all lambdas involved in the naming of a kernel. It uses the lambda-mangling number to do this, except replaces this with its own number (starting at 10000 for readabililty reasons) for lambdas used to name a kernel. Additionally, this implements itself as constexpr with a slight catch: if a name would be invalidated by the use of this lambda in a later kernel invocation, it is diagnosed as an error (see the Sema tests). Differential Revision: https://reviews.llvm.org/D103112	2021-05-27 07:12:20 -07:00
Anastasia Stulova	237c6924bd	[OpenCL] Add clang extension for bit-fields. Allow use of bit-fields as a clang extension in OpenCL. The extension can be enabled using pragma directives. This fixes PR45339! Differential Revision: https://reviews.llvm.org/D101843	2021-05-24 12:42:17 +01:00
Ole Strohm	7d20f709ea	[OpenCL] [NFC] Fixed underline being too short in rst	2021-05-11 09:45:28 +01:00
Anastasia Stulova	e994e74bca	[OpenCL] Add clang extension for non-portable kernel parameters. Added __cl_clang_non_portable_kernel_param_types extension that allows using non-portable types as kernel parameters. This allows bypassing the portability guarantees from the restrictions specified in C++ for OpenCL v1.0 s2.4. Currently this only disables the restrictions related to the data layout. The programmer should ensure the compiler generates the same layout for host and device or otherwise the argument should only be accessed on the device side. This extension could be extended to other case (e.g. permitting size_t) if desired in the future. Patch by olestrohm (Ole Strohm)! https://reviews.llvm.org/D101168	2021-05-05 14:58:23 +01:00
Anastasia Stulova	8fb0d6df11	[OpenCL][Docs] Describe extension for legacy atomics with generic addr space. This extension is primarily targeting SPIR-V compilations flow as the IR translation is the same between 1.x and 2.x atomics. Differential Revision: https://reviews.llvm.org/D101089	2021-04-29 14:02:34 +01:00

1 2 3 4 5 ...

293 Commits