llvm-project

Commit Graph

Author	SHA1	Message	Date
Nico Weber	3109ce51d4	clang-cl: Expose -f[no-]delete-null-pointer-checks as clang-cl flag	2020-11-11 09:19:02 -05:00
Elvina Yakubova	624bced7ee	[OpenCL] Make Clang recognize -cl-std=1.0 as a value argument This patch makes Clang recognize -cl-std=1.0 as a value argument, before only -std=cl1.0 has to be used instead. Fixes https://bugs.llvm.org/show_bug.cgi?id=47981 Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D91237	2020-11-11 17:01:57 +03:00
Roland McGrath	cf36142d34	[clang] Add missing header guard in <cpuid.h> This header has long lacked a standard multiple inclusion guard like other headers have, for no apparent reason. The GCC header of the same name likewise lacks one up through release 10.1, but trunk GCC (release 11, and perhaps future 10.x) has fixed it (see https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96238). Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D91226	2020-11-10 19:34:25 -08:00
Akira Hatanaka	d9258a21f0	Fix the data layout mangling specification for 'arm64-pc-win32-macho' rdar://problem/70410504	2020-11-10 18:52:12 -08:00
Richard Smith	c6d86b6b45	Properly collect template arguments from a class-scope function template specialization. Fixes a crash-on-valid if further template parameters are introduced within the specialization (by a generic lambda).	2020-11-10 15:55:19 -08:00
Akira Hatanaka	874b0a0b9d	[CodeGen] Mark calls to objc_autorelease as tail This enables a method sending an autorelease message to an object and returning the object in MRR to avoid adding the object to an autorelease pool if a call to objc_retainAutoreleasedReturnValue in the caller function accepts the hand off of the retain count. rdar://problem/50678052 Differential Revision: https://reviews.llvm.org/D91111	2020-11-10 13:48:25 -08:00
Xun Li	19f0770923	[Coroutine][Sema] Cleanup temporaries as early as possible The original bug was discovered in T75057860. Clang front-end emits an AST that looks like this for an co_await expression: \|- ExprWithCleanups \|- -CoawaitExpr \|- -MaterializeTemporaryExpr ... Awaiter ... \|- -CXXMemberCallExpr ... .await_ready ... \|- -CallExpr ... __builtin_coro_resume ... \|- -CXXMemberCallExpr ... .await_resume ... ExprWithCleanups is responsible for cleaning up (including calling dtors) for the temporaries generated in the wrapping expression). In the above structure, the __builtin_coro_resume part (which corresponds to the code for the suspend case in the co_await with symmetric transfer), the pseudocode looks like this: __builtin_coro_resume( awaiter.await_suspend( from_address( __builtin_coro_frame())).address()); One of the temporaries that's generated as part of this code is the coroutine handle returned from awaiter.await_suspend() call. The call returns a handle which is a prvalue (since it's a returned value on the fly). In order to call the address() method on it, it needs to be converted into an xvalue. Hence a materialized temp is created to hold it. This temp will need to be cleaned up eventually. Now, since all cleanups happen at the end of the entire co_await expression, which is after the <coro.suspend> suspension point, the compiler will think that such a temp needs to live across suspensions, and need to be put on the coroutine frame, even though it's only used temporarily just to call address() method. Such a phenomena not only unnecessarily increases the frame size, but can lead to ASAN failures, if the coroutine was already destroyed as part of the await_suspend() call. This is because if the coroutine was already destroyed, the frame no longer exists, and one can not store anything into it. But if the temporary object is considered to need to live on the frame, it will be stored into the frame after await_suspend() returns. A fix attempt was done in https://reviews.llvm.org/D87470. Unfortunately it is incorrect. The reason is that cleanups in Clang works more like linearly than nested. There is one current state indicating whether it needs cleanup, and an ExprWithCleanups resets that state. This means that an ExprWithCleanups must be capable of cleaning up all temporaries created in the wrapping expression, otherwise there will be dangling temporaries cleaned up at the wrong place. I eventually found a walk-around (https://reviews.llvm.org/D89066) that doesn't break any existing tests while fixing the issue. But it targets the final co_await only. If we ever have a co_await that's not on the final awaiter and the frame gets destroyed after suspend, we are in trouble. Hence we need a proper fix. This patch is the proper fix. It does the folllowing things to fully resolve the issue: 1. The AST has to be generated in the order according to their nesting relationship. We should not generate AST out of order because then the code generator would incorrectly track the state of temporaries and when a cleanup is needed. So the code in buildCoawaitCalls is reorganized so that we will be generating the AST for each coawait member call in order along with their child AST. 2. await_ready() call is wrapped with an ExprWithCleanups so that temporaries in it gets cleaned up as early as possible to avoid living across suspension. 3. await_suspend() call is wrapped with an ExprWithCleanups if it's not a symmetric transfer. In the case of a symmetric transfer, in order to maintain the musttail call contract, the ExprWithCleanups is wraaped before the resume call. 4. In the end, we mark again that it needs a cleanup, so that the entire CoawaitExpr will be wrapped with a ExprWithCleanups which will clean up the Awaiter object associated with the await expression. Differential Revision: https://reviews.llvm.org/D90990	2020-11-10 13:27:42 -08:00
Yang Fan	703038b35a	[Sema] Fix volatile check when testing if a return object can be implicitly moved In C++11 standard, to become implicitly movable, the expression in return statement should be a non-volatile automatic object. CWG1579 changed the rule to require that the expression only needs to be an automatic object. C++14 standard and C++17 standard kept this rule unchanged. C++20 standard changed the rule back to require the expression be a non-volatile automatic object. This should be a typo in standards, and VD should be non-volatile. Differential Revision: https://reviews.llvm.org/D88295	2020-11-10 15:11:07 -05:00
Alexandre Rames	58c586e701	Allow searching for prebuilt implicit modules. This reverts commit `c67656b994`, and addresses the build issue.	2020-11-10 10:14:13 -08:00
Richard Smith	b637148ecb	[c++20] For P0732R2 / P1907R1: Basic code generation and name mangling support for non-type template parameters of class type and template parameter objects. The Itanium side of this follows the approach I proposed in https://github.com/itanium-cxx-abi/cxx-abi/issues/47 on 2020-09-06. The MSVC side of this was determined empirically by observing MSVC's output. Differential Revision: https://reviews.llvm.org/D89998	2020-11-09 22:10:27 -08:00
Qiu Chaofan	979a4d268a	[PowerPC] [Clang] Port SSE4.1-compatible insert intrinsics This patch adds three intrinsics compatible to x86's SSE 4.1 on PowerPC target, with tests: - _mm_insert_epi8 - _mm_insert_epi32 - _mm_insert_epi64 The intrinsics implementation is contributed by Paul Clarke. Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D89242	2020-11-10 10:52:13 +08:00
Fangrui Song	e625f9c5d1	-fbasic-block-sections=list=: Suppress output if failed to open the file Reviewed By: tmsriram Differential Revision: https://reviews.llvm.org/D90815	2020-11-09 09:26:37 -08:00
Atmn Patel	fd3cad7a60	[clang] Fix ForStmt mustprogress handling D86841 had an error where for statements with no conditional were required to make progress. This is not true, this patch removes that line, and adds regression tests. Differential Revision: https://reviews.llvm.org/D91075	2020-11-09 11:38:06 -05:00
Tyker	d093401a26	[NFC] Remove string parameter of annotation attribute from AST childs. this simplifies using annotation attributes when using clang as library	2020-11-09 16:39:59 +01:00
Lucas Prates	c2c2cc1360	[ARM][AArch64] Adding Neoverse V1 CPU support Add support for the Neoverse V1 CPU to the ARM and AArch64 backends. This is based on patches from Mark Murray and Victor Campos. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D90765	2020-11-09 13:15:40 +00:00
Hubert Tong	09fc7796e5	[NFC][tests] Replace use of GNUisms in usage of diff ... the POSIX options suffice. This maintains compatibility with the system `diff` on platforms like AIX.	2020-11-08 12:07:51 -05:00
Arthur Eubanks	226e179f74	Revert "[NewPM] Provide method to run all pipeline callbacks, used for -O0" This reverts commit `ae38540042`. As well as some follow-up test fixes. The original change causes new-pass-manager.ll to fail when polly is enabled.	2020-11-08 00:32:35 -08:00
Melanie Blower	c511963d5a	[clang] Fix length threshold for MicrosoftMangle md5 hash Reviewers: rnk, dblaikie Differential Revision: https://reviews.llvm.org/D90714	2020-11-07 07:40:24 -08:00
Melanie Blower	b0de3f6787	[clang] Improve Microsoft mangling lit test with dblaikie's suggestions	2020-11-07 07:32:34 -08:00
Fangrui Song	d2da05de7c	[test] Fix Other/new-pass-manager.ll & clang/test/Misc/loop-opt-setup.c	2020-11-06 21:55:11 -08:00
Atmn Patel	d3e75d31e3	Revert "[CodeGen] Fixes sanitizer test" This reverts commit `b1878b4641`. This does fix the test but it means that `ac73b73c16` is not implemented correctly. Reverting for now, and will be reverting the commit that causes this to fail.	2020-11-07 00:32:12 -05:00
Atmn Patel	b1878b4641	[CodeGen] Fixes sanitizer test By turning the loop into an infinite one, the loop can't be deleted anymore so the test will continue to pass.	2020-11-06 23:53:38 -05:00
Atmn Patel	569abb530e	[LoopDeletion] Fixes failing test The commit `0b17c6e447` occasionally causes this test to fail, this fixes it.	2020-11-06 22:45:28 -05:00
Atmn Patel	0b17c6e447	[LoopDeletion] Allows deletion of possibly infinite side-effect free loops From C11 and C++11 onwards, a forward-progress requirement has been introduced for both languages. In the case of C, loops with non-constant conditionals that do not have any observable side-effects (as defined by 6.8.5p6) can be assumed by the implementation to terminate, and in the case of C++, this assumption extends to all functions. The clang frontend will emit the `mustprogress` function attribute for C++ functions (D86233, D85393, D86841) and emit the loop metadata `llvm.loop.mustprogress` for every loop in C11 or later that has a non-constant conditional. This patch modifies LoopDeletion so that only loops with the `llvm.loop.mustprogress` metadata or loops contained in functions that are required to make progress (`mustprogress` or `willreturn`) are checked for observable side-effects. If these loops do not have an observable side-effect, then we delete them. Loops without observable side-effects that do not satisfy the above conditions will not be deleted. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D86844	2020-11-06 22:06:58 -05:00
cchen	0cab91140f	[OpenMP5.0] map item can be non-contiguous for target update In order not to modify the `tgt_target_data_update` information but still be able to pass the extra information for non-contiguous map item (offset, count, and stride for each dimension), this patch overload `arg` when the maptype is set as `OMP_MAP_DESCRIPTOR`. The origin `arg` is for passing the pointer information, however, the overloaded `arg` is an array of descriptor_dim: struct descriptor_dim { int64_t offset; int64_t count; int64_t stride }; and the array size is the same as dimension size. In addition, since we have count and stride information in descriptor_dim, we can replace/overload the `arg_size` parameter by using dimension size. For supporting `stride` in array section, we use a dummy dimension in descriptor to store the unit size. The formula for counting the stride in dimension D_n: `unit size * (D_0 * D_1 ... * D_n-1) * D_n.stride`. Demonstrate how it works: ``` double arr[3][4][5]; D0: { offset = 0, count = 1, stride = 8 } // offset, count, dimension size always be 0, 1, 1 for this extra dimension, stride is the unit size D1: { offset = 0, count = 2, stride = 8 * 1 * 2 = 16 } // stride = unit size * (product of dimension size of D0) * D1.stride = 4 * 1 * 2 = 8 D2: { offset = 2, count = 2, stride = 8 * (1 * 5) * 1 = 40 } // stride = unit size * (product of dimension size of D0, D1) * D2.stride = 4 * 5 * 1 = 20 D3: { offset = 0, count = 2, stride = 8 * (1 * 5 * 4) * 2 = 320 } // stride = unit size * (product of dimension size of D0, D1, D2) * D3.stride = 4 * 25 * 2 = 200 // X here means we need to offload this data, therefore, runtime will transfer // data from offset 80, 96, 120, 136, 400, 416, 440, 456 // Runtime patch: https://reviews.llvm.org/D82245 // OOOOO OOOOO OOOOO // OOOOO OOOOO OOOOO // XOXOO OOOOO XOXOO // XOXOO OOOOO XOXOO ``` Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D84192	2020-11-06 21:04:37 -06:00
Elvina Yakubova	c9ca3a3c66	[AArch64] Add driver tests for HiSilicon's TSV110	2020-11-07 01:51:37 +03:00
Kevin P. Neal	2069403cdf	[FPEnv] Use strictfp metadata in casting nodes The strictfp metadata was added to the casting AST nodes in D85960, but we aren't using that metadata yet. This patch adds that support. In order to avoid lots of ad-hoc passing around of the strictfp bits I updated the IRBuilder when moving from a function that has the Expr* to a function that lacks it. I believe we should switch to this pattern to keep the strictfp support from being overly invasive. For the purpose of testing that we're picking up the right metadata, I also made my tests use a pragma to make the AST's strictfp metadata not match the global strictfp metadata. This exposes issues that we need to deal with in subsequent patches, and I believe this is the right method for most all of our clang strictfp tests. Differential Revision: https://reviews.llvm.org/D88913	2020-11-06 11:56:12 -05:00
David Spickett	aecd52b97b	[Clang][AArch64] Remove unused prefix in constrained rounding test This test was added in `7f38812d5b` and all the other tests make use of the COMMONIR check. So I think this was left in by mistake for this particular test. Reviewed By: kpn Differential Revision: https://reviews.llvm.org/D90921	2020-11-06 14:13:46 +00:00
Fangrui Song	247c5b5d69	[test] Properly test -Werror-implicit-function-declaration and -Wvec-elem-size Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D90874	2020-11-05 20:08:23 -08:00
Stella Stamenova	c67656b994	Revert "Allow searching for prebuilt implicit modules." This reverts commit `71e108cd86`. This change caused a build failure on Windows: http://lab.llvm.org:8011/#/builders/83/builds/570	2020-11-05 17:16:14 -08:00
Stanislav Mekhanoshin	4fcdfc4398	[AMDGPU] Simplify amdgpu-macros.cl test. NFC. Differential Revision: https://reviews.llvm.org/D90886	2020-11-05 16:29:16 -08:00
Michael Liao	23c6d1501d	[amdgpu] Add `llvm.amdgcn.endpgm` support. - `llvm.amdgcn.endpgm` is added to enable "abort" support. Differential Revision: https://reviews.llvm.org/D90809	2020-11-05 19:06:50 -05:00
Saleem Abdulrasool	e55157874c	APINotes: repair the Windows builders Disable the test on Windows, which should've been obvious as being needed. The differences in diff implementations and line-endings make this test difficult to execute on Windows.	2020-11-05 21:25:52 +00:00
Alexandre Rames	71e108cd86	Allow searching for prebuilt implicit modules. The behavior is controlled by the `-fprebuilt-implicit-modules` option, and allows searching for implicit modules in the prebuilt module cache paths. The current command-line options for prebuilt modules do not allow to easily maintain and use multiple versions of modules. Both the producer and users of prebuilt modules are required to know the relationships between compilation options and module file paths. Using a particular version of a prebuilt module requires passing a particular option on the command line (e.g. `-fmodule-file=[<name>=]<file>` or `-fprebuilt-module-path=<directory>`). However the compiler already knows how to distinguish and automatically locate implicit modules. Hence this proposal to introduce the `-fprebuilt-implicit-modules` option. When set, it enables searching for implicit modules in the prebuilt module paths (specified via `-fprebuilt-module-path`). To not modify existing behavior, this search takes place after the standard search for prebuilt modules. If not Here is a workflow illustrating how both the producer and consumer of prebuilt modules would need to know what versions of prebuilt modules are available and where they are located. clang -cc1 -x c modulemap -fmodules -emit-module -fmodule-name=foo -fmodules-cache-path=prebuilt_modules_v1 <config 1 options> clang -cc1 -x c modulemap -fmodules -emit-module -fmodule-name=foo -fmodules-cache-path=prebuilt_modules_v2 <config 2 options> clang -cc1 -x c modulemap -fmodules -emit-module -fmodule-name=foo -fmodules-cache-path=prebuilt_modules_v3 <config 3 options> clang -cc1 -x c use.c -fmodules fmodule-map-file=modulemap -fprebuilt-module-path=prebuilt_modules_v1 <config 1 options> clang -cc1 -x c use.c -fmodules fmodule-map-file=modulemap <non-prebuilt config options> With prebuilt implicit modules, the producer can generate prebuilt modules as usual, all in the same output directory. The same mechanisms as for implicit modules take care of incorporating hashes in the path to distinguish between module versions. Note that we do not specify the output module filename, so `-o` implicit modules are generated in the cache path `prebuilt_modules`. clang -cc1 -x c modulemap -fmodules -emit-module -fmodule-name=foo -fmodules-cache-path=prebuilt_modules <config 1 options> clang -cc1 -x c modulemap -fmodules -emit-module -fmodule-name=foo -fmodules-cache-path=prebuilt_modules <config 2 options> clang -cc1 -x c modulemap -fmodules -emit-module -fmodule-name=foo -fmodules-cache-path=prebuilt_modules <config 3 options> The user can now simply enable prebuilt implicit modules and point to the prebuilt modules cache. No need to "parse" command-line options to decide what prebuilt modules (paths) to use. clang -cc1 -x c use.c -fmodules fmodule-map-file=modulemap -fprebuilt-module-path=prebuilt_modules -fprebuilt-implicit-modules <config 1 options> clang -cc1 -x c use.c -fmodules fmodule-map-file=modulemap -fprebuilt-module-path=prebuilt_modules -fprebuilt-implicit-modules <non-prebuilt config options> This is for example particularly useful in a use-case where compilation is expensive, and the configurations expected to be used are predictable, but not controlled by the producer of prebuilt modules. Modules for the set of predictable configurations can be prebuilt, and using them does not require "parsing" the configuration (command-line options). Reviewed By: Bigcheese Differential Revision: https://reviews.llvm.org/D68997	2020-11-05 13:10:53 -08:00
Jan Ole Hüser	d2e7dca5ca	[CodeGen] Fix Bug 47499: __unaligned extension inconsistent behaviour with C and C++ For the language C++ the keyword __unaligned (a Microsoft extension) had no effect on pointers. The reason, why there was a difference between C and C++ for the keyword __unaligned: For C, the Method getAsCXXREcordDecl() returns nullptr. That guarantees that hasUnaligned() is called. If the language is C++, it is not guaranteed, that hasUnaligend() is called and evaluated. Here are some links: The Bug: https://bugs.llvm.org/show_bug.cgi?id=47499 Thread on the cfe-dev mailing list: http://lists.llvm.org/pipermail/cfe-dev/2020-September/066783.html Diff, that introduced the check hasUnaligned() in getNaturalTypeAlignment(): https://reviews.llvm.org/D30166 Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D90630	2020-11-05 12:57:17 -08:00
Albion Fung	1af037f643	[PowerPC] Correct cpsgn's behaviour on PowerPC to match that of the ABI This patch fixes the reversed behaviour exhibited by cpsgn on PPC. It now matches the ABI. Differential Revision: https://reviews.llvm.org/D84962	2020-11-05 15:35:14 -05:00
Saleem Abdulrasool	82f86ae01a	APINotes: add APINotesYAMLCompiler This adds the skeleton of the YAML Compiler for APINotes. This change only adds the YAML IO model for the API Notes along with a new testing tool `apinotes-test` which can be used to verify that can round trip the YAML content properly. It provides the basis for the future work which will add a binary serialization and deserialization format to the data model. This is based on the code contributed by Apple at https://github.com/llvm/llvm-project-staging/tree/staging/swift/apinotes. Differential Revision: https://reviews.llvm.org/D88859 Reviewed By: Gabor Marton	2020-11-05 18:55:13 +00:00
Fangrui Song	c6a384df1f	[Sema] Special case -Werror-implicit-function-declaration and reject other -Werror- This is the only -Werror- form warning option GCC supports (gcc/c-family/c.opt). Fortunately no other form is used anywhere.	2020-11-05 10:25:30 -08:00
Erich Keane	6b104ea4b4	Implement Lambda Conversion Operators for All CCs for MSVC. As described here: https://devblogs.microsoft.com/oldnewthing/20150220-00/?p=44623 In order to allow Lambdas to be used with traditional Win32 APIs, they emit a conversion function for (what Raymond Chen claims is all) a number of the calling conventions. Through experimentation, we discovered that the list isn't quite 'all'. This patch implements this by taking the list of conversions that MSVC emits (across 'all' architectures, I don't see any CCs on ARM), then emits them if they are supported by the current target. However, we also add 3 other options (which may be duplicates): free-function, member-function, and operator() calling conventions. We do this because we have an extension where we generate both free and member for these cases so th at people specifying a calling convention on the lambda will have the expected behavior when specifying one of those two. MSVC doesn't seem to permit specifying calling-convention on lambdas, but we do, so we need to make sure those are emitted as well. We do this so that clang-only conventions are supported if the user specifies them. Differential Revision: https://reviews.llvm.org/D90634	2020-11-05 07:25:44 -08:00
Sven van Haastregt	8ac9bcc746	[OpenCL] Support vec_step in C++ for OpenCL mode Enable the vec_step builtin in C++ for OpenCL mode for compatibility with OpenCL C. Differential Revision: https://reviews.llvm.org/D90766	2020-11-05 12:02:59 +00:00
Arthur Eubanks	5fd3193c88	[test] Add 'REQUIRES: bpf-registered-target' to bpf-O0.c	2020-11-04 23:19:14 -08:00
Arthur Eubanks	ae38540042	[NewPM] Provide method to run all pipeline callbacks, used for -O0 Some targets may add required passes via TargetMachine::registerPassBuilderCallbacks(). We need to run those even under -O0. As an example, BPFTargetMachine adds BPFAbstractMemberAccessPass, a required pass. This also allows us to clean up BackendUtil.cpp (and out-of-tree Rust usage of the NPM) by allowing us to share added passes like coroutines and sanitizers between -O0 and other optimization levels. Tests are a continuation of those added in https://reviews.llvm.org/D89083. In order to prevent TargetMachines from adding unnecessary optimization passes at -O0, TargetMachine::registerPassBuilderCallbacks() will be changed to take an OptimizationLevel, but that will be done separately. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D89158	2020-11-04 22:27:16 -08:00
Atmn Patel	ac73b73c16	[clang] Add mustprogress and llvm.loop.mustprogress attribute deduction Since C++11, the C++ standard has a forward progress guarantee [intro.progress], so all such functions must have the `mustprogress` requirement. In addition, from C11 and onwards, loops without a non-zero constant conditional or no conditional are also required to make progress (C11 6.8.5p6). This patch implements these attribute deductions so they can be used by the optimization passes. Differential Revision: https://reviews.llvm.org/D86841	2020-11-04 22:03:14 -05:00
Baptiste Saleil	f976ba6139	[PowerPC] Add Sema checks for MMA types The use of the new types introduced for PowerPC MMA instructions needs to be restricted. We add a PowerPC function checking that the given type is valid in a context in which we don't allow MMA types. This function is called from various places in Sema where we want to prevent the use of these types. Differential Revision: https://reviews.llvm.org/D82035	2020-11-04 17:01:47 -06:00
cchen	d0d43b58b1	[OpenMP] target nested `use_device_ptr() if()` and is_device_ptr trigger asserts Clang now asserts for the below case: ``` void clang::CodeGen::CGOpenMPRuntime::createOffloadEntriesAndInfoMetadata(): Assertion `std::get<0>(E) && "All ordered entries must exist!"' failed. ``` The reason why Clang hit the assert is because in `emitTargetDataCalls`, both `BeginThenGen` and `BeginElseGen` call `registerTargetRegionEntryInfo` and try to register the Entry in OffloadEntriesTargetRegion with same key. If changing the expression in if clause to any constant expression, then the assert disappear. (https://godbolt.org/z/TW7haj) The assert itself is to avoid user from accessing elements out of bound inside `OrderedEntries` in `createOffloadEntriesAndInfoMetadata`. In this patch, I add a check in `registerTargetRegionEntryInfo` to avoid register the target region more than once. A test case that triggers assert: https://godbolt.org/z/4cnGW8 Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D90704	2020-11-04 12:36:57 -06:00
Qiu Chaofan	7faf62a80b	[Clang] Add more fp128 math library function builtins Since glibc has supported math library functions conforming IEEE 128-bit floating point types on some platform (like ppc64le), we can fix clang's math builtins missing this type. Reviewed By: bkramer Differential Revision: https://reviews.llvm.org/D90593	2020-11-04 17:58:42 +08:00
Richard Smith	09b54e2799	When re-checking an already-substituted template argument, don't lose the reference-ness of the parameter's type.	2020-11-03 14:09:54 -08:00
Baptiste Saleil	daa127d77e	[PowerPC] Add MMA builtin decoding and definitions Add MMA builtin decoding. These builtins use the new PowerPC-specific types __vector_pair and __vector_quad. So to avoid pervasive changes, we use custom type descriptors and custom decoding for these builtins. We also use custom code generation to expand builtin calls with pointers to simpler intrinsic calls with non-pointer types. Differential Revision: https://reviews.llvm.org/D81748	2020-11-03 15:08:46 -06:00
Ben Dunbobbin	7ad6010f58	Fix - [Clang] Add the ability to map DLL storage class to visibility `415f7ee883` had a silly typo introduced when I inlined some code into a loop from its own function. Original commit message: For PlayStation we offer source code compatibility with Microsoft's dllimport/export annotations; however, our file format is based on ELF. To support this we translate from DLL storage class to ELF visibility at the end of codegen in Clang. Other toolchains have used similar strategies (e.g. see the documentation for this ARM toolchain: https://developer.arm.com/documentation/dui0530/i/migrating-from-rvct-v3-1-to-rvct-v4-0/changes-to-symbol-visibility-between-rvct-v3-1-and-rvct-v4-0) This patch adds the ability to perform this translation. Options are provided to support customizing the mapping behaviour. Differential Revision: https://reviews.llvm.org/D89970	2020-11-03 19:13:54 +00:00
Artem Belevich	be86b6773b	[CUDA] Allow local static variables with target attributes. While CUDA documentation claims that such variables are not allowed[1], NVCC has been accepting them since CUDA-10.0[2] and some headers in CUDA-11 rely on this working. 1. https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#static-variables-function 2. https://godbolt.org/z/zsodzc Differential Revision: https://reviews.llvm.org/D88345	2020-11-03 10:30:38 -08:00
Tim Renouf	89d41f3a2b	[AMDGPU] Add gfx1033 target Differential Revision: https://reviews.llvm.org/D90447 Change-Id: If2650fc7f31bbdd49c76e74a9ca8e3734d769761	2020-11-03 16:27:48 +00:00
Tim Renouf	ee3e642627	[AMDGPU] Add gfx90c target This differentiates the Ryzen 4000/4300/4500/4700 series APUs that were previously included in gfx909. Differential Revision: https://reviews.llvm.org/D90419 Change-Id: Ia901a7157eb2f73ccd9f25dbacec38427312377d	2020-11-03 16:27:43 +00:00
Yaxun (Sam) Liu	abd8cd9199	[CUDA][HIP] Fix linkage for -fgpu-rdc Currently for explicit template function instantiation in CUDA/HIP device compilation clang emits instantiated kernel with external linkage and instantiated device function with internal linkage. This is fine for -fno-gpu-rdc since there is only one TU. However this causes duplicate symbols for kernels for -fgpu-rdc if the same instantiation happen in multiple TU. Or missing symbols if a device function calls an explicitly instantiated template function in a different TU. To make explicit template function instantiation work for -fgpu-rdc we need to follow the C++ linkage paradigm, i.e. use weak_odr linkage. Differential Revision: https://reviews.llvm.org/D90311	2020-11-03 08:07:19 -05:00
Martin Storsjö	d3bd06f5c7	[clang] Fix the fsanitize.c testcase after `eaae6fdf67`. NFC. After that commit, the vptr sanitizer is enabled for mingw targets.	2020-11-03 10:21:29 +02:00
Martin Storsjö	eaae6fdf67	[clang] [MinGW] Allow using the vptr sanitizer Differential Revision: https://reviews.llvm.org/D90572	2020-11-03 09:59:09 +02:00
Stephan Bergmann	7a5184ed95	[scan-build] Fix clang++ pathname again `e00629f777` "[scan-build] Fix clang++ pathname" had removed the -MAJOR.MINOR suffix, but since presumably LLVM 7 the suffix is only -MAJOR, so ClangCXX (i.e., the CLANG_CXX environment variable passed to clang/tools/scan-build/libexec/ccc-analyzer) now contained a non-existing /path/to/clang-12++ (which apparently went largely unnoticed as clang/tools/scan-build/libexec/ccc-analyzer falls back to just 'clang++' if the executable denoted by CLANG_CXX does not exist). For the new clang/test/Analysis/scan-build/cxx-name.test to be effective, %scan-build must now take care to pass the clang executable's resolved pathname (i.e., ending in .../clang-MAJOR rather than just .../clang) to --use-analyzer. Differential Revision: https://reviews.llvm.org/D89481	2020-11-03 08:17:17 +01:00
Serge Pavlov	ee63acc37e	Put back the test pragma-fp-exc.cpp This test was removed in `5963e028e7` because it failed on cores where support of constrained intrinsics was limited. Now this test is enabled only on x86.	2020-11-03 13:18:40 +07:00
Alex Lorenz	701456b523	[darwin] add support for __isPlatformVersionAtLeast check for if (@available) The __isPlatformVersionAtLeast routine is an implementation of `if (@available)` check that uses the _availability_version_check API on Darwin that's supported on macOS 10.15, iOS 13, tvOS 13 and watchOS 6. Differential Revision: https://reviews.llvm.org/D90367	2020-11-02 16:28:09 -08:00
Ben Dunbobbin	ae9231ca2a	Reland - [Clang] Add the ability to map DLL storage class to visibility `415f7ee883` had LIT test failures on any build where the clang executable was not called "clang". I have adjusted the LIT CHECKs to remove the binary name to fix this. Original commit message: For PlayStation we offer source code compatibility with Microsoft's dllimport/export annotations; however, our file format is based on ELF. To support this we translate from DLL storage class to ELF visibility at the end of codegen in Clang. Other toolchains have used similar strategies (e.g. see the documentation for this ARM toolchain: https://developer.arm.com/documentation/dui0530/i/migrating-from-rvct-v3-1-to-rvct-v4-0/changes-to-symbol-visibility-between-rvct-v3-1-and-rvct-v4-0) This patch adds the ability to perform this translation. Options are provided to support customizing the mapping behaviour. Differential Revision: https://reviews.llvm.org/D89970	2020-11-02 23:24:49 +00:00
Artem Belevich	0a3ebb4d8d	Revert "[CUDA] Allow local static variables with target attributes." This reverts commit `f38a9e5117` Which triggered assertions.	2020-11-02 15:09:07 -08:00
Artem Belevich	f38a9e5117	[CUDA] Allow local static variables with target attributes. While CUDA documentation claims that such variables are not allowed[1], NVCC has been accepting them since CUDA-10.0[2] and some headers in CUDA-11 rely on this working. 1. https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#static-variables-function 2. https://godbolt.org/z/zsodzc Differential Revision: https://reviews.llvm.org/D88345	2020-11-02 14:37:13 -08:00
Christopher Di Bella	ba18bc4925	[Sema] adds -Wfree-nonheap-object member var checks Checks to make sure that stdlib's (std::)free is being appropriately used for member variables. Differential Revision: https://reviews.llvm.org/D90269	2020-11-02 11:03:28 -08:00
Ben Dunbobbin	5024d3aa18	Revert "[Clang] Add the ability to map DLL storage class to visibility" This reverts commit `415f7ee883`. The added tests were failing on the build bots!	2020-11-02 17:33:54 +00:00
Ben Dunbobbin	415f7ee883	[Clang] Add the ability to map DLL storage class to visibility For PlayStation we offer source code compatibility with Microsoft's dllimport/export annotations; however, our file format is based on ELF. To support this we translate from DLL storage class to ELF visibility at the end of codegen in Clang. Other toolchains have used similar strategies (e.g. see the documentation for this ARM toolchain: https://developer.arm.com/documentation/dui0530/i/migrating-from-rvct-v3-1-to-rvct-v4-0/changes-to-symbol-visibility-between-rvct-v3-1-and-rvct-v4-0) This patch adds the ability to perform this translation. Options are provided to support customizing the mapping behaviour. Differential Revision: https://reviews.llvm.org/D89970	2020-11-02 17:08:23 +00:00
Kirstóf Umann	22e7182002	[analyzer][ReturnPtrRangeChecker] Fix a false positive on end() iterator ReturnPtrRange checker emits a report if a function returns a pointer which points out of the buffer. However, end() iterator of containers is always such a pointer, so this always results a false positive report. This false positive case is now eliminated. This patch resolves these tickets: https://bugs.llvm.org/show_bug.cgi?id=20929 https://bugs.llvm.org/show_bug.cgi?id=25226 https://bugs.llvm.org/show_bug.cgi?id=27701 Patch by Tibor Brunner! Differential Revision: https://reviews.llvm.org/D83678	2020-11-02 16:41:17 +01:00
Ben Dunbobbin	ff2e24a741	[PS4] Support dllimport/export attributes For PS4 development we support dllimport/export annotations in source code. This patch enables the dllimport/export attributes on PS4 by adding a new function to query the triple for whether dllimport/export are used and using that function to decide whether these attributes are supported. This replaces the current method of checking if the target is Windows. This means we can drop the use of "TargetArch" in the .td file (which is an improvement as dllimport/export support isn't really a function of the architecture). I have included a simple codgen test to show that the attributes are accepted and have an effect on codegen for PS4. I have also enabled the DLLExportStaticLocal and DLLImportStaticLocal attributes, which we support downstream. However, I am unable to write a test for these attributes until other patches for PS4 dllimport/export handling land upstream. Whilst writing this patch I noticed that, as these attributes are internal, they do not need to be target specific (when these attributes are added internally in Clang the target specific checks have already been run); however, I think leaving them target specific is fine because it isn't harmful and they "really are" target specific even if that has no functional impact. Differential Revision: https://reviews.llvm.org/D90442	2020-11-02 14:25:34 +00:00
Teresa Johnson	95824be18f	[MemProf] Fix test failure on windows Fix failure in new test from 0949f96dc6521be80ebb8ebc1e1c506165c22aac: Don't match exact file path separator. Should fix: http://lab.llvm.org:8011/#/builders/119/builds/437/steps/9/logs/FAIL__Clang__memory-profile-filename_c	2020-11-01 19:06:50 -08:00
Teresa Johnson	0949f96dc6	[MemProf] Pass down memory profile name with optional path from clang Similar to -fprofile-generate=, add -fmemory-profile= which takes a directory path. This is passed down to LLVM via a new module flag metadata. LLVM in turn provides this name to the runtime via the new __memprof_profile_filename variable. Additionally, always pass a default filename (in $cwd if a directory name is not specified vi the = form of the option). This is also consistent with the behavior of the PGO instrumentation. Since the memory profiles will generally be fairly large, it doesn't make sense to dump them to stderr. Also, importantly, the memory profiles will eventually be dumped in a compact binary format, which is another reason why it does not make sense to send these to stderr by default. Change the existing memprof tests to specify log_path=stderr when that was being relied on. Depends on D89086. Differential Revision: https://reviews.llvm.org/D89087	2020-11-01 17:38:23 -08:00
Fangrui Song	96289ce633	[test] Fix unused check prefixes in test/AST	2020-10-31 21:46:45 -07:00
Fangrui Song	1a51bde1b6	[test] Clean up test/Frontend/gnu-mcount.c and fix unused check prefixes	2020-10-31 21:33:46 -07:00
Mark de Wever	b231396122	[Sema] Diagnose annotating `if constexpr` with a likelihood attribute Adds a diagnostic when the user annotates an `if constexpr` with a likelihood attribute. The `if constexpr` statement is evaluated at compile time so the attribute has no effect. Annotating the accompanied `else` with a likelihood attribute has the same effect as annotating a generic statement. Since the attribute there is most likely not intended, a diagnostic will be issued. Since the attributes can't conflict, the "conflict" won't be diagnosed for an `if constexpr`. Differential Revision: https://reviews.llvm.org/D90336	2020-10-31 17:51:36 +01:00
Mark de Wever	b46fddf75f	[CodeGen] Implement [[likely]] and [[unlikely]] for while and for loop. The attribute has no effect on a do statement since the path of execution will always include its substatement. It adds a diagnostic when the attribute is used on an infinite while loop since the codegen omits the branch here. Since the likelihood attributes have no effect on a do statement no diagnostic will be issued for do [[unlikely]] {...} while(0); Differential Revision: https://reviews.llvm.org/D89899	2020-10-31 17:51:29 +01:00
Serge Pavlov	5963e028e7	Temporarily remove test CodeGen/pragma-fp-exc This test fails on buildbots where CPU architecture does not fully support constrained intrinsics.	2020-10-31 19:48:44 +07:00
Serge Pavlov	6021cbea4d	Add option 'exceptions' to pragma clang fp Pragma 'clang fp' is extended to support a new option, 'exceptions'. It allows to specify floating point exception behavior more flexibly. Differential Revision: https://reviews.llvm.org/D89849	2020-10-31 17:36:12 +07:00
Arthur Eubanks	5c31b8b94f	Revert "Use uint64_t for branch weights instead of uint32_t" This reverts commit `10f2a0d662`. More uint64_t overflows.	2020-10-31 00:25:32 -07:00
Fangrui Song	e2a1639c73	[test] Fix unused check prefixes in test/Driver Note, the deprecated AArch64 -msign-return-address= does not accept b-key. So delete the incorrect tests.	2020-10-31 00:14:59 -07:00
Liu, Chen3	756f597841	[X86] Support Intel avxvnni This patch mainly made the following changes: 1. Support AVX-VNNI instructions; 2. Introduce ExplicitVEXPrefix flag so that vpdpbusd/vpdpbusds/vpdpbusds/vpdpbusds instructions only use vex-encoding when user explicity add {vex} prefix. Differential Revision: https://reviews.llvm.org/D89105	2020-10-31 12:39:51 +08:00
Richard Smith	dd8297b066	PR42513: Fix handling of function definitions lazily instantiated from friends. When determining whether a function has a template instantiation pattern, look for other declarations of that function that were instantiated from a friend function definition, rather than assuming that checking for member specialization information on whichever declaration name lookup found will be sufficient.	2020-10-30 18:35:12 -07:00
Thomas Lively	a787e09779	[WebAssembly] Prototype i64x2.bitmask As proposed in https://github.com/WebAssembly/simd/pull/368. Differential Revision: https://reviews.llvm.org/D90514	2020-10-30 17:23:30 -07:00
Thomas Lively	0a512a555a	[WebAssembly] Prototype i64x2.eq As proposed in https://github.com/WebAssembly/simd/pull/381. Since it is still in the prototyping phase, it is only accessible via a target builtin function and a target intrinsic. Depends on D90504. Differential Revision: https://reviews.llvm.org/D90508	2020-10-30 16:38:15 -07:00
Thomas Lively	1cb0b56607	[WebAssembly] Prototype i64x2.widen_{low,high}_i32x4_{s,u} As proposed in https://github.com/WebAssembly/simd/pull/290. As usual, these instructions are available only via builtin functions and intrinsics while they are in the prototyping stage. Differential Revision: https://reviews.llvm.org/D90504	2020-10-30 15:44:04 -07:00
Keith Smiley	bbf02e18f5	[clang][NFC] Remove unused FileCheck prefix This is to enable --allow-unused-duplicates=false. This prefix appears to be outdated and intentionally unused. Differential Revision: https://reviews.llvm.org/D90430	2020-10-30 13:32:14 -07:00
Richard Smith	2177e4555a	PR47861: Expand dangling reference warning to look through copy construction, and to assume that assignment operators return *this.	2020-10-30 10:19:50 -07:00
Arthur Eubanks	10f2a0d662	Use uint64_t for branch weights instead of uint32_t CallInst::updateProfWeight() creates branch_weights with i64 instead of i32. To be more consistent everywhere and remove lots of casts from uint64_t to uint32_t, use i64 for branch_weights. Reviewed By: davidxl Differential Revision: https://reviews.llvm.org/D88609	2020-10-30 10:03:46 -07:00
Simon Pilgrim	973317cc5e	[CodeGen][X86] Remove unused check-prefix in constrained fma tests	2020-10-30 16:23:08 +00:00
Simon Pilgrim	365f46efeb	[CodeGen][X86] Remove unused check-prefix in movdir tests	2020-10-30 16:23:08 +00:00
Simon Pilgrim	c44846f537	[CodeGen][X86] Cleanup + fix unused check-prefixes in bmi tests	2020-10-30 16:13:54 +00:00
Simon Pilgrim	fe3d765ac7	[CodeGen][X86] Tidyup CHECKs on bitscan tests	2020-10-30 16:13:52 +00:00
Simon Pilgrim	5cdd470504	[CodeGen][X86] Remove unused check-prefix in bitscan tests	2020-10-30 16:13:50 +00:00
Simon Pilgrim	0ff9d8c8ba	[CodeGen][X86] Remove unused check-prefix in bswap tests	2020-10-30 16:13:49 +00:00
Simon Pilgrim	d7389f05ee	[CodeGen][X86] Cleanup + remove unused check-prefixes in avx union tests	2020-10-30 16:13:47 +00:00
Simon Pilgrim	bbe055dd73	[CodeGen][X86] Remove unused check-prefix in amx inline asm tests	2020-10-30 16:13:45 +00:00
Cullen Rhodes	58d3f0ea49	[clang][aarch64] Address various fixed-length SVE vector operations This patch adds tests and support for operations on SVE vectors created by the 'arm_sve_vector_bits' attribute, described by the Arm C Language Extensions (ACLE, version 00bet6, section 3.7.3.3) for SVE [1]. This covers the following: * VLSTs support the same forms of element-wise initialization as GNU vectors. * VLSTs support the same built-in C and C++ operators as GNU vectors. * Conditional and binary expressions containing GNU and SVE vectors (fixed or sizeless) are invalid since the ambiguity around the result type affects the ABI. No functional changes were required to support vector initialization and operators. The functional changes are to address unsupported conditional and binary expressions. [1] https://developer.arm.com/documentation/100987/latest Reviewed By: fpetrogalli Differential Revision: https://reviews.llvm.org/D88233	2020-10-30 15:10:54 +00:00
Melanie Blower	71bf9f07d5	[clang] add fexperimental-strict-floating-point to test cases that fail on arm and aarch not sure this will work due to commit rG13bfd89c4962	2020-10-30 07:30:06 -07:00
Erich Keane	ec809e4cfe	PR47372: Fix Lambda invoker calling conventions As mentioned in the defect, the lambda static invoker does not follow the calling convention of the lambda itself, which seems wrong. This patch ensures that the calling convention of operator() is passed onto the invoker and conversion-operator type. This is accomplished by extracting the calling-convention determination code out into a separate function in order to better reflect the 'thiscall' work, as well as somewhat better support the future implementation of https://devblogs.microsoft.com/oldnewthing/20150220-00/?p=44623 For any target (basically just win32) that has a different free and static function calling convention, this generates BOTH alternatives. This required some work to get the Windows mangler to work correctly for this, as well as some tie-breaking for the unary operators. Differential Revision: https://reviews.llvm.org/D89559	2020-10-30 06:39:55 -07:00
David Sherwood	cea69fa4dc	[SVE] Add fatal error for unnamed SVE variadic arguments We don't currently support passing unnamed variadic SVE arguments so I've added a fatal error if we hit such cases to prevent any silent ABI issues in future. Differential Revision: https://reviews.llvm.org/D90230	2020-10-30 13:35:47 +00:00
Melanie Blower	13bfd89c49	[clang][FPEnv] Diagnose Strict FP pragmas if target does not support StrictFP Reviewers: sepavloff, kpn, aaron.ballman Differential Revision: https://reviews.llvm.org/D90316	2020-10-30 06:11:25 -07:00
Liu, Chen3	00090a2b82	Support complex target features combinations This patch is mainly doing two things: 1. Adding support for parentheses, making the combination of target features more diverse; 2. Making the priority of ’,‘ is higher than that of '\|' by default. So I need to make some change with PTX Builtin function. Differential Revision: https://reviews.llvm.org/D89184	2020-10-30 10:32:53 +08:00
Aaron Puchert	bbed8cfe80	Thread safety analysis: Consider static class members as inaccessible This fixes the issue pointed out in D84604#2363134. For now we exclude static members completely, we'll take them into account later.	2020-10-30 00:35:14 +01:00
Thomas Lively	be6f50798e	[WebAssembly] Implement SIMD signselect instructions As proposed in https://github.com/WebAssembly/simd/pull/124, using the opcodes adopted by V8 in https://chromium-review.googlesource.com/c/v8/v8/+/2486235/2/src/wasm/wasm-opcodes.h. Uses new builtin functions and a new target intrinsic exclusively to ensure that the new instructions are only emitted when a user explicitly opts in to using them since they are still in the prototyping and evaluation phase. Differential Revision: https://reviews.llvm.org/D90357	2020-10-29 11:06:20 -07:00
Mircea Trofin	13aee94bc7	[ThinLTO] Fix empty .llvmcmd sections When passing -lto-embed-bitcode=post-merge-pre-opt, we were getting empty .llvmcmd sections. It turns out that is because the CodeGenOptions::CmdArgs field was only populated when clang saw -fembed-bitcode={all\|marker}. This patch always populates the CodeGenOptions::CmdArgs. The overhead of carrying through in memory in all cases is likely negligible in the grand schema of things, and it keeps the using code simple. Differential Revision: https://reviews.llvm.org/D90366	2020-10-29 09:57:42 -07:00
Jon Chesterfield	dee7704829	[AMDGPU] Add __builtin_amdgcn_grid_size [AMDGPU] Add __builtin_amdgcn_grid_size Similar to D76772, loads the data from the dispatch pointer. Marked invariant. Patch also updates the openmp devicertl to use this builtin. Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D90251	2020-10-29 16:25:13 +00:00
Kazushi (Jam) Marukawa	b5ac3721c8	[VE] Change to use integrated assembly by defualt We've implemented integrated assembler. Now, we change to use integrated assembler by default. Update a regression test also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D90396	2020-10-30 00:16:04 +09:00
Serge Pavlov	08bb5d9196	[FPEnv] Tests for rounding properties of constant evalution These are moved from D88498. Differential Revision: https://reviews.llvm.org/D90026	2020-10-29 13:53:13 +07:00
Ben Shi	5be50d79c0	[NFC][clang][AVR] Add more devices Reviewed By: dylanmckay Differential Revision: https://reviews.llvm.org/D88352	2020-10-29 11:49:21 +08:00
Mircea Trofin	735ab4be35	[ThinLTO] Fix .llvmcmd emission llvm::EmbedBitcodeInModule needs (what used to be called) EmbedMarker set, in order to emit .llvmcmd. EmbedMarker is really about embedding the command line, so renamed the parameter accordingly, too. This was not caught at test because the check-prefix was incorrect, but FileCheck does not report that when multiple prefixes are provided. A separate patch will address that. Differential Revision: https://reviews.llvm.org/D90278	2020-10-28 17:45:30 -07:00
Derek Schuff	77973f8dee	[WebAssembly] Add support for DWARF type units Since Wasm comdat sections work similarly to ELF, we can use that mechanism to eliminate duplicate dwarf type information in the same way. Differential Revision: https://reviews.llvm.org/D88603	2020-10-28 17:41:22 -07:00
Amy Huang	7669f3c0f6	Recommit "[CodeView] Emit static data members as S_CONSTANTs." We used to only emit static const data members in CodeView as S_CONSTANTS when they were used; this patch makes it so they are always emitted. This changes CodeViewDebug.cpp to find the static const members from the class debug info instead of creating DIGlobalVariables in the IR whenever a static const data member is used. Bug: https://bugs.llvm.org/show_bug.cgi?id=47580 Differential Revision: https://reviews.llvm.org/D89072 This reverts commit `504615353f`.	2020-10-28 16:35:59 -07:00
Christopher Di Bella	425a83a5f0	[Sema] adds basic -Wfree-nonheap-object functionality Checks to make sure that stdlib's (std::)free is being appropriately used. Presently checks for the following misuses: - free(&stack_object) - free(stack_array) Differential Revision: https://reviews.llvm.org/D89988	2020-10-28 16:18:23 -07:00
Aaron Puchert	5dbccc6c89	Better source location for -Wignored-qualifiers on trailing return types We collect the source location of a trailing return type in the parser, improving the location for regular functions and providing a location for lambdas, where previously there was none. Fixes PR47732. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D90129	2020-10-28 23:32:57 +01:00
Richard Smith	09abecef7b	PR48002: Fix injection of elaborated-type-specifiers within local classes into the enclosing block scope. We weren't properly detecting whether the name would be injected into a block scope in the case where it was lexically declared in a local class.	2020-10-28 14:29:45 -07:00
Shilei Tian	0661328d7e	[Clang][OpenMP] Added the support for target data nowait Previously we added support for target nowait, but target data nowait has not been supported yet. In this patch, target data nowait will also be wrapped into a task. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D90099	2020-10-28 15:53:30 -04:00
Baptiste Saleil	40dd4d5233	[Clang][PowerPC] Add __vector_pair and __vector_quad types Define the __vector_pair and __vector_quad types that are used to manipulate the new accumulator registers introduced by MMA on PowerPC. Because these two types are specific to PowerPC, they are defined in a separate new file so it will be easier to add other PowerPC specific types if we need to in the future. Differential Revision: https://reviews.llvm.org/D81508	2020-10-28 13:19:20 -05:00
Thomas Lively	5b464f2aa5	[WebAssembly] Fix incorrectly named target builtin Rename __builtin_wasm_q15mulr_saturate_s_i8x16 to __builtin_wasm_q15mulr_saturate_s_i16x8, fixing the implied lane interpretation of the result.	2020-10-28 10:22:43 -07:00
Thomas Lively	31e944556f	[WebAssembly] Prototype extending multiplication SIMD instructions As proposed in https://github.com/WebAssembly/simd/pull/376. This commit implements new builtin functions and intrinsics for these instructions, but does not yet add them to wasm_simd128.h because they have not yet been merged to the proposal. These are the first instructions with opcodes greater than 0xff, so this commit updates the MC layer and disassembler to handle that correctly. Differential Revision: https://reviews.llvm.org/D90253	2020-10-28 09:38:59 -07:00
JonChesterfield	5d02ca49a2	[libomptarget][nvptx] Undef, weak shared variables [libomptarget][nvptx] Undef, weak shared variables Shared variables on nvptx, and LDS on amdgcn, are uninitialized at the start of kernel execution. Therefore create the variables with undef instead of zeros, motivated in part by the amdgcn back end rejecting LDS+initializer. Common is zero initialized, which seems incompatible with shared. Thus change them to weak, following the direction of https://reviews.llvm.org/rG7b3eabdcd215 Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D90248	2020-10-28 14:25:36 +00:00
Benjamin Kramer	207cf71fa9	Revert "[OpenMP] Add Passing in Original Declaration Names To Mapper API" This reverts commit `d981c7b758` and `a87d7b3d44`. Test fails under msan.	2020-10-28 13:58:14 +01:00
Derek Schuff	44eea0b1a7	Revert "[WebAssembly] Add support for DWARF type units" This reverts commit `bcb8a119df`.	2020-10-27 17:57:32 -07:00
Wei Wang	c4868700c5	[clang] Pass-through remarks options to linker Summary: Propagate driver commandline remarks options to linker when LTO is enabled. This gives novice user a convenient way to collect and filter remarks throughout a typical toolchain invocation with sample profile and LTO using single switch from the clang driver. A typical use of this option from clang command-line: * Using -Rpass* options to print remarks to screen: clang -fuse-ld=lld -flto=thin -fprofile-sample-use=foo_sample.txt -Rpass=inline -Rpass-missed=inline -Rpass-analysis=inline -fdiagnostics-show-hotness -fdiagnostics-hotness-threshold=100 -o foo foo.cpp Remarks will be dumped to screen from both pre-lto and lto compilation. * Using serialized remarks options clang -fuse-ld=lld -flto=thin -fprofile-sample-use=foo_sample.txt -fsave-optimization-record -fdiagnostics-show-hotness -fdiagnostics-hotness-threshold=100 -o foo foo.cpp This will produce multiple yaml files containing optimization remarks: 1. foo.opt.yaml : remarks from pre-lto 2. foo.opt.ld.yaml.thin.1.yaml: remark during lto Differential Revision: https://reviews.llvm.org/D85810	2020-10-27 17:23:32 -07:00
Derek Schuff	bcb8a119df	[WebAssembly] Add support for DWARF type units Since Wasm comdat sections work similarly to ELF, we can use that mechanism to eliminate duplicate dwarf type information in the same way. Differential Revision: https://reviews.llvm.org/D88603	2020-10-27 17:13:41 -07:00
Joseph Huber	a87d7b3d44	[OpenMP] Add Passing in Original Declaration Names To Mapper API Summary: This patch adds support for passing in the original delcaration name in the source file to the libomptarget runtime. This will allow the runtime to provide more intelligent debugging messages. This patch takes the original expression parsed from the OpenMP map / update clause and provides a textual representation if it was explicitly mapped, otherwise it takes the name of the variable declaration as a fallback. The information in passed to the runtime in a global array of strings that matches the existing ident_t source location strings using ";name;filename;column;row;;". See clang/test/OpenMP/target_map_names.cpp for an example of the generated output for a given map clause. Reviewers: jdoervert Differential Revision: https://reviews.llvm.org/D89802	2020-10-27 16:09:19 -04:00
Amy Huang	504615353f	Revert "[CodeView] Emit static data members as S_CONSTANTs." Seems like there's an assert in here that we shouldn't be running into. This reverts commit `515973222e`.	2020-10-27 11:29:58 -07:00
Tony	5984097823	[AMDGPU] Add missing support for targets - Add missing tests. Differential Revision: https://reviews.llvm.org/D90212	2020-10-27 15:36:31 +00:00
Nico Weber	2a4e704c92	Revert "Use uint64_t for branch weights instead of uint32_t" This reverts commit `e5766f25c6`. Makes clang assert when building Chromium, see https://crbug.com/1142813 for a repro.	2020-10-27 09:26:21 -04:00
Zahira Ammarguellat	e562a40871	Fix for PR47544. Clang is crashing after generating the right diagnostic for a re-declaration of a friend method.d https://reviews.llvm.org/D88112	2020-10-27 05:57:39 -07:00
Haojian Wu	2c2dc7c392	[clang][RecoveryExpr] Add tests for ObjectiveC. to demonstrate it works for some cases. Differential Revision: https://reviews.llvm.org/D90140	2020-10-27 09:42:19 +01:00
Shilei Tian	d38788b357	[Clang][OpenMP] Avoid unnecessary privatization of mapper array when there is no user defined mapper In current implementation, if it requires an outer task, the mapper array will be privatized no matter whether it has mapper. In fact, when there is no mapper, the mapper array only contains number of nullptr. In the libomptarget, the use of mapper array is `if (mappers_array && mappers_array[i])`, which means we can directly set mapper array to nullptr if there is no mapper. This can avoid unnecessary data copy. In this patch, the data privatization will not be emitted if the mapper array is nullptr. When it comes to the emit of task body, the nullptr will be used directly. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D90101	2020-10-27 00:02:32 -04:00
Arthur Eubanks	e5766f25c6	Use uint64_t for branch weights instead of uint32_t CallInst::updateProfWeight() creates branch_weights with i64 instead of i32. To be more consistent everywhere and remove lots of casts from uint64_t to uint32_t, use i64 for branch_weights. Reviewed By: davidxl Differential Revision: https://reviews.llvm.org/D88609	2020-10-26 20:24:04 -07:00
Chandler Carruth	aaf7ffd4e1	Teach `-fsanitize=fuzzer` to respect `-static` and `-static-libstdc++` when adding C++ standard libraries. Summary: Makes linking the sanitizers follow the same logic as the rest of the driver with respect to the static linking strategy for the C++ standard library. Subscribers: mcrosier, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D80488	2020-10-27 01:36:54 +00:00
Richard Smith	a5c7b46862	Fix checking for C++98 ICEs in C++11-and-later mode to not consider use of a reference to be acceptable.	2020-10-26 16:59:48 -07:00
Amy Huang	515973222e	[CodeView] Emit static data members as S_CONSTANTs. We used to only emit static const data members in CodeView as S_CONSTANTS when they were used; this patch makes it so they are always emitted. I changed CodeViewDebug.cpp to find the static const members from the class debug info instead of creating DIGlobalVariables in the IR whenever a static const data member is used. Bug: https://bugs.llvm.org/show_bug.cgi?id=47580 Differential Revision: https://reviews.llvm.org/D89072	2020-10-26 15:30:35 -07:00
Kiran Chandramohan	c551ba0e90	Run test only if X86 target is available This fixes failures in AArch64 buildbots by running the clang/test/CodeGen/X86/att-inline-asm-prefix.c only when the X86 target is available.	2020-10-26 21:28:59 +00:00
Sriraman Tallam	ad1b9daa4b	Prepend "__uniq" to symbol names hash with -funique-internal-linkage-names. Prepend the module name hash with a fixed string ".__uniq." which helps tools that consume sampled profiles and attribute it to functions to understand that this symbol belongs to a unique internal linkage type symbol. Symbols with suffixes can result from various optimizations in the compiler. Function Multiversioning, function splitting, parameter constant propogation, unique internal linkage names. External tools like sampled profile aggregators combine profiles from multiple runs of a binary. They use various heuristics with symbols that have suffixes to try and attribute the profile to the right function instance. For instance multi-versioned symbols like foo.avx, foo.sse4.2, etc even though different should be attributed to the same source function if a single function is versioned, using attribute target_clones (supported in GCC but yet to land in LLVM). Similarly, functions that are split (split part having a .cold suffix) could have profiles for both the original and split symbols but would be aggregated and attributed to the original function that was split. Unique internal linkage functions however have different source instances and the aggregator must not put them together but attribute it to the appropriate function instance. To be sure that we are dealing with a symbol of a unique internal linkage function, we would like to prepend the hash with a known string ".__uniq." which these tools can check to understand the suffix type. Differential Revision: https://reviews.llvm.org/D89617	2020-10-26 14:24:28 -07:00
Xiangling Liao	357715ce97	[NFC] Remove max_align.c LIT testcase Since we fixed the definition of `SuitableAlign`[https://reviews.llvm.org/D88659], `max_align_t` and `__BIGGEST_ALIGNMENT__` are not necessarily the same always. The original testcase was added here: https://reviews.llvm.org/D59048 Differential Revision: https://reviews.llvm.org/D90187	2020-10-26 17:14:30 -04:00
Xiangling Liao	3d4aebbb9d	[AIX] Also error on -G for link-only step Error on -G on AIX for all modes(preprocess, assemble, compile, link). Differential Revision: https://reviews.llvm.org/D90063	2020-10-26 16:51:28 -04:00
Zequan Wu	e56e7bd469	Revert "Revert "Ensure that checkInitIsICE is called exactly once for every variable"" This reverts commit `a2ac64dd90`.	2020-10-26 12:08:57 -07:00
Zequan Wu	a2ac64dd90	Revert "Ensure that checkInitIsICE is called exactly once for every variable" This causing `Assertion Result && "Could not evaluate expression"' failed` at https://bugs.chromium.org/p/chromium/issues/detail?id=1142009 This reverts commit `76c0092665`.	2020-10-26 11:59:55 -07:00
Nick Desaulniers	c8f84bd094	[Clang][CodeGen] fix failed assertion Ensure we can emit symbol aliases via function attribute even when function signatures contain incomplete types. Via bugreport: https://reviews.llvm.org/D66492#2350947 Reviewed By: erichkeane Differential Revision: https://reviews.llvm.org/D90073	2020-10-26 11:37:55 -07:00
Haojian Wu	efa9aaad70	[clang] Suppress "follow-up" diagnostics on recovery call expressions. Because of typo-correction, the AST can be transformed, and the transformed AST is marginally useful for diagnostics purpose, the following diagnostics usually do harm than good (easily cause confusions). Given the following code: ``` void abcc(); void test() { if (abc()); // diagnostic 1 (for the typo-correction): the typo is correct to `abcc()`, so the code is treate as `if (abcc())` in AST perspective; // diagnostic 2 (for mismatch type): we perform an type-analysis on `if`, discover the type is not match } ``` The secondary diagnostic "convertable to bool" is likely bogus to users. The idea is to use RecoveryExpr (clang's dependent mechanism) to preserve the recovery behavior but suppress all follow-up diagnostics. Differential Revision: https://reviews.llvm.org/D89946	2020-10-26 12:40:00 +01:00
Tyker	d3205bbca3	[Annotation] Allows annotation to carry some additional constant arguments. This allows using annotation in a much more contexts than it currently has. especially when annotation with template or constexpr. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D88645	2020-10-26 10:50:05 +01:00
Liu, Chen3	180548c5c7	[X86] VEX/EVEX prefix doesn't work for inline assembly. For now, we lost the encoding information if we using inline assembly. The encoding for the inline assembly will keep default even if we add the vex/evex prefix. Differential Revision: https://reviews.llvm.org/D90009	2020-10-26 08:37:45 +08:00
Aaron Puchert	5250a03a99	Thread safety analysis: Consider global variables in scope Instead of just mutex members we also consider mutex globals. Unsurprisingly they are always in scope. Now the paper [1] says that > The scope of a class member is assumed to be its enclosing class, > while the scope of a global variable is the translation unit in > which it is defined. But I don't think we should limit this to TUs where a definition is available - a declaration is enough to acquire the mutex, and if a mutex is really limited in scope to a translation unit, it should probably be only declared there. The previous attempt in `9dcc82f34e` was causing false positives because I wrongly assumed that LiteralPtrs were always globals, which they are not. This should be fixed now. [1] https://static.googleusercontent.com/media/research.google.com/en/us/pubs/archive/42958.pdf Fixes PR46354. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D84604	2020-10-25 19:32:26 +01:00
Melanie Blower	576d436c82	Correct LIT test failure detected on buildbot after mibintc committed rG2e204e23911b: [clang] Enable support for #pragma STDC FENV_ACCESS D87528	2020-10-25 08:10:34 -07:00
Melanie Blower	2e204e2391	[clang] Enable support for #pragma STDC FENV_ACCESS Reviewers: rjmccall, rsmith, sepavloff Differential Revision: https://reviews.llvm.org/D87528	2020-10-25 06:46:25 -07:00
Richard Smith	f81f09ba89	[c++20] For P0732R2: Support string literal operator templates.	2020-10-25 00:34:15 -07:00
Richard Smith	7b3515880c	For P0732R2, P1907R1: ensure that template parameter objects don't refer to disallowed objects or have non-constant destruction.	2020-10-24 22:11:43 -07:00
Benjamin Kramer	39a0d6889d	[X86] Add a stub for Intel's alderlake. No scheduling, no autodetection.	2020-10-24 19:01:22 +02:00
Benjamin Kramer	bd2cf96c09	[X86] Add a stub for znver3 based on the little public information there is in AMD's manuals No scheduling, no autodetection. Just enough so -march=znver3 works.	2020-10-24 19:01:22 +02:00
Caroline Concatto	4c5906cffd	[Flang][Driver] Add infrastructure for basic frontend actions and file I/O This patch introduces the dependencies required to read and manage input files provided by the command line option. It also adds the infrastructure to create and write to output files. The output is sent to either stdout or a file (specified with the `-o` flag). Separately, in order to be able to test the code for file I/O, it adds infrastructure to create frontend actions. As a basic testable example, it adds the `InputOutputTest` FrontendAction. The sole purpose of this action is to read a file from the command line and print it either to stdout or the output file. This action is run by using the `-test-io` flag also introduced in this patch (available for `flang-new` and `flang-new -fc1`). With this patch: ``` flang-new -test-io input-file.f90 ``` will read input-file.f90 and print it in the output file. The `InputOutputTest` frontend action has been introduced primarily to facilitate testing. It is hidden from users (i.e. it's only displayed with `--help-hidden`). Currently Clang doesn’t have an equivalent action. `-test-io` is used to trigger the InputOutputTest action in the Flang frontend driver. This patch makes sure that “flang-new” forwards it to “flang-new -fc1" by creating a preprocessor job. However, in Flang.cpp, `-test-io` is passed to “flang-new -fc1” without `-E`. This way we make sure that the preprocessor is _not_ run in the frontend driver. This is the desired behaviour: `-test-io` should only read the input file and print it to the output stream. co-authored-by: Andrzej Warzynski <andrzej.warzynski@arm.com> Differential Revision: https://reviews.llvm.org/D87989	2020-10-24 14:58:32 +01:00
Richard Smith	ccca93b5a2	Don't allow structured binding declarations to decompose a lambda-expression's captures. The built-in structured binding rules for classes require that all fields can be accessed by name, and the fields introduced for lambda captures are unnamed, so decomposing a capturing lambda is ill-formed.	2020-10-23 16:28:25 -07:00
Akira Hatanaka	71e1a56de1	[CodeGen] Emit destructor calls to destruct non-trivial C struct temporaries created by conditional and assignment operators rdar://problem/64989559 Differential Revision: https://reviews.llvm.org/D83448	2020-10-23 14:46:17 -07:00
Richard Smith	cb9b9842d3	PR47954 / DR2126: permit temporary objects that are lifetime-extended by variables that are usable in constant expressions to themselves be usable in constant expressions.	2020-10-23 14:29:18 -07:00
Nick Desaulniers	b7926ce6d7	[IR] add fn attr for no_stack_protector; prevent inlining on mismatch It's currently ambiguous in IR whether the source language explicitly did not want a stack a stack protector (in C, via function attribute no_stack_protector) or doesn't care for any given function. It's common for code that manipulates the stack via inline assembly or that has to set up its own stack canary (such as the Linux kernel) would like to avoid stack protectors in certain functions. In this case, we've been bitten by numerous bugs where a callee with a stack protector is inlined into an __attribute__((__no_stack_protector__)) caller, which generally breaks the caller's assumptions about not having a stack protector. LTO exacerbates the issue. While developers can avoid this by putting all no_stack_protector functions in one translation unit together and compiling those with -fno-stack-protector, it's generally not very ergonomic or as ergonomic as a function attribute, and still doesn't work for LTO. See also: https://lore.kernel.org/linux-pm/20200915172658.1432732-1-rkir@google.com/ https://lore.kernel.org/lkml/20200918201436.2932360-30-samitolvanen@google.com/T/#u Typically, when inlining a callee into a caller, the caller will be upgraded in its level of stack protection (see adjustCallerSSPLevel()). By adding an explicit attribute in the IR when the function attribute is used in the source language, we can now identify such cases and prevent inlining. Block inlining when the callee and caller differ in the case that one contains `nossp` when the other has `ssp`, `sspstrong`, or `sspreq`. Fixes pr/47479. Reviewed By: void Differential Revision: https://reviews.llvm.org/D87956	2020-10-23 11:55:39 -07:00
Xiangling Liao	05bef88eb3	[AIX] Let alloca return 16 bytes alignment On AIX, to support vector types, which should always be 16 bytes aligned, we set alloca to return 16 bytes aligned memory space. Differential Revision: https://reviews.llvm.org/D89910	2020-10-23 14:41:32 -04:00
Artem Belevich	e7fe125b77	[CUDA] Extract CUDA version from cuda.h if version.txt is not found If CUDA version can not be determined based on version.txt file, attempt to find CUDA_VERSION macro in cuda.h. This is a follow-up to D89752, Differntial Revision: https://reviews.llvm.org/D89832	2020-10-23 10:03:30 -07:00
Artem Belevich	65d206484c	[CUDA] Improve clang's ability to detect recent CUDA versions. CUDA-11.1 does not carry version.txt which causes clang to assume that it's CUDA-7.0, which used to be the only CUDA version w/o version.txt. In order to tell CUDA-7.0 apart from the new versions, clang now probes for the presence of libdevice.10.bc which is not present in the old CUDA versions. This should keep Clang working for CUDA-11.1. PR47332: https://bugs.llvm.org/show_bug.cgi?id=47332 Differential Revision: https://reviews.llvm.org/D89752	2020-10-23 10:03:29 -07:00
Richard Smith	af189c8ab1	Fix constant evaluation of zero-initialization of a union whose first FieldDecl is an unamed bitfield. Unnamed bitfields aren't non-static data member, so such a bitfield isn't actually the first non-static data member.	2020-10-22 17:03:59 -07:00
Xiangling Liao	0ba9843397	[AIX] Emit error for -G option on AIX 1. Emit error for -G driver option on AIX 2. Adjust cmake file to use -Wl,-G instead of -G On AIX, legacy XL compiler uses -G to produce a shared object enabled for use with the run-time linker, which has different meanings from what it is used for in Clang. And in Clang, other targets do not have -G map to another functionality in their legacy compiler. So this error is more important when we are on AIX. Differential Revision: https://reviews.llvm.org/D89897	2020-10-22 16:16:39 -04:00
Venkataramanan Kumar	57cdc52c4d	Initial support for vectorization using Libmvec (GLIBC vector math library) Differential Revision: https://reviews.llvm.org/D88154	2020-10-22 16:01:39 -04:00
Jonathan Crowther	9bc02e892f	[SystemZ][z/OS] Set short-enums as the default for z/OS This patch sets short-enums to be the default for z/OS. Reviewed By: abhina.sreeskantharajan Differential Revision: https://reviews.llvm.org/D89801	2020-10-22 14:15:58 -04:00
Marco Antognini	a779a16993	[OpenCL] Remove unused extensions Many non-language extensions are defined but also unused. This patch removes them with their tests as they do not require compiler support. The cl_khr_select_fprounding_mode extension is also removed because it has been deprecated since OpenCL 1.1 and Clang doesn't have any specific support for it. The cl_khr_context_abort extension is only referred to in "The OpenCL Specification", version 1.2 and 2.0, in Table 4.3, but no specification is provided in "The OpenCL Extension Specification" for these versions. Because it is both unused in Clang and lacks specification, this extension is removed. The following extensions are platform extensions that bring new OpenCL APIs but do not impact the kernel language nor require compiler support. They are therefore removed. - cl_khr_gl_sharing, introduced in OpenCL 1.0 - cl_khr_icd, introduced in OpenCL 1.2 - cl_khr_gl_event, introduced in OpenCL 1.1 Note: this extension adds a new API to create cl_event but it also specifies that these can only be used by clEnqueueAcquireGLObjects. Hence, they cannot be used on the device side and the extension does not impact the kernel language. - cl_khr_d3d10_sharing, introduced in OpenCL 1.1 - cl_khr_d3d11_sharing, introduced in OpenCL 1.2 - cl_khr_dx9_media_sharing, introduced in OpenCL 1.2 - cl_khr_image2d_from_buffer, introduced in OpenCL 1.2 - cl_khr_initialize_memory, introduced in OpenCL 1.2 - cl_khr_gl_depth_images, introduced in OpenCL 1.2 Note: this extension is related to cl_khr_depth_images but only the latter adds new features to the kernel language. - cl_khr_spir, introduced in OpenCL 1.2 - cl_khr_egl_event, introduced in OpenCL 1.2 Note: this extension adds a new API to create cl_event but it also specifies that these can only be used by clEnqueueAcquire* API functions. Hence, they cannot be used on the device side and the extension does not impact the kernel language. - cl_khr_egl_image, introduced in OpenCL 1.2 - cl_khr_terminate_context, introduced in OpenCL 1.2 The minimum required OpenCL version used in OpenCLExtensions.def for these extensions is not always correct. Removing these address that issue. Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D89372	2020-10-22 17:01:31 +01:00
Tianqing Wang	be39a6fe6f	[X86] Add User Interrupts(UINTR) instructions For more details about these instructions, please refer to the latest ISE document: https://software.intel.com/en-us/download/intel-architecture-instruction-set-extensions-programming-reference. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D89301	2020-10-22 17:33:07 +08:00
Alex Lorenz	de1016ce5c	[driver][arm64] Set target CPU to A12 for compiler invocations that target Apple Silicon macOS machines Differential Revision: https://reviews.llvm.org/D82699	2020-10-21 23:35:27 -07:00
Richard Smith	5e2c9a05b7	Fix test failure on Windows.	2020-10-21 20:02:37 -07:00
Xiang1 Zhang	7c3fea7721	[X86] Support customizing stack protector guard Reviewed By: nickdesaulniers, MaskRay Differential Revision: https://reviews.llvm.org/D88631	2020-10-22 10:08:14 +08:00
Richard Smith	8156074352	Ensure that the "value" of an unnamed bit-field isn't taken into account when determining the identity of a class NTTP.	2020-10-21 18:51:55 -07:00
Richard Smith	e97e9851b2	[c++20] For P0732R2: permit class template argument deduction for non-type template parameters.	2020-10-21 15:03:22 -07:00
Richard Smith	caf30e7f03	[c++20] For P0732R2: Give class NTTPs the proper type when examined with 'decltype'. This requires that we track enough information to determine the original type of the parameter in a substituted non-type template parameter, to distinguish the reference-to-class case from the class case.	2020-10-21 14:15:54 -07:00
Joseph Huber	cd4a4ae97a	[OpenMP] Fixing OpenMP/driver.c failing on 32-bit hosts The changes made in D88594 caused the test OpenMP/driver.c to fail on a 32-bit host becuase it was offloading to a 64-bit architecture by default. The offloading test was moved to a new file and a feature was added to the lit config to check for a 64-bit host. Reviewed By: daltenty Differential Revision: https://reviews.llvm.org/D89904	2020-10-21 17:01:36 -04:00
Sriraman Tallam	eef2e67d23	Simple fix to basic-block-sections to replace emit-obj with emit-llvm emit-obj is unnecessary here and further wasn't redirected to /dev/null.	2020-10-21 13:52:33 -07:00
Richard Smith	0c417d4bef	Add more test coverage for APValue serialization / deserialization and fix a few exposed bugs.	2020-10-21 13:21:41 -07:00
Richard Smith	ba4768c966	[c++20] For P0732R2 / P1907R1: Basic frontend support for class types as non-type template parameters. Create a unique TemplateParamObjectDecl instance for each such value, representing the globally unique template parameter object to which the template parameter refers. No IR generation support yet; that will follow in a separate patch.	2020-10-21 13:21:41 -07:00
Tyker	cf34dd0c4e	[clang] Improve Serialization/Imporing/Dumping of APValues Changes: - initializer expressions of constexpr variable are now wraped in a ConstantExpr. this is mainly used for testing purposes. the old caching system has not yet been removed. - Add all the missing Serialization and Importing for APValue. - Improve dumping of APValue when ASTContext isn't available. - Cleanup leftover from last patch. - Add Tests for Import and serialization. Differential Revision: https://reviews.llvm.org/D63640	2020-10-21 19:03:13 +02:00
John Brawn	ba60de5250	Use -### in arm-float-abi.c test This is needed to prevent the test from failing when llvm is configured so that the arm target is not present, which is the case for some buildbots.	2020-10-21 17:40:02 +01:00
Michael Liao	1bcec29afb	Only run when `arm` is registered. NFC.	2020-10-21 09:30:07 -04:00
David Zarzycki	87f6de72bc	[clang testing] Fix a read-only source build system failure	2020-10-21 08:08:03 -04:00
Florian Hahn	c50f0d239d	[Clang] Update newpm pipeline test in clang after D87322. This fixes a test failure because a LLVM pipeline test file in clang/ did not get updated in `88241ffb56`.	2020-10-21 12:59:50 +01:00
John Brawn	0c66606230	[Driver] Incorporate -mfloat-abi in the computed triple on ARM LLVM assumes that when it creates a call to a C library function it can use the C calling convention. On ARM the effective calling convention is determined from the target triple, however using -mfloat-abi=hard on ARM means that calls to (and definitions of) C library functions use the arm_aapcs_vfpcc calling convention which can result in a mismatch. Fix this by incorporating -mfloat-abi into the target triple, similar to how -mbig-endian and -march/-mcpu are. This only works for EABI targets and not Android or iOS, but there the float abi is fixed so instead give an error. Fixes PR45524 Differential Revision: https://reviews.llvm.org/D89573	2020-10-21 11:19:38 +01:00
Jonas Paulsson	42a82862b6	Reapply "[clang] Improve handling of physical registers in inline assembly operands." Earlyclobbers are now excepted from this change (original commit: `c78da03`). Review: Ulrich Weigand, Nick Desaulniers Differential Revision: https://reviews.llvm.org/D87279	2020-10-21 10:53:40 +02:00
Fangrui Song	829b9f6606	[test] Fix -fbasic-block-sections= test on Windows after D89500	2020-10-20 18:31:28 -07:00
Richard Smith	15e772e8dc	Don't instantiate lambda closure types in default member initializers when instantiating the enclosing class. We'll build new lambda closure types if and when we instantiate the default member initializer, and instantiating the closure type by itself can go wrong in cases where we fully-instantiate nested classes (in explicit instantiations of the enclosing class and when the enclosing class is a local class) -- we will instantiate the 'operator()' as a regular function rather than as a lambda call operator, so it doesn't get to use its captures, has the wrong 'this' type, etc.	2020-10-20 17:37:07 -07:00
Richard Smith	6781fee085	Don't permit array bound constant folding in OpenCL. Permitting non-standards-driven "do the best you can" constant-folding of array bounds is permitted solely as a GNU compatibility feature. We should not be doing it in any language mode that is attempting to be conforming. From https://reviews.llvm.org/D20090 it appears the intent here was to permit `__constant int` globals to be used in array bounds, but the change in that patch only added half of the functionality necessary to support that in the constant evaluator. This patch adds the other half of the functionality and turns off constant folding for array bounds in OpenCL. I couldn't find any spec justification for accepting the kinds of cases that D20090 accepts, so a reference to where in the OpenCL specification this is permitted would be useful. Note that this change also affects the code generation in one test: because after 'const int n = 0' we now treat 'n' as a constant expression with value 0, it's now a null pointer, so '(local int *)n' forms a null pointer rather than a zero pointer. Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D89520	2020-10-20 16:52:28 -07:00
Sriraman Tallam	f88785460e	Improve file doesnt exist error with -fbasic-block-sections= With -fbasicblock-sections=, let the front-end handle the case where the file doesnt exist. The driver only checks if the option syntax is right. Differential Revision: https://reviews.llvm.org/D89500	2020-10-20 16:41:56 -07:00
Peter Collingbourne	c5acd3490b	Driver: Add integer sanitizers to trapping group automatically. In D86000 we added a new sanitizer to the integer group without adding it to the trapping group. This broke usage of -fsanitize=integer -fsanitize-trap=integer or -fsanitize=integer -fsanitize-minimal-runtime. I think we can reasonably expect any new integer sanitizers to be compatible with trapping and the minimal runtime, so add them to the trapping group automatically. Also add a test to ensure that any future additions of sanitizers to the integer group will most likely result in test failures which would lead to updates to the minimal runtime if necessary. For this particular sanitizer no updates are required because it uses the existing shift_out_of_bounds callback function. Differential Revision: https://reviews.llvm.org/D89766	2020-10-20 13:45:39 -07:00
Akira Hatanaka	b78045c2ce	Add a C++ test case for https://reviews.llvm.org/D86854 The test case was part of https://reviews.llvm.org/D82999, which was abandoned after https://reviews.llvm.org/D86854 fixed the bug.	2020-10-20 07:34:38 -07:00
sstefan1	fbfb1c7909	[IR] Make nosync, nofree and willreturn default for intrinsics. D70365 allows us to make attributes default. This is a follow up to actually make nosync, nofree and willreturn default. The approach we chose, for now, is to opt-in to default attributes to avoid introducing problems to target specific intrinsics. Intrinsics with default attributes can be created using `DefaultAttrsIntrinsic` class.	2020-10-20 11:57:19 +02:00
Richard Smith	08c8d5bc51	Properly track whether a variable is constant-initialized. This fixes miscomputation of __builtin_constant_evaluated in the initializer of a variable that's not usable in constant expressions, but is readable when constant-folding. If evaluation of a constant initializer fails, we throw away the evaluated result instead of keeping it as a non-constant-initializer value for the variable, because it might not be a correct value. To avoid regressions for initializers that are foldable but not formally constant initializers, we now try constant-evaluating some globals in C++ twice: once to check for a constant initializer (in an mode where is_constannt_evaluated returns true) and again to determine the runtime value if the initializer is not a constant initializer.	2020-10-19 23:59:11 -07:00
Fangrui Song	2484e9159c	[Driver] Clean up -gz & --compress-debug-sections * Make cc1 and cc1as --compress-debug-sections an alias for --compress-debug-sections=zlib * Make -gz an alias for -gz=zlib The new behavior is consistent with GCC when binutils>=2.26 is detected: -gz is translated to --compress-debug-sections=zlib instead of --compress-debug-sections.	2020-10-19 23:06:33 -07:00
Fangrui Song	545c687c4b	[gcov] Unify driver and CC1 option names for -ftest-coverage & -fprofile-arcs No need to use -femit-coverage-notes and -femit-coverage-data.	2020-10-19 22:19:00 -07:00
Fangrui Song	0ab222e7d7	[gcov] Delete CC1 option -test-coverage The name is unfortunate because it is similar to the driver option -ftest-coverage. It turns out aside from one occurrence in a test, this option is not used.	2020-10-19 21:48:51 -07:00
Richard Smith	76c0092665	Ensure that checkInitIsICE is called exactly once for every variable for which it matters. This is a step towards separating checking for a constant initializer (in which std::is_constant_evaluated returns true) and any other evaluation of a variable initializer (in which it returns false).	2020-10-19 19:04:04 -07:00
Douglas Yung	774ab60125	Add option to use older clang ABI behavior when passing certain union types as function arguments Recently commit D78699 (commit `26cfb6e562`), fixed clang's behavior with respect to passing a union type through a register to correctly follow the ABI. However, this is an ABI breaking change with earlier versions of the clang compiler, so we should add an -fclang-abi-compat option to address this. Additionally, the PS4 ABI requires the older behavior, so that is added as well. This change adds a Ver11 value to the ClangABI enum that when it is set (or the target is the PS4 triple), we skip the ABI fix introduced in D78699. Differential Revision: https://reviews.llvm.org/D89747	2020-10-19 18:17:34 -07:00
Yaxun (Sam) Liu	52bcd691cb	Recommit "[CUDA][HIP] Defer overloading resolution diagnostics for host device functions" This recommits `7f1f89ec8d` and `40df06cdaf` with bug fixes for memory sanitizer failure and Tensile build failure.	2020-10-19 17:48:04 -04:00
Martin Storsjö	5eece137bc	[clang] Automatically link against oldnames just as linking against libcmt Differential Revision: https://reviews.llvm.org/D89702	2020-10-20 00:07:00 +03:00
Joseph Huber	24df30efda	[OpenMP] Fixing OpenMP/driver.c failing on 32-bit hosts The changes made in D88594 caused the test OpenMP/driver.c to fail on a 32-bit host becuase it was offloading to a 64-bit architecture by default. The offloading test was moved to a new file and a feature was added to the lit config to check for a 64-bit host. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D89696	2020-10-19 13:41:53 -04:00
Tony	89d71970cb	[AMDGPU] Extend hip-toolchin-features.hip test - Extend hip-toolchin-features.hip to also check the lld attributes are passed correctly. - Add check for cumode attributes. Differential Revision: https://reviews.llvm.org/D89636	2020-10-19 17:11:08 +00:00
Hans Wennborg	0628bea513	Revert "[PM/CC1] Add -f[no-]split-cold-code CC1 option to toggle splitting" This broke Chromium's PGO build, it seems because hot-cold-splitting got turned on unintentionally. See comment on the code review for repro etc. > This patch adds -f[no-]split-cold-code CC1 options to clang. This allows > the splitting pass to be toggled on/off. The current method of passing > `-mllvm -hot-cold-split=true` to clang isn't ideal as it may not compose > correctly (say, with `-O0` or `-Oz`). > > To implement the -fsplit-cold-code option, an attribute is applied to > functions to indicate that they may be considered for splitting. This > removes some complexity from the old/new PM pipeline builders, and > behaves as expected when LTO is enabled. > > Co-authored by: Saleem Abdulrasool <compnerd@compnerd.org> > Differential Revision: https://reviews.llvm.org/D57265 > Reviewed By: Aditya Kumar, Vedant Kumar > Reviewers: Teresa Johnson, Aditya Kumar, Fedor Sergeev, Philip Pfaffe, Vedant Kumar This reverts commit `273c299d5d`.	2020-10-19 12:31:14 +02:00
Haojian Wu	1e32df2f91	[clang-rename] Fix rename on variable templates. This patch adds support for renaming variable templates. Differential Revision: https://reviews.llvm.org/D89300	2020-10-19 09:44:59 +02:00
Haojian Wu	45a15dc682	[clang-rename] Fix rename on function template specializations. previously, we missed to rename occurrences to explicit function template specilizations. Differential Revision: https://reviews.llvm.org/D89221	2020-10-19 09:32:17 +02:00
Richard Smith	094e9f4779	PR47893: Synthesis of a comparison operator from an 'operator<=>' inherits the SFINAEness of its enclosing context.	2020-10-18 14:15:12 -07:00
Richard Smith	79cb179b14	PR47870: Properly mangle placeholders for deduced class template specializations that have no deduced type.	2020-10-18 13:57:41 -07:00
Hubert Tong	126094485a	[PowerPC][AIX] Make `__vector [un]signed long` an error The semantics associated with `__vector [un]signed long` are neither consistently specified nor consistently implemented. The IBM XL compilers on AIX traditionally treated these as deprecated aliases for the corresponding `__vector int` type in both 32-bit and 64-bit modes. The newer, Clang-based, IBM XL compilers on AIX make usage of the previously deprecated types an error. This is also consistent with IBM XL C/C++ for Linux on Power (on little endian distributions). In line with the above, this patch upgrades (on AIX) the deprecation of `__vector long` to become removal. Reviewed By: ZarkoCA Differential Revision: https://reviews.llvm.org/D89443	2020-10-18 12:39:16 -04:00
Mark de Wever	2bcda6bb28	[Sema, CodeGen] Implement [[likely]] and [[unlikely]] in SwitchStmt This implements the likelihood attribute for the switch statement. Based on the discussion in D85091 and D86559 it only handles the attribute when placed on the case labels or the default labels. It also marks the likelihood attribute as feature complete. There are more QoI patches in the pipeline. Differential Revision: https://reviews.llvm.org/D89210	2020-10-18 13:48:42 +02:00
Richard Smith	d4aac67859	Make the check for whether we should memset(0) an aggregate initialization a little smarter. Look through casts that preserve zero-ness when determining if an initializer is zero, so that we can handle cases like an {0} initializer whose corresponding field is a type other than 'int'.	2020-10-16 16:48:22 -07:00
Albion Fung	d30155feaa	[PowerPC] Implementation of 128-bit Binary Vector Rotate builtins This patch implements 128-bit Binary Vector Rotate builtins for PowerPC10. Differential Revision: https://reviews.llvm.org/D86819	2020-10-16 18:03:22 -04:00
Richard Smith	552c6c2328	PR44406: Follow behavior of array bound constant folding in more recent versions of GCC. Old GCC used to aggressively fold VLAs to constant-bound arrays at block scope in GNU mode. That's non-conforming, and more modern versions of GCC only do this at file scope. Update Clang to do the same. Also promote the warning for this from off-by-default to on-by-default in all cases; more recent versions of GCC likewise warn on this by default. This is still slightly more permissive than GCC, as pointed out in PR44406, as we still fold VLAs to constant arrays in structs, but that seems justifiable given that we don't support VLA-in-struct (and don't intend to ever support it), but GCC does. Differential Revision: https://reviews.llvm.org/D89523	2020-10-16 14:34:35 -07:00
Richard Smith	7e801ca0ef	Treat constant contexts as being in the default rounding mode. This addresses a regression where pretty much all C++ compilations using -frounding-math now fail, due to rounding being performed in constexpr function definitions in the standard library. This follows the "manifestly constant evaluated" approach described in https://reviews.llvm.org/D87528#2270676 -- evaluations that are required to succeed at compile time are permitted even in regions with dynamic rounding modes, as are (unfortunately) the evaluation of the initializers of local variables of const integral types. Differential Revision: https://reviews.llvm.org/D89360	2020-10-16 13:26:15 -07:00
Richard Smith	48c70c1664	Extend memset-to-zero optimization to C++11 aggregate functional casts Aggr{...}. We previously missed these cases due to not stepping over the additional AST nodes representing their syntactic form.	2020-10-16 13:21:08 -07:00
Scott Linder	c4d10e7e9b	[AMDGPU][HIP] Switch default DWARF version to 5 Another attempt at this, see D59008 for previous attempt. Reviewed By: kzhuravl, t-tye Differential Revision: https://reviews.llvm.org/D89484	2020-10-16 17:53:27 +00:00
Matt Arsenault	0a7cd99a70	Reapply "OpaquePtr: Add type to sret attribute" This reverts commit `eb9f7c28e5`. Previously this was incorrectly handling linking of the contained type, so this merges the fixes from D88973.	2020-10-16 11:05:02 -04:00
Florian Hahn	51ff04567b	Recommit "[DSE] Switch to MemorySSA-backed DSE by default." After investigation by @asbirlea, the issue that caused the revert appears to be an issue in the original source, rather than a problem with the compiler. This patch enables MemorySSA DSE again. This reverts commit `915310bf14`.	2020-10-16 09:02:53 +01:00
Konstantin Schwarz	6030a07516	Fix hidden-redecls.m test for some environments This test was failing in our CI environment, because Jenkins mounts the workspaces into Docker containers using their full path, i.e. /home/jenkins/workspaces/llvm-build. We've seen permission denied errors because /home/jenkins is mounted with root permissions and the default cache directory under Linux is $HOME/.cache. The fix is to explicitly provide the -fmodules-cache-path, which the other tests already seem to provide. Reviewed By: akyrtzi Differential Revision: https://reviews.llvm.org/D89453	2020-10-16 09:51:13 +02:00
Kito Cheng	cfa7094e49	[RISCV] Add -mtune support - The goal of this patch is improve option compatible with RISCV-V GCC, -mcpu support on GCC side will sent patch in next few days. - -mtune only affect the pipeline model and non-arch/extension related target feature, e.g. instruction fusion; in td file it called TuneFeatures, which is introduced by X86 back-end[1]. - -mtune accept all valid option for -mcpu and extra alias processor option, e.g. `generic`, `rocket` and `sifive-7-series`, the purpose is option compatible with RISCV-V GCC. - Processor alias for -mtune will resolve according the current target arch, rv32 or rv64, e.g. `rocket` will resolve to `rocket-rv32` or `rocket-rv64`. - Interaction between -mcpu and -mtune: * -mtune has higher priority than -mcpu for pipeline model and TuneFeatures. [1] https://reviews.llvm.org/D85165 Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D89025	2020-10-16 13:55:08 +08:00
Richard Smith	fc031d29be	Switch the default of VerifyIntegerConstantExpression from constant folding to not constant folding. Constant folding of ICEs is done as a GCC compatibility measure, but new code was picking it up, presumably by accident, due to the bad default. While here, also switch the flag from a bool to an enum to make it more obvious what it means at call sites. This highlighted a couple of places where our behavior is different between C++11 and C++14 due to switching from checking for an ICE to checking for a converted constant expression (where there is no 'fold' codepath).	2020-10-15 16:58:47 -07:00
Vedant Kumar	273c299d5d	[PM/CC1] Add -f[no-]split-cold-code CC1 option to toggle splitting This patch adds -f[no-]split-cold-code CC1 options to clang. This allows the splitting pass to be toggled on/off. The current method of passing `-mllvm -hot-cold-split=true` to clang isn't ideal as it may not compose correctly (say, with `-O0` or `-Oz`). To implement the -fsplit-cold-code option, an attribute is applied to functions to indicate that they may be considered for splitting. This removes some complexity from the old/new PM pipeline builders, and behaves as expected when LTO is enabled. Co-authored by: Saleem Abdulrasool <compnerd@compnerd.org> Differential Revision: https://reviews.llvm.org/D57265 Reviewed By: Aditya Kumar, Vedant Kumar Reviewers: Teresa Johnson, Aditya Kumar, Fedor Sergeev, Philip Pfaffe, Vedant Kumar	2020-10-15 23:13:33 +00:00
Fangrui Song	5a338599fb	[CGBuiltin] Respect asm labels and redefine_extname for builtins with specialized emitting rL131311 added `asm()` support for builtin functions, but `asm()` for builtins with specialized emitting (e.g. memcpy, various math functions) still do not work. This patch makes these functions work for `asm()` and `#pragma redefine_extname`. glibc uses `asm()` to redirect internal libc function calls to hidden aliases. Limitation: such a function is a builtin in clang, but will not be recognized as a libcall in optimization passes because Clang does not annotate the renamed function as a libcall. In GCC -O1 or above, `abs` can be optimized out but we can't. Additionally, we cannot redirect `__builtin_sin` to `real_sin` in the following example: double sin(double x) asm("real_sin"); double f(double d) { return __builtin_sin(d); } --- According to @rsmith, the following three statements cannot be simultaneously true: (1) The frontend function foo has known, builtin semantics X. (2) The symbol foo has known, builtin semantics X. (3) It's not correct to lower a call to the frontend function foo to the symbol foo. People do want (1) (if it is profitable to expand a memcpy, do it). This also means that people do not want to add -fno-builtin-memcpy. People do want (3): that is why they use asm("__GI_memcpy") in the first place. So unfortunately we make a compromise by not refuting (2) (see the limitation above). For most libcalls, there is a small loss because compilers don't synthesize them. For the few glibc cares about, it uses `asm("memcpy = __GI_memcpy");` to make the assembly level redirection. (Changing function names (e.g. `__memcpy`) is a hit to ergonomics which is not acceptable). Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D88712	2020-10-15 15:14:38 -07:00
Reid Kleckner	5fbab4025e	[MS] Apply `inreg` to AArch64 sret parms on instance methods The documentation rules indicate that instance methods should return large, trivially copyable aggregates via X1/X0 and not X8 as is normally done when returning such structs from free functions: https://docs.microsoft.com/en-us/cpp/build/arm64-windows-abi-conventions?view=vs-2019#return-values Fixes PR47836, a bug in the initial implementation of these rules. I tried to simplify the logic a bit as well while I'm here. Differential Revision: https://reviews.llvm.org/D89362	2020-10-15 14:54:42 -07:00
Yaxun (Sam) Liu	e384e94fbe	Revert "[HIP] Change default --gpu-max-threads-per-block value to 1024" This reverts commit `187658b8a6` due to AMDGPU backend issues.	2020-10-15 17:25:55 -04:00
Leonard Chan	79829a4704	Revert "[clang] Add -fc++-abi= flag for specifying which C++ ABI to use" This reverts commits `683b308c07` and `8487bfd4e9`. We will go for a more restricted approach that does not give freedom to everyone to change ABIs on whichever platform. See the discussion on https://reviews.llvm.org/D85802.	2020-10-15 14:24:38 -07:00
Thomas Lively	1992e30c2d	[WebAssembly] Prototype i8x16.popcnt As proposed at https://github.com/WebAssembly/simd/pull/379. Use a target builtin and intrinsic rather than normal codegen patterns to make the instruction opt-in until it is merged to the proposal and stabilized in engines. Differential Revision: https://reviews.llvm.org/D89446	2020-10-15 21:18:22 +00:00
Richard Smith	68f116aa23	PR47864: Fix assertion in pointer-to-member emission if there are multiple declarations of the same base class.	2020-10-15 13:51:51 -07:00
Stanislav Mekhanoshin	d1beb95d12	[AMDGPU] gfx1032 target Differential Revision: https://reviews.llvm.org/D89487	2020-10-15 12:41:18 -07:00
Thomas Lively	3f738d1f5e	Reland "[WebAssembly] v128.load{8,16,32,64}_lane instructions" This reverts commit `7c8385a352` with a typing fix to an instruction selection pattern.	2020-10-15 19:32:34 +00:00
Erik Pilkington	351317167e	[SemaObjC] Fix composite pointer type calculation for `void*` and pointer to lifetime qualified ObjC pointer type Fixes a regression introduced in `9a6f4d451c`. rdar://70101809 Differential revision: https://reviews.llvm.org/D89475	2020-10-15 15:21:01 -04:00
Konstantin Zhuravlyov	67f189e93c	Make sure both cc1 and cc1as process -m[no-]code-object-v3 Differential Revision: https://reviews.llvm.org/D89478	2020-10-15 14:03:26 -04:00
Thomas Lively	7c8385a352	Revert "[WebAssembly] v128.load{8,16,32,64}_lane instructions" This reverts commit `7c6bfd90ab`.	2020-10-15 15:49:36 +00:00
Thomas Lively	7c6bfd90ab	[WebAssembly] v128.load{8,16,32,64}_lane instructions Prototype the newly proposed load_lane instructions, as specified in https://github.com/WebAssembly/simd/pull/350. Since these instructions are not available to origin trial users on Chrome stable, make them opt-in by only selecting them from intrinsics rather than normal ISel patterns. Since we only need rough prototypes to measure performance right now, this commit does not implement all the load and store patterns that would be necessary to make full use of the offset immediate. However, the full suite of offset tests is included to make it easy to track improvements in the future. Since these are the first instructions to have a memarg immediate as well as an additional immediate, the disassembler needed some additional hacks to be able to parse them correctly. Making that code more principled is left as future work. Differential Revision: https://reviews.llvm.org/D89366	2020-10-15 15:33:10 +00:00
Simon Pilgrim	d7fa9030d4	[CodeGen][X86] Emit fshl/fshr ir intrinsics for shiftleft128/shiftright128 ms intrinsics Now that funnel shift handling is pretty good, we can use the intrinsics directly and avoid a lot of zext/trunc issues. https://godbolt.org/z/YqhnnM Differential Revision: https://reviews.llvm.org/D89405	2020-10-15 10:22:41 +01:00
Richard Smith	9dbb0886ea	Perform lvalue conversions on the left of a pseudo-destructor call 'p->~T()'. Previously we failed to convert 'p' from array/function to pointer type, and to represent the load of 'p' in the AST. The latter causes problems for constant evaluation.	2020-10-14 22:09:01 -07:00
Richard Smith	f7f2e4261a	PR47805: Use a single object for a function parameter in the caller and callee in constant evaluation. We previously made a deep copy of function parameters of class type when passing them, resulting in the destructor for the parameter applying to the original argument value, ignoring any modifications made in the function body. This also meant that the 'this' pointer of the function parameter could be observed changing between the caller and the callee. This change completely reimplements how we model function parameters during constant evaluation. We now model them roughly as if they were variables living in the caller, albeit with an artificially reduced scope that covers only the duration of the function call, instead of modeling them as temporaries in the caller that we partially "reparent" into the callee at the point of the call. This brings some minor diagnostic improvements, as well as significantly reduced stack usage during constant evaluation.	2020-10-14 17:43:51 -07:00
Dave Lee	4cb4db11ee	Revert "[ASTImporter] Fix crash caused by unset AttributeSpellingListIndex" This broke the GreenDragon build, due to the following error while running TestImportBuiltinFileID: ``` Ignored/unknown shouldn't get here UNREACHABLE executed at tools/clang/include/clang/Sema/AttrSpellingListIndex.inc:13! ``` See http://green.lab.llvm.org/green/view/LLDB/job/lldb-cmake/24213/ This reverts commit `73c6beb2f7`. This reverts https://reviews.llvm.org/D89318	2020-10-14 17:21:56 -07:00
Leonard Chan	8487bfd4e9	[clang][NFC] Change diagnostic to start with lowercase letter	2020-10-14 15:48:29 -07:00
Leonard Chan	683b308c07	[clang] Add -fc++-abi= flag for specifying which C++ ABI to use This implements the flag proposed in RFC http://lists.llvm.org/pipermail/cfe-dev/2020-August/066437.html. The goal is to add a way to override the default target C++ ABI through a compiler flag. This makes it easier to test and transition between different C++ ABIs through compile flags rather than build flags. In this patch: - Store `-fc++-abi=` in a LangOpt. This isn't stored in a CodeGenOpt because there are instances outside of codegen where Clang needs to know what the ABI is (particularly through ASTContext::createCXXABI), and we should be able to override the target default if the flag is provided at that point. - Expose the existing ABIs in TargetCXXABI as values that can be passed through this flag. - Create a .def file for these ABIs to make it easier to check flag values. - Add an error for diagnosing bad ABI flag values. Differential Revision: https://reviews.llvm.org/D85802	2020-10-14 12:31:21 -07:00
Christopher Di Bella	18432bea76	[Driver]: fix compiler-rt path when printing libgcc for baremetal clang --target arm-none-eabi --print-libgcc-file-name --rtlib=compiler-rt used to print `/path/to/lib/clang/version/lib/libclang_rt.builtins-arm.a` but should print `/path/to/lib/clang/version/lib/baremetal/libclang_rt.builtins-arm.a`. Similarly, --target armv7m-none-eabi should print libclang_rt.builtins-armv7m.a This matches the compiler-rt file name used at link time in the baremetal driver. Reviewed By: manojgupta Differential Revision: https://reviews.llvm.org/D89327	2020-10-14 10:29:35 -07:00
Konstantin Zhuravlyov	3fdf3b1539	AMDGPU: Update AMDHSA code object version handling Differential Revision: https://reviews.llvm.org/D89076	2020-10-14 13:04:27 -04:00
Simon Pilgrim	b967b9a711	[CodeGen] Move x86 specific ms intrinsic tests into x86 target subfolder. NFCI.	2020-10-14 17:37:26 +01:00
jasonliu	f85bcc21dd	[AIX] Turn -fdata-sections on by default in Clang Summary: This patch does the following: 1. Make InitTargetOptionsFromCodeGenFlags() accepts Triple as a parameter, because some options' default value is triple dependant. 2. DataSections is turned on by default on AIX for llc. 3. Test cases change accordingly because of the default behaviour change. 4. Clang Driver passes in -fdata-sections by default on AIX. Reviewed By: MaskRay, DiggerLin Differential Revision: https://reviews.llvm.org/D88737	2020-10-14 15:58:31 +00:00
Gabor Marton	73c6beb2f7	[ASTImporter] Fix crash caused by unset AttributeSpellingListIndex During the import of attributes we forgot to set the spelling list index. This caused a segfault when we wanted to traverse the AST (e.g. by the dump() method). Differential Revision: https://reviews.llvm.org/D89318	2020-10-14 14:10:08 +02:00
Gabor Marton	dd965711c9	[ASTImporter] Fix crash caused by unimported type of FromatAttr During the import of FormatAttrs we forgot to import the type (e.g `__scanf__`) of the attribute. This caused a segfault when we wanted to traverse the AST (e.g. by the dump() method). Differential Revision: https://reviews.llvm.org/D89319	2020-10-14 13:54:48 +02:00
Jonas Paulsson	625fa47617	Revert "[clang] Improve handling of physical registers in inline assembly operands." This reverts commit `c78da03778`. Temporarily reverted due to https://bugs.llvm.org/show_bug.cgi?id=47837.	2020-10-14 08:42:51 +02:00
Liu, Chen3	bd05afcb3f	[X86][NFC] Fix RUN line bug in the testcase Testcase added in D78699 doesn't work because the wrong RUN line in the testcase. Differential Revision: https://reviews.llvm.org/D89361	2020-10-14 12:40:34 +08:00
Richard Smith	69f7c006ff	Revert "PR47805: Use a single object for a function parameter in the caller and" Breaks a clangd unit test. This reverts commit `8f8b9f2cca`.	2020-10-13 19:32:03 -07:00
Richard Smith	8f8b9f2cca	PR47805: Use a single object for a function parameter in the caller and callee in constant evaluation. We previously made a deep copy of function parameters of class type when passing them, resulting in the destructor for the parameter applying to the original argument value, ignoring any modifications made in the function body. This also meant that the 'this' pointer of the function parameter could be observed changing between the caller and the callee. This change completely reimplements how we model function parameters during constant evaluation. We now model them roughly as if they were variables living in the caller, albeit with an artificially reduced scope that covers only the duration of the function call, instead of modeling them as temporaries in the caller that we partially "reparent" into the callee at the point of the call. This brings some minor diagnostic improvements, as well as significantly reduced stack usage during constant evaluation.	2020-10-13 18:50:46 -07:00
Erik Pilkington	498c7fa48a	[SemaObjC] Fix a crash on an invalid ternary with ARC pointers FindCompositeObjCPointerType nulls out the subexpressions on error, so bail out instead of trying to deref them.	2020-10-13 21:20:20 -04:00
Richard Smith	ab870f3030	Revert "PR47805: Use a single object for a function parameter in the caller and" The buildbots are displeased. This reverts commit `8d03a972ce`.	2020-10-13 15:59:00 -07:00
Richard Smith	8d03a972ce	PR47805: Use a single object for a function parameter in the caller and callee in constant evaluation. We previously made a deep copy of function parameters of class type when passing them, resulting in the destructor for the parameter applying to the original argument value, ignoring any modifications made in the function body. This also meant that the 'this' pointer of the function parameter could be observed changing between the caller and the callee. This change completely reimplements how we model function parameters during constant evaluation. We now model them roughly as if they were variables living in the caller, albeit with an artificially reduced scope that covers only the duration of the function call, instead of modeling them as temporaries in the caller that we partially "reparent" into the callee at the point of the call. This brings some minor diagnostic improvements, as well as significantly reduced stack usage during constant evaluation.	2020-10-13 15:45:04 -07:00
Xiangling Liao	4c10d6508f	[AIX] Support two itanium alignment LIT testcases for AIX using regex AIX has different layout dumping format from other itanium ABIs. And for these two cases, use regex to match AIX format. Differential Revision: https://reviews.llvm.org/D89064	2020-10-13 16:47:01 -04:00
Konstantin Zhuravlyov	e2eaa91451	AMDGPU: Remove -mamdgpu-debugger-abi option It has been unsupported for few years now. Differential Revision: https://reviews.llvm.org/D89125	2020-10-13 12:20:28 -04:00
Jonas Paulsson	c78da03778	[clang] Improve handling of physical registers in inline assembly operands. Change EmitAsmStmt() to - Not tie physregs with the "+r" constraint, but instead add the hard register as an input constraint. This makes "+r" and "=r":"r" look the same in the output. Background: Macro intensive user code may contain inline assembly statements with multiple operands constrained to the same physreg. Such a case (with the operand constraints "+r" : "r") currently triggers the TwoAddressInstructionPass assertion against any extra use of a tied register. Furthermore, TwoAddress will insert a COPY to that physreg even though isel has already done so (for the non-tied use), which may lead to a second redundant instruction currently. A simple fix for this is to not emit tied physreg uses in the first place for the "+r" constraint, which is what this patch does. - Give an error on multiple outputs to the same physical register. This should be reported and this is also what GCC does. Review: Ulrich Weigand, Aaron Ballman, Jennifer Yu, Craig Topper Differential Revision: https://reviews.llvm.org/D87279	2020-10-13 15:09:52 +02:00
Bevin Hansson	9fa7f48459	[Fixed Point] Add fixed-point to floating point cast types and consteval. Reviewed By: leonardchan Differential Revision: https://reviews.llvm.org/D86631	2020-10-13 13:26:56 +02:00

... 3 4 5 6 7 ...

42077 Commits