llvm-project

Commit Graph

Author	SHA1	Message	Date
Joseph Huber	d564409946	[OpenMP] Change CMake Configuration to Build for Highest CUDA Architecture by Default Summary: This patch changes the CMake files for Clang and Libomptarget to query the system for its supported CUDA architecture. This makes it much easier for the user to build optimal code without needing to set the flags manually. This relies on the now deprecated FindCUDA method in CMake, but full support for architecture detection is only availible in CMake >3.18 Reviewers: jdoerfert ye-luo Subscribers: cfe-commits guansong mgorny openmp-commits sstefan1 yaxunl Tags: #clang #OpenMP Differential Revision: https://reviews.llvm.org/D87946	2020-10-08 12:09:34 -04:00
Geoff Levner	b9225543e8	DeferredDiagnosticsEmitter crashes Patch VisitCXXDeleteExpr() in clang::UsedDeclVisitor to avoid it crashing when the expression's destroyed type is null. According to the comments in CXXDeleteExpr::getDestroyedType(), this can happen when the type to delete is a dependent type. Patch by Geoff Levner. Differential Revision: https://reviews.llvm.org/D88949	2020-10-08 11:42:21 -04:00
diggerlin	92bca12843	[AIX] add new option -mignore-xcoff-visibility SUMMARY: In IBM compiler xlclang , there is an option -fnovisibility which suppresses visibility. For more details see: https://www.ibm.com/support/knowledgecenter/SSGH3R_16.1.0/com.ibm.xlcpp161.aix.doc/compiler_ref/opt_visibility.html. We need to add the option -mignore-xcoff-visibility for compatibility with the IBM AIX OS (as the option is enabled by default in AIX). With this option llvm does not emit any visibility attribute to ASM or XCOFF object file. The option only work on the AIX OS, for other non-AIX OS using the option will report an unsupported options error. In AIX OS: 1.1 the option -mignore-xcoff-visibility is enabled by default , if there is not -fvisibility=* and -mignore-xcoff-visibility explicitly in the clang command . 1.2 if there is -fvisibility=* explicitly but not -mignore-xcoff-visibility explicitly in the clang command. it will generate visibility attributes. 1.3 if there are both -fvisibility=* and -mignore-xcoff-visibility explicitly in the clang command. The option "-mignore-xcoff-visibility" wins , it do not emit the visibility attribute. The option -mignore-xcoff-visibility has no effect on visibility attribute when compile with -emit-llvm option to generated LLVM IR. Reviewer: daltenty,Jason Liu Differential Revision: https://reviews.llvm.org/D87451	2020-10-08 09:34:58 -04:00
Joseph Huber	6668e4cc68	[OpenMP] Add Error Handling for Conflicting Pointer Sizes for Target Offload Summary: This patch adds an error to Clang that detects if OpenMP offloading is used between two architectures with incompatible pointer sizes. This ensures that the data mapping can be done correctly and solves an issue in code generation generating the wrong size pointer. Reviewer: jdoerfert Subscribers: cfe-commits delcypher guansong llvm-commits sstefan1 yaxunl Tags: #OpenMP #Clang Differential Revision: https://reviews.llvm.org/D88594	2020-10-08 08:20:38 -04:00
Serge Pavlov	70bf35070a	[Driver] Add output file to properties of Command Object of class `Command` contains various properties of a command to execute, but output file was missed from them. This change adds this property. It is required for reporting consumed time and memory implemented in D78903 and may be used in other cases too. Differential Revision: https://reviews.llvm.org/D78902	2020-10-08 18:23:39 +07:00
Haojian Wu	a96bcfb196	[AST][RecoveryExpr] Support dependent cast-expr in C for error-recovery. Suppress spurious "typecheck_cond_expect_scalar_operand" diagnostic. See whole context: https://reviews.llvm.org/D85025 Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D84387	2020-10-08 10:00:29 +02:00
Serge Guelton	b4ffc40d62	Update documentation and implementation of stage3 build Have the build work out of the box by forcing an LLD build. That way, we don't require an external LTO-aware linker, as we build one. Also remove reference to the seemingly dead builder. Differential Revision: https://reviews.llvm.org/D88990	2020-10-08 07:55:37 +02:00
Dominic Chen	c102488293	Add test for disabling Dead Virtual Function Elimination Differential Revision: https://reviews.llvm.org/D88349	2020-10-07 19:20:16 -04:00
Simon Pilgrim	42d91438ad	[CodeGen][X86] Cleanup labels on some sse/avx intrinsics tests. NFCI. Add some missing CHECK-LABEL lines. Remove leading '@' so it'll be possible to match against c and c++ builds in a future patch.	2020-10-07 19:33:14 +01:00
Alex Richardson	ff6e4441b9	[clang-format][tests] Fix MacroExpander lexer not parsing C++ keywords While debugging a different clang-format failure, I tried to reuse the MacroExpander lexer, but was surprised to see that it marks all C++ keywords (e.g. const, decltype) as being of type identifier. After stepping through the ::format() code, I noticed that the difference between these two is that the identifier table was not being initialized based on the FormatStyle, so only basic tokens such as tok::semi, tok::plus, etc. were being handled. Reviewed By: klimek Differential Revision: https://reviews.llvm.org/D88952	2020-10-07 17:17:41 +01:00
Alex Richardson	0a3c82e85b	[clang-format][NFC] Store FormatToken::Type as an enum instead of bitfield This improves the debugging experience since LLDB will print the enumerator name instead of a decimal number. This changes TokenType to have uint8_t as the underlying type and moves it after the remaining bitfields to avoid increasing the size of FormatToken. Reviewed By: MyDeveloperDay Differential Revision: https://reviews.llvm.org/D87006	2020-10-07 17:17:40 +01:00
Fanbo Meng	9908ee5670	[SystemZ][z/OS] Add test of zero length bitfield type size larger than target zero length bitfield boundary Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D88963	2020-10-07 11:34:13 -04:00
Haojian Wu	31dc908017	[clang] Use isCompoundAssignmentOp to simplify the code, NFC.	2020-10-07 09:50:43 +02:00
Haojian Wu	334ec6f807	[AST][RecoveryExpr] Support dependent conditional operators in C for error recovery. suppress spurious "typecheck_cond_expect_scalar" diagnostic. Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D84322	2020-10-07 09:33:57 +02:00
Johannes Doerfert	5a3f6bfe8a	Reapply "[OpenMP][FIX] Verify compatible types for declare variant calls" D88384 This reapplies D88384 with the minor modification that an assertion was changed to a regular conditional and graceful exit from ASTContext::mergeTypes.	2020-10-07 00:06:51 -05:00
Richard Smith	00d3e6c1b4	[c++17] Implement P0145R3 during constant evaluation. Ensure that we evaluate assignment and compound-assignment right-to-left, and array subscripting left-to-right. Fixes PR47724. This is a re-commit of `ded79be`, reverted in `37c74df`, with a fix and test for the crasher bug previously introduced.	2020-10-06 12:30:26 -07:00
Fanbo Meng	43cd0a98d1	[SystemZ][z/OS] Set default alignment rules for z/OS target Update RUN line to fix lit failure Differential Revision: https://reviews.llvm.org/D88845	2020-10-06 14:21:21 -04:00
Fanbo Meng	c781dc74a8	[SystemZ][z/OS] Set default alignment rules for z/OS target Set the default alignment control variables for z/OS target and add test case for alignment rules on z/OS. Reviewed By: abhina.sreeskantharajan Differential Revision: https://reviews.llvm.org/D88845	2020-10-06 13:16:15 -04:00
Aaron En Ye Shi	8d2a0c115e	[HIP] NFC Add comments to cmath functions Add missing comments to cmath functions. Differential Revision: https://reviews.llvm.org/D88837	2020-10-06 15:26:56 +00:00
Aaron En Ye Shi	aa2b593f14	[HIP] Restructure hip headers to add cmath Separate __clang_hip_math.h header into __clang_hip_cmath.h and __clang_hip_math.h. Improve the math function definition, and add missing definitions or declarations. Add missing overloads. Reviewed By: tra, JonChesterfield Differential Review: https://reviews.llvm.org/D88837	2020-10-06 14:48:53 +00:00
Shivanshu Goyal	66e4f07198	Add ability to turn off -fpch-instantiate-templates in clang-cl A lot of our code building with clang-cl.exe using Clang 11 was failing with the following 2 type of errors: 1. explicit specialization of 'foo' after instantiation 2. no matching function for call to 'bar' Note that we also use -fdelayed-template-parsing in our builds. I tried pretty hard to get a small repro for these failures, but couldn't. So there is some subtle edge case in the -fpch-instantiate-templates feature introduced by this change: https://reviews.llvm.org/D69585 When I tried turning this off using -fno-pch-instantiate-templates, builds would silently fail with the same error without any indication that -fno-pch-instantiate-templates was being ignored by the compiler. Then I realized this "no" option wasn't actually working when I ran Clang under a debugger. Differential revision: https://reviews.llvm.org/D88680	2020-10-06 16:23:23 +02:00
Dmitri Gribenko	37c74dfe72	Revert "[c++17] Implement P0145R3 during constant evaluation." This reverts commit `ded79be635`. It causes a crash (I sent the crash reproducer directly to the author).	2020-10-06 15:49:44 +02:00
Chuyang Chen	8fa45e1fd5	Convert diagnostics about multi-character literals from extension to warning This addresses PR46797.	2020-10-06 08:47:17 -04:00
David Spickett	f0a78bdfdc	[AArch64] Correct parameter type for unsigned Neon scalar shift intrinsics In the following intrinsics the shift amount (parameter 2) should be signed. vqshlb_u8 vqshlh_u16 vqshls_u32 vqshld_u64 vqrshlb_u8 vqrshlh_u16 vqrshls_u32 vqrshld_u64 vshld_u64 vrshld_u64 See https://developer.arm.com/documentation/ihi0073/latest Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D88013	2020-10-06 11:34:58 +01:00
Pushpinder Singh	3a12ff0dac	[OpenMP][RTL] Remove dead code RequiresDataSharing was always 0, resulting dead code in device runtime library. Reviewed By: jdoerfert, JonChesterfield Differential Revision: https://reviews.llvm.org/D88829	2020-10-06 05:43:47 -04:00
Haojian Wu	70d9dc8674	[AST][RecoveryExpr] Support dependent binary operator in C for error recovery. see the whole context in: https://reviews.llvm.org/D85025 Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D84226	2020-10-06 08:53:31 +02:00
Richard Smith	ded79be635	[c++17] Implement P0145R3 during constant evaluation. Ensure that we evaluate assignment and compound-assignment right-to-left, and array subscripting left-to-right. Fixes PR47724.	2020-10-05 19:04:14 -07:00
Richard Smith	ebf6fd633e	Make OpenMP tests less brittle in the face of changes in constant evaluation diagnostics.	2020-10-05 19:04:14 -07:00
Evandro Menezes	a48d480e1f	[RISCV] Fix broken test Fix test for the SiFive E76 core. This patch fixes the issue introduced by the commit `5d6d8a2769`.	2020-10-05 19:28:31 -05:00
Roman Lebedev	e00f189d39	[InstCombine] Revert rL226781 "Teach InstCombine to canonicalize loads which are only ever stored to always use a legal integer type if one is available." (PR47592) (it was introduced in https://lists.llvm.org/pipermail/llvm-dev/2015-January/080956.html) This canonicalization seems dubious. Most importantly, while it does not create `inttoptr` casts by itself, it may cause them to appear later, see e.g. D88788. I think it's pretty obvious that it is an undesirable outcome, by now we've established that seemingly no-op `inttoptr`/`ptrtoint` casts are not no-op, and are no longer eager to look past them. Which e.g. means that given ``` %a = load i32 %b = inttoptr %a %c = inttoptr %a ``` we likely won't be able to tell that `%b` and `%c` is the same thing. As we can see in D88789 / D88788 / D88806 / D75505, we can't really teach SCEV about this (not without the https://bugs.llvm.org/show_bug.cgi?id=47592 at least) And we can't recover the situation post-inlining in instcombine. So it really does look like this fold is actively breaking otherwise-good IR, in a way that is not recoverable. And that means, this fold isn't helpful in exposing the passes that are otherwise unaware of these patterns it produces. Thusly, i propose to simply not perform such a canonicalization. The original motivational RFC does not state what larger problem that canonicalization was trying to solve, so i'm not sure how this plays out in the larger picture. On vanilla llvm test-suite + RawSpeed, this results in increase of asm instructions and final object size by ~+0.05% decreases final count of bitcasts by -4.79% (-28990), ptrtoint casts by -15.41% (-3423), and of inttoptr casts by -25.59% (-6919, sic). Overall, there's -0.04% less IR blocks, -0.39% instructions. See https://bugs.llvm.org/show_bug.cgi?id=47592 Differential Revision: https://reviews.llvm.org/D88789	2020-10-06 00:00:30 +03:00
Evandro Menezes	5d6d8a2769	[RISCV] Add SiFive cores to the CPU option Add the SiFive cores E76 and U74 using the SiFive 7 series microarchitecture. Differential Revision: https://reviews.llvm.org/D88759	2020-10-05 15:50:57 -05:00
Fangrui Song	a2cc883368	[CUDA] Don't call __cudaRegisterVariable on C++17 inline variables D17779: host-side shadow variables of external declarations of device-side global variables have internal linkage and are referenced by `__cuda_register_globals`. nvcc from CUDA 11 does not allow `__device__ inline` or `__device__ constexpr` (C++17 inline variables) but clang has incorrectly supported them for a while: ``` error: A __device__ variable cannot be marked constexpr error: An inline __device__/__constant__/__managed__ variable must have internal linkage when the program is compiled in whole program mode (-rdc=false) ``` If such a variable (which has a comdat group) is discarded (a copy from another translation unit is prevailing and selected), accessing the variable from outside the section group (`__cuda_register_globals`) is a violation of the ELF specification and will be rejected by linkers: > A symbol table entry with STB_LOCAL binding that is defined relative to one of a group's sections, and that is contained in a symbol table section that is not part of the group, must be discarded if the group members are discarded. References to this symbol table entry from outside the group are not allowed. As a workaround, don't register such inline variables for now. (If we register the variables in all TUs, we will keep multiple instances of the shadow and break the C++ semantics for inline variables). We should reject such variables in Sema but our internal users need some time to migrate. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D88786	2020-10-05 12:53:59 -07:00
Saleem Abdulrasool	85d5064000	docs: add documentation describing API Notes API Notes are a feature which allows annotation of headers by an auxiliary file that contains metadata for declarations pertaining to the associated module. This enables adding attributes to declarations without requiring modification of the headers, enabling finer grained control for library headers for consumers without having to modify external headers. Differential Revision: https://reviews.llvm.org/D88446 Reviewed By: Richard Smith, Marcel Hlopko	2020-10-05 18:29:13 +00:00
Andrzej Warzynski	8d51d37e06	[flang] Introduce DiagnosticConsumer classes in libflangFrontend Currently Flang uses TextDiagnostic, TextDiagnosticPrinter & TestDiagnosticBuffer classes from Clang (more specifically, from libclangFrontend). This patch introduces simplified equivalents of these classes in Flang (i.e. it removes the dependency on libclangFrontend). Flang only needs these diagnostics classes for the compiler driver diagnostics. This is unlike in Clang in which similar diagnostic classes are used for e.g. Lexing/Parsing/Sema diagnostics. For this reason, the implementations introduced here are relatively basic. We can extend them in the future if this is required. This patch also enhances how the diagnostics are printed. In particular, this is the diagnostic that you'd get _before_ the changes introduced here (no text formatting): ``` $ bin/flang-new error: no input files ``` This is the diagnostic that you get _after_ the changes introduced here (in terminals that support it, the text is formatted - bold + red): ``` $ bin/flang-new flang-new: error: no input files ``` Tests are updated accordingly and options related to enabling/disabling color diagnostics are flagged as supported by Flang. Reviewed By: sameeranjoshi, CarolineConcatto Differential Revision: https://reviews.llvm.org/D87774	2020-10-05 17:46:44 +01:00
Joseph Huber	1dce692de1	Revert "[OpenMP] Add Error Handling for Conflicting Pointer Sizes for Target Offload" Reverting because detecting architecture size doesn't work on all platforms. This reverts commit `eaf73293cb`.	2020-10-05 12:35:39 -04:00
Joseph Huber	eaf73293cb	[OpenMP] Add Error Handling for Conflicting Pointer Sizes for Target Offload Summary: This patch adds an error to Clang that detects if OpenMP offloading is used between two architectures with incompatible pointer sizes. This ensures that the data mapping can be done correctly and solves an issue in code generation generating the wrong size pointer. This patch adds a new lit substitution, %omp_powerpc_triple that, if the system is 32-bit or 64-bit, sets the powerpc triple accordingly. This was required to fix some OpenMP tests that automatically populated the target architecture. Reviewers: jdoerfert Subscribers: cfe-commits guansong sstefan1 yaxunl delcypher Tags: OpenMP clang LLVM Differential Revision: https://reviews.llvm.org/D88594	2020-10-05 11:02:13 -04:00
Simon Pilgrim	7a932f4f4c	[Parser] ParseMicrosoftAsmStatement - Replace bit '\|' operator with logical '\|\|' operator. (PR47071) Fixes static analysis warning.	2020-10-05 14:23:28 +01:00
Gabor Marton	007dd12d54	[ASTImporter][AST] Fix structural equivalency crash on dependent FieldDecl Differential Revision: https://reviews.llvm.org/D88665	2020-10-05 14:06:09 +02:00
Haojian Wu	7f05fe1aee	[AST][RecoveryExpr] Fix a crash on undeduced type. We should not capture the type if the function return type is undeduced. Reviewed By: adamcz Differential Revision: https://reviews.llvm.org/D87350	2020-10-05 12:52:04 +02:00
Haojian Wu	3423d5c9da	[AST][RecoveryExpr] Popagate the error-bit from a VarDecl's initializer to DeclRefExpr. The error-bit was missing, if a DeclRefExpr (which refers to a VarDecl with a contains-errors initializer). It could cause different violations in clang -- the DeclRefExpr is value-dependent, but not contains-errors, `ABC<DeclRefExpr>` could produce a non-error and non-dependent type in non-template context, which will lead to crashes in constexpr evaluation. Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D86048	2020-10-05 10:35:29 +02:00
Yaxun (Sam) Liu	e372c1d762	[HIP] Fix -fgpu-allow-device-init option The option needs to be passed to both host and device compilation. Differential Revision: https://reviews.llvm.org/D88550	2020-10-04 22:13:05 -04:00
Yaxun (Sam) Liu	5b551b79d3	[HIP] Fix default output file for -E By convention the default output file for -E is "-" (stdout). This is expected by tools like ccache, which uses output of -E to determine if a file and its dependence has changed. Currently clang does not use stdout as default output file for -E for HIP, which causes ccache not working. This patch fixes that. Differential Revision: https://reviews.llvm.org/D88730	2020-10-04 22:03:16 -04:00
Yaxun (Sam) Liu	9756a402f2	Recommit "[HIP] Add option --gpu-instrument-lib=" recommit `64f7790e7d` after fixing hip-device-libs.hip.	2020-10-04 21:41:43 -04:00
Yaxun (Sam) Liu	fef0ebbc0b	Revert "[HIP] Add option --gpu-instrument-lib=" This reverts commit `64f7790e7d` due to regression in hip-device-libs.hip.	2020-10-04 21:27:29 -04:00
Yaxun (Sam) Liu	64f7790e7d	[HIP] Add option --gpu-instrument-lib= Add an option --gpu-instrument-lib= to allow users to specify an instrument device library. This is for supporting -finstrument in device code for debugging/profiling tools. Differential Revision: https://reviews.llvm.org/D88557	2020-10-04 21:16:36 -04:00
Yuanfang Chen	2c94d88e07	[NewPM] collapsing nested pass mangers of the same type This is one of the reason for extra invalidations in D84959. In practice, I don't think we have use cases needing this. This simplifies the pipeline a bit and prune corner cases when considering invalidations. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D85676	2020-10-04 15:57:13 -07:00
Craig Topper	a02b449bb1	[X86] Sync AESENC/DEC Key Locker builtins with gcc. For the wide builtins, pass a single input and output pointer to the builtins. Emit the GEPs and input loads from CGBuiltin.	2020-10-04 12:09:41 -07:00
Craig Topper	230c57b0bd	[X86] Synchronize the encodekey builtins with gcc. Don't assume void* is 16 byte aligned. We were taking multiple pointer arguments in the builtin. gcc accepts a single void. The cast from void to _m128i* caused the IR generation to assume the pointer was aligned. Instead make the builtin take a single void, emit i8 GEPs to adjust then cast to <2 x i64>* and perform a store with align of 1.	2020-10-04 12:09:35 -07:00
Craig Topper	28595cbbeb	[X86] Synchronize the loadiwkey builtin operand order with gcc version.	2020-10-04 12:09:29 -07:00
Craig Topper	6c6cd5f8a9	[X86] Consolidate wide Key Locker intrinsics into the same header as the other Key Locker intrinsics.	2020-10-04 12:09:21 -07:00
Roman Lebedev	aaae13d0c2	[NFC][clang][codegen] Autogenerate a few ARM SVE tests that are being affected by an upcoming patch	2020-10-04 19:54:09 +03:00
Esme-Yi	e3475f5b91	[PowerPC] Add builtins for xvtdiv(dp\|sp) and xvtsqrt(dp\|sp). Summary: This patch implements the builtins for xvtdivdp, xvtdivsp, xvtsqrtdp, xvtsqrtsp. The instructions correspond to the following builtins: int vec_test_swdiv(vector double v1, vector double v2); int vec_test_swdivs(vector float v1, vector float v2); int vec_test_swsqrt(vector double v1); int vec_test_swsqrts(vector float v1); This patch depends on D88274, which fixes the bug in copying from CRRC to GPRC/G8RC. Reviewed By: steven.zhang, amyk Differential Revision: https://reviews.llvm.org/D88278	2020-10-04 16:24:20 +00:00
Mark de Wever	1113fbf44c	[CodeGen] Improve likelihood branch weights Bruno De Fraine discovered some issues with D85091. The branch weights generated for `logical not` and `ternary conditional` were wrong. The `logical and` and `logical or` differed from the code generated of `__builtin_predict`. Adjusted the generated code for the likelihood to match `__builtin_predict`. The patch is based on Bruno's suggestions. Differential Revision: https://reviews.llvm.org/D88363	2020-10-04 14:24:27 +02:00
Roman Lebedev	03bd5198b6	[OldPM] Pass manager: run SROA after (simple) loop unrolling I have stumbled into this pretty accidentally, when rewriting some spaghetti-like code into something more structured, which involved using some `std::array<>`s. And to my surprise, the `alloca`s remained, causing about `+160%` perf regression. https://llvm-compile-time-tracker.com/compare.php?from=bb6f4d32aac3eecb51909f4facc625219307ee68&to=d563e66f40f9d4d145cb2050e41cb961e2b37785&stat=instructions suggests that this has geomean compile-time cost of `+0.08%`. Note that D68593 / `cecc0d27ad` already did this chage for NewPM, but left OldPM in a pessimized state. This fixes [[ https://bugs.llvm.org/show_bug.cgi?id=40011 \| PR40011 ]], [[ https://bugs.llvm.org/show_bug.cgi?id=42794 \| PR42794 ]] and probably some other reports. Reviewed By: nikic, xbolva00 Differential Revision: https://reviews.llvm.org/D87972	2020-10-04 11:53:50 +03:00
Nico Weber	ba60dc0aa7	Revert "[Driver] Move detectLibcxxIncludePath to ToolChain" This reverts commit `e25bf25920`. Breaks tests on Windows, see comments on https://reviews.llvm.org/D88452	2020-10-03 14:22:53 -04:00
Nathan Lanza	fcb0ab5933	[clang][NFC] Change a mention of `objc_static_protocol` to `non_runtime`	2020-10-03 14:04:14 -04:00
Mark de Wever	0ce6d6b46e	[Sema] List conversion validate character array. The function `TryListConversion` didn't properly validate the following part of the standard: Otherwise, if the parameter type is a character array [... ] and the initializer list has a single element that is an appropriately-typed string literal (8.5.2 [dcl.init.string]), the implicit conversion sequence is the identity conversion. This caused the following call to `f()` to be ambiguous. void f(int(&&)[1]); void f(unsigned(&&)[1]); void g(unsigned i) { f({i}); } This issue only occurs when the initializer list had one element. Differential Revision: https://reviews.llvm.org/D87561	2020-10-03 14:33:28 +02:00
Evandro Menezes	a0a8f83718	[PATCH] Fix typo (NFC)	2020-10-02 21:19:14 -05:00
Petr Hosek	e25bf25920	[Driver] Move detectLibcxxIncludePath to ToolChain This helper method is useful even outside of Gnu toolchains, so move it to ToolChain so it can be reused in other toolchains such as Fuchsia. Differential Revision: https://reviews.llvm.org/D88452	2020-10-02 18:37:20 -07:00
Petr Hosek	9a48411f35	Revert "[Driver] Move detectLibcxxIncludePath to ToolChain" This reverts commit `a594fd28e3` which is failign on some bots.	2020-10-02 16:59:28 -07:00
Yaxun (Sam) Liu	2cd75f738e	Diagnose invalid target ID for AMDGPU toolchain for assembler AMDGPU toolchain currently only diagnose invalid target ID for OpenCL source compilation. Invalid target ID is not diagnosed for assembler. This patch fixes that. Differential Revision: https://reviews.llvm.org/D88377	2020-10-02 19:38:02 -04:00
Yaxun (Sam) Liu	cbd420c5ed	[CUDA][HIP] Fix bound arch for offload action for fat binary Currently CUDA/HIP toolchain uses "unknown" as bound arch for offload action for fat binary. This causes -mcpu or -march with "unknown" added in HIPToolChain::TranslateArgs or CUDAToolChain::TranslateArgs. This causes issue for https://reviews.llvm.org/D88377 since HIP toolchain needs to check -mcpu in HIPToolChain::TranslateArgs. The bound arch of offload action for fat binary is not really used, therefore set it to CudaArch::UNUSED. Differential Revision: https://reviews.llvm.org/D88524	2020-10-02 19:05:51 -04:00
Richard Smith	8fb2a235b0	Don't reject calls to MinGW's unusual _setjmp declaration. We now recognize this function as a builtin despite it having an unexpected number of parameters; make sure we don't enforce that it has only 1 argument for its 2 parameters.	2020-10-02 15:12:15 -07:00
Yaxun (Sam) Liu	dc6a0b0ec7	[HIP] Align device binary To facilitate faster loading of device binaries and share them among processes, HIP runtime favors their alignment being 4096 bytes. HIP runtime can load unaligned device binaries, however, aligning them at 4096 bytes results in faster loading and less shared memory usage. This patch adds an option -bundle-align to clang-offload-bundler which allows bundles to be aligned at specified alignment. By default it is 1, which is NFC compared to existing format. This patch then aligns embedded fat binary and device binary inside fat binary at 4096 bytes. It has been verified this change does not cause significant overall file size increase for typical HIP applications (less than 1%). Differential Revision: https://reviews.llvm.org/D88734	2020-10-02 18:10:44 -04:00
Nathan Lanza	14f6bfcb52	[clang] Implement objc_non_runtime_protocol to remove protocol metadata Summary: Motivated by the new objc_direct attribute, this change adds a new attribute that remotes metadata from Protocols that the programmer knows isn't going to be used at runtime. We simply have the frontend skip generating any protocol metadata entries (e.g. OBJC_CLASS_NAME, _OBJC_$_PROTOCOL_INSTANCE_METHDOS, _OBJC_PROTOCOL, etc) for a protocol marked with `__attribute__((objc_non_runtime_protocol))`. There are a few APIs used to retrieve a protocol at runtime. `@protocol(SomeProtocol)` will now error out of the requested protocol is marked with attribute. `objc_getProtocol` will return `NULL` which is consistent with the behavior of a non-existing protocol. Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D75574	2020-10-02 17:35:50 -04:00
Petr Hosek	a594fd28e3	[Driver] Move detectLibcxxIncludePath to ToolChain This helper method is useful even outside of Gnu toolchains, so move it to ToolChain so it can be reused in other toolchains such as Fuchsia. Differential Revision: https://reviews.llvm.org/D88452	2020-10-02 14:23:48 -07:00
Evgenii Stepanov	66cf68ed46	[docs] Update ControlFlowIntegrity.rst. Expand the list of targets that support cfi-icall. Add ThinLTO everywhere LTO is mentioned. AFAIK all CFI features are supported with ThinLTO. Differential Revision: https://reviews.llvm.org/D87717	2020-10-02 12:01:05 -07:00
Arthur Eubanks	eb55735073	Reland [AlwaysInliner] Update BFI when inlining Reviewed By: davidxl Differential Revision: https://reviews.llvm.org/D88324	2020-10-02 10:46:57 -07:00
Yaxun (Sam) Liu	c87c017a4c	Fix failure in test hip-macros.hip requires amdgpu-registered-target.	2020-10-02 10:33:32 -04:00
Yaxun (Sam) Liu	36501b180a	Emit predefined macro for wavefront size for amdgcn Also fix the issue of multiple -m[no-]wavefrontsize64 options to make the last one wins. Differential Revision: https://reviews.llvm.org/D88370	2020-10-02 10:17:21 -04:00
Sjoerd Meijer	8825fec37e	[AArch64] Add CPU Cortex-R82 This adds support for -mcpu=cortex-r82. Some more information about this core can be found here: https://www.arm.com/products/silicon-ip-cpu/cortex-r/cortex-r82 One note about the system register: that is a bit of a refactoring because of small differences between v8.4-A AArch64 and v8-R AArch64. This is based on patches from Mark Murray and Mikhail Maltsev. Differential Revision: https://reviews.llvm.org/D88660	2020-10-02 12:47:23 +01:00
Hubert Tong	35ecc7fe49	[clang][Sema] Fix PR47676: Handle dependent AltiVec C-style cast Fix premature decision in the presence of type-dependent expression operands on whether AltiVec vector initializations from single expressions are "splat" operations. Verify that the instantiation is able to determine the correct cast semantics for both the scalar type and the vector type case. Note that, because the change only affects the single-expression case (and the target type is an AltiVec-style vector type), the replacement of a parenthesized list with a parenthesized expression does not change the semantics of the program in a program-observable manner. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D88526	2020-10-01 15:57:01 -04:00
Petr Hosek	de47e7122f	[CMake][Fuchsia] Don't set WIN32 API, rely on autodetection We prefer autodetection here to avoid persisting this configuration in the generated __config header which is shared across targets. Differential Revision: https://reviews.llvm.org/D88694	2020-10-01 12:35:52 -07:00
Sanjay Patel	149f5b573c	[APFloat] convert SNaN to QNaN in convert() and raise Invalid signal This is an alternate fix (see D87835) for a bug where a NaN constant gets wrongly transformed into Infinity via truncation. In this patch, we uniformly convert any SNaN to QNaN while raising 'invalid op'. But we don't have a way to directly specify a 32-bit SNaN value in LLVM IR, so those are always encoded/decoded by calling convert from/to 64-bit hex. See D88664 for a clang fix needed to allow this change. Differential Revision: https://reviews.llvm.org/D88238	2020-10-01 14:37:38 -04:00
Haojian Wu	c1b209cc61	[Format] Don't treat compound extension headers (foo.proto.h) as foo.cc main-file header. We receive internal bugs about this false positives after D86597. Differential Revision: https://reviews.llvm.org/D88640.	2020-10-01 19:57:57 +02:00
Sanjay Patel	686eb0d8de	[AST] do not error on APFloat invalidOp in default mode If FP exceptions are ignored, we should not error out of compilation just because APFloat indicated an exception. This is required as a preliminary step for D88238 which changes APFloat behavior for signaling NaN convert() to set the opInvalidOp exception status. Currently, there is no way to trigger this error because convert() never sets opInvalidOp. FP binops that set opInvalidOp also create a NaN, so the path to checkFloatingPointResult() is blocked by a different diagnostic: // [expr.pre]p4: // If during the evaluation of an expression, the result is not // mathematically defined [...], the behavior is undefined. // FIXME: C++ rules require us to not conform to IEEE 754 here. if (LHS.isNaN()) { Info.CCEDiag(E, diag::note_constexpr_float_arithmetic) << LHS.isNaN(); return Info.noteUndefinedBehavior(); } return checkFloatingPointResult(Info, E, St); Differential Revision: https://reviews.llvm.org/D88664	2020-10-01 13:46:45 -04:00
Michael Liao	8c36eaf037	[clang][opencl][codegen] Remove the insertion of `correctly-rounded-divide-sqrt-fp-math` fn-attr. - `-cl-fp32-correctly-rounded-divide-sqrt` is already handled in a per-instruction manner by annotating the accuracy required. There's no need to add that fn-attr. So far, there's no in-tree backend handling that attr and that OpenCL specific option. - In case that out-of-tree backends are broken, this change could be reverted if those backends could not be fixed. Differential Revision: https://reviews.llvm.org/D88424	2020-10-01 11:07:39 -04:00
Eduardo Caldas	5011d43108	Migrate Declarators to use the List API After this change all nodes that have a delimited-list are using the `List` API. Implementation details: Let's look at a declaration with multiple declarators: `int a, b;` To generate a declarator list node we need to have the range of declarators: `a, b`: However, the `ClangAST` actually stores them as separate declarations: `int a ;` `int b;` We solve that by appropriately marking the declarators on each separate declaration in the `ClangAST` and then for the final declarator `int b`, shrinking its range to fit to the already marked declarators. Differential Revision: https://reviews.llvm.org/D88403	2020-10-01 13:56:31 +00:00
Ranjeet Singh	e4f50e587f	[ARM] Add missing target for Arm neon test case. This is a follow-up from https://reviews.llvm.org/D61717. Where Richard described the issue with compiling arm_neon.h under -flax-vector-conversions=none. It looks like the example reproducer does actually work but what was missing was a test entry for that target. Differential Revision: https://reviews.llvm.org/D88546	2020-10-01 00:32:33 +01:00
Akira Hatanaka	21cf2e6c26	Handle unknown OSes in DarwinTargetInfo::getExnObjectAlignment rdar://problem/69727650	2020-09-30 16:05:17 -07:00
Hubert Tong	ae4c400e02	[NFC] Fix spacing in clang/test/Driver/aix-ld.c Fix one line with mismatch in indentation after `afc277b0ed`.	2020-09-30 17:01:32 -04:00
Arthur Eubanks	ce5379f0f0	[NPM] Add target specific hook to add passes for New Pass Manager The patch adds a new TargetMachine member "registerPassBuilderCallbacks" for targets to add passes to the pass pipeline using the New Pass Manager (similar to adjustPassManager for the Legacy Pass Manager). Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D88138	2020-09-30 13:29:43 -07:00
Joseph Huber	1b60f63e4f	Revert "[OpenMP] Replace OpenMP RTL Functions With OMPIRBuilder and OMPKinds.def" Failing tests on Arm due to the tests automatically populating incomatible pointer width architectures. Reverting until the tests are updated. Failing tests: OpenMP/distribute_parallel_for_num_threads_codegen.cpp OpenMP/distribute_parallel_for_if_codegen.cpp OpenMP/distribute_parallel_for_simd_if_codegen.cpp OpenMP/distribute_parallel_for_simd_num_threads_codegen.cpp OpenMP/target_teams_distribute_parallel_for_if_codegen.cpp OpenMP/target_teams_distribute_parallel_for_simd_if_codegen.cpp OpenMP/teams_distribute_parallel_for_if_codegen.cpp OpenMP/teams_distribute_parallel_for_simd_if_codegen.cpp This reverts commit `90eaedda9b`.	2020-09-30 15:12:21 -04:00
Sanjay Patel	81921ebc43	[CodeGen] improve coverage for float (32-bit) type of NAN; NFC Goes with D88238	2020-09-30 15:10:25 -04:00
Joseph Huber	bdc85292fb	Revert "[OpenMP] Add Error Handling for Conflicting Pointer Sizes for Target Offload" Failing tests on Arm due to the tests automatically populating incomatible pointer width architectures. Reverting until the tests are updated. Failing tests: OpenMP/distribute_parallel_for_num_threads_codegen.cpp OpenMP/distribute_parallel_for_if_codegen.cpp OpenMP/distribute_parallel_for_simd_if_codegen.cpp OpenMP/distribute_parallel_for_simd_num_threads_codegen.cpp OpenMP/target_teams_distribute_parallel_for_if_codegen.cpp OpenMP/target_teams_distribute_parallel_for_simd_if_codegen.cpp OpenMP/teams_distribute_parallel_for_if_codegen.cpp OpenMP/teams_distribute_parallel_for_simd_if_codegen.cpp This reverts commit `9d2378b591`.	2020-09-30 15:08:22 -04:00
David Tenty	afc277b0ed	[AIX][Clang][Driver] Link libm in c++ mode since that is the normal behaviour of other compilers on the platform. Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D88500	2020-09-30 14:02:17 -04:00
Joseph Huber	90eaedda9b	[OpenMP] Replace OpenMP RTL Functions With OMPIRBuilder and OMPKinds.def Summary: Replace the OpenMP Runtime Library functions used in CGOpenMPRuntimeGPU for OpenMP device code generation with ones in OMPKinds.def and use OMPIRBuilder for generating runtime calls. This allows us to consolidate more OpenMP code generation into the OMPIRBuilder. This patch also invalidates specifying target architectures with conflicting pointer sizes. Reviewers: jdoerfert Subscribers: aaron.ballman cfe-commits guansong llvm-commits sstefan1 yaxunl Tags: #OpenMP #Clang #LLVM Differential Revision: https://reviews.llvm.org/D88430	2020-09-30 14:00:01 -04:00
Joseph Huber	9d2378b591	[OpenMP] Add Error Handling for Conflicting Pointer Sizes for Target Offload Summary: This patch adds an error to Clang that detects if OpenMP offloading is used between two architectures with incompatible pointer sizes. This ensures that the data mapping can be done correctly and solves an issue in code generation generating the wrong size pointer. Reviewer: jdoerfert Subscribers: Tags: #OpenMP #Clang Differential Revision:	2020-09-30 13:58:24 -04:00
Richard Smith	892df30a7f	Fix interaction of `constinit` and `weak`. We previously took a shortcut and said that weak variables never have constant initializers (because those initializers are never correct to use outside the variable). We now say that weak variables can have constant initializers, but are never usable in constant expressions.	2020-09-30 10:49:50 -07:00
Alexandre Rames	700e63293e	[Sema] Support Comma operator for fp16 vectors. The current half vector was enforcing an assert expecting "(LHS is half vector) == (RHS is half vector)" for comma. Reviewed By: ahatanak, fhahn Differential Revision: https://reviews.llvm.org/D88265	2020-09-30 18:23:09 +01:00
Sanjay Patel	187686bea3	[CodeGen] add test for NAN creation; NFC This goes with the APFloat change proposed in D88238. This is copied from the MIPS-specific test in builtin-nan-legacy.c to verify that the normal behavior is correct on other targets without the complication of an inverted quiet bit.	2020-09-30 13:22:12 -04:00
Xiangling Liao	3a7487f903	[FE] Use preferred alignment instead of ABI alignment for complete object when applicable On some targets, preferred alignment is larger than ABI alignment in some cases. For example, on AIX we have special power alignment rules which would cause that. Previously, to support those cases, we added a “PreferredAlignment” field in the `RecordLayout` to store the AIX special alignment values in “PreferredAlignment” as the community suggested. However, that patch alone is not enough. There are places in the Clang where `PreferredAlignment` should have been used instead of ABI-specified alignment. This patch is aimed at fixing those spots. Differential Revision: https://reviews.llvm.org/D86790	2020-09-30 10:48:28 -04:00
Xiangling Liao	944691f0b7	[NFC][FE] Replace TypeSize with StorageUnitSize On some targets like AIX, last bitfield size is not always equal to last bitfield type size. Some bitfield like bool will have the same alignment as [unsigned]. So we'd like to use a more general term `StorageUnit` to replace type in this field. Differential Revision: https://reviews.llvm.org/D88260	2020-09-30 10:32:53 -04:00
Xiang1 Zhang	413577a879	[X86] Support Intel Key Locker Key Locker provides a mechanism to encrypt and decrypt data with an AES key without having access to the raw key value by converting AES keys into “handles”. These handles can be used to perform the same encryption and decryption operations as the original AES keys, but they only work on the current system and only until they are revoked. If software revokes Key Locker handles (e.g., on a reboot), then any previous handles can no longer be used. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D88398	2020-09-30 18:08:45 +08:00
Serge Pavlov	4e4f926e83	Remove test AST/const-fpfeatures-diag.c This test is going to be removed because using dynamic rounding mode in initializers is changing. It also causes build failures in some cases, so remove it now.	2020-09-30 11:07:55 +07:00
Brad Smith	6f01c53f26	Remove further OpenBSD/sparc bits	2020-09-29 22:17:12 -04:00
Yaxun (Sam) Liu	d04775e16b	Add remquo, frexp and modf overload functions to HIP header	2020-09-29 20:57:56 -04:00
Amy Huang	5c4fc581d5	[DebugInfo] Add types from constructor homing to the retained types list. Add class types to the retained types list to make sure they don't get dropped if the constructor is optimized out later. Differential Revision: https://reviews.llvm.org/D88522	2020-09-29 17:00:45 -07:00
John McCall	984744a131	Fix a variety of minor issues with ObjC method mangling: - Fix a memory leak accidentally introduced yesterday by using CodeGen's existing mangling context instead of creating a new context afresh. - Move GNU-runtime ObjC method mangling into the AST mangler; this will eventually be necessary to support direct methods there, but is also just the right architecture. - Make the Apple-runtime method mangling work properly when given an interface declaration, fixing a bug (which had solidified into a test) where mangling a category method from the interface could cause it to be mangled as if the category name was a class name. (Category names are namespaced within their class and have no global meaning.) - Fix a code cross-reference in dsymutil. Based on a patch by Ellis Hoag.	2020-09-29 19:51:53 -04:00
Richard Smith	1c604a9f5f	Recognize setjmp and friends as builtins even if jmp_buf is not declared yet. This happens in glibc's headers. It's important that we recognize these functions so that we can mark them as returns_twice. Differential Revision: https://reviews.llvm.org/D88518	2020-09-29 15:53:17 -07:00
Chris Hamilton	155d2d5300	Revert "[Sema] Address-space sensitive check for unbounded arrays (v2)" This reverts commit `d9ee935679`.	2020-09-29 22:46:14 +02:00
Aaron Ballman	538762fef0	Better diagnostics for anonymous bit-fields with attributes or an initializer. The current C++ grammar allows an anonymous bit-field with an attribute, but this is ambiguous (the attribute in that case could appertain to the type instead of the bit-field). The current thinking in the Core Working Group is that it's better to disallow attributes in that position at the grammar level so that the ambiguity resolves in favor of applying to the type. During discussions about the behavior of the attribute, the Core Working Group also felt it was better to disallow anonymous bit-fields from specifying a default member initializer. This implements both sets of related grammar changes.	2020-09-29 16:32:20 -04:00
Aaron Ballman	15fbae8ac3	Use "default member initializer" instead of "in-class initializer" for diagnostics. This changes some diagnostics to use terminology from the standard rather than invented terminology, which improves consistency with other diagnostics as well. There are no functional changes intended other than wording and naming.	2020-09-29 15:04:23 -04:00
Fangrui Song	3681be876f	Add -fprofile-update={atomic,prefer-atomic,single} GCC 7 introduced -fprofile-update={atomic,prefer-atomic} (prefer-atomic is for best efforts (some targets do not support atomics)) to increment counters atomically, which is exactly what we have done with -fprofile-instr-generate (D50867) and -fprofile-arcs (`b5ef137c11`). This patch adds the option to clang to surface the internal options at driver level. GCC 7 also turned on -fprofile-update=prefer-atomic when -pthread is specified, but it has performance regression (https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89307). So we don't follow suit. Differential Revision: https://reviews.llvm.org/D87737	2020-09-29 10:43:23 -07:00
Alex Lorenz	119274748b	NFC, add a missing stdlib include for the use of abort The FatalErrorHandler.cpp file uses 'abort', but doesn't include 'stdlib.h'. This causes a build error when modules are used in clang.	2020-09-29 08:50:51 -07:00
Chris Hamilton	d9ee935679	[Sema] Address-space sensitive check for unbounded arrays (v2) Check applied to unbounded (incomplete) arrays and pointers to spot cases where the computed address is beyond the largest possible addressable extent of the array, based on the address space in which the array is delcared, or which the pointer refers to. Check helps to avoid cases of nonsense pointer math and array indexing which could lead to linker failures or runtime exceptions. Of particular interest when building for embedded systems with small address spaces. This is version 2 of this patch -- version 1 had some testing issues due to a sign error in existing code. That error is corrected and lit test for this chagne is extended to verify the fix. Originally reviewed/accepted by: aaron.ballman Original revision: https://reviews.llvm.org/D86796 Reviewed By: ebevhan Differential Revision: https://reviews.llvm.org/D88174	2020-09-29 16:14:48 +02:00
Alexey Bader	9263931fcc	[SYCL] Assume SYCL device functions are convergent SYCL device compiler (similar to other SPMD compilers) assumes that functions are convergent by default to avoid invalid transformations. This attribute can be removed if compiler can prove that function does not have convergent operations. Reviewed By: Naghasan Differential Revision: https://reviews.llvm.org/D87282	2020-09-29 15:23:50 +03:00
Tres Popp	eb9f7c28e5	Revert "OpaquePtr: Add type to sret attribute" This reverts commit `55c4ff91bd`. Issues were introduced as discussed in https://reviews.llvm.org/D88241 where this change made previous bugs in the linker and BitCodeWriter visible.	2020-09-29 10:31:04 +02:00
Ellis Hoag	98ef7e29b0	This reduces code duplication between CGObjCMac.cpp and Mangle.cpp for generating the mangled name of an Objective-C method. This has no intended functionality change. https://reviews.llvm.org/D88329	2020-09-29 02:26:51 -04:00
Dmitry Antipov	bc868da0e7	[Driver] Filter out <libdir>/gcc and <libdir>/gcc-cross if they do not exists Differential Revision: https://reviews.llvm.org/D87901	2020-09-29 09:18:50 +03:00
Johannes Doerfert	4fc69ab002	Revert "[OpenMP][FIX] Verify compatible types for declare variant calls" This reverts commit `c942095790`. One of the tests broke, revert to investigate.	2020-09-29 00:37:11 -05:00
Johannes Doerfert	c942095790	[OpenMP][FIX] Verify compatible types for declare variant calls Especially for templates we need to check at some point if the base function matches the specialization we might call instead. Before this lead to the replacement of `std::sqrt(int(2))` calls with one that converts the argument to a `std::complex<int>`, clearly not the desired behavior. Reported as PR47655 Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D88384	2020-09-28 23:26:21 -05:00
Yaxun (Sam) Liu	5a3023a91c	[HIP] Return non-zero value for invalid target ID This is part of https://reviews.llvm.org/D60620	2020-09-28 23:07:39 -04:00
Yaxun (Sam) Liu	187658b8a6	Recommit "[HIP] Change default --gpu-max-threads-per-block value to 1024" Recommit `04abbb3a78`	2020-09-28 22:43:17 -04:00
Yaxun (Sam) Liu	10eb3bf2d4	Skip -fPIE for AMDGPU and HIP toolchain AMDGPU toolchain does not support -fPIE, therefore skip it if specified by driver. Differential Revision: https://reviews.llvm.org/D88425	2020-09-28 22:03:18 -04:00
Richard Smith	c375635d05	Ensure that we don't compute linkage for an anonymous class too early if it has a member whose name is the same as a builtin. Fixes a regression from the introduction of BuiltinAttr.	2020-09-28 17:22:40 -07:00
Jan Korous	6fd8c69049	[clang] Update warning-wall.c test Follow-up to 1e86d637eb4f: [clang] Selectively ena/disa-ble format-insufficient-args warning	2020-09-28 17:19:51 -07:00
Zahira Ammarguellat	efd04721c9	BuildVectorType with a dependent (array) type is crashing the compiler - Fix for PR-47542 Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D88150	2020-09-28 17:10:32 -07:00
Yonghong Song	54d9f743c8	BPF: move AbstractMemberAccess and PreserveDIType passes to EP_EarlyAsPossible Move abstractMemberAccess and PreserveDIType passes as early as possible, right after clang code generation. Currently, compiler may transform the above code p1 = llvm.bpf.builtin.preserve.struct.access(base, 0, 0); p2 = llvm.bpf.builtin.preserve.struct.access(p1, 1, 2); a = llvm.bpf.builtin.preserve_field_info(p2, EXIST); if (a) { p1 = llvm.bpf.builtin.preserve.struct.access(base, 0, 0); p2 = llvm.bpf.builtin.preserve.struct.access(p1, 1, 2); bpf_probe_read(buf, buf_size, p2); } to p1 = llvm.bpf.builtin.preserve.struct.access(base, 0, 0); p2 = llvm.bpf.builtin.preserve.struct.access(p1, 1, 2); a = llvm.bpf.builtin.preserve_field_info(p2, EXIST); if (a) { bpf_probe_read(buf, buf_size, p2); } and eventually assembly code looks like reloc_exist = 1; reloc_member_offset = 10; //calculate member offset from base p2 = base + reloc_member_offset; if (reloc_exist) { bpf_probe_read(bpf, buf_size, p2); } if during libbpf relocation resolution, reloc_exist is actually resolved to 0 (not exist), reloc_member_offset relocation cannot be resolved and will be patched with illegal instruction. This will cause verifier failure. This patch attempts to address this issue by do chaining analysis and replace chains with special globals right after clang code gen. This will remove the cse possibility described in the above. The IR typically looks like %6 = load @llvm.sk_buff:0:50$0:0:0:2:0 %7 = bitcast %struct.sk_buff* %2 to i8* %8 = getelementptr i8, i8* %7, %6 for a particular address computation relocation. But this transformation has another consequence, code sinking may happen like below: PHI = <possibly different @preserve__access_globals> %7 = bitcast %struct.sk_buff %2 to i8* %8 = getelementptr i8, i8* %7, %6 For such cases, we will not able to generate relocations since multiple relocations are merged into one. This patch introduced a passthrough builtin to prevent such optimization. Looks like inline assembly has more impact for optimizaiton, e.g., inlining. Using passthrough has less impact on optimizations. A new IR pass is introduced at the beginning of target-dependent IR optimization, which does: - report fatal error if any reloc global in PHI nodes - remove all bpf passthrough builtin functions Changes for existing CORE tests: - for clang tests, add "-Xclang -disable-llvm-passes" flags to avoid builtin->reloc_global transformation so the test is still able to check correctness for clang generated IR. - for llvm CodeGen/BPF tests, add "opt -O2 <ir_file> \| llvm-dis" command before "llc" command since "opt" is needed to call newly-placed builtin->reloc_global transformation. Add target triple in the IR file since "opt" requires it. - Since target triple is added in IR file, if a test may produce different results for different endianness, two tests will be created, one for bpfeb and another for bpfel, e.g., some tests for relocation of lshift/rshift of bitfields. - field-reloc-bitfield-1.ll has different relocations compared to old codes. This is because for the structure in the test, new code returns struct layout alignment 4 while old code is 8. Align 8 is more precise and permits double load. With align 4, the new mechanism uses 4-byte load, so generating different relocations. - test intrinsic-transforms.ll is removed. This is used to test cse on intrinsics so we do not lose metadata. Now metadata is attached to global and not instruction, it won't get lost with cse. Differential Revision: https://reviews.llvm.org/D87153	2020-09-28 16:56:22 -07:00
David Tenty	ee80615b5c	[clang][driver][AIX] Set compiler-rt as default rtlib Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D88182	2020-09-28 19:45:43 -04:00
Jan Korous	1e86d637eb	[clang] Selectively ena/disa-ble format-insufficient-args warning Differential Revision: https://reviews.llvm.org/D87176	2020-09-28 16:24:50 -07:00
Aaron Ballman	e7549dafcd	Fix a think-o with the numerical suffixes in the docs for init_priority.	2020-09-28 16:52:58 -04:00
Craig Topper	288c5776c9	[X86] Use inlineasm flag output for the _bittest* intrinsics. Instead of expliciting emitting a setc in the inline asm instructions, we can use flag output. This allows the backend to use the flag directly if it is needed by a branch. Previously we needed a test instruction to convert the register back to a flag. If the flag can't be used directly, the backend will emit a setcc. Differential Revision: https://reviews.llvm.org/D87888	2020-09-28 13:33:22 -07:00
Baptiste Saleil	0156914275	[PowerPC] Legalize v256i1 and v512i1 and implement load and store of these types This patch legalizes the v256i1 and v512i1 types that will be used for MMA. It implements loads and stores of these types. v256i1 is a pair of VSX registers, so for this type, we load/store the two underlying registers. v512i1 is used for MMA accumulators. So in addition to loading and storing the 4 associated VSX registers, we generate instructions to prime (copy the VSX registers to the accumulator) after loading and unprime (copy the accumulator back to the VSX registers) before storing. This patch also adds the UACC register class that is necessary to implement the loads and stores. This class represents accumulator in their unprimed form and allow the distinction between primed and unprimed accumulators to avoid invalid copies of the VSX registers associated with primed accumulators. Differential Revision: https://reviews.llvm.org/D84968	2020-09-28 14:39:37 -05:00
Paweł Bylica	0c82fa677f	[python][tests] Fix string comparison with "is"	2020-09-28 21:11:50 +02:00
Vedant Kumar	06bc685fa2	[ubsan] nullability-arg: Fix crash on C++ member pointers Extend -fsanitize=nullability-arg to handle call sites which accept C++ member pointers. rdar://62476022 Differential Revision: https://reviews.llvm.org/D88336	2020-09-28 09:41:18 -07:00
Michael Liao	5dbf80cad9	[clang][codegen] Annotate `correctly-rounded-divide-sqrt-fp-math` fn-attr for OpenCL only. - `-cl-fp32-correctly-rounded-divide-sqrt` is an OpenCL-specific option and `correctly-rounded-divide-sqrt-fp-math` should be added for OpenCL at most. Differential revision: https://reviews.llvm.org/D88303	2020-09-28 11:40:32 -04:00
Haojian Wu	bf890dcb0f	[clang] Don't emit "no member" diagnostic if the lookup fails on an invalid record decl. The "no member" diagnostic is likely bogus. Reviewed By: sammccall, #libc Differential Revision: https://reviews.llvm.org/D86765	2020-09-28 15:10:00 +02:00
David Sherwood	bafdd11326	[SVE] Replace / operator in TypeSize/ElementCount with divideCoefficientBy After some recent upstream discussion we decided that it was best to avoid having the / operator for both ElementCount and TypeSize, since this could give the impression that these classes can be used in the same way as basic integer integer types. However, division for scalable types is a bit odd because we are only dividing the minimum quantity by a value, as opposed to something like: (MinSize * Vscale) / SomeValue This is why when performing division it's important the caller first establishes whether the operation makes sense, perhaps by calling isKnownMultipleOf() prior to division. The caller must now explictly call divideCoefficientBy() on the class to perform the operation. Differential Revision: https://reviews.llvm.org/D87700	2020-09-28 08:03:00 +01:00
Richard Smith	df2a1f2aab	Add profiling support for APValues. For C++20 P0732R2; unused so far. Will be used and tested by a follow-on commit.	2020-09-27 20:05:39 -07:00
Richard Smith	9dcd96f728	Canonicalize declaration pointers when forming APValues. References to different declarations of the same entity aren't different values, so shouldn't have different representations. Recommit of `e6393ee813` with fixed handling for weak declarations. We now look for attributes on the most recent declaration when determining whether a declaration is weak. (Second recommit with further fixes for mishandling of weak declarations. Our behavior here is fundamentally unsound -- see PR47663 -- but this approach attempts to not make things worse.)	2020-09-27 19:05:26 -07:00
Aaron Ballman	de55ebe3bb	Typo fix; NFC	2020-09-27 08:30:41 -04:00
Aaron Puchert	485501899d	Fix sphinx warnings in AttributeReference, NFC The previous attempt in `d34c8c70` didn't help (the problem was missing indentation), and another issue was introduced by `a51d51a0`.	2020-09-27 00:52:36 +02:00
Russell Yanofsky	f702a6fa7c	Thread safety analysis: Improve documentation for ASSERT_CAPABILITY Previous description didn't actually state the effect the attribute has on thread safety analysis (causing analysis to assume the capability is held). Previous description was also ambiguous about (or slightly overstated) the noreturn assumption made by thread safety analysis, implying the assumption had to be true about the function's behavior in general, and not just its behavior in places where it's used. Stating the assumption specifically should avoid a perceived need to disable thread safety analysis in places where only asserting that a specific capability is held would be better. Reviewed By: aaronpuchert, vasild Differential Revision: https://reviews.llvm.org/D87629	2020-09-26 22:16:50 +02:00
Florian Hahn	915310bf14	Revert "[DSE] Switch to MemorySSA-backed DSE by default." There appears to be a mis-compile with MemorySSA-backed DSE in combination with llvm.lifetime.end. It currently appears like DSE is doing the right thing and the llvm.lifetime.end markers are incorrect. The reverted patch uncovers the mis-compile. This patch temporarily switches back to the legacy DSE implementation, while we investigate. This reverts commit `9d172c8e9c`.	2020-09-26 18:35:27 +01:00
Serge Pavlov	f91b9c0f98	Run test on particular target only The test `AST/const-fpfeatures-diag.c` requires setting strict FP semantics, so it fails on targets where support of such semantic is limited.	2020-09-26 20:26:34 +07:00
Serge Pavlov	6314f412a8	[FPEnv] Evaluate constant expressions under non-default rounding modes The change implements evaluation of constant floating point expressions under non-default rounding modes. The main objective was to support evaluation of global variable initializers, where constant rounding mode may be specified by `#pragma STDC FENV_ROUND`. Differential Revision: https://reviews.llvm.org/D87822	2020-09-26 17:59:39 +07:00
Dmitry Antipov	2ca0ea15e5	[Driver] Fix formatting as suggested by clang-format (NFC)	2020-09-26 08:52:51 +03:00
Dmitry Antipov	96318f64a7	[Driver] Perform Linux distribution detection only once Differential Revision: https://reviews.llvm.org/D87187	2020-09-26 08:44:08 +03:00
Shilei Tian	ebb1092a28	[Clang][OpenMP] Added support for nowait target in CodeGen via regular task Previously for nowait target, CG emitted a function call to `__tgt_target_nowait`, etc. However, in OpenMP RTL, these functions just directly call the no-nowait version, which means nowait is not working as expected. OpenMP specification says a target is acutally a target task, which is an untied and detachable task. It is natural to go to the direction that generates a task for a nowait target. However, OpenMP task has a problem that it must be within to a parallel region; otherwise the task will be executed immediately. As a result, if we directly wrap to a regular task, the `target nowait` outside of a parallel region is still a synchronous version. In D77609, I added the support for unshackled task in OpenMP RTL. Basically, unshackled task is a task that is not bound to any parallel region. So all nowait target will be tranformed into an unshackled task. In order to distinguish from regular task, a new flag bit is set for unshackled task. This flag will be used by RTL for later process. Since all target tasks are allocated via `__kmpc_omp_target_task_alloc`, and in current `libomptarget`, `__kmpc_omp_target_task_alloc` just calls `__kmpc_omp_task_alloc`. Therefore, we can modify the flag in `__kmpc_omp_target_task_alloc` so that we don't need to modify the FE too much. If users choose to opt out the feature, they just need to use a RTL w/o support of unshackled threads. As a result, in this patch, the `target nowait` region is simply wrapped into a regular task. Later once we have RTL support for unshackled tasks, the wrapped tasks can be executed by unshackled threads w/o changes in the FE. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D78075	2020-09-25 22:10:36 -04:00
Evandro Menezes	a000580a89	[RISCV] Update driver tests Add the RISC-V Bullet core to the driver tests.	2020-09-25 18:36:53 -05:00
Saleem Abdulrasool	58cdbf518b	Sema: add support for `__attribute__((__swift_private__))` This attribute allows declarations to be restricted to the framework itself, enabling Swift to remove the declarations when importing libraries. This is useful in the case that the functions can be implemented in a more natural way for Swift. This is based on the work of the original changes in `8afaf3aad2` Differential Revision: https://reviews.llvm.org/D87720 Reviewed By: Aaron Ballman	2020-09-25 22:33:53 +00:00
Matt Arsenault	55c4ff91bd	OpaquePtr: Add type to sret attribute Make the corresponding change that was made for byval in `b7141207a4`. Like byval, this requires a bulk update of the test IR tests to include the type before this can be mandatory.	2020-09-25 14:07:30 -04:00
Saleem Abdulrasool	76eb163259	Sema: remove unnecessary parameter for SwiftName handling (NFCI) This code never actually did anything in the implementation. `mergeDeclAttribute` is declared as `static`, and referenced exactly once in the file: from `Sema::mergeDeclAttributes`. `Sema::mergeDeclAttributes` sets `LocalAMK` to `AMK_None`. If the attribute is `DeprecatedAttr`, `UnavailableAttr`, or `AvailabilityAttr` then the `LocalAMK` is updated. However, because we are dealing with a `SwiftNameDeclAttr` here, `LocalAMK` remains `AMK_None`. This is then passed to the function which will as a result pass the value of `AMK_None == AMK_Override` aka `false`. Simply propagate the value through and erase the dead codepath. Thanks to Aaron Ballman for flagging the use of the availability merge kind here leading to this simplification! Differential Revision: https://reviews.llvm.org/D88263 Reviewed By: Aaron Ballman	2020-09-25 17:01:06 +00:00
Vedant Kumar	62c372770d	[profile] Add %t LLVM_PROFILE_FILE option to substitute $TMPDIR Add support for expanding the %t filename specifier in LLVM_PROFILE_FILE to the TMPDIR environment variable. This is supported on all platforms. On Darwin, TMPDIR is used to specify a temporary application-specific scratch directory. When testing apps on remote devices, it can be challenging for the host device to determine the correct TMPDIR, so it's helpful to have the runtime do this work. rdar://68524185 Differential Revision: https://reviews.llvm.org/D87332	2020-09-25 09:39:40 -07:00
Aaron Ballman	a51d51a0d4	Fix some of the more egregious 80-col and whitespace issues; NFC	2020-09-25 10:37:38 -04:00
Aaron Ballman	85cea77ecb	Typo fix; NFC	2020-09-25 10:26:29 -04:00
Benjamin Kramer	6a1bca8798	[Analyzer] Fix unused variable warning in Release builds clang/lib/StaticAnalyzer/Core/ExprEngineCXX.cpp:377:19: warning: unused variable 'Init'	2020-09-25 14:09:43 +02:00
Manuel Klimek	e336b74c99	[clang-format] Add a MacroExpander. Summary: The MacroExpander allows to expand simple (non-resursive) macro definitions from a macro identifier token and macro arguments. It annotates the tokens with a newly introduced MacroContext that keeps track of the role a token played in expanding the macro in order to be able to reconstruct the macro expansion from an expanded (formatted) token stream. Made Token explicitly copy-able to enable copying tokens from the parsed macro definition. Reviewers: sammccall Subscribers: mgorny, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D83296	2020-09-25 14:08:13 +02:00
Chris Bowler	f330d9f163	[PPC] [AIX] Implement calling convention IR for C99 complex types on AIX Add AIX calling convention logic to Clang for C99 complex types on AIX Differential Revision: https://reviews.llvm.org/D88130	2020-09-25 07:43:31 -04:00
Adam Balogh	facad21b29	[Analyzer] Fix for `ExprEngine::computeObjectUnderConstruction()` for base and delegating consturctor initializers For /C++/ constructor initializers `ExprEngine:computeUnderConstruction()` asserts that they are all member initializers. This is not neccessarily true when this function is used to get the return value for the construction context thus attempts to fetch return values of base and delegating constructor initializers result in assertions. This small patch fixes this issue. Differential Revision: https://reviews.llvm.org/D85351	2020-09-25 13:28:22 +02:00
Momchil Velikov	a88c722e68	[AArch64] PAC/BTI code generation for LLVM generated functions PAC/BTI-related codegen in the AArch64 backend is controlled by a set of LLVM IR function attributes, added to the function by Clang, based on command-line options and GCC-style function attributes. However, functions, generated in the LLVM middle end (for example, asan.module.ctor or __llvm_gcov_write_out) do not get any attributes and the backend incorrectly does not do any PAC/BTI code generation. This patch record the default state of PAC/BTI codegen in a set of LLVM IR module-level attributes, based on command-line options: * "sign-return-address", with non-zero value means generate code to sign return addresses (PAC-RET), zero value means disable PAC-RET. * "sign-return-address-all", with non-zero value means enable PAC-RET for all functions, zero value means enable PAC-RET only for functions, which spill LR. * "sign-return-address-with-bkey", with non-zero value means use B-key for signing, zero value mean use A-key. This set of attributes are always added for AArch64 targets (as opposed, for example, to interpreting a missing attribute as having a value 0) in order to be able to check for conflicts when combining module attributed during LTO. Module-level attributes are overridden by function level attributes. All the decision making about whether to not to generate PAC and/or BTI code is factored out into AArch64FunctionInfo, there shouldn't be any places left, other than AArch64FunctionInfo, which directly examine PAC/BTI attributes, except AArch64AsmPrinter.cpp, which is/will-be handled by a separate patch. Differential Revision: https://reviews.llvm.org/D85649	2020-09-25 11:47:14 +01:00
Ian Levesque	7db7a35545	Fix uninitialized XRayArg	2020-09-25 00:20:36 -04:00
Chris Bowler	64b8a633a8	[NFC] [PPC] Add PowerPC expected IR tests for C99 complex Adding this test so that I can extend it in a follow on patch with expected IR for AIX when I implement complex handling in AIXABIInfo. Reviewed By: daltenty, ZarkoCA Differential Revision: https://reviews.llvm.org/D88105	2020-09-24 23:28:40 -04:00
Ian Levesque	6f7fbdd285	[xray] Function coverage groups Add the ability to selectively instrument a subset of functions by dividing the functions into N logical groups and then selecting a group to cover. By selecting different groups over time you could cover the entire application incrementally with lower overhead than instrumenting the entire application at once. Differential Revision: https://reviews.llvm.org/D87953	2020-09-24 22:09:53 -04:00
Richard Smith	8c98c88034	PR47176: Don't read from an inactive union member if a friend function has default arguments and an exception specification.	2020-09-24 19:02:27 -07:00
Reid Kleckner	276f68eace	Revert "Add a static_assert confirming that DiagnosticBuilder is small" This reverts commit `a32feed0db`. This assert doesn't hold in 32-bit builds, I didn't do the math right.	2020-09-24 16:39:46 -07:00
Reid Kleckner	a32feed0db	Add a static_assert confirming that DiagnosticBuilder is small	2020-09-24 16:38:41 -07:00
Reid Kleckner	ecfc9b9712	[MS] For unknown ISAs, pass non-trivially copyable arguments indirectly Passing them directly is likely to be non-conforming, since it usually involves copying the bytes of the record. For unknown architectures, we don't know what MSVC does or will do, but we should at least try to conform as well as we can.	2020-09-24 16:29:48 -07:00
Reid Kleckner	b8a50e9207	[MS] Simplify rules for passing C++ records Regardless of the target architecture, we should always use the C rules (RAA_Default) for records that "canBePassedInRegisters". Those are trivially copyable things, and things marked with [[trivial_abi]]. This should be NFC, although it changes where the final decision about x86_32 overaligned records is made. The current x86_32 C rules say that overaligned things are passed indirectly, so there is no functional difference.	2020-09-24 16:29:47 -07:00
Bill Wendling	c9b53b3bf2	Fix regex in test.	2020-09-24 15:21:28 -07:00
Amy Huang	c8df781e54	[DebugInfo] Fix bug in constructor homing with classes with trivial constructors. This changes the code to avoid using constructor homing for aggregate classes and classes with trivial default constructors, instead of trying to loop through the constructors. Differential Revision: https://reviews.llvm.org/D87808	2020-09-24 14:43:48 -07:00
Bill Wendling	f97b68ef4d	Fix testcase.	2020-09-24 14:34:28 -07:00
Artem Belevich	30514f0afa	[CUDA] Added conversion functions to builtin vars. This is needed to compile some headers in CUDA-11 that assume that threadIdx is implicitly convertible to dim3. With NVCC, threadIdx is uint3 and there's dim3(uint3) constructor. Clang uses a special type for the builtin variables, so that path does not work. Instead, this patch adds conversion function to the builtin variable classes. that will allow them to be converted to dim3 and uint3. Differential Revision: https://reviews.llvm.org/D88250	2020-09-24 14:33:04 -07:00
Bill Wendling	34ca5b3392	Remove stale assert. This is triggered during serialization. The test is for modules, but will occur for any serialization effort using asm goto. Reviewed By: nickdesaulniers, jyknight Differential Revision: https://reviews.llvm.org/D88195	2020-09-24 13:59:42 -07:00
Sam McCall	1ad94624f8	[AST] Use data-recursion when building ParentMap, avoid stack overflow. The following crashes on my system before this patch, but not after: void foo(int i) { switch (i) { case 1: case 2: ... 100000 cases ... ; } } clang-query -c="match stmt(hasAncestor(stmt()))" deep.c I'm not sure it's actually a sane testcase to run though, it's pretty slow :-) Differential Revision: https://reviews.llvm.org/D88222	2020-09-24 22:49:44 +02:00
Alexey Bataev	579c42225a	[OPENMP]Fix PR47621: Variable used by task inside a template function is not made firstprivate by default Need to fix a check for the variable if it is declared in the inner OpenMP region to be able to firstprivatize it. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D88240	2020-09-24 16:18:09 -04:00
Erich Keane	f8a92adfa2	Remove dead branch identified by @rsmith on post-commit for D88236	2020-09-24 13:05:15 -07:00
Volodymyr Sapsai	9eba6b20a0	Revert "[Modules] Add stats to measure performance of building and loading modules." This reverts commit `c4bacc3c9b`. Test "LLVM :: ThinLTO/X86/funcimport-stats.ll" is failing. Reverting now and will recommit after making the test not fail with the added stats.	2020-09-24 12:36:06 -07:00
Volodymyr Sapsai	c4bacc3c9b	[Modules] Add stats to measure performance of building and loading modules. Measure amount of high-level or fixed-cost operations performed during building/loading modules and during header search. High-level operations like building a module or processing a .pcm file are motivated by previous issues where clang was re-building modules or re-reading .pcm files unnecessarily. Fixed-cost operations like `stat` calls are tracked because clang cannot change how long each operation takes but it can perform fewer of such operations to improve the compile time. Also tracking such stats over time can help us detect compile-time regressions. Added stats are more stable than the actual measured compilation time, so expect the detected regressions to be less noisy. rdar://problem/55715134 Reviewed By: aprantl, bruno Differential Revision: https://reviews.llvm.org/D86895	2020-09-24 12:23:47 -07:00
Erich Keane	606a734755	[PR47636] Fix tryEmitPrivate to handle non-constantarraytypes As mentioned in the bug report, tryEmitPrivate chokes on the MaterializeTemporaryExpr in the reproducers, since it assumes that if there are elements, than it must be a ConstantArrayType. However, the MaterializeTemporaryExpr (which matches exactly the AST when it is NOT a global/static) has an incomplete array type. This changes the section where the number-of-elements is non-zero to properly handle non-CAT types by just extracting it as an array type (since all we needed was the element type out of it).	2020-09-24 12:09:22 -07:00
Saleem Abdulrasool	d34c8c70aa	Basic: add an extra newline for sphinx (NFC) This should resolve the "Bullet list ends without a blank line" warning.	2020-09-24 18:51:10 +00:00
Alexey Bataev	cde7d90cc7	Revert "[OPENMP]Fix PR47621: Variable used by task inside a template function is not made firstprivate by default" This reverts commit `d1419c9fda` to fix the buffer overflow detected by address sanitiizer.	2020-09-24 14:42:04 -04:00
Reid Kleckner	b62fd436a3	Revert "Recommit [NFC] Refactor DiagnosticBuilder and PartialDiagnostic" This reverts commit `8e780a1653`. DiagnosticBuilder is a value type, created on the stack everywhere. IMO we should not be adding a vtable to it, and making very operator<< use a virtual interface. There are other feasible designs for implementing this. The original review, D84362, was approved by @tra, who is responsible for Clang's CUDA support, but it wasn't reviewed by @rsmith or anyone responsible for clang's diagnostic library.	2020-09-24 11:16:55 -07:00
Reid Kleckner	3453b6928d	Revert "Recommit "[CUDA][HIP] Defer overloading resolution diagnostics for host device functions"" This reverts commit `e39da8ab6a`. This depends on a change that needs additional design review and needs to be reverted.	2020-09-24 11:16:54 -07:00
Alexey Bataev	d1419c9fda	[OPENMP]Fix PR47621: Variable used by task inside a template function is not made firstprivate by default Need to fix a check for the variable if it is declared in the inner OpenMP region to be able to firstprivatize it. Differential Revision: https://reviews.llvm.org/D88240	2020-09-24 13:51:21 -04:00
Alexey Bataev	a9fca98ee4	[OPENMP]PR47606: Do not update the lastprivate item if it was captured by reference as firstprivate data member. No need to make final copy from the firsptrivate/lastprivate copy to the original item if the item is a data memeber. Firstprivate copy creates a copy by reference and the original item gets updated correctly when updating the lastprivate shared variable. Differential Revision: https://reviews.llvm.org/D88179	2020-09-24 13:14:13 -04:00
Saleem Abdulrasool	296d8832a3	Sema: add support for `__attribute__((__swift_newtype__))` Add the `swift_newtype` attribute which allows a type definition to be imported into Swift as a new type. The imported type must be either an enumerated type (enum) or an object type (struct). This is based on the work of the original changes in `8afaf3aad2` Differential Revision: https://reviews.llvm.org/D87652 Reviewed By: Aaron Ballman	2020-09-24 15:17:35 +00:00
Nathan Froyd	31a3c5fb45	[clang] use string tables for static diagnostic descriptions Using a pointer for the description string in StaticDiagInfoRec causes several problems: 1. We don't need to use a whole pointer to represent the string; 2. The use of pointers incurs runtime relocations for those pointers; the relocations take up space on disk and represent runtime overhead; 3. The need to relocate data implies that, on some platforms, the entire array containing StaticDiagInfoRecs cannot be shared between processes. This patch changes the storage scheme for the diagnostic descriptions to avoid these problems. We instead generate (effectively) one large string and then StaticDiagInfoRec conceptually holds offsets into the string. We elected to also move the storage of those offsets into a separate array to further reduce the space required. On x86-64 Linux, this change removes about 120KB of relocations and moves about 60KB from the non-shareable .data.rel.ro section to shareable .rodata. (The array is about 80KB before this, but we eliminated 4 bytes/entry by using offsets rather than pointers.) We actually reap this benefit twice, because these tables show up in both libclang.so and libclang-cpp.so and we get the reduction in both places. Differential Revision: https://reviews.llvm.org/D81865	2020-09-24 10:54:28 -04:00
Yaxun (Sam) Liu	e39da8ab6a	Recommit "[CUDA][HIP] Defer overloading resolution diagnostics for host device functions" This recommits `7f1f89ec8d` and `40df06cdaf` after fixing memory sanitizer failure.	2020-09-24 08:44:37 -04:00
Alexandre Ganea	f5314d15af	[Support] On Unix, let the CrashRecoveryContext return the signal code Before this patch, the CrashRecoveryContext was returning -2 upon a signal, like ExecuteAndWait does. This didn't match the behavior on Windows, where the the exception code was returned. We now return the signal's code, which optionally allows for re-throwing the signal later. Doing so requires all custom handlers to be removed first, through llvm::sys::unregisterHandlers() which we made a public API. This is part of https://reviews.llvm.org/D70378	2020-09-24 08:21:43 -04:00
Jonas Toth	4e53490047	[NFC][Docs] fix clang-docs compilation	2020-09-24 13:13:38 +02:00
Mikhail Maltsev	8cc842a950	[clang][Sema] Use enumerator instead of hard-coded constant Sema::DiagnoseSwiftName uses the constant 12 instead of the corresponding enumerator ExpectedFunctionWithProtoType. This is fragile and will fail if a new value gets added in the middle of the enum. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D88164	2020-09-24 10:24:22 +01:00
Amy Kwan	6b136b19cb	[Power10] Implement custom codegen for the vec_replace_elt and vec_replace_unaligned builtins. This patch implements custom codegen for the vec_replace_elt and vec_replace_unaligned builtins. These builtins map to the @llvm.ppc.altivec.vinsw and @llvm.ppc.altivec.vinsd intrinsics depending on the arguments. The main motivation for doing custom codegen for these intrinsics is because there are float and double versions of the builtin. Normally, the converting the float to an integer would be done via fptoui in the IR. This is incorrect as fptoui truncates the value and we must ensure the value is not truncated. Therefore, we provide custom codegen to utilize bitcast instead as bitcasts do not truncate. Differential Revision: https://reviews.llvm.org/D83500	2020-09-23 22:55:25 -05:00
Craig Topper	d9717d8ee7	[X86] Add a memory clobber to the bittest intrinsic inline asm. Get default clobbers from the target I believe the inline asm emitted here should have a memory clobber since it writes to memory. It was also missing the dirflag clobber that we use by default along with flags and fpsr. To avoid missing defaults in the future, get the default list from the target Differential Revision: https://reviews.llvm.org/D88121	2020-09-23 14:54:39 -07:00
Yaxun (Sam) Liu	8e780a1653	Recommit [NFC] Refactor DiagnosticBuilder and PartialDiagnostic This recommits `829d14ee0a`. The patch was reverted due to a regression in some CUDA app which was thought to be caused by this patch. However, investigation showed that the regression was due to some other issues, therefore recommit this patch.	2020-09-23 16:55:00 -04:00
Amy Kwan	2e7117f847	[PowerPC] Implement the 128-bit vec_[all\|any]_[eq \| ne \| lt \| gt \| le \| ge] builtins in Clang/LLVM This patch implements the vec_[all\|any]_[eq \| ne \| lt \| gt \| le \| ge] builtins for vector signed/unsigned __int128. Differential Revision: https://reviews.llvm.org/D87910	2020-09-23 16:49:40 -04:00
Albion Fung	88cdbeab41	[PowerPC] Implement Vector signed/unsigned __int128 overloads for the comparison builtins This patch implements Vector signed/unsigned __int128 overloads for the comparison builtins. Differential Revision: https://reviews.llvm.org/D87804	2020-09-23 16:49:40 -04:00
Aaron Ballman	af1d3e6559	Allow init_priority values <= 100 and > 65535 within system headers. This also adds some bare-bones documentation for the attribute rather than leaving it undocumented.	2020-09-23 15:26:50 -04:00
Stanislav Mekhanoshin	59691dc874	[AMDGPU] Make ds fp atomics overloadable Differential Revision: https://reviews.llvm.org/D87947	2020-09-23 11:39:50 -07:00
Sriraman Tallam	7d0bbe4090	Re-apply https://reviews.llvm.org/D87921 , was reverted to triage a PPC bot failure. D87921 was reverted in commit `b89059a313` as it was causing an unknown llvm PPC bot failure. Reapplying the patch after confirming that this is not responsible. Build bot failure: https://reviews.llvm.org/D87921#2286644 which caused the revert. The wrong placement of add pass with optimizations led to -funique-internal-linkage-names being disabled. Fixed the placement of the MPM.addpass for UniqueInternalLinkageNames to make it work correctly with -O2 and new pass manager. Updated the tests to explicitly check O0 and O1. Differential Revision: https://reviews.llvm.org/D87921	2020-09-23 10:28:40 -07:00
Dmitry Antipov	d882ca7f1f	[Driver] Check whether Gentoo-specific configuration directory exists Check whether /etc/env.d/gcc exists before trying to read from any file from there. This saves a few OS calls on a non-Gentoo system. Differential Revision: https://reviews.llvm.org/D87143	2020-09-23 20:25:23 +03:00
Mircea Trofin	271928792e	Add REQUIRES to embed-bitcode-noopt.ll	2020-09-23 10:13:09 -07:00
Mircea Trofin	437358be71	[clang]Test ensuring -fembed-bitcode passed to cc1 captures pre-opt bitcode. This is important to not regress because it allows us to capture pre-optimization bitcode and options, and replay the full optimization pipeline. Differential Revision: https://reviews.llvm.org/D88114	2020-09-23 09:35:28 -07:00
Aaron Ballman	819ff6b945	Improve dynamic AST matching diagnostics for conversion errors Currently, when marshaling a dynamic AST matchers, we check for the type and value validity of matcher arguments at the same time for some matchers. For instance, when marshaling hasAttr("foo"), the argument is first type checked to ensure it's a string and then checked to see if that string can locate an attribute with that name. Similar happens for other enumeration conversions like cast kinds or unary operator kinds. If the type is correct but the value cannot be looked up, we make a best-effort attempt to find a nearby name that the user might have meant, but if one cannot be found, we throw our hands up and claim the types don't match. This has an unfortunate behavior that when the user enters something of the correct type but a best guess cannot be located, you get confusing error messages like: Incorrect type for arg 1. (Expected = string) != (Actual = String). This patch splits the argument check into two parts: if the types don't match, give a type diagnostic. If the type matches but the value cannot be converted, give a best guess diagnostic or a value could not be located diagnostic. This addresses PR47057.	2020-09-23 12:13:36 -04:00
Yaxun (Sam) Liu	e90343ada3	Fix regressioin in test dwp-separate-debug-file.cpp	2020-09-23 11:49:59 -04:00
Yaxun (Sam) Liu	e6d50b4f22	recommit [HIP] Fix -gsplit-dwarf option recommit `e50465ecef` with fix for regression in lldb tests. Two issues: 1. the directory part of original .dwo file was dropped 2. if the stem of the .dwo file contains '.', the last dot and strings after that were removed This recommit fixes those two issues.	2020-09-23 11:20:29 -04:00
YangZhihui	1d1c382ed2	Fix typos in ASTMatchers.h; NFC	2020-09-23 09:09:11 -04:00
Yaxun (Sam) Liu	301e23305d	[CUDA][HIP] Fix static device var used by host code only A static device variable may be accessed in host code through cudaMemCpyFromSymbol etc. Currently clang does not emit the static device variable if it is only referenced by host code, which causes host code to fail at run time. This patch fixes that. Differential Revision: https://reviews.llvm.org/D88115	2020-09-23 08:18:19 -04:00
Gabor Marton	11d2e63ab0	[analyzer][StdLibraryFunctionsChecker] Separate the signature from the summaries The signature should not be part of the summaries as many FIXME comments suggests. By separating the signature, we open up the way to a generic matching implementation which could be used later under the hoods of CallDescriptionMap. Differential Revision: https://reviews.llvm.org/D88100	2020-09-23 10:59:34 +02:00

... 2 3 4 5 6 ...

86205 Commits