llvm-project

Commit Graph

Author	SHA1	Message	Date
Bevin Hansson	9fa7f48459	[Fixed Point] Add fixed-point to floating point cast types and consteval. Reviewed By: leonardchan Differential Revision: https://reviews.llvm.org/D86631	2020-10-13 13:26:56 +02:00
Ties Stuij	208987844f	[ARM] Follow AACPS standard for volatile bit-fields access width This patch resumes the work of D16586. According to the AAPCS, volatile bit-fields should be accessed using containers of the widht of their declarative type. In such case: ``` struct S1 { short a : 1; } ``` should be accessed using load and stores of the width (sizeof(short)), where now the compiler does only load the minimum required width (char in this case). However, as discussed in D16586, that could overwrite non-volatile bit-fields, which conflicted with C and C++ object models by creating data race conditions that are not part of the bit-field, e.g. ``` struct S2 { short a; int b : 16; } ``` Accessing `S2.b` would also access `S2.a`. The AAPCS Release 2020Q2 (https://documentation-service.arm.com/static/5efb7fbedbdee951c1ccf186?token=) section 8.1 Data Types, page 36, "Volatile bit-fields - preserving number and width of container accesses" has been updated to avoid conflict with the C++ Memory Model. Now it reads in the note: ``` This ABI does not place any restrictions on the access widths of bit-fields where the container overlaps with a non-bit-field member or where the container overlaps with any zero length bit-field placed between two other bit-fields. This is because the C/C++ memory model defines these as being separate memory locations, which can be accessed by two threads simultaneously. For this reason, compilers must be permitted to use a narrower memory access width (including splitting the access into multiple instructions) to avoid writing to a different memory location. For example, in struct S { int a:24; char b; }; a write to a must not also write to the location occupied by b, this requires at least two memory accesses in all current Arm architectures. In the same way, in struct S { int a:24; int:0; int b:8; };, writes to a or b must not overwrite each other. ``` I've updated the patch D16586 to follow such behavior by verifying that we only change volatile bit-field access when: - it won't overlap with any other non-bit-field member - we only access memory inside the bounds of the record - avoid overlapping zero-length bit-fields. Regarding the number of memory accesses, that should be preserved, that will be implemented by D67399. Reviewed By: ostannard Differential Revision: https://reviews.llvm.org/D72932	2020-10-13 10:31:48 +01:00
Simon Pilgrim	6c23cbc560	[X86] Convert integer _mm_reduce_* intrinsics to emit llvm.reduction intrinsics (PR47506) Emit the equivalent integer reduction intrinsics in IR instead of expanding to shuffle+arithmetic sequences. The fadd/fmul reductions might be trickier as they assume a similar bisection reduction while the generic intrinsics assume a sequential reduction (intel docs are ambiguous on the correct approach) - I'm not sure if we want to always tag them with reassoc? Anyway, that issue can wait until a separate fp patch along with the fmin/fmax reductions. Differential Revision: https://reviews.llvm.org/D87604	2020-10-13 09:28:39 +01:00
Richard Smith	913f600566	Canonicalize declaration pointers when forming APValues. References to different declarations of the same entity aren't different values, so shouldn't have different representations. Recommit of `e6393ee813`, most recently reverted in `9a33f027ac` due to a bug caused by ObjCInterfaceDecls not propagating availability attributes along their redeclaration chains; that bug was fixed in `e2d4174e9c`.	2020-10-12 19:32:57 -07:00
Richard Smith	e2d4174e9c	Ensure that InheritedAttrs are properly inherited along a redeclaration chain for ObjCInterfaceDecls. Only one such declaration can actually have attributes (the definition, if any), but generally we assume that we can look for InheritedAttrs on the most recent declaration.	2020-10-12 19:22:31 -07:00
Wang, Pengfei	412cdcf2ed	[X86] Add HRESET instruction. For more details about these instructions, please refer to the latest ISE document: https://software.intel.com/en-us/download/intel-architecture-instruction-set-extensions-programming-reference. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D89102	2020-10-13 08:47:26 +08:00
Bruno Cardoso Lopes	cffb0dd54d	[SemaTemplate] Stop passing insertion position around during VarTemplate instantiation They can get stale at use time because of updates from other recursive specializations. Instead, rely on the existence of previous declarations to add the specialization. Differential Revision: https://reviews.llvm.org/D87853	2020-10-12 16:48:50 -07:00
Xun Li	d80ecdf27f	[Coroutine] Rename coro-semmetric-transfer.cpp and possibly fix test failure Some tests start to fail after https://reviews.llvm.org/D89066. It's because the size of pointers are different on different targets. Limit the target in the command so there is no confusion. Also noticed I had typo in the test name. Adding disable-llvm-passes option to make the test more stable as well. Differential Revision: https://reviews.llvm.org/D89269	2020-10-12 15:29:07 -07:00
Duncan P. N. Exon Smith	69feac12d0	Lex: Avoid MemoryBuffer* key in ExcludedPreprocessorDirectiveSkipMapping, NFC This is a prep patch for changing SourceManager to return `Optional<MemoryBufferRef>` instead of `MemoryBuffer`. With that change the address of the MemoryBuffer will be gone, so instead use the start of the buffer as the key for this map. No functionality change intended, as it's expected that the pointer identity matches between the buffers and the buffer data. Radar-Id: rdar://70139990 Differential Revision: https://reviews.llvm.org/D89136	2020-10-12 17:39:01 -04:00
Erich Keane	ac73cafac0	Ensure TreeTransform considers ParmVarDecls as transformed Decls See PR47804: TreeTransform uses TransformedLocalDecls as a map of declarations that have been transformed already. When doing a "TransformDecl", which happens in the cases of updating a DeclRefExpr's target, the default implementation simply returns the already transformed declaration. However, this was not including ParmVarDecls. SO, any use of TreeTransform that didn't re-implement TransformDecl would NOT properly update the target of a DeclRefExpr, resulting in odd behavior. In the case of Typo-recovery, the result was that a lambda that used its own parameter would cause an error, since it thought that the ParmVarDecl referenced was a different lambda. Additionally, this caused a problem in the AST (a declrefexpr into another scope) such that a future instantiation would cause an assertion. This patch ensures that the ParmVarDecl transforming process records into TransformedLocalDecls so that the DeclRefExpr is ALSO updated.	2020-10-12 14:38:04 -07:00
Duncan P. N. Exon Smith	d07b290e4b	DependencyScanning: pull factory function into MinimizedVFS, NFC Avoid need for getBufferPtr API, simplifying another patch. No functionality change.	2020-10-12 17:25:10 -04:00
Mateusz Mikuła	3b1d018c0d	[MinGW][clang-shlib] Build only when LLVM_LINK_LLVM_DYLIB is enabled Otherwise it's easy to hit 2^16 DLL exports limit. Differential Revision: https://reviews.llvm.org/D89225	2020-10-12 23:28:23 +03:00
Arthur Eubanks	9a33f027ac	Revert "Canonicalize declaration pointers when forming APValues." This reverts commit `9dcd96f728`. See https://crbug.com/1134762.	2020-10-12 12:37:24 -07:00
Xun Li	dce8f2bb25	[Coroutine][Sema] Only tighten the suspend call temp lifetime for final awaiter In https://reviews.llvm.org/D87470 I added the change to tighten the lifetime of the expression awaiter.await_suspend().address. Howver it was incorrect. ExprWithCleanups will call the dtor and end the lifetime for all the temps created in the current full expr. When this is called on a normal await call, we don't want to do that. We only want to do this for the call on the final_awaiter, to avoid writing into the frame after the frame is destroyed. This change fixes it, by checking IsImplicit. Differential Revision: https://reviews.llvm.org/D89066	2020-10-12 12:00:20 -07:00
Fangrui Song	012dd42e02	[X86] Support -march=x86-64-v[234] PR47686. These micro-architecture levels are defined in the x86-64 psABI: https://gitlab.com/x86-psABIs/x86-64-ABI/-/commit/77566eb03bc6a326811cb7e9 GCC 11 will support these levels. Note, -mtune=x86-64-v[234] are invalid and __builtin_cpu_is cannot be used on them. Reviewed By: craig.topper, RKSimon Differential Revision: https://reviews.llvm.org/D89197	2020-10-12 10:29:46 -07:00
Mark de Wever	551caec4a8	Make likelihood lit test less brittle Jeremy Morse discovered an issue with the lit test introduced in D88363. The test gives different results for Sony's `-O1`. The test needs to run at `-O1` otherwise the likelihood attribute will be ignored. Instead of running all `-O1` passes it only runs the lower-expect pass which is needed to lower `__builtin_expect`. Differential Revision: https://reviews.llvm.org/D89204	2020-10-12 18:58:21 +02:00
Haojian Wu	8852d30b1c	[AST][RecoveryExpr] Don't perform early typo correction in C. The dependent mechanism for C error-recovery is mostly finished, this is the only place we have missed. Differential Revision: https://reviews.llvm.org/D89045	2020-10-12 11:24:45 +02:00
Haojian Wu	bb406f36dc	[AST][RecoveryExpr] Build dependent callexpr in C for error-recovery. See whole context: https://reviews.llvm.org/D85025 Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D84304	2020-10-12 11:15:01 +02:00
Haojian Wu	702529d899	[clang] Fix returning the underlying VarDecl as top-level decl for VarTemplateDecl. Given the following VarTemplateDecl AST, ``` VarTemplateDecl col:26 X \|-TemplateTypeParmDecl typename depth 0 index 0 `-VarDecl X 'bool' cinit `-CXXBoolLiteralExpr 'bool' true ``` previously, we returned the VarDecl as the top-level decl, which was not correct, the top-level decl should be VarTemplateDecl. Differential Revision: https://reviews.llvm.org/D89098	2020-10-12 10:46:18 +02:00
John McCall	cec49a5836	Revert "[SYCL] Implement __builtin_unique_stable_name." This reverts commit `b5a034e771`. This feature was added without following the proper process.	2020-10-12 01:10:09 -04:00
Jonas Devlieghere	ba2dff0159	Revert "PR47792: Include the type of a pointer or reference non-type template" This reverts commit `849c60541b` because it results in a stage 2 build failure: llvm-project/clang/include/clang/AST/ExternalASTSource.h:409:20: error: definition with same mangled name '_ZN5clang25LazyGenerationalUpdatePtrIPKNS_4DeclEPS1_XadL_ZNS_17ExternalASTSource19CompleteRedeclChainES3_EEE9makeValueERKNS_10ASTContextES4_' as another definition static ValueType makeValue(const ASTContext &Ctx, T Value);	2020-10-11 20:16:46 -07:00
Richard Smith	849c60541b	PR47792: Include the type of a pointer or reference non-type template parameter in its notion of template argument identity. We already did this for all the other kinds of non-type template argument. We're still missing the type from the mangling, so we continue to be able to see collisions at link time; that's an open ABI issue.	2020-10-11 15:59:49 -07:00
Roman Lebedev	544a6aa267	[InstCombine] combineLoadToOperationType(): don't fold int<->ptr cast into load And another step towards transforms not introducing inttoptr and/or ptrtoint casts that weren't there already. As we've been establishing (see D88788/D88789), if there is a int<->ptr cast, it basically must stay as-is, we can't do much with it. I've looked, and the most source of new such casts being introduces, as far as i can tell, is this transform, which, ironically, tries to reduce count of casts.. On vanilla llvm test-suite + RawSpeed, @ `-O3`, this results in -33.58% less `IntToPtr`s (19014 -> 12629) and +76.20% more `PtrToInt`s (18589 -> 32753), which is an increase of +20.69% in total. However just on RawSpeed, where i know there are basically none `IntToPtr` in the original source code, this results in -99.27% less `IntToPtr`s (2724 -> 20) and +82.92% more `PtrToInt`s (4513 -> 8255). which is again an increase of 14.34% in total. To me this does seem like the step in the right direction, we end up with strictly less `IntToPtr`, but strictly more `PtrToInt`, which seems like a reasonable trade-off. See https://reviews.llvm.org/D88860 / https://reviews.llvm.org/D88995 for some more discussion on the subject. (Eventually, `CastInst::isNoopCast()`/`CastInst::isEliminableCastPair` should be taught about this, yes) Reviewed By: nlopes, nikic Differential Revision: https://reviews.llvm.org/D88979	2020-10-11 20:24:28 +03:00
Fangrui Song	cbe4d973ed	[X86] Define __LAHF_SAHF__ if feature 'sahf' is set or 32-bit mode GCC 11 will define this macro. In LLVM, the feature flag only applies to 64-bit mode and we always define the macro in 32-bit mode. This is different from GCC -m32 in which -mno-sahf can suppress the macro. The discrepancy can unlikely cause trouble. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D89198	2020-10-11 09:46:00 -07:00
Zinovy Nis	32d565b461	[clang-tidy] Fix crash in readability-function-cognitive-complexity on weak refs Fix for https://bugs.llvm.org/show_bug.cgi?id=47779 Differential Revision: https://reviews.llvm.org/D89194	2020-10-11 18:52:38 +03:00
Tim Renouf	666ef0db20	[AMDGPU] Add gfx602, gfx705, gfx805 targets At AMD, in an internal audit of our code, we found some corner cases where we were not quite differentiating targets enough for some old hardware. This commit is part of fixing that by adding three new targets: * The "Oland" and "Hainan" variants of gfx601 are now split out into gfx602. LLPC (in the GPUOpen driver) and other front-ends could use that to avoid using the shaderZExport workaround on gfx602. * One variant of gfx703 is now split out into gfx705. LLPC and other front-ends could use that to avoid using the shaderSpiCsRegAllocFragmentation workaround on gfx705. * The "TongaPro" variant of gfx802 is now split out into gfx805. TongaPro has a faster 64-bit shift than its former friends in gfx802, and a subtarget feature could be set up for that to take advantage of it. This commit does not make that change; it just adds the target. V2: Add clang changes. Put TargetParser list in order. V3: AMDGCNGPUs table in TargetParser.cpp needs to be in GPUKind order, so fix the GPUKind order. Differential Revision: https://reviews.llvm.org/D88916 Change-Id: Ia901a7157eb2f73ccd9f25dbacec38427312377d	2020-10-10 17:22:22 +01:00
Thomas Lively	d8f58bf53a	[WebAssembly] Prototype i16x8.q15mulr_sat_s This saturating, rounding, Q-format multiplication instruction is proposed in https://github.com/WebAssembly/simd/pull/365. Differential Revision: https://reviews.llvm.org/D88968	2020-10-09 21:17:53 +00:00
Saleem Abdulrasool	5d74c43511	DirectoryWatcher: add an implementation for Windows This implements the directory watcher on Windows. It does the most naive thing for simplicity. ReadDirectoryChangesW is used to monitor the changes. However, in order to support interruption, we must use overlapped IO, which allows us to use the blocking, synchronous mechanism. We create a thread to post the notification to the consumer to allow the monitoring to continue. The two threads communicate via a locked queue. Differential Revision: https://reviews.llvm.org/D88666 Reviewed By: Adrian McCarthy	2020-10-09 20:55:57 +00:00
Scott Linder	40cef5a00e	[clang] Add a test for CGDebugInfo treatment of blocks There doesn't seem to be a direct test of this, and I'm planning to make future changes which will affect it. I'm not particularly familiar with the blocks extension, so suggestions for better tests are welcome. Differential Revision: https://reviews.llvm.org/D88754	2020-10-09 19:03:21 +00:00
Louis Dionne	a3a2431608	[clang] Don't look into <sysroot> for C++ headers if they are found alongside the toolchain Currently, Clang looks for libc++ headers alongside the installation directory of Clang, and it also adds a search path for headers in the -isysroot. This is problematic if headers are found in both the toolchain and in the sysroot, since #include_next will end up finding the libc++ headers in the sysroot instead of the intended system headers. This patch changes the logic such that if the toolchain contains libc++ headers, no C++ header paths are added in the sysroot. However, if the toolchain does not contain libc++ headers, the sysroot is searched as usual. This should not be a breaking change, since any code that previously relied on some libc++ headers being found in the sysroot suffered from the #include_next issue described above, which renders any libc++ header basically useless. Differential Revision: https://reviews.llvm.org/D89001	2020-10-09 12:41:41 -04:00
Anastasia Stulova	71d3b7ec7b	[OpenCL] Add new compilation mode for OpenCL 3.0. Extended -cl-std/std flag with CL3.0 and added predefined version macros. Patch by Anton Zabaznov (azabaznov)! Tags: #clang Differential Revision: https://reviews.llvm.org/D88300	2020-10-09 15:28:38 +01:00
Fanbo Meng	d91234b21c	[SystemZ][z/OS] Update target specific __attribute__((aligned)) value for test z/OS defaults to 16 bytes for __attribute__((aligned)), modify the test to differentiate between z/OS and Linux on s390x. Reviewed By: abhina.sreeskantharajan Differential Revision: https://reviews.llvm.org/D89127	2020-10-09 10:14:44 -04:00
Stefan Pintilie	0741a2c9ca	[Clang][unittests][NFC] Break up test in Callbacks.cpp The Callbacks.cpp test was taking a long time to compile on some build bots causing timeouts. This patch splits up that test into five separate cpp files and a header file. Reviewed By: gribozavr2 Differential Revision: https://reviews.llvm.org/D88886	2020-10-09 08:53:50 -05:00
Konstantin Zhuravlyov	b581c5a42f	Regenerate ClangCommandLineReference.rst Differential Revision: https://reviews.llvm.org/D89091	2020-10-09 08:29:53 -04:00
Ben Dunbobbin	bb148ad426	[windows-itanium] make dllimport/export handling closer to MS behavior Differential Revision: https://reviews.llvm.org/D86828	2020-10-09 13:24:07 +01:00
Ben Dunbobbin	a9f1bb92bf	Revert "first upstream review" Pushed by accident :( This reverts commit `4eb627ed96`.	2020-10-09 13:22:46 +01:00
Ben Dunbobbin	4eb627ed96	first upstream review	2020-10-09 13:21:03 +01:00
Jonas Toth	e517e5cfec	[clang] improve accuracy of ExprMutAnalyzer This patch extracts the ExprMutAnalyzer changes from https://reviews.llvm.org/D54943 into its own revision for simpler review and more atomic changes. The analysis results are improved. Nested expressions (e.g. conditional operators) are now detected properly. Some edge cases, especially template induced imprecisions are improved upon. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D88088	2020-10-09 13:45:32 +02:00
Nithin Vadukkumchery Rajendrakumar	0b4fe8086f	[Analyzer] Fix for dereferece of smart pointer after branching on unknown inner pointer Summary: Enabling warning after dereferece of smart pointer after branching on unknown inner pointer. Reviewers: NoQ, Szelethus, vsavchenko, xazax.hun Reviewed By: NoQ Subscribers: martong, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D87043	2020-10-09 13:42:25 +02:00
Dávid Bolvanský	caf28b0a12	[Diagnostics] Diagnose -Wsizeof-array-div for array of pointers Differential Revision: https://reviews.llvm.org/D87990	2020-10-09 12:56:06 +02:00
Bevin Hansson	9c26eb8b91	Refactor fixed point conversion test. Differential Revision: https://reviews.llvm.org/D88648	2020-10-09 10:27:42 +02:00
Liu, Chen3	26cfb6e562	[X86] Passing union type through register For example: union M256 { double d; __m256 m; }; extern void foo1(union M256 A); union M256 m1; void test() { foo1(m1); } clang will pass m1 through stack which does not follow the ABI. Differential Revision: https://reviews.llvm.org/D78699	2020-10-09 11:24:29 +08:00
Ben Barham	fbb499ef25	[AST] Fix crashes caused by redeclarations in hidden prototypes ObjCContainerDecl.getMethod returns a nullptr by default when the container is a hidden prototype. Callsites where the method is being looked up on the redeclaration's own container should skip this check since they (rightly) expect a valid method to be found. Resolves rdar://69444243 Reviewed By: akyrtzi Differential Revision: https://reviews.llvm.org/D89024	2020-10-08 19:48:36 -07:00
Alexandre Ganea	66face6aa0	Re-land [DebugInfo] Add debug location to stubs generated by CGDeclCXX and mark them as artificial Previously, when clang was compiled with -DLLVM_ENABLE_ASSERTIONS=ON, the added tests were displaying: inlinable function call in a function with debug info must have a !dbg location call void @"??1?$c@UB@@@@QEAA@XZ"(%struct.c* @"?f@?1??d@@YAPEAU?$c@UB@@@@XZ@4U2@A") fatal error: error in backend: Broken module found, compilation aborted! Stack dump: 0. Program arguments: <f:\svn\buildninja\bin\clang -cc1 -emit-llvm debug-info-no-location.cpp> -gcodeview -debug-info-kind=limited 1. <eof> parser at end of file 2. Per-function optimization Fixes PR43012 Differential Revision: https://reviews.llvm.org/D66328	2020-10-08 20:49:17 -04:00
Richard Smith	d1751d14a6	PR47175: Ensure type-dependent function-style casts have dependent types. Previously, a type-dependent cast to a deduced class template specialization type would end up with a non-dependent class template specialization type, leading to confusion downstream.	2020-10-08 17:00:22 -07:00
Arthur Eubanks	afff74e5c2	[HWAsan][NewPM] Handle hwasan like other sanitizers Move it as an EP callback (-O[123]) or in addSanitizersAtO0. This makes it not run in ThinLTO pre-link (like the other sanitizers), so don't check LTO runs in hwasan-new-pm.c. Changing its position also seems to change the generated IR. I think we just need to make sure the pass runs. Reviewed By: leonardchan Differential Revision: https://reviews.llvm.org/D88936	2020-10-08 14:43:21 -07:00
Simon Pilgrim	8568113101	Fix Wparentheses warning. NFCI. Wrap the containErrors() calls together - assert we have any containErrors cases in the conditional operator.	2020-10-08 20:02:19 +01:00
Joseph Huber	3cc1f1fc1d	[OpenMP] Replace OpenMP RTL Functions With OMPIRBuilder and OMPKinds.def Summary: Replace the OpenMP Runtime Library functions used in CGOpenMPRuntimeGPU for OpenMP device code generation with ones in OMPKinds.def and use OMPIRBuilder for generating runtime calls. This allows us to consolidate more OpenMP code generation into the OMPIRBuilder. Future additions to the GPU runtime functions should now go in OMPKinds.def Reviewers: jdoerfert Subscribers: aaron.ballman cfe-commits guansong llvm-commits sstefan1 yaxunl Tags: #OpenMP #LLVM #clang Differential Revision: https://reviews.llvm.org/D88430	2020-10-08 14:00:22 -04:00
David Green	a15bd0bfc2	[AIX] Add REQUIRES for powerpc test. NFC	2020-10-08 18:40:09 +01:00
Leonard Chan	64c0792946	[clang][feature] Add cxx_abi_relative_vtable feature This will be enabled if relative vtables is enabled. Differential revision: https://reviews.llvm.org/D85924	2020-10-08 10:30:54 -07:00
Joseph Huber	d564409946	[OpenMP] Change CMake Configuration to Build for Highest CUDA Architecture by Default Summary: This patch changes the CMake files for Clang and Libomptarget to query the system for its supported CUDA architecture. This makes it much easier for the user to build optimal code without needing to set the flags manually. This relies on the now deprecated FindCUDA method in CMake, but full support for architecture detection is only availible in CMake >3.18 Reviewers: jdoerfert ye-luo Subscribers: cfe-commits guansong mgorny openmp-commits sstefan1 yaxunl Tags: #clang #OpenMP Differential Revision: https://reviews.llvm.org/D87946	2020-10-08 12:09:34 -04:00
Geoff Levner	b9225543e8	DeferredDiagnosticsEmitter crashes Patch VisitCXXDeleteExpr() in clang::UsedDeclVisitor to avoid it crashing when the expression's destroyed type is null. According to the comments in CXXDeleteExpr::getDestroyedType(), this can happen when the type to delete is a dependent type. Patch by Geoff Levner. Differential Revision: https://reviews.llvm.org/D88949	2020-10-08 11:42:21 -04:00
diggerlin	92bca12843	[AIX] add new option -mignore-xcoff-visibility SUMMARY: In IBM compiler xlclang , there is an option -fnovisibility which suppresses visibility. For more details see: https://www.ibm.com/support/knowledgecenter/SSGH3R_16.1.0/com.ibm.xlcpp161.aix.doc/compiler_ref/opt_visibility.html. We need to add the option -mignore-xcoff-visibility for compatibility with the IBM AIX OS (as the option is enabled by default in AIX). With this option llvm does not emit any visibility attribute to ASM or XCOFF object file. The option only work on the AIX OS, for other non-AIX OS using the option will report an unsupported options error. In AIX OS: 1.1 the option -mignore-xcoff-visibility is enabled by default , if there is not -fvisibility=* and -mignore-xcoff-visibility explicitly in the clang command . 1.2 if there is -fvisibility=* explicitly but not -mignore-xcoff-visibility explicitly in the clang command. it will generate visibility attributes. 1.3 if there are both -fvisibility=* and -mignore-xcoff-visibility explicitly in the clang command. The option "-mignore-xcoff-visibility" wins , it do not emit the visibility attribute. The option -mignore-xcoff-visibility has no effect on visibility attribute when compile with -emit-llvm option to generated LLVM IR. Reviewer: daltenty,Jason Liu Differential Revision: https://reviews.llvm.org/D87451	2020-10-08 09:34:58 -04:00
Joseph Huber	6668e4cc68	[OpenMP] Add Error Handling for Conflicting Pointer Sizes for Target Offload Summary: This patch adds an error to Clang that detects if OpenMP offloading is used between two architectures with incompatible pointer sizes. This ensures that the data mapping can be done correctly and solves an issue in code generation generating the wrong size pointer. Reviewer: jdoerfert Subscribers: cfe-commits delcypher guansong llvm-commits sstefan1 yaxunl Tags: #OpenMP #Clang Differential Revision: https://reviews.llvm.org/D88594	2020-10-08 08:20:38 -04:00
Serge Pavlov	70bf35070a	[Driver] Add output file to properties of Command Object of class `Command` contains various properties of a command to execute, but output file was missed from them. This change adds this property. It is required for reporting consumed time and memory implemented in D78903 and may be used in other cases too. Differential Revision: https://reviews.llvm.org/D78902	2020-10-08 18:23:39 +07:00
Haojian Wu	a96bcfb196	[AST][RecoveryExpr] Support dependent cast-expr in C for error-recovery. Suppress spurious "typecheck_cond_expect_scalar_operand" diagnostic. See whole context: https://reviews.llvm.org/D85025 Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D84387	2020-10-08 10:00:29 +02:00
Serge Guelton	b4ffc40d62	Update documentation and implementation of stage3 build Have the build work out of the box by forcing an LLD build. That way, we don't require an external LTO-aware linker, as we build one. Also remove reference to the seemingly dead builder. Differential Revision: https://reviews.llvm.org/D88990	2020-10-08 07:55:37 +02:00
Dominic Chen	c102488293	Add test for disabling Dead Virtual Function Elimination Differential Revision: https://reviews.llvm.org/D88349	2020-10-07 19:20:16 -04:00
Simon Pilgrim	42d91438ad	[CodeGen][X86] Cleanup labels on some sse/avx intrinsics tests. NFCI. Add some missing CHECK-LABEL lines. Remove leading '@' so it'll be possible to match against c and c++ builds in a future patch.	2020-10-07 19:33:14 +01:00
Alex Richardson	ff6e4441b9	[clang-format][tests] Fix MacroExpander lexer not parsing C++ keywords While debugging a different clang-format failure, I tried to reuse the MacroExpander lexer, but was surprised to see that it marks all C++ keywords (e.g. const, decltype) as being of type identifier. After stepping through the ::format() code, I noticed that the difference between these two is that the identifier table was not being initialized based on the FormatStyle, so only basic tokens such as tok::semi, tok::plus, etc. were being handled. Reviewed By: klimek Differential Revision: https://reviews.llvm.org/D88952	2020-10-07 17:17:41 +01:00
Alex Richardson	0a3c82e85b	[clang-format][NFC] Store FormatToken::Type as an enum instead of bitfield This improves the debugging experience since LLDB will print the enumerator name instead of a decimal number. This changes TokenType to have uint8_t as the underlying type and moves it after the remaining bitfields to avoid increasing the size of FormatToken. Reviewed By: MyDeveloperDay Differential Revision: https://reviews.llvm.org/D87006	2020-10-07 17:17:40 +01:00
Fanbo Meng	9908ee5670	[SystemZ][z/OS] Add test of zero length bitfield type size larger than target zero length bitfield boundary Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D88963	2020-10-07 11:34:13 -04:00
Haojian Wu	31dc908017	[clang] Use isCompoundAssignmentOp to simplify the code, NFC.	2020-10-07 09:50:43 +02:00
Haojian Wu	334ec6f807	[AST][RecoveryExpr] Support dependent conditional operators in C for error recovery. suppress spurious "typecheck_cond_expect_scalar" diagnostic. Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D84322	2020-10-07 09:33:57 +02:00
Johannes Doerfert	5a3f6bfe8a	Reapply "[OpenMP][FIX] Verify compatible types for declare variant calls" D88384 This reapplies D88384 with the minor modification that an assertion was changed to a regular conditional and graceful exit from ASTContext::mergeTypes.	2020-10-07 00:06:51 -05:00
Richard Smith	00d3e6c1b4	[c++17] Implement P0145R3 during constant evaluation. Ensure that we evaluate assignment and compound-assignment right-to-left, and array subscripting left-to-right. Fixes PR47724. This is a re-commit of `ded79be`, reverted in `37c74df`, with a fix and test for the crasher bug previously introduced.	2020-10-06 12:30:26 -07:00
Fanbo Meng	43cd0a98d1	[SystemZ][z/OS] Set default alignment rules for z/OS target Update RUN line to fix lit failure Differential Revision: https://reviews.llvm.org/D88845	2020-10-06 14:21:21 -04:00
Fanbo Meng	c781dc74a8	[SystemZ][z/OS] Set default alignment rules for z/OS target Set the default alignment control variables for z/OS target and add test case for alignment rules on z/OS. Reviewed By: abhina.sreeskantharajan Differential Revision: https://reviews.llvm.org/D88845	2020-10-06 13:16:15 -04:00
Aaron En Ye Shi	8d2a0c115e	[HIP] NFC Add comments to cmath functions Add missing comments to cmath functions. Differential Revision: https://reviews.llvm.org/D88837	2020-10-06 15:26:56 +00:00
Aaron En Ye Shi	aa2b593f14	[HIP] Restructure hip headers to add cmath Separate __clang_hip_math.h header into __clang_hip_cmath.h and __clang_hip_math.h. Improve the math function definition, and add missing definitions or declarations. Add missing overloads. Reviewed By: tra, JonChesterfield Differential Review: https://reviews.llvm.org/D88837	2020-10-06 14:48:53 +00:00
Shivanshu Goyal	66e4f07198	Add ability to turn off -fpch-instantiate-templates in clang-cl A lot of our code building with clang-cl.exe using Clang 11 was failing with the following 2 type of errors: 1. explicit specialization of 'foo' after instantiation 2. no matching function for call to 'bar' Note that we also use -fdelayed-template-parsing in our builds. I tried pretty hard to get a small repro for these failures, but couldn't. So there is some subtle edge case in the -fpch-instantiate-templates feature introduced by this change: https://reviews.llvm.org/D69585 When I tried turning this off using -fno-pch-instantiate-templates, builds would silently fail with the same error without any indication that -fno-pch-instantiate-templates was being ignored by the compiler. Then I realized this "no" option wasn't actually working when I ran Clang under a debugger. Differential revision: https://reviews.llvm.org/D88680	2020-10-06 16:23:23 +02:00
Dmitri Gribenko	37c74dfe72	Revert "[c++17] Implement P0145R3 during constant evaluation." This reverts commit `ded79be635`. It causes a crash (I sent the crash reproducer directly to the author).	2020-10-06 15:49:44 +02:00
Chuyang Chen	8fa45e1fd5	Convert diagnostics about multi-character literals from extension to warning This addresses PR46797.	2020-10-06 08:47:17 -04:00
David Spickett	f0a78bdfdc	[AArch64] Correct parameter type for unsigned Neon scalar shift intrinsics In the following intrinsics the shift amount (parameter 2) should be signed. vqshlb_u8 vqshlh_u16 vqshls_u32 vqshld_u64 vqrshlb_u8 vqrshlh_u16 vqrshls_u32 vqrshld_u64 vshld_u64 vrshld_u64 See https://developer.arm.com/documentation/ihi0073/latest Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D88013	2020-10-06 11:34:58 +01:00
Pushpinder Singh	3a12ff0dac	[OpenMP][RTL] Remove dead code RequiresDataSharing was always 0, resulting dead code in device runtime library. Reviewed By: jdoerfert, JonChesterfield Differential Revision: https://reviews.llvm.org/D88829	2020-10-06 05:43:47 -04:00
Haojian Wu	70d9dc8674	[AST][RecoveryExpr] Support dependent binary operator in C for error recovery. see the whole context in: https://reviews.llvm.org/D85025 Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D84226	2020-10-06 08:53:31 +02:00
Richard Smith	ded79be635	[c++17] Implement P0145R3 during constant evaluation. Ensure that we evaluate assignment and compound-assignment right-to-left, and array subscripting left-to-right. Fixes PR47724.	2020-10-05 19:04:14 -07:00
Richard Smith	ebf6fd633e	Make OpenMP tests less brittle in the face of changes in constant evaluation diagnostics.	2020-10-05 19:04:14 -07:00
Evandro Menezes	a48d480e1f	[RISCV] Fix broken test Fix test for the SiFive E76 core. This patch fixes the issue introduced by the commit `5d6d8a2769`.	2020-10-05 19:28:31 -05:00
Roman Lebedev	e00f189d39	[InstCombine] Revert rL226781 "Teach InstCombine to canonicalize loads which are only ever stored to always use a legal integer type if one is available." (PR47592) (it was introduced in https://lists.llvm.org/pipermail/llvm-dev/2015-January/080956.html) This canonicalization seems dubious. Most importantly, while it does not create `inttoptr` casts by itself, it may cause them to appear later, see e.g. D88788. I think it's pretty obvious that it is an undesirable outcome, by now we've established that seemingly no-op `inttoptr`/`ptrtoint` casts are not no-op, and are no longer eager to look past them. Which e.g. means that given ``` %a = load i32 %b = inttoptr %a %c = inttoptr %a ``` we likely won't be able to tell that `%b` and `%c` is the same thing. As we can see in D88789 / D88788 / D88806 / D75505, we can't really teach SCEV about this (not without the https://bugs.llvm.org/show_bug.cgi?id=47592 at least) And we can't recover the situation post-inlining in instcombine. So it really does look like this fold is actively breaking otherwise-good IR, in a way that is not recoverable. And that means, this fold isn't helpful in exposing the passes that are otherwise unaware of these patterns it produces. Thusly, i propose to simply not perform such a canonicalization. The original motivational RFC does not state what larger problem that canonicalization was trying to solve, so i'm not sure how this plays out in the larger picture. On vanilla llvm test-suite + RawSpeed, this results in increase of asm instructions and final object size by ~+0.05% decreases final count of bitcasts by -4.79% (-28990), ptrtoint casts by -15.41% (-3423), and of inttoptr casts by -25.59% (-6919, sic). Overall, there's -0.04% less IR blocks, -0.39% instructions. See https://bugs.llvm.org/show_bug.cgi?id=47592 Differential Revision: https://reviews.llvm.org/D88789	2020-10-06 00:00:30 +03:00
Evandro Menezes	5d6d8a2769	[RISCV] Add SiFive cores to the CPU option Add the SiFive cores E76 and U74 using the SiFive 7 series microarchitecture. Differential Revision: https://reviews.llvm.org/D88759	2020-10-05 15:50:57 -05:00
Fangrui Song	a2cc883368	[CUDA] Don't call __cudaRegisterVariable on C++17 inline variables D17779: host-side shadow variables of external declarations of device-side global variables have internal linkage and are referenced by `__cuda_register_globals`. nvcc from CUDA 11 does not allow `__device__ inline` or `__device__ constexpr` (C++17 inline variables) but clang has incorrectly supported them for a while: ``` error: A __device__ variable cannot be marked constexpr error: An inline __device__/__constant__/__managed__ variable must have internal linkage when the program is compiled in whole program mode (-rdc=false) ``` If such a variable (which has a comdat group) is discarded (a copy from another translation unit is prevailing and selected), accessing the variable from outside the section group (`__cuda_register_globals`) is a violation of the ELF specification and will be rejected by linkers: > A symbol table entry with STB_LOCAL binding that is defined relative to one of a group's sections, and that is contained in a symbol table section that is not part of the group, must be discarded if the group members are discarded. References to this symbol table entry from outside the group are not allowed. As a workaround, don't register such inline variables for now. (If we register the variables in all TUs, we will keep multiple instances of the shadow and break the C++ semantics for inline variables). We should reject such variables in Sema but our internal users need some time to migrate. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D88786	2020-10-05 12:53:59 -07:00
Saleem Abdulrasool	85d5064000	docs: add documentation describing API Notes API Notes are a feature which allows annotation of headers by an auxiliary file that contains metadata for declarations pertaining to the associated module. This enables adding attributes to declarations without requiring modification of the headers, enabling finer grained control for library headers for consumers without having to modify external headers. Differential Revision: https://reviews.llvm.org/D88446 Reviewed By: Richard Smith, Marcel Hlopko	2020-10-05 18:29:13 +00:00
Andrzej Warzynski	8d51d37e06	[flang] Introduce DiagnosticConsumer classes in libflangFrontend Currently Flang uses TextDiagnostic, TextDiagnosticPrinter & TestDiagnosticBuffer classes from Clang (more specifically, from libclangFrontend). This patch introduces simplified equivalents of these classes in Flang (i.e. it removes the dependency on libclangFrontend). Flang only needs these diagnostics classes for the compiler driver diagnostics. This is unlike in Clang in which similar diagnostic classes are used for e.g. Lexing/Parsing/Sema diagnostics. For this reason, the implementations introduced here are relatively basic. We can extend them in the future if this is required. This patch also enhances how the diagnostics are printed. In particular, this is the diagnostic that you'd get _before_ the changes introduced here (no text formatting): ``` $ bin/flang-new error: no input files ``` This is the diagnostic that you get _after_ the changes introduced here (in terminals that support it, the text is formatted - bold + red): ``` $ bin/flang-new flang-new: error: no input files ``` Tests are updated accordingly and options related to enabling/disabling color diagnostics are flagged as supported by Flang. Reviewed By: sameeranjoshi, CarolineConcatto Differential Revision: https://reviews.llvm.org/D87774	2020-10-05 17:46:44 +01:00
Joseph Huber	1dce692de1	Revert "[OpenMP] Add Error Handling for Conflicting Pointer Sizes for Target Offload" Reverting because detecting architecture size doesn't work on all platforms. This reverts commit `eaf73293cb`.	2020-10-05 12:35:39 -04:00
Joseph Huber	eaf73293cb	[OpenMP] Add Error Handling for Conflicting Pointer Sizes for Target Offload Summary: This patch adds an error to Clang that detects if OpenMP offloading is used between two architectures with incompatible pointer sizes. This ensures that the data mapping can be done correctly and solves an issue in code generation generating the wrong size pointer. This patch adds a new lit substitution, %omp_powerpc_triple that, if the system is 32-bit or 64-bit, sets the powerpc triple accordingly. This was required to fix some OpenMP tests that automatically populated the target architecture. Reviewers: jdoerfert Subscribers: cfe-commits guansong sstefan1 yaxunl delcypher Tags: OpenMP clang LLVM Differential Revision: https://reviews.llvm.org/D88594	2020-10-05 11:02:13 -04:00
Simon Pilgrim	7a932f4f4c	[Parser] ParseMicrosoftAsmStatement - Replace bit '\|' operator with logical '\|\|' operator. (PR47071) Fixes static analysis warning.	2020-10-05 14:23:28 +01:00
Gabor Marton	007dd12d54	[ASTImporter][AST] Fix structural equivalency crash on dependent FieldDecl Differential Revision: https://reviews.llvm.org/D88665	2020-10-05 14:06:09 +02:00
Haojian Wu	7f05fe1aee	[AST][RecoveryExpr] Fix a crash on undeduced type. We should not capture the type if the function return type is undeduced. Reviewed By: adamcz Differential Revision: https://reviews.llvm.org/D87350	2020-10-05 12:52:04 +02:00
Haojian Wu	3423d5c9da	[AST][RecoveryExpr] Popagate the error-bit from a VarDecl's initializer to DeclRefExpr. The error-bit was missing, if a DeclRefExpr (which refers to a VarDecl with a contains-errors initializer). It could cause different violations in clang -- the DeclRefExpr is value-dependent, but not contains-errors, `ABC<DeclRefExpr>` could produce a non-error and non-dependent type in non-template context, which will lead to crashes in constexpr evaluation. Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D86048	2020-10-05 10:35:29 +02:00
Yaxun (Sam) Liu	e372c1d762	[HIP] Fix -fgpu-allow-device-init option The option needs to be passed to both host and device compilation. Differential Revision: https://reviews.llvm.org/D88550	2020-10-04 22:13:05 -04:00
Yaxun (Sam) Liu	5b551b79d3	[HIP] Fix default output file for -E By convention the default output file for -E is "-" (stdout). This is expected by tools like ccache, which uses output of -E to determine if a file and its dependence has changed. Currently clang does not use stdout as default output file for -E for HIP, which causes ccache not working. This patch fixes that. Differential Revision: https://reviews.llvm.org/D88730	2020-10-04 22:03:16 -04:00
Yaxun (Sam) Liu	9756a402f2	Recommit "[HIP] Add option --gpu-instrument-lib=" recommit `64f7790e7d` after fixing hip-device-libs.hip.	2020-10-04 21:41:43 -04:00
Yaxun (Sam) Liu	fef0ebbc0b	Revert "[HIP] Add option --gpu-instrument-lib=" This reverts commit `64f7790e7d` due to regression in hip-device-libs.hip.	2020-10-04 21:27:29 -04:00
Yaxun (Sam) Liu	64f7790e7d	[HIP] Add option --gpu-instrument-lib= Add an option --gpu-instrument-lib= to allow users to specify an instrument device library. This is for supporting -finstrument in device code for debugging/profiling tools. Differential Revision: https://reviews.llvm.org/D88557	2020-10-04 21:16:36 -04:00
Yuanfang Chen	2c94d88e07	[NewPM] collapsing nested pass mangers of the same type This is one of the reason for extra invalidations in D84959. In practice, I don't think we have use cases needing this. This simplifies the pipeline a bit and prune corner cases when considering invalidations. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D85676	2020-10-04 15:57:13 -07:00
Craig Topper	a02b449bb1	[X86] Sync AESENC/DEC Key Locker builtins with gcc. For the wide builtins, pass a single input and output pointer to the builtins. Emit the GEPs and input loads from CGBuiltin.	2020-10-04 12:09:41 -07:00
Craig Topper	230c57b0bd	[X86] Synchronize the encodekey builtins with gcc. Don't assume void* is 16 byte aligned. We were taking multiple pointer arguments in the builtin. gcc accepts a single void. The cast from void to _m128i* caused the IR generation to assume the pointer was aligned. Instead make the builtin take a single void, emit i8 GEPs to adjust then cast to <2 x i64>* and perform a store with align of 1.	2020-10-04 12:09:35 -07:00
Craig Topper	28595cbbeb	[X86] Synchronize the loadiwkey builtin operand order with gcc version.	2020-10-04 12:09:29 -07:00
Craig Topper	6c6cd5f8a9	[X86] Consolidate wide Key Locker intrinsics into the same header as the other Key Locker intrinsics.	2020-10-04 12:09:21 -07:00
Roman Lebedev	aaae13d0c2	[NFC][clang][codegen] Autogenerate a few ARM SVE tests that are being affected by an upcoming patch	2020-10-04 19:54:09 +03:00
Esme-Yi	e3475f5b91	[PowerPC] Add builtins for xvtdiv(dp\|sp) and xvtsqrt(dp\|sp). Summary: This patch implements the builtins for xvtdivdp, xvtdivsp, xvtsqrtdp, xvtsqrtsp. The instructions correspond to the following builtins: int vec_test_swdiv(vector double v1, vector double v2); int vec_test_swdivs(vector float v1, vector float v2); int vec_test_swsqrt(vector double v1); int vec_test_swsqrts(vector float v1); This patch depends on D88274, which fixes the bug in copying from CRRC to GPRC/G8RC. Reviewed By: steven.zhang, amyk Differential Revision: https://reviews.llvm.org/D88278	2020-10-04 16:24:20 +00:00
Mark de Wever	1113fbf44c	[CodeGen] Improve likelihood branch weights Bruno De Fraine discovered some issues with D85091. The branch weights generated for `logical not` and `ternary conditional` were wrong. The `logical and` and `logical or` differed from the code generated of `__builtin_predict`. Adjusted the generated code for the likelihood to match `__builtin_predict`. The patch is based on Bruno's suggestions. Differential Revision: https://reviews.llvm.org/D88363	2020-10-04 14:24:27 +02:00
Roman Lebedev	03bd5198b6	[OldPM] Pass manager: run SROA after (simple) loop unrolling I have stumbled into this pretty accidentally, when rewriting some spaghetti-like code into something more structured, which involved using some `std::array<>`s. And to my surprise, the `alloca`s remained, causing about `+160%` perf regression. https://llvm-compile-time-tracker.com/compare.php?from=bb6f4d32aac3eecb51909f4facc625219307ee68&to=d563e66f40f9d4d145cb2050e41cb961e2b37785&stat=instructions suggests that this has geomean compile-time cost of `+0.08%`. Note that D68593 / `cecc0d27ad` already did this chage for NewPM, but left OldPM in a pessimized state. This fixes [[ https://bugs.llvm.org/show_bug.cgi?id=40011 \| PR40011 ]], [[ https://bugs.llvm.org/show_bug.cgi?id=42794 \| PR42794 ]] and probably some other reports. Reviewed By: nikic, xbolva00 Differential Revision: https://reviews.llvm.org/D87972	2020-10-04 11:53:50 +03:00
Nico Weber	ba60dc0aa7	Revert "[Driver] Move detectLibcxxIncludePath to ToolChain" This reverts commit `e25bf25920`. Breaks tests on Windows, see comments on https://reviews.llvm.org/D88452	2020-10-03 14:22:53 -04:00
Nathan Lanza	fcb0ab5933	[clang][NFC] Change a mention of `objc_static_protocol` to `non_runtime`	2020-10-03 14:04:14 -04:00
Mark de Wever	0ce6d6b46e	[Sema] List conversion validate character array. The function `TryListConversion` didn't properly validate the following part of the standard: Otherwise, if the parameter type is a character array [... ] and the initializer list has a single element that is an appropriately-typed string literal (8.5.2 [dcl.init.string]), the implicit conversion sequence is the identity conversion. This caused the following call to `f()` to be ambiguous. void f(int(&&)[1]); void f(unsigned(&&)[1]); void g(unsigned i) { f({i}); } This issue only occurs when the initializer list had one element. Differential Revision: https://reviews.llvm.org/D87561	2020-10-03 14:33:28 +02:00
Evandro Menezes	a0a8f83718	[PATCH] Fix typo (NFC)	2020-10-02 21:19:14 -05:00
Petr Hosek	e25bf25920	[Driver] Move detectLibcxxIncludePath to ToolChain This helper method is useful even outside of Gnu toolchains, so move it to ToolChain so it can be reused in other toolchains such as Fuchsia. Differential Revision: https://reviews.llvm.org/D88452	2020-10-02 18:37:20 -07:00
Petr Hosek	9a48411f35	Revert "[Driver] Move detectLibcxxIncludePath to ToolChain" This reverts commit `a594fd28e3` which is failign on some bots.	2020-10-02 16:59:28 -07:00
Yaxun (Sam) Liu	2cd75f738e	Diagnose invalid target ID for AMDGPU toolchain for assembler AMDGPU toolchain currently only diagnose invalid target ID for OpenCL source compilation. Invalid target ID is not diagnosed for assembler. This patch fixes that. Differential Revision: https://reviews.llvm.org/D88377	2020-10-02 19:38:02 -04:00
Yaxun (Sam) Liu	cbd420c5ed	[CUDA][HIP] Fix bound arch for offload action for fat binary Currently CUDA/HIP toolchain uses "unknown" as bound arch for offload action for fat binary. This causes -mcpu or -march with "unknown" added in HIPToolChain::TranslateArgs or CUDAToolChain::TranslateArgs. This causes issue for https://reviews.llvm.org/D88377 since HIP toolchain needs to check -mcpu in HIPToolChain::TranslateArgs. The bound arch of offload action for fat binary is not really used, therefore set it to CudaArch::UNUSED. Differential Revision: https://reviews.llvm.org/D88524	2020-10-02 19:05:51 -04:00
Richard Smith	8fb2a235b0	Don't reject calls to MinGW's unusual _setjmp declaration. We now recognize this function as a builtin despite it having an unexpected number of parameters; make sure we don't enforce that it has only 1 argument for its 2 parameters.	2020-10-02 15:12:15 -07:00
Yaxun (Sam) Liu	dc6a0b0ec7	[HIP] Align device binary To facilitate faster loading of device binaries and share them among processes, HIP runtime favors their alignment being 4096 bytes. HIP runtime can load unaligned device binaries, however, aligning them at 4096 bytes results in faster loading and less shared memory usage. This patch adds an option -bundle-align to clang-offload-bundler which allows bundles to be aligned at specified alignment. By default it is 1, which is NFC compared to existing format. This patch then aligns embedded fat binary and device binary inside fat binary at 4096 bytes. It has been verified this change does not cause significant overall file size increase for typical HIP applications (less than 1%). Differential Revision: https://reviews.llvm.org/D88734	2020-10-02 18:10:44 -04:00
Nathan Lanza	14f6bfcb52	[clang] Implement objc_non_runtime_protocol to remove protocol metadata Summary: Motivated by the new objc_direct attribute, this change adds a new attribute that remotes metadata from Protocols that the programmer knows isn't going to be used at runtime. We simply have the frontend skip generating any protocol metadata entries (e.g. OBJC_CLASS_NAME, _OBJC_$_PROTOCOL_INSTANCE_METHDOS, _OBJC_PROTOCOL, etc) for a protocol marked with `__attribute__((objc_non_runtime_protocol))`. There are a few APIs used to retrieve a protocol at runtime. `@protocol(SomeProtocol)` will now error out of the requested protocol is marked with attribute. `objc_getProtocol` will return `NULL` which is consistent with the behavior of a non-existing protocol. Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D75574	2020-10-02 17:35:50 -04:00
Petr Hosek	a594fd28e3	[Driver] Move detectLibcxxIncludePath to ToolChain This helper method is useful even outside of Gnu toolchains, so move it to ToolChain so it can be reused in other toolchains such as Fuchsia. Differential Revision: https://reviews.llvm.org/D88452	2020-10-02 14:23:48 -07:00
Evgenii Stepanov	66cf68ed46	[docs] Update ControlFlowIntegrity.rst. Expand the list of targets that support cfi-icall. Add ThinLTO everywhere LTO is mentioned. AFAIK all CFI features are supported with ThinLTO. Differential Revision: https://reviews.llvm.org/D87717	2020-10-02 12:01:05 -07:00
Arthur Eubanks	eb55735073	Reland [AlwaysInliner] Update BFI when inlining Reviewed By: davidxl Differential Revision: https://reviews.llvm.org/D88324	2020-10-02 10:46:57 -07:00
Yaxun (Sam) Liu	c87c017a4c	Fix failure in test hip-macros.hip requires amdgpu-registered-target.	2020-10-02 10:33:32 -04:00
Yaxun (Sam) Liu	36501b180a	Emit predefined macro for wavefront size for amdgcn Also fix the issue of multiple -m[no-]wavefrontsize64 options to make the last one wins. Differential Revision: https://reviews.llvm.org/D88370	2020-10-02 10:17:21 -04:00
Sjoerd Meijer	8825fec37e	[AArch64] Add CPU Cortex-R82 This adds support for -mcpu=cortex-r82. Some more information about this core can be found here: https://www.arm.com/products/silicon-ip-cpu/cortex-r/cortex-r82 One note about the system register: that is a bit of a refactoring because of small differences between v8.4-A AArch64 and v8-R AArch64. This is based on patches from Mark Murray and Mikhail Maltsev. Differential Revision: https://reviews.llvm.org/D88660	2020-10-02 12:47:23 +01:00
Hubert Tong	35ecc7fe49	[clang][Sema] Fix PR47676: Handle dependent AltiVec C-style cast Fix premature decision in the presence of type-dependent expression operands on whether AltiVec vector initializations from single expressions are "splat" operations. Verify that the instantiation is able to determine the correct cast semantics for both the scalar type and the vector type case. Note that, because the change only affects the single-expression case (and the target type is an AltiVec-style vector type), the replacement of a parenthesized list with a parenthesized expression does not change the semantics of the program in a program-observable manner. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D88526	2020-10-01 15:57:01 -04:00
Petr Hosek	de47e7122f	[CMake][Fuchsia] Don't set WIN32 API, rely on autodetection We prefer autodetection here to avoid persisting this configuration in the generated __config header which is shared across targets. Differential Revision: https://reviews.llvm.org/D88694	2020-10-01 12:35:52 -07:00
Sanjay Patel	149f5b573c	[APFloat] convert SNaN to QNaN in convert() and raise Invalid signal This is an alternate fix (see D87835) for a bug where a NaN constant gets wrongly transformed into Infinity via truncation. In this patch, we uniformly convert any SNaN to QNaN while raising 'invalid op'. But we don't have a way to directly specify a 32-bit SNaN value in LLVM IR, so those are always encoded/decoded by calling convert from/to 64-bit hex. See D88664 for a clang fix needed to allow this change. Differential Revision: https://reviews.llvm.org/D88238	2020-10-01 14:37:38 -04:00
Haojian Wu	c1b209cc61	[Format] Don't treat compound extension headers (foo.proto.h) as foo.cc main-file header. We receive internal bugs about this false positives after D86597. Differential Revision: https://reviews.llvm.org/D88640.	2020-10-01 19:57:57 +02:00
Sanjay Patel	686eb0d8de	[AST] do not error on APFloat invalidOp in default mode If FP exceptions are ignored, we should not error out of compilation just because APFloat indicated an exception. This is required as a preliminary step for D88238 which changes APFloat behavior for signaling NaN convert() to set the opInvalidOp exception status. Currently, there is no way to trigger this error because convert() never sets opInvalidOp. FP binops that set opInvalidOp also create a NaN, so the path to checkFloatingPointResult() is blocked by a different diagnostic: // [expr.pre]p4: // If during the evaluation of an expression, the result is not // mathematically defined [...], the behavior is undefined. // FIXME: C++ rules require us to not conform to IEEE 754 here. if (LHS.isNaN()) { Info.CCEDiag(E, diag::note_constexpr_float_arithmetic) << LHS.isNaN(); return Info.noteUndefinedBehavior(); } return checkFloatingPointResult(Info, E, St); Differential Revision: https://reviews.llvm.org/D88664	2020-10-01 13:46:45 -04:00
Michael Liao	8c36eaf037	[clang][opencl][codegen] Remove the insertion of `correctly-rounded-divide-sqrt-fp-math` fn-attr. - `-cl-fp32-correctly-rounded-divide-sqrt` is already handled in a per-instruction manner by annotating the accuracy required. There's no need to add that fn-attr. So far, there's no in-tree backend handling that attr and that OpenCL specific option. - In case that out-of-tree backends are broken, this change could be reverted if those backends could not be fixed. Differential Revision: https://reviews.llvm.org/D88424	2020-10-01 11:07:39 -04:00
Eduardo Caldas	5011d43108	Migrate Declarators to use the List API After this change all nodes that have a delimited-list are using the `List` API. Implementation details: Let's look at a declaration with multiple declarators: `int a, b;` To generate a declarator list node we need to have the range of declarators: `a, b`: However, the `ClangAST` actually stores them as separate declarations: `int a ;` `int b;` We solve that by appropriately marking the declarators on each separate declaration in the `ClangAST` and then for the final declarator `int b`, shrinking its range to fit to the already marked declarators. Differential Revision: https://reviews.llvm.org/D88403	2020-10-01 13:56:31 +00:00
Ranjeet Singh	e4f50e587f	[ARM] Add missing target for Arm neon test case. This is a follow-up from https://reviews.llvm.org/D61717. Where Richard described the issue with compiling arm_neon.h under -flax-vector-conversions=none. It looks like the example reproducer does actually work but what was missing was a test entry for that target. Differential Revision: https://reviews.llvm.org/D88546	2020-10-01 00:32:33 +01:00
Akira Hatanaka	21cf2e6c26	Handle unknown OSes in DarwinTargetInfo::getExnObjectAlignment rdar://problem/69727650	2020-09-30 16:05:17 -07:00
Hubert Tong	ae4c400e02	[NFC] Fix spacing in clang/test/Driver/aix-ld.c Fix one line with mismatch in indentation after `afc277b0ed`.	2020-09-30 17:01:32 -04:00
Arthur Eubanks	ce5379f0f0	[NPM] Add target specific hook to add passes for New Pass Manager The patch adds a new TargetMachine member "registerPassBuilderCallbacks" for targets to add passes to the pass pipeline using the New Pass Manager (similar to adjustPassManager for the Legacy Pass Manager). Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D88138	2020-09-30 13:29:43 -07:00
Joseph Huber	1b60f63e4f	Revert "[OpenMP] Replace OpenMP RTL Functions With OMPIRBuilder and OMPKinds.def" Failing tests on Arm due to the tests automatically populating incomatible pointer width architectures. Reverting until the tests are updated. Failing tests: OpenMP/distribute_parallel_for_num_threads_codegen.cpp OpenMP/distribute_parallel_for_if_codegen.cpp OpenMP/distribute_parallel_for_simd_if_codegen.cpp OpenMP/distribute_parallel_for_simd_num_threads_codegen.cpp OpenMP/target_teams_distribute_parallel_for_if_codegen.cpp OpenMP/target_teams_distribute_parallel_for_simd_if_codegen.cpp OpenMP/teams_distribute_parallel_for_if_codegen.cpp OpenMP/teams_distribute_parallel_for_simd_if_codegen.cpp This reverts commit `90eaedda9b`.	2020-09-30 15:12:21 -04:00
Sanjay Patel	81921ebc43	[CodeGen] improve coverage for float (32-bit) type of NAN; NFC Goes with D88238	2020-09-30 15:10:25 -04:00
Joseph Huber	bdc85292fb	Revert "[OpenMP] Add Error Handling for Conflicting Pointer Sizes for Target Offload" Failing tests on Arm due to the tests automatically populating incomatible pointer width architectures. Reverting until the tests are updated. Failing tests: OpenMP/distribute_parallel_for_num_threads_codegen.cpp OpenMP/distribute_parallel_for_if_codegen.cpp OpenMP/distribute_parallel_for_simd_if_codegen.cpp OpenMP/distribute_parallel_for_simd_num_threads_codegen.cpp OpenMP/target_teams_distribute_parallel_for_if_codegen.cpp OpenMP/target_teams_distribute_parallel_for_simd_if_codegen.cpp OpenMP/teams_distribute_parallel_for_if_codegen.cpp OpenMP/teams_distribute_parallel_for_simd_if_codegen.cpp This reverts commit `9d2378b591`.	2020-09-30 15:08:22 -04:00
David Tenty	afc277b0ed	[AIX][Clang][Driver] Link libm in c++ mode since that is the normal behaviour of other compilers on the platform. Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D88500	2020-09-30 14:02:17 -04:00
Joseph Huber	90eaedda9b	[OpenMP] Replace OpenMP RTL Functions With OMPIRBuilder and OMPKinds.def Summary: Replace the OpenMP Runtime Library functions used in CGOpenMPRuntimeGPU for OpenMP device code generation with ones in OMPKinds.def and use OMPIRBuilder for generating runtime calls. This allows us to consolidate more OpenMP code generation into the OMPIRBuilder. This patch also invalidates specifying target architectures with conflicting pointer sizes. Reviewers: jdoerfert Subscribers: aaron.ballman cfe-commits guansong llvm-commits sstefan1 yaxunl Tags: #OpenMP #Clang #LLVM Differential Revision: https://reviews.llvm.org/D88430	2020-09-30 14:00:01 -04:00
Joseph Huber	9d2378b591	[OpenMP] Add Error Handling for Conflicting Pointer Sizes for Target Offload Summary: This patch adds an error to Clang that detects if OpenMP offloading is used between two architectures with incompatible pointer sizes. This ensures that the data mapping can be done correctly and solves an issue in code generation generating the wrong size pointer. Reviewer: jdoerfert Subscribers: Tags: #OpenMP #Clang Differential Revision:	2020-09-30 13:58:24 -04:00
Richard Smith	892df30a7f	Fix interaction of `constinit` and `weak`. We previously took a shortcut and said that weak variables never have constant initializers (because those initializers are never correct to use outside the variable). We now say that weak variables can have constant initializers, but are never usable in constant expressions.	2020-09-30 10:49:50 -07:00
Alexandre Rames	700e63293e	[Sema] Support Comma operator for fp16 vectors. The current half vector was enforcing an assert expecting "(LHS is half vector) == (RHS is half vector)" for comma. Reviewed By: ahatanak, fhahn Differential Revision: https://reviews.llvm.org/D88265	2020-09-30 18:23:09 +01:00
Sanjay Patel	187686bea3	[CodeGen] add test for NAN creation; NFC This goes with the APFloat change proposed in D88238. This is copied from the MIPS-specific test in builtin-nan-legacy.c to verify that the normal behavior is correct on other targets without the complication of an inverted quiet bit.	2020-09-30 13:22:12 -04:00
Xiangling Liao	3a7487f903	[FE] Use preferred alignment instead of ABI alignment for complete object when applicable On some targets, preferred alignment is larger than ABI alignment in some cases. For example, on AIX we have special power alignment rules which would cause that. Previously, to support those cases, we added a “PreferredAlignment” field in the `RecordLayout` to store the AIX special alignment values in “PreferredAlignment” as the community suggested. However, that patch alone is not enough. There are places in the Clang where `PreferredAlignment` should have been used instead of ABI-specified alignment. This patch is aimed at fixing those spots. Differential Revision: https://reviews.llvm.org/D86790	2020-09-30 10:48:28 -04:00
Xiangling Liao	944691f0b7	[NFC][FE] Replace TypeSize with StorageUnitSize On some targets like AIX, last bitfield size is not always equal to last bitfield type size. Some bitfield like bool will have the same alignment as [unsigned]. So we'd like to use a more general term `StorageUnit` to replace type in this field. Differential Revision: https://reviews.llvm.org/D88260	2020-09-30 10:32:53 -04:00
Xiang1 Zhang	413577a879	[X86] Support Intel Key Locker Key Locker provides a mechanism to encrypt and decrypt data with an AES key without having access to the raw key value by converting AES keys into “handles”. These handles can be used to perform the same encryption and decryption operations as the original AES keys, but they only work on the current system and only until they are revoked. If software revokes Key Locker handles (e.g., on a reboot), then any previous handles can no longer be used. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D88398	2020-09-30 18:08:45 +08:00
Serge Pavlov	4e4f926e83	Remove test AST/const-fpfeatures-diag.c This test is going to be removed because using dynamic rounding mode in initializers is changing. It also causes build failures in some cases, so remove it now.	2020-09-30 11:07:55 +07:00
Brad Smith	6f01c53f26	Remove further OpenBSD/sparc bits	2020-09-29 22:17:12 -04:00
Yaxun (Sam) Liu	d04775e16b	Add remquo, frexp and modf overload functions to HIP header	2020-09-29 20:57:56 -04:00
Amy Huang	5c4fc581d5	[DebugInfo] Add types from constructor homing to the retained types list. Add class types to the retained types list to make sure they don't get dropped if the constructor is optimized out later. Differential Revision: https://reviews.llvm.org/D88522	2020-09-29 17:00:45 -07:00
John McCall	984744a131	Fix a variety of minor issues with ObjC method mangling: - Fix a memory leak accidentally introduced yesterday by using CodeGen's existing mangling context instead of creating a new context afresh. - Move GNU-runtime ObjC method mangling into the AST mangler; this will eventually be necessary to support direct methods there, but is also just the right architecture. - Make the Apple-runtime method mangling work properly when given an interface declaration, fixing a bug (which had solidified into a test) where mangling a category method from the interface could cause it to be mangled as if the category name was a class name. (Category names are namespaced within their class and have no global meaning.) - Fix a code cross-reference in dsymutil. Based on a patch by Ellis Hoag.	2020-09-29 19:51:53 -04:00
Richard Smith	1c604a9f5f	Recognize setjmp and friends as builtins even if jmp_buf is not declared yet. This happens in glibc's headers. It's important that we recognize these functions so that we can mark them as returns_twice. Differential Revision: https://reviews.llvm.org/D88518	2020-09-29 15:53:17 -07:00
Chris Hamilton	155d2d5300	Revert "[Sema] Address-space sensitive check for unbounded arrays (v2)" This reverts commit `d9ee935679`.	2020-09-29 22:46:14 +02:00
Aaron Ballman	538762fef0	Better diagnostics for anonymous bit-fields with attributes or an initializer. The current C++ grammar allows an anonymous bit-field with an attribute, but this is ambiguous (the attribute in that case could appertain to the type instead of the bit-field). The current thinking in the Core Working Group is that it's better to disallow attributes in that position at the grammar level so that the ambiguity resolves in favor of applying to the type. During discussions about the behavior of the attribute, the Core Working Group also felt it was better to disallow anonymous bit-fields from specifying a default member initializer. This implements both sets of related grammar changes.	2020-09-29 16:32:20 -04:00
Aaron Ballman	15fbae8ac3	Use "default member initializer" instead of "in-class initializer" for diagnostics. This changes some diagnostics to use terminology from the standard rather than invented terminology, which improves consistency with other diagnostics as well. There are no functional changes intended other than wording and naming.	2020-09-29 15:04:23 -04:00
Fangrui Song	3681be876f	Add -fprofile-update={atomic,prefer-atomic,single} GCC 7 introduced -fprofile-update={atomic,prefer-atomic} (prefer-atomic is for best efforts (some targets do not support atomics)) to increment counters atomically, which is exactly what we have done with -fprofile-instr-generate (D50867) and -fprofile-arcs (`b5ef137c11`). This patch adds the option to clang to surface the internal options at driver level. GCC 7 also turned on -fprofile-update=prefer-atomic when -pthread is specified, but it has performance regression (https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89307). So we don't follow suit. Differential Revision: https://reviews.llvm.org/D87737	2020-09-29 10:43:23 -07:00
Alex Lorenz	119274748b	NFC, add a missing stdlib include for the use of abort The FatalErrorHandler.cpp file uses 'abort', but doesn't include 'stdlib.h'. This causes a build error when modules are used in clang.	2020-09-29 08:50:51 -07:00
Chris Hamilton	d9ee935679	[Sema] Address-space sensitive check for unbounded arrays (v2) Check applied to unbounded (incomplete) arrays and pointers to spot cases where the computed address is beyond the largest possible addressable extent of the array, based on the address space in which the array is delcared, or which the pointer refers to. Check helps to avoid cases of nonsense pointer math and array indexing which could lead to linker failures or runtime exceptions. Of particular interest when building for embedded systems with small address spaces. This is version 2 of this patch -- version 1 had some testing issues due to a sign error in existing code. That error is corrected and lit test for this chagne is extended to verify the fix. Originally reviewed/accepted by: aaron.ballman Original revision: https://reviews.llvm.org/D86796 Reviewed By: ebevhan Differential Revision: https://reviews.llvm.org/D88174	2020-09-29 16:14:48 +02:00
Alexey Bader	9263931fcc	[SYCL] Assume SYCL device functions are convergent SYCL device compiler (similar to other SPMD compilers) assumes that functions are convergent by default to avoid invalid transformations. This attribute can be removed if compiler can prove that function does not have convergent operations. Reviewed By: Naghasan Differential Revision: https://reviews.llvm.org/D87282	2020-09-29 15:23:50 +03:00
Tres Popp	eb9f7c28e5	Revert "OpaquePtr: Add type to sret attribute" This reverts commit `55c4ff91bd`. Issues were introduced as discussed in https://reviews.llvm.org/D88241 where this change made previous bugs in the linker and BitCodeWriter visible.	2020-09-29 10:31:04 +02:00
Ellis Hoag	98ef7e29b0	This reduces code duplication between CGObjCMac.cpp and Mangle.cpp for generating the mangled name of an Objective-C method. This has no intended functionality change. https://reviews.llvm.org/D88329	2020-09-29 02:26:51 -04:00
Dmitry Antipov	bc868da0e7	[Driver] Filter out <libdir>/gcc and <libdir>/gcc-cross if they do not exists Differential Revision: https://reviews.llvm.org/D87901	2020-09-29 09:18:50 +03:00
Johannes Doerfert	4fc69ab002	Revert "[OpenMP][FIX] Verify compatible types for declare variant calls" This reverts commit `c942095790`. One of the tests broke, revert to investigate.	2020-09-29 00:37:11 -05:00
Johannes Doerfert	c942095790	[OpenMP][FIX] Verify compatible types for declare variant calls Especially for templates we need to check at some point if the base function matches the specialization we might call instead. Before this lead to the replacement of `std::sqrt(int(2))` calls with one that converts the argument to a `std::complex<int>`, clearly not the desired behavior. Reported as PR47655 Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D88384	2020-09-28 23:26:21 -05:00
Yaxun (Sam) Liu	5a3023a91c	[HIP] Return non-zero value for invalid target ID This is part of https://reviews.llvm.org/D60620	2020-09-28 23:07:39 -04:00
Yaxun (Sam) Liu	187658b8a6	Recommit "[HIP] Change default --gpu-max-threads-per-block value to 1024" Recommit `04abbb3a78`	2020-09-28 22:43:17 -04:00
Yaxun (Sam) Liu	10eb3bf2d4	Skip -fPIE for AMDGPU and HIP toolchain AMDGPU toolchain does not support -fPIE, therefore skip it if specified by driver. Differential Revision: https://reviews.llvm.org/D88425	2020-09-28 22:03:18 -04:00
Richard Smith	c375635d05	Ensure that we don't compute linkage for an anonymous class too early if it has a member whose name is the same as a builtin. Fixes a regression from the introduction of BuiltinAttr.	2020-09-28 17:22:40 -07:00
Jan Korous	6fd8c69049	[clang] Update warning-wall.c test Follow-up to 1e86d637eb4f: [clang] Selectively ena/disa-ble format-insufficient-args warning	2020-09-28 17:19:51 -07:00
Zahira Ammarguellat	efd04721c9	BuildVectorType with a dependent (array) type is crashing the compiler - Fix for PR-47542 Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D88150	2020-09-28 17:10:32 -07:00
Yonghong Song	54d9f743c8	BPF: move AbstractMemberAccess and PreserveDIType passes to EP_EarlyAsPossible Move abstractMemberAccess and PreserveDIType passes as early as possible, right after clang code generation. Currently, compiler may transform the above code p1 = llvm.bpf.builtin.preserve.struct.access(base, 0, 0); p2 = llvm.bpf.builtin.preserve.struct.access(p1, 1, 2); a = llvm.bpf.builtin.preserve_field_info(p2, EXIST); if (a) { p1 = llvm.bpf.builtin.preserve.struct.access(base, 0, 0); p2 = llvm.bpf.builtin.preserve.struct.access(p1, 1, 2); bpf_probe_read(buf, buf_size, p2); } to p1 = llvm.bpf.builtin.preserve.struct.access(base, 0, 0); p2 = llvm.bpf.builtin.preserve.struct.access(p1, 1, 2); a = llvm.bpf.builtin.preserve_field_info(p2, EXIST); if (a) { bpf_probe_read(buf, buf_size, p2); } and eventually assembly code looks like reloc_exist = 1; reloc_member_offset = 10; //calculate member offset from base p2 = base + reloc_member_offset; if (reloc_exist) { bpf_probe_read(bpf, buf_size, p2); } if during libbpf relocation resolution, reloc_exist is actually resolved to 0 (not exist), reloc_member_offset relocation cannot be resolved and will be patched with illegal instruction. This will cause verifier failure. This patch attempts to address this issue by do chaining analysis and replace chains with special globals right after clang code gen. This will remove the cse possibility described in the above. The IR typically looks like %6 = load @llvm.sk_buff:0:50$0:0:0:2:0 %7 = bitcast %struct.sk_buff* %2 to i8* %8 = getelementptr i8, i8* %7, %6 for a particular address computation relocation. But this transformation has another consequence, code sinking may happen like below: PHI = <possibly different @preserve__access_globals> %7 = bitcast %struct.sk_buff %2 to i8* %8 = getelementptr i8, i8* %7, %6 For such cases, we will not able to generate relocations since multiple relocations are merged into one. This patch introduced a passthrough builtin to prevent such optimization. Looks like inline assembly has more impact for optimizaiton, e.g., inlining. Using passthrough has less impact on optimizations. A new IR pass is introduced at the beginning of target-dependent IR optimization, which does: - report fatal error if any reloc global in PHI nodes - remove all bpf passthrough builtin functions Changes for existing CORE tests: - for clang tests, add "-Xclang -disable-llvm-passes" flags to avoid builtin->reloc_global transformation so the test is still able to check correctness for clang generated IR. - for llvm CodeGen/BPF tests, add "opt -O2 <ir_file> \| llvm-dis" command before "llc" command since "opt" is needed to call newly-placed builtin->reloc_global transformation. Add target triple in the IR file since "opt" requires it. - Since target triple is added in IR file, if a test may produce different results for different endianness, two tests will be created, one for bpfeb and another for bpfel, e.g., some tests for relocation of lshift/rshift of bitfields. - field-reloc-bitfield-1.ll has different relocations compared to old codes. This is because for the structure in the test, new code returns struct layout alignment 4 while old code is 8. Align 8 is more precise and permits double load. With align 4, the new mechanism uses 4-byte load, so generating different relocations. - test intrinsic-transforms.ll is removed. This is used to test cse on intrinsics so we do not lose metadata. Now metadata is attached to global and not instruction, it won't get lost with cse. Differential Revision: https://reviews.llvm.org/D87153	2020-09-28 16:56:22 -07:00
David Tenty	ee80615b5c	[clang][driver][AIX] Set compiler-rt as default rtlib Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D88182	2020-09-28 19:45:43 -04:00
Jan Korous	1e86d637eb	[clang] Selectively ena/disa-ble format-insufficient-args warning Differential Revision: https://reviews.llvm.org/D87176	2020-09-28 16:24:50 -07:00
Aaron Ballman	e7549dafcd	Fix a think-o with the numerical suffixes in the docs for init_priority.	2020-09-28 16:52:58 -04:00
Craig Topper	288c5776c9	[X86] Use inlineasm flag output for the _bittest* intrinsics. Instead of expliciting emitting a setc in the inline asm instructions, we can use flag output. This allows the backend to use the flag directly if it is needed by a branch. Previously we needed a test instruction to convert the register back to a flag. If the flag can't be used directly, the backend will emit a setcc. Differential Revision: https://reviews.llvm.org/D87888	2020-09-28 13:33:22 -07:00
Baptiste Saleil	0156914275	[PowerPC] Legalize v256i1 and v512i1 and implement load and store of these types This patch legalizes the v256i1 and v512i1 types that will be used for MMA. It implements loads and stores of these types. v256i1 is a pair of VSX registers, so for this type, we load/store the two underlying registers. v512i1 is used for MMA accumulators. So in addition to loading and storing the 4 associated VSX registers, we generate instructions to prime (copy the VSX registers to the accumulator) after loading and unprime (copy the accumulator back to the VSX registers) before storing. This patch also adds the UACC register class that is necessary to implement the loads and stores. This class represents accumulator in their unprimed form and allow the distinction between primed and unprimed accumulators to avoid invalid copies of the VSX registers associated with primed accumulators. Differential Revision: https://reviews.llvm.org/D84968	2020-09-28 14:39:37 -05:00
Paweł Bylica	0c82fa677f	[python][tests] Fix string comparison with "is"	2020-09-28 21:11:50 +02:00
Vedant Kumar	06bc685fa2	[ubsan] nullability-arg: Fix crash on C++ member pointers Extend -fsanitize=nullability-arg to handle call sites which accept C++ member pointers. rdar://62476022 Differential Revision: https://reviews.llvm.org/D88336	2020-09-28 09:41:18 -07:00
Michael Liao	5dbf80cad9	[clang][codegen] Annotate `correctly-rounded-divide-sqrt-fp-math` fn-attr for OpenCL only. - `-cl-fp32-correctly-rounded-divide-sqrt` is an OpenCL-specific option and `correctly-rounded-divide-sqrt-fp-math` should be added for OpenCL at most. Differential revision: https://reviews.llvm.org/D88303	2020-09-28 11:40:32 -04:00
Haojian Wu	bf890dcb0f	[clang] Don't emit "no member" diagnostic if the lookup fails on an invalid record decl. The "no member" diagnostic is likely bogus. Reviewed By: sammccall, #libc Differential Revision: https://reviews.llvm.org/D86765	2020-09-28 15:10:00 +02:00
David Sherwood	bafdd11326	[SVE] Replace / operator in TypeSize/ElementCount with divideCoefficientBy After some recent upstream discussion we decided that it was best to avoid having the / operator for both ElementCount and TypeSize, since this could give the impression that these classes can be used in the same way as basic integer integer types. However, division for scalable types is a bit odd because we are only dividing the minimum quantity by a value, as opposed to something like: (MinSize * Vscale) / SomeValue This is why when performing division it's important the caller first establishes whether the operation makes sense, perhaps by calling isKnownMultipleOf() prior to division. The caller must now explictly call divideCoefficientBy() on the class to perform the operation. Differential Revision: https://reviews.llvm.org/D87700	2020-09-28 08:03:00 +01:00
Richard Smith	df2a1f2aab	Add profiling support for APValues. For C++20 P0732R2; unused so far. Will be used and tested by a follow-on commit.	2020-09-27 20:05:39 -07:00
Richard Smith	9dcd96f728	Canonicalize declaration pointers when forming APValues. References to different declarations of the same entity aren't different values, so shouldn't have different representations. Recommit of `e6393ee813` with fixed handling for weak declarations. We now look for attributes on the most recent declaration when determining whether a declaration is weak. (Second recommit with further fixes for mishandling of weak declarations. Our behavior here is fundamentally unsound -- see PR47663 -- but this approach attempts to not make things worse.)	2020-09-27 19:05:26 -07:00
Aaron Ballman	de55ebe3bb	Typo fix; NFC	2020-09-27 08:30:41 -04:00
Aaron Puchert	485501899d	Fix sphinx warnings in AttributeReference, NFC The previous attempt in `d34c8c70` didn't help (the problem was missing indentation), and another issue was introduced by `a51d51a0`.	2020-09-27 00:52:36 +02:00
Russell Yanofsky	f702a6fa7c	Thread safety analysis: Improve documentation for ASSERT_CAPABILITY Previous description didn't actually state the effect the attribute has on thread safety analysis (causing analysis to assume the capability is held). Previous description was also ambiguous about (or slightly overstated) the noreturn assumption made by thread safety analysis, implying the assumption had to be true about the function's behavior in general, and not just its behavior in places where it's used. Stating the assumption specifically should avoid a perceived need to disable thread safety analysis in places where only asserting that a specific capability is held would be better. Reviewed By: aaronpuchert, vasild Differential Revision: https://reviews.llvm.org/D87629	2020-09-26 22:16:50 +02:00
Florian Hahn	915310bf14	Revert "[DSE] Switch to MemorySSA-backed DSE by default." There appears to be a mis-compile with MemorySSA-backed DSE in combination with llvm.lifetime.end. It currently appears like DSE is doing the right thing and the llvm.lifetime.end markers are incorrect. The reverted patch uncovers the mis-compile. This patch temporarily switches back to the legacy DSE implementation, while we investigate. This reverts commit `9d172c8e9c`.	2020-09-26 18:35:27 +01:00
Serge Pavlov	f91b9c0f98	Run test on particular target only The test `AST/const-fpfeatures-diag.c` requires setting strict FP semantics, so it fails on targets where support of such semantic is limited.	2020-09-26 20:26:34 +07:00
Serge Pavlov	6314f412a8	[FPEnv] Evaluate constant expressions under non-default rounding modes The change implements evaluation of constant floating point expressions under non-default rounding modes. The main objective was to support evaluation of global variable initializers, where constant rounding mode may be specified by `#pragma STDC FENV_ROUND`. Differential Revision: https://reviews.llvm.org/D87822	2020-09-26 17:59:39 +07:00
Dmitry Antipov	2ca0ea15e5	[Driver] Fix formatting as suggested by clang-format (NFC)	2020-09-26 08:52:51 +03:00
Dmitry Antipov	96318f64a7	[Driver] Perform Linux distribution detection only once Differential Revision: https://reviews.llvm.org/D87187	2020-09-26 08:44:08 +03:00
Shilei Tian	ebb1092a28	[Clang][OpenMP] Added support for nowait target in CodeGen via regular task Previously for nowait target, CG emitted a function call to `__tgt_target_nowait`, etc. However, in OpenMP RTL, these functions just directly call the no-nowait version, which means nowait is not working as expected. OpenMP specification says a target is acutally a target task, which is an untied and detachable task. It is natural to go to the direction that generates a task for a nowait target. However, OpenMP task has a problem that it must be within to a parallel region; otherwise the task will be executed immediately. As a result, if we directly wrap to a regular task, the `target nowait` outside of a parallel region is still a synchronous version. In D77609, I added the support for unshackled task in OpenMP RTL. Basically, unshackled task is a task that is not bound to any parallel region. So all nowait target will be tranformed into an unshackled task. In order to distinguish from regular task, a new flag bit is set for unshackled task. This flag will be used by RTL for later process. Since all target tasks are allocated via `__kmpc_omp_target_task_alloc`, and in current `libomptarget`, `__kmpc_omp_target_task_alloc` just calls `__kmpc_omp_task_alloc`. Therefore, we can modify the flag in `__kmpc_omp_target_task_alloc` so that we don't need to modify the FE too much. If users choose to opt out the feature, they just need to use a RTL w/o support of unshackled threads. As a result, in this patch, the `target nowait` region is simply wrapped into a regular task. Later once we have RTL support for unshackled tasks, the wrapped tasks can be executed by unshackled threads w/o changes in the FE. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D78075	2020-09-25 22:10:36 -04:00
Evandro Menezes	a000580a89	[RISCV] Update driver tests Add the RISC-V Bullet core to the driver tests.	2020-09-25 18:36:53 -05:00
Saleem Abdulrasool	58cdbf518b	Sema: add support for `__attribute__((__swift_private__))` This attribute allows declarations to be restricted to the framework itself, enabling Swift to remove the declarations when importing libraries. This is useful in the case that the functions can be implemented in a more natural way for Swift. This is based on the work of the original changes in `8afaf3aad2` Differential Revision: https://reviews.llvm.org/D87720 Reviewed By: Aaron Ballman	2020-09-25 22:33:53 +00:00
Matt Arsenault	55c4ff91bd	OpaquePtr: Add type to sret attribute Make the corresponding change that was made for byval in `b7141207a4`. Like byval, this requires a bulk update of the test IR tests to include the type before this can be mandatory.	2020-09-25 14:07:30 -04:00
Saleem Abdulrasool	76eb163259	Sema: remove unnecessary parameter for SwiftName handling (NFCI) This code never actually did anything in the implementation. `mergeDeclAttribute` is declared as `static`, and referenced exactly once in the file: from `Sema::mergeDeclAttributes`. `Sema::mergeDeclAttributes` sets `LocalAMK` to `AMK_None`. If the attribute is `DeprecatedAttr`, `UnavailableAttr`, or `AvailabilityAttr` then the `LocalAMK` is updated. However, because we are dealing with a `SwiftNameDeclAttr` here, `LocalAMK` remains `AMK_None`. This is then passed to the function which will as a result pass the value of `AMK_None == AMK_Override` aka `false`. Simply propagate the value through and erase the dead codepath. Thanks to Aaron Ballman for flagging the use of the availability merge kind here leading to this simplification! Differential Revision: https://reviews.llvm.org/D88263 Reviewed By: Aaron Ballman	2020-09-25 17:01:06 +00:00
Vedant Kumar	62c372770d	[profile] Add %t LLVM_PROFILE_FILE option to substitute $TMPDIR Add support for expanding the %t filename specifier in LLVM_PROFILE_FILE to the TMPDIR environment variable. This is supported on all platforms. On Darwin, TMPDIR is used to specify a temporary application-specific scratch directory. When testing apps on remote devices, it can be challenging for the host device to determine the correct TMPDIR, so it's helpful to have the runtime do this work. rdar://68524185 Differential Revision: https://reviews.llvm.org/D87332	2020-09-25 09:39:40 -07:00
Aaron Ballman	a51d51a0d4	Fix some of the more egregious 80-col and whitespace issues; NFC	2020-09-25 10:37:38 -04:00
Aaron Ballman	85cea77ecb	Typo fix; NFC	2020-09-25 10:26:29 -04:00
Benjamin Kramer	6a1bca8798	[Analyzer] Fix unused variable warning in Release builds clang/lib/StaticAnalyzer/Core/ExprEngineCXX.cpp:377:19: warning: unused variable 'Init'	2020-09-25 14:09:43 +02:00
Manuel Klimek	e336b74c99	[clang-format] Add a MacroExpander. Summary: The MacroExpander allows to expand simple (non-resursive) macro definitions from a macro identifier token and macro arguments. It annotates the tokens with a newly introduced MacroContext that keeps track of the role a token played in expanding the macro in order to be able to reconstruct the macro expansion from an expanded (formatted) token stream. Made Token explicitly copy-able to enable copying tokens from the parsed macro definition. Reviewers: sammccall Subscribers: mgorny, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D83296	2020-09-25 14:08:13 +02:00
Chris Bowler	f330d9f163	[PPC] [AIX] Implement calling convention IR for C99 complex types on AIX Add AIX calling convention logic to Clang for C99 complex types on AIX Differential Revision: https://reviews.llvm.org/D88130	2020-09-25 07:43:31 -04:00

... 2 3 4 5 6 ...

86255 Commits