llvm-project

Commit Graph

Author	SHA1	Message	Date
Richard Smith	7e801ca0ef	Treat constant contexts as being in the default rounding mode. This addresses a regression where pretty much all C++ compilations using -frounding-math now fail, due to rounding being performed in constexpr function definitions in the standard library. This follows the "manifestly constant evaluated" approach described in https://reviews.llvm.org/D87528#2270676 -- evaluations that are required to succeed at compile time are permitted even in regions with dynamic rounding modes, as are (unfortunately) the evaluation of the initializers of local variables of const integral types. Differential Revision: https://reviews.llvm.org/D89360	2020-10-16 13:26:15 -07:00
Richard Smith	48c70c1664	Extend memset-to-zero optimization to C++11 aggregate functional casts Aggr{...}. We previously missed these cases due to not stepping over the additional AST nodes representing their syntactic form.	2020-10-16 13:21:08 -07:00
Scott Linder	c4d10e7e9b	[AMDGPU][HIP] Switch default DWARF version to 5 Another attempt at this, see D59008 for previous attempt. Reviewed By: kzhuravl, t-tye Differential Revision: https://reviews.llvm.org/D89484	2020-10-16 17:53:27 +00:00
Matt Arsenault	0a7cd99a70	Reapply "OpaquePtr: Add type to sret attribute" This reverts commit `eb9f7c28e5`. Previously this was incorrectly handling linking of the contained type, so this merges the fixes from D88973.	2020-10-16 11:05:02 -04:00
Florian Hahn	51ff04567b	Recommit "[DSE] Switch to MemorySSA-backed DSE by default." After investigation by @asbirlea, the issue that caused the revert appears to be an issue in the original source, rather than a problem with the compiler. This patch enables MemorySSA DSE again. This reverts commit `915310bf14`.	2020-10-16 09:02:53 +01:00
Konstantin Schwarz	6030a07516	Fix hidden-redecls.m test for some environments This test was failing in our CI environment, because Jenkins mounts the workspaces into Docker containers using their full path, i.e. /home/jenkins/workspaces/llvm-build. We've seen permission denied errors because /home/jenkins is mounted with root permissions and the default cache directory under Linux is $HOME/.cache. The fix is to explicitly provide the -fmodules-cache-path, which the other tests already seem to provide. Reviewed By: akyrtzi Differential Revision: https://reviews.llvm.org/D89453	2020-10-16 09:51:13 +02:00
Kito Cheng	cfa7094e49	[RISCV] Add -mtune support - The goal of this patch is improve option compatible with RISCV-V GCC, -mcpu support on GCC side will sent patch in next few days. - -mtune only affect the pipeline model and non-arch/extension related target feature, e.g. instruction fusion; in td file it called TuneFeatures, which is introduced by X86 back-end[1]. - -mtune accept all valid option for -mcpu and extra alias processor option, e.g. `generic`, `rocket` and `sifive-7-series`, the purpose is option compatible with RISCV-V GCC. - Processor alias for -mtune will resolve according the current target arch, rv32 or rv64, e.g. `rocket` will resolve to `rocket-rv32` or `rocket-rv64`. - Interaction between -mcpu and -mtune: * -mtune has higher priority than -mcpu for pipeline model and TuneFeatures. [1] https://reviews.llvm.org/D85165 Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D89025	2020-10-16 13:55:08 +08:00
Richard Smith	fc031d29be	Switch the default of VerifyIntegerConstantExpression from constant folding to not constant folding. Constant folding of ICEs is done as a GCC compatibility measure, but new code was picking it up, presumably by accident, due to the bad default. While here, also switch the flag from a bool to an enum to make it more obvious what it means at call sites. This highlighted a couple of places where our behavior is different between C++11 and C++14 due to switching from checking for an ICE to checking for a converted constant expression (where there is no 'fold' codepath).	2020-10-15 16:58:47 -07:00
Vedant Kumar	273c299d5d	[PM/CC1] Add -f[no-]split-cold-code CC1 option to toggle splitting This patch adds -f[no-]split-cold-code CC1 options to clang. This allows the splitting pass to be toggled on/off. The current method of passing `-mllvm -hot-cold-split=true` to clang isn't ideal as it may not compose correctly (say, with `-O0` or `-Oz`). To implement the -fsplit-cold-code option, an attribute is applied to functions to indicate that they may be considered for splitting. This removes some complexity from the old/new PM pipeline builders, and behaves as expected when LTO is enabled. Co-authored by: Saleem Abdulrasool <compnerd@compnerd.org> Differential Revision: https://reviews.llvm.org/D57265 Reviewed By: Aditya Kumar, Vedant Kumar Reviewers: Teresa Johnson, Aditya Kumar, Fedor Sergeev, Philip Pfaffe, Vedant Kumar	2020-10-15 23:13:33 +00:00
Fangrui Song	5a338599fb	[CGBuiltin] Respect asm labels and redefine_extname for builtins with specialized emitting rL131311 added `asm()` support for builtin functions, but `asm()` for builtins with specialized emitting (e.g. memcpy, various math functions) still do not work. This patch makes these functions work for `asm()` and `#pragma redefine_extname`. glibc uses `asm()` to redirect internal libc function calls to hidden aliases. Limitation: such a function is a builtin in clang, but will not be recognized as a libcall in optimization passes because Clang does not annotate the renamed function as a libcall. In GCC -O1 or above, `abs` can be optimized out but we can't. Additionally, we cannot redirect `__builtin_sin` to `real_sin` in the following example: double sin(double x) asm("real_sin"); double f(double d) { return __builtin_sin(d); } --- According to @rsmith, the following three statements cannot be simultaneously true: (1) The frontend function foo has known, builtin semantics X. (2) The symbol foo has known, builtin semantics X. (3) It's not correct to lower a call to the frontend function foo to the symbol foo. People do want (1) (if it is profitable to expand a memcpy, do it). This also means that people do not want to add -fno-builtin-memcpy. People do want (3): that is why they use asm("__GI_memcpy") in the first place. So unfortunately we make a compromise by not refuting (2) (see the limitation above). For most libcalls, there is a small loss because compilers don't synthesize them. For the few glibc cares about, it uses `asm("memcpy = __GI_memcpy");` to make the assembly level redirection. (Changing function names (e.g. `__memcpy`) is a hit to ergonomics which is not acceptable). Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D88712	2020-10-15 15:14:38 -07:00
Reid Kleckner	5fbab4025e	[MS] Apply `inreg` to AArch64 sret parms on instance methods The documentation rules indicate that instance methods should return large, trivially copyable aggregates via X1/X0 and not X8 as is normally done when returning such structs from free functions: https://docs.microsoft.com/en-us/cpp/build/arm64-windows-abi-conventions?view=vs-2019#return-values Fixes PR47836, a bug in the initial implementation of these rules. I tried to simplify the logic a bit as well while I'm here. Differential Revision: https://reviews.llvm.org/D89362	2020-10-15 14:54:42 -07:00
Yaxun (Sam) Liu	e384e94fbe	Revert "[HIP] Change default --gpu-max-threads-per-block value to 1024" This reverts commit `187658b8a6` due to AMDGPU backend issues.	2020-10-15 17:25:55 -04:00
Leonard Chan	79829a4704	Revert "[clang] Add -fc++-abi= flag for specifying which C++ ABI to use" This reverts commits `683b308c07` and `8487bfd4e9`. We will go for a more restricted approach that does not give freedom to everyone to change ABIs on whichever platform. See the discussion on https://reviews.llvm.org/D85802.	2020-10-15 14:24:38 -07:00
Thomas Lively	1992e30c2d	[WebAssembly] Prototype i8x16.popcnt As proposed at https://github.com/WebAssembly/simd/pull/379. Use a target builtin and intrinsic rather than normal codegen patterns to make the instruction opt-in until it is merged to the proposal and stabilized in engines. Differential Revision: https://reviews.llvm.org/D89446	2020-10-15 21:18:22 +00:00
Richard Smith	68f116aa23	PR47864: Fix assertion in pointer-to-member emission if there are multiple declarations of the same base class.	2020-10-15 13:51:51 -07:00
Stanislav Mekhanoshin	d1beb95d12	[AMDGPU] gfx1032 target Differential Revision: https://reviews.llvm.org/D89487	2020-10-15 12:41:18 -07:00
Thomas Lively	3f738d1f5e	Reland "[WebAssembly] v128.load{8,16,32,64}_lane instructions" This reverts commit `7c8385a352` with a typing fix to an instruction selection pattern.	2020-10-15 19:32:34 +00:00
Erik Pilkington	351317167e	[SemaObjC] Fix composite pointer type calculation for `void*` and pointer to lifetime qualified ObjC pointer type Fixes a regression introduced in `9a6f4d451c`. rdar://70101809 Differential revision: https://reviews.llvm.org/D89475	2020-10-15 15:21:01 -04:00
Konstantin Zhuravlyov	67f189e93c	Make sure both cc1 and cc1as process -m[no-]code-object-v3 Differential Revision: https://reviews.llvm.org/D89478	2020-10-15 14:03:26 -04:00
Thomas Lively	7c8385a352	Revert "[WebAssembly] v128.load{8,16,32,64}_lane instructions" This reverts commit `7c6bfd90ab`.	2020-10-15 15:49:36 +00:00
Thomas Lively	7c6bfd90ab	[WebAssembly] v128.load{8,16,32,64}_lane instructions Prototype the newly proposed load_lane instructions, as specified in https://github.com/WebAssembly/simd/pull/350. Since these instructions are not available to origin trial users on Chrome stable, make them opt-in by only selecting them from intrinsics rather than normal ISel patterns. Since we only need rough prototypes to measure performance right now, this commit does not implement all the load and store patterns that would be necessary to make full use of the offset immediate. However, the full suite of offset tests is included to make it easy to track improvements in the future. Since these are the first instructions to have a memarg immediate as well as an additional immediate, the disassembler needed some additional hacks to be able to parse them correctly. Making that code more principled is left as future work. Differential Revision: https://reviews.llvm.org/D89366	2020-10-15 15:33:10 +00:00
Simon Pilgrim	d7fa9030d4	[CodeGen][X86] Emit fshl/fshr ir intrinsics for shiftleft128/shiftright128 ms intrinsics Now that funnel shift handling is pretty good, we can use the intrinsics directly and avoid a lot of zext/trunc issues. https://godbolt.org/z/YqhnnM Differential Revision: https://reviews.llvm.org/D89405	2020-10-15 10:22:41 +01:00
Richard Smith	9dbb0886ea	Perform lvalue conversions on the left of a pseudo-destructor call 'p->~T()'. Previously we failed to convert 'p' from array/function to pointer type, and to represent the load of 'p' in the AST. The latter causes problems for constant evaluation.	2020-10-14 22:09:01 -07:00
Richard Smith	f7f2e4261a	PR47805: Use a single object for a function parameter in the caller and callee in constant evaluation. We previously made a deep copy of function parameters of class type when passing them, resulting in the destructor for the parameter applying to the original argument value, ignoring any modifications made in the function body. This also meant that the 'this' pointer of the function parameter could be observed changing between the caller and the callee. This change completely reimplements how we model function parameters during constant evaluation. We now model them roughly as if they were variables living in the caller, albeit with an artificially reduced scope that covers only the duration of the function call, instead of modeling them as temporaries in the caller that we partially "reparent" into the callee at the point of the call. This brings some minor diagnostic improvements, as well as significantly reduced stack usage during constant evaluation.	2020-10-14 17:43:51 -07:00
Dave Lee	4cb4db11ee	Revert "[ASTImporter] Fix crash caused by unset AttributeSpellingListIndex" This broke the GreenDragon build, due to the following error while running TestImportBuiltinFileID: ``` Ignored/unknown shouldn't get here UNREACHABLE executed at tools/clang/include/clang/Sema/AttrSpellingListIndex.inc:13! ``` See http://green.lab.llvm.org/green/view/LLDB/job/lldb-cmake/24213/ This reverts commit `73c6beb2f7`. This reverts https://reviews.llvm.org/D89318	2020-10-14 17:21:56 -07:00
Leonard Chan	8487bfd4e9	[clang][NFC] Change diagnostic to start with lowercase letter	2020-10-14 15:48:29 -07:00
Leonard Chan	683b308c07	[clang] Add -fc++-abi= flag for specifying which C++ ABI to use This implements the flag proposed in RFC http://lists.llvm.org/pipermail/cfe-dev/2020-August/066437.html. The goal is to add a way to override the default target C++ ABI through a compiler flag. This makes it easier to test and transition between different C++ ABIs through compile flags rather than build flags. In this patch: - Store `-fc++-abi=` in a LangOpt. This isn't stored in a CodeGenOpt because there are instances outside of codegen where Clang needs to know what the ABI is (particularly through ASTContext::createCXXABI), and we should be able to override the target default if the flag is provided at that point. - Expose the existing ABIs in TargetCXXABI as values that can be passed through this flag. - Create a .def file for these ABIs to make it easier to check flag values. - Add an error for diagnosing bad ABI flag values. Differential Revision: https://reviews.llvm.org/D85802	2020-10-14 12:31:21 -07:00
Christopher Di Bella	18432bea76	[Driver]: fix compiler-rt path when printing libgcc for baremetal clang --target arm-none-eabi --print-libgcc-file-name --rtlib=compiler-rt used to print `/path/to/lib/clang/version/lib/libclang_rt.builtins-arm.a` but should print `/path/to/lib/clang/version/lib/baremetal/libclang_rt.builtins-arm.a`. Similarly, --target armv7m-none-eabi should print libclang_rt.builtins-armv7m.a This matches the compiler-rt file name used at link time in the baremetal driver. Reviewed By: manojgupta Differential Revision: https://reviews.llvm.org/D89327	2020-10-14 10:29:35 -07:00
Konstantin Zhuravlyov	3fdf3b1539	AMDGPU: Update AMDHSA code object version handling Differential Revision: https://reviews.llvm.org/D89076	2020-10-14 13:04:27 -04:00
Simon Pilgrim	b967b9a711	[CodeGen] Move x86 specific ms intrinsic tests into x86 target subfolder. NFCI.	2020-10-14 17:37:26 +01:00
jasonliu	f85bcc21dd	[AIX] Turn -fdata-sections on by default in Clang Summary: This patch does the following: 1. Make InitTargetOptionsFromCodeGenFlags() accepts Triple as a parameter, because some options' default value is triple dependant. 2. DataSections is turned on by default on AIX for llc. 3. Test cases change accordingly because of the default behaviour change. 4. Clang Driver passes in -fdata-sections by default on AIX. Reviewed By: MaskRay, DiggerLin Differential Revision: https://reviews.llvm.org/D88737	2020-10-14 15:58:31 +00:00
Gabor Marton	73c6beb2f7	[ASTImporter] Fix crash caused by unset AttributeSpellingListIndex During the import of attributes we forgot to set the spelling list index. This caused a segfault when we wanted to traverse the AST (e.g. by the dump() method). Differential Revision: https://reviews.llvm.org/D89318	2020-10-14 14:10:08 +02:00
Gabor Marton	dd965711c9	[ASTImporter] Fix crash caused by unimported type of FromatAttr During the import of FormatAttrs we forgot to import the type (e.g `__scanf__`) of the attribute. This caused a segfault when we wanted to traverse the AST (e.g. by the dump() method). Differential Revision: https://reviews.llvm.org/D89319	2020-10-14 13:54:48 +02:00
Jonas Paulsson	625fa47617	Revert "[clang] Improve handling of physical registers in inline assembly operands." This reverts commit `c78da03778`. Temporarily reverted due to https://bugs.llvm.org/show_bug.cgi?id=47837.	2020-10-14 08:42:51 +02:00
Liu, Chen3	bd05afcb3f	[X86][NFC] Fix RUN line bug in the testcase Testcase added in D78699 doesn't work because the wrong RUN line in the testcase. Differential Revision: https://reviews.llvm.org/D89361	2020-10-14 12:40:34 +08:00
Richard Smith	69f7c006ff	Revert "PR47805: Use a single object for a function parameter in the caller and" Breaks a clangd unit test. This reverts commit `8f8b9f2cca`.	2020-10-13 19:32:03 -07:00
Richard Smith	8f8b9f2cca	PR47805: Use a single object for a function parameter in the caller and callee in constant evaluation. We previously made a deep copy of function parameters of class type when passing them, resulting in the destructor for the parameter applying to the original argument value, ignoring any modifications made in the function body. This also meant that the 'this' pointer of the function parameter could be observed changing between the caller and the callee. This change completely reimplements how we model function parameters during constant evaluation. We now model them roughly as if they were variables living in the caller, albeit with an artificially reduced scope that covers only the duration of the function call, instead of modeling them as temporaries in the caller that we partially "reparent" into the callee at the point of the call. This brings some minor diagnostic improvements, as well as significantly reduced stack usage during constant evaluation.	2020-10-13 18:50:46 -07:00
Erik Pilkington	498c7fa48a	[SemaObjC] Fix a crash on an invalid ternary with ARC pointers FindCompositeObjCPointerType nulls out the subexpressions on error, so bail out instead of trying to deref them.	2020-10-13 21:20:20 -04:00
Richard Smith	ab870f3030	Revert "PR47805: Use a single object for a function parameter in the caller and" The buildbots are displeased. This reverts commit `8d03a972ce`.	2020-10-13 15:59:00 -07:00
Richard Smith	8d03a972ce	PR47805: Use a single object for a function parameter in the caller and callee in constant evaluation. We previously made a deep copy of function parameters of class type when passing them, resulting in the destructor for the parameter applying to the original argument value, ignoring any modifications made in the function body. This also meant that the 'this' pointer of the function parameter could be observed changing between the caller and the callee. This change completely reimplements how we model function parameters during constant evaluation. We now model them roughly as if they were variables living in the caller, albeit with an artificially reduced scope that covers only the duration of the function call, instead of modeling them as temporaries in the caller that we partially "reparent" into the callee at the point of the call. This brings some minor diagnostic improvements, as well as significantly reduced stack usage during constant evaluation.	2020-10-13 15:45:04 -07:00
Xiangling Liao	4c10d6508f	[AIX] Support two itanium alignment LIT testcases for AIX using regex AIX has different layout dumping format from other itanium ABIs. And for these two cases, use regex to match AIX format. Differential Revision: https://reviews.llvm.org/D89064	2020-10-13 16:47:01 -04:00
Konstantin Zhuravlyov	e2eaa91451	AMDGPU: Remove -mamdgpu-debugger-abi option It has been unsupported for few years now. Differential Revision: https://reviews.llvm.org/D89125	2020-10-13 12:20:28 -04:00
Jonas Paulsson	c78da03778	[clang] Improve handling of physical registers in inline assembly operands. Change EmitAsmStmt() to - Not tie physregs with the "+r" constraint, but instead add the hard register as an input constraint. This makes "+r" and "=r":"r" look the same in the output. Background: Macro intensive user code may contain inline assembly statements with multiple operands constrained to the same physreg. Such a case (with the operand constraints "+r" : "r") currently triggers the TwoAddressInstructionPass assertion against any extra use of a tied register. Furthermore, TwoAddress will insert a COPY to that physreg even though isel has already done so (for the non-tied use), which may lead to a second redundant instruction currently. A simple fix for this is to not emit tied physreg uses in the first place for the "+r" constraint, which is what this patch does. - Give an error on multiple outputs to the same physical register. This should be reported and this is also what GCC does. Review: Ulrich Weigand, Aaron Ballman, Jennifer Yu, Craig Topper Differential Revision: https://reviews.llvm.org/D87279	2020-10-13 15:09:52 +02:00
Bevin Hansson	9fa7f48459	[Fixed Point] Add fixed-point to floating point cast types and consteval. Reviewed By: leonardchan Differential Revision: https://reviews.llvm.org/D86631	2020-10-13 13:26:56 +02:00
Ties Stuij	208987844f	[ARM] Follow AACPS standard for volatile bit-fields access width This patch resumes the work of D16586. According to the AAPCS, volatile bit-fields should be accessed using containers of the widht of their declarative type. In such case: ``` struct S1 { short a : 1; } ``` should be accessed using load and stores of the width (sizeof(short)), where now the compiler does only load the minimum required width (char in this case). However, as discussed in D16586, that could overwrite non-volatile bit-fields, which conflicted with C and C++ object models by creating data race conditions that are not part of the bit-field, e.g. ``` struct S2 { short a; int b : 16; } ``` Accessing `S2.b` would also access `S2.a`. The AAPCS Release 2020Q2 (https://documentation-service.arm.com/static/5efb7fbedbdee951c1ccf186?token=) section 8.1 Data Types, page 36, "Volatile bit-fields - preserving number and width of container accesses" has been updated to avoid conflict with the C++ Memory Model. Now it reads in the note: ``` This ABI does not place any restrictions on the access widths of bit-fields where the container overlaps with a non-bit-field member or where the container overlaps with any zero length bit-field placed between two other bit-fields. This is because the C/C++ memory model defines these as being separate memory locations, which can be accessed by two threads simultaneously. For this reason, compilers must be permitted to use a narrower memory access width (including splitting the access into multiple instructions) to avoid writing to a different memory location. For example, in struct S { int a:24; char b; }; a write to a must not also write to the location occupied by b, this requires at least two memory accesses in all current Arm architectures. In the same way, in struct S { int a:24; int:0; int b:8; };, writes to a or b must not overwrite each other. ``` I've updated the patch D16586 to follow such behavior by verifying that we only change volatile bit-field access when: - it won't overlap with any other non-bit-field member - we only access memory inside the bounds of the record - avoid overlapping zero-length bit-fields. Regarding the number of memory accesses, that should be preserved, that will be implemented by D67399. Reviewed By: ostannard Differential Revision: https://reviews.llvm.org/D72932	2020-10-13 10:31:48 +01:00
Simon Pilgrim	6c23cbc560	[X86] Convert integer _mm_reduce_* intrinsics to emit llvm.reduction intrinsics (PR47506) Emit the equivalent integer reduction intrinsics in IR instead of expanding to shuffle+arithmetic sequences. The fadd/fmul reductions might be trickier as they assume a similar bisection reduction while the generic intrinsics assume a sequential reduction (intel docs are ambiguous on the correct approach) - I'm not sure if we want to always tag them with reassoc? Anyway, that issue can wait until a separate fp patch along with the fmin/fmax reductions. Differential Revision: https://reviews.llvm.org/D87604	2020-10-13 09:28:39 +01:00
Richard Smith	913f600566	Canonicalize declaration pointers when forming APValues. References to different declarations of the same entity aren't different values, so shouldn't have different representations. Recommit of `e6393ee813`, most recently reverted in `9a33f027ac` due to a bug caused by ObjCInterfaceDecls not propagating availability attributes along their redeclaration chains; that bug was fixed in `e2d4174e9c`.	2020-10-12 19:32:57 -07:00
Richard Smith	e2d4174e9c	Ensure that InheritedAttrs are properly inherited along a redeclaration chain for ObjCInterfaceDecls. Only one such declaration can actually have attributes (the definition, if any), but generally we assume that we can look for InheritedAttrs on the most recent declaration.	2020-10-12 19:22:31 -07:00
Wang, Pengfei	412cdcf2ed	[X86] Add HRESET instruction. For more details about these instructions, please refer to the latest ISE document: https://software.intel.com/en-us/download/intel-architecture-instruction-set-extensions-programming-reference. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D89102	2020-10-13 08:47:26 +08:00
Bruno Cardoso Lopes	cffb0dd54d	[SemaTemplate] Stop passing insertion position around during VarTemplate instantiation They can get stale at use time because of updates from other recursive specializations. Instead, rely on the existence of previous declarations to add the specialization. Differential Revision: https://reviews.llvm.org/D87853	2020-10-12 16:48:50 -07:00

1 2 3 4 5 ...

41671 Commits