llvm-project

Commit Graph

Author	SHA1	Message	Date
Shoaib Meenai	610a0e8b53	[ELF] Assert on invalid GOT or PLT relocations Because of https://llvm.org/PR50675, we can end up producing a PLT relocation referencing a symbol that's dropped from the dynamic symbol table, which in turn causes a crash at runtime. We ran into this again recently, resulting in crashes for our users. A subsequent diff will fix that issue, but add an assert to catch it if it happens again. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D123985	2022-04-20 10:46:04 -07:00
Eli Friedman	13fc178173	Force GHashCell to be 8-byte-aligned. Otherwise, with recent versions of libstdc++, clang can't tell that the atomic operations are properly aligned, and generates calls to libatomic. (Actually, because of the use of reinterpret_cast, it wasn't guaranteed to be aligned, but I think it ended up being aligned in practice.) Fixes https://github.com/llvm/llvm-project/issues/54790 , the part where LLVM failed to build. Differential Revision: https://reviews.llvm.org/D123872	2022-04-18 08:46:03 -07:00
Pavel Kosov	a5b7ea0783	[llvm-objdump] Implemented PrintBranchImmAsAddress for MIPS Updated MipsInstPrinter to print absolute hex offsets for branch instructions. It is necessary to make the llvm-objdump output close to the gnu objdump output. This implementation is based on the implementation for RISC-V. OS Laboratory. Huawei Russian Research Institute. Saint-Petersburg Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D123764	2022-04-15 23:48:38 +02:00
Fangrui Song	b483ce1228	[ELF][ARM] Fix unneeded thunk for branches to hidden undefined weak Similar to D123750 for AArch64.	2022-04-14 23:58:13 -07:00
Fangrui Song	02eab52866	[ELF][AArch64] Fix unneeded thunk for branches to hidden undefined weak Similar to D119787 for PPC64. A hidden undefined weak may change its binding to local before some `isUndefinedWeak` code, so some `isUndefinedWeak` code needs to be changed to `isUndefined`. The undefined non-weak case has been errored, so just using `isUndefined` is fine. The Linux kernel recently has a usage that a branch from 0xffff800008491ee0 references a hidden undefined weak symbol `vfio_group_set_kvm`. It relies on the behavior that a branch to undefined weak resolving to the next instruction, otherwise it'd see spurious relocation out of range errors. Fixes https://github.com/ClangBuiltLinux/linux/issues/1624 Differential Revision: https://reviews.llvm.org/D123750	2022-04-14 11:32:30 -07:00
Jez Ng	2a6669060f	[lld-macho][nfc] De-templatize UnwindInfoSection Follow-on to {D123276}. Now that we work with an internal representation of compact unwind entries, we no longer need to template our UnwindInfoSectionImpl code based on the pointer size of the target architecture. I've still kept the split between `UnwindInfoSectionImpl` and `UnwindInfoSection`. I'd introduced that split in order to do type erasure, but I think it's still useful to have in order to keep `UnwindInfoSection`'s definition in the header file clean. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D123277	2022-04-13 16:19:22 -04:00
Tobias Hieta	837d16fb4c	[NFC] Simplify /noimplib argument logic	2022-04-13 16:40:30 +02:00
Tobias Hieta	2af4385477	[LLD][COFF] Add support for /noimplib Mostly for compatibility reasons with link.exe this flag makes sure we don't write a implib - not even when /implib is also passed, that's how link.exe works. Differential Revision: https://reviews.llvm.org/D123591	2022-04-13 16:40:29 +02:00
Tobias Hieta	eb4eef9ec4	[LLD][COFF] Add support for /noimplib Mostly for compatibility reasons with link.exe this flag makes sure we don't write a implib - not even when /implib is also passed, that's how link.exe works. Reviewed By: mstorsjo Differential Revision: https://reviews.llvm.org/D123591	2022-04-13 10:32:44 +02:00
Jez Ng	1cff723ff5	[lld-macho][nfc] Use includeInSymtab for all symtab-skipping logic {D123302} got me looking deeper at `includeInSymtab`. I thought it was a little odd that there were excluded (live) symbols for which `includeInSymtab` was false; we shouldn't have so many different ways to exclude a symbol. As such, this diff makes the `L`-prefixed-symbol exclusion code use `includeInSymtab` too. (Note that as part of our support for `__eh_frame`, we will also be excluding all `__eh_frame` symbols from the symtab in a future diff.) Another thing I noticed is that the `emitStabs` code never has to deal with excluded symbols because `SymtabSection::finalize()` already filters them out. As such, I've updated the comments and asserts from {D123302} to reflect this. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D123433	2022-04-11 15:45:46 -04:00
Nico Weber	75196b99fb	[llvm-lib] Add /WX, warn by default on empty inputs, add opt-out lib.exe by default exits successfully without writing an output file when no inputs are passed. llvm-lib has the same behavior, for compatibility. This behavior interacts poorly with build systems: If a static library target had no inputs, llvm-lib would not produce an output file, causing ninja (or make, or a similar system) to successfully run that step, but then re-run it on the next build. After this patch, llvm-lib emits a warning in this case, that with /WX can be turned into an error. That way, ninja (or make, or...) will mark the initial build as failed. People who don't like the warning can use /ignore:emptyoutput to suppress it. The warning also points out the existing flag /llvmlibempty which forces creation of an empty .lib file (this is an extension to lib.exe). Differential Revision: https://reviews.llvm.org/D123517	2022-04-11 13:15:30 -04:00
Vy Nguyen	1477964413	[lld][macho]Fix test to sort symbol table before dumping Details: The test previously expected a specific order of those symbols, which is not guaranteed (could change simply due to hashing changes, etc). So we change it to explicitly sort the symbols before checking contents. PR/53026 Differential Revision: https://reviews.llvm.org/D116813	2022-04-11 12:01:04 -04:00
Jez Ng	82dcf30636	[lld-macho] Use fewer indirections in UnwindInfo implementation The previous implementation of UnwindInfoSection materialized all the compact unwind entries & applied their relocations, then parsed the resulting data to generate the final unwind info. This design had some unfortunate conseqeuences: since relocations can only be applied after their referents have had addresses assigned, operations that need to happen before address assignment must contort themselves. (See {D113582} and observe how this diff greatly simplifies it.) Moreover, it made synthesizing new compact unwind entries awkward. Handling PR50956 will require us to do this synthesis, and is the main motivation behind this diff. Previously, instead of generating a new CompactUnwindEntry directly, we would have had to generate a ConcatInputSection with a number of `Reloc`s that would then get "flattened" into a CompactUnwindEntry. This diff introduces an internal representation of `CompactUnwindEntry` (the former `CompactUnwindEntry` has been renamed to `CompactUnwindLayout`). The new CompactUnwindEntry stores references to its personality symbol and LSDA section directly, without the use of `Reloc` structs. In addition to being easier to work with, this diff also allows us to handle unwind info whose personality symbols are located in sections placed after the `__unwind_info`. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D123276	2022-04-08 23:49:07 -04:00
Matt Arsenault	63fe6d7eae	lld/AMDGPU: Fix asserts if no object files are involved in link Fixes issue 47690. The reproduction steps produced a shared object from clang directly, and then fed the shared object back into lld. With no regular object files, this assert was hit. I'm not sure if we need to or should be looking for equivalent fields in shared objects.	2022-04-08 14:18:52 -04:00
Jorge Gorbe Moya	627f55b3ae	Fix format specifier. NFCI. Using a portable format specifier avoids a "format specifies type 'unsigned long long' but the argument has type 'uint64_t' (aka 'unsigned long') [-Werror,-Wformat]" error depending on the exact definition of `uint64_t`.	2022-04-07 15:26:49 -07:00
Zequan Wu	1da67ecefd	[llvm-symbolizer] Fix line offset for inline site. This fixes the issue when the current line offset is actually for next range. Maintain a current code range with current line offset and cache next file/line offset. Update file/line offset after finishing current range. Differential Revision: https://reviews.llvm.org/D123151	2022-04-07 15:17:59 -07:00
Jez Ng	b440c25742	[lld-macho][nfc] Give non-text ConcatOutputSections order-independent finalization This diff is motivated by my work to add proper DWARF unwind support. As detailed in PR50956 functions that need DWARF unwind need to have compact unwind entries synthesized for them. These CU entries encode an offset within `__eh_frame` that points to the corresponding DWARF FDE. In order to encode this offset during `UnwindInfoSectionImpl::finalize()`, we need to first assign values to `InputSection::outSecOff` for each `__eh_frame` subsection. But `__eh_frame` is ordered after `__unwind_info` (according to ld64 at least), which puts us in a bit of a bind: `outSecOff` gets assigned during finalization, but `__eh_frame` is being finalized after `__unwind_info`. But it occurred to me that there's no real need for most ConcatOutputSections to be finalized sequentially. It's only necessary for text-containing ConcatOutputSections that may contain branch relocs which may need thunks. ConcatOutputSections containing other types of data can be finalized in any order. This diff moves the finalization logic for non-text sections into a separate `finalizeContents()` method. This method is called before section address assignment & unwind info finalization takes place. In theory we could call these `finalizeContents()` methods in parallel, but in practice it seems to be faster to do it all on the main thread. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D123279	2022-04-07 18:13:27 -04:00
Fangrui Song	be01af4a0f	[ELF] Fix non-relocatable-non-emit-relocs --gc-sections to discard .L symbols This reverts commit `764cd491b1`, which I incorrectly assumed NFC partly because there were no test coverage for the non-relocatable non-emit-relocs case before 9d6d936243fe343abe89323a27c7241b395af541. The interaction of {,-r,--emit-relocs} {,--discard-locals} {,--gc-sections} is complex but without -r/--emit-relocs, --gc-sections does need to discard .L symbols like --no-gc-sections. The behavior matches GNU ld.	2022-04-07 14:34:32 -07:00
Fangrui Song	e25c41803f	[ELF][test] Improve discard-locals.s	2022-04-07 14:24:15 -07:00
Nico Weber	2cb3d28b17	[lld/mac] Add some comments and asserts I was wondering if SymtabSection::emitStabs() should check defined->includeInSymtab. Add asserts and comments explaining why that's not necessary. No behavior change. Differential Revision: https://reviews.llvm.org/D123302	2022-04-07 15:43:28 -04:00
Jez Ng	f004ecf6ec	[lld-macho][nfc] Remove indirection when looking up common section members {D118797} means that we can now check the name/segname of a given section directly, instead of having to look those properties up on one of its subsections. This allows us to simplify our code. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D123275	2022-04-07 14:28:52 -04:00
Jez Ng	da6b6b3c82	[lld-macho][nfc] Factor out findSymbolAtOffset Our compact unwind handling code currently has some logic to locate a symbol at a given offset in an InputSection. The EH frame code will need to do something similar, so let's factor out the code. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D123301	2022-04-07 09:13:39 -04:00
Nico Weber	8c1ea1ab81	[lld/mac] Don't emit stabs entries for functions folded during ICF This matches ld64, and makes dsymutil work better with lld's output. Fixes PR54783, see there for details. Reduces time needed to run dsymutil on Chromium Framework from 8m30s (which is already down from 26 min with D123218) to 6m30s and removes many lines of "could not find object file symbol for symbol" from dsymutil output (previously: several MB of those messages, now dsymutil is completely silent). Differential Revision: https://reviews.llvm.org/D123252	2022-04-07 08:09:32 -04:00
Simon Pilgrim	156b94c2d3	Fix "result of 32-bit shift implicitly converted to 64 bits" MSVC warning. NFC.	2022-04-07 11:25:09 +01:00
Nikita Popov	b8f50abd04	[lld] Remove support for legacy pass manager This removes options for performing LTO with the legacy pass manager in LLD. Options that explicitly enable the new pass manager are retained as no-ops. Differential Revision: https://reviews.llvm.org/D123219	2022-04-07 10:17:31 +02:00
Tobias Hieta	0dfa8a019d	[LLD][COFF] Fix TypeServerSource matcher with more than one collision Follow-up from `98bc304e9f` - while that commit fixed when you had two PDBs colliding on the same Guid it didn't fix the case where you had more than two PDBs using the same Guid. This commit fixes that and also tests much more carefully that all the types are correct no matter the order. Reviewed By: aganea, saudi Differential Revision: https://reviews.llvm.org/D123185	2022-04-07 09:33:46 +02:00
Fangrui Song	c29c19cb53	[ELF] Ignore --no-add-needed It is used by a few projects like keepassxc and mumble. Also see https://bugzilla.redhat.com/show_bug.cgi?id=2070813 that Fedora gcc has an (unneeded) gcc12-no-add-needed.patch which adds --no-add-needed, although --[no-]add-needed has been deprecated in GNU ld since 2009. Adding this has low costs and makes several folks happy. This basically restores `8f13bef575`. Fixes https://github.com/llvm/llvm-project/issues/54756	2022-04-06 22:41:27 -07:00
Jez Ng	e4b286211c	[lld-macho][nfc] Rearrange order of statements to clarify data dependencies	2022-04-07 00:00:41 -04:00
Nikita Popov	ed4e6e0398	[cmake] Remove LLVM_ENABLE_NEW_PASS_MANAGER cmake option Or rather, error out if it is set to something other than ON. This removes the ability to enable the legacy pass manager by default, but does not remove the ability to explicitly enable it through various flags like -flegacy-pass-manager or -enable-new-pm=0. I checked, and our test suite definitely doesn't pass with LLVM_ENABLE_NEW_PASS_MANAGER=OFF anymore. Differential Revision: https://reviews.llvm.org/D123126	2022-04-06 09:52:21 +02:00
Martin Storsjö	46776f7556	Fix warnings about variables that are set but only used in debug mode Add void casts to mark the variables used, next to the places where they are used in assert or `LLVM_DEBUG()` expressions. Differential Revision: https://reviews.llvm.org/D123117	2022-04-06 10:01:46 +03:00
Argyrios Kyrtzidis	330268ba34	[Support/Hash functions] Change the `final()` and `result()` of the hashing functions to return an array of bytes Returning `std::array<uint8_t, N>` is better ergonomics for the hashing functions usage, instead of a `StringRef`: * When returning `StringRef`, client code is "jumping through hoops" to do string manipulations instead of dealing with fixed array of bytes directly, which is more natural * Returning `std::array<uint8_t, N>` avoids the need for the hasher classes to keep a field just for the purpose of wrapping it and returning it as a `StringRef` As part of this patch also: * Introduce `TruncatedBLAKE3` which is useful for using BLAKE3 as the hasher type for `HashBuilder` with non-default hash sizes. * Make `MD5Result` inherit from `std::array<uint8_t, 16>` which improves & simplifies its API. Differential Revision: https://reviews.llvm.org/D123100	2022-04-05 21:38:06 -07:00
Mitch Phillips	786c89fed3	[ELF][MTE] Add --android-memtag-* options to synthesize ELF notes This ELF note is aarch64 and Android-specific. It specifies to the dynamic loader that specific work should be scheduled to enable MTE protection of stack and heap regions. Current synthesis of the ".note.android.memtag" ELF note is done in the Android build system. We'd like to move that to the compiler. This patch adds the --memtag-stack, --memtag-heap, and --memtag-mode={async, sync, none} flags to the linker, which synthesises the note for us. Future changes will add -fsanitize=memtag* flags to clang which will pass these through to lld. Depends on D119381. Differential Revision: https://reviews.llvm.org/D119384	2022-04-04 11:17:36 -07:00
Nico Weber	cd52b35ee4	fix comment typos to cycle bots	2022-04-04 08:56:18 -04:00
Fangrui Song	388584d382	[ELF][test] Fix RUN lines in lto/sample-profile.ll Reported at https://github.com/llvm/llvm-project/issues/54679#issuecomment-1086862116	2022-04-03 23:57:31 -07:00
Tobias Hieta	98bc304e9f	[lld][COFF] Fix TypeServerSource lookup on GUID collisions Microsoft shipped a bunch of PDB files with broken/invalid GUIDs which lead lld to use 0xFF as the key for these files in an internal cache. When multiple files have this key it will lead to collisions and confused symbol lookup. Several approaches to fix this was considered. Including making the key the path to the PDB file, but this requires some filesystem operations in order to normalize the file path. Since this only happens with malformatted PDB files and we haven't seen this before they malformatted files where shipped with visual studio we probably shouldn't optimize for this use-case. Instead we now just don't insert files with Guid == 0xFF into the cache map and warn if we get collisions so similar problems can be found in the future instead of being silent. Discussion about the root issue and the approach to this fix can be found on Github: https://github.com/llvm/llvm-project/issues/54487 Reviewed By: aganea Differential Revision: https://reviews.llvm.org/D122372	2022-04-02 10:09:07 +02:00
Nico Weber	663a7fa712	[lld/mac] Tweak a few comments Addresses review feedback I had missed on https://reviews.llvm.org/D122624 No behavior change. Differential Revision: https://reviews.llvm.org/D122904	2022-04-01 19:32:07 -04:00
Arthur Eubanks	79a9fe6c8a	[test] Mark uuid.s as unsupported on Windows For systems using gnuwin32, awk does not exist.	2022-04-01 15:32:51 -07:00
Leonard Grey	a9e325116c	Add output filename to UUID hash Differential Revision: https://reviews.llvm.org/D122843	2022-03-31 18:50:05 -04:00
Roger Kim	34b9729561	[lld-macho][NFC] Encapsulate symbol priority implementation. Just some code clean up. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D122752	2022-03-31 13:47:38 -04:00
Nico Weber	10cda6e36c	[lld/mac] Give range extension thunks for local symbols local visibility When two local symbols (think: file-scope static functions, or functions in unnamed namespaces) with the same name in two different translation units both needed thunks, ld64.lld previously created external thunks for both of them. These thunks ended up with the same name, leading to a duplicate symbol error for the thunk symbols. Instead, give thunks for local symbols local visibility. (Hitting this requires a jump to a local symbol from over 128 MiB away. It's unlikely that a single .o file is 128 MiB large, but with ICF you can end up with a situation where the local symbol is ICF'd with a symbol in a separate translation unit. And that can introduce a large enough jump to require a thunk.) Fixes PR54599. Differential Revision: https://reviews.llvm.org/D122624	2022-03-30 16:45:05 -04:00
Fangrui Song	c0065f1182	[ELF] Default to --no-fortran-common D86142 introduced --fortran-common and defaulted it to true (matching GNU ld but deviates from gold/macOS ld64). The default state was motivated by transparently supporting some FORTRAN 77 programs (Fortran 90 deprecated common blocks). Now I think it again. I believe we made a mistake to change the default: * this is a weird and legacy rule, though the breakage is very small * --fortran-common introduced complexity to parallel symbol resolution and will slow down it * --fortran-common more likely causes issues when users mix COMMON and STB_GLOBAL definitions (see https://github.com/llvm/llvm-project/issues/48570 and https://maskray.me/blog/2022-02-06-all-about-common-symbols). I have seen several issues in our internal projects and Android. On the other hand, --no-fortran-common is safer since COMMON/STB_GLOBAL have the same semantics related to archive member extraction. Therefore I think we should switch back, not punishing the common uage. A platform wanting --fortran-common can implement ld.lld as a shell script wrapper around `lld -flavor gnu --fortran-common "$@"`. Reviewed By: ikudrin, sfertile Differential Revision: https://reviews.llvm.org/D122450	2022-03-30 09:12:09 -07:00
Fangrui Song	4645311933	[ELF] --emit-relocs: adjust offsets of .rel[a].eh_frame relocations Two code paths may reach the EHFrame case in SectionBase::getOffset: * .eh_frame reference * relocation copy for --emit-relocs The first may be used by clang_rt.crtbegin.o and GCC crtbeginT.o to get the start address of the output .eh_frame. The relocation has an offset of 0 or (x86-64 PC-relative leaq for clang_rt.crtbegin.o) -4. The current code just returns `offset`, which handles this case well. The second is related to InputSection::copyRelocations on .eh_frame (used by --emit-relocs). .eh_frame pieces may be dropped due to GC/ICF, so we should convert the input offset to the output offset. Use the same way as MergeInputSection with a special case handling outSecOff==-1 for an invalid piece (see eh-frame-marker.s). This exposes an issue in mips64-eh-abs-reloc.s that we don't reliably handle anyway. Just add --no-check-dynamic-relocations to paper over it. Differential Revision: https://reviews.llvm.org/D122459	2022-03-29 09:51:41 -07:00
Fangrui Song	7370a489b1	[ELF] --emit-relocs: fix missing STT_SECTION when the first input section is synthetic addSectionSymbols suppresses the STT_SECTION symbol if the first input section is non-SHF_MERGE synthetic. This is incorrect when the first input section is synthetic while a non-synthetic input section exists: * `.bss : { (COMMON) (.bss) }` (`abc388ed3c` regressed the case because COMMON symbols precede .bss in the absence of a linker script) * Place a synthetic section in another section: `.data : { (.got) (.data) }` For `%t/a1` in the new test emit-relocs-synthetic.s, ld.lld produces incorrect relocations with symbol index 0. ``` 0000000000000000 <_start>: 0: 8b 05 33 00 00 00 movl 51(%rip), %eax # 0x39 <bss> 0000000000000002: R_X86_64_PC32 ABS+0xd 6: 8b 05 1c 00 00 00 movl 28(%rip), %eax # 0x28 <common> 0000000000000008: R_X86_64_PC32 common-0x4 c: 8b 05 06 00 00 00 movl 6(%rip), %eax # 0x18 000000000000000e: R_X86_64_GOTPCRELX ABS+0x4 ``` Fix the issue by checking every input section. Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D122463	2022-03-29 08:56:21 -07:00
Fangrui Song	48e251b1d6	Revert D122459 "[ELF] --emit-relocs: adjust offsets of .rel[a].eh_frame relocations" This reverts commit `6faba31e0d`. It may cause "offset is outside the section".	2022-03-28 20:26:21 -07:00
Fangrui Song	6faba31e0d	[ELF] --emit-relocs: adjust offsets of .rel[a].eh_frame relocations .eh_frame pieces may be dropped due to GC/ICF. When --emit-relocs adds relocations against .eh_frame, the offsets need to be adjusted. Use the same way as MergeInputSection with a special case handling outSecOff==-1 for an invalid piece (see eh-frame-marker.s). This exposes an issue in mips64-eh-abs-reloc.s that we don't reliably handle anyway. Just add --no-check-dynamic-relocations to paper over it. Original patch by Ayrton Muñoz Differential Revision: https://reviews.llvm.org/D122459	2022-03-28 16:23:13 -07:00
Fangrui Song	27ef7494b1	[ELF][test] Refactor some .eh_frame tests * Improve eh-frame-merge.s * Delete invalid .eh_frame+5 test in ehframe-relocation.s	2022-03-28 15:55:46 -07:00
Fangrui Song	1db59dc8e2	[ELF] Fix llvm_unreachable failure when COMMON is placed in SHT_PROGBITS output section Fix a regression in aa27bab5a1a17e9c4168a741a6298ecaa92c1ecb: COMMON in an SHT_PROGBITS output section caused llvm_unreachable failure.	2022-03-28 11:05:52 -07:00
Fangrui Song	8565a87fd4	[ELF] Simplify MergeInputSection::getParentOffset. NFC and remove overly verbose comments.	2022-03-28 10:02:35 -07:00
Fangrui Song	c37accf0a2	[Option] Avoid using the default argument for the 3-argument hasFlag. NFC The default argument true is error-prone: I think many would think the default is false.	2022-03-26 00:57:06 -07:00
Sam McCall	57ee624d79	[cmake] Provide CURRENT_TOOLS_DIR centrally, replacing CLANG_TOOLS_DIR CLANG_TOOLS_DIR holds the the current bin/ directory, maybe with a %(build_mode) placeholder. It is used to add the just-built binaries to $PATH for lit tests. In most cases it equals LLVM_TOOLS_DIR, which is used for the same purpose. But for a standalone build of clang, CLANG_TOOLS_DIR points at the build tree and LLVM_TOOLS_DIR points at the provided LLVM binaries. Currently CLANG_TOOLS_DIR is set in clang/test/, clang-tools-extra/test/, and other things always built with clang. This is a few cryptic lines of CMake in each place. Meanwhile LLVM_TOOLS_DIR is provided by configure_site_lit_cfg(). This patch moves CLANG_TOOLS_DIR to configure_site_lit_cfg() and renames it: - there's nothing clang-specific about the value - it will also replace LLD_TOOLS_DIR, LLDB_TOOLS_DIR etc (not in this patch) It also defines CURRENT_LIBS_DIR. While I removed the last usage of CLANG_LIBS_DIR in `e4cab4e24d`, there are LLD_LIBS_DIR usages etc that may be live, and I'd like to mechanically update them in a followup patch. Differential Revision: https://reviews.llvm.org/D121763	2022-03-25 20:22:01 +01:00
Fangrui Song	940bd4c771	[ELF] addSectionSymbols: simplify isec->getOutputSection(). NFC	2022-03-24 21:54:20 -07:00
Fangrui Song	d3e5b6f753	[ELF] Implement --build-id={md5,sha1} with truncated BLAKE3 --build-id was introduced as "approximation of true uniqueness across all binaries that might be used by overlapping sets of people". It does not require the some resistance mentioned below. In practice, people just use --build-id=md5 for 16-byte build ID and --build-id=sha1 for 20-byte build ID. BLAKE3 has 256-bit key length, which provides 128-bit security against (second-)preimage, collision, and differentiability attacks. Its portable implementation is fast. It additionally provides Arm Neon/AVX2/AVX-512. Just implement --build-id={md5,sha1} with truncated BLAKE3. Linking clang 14 RelWithDebInfo with --threads=8 on a Skylake CPU: * 1.13x as fast with --build-id=md5 * 1.15x as fast with --build-id=sha1 --threads=4 on Apple m1: * 1.25x as fast with --build-id=md5 * 1.17x as fast with --build-id=sha1 Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D121531	2022-03-24 11:31:39 -07:00
Jakob Koschel	0c86198b27	Reland "[ELF] Enable new passmanager plugin support for LTO" This is the orignal patch + a check that LLVM_BUILD_EXAMPLES is enabled before adding a dependency on the 'Bye' example pass. Original summary: Add cli options for new passmanager plugin support to lld. Currently it is not possible to load dynamic NewPM plugins with lld. This is an incremental update to D76866. While that patch only added cli options for llvm-lto2, this adds them for lld as well. This is especially useful for running dynamic plugins on the linux kernel with LTO. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D120490	2022-03-24 16:29:18 +01:00
Raphael Isemann	1104d79261	Revert "[ELF] Enable new passmanager plugin support for LTO" This reverts commit `32012eb11b`. Broke CMake configuration.	2022-03-24 09:57:15 +01:00
Jakob Koschel	32012eb11b	[ELF] Enable new passmanager plugin support for LTO Add cli options for new passmanager plugin support to lld. Currently it is not possible to load dynamic NewPM plugins with lld. This is an incremental update to D76866. While that patch only added cli options for llvm-lto2, this adds them for lld as well. This is especially useful for running dynamic plugins on the linux kernel with LTO. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D120490	2022-03-24 08:08:54 +01:00
Roger Kim	f858fba631	[lld][Macho][NFC] Encapsulate priorities map in a priority class `config->priorities` has been used to hold the intermediate state during the construction of the order in which sections should be laid out. This is not a good place to hold this state since the intermediate state is not a "configuration" for LLD. It should be encapsulated in a class for building a mapping from section to priority (which I created in this diff as the `PriorityBuilder` class). The same thing is being done for `config->callGraphProfile`. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D122156	2022-03-23 13:57:26 -04:00
Jacob Lambert	71b162c4bd	[AMDGPU][LLD] Adding support for ABI version 5 option Code object version 5 will use the same EFlags as version 4, so we only need to add an additional case Differential Revision: https://reviews.llvm.org/D122190	2022-03-23 01:22:37 -07:00
Jez Ng	c9c2363048	[lld-macho][nfc] Don't mix file sizes with addresses Update DataInCode's calculation of `endAddr` to use `getSize()` instead of `getFileSize()` -- while in practice they're the same for non-zerofill sections (which code sections are), we still should treat address sizes / offsets as distinct from file sizes / offsets.	2022-03-22 17:52:53 -04:00
Jez Ng	a993d607de	[lld-macho][nfc] Add comment explaining why a cast<> is safe	2022-03-21 07:23:09 -04:00
Jez Ng	1c0234dfcc	[lld-macho][nfc] Have findContainingSubsection take a Section ... instead of an instance of `Subsections`. This simplifies the code slightly since all its callsites have a Section instance anyway.	2022-03-21 07:23:09 -04:00
Sam Clegg	a04a507714	[lld][WebAssembly] Fix crash accessing non-live __tls_base symbol In programs that don't otherwise depend on `__tls_base` it won't be marked as live. However this symbol is used internally in a couple of places do we need to mark it as live explictily in those places. Fixes: #54386 Differential Revision: https://reviews.llvm.org/D121931	2022-03-17 13:59:45 -07:00
henry wong	948d05324a	[LTO][ELF] Require asserts for --stats-file= tests. https://reviews.llvm.org/D121809 causes the build bot failure, add the `REQUIRES: asserts` to fix it. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D121888	2022-03-17 23:57:13 +08:00
wangliushuai	1c04b52b25	[LTO][ELF] Add --stats-file= option. This patch adds a StatsFile option supported by gold to lld, related patch https://reviews.llvm.org/D45531. Reviewed By: tejohnson, MaskRay Differential Revision: https://reviews.llvm.org/D121809	2022-03-17 12:01:39 +08:00
Jez Ng	f5ddcf25d6	[lld-macho] Extend lto-internalize-unnamed-addr.ll * Test the case where a symbol is sometimes linkonce_odr and sometimes weak_odr * Test the visibility of the symbols at the IR level, after the internalize stage of LTO is done. (Previously we only checked the visibility of symbols in the final output binary.) Reviewed By: modimo Differential Revision: https://reviews.llvm.org/D121428	2022-03-16 17:30:31 -04:00
Sam McCall	75acad41bc	Use lit_config.substitute instead of foo % lit_config.params everywhere This mechanically applies the same changes from D121427 everywhere. Differential Revision: https://reviews.llvm.org/D121746	2022-03-16 09:57:41 +01:00
serge-sans-paille	989f1c72e0	Cleanup codegen includes This is a (fixed) recommit of https://reviews.llvm.org/D121169 after: 1061034926 before: 1063332844 Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D121681	2022-03-16 08:43:00 +01:00
Fangrui Song	c9dbf407af	[ELF] Move invalid binding diagnostic from initializeSymbols to postParse It is excessive to have a diagnostic for STB_LOCAL. Just reuse the invalid binding diagnostic for STB_LOCAL.	2022-03-16 00:31:29 -07:00
Fangrui Song	bdb98bd979	[ELF] Use endianness-aware read32 to avoid dispatch. NFC	2022-03-15 23:51:11 -07:00
Fangrui Song	385573e07b	[ELF] Inline ARMExidxSyntheticSection::classof. NFC To optimize the only call site `dyn_cast<ARMExidxSyntheticSection>(first)` and decrease code size.	2022-03-15 23:41:30 -07:00
Fangrui Song	1a590232f4	[ELF] Optimize "Strip sections" If SHT_LLVM_SYMPART is unused, don't iterate over inputSections. If neither --strip-debug/--strip-all, don't iterate over inputSections.	2022-03-15 23:15:43 -07:00
Fangrui Song	7c7702b318	[ELF] Move section assignment from initializeSymbols to postParse https://discourse.llvm.org/t/parallel-input-file-parsing/60164 initializeSymbols currently sets Defined::section and handles non-prevailing COMDAT groups. Move the code to the parallel postParse to reduce work from the single-threading code path and make parallel section initialization infeasible. Postpone reporting duplicate symbol errors so that the messages have the section information. (`Defined::section` is assigned in postParse and another thread may not have the information). * duplicated-synthetic-sym.s: BinaryFile duplicate definition (very rare) now has no section information * comdat-binding: `%t/w.o %t/g.o` leads to an undesired undefined symbol. This is not ideal but we report a diagnostic to inform that this is unsupported. (See release note) * comdat-discarded-lazy.s: %tdef.o is unextracted. The new behavior (discarded section error) makes more sense * i386-comdat.s: switched to a better approach working around .gnu.linkonce.t.__x86.get_pc_thunk.bx in glibc<2.32 for x86-32. Drop the ancient no-longer-relevant workaround for __i686.get_pc_thunk.bx Depends on D120640 Differential Revision: https://reviews.llvm.org/D120626	2022-03-15 19:24:41 -07:00
Fangrui Song	9b61fff0eb	Revert D120626 "[ELF] Move section assignment from initializeSymbols to postParse" This reverts commit `c30e6447c0`. It exposed brittle support for __x86.get_pc_thunk.bx. Need to think a bit how to support __x86.get_pc_thunk.bx.	2022-03-15 19:00:54 -07:00
Fangrui Song	48a02152ab	[ELF][test] Improve i386-linkonce.s Make it behave like the glibc<2.32 .gnu.linkonce usage that we want to work around.	2022-03-15 18:47:52 -07:00
Sam Clegg	4690bf2ed3	[lld][WebAssembly] Take advantage of extended const expressions when available In particular we use these in two places: 1. When building PIC code we no longer need to combine output segments into a single segment that can be initialized at `__memory_base`. Instead each segment can encode its offset from `__memory_base` in its initializer. e.g. ``` (i32.add (global.get __memory_base) (i32.const offset) ``` 2. When building PIC code we no longer need to relocation internalized global addresses. We can just initialize them with their correct offsets. Differential Revision: https://reviews.llvm.org/D121420	2022-03-15 17:50:05 -07:00
Jez Ng	8ce3750ff6	[lld-macho] Set FinalDefinitionInLinkageUnit on most LTO externs Since Mach-O has a two-level namespace (unlike ELF), we can usually set this property to true. (I believe this setting is only available in the new LTO backend, so I can't really use ld64 / libLTO's behavior as a reference here... I'm just doing what I think is correct.) See {D119294} for the work done to calculate the `interposable` used in this diff. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D119506	2022-03-15 20:25:06 -04:00
Fangrui Song	c1d4c67718	[ELF] Suppress duplicate symbol error for __x86.get_pc_thunk.bx	2022-03-15 17:20:29 -07:00
Sam Clegg	86c90f9bfd	[lld][WebAssembly] Add --unresolved-symbols=import-dynamic This is a new mode for handling unresolved symbols that allows all symbols to be imported in the same that they would be in the case of `-fpie` or `-shared`, but generting an otherwise fixed/non-relocatable binary. Code linked in this way should still be compiled with `-fPIC` so that data symbols can be resolved via imports. This essentially allows the building of static binaries that have dynamic imports. See: https://github.com/emscripten-core/emscripten/issues/12682 As with other uses of the experimental dynamic linking ABI, this behaviour will produce a warning unless run with `--experimental-pic`. Differential Revision: https://reviews.llvm.org/D91577	2022-03-15 15:10:21 -07:00
Fangrui Song	6be457c14d	[ELF] Work around not-fully-supported .gnu.linkonce.t.__x86.get_pc_thunk.bx	2022-03-15 14:48:29 -07:00
Jez Ng	ceff23c6e3	[lld-macho] -flat_namespace for dylibs should make all externs interposable All references to interposable symbols can be redirected at runtime to point to a different symbol definition (with the same name). For example, if both dylib A and B define symbol _foo, and we load A before B at runtime, then all references to _foo within dylib B will point to the definition in dylib A. ld64 makes all extern symbols interposable when linking with `-flat_namespace`. TODO 1: Support `-interposable` and `-interposable_list`, which should just be a matter of parsing those CLI flags and setting the `Defined::interposable` bit. TODO 2: Set Reloc::FinalDefinitionInLinkageUnit correctly with this info (we are currently not setting it at all, so we're erring on the conservative side, but we should help the LTO backend generate more optimal code.) Reviewed By: modimo, MaskRay Differential Revision: https://reviews.llvm.org/D119294	2022-03-14 22:18:32 -04:00
Jez Ng	7f3ddf8443	[lld-macho][nfc] Allow Defined symbols to be placed in binding sections Previously, we only allowed this for DylibSymbols. However, in order to properly support `-flat_namespace` as well as `-interposable`, we need to allow this for Defined symbols too. Therefore we hoist the `lazyBindOffset` and the `stubsHelperIndex` into the parent Symbol class. The actual change to support interposition under `-flat_namespace` is in {D119294}; the NFC changes here have been split out for easier review. Perf regression isn't stat sig on my 3.2 GHz 16-Core Intel Xeon W linking chromium_framework: base diff difference (95% CI) sys_time 1.227 ± 0.021 1.234 ± 0.031 [ -0.3% .. +1.5%] user_time 3.665 ± 0.036 3.674 ± 0.035 [ -0.2% .. +0.7%] wall_time 4.596 ± 0.055 4.609 ± 0.064 [ -0.3% .. +0.9%] samples 34 47 Max RSS regression is barely stat sig: base diff difference (95% CI) time 1003664356.324 ± 15404053.912 1010380403.613 ± 10578309.455 [ +0.0% .. +1.3%] samples 37 31 Reviewed By: modimo Differential Revision: https://reviews.llvm.org/D121351	2022-03-14 22:18:32 -04:00
Vy Nguyen	0d5e27623a	Reland "[lld-macho] Avoid using bump-alloc in TrieBuider"" This reverts commit `ee7a286cd3`.	2022-03-14 19:33:13 -04:00
Sterling Augustine	ee7a286cd3	Revert "[lld-macho] Avoid using bump-alloc in TrieBuider" This reverts commit `e049a87f04`. That commit breaks the build with errors of the form: /usr/local/google/home/saugustine/llvm/llvm-project/lld/MachO/ExportTrie.cpp:148:11: error: definition of implicitly declared destructor TrieNode::~TrieNode() {	2022-03-14 15:23:04 -07:00
Vy Nguyen	e049a87f04	[lld-macho] Avoid using bump-alloc in TrieBuider The code can be used in multi-threads and the allocator is not thread safe. fixes PR/54378 Reviewed By: int3, #lld-macho Differential Revision: https://reviews.llvm.org/D121638	2022-03-14 17:22:53 -04:00
Fangrui Song	c30e6447c0	[ELF] Move section assignment from initializeSymbols to postParse https://discourse.llvm.org/t/parallel-input-file-parsing/60164 initializeSymbols currently sets Defined::section and handles non-prevailing COMDAT groups. Move the code to the parallel postParse to reduce work from the single-threading code path and make parallel section initialization infeasible. Postpone reporting duplicate symbol errors so that the messages have the section information. (`Defined::section` is assigned in postParse and another thread may not have the information). * duplicated-synthetic-sym.s: BinaryFile duplicate definition (very rare) now has no section information * comdat-binding: `%t/w.o %t/g.o` leads to an undesired undefined symbol. This is not ideal but we report a diagnostic to inform that this is unsupported. (See release note) * comdat-discarded-lazy.s: %tdef.o is unextracted. The new behavior (discarded section error) makes more sense Depends on D120640 Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D120626	2022-03-14 14:13:41 -07:00
Fangrui Song	c7cf960d85	[ELF] Set the priority of STB_GNU_UNIQUE the same as STB_WEAK In GCC -fgnu-unique output, STB_GNU_UNIQUE symbols are always defined relative to a section in a COMDAT group. Currently `other` cannot be STB_GNU_UNIQUE for valid input, so this patch is NFC. If we switch to the model that ignores COMDAT resolution when performing symbol resolution (D120626), this will fix bogus `relocation refers to a symbol in a discarded section` errors when mixing -fno-gnu-unique objects with -fgnu-unique objects. Differential Revision: https://reviews.llvm.org/D120640	2022-03-14 12:00:15 -07:00
Sam Clegg	9504ab32b7	[WebAssembly] Second phase of implemented extended const proposal This change continues to lay the ground work for supporting extended const expressions in the linker. The included test covers object file reading and writing and the YAML representation. Differential Revision: https://reviews.llvm.org/D121349	2022-03-14 08:55:47 -07:00
Nico Weber	17414150cf	[lld-link] Tweak winsysroottest.test to have passing links on happy path Previously, the test checked for a "undefined symbol" error (instead of the "could not open std*.lib" which would happen without the flag). Instead, use /entry: so that the link succeeds. No behavior change, but maybe makes the test a bit easier to understand. Differential Revision: https://reviews.llvm.org/D121553	2022-03-14 10:44:26 -04:00
Fangrui Song	7b8fbb796c	[ELF] Simplify addCopyRelSymbol with invokeELFT. NFC	2022-03-12 14:08:10 -08:00
Petr Hosek	0c0f6cfb7b	[CMake] Rename TARGET_TRIPLE to LLVM_TARGET_TRIPLE This clarifies that this is an LLVM specific variable and avoids potential conflicts with other projects. Differential Revision: https://reviews.llvm.org/D119918	2022-03-11 15:43:01 -08:00
Jez Ng	9b7b21d2f7	[lld-macho] Don't allocate memory in parallelForEach ... since BumpPtrAllocator isn't thread-safe. Reviewed By: #lld-macho, Roger Differential Revision: https://reviews.llvm.org/D121458	2022-03-11 13:32:24 -05:00
Fangrui Song	4a8de2832a	[ELF] Add -z pack-relative-relocs GNU ld 2.38 added -z pack-relative-relocs which is similar to --pack-dyn-relocs=relr but synthesizes the `GLIBC_ABI_DT_RELR` version dependency if a shared object named `libc.so.` has a `GLIBC_2.` version dependency. This is used to implement the (as some glibc folks call) version lockout mechanism. Add this option, because glibc does not want to support --pack-dyn-relocs=relr which does not add `GLIBC_ABI_DT_RELR`. See https://maskray.me/blog/2021-10-31-relative-relocations-and-relr for detail. Close https://github.com/llvm/llvm-project/issues/53775 Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D120701	2022-03-10 19:54:21 -08:00
Jez Ng	fc968bcba4	[lld-macho][nfc] Fix formatting in ld64-vs-lld.rst	2022-03-10 18:33:18 -05:00
Jez Ng	4308f031cd	[lld-macho] Align cstrings less conservatively Previously, we aligned every cstring to 16 bytes as a temporary hack to deal with https://github.com/llvm/llvm-project/issues/50135. However, it was highly wasteful in terms of binary size. To recap, in contrast to ELF, which puts strings that need different alignments into different sections, `clang`'s Mach-O backend puts them all in one section. Strings that need to be aligned have the .p2align directive emitted before them, which simply translates into zero padding in the object file. In other words, we have to infer the alignment of the cstrings from their addresses. We differ slightly from ld64 in how we've chosen to align these cstrings. Both LLD and ld64 preserve the number of trailing zeros in each cstring's address in the input object files. When deduplicating identical cstrings, both linkers pick the cstring whose address has more trailing zeros, and preserve the alignment of that address in the final binary. However, ld64 goes a step further and also preserves the offset of the cstring from the last section-aligned address. I.e. if a cstring is at offset 18 in the input, with a section alignment of 16, then both LLD and ld64 will ensure the final address is 2-byte aligned (since `18 == 16 + 2`). But ld64 will also ensure that the final address is of the form 16 * k + 2 for some k (which implies 2-byte alignment). Note that ld64's heuristic means that a dedup'ed cstring's final address is dependent on the order of the input object files. E.g. if in addition to the cstring at offset 18 above, we have a duplicate one in another file with a `.cstring` section alignment of 2 and an offset of zero, then ld64 will pick the cstring from the object file earlier on the command line (since both have the same number of trailing zeros in their address). So the final cstring may either be at some address `16 * k + 2` or at some address `2 * k`. I've opted not to follow this behavior primarily for implementation simplicity, and secondarily to save a few more bytes. It's not clear to me that preserving the section alignment + offset is ever necessary, and there are many cases that are clearly redundant. In particular, if an x86_64 object file contains some strings that are accessed via SIMD instructions, then the .cstring section in the object file will be 16-byte-aligned (since SIMD requires its operand addresses to be 16-byte aligned). However, there will typically also be other cstrings in the same file that aren't used via SIMD and don't need this alignment. They will be emitted at some arbitrary address `A`, but ld64 will treat them as being 16-byte aligned with an offset of `16 % A`. I have verified that the two repros in https://github.com/llvm/llvm-project/issues/50135 work well with the new alignment behavior. Fixes https://github.com/llvm/llvm-project/issues/54036. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D121342	2022-03-10 15:18:15 -05:00
serge-sans-paille	f06d487dd6	Cleanup includes: WindowsDriver & WindowsManifest Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D121330	2022-03-10 17:19:06 +01:00
Nico Weber	a278250b0f	Revert "Cleanup codegen includes" This reverts commit `7f230feeea`. Breaks CodeGenCUDA/link-device-bitcode.cu in check-clang, and many LLVM tests, see comments on https://reviews.llvm.org/D121169	2022-03-10 07:59:22 -05:00
serge-sans-paille	7f230feeea	Cleanup codegen includes after: 1061034926 before: 1063332844 Differential Revision: https://reviews.llvm.org/D121169	2022-03-10 10:00:30 +01:00
Fangrui Song	72bedf46c7	[ELF] Inline InputSection::getParent. NFC Combined with the previous change, lld executable is ~2K smaller and some code paths using InputSection::getParent are more efficient. The fragmented headers lead to a design limitation that OutputSection has to be incomplete, so we cannot use static_cast.	2022-03-08 11:26:12 -08:00
Fangrui Song	6c814931bc	[ELF] Don't use multiple inheritance for OutputSection. NFC Add an OutputDesc class inheriting from SectionCommand. An OutputDesc wraps an OutputSection. This change allows InputSection::getParent to be inlined. Differential Revision: https://reviews.llvm.org/D120650	2022-03-08 11:23:42 -08:00
Jez Ng	ce2ae38124	[lld-macho] Deduplicate the `__objc_classrefs` section contents ld64 breaks down `__objc_classrefs` on a per-word level and deduplicates them. This greatly reduces the number of bind entries emitted (and therefore the amount of work `dyld` has to do at runtime). For chromium_framework, this change to LLD cuts the number of (non-lazy) binds from 912 to 190, getting us to parity with ld64 in this aspect. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D121053	2022-03-08 08:34:04 -05:00
Jez Ng	8ec1033933	[lld-macho] Deduplicate CFStrings during ICF `__cfstring` has embedded addends that foil ICF's hashing / equality checks. (We can ignore embedded addends when doing ICF because the same information gets recorded in our Reloc structs.) Therefore, in order to properly dedup CFStrings, we create a mutable copy of the CFString and zero out the embedded addends before performing any hashing / equality checks. (We did in fact have a partial implementation of CFString deduplication already. However, it only worked when the cstrings they point to are at identical offsets in their object files.) I anticipate this approach can be extended to other similar statically-allocated struct sections in the future. In addition, we previously treated all references with differing addends as unequal. This is not true when the references are to literals: different addends may point to the same literal in the output binary. In particular, `__cfstring` has such references to `__cstring`. I've adjusted ICF's `equalsConstant` logic accordingly, and I've added a few more tests to make sure the addend-comparison code path is adequately covered. Fixes https://github.com/llvm/llvm-project/issues/51281. Reviewed By: #lld-macho, Roger Differential Revision: https://reviews.llvm.org/D120137	2022-03-08 08:34:03 -05:00

1 2 3 4 5 ...

15290 Commits