llvm-project

Commit Graph

Author	SHA1	Message	Date
Nico Weber	d5b2921faf	[lld/tests] Stop setting the "asserts" and "debug" features The last use of `REQUIRES: debug` was removed in 2013 in `72c5d3d7c` in favor of `REQUIRES: asserts`. The last use of `REQUIRES: asserts` was removed in 2015 in `251b0e268` when the old COFF linker was removed. lld's test suite currently has no behavior difference with respect to assertions or debug builds (and hasn't had it for 6 years). Let's keep it that way :) Differential Revision: https://reviews.llvm.org/D115941	2022-01-05 13:39:17 -05:00
Fangrui Song	954aaf7c14	[ELF] Demote all lazy symbols. NFC This complements D111365. D111365 did not demote isUsedInRegularObj lazy symbols just to work around a --symbol-ordering-file diagnostic quirk. The quirk was dropped by `00dd2d15a4`, so we can demote all lazy symbols now, not just the isUsedInRegularObj ones.	2022-01-05 10:24:29 -08:00
Nico Weber	085f078307	Revert "Revert D109159 "[amdgpu] Enable selection of `s_cselect_b64`."" This reverts commit `859ebca744`. The change contained many unrelated changes and e.g. restored unit test failes for the old lld port.	2022-01-05 13:10:25 -05:00
David Salinas	859ebca744	Revert D109159 "[amdgpu] Enable selection of `s_cselect_b64`." This reverts commit `640beb38e7`. That commit caused performance degradtion in Quicksilver test QS:sGPU and a functional test failure in (rocPRIM rocprim.device_segmented_radix_sort). Reverting until we have a better solution to s_cselect_b64 codegen cleanup Change-Id: Ibf8e397df94001f248fba609f072088a46abae08 Reviewed By: kzhuravl Differential Revision: https://reviews.llvm.org/D115960 Change-Id: Id169459ce4dfffa857d5645a0af50b0063ce1105	2022-01-05 17:57:32 +00:00
Nico Weber	5730d11c2b	[lld-link] Consistently print all /verbose output to stderr lld-link used to consistently print all /verbose output to stdout, and that was an intentional decision: https://reviews.llvm.org/rG4bce7bcc88f3 https://reviews.llvm.org/rGe6e206d4b4814 added message() and log(), and back then `log()` morally was just `if (verbose) message(...)` and message() wrote to stdout. So that change moved most /verbose-induced writes to outs() to log(). Except for the one in printDiscardedMessage(), since the check for `verbose` for that one is in the caller, in Writer::createSections(): if (config->verbose) sc->printDiscardedMessage(); Later, https://reviews.llvm.org/D41033 changed log() to write to stderr. That moved lld-link from writing all its /verbose output to stdout to writing almost all of its /verbose output to stderr -- except for printDiscardedMessage() output. This change moves printDiscardedMessage() to call log() as well, so that all /verbose output once again consistently goes to the same stream. Differential Revision: https://reviews.llvm.org/D116667	2022-01-05 11:52:04 -05:00
Benjamin Kramer	afc14a0d17	Retire llvm::make_reverse_iterator in favor of std::make_reverse_iterator std::make_reverse_iterator is a C++14 feature, gcc has it since GCC 5.1.	2022-01-05 14:07:08 +01:00
Fangrui Song	ddea3bf7d1	[ELF] Remove redundant cast. NFC	2022-01-05 02:07:15 -08:00
Fangrui Song	0940cd18f2	[ELF] --symbol-ordering-file: use getLocalSymbols. NFC	2022-01-05 02:06:31 -08:00
Fangrui Song	00dd2d15a4	[ELF] --symbol-ordering-file: remove weird !lazy condition for "no such symbol" diagnostic The diagnostic is emitted for an unextracted lazy symbol but suppressed for an undefined symbol. Suppressing the diagnostic for unextracted lazy symbol probably makes more sense because (a) an unextracted lazy symbol is quite similar to an undefined symbol and (b) an unextracted lazy symbol is different from "no such symbol".	2022-01-05 02:04:36 -08:00
Fangrui Song	935229f66b	[ELF] Symbol::getVA: assert not called on a lazy symbol The code path is dead after D111365.	2022-01-05 00:46:48 -08:00
Xu Mingjie	b5149f4e66	[LTO] Fix assertion failed when flushing bitcode incrementally for LTO output. In https://reviews.llvm.org/D86905, we introduce an optimization, when lld emits LLVM bitcode, we allow bitcode writer flush data to disk early when buffered data size is above some threshold. But when `--plugin-opt=emit-llvm` and `-o /dev/null` are used, lld will trigger assertion `BytesRead >= 0 && static_cast<size_t>(BytesRead) == BytesFromDisk`. When we write output to /dev/null, BytesRead is zero, but at this program point BytesFromDisk is always non-zero. Reviewed By: stephan.yichao.zhao, MaskRay Differential Revision: https://reviews.llvm.org/D112297	2022-01-04 21:40:23 -08:00
Fangrui Song	292395329c	[lld-link] Remove unneeded lto::InputFile::create after D116434	2022-01-04 19:38:32 -08:00
Luís Ferreira	10e40a4ea3	[lld] Add support for other demanglers other than Itanium LLVM core library supports demangling other mangled symbols other than itanium, such as D and Rust. LLD should use those demanglers in order to output pretty demangled symbols on error messages. Reviewed By: MaskRay, #lld-macho Differential Revision: https://reviews.llvm.org/D116279	2022-01-05 03:25:41 +00:00
Fangrui Song	d496abbe2a	[lld-link] Replace LazyObjFile with lazy ObjFile/BitcodeFile Similar to ELF `3a5fb57393`. * previously when a LazyObjFile was extracted, a new ObjFile/BitcodeFile was created; now the file is reused, just with `lazy` cleared * avoid the confusing transfer of `symbols` from LazyObjFile to the new file * simpler code, smaller executable (5200+ bytes smaller on x86-64) * make eager parsing feasible (for parallel section/symbol table initialization) Reviewed By: aganea, rnk Differential Revision: https://reviews.llvm.org/D116434	2022-01-04 15:11:44 -08:00
Markus Böck	c40049d6d7	[lld][MinGW] Remove `--no-as-needed` from ignored flags In the post commit discussion of https://reviews.llvm.org/D116484 it was concluded that `--no-as-needed` should not be ignored. `--as-needed` stays ignored as it is already the default behaviour on COFF, which cannot be changed.	2022-01-03 23:01:02 +01:00
Kazu Hirata	5e1177302b	[wasm] Use nullptr instead of NULL (NFC) Identified with modernize-use-nullptr.	2022-01-02 10:20:21 -08:00
Markus Böck	1b708b67f6	[lld][MinGW] Ignore `--[no-]as-neeed` flags in MinGW driver These flags are specific to ELF, but are still accepted by GNU ld, even if it does not do anything. This patch adds them as ignored option for the sake of compatibility. Part of https://github.com/llvm/llvm-project/issues/52947 Differential Revision: https://reviews.llvm.org/D116484	2022-01-02 12:03:21 +01:00
John Ericson	b3af9fbcc9	Set the path to the shared cmake modules based on the llvm directory It’s still possible to build parts of the main llvm build (lld, clang etc) by symlinking them into llvm/tools. Reviewed By: Ericson2314 Differential Revision: https://reviews.llvm.org/D116472	2022-01-01 17:59:08 +00:00
John Ericson	896537048d	[lld][CMake] Use `GNUInstallDirs` to support custom installation dirs Extracted from D99484. My new plan is to start from the outside and work inward. Reviewed By: stephenneuendorffer Differential Revision: https://reviews.llvm.org/D115568	2021-12-31 18:57:57 +00:00
Fangrui Song	ed67d5a03a	[ELF] Switch cNamedSections to SmallVector. NFC Make it smaller	2021-12-30 16:08:26 -08:00
Fangrui Song	441de75f69	[lld][docs] Update _templates/indexsidebar.html after Bugzilla->GitHub issue migration	2021-12-30 13:34:45 -08:00
Fangrui Song	dabac5feec	[ELF][LTO] Cache symbol table of lazy BitcodeFile Similar to D62188: a BitcodeFile's symbol table may be iterated twice, once in --start-lib (lazy) state, and once in the non-lazy state. This patch makes `parseLazy` save `symbols[i]` so that the non-lazy state does not need to re-insert to the global symbol table. Avoiding a redundant `saver.save` may save memory. `Maximum resident set size (kbytes)` for a large --thinlto-index-only link: * without the patch: 10164000 * with the patch: 10095716 (0.6% decrease) Note: we can remove `saver.save` if `BitcodeCompiler::add` does not transfer the ownership of `f.obj` in `checkError(ltoObj->add(std::move(f.obj), resols));`. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D116390	2021-12-30 12:03:29 -08:00
Fangrui Song	a96fe1bf3b	[ELF][LTO] Call madvise(MADV_DONTNEED) on MemoryBuffer instances @tejohnson noticed that freeing MemoryBuffer instances right before `lto->compile` can save RSS, likely because the memory can be reused by LTO indexing (e.g. ThinLTO import/export lists).). For ELFFileBase instances, symbol and section names are backed by MemoryBuffer, so destroying MemoryBuffer would make some infrequent passes (parseSymbolVersion, reportBackrefs) crash and make debugging difficult. For a BitcodeFile, its content is completely unused, but destroying its MemoryBuffer makes the buffer identifier inaccessible and may introduce constraints for future changes. This patch leverages madvise(MADV_DONTNEED) which achieves the major gain without the latent issues. `Maximum resident set size (kbytes): ` for a large --thinlto-index-only link: * current behavior: 10146104KiB * destroy MemoryBuffer instances: 8555240KiB * madvise(MADV_DONTNEED) just bitcodeFiles and lazyBitcodeFiles: 8737372KiB * madvise(MADV_DONTNEED) all MemoryBuffers: 8739796KiB (16% decrease) Depends on D116366 Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D116367	2021-12-30 11:36:58 -08:00
Luís Ferreira	8792cd75d0	Revert "[lld] Add support for other demanglers other than Itanium" This reverts commit `e60d6dfd5a`. clang-ppc64le-rhel buildbot failed (https://lab.llvm.org/buildbot#builders/57/builds/13424): tools/lld/MachO/CMakeFiles/lldMachO.dir/Symbols.cpp.o: In function `lld::demangle(llvm::StringRef, bool)': Symbols.cpp:(.text._ZN3lld8demangleEN4llvm9StringRefEb[_ZN3lld8demangleEN4llvm9StringRefEb]+0x90): undefined reference to `llvm::demangle(std::string const&)'	2021-12-30 18:04:21 +00:00
Luís Ferreira	e60d6dfd5a	[lld] Add support for other demanglers other than Itanium LLVM core library supports demangling other mangled symbols other than itanium, such as D and Rust. LLD should use those demanglers in order to output pretty demangled symbols on error messages. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D116279	2021-12-30 17:52:38 +00:00
Fangrui Song	de92a13fec	[ELF] --gc-sections: Work around SHT_PROGBITS .init_array.N for Rust See https://github.com/rust-lang/rust/issues/92181	2021-12-28 16:40:51 -08:00
Mike Hommey	319181f767	[lld-macho] Fix alignment of TLV data sections References from thread-local variable sections are treated as offsets relative to the start of the thread-local data memory area, which is initialized via copying all the TLV data sections (which are all contiguous). If later data sections require a greater alignment than earlier ones, the offsets of data within those sections won't be guaranteed to aligned unless we normalize alignments. We therefore use the largest alignment for all TLV data sections. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D116263	2021-12-28 19:03:13 -05:00
Fangrui Song	49f646a9ed	[ELF] Change EhInputSection::pieces to SmallVector The decreased size does not matter that much as one file contributes at most one EhInputSection.	2021-12-27 21:34:38 -08:00
Fangrui Song	cb203f3f92	[ELF] Change InStruct/Partition pointers to unique_ptr and remove associated make<XXX> calls. gnuHash and sysvHash are unchanged, otherwise LinkerScript::discard would destroy the objects which may be referenced by input section descriptions. My x86-64 lld executable is 121+KiB smaller.	2021-12-27 18:15:23 -08:00
Fangrui Song	049cd480a0	[ELF] Use const reference. NFC	2021-12-27 17:05:48 -08:00
Fangrui Song	3c94d5d9d2	[ELF] addOrphanSections: avoid std::function	2021-12-27 15:57:38 -08:00
Fangrui Song	b8a4780032	[ELF] Simplify and optimize SymbolTableSection<ELFT>::writeTo	2021-12-27 15:16:14 -08:00
Fangrui Song	80c14dcc0e	[ELF] Delete stale declaration. NFC	2021-12-27 12:56:38 -08:00
Fangrui Song	e590c9bc73	[ELF] -r: move zero OutputSection::addr code into finalizeAddressDependentContent Ensure addresses are unchanged after finalizeAddressDependentContent.	2021-12-27 12:10:23 -08:00
Fangrui Song	abc388ed3c	[ELF] Move excludeLibs/redirectSymbols/replaceCommonSymbols adjacent Make post-thinlto-index symbol resolution passes closer.	2021-12-27 00:31:55 -08:00
Fangrui Song	66c550f8de	[ELF] Delete unused LazyObjKind	2021-12-27 00:03:53 -08:00
Fangrui Song	b07292f77a	[ELF] Serialize deleteFallThruJmpInsn to fix concurrency issue New deleteFallThruJmpInsn calls `make<JumpInstrMod>` which cannot be called concurrently. Losing parallelism is unfortunate but we can think of a better approach if parallelism here justifies itself.	2021-12-26 23:26:13 -08:00
Fangrui Song	315554e873	[ELF] Unify sizeof(InputSection) limits for _WIN32 and others Windows sizeof(InputSection) seems to match non-Windows now.	2021-12-26 23:02:24 -08:00
Fangrui Song	e90c8c0422	[ELF] Optimize basic block section bytesDropped/jumpInstrMods and make them more space efficient. This decreases sizeof(InputSection) from 176 to 160, and decreases peak memory usage by 0.3% when linking Chrome.	2021-12-26 22:17:30 -08:00
Fangrui Song	64038ef8c3	[ELF] ScriptParser: change std::vector to SmallVector	2021-12-26 20:12:55 -08:00
Fangrui Song	e9262edf0d	[ELF] SymbolTable:🔣 don't filter out PlaceholderKind Placeholders (-y and redirectSymbols removed versioned symbols) are very rare and the check just makes symbol table iteration slower. Most iterations filter out placeholders anyway, so this change just drops the filter behavior. For "Add symbols to symtabs", we need to ensure that redirectSymbols sets isUsedInRegularObj to false when making a symbol placeholder, to avoid an assertion failure in SymbolTableSection<ELFT>::writeTo. My .text is 2KiB smaller. The speed-up linking chrome is 0.x%.	2021-12-26 18:11:45 -08:00
Fangrui Song	7924b3814f	[ELF] Add Symbol::hasVersionSuffix "Process symbol versions" may take 2+% time. "Redirect symbols" may take 0.6% time. This change speeds up the two passes and makes `*sym.getVersionSuffix() == '@'` in the `undefined reference` diagnostic cleaner. Linking chrome (no debug info) and another large program is 1.5% faster. For empty-ver2.s: the behavior now matches GNU ld, though I'd consider the input invalid and the exact behavior does not matter.	2021-12-26 17:25:54 -08:00
Fangrui Song	469144ffa3	[ELF] De-template InputSectionBase::getEnclosingFunction	2021-12-26 15:21:22 -08:00
Fangrui Song	213896bc5a	[ELF] Remove unused InputSection::getOffsetInFile	2021-12-26 15:18:56 -08:00
Fangrui Song	a1c2ee0147	[ELF] LinkerScript/OutputSection: change other std::vector members to SmallVector 11+KiB smaller .text with both libc++ and libstdc++ builds.	2021-12-26 13:53:47 -08:00
Fangrui Song	10316a6f94	[ELF] Change InputSectionDescription members from vector to SmallVector This decreases sizeof(lld:🧝:InputSectionDescription) from 264 to 232.	2021-12-26 13:06:54 -08:00
Fangrui Song	bf7f3dd74e	[ELF] Move outSecOff addition from InputSection::writeTo to the caller Simplify the code a bit and improve consistency with SyntheticSection::writeTo.	2021-12-26 12:11:41 -08:00
Fangrui Song	aabe901d57	[ELF] Remove one redundant computeBinding This does resolve the redundancy in includeInDynsym().	2021-12-25 23:59:27 -08:00
Fangrui Song	20b4704da3	[ELF] reportRangeError: mention symbol name for non-STT_SECTION local symbols like non-global symbols	2021-12-25 23:46:47 -08:00
Fangrui Song	2c8ebab32e	[ELF] sortSymTabSymbols: change vector to SmallVector This function may take ~1% time. SmallVector<SymbolTableEntry, 0> is smaller (16 bytes instead of 24) and more efficient.	2021-12-25 23:16:27 -08:00
Fangrui Song	d5e310b154	[ELF][test] Make some TLS tests less sensitive to addresses	2021-12-25 22:05:20 -08:00
Fangrui Song	a00f480fe8	[ELF] scanReloc: remove unused start parameter. NFC This was once used as a workaround for detecting missing PPC64 TLSGD/TLSLD relocations produced by ancient IBM XL C/C++.	2021-12-25 14:34:06 -08:00
Fangrui Song	dd4f5d4ae5	[ELF] De-template handleTlsRelocation. NFC	2021-12-25 14:23:13 -08:00
Fangrui Song	70912420bb	[ELF] Move TLS dynamic relocations to postScanRelocations This temporarily increases sizeof(SymbolUnion), but allows us to mov GOT/PLT/etc index members outside Symbol in the future. Then, we can make TLSDESC and TLSGD use different indexes and support mixed TLSDESC and TLSGD (tested by x86-64-tlsdesc-gd-mixed.s). Note: needsTlsGd and needsTlsGdToIe may optionally be combined. Test updates are due to reordered GOT entries.	2021-12-24 22:36:49 -08:00
Fangrui Song	cde37a7e5a	[ELF][test] Add tests for mixed GD-to-IE and IE, mixed TLSDESC and GD Note: mixed TLSDESC and GD currently does not work.	2021-12-24 22:24:15 -08:00
Kazu Hirata	62e48ed10f	Use isa instead of dyn_cast (NFC)	2021-12-24 21:22:27 -08:00
Kazu Hirata	9c0a4227a9	Use Optional::getValueOr (NFC)	2021-12-24 20:57:40 -08:00
Fangrui Song	40fae4d8fc	[ELF] Optimize replaceCommonSymbols This decreases the 0.2% time (no debug info) to nearly no.	2021-12-24 19:01:51 -08:00
Fangrui Song	745420d3f4	[ELF] Cache global variable `target` in relocate* This avoid repeated load of the unique_ptr in hot paths.	2021-12-24 17:54:12 -08:00
Fangrui Song	b5a0f0f397	[ELF] Add ELFFileBase::{elfShdrs,numELFShdrs} to avoid duplicate llvm::object::ELFFile::sections() This mainly avoid `relsOrRelas` cost in `InputSectionBase::relocate`. `llvm::object::ELFFile::sections()` has redundant and expensive checks.	2021-12-24 17:10:38 -08:00
Fangrui Song	5e3403bd22	[ELF] parseLazy: skip local symbols	2021-12-24 13:16:34 -08:00
Fangrui Song	e694180033	[ELF] Optimize --wrap to only check non-local symbols	2021-12-24 12:28:59 -08:00
Fangrui Song	e1b6b5be46	[ELF] Avoid referencing SectionBase::repl after ICF It is fairly easy to forget SectionBase::repl after ICF. Let ICF rewrite a Defined symbol's `section` field to avoid references to SectionBase::repl in subsequent passes. This slightly improves the --icf=none performance due to less indirection (maybe for --icf={safe,all} as well if most symbols are Defined). With this change, there is only one reference to `repl` (--gdb-index D89751). We can undo `f4fb5fd752` (`Move Repl to SectionBase.`) but move `repl` to `InputSection` instead. Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D116093	2021-12-24 12:09:48 -08:00
Fangrui Song	0d749e13f7	[ELF] Optimize symbol initialization and resolution Avoid repeated load of global pointer (symtab) / members (sections.size(), firstGlobal) in the hot paths. And remove some unneeded this->	2021-12-23 21:54:32 -08:00
Fangrui Song	1d285f2de0	[ELF] Simplify and optimize ObjFile<ELFT>::parseLazy	2021-12-23 20:23:13 -08:00
Fangrui Song	1abbbc7b24	[ELF] scanVersionScript: remove unused variable	2021-12-23 18:18:25 -08:00
Fangrui Song	a2baf634a1	[ELF] Simplify SymbolTable::insert. NFC	2021-12-23 17:59:25 -08:00
Fangrui Song	417cd2e5c5	[ELF] SymbolTable: change some vector<Symbol *> to SmallVector The generated assembly for Symbol::insert is much shorter (std::vector resize is inefficient) and enables some inlining.	2021-12-23 16:49:38 -08:00
Fangrui Song	464cc4c920	[ELF] Remove stale comment which was duplicated in MarkLive<ELFT>::run Pointed out by thakis	2021-12-23 15:13:46 -08:00
Kristina Bessonova	81378f7e56	Revert "[DwarfDebug] Support emitting function-local declaration for a lexical block" & dependent patches Try to revert D113741 once again. This also reverts `0ac75e82ff` (D114705) as it causes LLDB's lldb-api.lang/cpp/nsimport.TestCppNsImport.py test failure w/o D113741. This reverts commit `f9607d45f3`. Differential Revision: https://reviews.llvm.org/D116225	2021-12-24 00:47:04 +02:00
Fangrui Song	bf45624ba0	[ELF][PPC32] Support .got2 in an output section description I added `PPC32Got2Section` D62464 to support .got2 but did not implement .got2 in another output section. PR52799 has a linker script placing .got2 in .rodata, which causes a null pointer dereference because a MergeSyntheticSection's file is nullptr. Add the support.	2021-12-23 11:32:44 -08:00
Fangrui Song	4374824ccf	[ELF] --gc-sections: combine two iterations over inputSections There is a slight speed-up.	2021-12-23 09:53:08 -08:00
Fangrui Song	33319dde2a	[ELF] LTO: skip expensive usedStartStop initialization if bitcodeFiles.empty() This may cost 1.3+% of total link time.	2021-12-23 01:52:54 -08:00
Fangrui Song	61312fd5aa	[ELF] sortSections: delete unneeded outSecOff assignment Related to D45368 but outSecOff is unneeded because resolveShfLinkOrder uses stable_sort.	2021-12-23 01:24:32 -08:00
Fangrui Song	5d0be553fa	[ELF] Optimize copyLocalSymbols. NFC	2021-12-23 00:59:29 -08:00
Fangrui Song	ad26b0b233	Revert "[ELF] Make Partition/InStruct members unique_ptr and remove associate make<XXX>" This reverts commit `e48b1c8a27`. This reverts commit `d019de23a1`. The changes caused memory leaks (non-final classes cannot use unique_ptr).	2021-12-22 23:55:11 -08:00
Fangrui Song	ba948c5a9c	[ELF] Use SmallVector for some global variables (Files and Sections). NFC My lld executable is 26+KiB smaller.	2021-12-22 22:30:08 -08:00
Fangrui Song	ba6973c89b	[ELF] Change nonnull pointer parameters to references	2021-12-22 22:02:29 -08:00
Fangrui Song	e48b1c8a27	[ELF] Make Partition members unique_ptr and remove associate make<XXX> See D116143 for benefits. My lld executable (x86-64) is 103+KiB smaller.	2021-12-22 21:34:26 -08:00
Fangrui Song	d019de23a1	[ELF] Make InStruct members unique_ptr and remove associate make<XXX> See D116143 for benefits. My lld executable (x86-64) is 24+KiB smaller.	2021-12-22 21:11:26 -08:00
Fangrui Song	5c75cc51b3	[ELF] Change nonnull pointer parameters to references. NFC	2021-12-22 21:09:57 -08:00
Fangrui Song	baa3eb0dd9	[ELF] Change some non-null pointer parameters to references. NFC	2021-12-22 20:51:11 -08:00
Fangrui Song	3a5fb57393	[ELF] Replace LazyObjFile with lazy ObjFile/BitcodeFile The new `lazy` state is the inverse of the previous `LazyObjFile::extracted`. There are many advantages: * previously when a LazyObjFile was extracted, a new ObjFile/BitcodeFile was created; now the file is reused, just with `lazy` cleared * avoid the confusing transfer of `symbols` from LazyObjFile to the new file * the `incompatible file:` diagnostic is unified with `is incompatible with` * simpler code, smaller executable (6200+ bytes smaller on x86-64) * make eager parsing feasible (for parallel section/symbol table initialization)	2021-12-22 17:41:50 -08:00
Fangrui Song	5fc4323eda	[ELF] Change some global pointers to unique_ptr Currently the singleton `config` is assigned by `config = make<Configuration>()` and (if `canExitEarly` is false) destroyed by `lld::freeArena`. `make<Configuration>` allocates a stab with `malloc(4096)`. This both wastes memory and bloats the executable (every type instantiates `BumpPtrAllocator` which costs more than 1KiB code on x86-64). (No need to worry about `clang::no_destroy`. Regular invocations (`canExitEarly` is true) call `_Exit` via llvm::sys::Process::ExitNoCleanup.) Reviewed By: lichray Differential Revision: https://reviews.llvm.org/D116143	2021-12-22 14:36:14 -08:00
Fangrui Song	eb37330ac7	[ELF] Change mipsGotIndex to uint32_t This does not decrease sizeof(InputSection) (important for memory usage) on ELF64 by itself but allows we to add another uint32_t.	2021-12-21 20:19:51 -08:00
Fangrui Song	48161b7490	[ELF] --gc-sections: Work around SHT_PROGBITS .init_array Older Go cmd/link used SHT_PROGBITS for .init_array . Work around the lack of https://golang.org/cl/373734 for a while. It does not generate .fini_array or .preinit_array	2021-12-21 10:44:29 -08:00
Fangrui Song	6683099a0d	[ELF] Optimize RelocationSection<ELFT>::writeTo When linking a 1.2G output (nearly no debug info, 2846621 dynamic relocations) using `--threads=8`, I measured ``` 9.131462 Total ExecuteLinker 1.449913 Total Write output file 1.445784 Total Write sections 0.657152 Write sections {"detail":".rela.dyn"} ``` This change decreases the .rela.dyn time to 0.25, leading to 4% speed up in the total time. * The parallelSort is slow because of expensive r_sym/r_offset computation. Cache the values. * The iteration is slow. Move r_sym/r_addend computation ahead of time and parallelize it. With the change, the new encodeDynamicReloc is cheap (0.05s). So no need to parallelize it. Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D115993	2021-12-21 09:43:44 -08:00
Fangrui Song	c2f2bb066b	[ELF] Remove unneeded SectionBase::repl indirection sec->repl equals sec after rL371216.	2021-12-21 00:39:16 -08:00
Esme-Yi	b66328701a	[PowerPC][llvm-objdump] enable --symbolize-operands for PowerPC ELF/XCOFF. Summary: When disassembling, symbolize a branch target operand to print a label instead of a real address. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D114492	2021-12-21 04:17:57 +00:00
Xu Mingjie	cb63ad8d1d	[LTO] Fix incomplete optimization remarks for dead functions when PreOptModuleHook or PostInternalizeModuleHook is defined In `20a895c4be`, we introduce `finalizeOptimizationRemarks()` to make sure we flush the diagnostic remarks file in case the linker doesn't call the global destructors before exiting. In https://reviews.llvm.org/D73597, we add optimization remarks for removed functions for debugging or for detecting dead code. But there is a case, if PreOptModuleHook or PostInternalizeModuleHook is defined (e.g. `--plugin-opt=emit-llvm` is passed to linker), we do not call `finalizeOptimizationRemarks()`, therefore we will get an incomplete optimization remarks file. This patch make sure we flush the diagnostic remarks file when PreOptModuleHook or PostInternalizeModuleHook is defined. Reviewed By: tejohnson, MaskRay Differential Revision: https://reviews.llvm.org/D115417	2021-12-20 18:16:09 -08:00
Fangrui Song	8825ffdbde	[ELF] --time-trace: Trace "Write sections" writeSections is typically a bottleneck. This was used to track down the following bottlenecks: * Output section .rela.dyn (`9115d75117`) * Output section .debug_str (`3aae04c744`) * posix_fallocate is slow for Linux tmpfs: D115957 Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D115984	2021-12-20 10:51:24 -08:00
Fangrui Song	bee5bc9075	[ELF] #undef PPC to support GCC powerpc32 build GCC's powerpc32 port predefines `PPC` as a macro in GNU C++ mode in some configurations (Linux, FreeBSD, and some others. See `builtin_define_std ("PPC"); ` in gcc/config/rs6000). ``` % powerpc-linux-gnu-g++ -E -dM -xc++ /dev/null -o - \| grep -w PPC #define PPC 1 ``` Fixes https://bugs.gentoo.org/829599 Reviewed By: thesamesam Differential Revision: https://reviews.llvm.org/D116017	2021-12-20 10:12:51 -08:00
Fangrui Song	3aae04c744	[ELF] Parallelize MergeNoTailSection::writeTo With this patch, writing .debug_str is significantly for a program with 1.5G .debug_str: * .debug_info 1.22s * .debug_str 2.57s decreases to 0.66	2021-12-17 23:30:42 -08:00
Fangrui Song	552d84414d	[ELF] Use SmallVector for many SyntheticSections. NFC This decreases struct sizes and usually decreases the lld executable size (39KiB for my x86-64 executable) (unless in some cases smaller SmallVector leads to more inlining, e.g. StringTableBuilder). For --gdb-index, there may be memory usage saving.	2021-12-17 19:22:16 -08:00
Vy Nguyen	4f90e67e2f	[lld-macho] Handle $ld$hide[$os] symbols. PR/52708 Differential Revision: https://reviews.llvm.org/D115775	2021-12-17 16:40:07 -05:00
Nico Weber	c4b45eeb44	[lld/mac] Don't lose "weak ref" bit when doing LTO Fixes #52778. Probably fixes Chromium crashing on startup on macOS 10.15 (and older) systems when building with LTO, but I haven't verified that yet. Differential Revision: https://reviews.llvm.org/D115949	2021-12-17 15:26:35 -05:00
Nico Weber	a3096ca9b4	[lld/test] List one test dep per line Matches llvm's and clang's /test/CMakeLists.txt, makes it easier to see in diffs which deps get added, and makes it easier to see if a given dependency is present or not. No behavior change.	2021-12-17 09:51:01 -05:00
Fangrui Song	aa27bab5a1	[ELF] InputSection::writeTo: reorder type checks and add LLVM_UNLIKELY	2021-12-16 23:42:50 -08:00
Fangrui Song	054cdb34a2	[ELF] Optimize MergeInputSection::splitNonStrings. NFC	2021-12-16 21:23:00 -08:00
Fangrui Song	4c98d08841	[ELF] Speed up MergeInputSection::split*. NFC	2021-12-16 21:17:02 -08:00
Fangrui Song	bf4fa3036a	[ELF] Use SmallVector for MergeInputSection::pieces. NFC sizeof(pieces) decreases from 24 to 16 on ELF64. One BumpPtrAllocator can store more MergeInputSections. The lld executable becomes smaller.	2021-12-16 21:07:39 -08:00
Fangrui Song	93558e575e	[ELF] Internalize createMergeSynthetic. NFC Only called once. Moving to OutputSections.cpp can make it inlined. finalizeInputSections can be very hot, especially in -O1 links with much debug info.	2021-12-16 20:50:06 -08:00
Daniel Kiss	2b4e6052b3	[lld] Add cet-report and bti-report flags Implement cet-report as supported in binutils. bti-report has the same behaviour for AArch64-BTI. Fixes https://github.com/llvm/llvm-project/issues/44828 Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D113901	2021-12-16 16:26:26 +01:00
Fangrui Song	8617996ac1	[ELF] maybeReportUndefined: move sym.isUndefined() check to the caller. NFC Avoid a function call in the majority of cases.	2021-12-16 00:27:19 -08:00
Fangrui Song	101407bfaa	[ELF] parseSymbolVersion: remove unussed pos == 0 check	2021-12-15 23:59:55 -08:00
Fangrui Song	60f5614931	[ELF] SharedFile::parse: cache symbols size for a loop. NFC	2021-12-15 22:45:28 -08:00
Fangrui Song	7b265e9791	[ELF] Move -l -L canonical and --library-path --library aliases Everyone uses -l -L instead of the long option counterparts. Make help messages attach to -L -l and (--reproduce) use them for response.txt command line options.	2021-12-15 21:49:53 -08:00
Fangrui Song	159b948e43	[ELF] ObjFile<ELFT>::initializeSymbols: don't call Allocate when firstGlobal==0 Calling `Allocate` with 0 size (when .symtab is absent, e.g. `invalid/mips-invalid-options-descriptor.test`) may return a nullptr, which will crash with -fsanitize=null (the underlying `Allocate` function is LLVM_ATTRIBUTE_RETURNS_NONNULL).	2021-12-15 18:21:48 -08:00
Fangrui Song	b0211de5e3	[ELF] Change Symbol::verdefIndex from uint32_t to uint16_t The SHT_GNU_version index is 16-bit, so the 32-bit value is a waste. Technically non-default version index 0x7fff uses version index 0xffff, but it is impossible in practice. This change decreases sizeof(SymbolUnion) from 80 to 72 on ELF64 platforms. Memory usage decreases by 1% when linking a large executable.	2021-12-15 17:59:30 -08:00
Fangrui Song	50187d2dd5	[ELF] Speed up ObjFile<ELFT>::createInputSection * Group ".note" section name checks * Move shouldMerge check to the caller	2021-12-15 17:15:32 -08:00
Vincent Lee	d17b092fe6	[lld-macho] Make writing map file asynchronous For large applications that write to map files, writing map files can take quite a bit of time. Sorting the biggest contributors to link times, writing map files ranks in at 2nd place, with load input files being the biggest contributor of link times. Avoiding writing map files on the critical path (and having its own thread) saves ~2-3 seconds when linking chromium framework on a 16-Core Intel Xeon W. ``` base diff difference (95% CI) sys_time 1.617 ± 0.034 1.657 ± 0.026 [ +1.5% .. +3.5%] user_time 28.536 ± 0.245 28.609 ± 0.180 [ -0.1% .. +0.7%] wall_time 23.833 ± 0.271 21.684 ± 0.194 [ -9.5% .. -8.5%] samples 31 24 ``` Reviewed By: #lld-macho, oontvoo, int3 Differential Revision: https://reviews.llvm.org/D115416	2021-12-15 16:37:04 -08:00
Fangrui Song	68009b78f2	[ELF] Symbol::replace: remove dead code	2021-12-15 16:08:18 -08:00
Fangrui Song	b5805b7847	[ELF] ObjFile<ELFT>::initializeSymbols: avoid StringRefZ from undefined symbols	2021-12-15 15:30:18 -08:00
Fangrui Song	2bdad16303	[ELF] SymbolTable::insert: keep @@ in the name * Avoid the name truncation quirk in SymbolTable::insert: the truncated name will be replaced by @@ again. * Allow foo and foo@@v1 in different files to be diagnosed as duplicate definition error (GNU ld behavior) * Avoid potential redundant strlen on symbol name due to StringRefZ in ObjFile<ELFT>::initializeSymbols	2021-12-15 15:19:35 -08:00
Fangrui Song	a8d6d2614b	[ELF] Replace make<Defined> with makeDefined. NFC This removes SpecificAlloc<Defined> and makes my lld executable 1.5k smaller. This drops the small memory waste due to the separate BumpPtrAllocator.	2021-12-15 13:15:03 -08:00
Fangrui Song	a596a5fc12	[ELF] ObjFile<ELFT>::initializeSymbols: Simplify this->symbols[i]. NFC	2021-12-15 13:02:38 -08:00
Fangrui Song	509153f1e7	[ELF] ObjFile<ELFT>::initializeSymbols: Batch allocate local symbols and detangle local/global symbol initialization. My x86-64 lld executable is 8k smaller due to the removal of SpecificAlloc<Undefined>.	2021-12-15 12:54:39 -08:00
Fangrui Song	3534d26cc1	[ELF] Slightly speed up -z keep-text-section-prefix	2021-12-15 10:20:11 -08:00
Fangrui Song	7c0881a38f	[ELF] --gc-sections: Change startwith(".jcr") to exact match GNU ld's internal linker script keeps `.jcr`, but not other sections starting with `.jcr`.	2021-12-15 01:27:08 -08:00
Fangrui Song	21dbfd4300	[ELF] --gc-sections: Change startwith(".init") (and ".fini") to exact match GNU ld's internal linker script keeps `.init`, but not other sections starting with `.init`. .fini is similar.	2021-12-15 01:16:26 -08:00
Fangrui Song	7a54ae9c1d	[ELF] Change objectFiles to ELFFileBase * This can sometimes avoid `cast<ObjFile<...>>`. I intentionally do not touch postScanRelocations to wait for its stabilization.	2021-12-15 00:37:10 -08:00
Fangrui Song	3deb82cd07	[ELF] Adjust getOutputSectionName prefix order Sorting the prefixes by decreasing frequency can improve performance. .gcc_except_table is relatively frequent, so move it ahead. .ctors and .dtors mostly disappear and should be the last.	2021-12-15 00:18:58 -08:00
Fangrui Song	5816f1855c	[ELF] Slightly speed up getOutputSectionName. NFC	2021-12-14 23:43:00 -08:00
Fangrui Song	89661a0e89	[ELF] Remove dead code from SymbolTable::find	2021-12-14 22:41:52 -08:00
Fangrui Song	c720b16aa5	[ELF] Use SmallVector for SharedFile and simplify parseVerdefs SHT_GNU_verdef is typically small, so it's unnecessary to reserve the vector. While here, fix a hypothetical issue when SHT_GNU_verdef has non-increasing version indexes, which don't happen with GNU ld, gold, ld.lld's output. My x86-64 lld executable is 256 bytes smaller.	2021-12-14 21:11:45 -08:00
Fangrui Song	1ff1d50d9f	[ELF] Make InputFile smaller sizeof(ObjFile<ELF64LE>) is decreased from 344 to 272 on an ELF64 system. In a large link with 30000 ObjFiles, this may be 2+MiB saving. Change std::vector members to SmallVector, and std::string members to SmallString<0> (these members typically don't benefit from small string optimization). On Linux x86-64 the lld executable is ~6k smaller.	2021-12-14 20:55:32 -08:00
Fangrui Song	cf783be8d7	Reland D114783/D115603 [ELF] Split scanRelocations into scanRelocations/postScanRelocations (Fixed an issue about GOT on a copy relocated alias.) (Fixed an issue about not creating r_addend=0 IRELATIVE for unreferenced non-preemptible ifunc.) The idea is to make scanRelocations mark some actions are needed (GOT/PLT/etc) and postpone the real work to postScanRelocations. It gives some flexibility: * Make it feasible to support .plt.got (PR32938): we need to know whether GLOB_DAT and JUMP_SLOT are both needed. * Make non-preemptible IFUNC handling slightly cleaner: avoid setting/clearing sym.gotInIgot * -z nocopyrel: report all copy relocation places for one symbol * Make GOT deduplication feasible * Make parallel relocation scanning feasible (if we can avoid all stateful operations and make Symbol attributes atomic), but parallelism may not be the appealing choice Since this patch moves a large chunk of code out of ELFT templates. My x86-64 executable is actually a few hundred bytes smaller. For ppc32-ifunc-nonpreemptible-pic.s: I remove absolute relocation references to non-preemptible ifunc because absolute relocation references are incorrect in -fpie mode. Reviewed By: peter.smith, ikudrin Differential Revision: https://reviews.llvm.org/D114783	2021-12-14 16:28:41 -08:00
Fangrui Song	04cf411c94	[ELF][test] Test unreferenced non-preemptible ifunc Add missing coverage exposed by D114783. There should be no associated IRELATIVE, otherwise (a) glibc ld.so may crash (b) it wastes space (c) unused IPLT causes confusion.	2021-12-14 16:25:50 -08:00
Fangrui Song	ea15b862d7	Revert D114783 [ELF] Split scanRelocations into scanRelocations/postScanRelocations May cause a failure for non-preemptible `bcmp` in a glibc -static link.	2021-12-14 14:33:50 -08:00
Stephan T. Lavavej	8bd106a891	[NFC] Fix typos in release notes. Reviewed By: ldionne, Mordante, MaskRay Differential Revision: https://reviews.llvm.org/D115685	2021-12-14 14:19:42 -08:00
Fangrui Song	6a44013b0e	[ELF] -Map: Print symbols which needs canonical PLT entry/copy relocation just once If a copy related symbol (say `copy`) is referenced in two .o files, this change removes a duplicated line from the -Map output: ``` 202470 202470 1 1 .bss.rel.ro 202470 202470 1 1 <internal>:(.bss.rel.ro) 202470 202470 1 1 copy removed 202470 202470 1 1 copy ``` Differential Revision: https://reviews.llvm.org/D115697	2021-12-14 10:31:06 -08:00
Fangrui Song	b79686c6dc	[ELF] Remove needsPltAddr in favor of needsCopy needsPltAddr is equivalent to `needsCopy && isFunc`. In many places, it is equivalent to `needsCopy` because the non-STT_FUNC cases are ruled out. Reviewed By: ikudrin, peter.smith Differential Revision: https://reviews.llvm.org/D115603	2021-12-14 09:52:43 -08:00
Fangrui Song	e7a95b0674	Reland [ELF] Split scanRelocations into scanRelocations/postScanRelocations (Fixed an issue about GOT on a copy relocated alias.) The idea is to make scanRelocations mark some actions are needed (GOT/PLT/etc) and postpone the real work to postScanRelocations. It gives some flexibility: * Make it feasible to support .plt.got (PR32938): we need to know whether GLOB_DAT and JUMP_SLOT are both needed. * Make non-preemptible IFUNC handling slightly cleaner: avoid setting/clearing sym.gotInIgot * -z nocopyrel: report all copy relocation places for one symbol * Make GOT deduplication feasible * Make parallel relocation scanning feasible (if we can avoid all stateful operations and make Symbol attributes atomic), but parallelism may not be the appealing choice Since this patch moves a large chunk of code out of ELFT templates. My x86-64 executable is actually a few hundred bytes smaller. For ppc32-ifunc-nonpreemptible-pic.s: I remove absolute relocation references to non-preemptible ifunc because absolute relocation references are incorrect in -fpie mode. Reviewed By: peter.smith, ikudrin Differential Revision: https://reviews.llvm.org/D114783	2021-12-13 20:11:24 -08:00
Fangrui Song	d1014d9e6d	[ELF] Improve test for copy relocations on aliases	2021-12-13 20:04:24 -08:00
Fangrui Song	0b8b86e30f	Revert "[ELF] Split scanRelocations into scanRelocations/postScanRelocations" This reverts commit `fc33861d48`. `replaceWithDefined` should copy needsGot, otherwise an alias for a copy relocated symbol may not have GOT entry if its needsGot was originally true.	2021-12-13 19:29:53 -08:00
Noah Shutty	fb6b103daa	[lld] Replace Symbolize.h with DIContext.h in lld's COFF lib lld only needs DIContext.h which it gets through Symbolize.h -> SymbolizableModule.h -> DIContext.h. This replaces it with a direct include of DIContext.h to avoid any confusion and pulling in unnecessary headers. Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D115659	2021-12-13 22:16:41 +00:00
Fangrui Song	fc33861d48	[ELF] Split scanRelocations into scanRelocations/postScanRelocations The idea is to make scanRelocations mark some actions are needed (GOT/PLT/etc) and postpone the real work to postScanRelocations. It gives some flexibility: * Make it feasible to support .plt.got (PR32938): we need to know whether GLOB_DAT and JUMP_SLOT are both needed. * Make non-preemptible IFUNC handling slightly cleaner: avoid setting/clearing sym.gotInIgot * -z nocopyrel: report all copy relocation places for one symbol * Make parallel relocation scanning possible (if we can avoid all stateful operations and make Symbol attributes atomic), but parallelism may not be the appealing choice * Make GOT deduplication feasible Since this patch moves a large chunk of code out of ELFT templates. My x86-64 executable is actually a few hundred bytes smaller. For ppc32-ifunc-nonpreemptible-pic.s: I remove absolute relocation references to non-preemptible ifunc because absolute relocation references are incorrect in -fpie mode. Reviewed By: peter.smith, ikudrin Differential Revision: https://reviews.llvm.org/D114783	2021-12-13 09:56:52 -08:00
Fangrui Song	9115d75117	[ELF] Use parallelSort for .rela.dyn An unstable sort suffices. In a large link (11.06s), this decreases .rela.dyn writeTo time from 1.52s to 0.81s, resulting in 6% total time speedup (the benefit will greatly dilute if --pack-dyn-relocs=relr becomes prevailing). Encoding the dynamic relocations then sorting raw Elf_Rel/Elf_Rela doesn't seem to improve much (doing that would require code duplicate because of Elf_Rel/Elf_Rela plus unfortunate mips64le), so don't do that.	2021-12-12 20:53:06 -08:00
Fangrui Song	1eaa9b4374	[ELF] initializeSections: move SHT_LLVM_CALL_GRAPH_PROFILE check into SHF_EXCLUDE && !relocatable. NFC Avoid a comparison in the majority of cases.	2021-12-12 20:05:21 -08:00
Fangrui Song	d29766bb48	[ELF] relocateAlloc: remove variables type and expr. NFC	2021-12-12 19:31:30 -08:00
Fangrui Song	4cfff19b88	[ELF] Move adjustSplitStackFunctionPrologues's splitStack check to the caller. NFC Avoid a function call in the majority of cases and make the output smaller.	2021-12-12 19:26:03 -08:00
Fangrui Song	a8024dfc06	[ELF] Avoid mutable addend parameter. NFC	2021-12-12 19:12:01 -08:00
Fangrui Song	af520fba2e	[ELF][test] Remove unused/incorrect .got check line	2021-12-12 10:51:05 -08:00
Jez Ng	098430cd25	[lld-macho][nfc] Simplify LC_DATA_IN_CODE generation 1. After D113241, we have the section address easily accessible and no longer need to iterate across the LC_SEGMENT commands to emit LC_DATA_IN_CODE. 2. There's no need to store a pointer to the data in code entries during the parse step; we can just look it up as part of the output step. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D115556	2021-12-11 01:01:57 -05:00
Jez Ng	40bcbe48e8	[lld-macho][nfc] InputSections don't need to track their total # of callsites ... only whether they have more than zero. This simplifies the code slightly. I've also moved the field into the ConcatInputSection subclass since it doesn't actually get used by the other InputSections. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D115539	2021-12-11 01:01:57 -05:00
Jez Ng	8a1f2d6580	[lld-macho] Include archive name in bitcode files Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D115281	2021-12-07 19:11:23 -05:00
Igor Kudrin	ce25eb12dd	[ELF] Do not report undefined weak references in shared libraries This fixes an issue introduced in D101996. A weak reference in a shared library could be incorrectly reported if there is another library that has a strong reference to the same symbol. Differential Revision: https://reviews.llvm.org/D115041	2021-12-07 10:10:51 +07:00
Chris Davis	e4eb6216c2	Enable pdbpagesize to allow support for PDB file sizes > 4GB Enable the pdbpagesize flag to allow linking of PDB files > 4GB. Also includes a couple small fixes to change to uint64_t to support the larger file sizes. I updated the max file size check in MSFBuilder.cpp to take into account the page size. Differential Revision: https://reviews.llvm.org/D115051	2021-12-06 18:22:08 -05:00
Jez Ng	1b44364714	[lld-macho] Unreferenced weak dylib symbols shouldn't fetch archive symbols We were fetching archive symbols too eagerly, bloating binary size as well as just screwing up binaries that expected to look up certain symbols only at runtime. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D115092	2021-12-05 15:11:44 -05:00
Kristina Bessonova	0ac75e82ff	Reland [DwarfDebug] Move emission of global vars, types and imports to endModule() This patch proposes to move emission of global variables, types, imported entities, etc from DwarfDebug::beginModule() to DwarfDebug::endModule(). Effectively, this changes nothing but the order of debug entities which will be as follows: * subprograms (including related context, local variables/labels, local imported entities; related types can be created as a part of the emission of local entities of an abstract subprogram); * global variables (including related context and types); * retained types and enums; * non-local-scoped imported entities; * basic types; * other types left (as a part of local variables attributes emission). Note that the order of emitted compile units may also be changed as now we emit units that contain subprograms first and then all other non-empty units. The motivation behind this change is the following: (1) DwarfDebug::beginModule() is run at the very beginning of backend's pipeline, from this time IR can be significantly changed by target-specific passes. If it happens for debug metadata of global entities, those changes will not be reflected in the emitted DWARF. (2) imported subprogram names should refer to an abstract subprogram if it exists, but it isn't known in DwarfDebug::beginModule() (it's possible to make some guesses based on location info, but it's not quite reliable); (3) aforementioned entities if they are scoped within a bracketed block (subject of D113741) couldn't be emitted in DwarfDebug::beginModule() (they need parent emitted first). Another problem is if to try to gather some information about local entities and defer their emission (till subprogram's processing or DwarfDebug::endModule()) all the gathered details might be irrelevant / invalid by the time the entities are being emitted (because of (1)). Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D114705	2021-12-05 13:56:45 +02:00
Leonard Grey	134275d994	[Support] Use final filename for Caching buffer identifier Mach-O LLD uses the buffer identifier of the memory buffer backing an object file to generate stabs which are used by `dsymutil` to find the object file for dSYM generation. When using thinLTO, these buffers are provided by the cache which initially saves them to disk as temporary files beginning with "Thin-" but renames them to persistent files beginning with "llvmcache-" before the buffer is provided to the cache user. However, the buffer is created before the file is renamed and is given the temp file's name as an identifier. This causes the generated stabs to point to nonexistent files. This change names the buffer with the eventual persistent filename. I think this is safe because failing to rename the temp file is a fatal error. Differential Revision: https://reviews.llvm.org/D115055	2021-12-04 22:25:49 -05:00
Kristina Bessonova	a961604819	Revert "[DwarfDebug] Support emitting function-local declaration for a lexical block" This reverts commits * `ee691970a9` (D113741), * `79d3132998` (D114705) due to lldb and dexter test failures.	2021-12-04 18:06:57 +02:00
Kristina Bessonova	79d3132998	[DwarfDebug] Move emission of global vars, types and imports to endModule() This patch proposes to move emission of global variables, types, imported entities, etc from DwarfDebug::beginModule() to DwarfDebug::endModule(). Effectively, this changes nothing but the order of debug entities which will be as follows: * subprograms (including related context, local variables/labels, local imported entities; related types can be created as a part of the emission of local entities of an abstract subprogram); * global variables (including related context and types); * retained types and enums; * non-local-scoped imported entities; * basic types; * other types left (as a part of local variables attributes emission). Note that the order of emitted compile units may also be changed as now we emit units that contain subprograms first and then all other non-empty units. The motivation behind this change is the following: (1) DwarfDebug::beginModule() is run at the very beginning of backend's pipeline, from this time IR can be significantly changed by target-specific passes. If it happens for debug metadata of global entities, those changes will not be reflected in the emitted DWARF. (2) imported subprogram names should refer to an abstract subprogram if it exists, but it isn't known in DwarfDebug::beginModule() (it's possible to make some guesses based on location info, but it's not quite reliable); (3) aforementioned entities if they are scoped within a bracketed block (subject of D113741) couldn't be emitted in DwarfDebug::beginModule() (they need parent emitted first). Another problem is if to try to gather some information about local entities and defer their emission (till subprogram's processing or DwarfDebug::endModule()) all the gathered details might be irrelevant / invalid by the time the entities are being emitted (because of (1)). Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D114705	2021-12-04 14:10:01 +02:00
Fangrui Song	9bd6f6f6d5	[ELF][test] Fix typo in aarch64-cortex-a53-843419-recognize.s	2021-12-03 14:38:56 -08:00
George Koehler	885fb9a257	[ELF][PPC32] Make R_PPC32_PLTREL retain .got PLT usage needs the first 12 bytes of the .got section. We need to keep .got and DT_GOT_PPC even if .got/_GLOBAL_OFFSET_TABLE_ are not referenced (large PIC code may only reference .got2), which is the case in OpenBSD's ld.so, leading to a misleading error, "unsupported insecure BSS PLT object". Fix this by adding R_PPC32_PLTREL to the list of hasGotOffRel. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D114982	2021-12-02 15:28:37 -08:00
Fangrui Song	353fe72ca3	[ELF] Hint -z nostart-stop-gc for __start_ undefined references Make users aware what to do with ld.lld 13.0.0 / GNU ld<2015-10 --gc-sections behavior. Differential Revision: https://reviews.llvm.org/D114830	2021-12-02 11:58:25 -08:00
Keith Smiley	9e3552523e	[lld-macho] Remove old macho darwin lld During the llvm round table it was generally agreed that the newer macho lld implementation is feature complete enough to replace the old implementation entirely. This will reduce confusion for new users who aren't aware of the history. Differential Revision: https://reviews.llvm.org/D114842	2021-12-02 11:04:49 -08:00
Reid Kleckner	8270ff86a1	[ELF] Fix driver.test after `8c3641d0` when cwd is readonly	2021-12-02 10:25:04 -08:00
Sam Clegg	6f5c5cbe5f	[lld][WebAssembly] Fix for debug relocations against undefined function symbols This is very similar to https://reviews.llvm.org/D103557 but applies to symbols which are undefined at link time rather than compile time. We already have code that handles symbols which were defined at link time but dead stripped by `--gc-sections` (See `test/wasm/debug-removed-fn.ll`). In that case the symbols are not live (!isLive()). However, we can also have live symbols (which are references by the program) but which are undefined at link time and are imported by the linker. In the test case here the symbol `undef` is used but is not defined in the program but is imported by the linker due to the `--import-undefined` flag. Fixes: https://github.com/emscripten-core/emscripten/issues/15528 Differential Revision: https://reviews.llvm.org/D114921	2021-12-02 08:36:28 -08:00
Fangrui Song	c5bfffed48	[ELF] Discard input .note.gnu.build-id even with default --build-id=none binutils 2.38 will adopt this behavior https://sourceware.org/bugzilla/show_bug.cgi?id=28639 Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D114910	2021-12-02 09:50:59 +00:00
Igor Kudrin	b0ac68ccb7	[ELF] Prevent internalizing used comdat symbol When a comdat symbol is defined in both bitcode and regular object files, which are contained in the same archive, the linker could lose the flag that the symbol is used in the regular object file and allow LTO to internalize it, which led to "error: undefined symbol". The issue was introduced in D79300. Differential Revision: https://reviews.llvm.org/D114801	2021-12-02 12:10:06 +07:00
Fangrui Song	ad45df91ad	[ELF][PPC64] Remove unneeded PPC64PCRelLongBranchThunk This reverts the PPC64PCRelLongBranchThunk part from D86706. PPC64PCRelLongBranchThunk is the same as PPC64R12SetupStub. Use `__gep_setup_` instead of `__long_branch_pcrel_` for the stub symbol name as it more closely indicates the operation. (Note: GNU ld uses `.long_branch.` and `.plt_branch.`). Reviewed By: NeHuang, nemanjai Differential Revision: https://reviews.llvm.org/D114656	2021-11-30 11:33:17 -08:00
Fangrui Song	8c3641d03e	[ELF] Change -z unknown from error to warning There is a trend of having more optional options (usually security hardening related) like -z cet-report=, -z bti-report=, -z force-bti. If ld.lld 14.0.0 uses a warning, in 15/16/17/... timeframe when people add new options to software, they can worry less about linker errors on ld.lld 14.0.0. In some cases `-z foo` does essential work where a silent ignore can be problematic, but the user has received a warning. From my observation, the doing-essential-work `-z foo` is much fewer than the converse. In addition, the user who cares can use `--fatal-warnings` (Note: GNU ld doesn't upgrade warnings to errors). It is unclear whether we need something like `clang -Wunknown-warning-option`. If we ever run into unfortunate transition like `-z start-stop-gc`, the affected software (e.g. ldc is a compiler which passes linker options to the underlying ld) can blindly add the `-z` option, without worrying it may cause a linker error to LLD 14.0.0. Reviewed By: jrtc27, peter.smith Differential Revision: https://reviews.llvm.org/D114748	2021-11-30 11:06:28 -08:00
Vy Nguyen	74cbd71072	[lld-macho] Mark dylib symbols coming from -weak_framework as weak-ref. PR:52564 Differential Revision: https://reviews.llvm.org/D114397	2021-11-30 09:54:59 -05:00
Fangrui Song	5188f55d32	[ELF] Move ObjFile<ELFT>::{getLocalSymbols,getGlobalSymbols} to non-template ELFFileBase. NFC	2021-11-30 00:50:19 -08:00
Fangrui Song	5047e3a3ba	[ELF] Move GOT/PLT relocation code closer. NFC	2021-11-29 23:10:04 -08:00
Fangrui Song	1ce51a5f35	[ELF] --cref: If -Map is specified, print to the map file PR48282: This behavior matches GNU ld and gold. Reviewed By: markj Differential Revision: https://reviews.llvm.org/D114663	2021-11-29 14:14:53 -08:00
Fangrui Song	4709bacf18	[ELF] Avoid std::stable_partition which may allocate memory. NFC	2021-11-28 21:47:56 -08:00
Fangrui Song	99a2d940dd	[ELF] Speed up/simplify removeUnusedSyntheticSections. NFC Make one change: when the OutputSection is nullptr (due to /DISCARD/ or garbage collected BssSection (replaceCommonSymbols)), discard the SyntheticSection as well.	2021-11-28 21:07:34 -08:00
Fangrui Song	286c11165e	[ELF] Decrease InputSectionBase::entsize to uint32_t While here, change the sh_addralign argument to uint32_t (InputSection ctor's argument and the member are uint32_t); add constexpr.	2021-11-28 19:50:33 -08:00
Fangrui Song	e652f3f04a	[ELF] Simplify some ctx->outSec with sec. NFC	2021-11-28 19:08:27 -08:00
Fangrui Song	89c0f4553e	[ELF] Simplify/remove LinkerScript::switchTo. NFC	2021-11-28 19:05:15 -08:00
Fangrui Song	11291326cd	[ELF] Support --oformat= beside Separate --oformat Both GNU ld's manpage and ours use --oformat= as the canonical form. It's odd that we do not support it...	2021-11-28 18:44:23 -08:00
Fangrui Song	b5f1fa3e5c	[ELF][test] --oformat binary: Check that SIZEOF_HEADERS==0	2021-11-28 18:34:36 -08:00
Fangrui Song	1164c4b375	[ELF] Simplify/remove LinkerScript::output and advance. NFC	2021-11-28 16:58:06 -08:00
Fangrui Song	e80a0b353c	[ELF] Remove unneeded getOutputSectionVA. NFC I attempted to remove it 1 or 2 year ago but kept it just to have a good diagnostic in case the output section is nullptr (should be impossible). It is long enough that we haven't seen such a case.	2021-11-28 16:17:10 -08:00
Fangrui Song	85e50c1080	[ELF] Inline InputSection::getOffset into callers and remove it. NFC This is an unneeded abstraction which may cause confusion: SectionBase::getOffset has the same name but hard codes -1 as the size of OutputSection.	2021-11-28 16:09:04 -08:00
Fangrui Song	7ea662e2dd	[ELF] Replace one make_unique from r316378 with a stack object. NFC	2021-11-28 15:32:29 -08:00
Fangrui Song	25c7ec4fc6	[ELF] Simplify OutputSection::sectionIndex assignment. NFC And improve comments.	2021-11-28 14:56:29 -08:00
Fangrui Song	d060cc1f98	[ELF] Fix out-of-bounds write in memset(&Out::first, ...) Fix r285764: there is no guarantee that Out::first is placed before other static data members of `struct Out`. After `bufferStart` was introduced, this out-of-bounds write is destined in many compilers. It is likely benign, though. And move `Out::elfHeader->size` assignment beside `Out::elfHeader->sectionIndex`	2021-11-28 14:47:57 -08:00
Fangrui Song	cecc6893a0	[ELF] Simplify assignFileOffsets There is a difference with non-SHF_ALLOC SHT_NOBITS when off%sh_addralign!=0 which doesn't happen/matter in practice.	2021-11-28 13:44:42 -08:00
Fangrui Song	f9a4d9aa03	[ELF] -z separate-*: Use max-page-size instead of common-page-size for text/non-SHF_ALLOC transition and writeTrapInstr For -z separate-code and -z separate-loadable-segments: When RW is present, the RX to RW transition is aligned with max-page-size. When RW is absent, the RX to non-SHF_ALLOC transition should use max-page-size as well.	2021-11-28 12:47:50 -08:00
Fangrui Song	6c1c2313d1	[ELF] Simplify assignFileOffsets. NFC	2021-11-28 11:43:59 -08:00
Ard Biesheuvel	da66263b6e	[ARM] implement support for ALU/LDR PC-relative group relocations Currently, LLD does not support the complete set of ARM group relocations. Given that I intend to start using these in the Linux kernel [0], let's add support for these. This implements the group processing as documented in the ELF psABI. Notably, this means support is dropped for very far symbol references that also carry a small component, where the immediate is rotated in such a way that only part of it wraps to the other end of the 32-bit word. To me, it seems unlikely that this is something anyone could be relying on, but of course I could be wrong. [0] https://lore.kernel.org/r/20211122092816.2865873-8-ardb@kernel.org/ Reviewed By: peter.smith, MaskRay Differential Revision: https://reviews.llvm.org/D114172	2021-11-27 10:26:37 +01:00
Fangrui Song	6fa8f7beb1	[ELF][test] Test that .o definition does not inherit .so STV_PROTECTED Test %t2.so %t.o beside %t.o %t2.so	2021-11-26 15:00:10 -08:00
Fangrui Song	f1ba48d508	[ELF] Simplify Symbol::extract. NFC	2021-11-26 14:10:55 -08:00
Fangrui Song	3b4dd68de5	[ELF][PPC64] Make --power10-stubs/--no-power10-stubs proper aliases for --power10-stubs={auto,no} This allows --power10-stubs= and --[no-]power10-stubs to override each other (they are position dependent in GNU ld). Also improve --help messages and the manpage. Note: GNU ld's default "auto" mode uses heuristics to decide whether Power10 instructions are used. Arguably it is a design mistake of R_PPC64_REL24_NOTOC (acked by the relevant folks on a libc-alpha discussion). We don't implement "auto", so the default --power10-stubs is the same as "yes".	2021-11-26 11:51:45 -08:00
Fangrui Song	09401dfcf1	[ELF] Rename fetch to extract The canonical term is "extract" (GNU ld documentation, Solaris's `-z *extract` options). Avoid inventing a term and match --why-extract. (ld64 prefers "load" but the word is overloaded too much) Mostly MFC, except for --help messages and the header row in --print-archive-stats output.	2021-11-26 10:58:50 -08:00
Fangrui Song	7051aeef7a	[ELF] Rename BaseCommand to SectionCommand. NFC BaseCommand was picked when PHDRS/INSERT/etc were not implemented. Rename it to SectionCommand to match `sectionCommands` and make it clear that the commands are used in SECTIONS (except a special case for SymbolAssignment). Also, improve naming of some BaseCommand variables (base -> cmd).	2021-11-25 20:24:23 -08:00
Fangrui Song	e40e17fcaf	[ELF] Make ExprValue smaller. NFC'	2021-11-25 16:55:06 -08:00
Fangrui Song	6188fd4957	[ELF] Rename OutputSection::sectionCommands to commands. NFC This partially reverts r315409: the description applies to LinkerScript, but not to OutputSection. The name "sectionCommands" is used in both LinkerScript::sectionCommands and OutputSection::sectionCommands, which may lead to confusion. "commands" in OutputSection has no ambiguity because there are no other types of commands.	2021-11-25 16:47:07 -08:00
Fangrui Song	ff0d9e6cfa	[ELF] Remove redundant part.dynSymTab creation. NFC	2021-11-25 14:42:22 -08:00
Fangrui Song	5ca54c6686	[ELF] Simplify GnuHashSection::write. NFC	2021-11-25 14:23:25 -08:00
Fangrui Song	55c14d6dbf	[ELF] Simplify DynamicSection content computation. NFC The new code computes the content twice, but avoides the tricky std::function<uint64_t()>. Removed 13KiB code in a Release build.	2021-11-25 14:12:34 -08:00
Fangrui Song	6ca8fde226	[ELF] Emit DF_STATIC_TLS only for -shared This matches GNU ld and saves 2 words for executables.	2021-11-24 23:17:13 -08:00
Fangrui Song	5922dd91f8	[ELF] Rename hasStaticTlsModel to hasTlsIe and remove unneeded atomic.	2021-11-24 21:06:04 -08:00
Fangrui Song	371290dfd4	[ELF] Remove unneeded DF_STATIC_TLS for EM_386 local-exec TLS which is also untested.	2021-11-24 20:43:58 -08:00
Igor Kudrin	8cdf1c1edb	[ELF] Support the "read-only" memory region attribute The attribute 'r' allows (or disallows for the negative case) read-only sections, i.e. ones without the SHF_WRITE flag, to be assigned to the memory region. Before the patch, lld could put a section in the wrong region or fail with "error: no memory region specified for section". Differential Revision: https://reviews.llvm.org/D113771	2021-11-24 12:17:09 +07:00
Fangrui Song	38ed1db7e8	[ELF] Support non-RAX/non-adjacent R_X86_64_GOTPC32_TLSDESC/R_X86_64_TLSDESC_CALL The current TLSDESC optimization code assumes: ``` leaq x@tlsdesc(%rip), %rax call x@tlscall(%rax) # adjacent ``` From https://gitlab.freedesktop.org/mesa/mesa/-/issues/5665 , it seems that the two instructions may not be adjacent in GCC 10's output: ``` leaq x@tlsdesc(%rip), %rax something else call x@tlscall(%rax) ``` This patch supports the case. While here, support non-RAX registers for R_X86_64_GOTPC32_TLSDESC, in case the compiler generates inefficient: ``` leaq x@tlsdesc(%rip), %rcx # or %rdx, %rbx, %rdi, ... movq %rcx, %rax call *x@tlscall(%rax) # GNU ld/gold error for non-RAX ``` Differential Revision: https://reviews.llvm.org/D114416	2021-11-23 10:30:11 -08:00
Martin Storsjö	d703b92296	[LLD] [COFF] Omit section symbols and IMAGE_SYM_CLASS_LABEL from the PE symbol table The section symbols aren't of much practical use when looking at a linked image. This shrinks one observed mingw style unstripped binary by 14%. IMAGE_SYM_CLASS_LABEL is in spirit the same as a temporary assembler label that isn't emitted on the object file level at all. Differential Revision: https://reviews.llvm.org/D113866	2021-11-23 10:17:04 +02:00
Martin Storsjö	7c15da6761	[LLD] [COFF] Interpret the immediate in ARM64 adr/adrp relocations as signed 21 bit This matches how MS link.exe interprets this relocation. Differential Revision: https://reviews.llvm.org/D114347	2021-11-23 10:13:01 +02:00
Shoaib Meenai	2f5d6a0ea5	[MachO] Fix struct size assertion std::vector can have different sizes depending on the STL's debug level, so account for its size separately. (You could argue that we should be accounting for all the other members separately as well, but that would be very unergonomic, and std::vector is the only one that's caused problems so far.)	2021-11-22 15:02:30 -08:00
Fangrui Song	7aafe467d2	[ELF] Simplify a condition with config->copyRelocs. NFC	2021-11-22 13:59:23 -08:00
Vy Nguyen	944071eca2	[lld-macho] Don't replace local personality symbol with LazySymbol Follup-up to D107533, where we replaced local syms with non-local. It doesn't make sense to replace local symbol with lazy. Differential Revision: https://reviews.llvm.org/D110040	2021-11-22 14:09:54 -05:00
Igor Kudrin	a05b694b1e	[ELF][NFC] Do not pass region name to expandMemoryRegion() The name can be easily got on-site. Differential Revision: https://reviews.llvm.org/D114228	2021-11-22 14:19:07 +07:00
Fangrui Song	648157b05a	[ELF] Move getOutputSectionName from Writer.cpp to LinkerScript.cpp. NFC and internalize it.	2021-11-20 22:18:09 -08:00
Fangrui Song	2997441b85	[ELF] Support discarding .got.plt Fix a null pointer dereference when .got.plt is discarded. This also adds a test for discarding `.plt`. Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D114180	2021-11-19 10:50:53 -08:00
Nico Weber	bc20bcb39e	[lld/mac] Crash even less on undefined symbols with --icf=all Follow-up to https://reviews.llvm.org/D112643. Even after that change, we were still asserting if two separate functions that are eligible for ICF (same size, same data, same number of relocs, same reloc types, ...) referred to Undefineds. This fixes that oversight. Differential Revision: https://reviews.llvm.org/D114195	2021-11-19 09:23:19 -05:00
Andrew Ng	47eb3f155f	[ELF] Ensure output section is not discarded in addStartEndSymbols() Fixes https://bugs.llvm.org/show_bug.cgi?id=52534. Differential Revision: https://reviews.llvm.org/D114179	2021-11-19 11:45:58 +00:00
Konstantin Schwarz	8c18719bae	[ELF] Expand LMA region if output section alignment introduces padding When aligning the start address of an output section introduces a gap between the current dot pointer and the new aligned address, we were already properly expanding the memory region, if available. D74286 introduced a new behavior to also align the LMA address if an LMA region is specified. However, this did not expand the corresponding LMA region. Now, we also expand the LMA region if it is set. This fixes PR52510. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D114166	2021-11-19 11:27:21 +01:00
Vincent Lee	adfbb5411b	[lld-macho] Add warn flags to enable/disable warnings on -install_name ld64 doesn't warn on builds using `-install_name` if it's a bundle. But, the current warning is nice to have because `install_name` only works with dylib. To prevent an overflow of warnings in build logs and have parity with ld64, create a `--warn-dylib-install-name` and `--warn-no-dylib-install-name` flag that enables this LLD specific warning. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D113534	2021-11-17 16:18:14 -08:00
Greg McGary	9cc489a4b2	[lld-macho][nfc] Factor-out NFC changes from main __eh_frame diff In order to keep signal:noise high for the `__eh_frame` diff, I have teased-out the NFC changes and put them here. Differential Revision: https://reviews.llvm.org/D114017	2021-11-17 15:16:44 -07:00
Shoaib Meenai	01510ac084	[MachO] Move type size asserts to source files. NFC As discussed in https://reviews.llvm.org/D113809#3128636. It's a bit unfortunate to move the asserts away from the structs whose sizes they're checking, but it's a far better developer experience when one of the asserts is violated, because you get a single error instead of every single source file including the header erroring out.	2021-11-16 17:14:16 -08:00
Vy Nguyen	34d15eaced	[lld-macho][nfc] Sanity check on template type Differential Revision: https://reviews.llvm.org/D114044	2021-11-16 20:04:49 -05:00
Shoaib Meenai	93bf271f27	[MachO] Shrink reloc from 32 bytes to 24 bytes The `r_address` field of `relocation_info` is only 4 bytes, so our offset field (which is the `r_address` field adjusted for subsection splitting) also only needs to be 4 bytes. This reduces the structure size from 32 bytes to 24 bytes. Combined with https://reviews.llvm.org/D113813, this is a minor perf improvement for linking an internal app, tested on two machines: ``` smol-relocs baseline difference (95% CI) sys_time 7.367 ± 0.138 7.543 ± 0.157 [ +0.9% .. +3.8%] user_time 21.843 ± 0.351 21.861 ± 0.450 [ -1.3% .. +1.4%] wall_time 20.301 ± 0.307 20.556 ± 0.324 [ +0.1% .. +2.4%] samples 16 16 smol-relocs baseline difference (95% CI) sys_time 2.923 ± 0.050 2.992 ± 0.018 [ +1.4% .. +3.4%] user_time 10.345 ± 0.039 10.448 ± 0.023 [ +0.8% .. +1.2%] wall_time 12.068 ± 0.071 12.229 ± 0.021 [ +1.0% .. +1.7%] samples 15 12 ``` More importantly though, this change by itself reduces our maximum resident set size by 220 MB (2.75%, from 7.85 GB to 7.64 GB) on the first machine. On the second machine, it reduces it by 125 MB (1.94%, from 6.31 GB to 6.19 GB). Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D113818	2021-11-16 16:30:34 -08:00
Shoaib Meenai	3195297897	[MachO] Reduce size of Symbol and Defined We can lay out Symbol more optimally to reduce its size from 56 bytes to 48 bytes by eliminating unnecessary padding, and we can lay out Defined such that its bitfield members are placed in the tail padding of Symbol (on ABIs which support this), to reduce it from 96 bytes to 80 bytes (8 bytes from the Symbol reduction, and 8 bytes from the tail padding reuse). This is perf-neutral for an internal app (results from two different machines): ``` smol-syms baseline difference (95% CI) sys_time 7.430 ± 0.202 7.440 ± 0.193 [ -2.6% .. +2.9%] user_time 21.443 ± 0.513 21.206 ± 0.396 [ -3.3% .. +1.1%] wall_time 20.453 ± 0.534 20.222 ± 0.488 [ -3.7% .. +1.5%] samples 9 8 smol-syms baseline difference (95% CI) sys_time 3.011 ± 0.050 3.040 ± 0.052 [ -0.4% .. +2.3%] user_time 10.416 ± 0.075 10.496 ± 0.091 [ +0.1% .. +1.4%] wall_time 12.229 ± 0.144 12.354 ± 0.192 [ -0.1% .. +2.1%] samples 14 13 ``` However, on the first machine, it reduces maximum resident set size by 65.9 MB (0.8%, from 7.92 GB to 7.85 GB). On the second machine, it reduces it by 92 MB (1.4%, from 6.40 GB to 6.31 GB). Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D113813	2021-11-16 16:30:33 -08:00
Shoaib Meenai	637a3396b3	[MachO] Fix struct size assertion It was checking for 64-bit builds incorrectly. Unfortunately, ConcatInputSection has grown a bit in the meantime, and I don't see any obvious way to shrink it. Perhaps icfEqClass could use 32-bit hashes instead of 64-bit ones, but xxHash64 is supposed to be much faster than xxHash32 (https://github.com/Cyan4973/xxHash#benchmarks), so that sounds like a loss. (Unrelatedly, we should really look at using XXH3 instead of xxHash64 now.) Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D113809	2021-11-16 16:30:31 -08:00
Greg McGary	3a1b3c9afe	[lld-macho][nfc] rename parsed-section types & variables This is an NFC diff that prepares for pruning & relocating `__eh_frame`. Along the way, I made the following changes to ... * clarify usage of `section` vs. `subsection` * remove `map` & `vec` from type names * disambiguate class `Section` from template parameter `SectionHeader`. Differential Revision: https://reviews.llvm.org/D113241	2021-11-16 07:06:41 -07:00
Quinn Pham	1ca00ecfb8	[NFC][lld] Inclusive language: change master file to merged file [NFC] As part of using inclusive language within the llvm project, this patch replaces master with merged in these comments. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D113903	2021-11-15 14:32:09 -06:00
Igor Kudrin	66691de94c	[ELF] Do not try to assign a memory region to a non-allocatable section Non-allocatable sections are not part of the memory image of the program, so there is no need to find memory regions for them either matching properties or handling explicit assignments. The early test and return help to simplify LinkerScript::findMemoryRegion() a bit. Differential Revision: https://reviews.llvm.org/D113768	2021-11-15 15:59:39 +07:00
Shao-Ce SUN	0c660256eb	[NFC] Trim trailing whitespace in *.rst	2021-11-15 09:17:08 +08:00
Keith Smiley	51715fbd96	[lld-macho] Fix warning ``` /Users/ksmiley/dev/llvm-project/lld/MachO/Symbols.cpp:43:27: warning: field 'external' will be initialized after field 'weakDefCanBeHidden' [-Wreorder-ctor] weakDef(isWeakDef), external(isExternal), ^ 1 warning generated. ``` Differential Revision: https://reviews.llvm.org/D113823	2021-11-12 19:36:51 -08:00
Vy Nguyen	9b29dae3ca	[lld-macho] Allow exporting weak_def_can_be_hidden(AKA "autohide") symbols autohide symbols behaves similarly to private_extern symbols. However, LD64 allows exporting autohide symbols. LLD currently does not. This patch allows LLD to export them. Differential Revision: https://reviews.llvm.org/D113167	2021-11-12 21:57:30 -05:00
Vy Nguyen	ad932320d8	[lld-macho] Parallelize scanning the symbol tables in export/unexport-ing. (Split from D113167) Benchmarking on one of our large apps which exports a few thousands symbols, this showed an improvement of ~17%. x ./LLD_no_parallel.txt + ./LLD_with_parallel.txt N Min Max Median Avg Stddev x 10 84.01 89.41 88.64 87.693 1.7424061 + 10 71.9 74.29 72.63 72.753 0.77734663 Difference at 95.0% confidence -14.94 +/- 1.26763 -17.0367% +/- 1.44553% (Student's t, pooled s = 1.34912) (wallclock) Differential Revision: https://reviews.llvm.org/D113820	2021-11-12 20:57:24 -05:00
Duncan P. N. Exon Smith	9a2b54af22	lld: const-qualify iterations through VarStreamArray, NFC No functionality change here; just unblocking a patch to LLVM.	2021-11-12 14:29:49 -08:00
Jez Ng	9d0b237c51	[lld-macho] Fix symbol relocs handling for LSDAs Similar to D113702, but for the LSDAs. Clang seems to emit all LSDA relocs as section relocs, but ld -r can turn those relocs into symbol ones. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D113721	2021-11-12 16:02:49 -05:00
Jez Ng	d9b6f7e312	[lld-macho] Teach ICF to dedup functions with identical unwind info Dedup'ing unwind info is tricky because each CUE contains a different function address, if ICF operated naively and compared the entire contents of each CUE, entries with identical unwind info but belonging to different functions would never be considered identical. To work around this problem, we slice away the function address before performing ICF. We rely on `relocateCompactUnwind()` to correctly handle these truncated input sections. Here are the numbers before and after D109944, D109945, and this diff were applied, as tested on my 3.2 GHz 16-Core Intel Xeon W: Without any optimizations: base diff difference (95% CI) sys_time 0.849 ± 0.015 0.896 ± 0.012 [ +4.8% .. +6.2%] user_time 3.357 ± 0.030 3.512 ± 0.023 [ +4.3% .. +5.0%] wall_time 3.944 ± 0.039 4.032 ± 0.031 [ +1.8% .. +2.6%] samples 40 38 With `-dead_strip`: base diff difference (95% CI) sys_time 0.847 ± 0.010 0.896 ± 0.012 [ +5.2% .. +6.5%] user_time 3.377 ± 0.014 3.532 ± 0.015 [ +4.4% .. +4.8%] wall_time 3.962 ± 0.024 4.060 ± 0.030 [ +2.1% .. +2.8%] samples 47 30 With `-dead_strip` and `--icf=all`: base diff difference (95% CI) sys_time 0.935 ± 0.013 0.957 ± 0.018 [ +1.5% .. +3.2%] user_time 3.472 ± 0.022 6.531 ± 0.046 [ +87.6% .. +88.7%] wall_time 4.080 ± 0.040 5.329 ± 0.060 [ +30.0% .. +31.2%] samples 37 30 Unsurprisingly, ICF is now a lot slower, likely due to the much larger number of input sections it needs to process. But the rest of the linker only suffers a mild slowdown. Note that the compact-unwind-bad-reloc.s test was expanded because we now handle the relocation for CUE's function address in a separate code path from the rest of the CUE relocations. The extended test covers both code paths. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D109946	2021-11-12 16:02:49 -05:00
Jez Ng	ad8df21db2	[reland][lld-macho] Fix symbol relocs handling for compact unwind's functionAddress Clang seems to emit all functionAddress relocs as section relocs, but `ld -r` can turn those relocs into symbol ones. It turns out that we weren't handling that case correctly when the symbol was a weak def whose definition did not prevail. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D113702	2021-11-12 15:01:51 -05:00
Keith Smiley	eb6f9f3123	[lld-macho] Fix trailing slash in oso_prefix Previously if you passed `-oso_prefix path/to/foo/` with a trailing slash at the end, using `real_path` would remove that slash, but that slash is necessary to make sure OSO prefix paths end up as valid relative paths instead of starting with `/`. Differential Revision: https://reviews.llvm.org/D113541	2021-11-12 11:29:08 -08:00
Fangrui Song	a05384dc89	[ELF] Make --no-relax disable R_X86_64_GOTPCRELX and R_X86_64_REX_GOTPCRELX GOT optimization This brings back the original version of D81359. I have found several use cases now. * Unlike GNU ld, LLD's relocation processing is one pass. If we decide to optimize(relax) R_X86_64_{,REX_}GOTPCRELX, we will suppress GOT generation and cannot undo the decision later. Optimizing R_X86_64_REX_GOTPCRELX can usually make it easy to hit `relocation R_X86_64_REX_GOTPCRELX out of range` because the distance to GOT is usually shorter. Without --no-relax, the user has to recompile with `-Wa,-mrelax-relocations=no`. * The option would help during my investigationg of the root cause of https://git.kernel.org/linus/09e43968db40c33a73e9ddbfd937f46d5c334924 * There is need for relaxation for AArch64 & RISC-V. Implementing this for x86-64 improves consistency with little target-specific cost (two-line X86_64.cpp change). Reviewed By: alexander-shaposhnikov Differential Revision: https://reviews.llvm.org/D113615	2021-11-12 09:47:31 -08:00
Kazu Hirata	835135a8ae	Revert "[lld-macho] Fix symbol relocs handling for compact unwind's functionAddress" This reverts commit `e941fe5061`. The commit in question causes: lld/MachO/InputFiles.cpp:916:13: error: use of undeclared identifier 'it'	2021-11-11 20:29:48 -08:00
Jez Ng	e941fe5061	[lld-macho] Fix symbol relocs handling for compact unwind's functionAddress Clang seems to emit all functionAddress relocs as section relocs, but `ld -r` can turn those relocs into symbol ones. It turns out that we weren't handling that case correctly when the symbol was a weak def whose definition did not prevail. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D113702	2021-11-11 22:53:35 -05:00
Petr Hosek	d56b171ee9	[lld][ELF] Support for R_ARM_THM_JUMP8 This change implements support for R_ARM_THM_JUMP8 relocation in addition to R_ARM_THM_JUMP11 which is already supported by LLD. Differential Revision: https://reviews.llvm.org/D21225	2021-11-11 09:06:52 -08:00
Igor Kudrin	d2dd36bbbe	[ELF] Better resemble GNU ld when placing orphan sections into memory regions An orphan section should be placed in the same memory region as its anchor section if the latter specifies the memory region explicitly. If there is no explicit assignment for the anchor section in the linker script, its memory region is selected by matching attributes, and the same should be done for the orphan section. Before the patch, some scripts that were handled smoothly in GNU ld caused an "error: no memory region specified for section" in lld. Differential Revision: https://reviews.llvm.org/D112925	2021-11-11 15:07:38 +07:00
Jez Ng	a2404f11c7	[lld-macho] Support renaming of LSDA section Previously, our unwind info finalization logic assumed that the LSDA section referenced by `__compact_unwind` was already finalized before `__TEXT,__unwind_info` itself. However, that assumption could be broken by the use of `-rename_section` -- it could be (and is) used to move `__gcc_except_tab` it into a different segment later in the file. (__TEXT is always the first non-zerofill segment, so any rename basically guarantees that the section will be ordered after `__unwind_info`.) To handle this case, we compare LSDA relocations instead of their final values in `UnwindInfoSection::finalize()`, and we actually relocate those LSDAs in `UnwindInfoSection::writeTo()`. In order to do this, we need an easy way to track which Symbol a given CUE corresponds to. My solution was to change our `cuPtrVector` into a vector of indices, with each index used for both the symbols vector (`symbolsVec`) as well as the CUE vector (`cuVector`). This change seems perf neutral. Numbers for linking chromium_framework on my 16 core Mac Pro: base diff difference (95% CI) sys_time 1.248 ± 0.025 1.245 ± 0.026 [ -1.3% .. +0.8%] user_time 3.588 ± 0.045 3.587 ± 0.037 [ -0.6% .. +0.5%] wall_time 4.605 ± 0.069 4.595 ± 0.069 [ -1.0% .. +0.5%] samples 42 26 Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D113582	2021-11-10 19:31:54 -05:00
Fangrui Song	51ee08c217	[ELF] Enforce double-dash form for --ignore-{data,function}-pointer-equality --reproduce --thread They are LLD-specific options. We have enforced double-dash forms for other options (reduce collision with short options) but missed them.	2021-11-10 01:17:08 -08:00
Fangrui Song	d71bb6a409	[ELF] Inline isPPC64SmallCodeModelTocReloc which is only called once. NFC	2021-11-09 20:41:05 -08:00
Fangrui Song	bec28ee1ea	[ELF] Move isStaticLinkTimeConstant closer to the only caller processRelocAux. NFC	2021-11-09 20:37:46 -08:00
Fangrui Song	213d1849a4	[ELF] Improve sh_info=0 and sh_info>=num_sections diagnostic for SHT_REL/SHT_RELA PR52408 reported an sh_info=0 instance. I have seen sh_info=0 independently before. sh_info>=num_sections is probably very rare. Just use one diagnostic for the two types of errors. Delete invalid-relocations.test which is covered by invalid/bad-reloc-target.test Differential Revision: https://reviews.llvm.org/D113466	2021-11-09 09:54:12 -08:00
Vy Nguyen	2e1be96df6	Reland "[lld-macho] Fix assertion failure in registerCompactUnwind"" PR/52372 Differential Revision: https://reviews.llvm.org/D112977 New changes: - use llvm-otool instead of `otool` which doesn't in exist on non-OSX platforms - add llvm-otool to the set of tools used by test so that the bot will use the <build_dir>/bin/llvm-otool instead of the unqualified `llvm-otool` (which may not exist) - update tests since the latest (TOT) llvm-otool prints a space between two bytes and the old one doesn't.	2021-11-09 11:52:46 -05:00
Vy Nguyen	eb4a517816	Revert "[lld-macho] Fix assertion failure in registerCompactUnwind" broke windows build - reverting to investigate This reverts commit `b2d9258474`.	2021-11-09 10:31:47 -05:00
Vy Nguyen	b2d9258474	[lld-macho] Fix assertion failure in registerCompactUnwind PR/52372 Differential Revision: https://reviews.llvm.org/D112977	2021-11-09 10:08:17 -05:00
Fangrui Song	43bb5f0185	[docs] Remove outdated documentation for the legacy Atom-based LLD The outdated documentation diverges a lot from the current state of COFF/Mach-O/ELF/wasm ports and may just confuse users. It is better rewriting some if useful. Tested with `ninja docs-lld-html` Reviewed By: #lld-macho, lhames, Jez Ng Differential Revision: https://reviews.llvm.org/D113432	2021-11-08 15:20:16 -08:00
Fangrui Song	cebb0a64b4	[ELF][ARM] Improve error message for unknown relocation Like rLLD354040. Before: `error: unrecognized relocation Unknown (254)` Now: `error: unknown relocation (254) against symbol foo`	2021-11-08 12:39:08 -08:00
David Blaikie	78758026e2	Fix lld test after dwarfdump array syntax change	2021-11-05 23:00:29 -07:00
Fangrui Song	26a8ceba3e	[llvm-readobj] Display DT_RELRSZ/DT_RELRENT as " (bytes)" to match RELSZ/RELENT. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D113206	2021-11-05 10:02:49 -07:00
Quinn Pham	c71fbdd87b	[NFC] Inclusive language: Remove instances of master in URLs [NFC] This patch fixes URLs containing "master". Old URLs were either broken or redirecting to the new URL. Reviewed By: #libc, ldionne, mehdi_amini Differential Revision: https://reviews.llvm.org/D113186	2021-11-05 08:48:41 -05:00
Keith Smiley	a7a2959901	[lld-macho] Replace LC_LINKER_OPTION parsing This removes the tablegen based parsing of LC_LINKER_OPTION since it can only actually contain a very small number of potential arguments. In our project with tablegen this took 5 seconds before. This replaces https://reviews.llvm.org/D113075 Differential Revision: https://reviews.llvm.org/D113235	2021-11-04 22:03:40 -07:00
Fangrui Song	005456e5fc	[lld-macho] Fix an assertion failure when -u specifies an undefined section$start symbol This matches ld64. Also improve the test for `-dead_strip`. Reviewed By: #lld-macho, Jez Ng Differential Revision: https://reviews.llvm.org/D113147	2021-11-04 21:28:33 -07:00
Keith Smiley	0bce3e3b84	[lld-macho] Clear resolvedReads cache https://reviews.llvm.org/D113153#3108083 smeenai, int3 Differential Revision: https://reviews.llvm.org/D113198	2021-11-04 18:02:34 -07:00

... 3 4 5 6 7 ...

15048 Commits