llvm-project

Commit Graph

Author	SHA1	Message	Date
Fangrui Song	bf4fa3036a	[ELF] Use SmallVector for MergeInputSection::pieces. NFC sizeof(pieces) decreases from 24 to 16 on ELF64. One BumpPtrAllocator can store more MergeInputSections. The lld executable becomes smaller.	2021-12-16 21:07:39 -08:00
Fangrui Song	93558e575e	[ELF] Internalize createMergeSynthetic. NFC Only called once. Moving to OutputSections.cpp can make it inlined. finalizeInputSections can be very hot, especially in -O1 links with much debug info.	2021-12-16 20:50:06 -08:00
Daniel Kiss	2b4e6052b3	[lld] Add cet-report and bti-report flags Implement cet-report as supported in binutils. bti-report has the same behaviour for AArch64-BTI. Fixes https://github.com/llvm/llvm-project/issues/44828 Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D113901	2021-12-16 16:26:26 +01:00
Fangrui Song	8617996ac1	[ELF] maybeReportUndefined: move sym.isUndefined() check to the caller. NFC Avoid a function call in the majority of cases.	2021-12-16 00:27:19 -08:00
Fangrui Song	101407bfaa	[ELF] parseSymbolVersion: remove unussed pos == 0 check	2021-12-15 23:59:55 -08:00
Fangrui Song	60f5614931	[ELF] SharedFile::parse: cache symbols size for a loop. NFC	2021-12-15 22:45:28 -08:00
Fangrui Song	7b265e9791	[ELF] Move -l -L canonical and --library-path --library aliases Everyone uses -l -L instead of the long option counterparts. Make help messages attach to -L -l and (--reproduce) use them for response.txt command line options.	2021-12-15 21:49:53 -08:00
Fangrui Song	159b948e43	[ELF] ObjFile<ELFT>::initializeSymbols: don't call Allocate when firstGlobal==0 Calling `Allocate` with 0 size (when .symtab is absent, e.g. `invalid/mips-invalid-options-descriptor.test`) may return a nullptr, which will crash with -fsanitize=null (the underlying `Allocate` function is LLVM_ATTRIBUTE_RETURNS_NONNULL).	2021-12-15 18:21:48 -08:00
Fangrui Song	b0211de5e3	[ELF] Change Symbol::verdefIndex from uint32_t to uint16_t The SHT_GNU_version index is 16-bit, so the 32-bit value is a waste. Technically non-default version index 0x7fff uses version index 0xffff, but it is impossible in practice. This change decreases sizeof(SymbolUnion) from 80 to 72 on ELF64 platforms. Memory usage decreases by 1% when linking a large executable.	2021-12-15 17:59:30 -08:00
Fangrui Song	50187d2dd5	[ELF] Speed up ObjFile<ELFT>::createInputSection * Group ".note" section name checks * Move shouldMerge check to the caller	2021-12-15 17:15:32 -08:00
Vincent Lee	d17b092fe6	[lld-macho] Make writing map file asynchronous For large applications that write to map files, writing map files can take quite a bit of time. Sorting the biggest contributors to link times, writing map files ranks in at 2nd place, with load input files being the biggest contributor of link times. Avoiding writing map files on the critical path (and having its own thread) saves ~2-3 seconds when linking chromium framework on a 16-Core Intel Xeon W. ``` base diff difference (95% CI) sys_time 1.617 ± 0.034 1.657 ± 0.026 [ +1.5% .. +3.5%] user_time 28.536 ± 0.245 28.609 ± 0.180 [ -0.1% .. +0.7%] wall_time 23.833 ± 0.271 21.684 ± 0.194 [ -9.5% .. -8.5%] samples 31 24 ``` Reviewed By: #lld-macho, oontvoo, int3 Differential Revision: https://reviews.llvm.org/D115416	2021-12-15 16:37:04 -08:00
Fangrui Song	68009b78f2	[ELF] Symbol::replace: remove dead code	2021-12-15 16:08:18 -08:00
Fangrui Song	b5805b7847	[ELF] ObjFile<ELFT>::initializeSymbols: avoid StringRefZ from undefined symbols	2021-12-15 15:30:18 -08:00
Fangrui Song	2bdad16303	[ELF] SymbolTable::insert: keep @@ in the name * Avoid the name truncation quirk in SymbolTable::insert: the truncated name will be replaced by @@ again. * Allow foo and foo@@v1 in different files to be diagnosed as duplicate definition error (GNU ld behavior) * Avoid potential redundant strlen on symbol name due to StringRefZ in ObjFile<ELFT>::initializeSymbols	2021-12-15 15:19:35 -08:00
Fangrui Song	a8d6d2614b	[ELF] Replace make<Defined> with makeDefined. NFC This removes SpecificAlloc<Defined> and makes my lld executable 1.5k smaller. This drops the small memory waste due to the separate BumpPtrAllocator.	2021-12-15 13:15:03 -08:00
Fangrui Song	a596a5fc12	[ELF] ObjFile<ELFT>::initializeSymbols: Simplify this->symbols[i]. NFC	2021-12-15 13:02:38 -08:00
Fangrui Song	509153f1e7	[ELF] ObjFile<ELFT>::initializeSymbols: Batch allocate local symbols and detangle local/global symbol initialization. My x86-64 lld executable is 8k smaller due to the removal of SpecificAlloc<Undefined>.	2021-12-15 12:54:39 -08:00
Fangrui Song	3534d26cc1	[ELF] Slightly speed up -z keep-text-section-prefix	2021-12-15 10:20:11 -08:00
Fangrui Song	7c0881a38f	[ELF] --gc-sections: Change startwith(".jcr") to exact match GNU ld's internal linker script keeps `.jcr`, but not other sections starting with `.jcr`.	2021-12-15 01:27:08 -08:00
Fangrui Song	21dbfd4300	[ELF] --gc-sections: Change startwith(".init") (and ".fini") to exact match GNU ld's internal linker script keeps `.init`, but not other sections starting with `.init`. .fini is similar.	2021-12-15 01:16:26 -08:00
Fangrui Song	7a54ae9c1d	[ELF] Change objectFiles to ELFFileBase * This can sometimes avoid `cast<ObjFile<...>>`. I intentionally do not touch postScanRelocations to wait for its stabilization.	2021-12-15 00:37:10 -08:00
Fangrui Song	3deb82cd07	[ELF] Adjust getOutputSectionName prefix order Sorting the prefixes by decreasing frequency can improve performance. .gcc_except_table is relatively frequent, so move it ahead. .ctors and .dtors mostly disappear and should be the last.	2021-12-15 00:18:58 -08:00
Fangrui Song	5816f1855c	[ELF] Slightly speed up getOutputSectionName. NFC	2021-12-14 23:43:00 -08:00
Fangrui Song	89661a0e89	[ELF] Remove dead code from SymbolTable::find	2021-12-14 22:41:52 -08:00
Fangrui Song	c720b16aa5	[ELF] Use SmallVector for SharedFile and simplify parseVerdefs SHT_GNU_verdef is typically small, so it's unnecessary to reserve the vector. While here, fix a hypothetical issue when SHT_GNU_verdef has non-increasing version indexes, which don't happen with GNU ld, gold, ld.lld's output. My x86-64 lld executable is 256 bytes smaller.	2021-12-14 21:11:45 -08:00
Fangrui Song	1ff1d50d9f	[ELF] Make InputFile smaller sizeof(ObjFile<ELF64LE>) is decreased from 344 to 272 on an ELF64 system. In a large link with 30000 ObjFiles, this may be 2+MiB saving. Change std::vector members to SmallVector, and std::string members to SmallString<0> (these members typically don't benefit from small string optimization). On Linux x86-64 the lld executable is ~6k smaller.	2021-12-14 20:55:32 -08:00
Fangrui Song	cf783be8d7	Reland D114783/D115603 [ELF] Split scanRelocations into scanRelocations/postScanRelocations (Fixed an issue about GOT on a copy relocated alias.) (Fixed an issue about not creating r_addend=0 IRELATIVE for unreferenced non-preemptible ifunc.) The idea is to make scanRelocations mark some actions are needed (GOT/PLT/etc) and postpone the real work to postScanRelocations. It gives some flexibility: * Make it feasible to support .plt.got (PR32938): we need to know whether GLOB_DAT and JUMP_SLOT are both needed. * Make non-preemptible IFUNC handling slightly cleaner: avoid setting/clearing sym.gotInIgot * -z nocopyrel: report all copy relocation places for one symbol * Make GOT deduplication feasible * Make parallel relocation scanning feasible (if we can avoid all stateful operations and make Symbol attributes atomic), but parallelism may not be the appealing choice Since this patch moves a large chunk of code out of ELFT templates. My x86-64 executable is actually a few hundred bytes smaller. For ppc32-ifunc-nonpreemptible-pic.s: I remove absolute relocation references to non-preemptible ifunc because absolute relocation references are incorrect in -fpie mode. Reviewed By: peter.smith, ikudrin Differential Revision: https://reviews.llvm.org/D114783	2021-12-14 16:28:41 -08:00
Fangrui Song	04cf411c94	[ELF][test] Test unreferenced non-preemptible ifunc Add missing coverage exposed by D114783. There should be no associated IRELATIVE, otherwise (a) glibc ld.so may crash (b) it wastes space (c) unused IPLT causes confusion.	2021-12-14 16:25:50 -08:00
Fangrui Song	ea15b862d7	Revert D114783 [ELF] Split scanRelocations into scanRelocations/postScanRelocations May cause a failure for non-preemptible `bcmp` in a glibc -static link.	2021-12-14 14:33:50 -08:00
Stephan T. Lavavej	8bd106a891	[NFC] Fix typos in release notes. Reviewed By: ldionne, Mordante, MaskRay Differential Revision: https://reviews.llvm.org/D115685	2021-12-14 14:19:42 -08:00
Fangrui Song	6a44013b0e	[ELF] -Map: Print symbols which needs canonical PLT entry/copy relocation just once If a copy related symbol (say `copy`) is referenced in two .o files, this change removes a duplicated line from the -Map output: ``` 202470 202470 1 1 .bss.rel.ro 202470 202470 1 1 <internal>:(.bss.rel.ro) 202470 202470 1 1 copy removed 202470 202470 1 1 copy ``` Differential Revision: https://reviews.llvm.org/D115697	2021-12-14 10:31:06 -08:00
Fangrui Song	b79686c6dc	[ELF] Remove needsPltAddr in favor of needsCopy needsPltAddr is equivalent to `needsCopy && isFunc`. In many places, it is equivalent to `needsCopy` because the non-STT_FUNC cases are ruled out. Reviewed By: ikudrin, peter.smith Differential Revision: https://reviews.llvm.org/D115603	2021-12-14 09:52:43 -08:00
Fangrui Song	e7a95b0674	Reland [ELF] Split scanRelocations into scanRelocations/postScanRelocations (Fixed an issue about GOT on a copy relocated alias.) The idea is to make scanRelocations mark some actions are needed (GOT/PLT/etc) and postpone the real work to postScanRelocations. It gives some flexibility: * Make it feasible to support .plt.got (PR32938): we need to know whether GLOB_DAT and JUMP_SLOT are both needed. * Make non-preemptible IFUNC handling slightly cleaner: avoid setting/clearing sym.gotInIgot * -z nocopyrel: report all copy relocation places for one symbol * Make GOT deduplication feasible * Make parallel relocation scanning feasible (if we can avoid all stateful operations and make Symbol attributes atomic), but parallelism may not be the appealing choice Since this patch moves a large chunk of code out of ELFT templates. My x86-64 executable is actually a few hundred bytes smaller. For ppc32-ifunc-nonpreemptible-pic.s: I remove absolute relocation references to non-preemptible ifunc because absolute relocation references are incorrect in -fpie mode. Reviewed By: peter.smith, ikudrin Differential Revision: https://reviews.llvm.org/D114783	2021-12-13 20:11:24 -08:00
Fangrui Song	d1014d9e6d	[ELF] Improve test for copy relocations on aliases	2021-12-13 20:04:24 -08:00
Fangrui Song	0b8b86e30f	Revert "[ELF] Split scanRelocations into scanRelocations/postScanRelocations" This reverts commit `fc33861d48`. `replaceWithDefined` should copy needsGot, otherwise an alias for a copy relocated symbol may not have GOT entry if its needsGot was originally true.	2021-12-13 19:29:53 -08:00
Noah Shutty	fb6b103daa	[lld] Replace Symbolize.h with DIContext.h in lld's COFF lib lld only needs DIContext.h which it gets through Symbolize.h -> SymbolizableModule.h -> DIContext.h. This replaces it with a direct include of DIContext.h to avoid any confusion and pulling in unnecessary headers. Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D115659	2021-12-13 22:16:41 +00:00
Fangrui Song	fc33861d48	[ELF] Split scanRelocations into scanRelocations/postScanRelocations The idea is to make scanRelocations mark some actions are needed (GOT/PLT/etc) and postpone the real work to postScanRelocations. It gives some flexibility: * Make it feasible to support .plt.got (PR32938): we need to know whether GLOB_DAT and JUMP_SLOT are both needed. * Make non-preemptible IFUNC handling slightly cleaner: avoid setting/clearing sym.gotInIgot * -z nocopyrel: report all copy relocation places for one symbol * Make parallel relocation scanning possible (if we can avoid all stateful operations and make Symbol attributes atomic), but parallelism may not be the appealing choice * Make GOT deduplication feasible Since this patch moves a large chunk of code out of ELFT templates. My x86-64 executable is actually a few hundred bytes smaller. For ppc32-ifunc-nonpreemptible-pic.s: I remove absolute relocation references to non-preemptible ifunc because absolute relocation references are incorrect in -fpie mode. Reviewed By: peter.smith, ikudrin Differential Revision: https://reviews.llvm.org/D114783	2021-12-13 09:56:52 -08:00
Fangrui Song	9115d75117	[ELF] Use parallelSort for .rela.dyn An unstable sort suffices. In a large link (11.06s), this decreases .rela.dyn writeTo time from 1.52s to 0.81s, resulting in 6% total time speedup (the benefit will greatly dilute if --pack-dyn-relocs=relr becomes prevailing). Encoding the dynamic relocations then sorting raw Elf_Rel/Elf_Rela doesn't seem to improve much (doing that would require code duplicate because of Elf_Rel/Elf_Rela plus unfortunate mips64le), so don't do that.	2021-12-12 20:53:06 -08:00
Fangrui Song	1eaa9b4374	[ELF] initializeSections: move SHT_LLVM_CALL_GRAPH_PROFILE check into SHF_EXCLUDE && !relocatable. NFC Avoid a comparison in the majority of cases.	2021-12-12 20:05:21 -08:00
Fangrui Song	d29766bb48	[ELF] relocateAlloc: remove variables type and expr. NFC	2021-12-12 19:31:30 -08:00
Fangrui Song	4cfff19b88	[ELF] Move adjustSplitStackFunctionPrologues's splitStack check to the caller. NFC Avoid a function call in the majority of cases and make the output smaller.	2021-12-12 19:26:03 -08:00
Fangrui Song	a8024dfc06	[ELF] Avoid mutable addend parameter. NFC	2021-12-12 19:12:01 -08:00
Fangrui Song	af520fba2e	[ELF][test] Remove unused/incorrect .got check line	2021-12-12 10:51:05 -08:00
Jez Ng	098430cd25	[lld-macho][nfc] Simplify LC_DATA_IN_CODE generation 1. After D113241, we have the section address easily accessible and no longer need to iterate across the LC_SEGMENT commands to emit LC_DATA_IN_CODE. 2. There's no need to store a pointer to the data in code entries during the parse step; we can just look it up as part of the output step. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D115556	2021-12-11 01:01:57 -05:00
Jez Ng	40bcbe48e8	[lld-macho][nfc] InputSections don't need to track their total # of callsites ... only whether they have more than zero. This simplifies the code slightly. I've also moved the field into the ConcatInputSection subclass since it doesn't actually get used by the other InputSections. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D115539	2021-12-11 01:01:57 -05:00
Jez Ng	8a1f2d6580	[lld-macho] Include archive name in bitcode files Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D115281	2021-12-07 19:11:23 -05:00
Igor Kudrin	ce25eb12dd	[ELF] Do not report undefined weak references in shared libraries This fixes an issue introduced in D101996. A weak reference in a shared library could be incorrectly reported if there is another library that has a strong reference to the same symbol. Differential Revision: https://reviews.llvm.org/D115041	2021-12-07 10:10:51 +07:00
Chris Davis	e4eb6216c2	Enable pdbpagesize to allow support for PDB file sizes > 4GB Enable the pdbpagesize flag to allow linking of PDB files > 4GB. Also includes a couple small fixes to change to uint64_t to support the larger file sizes. I updated the max file size check in MSFBuilder.cpp to take into account the page size. Differential Revision: https://reviews.llvm.org/D115051	2021-12-06 18:22:08 -05:00
Jez Ng	1b44364714	[lld-macho] Unreferenced weak dylib symbols shouldn't fetch archive symbols We were fetching archive symbols too eagerly, bloating binary size as well as just screwing up binaries that expected to look up certain symbols only at runtime. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D115092	2021-12-05 15:11:44 -05:00
Kristina Bessonova	0ac75e82ff	Reland [DwarfDebug] Move emission of global vars, types and imports to endModule() This patch proposes to move emission of global variables, types, imported entities, etc from DwarfDebug::beginModule() to DwarfDebug::endModule(). Effectively, this changes nothing but the order of debug entities which will be as follows: * subprograms (including related context, local variables/labels, local imported entities; related types can be created as a part of the emission of local entities of an abstract subprogram); * global variables (including related context and types); * retained types and enums; * non-local-scoped imported entities; * basic types; * other types left (as a part of local variables attributes emission). Note that the order of emitted compile units may also be changed as now we emit units that contain subprograms first and then all other non-empty units. The motivation behind this change is the following: (1) DwarfDebug::beginModule() is run at the very beginning of backend's pipeline, from this time IR can be significantly changed by target-specific passes. If it happens for debug metadata of global entities, those changes will not be reflected in the emitted DWARF. (2) imported subprogram names should refer to an abstract subprogram if it exists, but it isn't known in DwarfDebug::beginModule() (it's possible to make some guesses based on location info, but it's not quite reliable); (3) aforementioned entities if they are scoped within a bracketed block (subject of D113741) couldn't be emitted in DwarfDebug::beginModule() (they need parent emitted first). Another problem is if to try to gather some information about local entities and defer their emission (till subprogram's processing or DwarfDebug::endModule()) all the gathered details might be irrelevant / invalid by the time the entities are being emitted (because of (1)). Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D114705	2021-12-05 13:56:45 +02:00
Leonard Grey	134275d994	[Support] Use final filename for Caching buffer identifier Mach-O LLD uses the buffer identifier of the memory buffer backing an object file to generate stabs which are used by `dsymutil` to find the object file for dSYM generation. When using thinLTO, these buffers are provided by the cache which initially saves them to disk as temporary files beginning with "Thin-" but renames them to persistent files beginning with "llvmcache-" before the buffer is provided to the cache user. However, the buffer is created before the file is renamed and is given the temp file's name as an identifier. This causes the generated stabs to point to nonexistent files. This change names the buffer with the eventual persistent filename. I think this is safe because failing to rename the temp file is a fatal error. Differential Revision: https://reviews.llvm.org/D115055	2021-12-04 22:25:49 -05:00
Kristina Bessonova	a961604819	Revert "[DwarfDebug] Support emitting function-local declaration for a lexical block" This reverts commits * `ee691970a9` (D113741), * `79d3132998` (D114705) due to lldb and dexter test failures.	2021-12-04 18:06:57 +02:00
Kristina Bessonova	79d3132998	[DwarfDebug] Move emission of global vars, types and imports to endModule() This patch proposes to move emission of global variables, types, imported entities, etc from DwarfDebug::beginModule() to DwarfDebug::endModule(). Effectively, this changes nothing but the order of debug entities which will be as follows: * subprograms (including related context, local variables/labels, local imported entities; related types can be created as a part of the emission of local entities of an abstract subprogram); * global variables (including related context and types); * retained types and enums; * non-local-scoped imported entities; * basic types; * other types left (as a part of local variables attributes emission). Note that the order of emitted compile units may also be changed as now we emit units that contain subprograms first and then all other non-empty units. The motivation behind this change is the following: (1) DwarfDebug::beginModule() is run at the very beginning of backend's pipeline, from this time IR can be significantly changed by target-specific passes. If it happens for debug metadata of global entities, those changes will not be reflected in the emitted DWARF. (2) imported subprogram names should refer to an abstract subprogram if it exists, but it isn't known in DwarfDebug::beginModule() (it's possible to make some guesses based on location info, but it's not quite reliable); (3) aforementioned entities if they are scoped within a bracketed block (subject of D113741) couldn't be emitted in DwarfDebug::beginModule() (they need parent emitted first). Another problem is if to try to gather some information about local entities and defer their emission (till subprogram's processing or DwarfDebug::endModule()) all the gathered details might be irrelevant / invalid by the time the entities are being emitted (because of (1)). Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D114705	2021-12-04 14:10:01 +02:00
Fangrui Song	9bd6f6f6d5	[ELF][test] Fix typo in aarch64-cortex-a53-843419-recognize.s	2021-12-03 14:38:56 -08:00
George Koehler	885fb9a257	[ELF][PPC32] Make R_PPC32_PLTREL retain .got PLT usage needs the first 12 bytes of the .got section. We need to keep .got and DT_GOT_PPC even if .got/_GLOBAL_OFFSET_TABLE_ are not referenced (large PIC code may only reference .got2), which is the case in OpenBSD's ld.so, leading to a misleading error, "unsupported insecure BSS PLT object". Fix this by adding R_PPC32_PLTREL to the list of hasGotOffRel. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D114982	2021-12-02 15:28:37 -08:00
Fangrui Song	353fe72ca3	[ELF] Hint -z nostart-stop-gc for __start_ undefined references Make users aware what to do with ld.lld 13.0.0 / GNU ld<2015-10 --gc-sections behavior. Differential Revision: https://reviews.llvm.org/D114830	2021-12-02 11:58:25 -08:00
Keith Smiley	9e3552523e	[lld-macho] Remove old macho darwin lld During the llvm round table it was generally agreed that the newer macho lld implementation is feature complete enough to replace the old implementation entirely. This will reduce confusion for new users who aren't aware of the history. Differential Revision: https://reviews.llvm.org/D114842	2021-12-02 11:04:49 -08:00
Reid Kleckner	8270ff86a1	[ELF] Fix driver.test after `8c3641d0` when cwd is readonly	2021-12-02 10:25:04 -08:00
Sam Clegg	6f5c5cbe5f	[lld][WebAssembly] Fix for debug relocations against undefined function symbols This is very similar to https://reviews.llvm.org/D103557 but applies to symbols which are undefined at link time rather than compile time. We already have code that handles symbols which were defined at link time but dead stripped by `--gc-sections` (See `test/wasm/debug-removed-fn.ll`). In that case the symbols are not live (!isLive()). However, we can also have live symbols (which are references by the program) but which are undefined at link time and are imported by the linker. In the test case here the symbol `undef` is used but is not defined in the program but is imported by the linker due to the `--import-undefined` flag. Fixes: https://github.com/emscripten-core/emscripten/issues/15528 Differential Revision: https://reviews.llvm.org/D114921	2021-12-02 08:36:28 -08:00
Fangrui Song	c5bfffed48	[ELF] Discard input .note.gnu.build-id even with default --build-id=none binutils 2.38 will adopt this behavior https://sourceware.org/bugzilla/show_bug.cgi?id=28639 Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D114910	2021-12-02 09:50:59 +00:00
Igor Kudrin	b0ac68ccb7	[ELF] Prevent internalizing used comdat symbol When a comdat symbol is defined in both bitcode and regular object files, which are contained in the same archive, the linker could lose the flag that the symbol is used in the regular object file and allow LTO to internalize it, which led to "error: undefined symbol". The issue was introduced in D79300. Differential Revision: https://reviews.llvm.org/D114801	2021-12-02 12:10:06 +07:00
Fangrui Song	ad45df91ad	[ELF][PPC64] Remove unneeded PPC64PCRelLongBranchThunk This reverts the PPC64PCRelLongBranchThunk part from D86706. PPC64PCRelLongBranchThunk is the same as PPC64R12SetupStub. Use `__gep_setup_` instead of `__long_branch_pcrel_` for the stub symbol name as it more closely indicates the operation. (Note: GNU ld uses `.long_branch.` and `.plt_branch.`). Reviewed By: NeHuang, nemanjai Differential Revision: https://reviews.llvm.org/D114656	2021-11-30 11:33:17 -08:00
Fangrui Song	8c3641d03e	[ELF] Change -z unknown from error to warning There is a trend of having more optional options (usually security hardening related) like -z cet-report=, -z bti-report=, -z force-bti. If ld.lld 14.0.0 uses a warning, in 15/16/17/... timeframe when people add new options to software, they can worry less about linker errors on ld.lld 14.0.0. In some cases `-z foo` does essential work where a silent ignore can be problematic, but the user has received a warning. From my observation, the doing-essential-work `-z foo` is much fewer than the converse. In addition, the user who cares can use `--fatal-warnings` (Note: GNU ld doesn't upgrade warnings to errors). It is unclear whether we need something like `clang -Wunknown-warning-option`. If we ever run into unfortunate transition like `-z start-stop-gc`, the affected software (e.g. ldc is a compiler which passes linker options to the underlying ld) can blindly add the `-z` option, without worrying it may cause a linker error to LLD 14.0.0. Reviewed By: jrtc27, peter.smith Differential Revision: https://reviews.llvm.org/D114748	2021-11-30 11:06:28 -08:00
Vy Nguyen	74cbd71072	[lld-macho] Mark dylib symbols coming from -weak_framework as weak-ref. PR:52564 Differential Revision: https://reviews.llvm.org/D114397	2021-11-30 09:54:59 -05:00
Fangrui Song	5188f55d32	[ELF] Move ObjFile<ELFT>::{getLocalSymbols,getGlobalSymbols} to non-template ELFFileBase. NFC	2021-11-30 00:50:19 -08:00
Fangrui Song	5047e3a3ba	[ELF] Move GOT/PLT relocation code closer. NFC	2021-11-29 23:10:04 -08:00
Fangrui Song	1ce51a5f35	[ELF] --cref: If -Map is specified, print to the map file PR48282: This behavior matches GNU ld and gold. Reviewed By: markj Differential Revision: https://reviews.llvm.org/D114663	2021-11-29 14:14:53 -08:00
Fangrui Song	4709bacf18	[ELF] Avoid std::stable_partition which may allocate memory. NFC	2021-11-28 21:47:56 -08:00
Fangrui Song	99a2d940dd	[ELF] Speed up/simplify removeUnusedSyntheticSections. NFC Make one change: when the OutputSection is nullptr (due to /DISCARD/ or garbage collected BssSection (replaceCommonSymbols)), discard the SyntheticSection as well.	2021-11-28 21:07:34 -08:00
Fangrui Song	286c11165e	[ELF] Decrease InputSectionBase::entsize to uint32_t While here, change the sh_addralign argument to uint32_t (InputSection ctor's argument and the member are uint32_t); add constexpr.	2021-11-28 19:50:33 -08:00
Fangrui Song	e652f3f04a	[ELF] Simplify some ctx->outSec with sec. NFC	2021-11-28 19:08:27 -08:00
Fangrui Song	89c0f4553e	[ELF] Simplify/remove LinkerScript::switchTo. NFC	2021-11-28 19:05:15 -08:00
Fangrui Song	11291326cd	[ELF] Support --oformat= beside Separate --oformat Both GNU ld's manpage and ours use --oformat= as the canonical form. It's odd that we do not support it...	2021-11-28 18:44:23 -08:00
Fangrui Song	b5f1fa3e5c	[ELF][test] --oformat binary: Check that SIZEOF_HEADERS==0	2021-11-28 18:34:36 -08:00
Fangrui Song	1164c4b375	[ELF] Simplify/remove LinkerScript::output and advance. NFC	2021-11-28 16:58:06 -08:00
Fangrui Song	e80a0b353c	[ELF] Remove unneeded getOutputSectionVA. NFC I attempted to remove it 1 or 2 year ago but kept it just to have a good diagnostic in case the output section is nullptr (should be impossible). It is long enough that we haven't seen such a case.	2021-11-28 16:17:10 -08:00
Fangrui Song	85e50c1080	[ELF] Inline InputSection::getOffset into callers and remove it. NFC This is an unneeded abstraction which may cause confusion: SectionBase::getOffset has the same name but hard codes -1 as the size of OutputSection.	2021-11-28 16:09:04 -08:00
Fangrui Song	7ea662e2dd	[ELF] Replace one make_unique from r316378 with a stack object. NFC	2021-11-28 15:32:29 -08:00
Fangrui Song	25c7ec4fc6	[ELF] Simplify OutputSection::sectionIndex assignment. NFC And improve comments.	2021-11-28 14:56:29 -08:00
Fangrui Song	d060cc1f98	[ELF] Fix out-of-bounds write in memset(&Out::first, ...) Fix r285764: there is no guarantee that Out::first is placed before other static data members of `struct Out`. After `bufferStart` was introduced, this out-of-bounds write is destined in many compilers. It is likely benign, though. And move `Out::elfHeader->size` assignment beside `Out::elfHeader->sectionIndex`	2021-11-28 14:47:57 -08:00
Fangrui Song	cecc6893a0	[ELF] Simplify assignFileOffsets There is a difference with non-SHF_ALLOC SHT_NOBITS when off%sh_addralign!=0 which doesn't happen/matter in practice.	2021-11-28 13:44:42 -08:00
Fangrui Song	f9a4d9aa03	[ELF] -z separate-*: Use max-page-size instead of common-page-size for text/non-SHF_ALLOC transition and writeTrapInstr For -z separate-code and -z separate-loadable-segments: When RW is present, the RX to RW transition is aligned with max-page-size. When RW is absent, the RX to non-SHF_ALLOC transition should use max-page-size as well.	2021-11-28 12:47:50 -08:00
Fangrui Song	6c1c2313d1	[ELF] Simplify assignFileOffsets. NFC	2021-11-28 11:43:59 -08:00
Ard Biesheuvel	da66263b6e	[ARM] implement support for ALU/LDR PC-relative group relocations Currently, LLD does not support the complete set of ARM group relocations. Given that I intend to start using these in the Linux kernel [0], let's add support for these. This implements the group processing as documented in the ELF psABI. Notably, this means support is dropped for very far symbol references that also carry a small component, where the immediate is rotated in such a way that only part of it wraps to the other end of the 32-bit word. To me, it seems unlikely that this is something anyone could be relying on, but of course I could be wrong. [0] https://lore.kernel.org/r/20211122092816.2865873-8-ardb@kernel.org/ Reviewed By: peter.smith, MaskRay Differential Revision: https://reviews.llvm.org/D114172	2021-11-27 10:26:37 +01:00
Fangrui Song	6fa8f7beb1	[ELF][test] Test that .o definition does not inherit .so STV_PROTECTED Test %t2.so %t.o beside %t.o %t2.so	2021-11-26 15:00:10 -08:00
Fangrui Song	f1ba48d508	[ELF] Simplify Symbol::extract. NFC	2021-11-26 14:10:55 -08:00
Fangrui Song	3b4dd68de5	[ELF][PPC64] Make --power10-stubs/--no-power10-stubs proper aliases for --power10-stubs={auto,no} This allows --power10-stubs= and --[no-]power10-stubs to override each other (they are position dependent in GNU ld). Also improve --help messages and the manpage. Note: GNU ld's default "auto" mode uses heuristics to decide whether Power10 instructions are used. Arguably it is a design mistake of R_PPC64_REL24_NOTOC (acked by the relevant folks on a libc-alpha discussion). We don't implement "auto", so the default --power10-stubs is the same as "yes".	2021-11-26 11:51:45 -08:00
Fangrui Song	09401dfcf1	[ELF] Rename fetch to extract The canonical term is "extract" (GNU ld documentation, Solaris's `-z *extract` options). Avoid inventing a term and match --why-extract. (ld64 prefers "load" but the word is overloaded too much) Mostly MFC, except for --help messages and the header row in --print-archive-stats output.	2021-11-26 10:58:50 -08:00
Fangrui Song	7051aeef7a	[ELF] Rename BaseCommand to SectionCommand. NFC BaseCommand was picked when PHDRS/INSERT/etc were not implemented. Rename it to SectionCommand to match `sectionCommands` and make it clear that the commands are used in SECTIONS (except a special case for SymbolAssignment). Also, improve naming of some BaseCommand variables (base -> cmd).	2021-11-25 20:24:23 -08:00
Fangrui Song	e40e17fcaf	[ELF] Make ExprValue smaller. NFC'	2021-11-25 16:55:06 -08:00
Fangrui Song	6188fd4957	[ELF] Rename OutputSection::sectionCommands to commands. NFC This partially reverts r315409: the description applies to LinkerScript, but not to OutputSection. The name "sectionCommands" is used in both LinkerScript::sectionCommands and OutputSection::sectionCommands, which may lead to confusion. "commands" in OutputSection has no ambiguity because there are no other types of commands.	2021-11-25 16:47:07 -08:00
Fangrui Song	ff0d9e6cfa	[ELF] Remove redundant part.dynSymTab creation. NFC	2021-11-25 14:42:22 -08:00
Fangrui Song	5ca54c6686	[ELF] Simplify GnuHashSection::write. NFC	2021-11-25 14:23:25 -08:00
Fangrui Song	55c14d6dbf	[ELF] Simplify DynamicSection content computation. NFC The new code computes the content twice, but avoides the tricky std::function<uint64_t()>. Removed 13KiB code in a Release build.	2021-11-25 14:12:34 -08:00
Fangrui Song	6ca8fde226	[ELF] Emit DF_STATIC_TLS only for -shared This matches GNU ld and saves 2 words for executables.	2021-11-24 23:17:13 -08:00
Fangrui Song	5922dd91f8	[ELF] Rename hasStaticTlsModel to hasTlsIe and remove unneeded atomic.	2021-11-24 21:06:04 -08:00
Fangrui Song	371290dfd4	[ELF] Remove unneeded DF_STATIC_TLS for EM_386 local-exec TLS which is also untested.	2021-11-24 20:43:58 -08:00
Igor Kudrin	8cdf1c1edb	[ELF] Support the "read-only" memory region attribute The attribute 'r' allows (or disallows for the negative case) read-only sections, i.e. ones without the SHF_WRITE flag, to be assigned to the memory region. Before the patch, lld could put a section in the wrong region or fail with "error: no memory region specified for section". Differential Revision: https://reviews.llvm.org/D113771	2021-11-24 12:17:09 +07:00
Fangrui Song	38ed1db7e8	[ELF] Support non-RAX/non-adjacent R_X86_64_GOTPC32_TLSDESC/R_X86_64_TLSDESC_CALL The current TLSDESC optimization code assumes: ``` leaq x@tlsdesc(%rip), %rax call x@tlscall(%rax) # adjacent ``` From https://gitlab.freedesktop.org/mesa/mesa/-/issues/5665 , it seems that the two instructions may not be adjacent in GCC 10's output: ``` leaq x@tlsdesc(%rip), %rax something else call x@tlscall(%rax) ``` This patch supports the case. While here, support non-RAX registers for R_X86_64_GOTPC32_TLSDESC, in case the compiler generates inefficient: ``` leaq x@tlsdesc(%rip), %rcx # or %rdx, %rbx, %rdi, ... movq %rcx, %rax call *x@tlscall(%rax) # GNU ld/gold error for non-RAX ``` Differential Revision: https://reviews.llvm.org/D114416	2021-11-23 10:30:11 -08:00
Martin Storsjö	d703b92296	[LLD] [COFF] Omit section symbols and IMAGE_SYM_CLASS_LABEL from the PE symbol table The section symbols aren't of much practical use when looking at a linked image. This shrinks one observed mingw style unstripped binary by 14%. IMAGE_SYM_CLASS_LABEL is in spirit the same as a temporary assembler label that isn't emitted on the object file level at all. Differential Revision: https://reviews.llvm.org/D113866	2021-11-23 10:17:04 +02:00
Martin Storsjö	7c15da6761	[LLD] [COFF] Interpret the immediate in ARM64 adr/adrp relocations as signed 21 bit This matches how MS link.exe interprets this relocation. Differential Revision: https://reviews.llvm.org/D114347	2021-11-23 10:13:01 +02:00
Shoaib Meenai	2f5d6a0ea5	[MachO] Fix struct size assertion std::vector can have different sizes depending on the STL's debug level, so account for its size separately. (You could argue that we should be accounting for all the other members separately as well, but that would be very unergonomic, and std::vector is the only one that's caused problems so far.)	2021-11-22 15:02:30 -08:00
Fangrui Song	7aafe467d2	[ELF] Simplify a condition with config->copyRelocs. NFC	2021-11-22 13:59:23 -08:00
Vy Nguyen	944071eca2	[lld-macho] Don't replace local personality symbol with LazySymbol Follup-up to D107533, where we replaced local syms with non-local. It doesn't make sense to replace local symbol with lazy. Differential Revision: https://reviews.llvm.org/D110040	2021-11-22 14:09:54 -05:00
Igor Kudrin	a05b694b1e	[ELF][NFC] Do not pass region name to expandMemoryRegion() The name can be easily got on-site. Differential Revision: https://reviews.llvm.org/D114228	2021-11-22 14:19:07 +07:00
Fangrui Song	648157b05a	[ELF] Move getOutputSectionName from Writer.cpp to LinkerScript.cpp. NFC and internalize it.	2021-11-20 22:18:09 -08:00
Fangrui Song	2997441b85	[ELF] Support discarding .got.plt Fix a null pointer dereference when .got.plt is discarded. This also adds a test for discarding `.plt`. Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D114180	2021-11-19 10:50:53 -08:00
Nico Weber	bc20bcb39e	[lld/mac] Crash even less on undefined symbols with --icf=all Follow-up to https://reviews.llvm.org/D112643. Even after that change, we were still asserting if two separate functions that are eligible for ICF (same size, same data, same number of relocs, same reloc types, ...) referred to Undefineds. This fixes that oversight. Differential Revision: https://reviews.llvm.org/D114195	2021-11-19 09:23:19 -05:00
Andrew Ng	47eb3f155f	[ELF] Ensure output section is not discarded in addStartEndSymbols() Fixes https://bugs.llvm.org/show_bug.cgi?id=52534. Differential Revision: https://reviews.llvm.org/D114179	2021-11-19 11:45:58 +00:00
Konstantin Schwarz	8c18719bae	[ELF] Expand LMA region if output section alignment introduces padding When aligning the start address of an output section introduces a gap between the current dot pointer and the new aligned address, we were already properly expanding the memory region, if available. D74286 introduced a new behavior to also align the LMA address if an LMA region is specified. However, this did not expand the corresponding LMA region. Now, we also expand the LMA region if it is set. This fixes PR52510. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D114166	2021-11-19 11:27:21 +01:00
Vincent Lee	adfbb5411b	[lld-macho] Add warn flags to enable/disable warnings on -install_name ld64 doesn't warn on builds using `-install_name` if it's a bundle. But, the current warning is nice to have because `install_name` only works with dylib. To prevent an overflow of warnings in build logs and have parity with ld64, create a `--warn-dylib-install-name` and `--warn-no-dylib-install-name` flag that enables this LLD specific warning. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D113534	2021-11-17 16:18:14 -08:00
Greg McGary	9cc489a4b2	[lld-macho][nfc] Factor-out NFC changes from main __eh_frame diff In order to keep signal:noise high for the `__eh_frame` diff, I have teased-out the NFC changes and put them here. Differential Revision: https://reviews.llvm.org/D114017	2021-11-17 15:16:44 -07:00
Shoaib Meenai	01510ac084	[MachO] Move type size asserts to source files. NFC As discussed in https://reviews.llvm.org/D113809#3128636. It's a bit unfortunate to move the asserts away from the structs whose sizes they're checking, but it's a far better developer experience when one of the asserts is violated, because you get a single error instead of every single source file including the header erroring out.	2021-11-16 17:14:16 -08:00
Vy Nguyen	34d15eaced	[lld-macho][nfc] Sanity check on template type Differential Revision: https://reviews.llvm.org/D114044	2021-11-16 20:04:49 -05:00
Shoaib Meenai	93bf271f27	[MachO] Shrink reloc from 32 bytes to 24 bytes The `r_address` field of `relocation_info` is only 4 bytes, so our offset field (which is the `r_address` field adjusted for subsection splitting) also only needs to be 4 bytes. This reduces the structure size from 32 bytes to 24 bytes. Combined with https://reviews.llvm.org/D113813, this is a minor perf improvement for linking an internal app, tested on two machines: ``` smol-relocs baseline difference (95% CI) sys_time 7.367 ± 0.138 7.543 ± 0.157 [ +0.9% .. +3.8%] user_time 21.843 ± 0.351 21.861 ± 0.450 [ -1.3% .. +1.4%] wall_time 20.301 ± 0.307 20.556 ± 0.324 [ +0.1% .. +2.4%] samples 16 16 smol-relocs baseline difference (95% CI) sys_time 2.923 ± 0.050 2.992 ± 0.018 [ +1.4% .. +3.4%] user_time 10.345 ± 0.039 10.448 ± 0.023 [ +0.8% .. +1.2%] wall_time 12.068 ± 0.071 12.229 ± 0.021 [ +1.0% .. +1.7%] samples 15 12 ``` More importantly though, this change by itself reduces our maximum resident set size by 220 MB (2.75%, from 7.85 GB to 7.64 GB) on the first machine. On the second machine, it reduces it by 125 MB (1.94%, from 6.31 GB to 6.19 GB). Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D113818	2021-11-16 16:30:34 -08:00
Shoaib Meenai	3195297897	[MachO] Reduce size of Symbol and Defined We can lay out Symbol more optimally to reduce its size from 56 bytes to 48 bytes by eliminating unnecessary padding, and we can lay out Defined such that its bitfield members are placed in the tail padding of Symbol (on ABIs which support this), to reduce it from 96 bytes to 80 bytes (8 bytes from the Symbol reduction, and 8 bytes from the tail padding reuse). This is perf-neutral for an internal app (results from two different machines): ``` smol-syms baseline difference (95% CI) sys_time 7.430 ± 0.202 7.440 ± 0.193 [ -2.6% .. +2.9%] user_time 21.443 ± 0.513 21.206 ± 0.396 [ -3.3% .. +1.1%] wall_time 20.453 ± 0.534 20.222 ± 0.488 [ -3.7% .. +1.5%] samples 9 8 smol-syms baseline difference (95% CI) sys_time 3.011 ± 0.050 3.040 ± 0.052 [ -0.4% .. +2.3%] user_time 10.416 ± 0.075 10.496 ± 0.091 [ +0.1% .. +1.4%] wall_time 12.229 ± 0.144 12.354 ± 0.192 [ -0.1% .. +2.1%] samples 14 13 ``` However, on the first machine, it reduces maximum resident set size by 65.9 MB (0.8%, from 7.92 GB to 7.85 GB). On the second machine, it reduces it by 92 MB (1.4%, from 6.40 GB to 6.31 GB). Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D113813	2021-11-16 16:30:33 -08:00
Shoaib Meenai	637a3396b3	[MachO] Fix struct size assertion It was checking for 64-bit builds incorrectly. Unfortunately, ConcatInputSection has grown a bit in the meantime, and I don't see any obvious way to shrink it. Perhaps icfEqClass could use 32-bit hashes instead of 64-bit ones, but xxHash64 is supposed to be much faster than xxHash32 (https://github.com/Cyan4973/xxHash#benchmarks), so that sounds like a loss. (Unrelatedly, we should really look at using XXH3 instead of xxHash64 now.) Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D113809	2021-11-16 16:30:31 -08:00
Greg McGary	3a1b3c9afe	[lld-macho][nfc] rename parsed-section types & variables This is an NFC diff that prepares for pruning & relocating `__eh_frame`. Along the way, I made the following changes to ... * clarify usage of `section` vs. `subsection` * remove `map` & `vec` from type names * disambiguate class `Section` from template parameter `SectionHeader`. Differential Revision: https://reviews.llvm.org/D113241	2021-11-16 07:06:41 -07:00
Quinn Pham	1ca00ecfb8	[NFC][lld] Inclusive language: change master file to merged file [NFC] As part of using inclusive language within the llvm project, this patch replaces master with merged in these comments. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D113903	2021-11-15 14:32:09 -06:00
Igor Kudrin	66691de94c	[ELF] Do not try to assign a memory region to a non-allocatable section Non-allocatable sections are not part of the memory image of the program, so there is no need to find memory regions for them either matching properties or handling explicit assignments. The early test and return help to simplify LinkerScript::findMemoryRegion() a bit. Differential Revision: https://reviews.llvm.org/D113768	2021-11-15 15:59:39 +07:00
Shao-Ce SUN	0c660256eb	[NFC] Trim trailing whitespace in *.rst	2021-11-15 09:17:08 +08:00
Keith Smiley	51715fbd96	[lld-macho] Fix warning ``` /Users/ksmiley/dev/llvm-project/lld/MachO/Symbols.cpp:43:27: warning: field 'external' will be initialized after field 'weakDefCanBeHidden' [-Wreorder-ctor] weakDef(isWeakDef), external(isExternal), ^ 1 warning generated. ``` Differential Revision: https://reviews.llvm.org/D113823	2021-11-12 19:36:51 -08:00
Vy Nguyen	9b29dae3ca	[lld-macho] Allow exporting weak_def_can_be_hidden(AKA "autohide") symbols autohide symbols behaves similarly to private_extern symbols. However, LD64 allows exporting autohide symbols. LLD currently does not. This patch allows LLD to export them. Differential Revision: https://reviews.llvm.org/D113167	2021-11-12 21:57:30 -05:00
Vy Nguyen	ad932320d8	[lld-macho] Parallelize scanning the symbol tables in export/unexport-ing. (Split from D113167) Benchmarking on one of our large apps which exports a few thousands symbols, this showed an improvement of ~17%. x ./LLD_no_parallel.txt + ./LLD_with_parallel.txt N Min Max Median Avg Stddev x 10 84.01 89.41 88.64 87.693 1.7424061 + 10 71.9 74.29 72.63 72.753 0.77734663 Difference at 95.0% confidence -14.94 +/- 1.26763 -17.0367% +/- 1.44553% (Student's t, pooled s = 1.34912) (wallclock) Differential Revision: https://reviews.llvm.org/D113820	2021-11-12 20:57:24 -05:00
Duncan P. N. Exon Smith	9a2b54af22	lld: const-qualify iterations through VarStreamArray, NFC No functionality change here; just unblocking a patch to LLVM.	2021-11-12 14:29:49 -08:00
Jez Ng	9d0b237c51	[lld-macho] Fix symbol relocs handling for LSDAs Similar to D113702, but for the LSDAs. Clang seems to emit all LSDA relocs as section relocs, but ld -r can turn those relocs into symbol ones. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D113721	2021-11-12 16:02:49 -05:00
Jez Ng	d9b6f7e312	[lld-macho] Teach ICF to dedup functions with identical unwind info Dedup'ing unwind info is tricky because each CUE contains a different function address, if ICF operated naively and compared the entire contents of each CUE, entries with identical unwind info but belonging to different functions would never be considered identical. To work around this problem, we slice away the function address before performing ICF. We rely on `relocateCompactUnwind()` to correctly handle these truncated input sections. Here are the numbers before and after D109944, D109945, and this diff were applied, as tested on my 3.2 GHz 16-Core Intel Xeon W: Without any optimizations: base diff difference (95% CI) sys_time 0.849 ± 0.015 0.896 ± 0.012 [ +4.8% .. +6.2%] user_time 3.357 ± 0.030 3.512 ± 0.023 [ +4.3% .. +5.0%] wall_time 3.944 ± 0.039 4.032 ± 0.031 [ +1.8% .. +2.6%] samples 40 38 With `-dead_strip`: base diff difference (95% CI) sys_time 0.847 ± 0.010 0.896 ± 0.012 [ +5.2% .. +6.5%] user_time 3.377 ± 0.014 3.532 ± 0.015 [ +4.4% .. +4.8%] wall_time 3.962 ± 0.024 4.060 ± 0.030 [ +2.1% .. +2.8%] samples 47 30 With `-dead_strip` and `--icf=all`: base diff difference (95% CI) sys_time 0.935 ± 0.013 0.957 ± 0.018 [ +1.5% .. +3.2%] user_time 3.472 ± 0.022 6.531 ± 0.046 [ +87.6% .. +88.7%] wall_time 4.080 ± 0.040 5.329 ± 0.060 [ +30.0% .. +31.2%] samples 37 30 Unsurprisingly, ICF is now a lot slower, likely due to the much larger number of input sections it needs to process. But the rest of the linker only suffers a mild slowdown. Note that the compact-unwind-bad-reloc.s test was expanded because we now handle the relocation for CUE's function address in a separate code path from the rest of the CUE relocations. The extended test covers both code paths. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D109946	2021-11-12 16:02:49 -05:00
Jez Ng	ad8df21db2	[reland][lld-macho] Fix symbol relocs handling for compact unwind's functionAddress Clang seems to emit all functionAddress relocs as section relocs, but `ld -r` can turn those relocs into symbol ones. It turns out that we weren't handling that case correctly when the symbol was a weak def whose definition did not prevail. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D113702	2021-11-12 15:01:51 -05:00
Keith Smiley	eb6f9f3123	[lld-macho] Fix trailing slash in oso_prefix Previously if you passed `-oso_prefix path/to/foo/` with a trailing slash at the end, using `real_path` would remove that slash, but that slash is necessary to make sure OSO prefix paths end up as valid relative paths instead of starting with `/`. Differential Revision: https://reviews.llvm.org/D113541	2021-11-12 11:29:08 -08:00
Fangrui Song	a05384dc89	[ELF] Make --no-relax disable R_X86_64_GOTPCRELX and R_X86_64_REX_GOTPCRELX GOT optimization This brings back the original version of D81359. I have found several use cases now. * Unlike GNU ld, LLD's relocation processing is one pass. If we decide to optimize(relax) R_X86_64_{,REX_}GOTPCRELX, we will suppress GOT generation and cannot undo the decision later. Optimizing R_X86_64_REX_GOTPCRELX can usually make it easy to hit `relocation R_X86_64_REX_GOTPCRELX out of range` because the distance to GOT is usually shorter. Without --no-relax, the user has to recompile with `-Wa,-mrelax-relocations=no`. * The option would help during my investigationg of the root cause of https://git.kernel.org/linus/09e43968db40c33a73e9ddbfd937f46d5c334924 * There is need for relaxation for AArch64 & RISC-V. Implementing this for x86-64 improves consistency with little target-specific cost (two-line X86_64.cpp change). Reviewed By: alexander-shaposhnikov Differential Revision: https://reviews.llvm.org/D113615	2021-11-12 09:47:31 -08:00
Kazu Hirata	835135a8ae	Revert "[lld-macho] Fix symbol relocs handling for compact unwind's functionAddress" This reverts commit `e941fe5061`. The commit in question causes: lld/MachO/InputFiles.cpp:916:13: error: use of undeclared identifier 'it'	2021-11-11 20:29:48 -08:00
Jez Ng	e941fe5061	[lld-macho] Fix symbol relocs handling for compact unwind's functionAddress Clang seems to emit all functionAddress relocs as section relocs, but `ld -r` can turn those relocs into symbol ones. It turns out that we weren't handling that case correctly when the symbol was a weak def whose definition did not prevail. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D113702	2021-11-11 22:53:35 -05:00
Petr Hosek	d56b171ee9	[lld][ELF] Support for R_ARM_THM_JUMP8 This change implements support for R_ARM_THM_JUMP8 relocation in addition to R_ARM_THM_JUMP11 which is already supported by LLD. Differential Revision: https://reviews.llvm.org/D21225	2021-11-11 09:06:52 -08:00
Igor Kudrin	d2dd36bbbe	[ELF] Better resemble GNU ld when placing orphan sections into memory regions An orphan section should be placed in the same memory region as its anchor section if the latter specifies the memory region explicitly. If there is no explicit assignment for the anchor section in the linker script, its memory region is selected by matching attributes, and the same should be done for the orphan section. Before the patch, some scripts that were handled smoothly in GNU ld caused an "error: no memory region specified for section" in lld. Differential Revision: https://reviews.llvm.org/D112925	2021-11-11 15:07:38 +07:00
Jez Ng	a2404f11c7	[lld-macho] Support renaming of LSDA section Previously, our unwind info finalization logic assumed that the LSDA section referenced by `__compact_unwind` was already finalized before `__TEXT,__unwind_info` itself. However, that assumption could be broken by the use of `-rename_section` -- it could be (and is) used to move `__gcc_except_tab` it into a different segment later in the file. (__TEXT is always the first non-zerofill segment, so any rename basically guarantees that the section will be ordered after `__unwind_info`.) To handle this case, we compare LSDA relocations instead of their final values in `UnwindInfoSection::finalize()`, and we actually relocate those LSDAs in `UnwindInfoSection::writeTo()`. In order to do this, we need an easy way to track which Symbol a given CUE corresponds to. My solution was to change our `cuPtrVector` into a vector of indices, with each index used for both the symbols vector (`symbolsVec`) as well as the CUE vector (`cuVector`). This change seems perf neutral. Numbers for linking chromium_framework on my 16 core Mac Pro: base diff difference (95% CI) sys_time 1.248 ± 0.025 1.245 ± 0.026 [ -1.3% .. +0.8%] user_time 3.588 ± 0.045 3.587 ± 0.037 [ -0.6% .. +0.5%] wall_time 4.605 ± 0.069 4.595 ± 0.069 [ -1.0% .. +0.5%] samples 42 26 Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D113582	2021-11-10 19:31:54 -05:00
Fangrui Song	51ee08c217	[ELF] Enforce double-dash form for --ignore-{data,function}-pointer-equality --reproduce --thread They are LLD-specific options. We have enforced double-dash forms for other options (reduce collision with short options) but missed them.	2021-11-10 01:17:08 -08:00
Fangrui Song	d71bb6a409	[ELF] Inline isPPC64SmallCodeModelTocReloc which is only called once. NFC	2021-11-09 20:41:05 -08:00
Fangrui Song	bec28ee1ea	[ELF] Move isStaticLinkTimeConstant closer to the only caller processRelocAux. NFC	2021-11-09 20:37:46 -08:00
Fangrui Song	213d1849a4	[ELF] Improve sh_info=0 and sh_info>=num_sections diagnostic for SHT_REL/SHT_RELA PR52408 reported an sh_info=0 instance. I have seen sh_info=0 independently before. sh_info>=num_sections is probably very rare. Just use one diagnostic for the two types of errors. Delete invalid-relocations.test which is covered by invalid/bad-reloc-target.test Differential Revision: https://reviews.llvm.org/D113466	2021-11-09 09:54:12 -08:00
Vy Nguyen	2e1be96df6	Reland "[lld-macho] Fix assertion failure in registerCompactUnwind"" PR/52372 Differential Revision: https://reviews.llvm.org/D112977 New changes: - use llvm-otool instead of `otool` which doesn't in exist on non-OSX platforms - add llvm-otool to the set of tools used by test so that the bot will use the <build_dir>/bin/llvm-otool instead of the unqualified `llvm-otool` (which may not exist) - update tests since the latest (TOT) llvm-otool prints a space between two bytes and the old one doesn't.	2021-11-09 11:52:46 -05:00
Vy Nguyen	eb4a517816	Revert "[lld-macho] Fix assertion failure in registerCompactUnwind" broke windows build - reverting to investigate This reverts commit `b2d9258474`.	2021-11-09 10:31:47 -05:00
Vy Nguyen	b2d9258474	[lld-macho] Fix assertion failure in registerCompactUnwind PR/52372 Differential Revision: https://reviews.llvm.org/D112977	2021-11-09 10:08:17 -05:00
Fangrui Song	43bb5f0185	[docs] Remove outdated documentation for the legacy Atom-based LLD The outdated documentation diverges a lot from the current state of COFF/Mach-O/ELF/wasm ports and may just confuse users. It is better rewriting some if useful. Tested with `ninja docs-lld-html` Reviewed By: #lld-macho, lhames, Jez Ng Differential Revision: https://reviews.llvm.org/D113432	2021-11-08 15:20:16 -08:00
Fangrui Song	cebb0a64b4	[ELF][ARM] Improve error message for unknown relocation Like rLLD354040. Before: `error: unrecognized relocation Unknown (254)` Now: `error: unknown relocation (254) against symbol foo`	2021-11-08 12:39:08 -08:00
David Blaikie	78758026e2	Fix lld test after dwarfdump array syntax change	2021-11-05 23:00:29 -07:00
Fangrui Song	26a8ceba3e	[llvm-readobj] Display DT_RELRSZ/DT_RELRENT as " (bytes)" to match RELSZ/RELENT. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D113206	2021-11-05 10:02:49 -07:00
Quinn Pham	c71fbdd87b	[NFC] Inclusive language: Remove instances of master in URLs [NFC] This patch fixes URLs containing "master". Old URLs were either broken or redirecting to the new URL. Reviewed By: #libc, ldionne, mehdi_amini Differential Revision: https://reviews.llvm.org/D113186	2021-11-05 08:48:41 -05:00
Keith Smiley	a7a2959901	[lld-macho] Replace LC_LINKER_OPTION parsing This removes the tablegen based parsing of LC_LINKER_OPTION since it can only actually contain a very small number of potential arguments. In our project with tablegen this took 5 seconds before. This replaces https://reviews.llvm.org/D113075 Differential Revision: https://reviews.llvm.org/D113235	2021-11-04 22:03:40 -07:00
Fangrui Song	005456e5fc	[lld-macho] Fix an assertion failure when -u specifies an undefined section$start symbol This matches ld64. Also improve the test for `-dead_strip`. Reviewed By: #lld-macho, Jez Ng Differential Revision: https://reviews.llvm.org/D113147	2021-11-04 21:28:33 -07:00
Keith Smiley	0bce3e3b84	[lld-macho] Clear resolvedReads cache https://reviews.llvm.org/D113153#3108083 smeenai, int3 Differential Revision: https://reviews.llvm.org/D113198	2021-11-04 18:02:34 -07:00
Noah Shutty	d788c44f5c	[Support] Improve Caching conformance with Support library behavior This diff makes several amendments to the local file caching mechanism which was migrated from ThinLTO to Support in rGe678c51177102845c93529d457b020f969125373 in response to follow-up discussion on that commit. Patch By: noajshu Differential Revision: https://reviews.llvm.org/D113080	2021-11-04 13:00:44 -07:00
Keith Smiley	e7fdff403e	[lld-macho] Silently ignore the -objc_abi_version This undocumented ld64 flag, based on the most recent ld64 source dump from Xcode 12, only applies to i386. It seems like on all newer architectures this behavior is the default. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D113070	2021-11-03 22:16:09 -07:00
Keith Smiley	d49e7244cc	[lld-macho] Cache readFile results In one of our links lld was reading 760k files, but the unique number of files was only 1500. This takes that link from 30 seconds to 8. This seems like a heavy hammer, especially since some things don't need to be cached, like the filelist arguments and the passed static archives (the latter is already cached as a one off), but it seems ld64 does something similar here to short circuit these duplicate reads: `82e429e186/src/ld/InputFiles.cpp (L644-L665)` Of the types of files being read for our iOS app, the biggest problem was constantly re-reading small tbd files: ``` % wc -l /tmp/read.txt 761414 /tmp/read.txt % cat /tmp/read.txt \| sort -u \| wc -l 1503 % cat /tmp/read.txt \| grep "\.a$" \| wc -l 43721 % cat /tmp/read.txt \| grep "\.tbd$" \| wc -l 717656 ``` We could likely hoist this logic up to not cache at this level, but it would be a more invasive change to make sure all callers that needed it cached the results. I could see this being an issue with OOMs, and I'm not a linker expert so maybe there's another way we should solve this problem? Feedback welcome! Reviewed By: int3, #lld-macho Differential Revision: https://reviews.llvm.org/D113153	2021-11-03 22:12:21 -07:00
Keith Smiley	6629ec3ecc	[lld-macho] Implement -arch_errors_fatal By default with ld64, architecture mismatches are just warnings, then this flag can be passed to make these fail. This matches that behavior. Reviewed By: int3, #lld-macho Differential Revision: https://reviews.llvm.org/D113082	2021-11-03 22:01:53 -07:00
Jez Ng	4ae8c83104	[lld-macho][nfc] Remove unnecessary -pie flags in tests D101513 means that we no longer need to specify `-pie` in most of our test RUN commands. Let's clean up the unused flags so as not to confuse future test writers. Reviewed By: #lld-macho, oontvoo, MaskRay Differential Revision: https://reviews.llvm.org/D113114	2021-11-04 00:02:03 -04:00
Keith Smiley	4313c56aa3	[lld-macho] Enable search-paths tests on macOS I'm not sure what the history is here but this test passes on macOS today. It seems like we should unify these tests if they need to run cross platform. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D113085	2021-11-03 12:01:36 -07:00
Keith Smiley	63e65de3ff	[lld-macho] Cache discovered framework paths On our large iOS project this took a link from 1 minute 45 seconds to 45 seconds. For reference ld64 does the same link in ~20 seconds. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D113063	2021-11-03 11:11:54 -07:00
Keith Smiley	f79e65e61f	[lld-macho] Cache library paths from findLibrary On top of https://reviews.llvm.org/D113063 this took another 10 seconds off our overall link time. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D113073	2021-11-03 10:02:23 -07:00
Fangrui Song	c977564fc2	Revert "[ELF] Try appeasing --target=armv7-linux-androideabi24 sanitizer symbolization tests" This reverts commit `5cbec88cbf`. Vitaly said that `2faac77f26` actually works. Sanitizer's armv7-linux-androideabi24 configuration has other issues which haven't been identified yet, but that's unrelated to the empty symbol name issue.	2021-11-03 00:56:09 -07:00
Fangrui Song	5cbec88cbf	[ELF] Try appeasing --target=armv7-linux-androideabi24 sanitizer symbolization tests	2021-11-02 18:57:04 -07:00
Vy Nguyen	37f96cb478	Revert "[lld-macho] Change bitfield types to be identical." This reverts commit `ae31f9fbad`. Reason: bitfields can't be merged across parent/child classes anyway. So this change doesn't help.	2021-11-02 16:57:51 -04:00
Vy Nguyen	ae31f9fbad	[lld-macho] Change bitfield types to be identical. Symbol's subclasses all have an additional bitfield of type uint8_t (RefState enum). For the bitfields in the same block tomerge, they should be of the same type. (clang/gcc will work, but others like MSVC does not) Differential Revision: https://reviews.llvm.org/D113040	2021-11-02 15:48:39 -04:00
Nico Weber	64c1734438	[lld/mac] Write -v output to stderr This matches ld64, and it's conceivable that projects try to read this information off stderr for that reason. --version keeps writing to stdout. Differential Revision: https://reviews.llvm.org/D113020	2021-11-02 13:59:14 -04:00
Vy Nguyen	d7e5393af4	[lld-macho] Remove no_dtrace_dof from un-implemented group. One fewer warning. In practice, lld already "implements" it. (ie., it does not do dtrace-dof processing ever). Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D112934	2021-11-02 12:36:08 -04:00
Vy Nguyen	3f35dd06a5	[lld-macho][nfc][cleanup] Fix a few code style lints and clang-tidy findings - Use .empty() instead of `size() == 0` when possible. - Use const-ref to avoid copying Differential Revision: https://reviews.llvm.org/D112978	2021-11-02 11:26:15 -04:00
Shoaib Meenai	7a4b27609d	[lld] Add test suite mode for running LLD main twice LLD_IN_TEST determines how many times each port's `main` function is run in each LLD process, and setting LLD_IN_TEST=2 (or higher) is useful for checking if we're cleaning up and resetting global state correctly. Add a test suite parameter to enable this easily. There's work in progress to remove global state (e.g. D108850), but this seems useful in the interim. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D112898	2021-11-01 14:26:54 -07:00
Fangrui Song	2f7366c89d	[ELF] Simplify R_DTPREL. NFC	2021-10-31 20:30:00 -07:00
Shoaib Meenai	264d3b6d4e	[MachO] Use error instead of fatal for missing -arch `fatal` should only be used for malformed inputs according to ErrorHandler.h; `error` is more appropriate for missing arguments, accompanied by a check to bail out early in case of the error. Some tests need to be adjusted accordingly. Makes `lld/test/MachO/arch.s` pass with `LLD_IN_TEST=2`. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D112879	2021-10-31 16:31:21 -07:00
Shoaib Meenai	0f6d720f1f	[MachO] Properly reset global state We need to reset global state between runs, similar to the other ports. There's some file-static state which needs to be reset as well and we need to add some new helpers for that. With this change, most LLD Mach-O tests pass with `LLD_IN_TEST=2` (which runs the linker twice on each test). Some tests will be fixed by the remainder of this stack, and the rest are fundamentally incompatible with that mode (e.g. they intentionally throw fatal errors). Fixes PR52070. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D112878	2021-10-31 16:14:29 -07:00
Nico Weber	f964ca896f	[lld/coff] Add parsing for /pdbpagesize: flag It's not used for anything yet, but we now accept `/pdbpagesize:4096` (the default behavior) and we give arguably more useful diagnostics for other values. It's plumbed through to the MSF layer, so just uncommenting out the bit in DriverUtils.cpp that rejects args other than 4096 is enough to try other values. Differential Revision: https://reviews.llvm.org/D112871	2021-10-31 18:36:23 -04:00
Fangrui Song	9f8ffaaa0b	[ELF] Replace "symbol '...' has no type" diagnostic with "relocation ... cannot be used against symbol '...'" The "symbol 'foo' has no type" diagnostic tries to inform that copy relocation/canonical PLT entry cannot be used, but the diagnostic is often incorrect and confusing.	2021-10-31 13:12:26 -07:00
Fangrui Song	164194a5af	[ELF] Untangle R_GOT style TLS IE and processRelocAux. NFC	2021-10-31 12:38:36 -07:00
Fangrui Song	55e69ece72	[ELF] Remove -Wl,-z,notext hint The hint does not pull its weight: * adding -Wl,-z,notext often won't work (relocation types other than `symbolRel`, e.g. `R_AARCH64_LDST32_ABS_LO12_NC`) * for pure (no assembly) C/C++ projects, the "-fPIC" hint is sufficient	2021-10-31 12:10:43 -07:00
Fangrui Song	b76aacef5f	[ELF] Simplify isStaticLinkTimeConstant. NFC	2021-10-31 10:46:42 -07:00
Fangrui Song	3fe4b54915	[ELF] Make getImplicitAddend return 0 for R_ARM_V4BX. NFC Will be useful if we move R_ARM_V4BX handling around.	2021-10-30 23:31:39 -07:00
Fangrui Song	aa1d32f519	[ELF][Mips] Use R_DTPREL for R_MIPS_TLS_DTPREL*	2021-10-30 21:58:43 -07:00
Nico Weber	2d48b19136	[lld/mac] Fix mislink with ICF When comparing relocations against two symbols, ICF's equalsConstant() did not look at the value of the two symbols. With subsections_via_symbols, the value is usually 0 but not always: In particular, it isn't 0 for constants in string and literal sections. Since we ignored the value, comparing two constant string symbols or two literal symbols always compared the 0th's element, so functions in the same TU always compared as equal. This can cause mislinks, and, with -dead_strip, crashes. Fixes PR52349, see that bug for lots of details and examples of mislinks. While here, make the existing assembly in icf-literals.s a bit more realistic (use leaq instead of movq with strings, and use foo(%rip) instead of foo@gotpcrel(%rip)). This has no interesting effect, it just maybe makes the test look a bit less surprising. Differential Revision: https://reviews.llvm.org/D112862	2021-10-30 18:58:59 -04:00
Sam Clegg	182b72aa48	[lld][WebAssembly] Generate TLS relocation code also when linking statically Previously relocations were only generated for PIC output, but relocations for TLS GOT entries are always needed when shared memory is enabled, not just in PIC mode. This means that the `__wasm_apply_global_tls_relocs` is now generated even for statically linked (non-PIC) output. Without this the globals that hold the addresses of TLS symbols are not set correctly. Differential Revision: https://reviews.llvm.org/D112833	2021-10-29 13:26:35 -07:00
Sam Clegg	fad05465c1	[lld][WebAssembly] Handle TLS variables in Symbol::getVA. NFC In the shared memory case we can always assume that TLS addresses are relative to __tls_base. In the non-shared memory case TLS variables are absolute, just like normal data addresses. This simplifies the code in calcNewValue so that TLS relocations no longer need special handling. Differential Revision: https://reviews.llvm.org/D112831	2021-10-29 10:45:30 -07:00
Jez Ng	6c2f26a159	[lld-macho] -all_load and -ObjC should not affect LC_LINKER_OPTION flags In particular, they should not cause archives to be eagerly loaded. This matches ld64's behavior. Fixes PR52246. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D112756	2021-10-29 11:00:28 -04:00
Jez Ng	a271f2410f	[lld-macho][nfc] Canonicalize all pointers to InputSections early on Having to remember to call `canonical()` all over the place is error-prone; let's do it in a centralized location instead. It also appears to improve performance slightly. base diff difference (95% CI) sys_time 0.984 ± 0.009 0.983 ± 0.014 [ -0.8% .. +0.6%] user_time 6.508 ± 0.035 6.475 ± 0.036 [ -0.8% .. -0.2%] wall_time 5.321 ± 0.034 5.300 ± 0.033 [ -0.7% .. -0.1%] samples 36 23 Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D112687	2021-10-29 11:00:28 -04:00
Fangrui Song	3a4b605bc1	[lld-macho] Internalize createFiles. NFC	2021-10-28 22:14:37 -07:00
Fangrui Song	6fcc19afb9	[ELF] Simplify R_TPREL formula after D111365	2021-10-28 21:03:53 -07:00
Fangrui Song	6e04ec801b	[docs] Fix docs-lld-html	2021-10-28 18:44:44 -07:00
Fangrui Song	e39c138f45	[ELF] Implement TLSDESC for x86-32 `-z rela` is also supported. Tested with: ``` cat > ./a.c <<eof #include <assert.h> int foo(); int bar(); int main() { assert(foo() == 2); assert(foo() == 4); assert(bar() == 2); assert(bar() == 4); } eof cat > ./b.c <<eof #include <stdio.h> __thread int tls0; extern __thread int tls1; int foo() { return ++tls0 + ++tls1; } static __thread int tls2, tls3; int bar() { return ++tls2 + ++tls3; } eof echo '__thread int tls1;' > ./c.c sed 's/ /\t/' > ./Makefile <<'eof' .MAKE.MODE = meta curDirOk=true CC := gcc -m32 -g -fpic -mtls-dialect=gnu2 LDFLAGS := -m32 -Wl,-rpath=. all: a0 a1 a2 run: all ./a0 && ./a1 && ./a2 c.so: c.o; ${LINK.c} -shared $> -o $@ bc.so: b.o c.o; ${LINK.c} -shared $> -o $@ b.so: b.o c.so; ${LINK.c} -shared $> -o $@ a0: a.o b.o c.o; ${LINK.c} $> -o $@ a1: a.o b.so; ${LINK.c} $> -o $@ a2: a.o bc.so; ${LINK.c} $> -o $@ eof ``` and glibc `elf/tst-gnu2-tls1`. `/usr/local/bin/ld` points to the freshly built `lld`. `bmake run && bmake CFLAGS=-O1 run` => ok. Differential Revision: https://reviews.llvm.org/D112582	2021-10-28 17:52:03 -07:00
Sam Clegg	1eb79e732c	[lld][WebAssembly] Initialize bss segments using memory.fill Previously we were relying on the dynamic loader to take care of this but it simple and correct for us to do it here instead. Now we initialize bss segments as part of `__wasm_init_memory` at the same time we initialize passive segments. In addition we extent the us of `__wasm_init_memory` outside of shared memory situations. Specifically it is now used to initialize bss segments when the memory is imported. Differential Revision: https://reviews.llvm.org/D112667	2021-10-28 17:15:08 -07:00
Sam Clegg	50bfc45109	[lld][WebAssemlby] Always enable mutable-globals feature in PIC mode This works around an issue where the feature can be forgotten in the case of LTO + object file with no functions. See: https://bugs.llvm.org/show_bug.cgi?id=52339 Differential Revision: https://reviews.llvm.org/D112769	2021-10-28 16:24:54 -07:00
Sam Clegg	28848e9e1b	[lld][WebAssembly] Handle duplicate archive member names in ThinLTO This entire change, including the test case, comes almost verbatim from the ELF driver. Fixes: https://github.com/emscripten-core/emscripten/issues/12763 Differential Revision: https://reviews.llvm.org/D112723	2021-10-28 11:48:04 -07:00
Sam Clegg	4da38c14d0	[lld] Rename addCombinedLTOObjects to match ELF driver. NFC This function was renamed in https://reviews.llvm.org/D62291. The new name seems more accurate and also its good to maintain some consistency between these methods in the different drivers. Differential Revision: https://reviews.llvm.org/D112719	2021-10-28 11:46:19 -07:00
Fangrui Song	2b1e32410c	[ELF] Change common diagnostics to report both object file location and source file location Many diagnostics use `getErrorPlace` or `getErrorLocation` to report a location. In the presence of line table debug information, `getErrorPlace` uses a source file location and ignores the object file location. However, the object file location is sometimes more useful. This patch changes "undefined symbol" and "out of range" diagnostics to report both object/source file locations. Other diagnostics can use similar format if needed. The key idea is to let `InputSectionBase::getLocation` report the object file location and use `getSrcMsg` for source file/line information. `getSrcMsg` doesn't leverage `STT_FILE` information yet, but I think the temporary lack of the functionality is ok. For the ARM "branch and link relocation" diagnostic, I arbitrarily place the source file location at the end of the line. The diagnostic is not very common so its formatting doesn't need to be pretty. Differential Revision: https://reviews.llvm.org/D112518	2021-10-28 09:38:45 -07:00
Sam Clegg	e091a66cb7	[lld][ELF] Update name of function in comment. NFC This function was renamed in https://reviews.llvm.org/D62291.	2021-10-28 07:29:43 -07:00
Vincent Lee	d54360cd32	[lld-macho] Implement -S There are a couple internal builds that require the use of this flag. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D112594	2021-10-27 17:09:57 -07:00
Nico Weber	7f369304df	[lld/mac] Don't crash on undefined symbols with --icf=all ICF runs before relocation processing, but undefined symbol errors are only emitted during relocation processing. So just ignore Undefineds during ICF (instead of crashing) -- lld will emit an error once ICF is done. Fixes PR52330. Differential Revision: https://reviews.llvm.org/D112643	2021-10-27 16:20:10 -04:00
Jez Ng	b7e12ca7aa	[lld-macho] If export_size is zero, export_off must be zero Otherwise tools like codesign_allocate will choke. We were already handling this correctly for the other DYLD_INFO sections. Doing this correctly is a bit subtle: we don't know if export_size will be zero until we have run `ExportSection::finalizeContents()`. However, we must still add the ExportSection to the `__LINKEDIT` segment in order that it gets sorted during `sortSectionsAndSegments()`. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D112589	2021-10-27 14:58:42 -04:00
Nico Weber	6503a68565	[lld/mac] Don't assert when ICFing arm64 code WordLiteralSection dedupes literals by content. WordLiteralInputSection::getOffset() used to read a literal at the passed-in offset and look up this value in the deduping map to find the offset of the deduped value. But it's possible that (e.g.) a 16-byte literal's value is accessed 4 bytes in. To get the offset at that address, we have to get the deduped value at offset 0 and then apply the offset 4 to the result. (See also WordLiteralSection::finalizeContents() which fills in those maps.) Only a problem on arm64 because in x86_64 the offset is part of the instruction instead of a separate ARM64_RELOC_ADDEND relocation. (See bug for more details.) Fixes PR51999. Differential Revision: https://reviews.llvm.org/D112584	2021-10-27 14:02:07 -04:00
Sam Clegg	1aeb4c4a43	[lld][WebAssebmly] Convert tests to use disassembly. NFC Differential Revision: https://reviews.llvm.org/D112590	2021-10-27 10:34:52 -07:00
Fangrui Song	ecc93ed2d7	[ELF] Replace InputBaseSection::{areRelocsRela,firstRelocation,numRelocation} with relSecIdx For `InputSection` `.foo`, its `InputBaseSection::{areRelocsRela,firstRelocation,numRelocation}` basically encode the information of `.rel[a].foo`. However, one uint32_t (the relocation section index) suffices. See the implementation of `relsOrRelas`. This change decreases sizeof(InputSection) from 184 to 176 on 64-bit Linux. The maximum resident set size linking a large application (1.2G output) decreases by 0.39%. Differential Revision: https://reviews.llvm.org/D112513	2021-10-27 09:51:07 -07:00
Fangrui Song	35c3f5610c	[ELF][X86] Write R_X86_64_TLSDESC addends with -z rel Similar to D100544 for AArch64. Reviewed By: arichardson Differential Revision: https://reviews.llvm.org/D112592	2021-10-27 09:35:30 -07:00
Nico Weber	9f90347588	fix comment typos to cycle bots	2021-10-27 09:53:08 -04:00
Jez Ng	1d2a4cd57d	[lld-macho] Fix compact-unwind-bad-reloc.s test Broken by `a9353dbe51`. Now that the functions point to the compact unwind entries, instead of the other way around, we need to perform the "invalid reference" check in a different place. This change was originally part of the stacked diff D109946, but should have been included as part of D109945.	2021-10-26 18:59:12 -04:00
Nuri Amari	a299b24712	Regenerate LC_CODE_SIGNATURE during llvm-objcopy operations Context: This is a second attempt at introducing signature regeneration to llvm-objcopy. In this diff: https://reviews.llvm.org/D109840, a script was introduced to test the validity of a code signature. In this diff: https://reviews.llvm.org/D109803 (now reverted), an effort was made to extract the signature generation behavior out of LLD into a common location for use in llvm-objcopy. In this diff: https://reviews.llvm.org/D109972 it was decided that there was no appropriate common location and that a small amount of duplication to bring signature generation to llvm-objcopy would be better. This diff introduces this duplication. Summary Prior to this change, if a LC_CODE_SIGNATURE load command was included in the binary passed to llvm-objcopy, the command and associated section were simply copied and included verbatim in the new binary. If rest of the binary was modified at all, this results in an invalid Mach-O file. This change regenerates the signature rather than copying it. The code_signature_lc.test test was modified to include the yaml representation of a small signed MachO executable in order to effectively test the signature generation. Reviewed By: alexander-shaposhnikov, #lld-macho Differential Revision: https://reviews.llvm.org/D111164	2021-10-26 14:51:13 -07:00
Jez Ng	a9353dbe51	[lld-macho] Simplify the handling of "no unwind info" functions This diff does away with `addEntriesForFunctionsWithoutUnwindInfo()`, because `addSymbol()` can now determine which functions need those entries. While overhauling UnwindInfoSection, I also parallelized the relocation of the contents of the CUEs. This somewhat offsets the time regression from creating one InputSection per CUE (which was done in D109944). Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D109945	2021-10-26 16:04:16 -04:00
Jez Ng	002eda7056	[lld-macho] Associate compact unwind entries with function symbols Compact unwind entries (CUEs) contain pointers to their respective function symbols. However, during the link process, it's far more useful to have pointers from the function symbol to the CUE than vice versa. This diff adds that pointer in the form of `Defined::compactUnwind`. In particular, when doing dead-stripping, we want to mark CUEs live when their function symbol is live; and when doing ICF, we want to dedup sections iff the symbols in that section have identical CUEs. In both cases, we want to be able to locate the symbols within a given section, as well as locate the CUEs belonging to those symbols. So this diff also adds `InputSection::symbols`. The ultimate goal of this refactor is to have ICF support dedup'ing functions with unwind info, but that will be handled in subsequent diffs. This diff focuses on simplifying `-dead_strip` -- `findFunctionsWithUnwindInfo` is no longer necessary, and `Defined::isLive()` is now a lot simpler. Moreover, UnwindInfoSection no longer has to check for dead CUEs -- we simply avoid adding them in the first place. Additionally, we now support stripping of dead LSDAs, which follows quite naturally since `markLive()` can now reach them via the CUEs. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D109944	2021-10-26 16:04:15 -04:00
Jez Ng	622150ad5f	[lld-macho] Put GOT into `__DATA` segment where appropriate We were previously always emitting the GOT into `__DATA_CONST`, even for target platforms where it should end up in `__DATA`. I stumbled onto this while trying to use the `class-dump` tool -- with the wrong segment names, it fails to locate the ObjC runtime info and therefore fails to dump any classes. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D112500	2021-10-26 11:38:01 -04:00
Vy Nguyen	e5fb79b314	[lld-macho] Make test produce the dead.o and live.o that are used below. Follow up fix to breakages in D112485	2021-10-25 22:10:24 -04:00
Vy Nguyen	46ef187dcc	[lld-macho] Fix incremental build (again) from D112485	2021-10-25 21:51:34 -04:00
Jez Ng	d3ddd569eb	[lld-macho] Fix incremental builds	2021-10-25 20:51:05 -04:00
Fangrui Song	3b42fc8a07	[ELF] Simplify sortSection. NFC	2021-10-25 16:57:46 -07:00
Jez Ng	413e249a47	[lld-macho][nfc] Test that we don't emit undef symbol errors for dead code This is what ld64 does too, so we have parity here (though I think ld64 still removes dead code more effectively than we do...) Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D112485	2021-10-25 19:05:39 -04:00
Fangrui Song	4d9f6caee3	[ELF] Change SharedFile::soName from std::string to StringRef	2021-10-25 15:54:04 -07:00
Fangrui Song	25da870057	[ELF] Remove irrelevant group signature hack working around old gold -r	2021-10-25 15:09:08 -07:00
Fangrui Song	43753f8f9d	[ELF] Remove irrelevant SHT_INIT_ARRAY/SHT_FINI_ARRAY hack The hack is irrelevant for two reasons: * binutils 2.24 is quite old and cannot handle R_X86_64_REX_GOTPCRELX from 2016 onwards anyway * `canMergeToProgbits` allows combining SHT_INIT_ARRAY/SHT_FINI_ARRAY into SHT_PROGBITS	2021-10-25 14:23:05 -07:00
Fangrui Song	6506907a0a	[ELF] Update comments/diagnostics for -defsym and -image-base to use the canonical two-dash form	2021-10-25 14:01:36 -07:00
Fangrui Song	ca8105b76c	[ELF][X86] Support R_X86_64_PLTOFF64 For a function call (using the default `-fplt`), GCC `-mcmodel=large` generates an assembly modifier which leads to an R_X86_64_PLTOFF64 relocation. In real world, http://git.ageinghacker.net/jitter (used by GNU poke) uses `-mcmodel=large`. R_X86_64_PLTOFF64's formula is (if preemptible) `L - GOT + A` or (if non-preemptible) `S - GOT + A` where `GOT` is (confusingly) the address of `.got.plt` Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D112386	2021-10-25 13:05:17 -07:00
Fangrui Song	a14ccaf509	[ELF] Support 128-bit bitmask in oneof(RelExpr) Taken from Chih-Mao Chen's D100835. RelExpr has 64 bits now and needs the extension to support new members (`R_PLT_GOTPLT` for `R_X86_64_PLTOFF64` support). Note: RelExpr needs to have at least a member >=64 to prevent -Wtautological-constant-out-of-range-compare for `if (expr >= 64)`. Reviewed By: arichardson, peter.smith Differential Revision: https://reviews.llvm.org/D112385	2021-10-25 13:05:17 -07:00
Fangrui Song	bf6e259b21	[ELF] Update comments/diagnostics for some long options to use the canonical two-dash form Rewrite some comments as appropriate.	2021-10-25 12:52:06 -07:00
Fangrui Song	4ae1c2c6f1	[ELF] Delete unneeded hack for discarding empty name local symbol This actually improves GNU ld compatibility. Correct assemblers don't create such symbols. Also simplify the code.	2021-10-25 11:55:31 -07:00
Vy Nguyen	7d549acbb6	[lld-macho][nfc] Rename output binary so it doesn't overwrite existing one `%t/basics` already exists - it would be nice to be able to examine it afterward Differential Revision: https://reviews.llvm.org/D112392	2021-10-25 09:55:40 -04:00
Fangrui Song	815a1207bf	[ELF] Remove ignored options that likely nobody uses GNU ld doesn't support `--no-pic-executable`. `-p` has been removed from likely the only use case (Linux kernel) for over 2.5 years: https://git.kernel.org/linus/091bb549f7722723b284f63ac665e2aedcf9dec9 `--no-add-needed` was the pre-binutils-2.23 spelling for `--no-copy-dt-needed-entries`. The legacy alias is irrelevant in 2021.	2021-10-24 18:29:45 -07:00
Kazu Hirata	4bd46501c3	Use llvm::any_of and llvm::none_of (NFC)	2021-10-24 17:35:33 -07:00
Kazu Hirata	4ba9d9c84f	Use StringRef::contains (NFC)	2021-10-23 20:41:46 -07:00
Vy Nguyen	236197e2d0	[lld-macho] Implement -oso_prefix https://bugs.llvm.org/show_bug.cgi?id=50229 Differential Revision: https://reviews.llvm.org/D112291	2021-10-22 16:32:42 -04:00
Jez Ng	77fdc0e56b	[lld-macho] Simplify lc-linker-option.ll and re-enable it on Windows While attempting to simplify it, I discovered a concerning discrepancy between our handling of LC_LINKER_OPTION vs ld64's. In particular, ld64 does not appear to check for `-all_load` nor `-ObjC` when processing those options. Thus, if/when we fix this behavior, no duplicate symbol error will be expected regardless of the use-after-free. As such, I've removed the test logic that tries to induce the duplicate symbol error. We can just rely on ASAN to do the verification. In order to make the test run on Windows, I've removed the symlink logic. Both ld64 and LLD handle this un-symlinked framework just fine. I also capitalized the framework name, since that's the typical convention. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D112195	2021-10-21 11:23:44 -04:00
Igor Kudrin	1302fdc233	[ELF] Avoid adding an orphan section to a less suitable segment If segments are defined in a linker script, placing an orphan section before the found closest-rank section can result in adding it in a previous segment and changing flags of that segment. This happens if the orphan section has a lower sort rank than the found section. To avoid that, the patch forces orphan sections to be moved after the found section if segments are explicitly defined. Differential Revision: https://reviews.llvm.org/D111717	2021-10-21 11:38:39 +07:00
Vy Nguyen	6b715e9c4d	[lld-macho][nfc] Added some notes on deliberate differences btw LD64 vs LLD-MACHO For future references and to help with debugging crashes, this could be useful. Differential Revision: https://reviews.llvm.org/D110464	2021-10-20 22:41:57 -04:00
Jez Ng	9ef55ddc3f	[lld-macho] Temporarily disable lc-linker-option.ll on Windows It's currently using a symlink, which is not supported on Windows.	2021-10-20 20:05:30 -04:00
Nico Weber	1412719066	[lld/mac] Remove else-after-return in ICF code No behavior change.	2021-10-20 14:24:13 -04:00
Kaining Zhong	aab0f2264a	[lld-macho] Fix dangling string reference when adding frameworks In Driver.cpp, addFramework used std::string instance to represent the path of a framework, which will be freed after the function returns. However, this string is stored in loadedArchive, which will be used later to compare with path of newly added frameworks. This caused https://bugs.llvm.org/show_bug.cgi?id=52133. A test is included in this commit to reproduce this bug. Now resolveDylibPath returns a StringRef instance, and it uses StringSaver to save its data, then returns it to functions on the top. This ensures the resolved framework path is still valid after LC_LINKER_OPTION is parsed. Reviewed By: int3, #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D111706	2021-10-20 11:21:40 -04:00
Paulo Matos	6d0c7bc17d	[WebAssembly] Implementation of table.get/set for reftypes in LLVM IR This change implements new DAG nodes TABLE_GET/TABLE_SET, and lowering methods for load and stores of reference types from IR arrays. These global LLVM IR arrays represent tables at the Wasm level. Differential Revision: https://reviews.llvm.org/D111154	2021-10-20 10:31:31 +02:00
Noah Shutty	e678c51177	[Support][ThinLTO] Move ThinLTO caching to LLVM Support library We would like to move ThinLTO’s battle-tested file caching mechanism to the LLVM Support library so that we can use it elsewhere in LLVM. Patch By: noajshu Differential Revision: https://reviews.llvm.org/D111371	2021-10-18 18:57:25 -07:00
Petr Hosek	8e46e34d24	Revert "[Support][ThinLTO] Move ThinLTO caching to LLVM Support library" This reverts commit `92b8cc52bb` since it broke the gold plugin.	2021-10-18 12:24:05 -07:00
Noah Shutty	92b8cc52bb	[Support][ThinLTO] Move ThinLTO caching to LLVM Support library We would like to move ThinLTO’s battle-tested file caching mechanism to the LLVM Support library so that we can use it elsewhere in LLVM. Patch By: noajshu Differential Revision: https://reviews.llvm.org/D111371	2021-10-18 12:08:49 -07:00
Kazu Hirata	8568ca789e	Use llvm::erase_if (NFC)	2021-10-18 09:33:42 -07:00
gbreynoo	f2c144fc18	[LLD][TEST] Add testing for negative addends for R_X86_64_32 and R_X86_64_PC32 relocations This change is derived from a test case we have locally but I could not see an equivalent in LLD's testing. Differential Revision: https://reviews.llvm.org/D111803	2021-10-18 16:38:33 +01:00
Kazu Hirata	10726992fa	Use llvm::erase_value (NFC)	2021-10-16 23:31:21 -07:00
Fangrui Song	f8ee74fc13	[ELF] Require two-dash form for --pack-dyn-relocs LLD specific options can be more rigid. Also add a test.	2021-10-15 15:36:30 -07:00
Sam Clegg	659a08399a	[WebAssembly] Add import info to `dylink` section of shared libraries See https://github.com/WebAssembly/tool-conventions/pull/175 Differential Revision: https://reviews.llvm.org/D111345	2021-10-15 11:49:16 -07:00
Nico Weber	4e572db0c2	[lld/mac] Mark private externs with GOT relocs as LOCAL in indirect symbtab prepareSymbolRelocation() in Writer.cpp adds both symbols that need binding and symbols relocated with a pointer relocation to the got. Pointer relocations are emitted for non-movq GOTPCREL(%rip) loads. (movqs become GOT_LOADs so that the linker knows they can be relaxed to leaqs, while others, such as addq, become just GOT -- a pointer relocation -- since they can't be relaxed in that way). For example, this C file produces a private_extern GOT relocation when compiled with -O2 with clang: extern const char kString[]; const char* g(int a) { return kString + a; } Linkers need to put pointer-relocated symbols into the GOT, but ld64 marks them as LOCAL in the indirect symbol table. This matters, since `strip -x` looks at the indirect symbol table when deciding what to strip. The indirect symtab emitting code was assuming that only symbols that need binding are in the GOT, but pointer relocations where there too. Hence, the code needs to explicitly check if a symbol is a private extern. Fixes https://crbug.com/1242638, which has some more information in comments 14 and 15. With this patch, the output of `nm -U` on Chromium Framework after stripping now contains just two symbols when using lld, just like with ld64. Differential Revision: https://reviews.llvm.org/D111852	2021-10-15 13:24:47 -04:00
Heejin Ahn	9261ee32dc	[WebAssembly] Make EH work with dynamic linking This makes Wasm EH work with dynamic linking. So far we were only able to handle destructors, which do not use any tags or LSDA info. 1. This uses `TargetExternalSymbol` for `GCC_except_tableN` symbols, which points to the address of per-function LSDA info. It is more convenient to use than `MCSymbol` because it can take additional target flags. 2. When lowering `wasm_lsda` intrinsic, if PIC is enabled, make the symbol relative to `__memory_base` and generate the `add` node. If PIC is disabled, continue to use the absolute address. 3. Make tag symbols (`__cpp_exception` and `__c_longjmp`) undefined in the backend, because it is hard to make it work with dynamic linking's loading order. Instead, we make all tag symbols undefined in the LLVM backend and import it from JS. 4. Add support for undefined tags to the linker. Companion patches: - https://github.com/WebAssembly/binaryen/pull/4223 - https://github.com/emscripten-core/emscripten/pull/15266 Reviewed By: sbc100 Differential Revision: https://reviews.llvm.org/D111388	2021-10-12 23:28:27 -07:00
Nico Weber	f09dce564e	[lld] fix typos to cycle bots	2021-10-12 17:03:39 -04:00
Andrew Ng	649cc160e3	[ELF][test] Add testing for dynamic TLS relocations in .debug_info Differential Revision: https://reviews.llvm.org/D111436	2021-10-12 10:54:52 +01:00
Fangrui Song	71ec1e5015	[ELF] Demote !isUsedInRegularObj lazy symbol I think D79300 has fixed the D51892 (`__i686.get_pc_thunk.bx`) issue, so we can bring back rL330869. D79300 says `would error undefined symbol instead of the more relevant discarded section` but it doesn't reproduce now. This avoids a quirk in `isUndefWeak()`. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D111365	2021-10-11 09:46:31 -07:00
Ben Dunbobbin	aaeba6483f	[LLD] [TEST] Add test case for patching an absolute relocation to a weak undef I noticed that we had this case in our internal testsuite but couldn't find it in LLD's tests. This adds that case. Differential Revision: https://reviews.llvm.org/D110716	2021-10-11 13:14:45 +01:00
Keith Smiley	dfeaa1941b	[lld][test] Remove /usr/local/lib test requirement This field only exists if the directory exists on the machine running the test. It likely exists for most Intel macOS users because of homebrew, but doesn't exist on some of the CI machines. This unfortunately makes this test a bit less strict. Differential Revision: https://reviews.llvm.org/D111361	2021-10-07 15:17:52 -07:00
Keith Smiley	0885afb8b0	[lld][test] Fix darwin REQUIRES (NFC) Some subprojects like compiler-rt define the `darwin` feature in their lit config, but lld does not do that, so we need to use the global system-darwin here instead. This test seems to have drifted from the actual behavior so I also had to add `/usr/local/lib` here to make it pass. Differential Revision: https://reviews.llvm.org/D111268	2021-10-07 12:37:37 -07:00
Heejin Ahn	3ec1760d91	[WebAssembly] Remove WasmTagType This removes `WasmTagType`. `WasmTagType` contained an attribute and a signature index: ``` struct WasmTagType { uint8_t Attribute; uint32_t SigIndex; }; ``` Currently the attribute field is not used and reserved for future use, and always 0. And that this class contains `SigIndex` as its property is a little weird in the place, because the tag type's signature index is not an inherent property of a tag but rather a reference to another section that changes after linking. This makes tag handling in the linker also weird that tag-related methods are taking both `WasmTagType` and `WasmSignature` even though `WasmTagType` contains a signature index. This is because the signature index changes in linking so it doesn't have any info at this point. This instead moves `SigIndex` to `struct WasmTag` itself, as we did for `struct WasmFunction` in D111104. In this CL, in lib/MC and lib/Object, this now treats tag types in the same way as function types. Also in YAML, this removes `struct Tag`, because now it only contains the tag index. Also tags set `SigIndex` in `WasmImport` union, as functions do. I think this makes things simpler and makes tag handling more in line with function handling. These two shares similar properties in that both of them have signatures, but they are kind of nominal so having the same signature doesn't mean they are the same element. Also a drive-by fix: the reserved 'attirubute' part's encoding changed from uleb32 to uint8 a while ago. This was fixed in lib/MC and lib/Object but not in YAML. This doesn't change object files because the field's value is always 0 and its encoding is the same for the both encoding. This is effectively NFC; I didn't mark it as such just because it changed YAML test results. Reviewed By: sbc100, tlively Differential Revision: https://reviews.llvm.org/D111086	2021-10-05 17:11:22 -07:00
Heejin Ahn	9a9ec8e04b	[lld][WebAssembly] Remove redundant check for undefined global (NFC) Also does some refactoring. Reviewed By: sbc100 Differential Revision: https://reviews.llvm.org/D111101	2021-10-05 15:11:27 -07:00
Sam Clegg	8fe128476e	[lld][WebAssembly] Create optional internal symbols only after LTO object as been added This is important for the cases where new symbols can be introduced during LTO. Specifically this happens for during TLS-lowering where references to `__tls_base` can be introduced. Fixes: https://github.com/emscripten-core/emscripten/issues/12489 Differential Revision: https://reviews.llvm.org/D111171	2021-10-05 13:31:09 -07:00
Andrew Ng	3334b9d70b	[ELF][test] Enhance relative dynamic relocation tests Add checking of the value of the relocation with an addend. Also check all relocation offsets. Differential Revision: https://reviews.llvm.org/D111071	2021-10-05 11:32:22 +01:00
Igor Kudrin	65c284a7be	[ELF][test][NFC] Make a test standard compliant PT_LOAD segments in the program header must be sorted by their virtual addresses, so they should be defined in a similar order as the associated sections. Differential Revision: https://reviews.llvm.org/D111068	2021-10-05 11:40:02 +07:00
Sam Clegg	c0039de295	[Object][WebAssemlby] Report function types (signatures). NFC This simplifies the code in a number of ways and avoids having to track functions and their types separately. Differential Revision: https://reviews.llvm.org/D111104	2021-10-04 17:33:56 -07:00
Nico Weber	f3091831f4	[lld] Use checkError more No behavior change.	2021-10-04 11:46:16 -04:00
Andrew Ng	39f3f7c08f	[ELF][test] Fix several LLD ICF tests A number of the ICF tests were not updated to use --print-icf-sections instead of --verbose and various '-NOT' checks were not updated to the latest output format of --print-icf-sections. Because these are all 'negative' tests, these issues have gone unnoticed. Differential Revision: https://reviews.llvm.org/D110353	2021-10-04 11:10:10 +01:00
Daniel Rodríguez Troitiño	657f02d458	Revert "Extract LC_CODE_SIGNATURE related implementation out of LLD" This reverts commit `cc8229603b`. As discussed in the review of https://reviews.llvm.org/D109972, this was not right approach, so we are reverting to start with a different approach. Differential Revision: https://reviews.llvm.org/D110974	2021-10-01 17:19:50 -07:00
Teresa Johnson	b55a964197	Second attempt to fix Windows failures from test changes Try to address Windows flakes from `d87bdc272b` by adding "\|\| true" as suggested in D110276 so the whole test doesn't fail when Windows thinks it can't remove the binary.	2021-09-29 19:24:35 -07:00
Teresa Johnson	2f1b99ca67	Use rm -f to fix Windows failures from test changes Try to address Windows flakes from `d87bdc272b` by using 'rm -f' instead of just 'rm' as discussed in D110276. For example: http://45.33.8.238/win/46115/step_7.txt	2021-09-29 08:01:22 -07:00
Nico Weber	c19315ef60	[lld/mac] Don't warn on both --icf=all and -no_deduplicate Instead, just make the later flag win, like usual. Implement this by making -no_deduplicate an actual alias for --icf=none at the Options.td level. Differential Revision: https://reviews.llvm.org/D110672	2021-09-29 08:25:21 -04:00
Teresa Johnson	d87bdc272b	Clean up large copies of binaries copied into temp directories in tests In looking at the disk space used by a ninja check-all, I found that a few of the largest files were copies of clang and lld made into temp directories by a couple of tests. These tests were added in D53021 and D74811. Clean up these copies after usage. Differential Revision: https://reviews.llvm.org/D110276	2021-09-28 17:04:09 -07:00
Shoaib Meenai	f9b3c18e74	[CodeGen] Fix wrapping personality symbol on ARM The ARM backend was explicitly setting global binding on the personality symbol. This was added without any comment in `a7ec2dcefd`, which introduced EHABI support (back in 2011). None of the other backends do anything equivalent, as far as I can tell. This causes problems when attempting to wrap the personality symbol. Wrapped symbols are marked as weak inside LTO to inhibit IPO (see https://reviews.llvm.org/D33621). When we wrap the personality symbol, it initially gets weak binding, and then the ARM backend attempts to change the binding to global, which causes an error in MC because of attempting to change the binding of a symbol from non-global to global (the error was added in https://reviews.llvm.org/D90108). Simply drop the ARM backend's explicit global binding setting to fix this. This matches all the other backends, and a large internal application successfully linked and ran with this change, so it shouldn't cause any problems. Test via LLD, since wrapping is required to exhibit the issue. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D110609	2021-09-28 15:01:05 -07:00
Fangrui Song	74a47e54be	[llvm-objdump] Fix -R display and support ET_EXEC * Add a newline before `DYNAMIC RELOCATION RECORDS` (see D101796) * Add the missing `OFFSET TYPE VALUE` line * Align columns Note: llvm-readobj/ELFDumper.cpp `loadDynamicTable` has sophisticated PT_DYNAMIC code which is unavailable in llvm-objdump. Reviewed By: jhenderson, Higuoxing Differential Revision: https://reviews.llvm.org/D110595	2021-09-28 09:58:27 -07:00
Fangrui Song	2bf06d9345	[ELF] Support symbol names with space in linker script expressions Fix PR51961 Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D110490	2021-09-27 09:50:42 -07:00
Fangrui Song	db6a00daa0	[ELF] Remove unneeded binding parameter from addOptionalRegular. NFC __rela_iplt_start uses spurious STB_WEAK, but it doesn't matter because STV_HIDDEN overrides the binding.	2021-09-25 15:47:27 -07:00
Fangrui Song	d23fd8ae89	[ELF] Replace noneRel = R__NONE with static constexpr. NFC All architectures define R__NONE to 0.	2021-09-25 15:16:44 -07:00
Fangrui Song	40cd4db442	[ELF] Default gotBaseSymInGotPlt to false (NFC for most architectures) Most architectures use .got instead of .got.plt, so switching the default can minimize customization. This fixes an issue for SPARC V9 which uses .got . AVR, AMDGPU, and MSP430 don't seem to use _GLOBAL_OFFSET_TABLE_.	2021-09-25 15:06:09 -07:00
Fangrui Song	a892c0e49e	[ELF][test] Improve test coverage	2021-09-25 11:57:54 -07:00
Mike Hommey	08ef24f6ab	Wrap xar/xar.h include in extern "C" block Without such wrapping, linking lld fails with missing symbols because of C++ symbol mangling with older versions of the MacOSX SDK, in which xar.h doesn't have an extern "C" block itself. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D110224	2021-09-23 09:37:30 +02:00
Fangrui Song	19d53d45f2	[ELF][AArch64] Refine and fix the condition when BTI/PAC PLT needs bti c (As I mentioned in https://reviews.llvm.org/D62609#1534158 , the condition for using bti c for executable can be loosened.) In two cases the address of a PLT may escape: * canonical PLT entry for a STT_FUNC * non-preemptible STT_GNU_IFUNC which is converted to STT_FUNC The first case can be detected with `needsPltAddr`. The second case is not straightforward to detect because for the Relocations.cpp created `directSym`, it's difficult to know whether the associated `sym` has exercised the `!needsPlt(expr)` code path. Just use the conservative `isInIplt` condition. A non-preemptible ifunc not referenced by non-GOT-generating non-PLT-generating relocations will have an unneeded `bti c`, but the cost is acceptable. The second case fixes a bug as well: a -shared link may have non-preemptible ifunc. Before the patch we did not emit `bti c` and could be wrong if the PLT address escaped. GNU ld doesn't handle the case: `relocation R_AARCH64_ADR_PREL_PG_HI21 against STT_GNU_IFUNC symbol 'ifunc2' isn't handled by elf64_aarch64_final_link_relocate` (https://sourceware.org/bugzilla/show_bug.cgi?id=28370) For -shared, if BTI is enabled but PAC is disabled, the PLT entry size increases from 16 to 24 because we have to select the PLT scheme early, but the cost is acceptable. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D110217	2021-09-22 11:51:09 -07:00
Hongtao Yu	d9b511d8e8	[CSSPGO] Set PseudoProbeInserter as a default pass. Currenlty PseudoProbeInserter is a pass conditioned on a target switch. It works well with a single clang invocation. It doesn't work so well when the backend is called separately (i.e, through the linker or llc), where user has always to pass -pseudo-probe-for-profiling explictly. I'm making the pass a default pass that requires no command line arg to trigger, but will be actually run depending on whether the CU comes with `llvm.pseudo_probe_desc` metadata. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D110209	2021-09-22 09:09:48 -07:00
Andrew Ng	05b1303421	[ELF][test] Restore important part of ICF alignment test Restore the checking of addresses in ICF test which was testing the behaviour of ICF with regards to different alignments of otherwise identical sections. Also make the test more robust to layout changes. Differential Revision: https://reviews.llvm.org/D110090	2021-09-22 14:15:33 +01:00
Amy Huang	6e994a833e	[lld] Remove timers.ll because inconsistent timers behavior causes the test to fail sometimes See https://reviews.llvm.org/D109904	2021-09-20 09:57:18 -07:00
Fangrui Song	a954bb18b1	[ELF] Add --why-extract= to query why archive members/lazy object files are extracted Similar to D69607 but for archive member extraction unrelated to GC. This patch adds --why-extract=. Prior art: GNU ld -M prints ``` Archive member included to satisfy reference by file (symbol) a.a(a.o) main.o (a) b.a(b.o) (b()) ``` -M is mainly for input section/symbol assignment <-> output section mapping (often huge output) and the information may appear ad-hoc. Apple ld64 ``` __Z1bv forced load of b.a(b.o) _a forced load of a.a(a.o) ``` It doesn't say the reference file. Arm's proprietary linker ``` Selecting member vsnprintf.o(c_wfu.l) to define vsnprintf. ... Loading member vsnprintf.o from c_wfu.l. definition: vsnprintf reference : _printf_a ``` --- --why-extract= gives the user the full data (which is much shorter than GNU ld -Map). It is easy to track a chain of references to one archive member with a one-liner, e.g. ``` % ld.lld main.o a_b.a b_c.a c.a -o /dev/null --why-extract=- \| tee stdout reference extracted symbol main.o a_b.a(a_b.o) a a_b.a(a_b.o) b_c.a(b_c.o) b() b_c.a(b_c.o) c.a(c.o) c() % ruby -ane 'BEGIN{p={}}; p[$F[1]]=[$F[0],$F[2]] if $.>1; END{x="c.a(c.o)"; while y=p[x]; puts "#{y[0]} extracts #{x} to resolve #{y[1]}"; x=y[0] end}' stdout b_c.a(b_c.o) extracts c.a(c.o) to resolve c() a_b.a(a_b.o) extracts b_c.a(b_c.o) to resolve b() main.o extracts a_b.a(a_b.o) to resolve a ``` Archive member extraction happens before --gc-sections, so this may not be a live path under --gc-sections, but I think it is a good approximation in practice. * Specifying a file avoids output interleaving with --verbose. * Required `=` prevents accidental overwrite of an input if the user forgets `=`. (Most of compiler drivers' long options accept `=` but not ` `) Differential Revision: https://reviews.llvm.org/D109572	2021-09-20 09:52:30 -07:00
Fangrui Song	d001ab82e4	[ELF] Don't fall back to .text for e_entry We have the rule to simulate (https://sourceware.org/binutils/docs/ld/Entry-Point.html), but the behavior is questionable (https://sourceware.org/pipermail/binutils/2021-September/117929.html). gold doesn't fall back to .text. The behavior is unlikely relied by projects (there is even a warning for executable links), so let's just delete this fallback path. Reviewed By: jhenderson, peter.smith Differential Revision: https://reviews.llvm.org/D110014	2021-09-20 09:35:12 -07:00
Nico Weber	1b2c36aa5f	[lld/mac] Fix comment typo to cycle bots	2021-09-18 11:15:21 -04:00
Amy Huang	724a1dff8a	[lld] Fix small error in previous commit `6f7483b1ec`.	2021-09-17 17:47:21 -07:00
Amy Huang	6f7483b1ec	Reland "[LLD] Remove global state in lld/COFF" after fixing asan and msan test failures Original commit description: [LLD] Remove global state in lld/COFF This patch removes globals from the lldCOFF library, by moving globals into a context class (COFFLinkingContext) and passing it around wherever it's needed. See https://lists.llvm.org/pipermail/llvm-dev/2021-June/151184.html for context about removing globals from LLD. I also haven't moved the `driver` or `config` variables yet. Differential Revision: https://reviews.llvm.org/D109634 This reverts commit `a2fd05ada9`. Original commits were `b4fa71eed3` and `e03c7e367a`.	2021-09-17 17:18:42 -07:00
Jez Ng	91ace9f062	[lld-macho] Construct CFString literals by copying the ConcatInputSection ... instead of constructing a new one each time. This allows us to take advantage of {D105305}. I didn't see a substantial difference when linking chromium_framework, but this paves the way for reusing similar logic for splitting compact unwind entries into sections. There are a lot more of those, so the performance impact is significant. Differential Revision: https://reviews.llvm.org/D109895	2021-09-17 19:46:20 -04:00
Vy Nguyen	b428c3e8c1	[lld-macho] Ignore local personality symbols if non-local with the same name exisst, to avoid "too many personalities" error. Sometimes people intentionally re-define a dylib personlity symbol as a local defined symbol as a workaround to a ld -r bug. As a result, we could see "too many personalities" to encode. This patch tries to handle this case by ignoring the local symbols entirely. Differential Revision: https://reviews.llvm.org/D107533	2021-09-17 12:59:42 -04:00
Nuri Amari	aaf00f3f19	Add MachO signature verification test Add a test to ensure that MachO files including a LC_CODE_SIGNATURE load command produced by lld are signed correctly. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D109840	2021-09-16 17:55:32 -07:00
Nuri Amari	cc8229603b	Extract LC_CODE_SIGNATURE related implementation out of LLD Move the functionality in lld that handles writing of the LC_CODE_SIGNATURE load command and associated data section to a central reusable location. This change is in preparation for another change that modifies llvm-objcopy to reproduce the LC_CODE_SIGNATURE load command and corresponding data section to maintain the validity of signed macho object files passed through llvm-objcopy. Reviewed By: #lld-macho, int3, oontvoo Differential Revision: https://reviews.llvm.org/D109803	2021-09-16 17:43:39 -07:00
Fangrui Song	1d08a19a38	[ELF] Clarify --export-dynamic-symbol/--dynamic-list. NFC	2021-09-16 17:13:08 -07:00
Amy Huang	a2fd05ada9	Temporarily revert "[LLD] Remove global state in lld/COFF" and "[lld] Add test to check for timer output" Seems to be causing a number of asan test failures. This reverts commit `b4fa71eed3` and `e03c7e367a`.	2021-09-16 11:58:11 -07:00
Amy Huang	e03c7e367a	[lld] Add test to check for timer output This test checks that timers are working and printing as expected. I also seem to have changed the order of the timers in my globals refactoring patch, so I fixed it here. Differential Revision: https://reviews.llvm.org/D109904	2021-09-16 11:36:46 -07:00
Amy Huang	b4fa71eed3	[LLD] Remove global state in lld/COFF This patch removes globals from the lldCOFF library, by moving globals into a context class (COFFLinkingContext) and passing it around wherever it's needed. See https://lists.llvm.org/pipermail/llvm-dev/2021-June/151184.html for context about removing globals from LLD. I also haven't moved the `driver` or `config` variables yet. Differential Revision: https://reviews.llvm.org/D109634	2021-09-16 11:00:23 -07:00
Alfonso Gregory	a2c319fdc6	[LLVM][CMake][NFC] Resolve FIXME: Rename LLVM_CMAKE_PATH to LLVM_CMAKE_DIR throughout the project This way, we do not need to set LLVM_CMAKE_PATH to LLVM_CMAKE_DIR when (NOT LLVM_CONFIG_FOUND) Reviewed By: #libc, ldionne Differential Revision: https://reviews.llvm.org/D107717	2021-09-16 18:29:57 +02:00
Thomas Lively	962acf0a27	[lld][WebAssembly] Use llvm-objdump to test __wasm_init_memory Rather than depending on the hex dump from obj2yaml. Now the test shows the expected function body in a human readable format. Differential Revision: https://reviews.llvm.org/D109730	2021-09-14 18:07:59 -07:00
Nico Weber	ed2f0ad307	[lld/mac] Search .tbd before binary for framework files too This matters for example for the iPhoneSimulator14.0.sdk, which has a System/Library/Frameworks/UIKit.framework/UIKit that has LC_BUILD_VERSION with minos of 14.0, so linking against that file will produce warnings like: .../iPhoneSimulator14.0.sdk/System/Library/Frameworks/UIKit.framework/UIKit has version 14.0.0, which is newer than target minimum of 12.0.0 when targeting x86_64-apple-ios12.0-simulator. That doens't happen when linking against UIKit.tbd instead, obviously. Linking with RC_TRACE_DYLIB_SEARCHING=1 shows that ld64 also searches the tbd file first, and we already get that right for non-framework dylibs. Fixes crbug.com/1249456. Differential Revision: https://reviews.llvm.org/D109768	2021-09-14 15:26:45 -04:00
Sam Clegg	6ee55f9ab5	Fix test failure created by `ef8c9135ef` Followup to https://reviews.llvm.org/D108877 to fix test failure.	2021-09-14 07:35:05 -07:00
Sam Clegg	ef8c9135ef	[WebAssembly] Allow import and export of TLS symbols between DSOs We previously had a limitation that TLS variables could not be exported (and therefore could also not be imported). This change removed that limitation. Differential Revision: https://reviews.llvm.org/D108877	2021-09-14 06:47:37 -07:00
Thomas Lively	b2032f18c9	[lld][WebAssembly] Relax limitations on multithreaded instantiation For multithreaded modules (i.e. modules with a shared memory), lld injects a synthetic Wasm start function that is automatically called during instantiation to initialize memory from passive data segments. Even though the module will be instantiated separately on each thread, memory initialization should happen only once. Furthermore, memory initialization should be finished by the time each thread finishes instantiation. Since multiple threads may be instantiating their modules at the same time, the synthetic function must synchronize them. The current synchronization tries to atomically increment a flag from 0 to 1 in memory then enters one of two cases. First, if the increment was successful, the current thread is responsible for initializing memory. It does so, increments the flag to 2 to signify that memory has been initialized, then notifies all threads waiting on the flag. Otherwise, the thread atomically waits on the flag with an expected value of 1 until memory has been initialized. Either the initializer thread finishes initializing memory (i.e. sets the flag to 2) first and the waiter threads do not end up blocking, or the waiter threads succesfully start waiting before memory is initialized so they will be woken by the initializer thread once it has finished. One complication with this scheme is that there are various contexts on the Web, most notably on the main browser thread, that cannot successfully execute a wait. Executing a wait in these contexts causes a trap, and in this case would cause instantiation to fail. The embedder must therefore ensure that these contexts win the race and become responsible for initializing memory, since that is the only code path that does not execute a wait. Unfortunately, since only one thread can win the race and initialize memory, this scheme makes it impossible to have multiple threads in contexts that cannot wait. For example, it is not currently possible to instantiate the module on both the main browser thread as well as in an AudioWorklet. To loosen this restriction, this commit inserts an extra check so that the wait will not be executed at all when memory has already been initialized, i.e. when the flag value is 2. After this change, the module can be instantiated on threads in non-waiting contexts as long as the embedder can guarantee either that the thread will win the race and initialize memory (as before) or that memory has already been initialized when instantiation begins. Threads in contexts that can wait can continue racing to initialize memory. Fixes (or at least improves) PR51702. Reviewed By: dschuff Differential Revision: https://reviews.llvm.org/D109722	2021-09-13 15:03:51 -07:00
Sam Clegg	b78c85a44a	[WebAssembly] Convert to new "dylink.0" section format This format is based on sub-sections (like the "linking" and "name" sections) and is therefore easier to extend going forward. spec change: https://github.com/WebAssembly/tool-conventions/pull/170 binaryen change: https://github.com/WebAssembly/binaryen/pull/4141 wabt change: https://github.com/WebAssembly/wabt/pull/1707 emscripten change: https://github.com/emscripten-core/emscripten/pull/15019 Differential Revision: https://reviews.llvm.org/D109595	2021-09-12 05:30:38 -07:00
Sam Clegg	3a7bcba34b	[lld][WebAssembly] Cleanup output of --verbose Remove some unnecessary logging from wasm-ld when running under `--verbose`. Unlike `-debug` this logging is available in release builds. This change makes it little more minimal/readable. Also, avoid compiling the `debugWrite` function in releaase builds where it does nothing. This should remove a lot debug strings from the binary, and avoid having to construct unused debug strings at runtime. Differential Revision: https://reviews.llvm.org/D109583	2021-09-10 11:35:50 -04:00
Fangrui Song	bcc34ab6c8	[lld] Enable ANSI escape code for Windows Buffered diagnostics need ENABLE_VIRTUAL_TERMINAL_PROCESSING after D87272. Do it unconditionally like FileCheck.	2021-09-09 16:51:11 -07:00
Sam Clegg	6355234660	[lld][WebAssembly] Fix crash on un-used __tls_base symbol In the case that TLS is used in the single-threaded program, and therefore effectively lowered away, we still optionally create a `__tls_base` symbols, but the code for setting it was assuming it was always created. Differential Revision: https://reviews.llvm.org/D109518	2021-09-09 12:45:58 -04:00
Fangrui Song	0db402c5b4	[lld] Buffer writes when composing a single diagnostic llvm::errs() is unbuffered. On a POSIX platform, composing a diagnostic string may invoke the ::write syscall multiple times, which can be slow. Buffer writes to a temporary SmallString when composing a single diagnostic to reduce the number of ::write syscalls to one (also easier to read under strace/truss). For an invocation of ld.lld with 62000+ lines of `ld.lld: warning: symbol ordering file: no such symbol: ` warnings (D87121), the buffering decreases the write time from 1s to 0.4s (for /dev/tty) and from 0.4s to 0.1s (for a tmpfs file). This can speed up `relocation R_X86_64_PC32 out of range` diagnostic printing as well with `--noinhibit-exec --no-fatal-warnings`. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D87272	2021-09-09 09:27:14 -07:00
Sam Clegg	44177e5fb2	[WebAssembly] Add explict TLS symbol flag As before we maintain backwards compat with older object files by also infering the TLS flag based on the name of the segment. This change is was split out from https://reviews.llvm.org/D108877. Differential Revision: https://reviews.llvm.org/D109426	2021-09-09 10:03:30 -04:00
Fangrui Song	aa4dfba522	[ELF] Infer EM_HEXAGON in getBitcodeMachineKind	2021-09-07 20:46:37 -07:00
Fangrui Song	abd80ecf6e	[ELF][test] Improve gitBitcodeMachineKind tests	2021-09-07 11:38:43 -07:00
Jez Ng	d9ab62ca3d	[lld-macho] Initialize LTO backend with diagnostic handler Failing to do so results in `std::bad_function_call` being thrown when a pass tries to emit a diagnostic. I've copied the relevant test over from LLD-ELF's test suite. Reviewed By: #lld-macho, thevinster Differential Revision: https://reviews.llvm.org/D109274	2021-09-04 17:40:07 -04:00
David Blaikie	bc066e26c9	DebugInfo: Fix a few bot failures for type dumping fixes	2021-09-03 14:08:58 -07:00
Nico Weber	c15b588852	[lld/mac] Don't assert during thunk insertion if there are undefined symbols We end up calling resolveBranchVA(), which asserts for Undefineds. As fix, just return early in Writer::run() if there are any diagnostics after processing relocations (which is where undefined symbol errors are emitted). This matches what the ELF port does. Differential Revision: https://reviews.llvm.org/D109079	2021-09-03 12:22:41 -04:00
Nico Weber	9d22754389	Fix lld build after `5881dcff7e`	2021-09-02 15:07:10 -04:00
Sid Manning	0d7e5daedc	[lld][Hexagon] Add checks for instructions that can have TLS relocations Several instructions with potential TLS relocations were missing. This issue was found when building the Canadian LLVM toolchain.	2021-09-01 13:15:18 -07:00
Alexandre Ganea	7f0664f193	[LLD][COFF] Clean paths in PDB even when /pdbsourcepath is omitted Differential Revision: https://reviews.llvm.org/D109030	2021-08-31 19:05:10 -04:00
Fangrui Song	f9277caffc	[ELF][test] Fix R_AARCH64_ADR_PREL_PG_HI21 typo Found by redfast00	2021-08-31 13:09:55 -07:00
Nico Weber	86c8f395ae	[lld/mac] Leave more room for thunks in thunk placement code Fixes PR51578 in practice. Currently there's only enough room for a single thunk, which for real-life code isn't enough. The error case only happens when there are many branch statements very close to each other (0 or 1 instructions apart), with the function at the finalization barrier small. There's a FIXME on what to do if we hit this case, but that suggestion sounds complicated to me (see end of PR51578 comment 5 for why). Instead, just leave more room for thunks. Chromium's unit_tests links fine with room for 3 thunks. Leave room for 100, which should fix this for most cases in practice. There's little cost for leaving lots of room: This slop value only determines when we finalize sections, and we insert thunks for forward jumps into unfinalized sections. So leaving room means we'll need a few more thunks, but the thunk jump range is 128 MiB while a single thunk is just 12 bytes. For Chromium's unit_tests: With a slop of 3: thunk calls = 355418, thunks = 10903 With a slop of 100: thunk calls = 355426, thunks = 10904 Chances are 100 is enough for all use cases we'll hit in practice, but even bumping it to 1000 would probably be fine. Differential Revision: https://reviews.llvm.org/D108930	2021-08-30 22:09:05 -04:00
Nico Weber	83df94067d	[lld/mac] Tweak estimateStubsInRangeVA a bit - Move a few variables closer to their uses, remove some completely (no behavior change) - Add some comments - Make maxPotentialThunks include calls to stubs. It's possible that an earlier call to a stub late in the stub table will need a thunk, and that inserted thunk could push a stub earlier in the stub table out of range. This is unlikely to happen, but usually there are way fewer stub calls than non-stub calls, so if we're doing a conservative approximation here we might as well do it correctly. (For chromium's unit_tests target, 134421/242639 stub calls are direct calls without this change, compared to 134408/242639 with this change) No real, meaningful behavior difference. Differential Revision: https://reviews.llvm.org/D108924	2021-08-30 13:56:45 -04:00
Nico Weber	9721197520	[lld/mac] Set branchRange a bit more carefully - Don't subtract thunkSize from branchRange. Most places care about the actual maximal branch range. Subtract thunkSize in the one place that wants to leave room for a thunk. - Set it to 0x800_0000 instead of 0xFF_FFFF - Subtract 4 for the positive branch direction since it's a two's complement 24bit number sign-extended mutiplied by 4, so its range is -0x800_0000..+0x7FF_FFFC - Make boundary checks include the boundary values This doesn't make a huge difference in practice. It's preparation for a "real" fix for PR51578 -- but it also lets the repro in comment 0 in that bug place one more thunk before hitting the TODO. Differential Revision: https://reviews.llvm.org/D108897	2021-08-30 12:36:06 -04:00
Fangrui Song	3726039561	[ELF] Simplify addGotEntry. NFC	2021-08-29 13:40:08 -07:00
Fangrui Song	d3fdc312b2	[ELF] Untangle TLS IE and regular GOT from addGotEntry for non-mips. NFC	2021-08-29 13:21:06 -07:00
Fangrui Song	1861160697	[ELF] Move handleTlsRelocations. NFC Prepare for addGotEntry simplification.	2021-08-29 13:11:35 -07:00
Fangrui Song	204b2902d5	[ELF] Remove unused processRelocAux argument. NFC	2021-08-29 12:07:56 -07:00
Nico Weber	28be02f334	[lld/mac] Don't assert on -dead_strip + arm64 range extension thunks The assert is harmless and thinks worked fine in builds with asserts enabled, but it's still nice to fix the assert. Differential Revision: https://reviews.llvm.org/D108853	2021-08-27 23:27:45 -04:00
Pirama Arumuga Nainar	9632ce14e4	[lld/test/ELF] Test fetch from archive to resolve undefined symbols in shared libs Add missing test coverage uncovered in review of D108006. Differential Revision: https://reviews.llvm.org/D108328	2021-08-27 14:17:32 -07:00
Nico Weber	34ac7a7ac1	[lld/COFF] Ignore /LTCG, /LTCG:, /LTCGOUT:, /ILK: flags We currently complain "could not open /LTCG: no such file or directory", which isn't very useful. We could emit a warning when we see this flag, but just ignoring it seems fine. Final missing part of PR38799. Differential Revision: https://reviews.llvm.org/D108799	2021-08-27 09:13:30 -04:00
Nico Weber	66dc44f703	[lld/COFF] Use P_priv more P_priv does the same as the old QF further down. Standardize on P_priv. No behavior change. Differential Revision: https://reviews.llvm.org/D108798	2021-08-27 08:48:05 -04:00
Jez Ng	c74eb05f21	[lld-macho][nfc] Clean up InputSection constructors	2021-08-26 19:07:48 -04:00
Jez Ng	9b5148d426	[lld-macho] Have -ObjC load archive members before symbol resolution This is what ld64 does. Deviating in behavior here can result in some subtle duplicate symbol errors, as detailed in the objc.s test. Differential Revision: https://reviews.llvm.org/D108781	2021-08-26 18:52:07 -04:00
Jez Ng	9065fe5591	[lld-macho] Refactor archive loading The previous logic was duplicated between symbol-initiated archive loads versus flag-initiated loads (i.e. `-force_load` and `-ObjC`). This resulted in code duplication as well as redundant work -- we would create Archive instances twice whenever we had one of those flags; once in `getArchiveMembers` and again when we constructed the ArchiveFile. This was motivated by an upcoming diff where we load archive members containing ObjC-related symbols before loading those containing ObjC-related sections, as well as before performing symbol resolution. Without this refactor, it would be difficult to do that while avoiding loading the same archive member twice. Differential Revision: https://reviews.llvm.org/D108780	2021-08-26 18:52:07 -04:00
Jez Ng	2179930868	[lld-macho] Fix unwind info personality size This was missed by {D107035}. This fix addresses the following warning: loop variable 'personality' has type 'const uint32_t &' (aka 'const unsigned int &') but is initialized with type 'const unsigned long long' resulting in a copy [-Wrange-loop-analysis] In addition to fixing the size, I also removed the const reference, since there's no performance benefit to avoiding copies of integer-sized values.	2021-08-26 18:52:06 -04:00
Nico Weber	400a1de3ac	[lld/COFF] Improve handling of the /manifestdependency: flag If multiple /manifestdependency: flags are passed, they are naively deduped, but after that each of them should have an effect, instead of just the last one. Also, /manifestdependency: flags are allowed in .drectve sections (from `#pragma comment(linker, ...`). To make the interaction between /manifestdependency: flags enabling manifest by default but /manifest:no overriding this work, add an explict ManifestKind::Default state to represent no explicit /manifest flag being passed. To make /manifestdependency: flags from input file .drectve sections work with /manifest:embed, delay embedded manifest emission until after input files have been read. Differential Revision: https://reviews.llvm.org/D108628	2021-08-25 14:36:32 -04:00
Heejin Ahn	77b921b870	[WebAssembly] Tidy up EH/SjLj options This CL is small, but the description can be a little long because I'm trying to sum up the status quo for Emscripten/Wasm EH/SjLj options. First, this CL adds an option for Wasm SjLj (`-wasm-enable-sjlj`), which handles SjLj using Wasm EH. The implementation for this will be added as a followup CL, but this adds the option first to do error checking. This also adds an option for Wasm EH (`-wasm-enable-eh`), which has been already implemented. Before we used `-exception-model=wasm` as the same meaning as enabling Wasm EH, but after we add Wasm SjLj, it will be possible to use Wasm EH instructions for Wasm SjLj while not enabling EH, so going forward, to use Wasm EH, `opt` and `llc` will need this option. This only affects `opt` and `llc` command lines and does not affect Emscripten user interface. Now we have two modes of EH (Emscripten/Wasm) and also two modes of SjLj (also Emscripten/Wasm). The options corresponding to each of are: - Emscripten EH: `-enable-emscripten-cxx-exceptions` - Emscripten SjLj: `-enable-emscripten-sjlj` - Wasm EH: `-wasm-enable-eh -exception-model=wasm` `-mattr=+exception-handling` - Wasm SjLj: `-wasm-enable-sjlj -exception-model=wasm` `-mattr=+exception-handling` The reason Wasm EH/SjLj's options are a little complicated are `-exception-model` and `-mattr` are common LLVM options ane not under our control. (`-mattr` can be omitted if it is embedded within the bitcode file.) And we have the following rules of the option composition: - Emscripten EH and Wasm EH cannot be turned on at the same itme - Emscripten SjLj and Wasm SjLj cannot be turned on at the same time - Wasm SjLj should be used with Wasm EH Which means we now allow these combinations: - Emscripten EH + Emscripten SjLj: the current default in `emcc` - Wasm EH + Emscripten SjLj: This is allowed, but only as an interim step in which we are testing Wasm EH but not yet have a working implementation of Wasm SjLj. This will error out (D107687) in compile time if `setjmp` is called in a function in which Wasm exception is used. - Wasm EH + Wasm SjLj: This will be the default mode later when using Wasm EH. Currently Wasm SjLj implementation doesn't exist, so it doesn't work. - Emscripten EH + Wasm SjLj will not work. This CL moves these error checking routines to `WebAssemblyPassConfig::addIRPasses`. Not sure if this is an ideal place to do this, but I couldn't find elsewhere. Currently some checking is done within LowerEmscriptenEHSjLj, but these checks only run if LowerEmscriptenEHSjLj runs so it may not run when Wasm EH is used. This moves that to `addIRPasses` and adds some more checks. Currently LowerEmscriptenEHSjLj pass is responsible for Emscripten EH and Emscripten SjLj. Wasm EH transformations are done in multiple places, including WasmEHPrepare, LateEHPrepare, and CFGStackify. But in the followup CL, LowerEmscriptenEHSjLj pass will be also responsible for a part of Wasm SjLj transformation, because WasmSjLj will also be using several Emscripten library functions, and we will be sharing more than half of the transformation to do that between Emscripten SjLj and Wasm SjLj. Currently we have `-enable-emscripten-cxx-exceptions` and `-enable-emscripten-sjlj` but these only work for `llc`, because for `llc` we feed these options to the pass but when we run the pass using `opt` the pass will be created with no options and the default options will be used, which turns both Emscripten EH and Emscripten SjLj on. Now we have one more SjLj option to care for, LowerEmscriptenEHSjLj pass needs a finer way to control these options. This CL removes those default parameters and make LowerEmscriptenEHSjLj pass read directly from command line options specified. So if we only run `opt -wasm-lower-em-ehsjlj`, currently both Emscripten EH and Emscripten SjLj will run, but with this CL, none will run unless we additionally pass `-enable-emscripten-cxx-exceptions` or `-enable-emscripten-sjlj`, or both. This does not affect users; this only affects our `opt` tests because `emcc` will not call either `opt` or `llc`. As a result of this, our existing Emscripten EH/SjLj tests gained one or both of those options in their `RUN` lines. Reviewed By: dschuff Differential Revision: https://reviews.llvm.org/D107685	2021-08-24 17:54:39 -07:00
Sam Clegg	c468dc1b12	[lld][WebAssembly] Handle weakly defined symbols in shared libraries. In the case of weakly defined symbols in shared libraries we now generate both an import and an export. The dynamic linker can then choose how a winner from among all the shared libraries that define a given symbol. Previously any direct usage of a weakly defined symbol would use the DSO-local definition (For example, even through there would be single address for a weakly defined function, each DSO could end up directly calling its local version). Fixes: https://github.com/emscripten-core/emscripten/issues/13773 Differential Revision: https://reviews.llvm.org/D108413	2021-08-19 19:25:49 -04:00
Sam Clegg	e4888be74e	[WebAssembly] Avoid unused function imports in PIC mode In PIC mode we import function address via `GOT.mem` imports but for direct function calls we still import the first class function. However, if the function is never directly called we can avoid the first class import completely. Differential Revision: https://reviews.llvm.org/D108345	2021-08-18 22:31:04 -04:00
Sam Clegg	12b1dc0467	[WebAssembly][lld] Convert signature-mismatch.ll test to asm. NFC Differential Revision: https://reviews.llvm.org/D108346	2021-08-18 22:17:02 -04:00
Fangrui Song	f74b70ef57	[lld-macho][test] Remove ld64.lld: prefix in a diagnostic The convention is not to check the prefix before `error: `. This gives flexibility if we need to rename ld64.lld to something else, (e.g. a while ago we used ld64.lld.darwinnew).	2021-08-16 19:41:12 -07:00
Fangrui Song	54e76cb17a	[split-file] Default to --no-leading-lines It turns out that the --leading-lines may be a bad default. [[#@LINE+-num]] is rarely used.	2021-08-16 19:23:11 -07:00
Vincent Lee	08d55c5c01	[lld-macho] Refactor parseSections to avoid creating isec on LLVM segments Address post follow up comment in D108016. Avoid creating isec for LLVM segments since we are skipping over it. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D108167	2021-08-16 18:47:50 -07:00
Vincent Lee	15dc93e61c	[lld-macho] Ignore LLVM segments to prevent duplicate syms There was an instance of a third-party archive containing multiple _llvm symbols from different files that clashed with each other producing duplicate symbols. Symbols under the LLVM segment don't seem to be producing any meaningful value, so just ignore them. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D108016	2021-08-16 12:41:03 -07:00
Martin Storsjö	f8340c8c5d	[LLD] [MinGW] Add more options for disabling flags in the executable In `e72403f96d`, we added the flag "--no-dynamicbase" for disabling the dynamicbase flag which we set by default. At the time, ld.bfd didn't have any corresponding option (as ld.bfd defaulted to not setting the flag). Almost at the same time, corresponding options were added to ld.bfd for disabling it (while it was being enabled by default), with a different name, "--disable-dynamicbase". Thus add the "--disable-dynamicbase" option. Make this default one advertised in the help listing, but keep the "--no-dynamicbase" form as an alias. Also improve checking for the last option set if there are multiple ones on the same command line. Also add corresponding disable options for a lot of other flags that we set by default, also added in ld.bfd in the same commit: https://sourceware.org/git/?p=binutils-gdb.git;a=commitdiff;h=514b4e191d5f46de8e142fe216e677a35fa9c4bb Differential Revision: https://reviews.llvm.org/D107930	2021-08-12 13:27:09 +03:00
Reid Kleckner	fb9a075c81	[lld] Add llvm-profdata to lld test deps As of https://reviews.llvm.org/D104431, the test suite runs llvm-profdata, so it must be added to the list of deps.	2021-08-11 11:52:40 -07:00
Yolanda Chen	8fa16cc628	[LTO][lld] Add lto-pgo-warn-mismatch option When enable CSPGO for ThinLTO, there are profile cfg mismatch warnings that will cause lld-link errors (with /WX) due to source changes (e.g. `#if` code runs for profile generation but not for profile use) To disable it we have to use an internal "/mllvm:-no-pgo-warn-mismatch" option. In contrast clang uses option ”-Wno-backend-plugin“ to avoid such warnings and gcc has an explicit "-Wno-coverage-mismatch" option. Add "lto-pgo-warn-mismatch" option to lld COFF/ELF to help turn on/off the profile mismatch warnings explicitly when build with ThinLTO and CSPGO. Differential Revision: https://reviews.llvm.org/D104431	2021-08-11 09:45:55 -07:00
Wang, Pengfei	6c4809825d	Revert "[lld] Add lto-pgo-warn-mismatch option" This reverts commit `0cfb00a1c9`.	2021-08-11 16:25:42 +08:00
Yolanda Chen	0cfb00a1c9	[lld] Add lto-pgo-warn-mismatch option When enable CSPGO for ThinLTO, there are profile cfg mismatch warnings that will cause lld-link errors (with /WX). To disable it we have to use an internal "/mllvm:-no-pgo-warn-mismatch" option. In contrast clang uses option ”-Wno-backend-plugin“ to avoid such warnings and gcc has an explicit "-Wno-coverage-mismatch" option. Add this "lto-pgo-warn-mismatch" option to lld to help turn on/off the profile mismatch warnings explicitly when build with ThinLTO and CSPGO. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D104431	2021-08-11 14:43:26 +08:00
Sam Clegg	56175b2f5c	[lld][WebAssembly] Prefer objdump -d over obj2yaml for tests. NFC Now that we have https://reviews.llvm.org/D105539 we can use objdump -d to actually check for instruction sequences rather than binary blobs. This is just an example of how to do that we should followup with a wider ranging conversion of existing tests. Differential Revision: https://reviews.llvm.org/D106897	2021-08-10 18:17:58 -04:00
Fangrui Song	76093b1739	[InlineAdvisor] Add single quotes around caller/callee names Clang diagnostics refer to identifier names in quotes. This patch makes inline remarks conform to the convention. New behavior: ``` % clang -O2 -Rpass=inline -Rpass-missed=inline -S a.c a.c:4:25: remark: 'foo' inlined into 'bar' with (cost=-30, threshold=337) at callsite bar:0:25; [-Rpass=inline] int bar(int a) { return foo(a); } ^ ``` Reviewed By: hoy Differential Revision: https://reviews.llvm.org/D107791	2021-08-10 11:51:31 -07:00
Ben Dunbobbin	8392e8c007	[LLD][Test] Add thin archives to map file test This adds thin archives to the map file test. I noticed that we had this test-case in our downstream testsuite but it wasn't in the upstream testing. Differential revision: https://reviews.llvm.org/D107555	2021-08-10 10:24:01 +01:00
Pan, Tao	c70fa6da9a	Fix gcc build error after D105519 Same as `3bec7ed59e` Reviewed By: sbc100 Differential Revision: https://reviews.llvm.org/D107422	2021-08-09 14:32:34 +08:00
Simon Atanasyan	454f69bcc1	[LLD] Add required `ppc` target to the test cases. NFC	2021-08-07 13:29:59 +03:00
Simon Atanasyan	c6ebc651b6	[LLD] Support compressed input sections on big-endian targets This patch enables compressed input sections on big-endian targets by checking the target endianness and selecting an appropriate `Chdr` structure. Fixes PR51369 Differential Revision: https://reviews.llvm.org/D107635	2021-08-07 13:20:13 +03:00
Paul Robinson	34035b1044	2nd Speculative fix for MachO lld test after "Have REQUIRES support the target triple" See: http://45.33.8.238/macm1/15677/step_10.txt Follow-up to `f88ad8d` as it appears the lld invocations both emit an error message; so, try adding 'not' to the RUN lines.	2021-08-06 10:49:36 -07:00
Paul Robinson	f88ad8d00f	Speculative fix for MachO lld test after "Have REQUIRES support the target triple" See: http://45.33.8.238/macm1/15677/step_10.txt This is a test that has `REQUIRES: x86` which means it never ran before; I don't have a MachO environment but based on the FileCheck output it looks like it should be sufficient to remove one CHECK line.	2021-08-06 09:23:45 -07:00
Fangrui Song	72d070b4db	[ELF] Support copy relocation on non-default version symbols Copy relocation on a non-default version symbol is unsupported and can crash at runtime. Fortunately there is a one-line fix which works for most cases: ensure `getSymbolsAt` unconditionally returns `ss`. If two non-default version symbols are defined at the same place and both are copy relocated, our implementation will copy relocated them into different addresses. The pointer inequality is very unlikely an issue. In GNU ld, copy relocating version aliases seems to create more pointer inequality problems than us. ( In glibc, sys_errlist@GLIBC_2.2.5 sys_errlist@GLIBC_2.3 sys_errlist@GLIBC_2.4 are defined at the same place, but it is unlikely they are all copy relocated in one executable. Even if so, the variables are read-only and pointer inequality should not be a problem. ) Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D107535	2021-08-05 10:32:14 -07:00
Fangrui Song	00809c8889	[ELF] Apply version script patterns to non-default version symbols Currently version script patterns are ignored for .symver produced non-default version (single @) symbols. This makes such symbols not localizable by `local:`, e.g. ``` .symver foo3_v1,foo3@v1 .globl foo_v1 foo3_v1: ld.lld --version-script=a.ver -shared a.o ``` This patch adds the support: * Move `config->versionDefinitions[VER_NDX_LOCAL].patterns` to `config->versionDefinitions[versionId].localPatterns` * Rename `config->versionDefinitions[versionId].patterns` to `config->versionDefinitions[versionId].nonLocalPatterns` * Allow `findAllByVersion` to find non-default version symbols when `includeNonDefault` is true. (Note: `symtab` keys do not have `@@`) * Make each pattern check both the unversioned `pat.name` and the versioned `${pat.name}@${v.name}` * `localPatterns` can localize `${pat.name}@${v.name}`. `nonLocalPatterns` can prevent localization by assigning `verdefIndex` (before `parseSymbolVersion`). --- If a user notices new `undefined symbol` errors with a version script containing `local: *;`, the issue is likely due to a missing `global:` pattern. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D107234	2021-08-04 23:52:56 -07:00
Fangrui Song	a533eb7423	Revert "[ELF] Apply version script patterns to non-default version symbols" This reverts commit `7ed22a6fa9`. buf is not cleared so the commit misses some cases.	2021-08-04 23:52:55 -07:00
Fangrui Song	7a6482216f	[CMake][gn] lldMachO=>lldMachOOld, lldMachO2=>lldMachO Now that D95204 switched default to new Darwin backend, rename some CMake targets to match. Reviewed By: #lld-macho, smeenai, int3 Differential Revision: https://reviews.llvm.org/D107516	2021-08-04 18:52:41 -07:00
Fangrui Song	bd484c9940	[lld] Remove unused LLD_REPOSITORY Remnant after D72803. Distributions who want to customize the string can customize LLD_VERSION_STRING instead. Reviewed By: #lld-macho, mstorsjo, thakis Differential Revision: https://reviews.llvm.org/D107416	2021-08-04 13:04:10 -07:00
Fangrui Song	0a6aad5991	[ELF] Fix typo. NFC	2021-08-04 09:26:29 -07:00
Fangrui Song	66d4430492	[ELF] Combine foo@v1 and foo with the same versionId if both are defined Due to an assembler design flaw (IMO), `.symver foo,foo@v1` produces two symbols `foo` and `foo@v1` if `foo` is defined. * `v1 {};` produces both `foo` and `foo@v1`, but GNU ld only produces `foo@v1` * `v1 { foo; };` produces both `foo@@v1` and `foo@v1`, but GNU ld only produces `foo@v1` * `v2 { foo; };` produces both `foo@@v2` and `foo@v1`, matching GNU ld. (Tested by symver.s) This patch implements the GNU ld behavior by reusing the symbol redirection mechanism in D92259. The new test symver-non-default.s checks the first two cases. Without the patch, the second case will produce `foo@v1` and `foo@@v1` which looks weird and makes foo unnecessarily default versioned. Note: `.symver foo,foo@v1,remove` exists but the unfortunate `foo` will not go away anytime soon. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D107235	2021-08-04 09:06:05 -07:00
Fangrui Song	7ed22a6fa9	[ELF] Apply version script patterns to non-default version symbols Currently version script patterns are ignored for .symver produced non-default version (single @) symbols. This makes such symbols not localizable by `local:`, e.g. ``` .symver foo3_v1,foo3@v1 .globl foo_v1 foo3_v1: ld.lld --version-script=a.ver -shared a.o # In a.out, foo3@v1 is incorrectly exported. ``` This patch adds the support: * Move `config->versionDefinitions[VER_NDX_LOCAL].patterns` to `config->versionDefinitions[versionId].localPatterns` * Rename `config->versionDefinitions[versionId].patterns` to `config->versionDefinitions[versionId].nonLocalPatterns` * Allow `findAllByVersion` to find non-default version symbols when `includeNonDefault` is true. (Note: `symtab` keys do not have `@@`) * Make each pattern check both the unversioned `pat.name` and the versioned `${pat.name}@${v.name}` * `localPatterns` can localize `${pat.name}@${v.name}`. `nonLocalPatterns` can prevent localization by assigning `verdefIndex` (before `parseSymbolVersion`). --- If a user notices new `undefined symbol` errors with a version script containing `local: *;`, the issue is likely due to a missing `global:` pattern. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D107234	2021-08-04 09:02:11 -07:00
Fangrui Song	9bd29a73d1	[ELF] Make dot in .tbss correct GNU ld doesn't support multiple SHF_TLS SHT_NOBITS output sections (it restores the address after an SHF_TLS SHT_NOBITS section, so consecutive SHF_TLS SHT_NOBITS sections will have conflicting address ranges). That said, `threadBssOffset` implements limited support for consecutive SHF_TLS SHT_NOBITS sections. (SHF_TLS SHT_PROGBITS following a SHF_TLS SHT_NOBITS can still be incorrect.) `.` in an output section description of an SHF_TLS SHT_NOBITS section is incorrect. (https://lists.llvm.org/pipermail/llvm-dev/2021-July/151974.html) This patch saves the end address of the previous tbss section in `ctx->tbssAddr`, changes `dot` in the beginning of `assignOffset` so that `.` evaluation will be correct. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D107208	2021-08-04 08:58:50 -07:00

... 5 6 7 8 9 ...

15048 Commits