llvm-project

Commit Graph

Author	SHA1	Message	Date
Fangrui Song	40fae4d8fc	[ELF] Optimize replaceCommonSymbols This decreases the 0.2% time (no debug info) to nearly no.	2021-12-24 19:01:51 -08:00
Fangrui Song	b5a0f0f397	[ELF] Add ELFFileBase::{elfShdrs,numELFShdrs} to avoid duplicate llvm::object::ELFFile::sections() This mainly avoid `relsOrRelas` cost in `InputSectionBase::relocate`. `llvm::object::ELFFile::sections()` has redundant and expensive checks.	2021-12-24 17:10:38 -08:00
Fangrui Song	5e3403bd22	[ELF] parseLazy: skip local symbols	2021-12-24 13:16:34 -08:00
Fangrui Song	0d749e13f7	[ELF] Optimize symbol initialization and resolution Avoid repeated load of global pointer (symtab) / members (sections.size(), firstGlobal) in the hot paths. And remove some unneeded this->	2021-12-23 21:54:32 -08:00
Fangrui Song	1d285f2de0	[ELF] Simplify and optimize ObjFile<ELFT>::parseLazy	2021-12-23 20:23:13 -08:00
Fangrui Song	ad26b0b233	Revert "[ELF] Make Partition/InStruct members unique_ptr and remove associate make<XXX>" This reverts commit `e48b1c8a27`. This reverts commit `d019de23a1`. The changes caused memory leaks (non-final classes cannot use unique_ptr).	2021-12-22 23:55:11 -08:00
Fangrui Song	ba948c5a9c	[ELF] Use SmallVector for some global variables (Files and Sections). NFC My lld executable is 26+KiB smaller.	2021-12-22 22:30:08 -08:00
Fangrui Song	d019de23a1	[ELF] Make InStruct members unique_ptr and remove associate make<XXX> See D116143 for benefits. My lld executable (x86-64) is 24+KiB smaller.	2021-12-22 21:11:26 -08:00
Fangrui Song	3a5fb57393	[ELF] Replace LazyObjFile with lazy ObjFile/BitcodeFile The new `lazy` state is the inverse of the previous `LazyObjFile::extracted`. There are many advantages: * previously when a LazyObjFile was extracted, a new ObjFile/BitcodeFile was created; now the file is reused, just with `lazy` cleared * avoid the confusing transfer of `symbols` from LazyObjFile to the new file * the `incompatible file:` diagnostic is unified with `is incompatible with` * simpler code, smaller executable (6200+ bytes smaller on x86-64) * make eager parsing feasible (for parallel section/symbol table initialization)	2021-12-22 17:41:50 -08:00
Fangrui Song	60f5614931	[ELF] SharedFile::parse: cache symbols size for a loop. NFC	2021-12-15 22:45:28 -08:00
Fangrui Song	159b948e43	[ELF] ObjFile<ELFT>::initializeSymbols: don't call Allocate when firstGlobal==0 Calling `Allocate` with 0 size (when .symtab is absent, e.g. `invalid/mips-invalid-options-descriptor.test`) may return a nullptr, which will crash with -fsanitize=null (the underlying `Allocate` function is LLVM_ATTRIBUTE_RETURNS_NONNULL).	2021-12-15 18:21:48 -08:00
Fangrui Song	50187d2dd5	[ELF] Speed up ObjFile<ELFT>::createInputSection * Group ".note" section name checks * Move shouldMerge check to the caller	2021-12-15 17:15:32 -08:00
Fangrui Song	b5805b7847	[ELF] ObjFile<ELFT>::initializeSymbols: avoid StringRefZ from undefined symbols	2021-12-15 15:30:18 -08:00
Fangrui Song	2bdad16303	[ELF] SymbolTable::insert: keep @@ in the name * Avoid the name truncation quirk in SymbolTable::insert: the truncated name will be replaced by @@ again. * Allow foo and foo@@v1 in different files to be diagnosed as duplicate definition error (GNU ld behavior) * Avoid potential redundant strlen on symbol name due to StringRefZ in ObjFile<ELFT>::initializeSymbols	2021-12-15 15:19:35 -08:00
Fangrui Song	a596a5fc12	[ELF] ObjFile<ELFT>::initializeSymbols: Simplify this->symbols[i]. NFC	2021-12-15 13:02:38 -08:00
Fangrui Song	509153f1e7	[ELF] ObjFile<ELFT>::initializeSymbols: Batch allocate local symbols and detangle local/global symbol initialization. My x86-64 lld executable is 8k smaller due to the removal of SpecificAlloc<Undefined>.	2021-12-15 12:54:39 -08:00
Fangrui Song	7a54ae9c1d	[ELF] Change objectFiles to ELFFileBase * This can sometimes avoid `cast<ObjFile<...>>`. I intentionally do not touch postScanRelocations to wait for its stabilization.	2021-12-15 00:37:10 -08:00
Fangrui Song	c720b16aa5	[ELF] Use SmallVector for SharedFile and simplify parseVerdefs SHT_GNU_verdef is typically small, so it's unnecessary to reserve the vector. While here, fix a hypothetical issue when SHT_GNU_verdef has non-increasing version indexes, which don't happen with GNU ld, gold, ld.lld's output. My x86-64 lld executable is 256 bytes smaller.	2021-12-14 21:11:45 -08:00
Fangrui Song	1ff1d50d9f	[ELF] Make InputFile smaller sizeof(ObjFile<ELF64LE>) is decreased from 344 to 272 on an ELF64 system. In a large link with 30000 ObjFiles, this may be 2+MiB saving. Change std::vector members to SmallVector, and std::string members to SmallString<0> (these members typically don't benefit from small string optimization). On Linux x86-64 the lld executable is ~6k smaller.	2021-12-14 20:55:32 -08:00
Fangrui Song	1eaa9b4374	[ELF] initializeSections: move SHT_LLVM_CALL_GRAPH_PROFILE check into SHF_EXCLUDE && !relocatable. NFC Avoid a comparison in the majority of cases.	2021-12-12 20:05:21 -08:00
Igor Kudrin	ce25eb12dd	[ELF] Do not report undefined weak references in shared libraries This fixes an issue introduced in D101996. A weak reference in a shared library could be incorrectly reported if there is another library that has a strong reference to the same symbol. Differential Revision: https://reviews.llvm.org/D115041	2021-12-07 10:10:51 +07:00
Fangrui Song	c5bfffed48	[ELF] Discard input .note.gnu.build-id even with default --build-id=none binutils 2.38 will adopt this behavior https://sourceware.org/bugzilla/show_bug.cgi?id=28639 Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D114910	2021-12-02 09:50:59 +00:00
Igor Kudrin	b0ac68ccb7	[ELF] Prevent internalizing used comdat symbol When a comdat symbol is defined in both bitcode and regular object files, which are contained in the same archive, the linker could lose the flag that the symbol is used in the regular object file and allow LTO to internalize it, which led to "error: undefined symbol". The issue was introduced in D79300. Differential Revision: https://reviews.llvm.org/D114801	2021-12-02 12:10:06 +07:00
Fangrui Song	5188f55d32	[ELF] Move ObjFile<ELFT>::{getLocalSymbols,getGlobalSymbols} to non-template ELFFileBase. NFC	2021-11-30 00:50:19 -08:00
Fangrui Song	09401dfcf1	[ELF] Rename fetch to extract The canonical term is "extract" (GNU ld documentation, Solaris's `-z *extract` options). Avoid inventing a term and match --why-extract. (ld64 prefers "load" but the word is overloaded too much) Mostly MFC, except for --help messages and the header row in --print-archive-stats output.	2021-11-26 10:58:50 -08:00
Fangrui Song	7aafe467d2	[ELF] Simplify a condition with config->copyRelocs. NFC	2021-11-22 13:59:23 -08:00
Fangrui Song	213d1849a4	[ELF] Improve sh_info=0 and sh_info>=num_sections diagnostic for SHT_REL/SHT_RELA PR52408 reported an sh_info=0 instance. I have seen sh_info=0 independently before. sh_info>=num_sections is probably very rare. Just use one diagnostic for the two types of errors. Delete invalid-relocations.test which is covered by invalid/bad-reloc-target.test Differential Revision: https://reviews.llvm.org/D113466	2021-11-09 09:54:12 -08:00
Fangrui Song	ecc93ed2d7	[ELF] Replace InputBaseSection::{areRelocsRela,firstRelocation,numRelocation} with relSecIdx For `InputSection` `.foo`, its `InputBaseSection::{areRelocsRela,firstRelocation,numRelocation}` basically encode the information of `.rel[a].foo`. However, one uint32_t (the relocation section index) suffices. See the implementation of `relsOrRelas`. This change decreases sizeof(InputSection) from 184 to 176 on 64-bit Linux. The maximum resident set size linking a large application (1.2G output) decreases by 0.39%. Differential Revision: https://reviews.llvm.org/D112513	2021-10-27 09:51:07 -07:00
Fangrui Song	25da870057	[ELF] Remove irrelevant group signature hack working around old gold -r	2021-10-25 15:09:08 -07:00
Fangrui Song	aa4dfba522	[ELF] Infer EM_HEXAGON in getBitcodeMachineKind	2021-09-07 20:46:37 -07:00
Fangrui Song	db5e078690	[LTO] Add SelectionKind to IRSymtab and use it in ld.lld/LLVMgold In PGO, a C++ external linkage function `foo` has a private counter `__profc_foo` and a private `__profd_foo` in a `comdat nodeduplicate`. A `__attribute__((weak))` function `foo` has a weak hidden counter `__profc_foo` and a private `__profd_foo` in a `comdat nodeduplicate`. In `ld.lld a.o b.o`, say a.o defines an external linkage `foo` and b.o defines a weak `foo`. Currently we treat `comdat nodeduplicate` as `comdat any`, ld.lld will incorrectly consider `b.o:__profc_foo` non-prevailing. In the worst case when `b.o:__profd_foo` is retained and `b.o:__profc_foo` isn't, there will be dangling reference causing an `undefined hidden symbol` error. Add SelectionKind to `Comdat` in IRSymtab and let linkers ignore nodeduplicate comdat. Differential Revision: https://reviews.llvm.org/D106228	2021-07-20 13:22:00 -07:00
Nico Weber	fbb45947b2	[lld/mac] Resolve defined symbols before undefined symbols Ports https://reviews.llvm.org/D95985 to the MachO port. Happens to fix PR51135; see that bug for details. Also makes lld's behavior match ld64 for the included test case. Differential Revision: https://reviews.llvm.org/D106293	2021-07-19 16:37:41 -04:00
Fangrui Song	7de2173c2a	[ELF] --fortran-common: prefer STB_WEAK to COMMON The ELF specification says "The link editor honors the common definition and ignores the weak ones." GNU ld and our Symbol::compare follow this, but the --fortran-common code (D86142) made a mistake on the precedence. Fixes https://bugs.llvm.org/show_bug.cgi?id=51082 Reviewed By: peter.smith, sfertile Differential Revision: https://reviews.llvm.org/D105945	2021-07-14 10:18:30 -07:00
Alexander Yermolovich	24129fbc9a	[LLD] Adding support for RELA for CG Profile. This is a follow up to https://reviews.llvm.org/D104080, and `ca3bdb57fa (diff-e64a48fabe31db213a631fdc5f2acb51bdddf3f16a8fb2928784f4c579229585)`. The implementation of call graph profile was changed from a black box section to relocation approach. This was done to be compatible with post processing tools like strip/objcopy, and llvm equivalent. When they are invoked on object file before the final linking step with this new approach the symbol indices correctness is preserved. The GNU binutils tools change the REL section to RELA section, unlike llvm tools. For example when strip -S is run on the ELF object files, as an intermediate step before linking. To preserve compatibility this patch extends implementation in LLD and ELFDumper to support both REL and RELA sections for call graph profile. Reviewed By: MaskRay, jhenderson Differential Revision: https://reviews.llvm.org/D105217	2021-07-13 13:56:30 -07:00
Fangrui Song	ca3bdb57fa	[MC][ELF] Change SHT_LLVM_CALL_GRAPH_PROFILE relocations from SHT_RELA to SHT_REL ... even on targets preferring RELA. The section is only consumed by ld.lld which can handle REL. Follow-up to D104080 as I explained in the review. There are two advantages: * The D104080 code only handles RELA, so arm/i386/mips32 etc may warn for -fprofile-use=/-fprofile-sample-use= usage. * Decrease object file size for RELA targets While here, change the relocation to relocate weights, instead of 0,1,2,3,.. I failed to catch the issue during review.	2021-06-24 21:35:48 -07:00
Fangrui Song	c4ca39e0f5	[ELF] Fix .rela.llvm.call-graph-profile detection after D104080 A SHT_SYMTAB section's sh_info is the number of local symbols. sh_info may coincide with the section header index of SHT_LLVM_CALL_GRAPH_PROFILE.	2021-06-24 15:21:28 -07:00
Alexander Yermolovich	a224c5199b	[LLD][LLVM] CG Graph profile using relocations Currently when .llvm.call-graph-profile is created by llvm it explicitly encodes the symbol indices. This section is basically a black box for post processing tools. For example, if we run strip -s on the object files the symbol table changes, but indices in that section do not. In non-visible behavior indices point to wrong symbols. The visible behavior indices point outside of Symbol table: "invalid symbol index". This patch changes the format by using R_*_NONE relocations to indicate the from/to symbols. The Frequency (Weight) will still be in the .llvm.call-graph-profile, but symbol information will be in relocation section. In LLD information from both sections is used to reconstruct call graph profile. Relocations themselves will never be applied. With this approach post processing tools that handle relocations correctly work for this section also. Tools can add/remove symbols and as long as they handle relocation sections with this approach information stays correct. Doing a quick experiment with clang-13. The size went up from 107KB to 322KB, aggregate of all the input sections. Size of clang-13 binary is ~118MB. For users of -fprofile-use/-fprofile-sample-use the size of object files will go up slightly, it will not impact final binary size. Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D104080	2021-06-24 09:09:33 -07:00
Igor Kudrin	70c23e232e	[LLD] Improve reporting unresolved symbols in shared libraries Currently, when reporting unresolved symbols in shared libraries, if an undefined symbol is firstly seen in a regular object file that shadows the reference for the same symbol in a shared object. As a result, the error for the unresolved symbol in the shared library is not reported. If referencing sections in regular object files are discarded because of '--gc-sections', no reports about such symbols are generated, and the linker finishes successfully, generating an output image that fails on the run. The patch fixes the issue by keeping symbols, which should be checked, for each shared library separately. Differential Revision: https://reviews.llvm.org/D101996	2021-05-11 12:48:29 +07:00
Nico Weber	221388f451	fix comment typo to cycle bots	2021-03-29 14:50:17 -04:00
Abhina Sreeskantharajan	c83cd8feef	[NFC] Reordering parameters in getFile and getFileOrSTDIN In future patches I will be setting the IsText parameter frequently so I will refactor the args to be in the following order. I have removed the FileSize parameter because it is never used. ``` static ErrorOr<std::unique_ptr<MemoryBuffer>> getFile(const Twine &Filename, bool IsText = false, bool RequiresNullTerminator = true, bool IsVolatile = false); static ErrorOr<std::unique_ptr<MemoryBuffer>> getFileOrSTDIN(const Twine &Filename, bool IsText = false, bool RequiresNullTerminator = true); static ErrorOr<std::unique_ptr<MB>> getFileAux(const Twine &Filename, uint64_t MapSize, uint64_t Offset, bool IsText, bool RequiresNullTerminator, bool IsVolatile); static ErrorOr<std::unique_ptr<WritableMemoryBuffer>> getFile(const Twine &Filename, bool IsVolatile = false); ``` Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D99182	2021-03-25 09:47:49 -04:00
Nico Weber	cb4df6eb8d	fix comment typos to cycle bots	2021-02-18 14:25:21 -05:00
Petr Hosek	bfa4235e6e	[lld][ELF] Support for zero flag section groups This change introduces support for zero flag ELF section groups to lld. lld already supports COMDAT sections, which in ELF are a special type of ELF section groups. These are generally useful to enable linker GC where you want a group of sections to always travel together, that is to be either retained or discarded as a whole, but without the COMDAT semantics. Other ELF linkers already support zero flag ELF section groups and this change helps us reach feature parity. Differential Revision: https://reviews.llvm.org/D96636	2021-02-16 14:33:09 -08:00
Fangrui Song	0557b1bdec	[ELF] Resolve defined symbols before undefined symbols When parsing an object file, LLD interleaves undefined symbol resolution (which may recursively fetch other lazy objects) with defined symbol resolution. This may lead to surprising results, e.g. if an object file defines currently undefined symbols and references another lazy symbol, we may interleave defined symbols with the lazy fetch, potentially leading to the defined symbols resolving to different files. As an example, if both `a.a(a.o)` and `a.a(b.o)` define `foo` (not in COMDAT group, or in different COMDAT groups) and `__profd_foo` (in COMDAT group `__profd_foo`). LLD may resolve `foo` to `a.a(a.o)` and `__profd_foo` to `b.a(b.o)`, i.e. different files. ``` parse ArchiveFile a.a entry fetches a.a(a.o) parse ObjectFile a.o define entry define foo reference b b fetches a.a(b.o) parse ObjectFile b.o define prevailing __profd_foo define (ignored) non-prevailing __profd_foo ``` Assuming a set of interconnected symbols are defined all or none in several lazy objects. Arguably making them resolve to the same file is preferable than making them resolve to different files (some are lazy objects). The main argument favoring the new behavior is the stability. The relative order between a defined symbol and an undefined symbol does not change the symbol resolution behavior. Only the relative order between two undefined symbols can affect fetching behaviors. --- The real world case is reduced from a Fuchsia PGO usage: `a.a(a.o)` has a constructor within COMDAT group C5 while `a.a(b.o)` has a constructor within COMDAT group C2. Because they use different group signatures, they are not de-duplicated. It is not entirely whether Clang behavior is entirely conforming. LLD selects the PGO counter section (`__profd_`) from `a.a(b.o)` and the constructor section from `a.a(a.o)`. The `__profd_` is a SHF_LINK_ORDER section linking to its own non-prevailing constructor section, so LLD errors `sh_link points to discarded section`. This patch fixes the error. Differential Revision: https://reviews.llvm.org/D95985	2021-02-11 09:41:46 -08:00
Fangrui Song	7605a9a009	[ELF] Support aarch64_be This patch adds * Big-endian values for `R_AARCH64_{ABS,PREL}{16,32,64}` and `R_AARCH64_PLT32` * aarch64elfb & aarch64linuxb BFD emulations * elf64-bigaarch64 output format (bfdname) Link: https://github.com/ClangBuiltLinux/linux/issues/1288 Differential Revision: https://reviews.llvm.org/D96188	2021-02-08 08:55:29 -08:00
Fangrui Song	5f4d7b2f0a	[ELF] Improve --icf=safe diagnostic The current diagnostic has confused users. The new wording is adapted from one suggested by Ian Lance Taylor. Differential Revision: https://reviews.llvm.org/D95917	2021-02-05 09:37:37 -08:00
Brandon Bergren	275eb8289c	[PowerPC] Support powerpcle target in LLD [4/5] Add support for linking powerpcle code in LLD. Rewrite lld/test/ELF/emulation-ppc.s to use a shared check block and add powerpcle tests. Update tests. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D93917	2021-01-02 12:18:05 -06:00
Sean Fertile	8f91f38148	[LLD] Search archives for symbol defs to override COMMON symbols. This patch changes the archive handling to enable the semantics needed for legacy FORTRAN common blocks and block data. When we have a COMMON definition of a symbol and are including an archive, LLD will now search the members for global/weak defintions to override the COMMON symbol. The previous LLD behavior (where a member would only be included if it satisifed some other needed symbol definition) can be re-enabled with the option '-no-fortran-common'. Differential Revision: https://reviews.llvm.org/D86142	2020-12-07 10:09:19 -05:00
James Henderson	439341b9bf	[lld][ELF] Add additional time trace categories I noticed when running a large link with the --time-trace option that there were several areas which were missing any specific time trace categories (aside from the generic link/ExecuteLinker categories). This patch adds new categories to fill most of the "gaps", or to provide more detail than was previously provided. Reviewed by: MaskRay, grimar, russell.gallop Differential Revision: https://reviews.llvm.org/D90686	2020-11-10 10:28:46 +00:00
Konstantin Zhuravlyov	f218652a36	LLD/AMDGPU: Infer os abi based on input llvm bitcode Differential Revision: https://reviews.llvm.org/D89042	2020-10-13 12:20:28 -04:00
Christian Iversen	a9cefc3dee	[ELF] Fix broken bitstream linking with lld when e_machine > 255 In ELF/InputFiles.cpp, getBitcodeMachineKind() is limited to uint8_t return type. This works as long as EM_xxx is < 256, which is true for common architectures, but not for some newly assigned or unofficial EM_* values. The corresponding ELF field (e_machine) can hold uint16_t. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D89185	2020-10-11 14:19:25 -07:00

1 2 3 4 5 ...

691 Commits