llvm-project

Commit Graph

Author	SHA1	Message	Date
Fangrui Song	6c814931bc	[ELF] Don't use multiple inheritance for OutputSection. NFC Add an OutputDesc class inheriting from SectionCommand. An OutputDesc wraps an OutputSection. This change allows InputSection::getParent to be inlined. Differential Revision: https://reviews.llvm.org/D120650	2022-03-08 11:23:42 -08:00
Fangrui Song	8ca46bba23	[ELF] Move isUsedInRegularObj assignment from ctor to call sites. NFC This removes the tricky `isUsedInRegularObj(!file \|\| file->kind() == InputFile::ObjKind)` and the copy from `Symbol::mergeProperties`.	2022-02-23 21:32:50 -08:00
Fangrui Song	b01430a04f	[ELF] Don't rely on Symbols.h's transitive inclusion of InputFiles.h. NFC	2022-02-23 19:18:24 -08:00
Fangrui Song	fc0aa8424c	[ELF] Check COMMON symbols for PROVIDE and don't redefine COMMON symbols edata/end/etext In GNU ld, the definition precedence is: regular symbol assignment > relocatable object definition > `PROVIDE` symbol assignment. GNU ld's internal linker scripts define the non-reserved (by C and C++) edata/end/etext with `PROVIDE` so the relocatable object definition takes precedence. This makes sense because `int end;` is valid. We currently redefine such symbols if they are COMMON, but not if they are regular definitions, so `int end;` with -fcommon is essentially a UB in ld.lld. Fix this (also improve consistency and match GNU ld) by using the `isDefined` code path for `isCommon`. In GNU ld, reserved identifiers like `__ehdr_start` do not use `PROVIDE`, while we treat them all as `PROVIDE`, this seems fine. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D120389	2022-02-23 10:15:42 -08:00
Fangrui Song	27bb799095	[ELF] Clean up headers. NFC	2022-02-07 21:53:34 -08:00
Fangrui Song	fbf2f66400	[ELF] Update flag propagation rule to ignore discarded output sections See the updated insert-before.test for the effects: many synthetic sections are SHF_ALLOC\|SHF_WRITE. If they are discarded, we don't want to propagate their flags to subsequent output section descriptions. `getFirstInputSection(sec) == nullptr` can technically be merged into `isDiscardable` but I'd like to postpone that as not sharing code may give more refactoring opportunity. Depends on D118529. Reviewed By: peter.smith, bluca Differential Revision: https://reviews.llvm.org/D118530	2022-02-01 10:19:30 -08:00
Fangrui Song	a0318711c8	[ELF] Rename adjustSectionsBeforeSorting to adjustOutputSections and make it affect INSERT commands adjustSectionsBeforeSorting updates some output section attributes (alignment/flags) and removes discardable empty sections. When it is called, INSERT commands have not been processed. Therefore the flags propagation rule may not affect output sections defined in an INSERT command properly. Fix this by moving processInsertCommands before adjustSectionsBeforeSorting. adjustSectionsBeforeSorting is somewhat misnamed. The order between it and sortInputSections does not matter. With the pass shuffle, the name of adjustSectionsBeforeSorting becomes wrong. Therefore rename it. The new name is not set into stone. The function mixes several tasks and the code may be refactored in a way that we may give them more meaningful names. With this patch, I think the behavior of attribute propagation becomes more reasonable. In particular, in the absence of non-INSERT SECTIONS, inserting a section after a SHF_ALLOC one will give us a SHF_ALLOC section, not a non-SHF_ALLOC one (see linkerscript/insert-after.test). Reviewed By: peter.smith, bluca Differential Revision: https://reviews.llvm.org/D118529	2022-02-01 10:16:12 -08:00
Alexandre Ganea	83d59e05b2	Re-land [LLD] Remove global state in lldCommon Move all variables at file-scope or function-static-scope into a hosting structure (lld::CommonLinkerContext) that lives at lldMain()-scope. Drivers will inherit from this structure and add their own global state, in the same way as for the existing COFFLinkerContext. See discussion in https://lists.llvm.org/pipermail/llvm-dev/2021-June/151184.html The previous land `f860fe3622` caused issues in https://lab.llvm.org/buildbot/#/builders/123/builds/8383, fixed by `22ee510dac`. Differential Revision: https://reviews.llvm.org/D108850	2022-01-20 14:53:26 -05:00
Fangrui Song	ac0986f880	[ELF] Change std::vector<InputSectionBase *> to SmallVector There is no remaining std::vector<InputSectionBase> now. My x86-64 lld executable is 2KiB small.	2022-01-17 10:25:07 -08:00
Fangrui Song	769057a5d0	[ELF] Change some DenseMap<StringRef, > to DenseMap<CachedHashStringRef, >. NFC	2022-01-16 21:19:01 -08:00
Alexandre Ganea	e6b153947d	Revert [LLD] Remove global state in lldCommon It seems to be causing issues on https://lab.llvm.org/buildbot/#/builders/123/builds/8383	2022-01-16 11:03:06 -05:00
Alexandre Ganea	f860fe3622	[LLD] Remove global state in lldCommon Move all variables at file-scope or function-static-scope into a hosting structure (lld::CommonLinkerContext) that lives at lldMain()-scope. Drivers will inherit from this structure and add their own global state, in the same way as for the existing COFFLinkerContext. See discussion in https://lists.llvm.org/pipermail/llvm-dev/2021-June/151184.html Differential Revision: https://reviews.llvm.org/D108850	2022-01-16 08:57:57 -05:00
Fangrui Song	43d927984c	[ELF] Refactor how .gnu.hash and .hash are discarded Switch to the D114180 approach which is simpler and allows gnuHashTab/hashTab to switch to unique_ptr.	2022-01-12 12:47:07 -08:00
Fangrui Song	bf9c8636f2	[ELF] Support discarding .relr.dyn `db08df0570` does not work because part.relrDyn is a unique_ptr and `reset` destroys the object which may still be referenced. This commit uses the D114180 approach. Also improve the test to check that there is no R_X86_64_RELATIVE.	2022-01-12 11:55:22 -08:00
Fangrui Song	db08df0570	[ELF] Support discarding .relr.dyn to prepare for D116838, otherwise for linkerscript/discard-section-err.s, there will be a null pointer dereference in `part.relrDyn->getParent()->size` in `finalizeSynthetic(part.relrDyn.get())`.	2022-01-12 10:38:59 -08:00
Fangrui Song	cb203f3f92	[ELF] Change InStruct/Partition pointers to unique_ptr and remove associated make<XXX> calls. gnuHash and sysvHash are unchanged, otherwise LinkerScript::discard would destroy the objects which may be referenced by input section descriptions. My x86-64 lld executable is 121+KiB smaller.	2021-12-27 18:15:23 -08:00
Fangrui Song	3c94d5d9d2	[ELF] addOrphanSections: avoid std::function	2021-12-27 15:57:38 -08:00
Fangrui Song	a1c2ee0147	[ELF] LinkerScript/OutputSection: change other std::vector members to SmallVector 11+KiB smaller .text with both libc++ and libstdc++ builds.	2021-12-26 13:53:47 -08:00
Fangrui Song	10316a6f94	[ELF] Change InputSectionDescription members from vector to SmallVector This decreases sizeof(lld:🧝:InputSectionDescription) from 264 to 232.	2021-12-26 13:06:54 -08:00
Kazu Hirata	9c0a4227a9	Use Optional::getValueOr (NFC)	2021-12-24 20:57:40 -08:00
Fangrui Song	ad26b0b233	Revert "[ELF] Make Partition/InStruct members unique_ptr and remove associate make<XXX>" This reverts commit `e48b1c8a27`. This reverts commit `d019de23a1`. The changes caused memory leaks (non-final classes cannot use unique_ptr).	2021-12-22 23:55:11 -08:00
Fangrui Song	e48b1c8a27	[ELF] Make Partition members unique_ptr and remove associate make<XXX> See D116143 for benefits. My lld executable (x86-64) is 103+KiB smaller.	2021-12-22 21:34:26 -08:00
Fangrui Song	d019de23a1	[ELF] Make InStruct members unique_ptr and remove associate make<XXX> See D116143 for benefits. My lld executable (x86-64) is 24+KiB smaller.	2021-12-22 21:11:26 -08:00
Fangrui Song	baa3eb0dd9	[ELF] Change some non-null pointer parameters to references. NFC	2021-12-22 20:51:11 -08:00
Fangrui Song	5fc4323eda	[ELF] Change some global pointers to unique_ptr Currently the singleton `config` is assigned by `config = make<Configuration>()` and (if `canExitEarly` is false) destroyed by `lld::freeArena`. `make<Configuration>` allocates a stab with `malloc(4096)`. This both wastes memory and bloats the executable (every type instantiates `BumpPtrAllocator` which costs more than 1KiB code on x86-64). (No need to worry about `clang::no_destroy`. Regular invocations (`canExitEarly` is true) call `_Exit` via llvm::sys::Process::ExitNoCleanup.) Reviewed By: lichray Differential Revision: https://reviews.llvm.org/D116143	2021-12-22 14:36:14 -08:00
Fangrui Song	3534d26cc1	[ELF] Slightly speed up -z keep-text-section-prefix	2021-12-15 10:20:11 -08:00
Fangrui Song	3deb82cd07	[ELF] Adjust getOutputSectionName prefix order Sorting the prefixes by decreasing frequency can improve performance. .gcc_except_table is relatively frequent, so move it ahead. .ctors and .dtors mostly disappear and should be the last.	2021-12-15 00:18:58 -08:00
Fangrui Song	5816f1855c	[ELF] Slightly speed up getOutputSectionName. NFC	2021-12-14 23:43:00 -08:00
Fangrui Song	e652f3f04a	[ELF] Simplify some ctx->outSec with sec. NFC	2021-11-28 19:08:27 -08:00
Fangrui Song	89c0f4553e	[ELF] Simplify/remove LinkerScript::switchTo. NFC	2021-11-28 19:05:15 -08:00
Fangrui Song	1164c4b375	[ELF] Simplify/remove LinkerScript::output and advance. NFC	2021-11-28 16:58:06 -08:00
Fangrui Song	e80a0b353c	[ELF] Remove unneeded getOutputSectionVA. NFC I attempted to remove it 1 or 2 year ago but kept it just to have a good diagnostic in case the output section is nullptr (should be impossible). It is long enough that we haven't seen such a case.	2021-11-28 16:17:10 -08:00
Fangrui Song	7ea662e2dd	[ELF] Replace one make_unique from r316378 with a stack object. NFC	2021-11-28 15:32:29 -08:00
Fangrui Song	7051aeef7a	[ELF] Rename BaseCommand to SectionCommand. NFC BaseCommand was picked when PHDRS/INSERT/etc were not implemented. Rename it to SectionCommand to match `sectionCommands` and make it clear that the commands are used in SECTIONS (except a special case for SymbolAssignment). Also, improve naming of some BaseCommand variables (base -> cmd).	2021-11-25 20:24:23 -08:00
Fangrui Song	6188fd4957	[ELF] Rename OutputSection::sectionCommands to commands. NFC This partially reverts r315409: the description applies to LinkerScript, but not to OutputSection. The name "sectionCommands" is used in both LinkerScript::sectionCommands and OutputSection::sectionCommands, which may lead to confusion. "commands" in OutputSection has no ambiguity because there are no other types of commands.	2021-11-25 16:47:07 -08:00
Igor Kudrin	8cdf1c1edb	[ELF] Support the "read-only" memory region attribute The attribute 'r' allows (or disallows for the negative case) read-only sections, i.e. ones without the SHF_WRITE flag, to be assigned to the memory region. Before the patch, lld could put a section in the wrong region or fail with "error: no memory region specified for section". Differential Revision: https://reviews.llvm.org/D113771	2021-11-24 12:17:09 +07:00
Igor Kudrin	a05b694b1e	[ELF][NFC] Do not pass region name to expandMemoryRegion() The name can be easily got on-site. Differential Revision: https://reviews.llvm.org/D114228	2021-11-22 14:19:07 +07:00
Fangrui Song	648157b05a	[ELF] Move getOutputSectionName from Writer.cpp to LinkerScript.cpp. NFC and internalize it.	2021-11-20 22:18:09 -08:00
Andrew Ng	47eb3f155f	[ELF] Ensure output section is not discarded in addStartEndSymbols() Fixes https://bugs.llvm.org/show_bug.cgi?id=52534. Differential Revision: https://reviews.llvm.org/D114179	2021-11-19 11:45:58 +00:00
Konstantin Schwarz	8c18719bae	[ELF] Expand LMA region if output section alignment introduces padding When aligning the start address of an output section introduces a gap between the current dot pointer and the new aligned address, we were already properly expanding the memory region, if available. D74286 introduced a new behavior to also align the LMA address if an LMA region is specified. However, this did not expand the corresponding LMA region. Now, we also expand the LMA region if it is set. This fixes PR52510. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D114166	2021-11-19 11:27:21 +01:00
Igor Kudrin	66691de94c	[ELF] Do not try to assign a memory region to a non-allocatable section Non-allocatable sections are not part of the memory image of the program, so there is no need to find memory regions for them either matching properties or handling explicit assignments. The early test and return help to simplify LinkerScript::findMemoryRegion() a bit. Differential Revision: https://reviews.llvm.org/D113768	2021-11-15 15:59:39 +07:00
Igor Kudrin	d2dd36bbbe	[ELF] Better resemble GNU ld when placing orphan sections into memory regions An orphan section should be placed in the same memory region as its anchor section if the latter specifies the memory region explicitly. If there is no explicit assignment for the anchor section in the linker script, its memory region is selected by matching attributes, and the same should be done for the orphan section. Before the patch, some scripts that were handled smoothly in GNU ld caused an "error: no memory region specified for section" in lld. Differential Revision: https://reviews.llvm.org/D112925	2021-11-11 15:07:38 +07:00
Fangrui Song	bf6e259b21	[ELF] Update comments/diagnostics for some long options to use the canonical two-dash form Rewrite some comments as appropriate.	2021-10-25 12:52:06 -07:00
Fangrui Song	9bd29a73d1	[ELF] Make dot in .tbss correct GNU ld doesn't support multiple SHF_TLS SHT_NOBITS output sections (it restores the address after an SHF_TLS SHT_NOBITS section, so consecutive SHF_TLS SHT_NOBITS sections will have conflicting address ranges). That said, `threadBssOffset` implements limited support for consecutive SHF_TLS SHT_NOBITS sections. (SHF_TLS SHT_PROGBITS following a SHF_TLS SHT_NOBITS can still be incorrect.) `.` in an output section description of an SHF_TLS SHT_NOBITS section is incorrect. (https://lists.llvm.org/pipermail/llvm-dev/2021-July/151974.html) This patch saves the end address of the previous tbss section in `ctx->tbssAddr`, changes `dot` in the beginning of `assignOffset` so that `.` evaluation will be correct. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D107208	2021-08-04 08:58:50 -07:00
Fangrui Song	03051f7ac8	[ELF] Preserve section order within an INSERT AFTER command For ``` SECTIONS { text.0 : {} text.1 : {} text.2 : {} } INSERT AFTER .data; ``` the current order is `.data text.2 text.1 text.0`. It makes more sense to preserve the specified order and thus improve compatibility with GNU ld. For ``` SECTIONS { text.0 : {} } INSERT AFTER .data; SECTIONS { text.3 : {} } INSERT AFTER .data; ``` GNU ld somehow collects sections with `INSERT AFTER .data` together (IMO inconsistent) but I think it makes more sense to execute the commands in order and get `.data text.3 text.0` instead. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D105158	2021-06-30 11:35:50 -07:00
Fangrui Song	899fdf548e	[ELF] Add OVERWRITE_SECTIONS command This implements https://sourceware.org/bugzilla/show_bug.cgi?id=26404 An `OVERWRITE_SECTIONS` command is a `SECTIONS` variant which contains several output section descriptions. The output sections do not have specify an order. Similar to `INSERT [BEFORE\|AFTER]`, `LinkerScript::hasSectionsCommand` is not set, so the built-in rules (see `docs/ELF/linker_script.rst`) still apply. `OVERWRITE_SECTIONS` can be more convenient than `INSERT` because it does not need an anchor section. The initial syntax is intentionally narrow to facilitate backward compatible extensions in the future. Symbol assignments cannot be used. This feature is versatile. To list a few usage: * Use `section : { KEEP(...) }` to retain input sections under GC * Define encapsulation symbols (start/end) for an output section * Use `section : ALIGN(...) : { ... }` to overalign an output section (similar to ld64 `-sectalign`) When an output section is specified by both `OVERWRITE_SECTIONS` and `INSERT`, `INSERT` is processed after overwrite sections. To make this work, this patch changes `InsertCommand` to use name based matching instead of pointer based matching. (This may cause a difference when `INSERT` moves one output section more than once. Such duplicate commands should not be used in practice (seems that in GNU ld the output sections may just disappear).) A linker script can be used without -T/--script. The traditional `SECTIONS` commands are concatenated, so a wrong rule can be more noticeable from the section order. This feature if misused can be less noticeable, just like `INSERT`. Differential Revision: https://reviews.llvm.org/D103303	2021-06-13 12:41:11 -07:00
Nico Weber	1e89f08f59	fix typo to cycle bots	2021-04-14 14:52:53 -04:00
Georgii Rymar	3f5dc57fd1	[LLD][ELF] - Don't keep empty output sections which have explicit program headers. This reverts a side effect introduced in the code cleanup patch D43571: LLD started to emit empty output sections that are explicitly assigned to a segment. This patch fixes the issue by removing the !sec.phdrs.empty() special case from isDiscardable. As compensation, we add an early phdrs propagation step (see the inline comment). This is similar to one that we do in adjustSectionsAfterSorting. Differential revision: https://reviews.llvm.org/D92301	2020-12-02 11:19:21 +03:00
Fangrui Song	048b16f7fb	[ELF] Check --orphan-handling=place (default value) early The function took 1% (161MiB clang) to 1.7% (an 4.9GiB executable) time.	2020-11-29 12:36:27 -08:00
Fangrui Song	73d01a80ce	[ELF] Sort by input order within an input section description According to https://sourceware.org/binutils/docs/ld/Input-Section-Basics.html#Input-Section-Basics for `(.a .b)`, the order should match the input order: for `ld 1.o 2.o`, sections from 1.o precede sections from 2.o * within a file, `.a` and `.b` appear in the section header table order This patch implements the behavior. The interaction with `SORT` and --sort-section is: Matched sections are ordered by radix sort with the keys being `(SORT, --sort-section, input order)`, where `SORT` (if present) is most significant. > Note, multiple `SORT` within an input section description has undocumented and > confusing behaviors in GNU ld: > https://sourceware.org/pipermail/binutils/2020-November/114083.html > Therefore multiple `SORT` is not the focus for this patch but > this patch still strives to have an explainable behavior. As an example, we partition `SORT(a.) b.* c.* SORT(d.)`, into `SORT(a.) \| b.* c.* \| SORT(d.)` and perform sorting within groups. Sections matched by patterns between two `SORT` are sorted by input order. If --sort-alignment is given, they are sorted by --sort-alignment, breaking tie by input order. This patch also allows a section to be matched by multiple patterns, previously duplicated sections could occupy more space in the output and had erroneous zero bytes. The patch is in preparation for support for `(SORT_BY_INIT_PRIORITY(.init_array. .ctors.)) (.init_array .ctors)`, which will allow LLD to mix .ctors/.init_array like GNU ld (gold's --ctors-in-init-array) PR44698 and PR48096 Reviewed By: grimar, psmith Differential Revision: https://reviews.llvm.org/D91127	2020-11-12 08:53:11 -08:00

1 2 3 4 5 ...

895 Commits