llvm-project

Commit Graph

Author	SHA1	Message	Date
Igor Kudrin	65c284a7be	[ELF][test][NFC] Make a test standard compliant PT_LOAD segments in the program header must be sorted by their virtual addresses, so they should be defined in a similar order as the associated sections. Differential Revision: https://reviews.llvm.org/D111068	2021-10-05 11:40:02 +07:00
Fangrui Song	2bf06d9345	[ELF] Support symbol names with space in linker script expressions Fix PR51961 Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D110490	2021-09-27 09:50:42 -07:00
Fangrui Song	a892c0e49e	[ELF][test] Improve test coverage	2021-09-25 11:57:54 -07:00
Fangrui Song	54e76cb17a	[split-file] Default to --no-leading-lines It turns out that the --leading-lines may be a bad default. [[#@LINE+-num]] is rarely used.	2021-08-16 19:23:11 -07:00
Fangrui Song	9bd29a73d1	[ELF] Make dot in .tbss correct GNU ld doesn't support multiple SHF_TLS SHT_NOBITS output sections (it restores the address after an SHF_TLS SHT_NOBITS section, so consecutive SHF_TLS SHT_NOBITS sections will have conflicting address ranges). That said, `threadBssOffset` implements limited support for consecutive SHF_TLS SHT_NOBITS sections. (SHF_TLS SHT_PROGBITS following a SHF_TLS SHT_NOBITS can still be incorrect.) `.` in an output section description of an SHF_TLS SHT_NOBITS section is incorrect. (https://lists.llvm.org/pipermail/llvm-dev/2021-July/151974.html) This patch saves the end address of the previous tbss section in `ctx->tbssAddr`, changes `dot` in the beginning of `assignOffset` so that `.` evaluation will be correct. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D107208	2021-08-04 08:58:50 -07:00
Jessica Clarke	cfaa5bf4ce	[ELF] Align the first section of a PT_TLS even if its type is SHT_NOBITS This is somewhat of a repeat of D66658 but for sections in PT_TLS segments. Although such sections don't need to be aligned such that address and offset are congruent modulo the page size, they do need to be congruent modulo the segment alignment, otherwise the whole PT_TLS will be unaligned. We therefore use the normal calculation to determine the section's address within the PT_LOAD rather than bailing out early due to being SHT_NOBITS. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D106987	2021-07-29 15:14:00 +01:00
Jessica Clarke	b96bb7899f	[ELF] Add two new tests showing broken .tbss alignment if first in PT_TLS This is a similar problem to D66658, where we are too aggressive in not aligning NOBITS sections, and the tests are based on the ones added for that fix. If a .tbss section is first in a PT_TLS segment (i.e. there is no .tdata section) then, although it doesn't need to be aligned such that address and offset are congruent modulo the page size, they do need to be congruent modulo the segment alignment, otherwise the whole PT_TLS will be unaligned. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D106986	2021-07-29 15:13:52 +01:00
Fangrui Song	e7a7ad134f	[ELF] Support quoted symbols in symbol assignments glibc/elf/tst-absolute-zero-lib.lds uses `"absolute" = 0;`	2021-07-25 16:26:37 -07:00
Fangrui Song	3c9d86f951	[ELF][test] Avoid llvm-readelf/llvm-readobj one-dash long options	2021-07-16 10:02:47 -07:00
Fangrui Song	03051f7ac8	[ELF] Preserve section order within an INSERT AFTER command For ``` SECTIONS { text.0 : {} text.1 : {} text.2 : {} } INSERT AFTER .data; ``` the current order is `.data text.2 text.1 text.0`. It makes more sense to preserve the specified order and thus improve compatibility with GNU ld. For ``` SECTIONS { text.0 : {} } INSERT AFTER .data; SECTIONS { text.3 : {} } INSERT AFTER .data; ``` GNU ld somehow collects sections with `INSERT AFTER .data` together (IMO inconsistent) but I think it makes more sense to execute the commands in order and get `.data text.3 text.0` instead. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D105158	2021-06-30 11:35:50 -07:00
Fangrui Song	2508733e1b	[ELF] --sysroot: change sysrooted script to not fall back for an absolute path Modify the D13209 logic: for a script inside the sysroot, if an absolute path does not exist, report an error instead of falling back to the path without the sysroot prefix. This matches GNU ld, which makes sense to me: we don't want to find an arbitrary file in the host. Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D104894	2021-06-25 12:52:39 -07:00
Konstantin Schwarz	5d621ed85d	[ELF] Consider that NOLOAD sections should be placed in a PT_LOAD segment During PHDR creation, the case where an output section does not require a PT_LOAD header but still occupies memory in the current VMA region was not handled. If such an output section interleaves two output sections that have the same VMA and LMA regions set, we would previously re-use the existing PT_LOAD header for the second output section. However, since the memory region is not contiguous, we need to start a new PT_LOAD segment. This fixes https://bugs.llvm.org/show_bug.cgi?id=50558 Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D103815	2021-06-16 12:36:45 +02:00
Fangrui Song	899fdf548e	[ELF] Add OVERWRITE_SECTIONS command This implements https://sourceware.org/bugzilla/show_bug.cgi?id=26404 An `OVERWRITE_SECTIONS` command is a `SECTIONS` variant which contains several output section descriptions. The output sections do not have specify an order. Similar to `INSERT [BEFORE\|AFTER]`, `LinkerScript::hasSectionsCommand` is not set, so the built-in rules (see `docs/ELF/linker_script.rst`) still apply. `OVERWRITE_SECTIONS` can be more convenient than `INSERT` because it does not need an anchor section. The initial syntax is intentionally narrow to facilitate backward compatible extensions in the future. Symbol assignments cannot be used. This feature is versatile. To list a few usage: * Use `section : { KEEP(...) }` to retain input sections under GC * Define encapsulation symbols (start/end) for an output section * Use `section : ALIGN(...) : { ... }` to overalign an output section (similar to ld64 `-sectalign`) When an output section is specified by both `OVERWRITE_SECTIONS` and `INSERT`, `INSERT` is processed after overwrite sections. To make this work, this patch changes `InsertCommand` to use name based matching instead of pointer based matching. (This may cause a difference when `INSERT` moves one output section more than once. Such duplicate commands should not be used in practice (seems that in GNU ld the output sections may just disappear).) A linker script can be used without -T/--script. The traditional `SECTIONS` commands are concatenated, so a wrong rule can be more noticeable from the section order. This feature if misused can be less noticeable, just like `INSERT`. Differential Revision: https://reviews.llvm.org/D103303	2021-06-13 12:41:11 -07:00
Fangrui Song	6d2d3bd0a6	[ELF] Default to -z start-stop-gc with a glibc "__libc_" special case Change the default to facilitate GC for metadata section usage, so that they don't need SHF_LINK_ORDER or SHF_GROUP just to drop the unhelpful rule (if they want to be unconditionally retained, use SHF_GNU_RETAIN (`__attribute__((retain))`) or linker script `KEEP`). The dropped SHF_GROUP special case makes the behavior of -z start-stop-gc and -z nostart-stop-gc closer to GNU ld>=2.37 (https://sourceware.org/PR27451). However, we default to -z start-stop-gc (which actually matches more closely to GNU ld before 2015-10 https://sourceware.org/PR19167), which is different from modern GNU ld (which has the unhelpful rule to work around glibc). As a compensation, we special case `__libc_` sections as a workaround for glibc<2.34 (https://sourceware.org/PR27492). Since -z start-stop-gc as the default actually matches the traditional GNU ld behavior, there isn't much to be aware of. There was a systemd usage which has been fixed by https://github.com/systemd/systemd/pull/19144	2021-04-16 12:18:46 -07:00
Fangrui Song	e4f385d894	[ELF] Support . and $ in symbol names in expressions GNU ld supports `.` and `$` in symbol names while LLD doesn't support them in `readPrimary` expressions. Using `.` can result in such an error: ``` https://github.com/ClangBuiltLinux/linux/issues/1318 ld.lld: error: ./arch/powerpc/kernel/vmlinux.lds:255: malformed number: .TOC. >>> __toc_ptr = (DEFINED (.TOC.) ? .TOC. : ADDR (.got)) + 0x8000; ``` Allow `.` (ppc64 special symbol `.TOC.`) and `$` (RISC-V special symbol `__global_pointer$`). Change `diag[3-5].test` to use an invalid character `^`. Note: GNU ld allows `~` in non-leading positions of a symbol name. `~` is not used in practice, conflicts with the unary operator, and can cause some parsing difficulty, so this patch does not add it. Differential Revision: https://reviews.llvm.org/D98306	2021-03-11 09:34:36 -08:00
Fangrui Song	962b29d716	ELFObjectWriter: Don't sort non-local symbols As we don't sort local symbols, don't sort non-local symbols. This makes non-local symbols appear in their register order, which matches GNU as. The register order is nice in that you can write tests with interleaved CHECK prefixes, e.g. ``` // CHECK: something about foo .globl foo foo: // CHECK: something about bar .globl bar bar: ``` With the lexicographical order, the user needs to place lexicographical smallest symbol first or keep CHECK prefixes in one place.	2021-02-13 10:32:27 -08:00
Fangrui Song	1f69355802	[test] Make ELF tests amenable to the order of non-local symbols	2021-02-12 21:00:42 -08:00
Bob Haarman	8e0b179315	[ELF] report section sizes when output file too large Fixes PR48523. When the linker errors with "output file too large", one question that comes to mind is how the section sizes differ from what they were previously. Unfortunately, this information is lost when the linker exits without writing the output file. This change makes it so that the error message includes the sizes of the largest sections. Reviewed By: MaskRay, grimar, jhenderson Differential Revision: https://reviews.llvm.org/D94560	2021-01-21 19:47:03 +00:00
Fangrui Song	16cb7910f5	[ELF] --emit-relocs: fix a crash if .rela.dyn is an empty output section Fix PR48357: If .rela.dyn appears as an output section description, its type may be SHT_RELA (due to the empty synthetic .rela.plt) while there is no input section. The empty .rela.dyn may be retained due to a reference in a linker script. Don't crash. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D93367	2020-12-16 08:59:38 -08:00
Georgii Rymar	3f5dc57fd1	[LLD][ELF] - Don't keep empty output sections which have explicit program headers. This reverts a side effect introduced in the code cleanup patch D43571: LLD started to emit empty output sections that are explicitly assigned to a segment. This patch fixes the issue by removing the !sec.phdrs.empty() special case from isDiscardable. As compensation, we add an early phdrs propagation step (see the inline comment). This is similar to one that we do in adjustSectionsAfterSorting. Differential revision: https://reviews.llvm.org/D92301	2020-12-02 11:19:21 +03:00
Fangrui Song	40a42f9f3f	[ELF] Make SORT_INIT_PRIORITY support .ctors.N Input sections `.ctors/.ctors.N` may go to either the output section `.init_array` or the output section `.ctors`: * output `.ctors`: currently we sort them by name. This patch changes to sort by priority from high to low. If N in `.ctors.N` is in the form of %05u, there is no semantic difference. Actually GCC and Clang do use %05u. (In the test `ctors_dtors_priority.s` and Gold's test `gold/testsuite/script_test_14.s`, we can see %03u, but they are not really produced by compilers.) * output `.init_array`: users can provide an input section description `SORT_BY_INIT_PRIORITY(.init_array.* .ctors.)` to mix `.init_array.` and `.ctors.`. This can make .init_array.N and .ctors.(65535-N) interchangeable. With this change, users can mix `.ctors.N` and `.init_array.N` in `.init_array` (PR44698 and PR48096) with linker scripts. As an example: ``` SECTIONS { .init_array : { (SORT_BY_INIT_PRIORITY(.init_array.* .ctors.)) (.init_array EXCLUDE_FILE (crtbegin.o crtbegin?.o crtend.o crtend?.o ) .ctors) } } INSERT AFTER .fini_array; SECTIONS { .fini_array : { (SORT_BY_INIT_PRIORITY(.fini_array. .dtors.)) (.fini_array EXCLUDE_FILE (crtbegin.o crtbegin?.o crtend.o crtend?.o ) .dtors) } } INSERT BEFORE .init_array; ``` Reviewed By: psmith Differential Revision: https://reviews.llvm.org/D91187	2020-11-12 08:56:12 -08:00
Fangrui Song	73d01a80ce	[ELF] Sort by input order within an input section description According to https://sourceware.org/binutils/docs/ld/Input-Section-Basics.html#Input-Section-Basics for `(.a .b)`, the order should match the input order: for `ld 1.o 2.o`, sections from 1.o precede sections from 2.o * within a file, `.a` and `.b` appear in the section header table order This patch implements the behavior. The interaction with `SORT` and --sort-section is: Matched sections are ordered by radix sort with the keys being `(SORT, --sort-section, input order)`, where `SORT` (if present) is most significant. > Note, multiple `SORT` within an input section description has undocumented and > confusing behaviors in GNU ld: > https://sourceware.org/pipermail/binutils/2020-November/114083.html > Therefore multiple `SORT` is not the focus for this patch but > this patch still strives to have an explainable behavior. As an example, we partition `SORT(a.) b.* c.* SORT(d.)`, into `SORT(a.) \| b.* c.* \| SORT(d.)` and perform sorting within groups. Sections matched by patterns between two `SORT` are sorted by input order. If --sort-alignment is given, they are sorted by --sort-alignment, breaking tie by input order. This patch also allows a section to be matched by multiple patterns, previously duplicated sections could occupy more space in the output and had erroneous zero bytes. The patch is in preparation for support for `(SORT_BY_INIT_PRIORITY(.init_array. .ctors.)) (.init_array .ctors)`, which will allow LLD to mix .ctors/.init_array like GNU ld (gold's --ctors-in-init-array) PR44698 and PR48096 Reviewed By: grimar, psmith Differential Revision: https://reviews.llvm.org/D91127	2020-11-12 08:53:11 -08:00
Fangrui Song	2a9aed0e8b	[ELF] Support multiple SORT in an input section description The second `SORT` in `(SORT(...) SORT(...))` is incorrectly parsed as a file pattern. Fix the bug by stopping at `SORT` in `readInputSectionsList`. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D91180	2020-11-12 08:46:53 -08:00
Georgii Rymar	41726f8d5b	[llvm-readobj] - Print "Unknown" when a program header is unknown. Currently, when a program header type is unknown, we dont print anything: ``` ProgramHeader { Type: (0x60000000) ``` With this patch the output will be: ``` ProgramHeader { Type: Unknown (0x60000000) ``` It was discussed in D85526 and consistent with what we print for '--sections' already, e.g.: ``` Section { Name: .sec Type: Unknown (0x7FFFFFFF) } ``` Differential revision: https://reviews.llvm.org/D86213	2020-08-25 13:05:17 +03:00
Fangrui Song	9670029b6b	[ELF] Keep st_type for symbol assignment PR46970: for `alias = aliasee`, the alias can be used in relocation processing and on ARM st_type does affect Thumb interworking. It is thus desirable for the alias to get the same st_type. Note that the st_size field should not be inherited because some tools use st_size=0 as a heuristic to detect aliases. Retaining st_size can thwart such heuristics and cause aliases to be preferred over the original symbols. Differential Revision: https://reviews.llvm.org/D86263	2020-08-20 16:05:27 -07:00
Fangrui Song	ec29538af2	[ELF] Assign file offsets of non-SHF_ALLOC after SHF_ALLOC and set sh_addr=0 to non-SHF_ALLOC * GNU ld places non-SHF_ALLOC sections after SHF_ALLOC sections. This has the advantage that the file offsets of a non-SHF_ALLOC cannot be contained in a PT_LOAD. This patch matches the behavior. * For non-SHF_ALLOC non-orphan sections, GNU ld may assign non-zero sh_addr and treat them similar to SHT_NOBITS (not advance location counter). This is an alternative approach to what we have done in D85100. By placing non-SHF_ALLOC sections at the end, we can drop special cases in createSection and findOrphanPos added by D85100. Different from GNU ld, we set sh_addr to 0 for non-SHF_ALLOC sections. 0 arguably is better because non-SHF_ALLOC sections don't appear in the memory image. ELF spec says: > sh_addr - If the section will appear in the memory image of a process, this > member gives the address at which the section's first byte should > reside. Otherwise, the member contains 0. D85100 appeared to take a detour. If we take a combined view on D85100 and this patch, the overall complexity slightly increases (one more 3-line loop) and compatibility with GNU ld improves. The behavior we don't want to match is the special treatment of .symtab .shstrtab .strtab: they can be matched in LLD but not in GNU ld. Reviewed By: jhenderson, psmith Differential Revision: https://reviews.llvm.org/D85867	2020-08-18 09:03:01 -07:00
Fangrui Song	e8a11c0558	[ELF] Allow mixed SHF_LINK_ORDER & non-SHF_LINK_ORDER sections and sort within InputSectionDescription LLD currently does not allow non-contiguous SHF_LINK_ORDER components in an output section. This makes it infeasible to add SHF_LINK_ORDER to an existing metadata section if backward compatibility with older object files are concerned. We did not allow mixed components (like GNU ld) and D77007 relaxed to allow non-contiguous SHF_LINK_ORDER components. This patch allows arbitrary mix, with sorting performed within an InputSectionDescription. For example, `.rodata : {(.rodata.foo) (.rodata.bar)}`, has two InputSectionDescription's. If there is at least one SHF_LINK_ORDER and at least one non-SHF_LINK_ORDER in .rodata.foo, they are ordered within `(.rodata.foo)`: we arbitrarily place SHF_LINK_ORDER components before non-SHF_LINK_ORDER components (like Solaris ld). `(.rodata.bar)` is ordered similarly, but the two InputSectionDescription's don't interact. It can be argued that this is more reasonable than the previous behavior where written order was not respected. It would be nice if the two different semantics (ordering requirement & garbage collection) were not overloaded on one section flag, however, it is probably difficult to obtain a generic flag at this point (https://groups.google.com/forum/#!topic/generic-abi/hgx_m1aXqUo "SHF_LINK_ORDER's original semantics make upgrade difficult"). (Actually, without the GC semantics, SHF_LINK_ORDER would still have the sh_link!=0 & sh_link=0 issue. It is just that people find the GC semantics more useful and tend to use the feature more often.) GNU ld feature request: https://sourceware.org/bugzilla/show_bug.cgi?id=16833 Differential Revision: https://reviews.llvm.org/D84001	2020-08-17 11:29:05 -07:00
Fangrui Song	a6db64ef4a	[ELF] Allow sections after a non-SHF_ALLOC section to be covered by PT_LOAD GNU ld allows sections after a non-SHF_ALLOC section to be covered by PT_LOAD (PR37607) and assigns addresses to non-SHF_ALLOC output sections (similar to SHF_ALLOC NOBITS sections. The location counter is not advanced). This patch tries to fix PR37607 (remove a special case in `Writer<ELFT>::createPhdrs`). To make the created PT_LOAD meaningful, we cannot reset dot to 0 for a middle non-SHF_ALLOC output section. This results in removal of two special cases in LinkerScript::assignOffsets. Non-SHF_ALLOC non-orphan sections can have non-zero addresses like in GNU ld. The zero address rule for non-SHF_ALLOC sections is weakened to apply to orphan only. This results in a special case in createSection and findOrphanPos, respectively. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D85100	2020-08-06 08:27:15 -07:00
Muhammad Omair Javaid	d9e191cb17	Revert "[ELF] Allow sections after a non-SHF_ALLOC section to be covered by PT_LOAD" This reverts commit `030ddc0a0b`. This breaks http://lab.llvm.org:8011/builders/lldb-arm-ubuntu and http://lab.llvm.org:8011/builders/lldb-aarch64-ubuntu Differential Revision: https://reviews.llvm.org/D85100	2020-08-06 16:30:05 +05:00
Fangrui Song	030ddc0a0b	[ELF] Allow sections after a non-SHF_ALLOC section to be covered by PT_LOAD GNU ld allows sections after a non-SHF_ALLOC section to be covered by PT_LOAD (PR37607) and assigns addresses to non-SHF_ALLOC output sections (similar to SHF_ALLOC NOBITS sections. The location counter is not advanced). This patch tries to fix PR37607 (remove a special case in `Writer<ELFT>::createPhdrs`). To make the created PT_LOAD meaningful, we cannot reset dot to 0 for a middle non-SHF_ALLOC output section. This results in removal of two special cases in LinkerScript::assignOffsets. Non-SHF_ALLOC non-orphan sections can have non-zero addresses like in GNU ld. The zero address rule for non-SHF_ALLOC sections is weakened to apply to orphan only. This results in a special case in createSection and findOrphanPos, respectively. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D85100	2020-08-05 09:30:23 -07:00
Fangrui Song	bcea3a7a28	Add test utility 'split-file' See https://lists.llvm.org/pipermail/llvm-dev/2020-July/143373.html "[llvm-dev] Multiple documents in one test file" for some discussions. This patch has explored several alternatives. The current semantics are similar to what @dblaikie proposed. `split-file filename output` splits the input file into multiple parts separated by regex `^(.\|//)--- filename` and write each part to the file `output/filename` (`filename` can include path separators). Use case A (organizing input of different formats (e.g. linker script+assembly) in one file). ``` # RUN: split-file %s %t # RUN: llvm-mc %t/asm -o %t.o # RUN: ld.lld -T %t/lds %t.o -o %t This is sometimes better than the %S/Inputs/ approach because the user can see the auxiliary files immediately and don't have to open another file. # asm ... # lds ... ``` Use case B (for utilities which don't have built-in input splitting feature): ``` // RUN: split-file %s %t // RUN: llc < %t/1.ll \| FileCheck %s --check-prefix=CASE1 // RUN: llc < %t/2.ll \| FileCheck %s --check-prefix=CASE2 Combing tests prudently can improve readability. For example, when testing parsing errors if the recovery mechanism isn't possible, grouping the tests in one file can more readily see test coverage/strategy. //--- 1.ll ... //--- 2.ll ... ``` Since this is a new utility, there is no git history concerns for UpperCase variable names. I use lowerCase variable names like mlir/lld. Reviewed By: jhenderson, lattner Differential Revision: https://reviews.llvm.org/D83834	2020-08-03 20:42:09 -07:00
Fangrui Song	dd405f1a53	Revert D83834 "Add test utility 'extract'" This reverts commit `d054c7ee2e`. There are discussions about the utility name, its functionality and user interface. Revert before we reach consensus.	2020-07-28 13:26:33 -07:00
Hafiz Abid Qadeer	1f166edeb4	[lld][linkerscript] Fix handling of DEFINED. Current implementation did not check that symbols is actually defined. Only checked for presence. GNU ld documentation says, "Return 1 if symbol is in the linker global symbol table and is defined before the statement using DEFINED in the script, otherwise return 0." https://sourceware.org/binutils/docs/ld/Builtin-Functions.html#Builtin-Functions Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D83758	2020-07-28 21:18:01 +01:00
Isaac Richter	fa1145a8d2	[lld][ELF] Add LOG2CEIL builtin ldscript function This patch adds support for the LOG2CEIL builtin function in linker scripts: https://sourceware.org/binutils/docs/ld/Builtin-Functions.html#index-LOG2CEIL_0028exp_0029 As documented for LD, and to keep compatibility, LOG2CEIL(0) returns 0 (not -inf). The test vectors are somewhat arbitrary. We check minimum values (0-4); middle values (2^32, and 2^32+1); and the maximum value (2^64-1). The checks for LOG2CEIL explicitly use full 64-bit values (16 hex digits). This is needed to properly verify that -inf and other interesting results aren't returned. (For some reason, all other tests in operators.test use only 14 digits.) Differential revision: https://reviews.llvm.org/D84054	2020-07-27 12:16:43 +03:00
Georgii Rymar	ae4279bd3e	[LLD][ELF] - Linkerscript: report location for the "unclosed comment in a linker script" error. Currently we print "error: unclosed comment in a linker script", which doesn't provide information about the real error location. Fixes https://bugs.llvm.org/show_bug.cgi?id=46793. Differential revision: https://reviews.llvm.org/D84300	2020-07-24 11:38:26 +03:00
Fangrui Song	d054c7ee2e	Add test utility 'extract' See https://lists.llvm.org/pipermail/llvm-dev/2020-July/143373.html "[llvm-dev] Multiple documents in one test file" for some discussions. `extract part filename` splits the input file into multiple parts separated by regex `^(.\|//)--- ` and extract the specified part to stdout or the output file (if specified). Use case A (organizing input of different formats (e.g. linker script+assembly) in one file). ``` // RUN: extract lds %s -o %t.lds // RUN: extract asm %s -o %t.s // RUN: llvm-mc %t.s -o %t.o // RUN: ld.lld -T %t.lds %t.o -o %t This is sometimes better than the %S/Inputs/ approach because the user can see the auxiliary files immediately and don't have to open another file. ``` Use case B (for utilities which don't have built-in input splitting feature): ``` // RUN: extract case1 %s \| llc \| FileCheck %s --check-prefix=CASE1 // RUN: extract case2 %s \| llc \| FileCheck %s --check-prefix=CASE2 Combing tests prudently can improve readability. This is sometimes better than having multiple test files. ``` Since this is a new utility, there is no git history concerns for UpperCase variable names. I use lowerCase variable names like mlir/lld. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D83834	2020-07-23 19:15:35 -07:00
Fangrui Song	8ffb2097cc	[ELF] Refine LMA offset propagation rule in D76995 If neither AT(lma) nor AT>lma_region is specified, D76995 keeps `lmaOffset` (LMA - VMA) if the previous section is in the default LMA region. This patch additionally checks that the two sections are in the same memory region. Add a test case derived from https://bugs.llvm.org/show_bug.cgi?id=45313 .mdata : AT(0xfb01000) { (.data); } > TCM // It is odd to make .bss inherit lmaOffset, because the two sections // are in different memory regions. .bss : { (.bss) } > DDR With this patch, section VMA/LMA match GNU ld. Note, GNU ld supports out-of-order (w.r.t sh_offset) sections and places .text and .bss in the same PT_LOAD. We don't have that behavior. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D81986	2020-06-19 09:11:33 -07:00
Fangrui Song	c49f83b6e9	[ELF] Don't advance sh_offset for an empty section whose PT_LOAD is removed (due to p_memsz=0) removeEmptyPTLoad() removes empty (p_memsz=0) PT_LOAD segments. In assignFileOffsets(), setFileOffset() unnecessarily advances file offsets for containing empty sections. This is exposed by arm Linux kernel's multi_v5_defconfig (see https://bugs.llvm.org/show_bug.cgi?id=45632) ``` ld.lld (max-page-size=65536): [34] .init.data PROGBITS c0c24000 c34000 0128ac 00 WA 0 0 4096 [35] .text_itcm PROGBITS fffe0000 c50000 000000 00 WA 0 0 1 [36] .data_dtcm PROGBITS fffe8000 c58000 000000 00 WA 0 0 1 [37] .data PROGBITS c0c38000 c58000 0647a0 00 WA 0 0 32 arm-linux-gnueabi-ld (max-page-size=65536): [23] .init.data PROGBITS c0c12000 c22000 0128ac 00 WA 0 0 4096 [24] .text_itcm PROGBITS fffe0000 ca2558 000000 00 W 0 0 1 [25] .data_dtcm PROGBITS fffe8000 ca2558 000000 00 W 0 0 1 [26] .data PROGBITS c0c26000 c36000 0647a0 00 WA 0 0 32 ``` This patch clears OutputSection::ptLoad if ptLoad is removed by removeEmptyPTLoad(). Conceptually this removes "dangling" references. Reviewed By: psmith Differential Revision: https://reviews.llvm.org/D79254	2020-05-04 08:07:34 -07:00
Thomas Preud'homme	d735c7048c	[test] Fix lld's ELF/linkerscript/thunk-gen-mips.s Summary: Lld test ELF/linkerscript/thunk-gen-mips.s was accidentally disabled due to the use of wrong FileCheck directives. As a result the test seems to have bitrotted as it fails to pass if fixing the directive. To ease updates to the test in case of change of the __start address the checks have been changed to use numeric variables to express all the addresses based on the __start address. Reviewed By: atanasyan Differential Revision: https://reviews.llvm.org/D79270	2020-05-02 22:49:23 +01:00
Fangrui Song	5c86b08a6f	[ELF][test] Improve tests Prepare for the upcomong change that removes unneeded sh_offset advancement for empty sections whose PT_LOAD are removed.	2020-05-01 11:27:51 -07:00
Thomas Preud'homme	9ecddde321	[test] Fix ELF/linkerscript/input-archive.s w/ @ in path Lld test ELF/linkerscript/input-archive.s fails when path contain a @ because is not accepted in unquoted token in linker scripts which leads to the path being broken in 2 around the @. This commit quotes the path used in the linker script created by this and similar testcases allowing the test to pass even in the presence of an @ sign in the path. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D79103	2020-04-30 20:14:22 +01:00
Fangrui Song	c384ca3c6a	[ELF] For relative paths in INPUT() and GROUP(), search the directory of the current linker script before searching other paths For a relative path in INPUT() or GROUP(), this patch changes the search order by adding the directory of the current linker script. The new search order (consistent with GNU ld >= 2.35 regarding the new test `test/ELF/input-relative.s`): 1. the directory of the current linker script (GNU ld from Binutils 2.35 onwards; https://sourceware.org/bugzilla/show_bug.cgi?id=25806) 2. the current working directory 3. library paths (-L) This behavior makes it convenient to replace a .so or .a with a linker script with additional input. For example, glibc ``` % cat /usr/lib/x86_64-linux-gnu/libm.a /* GNU ld script */ OUTPUT_FORMAT(elf64-x86-64) GROUP ( /usr/lib/x86_64-linux-gnu/libm-2.29.a /usr/lib/x86_64-linux-gnu/libmvec.a ) ``` could be simplified as `GROUP(libm-2.29.a libmvec.a)`. Another example is to make libc++.a a linker script: ``` INPUT(libc++.a.1 libc++abi.a) ``` Note, -l is not affected. Reviewed By: psmith Differential Revision: https://reviews.llvm.org/D77779	2020-04-22 12:34:20 -07:00
Kazuaki Ishizaki	7c5fcb3591	[lld] NFC: fix trivial typos in comments Differential Revision: https://reviews.llvm.org/D72339	2020-04-02 01:21:36 +09:00
Fangrui Song	bb4a36ea28	[ELF] Propagate LMA offset to sections with neither AT() nor AT> Fixes https://bugs.llvm.org/show_bug.cgi?id=45313 Also fixes linkerscript/{at4.s,overlay.test} LMA address issues exposed by `011b785505`. Related: D74297 This patch improves emulation of GNU ld's heuristics on the difference between the LMA and the VMA: https://sourceware.org/binutils/docs/ld/Output-Section-LMA.html#Output-Section-LMA New test linkerscript/lma-offset.s (based on at4.s) demonstrates some behaviors. Reviewed By: psmith Differential Revision: https://reviews.llvm.org/D76995	2020-04-01 08:19:06 -07:00
Fangrui Song	51475e4023	[ELF][test] Add linkerscript/linkorder-linked-to.s Delete relocatable-linkorder.s which is covered.	2020-03-30 15:17:29 -07:00
Fangrui Song	673e81eee4	[ELF] Allow SHF_LINK_ORDER and non-SHF_LINK_ORDER to be mixed Currently, `error: incompatible section flags for .rodata` is reported when we mix SHF_LINK_ORDER and non-SHF_LINK_ORDER sections in an output section. This is overconstrained. This patch allows mixed flags with the requirement that SHF_LINK_ORDER sections must be contiguous. Mixing flags is used by Linux aarch64 (https://github.com/ClangBuiltLinux/linux/issues/953) .init.data : { ... KEEP(*(__patchable_function_entries)) ... } When the integrated assembler is enabled, clang's -fpatchable-function-entry=N[,M] implementation sets the SHF_LINK_ORDER flag (D72215) to fix a number of garbage collection issues. Strictly speaking, the ELF specification does not require contiguous SHF_LINK_ORDER sections but for many current uses of SHF_LINK_ORDER like .ARM.exidx/__patchable_function_entries there has been a requirement for the sections to be contiguous on top of the requirements of the ELF specification. This patch also imposes one restriction: SHF_LINK_ORDER sections cannot be separated by a symbol assignment or a BYTE command. Not allowing BYTE is a natural extension that a non-SHF_LINK_ORDER cannot be a separator. Symbol assignments can delimiter the contents of SHF_LINK_ORDER sections. Allowing SHF_LINK_ORDER sections across symbol assignments (especially __start_/__stop_) can make things hard to explain. The restriction should not be a problem for practical use cases. Reviewed By: psmith Differential Revision: https://reviews.llvm.org/D77007	2020-03-30 10:03:55 -07:00
Fangrui Song	2d19270efc	[ELF][test] Improve linkerscript/linkorder.s	2020-03-30 09:34:29 -07:00
Matt Schulte	fdc41aa22c	[lld][ELF] Mark empty NOLOAD output sections SHT_NOBITS instead of SHT_PROGBITS This fixes PR# 45336. Output sections described in a linker script as NOLOAD with no input sections would be marked as SHT_PROGBITS. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D76981	2020-03-28 10:07:58 -07:00
James Henderson	3ff3c6986b	[lld][ELF] Fix error message The error previously talked about a "section header" but was actually referring to a program header. Reviewed by: grimar, MaskRay Differential Revision: https://reviews.llvm.org/D76846	2020-03-26 15:30:24 +00:00
Fangrui Song	9e33c09647	[ELF] Keep orphan section names (.rodata.foo .text.foo) unchanged if !hasSectionsCommand This behavior matches GNU ld and seems reasonable. ``` // If a SECTIONS command is not specified .text.* -> .text .rodata.* -> .rodata .init_array.* -> .init_array ``` A proposed Linux feature CONFIG_FG_KASLR may depend on the GNU ld behavior. Reword a comment about -z keep-text-section-prefix and a comment about CommonSection (deleted by rL286234). Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D75225	2020-03-23 10:30:06 -07:00

1 2 3 4 5 ...

630 Commits