llvm-project

Commit Graph

Author	SHA1	Message	Date
Georgii Rymar	41726f8d5b	[llvm-readobj] - Print "Unknown" when a program header is unknown. Currently, when a program header type is unknown, we dont print anything: ``` ProgramHeader { Type: (0x60000000) ``` With this patch the output will be: ``` ProgramHeader { Type: Unknown (0x60000000) ``` It was discussed in D85526 and consistent with what we print for '--sections' already, e.g.: ``` Section { Name: .sec Type: Unknown (0x7FFFFFFF) } ``` Differential revision: https://reviews.llvm.org/D86213	2020-08-25 13:05:17 +03:00
Fangrui Song	9670029b6b	[ELF] Keep st_type for symbol assignment PR46970: for `alias = aliasee`, the alias can be used in relocation processing and on ARM st_type does affect Thumb interworking. It is thus desirable for the alias to get the same st_type. Note that the st_size field should not be inherited because some tools use st_size=0 as a heuristic to detect aliases. Retaining st_size can thwart such heuristics and cause aliases to be preferred over the original symbols. Differential Revision: https://reviews.llvm.org/D86263	2020-08-20 16:05:27 -07:00
Fangrui Song	ec29538af2	[ELF] Assign file offsets of non-SHF_ALLOC after SHF_ALLOC and set sh_addr=0 to non-SHF_ALLOC * GNU ld places non-SHF_ALLOC sections after SHF_ALLOC sections. This has the advantage that the file offsets of a non-SHF_ALLOC cannot be contained in a PT_LOAD. This patch matches the behavior. * For non-SHF_ALLOC non-orphan sections, GNU ld may assign non-zero sh_addr and treat them similar to SHT_NOBITS (not advance location counter). This is an alternative approach to what we have done in D85100. By placing non-SHF_ALLOC sections at the end, we can drop special cases in createSection and findOrphanPos added by D85100. Different from GNU ld, we set sh_addr to 0 for non-SHF_ALLOC sections. 0 arguably is better because non-SHF_ALLOC sections don't appear in the memory image. ELF spec says: > sh_addr - If the section will appear in the memory image of a process, this > member gives the address at which the section's first byte should > reside. Otherwise, the member contains 0. D85100 appeared to take a detour. If we take a combined view on D85100 and this patch, the overall complexity slightly increases (one more 3-line loop) and compatibility with GNU ld improves. The behavior we don't want to match is the special treatment of .symtab .shstrtab .strtab: they can be matched in LLD but not in GNU ld. Reviewed By: jhenderson, psmith Differential Revision: https://reviews.llvm.org/D85867	2020-08-18 09:03:01 -07:00
Fangrui Song	e8a11c0558	[ELF] Allow mixed SHF_LINK_ORDER & non-SHF_LINK_ORDER sections and sort within InputSectionDescription LLD currently does not allow non-contiguous SHF_LINK_ORDER components in an output section. This makes it infeasible to add SHF_LINK_ORDER to an existing metadata section if backward compatibility with older object files are concerned. We did not allow mixed components (like GNU ld) and D77007 relaxed to allow non-contiguous SHF_LINK_ORDER components. This patch allows arbitrary mix, with sorting performed within an InputSectionDescription. For example, `.rodata : {(.rodata.foo) (.rodata.bar)}`, has two InputSectionDescription's. If there is at least one SHF_LINK_ORDER and at least one non-SHF_LINK_ORDER in .rodata.foo, they are ordered within `(.rodata.foo)`: we arbitrarily place SHF_LINK_ORDER components before non-SHF_LINK_ORDER components (like Solaris ld). `(.rodata.bar)` is ordered similarly, but the two InputSectionDescription's don't interact. It can be argued that this is more reasonable than the previous behavior where written order was not respected. It would be nice if the two different semantics (ordering requirement & garbage collection) were not overloaded on one section flag, however, it is probably difficult to obtain a generic flag at this point (https://groups.google.com/forum/#!topic/generic-abi/hgx_m1aXqUo "SHF_LINK_ORDER's original semantics make upgrade difficult"). (Actually, without the GC semantics, SHF_LINK_ORDER would still have the sh_link!=0 & sh_link=0 issue. It is just that people find the GC semantics more useful and tend to use the feature more often.) GNU ld feature request: https://sourceware.org/bugzilla/show_bug.cgi?id=16833 Differential Revision: https://reviews.llvm.org/D84001	2020-08-17 11:29:05 -07:00
Fangrui Song	a6db64ef4a	[ELF] Allow sections after a non-SHF_ALLOC section to be covered by PT_LOAD GNU ld allows sections after a non-SHF_ALLOC section to be covered by PT_LOAD (PR37607) and assigns addresses to non-SHF_ALLOC output sections (similar to SHF_ALLOC NOBITS sections. The location counter is not advanced). This patch tries to fix PR37607 (remove a special case in `Writer<ELFT>::createPhdrs`). To make the created PT_LOAD meaningful, we cannot reset dot to 0 for a middle non-SHF_ALLOC output section. This results in removal of two special cases in LinkerScript::assignOffsets. Non-SHF_ALLOC non-orphan sections can have non-zero addresses like in GNU ld. The zero address rule for non-SHF_ALLOC sections is weakened to apply to orphan only. This results in a special case in createSection and findOrphanPos, respectively. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D85100	2020-08-06 08:27:15 -07:00
Muhammad Omair Javaid	d9e191cb17	Revert "[ELF] Allow sections after a non-SHF_ALLOC section to be covered by PT_LOAD" This reverts commit `030ddc0a0b`. This breaks http://lab.llvm.org:8011/builders/lldb-arm-ubuntu and http://lab.llvm.org:8011/builders/lldb-aarch64-ubuntu Differential Revision: https://reviews.llvm.org/D85100	2020-08-06 16:30:05 +05:00
Fangrui Song	030ddc0a0b	[ELF] Allow sections after a non-SHF_ALLOC section to be covered by PT_LOAD GNU ld allows sections after a non-SHF_ALLOC section to be covered by PT_LOAD (PR37607) and assigns addresses to non-SHF_ALLOC output sections (similar to SHF_ALLOC NOBITS sections. The location counter is not advanced). This patch tries to fix PR37607 (remove a special case in `Writer<ELFT>::createPhdrs`). To make the created PT_LOAD meaningful, we cannot reset dot to 0 for a middle non-SHF_ALLOC output section. This results in removal of two special cases in LinkerScript::assignOffsets. Non-SHF_ALLOC non-orphan sections can have non-zero addresses like in GNU ld. The zero address rule for non-SHF_ALLOC sections is weakened to apply to orphan only. This results in a special case in createSection and findOrphanPos, respectively. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D85100	2020-08-05 09:30:23 -07:00
Fangrui Song	bcea3a7a28	Add test utility 'split-file' See https://lists.llvm.org/pipermail/llvm-dev/2020-July/143373.html "[llvm-dev] Multiple documents in one test file" for some discussions. This patch has explored several alternatives. The current semantics are similar to what @dblaikie proposed. `split-file filename output` splits the input file into multiple parts separated by regex `^(.\|//)--- filename` and write each part to the file `output/filename` (`filename` can include path separators). Use case A (organizing input of different formats (e.g. linker script+assembly) in one file). ``` # RUN: split-file %s %t # RUN: llvm-mc %t/asm -o %t.o # RUN: ld.lld -T %t/lds %t.o -o %t This is sometimes better than the %S/Inputs/ approach because the user can see the auxiliary files immediately and don't have to open another file. # asm ... # lds ... ``` Use case B (for utilities which don't have built-in input splitting feature): ``` // RUN: split-file %s %t // RUN: llc < %t/1.ll \| FileCheck %s --check-prefix=CASE1 // RUN: llc < %t/2.ll \| FileCheck %s --check-prefix=CASE2 Combing tests prudently can improve readability. For example, when testing parsing errors if the recovery mechanism isn't possible, grouping the tests in one file can more readily see test coverage/strategy. //--- 1.ll ... //--- 2.ll ... ``` Since this is a new utility, there is no git history concerns for UpperCase variable names. I use lowerCase variable names like mlir/lld. Reviewed By: jhenderson, lattner Differential Revision: https://reviews.llvm.org/D83834	2020-08-03 20:42:09 -07:00
Fangrui Song	dd405f1a53	Revert D83834 "Add test utility 'extract'" This reverts commit `d054c7ee2e`. There are discussions about the utility name, its functionality and user interface. Revert before we reach consensus.	2020-07-28 13:26:33 -07:00
Hafiz Abid Qadeer	1f166edeb4	[lld][linkerscript] Fix handling of DEFINED. Current implementation did not check that symbols is actually defined. Only checked for presence. GNU ld documentation says, "Return 1 if symbol is in the linker global symbol table and is defined before the statement using DEFINED in the script, otherwise return 0." https://sourceware.org/binutils/docs/ld/Builtin-Functions.html#Builtin-Functions Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D83758	2020-07-28 21:18:01 +01:00
Isaac Richter	fa1145a8d2	[lld][ELF] Add LOG2CEIL builtin ldscript function This patch adds support for the LOG2CEIL builtin function in linker scripts: https://sourceware.org/binutils/docs/ld/Builtin-Functions.html#index-LOG2CEIL_0028exp_0029 As documented for LD, and to keep compatibility, LOG2CEIL(0) returns 0 (not -inf). The test vectors are somewhat arbitrary. We check minimum values (0-4); middle values (2^32, and 2^32+1); and the maximum value (2^64-1). The checks for LOG2CEIL explicitly use full 64-bit values (16 hex digits). This is needed to properly verify that -inf and other interesting results aren't returned. (For some reason, all other tests in operators.test use only 14 digits.) Differential revision: https://reviews.llvm.org/D84054	2020-07-27 12:16:43 +03:00
Georgii Rymar	ae4279bd3e	[LLD][ELF] - Linkerscript: report location for the "unclosed comment in a linker script" error. Currently we print "error: unclosed comment in a linker script", which doesn't provide information about the real error location. Fixes https://bugs.llvm.org/show_bug.cgi?id=46793. Differential revision: https://reviews.llvm.org/D84300	2020-07-24 11:38:26 +03:00
Fangrui Song	d054c7ee2e	Add test utility 'extract' See https://lists.llvm.org/pipermail/llvm-dev/2020-July/143373.html "[llvm-dev] Multiple documents in one test file" for some discussions. `extract part filename` splits the input file into multiple parts separated by regex `^(.\|//)--- ` and extract the specified part to stdout or the output file (if specified). Use case A (organizing input of different formats (e.g. linker script+assembly) in one file). ``` // RUN: extract lds %s -o %t.lds // RUN: extract asm %s -o %t.s // RUN: llvm-mc %t.s -o %t.o // RUN: ld.lld -T %t.lds %t.o -o %t This is sometimes better than the %S/Inputs/ approach because the user can see the auxiliary files immediately and don't have to open another file. ``` Use case B (for utilities which don't have built-in input splitting feature): ``` // RUN: extract case1 %s \| llc \| FileCheck %s --check-prefix=CASE1 // RUN: extract case2 %s \| llc \| FileCheck %s --check-prefix=CASE2 Combing tests prudently can improve readability. This is sometimes better than having multiple test files. ``` Since this is a new utility, there is no git history concerns for UpperCase variable names. I use lowerCase variable names like mlir/lld. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D83834	2020-07-23 19:15:35 -07:00
Fangrui Song	8ffb2097cc	[ELF] Refine LMA offset propagation rule in D76995 If neither AT(lma) nor AT>lma_region is specified, D76995 keeps `lmaOffset` (LMA - VMA) if the previous section is in the default LMA region. This patch additionally checks that the two sections are in the same memory region. Add a test case derived from https://bugs.llvm.org/show_bug.cgi?id=45313 .mdata : AT(0xfb01000) { (.data); } > TCM // It is odd to make .bss inherit lmaOffset, because the two sections // are in different memory regions. .bss : { (.bss) } > DDR With this patch, section VMA/LMA match GNU ld. Note, GNU ld supports out-of-order (w.r.t sh_offset) sections and places .text and .bss in the same PT_LOAD. We don't have that behavior. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D81986	2020-06-19 09:11:33 -07:00
Fangrui Song	c49f83b6e9	[ELF] Don't advance sh_offset for an empty section whose PT_LOAD is removed (due to p_memsz=0) removeEmptyPTLoad() removes empty (p_memsz=0) PT_LOAD segments. In assignFileOffsets(), setFileOffset() unnecessarily advances file offsets for containing empty sections. This is exposed by arm Linux kernel's multi_v5_defconfig (see https://bugs.llvm.org/show_bug.cgi?id=45632) ``` ld.lld (max-page-size=65536): [34] .init.data PROGBITS c0c24000 c34000 0128ac 00 WA 0 0 4096 [35] .text_itcm PROGBITS fffe0000 c50000 000000 00 WA 0 0 1 [36] .data_dtcm PROGBITS fffe8000 c58000 000000 00 WA 0 0 1 [37] .data PROGBITS c0c38000 c58000 0647a0 00 WA 0 0 32 arm-linux-gnueabi-ld (max-page-size=65536): [23] .init.data PROGBITS c0c12000 c22000 0128ac 00 WA 0 0 4096 [24] .text_itcm PROGBITS fffe0000 ca2558 000000 00 W 0 0 1 [25] .data_dtcm PROGBITS fffe8000 ca2558 000000 00 W 0 0 1 [26] .data PROGBITS c0c26000 c36000 0647a0 00 WA 0 0 32 ``` This patch clears OutputSection::ptLoad if ptLoad is removed by removeEmptyPTLoad(). Conceptually this removes "dangling" references. Reviewed By: psmith Differential Revision: https://reviews.llvm.org/D79254	2020-05-04 08:07:34 -07:00
Thomas Preud'homme	d735c7048c	[test] Fix lld's ELF/linkerscript/thunk-gen-mips.s Summary: Lld test ELF/linkerscript/thunk-gen-mips.s was accidentally disabled due to the use of wrong FileCheck directives. As a result the test seems to have bitrotted as it fails to pass if fixing the directive. To ease updates to the test in case of change of the __start address the checks have been changed to use numeric variables to express all the addresses based on the __start address. Reviewed By: atanasyan Differential Revision: https://reviews.llvm.org/D79270	2020-05-02 22:49:23 +01:00
Fangrui Song	5c86b08a6f	[ELF][test] Improve tests Prepare for the upcomong change that removes unneeded sh_offset advancement for empty sections whose PT_LOAD are removed.	2020-05-01 11:27:51 -07:00
Thomas Preud'homme	9ecddde321	[test] Fix ELF/linkerscript/input-archive.s w/ @ in path Lld test ELF/linkerscript/input-archive.s fails when path contain a @ because is not accepted in unquoted token in linker scripts which leads to the path being broken in 2 around the @. This commit quotes the path used in the linker script created by this and similar testcases allowing the test to pass even in the presence of an @ sign in the path. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D79103	2020-04-30 20:14:22 +01:00
Fangrui Song	c384ca3c6a	[ELF] For relative paths in INPUT() and GROUP(), search the directory of the current linker script before searching other paths For a relative path in INPUT() or GROUP(), this patch changes the search order by adding the directory of the current linker script. The new search order (consistent with GNU ld >= 2.35 regarding the new test `test/ELF/input-relative.s`): 1. the directory of the current linker script (GNU ld from Binutils 2.35 onwards; https://sourceware.org/bugzilla/show_bug.cgi?id=25806) 2. the current working directory 3. library paths (-L) This behavior makes it convenient to replace a .so or .a with a linker script with additional input. For example, glibc ``` % cat /usr/lib/x86_64-linux-gnu/libm.a /* GNU ld script */ OUTPUT_FORMAT(elf64-x86-64) GROUP ( /usr/lib/x86_64-linux-gnu/libm-2.29.a /usr/lib/x86_64-linux-gnu/libmvec.a ) ``` could be simplified as `GROUP(libm-2.29.a libmvec.a)`. Another example is to make libc++.a a linker script: ``` INPUT(libc++.a.1 libc++abi.a) ``` Note, -l is not affected. Reviewed By: psmith Differential Revision: https://reviews.llvm.org/D77779	2020-04-22 12:34:20 -07:00
Kazuaki Ishizaki	7c5fcb3591	[lld] NFC: fix trivial typos in comments Differential Revision: https://reviews.llvm.org/D72339	2020-04-02 01:21:36 +09:00
Fangrui Song	bb4a36ea28	[ELF] Propagate LMA offset to sections with neither AT() nor AT> Fixes https://bugs.llvm.org/show_bug.cgi?id=45313 Also fixes linkerscript/{at4.s,overlay.test} LMA address issues exposed by `011b785505`. Related: D74297 This patch improves emulation of GNU ld's heuristics on the difference between the LMA and the VMA: https://sourceware.org/binutils/docs/ld/Output-Section-LMA.html#Output-Section-LMA New test linkerscript/lma-offset.s (based on at4.s) demonstrates some behaviors. Reviewed By: psmith Differential Revision: https://reviews.llvm.org/D76995	2020-04-01 08:19:06 -07:00
Fangrui Song	51475e4023	[ELF][test] Add linkerscript/linkorder-linked-to.s Delete relocatable-linkorder.s which is covered.	2020-03-30 15:17:29 -07:00
Fangrui Song	673e81eee4	[ELF] Allow SHF_LINK_ORDER and non-SHF_LINK_ORDER to be mixed Currently, `error: incompatible section flags for .rodata` is reported when we mix SHF_LINK_ORDER and non-SHF_LINK_ORDER sections in an output section. This is overconstrained. This patch allows mixed flags with the requirement that SHF_LINK_ORDER sections must be contiguous. Mixing flags is used by Linux aarch64 (https://github.com/ClangBuiltLinux/linux/issues/953) .init.data : { ... KEEP(*(__patchable_function_entries)) ... } When the integrated assembler is enabled, clang's -fpatchable-function-entry=N[,M] implementation sets the SHF_LINK_ORDER flag (D72215) to fix a number of garbage collection issues. Strictly speaking, the ELF specification does not require contiguous SHF_LINK_ORDER sections but for many current uses of SHF_LINK_ORDER like .ARM.exidx/__patchable_function_entries there has been a requirement for the sections to be contiguous on top of the requirements of the ELF specification. This patch also imposes one restriction: SHF_LINK_ORDER sections cannot be separated by a symbol assignment or a BYTE command. Not allowing BYTE is a natural extension that a non-SHF_LINK_ORDER cannot be a separator. Symbol assignments can delimiter the contents of SHF_LINK_ORDER sections. Allowing SHF_LINK_ORDER sections across symbol assignments (especially __start_/__stop_) can make things hard to explain. The restriction should not be a problem for practical use cases. Reviewed By: psmith Differential Revision: https://reviews.llvm.org/D77007	2020-03-30 10:03:55 -07:00
Fangrui Song	2d19270efc	[ELF][test] Improve linkerscript/linkorder.s	2020-03-30 09:34:29 -07:00
Matt Schulte	fdc41aa22c	[lld][ELF] Mark empty NOLOAD output sections SHT_NOBITS instead of SHT_PROGBITS This fixes PR# 45336. Output sections described in a linker script as NOLOAD with no input sections would be marked as SHT_PROGBITS. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D76981	2020-03-28 10:07:58 -07:00
James Henderson	3ff3c6986b	[lld][ELF] Fix error message The error previously talked about a "section header" but was actually referring to a program header. Reviewed by: grimar, MaskRay Differential Revision: https://reviews.llvm.org/D76846	2020-03-26 15:30:24 +00:00
Fangrui Song	9e33c09647	[ELF] Keep orphan section names (.rodata.foo .text.foo) unchanged if !hasSectionsCommand This behavior matches GNU ld and seems reasonable. ``` // If a SECTIONS command is not specified .text.* -> .text .rodata.* -> .rodata .init_array.* -> .init_array ``` A proposed Linux feature CONFIG_FG_KASLR may depend on the GNU ld behavior. Reword a comment about -z keep-text-section-prefix and a comment about CommonSection (deleted by rL286234). Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D75225	2020-03-23 10:30:06 -07:00
Fangrui Song	011b785505	[ELF] Create readonly PT_LOAD in the presence of a SECTIONS command This essentially drops the change by r288021 (discussed with Georgii Rymar and Peter Smith and noted down in the release note of lld 10). GNU ld>=2.31 enables -z separate-code by default for Linux x86. By default (in the absence of a PHDRS command) a readonly PT_LOAD is created, which is different from its traditional behavior. Not emulating GNU ld's traditional behavior is good for us because it improves code consistency (we create a readonly PT_LOAD in the absence of a SECTIONS command). Users can add --no-rosegment to restore the previous behavior (combined readonly and read-executable sections in a single RX PT_LOAD).	2020-03-19 19:11:11 -07:00
Fangrui Song	09ac859c13	[ELF][test] Make tests less address sensitive and delete redundant tests	2020-03-19 18:04:47 -07:00
Georgii Rymar	bb7d2b1780	[LLD][ELF] - Disambiguate "=fillexp" with a primary expression to allow =0x90 /DISCARD/ Fixes https://bugs.llvm.org/show_bug.cgi?id=44903 It is about the following case: ``` SECTIONS { .foo : { (.foo) } =0x90909090 /DISCARD/ : { (.bar) } } ``` Here while parsing the fill expression we treated the "/" of "/DISCARD/" as operator. With this change, suggested by Fangrui Song, we do not allow expressions with operators (e.g. "0x1100 + 0x22") that are not wrapped into round brackets. It should not be an issue for users, but helps to resolve parsing ambiguity. Differential revision: https://reviews.llvm.org/D74687	2020-03-19 12:49:25 +03:00
Fangrui Song	2bd1ca28c1	[ELF][test] Delete unneeded --no-threads	2020-03-17 12:58:24 -07:00
Fangrui Song	536ba6373f	[Object] Change ELFObjectFile<ELFT>::getFileFormatName() to use BFD names Follow-up for D74433 What the function returns are almost standard BFD names, except that "ELF" is in uppercase instead of lowercase. This patch changes "ELF" to "elf" and changes ARM/AArch64 to use their BFD names. MIPS and PPC64 have endianness differences as well, but this patch does not intend to address them. Advantages: * llvm-objdump: the "file format " line matches GNU objdump on ARM/AArch64 objects * "file format " line can be extracted and fed into llvm-objcopy -O literally. (https://github.com/ClangBuiltLinux/linux/issues/779 has such a use case) Affected tools: llvm-readobj, llvm-objdump, llvm-dwarfdump, MCJIT (internal implementation detail, not exposed) Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D76046	2020-03-16 07:42:04 -07:00
Fangrui Song	f0374e7db2	[test] lld/test/: change llvm-objdump single-dash long options to double-dash options	2020-03-15 17:48:36 -07:00
Fangrui Song	eb4b5a36a6	[ELF] Move --print-map(-M)/--cref before checkSections() and openFile() -M output can be useful when diagnosing an "error: output file too large" problem (emitted in openFile()). I just ran into such a situation where I had to debug an erronerous Linux kernel linker script. It tried to create a file larger than INT64_MAX bytes. This patch could have helped https://bugs.llvm.org/show_bug.cgi?id=44715 as well. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D75966	2020-03-12 08:00:18 -07:00
Fangrui Song	fbf41b5267	[ELF] Simplify sh_addr computation and warn if sh_addr is not a multiple of sh_addralign See `docs/ELF/linker_script.rst` for the new computation for sh_addr and sh_addralign. `ALIGN(section_align)` now means: "increase alignment to section_align" (like yet another input section requirement). The "start of section .foo changes from 0x11 to 0x20" warning no longer makes sense. Change it to warn if sh_addr%sh_addralign!=0. To decrease the alignment from the default max_input_align, use `.output ALIGN(8) : {}` instead of `.output : ALIGN(8) {}` See linkerscript/section-address-align.test as an example. When both an output section address and ALIGN are set (can be seen as an "undefined behavior" https://sourceware.org/ml/binutils/2020-03/msg00115.html), lld may align more than GNU ld, but it makes a linker script working with GNU ld hard to break with lld. This patch can be considered as restoring part of the behavior before D74736. Differential Revision: https://reviews.llvm.org/D75724	2020-03-11 09:35:42 -07:00
Fangrui Song	92b5b980d2	[ELF] Postpone evaluation of ORIGIN/LENGTH in a MEMORY command ``` createFiles(args) readDefsym readerLinkerScript(mb) ... readMemory readMemoryAssignment("ORIGIN", "org", "o") // eagerly evaluated target = getTarget(); link(args) writeResult<ELFT>() ... finalizeSections() script->processSymbolAssignments() addSymbol(cmd) // with this patch, evaluated here ``` readMemoryAssignment eagerly evaluates ORIGIN/LENGTH and returns an uint64_t. This patch postpones the evaluation to make --defsym and symbol assignments * `CONSTANT(COMMONPAGESIZE)` (requires a non-null `lld:🧝:target`) work. If the expression somehow requires interaction with memory regions, the circular dependency may cause the expression to evaluate to a strange value. See the new test added to memory-err.s Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D75763	2020-03-09 08:31:41 -07:00
Fangrui Song	7b8341b257	[ELF][test] Improve MEMORY tests	2020-03-06 14:01:20 -08:00
Fangrui Song	71e2ca6e32	[llvm-objdump] -d: print `00000000 <foo>:` instead of `00000000 foo:` The new behavior matches GNU objdump. A pair of angle brackets makes tests slightly easier. `.foo:` is not unique and thus cannot be used in a `CHECK-LABEL:` directive. Without `-LABEL`, the CHECK line can match the `Disassembly of section` line and causes the next `CHECK-NEXT:` to fail. ``` Disassembly of section .foo: 0000000000001634 .foo: ``` Bdragon: <> has metalinguistic connotation. it just "feels right" Reviewed By: rupprecht Differential Revision: https://reviews.llvm.org/D75713	2020-03-05 18:05:28 -08:00
Fangrui Song	f9a0056016	[llvm-objdump] --syms: make flags closer to GNU objdump This fixes several issues. The behavior changes are: A SHN_COMMON symbol does not have the 'g' flag. An undefined symbol does not have 'g' or 'l' flag. A STB_GLOBAL SymbolRef::ST_Unknown symbol has the 'g' flag. A STB_LOCAL SymbolRef::ST_Unknown symbol has the 'l' flag. Reviewed By: rupprecht Differential Revision: https://reviews.llvm.org/D75659	2020-03-05 09:59:53 -08:00
Fangrui Song	9e1319df7e	[llvm-readelf] Make --all output order closer to GNU readelf https://bugs.llvm.org/show_bug.cgi?id=43403 The new order makes it easy to compare the two tools' --all. Reviewed By: grimar, rupprecht Differential Revision: https://reviews.llvm.org/D75592	2020-03-04 12:22:12 -08:00
Fangrui Song	c72d60d42f	[llvm-objdump] --syms: print st_size as "%016" PRIx64 instead of "%08" PRIx64 for 64-bit objects This is GNU objdump's behavior and it is reasonable to match. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D75588	2020-03-04 12:09:27 -08:00
Fangrui Song	1044ee827a	[ELF][test] Improve llvm-objdump -t tests In many cases, llvm-nm or llvm-readelf is more suitable.	2020-03-04 10:48:36 -08:00
Fangrui Song	b305b8a256	[ELF][test] Rename SHF_LINK_ORDER related "metadata" to "linkorder" Test cleanups.	2020-02-26 17:36:59 -08:00
Fangrui Song	37c7f0d945	[ELF] --orphan-handling=: don't warn/error for input SHT_REL[A] retained by --emit-relocs They are purposefully skipped by input section descriptions (rL295324). Similarly, --orphan-handling= should not warn/error for them. This behavior matches GNU ld. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D75151	2020-02-26 10:32:54 -08:00
Fangrui Song	423194098b	[ELF] --orphan-handling=: don't warn/error for unused synthesized sections This makes --orphan-handling= less noisy. This change also improves our compatibility with GNU ld. GNU ld special cases .symtab, .strtab and .shstrtab . We need output section descriptions for .symtab, .strtab and .shstrtab to suppress: <internal>:(.symtab) is being placed in '.symtab' <internal>:(.shstrtab) is being placed in '.shstrtab' <internal>:(.strtab) is being placed in '.strtab' With --strip-all, .symtab and .strtab can be omitted (note, --strip-all is not compatible with --emit-relocs). Reviewed By: nickdesaulniers Differential Revision: https://reviews.llvm.org/D75149	2020-02-26 08:56:12 -08:00
Fangrui Song	93331a17e8	[ELF] Support archive:file syntax in input section descriptions Fixes https://bugs.llvm.org/show_bug.cgi?id=44450 https://sourceware.org/binutils/docs/ld/Input-Section-Basics.html#Input-Section-Basics The following two rules are not implemented. * `archive:` matches every file in the archive. * `:file` matches a file not in an archive. Reviewed By: grimar, ruiu Differential Revision: https://reviews.llvm.org/D75100	2020-02-25 07:57:43 -08:00
Fangrui Song	de0dda54d3	[ELF] Warn changed output section address When the output section address (addrExpr) is specified, GNU ld warns if sh_addr is different. This patch implements the warning. Note, LinkerScript::assignAddresses can be called more than once. We need to record the changed section addresses, and only report the warnings after the addresses are finalized. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D74741	2020-02-21 08:13:29 -08:00
Fangrui Song	6ed8e20143	[ELF] Ignore the maximum of input section alignments for two cases Follow-up for D74286. Notations: * alignExpr: the computed ALIGN value * max_input_align: the maximum of input section alignments This patch changes the following two cases to match GNU ld: * When ALIGN is present, GNU ld sets output sh_addr to alignExpr, while lld use max(alignExpr, max_input_align) * When addrExpr is specified but alignExpr is not, GNU ld sets output sh_addr to addrExpr, while lld uses `advance(0, max_input_align)` Note, sh_addralign is still set to max(alignExpr, max_input_align). lma-align.test is enhanced a bit to check we don't overalign sh_addr. fixSectionAlignments() sets addrExpr but not alignExpr for the `!hasSectionsCommand` case. This patch sets alignExpr as well so that max_input_align will be respected. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D74736	2020-02-21 08:12:00 -08:00
Fangrui Song	d6d640ebae	[ELF][test] Fix section sh_type and sh_flags A future MC change may add a warning/error when a .section directive specifies incorrect sh_flags/sh_type.	2020-02-19 22:01:42 -08:00
Fangrui Song	81cebfd008	[ELF][test] Change -o %t to -o /dev/null if the output is not needed	2020-02-12 21:54:50 -08:00

1 2 3 4 5 ...

607 Commits