llvm-project

Commit Graph

Author	SHA1	Message	Date
Fangrui Song	d054c7ee2e	Add test utility 'extract' See https://lists.llvm.org/pipermail/llvm-dev/2020-July/143373.html "[llvm-dev] Multiple documents in one test file" for some discussions. `extract part filename` splits the input file into multiple parts separated by regex `^(.\|//)--- ` and extract the specified part to stdout or the output file (if specified). Use case A (organizing input of different formats (e.g. linker script+assembly) in one file). ``` // RUN: extract lds %s -o %t.lds // RUN: extract asm %s -o %t.s // RUN: llvm-mc %t.s -o %t.o // RUN: ld.lld -T %t.lds %t.o -o %t This is sometimes better than the %S/Inputs/ approach because the user can see the auxiliary files immediately and don't have to open another file. ``` Use case B (for utilities which don't have built-in input splitting feature): ``` // RUN: extract case1 %s \| llc \| FileCheck %s --check-prefix=CASE1 // RUN: extract case2 %s \| llc \| FileCheck %s --check-prefix=CASE2 Combing tests prudently can improve readability. This is sometimes better than having multiple test files. ``` Since this is a new utility, there is no git history concerns for UpperCase variable names. I use lowerCase variable names like mlir/lld. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D83834	2020-07-23 19:15:35 -07:00
Petr Hosek	38c71b7c85	Revert "[CMake] Simplify CMake handling for zlib" This reverts commit `1d09ecf361` since it breaks sanitizer bots.	2020-07-23 15:12:42 -07:00
Petr Hosek	1d09ecf361	[CMake] Simplify CMake handling for zlib Rather than handling zlib handling manually, use find_package from CMake to find zlib properly. Use this to normalize the LLVM_ENABLE_ZLIB, HAVE_ZLIB, HAVE_ZLIB_H. Furthermore, require zlib if LLVM_ENABLE_ZLIB is set to YES, which requires the distributor to explicitly select whether zlib is enabled or not. This simplifies the CMake handling and usage in the rest of the tooling. This is a reland of `abb0075` with all followup changes and fixes that should address issues that were reported in PR44780. Differential Revision: https://reviews.llvm.org/D79219	2020-07-23 14:47:25 -07:00
Fangrui Song	4e80c768c2	[ELF] Support -r --gc-sections -r --gc-sections is usually not useful because it just makes intermediate output smaller. https://bugs.llvm.org/show_bug.cgi?id=46700#c7 mentions a use case: validating the absence of undefined symbols ealier than in the final link. After D84129 (SHT_GROUP support in -r links), we can support -r --gc-sections without extra code. So let's allow it. Reviewed By: grimar, jhenderson Differential Revision: https://reviews.llvm.org/D84131	2020-07-23 08:16:01 -07:00
Fangrui Song	86ab98b001	[ELF] -r: rewrite SHT_GROUP content if some members are combined or discarded * If two group members are combined, we should leave just one index in the SHT_GROUP content. * If a group member is discarded (/DISCARD/ or upcoming -r --gc-sections combination), we should drop its index in the SHT_GROUP content. LLD currently crashes (`getOutputSection()` is null). Reviewed By: psmith Differential Revision: https://reviews.llvm.org/D84129	2020-07-21 08:49:45 -07:00
Victor Huang	91cce1a2bc	[PowerPC] Implement R_PPC64_REL24_NOTOC local calls, callee requires a TOC The PC Relative code now allows for calls that are marked with the relocation R_PPC64_REL24_NOTOC. This indicates that the caller does not have a valid TOC pointer in R2 and does not require R2 to be restored after the call. This patch is added to support local calls to callees that require a TOC Reviewed By: sfertile, MaskRay, nemanjai, stefanp Differential Revision: https://reviews.llvm.org/D83504	2020-07-20 17:46:49 +00:00
Sylvain Audi	3a108ab256	[LLD][COFF] Skip computation of the undefined symbols references that are not shown The "undefined symbol" error message from lld-link displays up to 3 references to that symbol, and the number of extra references not shown. This patch removes the computation of the strings for those extra references. It fixes a freeze of lld-link we accidentally encountered when activating asan on a large project, without linking with the asan library. In that case, __asan_report_load8 was referenced more than 2 million times, causing the computation of that many display strings, of which only 3 were used. Differential Revision: https://reviews.llvm.org/D83510	2020-07-20 13:45:16 -04:00
Hans Wennborg	8a197e0b16	Require shell for lld/test/ELF/arm-exidx-range.s The test fails in 32-bit Windows builds for unclear reasons: ld.lld: error: failed to open C:\src\llvm_package_1100-rc1\build32_stage0\tools\lld\test\ELF\Output\arm-exidx-range.s.tmp: The parameter is incorrect.	2020-07-20 17:49:10 +02:00
Georgii Rymar	ef377d3f2d	[LLD][ELF] - Update the lld/test/ELF/mips-merge-abiflags.s after llvm-readelf change. The warning message was updated (https://reviews.llvm.org/rG256aea816da45bd3bc52317cbc89e704095a8991)	2020-07-20 12:17:49 +03:00
Fangrui Song	7099a4b56b	[ELF][test] Update test after DW_EH_PE_sdata4/DW_EH_PE_sdata8 change	2020-07-18 09:07:50 -07:00
Georgii Rymar	2e10b7a39b	[LLD][ELF] - Update test after yaml2obj change. NFC. Names of YAML keys changed in rG7a587ca93200c49e47fe205ce037895c81c5a542	2020-07-15 13:00:48 +03:00
Martin Storsjö	92f7bd3a39	[LLD] [MinGW] Implement the --file-alignment and --section-alignment options Differential Revision: https://reviews.llvm.org/D83720	2020-07-15 08:34:08 +03:00
Petr Hosek	bcd27d9d73	Revert "[CMake] Simplify CMake handling for zlib" This reverts commit `8c1a79dc12` because it fails when zlib isn't installed.	2020-07-14 19:56:10 -07:00
Petr Hosek	8c1a79dc12	[CMake] Simplify CMake handling for zlib Rather than handling zlib handling manually, use find_package from CMake to find zlib properly. Use this to normalize the LLVM_ENABLE_ZLIB, HAVE_ZLIB, HAVE_ZLIB_H. Furthermore, require zlib if LLVM_ENABLE_ZLIB is set to YES, which requires the distributor to explicitly select whether zlib is enabled or not. This simplifies the CMake handling and usage in the rest of the tooling. This is a reland of `abb0075` with all followup changes and fixes that should address issues that were reported in PR44780. Differential Revision: https://reviews.llvm.org/D79219	2020-07-14 19:30:08 -07:00
Igor Kudrin	c4fc26b4c0	[ELF] Do not leave undefined symbols (specified by -init and -fini) if they are defined in non-fetched archive members After D69985, symbols for "-init" and "-fini" were unconditionally marked as used even if they were just lazy symbols seen when scanning archives. That resulted in exposing them in the symbol table of an output file, as Undefined, which added unwanted dependencies. The patch fixes the issue by checking the kind of the symbols before the marking. Differential Revision: https://reviews.llvm.org/D83549	2020-07-14 16:35:17 +07:00
Georgii Rymar	fbb6c9df28	[LLD][ELF] - Fix the test after llvm-readelf output change. An issue for llvm-readelf was fixed in https://reviews.llvm.org/rG84a170178c4431b7536c83ff0e5ce80774d08df6 Now it produce no output for this test.	2020-07-13 15:18:30 +03:00
Hans Wennborg	e73d0b5719	[COFF] Error on unexpected .pdata size Previously, lld would crash if the .pdata size was not an even multiple of the expected .pdata entry size. This makes it error gracefully instead. (We hit this in Chromium due to an assembler problem: https://crbug.com/1101577) Differential revision: https://reviews.llvm.org/D83479	2020-07-13 13:38:39 +02:00
Georgii Rymar	af16a45683	[LLD][ELF] - Allow relocation sections to appear before their target sections. It allows handling cases when we have SHT_REL[A] sections before target sections in objects. This fixes https://bugs.llvm.org/show_bug.cgi?id=46632 which says: "Normally it is not what compilers would emit. We have to support it, because some custom tools might want to use this feature, which is not restricted by ELF gABI" Differential revision: https://reviews.llvm.org/D83469	2020-07-13 13:59:54 +03:00
Ayke van Laethem	69e60c9dc7	[LLD][ELF][AVR] Implement the missing relocation types Implements the missing relocation types for AVR target. The results have been cross-checked with binutils. Original patch by LemonBoy. Some changes by me. Differential Revision: https://reviews.llvm.org/D78741	2020-07-12 18:18:54 +02:00
Alexandre Ganea	b71499ac9e	Revert "Re-land [CodeView] Add full repro to LF_BUILDINFO record" This reverts commit `add59ecb34` and `41d2813a5f`.	2020-07-10 19:46:16 -04:00
Alexandre Ganea	add59ecb34	Re-land [CodeView] Add full repro to LF_BUILDINFO record This patch adds some missing information to the LF_BUILDINFO which allows for rebuilding an .OBJ without any external dependency but the .OBJ itself (other than the compiler executable). Some tools need this information to reproduce a build without any knowledge of the build system. The LF_BUILDINFO therefore stores a full path to the compiler, the PWD (which is the CWD at program startup), a relative or absolute path to the TU, and the full CC1 command line. The command line needs to be freestanding (not depend on any environment variable). In the same way, MSVC doesn't store the provided command-line, but an expanded version (somehow their equivalent of CC1) which is also freestanding. For more information see PR36198 and D43002. Differential Revision: https://reviews.llvm.org/D80833	2020-07-10 13:59:28 -04:00
Victor Huang	118366dcb6	[PowerPC] Implement R_PPC64_REL24_NOTOC calls, callee also has no TOC The PC Relative code allows for calls that are marked with the relocation R_PPC64_REL24_NOTOC. This indicates that the caller does not have a valid TOC pointer in R2 and does not require R2 to be restored after the call. This patch is added to support local calls to callees tha also do not have a TOC. Reviewed By: sfertile, MaskRay, stefanp Differential Revision: https://reviews.llvm.org/D82816	2020-07-10 07:23:32 -05:00
Stefan Pintilie	6f7727db47	[PowerPC] Fix test case from `beb52b12cb` Forgot to add the REQUIRES ppc line to the test.	2020-07-09 12:39:24 -05:00
Stefan Pintilie	beb52b12cb	[PowerPC] Support PCRelative Callees for R_PPC64_REL24 Relocation The R_PPC64_REL24 is used in function calls when the caller requires a valid TOC pointer. If the callee shares the same TOC or does not clobber the TOC pointer then a direct call can be made. If the callee does not share the TOC a thunk must be added to save the TOC pointer for the caller. Up until PC Relative was introduced all local calls on medium and large code models were assumed to share a TOC. This is no longer the case because if the caller requires a TOC and the callee is PC Relative then the callee can clobber the TOC even if it is in the same DSO. This patch is to add support for a TOC caller calling a PC Relative callee that clobbers the TOC. Reviewed By: sfertile, MaskRay Differential Revision: https://reviews.llvm.org/D82950	2020-07-09 09:50:19 -05:00
Igor Kudrin	ca4d8da0c3	[DebugInfo] Add more checks to parsing .debug_pub* sections. The patch adds checking for various potential issues in parsing name lookup tables and reporting them as recoverable errors, similarly as we do for other tables. Differential Revision: https://reviews.llvm.org/D83050	2020-07-09 19:15:31 +07:00
Igor Kudrin	68f5a8b204	[DebugInfo] Do not hang when parsing a malformed .debug_pub* section. The parsing method did not check reading errors and might easily fall into an infinite loop on an invalid input because of that. Differential Revision: https://reviews.llvm.org/D83049	2020-07-09 19:15:11 +07:00
Fangrui Song	4ce56b8122	[ELF] Add -z dead-reloc-in-nonalloc=<section_glob>=<value> ... to customize the tombstone value we use for an absolute relocation referencing a discarded symbol. This can be used as a workaround when some debug processing tool has trouble with current -1 tombstone value (https://bugs.chromium.org/p/chromium/issues/detail?id=1102223#c11 ) For example, to get the current built-in rules (not considering the .debug_line special case for ICF): ``` -z dead-reloc-in-nonalloc='.debug_=0xffffffffffffffff' -z dead-reloc-in-nonalloc=.debug_loc=0xfffffffffffffffe -z dead-reloc-in-nonalloc=.debug_ranges=0xfffffffffffffffe ``` To get GNU ld (as of binutils 2.35)'s behavior: ``` -z dead-reloc-in-nonalloc='=0' -z dead-reloc-in-nonalloc=.debug_ranges=1 ``` This option has other use cases. For example, if we want to check whether a non-SHF_ALLOC section has dead relocations. With this patch, we can run a regular LLD and run another with a special -z dead-reloc-in-nonalloc=, then compare their output. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D83264	2020-07-08 10:15:16 -07:00
Fangrui Song	09b81a72ac	[ELF] Ignore --no-relax for RISC-V In GNU ld, --no-relax can disable x86-64 GOTPCRELX relaxation. It is not useful, so we don't implement it. For RISC-V, --no-relax disables linker relaxations which have larger impact. Linux kernel specifies --no-relax when CONFIG_DYNAMIC_FTRACE is specified (since http://git.kernel.org/linus/a1d2a6b4cee858a2f27eebce731fbf1dfd72cb4e ). LLD has not implemented the relaxations, so this option is a no-op. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D81359	2020-07-07 09:48:13 -07:00
Wouter van Oortmerssen	4d135b0446	[WebAssembly] 64-bit memory limits	2020-07-06 12:40:45 -07:00
Fangrui Song	c1a5f73a4a	[ELF][ARM] Represent R_ARM_LDO32 as R_DTPREL instead of R_ABS Follow-up to D82899. Note, we need to disable R_DTPREL relaxation because ARM psABI does not define TLS relaxation. Reviewed By: grimar, psmith Differential Revision: https://reviews.llvm.org/D83138	2020-07-06 09:47:53 -07:00
Fangrui Song	6fa1343bb3	[ELF] Resolve R_DTPREL in .debug_* referencing discarded symbols to -1 The location of a TLS variable is encoded as a DW_OP_const4u/DW_OP_const8u followed by a DW_OP_push_tls_address (or DW_OP_GNU_push_tls_address https://sourceware.org/bugzilla/show_bug.cgi?id=11616 ). This change follows up to D81784 and makes relocations types generalized as R_DTPREL (e.g. R_X86_64_DTPOFF{32,64}, R_PPC64_DTPREL64) use -1 as the tombstone value as well. This works for both TLS Variant I and Variant II architectures. * arm: .long tls(tlsldo) # not working currently (R_ARM_TLS_LDO32 is R_ABS) * mips64: .dtpreldword tls+32768 * ppc64: .quad tls@DTPREL+0x8000 * riscv: neither GCC nor clang has implemented DW_AT_location. It is likely .long/.quad tls@dtprel+0x800 * x86-32: .long tls@DTPOFF * x86-64: .long tls@DTPOFF; .quad tls@DTPOFF tls has a non-negative st_value, so such relocations (st_value+addend) never resolve to -1 in a normal (not discarded) case. ``` // clang -fuse-ld=lld -g -ffunction-sections a.c -Wl,--gc-sections // foo and tls will be discarded by --gc-sections. // DW_AT_location [DW_FORM_exprloc] (DW_OP_const8u 0xffffffffffffffff, DW_OP_GNU_push_tls_address) thread_local int tls; int foo() { return ++tls; } int main() {} ``` Also, drop logic added in D26201 intended to address PR30793. It added a test (gc-debuginfo-tls.s) using a non-SHF_ALLOC section and a local symbol, which does not reflect the intended scenario: a relocation in a SHF_ALLOC section referencing a discarded non-local symbol. For such a non .debug_* section, just emit an error. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D82899	2020-07-03 09:50:30 -07:00
Ben Dunbobbin	f42f599d37	[LLD][ELF][Windows] Allow LLD to overwrite existing output files that are in use On Windows co-operative programs can be expected to open LLD's output in FILE_SHARE_DELETE mode. This allows us to delete the file (by moving it to a temporary filename and then deleting it) so that we can link another output file that overwrites the existing file, even if the current file is in use. A similar strategy is documented here: https://boostgsoc13.github.io/boost.afio/doc/html/afio/FAQ/deleting_open_files.html Differential Revision: https://reviews.llvm.org/D82567	2020-07-03 13:08:56 +01:00
Jez Ng	53eb7fda51	[lld-macho] Support binding dysyms to any section Previously, we only supported binding dysyms to the GOT. This diff adds support for binding them to any arbitrary section. C++ programs appear to use this, I believe for vtables and type_info. This diff also makes our bind opcode encoding a bit smarter -- we now encode just the differences between bindings, which will make things more compact. I was initially concerned about the performance overhead of iterating over these relocations, but it turns out that the number of such relocations is small. A quick analysis of my llvm-project build directory showed that < 1.3% out of ~7M relocations are RELOC_UNSIGNED bindings to symbols (including both dynamic and static symbols). Reviewed By: #lld-macho, smeenai Differential Revision: https://reviews.llvm.org/D83103	2020-07-02 21:21:01 -07:00
Nico Weber	425fb21e03	ld64.lld: Make janky support for tbd files actually work sometimes Also fix a bug in the test input that made the test miss this issue.	2020-07-02 15:31:21 -04:00
James Henderson	5012ddd5af	[ELF][test] Add some additional .eh_frame/.eh_frame_hdr testing This patch adds a few extra cases to the existing testing for eh_frame and eh_frame_hdr behaviour in LLD. They all come from a private testsuite we are trying to migrate to lit. Reviewed by: grimar, MaskRay Differential Revision: https://reviews.llvm.org/D82852	2020-07-02 15:51:33 +01:00
Craig Topper	0aad82943a	[X86] Enable multibyte NOPs in 64-bit mode for padding/alignment. The default CPU used by llvm-mc doesn't have the NOPL feature, but if we know we're compiling in 64-bit mode we should be able to use nopl.	2020-07-01 23:59:01 -07:00
Nico Weber	ba5087f130	ld64.lld: Add janky support for tbd files With this, a simple hello world links against libSystem.tbd and the old ld64.lld linker kind of works again with newer SDKs. The motivation here is to have an arm64 cross linker that's good enough to be able to run simple configure link checks on non-mac systems for generating config.h files. Once -flavor darwinnew can link arm64, we'll switch to that.	2020-07-02 00:05:52 -04:00
Jez Ng	7996a1ef70	[lld-macho] Make sure ZeroFill sections are at the end of their segments Summary: ld64 does this, and references an internal rdar:// number as an explanation. No idea what that rdar issue is, but in practice, it seems that not putting a BSS section at the end can cause subsequent sections in the same segment to be overwritten with zeroes. Reviewers: #lld-macho Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81888	2020-07-01 19:39:29 -07:00
Douglas Yung	6f13299c02	Revert "Update lto.ll test after `3367e9da` enabled multibyte NOPs in 64-bit mode." This reverts commit `79f6a814ab`. Didn't notice that someone had reverted the commit that caused the problem.	2020-07-01 16:34:57 -07:00
Douglas Yung	79f6a814ab	Update lto.ll test after `3367e9da` enabled multibyte NOPs in 64-bit mode. This should fix the PS4 linux build bot: http://lab.llvm.org:8011/builders/llvm-clang-lld-x86_64-scei-ps4-ubuntu-fast/builds/70409	2020-07-01 16:25:25 -07:00
Fangrui Song	e6ad78fe05	[ELF] Don't resolve a relocation in .debug_line referencing an ICF folded symbol to the tombstone value After D81784, we resolve a relocation in .debug_* referencing an ICF folded section symbol to a tombstone value. Doing this for .debug_line has a problem (https://reviews.llvm.org/D81784#2116925 ): .debug_line may describe folded lines as having addresses UINT64_MAX or some wraparound small addresses. ``` int foo(int x) { return x; // line 2 } int bar(int x) { return x; // line 6 } ``` ``` Address Line Column File ISA Discriminator Flags ------------------ ------ ------ ------ --- ------------- ------------- 0x00000000002016c0 1 0 1 0 0 is_stmt 0x00000000002016c7 2 9 1 0 0 is_stmt prologue_end 0x00000000002016ca 2 2 1 0 0 0x00000000002016cc 2 2 1 0 0 end_sequence // UINT64_MAX and wraparound small addresses 0xffffffffffffffff 5 0 1 0 0 is_stmt 0x0000000000000006 6 9 1 0 0 is_stmt prologue_end 0x0000000000000009 6 2 1 0 0 0x000000000000000b 6 2 1 0 0 end_sequence 0x00000000002016d0 9 0 1 0 0 is_stmt 0x00000000002016df 10 6 1 0 0 is_stmt prologue_end 0x00000000002016e6 11 11 1 0 0 is_stmt ... ``` These entries can confuse debuggers: gdb before 2020-07-01 (binutils-gdb a8caed5d7faa639a1e6769eba551d15d8ddd9510 "Recognize -1 as a tombstone value in .debug_line") (can't continue due to a breakpoint in an invalid region of memory): ``` Warning: Cannot insert breakpoint 1. Cannot access memory at address 0x6 ``` lldb (breakpoint has no effect): ``` (lldb) b 6 Breakpoint 1: no locations (pending). WARNING: Unable to resolve breakpoint to any actual locations. ``` This patch special cases .debug_line to not use the tombstone value, restoring the previous behavior: .debug_line will have entries with the same addresses (ICF) but different line numbers. A breakpoint on line 2 or 6 will trigger on both functions. Reviewed By: dblaikie, jhenderson Differential Revision: https://reviews.llvm.org/D82828	2020-07-01 13:38:16 -07:00
Fangrui Song	f5e0ae240f	[ELF][test] Fix ppc32-gnu-ifunc.s and remove XFAIL: *	2020-06-30 22:05:51 -07:00
Sam Clegg	73e575a88e	[lld][WebAssembly] Give better error message on bad archive member Include the archive name as well as the member name when an error is encountered parsing bitcode archives. Differential Revision: https://reviews.llvm.org/D82884	2020-06-30 21:36:36 -07:00
Fangrui Song	d94526bb5f	[ELF] --warn-backrefs: check that D79300 fixed an issue due to `mb = {}` D79300 forgot to change `getBuffer().empty()` in LazyObjFile::parse to `fetched`. This caused incorrect iterating after the current LazyObjFile was fetched. This issue is benign and can just cause loss of "undefined symbols" and "backward reference" diagnostics. Before D79300 `mb = {}` caused --warn-backrefs-exclude to be useless for a fetched LazyObjFile. Add two test cases.	2020-06-26 20:31:47 -07:00
Fangrui Song	4542c18ef2	[ELF] -r: don't parse @ (symbol versioning) for .symver inline asm in bitcode Fixes PR46420 Similar to D43307 for non-LTO. Module-level inline assembly can use .symver to create a symbol with `@` in the name. For relocatable output, @ should be retained in the symbol name. `@ver` should not be parsed and dropped. Reviewed By: grimar, psmith Differential Revision: https://reviews.llvm.org/D82433	2020-06-24 08:22:22 -07:00
Stefan Pintilie	8131ef5d63	[LLD][PowerPC] Add support for R_PPC64_GOT_PCREL34 Add support for the 34bit relocation R_PPC64_GOT_PCREL34 for PC Relative in LLD. Reviewers: sfertile, MaskRay Differential Revision: https://reviews.llvm.org/D81948	2020-06-24 07:40:35 -05:00
Leonard Chan	723b5a1785	[lld][ELF][AArch64] Handle R_AARCH64_PLT32 relocation This is the followup to D77647 which implements handling for the new R_AARCH64_PLT32 relocation type in lld. This relocation would benefit the PIC-friendly vtables feature described in D72959. Differential Revision: https://reviews.llvm.org/D81184	2020-06-23 16:10:07 -07:00
Petr Hosek	fffd05d525	[ELF] Add -z start-stop-visibility= to set __start_/__stop_ symbol visibility This matches the equivalent flag implemented in GNU linkers, see https://sourceware.org/pipermail/binutils/2020-June/111685.html for the associated discussion. Differential Revision: https://reviews.llvm.org/D55682	2020-06-23 15:59:59 -07:00
Stefan Pintilie	3a55a2a97f	[LLD][PowerPC] Add support for R_PPC64_PCREL34 Add support for the 34bit relocation R_PPC64_PCREL34 for PC Relative in LLD.	2020-06-23 14:59:19 -05:00
Fangrui Song	e618ccbf43	[ELF] Resolve relocations in .debug_* referencing (discarded symbols or ICF folded section symbols) to tombstone values See D59553, https://lists.llvm.org/pipermail/llvm-dev/2020-May/141885.html and https://sourceware.org/pipermail/binutils/2020-May/111357.html for extensive discussions on a tombstone value. See http://www.dwarfstd.org/ShowIssue.php?issue=200609.1 (Reserve an address value for "not present") for a DWARF enhancement proposal. We resolve such relocations to a tombstone value to indicate that the address is invalid. This solves several problems (the normal behavior is to resolve the relocation to the addend): * For an empty function in a collected section, a pair of (0,0) can terminate .debug_loc and .debug_ranges (as of binutils 2.34, GNU ld resolves such a relocation to 1 to avoid the .debug_ranges issue) * If DW_AT_high_pc is sufficiently large, the address range can collide with a regular code range of low address (https://bugs.llvm.org/show_bug.cgi?id=41124 ) * If a text section is folded into another by ICF, we may leave entries in multiple CUs claiming ownership of the same range of code, which can confuse consumers. * Debug information associated with COMDAT sections can have problems similar to ICF, but is more complex - thus not addressed by this patch. For pre-DWARF-v5 .debug_loc and .debug_ranges, a pair of 0 can terminate entries (invalidating subsequent ranges). -1 is a reserved value with special meaning (base address selection entry) which can't be used either. Use -2 instead. For all other .debug_*, use UINT32_MAX for 32-bit targets and UINT64_MAX for 64-bit targets. In the code, we intentionally use `uint64_t tombstone = UINT64_MAX` for 32-bit targets as well: this matches SignExtend64 as used in `relocateAlloc`. (Actually UINT32_MAX does not work for R_386_32) Note 0, we only special case `target->symbolicRel` (R_X86_64_64, R_AARCH64_ABS64, R_PPC64_ADDR64), not short-range absolute relocations (e.g. R_X86_64_32). Only forms like DW_FORM_addr need to be special cased. They can hold an arbitrary address (must be 64-bit on a 64-bit target). (In theory, producers can make use of small code model to emit 32-bit relocations. This doesn't seem to be leveraged.) Note 1, we have to ignore the addend, because we don't want to resolve DW_AT_low_pc (which may have a non-zero addend) to -1+addend (wrap around to a low address): __attribute__((section(".text.x"))) void f1() { } __attribute__((section(".text.x"))) void f2() { } // DW_AT_low_pc has a non-zero addend Note 2, if the prevailing copy does not have debugging information while a non-prevailing copy has (partial debug build), we don't do extra work to attach debugging information to the prevailing definition. (clang has a lot of debug info optimizations that are on-by-default that assume the whole program is built with debug info). clang -c -ffunction-sections a.cc # prevailing copy has no debug info clang -c -ffunction-sections -g b.cc Reviewed By: dblaikie, avl, jhenderson Differential Revision: https://reviews.llvm.org/D81784	2020-06-23 11:48:46 -07:00
Sam Clegg	79aad89d8d	[WebAssembly] Add support for externalref to MC and wasm-ld This allows code for handling externref values to be processed by the assembler and linker. Differential Revision: https://reviews.llvm.org/D81977	2020-06-22 15:57:24 -07:00
Peter Collingbourne	bd7defeb94	llvm-nm: Implement --special-syms. Differential Revision: https://reviews.llvm.org/D82251	2020-06-22 13:05:47 -07:00
Jez Ng	3646ee503d	[lld-macho] Refactor segment/section creation, sorting, and merging Summary: There were a few issues with the previous setup: 1. The section sorting comparator used a declarative map of section names to determine the correct order, but it turns out we need to match on more than just names -- in particular, an upcoming diff will sort based on whether the S_ZERO_FILL flag is set. This diff changes the sorter to a more imperative but flexible form. 2. We were sorting OutputSections stored in a MapVector, which left the MapVector in an inconsistent state -- the wrong keys map to the wrong values! In practice, we weren't doing key lookups (only container iteration) after the sort, so this was fine, but it was still a dubious state of affairs. This diff copies the OutputSections to a vector before sorting them. 3. We were adding unneeded OutputSections to OutputSegments and then filtering them out later, which meant that we had to remember whether an OutputSegment was in a pre- or post-filtered state. This diff only adds the sections to the segments if they are needed. In addition to those major changes, two minor ones worth noting: 1. I renamed all OutputSection variable names to `osec`, to parallel `isec`. Previously we were using some inconsistent combination of `osec`, `os`, and `section`. 2. I added a check (and a test) for InputSections with names that clashed with those of our synthetic OutputSections. Reviewers: #lld-macho Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81887	2020-06-21 17:13:59 -07:00
Fangrui Song	8ffb2097cc	[ELF] Refine LMA offset propagation rule in D76995 If neither AT(lma) nor AT>lma_region is specified, D76995 keeps `lmaOffset` (LMA - VMA) if the previous section is in the default LMA region. This patch additionally checks that the two sections are in the same memory region. Add a test case derived from https://bugs.llvm.org/show_bug.cgi?id=45313 .mdata : AT(0xfb01000) { (.data); } > TCM // It is odd to make .bss inherit lmaOffset, because the two sections // are in different memory regions. .bss : { (.bss) } > DDR With this patch, section VMA/LMA match GNU ld. Note, GNU ld supports out-of-order (w.r.t sh_offset) sections and places .text and .bss in the same PT_LOAD. We don't have that behavior. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D81986	2020-06-19 09:11:33 -07:00
Fangrui Song	c4d13f72a6	[ELF] Refactor ObjFile<ELFT>::initializeSymbols to enforce the invariant: InputFile::symbols has non null entry Fixes PR46348. ObjFile<ELFT>::initializeSymbols contains two symbol iteration loops: ``` for each symbol if non-inheriting && non-local fill in this->symbols[i] for each symbol if local fill in this->symbols[i] else symbol resolution ``` Symbol resolution can trigger a duplicate symbol error which will call InputSectionBase::getObjMsg to iterate over InputFile::symbols. If a non-local symbol appears after the non-local symbol being resolved (violating ELF spec), its `this->symbols[i]` entry has not been filled in, InputSectionBase::getObjMsg will crash due to `dyn_cast<Defined>(nullptr)`. To fix the bug, reorganize the two loops to ensure this->symbols is complete before symbol resolution. This enforces the invariant: InputFile::symbols has none null entry when InputFile::getSymbols() is called. ``` for each symbol if non-inheriting fill in this->symbols[i] for each symbol starting from firstGlobal if non-local symbol resolution ``` Additionally, move the (non-local symbol in local part of .symtab) diagnostic from Writer<ELFT>::copyLocalSymbols() to initializeSymbols(). Reviewed By: grimar, jhenderson Differential Revision: https://reviews.llvm.org/D81988	2020-06-19 09:05:37 -07:00
Alexandre Ganea	2ae0df5be7	[CodeView] Revert `8374bf4363` and `403f953792` This reverts: `8374bf4363` [CodeView] Fix generated command-line expansion in LF_BUILDINFO. Fix the 'pdb' entry which was previously a null reference, now an empty string. `403f953792` [CodeView] Add full repro to LF_BUILDINFO record This is causing the lld/test/COFF/pdb-relative-source-lines.test to fail: http://lab.llvm.org:8011/builders/lld-x86_64-win/builds/1096/steps/test-check-all/logs/FAIL%3A%20lld%3A%3Apdb-relative-source-lines.test And clang/test/CodeGen/debug-info-codeview-buildinfo.c fails as well: http://lab.llvm.org:8011/builders/clang-s390x-linux/builds/33346/steps/ninja%20check%201/logs/FAIL%3A%20Clang%3A%3Adebug-info-codeview-buildinfo.c	2020-06-18 16:18:46 -04:00
Sam Clegg	21e14cce23	[lld][WebAssembly] Allow ctors functions that return values Some projects use the constructor attribute on functions that also return values. In this case we just ignore them. The error was reported in the libgpg-error project that marks gpg_err_init with the `__constructor__` attribute. Differential Revision: https://reviews.llvm.org/D81962	2020-06-18 13:11:40 -07:00
Alexandre Ganea	403f953792	[CodeView] Add full repro to LF_BUILDINFO record This patch adds some missing information to the LF_BUILDINFO which allows for rebuilding an .OBJ without any external dependency but the .OBJ itself (other than the compiler executable). Some tools need this information to reproduce a build without any knowledge of the build system. The LF_BUILDINFO therefore stores a full path to the compiler, the PWD (which is the CWD at program startup), a relative or absolute path to the TU, and the full CC1 command line. The command line needs to be freestanding (not depend on any environment variable). In the same way, MSVC doesn't store the provided command-line, but an expanded version (somehow their equivalent of CC1) which is also freestanding. For more information see PR36198 and D43002. Differential Revision: https://reviews.llvm.org/D80833	2020-06-18 09:17:15 -04:00
Greg McGary	f7453a97ce	[lld-macho] Add missing search-paths-darwin.test Summary: Forgot to `git add` it when patching D80677	2020-06-17 20:51:26 -07:00
Greg McGary	d50f44a2f7	[lld-macho] Handle framework search path, alongside library search path Summary: Add front-end support for `lld::macho::Configuration::frameworkSearchPath`. Depends on D80582. Reviewers: ruiu, pcc, MaskRay, smeenai, int3, Ktwu, alexshap, christylee Reviewed By: int3 Subscribers: ormris, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80677	2020-06-17 20:41:28 -07:00
Jez Ng	525c7d8cda	[lld-macho] Handle alignment correctly when merging InputSections Summary: Previously, we weren't updating isecAddr when aligning InputSections, resulting in truncated sections under the right conditions. Reviewers: #lld-macho, compnerd Reviewed By: #lld-macho, compnerd Subscribers: smeenai, compnerd, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81298	2020-06-17 20:41:28 -07:00
Jez Ng	74871cdad7	[lld-macho] Ensure __bss sections we output have file offset of zero Summary: llvm-mc emits `__bss` sections with an offset of zero, but we weren't expecting that in our input, so we were copying non-zero data from the start of the file and putting it in `__bss`, with obviously undesirable runtime results. (It appears that the kernel will copy those nonzero bytes as long as the offset is nonzero, regardless of whether S_ZERO_FILL is set.) I debated on whether to make a special ZeroFillSection -- separate from a regular InputSection -- but it seemed like too much work for now. But I'm happy to refactor if anyone feels strongly about having it as a separate class. Depends on D80857. Reviewers: ruiu, pcc, MaskRay, smeenai, alexshap, gkm, Ktwu, christylee Reviewed By: smeenai Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80859	2020-06-17 20:41:28 -07:00
Jez Ng	a12e7d406d	[lld-macho] Handle GOT relocations of non-dylib symbols Summary: Turns out this case is actually really common -- it happens whenever there's a reference to an `extern` variable that ends up statically linked. Depends on D80856. Reviewers: ruiu, pcc, MaskRay, smeenai, alexshap, gkm, Ktwu, christylee Reviewed By: smeenai Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80857	2020-06-17 20:41:28 -07:00
Jez Ng	c3d98ea89f	[lld-macho] Support X86_64_RELOC_GOT Summary: As far as I can tell, it's identical to _GOT_LOAD. llvm-mc has the following comment explaining why _GOT exists: ``` // x86_64 distinguishes movq foo@GOTPCREL so that the linker can // rewrite the movq to an leaq at link time if the symbol ends up in // the same linkage unit. ``` Depends on D80855. Reviewers: ruiu, pcc, MaskRay, smeenai, alexshap, gkm, Ktwu, christylee Reviewed By: MaskRay, smeenai Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80856	2020-06-17 20:41:28 -07:00
Jez Ng	fcde378dcb	[lld-macho] Support non-pcrel section relocs Summary: Depends on D80854. Reviewers: ruiu, pcc, MaskRay, smeenai, alexshap, gkm, Ktwu, christylee Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80855	2020-06-17 20:41:28 -07:00
Jez Ng	2f4cfa3c7a	[lld-macho] Avoid explicit -arch in tests by defaulting to x86-64 Summary: As mentioned in https://reviews.llvm.org/D81326#2093931, I'm not sure it makes sense to use the default target triple to determine -arch. Long-term we should probably detect it from the input object files, but in the meantime it would be nice not to have to add it to all our tests by using a convenient default. Reviewers: #lld-macho Subscribers: arphaman, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81983	2020-06-17 20:41:27 -07:00
Fangrui Song	49279ca160	[ELF] Improve --export-dynamic-symbol performance by checking whether wildcard is really used A hasWildcard pattern iterates over symVector, which can be slow when there are many --export-dynamic-symbol. In optimistic cases, most patterns don't use a wildcard character. hasWildcard: false can avoid a symbol table iteration. While here, add two tests using `[` and `?`, respectively.	2020-06-17 17:12:10 -07:00
Fangrui Song	a649802c59	[lld-macho][test] Fix dylink-lazy.s after `0a5228a02c`	2020-06-16 17:57:59 -07:00
Saleem Abdulrasool	0a5228a02c	test: add library search path to repair the test Update the test to no longer rely on the testing workaround in the linker and use the external libSystem stub.	2020-06-16 17:06:21 -07:00
Saleem Abdulrasool	73312976ad	lld: remove old test support path This removes the stub library that lld injected to satisfy the dependency on the libSystem. Now with TBD support, we can provide the stub library to permit the tests to function properly as they would on a real system. Reviewed By: smeenai Differential Revision: https://reviews.llvm.org/D81418	2020-06-16 15:57:58 -07:00
Greg McGary	7df80e3f23	[lld-macho] Specify the complete set of command-line options for ld64 This is a complete Options.td compiled from ld(1) dated 2018-03-07 and cross checked with ld64 source code version 512.4 dated 2018-03-18. This is the first in a series of diffs for argument handling. Follow-ups will include switch cases for all the new instances of `OPT_foo`, and parsing/validation of arguments attached to options, e.g., more code akin to `OPT_platform_version` and associated `parsePlatformVersion()`. Reviewed By: smeenai Differential Revision: https://reviews.llvm.org/D80582	2020-06-15 12:50:20 -07:00
Wouter van Oortmerssen	d9e0bbd17b	[WebAssembly] Adding 64-bit versions of all load & store ops. Context: https://github.com/WebAssembly/memory64/blob/master/proposals/memory64/Overview.md This is just a first step, adding the new instruction variants while keeping the existing 32-bit functionality working. Some of the basic load/store tests have new wasm64 versions that show that the basics of the target are working. Further features need implementation, but these will be added in followups to keep things reviewable. Differential Revision: https://reviews.llvm.org/D80769	2020-06-15 08:31:56 -07:00
Kirill Bobyrev	9baba7cf66	Revert "[lld-macho] No need to explicitly specify -arch in tests" This reverts commit `51c5baacf3` and also `337fb8c767` - "[lld-macho] Set REQUIRES: x86 on more tests". These patches cause test crashes: http://lab.llvm.org:8011/builders/ppc64le-lld-multistage-test/builds/10054	2020-06-15 12:27:30 +02:00
Jez Ng	337fb8c767	[lld-macho] Set REQUIRES: x86 on more tests Summary: Fixes the build break caused by D81802.	2020-06-14 19:05:12 -07:00
Jez Ng	53c796b948	[lld-macho] Properly handle & validate relocation r_length Summary: We should be reading / writing our addends / relocated addresses based on r_length, and not just based on the type of the relocation. But since only some r_length values are valid for a given reloc type, I've also added some validation. ld64 has code to allow for r_length = 0 in X86_64_RELOC_BRANCH relocs, but I'm not sure how to create such a relocation... Reviewed By: smeenai Differential Revision: https://reviews.llvm.org/D80854	2020-06-14 16:35:23 -07:00
Jez Ng	51c5baacf3	[lld-macho] No need to explicitly specify -arch in tests Summary: After {D81326} landed, some tests started failing if they did not have `-arch` specified. I think one of the reasons happened was due to the fact that we were taking a reference to a temporary value that was freed too early. Fixing that got the error to go away on my local Linux machine. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D81802	2020-06-14 16:35:21 -07:00
Pavel Labath	fccaa89e23	[llvm/DWARFDebugLine] Fix a typo in one warning message	2020-06-11 13:04:52 +02:00
Hongtao Yu	2638aafe12	[LLD][ThinLTO] Add --thinlto-single-module to allow compiling partial modules. This change introduces an LLD switch --thinlto-single-module to allow compiling only a part of the input modules. This is specifically enables: 1. Fast investigating/debugging modules of interest without spending time on compiling unrelated modules. 2. Compiler debug dump with -mllvm -debug-only= for specific modules. It will be useful for large applications which has 1K+ input modules for thinLTO. The switch can be combined with `--lto-obj-path=` or `--lto-emit-asm` to obtain intermediate object files or assembly files. So far the module name matching is implemented as a fuzzy name lookup where the modules with name containing the switch value are compiled. E.g, Command: ld.lld main.o thin.a --thinlto-single-module=thin.a --lto-obj-path=single.o log: [ThinLTO] Selecting thin.a(thin1.o at 168) to compile [ThinLTO] Selecting thin.a(thin2.o at 228) to compile Command: ld.lld main.o thin.a --thinlto-single-module=thin1.o --lto-obj-path=single.o log: [ThinLTO] Selecting thin.a(thin1.o at 168) to compile Differential Revision: https://reviews.llvm.org/D80406	2020-06-10 15:32:30 -07:00
Pavel Labath	de522035b9	[lld] Fix test/ELF/undef.s for `6f55b5a1` There's been a slight change in wording for the warning message about debug_line problems.	2020-06-10 16:36:10 +02:00
gbreynoo	be44b79257	[lld][test] Expand testing for dynamic-list and export-dynamic - Expanded testing for --dynamic-list and --export-dynamic - Fixed invalid-dynamic-list.test Differential Revision: https://reviews.llvm.org/D80311	2020-06-10 14:06:30 +01:00
Fangrui Song	b114e134bd	[ELF] Fix --thinlto-index-only regression after D79300 After D79300, we don't rewrite InputFile::mb to an empty buffer. In thinLTOCreateEmptyIndexFiles(), we should check LazyObjFile::fetched as well as checking whether mb is a bitcode, otherwise we would overwrite (path + .thinlto.bc) with an empty index.	2020-06-09 23:10:30 -07:00
Fangrui Song	ba890da287	[ELF] Demote lazy symbols relative to a discarded section to Undefined Fixes PR45594. In `ObjFile<ELFT>::initializeSymbols()`, for a defined symbol relative to a discarded section (due to section group rules), it may have been inserted as a lazy symbol. We need to demote it to an Undefined to enable the `discarded section` error happened in a later pass. Add `LazyObjFile::fetched` (if true) and `ArchiveFile::parsed` (if false) to represent that there is an ongoing lazy symbol fetch and we should replace the current lazy symbol with an Undefined, instead of calling `Symbol::resolve` (`Symbol::resolve` should be called if the lazy symbol was added by an unrelated archive/lazy object). As a side result, one small issue in start-lib-comdat.s is now fixed. The hack motivating D51892 will be unsupported: if `.gnu.linkonce.t.__i686.get_pc_thunk.bx` in an archive is referenced by another section, this will likely be errored unless the function is also defined in a regular object file. (Bringing back rL330869 would error `undefined symbol` instead of the more relevant `discarded section`.) Note, glibc i386's crti.o still works (PR31215), because `.gnu.linkonce.t.__x86.get_pc_thunk.bx` is in crti.o (one of the first regular object files in a linker command line). Reviewed By: psmith Differential Revision: https://reviews.llvm.org/D79300	2020-06-09 11:27:34 -07:00
Saleem Abdulrasool	5fda192fed	test: repair lld builder Add the explicit `-arch x86_64` to the invocation.	2020-06-08 20:13:54 -07:00
Saleem Abdulrasool	6fe27b5fed	lld: initial pass at supporting TBD Add support to lld to use Text Based API stubs for linking. This is support is incomplete not filtering out platforms. It also does not account for architecture specific API handling and potentially does not correctly handle trees of re-exports with inlined libraries being treated as direct children of the top level library.	2020-06-08 18:15:40 -07:00
Saleem Abdulrasool	fcdf7578aa	lld: improve the `-arch` handling for MachO Use the default target triple configured by the user to determine the default architecture for `ld64.lld`. Stash the architecture in the configuration as when linking against TBDs, we will need to filter out the symbols based upon the architecture. Treat the Haswell slice as it is equivalent to `x86_64` but with the extra Haswell extensions (e.g. AVX2, FMA3, BMI1, etc). This will make it easier to add new architectures in the future. This change also changes the failure mode where an invalid `-arch` parameter will result in the linker exiting without further processing.	2020-06-08 11:04:19 -07:00
Fangrui Song	ac6abc99e2	[ELF] Don't cause assertion failure if --dynamic-list or --version-script takes an empty file Fixes PR46184 Report line 1 of the last memory buffer.	2020-06-05 15:59:54 -07:00
Saleem Abdulrasool	e78431354b	lld: use modern library search ordering This merges the static and shared library and behaves as if `-search_paths_first` was specified which is also the default behaviour on ld64 (and now lld). Unify the paths, and use `llvm::sys::path` to deal with the path to be truly agnostic to the host.	2020-06-05 12:12:26 -07:00
Fangrui Song	7bee6e30fe	[ELF] Handle -u before input files If both a.a and b.so define foo ``` ld.bfd -u foo a.a b.so # foo is defined ld.bfd a.a b.so -u foo # foo is defined ld.bfd -u foo b.so a.a # foo is undefined (provided at runtime by b.so) ld.bfd b.so a.a -u foo # foo is undefined (provided at runtime by b.so) ``` In all cases we make foo undefined in the output. I tend to think the GNU ld behavior makes more sense. * In their model, they have to treat -u as a fake object file with an undefined symbol before all input files, otherwise the first archive would not be fetched. * Following their behavior allows us to drop a --warn-backrefs special case. Reviewed By: psmith Differential Revision: https://reviews.llvm.org/D81052	2020-06-05 08:44:38 -07:00
Saleem Abdulrasool	116e38fd8b	lld: add basic static library search This is a very basic static library search addition. This is the pre-Xcode4 behaviour of searching all paths for the shared version before searching for the static version of the library. This behaviour is supposed to be inverted with `-search_paths_first` being the default. This adds the library search with the intention of providing the setup to merge the paths into one path and making it controllable by `OPT_search_paths_first`.	2020-06-03 23:32:05 +00:00
Fangrui Song	3eb4bf13ba	[ELF] Append " [--no-allow-shlib-undefined]" to the corresponding diagnostics --no-allow-shlib-undefined (enabled by default when linking an executable) rejects unresolved references in shared objects. Users may be confused by the common diagnostics of unresolved symbols in object files (LLD: "undefined symbol: foo"; GNU ld/gold: "undefined reference to") Learn from GCC/clang " [-Wfoo]": append the option name to the diagnostics. Users can find relevant information by searching "--no-allow-shlib-undefined". It should also be obvious to them that the positive form --allow-shlib-undefined can suppress the error. Also downgrade the error to a warning if --noinhibit-exec is used (compatible with GNU ld and gold). Reviewed By: grimar, psmith Differential Revision: https://reviews.llvm.org/D81028	2020-06-03 07:59:37 -07:00
Jez Ng	f04d1c3b90	[lld-macho] Move all tests for erroneous inputs under invalid/ For consistency. The no-id-dylib test was originally referencing the Inputs/ folder via a relative path. Instead of updating that path, I decided to make the test self-contained. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D80217	2020-06-02 13:19:38 -07:00
Jez Ng	a04c133564	[lld-macho] Set __PAGEZERO size to 4GB That's what ld64 uses for 64-bit targets. I figured it's best to make this change sooner rather than later since a bunch of our tests are relying on hardcoded addresses that depend on this value. Reviewed By: smeenai Differential Revision: https://reviews.llvm.org/D80177	2020-06-02 13:19:38 -07:00
Jez Ng	df2a5778c3	[lld-macho] Error on encountering undefined symbols ... instead of silently emitting a reference to the zero address. Reviewed By: smeenai Differential Revision: https://reviews.llvm.org/D80169	2020-06-02 13:19:38 -07:00
Jez Ng	6f6d91867d	[lld-macho] Add some relocation validation logic I considered making a `Target::validate()` method, but I wasn't sure how I felt about the overhead of doing yet another switch-dispatch on the relocation type, so I put the validation in `relocateOne` instead... might be a bit of a micro-optimization, but `relocateOne` does assume certain things about the relocations it gets, and this error handling makes that explicit, so it's not a totally unreasonable code organization. Reviewed By: smeenai Differential Revision: https://reviews.llvm.org/D80049	2020-06-02 13:19:38 -07:00
Igor Kudrin	4933ab2ccb	[DebugInfo] Report the format of call frame information entries [6/10] Differential Revision: https://reviews.llvm.org/D80523	2020-06-02 17:55:30 +07:00
Simon Atanasyan	b00f0d4238	[mips] Support 64-bit relative relocations MIPS 64-bit ABI does not provide special PC-relative relocation like R_MIPS_PC32 in 32-bit case. But we can use a "chain of relocation" defined by N64 ABIs. In that case one relocation record might contain up to three relocations which applied sequentially. Width of a final relocation mask applied to the result of relocation depends on the last relocation in the chain. In case of 64-bit PC-relative relocation we need the following chain: `R_MIPS_PC32 \| R_MIPS_64`. The first relocation calculates an offset, but does not truncate the result. The second relocation just apply calculated result as a 64-bit value. The 64-bit PC-relative relocation might be useful in generation of `.eh_frame` sections to escape passing `-Wl,-z,notext` flags to linker. Differential Revision: https://reviews.llvm.org/D80390	2020-06-02 11:44:11 +03:00
Fangrui Song	a6ae333a0c	[ELF] --wrap: don't error `undefined reference to __real_foo` (--no-allow-shlib-undefined) if foo is a wrapped definition This is a regression after D51283. Also, export `foo` if `__real_foo` is referenced by a shared object.	2020-06-01 23:00:51 -07:00
Fangrui Song	751f18e7d4	[ELF] Refine --export-dynamic-symbol semantics to be compatible GNU ld 2.35 GNU ld from binutils 2.35 onwards will likely support --export-dynamic-symbol but with different semantics. https://sourceware.org/pipermail/binutils/2020-May/111302.html Differences: 1. -export-dynamic-symbol is not supported 2. --export-dynamic-symbol takes a glob argument 3. --export-dynamic-symbol can suppress binding the references to the definition within the shared object if (-Bsymbolic or -Bsymbolic-functions) 4. --export-dynamic-symbol does not imply -u I don't think the first three points can affect any user. For the fourth point, Not implying -u can lead to some archive members unfetched. Add -u foo to restore the previous behavior. Exact semantics: * -no-pie or -pie: matched non-local defined symbols will be added to the dynamic symbol table. * -shared: matched non-local STV_DEFAULT symbols will not be bound to definitions within the shared object even if they would otherwise be due to -Bsymbolic, -Bsymbolic-functions, or --dynamic-list. Reviewed By: psmith Differential Revision: https://reviews.llvm.org/D80487	2020-06-01 11:30:03 -07:00
Fangrui Song	ee9a251caf	[ELF] Set DF_1_PIE for -pie DF_1_PIE originated from Solaris (https://docs.oracle.com/cd/E36784_01/html/E36857/chapter6-42444.html ). GNU ld since https://sourceware.org/git/?p=binutils-gdb.git;a=commit;h=5fe2850dd96483f176858fd75c098313d5b20bc2 sets the flag on non-Solaris platforms. It can help distinguish PIE from ET_DYN. eu-classify from elfutils uses this to recognize PIE (https://sourceware.org/git/?p=elfutils.git;a=commit;h=3f489b5c7c78df6d52f8982f79c36e9a220e8951 ) glibc uses this flag to reject dlopen'ing a PIE (https://sourceware.org/bugzilla/show_bug.cgi?id=24323 ) Reviewed By: psmith Differential Revision: https://reviews.llvm.org/D80872	2020-06-01 10:19:41 -07:00
Fangrui Song	881c5eef98	[ELF] Add -z rel and -z rela LLD supports both REL and RELA for static relocations, but emits either of REL and RELA for dynamic relocations. The relocation entry format is specified by each psABI. musl ld.so supports both REL and RELA. For such ld.so implementations, REL (.rel.dyn .rel.plt) has size benefits even if the psABI chooses RELA: sizeof(Elf64_Rel)=16 < sizeof(Elf64_Rela)=24. * COPY, GLOB_DAT and J[U]MP_SLOT always have 0 addend. A ld.so implementation does not need to read the implicit addend. REL is strictly better. * A RELATIVE has a non-zero addend. Such relocations can be packed compactly with the RELR relocation entry format, which is out of scope of this patch. * For other dynamic relocation types (e.g. symbolic relocation R_X86_64_64), a ld.so implementation needs to read the implicit addend. REL may have minor performance impact, because reading implicit addends forces random access reads instead of being able to blast out a bunch of writes while chasing the relocation array. This patch adds -z rel and -z rela to change the relocation entry format for dynamic relocations. I have tested that a -z rel produced x86-64 executable works with musl ld.so -z rela may be useful for debugging purposes on processors whose psABIs specify REL as the canonical format: addends can be easily read by a tool. Reviewed By: grimar, mcgrathr Differential Revision: https://reviews.llvm.org/D80496	2020-05-29 14:22:03 -07:00
Sam Clegg	fd1c894a4a	[lld][WebAssembly] Convert some lld tests to assembly When we originally wrote these tests we didn't have a stable and fleshed out assembly format. Now we do so we should prefer that over llvm ir for lld tests to avoid including more part of llvm than necessary in order to run the test. This change converts just 30 out of about 130 test files. More to come when I have some more time. Differential Revision: https://reviews.llvm.org/D80361	2020-05-28 16:52:01 -07:00
Hiroshi Yamauchi	a7fa35a629	[ThinLTO] Compute the basic block count across modules. Summary: Count the per-module number of basic blocks when the module summary is computed and sum them up during Thin LTO indexing. This is used to estimate the working set size under the partial sample PGO. This is split off of D79831. Reviewers: davidxl, espindola Subscribers: emaste, inglorion, hiraditya, MaskRay, steven_wu, dexonsmith, arphaman, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80403	2020-05-28 10:33:05 -07:00
Fangrui Song	1224e619d9	[ELF][test] Fix wrap-no-real.s after D51283 Give %t3.so a DT_SONAME so that the DT_NEEDED entry in a dependent executable has a fixed length.	2020-05-27 17:43:33 -07:00
Rui Ueyama	54d2896852	[ELF] --wrap: Drop __real_ symbol from the symbol table In D34993, we discussed and concluded that we should drop `__real_ symbol from the symbol table, but I did the opposite in D50569. This patch is to drop `__real_` symbol. MaskRay's note: omitting `__real_` is important if it is undefined: otherwise a subsequent link may error due to the undefined `__real_` in .dynsym Differential Revision: https://reviews.llvm.org/D51283	2020-05-27 16:58:00 -07:00
Fangrui Song	b8a3c618d6	[ELF] Allow misaligned SHT_GNU_verneed Bazel created interface shared objects (.ifso) may be misaligned. We use llvm::support::detail::packed_endian_specific_integral under the hood which allows reading of misaligned values, so there is not a need to diagnose (in LLD we don't intend to support sophisticated parsing for SHT_GNU_*).	2020-05-26 11:18:19 -07:00
Fangrui Song	bae7cf6746	[ELF][PPC64] Synthesize _savegpr[01]_{14..31} and _restgpr[01]_{14..31} In the 64-bit ELF V2 API Specification: Power Architecture, 2.3.3.1. GPR Save and Restore Functions defines some special functions which may be referenced by GCC produced assembly (LLVM does not reference them). With GCC -Os, when the number of call-saved registers exceeds a certain threshold, GCC generates `_savegpr0_* _restgpr0_` calls and expects the linker to define them. See https://sourceware.org/pipermail/binutils/2002-February/017444.html and https://sourceware.org/pipermail/binutils/2004-August/036765.html . This is weird because libgcc.a would be the natural place. However, the linker generation approach has the advantage that the linker can generate multiple copies to avoid long branch thunks. We don't consider the advantage significant enough to complicate our trunk implementation, so we take a simple approach. Check whether `_savegpr0_{14..31}` are used * If yes, define needed symbols and add an InputSection with the code sequence. `_savegpr1_` `_restgpr0_` and `_restgpr1_*` are similar. Reviewed By: sfertile Differential Revision: https://reviews.llvm.org/D79977	2020-05-26 09:35:41 -07:00
Kang Zhang	86e3abc9e6	[PowerPC] Add some InstAlias definitions Summary: This patch add the InstAlias definitions for below instructions. ADDI ADDIS ADDI8 ADDIS8 RLWINM8 ISEL ISEL8 OR OR_rec ORI ORI8 XORI8 CNTLZW8 CNTLZW8_rec TEND TSR RFEBB NOR NOR_rec MTCRF SUBF SUBF_rec SUBFC SUBFC_rec RLDICL_32_64 TW Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D77559	2020-05-24 14:05:28 +00:00
Tobias Hieta	f794808bb9	[LLD/MinGW]: Expose --thinlto-cache-dir Differential Revision: https://reviews.llvm.org/D80438	2020-05-24 12:30:56 +03:00
Fangrui Song	e32f04cdc9	[ELF] Parse SHT_GNU_verneed and respect versioned undefined symbols in shared objects An undefined symbol in a shared object can be versioned, like `f@v1`. We currently insert `f` as an Undefined into the symbol table, but we should insert `f@v1` instead. The string `v1` is inferred from SHT_GNU_versym and SHT_GNU_verneed. This patch implements the functionality. Failing to do this can cause two issues: * If a versioned symbol referenced by a shared object is defined in the executable, we will fail to export it. * If a versioned symbol referenced by a shared object in another object file, --no-allow-shlib-undefined may spuriously report an "undefined reference to " error. See https://bugs.llvm.org/show_bug.cgi?id=44842 (Linking -lfftw3 -lm on Arch Linux can cause `undefined reference to __log_finite`) Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D80059	2020-05-23 09:55:48 -07:00
Thomas Lively	d851fce4cb	[lld][WebAssembly] Do not emit initialization for .bss segments Summary: This patch fixes a bug where initialization code for .bss segments was emitted in the memory initialization function even though the .bss segments were discounted in the datacount section and omitted in the data section. This was producing invalid binaries due to out-of-bounds segment indices on the memory.init and data.drop instructions that were trying to operate on the nonexistent .bss segments. Reviewers: sbc100 Subscribers: dschuff, jgravelle-google, aheejin, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80354	2020-05-21 11:33:25 -07:00
Jez Ng	ce0d8beebc	[lld-macho][re-land] Support X86_64_RELOC_UNSIGNED This reverts commit `db8559eee4`.	2020-05-19 12:31:55 -07:00
Jez Ng	4eb6f4854e	[lld-macho][re-land] Support .subsections_via_symbols Summary: This diff restores and builds upon @pcc and @ruiu's initial work on subsections. The .subsections_via_symbols directive indicates we can split each section along symbol boundaries, unless those symbols have been marked with `.alt_entry`. We exercise this functionality in our tests by using order files that rearrange those symbols. Depends on D79668. Reviewers: ruiu, pcc, MaskRay, smeenai, alexshap, gkm, Ktwu, christylee Reviewed By: smeenai Subscribers: thakis, llvm-commits, pcc, ruiu Tags: #llvm Differential Revision: https://reviews.llvm.org/D79926	2020-05-19 12:31:54 -07:00
Fangrui Song	6467649974	[ELF] Make --trace-symbol track preempted shared definitions Note, we still name a preempted SharedSymbol "shared definition", instead of "reference" as printed by GNU ld. This difference should not matter. ``` // GNU ld ld.bfd: t: definition of f@v1 ld.bfd: t.so: reference to f@v1 ``` Reviewed By: psmith Differential Revision: https://reviews.llvm.org/D80143	2020-05-19 08:56:35 -07:00
Jez Ng	70fbbcdd34	Revert "[lld-macho] Support .subsections_via_symbols" Due to build breakage mentioned in https://reviews.llvm.org/D79926. This reverts commit `e270b2f172`.	2020-05-19 08:30:02 -07:00
Jez Ng	db8559eee4	Revert "[lld-macho] Support X86_64_RELOC_UNSIGNED" This reverts commit `1f820e3559`.	2020-05-19 08:30:02 -07:00
Jez Ng	1f820e3559	[lld-macho] Support X86_64_RELOC_UNSIGNED Note that it's only used for non-pc-relative contexts. Reviewed By: MaskRay, smeenai Differential Revision: https://reviews.llvm.org/D80048	2020-05-19 07:46:57 -07:00
Jez Ng	e270b2f172	[lld-macho] Support .subsections_via_symbols This diff restores and builds upon @pcc and @ruiu's initial work on subsections. The .subsections_via_symbols directive indicates we can split each section along symbol boundaries, unless those symbols have been marked with `.alt_entry`. We exercise this functionality in our tests by using order files that rearrange those symbols. Reviewed By: smeenai Differential Revision: https://reviews.llvm.org/D79926	2020-05-19 07:46:57 -07:00
Jez Ng	55e9eb416e	[lld-macho] Support -order_file The order file indicates how input sections should be sorted within each output section, based on the symbols contained within those sections. This diff sets the stage for implementing and testing `.subsections_via_symbols`, where we will break up InputSections by each symbol and sort them more granularly. Reviewed By: smeenai Differential Revision: https://reviews.llvm.org/D79668	2020-05-19 07:46:57 -07:00
Greg McGary	1aadd6ce61	[lld] Remove unused lld/test/Driver/Inputs/**/libtest.a Under `lld/test/Driver/Inputs/`, all instances of `libtest.a` are unreferenced. FYI, all of these are empty archives, and the files contain only a magic number. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D80182	2020-05-19 07:19:10 -07:00
Hongtao Yu	90af55d8a9	[LLD][ELF] Use offset in thin archives to disambiguate thinLTO members This is fixing a thinLTO module collision issue for thin archives. The problem is that we always use a zero offset to name members in a thin archive and that causes the following build error: ld.lld: error: Expected at most one ThinLTO module per bitcode file which happens to a thin archive that has two members with the same object file name (whose paths will be ignored by thinLTO driver) The fix here is to use real member offset instead as is done for non-thin archives. Differential Revision: https://reviews.llvm.org/D79880	2020-05-15 12:02:08 -07:00
Fangrui Song	e36223c85c	[ELF] Enforce two dashes for Flag options not supported by GNU ld (i.e. no compatibility burden) Announced on https://lists.llvm.org/pipermail/llvm-dev/2020-May/141416.html Similar to D79371, but for `multiclass B` (convenience helper for defining --foo and --no-foo) Some changed options are also used by gold, but I haven't seen their one-dash use cases outside of lld's testsuite.	2020-05-15 11:07:25 -07:00
Georgii Rymar	969c63a2ec	[LLD][ELF] - Fix section-alignment.test after yaml2obj change. yaml2obj changed in rG7ccae2cece72. Previous version of the yaml produced a different error and a 500mb binary.	2020-05-15 12:00:37 +03:00
Kellie Medlin	2b920ae78c	[lld] Add archive file support to Mach-O backend With this change, basic archive files can be linked together. Input section discovery has been refactored into a function since archive files lazily resolve their symbols / the object files containing those symbols. Reviewed By: int3, smeenai Differential Revision: https://reviews.llvm.org/D78342	2020-05-14 12:58:35 -07:00
Reid Kleckner	f5a79713b6	[LLD] Fix precomp-link.test I had a stale directory at %t, so the test passed locally, but not remotely. Skip the %t directory altogether.	2020-05-14 10:32:50 -07:00
Reid Kleckner	54a335a2f6	[COFF] Move type merging to TpiSource::mergeDebugT virtual method This paves the way to doing more things in parallel, and allows us to order type sources in dependency order. PDBs and PCH objects have to be loaded before object files which use them. This is a rebase of the unapplied remaining changes in https://reviews.llvm.org/D59226. I found it very challenging to rebase this across the LLD variable name style change. I recall there was a tool for that, but I didn't take the time to use it. Reviewers: aganea, akhuang Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79672	2020-05-14 09:47:00 -07:00
Martin Storsjö	b0969be4df	[LLD] [COFF] Try to fix test errors from `7f0e6c31c2` on windows Just skip trying to match for the path separator explicitly (instead of making it match either a forward or backwards slash), simplifying the test a little.	2020-05-14 13:47:34 +03:00
Martin Storsjö	7f0e6c31c2	[LLD] [COFF] Add options for disabling auto import and runtime pseudo relocs Allow disabling either the full auto import feature, or just forbidding the cases that require runtime fixups. As long as all auto imported variables are referenced from separate .refptr$<name> sections, we can alias them on top of the IAT entries and don't actually need any runtime fixups via pseudo relocations. LLVM generates references to variables in .refptr stubs, if it isn't known that the variable for sure is defined in the same object module. Runtime pseudo relocs are needed if the addresses of auto imported variables are used in constant initializers though. Fixing up runtime pseudo relocations requires the use of VirtualProtect (which is disallowed in WinStore/UWP apps) or VirtualProtectFromApp. To allow any risk of ambiguity, allow rejecting cases that would require this at the linker stage. This adds support for the --disable-runtime-pseudo-reloc and --disable-auto-import options in the MinGW driver (matching GNU ld.bfd) with corresponding lld private options in the COFF driver. Differential Revision: https://reviews.llvm.org/D78923	2020-05-14 13:05:14 +03:00
Sam Clegg	064e9907ba	[lld][WebAssembly] Fix for --relocatable and signature mismatches This is a followup to https://reviews.llvm.org/D78779. When signatures mismatch we create set of variant symbols. Some of the fields in these symbols were not be initialized correct. Specifically we were seeing isUsedInRegularObj not being set correctly, leading to the symbol not getting included in the symbol table and a crash writing relections in --reloctable mode. There is larger refactor due here, but this is a minimal change the fixes the bug at hand. Differential Revision: https://reviews.llvm.org/D79756	2020-05-13 10:27:09 -07:00
Nico Weber	759bae956a	[lld-macho] Ignore -platform_version and -syslibroot flags. clang passes these flags; this makes it easier to try `clang -v` output with `ld -flavor darwinnew`. Differential Revision: https://reviews.llvm.org/D79797	2020-05-12 19:17:01 -04:00
Jez Ng	87b6fd3e02	[lld-macho] Add support for creating and reading reexported dylibs This unblocks the linking of real programs, since many core system functions are only available as sub-libraries of libSystem. Differential Revision: https://reviews.llvm.org/D79228	2020-05-12 07:52:03 -07:00
Jez Ng	c8c39185f3	[lld-macho] Re-add dylink-lazy test This reverts commit eb81de2de4003e3045fdf743e093f77e37aee9bf; the test commands just needed to be run under llvm-lit.	2020-05-12 07:52:03 -07:00
Eric Christopher	eb81de2de4	Temporarily Revert "[lld-macho] Re-add dylink-lazy test" as it appears to be still failing. This reverts commit `723c46e645`.	2020-05-11 19:47:21 -07:00
Jez Ng	723c46e645	[lld-macho] Re-add dylink-lazy test The initial attempt didn't work on Windows; apparently Powershell has a different syntax for running commands sequentially and concatenating their outputs. So I've created two temporary files instead. Differential Revision: https://reviews.llvm.org/D79697	2020-05-11 12:39:21 -07:00
Peter Smith	0ae7990b60	[ELF][ARM] Support /DISCARD/ of subset of .ARM.exidx sections Both the .ARM.exidx and .eh_frame sections have a custom SyntheticSection that acts as a container for the InputSections. The InputSections are added to the SyntheticSection prior to /DISCARD/ which limits the affect a /DISCARD/ can have to the whole SyntheticSection. In the majority of cases this is sufficient as it is not common to discard subsets of the InputSections. The Linux kernel has one of these scripts which has something like: /DISCARD/ : { (.ARM.exidx.exit.text) (.ARM.extab.exit.text) ... } The .ARM.exidx.exit.text are not discarded because the InputSection has been transferred to the Synthetic Section. The *(.ARM.extab.exit.text) sections have not so they are discarded. When we come to write out the .ARM.exidx sections the dangling references from .ARM.exidx.exit.text to .ARM.extab.exit.text currently cause relocation out of range errors, but could as easily cause a fatal error message if we check for dangling references at relocation time. This patch attempts to respect the /DISCARD/ command by running it on the .ARM.exidx InputSections stored in the SyntheticSection. The .eh_frame is in theory affected by this problem, but I don't think that there is a dangling reference problem that can happen with these sections. Fixes remaining part of pr44824 Differential Revision: https://reviews.llvm.org/D79687	2020-05-11 14:27:13 +01:00
Martin Storsjö	ed0a57f753	[LLD] [COFF] Fix def file exporting of symbols containing periods This fixes an accidental breakage of exporting symbols using def files, when the symbol name contains a period, since commit `0ca06f7950`, mixing up a symbol name containing a period with the case of exporting a symbol as a forward to another dll. Differential Revision: https://reviews.llvm.org/D79619	2020-05-10 23:30:14 +03:00
Jez Ng	0dcd7c34f1	[lld-macho] Delete dylink-lazy test for now Summary: It's breaking Windows builds	2020-05-10 02:22:33 -07:00
Jez Ng	8a5e45aa90	[lld-macho] Try to unbreak dylink-lazy test on Windows	2020-05-10 00:23:32 -07:00
Jez Ng	198b0c57df	[lld-macho] Support pc-relative section relocations Summary: So far we've only supported symbol relocations. Reviewers: ruiu, pcc, MaskRay, smeenai, alexshap, gkm, Ktwu, christylee Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79211	2020-05-09 20:56:23 -07:00
Jez Ng	7bbdbacd00	[lld-macho] Use export trie instead of symtab when linking against dylibs Summary: This allows us to link against stripped dylibs. Moreover, it's simply more correct: The symbol table includes symbols that the dylib uses but doesn't export. This temporarily regresses our ability to do lazy symbol binding because dyld_stub_binder isn't in libSystem's export trie. Rather, it is in one of the sub-libraries libSystem re-exports. (This doesn't affect our tests since we are mocking out dyld_stub_binder there.) A follow-up diff will address this by adding support for sub-libraries. Depends on D79114. Reviewers: ruiu, pcc, MaskRay, smeenai, alexshap, gkm, Ktwu, christylee Subscribers: mgorny, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79226	2020-05-09 20:56:22 -07:00
Jez Ng	5d3feefa0d	[lld-macho] Dylib symbols should always replace undefined symbols Summary: Otherwise we get undefined symbol errors depending on the order of arguments on the command line. Depends on D78270. Reviewers: ruiu, pcc, MaskRay, smeenai, alexshap, gkm, Ktwu, christylee Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79114	2020-05-09 20:56:22 -07:00
Jez Ng	b3e2fc931d	[lld-macho] Support calls to functions in dylibs Summary: This diff implements lazy symbol binding -- very similar to the PLT mechanism in ELF. ELF's .plt section is broken up into two sections in Mach-O: StubsSection and StubHelperSection. Calls to functions in dylibs will end up calling into StubsSection, which contains indirect jumps to addresses stored in the LazyPointerSection (the counterpart to ELF's .plt.got). Initially, the LazyPointerSection contains addresses that point into one of the entry points in the middle of the StubHelperSection. The code in StubHelperSection will push on the stack an offset into the LazyBindingSection. The push is followed by a jump to the beginning of the StubHelperSection (similar to PLT0), which then calls into dyld_stub_binder. dyld_stub_binder is a non-lazily bound symbol, so this call looks it up in the GOT. The stub binder will look up the bind opcodes in the LazyBindingSection at the given offset. The bind opcodes will tell the binder to update the address in the LazyPointerSection to point to the symbol, so that subsequent calls don't have to redo the symbol resolution. The binder will then jump to the resolved symbol. Depends on D78269. Reviewers: ruiu, pcc, MaskRay, smeenai, alexshap, gkm, Ktwu, christylee Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78270	2020-05-09 20:56:22 -07:00
Jez Ng	db157d2733	[lld-macho] Follow-up to D77893 Summary: 1. Don't have isHidden() depend on isNeeded(). Whether a section is hidden is orthogonal from whether it is needed: hidden sections will never have a header regardless of whether they have a body. (I know we override this method with return false for synthetic sections, but regardless I think it's confusing to write it this way for non-synthetic sections.) 2. Don't call writeTo() on unneeded sections. D78270 assumes that this is true when implementing the stub helper section. 3. Filter out the unneeded sections early on to avoid having to deal with them in multiple places. 4. Remove assumption in test that the referenced file has no other symbols. (We should create separate input files for future tests to avoid such issues.) Reviewers: ruiu, pcc, MaskRay, smeenai, alexshap, gkm, Ktwu, christylee Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79460	2020-05-09 20:56:22 -07:00
Thomas Lively	a1ae9566ea	[WebAssembly] Disallow 'shared-mem' rather than 'atomics' Summary: The WebAssembly backend automatically lowers atomic operations and TLS to nonatomic operations and non-TLS data when either are present and the atomics or bulk-memory features are not present, respectively. The resulting object is no longer thread-safe, so the linker has to be told not to allow it to be linked into a module with shared memory. This was previously done by disallowing the 'atomics' feature, which prevented any objct with its atomic operations or TLS removed from being linked with any object containing atomics or TLS, and therefore preventing it from being linked into a module with shared memory since shared memory requires atomics. However, as of https://github.com/WebAssembly/threads/issues/144, the validation rules are relaxed to allow atomic operations to validate with unshared memories, which makes it perfectly safe to link an object with stripped atomics and TLS with another object that still contains TLS and atomics as long as the resulting module has an unshared memory. To allow this kind of link, this patch disallows a pseudo-feature 'shared-mem' rather than 'atomics' to communicate to the linker that the object is not thread-safe. This means that the 'atomics' feature is available to accurately reflect whether or not an object has atomics enabled. As a drive-by tweak, this change also requires that bulk-memory be enabled in addition to atomics in order to use shared memory. This is because initializing shared memories requires bulk-memory operations. Reviewers: aheejin, sbc100 Subscribers: dschuff, jgravelle-google, hiraditya, sunfish, jfb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79542	2020-05-08 13:52:39 -07:00
Wei Mi	538208f6c0	[lld] Add a new output section ".text.unknown" for funtions with unknown hotness For sampleFDO, because the optimized build uses profile generated from previous release, often we couldn't tell a function without profile was truely cold or just newly created so we had to treat them conservatively and put them in .text section instead of .text.unlikely. The result was when we persue the best performance by locking .text.hot and .text in memory, we wasted a lot of memory to keep cold functions inside. This problem has been largely solved for regular sampleFDO using profile-symbol-list (https://reviews.llvm.org/D66374), but for the case when we use partial profile, we still waste a lot of memory because of it. In https://reviews.llvm.org/D62540, we propose to save functions with unknown hotness information in a special section called ".text.unknown", so that compiler will treat those functions as luck-warm, but runtime can choose not to mlock the special section in memory or use other strategy to save memory. That will solve most of the memory problem even if we use a partial profile. The patch adds the support in lld for the special section.For sampleFDO, because the optimized build uses profile generated from previous release, often we couldn't tell a function without profile was truely cold or just newly created so we had to treat them conservatively and put them in .text section instead of .text.unlikely. The result was when we persue the best performance by locking .text.hot and .text in memory, we wasted a lot of memory to keep cold functions inside. This problem has been largely solved for regular sampleFDO using profile-symbol-list (https://reviews.llvm.org/D66374), but for the case when we use partial profile, we still waste a lot of memory because of it. In https://reviews.llvm.org/D62540, we propose to save functions with unknown hotness information in a special section called ".text.unknown", so that compiler will treat those functions as luck-warm, but runtime can choose not to mlock the special section in memory or use other strategy to save memory. That will solve most of the memory problem even if we use a partial profile. The patch adds the support in lld for the special section. Differential Revision: https://reviews.llvm.org/D79590	2020-05-08 11:14:48 -07:00
Fangrui Song	e20a215992	[ELF] Add convenience TableGen classes to enforce two dashes for options not supported by GNU ld Announced on https://lists.llvm.org/pipermail/llvm-dev/2020-May/141416.html For many options, we have to support either one or two dash to be compatible with GNU ld. For newer and lld specific options, we can enforce strict double dashes. Affected options: * --thinlto-* * --lto-* * --shuffle-sections= This patch does not change `-plugin-opt=` because clang driver passes `-plugin-opt=` and I don't intend to cause churn. In 2000, GNU ld tried something similar with --omagic https://sourceware.org/git/?p=binutils-gdb.git;a=commit;h=e4897a3288f37d5f69e8acd256a6e83e607fe8d8 Reviewed By: tejohnson, psmith Differential Revision: https://reviews.llvm.org/D79371	2020-05-08 07:37:06 -07:00
Reid Kleckner	d71c3c425c	[COFF] Dump string table size for COFF file headers I couldn't find this info in any other dumper, so it might as well be here.	2020-05-06 15:48:36 -07:00
Sam Clegg	f03b6e785b	[lld][WebAssembly] Honor --allow-undefined for data symbols too This was originally the way this worked before before https://reviews.llvm.org/D60882. In retrospect it seems inconsistent that `--allow-undefined` doesn't work for all symbols. See: https://groups.google.com/g/emscripten-discuss/c/HSRgQiIq1gI/m/Kt9oFWHiAwAJ I'm also planning a followup change which implement the full `--unresolved-symbols=..` flags supported by ELF linkers (both ld and ld.lld) since it seems more standard. Differential Revision: https://reviews.llvm.org/D79247	2020-05-06 12:39:29 -07:00
Alexandre Ganea	6adc45d3fd	[LLD][COFF] Move debug info for thread-local variables into PDB global stream Before this patch, the debug record S_GTHREAD32 which represents global thread_local symbols, was emitted by LLD into the respective module stream. This makes Visual Studio unable to display thread_local symbols in the debugger. After this patch, S_GTHREAD32 is moved into the globals stream. This matches MSVC behavior. Differential Revision: https://reviews.llvm.org/D79005	2020-05-06 15:23:58 -04:00
Sid Manning	0e6536fd97	[Hexagon] Add R_HEX_GD_PLT_B22/32_PCREL relocations Extended versions of GD_PLT_B22_PCREL. These surface when -mlong-calls is used. Differential Revision: https://reviews.llvm.org/D79191	2020-05-05 11:47:51 -05:00
Peter Smith	48aebfc908	[ELF][ARM] Do not create .ARM.exidx sections for out of range inputs A linker will create .ARM.exidx sections for InputSections that don't have them. This can cause a relocation out of range error If the InputSection happens to be extremely far away from the other sections. This is often the case for the vector table on older ARM CPUs as the only two places that the table can be placed is 0 or 0xffff0000. We fix this by removing InputSections that need a linker generated .ARM.exidx section if that would cause an error. Differential Revision: https://reviews.llvm.org/D79289	2020-05-05 09:59:45 +01:00

1 2 3 4 5 ...

6189 Commits