llvm-project

Commit Graph

Author	SHA1	Message	Date
Fangrui Song	76093b1739	[InlineAdvisor] Add single quotes around caller/callee names Clang diagnostics refer to identifier names in quotes. This patch makes inline remarks conform to the convention. New behavior: ``` % clang -O2 -Rpass=inline -Rpass-missed=inline -S a.c a.c:4:25: remark: 'foo' inlined into 'bar' with (cost=-30, threshold=337) at callsite bar:0:25; [-Rpass=inline] int bar(int a) { return foo(a); } ^ ``` Reviewed By: hoy Differential Revision: https://reviews.llvm.org/D107791	2021-08-10 11:51:31 -07:00
Ben Dunbobbin	8392e8c007	[LLD][Test] Add thin archives to map file test This adds thin archives to the map file test. I noticed that we had this test-case in our downstream testsuite but it wasn't in the upstream testing. Differential revision: https://reviews.llvm.org/D107555	2021-08-10 10:24:01 +01:00
Simon Atanasyan	454f69bcc1	[LLD] Add required `ppc` target to the test cases. NFC	2021-08-07 13:29:59 +03:00
Simon Atanasyan	c6ebc651b6	[LLD] Support compressed input sections on big-endian targets This patch enables compressed input sections on big-endian targets by checking the target endianness and selecting an appropriate `Chdr` structure. Fixes PR51369 Differential Revision: https://reviews.llvm.org/D107635	2021-08-07 13:20:13 +03:00
Paul Robinson	34035b1044	2nd Speculative fix for MachO lld test after "Have REQUIRES support the target triple" See: http://45.33.8.238/macm1/15677/step_10.txt Follow-up to `f88ad8d` as it appears the lld invocations both emit an error message; so, try adding 'not' to the RUN lines.	2021-08-06 10:49:36 -07:00
Paul Robinson	f88ad8d00f	Speculative fix for MachO lld test after "Have REQUIRES support the target triple" See: http://45.33.8.238/macm1/15677/step_10.txt This is a test that has `REQUIRES: x86` which means it never ran before; I don't have a MachO environment but based on the FileCheck output it looks like it should be sufficient to remove one CHECK line.	2021-08-06 09:23:45 -07:00
Fangrui Song	72d070b4db	[ELF] Support copy relocation on non-default version symbols Copy relocation on a non-default version symbol is unsupported and can crash at runtime. Fortunately there is a one-line fix which works for most cases: ensure `getSymbolsAt` unconditionally returns `ss`. If two non-default version symbols are defined at the same place and both are copy relocated, our implementation will copy relocated them into different addresses. The pointer inequality is very unlikely an issue. In GNU ld, copy relocating version aliases seems to create more pointer inequality problems than us. ( In glibc, sys_errlist@GLIBC_2.2.5 sys_errlist@GLIBC_2.3 sys_errlist@GLIBC_2.4 are defined at the same place, but it is unlikely they are all copy relocated in one executable. Even if so, the variables are read-only and pointer inequality should not be a problem. ) Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D107535	2021-08-05 10:32:14 -07:00
Fangrui Song	00809c8889	[ELF] Apply version script patterns to non-default version symbols Currently version script patterns are ignored for .symver produced non-default version (single @) symbols. This makes such symbols not localizable by `local:`, e.g. ``` .symver foo3_v1,foo3@v1 .globl foo_v1 foo3_v1: ld.lld --version-script=a.ver -shared a.o ``` This patch adds the support: * Move `config->versionDefinitions[VER_NDX_LOCAL].patterns` to `config->versionDefinitions[versionId].localPatterns` * Rename `config->versionDefinitions[versionId].patterns` to `config->versionDefinitions[versionId].nonLocalPatterns` * Allow `findAllByVersion` to find non-default version symbols when `includeNonDefault` is true. (Note: `symtab` keys do not have `@@`) * Make each pattern check both the unversioned `pat.name` and the versioned `${pat.name}@${v.name}` * `localPatterns` can localize `${pat.name}@${v.name}`. `nonLocalPatterns` can prevent localization by assigning `verdefIndex` (before `parseSymbolVersion`). --- If a user notices new `undefined symbol` errors with a version script containing `local: *;`, the issue is likely due to a missing `global:` pattern. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D107234	2021-08-04 23:52:56 -07:00
Fangrui Song	a533eb7423	Revert "[ELF] Apply version script patterns to non-default version symbols" This reverts commit `7ed22a6fa9`. buf is not cleared so the commit misses some cases.	2021-08-04 23:52:55 -07:00
Fangrui Song	66d4430492	[ELF] Combine foo@v1 and foo with the same versionId if both are defined Due to an assembler design flaw (IMO), `.symver foo,foo@v1` produces two symbols `foo` and `foo@v1` if `foo` is defined. * `v1 {};` produces both `foo` and `foo@v1`, but GNU ld only produces `foo@v1` * `v1 { foo; };` produces both `foo@@v1` and `foo@v1`, but GNU ld only produces `foo@v1` * `v2 { foo; };` produces both `foo@@v2` and `foo@v1`, matching GNU ld. (Tested by symver.s) This patch implements the GNU ld behavior by reusing the symbol redirection mechanism in D92259. The new test symver-non-default.s checks the first two cases. Without the patch, the second case will produce `foo@v1` and `foo@@v1` which looks weird and makes foo unnecessarily default versioned. Note: `.symver foo,foo@v1,remove` exists but the unfortunate `foo` will not go away anytime soon. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D107235	2021-08-04 09:06:05 -07:00
Fangrui Song	7ed22a6fa9	[ELF] Apply version script patterns to non-default version symbols Currently version script patterns are ignored for .symver produced non-default version (single @) symbols. This makes such symbols not localizable by `local:`, e.g. ``` .symver foo3_v1,foo3@v1 .globl foo_v1 foo3_v1: ld.lld --version-script=a.ver -shared a.o # In a.out, foo3@v1 is incorrectly exported. ``` This patch adds the support: * Move `config->versionDefinitions[VER_NDX_LOCAL].patterns` to `config->versionDefinitions[versionId].localPatterns` * Rename `config->versionDefinitions[versionId].patterns` to `config->versionDefinitions[versionId].nonLocalPatterns` * Allow `findAllByVersion` to find non-default version symbols when `includeNonDefault` is true. (Note: `symtab` keys do not have `@@`) * Make each pattern check both the unversioned `pat.name` and the versioned `${pat.name}@${v.name}` * `localPatterns` can localize `${pat.name}@${v.name}`. `nonLocalPatterns` can prevent localization by assigning `verdefIndex` (before `parseSymbolVersion`). --- If a user notices new `undefined symbol` errors with a version script containing `local: *;`, the issue is likely due to a missing `global:` pattern. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D107234	2021-08-04 09:02:11 -07:00
Fangrui Song	9bd29a73d1	[ELF] Make dot in .tbss correct GNU ld doesn't support multiple SHF_TLS SHT_NOBITS output sections (it restores the address after an SHF_TLS SHT_NOBITS section, so consecutive SHF_TLS SHT_NOBITS sections will have conflicting address ranges). That said, `threadBssOffset` implements limited support for consecutive SHF_TLS SHT_NOBITS sections. (SHF_TLS SHT_PROGBITS following a SHF_TLS SHT_NOBITS can still be incorrect.) `.` in an output section description of an SHF_TLS SHT_NOBITS section is incorrect. (https://lists.llvm.org/pipermail/llvm-dev/2021-July/151974.html) This patch saves the end address of the previous tbss section in `ctx->tbssAddr`, changes `dot` in the beginning of `assignOffset` so that `.` evaluation will be correct. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D107208	2021-08-04 08:58:50 -07:00
Fangrui Song	44361e5b90	[ELF] Add --export-dynamic-symbol-list This is available in GNU ld 2.35 and can be seen as a shortcut for multiple --export-dynamic-symbol, or a --dynamic-list variant without the symbolic intention. In the long term, this option probably should be preferred over --dynamic-list. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D107317	2021-08-03 09:01:03 -07:00
Martin Storsjö	b7fb5b54a9	[LLD] [MinGW] Support both "--opt value" and "--opt=value" for more options This does the same fix as D107237 but for a couple more options, converting all remaining cases of such options to accept both forms, for consistency. This fixes building e.g. openldap, which uses --image-base=<value>. Differential Revision: https://reviews.llvm.org/D107253	2021-08-03 10:55:44 +03:00
Mateusz Mikuła	05b025edf4	[LLD][MinGW] Accept joined format for --stack Postgresql uses `--stack=` in its Makefile. Downstream issue: https://github.com/msys2/MINGW-packages/pull/9167 Reviewed By: mstorsjo Differential Revision: https://reviews.llvm.org/D107237	2021-08-01 23:27:00 +03:00
Fangrui Song	52f35c9f14	[ELF][test] Improve .symver & --version-script tests And delete redundant tests.	2021-07-31 18:57:19 -07:00
Fangrui Song	b06426da76	[ELF] Add -Bsymbolic-non-weak-functions This option is a subset of -Bsymbolic-functions. It applies to STB_GLOBAL STT_FUNC definitions. The address of a vague linkage function (STB_WEAK STT_FUNC, e.g. an inline function, a template instantiation) seen by a -Bsymbolic-functions linked shared object may be different from the address seen from outside the shared object. Such cases are uncommon. (ELF/Mach-O programs may use `-fvisibility-inlines-hidden` to break such pointer equality. On Windows, correct dllexport and dllimport are needed to make pointer equality work. Windows link.exe enables /OPT:ICF by default so different inline functions may have the same address.) ``` // a.cc -> a.o -> a.so (-Bsymbolic-functions) inline void f() {} void g() { return (void )&f; } // b.cc -> b.o -> exe // The address is different! inline void f() {} ``` -Bsymbolic-non-weak-functions is a safer (C++ conforming) subset of -Bsymbolic-functions, which can make such programs work. Implementations usually emit a vague linkage definition in a COMDAT group. We could detect the group (with more code) but I feel that we should just check STB_WEAK for simplicity. A weak definition will thus serve as an escape hatch for rare cases when users want interposition on definitions. GNU ld feature request: https://sourceware.org/bugzilla/show_bug.cgi?id=27871 Longer write-up: https://maskray.me/blog/2021-05-16-elf-interposition-and-bsymbolic If Linux distributions migrate to protected non-vague-linkage external linkage functions by default, the linker option can still be handy because it allows rapid experiment without recompilation. Protected function addresses currently have deep issues in GNU ld. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D102570	2021-07-29 14:46:53 -07:00
Jez Ng	a26bb9cc05	[lld-macho][nfc] Simplify common-symbol-coalescing test	2021-07-29 11:07:50 -04:00
Jez Ng	e49374f9e0	[lld-macho] Support common symbols in bitcode (but differently from ld64) ld64 seems to handle common symbols in bitcode rather bizarrely. They follow entirely different precedence rules from their non-bitcode counterparts. I initially tried to emulate ld64 in D106597, but I'm not sure the extra complexity is worth it, especially given that common symbols are not, well, very common. This diff accords common bitcode symbols the same precedence as regular common symbols, just as we treat all other pairs of bitcode and non-bitcode symbol types. The tests document ld64's behavior in detail, just in case we want to revisit this. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D107027	2021-07-29 11:07:50 -04:00
Jessica Clarke	cfaa5bf4ce	[ELF] Align the first section of a PT_TLS even if its type is SHT_NOBITS This is somewhat of a repeat of D66658 but for sections in PT_TLS segments. Although such sections don't need to be aligned such that address and offset are congruent modulo the page size, they do need to be congruent modulo the segment alignment, otherwise the whole PT_TLS will be unaligned. We therefore use the normal calculation to determine the section's address within the PT_LOAD rather than bailing out early due to being SHT_NOBITS. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D106987	2021-07-29 15:14:00 +01:00
Jessica Clarke	b96bb7899f	[ELF] Add two new tests showing broken .tbss alignment if first in PT_TLS This is a similar problem to D66658, where we are too aggressive in not aligning NOBITS sections, and the tests are based on the ones added for that fix. If a .tbss section is first in a PT_TLS segment (i.e. there is no .tdata section) then, although it doesn't need to be aligned such that address and offset are congruent modulo the page size, they do need to be congruent modulo the segment alignment, otherwise the whole PT_TLS will be unaligned. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D106986	2021-07-29 15:13:52 +01:00
Jez Ng	dc9ee39251	[lld-macho] Downgrade "cannot export hidden symbol" to warning This matches ld64's behavior, and makes it easier to fit LLD into existing build systems. Reviewed By: #lld-macho, smeenai Differential Revision: https://reviews.llvm.org/D107011	2021-07-28 18:46:26 -04:00
Fangrui Song	660b753e28	[ELF][test] Convert --start-address= and --stop-address= values to hexadecimal so that readers can connect them with the hexadecimal addresses in the output.	2021-07-28 12:55:09 -07:00
Fangrui Song	f17e7df04a	[ELF][test] Delete unneeded --triple=thumb* from llvm-objdump RUN lines	2021-07-28 12:47:12 -07:00
Fangrui Song	b00c8ab1b9	Revert "[ELF] --gc-sections: allow GC on reserved sections in a group" clang may place dynamic initializations for explicitly specialized class template static data members in comdat. Such in-comdat SHT_INIT_ARRAY was an abuse but we have to work around it for a while.	2021-07-27 16:34:32 -07:00
Amilendra Kodithuwakku	b9cf1769de	[lld][ELF] remove empty SyntheticSections from inputSections Change removeUnusedSyntheticSections() to actually remove empty SyntheticSections in inputSections. In addition to doing what removeUnusedSyntheticSections() was meant to do, this will also make the shuffle-sections tests, which shuffles inputSections, less sensitive to empty Synthetic Sections that will not appear in the final image. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D106427 Change-Id: I589eaf596472161a4395fb658aea0fad73318088	2021-07-27 23:29:02 +01:00
Nico Weber	e26356a00e	[lld/mac] Fix application-extension.s failure after `8e8701abca` The test accidentally tested something else that makes lld fail with a different (correct-looking) error that wasn't the one the test tries to test for. (The test case before this change makes ld64 hang in an infinite loop.)	2021-07-27 14:39:43 -04:00
Nico Weber	8e8701abca	[lld/mac] When loading reexports, look for basename in -F / -L first Matches ld64 (cf Options::findIndirectDylib()), and fixes PR51218. Differential Revision: https://reviews.llvm.org/D106842	2021-07-27 14:28:52 -04:00
Derek Schuff	cf54424a46	[lld][WebAssembly] Do not remove name section with --strip-debug Leave the name section in the output when using the --strip-debug flag. This treats it more like ELF symbol tables, as the name section has similar uses at runtime (e.g. wasm engines understand it and it can be used for symbolization at runtime). Fixes https://github.com/emscripten-core/emscripten/issues/14623 Differential Revision: https://reviews.llvm.org/D106728	2021-07-26 11:06:52 -07:00
Fangrui Song	c0da287c30	[yaml2obj][MachO] Rename PayloadString to Content The new name is conciser and matches yaml2obj ELF & DWARF. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D106759	2021-07-26 09:04:51 -07:00
Fangrui Song	e7a7ad134f	[ELF] Support quoted symbols in symbol assignments glibc/elf/tst-absolute-zero-lib.lds uses `"absolute" = 0;`	2021-07-25 16:26:37 -07:00
Nico Weber	75e7d1320c	[lld/mac] Make comment style uniform in start-end.s test	2021-07-25 18:37:49 -04:00
Nico Weber	80caa1eb4a	[lld/mac] Add support for segment$start$ and segment$end$ symbols These symbols are somewhat interesting in that they create non-existing segments, which as far as I know is the only way to create segments that don't contain any sections. Final part of part of PR50760. Like D106629, but for segments instead of sections. I'm not aware of anything that needs this in practice. Differential Revision: https://reviews.llvm.org/D106767	2021-07-25 18:25:13 -04:00
Nico Weber	afdeb432f0	[lld/mac] Move output segment rename logic into OutputSegment Fixes the output segment name if both -rename_section and -rename_segment are used and the post-section-rename segment name is the same as the pre-segment-rename segment name to match ld64's behavior. The motivation is that segment$start$ can create section-less segments, and this makes a corner case in the interaction between segment$start and -rename_segment in the upcoming segment$start patch. Differential Revision: https://reviews.llvm.org/D106766	2021-07-25 18:20:09 -04:00
Nico Weber	6bf7d2d9c9	[lld/mac] Reland: Add tests for the interaction between -rename_section and -rename_segment No behavior change. Differential Revision: https://reviews.llvm.org/D106765	2021-07-25 18:16:33 -04:00
Nico Weber	14bb6e4d70	Revert "[lld/mac] Add tests for the interaction between -rename_section and -rename_segment" This reverts commit `a6eb34624d`. The test fails, I screwed something up.	2021-07-25 18:11:36 -04:00
Nico Weber	a6eb34624d	[lld/mac] Add tests for the interaction between -rename_section and -rename_segment No behavior change. Differential Revision: https://reviews.llvm.org/D106765	2021-07-25 18:03:25 -04:00
Ayke van Laethem	13ca0c87ed	[lld][WebAssembly] Align __heap_base __heap_base was not aligned. In practice, it will often be aligned simply because it follows the stack, but when the stack is placed at the beginning (with the --stack-first option), the __heap_base might be unaligned. It could even be byte-aligned. At least wasi-libc appears to expect that __heap_base is aligned: `659ff41456/dlmalloc/src/malloc.c (L5224)` While WebAssembly itself does not appear to require any alignment for memory accesses, it is sometimes required when sharing a pointer externally. For example, WASI might expect alignment up to 8: https://github.com/WebAssembly/WASI/blob/main/phases/snapshot/docs.md#-timestamp-u64 This issue got introduced with the addition of the --stack-first flag: https://reviews.llvm.org/D46141 I suspect the lack of alignment wasn't intentional here. Differential Revision: https://reviews.llvm.org/D106499	2021-07-24 14:03:26 +02:00
Nico Weber	92c085e7c4	[lld/mac] Fix comment typo in new start-end.s test	2021-07-23 18:14:38 -04:00
Nico Weber	04e8d0b62d	[lld/mac] Implement support for section$start and section$ end symbols With this, libclang_rt.profile_osx.a can be linked, that is coverage and PGO-instrumented builds should now work with lld. section$start and section$end symbols can create non-existing sections. They're also undefined symbols that are only magic if there isn't a regular symbol with their name, which means the need to be handled in treatUndefined() instead of just looping over all existing sections and adding start and end symbols like the ELF port does. To represent the actual symbols, this uses absolute symbols that get their value updated once an output section is layed out. segment$start and segment$end are still missing for now, but they produce a nicer error message after this patch. Main part of PR50760. Differential Revision: https://reviews.llvm.org/D106629	2021-07-23 16:01:09 -04:00
Jez Ng	d9a639901f	[lld-macho][nfc] Add test for resolution of bitcode symbols We lacked a test for bitcode symbol precedence. We assumed that they followed the same rules as their regular symbol counterparts, but never had a test to verify that we were matching ld64's behavior. It turns out that we were largely correct, though we deviate from ld64 when there are bitcode and non-bitcode symbols of the same name. The test added in this diff both verifies our behavior and documents the differences. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D106596	2021-07-23 11:49:00 -04:00
Jez Ng	cafed6f292	[lld-macho][nfc] Fix test to reflect that symbol attributes don't matter within an archive We had a comment that claimed that defined symbols had priority over common symbols if they occurred in the same archive. In fact, they appear to have equal precedence. Our implementation already does this, so I'm just updating the test comment. Also added a few other test comments along the way for readability. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D106595	2021-07-23 11:49:00 -04:00
Jez Ng	8eac5dcb36	[lld-macho] Reorganize + extend ICF test I found icf.s a bit hard to work with as it was not possible to extend any of the functions `_a` ... `_k` to test new relocation / referent types without modifying every single one of them. Additionally, their one-letter names were not descriptive (though the comments helped). I've renamed all the functions to reflect the feature they are testing, and shrunk them so that they contain just enough to test that one feature. I've also added tests for non-zero addends (via the `_abs1a_ref_with_addend` and `_defined_ref_with_addend_1` functions). Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D106211	2021-07-23 11:49:00 -04:00
Nico Weber	2c508cf583	[lld/mac] Don't crash on absolute symbols in order files Absolute symbols have a nullptr isec. buildInputSectionPriorities() would defer isec, causing crashes. Ordering absolute symbols doesn't make sense, so just ignore them. This seems to match ld64. Differential Revision: https://reviews.llvm.org/D106628	2021-07-23 11:33:23 -04:00
Nico Weber	687181caba	[lld/mac] Add missing REQUIRES line to new test	2021-07-23 10:40:22 -04:00
Leonard Grey	5acc6d4572	[lld-macho] Disambiguate bitcode files with the same name by archive name/offset in archive Ported from COFF/ELF; test is adapted from test/COFF/thinlto-archivecollision.ll LTO expects every bitcode file to have a unique name. If given multiple bitcode files with the same name, it errors with "Expected at most one ThinLTO module per bitcode file". This change incorporates the archive name, to disambiguate members with the same name in different archives and the offset in archive to disambiguate members with the same name in the same archive. Differential Revision: https://reviews.llvm.org/D106179	2021-07-22 22:50:25 -04:00
Fangrui Song	120b18767c	[ELF] --gc-sections: allow GC on reserved sections in a group This generalizes D70146 (SHT_NOTE) to more reserved sections and makes our rules more consistent. Now SHF_GROUP is more similar to SHF_LINK_ORDER. For SHT_INIT_ARRAY/SHT_FINI_ARRAY, the rule will be closer to PE/COFF link.exe. Previously sanitizers use llvm.global_ctors to make module_ctor a GC root, which is considered an abuse. https://groups.google.com/g/generic-abi/c/TpleUEkNoQI We can squeak through on compatibility issues because compilers otherwise don't use SHF_GROUP special sections.	2021-07-22 17:09:23 -07:00
Fangrui Song	54bc2d812e	[ELF][test] Add a test about GCable SHF_LINK_ORDER SHT_INIT_ARRAY	2021-07-22 17:04:54 -07:00
Nico Weber	2d6fb62ef2	[lld/mac] Handle symbols from -U in treatUndefinedSymbol() In ld64, `-U section$start$FOO$bar` handles `section$start$FOO$bar` as a regular `section$start` symbol, that is section$start processing happens before -U processing. Likely, nobody uses that in practice so it doesn't seem very important to be compatible with this, but it also moves the -U handling code next to the `-undefined dynamic_lookup` handling code, which is nice because they do the same thing. And, in fact, this did identify a bug in a corner case in the intersection of `-undefined dynamic_lookup` and dead-stripping (fix for that in D106565). Vaguely related to PR50760. No interesting behavior change. Differential Revision: https://reviews.llvm.org/D106566	2021-07-22 19:43:57 -04:00
Nico Weber	5ae39d4f97	[lld/mac] Fix bug in interaction of -dead_strip and -undefined dynamic_lookup We lost the `used` bit on the Undefined when we replaced it with a DylibSymbol in treatUndefined(). Differential Revision: https://reviews.llvm.org/D106565	2021-07-22 19:30:46 -04:00
Nick Fitzgerald	1d445a6e76	Reland: "[WebAssembly] Deduplicate imports of the same module name, field name, and type" When two symbols import the same thing, only one import should be emitted in the Wasm file. Fixes https://bugs.llvm.org/show_bug.cgi?id=50938 Reverted in: `16aac493e5`. Reviewed By: sbc100 Differential Revision: https://reviews.llvm.org/D105519	2021-07-22 14:16:05 -07:00
Martin Storsjö	9dbc4b09af	[LLD] [COFF] Make -export-all-symbols work as intended for EXEs If some symbols are marked with dllexport, we still want to export all symbols if -export-all-symbols is specified. Previously, this only worked as it should for DLL output, not for EXE. This should fix downstream bug https://github.com/msys2/MINGW-packages/issues/9163. Differential Revision: https://reviews.llvm.org/D106245	2021-07-22 23:34:03 +03:00
Douglas Yung	4e52a04833	Change requires line from arm to aarch64 since the test uses arm64_32 which is AArch64.	2021-07-21 12:51:53 -07:00
Fangrui Song	c53a5eebb1	[ELF][test] Add -DAG The guid of a local linkage variable has the module path encoded, so the order between a local linkage variable and a non-local linkage variable isn't guaranteed.	2021-07-20 15:27:52 -07:00
Vincent Lee	33ab995617	Recommit "[lld-macho] Use DO_BIND_ADD_ADDR_IMM_SCALED for bind opcodes" Implement pass 3 of bind opcodes from ld64 (which supports both 32-bit and 64-bit). Pass 3 implementation condenses BIND_OPCODE_DO_BIND_ADD_ADDR_ULEB opcode to BIND_OPCODE_DO_BIND_ADD_ADDR_IMM_SCALED. This change is already behind an O2 flag so it shouldn't impact current performance. I verified ld64's output with x86_64 LLD and they were both emitting the same optimized bind opcodes (although in a slightly different order). Tested with arm64_32 LLD and compared that with x86 LLD that the order of the bind opcodes are the same (offset values are different which should be expected). Reviewed By: int3, #lld-macho, MaskRay Differential Revision: https://reviews.llvm.org/D106128	2021-07-20 13:45:24 -07:00
Fangrui Song	db5e078690	[LTO] Add SelectionKind to IRSymtab and use it in ld.lld/LLVMgold In PGO, a C++ external linkage function `foo` has a private counter `__profc_foo` and a private `__profd_foo` in a `comdat nodeduplicate`. A `__attribute__((weak))` function `foo` has a weak hidden counter `__profc_foo` and a private `__profd_foo` in a `comdat nodeduplicate`. In `ld.lld a.o b.o`, say a.o defines an external linkage `foo` and b.o defines a weak `foo`. Currently we treat `comdat nodeduplicate` as `comdat any`, ld.lld will incorrectly consider `b.o:__profc_foo` non-prevailing. In the worst case when `b.o:__profd_foo` is retained and `b.o:__profc_foo` isn't, there will be dangling reference causing an `undefined hidden symbol` error. Add SelectionKind to `Comdat` in IRSymtab and let linkers ignore nodeduplicate comdat. Differential Revision: https://reviews.llvm.org/D106228	2021-07-20 13:22:00 -07:00
Sam Clegg	d51f74acdf	[lld][WebAssembly] Error on import of TLS symbols in shared libraries In https://reviews.llvm.org/D102044 we made exporting a TLS symbol into an error, but we also want to error on import. See https://github.com/emscripten-core/emscripten/issues/14461 Differential Revision: https://reviews.llvm.org/D106385	2021-07-20 12:36:03 -07:00
Fangrui Song	88e2268a34	Revert D106128 "[lld-macho] Use DO_BIND_ADD_ADDR_IMM_SCALED for bind opcodes" This reverts commit `321b2bef09`. `for (BindIR *p = &opcodes[0]; p->opcode != BIND_OPCODE_DONE; ++p) {` has a heap-buffer-overflow with test/MachO/bind-opcodes.	2021-07-19 18:13:52 -07:00
Fangrui Song	16aac493e5	Revert D105519 "[WebAssembly] Deduplicate imports of the same module name, field name, and type" and its followup This reverts commit `4ae575b999` and `9b965b37c7`. There is an use-of-uninitialized-value bug in the `else` branch in ImportSection::addImport.	2021-07-19 17:09:01 -07:00
Vincent Lee	321b2bef09	[lld-macho] Use DO_BIND_ADD_ADDR_IMM_SCALED for bind opcodes Implement pass 3 of bind opcodes from ld64 (which supports both 32-bit and 64-bit). Pass 3 implementation condenses BIND_OPCODE_DO_BIND_ADD_ADDR_ULEB opcode to BIND_OPCODE_DO_BIND_ADD_ADDR_IMM_SCALED. This change is already behind an O2 flag so it shouldn't impact current performance. I verified ld64's output with x86_64 LLD and they were both emitting the same optimized bind opcodes (although in a slightly different order). Tested with arm64_32 LLD and compared that with x86 LLD that the order of the bind opcodes are the same (offset values are different which should be expected). Reviewed By: int3, #lld-macho Differential Revision: https://reviews.llvm.org/D106128	2021-07-19 16:18:33 -07:00
Derek Schuff	ad1f5457d2	[WebAssembly] Generate R_WASM_FUNCTION_OFFSET relocs in debuginfo sections Debug info sections need R_WASM_FUNCTION_OFFSET_I32 relocs (with FK_Data_4 fixup kinds) to refer to functions (instead of R_WASM_TABLE_INDEX as is used in data sections). Usually this is done in a convoluted way, with unnamed temp data symbols which target the start of the function, in which case WasmObjectWriter::recordRelocation converts it to use the section symbol instead. However in some cases the function can actually be undefined; in this case the dwarf generator uses the function symbol (a named undefined function symbol) instead. In that case the section-symbol transform doesn't work and we need to generate the correct reloc type a different way. In this change WebAssemblyWasmObjectWriter::getRelocType takes the fixup section type into account to choose the correct reloc type. Fixes PR50408 Differential Revision: https://reviews.llvm.org/D103557	2021-07-19 14:02:33 -07:00
Nick Fitzgerald	4ae575b999	[WebAssembly] Deduplicate imports of the same module name, field name, and type When two symbols import the same thing, only one import should be emitted in the Wasm file. Fixes https://bugs.llvm.org/show_bug.cgi?id=50938 Reviewed By: sbc100 Differential Revision: https://reviews.llvm.org/D105519	2021-07-19 13:59:02 -07:00
Leonard Grey	6ef37b640d	[lld/mac] Add test for --lto-O This belongs to `fe08e9c487`, I (thakis) forgot to `git add` it back then. Differential Revision: https://reviews.llvm.org/D105223	2021-07-19 16:45:33 -04:00
Nico Weber	fbb45947b2	[lld/mac] Resolve defined symbols before undefined symbols Ports https://reviews.llvm.org/D95985 to the MachO port. Happens to fix PR51135; see that bug for details. Also makes lld's behavior match ld64 for the included test case. Differential Revision: https://reviews.llvm.org/D106293	2021-07-19 16:37:41 -04:00
Nico Weber	bcbb3066ce	[lld/mac] Change load command order to be more like ld64 No meaningful behavior change. Makes diffing `otool -l` output a bit easier. Differential Revision: https://reviews.llvm.org/D106219	2021-07-19 15:04:32 -04:00
Jez Ng	428a7c1b38	[lld-macho] Have ICF operate on all sections at once ICF previously operated only within a given OutputSection. We would merge all CFStrings first, then merge all regular code sections in a second phase. This worked fine since CFStrings would never reference regular `__text` sections. However, I would like to expand ICF to merge functions that reference unwind info. Unwind info references the LSDA section, which can in turn reference the `__text` section, so we cannot perform ICF in phases. In order to have ICF operate on InputSections spanning multiple OutputSections, we need a way to distinguish InputSections that are destined for different OutputSections, so that we don't fold across section boundaries. We achieve this by creating OutputSections early, and setting `InputSection::parent` to point to them. This is what LLD-ELF does. (This change should also make it easier to implement the `section$start$` symbols.) This diff also folds InputSections w/o checking their flags, which I think is the right behavior -- if they are destined for the same OutputSection, they will have the same flags in the output (even if their input flags differ). I.e. the `parent` pointer check subsumes the `flags` check. In practice this has nearly no effect (ICF did not become any more effective on chromium_framework). I've also updated ICF.cpp's block comment to better reflect its current status. Reviewed By: #lld-macho, smeenai Differential Revision: https://reviews.llvm.org/D105641	2021-07-17 13:42:51 -04:00
Fangrui Song	fa3231eb18	[COFF][test] Fix llvm-readobj tests	2021-07-16 13:28:46 -07:00
Fangrui Song	8f806d5f52	[test] Avoid llvm-readelf/llvm-readobj one-dash long options	2021-07-16 12:03:07 -07:00
Fangrui Song	3c9d86f951	[ELF][test] Avoid llvm-readelf/llvm-readobj one-dash long options	2021-07-16 10:02:47 -07:00
Vincent Lee	d695d0d6f6	[lld-macho] Optimize bind opcodes with multiple passes In D105866, we used an intermediate container to store a list of opcodes. Here, we use that data structure to help us perform optimization passes that would allow a more efficient encoding of bind opcodes. Currently, the functionality mirrors the optimization pass {1,2} done in ld64 for bind opcodes under optimization gate to prevent slight regressions. Reviewed By: int3, #lld-macho Differential Revision: https://reviews.llvm.org/D105867	2021-07-15 20:52:46 -07:00
Fangrui Song	f8cb78e99a	[ELF] Don't define __rela_iplt_start for -pie/-shared `clang -fuse-ld=lld -static-pie -fpie` produced executable currently crashes and this patch makes it work. See https://sourceware.org/bugzilla/show_bug.cgi?id=27164 and https://sourceware.org/pipermail/libc-alpha/2021-July/128810.html While it seems unreasonable to keep csu/libc-start.c ARCH_APPLY_IREL unclear in static-pie mode and have an unneeded diff -u =(ld.bfd --verbose) =(ld.bfd -pie --verbose) difference, glibc folks don't want to fix their code. I feel sad about that but this patch can remove an iffy condition for lld/ELF as well: `needsInterpSection()`.	2021-07-15 11:31:11 -07:00
Fangrui Song	80f9fd4ce3	[ELF][test] Rework non-preemptible ifunc tests	2021-07-15 11:31:05 -07:00
Fangrui Song	aa3df8ddcd	[test] Avoid llvm-readelf/llvm-readobj one-dash long options and deprecated aliases (e.g. --file-headers)	2021-07-15 10:26:21 -07:00
Wouter van Oortmerssen	4157b6033d	[WebAssembly] Fixed LLD generation of 64-bit __wasm_apply_data_relocs Differential Revision: https://reviews.llvm.org/D105863	2021-07-15 10:02:02 -07:00
Leonard Grey	c931ff72bd	[lld-macho] Add LTO cache support This adds support for the lld-only `--thinlto-cache-policy` option, as well as implementations for ld64's `-cache_path_lto`, `-prune_interval_lto`, `-prune_after_lto`, and `-max_relative_cache_size_lto`. Test is adapted from lld/test/ELF/lto/cache.ll Differential Revision: https://reviews.llvm.org/D105922	2021-07-15 12:56:13 -04:00
Fangrui Song	7299c6f635	[test] Avoid llvm-nm one-dash long options	2021-07-15 09:50:36 -07:00
Fangrui Song	7de2173c2a	[ELF] --fortran-common: prefer STB_WEAK to COMMON The ELF specification says "The link editor honors the common definition and ignores the weak ones." GNU ld and our Symbol::compare follow this, but the --fortran-common code (D86142) made a mistake on the precedence. Fixes https://bugs.llvm.org/show_bug.cgi?id=51082 Reviewed By: peter.smith, sfertile Differential Revision: https://reviews.llvm.org/D105945	2021-07-14 10:18:30 -07:00
Alexander Yermolovich	24129fbc9a	[LLD] Adding support for RELA for CG Profile. This is a follow up to https://reviews.llvm.org/D104080, and `ca3bdb57fa (diff-e64a48fabe31db213a631fdc5f2acb51bdddf3f16a8fb2928784f4c579229585)`. The implementation of call graph profile was changed from a black box section to relocation approach. This was done to be compatible with post processing tools like strip/objcopy, and llvm equivalent. When they are invoked on object file before the final linking step with this new approach the symbol indices correctness is preserved. The GNU binutils tools change the REL section to RELA section, unlike llvm tools. For example when strip -S is run on the ELF object files, as an intermediate step before linking. To preserve compatibility this patch extends implementation in LLD and ELFDumper to support both REL and RELA sections for call graph profile. Reviewed By: MaskRay, jhenderson Differential Revision: https://reviews.llvm.org/D105217	2021-07-13 13:56:30 -07:00
Hafiz Abid Qadeer	fb9c5c3dce	[lld][AMDGPU] Handle R_AMDGPU_REL16 relocation. This patch is a followup patch to https://reviews.llvm.org/D105760 which adds this relocation. This handles the relocation in lld. The s_branch family of instruction does the following: PC = PC + signext(simm * 4) + 4 so we we do the opposite on the target address before writing it in the instruction stream. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D105761	2021-07-13 20:41:11 +01:00
Wouter van Oortmerssen	b568c11b40	[WebAssembly] Fixed LLD generation of 64-bit __wasm_init_memory Differential Revision: https://reviews.llvm.org/D105849	2021-07-12 15:26:11 -07:00
Nico Weber	f21801dab2	[lld/mac] Implement -application_extension Differential Revision: https://reviews.llvm.org/D105818	2021-07-12 13:42:16 -04:00
Nico Weber	396f2e9d6d	[lld/mac] Make tbd files in one test valid No behavior change, but ld64 can't load .tbd files without the trailing `...`, so include them to make it easier to run tests with l64 too.	2021-07-12 11:13:54 -04:00
Jez Ng	0fb299072c	[lld-macho][nfc] Fix YAML input in compact-unwind-sym-relocs.s * Adjust strsize so llvm-objdump doesn't complain about it extending past the end of file * Remove symbol that was referencing a deleted section * Adjust n_sect of the remaining `_main` symbol to point at the right section	2021-07-11 21:36:24 -04:00
Nico Weber	c10947b5f8	[lld/mac] Unbreak objc.s after `6e05c1cd5f`	2021-07-11 13:57:15 -04:00
Nico Weber	6e05c1cd5f	[lld/mac] Always reference dyld_stub_binder when linked with libSystem lld currently only references dyld_stub_binder when it's needed. ld64 always references it when libSystem is linked. Match ld64. The (somewhat lame) motivation is that `nm` on a binary without any export writes a "no symbols" warning to stderr, and this change makes it so that every binary in practice has at least a reference to dyld_stub_binder, which suppresses that. Every "real" output file will reference dyld_stub_binder, so most of the time this shouldn't make much of a difference. And if you really don't want to have this reference for whatever reason, you can stop passing -lSystem, like you have to for ld64 anyways. (After linking any dylib, we dump the exported list of symbols to a txt file with `nm` and only relink downstream deps if that txt file changes. A nicer fix is to make lld optionally write .tbd files with the public interface of a linked dylib and use that instead, but for now the txt files are what we do.) Differential Revision: https://reviews.llvm.org/D105782	2021-07-11 13:37:48 -04:00
Nico Weber	10e28a7484	[lld/mac] Use normal Undefined machinery for dyld_stub_binder lookup This is for aesthetic reasons, I'm not aware of anything that needs this in practice. It does have a few effects: - `-undefined dynamic_lookup` now has an effect for dyld_stub_binder. This matches ld64. - `-U dyld_stub_binder` now works like you'd expect (it doesn't work in ld64). - The error message for a missing dyld_stub_binder symbol now looks like other undefined reference symbols, it changes from symbol dyld_stub_binder not found (normally in libSystem.dylib). Needed to perform lazy binding. to error: undefined symbol: dyld_stub_binder >>> referenced by lazy binding (normally in libSystem.dylib) Also add test coverage for that error message. But in practice, this should have no interesting effects since everything links in dyld_stub_binder via libSystem anyways. Differential Revision: https://reviews.llvm.org/D105781	2021-07-11 12:48:59 -04:00
Jez Ng	d5c0b9c848	[lld-macho][nfc] Expand the compact unwind symbol reloc test Add a bit more detail to the comments, and check that the final binary does indeed have a `__unwind_info` section (D105557 previosly regressed this). Also rename the test to emphasize that we are testing relocations compact unwind, not relocations in general.	2021-07-11 00:35:05 -04:00
Vy Nguyen	3822e3d5b0	[lld-macho] Fix bug in handling unwind info from ld -r Two changess: - Drop assertions that all symbols are in GOT - Set allEntriesAreOmitted correctly Related bug: 50812 Differential Revision: https://reviews.llvm.org/D105364	2021-07-09 22:44:51 -04:00
Wouter van Oortmerssen	9647a6f719	[WebAssembly] Added initial type checker to MC Assembler This to protect against non-sensical instruction sequences being assembled, which would either cause asserts/crashes further down, or a Wasm module being output that doesn't validate. Unlike a validator, this type checker is able to give type-errors as part of the parsing process, which makes the assembler much friendlier to be used by humans writing manual input. Because the MC system is single pass (instructions aren't even stored in MC format, they are directly output) the type checker has to be single pass as well, which means that from now on .globaltype and .functype decls must come before their use. An extra pass is added to Codegen to collect information for this purpose, since AsmPrinter is normally single pass / streaming as well, and would otherwise generate this information on the fly. A `-no-type-check` flag was added to llvm-mc (and any other tools that take asm input) that surpresses type errors, as a quick escape hatch for tests that were not intended to be type correct. This is a first version of the type checker that ignores control flow, i.e. it checks that types are correct along the linear path, but not the branch path. This will still catch most errors. Branch checking could be added in the future. Differential Revision: https://reviews.llvm.org/D104945	2021-07-09 14:07:25 -07:00
Alex Richardson	cc7cb9523e	[ELF][AArch64] Write addends for TLSDESC relocations with -z rel Since D100490 this case is diagnosed for -z rel. This commit implements R_AARCH64_TLSDESC cases for AArch64::getImplicitAddend() and AArch64::relocate(). However, there are probably further relocation types that need to be handled for full support of -z rel. Fixes https://bugs.llvm.org/show_bug.cgi?id=47009 Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D100544	2021-07-09 10:41:41 +01:00
Alex Richardson	e564932842	[ELF] Write R_RISCV_IRELATIVE addends with -z rel I found this missing case with the new --check-dynamic-relocation flag while running the lld tests with --apply-dynamic-relocs enabled by default. This is the same as D101452 just for RISC-V Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101454	2021-07-09 10:41:40 +01:00
Alex Richardson	79332fb722	[ELF] Write R_X86_64_IRELATIVE addends with -z rel I found this missing case with the new --check-dynamic-relocation flag while running the lld tests with --apply-dynamic-relocs enabled by default. This also fixes a broken CHECK in lld/test/ELF/x86-64-gotpc-relax.s: The test wasn't using CHECK-NEXT, so it was passing despite the output actually containing relocations. I am not sure when this changed, but I think this behaviour is correct. Found with D101450 + enabling --apply-dynamic-relocs by default. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101452	2021-07-09 10:41:40 +01:00
Alex Richardson	f4b0c9abfb	[ELF] Implement X86_64::getImplicitAddend() This allows checking dynamic relocation addends for -z rel and --apply-dynamic-relocs output. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101451	2021-07-09 10:41:40 +01:00
Alex Richardson	35c5e564e6	[ELF] Check the Elf_Rel addends for dynamic relocations There used to be many cases where addends for Elf_Rel were not emitted in the final object file (mostly when building for MIPS64 since the input .o files use RELA but the output uses REL). These cases have been fixed since, but this patch adds a check to ensure that the written values are correct. It is based on a previous patch that I added to the CHERI fork of LLD since we were using MIPS64 as a baseline. The work has now almost entirely shifted to RISC-V and Arm Morello (which use Elf_Rela), but I thought it would be useful to upstream our local changes anyway. This patch adds a (hidden) command line flag --check-dynamic-relocations that can be used to enable these checks. It is also on by default in assertions builds for targets that handle all dynamic relocations kinds that LLD can emit in Target::getImplicitAddend(). Currently this is enabled for ARM, MIPS, and I386. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101450	2021-07-09 10:41:40 +01:00
Alex Richardson	6d87ca08ae	[ELF] Refactor DynamicReloc to fix incorrect relocation addends This patch changes the DynamicReloc class to store an enum instead of the overloaded useSymVA member to make it easier to understand and fix incorrect addends being written in some corner cases. The change is motivated by a follow-up review that checks the value of implicit Elf_Rel addends written to the output file. This patch fixes an incorrect output when using `-z rela` for i386 files with R_386_GOT32 relocations (not that this really matters since it's an unsupported configuration). Storing the relocation expression kind also addresses an incorrect addend FIXME in ppc64-abs64-dyn.s introduced in D63383. DynamicReloc now also has a special case for the MIPS TLS relocations (DynamicReloc::AgainstSymbolWithTargetVA) since the R_MIPS_TLS_TPREL{32/64} the symbol VA to the GOT for preemptible symbols. I'm not sure if the symbol value actually should be written for R_MIPS_TLS_TPREL32, but this patch does not attempt to change that behaviour. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D100490	2021-07-09 10:41:40 +01:00
Thomas Lively	0fd5e7b2d8	[WebAssembly][lld] Fix segfault on .bss sections in mapfile When memory is declared in the Wasm module, we rely on the implicit zero initialization behavior and do not explicitly output .bss sections. The means that they do not have associated `outputSec` entries, which was causing segfaults in the mapfile support. Fix the issue by guarding against null `outputSec` and falling back to using a zero offset. Differential Revision: https://reviews.llvm.org/D102951	2021-07-07 23:31:48 -07:00
Vy Nguyen	e25a384055	[lld-macho][nfc] Rename test file to be more descriptive (rather than referencing the bug number) Differential Revision: https://reviews.llvm.org/D105559	2021-07-07 13:15:55 -04:00
Nico Weber	8a7b5ebf4d	[lld/mac] Don't crash when dead-stripping removes all unwind info If the input has compact unwind info but all of it is removed after dead stripping, we would crash. Now we don't write any __unwind_info section at all, like ld64. This is a bit awkward to implement because we only know the final state of unwind info after UnwindInfoSectionImpl<Ptr>::finalize(), which is called after sections are added. So add a small amount of bookkeeping to relocateCompactUnwind() instead (which runs earlier) so that we can predict what finalize() will do before it runs. Fixes PR51010. Differential Revision: https://reviews.llvm.org/D105557	2021-07-07 13:05:40 -04:00
Nico Weber	76f734040a	[lld/mac] Give several LTO tests an "lto-" prefix Differential Revision: https://reviews.llvm.org/D105476	2021-07-06 15:23:42 -04:00
Nico Weber	3eb2fc4b50	[lld/mac] Partially implement -export_dynamic This implements the part of -export_dynamic that adds external symbols as dead strip roots even for executables. It does not yet implement the effect -export_dynamic has for LTO. I tried just replacing `config->outputType != MH_EXECUTE` with `(config->outputType != MH_EXECUTE \|\| config->exportDynamic)` in LTO.cpp, but then local symbols make it into the symbol table too, which is too much (and also doesn't match ld64). So punt on this for now until I understand it better. (D91583 may or may not be related too). Differential Revision: https://reviews.llvm.org/D105482	2021-07-06 11:22:18 -04:00
Nico Weber	64be5b7d87	[lld/mac] Implement -arch_multiple This is the other flag clang passes when calling clang with two -arch flags (which means with this, `clang -arch x86_64 -arch arm64 -fuse-ld=lld ...` now no longer prints any warnings \o/). Since clang calls the linker several times in that setup, it's not clear to the user from which invocation the errors are. The flag's help text is Specifies that the linker should augment error and warning messages with the architecture name. In ld64, the only effect of the flag is that undefined symbols are prefaced with Undefined symbols for architecture x86_64: instead of the usual "Undefined symbols:". So for now, let's add this only to undefined symbol errors too. That's probably the most common linker diagnostic. Another idea would be to prefix errors and warnings with "ld64.lld(x86_64):" instead of the usual "ld64.lld:", but I'm not sure if people would misunderstand that as a comment about the arch of ld itself. But open to suggestions on what effect this flag should have :) And we don't have to get it perfect now, we can iterate on it. Differential Revision: https://reviews.llvm.org/D105450	2021-07-06 00:25:18 -04:00
Nico Weber	2c25f39fcc	[lld/mac] Implement -final_output This is one of two flags clang passes to the linker when giving calling clang with multiple -arch flags. I think it'd make sense to also use finalOutput instead of outputFile in CodeSignatureSection() and when replacing @executable_path, but ld64 doesn't do that, so I'll at least put those in separate commits. Differential Revision: https://reviews.llvm.org/D105449	2021-07-05 20:06:26 -04:00
Nico Weber	db64306d99	[lld/mac] Implement -umbrella I think this is an old way for doing what is done with -reexport_library these days, but it's e.g. still used in libunwind's build (the opensource.apple.com one, not the llvm one). Differential Revision: https://reviews.llvm.org/D105448	2021-07-05 20:06:25 -04:00
Jez Ng	718c32175b	[lld-macho] Only emit one BIND_OPCODE_SET_SYMBOL per symbol Size-wise, BIND_OPCODE_SET_SYMBOL_TRAILING_FLAGS_IMM is the most expensive opcode, since it comes with an associated symbol string. We were previously emitting it once per binding, instead of once per symbol. This diff groups all bindings for a given symbol together and ensures we only emit one such opcode per symbol. This matches ld64's behavior. While this is a relatively small win on chromium_framework (-72KiB), for programs that have more dynamic bindings, the difference can be quite large. This change is perf-neutral when linking chromium_framework. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D105075	2021-07-05 20:00:19 -04:00
Jez Ng	4aaf878750	[lld-macho][nfc] Add REQUIRES: x86 to test I didn't realize that llvm-objdump's features were arch-specific. This should fix the non-x86 buildbots.	2021-07-05 03:40:54 -04:00
Jez Ng	bcaf57cae8	[lld-macho] Parse relocations quickly by assuming sorted order clang and gcc both seem to emit relocations in reverse order of address. That means we can match relocations to their containing subsections in `O(relocs + subsections)` rather than the `O(relocs * log(subsections))` that our previous binary search implementation required. Unfortunately, `ld -r` can still emit unsorted relocations, so we have a fallback code path for that (less common) case. Numbers for linking chromium_framework on my 3.2 GHz 16-Core Intel Xeon W: N Min Max Median Avg Stddev x 20 4.04 4.11 4.075 4.0775 0.018027756 + 20 3.95 4.02 3.98 3.985 0.020900768 Difference at 95.0% confidence -0.0925 +/- 0.0124919 -2.26855% +/- 0.306361% (Student's t, pooled s = 0.0195172) Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D105410	2021-07-05 01:13:44 -04:00
Nico Weber	9e24979d73	[lld/mac] Fix function offset on 1st-level unwind table sentinel Two bugs: 1. This tries to take the address of the last symbol plus the length of the last symbol. However, the sorted vector is cuPtrVector, not cuVector. Also, cuPtrVector has tombstone values removed and cuVector doesn't. If there was a stripped value at the end, the "last" element's value was UINT64_MAX, which meant the sentinel value was one less than the length of that "last" dead symbol. 2. We have to subtract in.header->addr. For 64-bit binaries that's (1 << 32) and functionAddress is 32-bit so this is a no-op, but for 32-bit binaries the sentinel's value was too large. I believe this has no effect in practice since the first-level binary search code in libunwind (in UnwindCursor.hpp) does: uint32_t low = 0; uint32_t high = sectionHeader.indexCount(); uint32_t last = high - 1; while (low < high) { uint32_t mid = (low + high) / 2; if ((mid == last) \|\| (topIndex.functionOffset(mid + 1) > targetFunctionOffset)) { low = mid; break; } else { low = mid + 1; } So the address of the last entry in the first-level table isn't really checked -- except for the very end, but the check against `last` means we just run the loop once more than necessary. But it makes `unwinddump` output look less confusing, and it's what it looks was the intention here. (No test since I can't think of a way to make FileCheck check that one number is larger than another.) Differential Revision: https://reviews.llvm.org/D105404	2021-07-04 18:06:20 -04:00
Nico Weber	d2d6da3011	[lld/mac] Don't crash on 32-bit output binaries when dead-stripping Fixes PR50974. Differential Revision: https://reviews.llvm.org/D105399	2021-07-04 18:03:31 -04:00
David Blaikie	bf7f846b68	Fix test so it doesn't try to write to the test directory, only to %t	2021-07-02 14:59:50 -07:00
Vy Nguyen	c7c5a1c9ae	[lld-macho] Ignore debug symbols while preparing relocations. Details: see https://bugs.llvm.org/show_bug.cgi?id=50812 Differential Revision: https://reviews.llvm.org/D105210	2021-07-02 13:51:46 -04:00
Martin Storsjö	ce211c505b	[LLD] [COFF] Fix up missing stdcall decorations in MinGW mode If linking directly against a DLL without an import library, the DLL export symbols might not contain stdcall decorations. If we have an undefined symbol with decoration, and we happen to have a matching undecorated symbol (which either is lazy and can be loaded, or already defined), then alias it against that instead. This matches what's done in reverse, when we have a def file declaring to export a symbol without decoration, but we only have a defined decorated symbol. In that case we do a fuzzy match (SymbolTable::findMangle). This case is more straightforward; if we have a decorated undefined symbol, just strip the decoration and look for the corresponding undecorated symbol name. Add warnings and options for either silencing the warning or disabling the whole feature, corresponding to how ld.bfd does it. (This feature works for any symbol decoration mismatch, not only when linking against a DLL directly; ld.bfd also tolerates it anywhere, and also fixes up mismatches in the other direction, like SymbolTable::findMangle, for any symbol, not only exports. But in practice, at least for lld, it would primarily end up used for linking against DLLs.) Differential Revision: https://reviews.llvm.org/D104532	2021-07-02 09:49:14 +03:00
Martin Storsjö	c09e5e50b1	[LLD] [MinGW] Allow linking to DLLs directly As the COFF linker is capable of linking directly against a DLL now (after D104530, as long as it is running in mingw mode), don't error out here but successfully load libraries specified with "-l" from DLLs if that's what ld.bfd would have matched. Differential Revision: https://reviews.llvm.org/D104531	2021-07-02 09:49:13 +03:00
Martin Storsjö	a9ff1ce1b9	[LLD] [COFF] Support linking directly against DLLs in MinGW mode GNU ld.bfd supports linking directly against DLLs without using an import library, and some projects have picked up on this habit. (There's no one single unsurmountable issue with using import libraries, but this is a regularly surfacing missing feature.) As long as one is linking by name (instead of by ordinal), the DLL export table contains most of the information needed. (One can inspect what section a symbol points at, to see if it's a function or data symbol. The practical implementation of this loops over all sections for each symbol, but as long as they're not very many, that should hopefully be tolerable performance wise.) One exception where the information in the DLL isn't entirely enough is on i386 with stdcall functions; depending on how they're done, the exported function name can be a plain undecorated name, while the import library would contain the full decorated symbol name. This issue is addressed separately in a different patch. This is implemented mimicing the structure of a regular import library, with one InputFile corresponding to the static archive that just adds lazy symbols, which then are fetched when they are needed. When such a symbol is fetched, we synthesize a coff_import_header structure in memory and create a regular ImportFile out of it. The implementation could be even smaller by just creating ImportFiles for every symbol available immediately, but that would have the drawback of actually ending up importing all symbols unless running with GC enabled (and mingw mode defaults to having it disabled for historical reasons). Differential Revision: https://reviews.llvm.org/D104530	2021-07-02 09:49:13 +03:00
Jez Ng	ac2dd06b91	[lld-macho] Deduplicate CFStrings `__cfstring` is a special literal section, so instead of breaking it up at symbol boundaries, we break it up at fixed-width boundaries (since each literal is the same size). Symbols can only occur at one of those boundaries, so this is strictly more powerful than `.subsections_via_symbols`. With that in place, we then run the section through ICF. This change is about perf-neutral when linking chromium_framework. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D105045	2021-07-01 21:22:38 -04:00
Jez Ng	b41b4148e7	[lld-macho] Only enable `__DATA_CONST` for newer platforms Matches ld64. Reviewed By: #lld-macho, alexander-shaposhnikov Differential Revision: https://reviews.llvm.org/D105080	2021-06-30 18:55:48 -04:00
Jez Ng	0d6d35e63b	[lld-macho] -section_rename should work on synthetic sections too Previously, we only applied the renames to ConcatOutputSections. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D105079	2021-06-30 18:55:48 -04:00
Fangrui Song	03051f7ac8	[ELF] Preserve section order within an INSERT AFTER command For ``` SECTIONS { text.0 : {} text.1 : {} text.2 : {} } INSERT AFTER .data; ``` the current order is `.data text.2 text.1 text.0`. It makes more sense to preserve the specified order and thus improve compatibility with GNU ld. For ``` SECTIONS { text.0 : {} } INSERT AFTER .data; SECTIONS { text.3 : {} } INSERT AFTER .data; ``` GNU ld somehow collects sections with `INSERT AFTER .data` together (IMO inconsistent) but I think it makes more sense to execute the commands in order and get `.data text.3 text.0` instead. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D105158	2021-06-30 11:35:50 -07:00
Fangrui Song	7b06bfc49e	[ELF] -pie: produce dynamic relocations for absolute relocations referencing undef weak See the comment for my understanding of -no-pie and -shared expectation. -no-pie has freedom on choices. We choose dynamic relocations to be consistent with the handling of GOT-generating relocations. Note: GNU ld has arch-varying behaviors and its x86 -pie has a very complex rule: if there is at least one GOT-generating or PLT-generating relocation and -z dynamic-undefined-weak (enabled by default) is in effect, generate a dynamic relocation. We don't emulate its rule. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D105164	2021-06-30 09:43:28 -07:00
Peter Smith	fc1cb3104b	[LLD][ELF][ARM] Tidy up test to hook up missing filecheck patterns [NFC] A couple of filecheck patterns had not been hooked up with the patterns suffering from some drift. As this test is old and llvm-objdump has improved a lot, take this opportunity to hide the instruction encoding. I've also taken out a lot of the explanatory comments that llvm-objdump improvements make redundant, as these comments oftern don't get updated when addresses change. Differential Revision: https://reviews.llvm.org/D104907	2021-06-30 14:16:40 +01:00
Peter Smith	dd4d3f7406	[LLD][ELF][ARM] Fix case of patched unrelocated BLX There are a couple of problems with the code to patch unrelocated BLX instructions: 1. The calculation of the PC needs to take into account the alignment of the instruction. The Thumb BLX uses alignDown(PC, 4) for the source address. 2. The calculation of the PC bias is hard-coded to 4 which works for Thumb, but when there is a BLX the branch will be in Arm state so it needs an 8 byte PC bias. No asssembler generates an unrelocated BLX instruction so these problems do not affect real world programs. However we should still fix them. Differential Revision: https://reviews.llvm.org/D104905	2021-06-30 14:07:35 +01:00
Igor Kudrin	657e067bb5	[ARMInstPrinter] Print the target address of a branch instruction This follows other patches that changed printing immediate values of branch instructions to target addresses, see D76580 (x86), D76591 (PPC), D77853 (AArch64). As observing immediate values might sometimes be useful, they are printed as comments for branch instructions. // llvm-objdump -d output (before) 000200b4 <_start>: 200b4: ff ff ff fa blx #-4 <thumb> 000200b8 <thumb>: 200b8: ff f7 fc ef blx #-8 <_start> // llvm-objdump -d output (after) 000200b4 <_start>: 200b4: ff ff ff fa blx 0x200b8 <thumb> @ imm = #-4 000200b8 <thumb>: 200b8: ff f7 fc ef blx 0x200b4 <_start> @ imm = #-8 // GNU objdump -d. 000200b4 <_start>: 200b4: faffffff blx 200b8 <thumb> 000200b8 <thumb>: 200b8: f7ff effc blx 200b4 <_start> Differential Revision: https://reviews.llvm.org/D104701	2021-06-30 16:35:28 +07:00
Nico Weber	aed0a08c69	[lld/mac] Make symbol table order deterministic SymtabSection::emitStabs() writes the symbol table in the order of externalSymbols, which has the order of symtab->getSymbols(), which is just the order symbols are added to the symbol table. In practice, symbols in the symbol files of input .o files are sorted, but since that's not guaranteed we sort them in ObjFile::parseSymbols(). To make sure several symbols with the same address keep the order they're in the input file, we have to use stable_sort(). In practice, std::sort() on already-sorted inputs won't change the order of just adjacent elements, and while in theory std::sort() could use a random pivot, in practice the code should be deterministic as it was previously too. But now lld/test/MachO/stabs.s passes with LLVM_ENABLE_EXPENSIVE_CHECKS=ON (the last test that was failing with that set). Fixes a regression from D99972. While here, remove an empty section in stabs.s and move .subsections_via_symbols to the end where it usually is (this part no behavior change). Differential Revision: https://reviews.llvm.org/D105071	2021-06-29 09:29:49 -04:00
Leonard Grey	a8a6e5b094	[lld-macho] Preserve alignment for non-deduplicated cstrings Fixes PR50637. Downstream bug: https://crbug.com/1218958 Currently, we split __cstring along symbol boundaries with .subsections_via_symbols when not deduplicating, and along null bytes when deduplicating. This change splits along null bytes unconditionally, and preserves original alignment in the non- deduplicated case. Removing subsections-section-relocs.s because with this change, __cstring is never reordered based on the order file. Differential Revision: https://reviews.llvm.org/D104919	2021-06-28 22:26:43 -04:00
Nico Weber	f1969b74a7	[lld/mac] Fix nondeterminism in output section ordering The two different thread_local_regular sections (__thread_data and more_thread_data) had nondeterminstic ordering for two reasons: 1. https://reviews.llvm.org/D102972 changed concatOutputSections from MapVector to DenseMap, so when we iterate it to make output segments, we would add the two sections to the __DATA output segment in nondeterministic order. 2. The same change also moved the two stable_sort()s for segments and sections to sort(). Since sections with assigned priority (such as TLV data) have the same priority for all sections, this is incorrect -- we must use stable_sort() so that the initial (input-order-based) order remains. As a side effect, we now (deterministically) put the __common section in front of __bss (while previously we happened to put it after it). (__common and __bss are both zerofill so both have order INT_MAX, but common symbols are added to inputSections before normal sections are collected.) Makes lld/test/MachO/tlv.s and lld/test/MachO/tlv-dylib.s pass with LLVM_ENABLE_EXPENSIVE_CHECKS=ON. Differential Revision: https://reviews.llvm.org/D105054	2021-06-28 18:41:33 -04:00
Jez Ng	74d5f30d83	[lld-macho][nfc] Add absolute-vs-non-absolute symbol test for ICF Make sure we don't wrongly fold two sections that refer to symbols with the same value if they are not both absolute / non-absolute. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D104876	2021-06-28 14:49:40 -04:00
Jez Ng	557e1fa02f	[lld-macho] Extend ICF to literal sections Literal sections can be deduplicated before running ICF. That makes it easy to compare them during ICF: we can tell if two literals are constant-equal by comparing their offsets in their OutputSection. LLD-ELF takes a similar approach. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D104671	2021-06-28 14:49:39 -04:00
David Spickett	6942076096	[lld][MachO] Temporarily require 64 bit build for dead-strip.s This test has always failed on 32 bit armv8 bots: https://lab.llvm.org/buildbot/#/builders/178/builds/42 Due to the output order of some symbols changing. I don't think this is an Arm specific issue so disabling on 32 bit while it's investigated.	2021-06-28 09:37:45 +00:00
Igor Kudrin	d25e572421	[llvm-objdump] Print memory operand addresses as regular comments The patch reuses the common code to print memory operand addresses as instruction comments. This helps to align the comments and enables using target-specific comment markers when `evaluateMemoryOperandAddress()` is implemented for them. Differential Revision: https://reviews.llvm.org/D104861	2021-06-28 14:25:22 +07:00
Igor Kudrin	e7fffa6f03	[llvm-objdump] Prefix memory operand addresses with '0x' This helps to avoid ambiguity when the address contains only digits 0..9. Differential Revision: https://reviews.llvm.org/D104909	2021-06-28 14:25:21 +07:00
Nico Weber	0f24ffcdfa	[lld/mac] Don't fold UNWIND_X86_64_MODE_STACK_IND unwind entries libunwind uses unwind info to find the function address belonging to the current instruction pointer. libunwind/src/CompactUnwinder.hpp's step functions read functionStart for UNWIND_X86_64_MODE_STACK_IND (and for nothing else), so these encodings need a dedicated entry per function, so that the runtime can get the stacksize off the `subq` instrunction in the function's prologue. This matches ld64. (CompactUnwinder.hpp from https://opensource.apple.com/source/libunwind/ also reads functionStart in a few more cases if `SUPPORT_OLD_BINARIES` is set, but it defaults to 0, and ld64 seems to not worry about these additional cases.) Related upstream bug: https://crbug.com/1220175 Differential Revision: https://reviews.llvm.org/D104978	2021-06-27 06:49:32 -04:00
Jan Kratochvil	a7afaf9019	Fix lld testsuite after llvm-dwarfdump now errors on invalid DWARF D104271 broke buildbots for lld/test/ELF/non-abs-reloc.s .	2021-06-27 12:26:11 +02:00
Fangrui Song	2508733e1b	[ELF] --sysroot: change sysrooted script to not fall back for an absolute path Modify the D13209 logic: for a script inside the sysroot, if an absolute path does not exist, report an error instead of falling back to the path without the sysroot prefix. This matches GNU ld, which makes sense to me: we don't want to find an arbitrary file in the host. Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D104894	2021-06-25 12:52:39 -07:00
Martin Storsjö	d07f43641f	[LLD] [COFF] Fix handling of LTO comdats with nontrivial selection types after `728cc0075e` Commit `728cc0075e` made comdat symbols from LTO objects be treated as any regular comdat symbol. This works great for symbols that actually are IMAGE_COMDAT_SELECT_ANY, but if the symbols have a less trivial selection type that require comparing either the section chunk size or contents, we can't check that before actually doing the LTO compilation. Therefore bring back one aspect of handling from before; that comdat resolution with a leader from an LTO symbol is essentially skipped, like it was before `728cc0075e`. Differential Revision: https://reviews.llvm.org/D104605	2021-06-25 09:39:56 +03:00
Greg McGary	8a8558ae27	[lld-macho] add tests for ICF, plus cleanups Add tests for pending TODOs, plus some global cleanups: * No fold: func has personality/LSDA * Fold: reference to absolute symbol with different name but identical value * No fold: reloc references to absolute symbols with different values * No fold: N_ALT_ENTRY symbols Differential Revision: https://reviews.llvm.org/D104721	2021-06-23 20:44:25 -07:00
Nico Weber	dbbc8d8333	[lld/mac] Don't crash on absolute symbols in unwind info generation Fixes a regression from `d6565a2dbc` and PR50820.	2021-06-23 14:25:34 -04:00
Martin Storsjö	f1a18fb699	[LLD] [MinGW] Silence the printouts in one test. NFC. This particular linker invocation is only run to check that we accept options, but we don't inspect the generated command line. As all other commands in the file have their output piped to FileCheck, the lit test doesn't print any other output; therefore silence this one for consistency as well.	2021-06-23 10:44:01 +03:00
Martin Storsjö	fdf54f5c50	[LLD] [MinGW] Print the lld-link command to stderr This is consistent with how clang prints its internal commands with -### and -v. When linking with -verbose, we get log messages from the actual linking written to stderr. By printing the command to the same stream, we make sure they appear in a sensible chronological order. Differential Revision: https://reviews.llvm.org/D104527	2021-06-23 10:21:42 +03:00
Reid Kleckner	5bcbc7ee52	Add regression test for maybeMangle issue This was crbug.com/1222724, which caused D104529 to be reverted. The new test fails when D104529 is reapplied locally.	2021-06-22 12:55:25 -07:00
Nico Weber	d6565a2dbc	[lld/mac] Add explicit "no unwind info" entries for functions without unwind info Fixes PR50529. With this, lld-linked Chromium base_unittests passes on arm macs. Surprisingly, no measurable impact on link time. Differential Revision: https://reviews.llvm.org/D104681	2021-06-22 06:12:42 -04:00
Nico Weber	e6cb55d5ce	[lld/mac] Test zerofill sections after __thread_bss Real zerofill sections go after __thread_bss, since zerofill sections must all be at the end of their segment and __thread_bss must be right after __thread_data. Works fine already, but wasn't tested as far as I can tell. Also tweak comment about zerofill sections a bit. No behavior change. Differential Revision: https://reviews.llvm.org/D104609	2021-06-20 20:44:29 -04:00
Fangrui Song	89e66a3ab3	[ELF] Delete --no-cref which does not exist in GNU ld Also delete the single dash form which does not appear to be used.	2021-06-20 14:28:56 -07:00
Fangrui Song	cd6b1b2b86	[ELF][test] Add missing tests for --no-export-dynamic & --no-warn-backrefs	2021-06-20 14:20:14 -07:00
Martin Storsjö	1c8bb625b7	[LLD] [MinGW] Print errors/warnings in lld-link with a "ld.lld" prefix Pass the original argv[0] to the coff linker, as the coff linker uses the basename of argv[0] as the log prefix. This makes error messages to be printed with a "ld.lld:" prefix instead of "lld-link:". The current "lld-link:" prefix can be confusing to users, as they're invoking the MinGW linker (and might not even have a lld-link executable). Keep the first argument as lld-link when printing the command line, to make it an actually reproducible standalone command. Differential Revision: https://reviews.llvm.org/D104526	2021-06-19 22:32:37 +03:00
Nico Weber	c931e12b1d	[lld/mac] Make sure __thread_ptrs is in front of __thread_bss The exact location doesn't matter, but it should be in front of __thread_bss. We put it right in front of __thread_data which is where ld64 seems to put it as well. Fixes PR50769. (As mentioned on the bug, there is probably a more structural fix too, see comment 5. If we don't address this, it's likely we'll run into this again with other synthetic sections. But for now, let's fix the immediate breakage.) Differential Revision: https://reviews.llvm.org/D104596	2021-06-19 12:56:43 -04:00
Nico Weber	17271ece0d	[lld/mac] Give __DATA,__thread_ptrs type S_THREAD_LOCAL_VARIABLE_POINTERS ...instead of S_NON_LAZY_SYMBOL_POINTERS. This matches ld64. Part of PR50769. While here, also remove an old TODO that was done in D87178. Differential Revision: https://reviews.llvm.org/D104594	2021-06-19 12:56:42 -04:00
Jez Ng	4507f64165	[re-land][lld-macho] Avoid force-loading the same archive twice This reverts commit `c9b241efd6`, which was a backout diff to fix the buildbots. The real culprit of the crash is `1d31fb8d12`, which is being reverted. Differential Revision: https://reviews.llvm.org/D104353	2021-06-18 22:43:50 -04:00
Nico Weber	c9b241efd6	Revert "[lld-macho] Avoid force-loading the same archive twice" This reverts commit `24706cd73c`. Test seems to fail flakily. See comments on https://reviews.llvm.org/D104353 for a hypothesis for why.	2021-06-18 20:25:27 -04:00
Jez Ng	4c49f9ceaf	[lld-macho] Handle non-extern symbols marked as private extern Previously, we asserted that such a case was invalid, but in fact `ld -r` can emit such symbols if the input contained a (true) private extern, or if it contained a symbol started with "L". Non-extern symbols marked as private extern are essentially equivalent to regular TU-scoped symbols, so no new functionality is needed. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D104502	2021-06-18 16:36:14 -04:00
Nico Weber	f7366890c2	[lld/mac] Support -data_in_code_info, -function_starts flags These are on by default, but there's also an explicit flag for them. Differential Revision: https://reviews.llvm.org/D104543	2021-06-18 13:01:42 -04:00
Greg McGary	8120c9e379	Rename option -icf MODE to --icf=MODE The `icf` command-line option is not present in ld64, so it should use the LLD option syntax, which begins with double dashes and separates primary option from any suboption with the equal sign. Differential Revision: https://reviews.llvm.org/D104548	2021-06-18 09:52:15 -07:00
Heejin Ahn	1d891d44f3	[WebAssembly] Rename event to tag We recently decided to change 'event' to 'tag', and 'event section' to 'tag section', out of the rationale that the section contains a generalized tag that references a type, which may be used for something other than exceptions, and the name 'event' can be confusing in the web context. See - https://github.com/WebAssembly/exception-handling/issues/159#issuecomment-857910130 - https://github.com/WebAssembly/exception-handling/pull/161 Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D104423	2021-06-17 20:34:19 -07:00
Sam Clegg	d01e673a9f	[lld][WebAssembly] Fix crash calling weakly undefined function in PIC code Differential Revision: https://reviews.llvm.org/D104495	2021-06-17 16:49:02 -07:00
Sam Clegg	758633f922	[lld][WebAssembly] Add new `--import-undefined` option This change revisits https://reviews.llvm.org/D79248 which originally added support for the --unresolved-symbols flag. At the time I thought it would make sense to add a third option to this flag called `import-functions` but it turns out (as was suspects by on the reviewers IIRC) that this option can be authoganal. Instead I've added a new option called `--import-undefined` that only operates on symbols that can be imported (for example, function symbols can always be imported as opposed to data symbols we can only be imported when compiling with PIC). This option gives us the full expresivitiy that emscripten needs to be able allow reporting of undefined data symbols as well as the option to disable that. This change does remove the `--unresolved-symbols=import-functions` option, which is been in the codebase now for about a year but I would be extremely surprised if anyone was using it. Differential Revision: https://reviews.llvm.org/D103290	2021-06-17 11:44:21 -07:00
Vy Nguyen	366df11a35	[lld-macho] Rework mergeFlag to behave closer to what ld64 does. Details: I've been getting a few weird errors similar to the following from our internal tests: ``` ld64.lld.darwinnew: error: Cannot merge section __eh_frame (type=0x0) into __eh_frame (type=0xB): inconsistent types ld64.lld.darwinnew: error: Cannot merge section __eh_frame (flags=0x0) into __eh_frame (flags=0x6800000B): strict flags differ ld64.lld.darwinnew: error: Cannot merge section __eh_frame (type=0x0) into __eh_frame (type=0xB): inconsistent types ld64.lld.darwinnew: error: Cannot merge section __eh_frame (flags=0x0) into __eh_frame (flags=0x6800000B): strict flags differ ``` Differential Revision: https://reviews.llvm.org/D103971	2021-06-17 14:22:58 -04:00
Greg McGary	f27e4548fc	[lld-macho] Implement ICF ICF = Identical C(ode\|OMDAT) Folding This is the LLD ELF/COFF algorithm, adapted for MachO. So far, only `-icf all` is supported. In order to support `-icf safe`, we will need to port address-significance tables (`.addrsig` directives) to MachO, which will come in later diffs. `check-{llvm,clang,lld}` have 0 regressions for `lld -icf all` vs. baseline ld64. We only run ICF on `__TEXT,__text` for reasons explained in the block comment in `ConcatOutputSection.cpp`. Here is the perf impact for linking `chromium_framekwork` on a Mac Pro (16-core Xeon W) for the non-ICF case vs. pre-ICF: ``` N Min Max Median Avg Stddev x 20 4.27 4.44 4.34 4.349 0.043029977 + 20 4.37 4.46 4.405 4.4115 0.025188761 Difference at 95.0% confidence 0.0625 +/- 0.0225658 1.43711% +/- 0.518873% (Student's t, pooled s = 0.0352566) ``` Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D103292	2021-06-17 10:07:44 -07:00
Jez Ng	24706cd73c	[lld-macho] Avoid force-loading the same archive twice We need to dedup archive loads (similar to what we do for dylib loads). I noticed this issue after building some Swift stuff that used `-force_load_swift_libs`, as it caused some Swift archives to be loaded many times. Reviewed By: #lld-macho, thakis, MaskRay Differential Revision: https://reviews.llvm.org/D104353	2021-06-17 11:13:54 -04:00
Igor Kudrin	5355b8c631	[ELF] Restore arm-branch.s test After D77330, the comments are inconsistent with the disassembled code. As the value of `far` has been changed, a thunk to reach it is now generated, and target addresses of branch instructions are different from what was initially expected. The patch fixes that and makes the test closer to what it was originally. Differential Revision: https://reviews.llvm.org/D104286	2021-06-17 17:08:13 +07:00
Jez Ng	560636e549	[lld-macho] Put DATA_IN_CODE immediately after FUNCTION_STARTS codesign checks for this. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D104354	2021-06-16 15:23:07 -04:00
Jez Ng	eeac6b2bec	[lld-macho] Handle multiple LC_LINKER_OPTIONs We previously only parsed the first one. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D104352	2021-06-16 15:23:06 -04:00
Jez Ng	d52d1b93c3	[lld-macho] Downgrade version mismatch to warning It's a warning in ld64. While having LLD be stricter would be nice, it makes it harder for it to be a drop-in replacement into existing builds. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D104333	2021-06-16 11:06:26 -04:00
Nico Weber	b579938d40	[lld/mac] Add support for -no_data_in_code_info flag Differential Revision: https://reviews.llvm.org/D104345	2021-06-16 06:40:42 -04:00
Konstantin Schwarz	5d621ed85d	[ELF] Consider that NOLOAD sections should be placed in a PT_LOAD segment During PHDR creation, the case where an output section does not require a PT_LOAD header but still occupies memory in the current VMA region was not handled. If such an output section interleaves two output sections that have the same VMA and LMA regions set, we would previously re-use the existing PT_LOAD header for the second output section. However, since the memory region is not contiguous, we need to start a new PT_LOAD segment. This fixes https://bugs.llvm.org/show_bug.cgi?id=50558 Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D103815	2021-06-16 12:36:45 +02:00
Alexander Shaposhnikov	928394d109	[lld][MachO] Add support for LC_DATA_IN_CODE Add first bits for emitting LC_DATA_IN_CODE. Test plan: make check-lld-macho Differential revision: https://reviews.llvm.org/D103006	2021-06-14 19:21:59 -07:00
Fangrui Song	899fdf548e	[ELF] Add OVERWRITE_SECTIONS command This implements https://sourceware.org/bugzilla/show_bug.cgi?id=26404 An `OVERWRITE_SECTIONS` command is a `SECTIONS` variant which contains several output section descriptions. The output sections do not have specify an order. Similar to `INSERT [BEFORE\|AFTER]`, `LinkerScript::hasSectionsCommand` is not set, so the built-in rules (see `docs/ELF/linker_script.rst`) still apply. `OVERWRITE_SECTIONS` can be more convenient than `INSERT` because it does not need an anchor section. The initial syntax is intentionally narrow to facilitate backward compatible extensions in the future. Symbol assignments cannot be used. This feature is versatile. To list a few usage: * Use `section : { KEEP(...) }` to retain input sections under GC * Define encapsulation symbols (start/end) for an output section * Use `section : ALIGN(...) : { ... }` to overalign an output section (similar to ld64 `-sectalign`) When an output section is specified by both `OVERWRITE_SECTIONS` and `INSERT`, `INSERT` is processed after overwrite sections. To make this work, this patch changes `InsertCommand` to use name based matching instead of pointer based matching. (This may cause a difference when `INSERT` moves one output section more than once. Such duplicate commands should not be used in practice (seems that in GNU ld the output sections may just disappear).) A linker script can be used without -T/--script. The traditional `SECTIONS` commands are concatenated, so a wrong rule can be more noticeable from the section order. This feature if misused can be less noticeable, just like `INSERT`. Differential Revision: https://reviews.llvm.org/D103303	2021-06-13 12:41:11 -07:00
Alexander Shaposhnikov	b9095f5e1a	[lld][MachO] Fix function starts section Sort the addresses stored in FunctionStarts section. Previously we were encoding potentially large numbers (due to unsigned overflow). Test plan: make check-all Differential revision: https://reviews.llvm.org/D103662	2021-06-11 17:47:28 -07:00
Jez Ng	464d3dc3d1	[lld-macho] Have dead-stripping work with literal sections Literal sections are not atomically live or dead. Rather, liveness is tracked for each individual literal they contain. CStrings have their liveness tracked via a `live` bit in StringPiece, and fixed-width literals have theirs tracked via a BitVector. The live-marking code now needs to track the offset within each section that is to be marked live, in order to identify the literal at that particular offset. Numbers for linking chromium_framework on my 3.2 GHz 16-Core Intel Xeon W with both `-dead_strip` and `--deduplicate-literals`, with and without this diff applied: ``` N Min Max Median Avg Stddev x 20 4.32 4.44 4.375 4.372 0.03105174 + 20 4.3 4.39 4.36 4.3595 0.023277502 No difference proven at 95.0% confidence ``` This gives us size savings of about 0.4%. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D103979	2021-06-11 19:50:09 -04:00
Jez Ng	5d88f2dd94	[lld-macho] Deduplicate fixed-width literals Conceptually, the implementation is pretty straightforward: we put each literal value into a hashtable, and then write out the keys of that hashtable at the end. In contrast with ELF, the Mach-O format does not support variable-length literals that aren't strings. Its literals are either 4, 8, or 16 bytes in length. LLD-ELF dedups its literals via sorting + uniq'ing, but since we don't need to worry about overly-long values, we should be able to do a faster job by just hashing. That said, the implementation right now is far from optimal, because we add to those hashtables serially. To parallelize this, we'll need a basic concurrent hashtable (only needs to support concurrent writes w/o interleave reads), which shouldn't be to hard to implement, but I'd like to punt on it for now. Numbers for linking chromium_framework on my 3.2 GHz 16-Core Intel Xeon W: N Min Max Median Avg Stddev x 20 4.27 4.39 4.315 4.3225 0.033225703 + 20 4.36 4.82 4.44 4.4845 0.13152846 Difference at 95.0% confidence 0.162 +/- 0.0613971 3.74783% +/- 1.42041% (Student's t, pooled s = 0.0959262) This corresponds to binary size savings of 2MB out of 335MB, or 0.6%. It's not a great tradeoff as-is, but as mentioned our implementation can be signficantly optimized, and literal dedup will unlock more opportunities for ICF to identify identical structures that reference the same literals. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D103113	2021-06-11 19:50:08 -04:00
Nico Weber	54418c5a35	[lld/mac] Make binaries written by lld strippable Be less clever when writing the indirect symbols in LC_DYSYMTAB: lld used to make point __stubs and __la_symbol_ptr point at the same bytes in the indirect symbol table in the __LINKEDIT segment. That confused strip, so write the same bytes twice and make __stubs and __la_symbol_ptr point at one copy each, so that they don't share data. This unconfuses strip, and seems to be what ld64 does too, so hopefully tools are generally more used to this. This makes the output binaries a bit larger, but not much: 4 bytes for roughly each called function from a dylib and each weak function. Chromium Framewoork grows by 6536 bytes, clang-format by a few hundred. With this, `strip -x Chromium\ Framework` works (244 MB before stripping to 171 MB after stripping, compared to 236 MB=>164 MB with ld64). Running strip without `-x` produces the same error message now for lld-linked Chromium Framework as for when using ld64 as a linker. `strip clang-format` also works now but didn't previously. Fixes PR50657. Differential Revision: https://reviews.llvm.org/D104081	2021-06-11 00:18:03 -04:00
Fangrui Song	c03b6305d8	[ELF][RISCV] Resolve branch relocations referencing undefined weak to current location if not using PLT In a -no-pie link we optimize R_PLT_PC to R_PC. Currently we resolve a branch relocation to the link-time zero address. However such a choice tends to cause relocation overflow possibility for RISC architectures. * aarch64: GNU ld: rewrite the instruction to a NOP; ld.lld: branch to the next instruction * mips: GNU ld: branch to the start of the text segment (?); ld.lld: branch to zero * ppc32: GNU ld: rewrite the instruction to a NOP; ld.lld: branch to the current instruction * ppc64: GNU ld: rewrite the instruction to a NOP; ld.lld: branch to the current instruction * riscv: GNU ld: branch to the absolute zero address (with instruction rewriting) * i386/x86_64: GNU ld/ld.lld: branch to the link-time zero address I think that resolving to the same location is a good choice. The instruction, if triggered, is clearly an undefined behavior. Resolving to the same location can cause an infinite loop (making the user aware of the issue) while ensuring no overflow. Reviewed By: jrtc27 Differential Revision: https://reviews.llvm.org/D103001	2021-06-10 13:25:16 -07:00
Nico Weber	e87c095af3	[lld/mac] Print dylib search details with --print-dylib-search or RC_TRACE_DYLIB_SEARCHING For debugging dylib loading, it's useful to have some insight into what the linker is doing. ld64 has the undocumented RC_TRACE_DYLIB_SEARCHING env var for this printing dylib search candidates. This adds a flag --print-dylib-search to make lld print the seame information. It's useful for users, but also for writing tests. The output is formatted slightly differently than ld64, but we still support RC_TRACE_DYLIB_SEARCHING to offer at least a compatible way to trigger this. ld64 has both `-print_statistics` and `-trace_symbol_output` to enable diagnostics output. I went with "print" since that seems like a more straightforward name. Differential Revision: https://reviews.llvm.org/D103985	2021-06-09 22:08:20 -04:00
Nico Weber	bbe6f51b72	[lld/mac] Make framework symlinks in tests more realistic In a framework Foo.framework, Foo.framework/Foo is usually a relative symbolic link to Foo.framework/Versions/Current/Foo, and Foo.framework/Versions/Current is usually a relative symbolic link to A. Our tests used absolute symbolic links. Now they use relative symbolic links. No behavior change, just makes the tests more representative of the real world. (implicit-dylib.s omits the "Current" folder too, but I'm not changing that here.) Differential Revision: https://reviews.llvm.org/D103998	2021-06-09 20:39:39 -04:00
Nico Weber	0e399eb527	[lld/mac] When handling @loader_path, use realpath() of symlinks This is important for Frameworks, which are usually symlinks. ld64 gets this right for @rpath that's replaced with @loader_path, but not for bare @loader_path -- ld64's code calls realpath() in that case too, but ignores the result. ld64 somehow manages to find libbar1.dylib in the test without the explicit `-rpath` in Foo1. I don't understand why or how. But this change is a step forward and fixes an immediate problem I'm having, so let's start with this :) Differential Revision: https://reviews.llvm.org/D103990	2021-06-09 20:36:07 -04:00
Fangrui Song	928a197d26	[ELF] Add a GRP_COMDAT test with a local signature symbol See https://groups.google.com/g/generic-abi/c/2X6mR-s2zoc Test that a local signature symbol does not suppress COMDAT deduplication.	2021-06-08 09:22:30 -07:00
Jez Ng	447dfbe005	[lld-macho] Implement -force_load_swift_libs It causes libraries whose names start with "swift" to be force-loaded. Note that unlike the more general `-force_load`, this flag only applies to libraries specified via LC_LINKER_OPTIONS, and not those passed on the command-line. This is what ld64 does. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D103709	2021-06-07 23:48:35 -04:00
Jez Ng	04259cde15	[lld-macho] Implement cstring deduplication Our implementation draws heavily from LLD-ELF's, which in turn delegates its string deduplication to llvm-mc's StringTableBuilder. The messiness of this diff is largely due to the fact that we've previously assumed that all InputSections get concatenated together to form the output. This is no longer true with CStringInputSections, which split their contents into StringPieces. StringPieces are much more lightweight than InputSections, which is important as we create a lot of them. They may also overlap in the output, which makes it possible for strings to be tail-merged. In fact, the initial version of this diff implemented tail merging, but I've dropped it for reasons I'll explain later. Alignment Issues Mergeable cstring literals are found under the `__TEXT,__cstring` section. In contrast to ELF, which puts strings that need different alignments into different sections, clang's Mach-O backend puts them all in one section. Strings that need to be aligned have the `.p2align` directive emitted before them, which simply translates into zero padding in the object file. I think ld64 extracts the desired per-string alignment from this data by preserving each string's offset from the last section-aligned address. I'm not entirely certain since it doesn't seem consistent about doing this; but perhaps this can be chalked up to cases where ld64 has to deduplicate strings with different offset/alignment combos -- it seems to pick one of their alignments to preserve. This doesn't seem correct in general; we can in fact can induce ld64 to produce a crashing binary just by linking in an additional object file that only contains cstrings and no code. See PR50563 for details. Moreover, this scheme seems rather inefficient: since unaligned and aligned strings are all put in the same section, which has a single alignment value, it doesn't seem possible to tell whether a given string doesn't have any alignment requirements. Preserving offset+alignments for strings that don't need it is wasteful. In practice, the crashes seen so far seem to stem from x86_64 SIMD operations on cstrings. X86_64 requires SIMD accesses to be 16-byte-aligned. So for now, I'm thinking of just aligning all strings to 16 bytes on x86_64. This is indeed wasteful, but implementation-wise it's simpler than preserving per-string alignment+offsets. It also avoids the aforementioned crash after deduplication of differently-aligned strings. Finally, the overhead is not huge: using 16-byte alignment (vs no alignment) is only a 0.5% size overhead when linking chromium_framework. With these alignment requirements, it doesn't make sense to attempt tail merging -- most strings will not be eligible since their overlaps aren't likely to start at a 16-byte boundary. Tail-merging (with alignment) for chromium_framework only improves size by 0.3%. It's worth noting that LLD-ELF only does tail merging at `-O2`. By default (at `-O1`), it just deduplicates w/o tail merging. @thakis has also mentioned that they saw it regress compressed size in some cases and therefore turned it off. `ld64` does not seem to do tail merging at all. Performance Numbers CString deduplication reduces chromium_framework from 250MB to 242MB, or about a 3.2% reduction. Numbers for linking chromium_framework on my 3.2 GHz 16-Core Intel Xeon W: N Min Max Median Avg Stddev x 20 3.91 4.03 3.935 3.95 0.034641016 + 20 3.99 4.14 4.015 4.0365 0.0492336 Difference at 95.0% confidence 0.0865 +/- 0.027245 2.18987% +/- 0.689746% (Student's t, pooled s = 0.0425673) As expected, cstring merging incurs some non-trivial overhead. When passing `--no-literal-merge`, it seems that performance is the same, i.e. the refactoring in this diff didn't cost us. N Min Max Median Avg Stddev x 20 3.91 4.03 3.935 3.95 0.034641016 + 20 3.89 4.02 3.935 3.9435 0.043197831 No difference proven at 95.0% confidence Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D102964	2021-06-07 23:48:35 -04:00
Nico Weber	17c43c4045	[lld/mac] Add reexports after reexporter to inputFiles When a library "host"'s reexports change their installName with `$ld$os10.11$install_name$host`, we used to write a load command for "host" but write the version numbers of the reexport instead of "host". This fixes that. I first thought that the rule is to take the version numbers from the library that originally had that install name (implemented in D103819), but that's not what ld64 seems to be doing: It takes the version number from the first dylib with that install name it loads, and it loads the reexporting library before the reexports. We already did most of that, we just added reexports before the reexporter. After this change, we add the reexporter before the reexports. Addresses https://bugs.llvm.org/show_bug.cgi?id=49800#c11 part 1. (ld64 seems to add reexports after processing _all_ files on the command line, while we add them right after the reexporter. For the common case of reexport + $ld$ symbol changing back to the exporter name, this doesn't make a difference, but you can construct a case where it does. I expect this to not make a difference in practice though.) Differential Revision: https://reviews.llvm.org/D103821	2021-06-07 17:04:03 -04:00
Nico Weber	422544414b	[lld/mac] Add a test for -reexport_library + -dead_strip_dylibs Our behavior here already matched ld64, now we have a test for it. (ld64 even strips the library here if you also pass -needed_library bar.dylib. That seems wrong to me, and lld honors needed_library in that case.) Differential Revision: https://reviews.llvm.org/D103812	2021-06-07 13:44:58 -04:00
Nico Weber	c5ffe97988	[lld/mac] Implement support for searching dylibs with @rpath/ in install name Also adjust a few comments, and move the DylibFile comment talking about umbrella next to the parameter again. Differential Revision: https://reviews.llvm.org/D103783	2021-06-07 06:22:52 -04:00
Nico Weber	52489021cf	[lld/mac] Implement support for searching dylibs with @loader_path/ in install name Differential Revision: https://reviews.llvm.org/D103779	2021-06-06 20:19:50 -04:00
Nico Weber	a48bd587f7	[lld/mac] Implement support for searching dylibs with @executable_path/ in install name Differential Revision: https://reviews.llvm.org/D103775	2021-06-06 20:01:50 -04:00
Alexander Shaposhnikov	5e49ee8794	[lld][MachO] Add support for $ld$install_name symbols This diff adds support for $ld$install_name symbols. Test plan: make check-lld-macho Differential revision: https://reviews.llvm.org/D103746	2021-06-05 12:58:59 -07:00
Alexander Shaposhnikov	cf29a92b90	[lld][MachO] Fix typo in special-symbol-ld-previous.s Fix typo in the test special-symbol-ld-previous.s. NFC.	2021-06-05 01:27:42 -07:00
Alexander Shaposhnikov	1309c181a8	[lld][MachO] Add first bits to support special symbols This diff adds first bits to support special symbols $ld$previous* in LLD. $ld$* symbols modify properties/behavior of the library (e.g. its install name, compatibility version or hide/add symbols) for specific target versions. Test plan: make check-lld-macho Differential revision: https://reviews.llvm.org/D103505	2021-06-04 23:32:26 -07:00
Nico Weber	1aae55ddea	[lld/mac] Add test coverage for --reproduce + -flat_namespace Works fine already, now it has a test too. Differential Revision: https://reviews.llvm.org/D103643	2021-06-03 21:00:35 -04:00
Jez Ng	6881f29a36	[lld-macho] Parse re-exports of nested TAPI documents D103423 neglected to call `parseReexports()` for nested TBD documents, leading to symbol resolution failures when trying to look up a symbol nested more than one level deep in a TBD file. This fixes the regression and adds a test. It also appears that `umbrella` wasn't being set properly when calling `parseLoadCommands` -- it's supposed to resolve to `this` if `nullptr` is passed. I didn't write a failing test case for this but I've made `umbrella` a member so the previous behavior should be preserved. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D103586	2021-06-03 12:02:30 -04:00
Martin Storsjö	728cc0075e	[LLD] [COFF] Fix autoexport from LTO objects with comdat symbols Make sure that comdat symbols also have a non-null dummy SectionChunk associated. This requires moving around an existing FIXME regarding comdats in LTO. Differential Revision: https://reviews.llvm.org/D103012	2021-06-03 15:14:49 +03:00
Nico Weber	5ecfdb5123	[lld/mac] try to fix tests after `a5645513db` My linux system doesn't like the `grep` for some reason, but FileCheck seems to work.	2021-06-02 11:33:11 -04:00
Nico Weber	a5645513db	[lld/mac] Implement -dead_strip Also adds support for live_support sections, no_dead_strip sections, .no_dead_strip symbols. Chromium Framework 345MB unstripped -> 250MB stripped (vs 290MB unstripped -> 236M stripped with ld64). Doing dead stripping is a bit faster than not, because so much less data needs to be processed: % ministat lld_* x lld_nostrip.txt + lld_strip.txt N Min Max Median Avg Stddev x 10 3.929414 4.07692 4.0269079 4.0089678 0.044214794 + 10 3.8129408 3.9025559 3.8670411 3.8642573 0.024779651 Difference at 95.0% confidence -0.144711 +/- 0.0336749 -3.60967% +/- 0.839989% (Student's t, pooled s = 0.0358398) This interacts with many parts of the linker. I tried to add test coverage for all added `isLive()` checks, so that some test will fail if any of them is removed. I checked that the test expectations for the most part match ld64's behavior (except for live-support-iterations.s, see the comment in the test). Interacts with: - debug info - export tries - import opcodes - flags like -exported_symbol(s_list) - -U / dynamic_lookup - mod_init_funcs, mod_term_funcs - weak symbol handling - unwind info - stubs - map files - -sectcreate - undefined, dylib, common, defined (both absolute and normal) symbols It's possible it interacts with more features I didn't think of, of course. I also did some manual testing: - check-llvm check-clang check-lld work with lld with this patch as host linker and -dead_strip enabled - Chromium still starts - Chromium's base_unittests still pass, including unwind tests Implemenation-wise, this is InputSection-based, so it'll work for object files with .subsections_via_symbols (which includes all object files generated by clang). I first based this on the COFF implementation, but later realized that things are more similar to ELF. I think it'd be good to refactor MarkLive.cpp to look more like the ELF part at some point, but I'd like to get a working state checked in first. Mechanical parts: - Rename canOmitFromOutput to wasCoalesced (no behavior change) since it really is for weak coalesced symbols - Add noDeadStrip to Defined, corresponding to N_NO_DEAD_STRIP (`.no_dead_strip` in asm) Fixes PR49276. Differential Revision: https://reviews.llvm.org/D103324	2021-06-02 11:09:26 -04:00
Nico Weber	66a1ecd2cf	[lld/mac] Implement -needed_framework, -needed_library, -needed-l These allow overriding dead_strip_dylibs. Differential Revision: https://reviews.llvm.org/D103499	2021-06-02 11:06:42 -04:00
Nico Weber	e14fd7d879	[lld/mac] Don't strip explicit dylib also mentioned in LC_LINKER_OPTION Noticed by Jez in D103499. Differential Revision: https://reviews.llvm.org/D103521	2021-06-02 10:59:56 -04:00
Nico Weber	78ce89bb1e	[lld/mac] Implement -reexport_framework, -reexport_library, -reexport-l These are slightly easier-to-use versions of -sub_library and -sub_umbrella. Differential Revision: https://reviews.llvm.org/D103497	2021-06-02 06:37:34 -04:00
Nico Weber	222a88a243	[lld/mac] Make -t work correctly with -flat_namespace We used to not print dylibs referenced by other dylibs in `-t` mode. This affected reexports, and with `-flat_namespace` also just dylibs loaded by dylibs. Now we print them. Fixes PR49514. Differential Revision: https://reviews.llvm.org/D103428	2021-06-01 19:23:39 -04:00
Nico Weber	aeae3e0ba9	[lld/mac] Emit only one LC_LOAD_DYLIB per dylib In some cases, we end up with several distinct DylibFiles that have the same install name. Only emit a single LC_LOAD_DYLIB in those cases. This happens in 3 cases I know of: 1. Some tbd files are symlinks. libpthread.tbd is a symlink against libSystem.tbd for example, so `-lSystem -lpthread` loads libSystem.tbd twice. We could (and maybe should) cache loaded dylibs by realpath() to catch this. 2. Some tbd files are copies of each other. For example, CFNetwork.framework/CFNetwork.tbd and CFNetwork.framework/Versions/A/CFNetwork.tbd are two distinct copies of the same file. The former is found by `-framework CFNetwork` and the latter by the reexport in CoreServices.tbd. We could conceivably catch this by making `-framework` search look in `Versions/Current` instead of in the root, and/or by using a content hash to cache tbd files, but that's starting to sound complicated. 3. Magic $ld$ symbol processing can change the install name of a dylib based on the target platform_version. Here, two truly distinct dylibs can have the same install name. So we need this code to deal with (3) anyways. Might as well use it for 1 and 2, at least for now :) With this (and D103430), clang-format links in the same dylibs when linked with lld and ld64. Differential Revision: https://reviews.llvm.org/D103488	2021-06-01 18:15:35 -04:00
Sam Clegg	c1a59fa550	[lld][WebAssemlby] Fix for string merging of -dwarf-5 sections We were mistakenly treating `.debug_str_offsets` as a string mergable section when it is not (it contains integers not strings). This is an indication that we really should find a way to store flags for custom sections. Fixes: https://bugs.llvm.org/show_bug.cgi?id=48828 Fixes: https://bugs.chromium.org/p/chromium/issues/detail?id=1172217 Differential Revision: https://reviews.llvm.org/D103486	2021-06-01 14:33:56 -07:00
Nico Weber	2c1903412b	[lld/mac] Implement removal of unused dylibs This omits load commands for unreferenced dylibs if: - the dylib was loaded implicitly, - it is marked MH_DEAD_STRIPPABLE_DYLIB - or -dead_strip_dylibs is passed This matches ld64. Currently, the "is dylib referenced" state is computed before dead code stripping and is not updated after dead code stripping. This too matches ld64. We should do better here. With this, clang-format linked with lld (like with ld64) no longer has libobjc.A.dylib in `otool -L` output. (It was implicitly loaded as a reexport of CoreFoundation.framework, but it's not needed.) Differential Revision: https://reviews.llvm.org/D103430	2021-06-01 16:06:30 -04:00
Nico Weber	0b39f055d8	[lld/mac] Don't write mtimes to N_OSO entries if ZERO_AR_DATE is set. This is important for build determinism. This matches ld64. Differential Revision: https://reviews.llvm.org/D103446	2021-06-01 15:29:38 -04:00
Nico Weber	c4053cd14e	[lld/mac] Don't crash on -order_file with assembly inputs on arm64 .s files with `-g` generate __debug_aranges on darwin/arm64 for some reason, and those lead to `nullptr` symbols. Don't crash on that. Fixes PR50517. Differential Revision: https://reviews.llvm.org/D103350	2021-05-28 21:00:46 -04:00
Fangrui Song	2644399ce7	[lld-macho][test] Simplify --allow-empty with count 0	2021-05-28 15:15:59 -07:00
Reid Kleckner	109aac9212	[PDB] Enable parallel ghash type merging by default Ghashing is probably going to be faster in most cases, even without precomputed ghashes in object files. Here is my table of results linking clang.pdb: ------------------------------- \| threads \| GHASH \| NOGHASH \| ------------------------------- \| j1 \| 51.031s \| 25.141s \| \| j2 \| 31.079s \| 22.109s \| \| j4 \| 18.609s \| 23.156s \| \| j8 \| 11.938s \| 21.984s \| \| j28 \| 8.375s \| 18.391s \| ------------------------------- This shows that ghashing is faster if at least four cores are available. This may make the linker slower if most cores are busy in the middle of a build, but in that case, the linker probably isn't on the critical path of the build. Incremental build performance is arguably more important than highly contended batch build link performance. The -time output indicates that ghash computation is the dominant factor: Input File Reading: 924 ms ( 1.8%) GC: 689 ms ( 1.3%) ICF: 527 ms ( 1.0%) Code Layout: 414 ms ( 0.8%) Commit Output File: 24 ms ( 0.0%) PDB Emission (Cumulative): 49938 ms ( 94.8%) Add Objects: 46783 ms ( 88.8%) Global Type Hashing: 38983 ms ( 74.0%) GHash Type Merging: 5640 ms ( 10.7%) Symbol Merging: 2154 ms ( 4.1%) Publics Stream Layout: 188 ms ( 0.4%) TPI Stream Layout: 18 ms ( 0.0%) Commit to Disk: 2818 ms ( 5.4%) -------------------------------------------------- Total Link Time: 52669 ms (100.0%) We can speed that up with a faster content hash (not SHA1). Differential Revision: https://reviews.llvm.org/D102888	2021-05-27 14:19:36 -07:00
Jez Ng	fcab06bd85	[lld-macho][nfc] Sort OutputSections based on explicit order of command-line inputs This diff paves the way for {D102964} which adds a new kind of InputSection. We previously maintained section ordering implicitly: we created InputSections as we parsed each file in command-line order, and passed on this ordering when we created OutputSections and OutputSegments by iterating over these InputSections. The implicitness of the ordering made it difficult to refactor the code to e.g. handle a new type of InputSection. As such, I've codified the ordering explicitly via `inputOrder` fields. This also allows us to use `sort` instead of `stable_sort`. Benchmarking chromium_framework on my 3.2 GHz 16-Core Intel Xeon W: N Min Max Median Avg Stddev x 20 4.23 4.35 4.27 4.274 0.030157481 + 20 4.24 4.38 4.27 4.2815 0.033759989 No difference proven at 95.0% confidence Reviewed By: #lld-macho, alexshap Differential Revision: https://reviews.llvm.org/D102972	2021-05-25 14:58:29 -04:00
Nathan Lanza	2f65166056	[lld:elf] Weaken the requirement for a computed binding to be STB_LOCAL Given the following scenario: ``` // Cat.cpp struct Animal { virtual void makeNoise() const = 0; }; struct Cat : Animal { void makeNoise() const override; }; extern "C" int puts(char const ); void Cat::makeNoise() const { puts("Meow"); } void doThingWithCat(Animal a) { static_cast<Cat >(a)->makeNoise(); } // CatUser.cpp struct Animal { virtual void makeNoise() const = 0; }; struct Cat : Animal { void makeNoise() const override; }; void doThingWithCat(Animal a); void useDoThingWithCat() { Cat d = new Cat; doThingWithCat(d); } // cat.ver { global: _Z17useDoThingWithCatv; local: ; }; $ clang++ Cat.cpp CatUser.cpp -fpic -flto=thin -fwhole-program-vtables -shared -O3 -fuse-ld=lld -Wl,--lto-whole-program-visibility -Wl,--version-script,cat.ver ``` We cannot devirtualize `Cat::makeNoise`. The issue is complex: Due to `-fsplit-lto-unit` and usage of type metadata, we place the Cat vtable declaration into module 0 and the Cat vtable definition with type metadata into module 1, causing duplicate entries (Undefined followed by Defined) in the `lto::InputFile::symbols()` output. In `BitcodeFile::parse`, after processing the `Undefined` then the `Defined`, the final state is `Defined`. In `BitcodeCompiler::add`, for the first symbol, `computeBinding` returns `STB_LOCAL`, then we reset it to `Undefined` because it is prevailing (`versionId` is `preserved`). For the second symbol, because the state is now `Undefined`, `computeBinding` returns `STB_GLOBAL`, causing `ExportDynamic` to be true and suppressing devirtualization. In D77280, the `computeBinding` change used a stricter `isDefined()` condition to make weak``Lazy` symbol work. This patch relaxes the condition to weaker `!isLazy()` to keep it working while making the devirtualization work as well. Differential Revision: https://reviews.llvm.org/D98686	2021-05-24 23:32:21 -04:00
serge-sans-paille	4ab3041acb	Revert "[NFC] remove explicit default value for strboolattr attribute in tests" This reverts commit `bda6e5bee0`. See https://lab.llvm.org/buildbot/#/builders/109/builds/15424 for instance	2021-05-24 19:43:40 +02:00
serge-sans-paille	bda6e5bee0	[NFC] remove explicit default value for strboolattr attribute in tests Since `d6de1e1a71`, no attributes is quivalent to setting attribute to false. This is a preliminary commit for https://reviews.llvm.org/D99080	2021-05-24 19:31:04 +02:00
Fangrui Song	0f298ec6cc	[ELF][test] Avoid local signature symbols for section groups to match reality If we support local signature symbols (PR43094), these tests would fail. When the support is added, new tests (local signature symbol specific) should be developed.	2021-05-22 17:48:45 -07:00
Sam Clegg	8544b40b6e	[lld][WebAssembly] Fix for PIC output + TLS + non-shared-memory Prior to this change build with `-shared/-pie` and using TLS (but without -shared-memory) would hit this assert: "Currenly only a single data segment is supported in PIC mode" This is because we were not including TLS data when merging data segments. However, when we build without shared-memory (i.e. without threads) we effectively lower away TLS into a normal active data segment.. so we were ending up with two active data segments: the merged data, and the lowered TLS data. To fix this problem we can instead avoid combining data segments at all when running in shared memory mode (because in this case all segment initialization is passive). And then in non-shared memory mode we know that TLS has been lowered and therefore we can can and should combine all segments. So with this new behavior we have two different modes: 1. With shared memory / mutli-threaded: Never combine data segments since it is not necessary. (All data segments as passive already). 2. Wihout shared memory / single-threaded: Combine all data segments since we treat TLS as normal data. (We end up with a single active data segment). Differential Revision: https://reviews.llvm.org/D102937	2021-05-21 15:16:47 -07:00
Axel Y. Rivera	4fb131b497	[LLD][COFF] PR49068: Include the IMAGE_REL_BASED_HIGHLOW relocation base type when the machine is 64 bits and the relocation type is ADDR32 The COFF driver produces an ABSOLUTE relocation base for an ADDR32 relocation type and the system is 64 bits (machine=AMD64). The relocation information won't be added in the output and could produce an incorrect address access during run-time. This change set checks if the relocation type is IMAGE_REL_AMD64_ADDR32 and if so, adds the relocated symbol as IMAGE_REL_BASED_HIGHLOW base. Differential Revision: https://reviews.llvm.org/D96619	2021-05-21 23:45:55 +03:00
Reid Kleckner	e73203a561	[PDB] Check the type server guid when ghashing Previously we simply didn't check this. Prereq to make the test suite pass with ghash enabled by default. Differential Revision: https://reviews.llvm.org/D102885	2021-05-20 16:36:12 -07:00
Martin Storsjö	33b71ec9c6	[LLD] [COFF] Fix automatic export of symbols from LTO objects Differential Revision: https://reviews.llvm.org/D101569	2021-05-21 00:36:58 +03:00
Wouter van Oortmerssen	3a293cbf13	[WebAssembly] Fix PIC/GOT codegen for wasm64 __table_base is know 64-bit, since in LLVM it represents a function pointer offset __table_base32 is a copy in wasm32 for use in elem init expr, since no truncation may be used there. New reloc R_WASM_TABLE_INDEX_REL_SLEB64 added Differential Revision: https://reviews.llvm.org/D101784	2021-05-20 09:59:31 -07:00
Sam Clegg	356b85edd7	[lld][WebAssembly] Fix for string tail merging and -r/--relocatable Ensure that both SyntheticMergedChunk and all MergeInfoChunks that it comprises are assigned the correct output section. Without this we would crash when outputting relocations in --relocatable mode. Fixes: https://github.com/emscripten-core/emscripten/issues/14220 Differential Revision: https://reviews.llvm.org/D102806	2021-05-19 15:25:58 -07:00
Reid Kleckner	12dd8df38b	[PDB] Do not record PGO or coverage public symbols These symbols are long, and they tend to cause the PDB file size to overflow. They are generally not necessary when debugging problems in user code. This change reduces the size of chrome.dll.pdb with coverage from 6,937,108,480 bytes to 4,690,210,816 bytes. Differential Revision: https://reviews.llvm.org/D102719	2021-05-19 12:41:31 -07:00
Mariusz Ceier	9383e9c1e6	Fix lld macho standalone build by including llvm/Config/llvm-config.h instead of llvm/Config/config.h lld/MachO/Driver.cpp and lld/MachO/SyntheticSections.cpp include llvm/Config/config.h which doesn't exist when building standalone lld. This patch replaces llvm/Config/config.h include with llvm/Config/llvm-config.h just like it is in lld/ELF/Driver.cpp and HAVE_LIBXAR with LLVM_HAVE_LIXAR and moves LLVM_HAVE_LIBXAR from config.h to llvm-config.h Also it adds LLVM_HAVE_LIBXAR to LLVMConfig.cmake and links liblldMachO2.so with XAR_LIB if LLVM_HAVE_LIBXAR is set. Differential Revision: https://reviews.llvm.org/D102084	2021-05-19 11:15:07 -04:00
Sam Clegg	876d49baad	[lld][WebAssembly] Convert test to assembly. NFC. Differential Revision: https://reviews.llvm.org/D102704	2021-05-18 12:31:13 -07:00
Sam Clegg	45b7cf9955	[lld][WebAssembly] Enable string tail merging in debug sections This is a followup to https://reviews.llvm.org/D97657 which applied string tail merging to data segments. Fixes: https://bugs.llvm.org/show_bug.cgi?id=48828 Differential Revision: https://reviews.llvm.org/D102436	2021-05-18 12:25:39 -07:00
Nico Weber	b4ead2c37b	[lld/mac] Correctly set nextdefsym In LC_DYSYMTAB, private externs were still emitted as exported symbols instead of as locals. Fixes PR50373. See bug for details. Differential Revision: https://reviews.llvm.org/D102662	2021-05-18 13:53:55 -04:00
Markus Böck	65271ffe84	[lld][MinGW] Introduce aliases for -Bdynamic and -Bstatic Besides -Bdynamic and -Bstatic, ld documents additional aliases for both of these options. Instead of -Bstatic, one may write -dn, -non_shared or -static. Instead of -Bdynamic one may write -dy or -call_shared. Source: https://sourceware.org/binutils/docs-2.36/ld/Options.html This patch adds those aliases to the MinGW driver of lld for the sake of ld compatibility. Encountered this case while compiling a static Qt 6.1 distribution and got build failures as -static was passed directly to the linker, instead of through the compiler driver. Differential Revision: https://reviews.llvm.org/D102637	2021-05-17 22:13:26 +02:00
Nico Weber	4a12248ee2	[lld/mac] Honor REFERENCED_DYAMICALLY, set it on __mh_execute_header Has the effect that `__mh_execute_header` stays in the symbol table of outputs even after running `strip` on the output. I don't know if that's important for anything -- my motivation for the patch is just is to make the output more similar to ld64. (Corresponds to symbolTableInAndNeverStrip in ld64.) Differential Revision: https://reviews.llvm.org/D102619	2021-05-17 14:22:12 -04:00
Mateusz Mikuła	84306ef9c4	[LLD][MinGW] Add --fatal-warnings and --no-fatal-warnings flags Differential Revision: https://reviews.llvm.org/D102514	2021-05-17 10:40:31 +03:00
Harald van Dijk	d62413452f	[lld][X86] Restore gotEntrySize. D62727 removed GotEntrySize and GotPltEntrySize with a comment that they are always equal to wordsize(), but that is not entirely true: X32 has a word size of 4, but needs 8-byte GOT entries. This restores gotEntrySize for both, adjusted for current naming conventions, but defaults it to config->wordsize to keep things simple for architectures other than x86_64. This partially reverts D62727. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D102509	2021-05-17 00:13:00 +01:00
Fangrui Song	5741dc87a5	[test] Improve x86-64-plt.s	2021-05-14 10:38:40 -07:00
Fangrui Song	4adf7a7604	[ELF] Add -Bno-symbolic This option will be available in GNU ld 2.27 (https://sourceware.org/bugzilla/show_bug.cgi?id=27834). This option can cancel previously specified -Bsymbolic and -Bsymbolic-functions. This is useful for excluding some links when the default uses -Bsymbolic-functions. Reviewed By: jhenderson, peter.smith Differential Revision: https://reviews.llvm.org/D102383	2021-05-14 09:40:32 -07:00
Fangrui Song	da9b6d0656	[ELF][test] Improve -Bsymbolic & -Bsymbolic-functions test Previously there was no test checking that -Bsymbolic-functions only applies to STT_FUNC symbols. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D102461	2021-05-14 09:33:43 -07:00
Sam Clegg	cd01430ff1	[lld][WebAssembly] Allow data symbols to extend past end of segment This fixes a bug with string merging with string symbols that contain NULLs, as is the case in the `merge-string.s` test. The bug only showed when we run with `--relocatable` and then try read the resulting object back in. In this case we would end up with string symbols that extend past the end of the segment in which they live. The problem comes from the fact that sections which are flagged as string mergable assume that all strings are NULL terminated. The merging algorithm will drop trailing chars that follow a NULL since they are essentially unreachable. However, the "size" attribute (in the symbol table) of such a truncated symbol is not updated resulting a symbol size that can overlap the end of the segment. I verified that this can happen in ELF too given the right conditions and the its harmless enough. In practice Strings that contain embedded null should not be part of a mergable section. Differential Revision: https://reviews.llvm.org/D102281	2021-05-12 13:43:37 -07:00
Sam Clegg	3041b16f73	[WebAssembly] Add TLS data segment flag: WASM_SEG_FLAG_TLS Previously the linker was relying solely on the name of the segment to imply TLS. Differential Revision: https://reviews.llvm.org/D102202	2021-05-12 13:31:02 -07:00
Martin Storsjö	7e0768329c	[LLD] [COFF] Fix including the personality function for DWARF EH when linking with --gc-sections Since `c579a5b1d9` we don't traverse .eh_frame when doing GC. But the exception handling personality function needs to be included, and is only referenced from within .eh_frame. Differential Revision: https://reviews.llvm.org/D102138	2021-05-12 22:23:01 +03:00
Greg McGary	93c8559baf	[lld-macho] Implement branch-range-extension thunks Extend the range of calls beyond an architecture's limited branch range by first calling a thunk, which loads the far address into a scratch register (x16 on ARM64) and branches through it. Other ports (COFF, ELF) use multiple passes with successively-refined guesses regarding the expansion of text-space imposed by thunk-space overhead. This MachO algorithm places thunks during MergedOutputSection::finalize() in a single pass using exact thunk-space overheads. Thunks are kept in a separate vector to avoid the overhead of inserting into the `inputs` vector of `MergedOutputSection`. FIXME: * arm64-stubs.s test is broken * add thunk tests * Handle thunks to DylibSymbol in MergedOutputSection::finalize() Differential Revision: https://reviews.llvm.org/D100818	2021-05-12 09:44:58 -07:00
Sam Clegg	19cedd3cd3	[lld][WebAssembly] Fix for string merging + negative addends Don't include the relocation addend when calculating the virtual address of a symbol. Instead just pass the symbol's offset and add the addend afterwards. Without this fix we hit the `offset is outside the section` error in MergeInputSegment::getSegmentPiece. This fixes a real world error we were are seeing in emscripten. Differential Revision: https://reviews.llvm.org/D102271	2021-05-11 17:47:57 -07:00
Sam Clegg	b49a798e71	[lld][WebAssembly] Remove relocation target verification We have this extra step in wasm-ld that doesn't exist in other lld backend which verifies the existing contents of the relocation targets. This was originally intended as an extra form of double checking and an aid to compiler developers. However it has always been somewhat controversial and there have been suggestions in the past the we simply remove it. My motivation for removing it now is that its causing me a headache when trying to fix an issue with negative addends. In the case of negative addends that final result can be wrapped/negative but this checking code would require significant modification to be able to deal with that case. For example with some test cases I'm looking at I'm seeing error like this: ``` wasm-ld: warning: /usr/local/google/home/sbc/dev/wasm/llvm-build/tools/lld/test/wasm/Output/merge-string.s.tmp.o:(.rodata_relocs): unexpected existing value for R_WASM_MEMORY_ADDR_I32: existing=FFFFFFFA expected=FFFFFFFFFFFFFFFA ``` Rather than try to refactor `calcExpectedValue` to somehow return two different types of results (32 and 64-bit) depending on the relocation type, I think we can just remove this code. Differential Revision: https://reviews.llvm.org/D102265	2021-05-11 12:05:14 -07:00
Sam Clegg	b2f227c6c8	[lld][WebAssembly] Convert test to assembly. NFC. Differential Revision: https://reviews.llvm.org/D102264	2021-05-11 11:37:53 -07:00
Nico Weber	9ab49ae55d	[lld/mac] Implement -sectalign clang sometimes passes this flag along (see D68351), so we should implement it. Differential Revision: https://reviews.llvm.org/D102247	2021-05-11 13:31:32 -04:00
Igor Kudrin	70c23e232e	[LLD] Improve reporting unresolved symbols in shared libraries Currently, when reporting unresolved symbols in shared libraries, if an undefined symbol is firstly seen in a regular object file that shadows the reference for the same symbol in a shared object. As a result, the error for the unresolved symbol in the shared library is not reported. If referencing sections in regular object files are discarded because of '--gc-sections', no reports about such symbols are generated, and the linker finishes successfully, generating an output image that fails on the run. The patch fixes the issue by keeping symbols, which should be checked, for each shared library separately. Differential Revision: https://reviews.llvm.org/D101996	2021-05-11 12:48:29 +07:00
Sam Clegg	3b8d2be527	Reland: "[lld][WebAssembly] Initial support merging string data" This change was originally landed in: `5000a1b4b9` It was reverted in: `061e071d8c` This change adds support for a new WASM_SEG_FLAG_STRINGS flag in the object format which works in a similar fashion to SHF_STRINGS in the ELF world. Unlike the ELF linker this support is currently limited: - No support for SHF_MERGE (non-string merging) - Always do full tail merging ("lo" can be merged with "hello") - Only support single byte strings (p2align 0) Like the ELF linker merging is only performed at `-O1` and above. This fixes part of https://bugs.llvm.org/show_bug.cgi?id=48828, although crucially it doesn't not currently support debug sections because they are not represented by data segments (they are custom sections) Differential Revision: https://reviews.llvm.org/D97657	2021-05-10 16:03:38 -07:00
Nico Weber	061e071d8c	Revert "[lld][WebAssembly] Initial support merging string data" This reverts commit `5000a1b4b9`. Breaks tests, see https://reviews.llvm.org/D97657#2749151 Easily repros locally with `ninja check-llvm-mc-webassembly`.	2021-05-10 18:28:28 -04:00
Sam Clegg	5000a1b4b9	[lld][WebAssembly] Initial support merging string data This change adds support for a new WASM_SEG_FLAG_STRINGS flag in the object format which works in a similar fashion to SHF_STRINGS in the ELF world. Unlike the ELF linker this support is currently limited: - No support for SHF_MERGE (non-string merging) - Always do full tail merging ("lo" can be merged with "hello") - Only support single byte strings (p2align 0) Like the ELF linker merging is only performed at `-O1` and above. This fixes part of https://bugs.llvm.org/show_bug.cgi?id=48828, although crucially it doesn't not currently support debug sections because they are not represented by data segments (they are custom sections) Differential Revision: https://reviews.llvm.org/D97657	2021-05-10 13:15:12 -07:00
Jez Ng	b1c3c2e4fc	[lld-macho] Fix order file arch filtering We had a hardcoded check and a stale TODO, written back when we only had support for one architecture. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D102154	2021-05-10 15:45:54 -04:00
Jez Ng	2516b0b526	[lld-macho] Treat undefined symbols uniformly In particular, we should apply the `-undefined` behavior to all such symbols, include those that are specified via the command line (i.e. `-e`, `-u`, and `-exported_symbol`). ld64 supports this too. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D102143	2021-05-10 15:45:54 -04:00
Jez Ng	3d5e5066f1	[lld-macho][nfc] Clean up tests * Remove unnecessary `rm -rf %t`s * Have lc-linker-option.ll use the right comment marker	2021-05-10 15:45:54 -04:00
Fangrui Song	1f44fee521	[lld-macho] Improve an external weak def test The rebase table entry is untested. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D102150	2021-05-10 10:35:44 -07:00
Sam Clegg	bda8b84884	[lld][WebAssembly] Disallow exporting of TLS symbols Cross module TLS is currently not supported by our ABI. This change makes explicitly exporting a TLS symbol into an error and prevents implicit exporting (via --export-all). See https://github.com/emscripten-core/emscripten/issues/14120 Differential Revision: https://reviews.llvm.org/D102044	2021-05-10 09:58:44 -07:00
Fangrui Song	7a0231ae59	[llvm-objdump][MachO] Print a newline before lazy bind/bind/weak/exports trie This adds a separator between two pieces of information. Reviewed By: #lld-macho, alexshap Differential Revision: https://reviews.llvm.org/D102114	2021-05-10 09:16:18 -07:00
Jez Ng	75f74f2673	[lld-macho] Add llvm-otool as a test dependency This unbreaks my local build, which is configured to build only parts of LLVM.	2021-05-09 21:12:58 -04:00
Nico Weber	7f673fcaa9	[lld/mac] Fix alignment on subsections On a section with alignment of 16, subsections aligned to 16-byte boundaries should keep their 16-byte alignment. Fixes PR50274. (The same bug could have happened with -order_file previously.) Differential Revision: https://reviews.llvm.org/D102139	2021-05-09 21:00:56 -04:00
Jez Ng	0f8854f7f5	[lld-macho] Don't reference entry symbol for non-executables This would cause us to pull in symbols (and code) that should be unused. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D102137	2021-05-09 20:30:26 -04:00
Greg McGary	4b89629403	[lld-macho][NFC] Purge stale test-output trees prior to split-file Enforce standard practice Differential Revision: https://reviews.llvm.org/D102112	2021-05-08 17:36:30 -07:00
Greg McGary	5be8502271	[lld-macho] Explicitly undefine literal exported symbols Symbols explicitly exported via command-line options `--exported_symbol SYM` and `--exported_symbols_list FILE` must be defined. Before this fix, lazy symbols defined in archives would be left to languish. We now force them to be included in the linked output. Differential Revision: https://reviews.llvm.org/D102100	2021-05-08 11:37:00 -07:00
Arthur Eubanks	34a8a437bf	[NewPM] Hide pass manager debug logging behind -debug-pass-manager-verbose Printing pass manager invocations is fairly verbose and not super useful. This allows us to remove DebugLogging from pass managers and PassBuilder since all logging (aside from analysis managers) goes through instrumentation now. This has the downside of never being able to print the top level pass manager via instrumentation, but that seems like a minor downside. Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D101797	2021-05-07 21:51:47 -07:00
Nico Weber	d5a70db193	[lld/mac] Write every weak symbol only once in the output Before this, if an inline function was defined in several input files, lld would write each copy of the inline function the output. With this patch, it only writes one copy. Reduces the size of Chromium Framework from 378MB to 345MB (compared to 290MB linked with ld64, which also does dead-stripping, which we don't do yet), and makes linking it faster: N Min Max Median Avg Stddev x 10 3.9957051 4.3496981 4.1411121 4.156837 0.10092097 + 10 3.908154 4.169318 3.9712729 3.9846753 0.075773012 Difference at 95.0% confidence -0.172162 +/- 0.083847 -4.14165% +/- 2.01709% (Student's t, pooled s = 0.0892373) Implementation-wise, when merging two weak symbols, this sets a "canOmitFromOutput" on the InputSection belonging to the weak symbol not put in the symbol table. We then don't write InputSections that have this set, as long as they are not referenced from other symbols. (This happens e.g. for object files that don't set .subsections_via_symbols or that use .alt_entry.) Some restrictions: - not yet done for bitcode inputs - no "comdat" handling (`kindNoneGroupSubordinate*` in ld64) -- Frame Descriptor Entries (FDEs), Language Specific Data Areas (LSDAs) (that is, catch block unwind information) and Personality Routines associated with weak functions still not stripped. This is wasteful, but harmless. - However, this does strip weaks from __unwind_info (which is needed for correctness and not just for size) - This nopes out on InputSections that are referenced form more than one symbol (eg from .alt_entry) for now Things that work based on symbols Just Work: - map files (change in MapFile.cpp is no-op and not needed; I just found it a bit more explicit) - exports Things that work with inputSections need to explicitly check if an inputSection is written (e.g. unwind info). This patch is useful in itself, but it's also likely also a useful foundation for dead_strip. I used to have a "canoncialRepresentative" pointer on InputSection instead of just the bool, which would be handy for ICF too. But I ended up not needing it for this patch, so I removed that again for now. Differential Revision: https://reviews.llvm.org/D102076	2021-05-07 17:11:40 -04:00
LemonBoy	f876383384	[AsmParser][ARM] Make .thumb_func imply .thumb GNU as documentation states that a `.thumb_func` directive implies `.thumb`, teach the asm parser to switch mode whenever it's encountered. On the other hand the labeled form, exclusive to Apple's toolchain, doesn't switch mode at all. Reviewed By: nickdesaulniers, peter.smith Differential Revision: https://reviews.llvm.org/D101975	2021-05-07 12:13:36 +02:00
Stefan Pintilie	f0adf3a24c	[PowerPC][LLD] Make sure that the correct Thunks are used. This fixes an issue where mixed TOC / NOTOC calls can call the incorrect thunks if a previous thunk already exists. The issue appears when a TOC funciton calls a NOTOC callee and then a different NOTOC function calls the same NOTOC callee. In this case the linker would sometimes incorrectly call the same thunk for both cases. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101837	2021-05-06 12:00:04 -05:00
Jez Ng	9260760235	[lld-macho] Support loading of zippered dylibs ld64 can emit dylibs that support more than one platform (typically macOS and macCatalyst). This diff allows LLD to read in those dylibs. Note that this is a super bare-bones implementation -- in particular, I haven't added support for LLD to emit those multi-platform dylibs, nor have I added a variety of validation checks that ld64 does. Until we have a use-case for emitting zippered dylibs, I think this is good enough. Fixes PR49597. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D101954	2021-05-06 11:19:40 -04:00

... 3 4 5 6 7 ...

7199 Commits