llvm-project

Commit Graph

Author	SHA1	Message	Date
Martin Storsjö	7c15da6761	[LLD] [COFF] Interpret the immediate in ARM64 adr/adrp relocations as signed 21 bit This matches how MS link.exe interprets this relocation. Differential Revision: https://reviews.llvm.org/D114347	2021-11-23 10:13:01 +02:00
Shoaib Meenai	2f5d6a0ea5	[MachO] Fix struct size assertion std::vector can have different sizes depending on the STL's debug level, so account for its size separately. (You could argue that we should be accounting for all the other members separately as well, but that would be very unergonomic, and std::vector is the only one that's caused problems so far.)	2021-11-22 15:02:30 -08:00
Fangrui Song	7aafe467d2	[ELF] Simplify a condition with config->copyRelocs. NFC	2021-11-22 13:59:23 -08:00
Vy Nguyen	944071eca2	[lld-macho] Don't replace local personality symbol with LazySymbol Follup-up to D107533, where we replaced local syms with non-local. It doesn't make sense to replace local symbol with lazy. Differential Revision: https://reviews.llvm.org/D110040	2021-11-22 14:09:54 -05:00
Igor Kudrin	a05b694b1e	[ELF][NFC] Do not pass region name to expandMemoryRegion() The name can be easily got on-site. Differential Revision: https://reviews.llvm.org/D114228	2021-11-22 14:19:07 +07:00
Fangrui Song	648157b05a	[ELF] Move getOutputSectionName from Writer.cpp to LinkerScript.cpp. NFC and internalize it.	2021-11-20 22:18:09 -08:00
Fangrui Song	2997441b85	[ELF] Support discarding .got.plt Fix a null pointer dereference when .got.plt is discarded. This also adds a test for discarding `.plt`. Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D114180	2021-11-19 10:50:53 -08:00
Nico Weber	bc20bcb39e	[lld/mac] Crash even less on undefined symbols with --icf=all Follow-up to https://reviews.llvm.org/D112643. Even after that change, we were still asserting if two separate functions that are eligible for ICF (same size, same data, same number of relocs, same reloc types, ...) referred to Undefineds. This fixes that oversight. Differential Revision: https://reviews.llvm.org/D114195	2021-11-19 09:23:19 -05:00
Andrew Ng	47eb3f155f	[ELF] Ensure output section is not discarded in addStartEndSymbols() Fixes https://bugs.llvm.org/show_bug.cgi?id=52534. Differential Revision: https://reviews.llvm.org/D114179	2021-11-19 11:45:58 +00:00
Konstantin Schwarz	8c18719bae	[ELF] Expand LMA region if output section alignment introduces padding When aligning the start address of an output section introduces a gap between the current dot pointer and the new aligned address, we were already properly expanding the memory region, if available. D74286 introduced a new behavior to also align the LMA address if an LMA region is specified. However, this did not expand the corresponding LMA region. Now, we also expand the LMA region if it is set. This fixes PR52510. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D114166	2021-11-19 11:27:21 +01:00
Vincent Lee	adfbb5411b	[lld-macho] Add warn flags to enable/disable warnings on -install_name ld64 doesn't warn on builds using `-install_name` if it's a bundle. But, the current warning is nice to have because `install_name` only works with dylib. To prevent an overflow of warnings in build logs and have parity with ld64, create a `--warn-dylib-install-name` and `--warn-no-dylib-install-name` flag that enables this LLD specific warning. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D113534	2021-11-17 16:18:14 -08:00
Greg McGary	9cc489a4b2	[lld-macho][nfc] Factor-out NFC changes from main __eh_frame diff In order to keep signal:noise high for the `__eh_frame` diff, I have teased-out the NFC changes and put them here. Differential Revision: https://reviews.llvm.org/D114017	2021-11-17 15:16:44 -07:00
Shoaib Meenai	01510ac084	[MachO] Move type size asserts to source files. NFC As discussed in https://reviews.llvm.org/D113809#3128636. It's a bit unfortunate to move the asserts away from the structs whose sizes they're checking, but it's a far better developer experience when one of the asserts is violated, because you get a single error instead of every single source file including the header erroring out.	2021-11-16 17:14:16 -08:00
Vy Nguyen	34d15eaced	[lld-macho][nfc] Sanity check on template type Differential Revision: https://reviews.llvm.org/D114044	2021-11-16 20:04:49 -05:00
Shoaib Meenai	93bf271f27	[MachO] Shrink reloc from 32 bytes to 24 bytes The `r_address` field of `relocation_info` is only 4 bytes, so our offset field (which is the `r_address` field adjusted for subsection splitting) also only needs to be 4 bytes. This reduces the structure size from 32 bytes to 24 bytes. Combined with https://reviews.llvm.org/D113813, this is a minor perf improvement for linking an internal app, tested on two machines: ``` smol-relocs baseline difference (95% CI) sys_time 7.367 ± 0.138 7.543 ± 0.157 [ +0.9% .. +3.8%] user_time 21.843 ± 0.351 21.861 ± 0.450 [ -1.3% .. +1.4%] wall_time 20.301 ± 0.307 20.556 ± 0.324 [ +0.1% .. +2.4%] samples 16 16 smol-relocs baseline difference (95% CI) sys_time 2.923 ± 0.050 2.992 ± 0.018 [ +1.4% .. +3.4%] user_time 10.345 ± 0.039 10.448 ± 0.023 [ +0.8% .. +1.2%] wall_time 12.068 ± 0.071 12.229 ± 0.021 [ +1.0% .. +1.7%] samples 15 12 ``` More importantly though, this change by itself reduces our maximum resident set size by 220 MB (2.75%, from 7.85 GB to 7.64 GB) on the first machine. On the second machine, it reduces it by 125 MB (1.94%, from 6.31 GB to 6.19 GB). Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D113818	2021-11-16 16:30:34 -08:00
Shoaib Meenai	3195297897	[MachO] Reduce size of Symbol and Defined We can lay out Symbol more optimally to reduce its size from 56 bytes to 48 bytes by eliminating unnecessary padding, and we can lay out Defined such that its bitfield members are placed in the tail padding of Symbol (on ABIs which support this), to reduce it from 96 bytes to 80 bytes (8 bytes from the Symbol reduction, and 8 bytes from the tail padding reuse). This is perf-neutral for an internal app (results from two different machines): ``` smol-syms baseline difference (95% CI) sys_time 7.430 ± 0.202 7.440 ± 0.193 [ -2.6% .. +2.9%] user_time 21.443 ± 0.513 21.206 ± 0.396 [ -3.3% .. +1.1%] wall_time 20.453 ± 0.534 20.222 ± 0.488 [ -3.7% .. +1.5%] samples 9 8 smol-syms baseline difference (95% CI) sys_time 3.011 ± 0.050 3.040 ± 0.052 [ -0.4% .. +2.3%] user_time 10.416 ± 0.075 10.496 ± 0.091 [ +0.1% .. +1.4%] wall_time 12.229 ± 0.144 12.354 ± 0.192 [ -0.1% .. +2.1%] samples 14 13 ``` However, on the first machine, it reduces maximum resident set size by 65.9 MB (0.8%, from 7.92 GB to 7.85 GB). On the second machine, it reduces it by 92 MB (1.4%, from 6.40 GB to 6.31 GB). Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D113813	2021-11-16 16:30:33 -08:00
Shoaib Meenai	637a3396b3	[MachO] Fix struct size assertion It was checking for 64-bit builds incorrectly. Unfortunately, ConcatInputSection has grown a bit in the meantime, and I don't see any obvious way to shrink it. Perhaps icfEqClass could use 32-bit hashes instead of 64-bit ones, but xxHash64 is supposed to be much faster than xxHash32 (https://github.com/Cyan4973/xxHash#benchmarks), so that sounds like a loss. (Unrelatedly, we should really look at using XXH3 instead of xxHash64 now.) Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D113809	2021-11-16 16:30:31 -08:00
Greg McGary	3a1b3c9afe	[lld-macho][nfc] rename parsed-section types & variables This is an NFC diff that prepares for pruning & relocating `__eh_frame`. Along the way, I made the following changes to ... * clarify usage of `section` vs. `subsection` * remove `map` & `vec` from type names * disambiguate class `Section` from template parameter `SectionHeader`. Differential Revision: https://reviews.llvm.org/D113241	2021-11-16 07:06:41 -07:00
Quinn Pham	1ca00ecfb8	[NFC][lld] Inclusive language: change master file to merged file [NFC] As part of using inclusive language within the llvm project, this patch replaces master with merged in these comments. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D113903	2021-11-15 14:32:09 -06:00
Igor Kudrin	66691de94c	[ELF] Do not try to assign a memory region to a non-allocatable section Non-allocatable sections are not part of the memory image of the program, so there is no need to find memory regions for them either matching properties or handling explicit assignments. The early test and return help to simplify LinkerScript::findMemoryRegion() a bit. Differential Revision: https://reviews.llvm.org/D113768	2021-11-15 15:59:39 +07:00
Shao-Ce SUN	0c660256eb	[NFC] Trim trailing whitespace in *.rst	2021-11-15 09:17:08 +08:00
Keith Smiley	51715fbd96	[lld-macho] Fix warning ``` /Users/ksmiley/dev/llvm-project/lld/MachO/Symbols.cpp:43:27: warning: field 'external' will be initialized after field 'weakDefCanBeHidden' [-Wreorder-ctor] weakDef(isWeakDef), external(isExternal), ^ 1 warning generated. ``` Differential Revision: https://reviews.llvm.org/D113823	2021-11-12 19:36:51 -08:00
Vy Nguyen	9b29dae3ca	[lld-macho] Allow exporting weak_def_can_be_hidden(AKA "autohide") symbols autohide symbols behaves similarly to private_extern symbols. However, LD64 allows exporting autohide symbols. LLD currently does not. This patch allows LLD to export them. Differential Revision: https://reviews.llvm.org/D113167	2021-11-12 21:57:30 -05:00
Vy Nguyen	ad932320d8	[lld-macho] Parallelize scanning the symbol tables in export/unexport-ing. (Split from D113167) Benchmarking on one of our large apps which exports a few thousands symbols, this showed an improvement of ~17%. x ./LLD_no_parallel.txt + ./LLD_with_parallel.txt N Min Max Median Avg Stddev x 10 84.01 89.41 88.64 87.693 1.7424061 + 10 71.9 74.29 72.63 72.753 0.77734663 Difference at 95.0% confidence -14.94 +/- 1.26763 -17.0367% +/- 1.44553% (Student's t, pooled s = 1.34912) (wallclock) Differential Revision: https://reviews.llvm.org/D113820	2021-11-12 20:57:24 -05:00
Duncan P. N. Exon Smith	9a2b54af22	lld: const-qualify iterations through VarStreamArray, NFC No functionality change here; just unblocking a patch to LLVM.	2021-11-12 14:29:49 -08:00
Jez Ng	9d0b237c51	[lld-macho] Fix symbol relocs handling for LSDAs Similar to D113702, but for the LSDAs. Clang seems to emit all LSDA relocs as section relocs, but ld -r can turn those relocs into symbol ones. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D113721	2021-11-12 16:02:49 -05:00
Jez Ng	d9b6f7e312	[lld-macho] Teach ICF to dedup functions with identical unwind info Dedup'ing unwind info is tricky because each CUE contains a different function address, if ICF operated naively and compared the entire contents of each CUE, entries with identical unwind info but belonging to different functions would never be considered identical. To work around this problem, we slice away the function address before performing ICF. We rely on `relocateCompactUnwind()` to correctly handle these truncated input sections. Here are the numbers before and after D109944, D109945, and this diff were applied, as tested on my 3.2 GHz 16-Core Intel Xeon W: Without any optimizations: base diff difference (95% CI) sys_time 0.849 ± 0.015 0.896 ± 0.012 [ +4.8% .. +6.2%] user_time 3.357 ± 0.030 3.512 ± 0.023 [ +4.3% .. +5.0%] wall_time 3.944 ± 0.039 4.032 ± 0.031 [ +1.8% .. +2.6%] samples 40 38 With `-dead_strip`: base diff difference (95% CI) sys_time 0.847 ± 0.010 0.896 ± 0.012 [ +5.2% .. +6.5%] user_time 3.377 ± 0.014 3.532 ± 0.015 [ +4.4% .. +4.8%] wall_time 3.962 ± 0.024 4.060 ± 0.030 [ +2.1% .. +2.8%] samples 47 30 With `-dead_strip` and `--icf=all`: base diff difference (95% CI) sys_time 0.935 ± 0.013 0.957 ± 0.018 [ +1.5% .. +3.2%] user_time 3.472 ± 0.022 6.531 ± 0.046 [ +87.6% .. +88.7%] wall_time 4.080 ± 0.040 5.329 ± 0.060 [ +30.0% .. +31.2%] samples 37 30 Unsurprisingly, ICF is now a lot slower, likely due to the much larger number of input sections it needs to process. But the rest of the linker only suffers a mild slowdown. Note that the compact-unwind-bad-reloc.s test was expanded because we now handle the relocation for CUE's function address in a separate code path from the rest of the CUE relocations. The extended test covers both code paths. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D109946	2021-11-12 16:02:49 -05:00
Jez Ng	ad8df21db2	[reland][lld-macho] Fix symbol relocs handling for compact unwind's functionAddress Clang seems to emit all functionAddress relocs as section relocs, but `ld -r` can turn those relocs into symbol ones. It turns out that we weren't handling that case correctly when the symbol was a weak def whose definition did not prevail. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D113702	2021-11-12 15:01:51 -05:00
Keith Smiley	eb6f9f3123	[lld-macho] Fix trailing slash in oso_prefix Previously if you passed `-oso_prefix path/to/foo/` with a trailing slash at the end, using `real_path` would remove that slash, but that slash is necessary to make sure OSO prefix paths end up as valid relative paths instead of starting with `/`. Differential Revision: https://reviews.llvm.org/D113541	2021-11-12 11:29:08 -08:00
Fangrui Song	a05384dc89	[ELF] Make --no-relax disable R_X86_64_GOTPCRELX and R_X86_64_REX_GOTPCRELX GOT optimization This brings back the original version of D81359. I have found several use cases now. * Unlike GNU ld, LLD's relocation processing is one pass. If we decide to optimize(relax) R_X86_64_{,REX_}GOTPCRELX, we will suppress GOT generation and cannot undo the decision later. Optimizing R_X86_64_REX_GOTPCRELX can usually make it easy to hit `relocation R_X86_64_REX_GOTPCRELX out of range` because the distance to GOT is usually shorter. Without --no-relax, the user has to recompile with `-Wa,-mrelax-relocations=no`. * The option would help during my investigationg of the root cause of https://git.kernel.org/linus/09e43968db40c33a73e9ddbfd937f46d5c334924 * There is need for relaxation for AArch64 & RISC-V. Implementing this for x86-64 improves consistency with little target-specific cost (two-line X86_64.cpp change). Reviewed By: alexander-shaposhnikov Differential Revision: https://reviews.llvm.org/D113615	2021-11-12 09:47:31 -08:00
Kazu Hirata	835135a8ae	Revert "[lld-macho] Fix symbol relocs handling for compact unwind's functionAddress" This reverts commit `e941fe5061`. The commit in question causes: lld/MachO/InputFiles.cpp:916:13: error: use of undeclared identifier 'it'	2021-11-11 20:29:48 -08:00
Jez Ng	e941fe5061	[lld-macho] Fix symbol relocs handling for compact unwind's functionAddress Clang seems to emit all functionAddress relocs as section relocs, but `ld -r` can turn those relocs into symbol ones. It turns out that we weren't handling that case correctly when the symbol was a weak def whose definition did not prevail. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D113702	2021-11-11 22:53:35 -05:00
Petr Hosek	d56b171ee9	[lld][ELF] Support for R_ARM_THM_JUMP8 This change implements support for R_ARM_THM_JUMP8 relocation in addition to R_ARM_THM_JUMP11 which is already supported by LLD. Differential Revision: https://reviews.llvm.org/D21225	2021-11-11 09:06:52 -08:00
Igor Kudrin	d2dd36bbbe	[ELF] Better resemble GNU ld when placing orphan sections into memory regions An orphan section should be placed in the same memory region as its anchor section if the latter specifies the memory region explicitly. If there is no explicit assignment for the anchor section in the linker script, its memory region is selected by matching attributes, and the same should be done for the orphan section. Before the patch, some scripts that were handled smoothly in GNU ld caused an "error: no memory region specified for section" in lld. Differential Revision: https://reviews.llvm.org/D112925	2021-11-11 15:07:38 +07:00
Jez Ng	a2404f11c7	[lld-macho] Support renaming of LSDA section Previously, our unwind info finalization logic assumed that the LSDA section referenced by `__compact_unwind` was already finalized before `__TEXT,__unwind_info` itself. However, that assumption could be broken by the use of `-rename_section` -- it could be (and is) used to move `__gcc_except_tab` it into a different segment later in the file. (__TEXT is always the first non-zerofill segment, so any rename basically guarantees that the section will be ordered after `__unwind_info`.) To handle this case, we compare LSDA relocations instead of their final values in `UnwindInfoSection::finalize()`, and we actually relocate those LSDAs in `UnwindInfoSection::writeTo()`. In order to do this, we need an easy way to track which Symbol a given CUE corresponds to. My solution was to change our `cuPtrVector` into a vector of indices, with each index used for both the symbols vector (`symbolsVec`) as well as the CUE vector (`cuVector`). This change seems perf neutral. Numbers for linking chromium_framework on my 16 core Mac Pro: base diff difference (95% CI) sys_time 1.248 ± 0.025 1.245 ± 0.026 [ -1.3% .. +0.8%] user_time 3.588 ± 0.045 3.587 ± 0.037 [ -0.6% .. +0.5%] wall_time 4.605 ± 0.069 4.595 ± 0.069 [ -1.0% .. +0.5%] samples 42 26 Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D113582	2021-11-10 19:31:54 -05:00
Fangrui Song	51ee08c217	[ELF] Enforce double-dash form for --ignore-{data,function}-pointer-equality --reproduce --thread They are LLD-specific options. We have enforced double-dash forms for other options (reduce collision with short options) but missed them.	2021-11-10 01:17:08 -08:00
Fangrui Song	d71bb6a409	[ELF] Inline isPPC64SmallCodeModelTocReloc which is only called once. NFC	2021-11-09 20:41:05 -08:00
Fangrui Song	bec28ee1ea	[ELF] Move isStaticLinkTimeConstant closer to the only caller processRelocAux. NFC	2021-11-09 20:37:46 -08:00
Fangrui Song	213d1849a4	[ELF] Improve sh_info=0 and sh_info>=num_sections diagnostic for SHT_REL/SHT_RELA PR52408 reported an sh_info=0 instance. I have seen sh_info=0 independently before. sh_info>=num_sections is probably very rare. Just use one diagnostic for the two types of errors. Delete invalid-relocations.test which is covered by invalid/bad-reloc-target.test Differential Revision: https://reviews.llvm.org/D113466	2021-11-09 09:54:12 -08:00
Vy Nguyen	2e1be96df6	Reland "[lld-macho] Fix assertion failure in registerCompactUnwind"" PR/52372 Differential Revision: https://reviews.llvm.org/D112977 New changes: - use llvm-otool instead of `otool` which doesn't in exist on non-OSX platforms - add llvm-otool to the set of tools used by test so that the bot will use the <build_dir>/bin/llvm-otool instead of the unqualified `llvm-otool` (which may not exist) - update tests since the latest (TOT) llvm-otool prints a space between two bytes and the old one doesn't.	2021-11-09 11:52:46 -05:00
Vy Nguyen	eb4a517816	Revert "[lld-macho] Fix assertion failure in registerCompactUnwind" broke windows build - reverting to investigate This reverts commit `b2d9258474`.	2021-11-09 10:31:47 -05:00
Vy Nguyen	b2d9258474	[lld-macho] Fix assertion failure in registerCompactUnwind PR/52372 Differential Revision: https://reviews.llvm.org/D112977	2021-11-09 10:08:17 -05:00
Fangrui Song	43bb5f0185	[docs] Remove outdated documentation for the legacy Atom-based LLD The outdated documentation diverges a lot from the current state of COFF/Mach-O/ELF/wasm ports and may just confuse users. It is better rewriting some if useful. Tested with `ninja docs-lld-html` Reviewed By: #lld-macho, lhames, Jez Ng Differential Revision: https://reviews.llvm.org/D113432	2021-11-08 15:20:16 -08:00
Fangrui Song	cebb0a64b4	[ELF][ARM] Improve error message for unknown relocation Like rLLD354040. Before: `error: unrecognized relocation Unknown (254)` Now: `error: unknown relocation (254) against symbol foo`	2021-11-08 12:39:08 -08:00
David Blaikie	78758026e2	Fix lld test after dwarfdump array syntax change	2021-11-05 23:00:29 -07:00
Fangrui Song	26a8ceba3e	[llvm-readobj] Display DT_RELRSZ/DT_RELRENT as " (bytes)" to match RELSZ/RELENT. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D113206	2021-11-05 10:02:49 -07:00
Quinn Pham	c71fbdd87b	[NFC] Inclusive language: Remove instances of master in URLs [NFC] This patch fixes URLs containing "master". Old URLs were either broken or redirecting to the new URL. Reviewed By: #libc, ldionne, mehdi_amini Differential Revision: https://reviews.llvm.org/D113186	2021-11-05 08:48:41 -05:00
Keith Smiley	a7a2959901	[lld-macho] Replace LC_LINKER_OPTION parsing This removes the tablegen based parsing of LC_LINKER_OPTION since it can only actually contain a very small number of potential arguments. In our project with tablegen this took 5 seconds before. This replaces https://reviews.llvm.org/D113075 Differential Revision: https://reviews.llvm.org/D113235	2021-11-04 22:03:40 -07:00
Fangrui Song	005456e5fc	[lld-macho] Fix an assertion failure when -u specifies an undefined section$start symbol This matches ld64. Also improve the test for `-dead_strip`. Reviewed By: #lld-macho, Jez Ng Differential Revision: https://reviews.llvm.org/D113147	2021-11-04 21:28:33 -07:00
Keith Smiley	0bce3e3b84	[lld-macho] Clear resolvedReads cache https://reviews.llvm.org/D113153#3108083 smeenai, int3 Differential Revision: https://reviews.llvm.org/D113198	2021-11-04 18:02:34 -07:00
Noah Shutty	d788c44f5c	[Support] Improve Caching conformance with Support library behavior This diff makes several amendments to the local file caching mechanism which was migrated from ThinLTO to Support in rGe678c51177102845c93529d457b020f969125373 in response to follow-up discussion on that commit. Patch By: noajshu Differential Revision: https://reviews.llvm.org/D113080	2021-11-04 13:00:44 -07:00
Keith Smiley	e7fdff403e	[lld-macho] Silently ignore the -objc_abi_version This undocumented ld64 flag, based on the most recent ld64 source dump from Xcode 12, only applies to i386. It seems like on all newer architectures this behavior is the default. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D113070	2021-11-03 22:16:09 -07:00
Keith Smiley	d49e7244cc	[lld-macho] Cache readFile results In one of our links lld was reading 760k files, but the unique number of files was only 1500. This takes that link from 30 seconds to 8. This seems like a heavy hammer, especially since some things don't need to be cached, like the filelist arguments and the passed static archives (the latter is already cached as a one off), but it seems ld64 does something similar here to short circuit these duplicate reads: `82e429e186/src/ld/InputFiles.cpp (L644-L665)` Of the types of files being read for our iOS app, the biggest problem was constantly re-reading small tbd files: ``` % wc -l /tmp/read.txt 761414 /tmp/read.txt % cat /tmp/read.txt \| sort -u \| wc -l 1503 % cat /tmp/read.txt \| grep "\.a$" \| wc -l 43721 % cat /tmp/read.txt \| grep "\.tbd$" \| wc -l 717656 ``` We could likely hoist this logic up to not cache at this level, but it would be a more invasive change to make sure all callers that needed it cached the results. I could see this being an issue with OOMs, and I'm not a linker expert so maybe there's another way we should solve this problem? Feedback welcome! Reviewed By: int3, #lld-macho Differential Revision: https://reviews.llvm.org/D113153	2021-11-03 22:12:21 -07:00
Keith Smiley	6629ec3ecc	[lld-macho] Implement -arch_errors_fatal By default with ld64, architecture mismatches are just warnings, then this flag can be passed to make these fail. This matches that behavior. Reviewed By: int3, #lld-macho Differential Revision: https://reviews.llvm.org/D113082	2021-11-03 22:01:53 -07:00
Jez Ng	4ae8c83104	[lld-macho][nfc] Remove unnecessary -pie flags in tests D101513 means that we no longer need to specify `-pie` in most of our test RUN commands. Let's clean up the unused flags so as not to confuse future test writers. Reviewed By: #lld-macho, oontvoo, MaskRay Differential Revision: https://reviews.llvm.org/D113114	2021-11-04 00:02:03 -04:00
Keith Smiley	4313c56aa3	[lld-macho] Enable search-paths tests on macOS I'm not sure what the history is here but this test passes on macOS today. It seems like we should unify these tests if they need to run cross platform. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D113085	2021-11-03 12:01:36 -07:00
Keith Smiley	63e65de3ff	[lld-macho] Cache discovered framework paths On our large iOS project this took a link from 1 minute 45 seconds to 45 seconds. For reference ld64 does the same link in ~20 seconds. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D113063	2021-11-03 11:11:54 -07:00
Keith Smiley	f79e65e61f	[lld-macho] Cache library paths from findLibrary On top of https://reviews.llvm.org/D113063 this took another 10 seconds off our overall link time. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D113073	2021-11-03 10:02:23 -07:00
Fangrui Song	c977564fc2	Revert "[ELF] Try appeasing --target=armv7-linux-androideabi24 sanitizer symbolization tests" This reverts commit `5cbec88cbf`. Vitaly said that `2faac77f26` actually works. Sanitizer's armv7-linux-androideabi24 configuration has other issues which haven't been identified yet, but that's unrelated to the empty symbol name issue.	2021-11-03 00:56:09 -07:00
Fangrui Song	5cbec88cbf	[ELF] Try appeasing --target=armv7-linux-androideabi24 sanitizer symbolization tests	2021-11-02 18:57:04 -07:00
Vy Nguyen	37f96cb478	Revert "[lld-macho] Change bitfield types to be identical." This reverts commit `ae31f9fbad`. Reason: bitfields can't be merged across parent/child classes anyway. So this change doesn't help.	2021-11-02 16:57:51 -04:00
Vy Nguyen	ae31f9fbad	[lld-macho] Change bitfield types to be identical. Symbol's subclasses all have an additional bitfield of type uint8_t (RefState enum). For the bitfields in the same block tomerge, they should be of the same type. (clang/gcc will work, but others like MSVC does not) Differential Revision: https://reviews.llvm.org/D113040	2021-11-02 15:48:39 -04:00
Nico Weber	64c1734438	[lld/mac] Write -v output to stderr This matches ld64, and it's conceivable that projects try to read this information off stderr for that reason. --version keeps writing to stdout. Differential Revision: https://reviews.llvm.org/D113020	2021-11-02 13:59:14 -04:00
Vy Nguyen	d7e5393af4	[lld-macho] Remove no_dtrace_dof from un-implemented group. One fewer warning. In practice, lld already "implements" it. (ie., it does not do dtrace-dof processing ever). Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D112934	2021-11-02 12:36:08 -04:00
Vy Nguyen	3f35dd06a5	[lld-macho][nfc][cleanup] Fix a few code style lints and clang-tidy findings - Use .empty() instead of `size() == 0` when possible. - Use const-ref to avoid copying Differential Revision: https://reviews.llvm.org/D112978	2021-11-02 11:26:15 -04:00
Shoaib Meenai	7a4b27609d	[lld] Add test suite mode for running LLD main twice LLD_IN_TEST determines how many times each port's `main` function is run in each LLD process, and setting LLD_IN_TEST=2 (or higher) is useful for checking if we're cleaning up and resetting global state correctly. Add a test suite parameter to enable this easily. There's work in progress to remove global state (e.g. D108850), but this seems useful in the interim. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D112898	2021-11-01 14:26:54 -07:00
Fangrui Song	2f7366c89d	[ELF] Simplify R_DTPREL. NFC	2021-10-31 20:30:00 -07:00
Shoaib Meenai	264d3b6d4e	[MachO] Use error instead of fatal for missing -arch `fatal` should only be used for malformed inputs according to ErrorHandler.h; `error` is more appropriate for missing arguments, accompanied by a check to bail out early in case of the error. Some tests need to be adjusted accordingly. Makes `lld/test/MachO/arch.s` pass with `LLD_IN_TEST=2`. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D112879	2021-10-31 16:31:21 -07:00
Shoaib Meenai	0f6d720f1f	[MachO] Properly reset global state We need to reset global state between runs, similar to the other ports. There's some file-static state which needs to be reset as well and we need to add some new helpers for that. With this change, most LLD Mach-O tests pass with `LLD_IN_TEST=2` (which runs the linker twice on each test). Some tests will be fixed by the remainder of this stack, and the rest are fundamentally incompatible with that mode (e.g. they intentionally throw fatal errors). Fixes PR52070. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D112878	2021-10-31 16:14:29 -07:00
Nico Weber	f964ca896f	[lld/coff] Add parsing for /pdbpagesize: flag It's not used for anything yet, but we now accept `/pdbpagesize:4096` (the default behavior) and we give arguably more useful diagnostics for other values. It's plumbed through to the MSF layer, so just uncommenting out the bit in DriverUtils.cpp that rejects args other than 4096 is enough to try other values. Differential Revision: https://reviews.llvm.org/D112871	2021-10-31 18:36:23 -04:00
Fangrui Song	9f8ffaaa0b	[ELF] Replace "symbol '...' has no type" diagnostic with "relocation ... cannot be used against symbol '...'" The "symbol 'foo' has no type" diagnostic tries to inform that copy relocation/canonical PLT entry cannot be used, but the diagnostic is often incorrect and confusing.	2021-10-31 13:12:26 -07:00
Fangrui Song	164194a5af	[ELF] Untangle R_GOT style TLS IE and processRelocAux. NFC	2021-10-31 12:38:36 -07:00
Fangrui Song	55e69ece72	[ELF] Remove -Wl,-z,notext hint The hint does not pull its weight: * adding -Wl,-z,notext often won't work (relocation types other than `symbolRel`, e.g. `R_AARCH64_LDST32_ABS_LO12_NC`) * for pure (no assembly) C/C++ projects, the "-fPIC" hint is sufficient	2021-10-31 12:10:43 -07:00
Fangrui Song	b76aacef5f	[ELF] Simplify isStaticLinkTimeConstant. NFC	2021-10-31 10:46:42 -07:00
Fangrui Song	3fe4b54915	[ELF] Make getImplicitAddend return 0 for R_ARM_V4BX. NFC Will be useful if we move R_ARM_V4BX handling around.	2021-10-30 23:31:39 -07:00
Fangrui Song	aa1d32f519	[ELF][Mips] Use R_DTPREL for R_MIPS_TLS_DTPREL*	2021-10-30 21:58:43 -07:00
Nico Weber	2d48b19136	[lld/mac] Fix mislink with ICF When comparing relocations against two symbols, ICF's equalsConstant() did not look at the value of the two symbols. With subsections_via_symbols, the value is usually 0 but not always: In particular, it isn't 0 for constants in string and literal sections. Since we ignored the value, comparing two constant string symbols or two literal symbols always compared the 0th's element, so functions in the same TU always compared as equal. This can cause mislinks, and, with -dead_strip, crashes. Fixes PR52349, see that bug for lots of details and examples of mislinks. While here, make the existing assembly in icf-literals.s a bit more realistic (use leaq instead of movq with strings, and use foo(%rip) instead of foo@gotpcrel(%rip)). This has no interesting effect, it just maybe makes the test look a bit less surprising. Differential Revision: https://reviews.llvm.org/D112862	2021-10-30 18:58:59 -04:00
Sam Clegg	182b72aa48	[lld][WebAssembly] Generate TLS relocation code also when linking statically Previously relocations were only generated for PIC output, but relocations for TLS GOT entries are always needed when shared memory is enabled, not just in PIC mode. This means that the `__wasm_apply_global_tls_relocs` is now generated even for statically linked (non-PIC) output. Without this the globals that hold the addresses of TLS symbols are not set correctly. Differential Revision: https://reviews.llvm.org/D112833	2021-10-29 13:26:35 -07:00
Sam Clegg	fad05465c1	[lld][WebAssembly] Handle TLS variables in Symbol::getVA. NFC In the shared memory case we can always assume that TLS addresses are relative to __tls_base. In the non-shared memory case TLS variables are absolute, just like normal data addresses. This simplifies the code in calcNewValue so that TLS relocations no longer need special handling. Differential Revision: https://reviews.llvm.org/D112831	2021-10-29 10:45:30 -07:00
Jez Ng	6c2f26a159	[lld-macho] -all_load and -ObjC should not affect LC_LINKER_OPTION flags In particular, they should not cause archives to be eagerly loaded. This matches ld64's behavior. Fixes PR52246. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D112756	2021-10-29 11:00:28 -04:00
Jez Ng	a271f2410f	[lld-macho][nfc] Canonicalize all pointers to InputSections early on Having to remember to call `canonical()` all over the place is error-prone; let's do it in a centralized location instead. It also appears to improve performance slightly. base diff difference (95% CI) sys_time 0.984 ± 0.009 0.983 ± 0.014 [ -0.8% .. +0.6%] user_time 6.508 ± 0.035 6.475 ± 0.036 [ -0.8% .. -0.2%] wall_time 5.321 ± 0.034 5.300 ± 0.033 [ -0.7% .. -0.1%] samples 36 23 Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D112687	2021-10-29 11:00:28 -04:00
Fangrui Song	3a4b605bc1	[lld-macho] Internalize createFiles. NFC	2021-10-28 22:14:37 -07:00
Fangrui Song	6fcc19afb9	[ELF] Simplify R_TPREL formula after D111365	2021-10-28 21:03:53 -07:00
Fangrui Song	6e04ec801b	[docs] Fix docs-lld-html	2021-10-28 18:44:44 -07:00
Fangrui Song	e39c138f45	[ELF] Implement TLSDESC for x86-32 `-z rela` is also supported. Tested with: ``` cat > ./a.c <<eof #include <assert.h> int foo(); int bar(); int main() { assert(foo() == 2); assert(foo() == 4); assert(bar() == 2); assert(bar() == 4); } eof cat > ./b.c <<eof #include <stdio.h> __thread int tls0; extern __thread int tls1; int foo() { return ++tls0 + ++tls1; } static __thread int tls2, tls3; int bar() { return ++tls2 + ++tls3; } eof echo '__thread int tls1;' > ./c.c sed 's/ /\t/' > ./Makefile <<'eof' .MAKE.MODE = meta curDirOk=true CC := gcc -m32 -g -fpic -mtls-dialect=gnu2 LDFLAGS := -m32 -Wl,-rpath=. all: a0 a1 a2 run: all ./a0 && ./a1 && ./a2 c.so: c.o; ${LINK.c} -shared $> -o $@ bc.so: b.o c.o; ${LINK.c} -shared $> -o $@ b.so: b.o c.so; ${LINK.c} -shared $> -o $@ a0: a.o b.o c.o; ${LINK.c} $> -o $@ a1: a.o b.so; ${LINK.c} $> -o $@ a2: a.o bc.so; ${LINK.c} $> -o $@ eof ``` and glibc `elf/tst-gnu2-tls1`. `/usr/local/bin/ld` points to the freshly built `lld`. `bmake run && bmake CFLAGS=-O1 run` => ok. Differential Revision: https://reviews.llvm.org/D112582	2021-10-28 17:52:03 -07:00
Sam Clegg	1eb79e732c	[lld][WebAssembly] Initialize bss segments using memory.fill Previously we were relying on the dynamic loader to take care of this but it simple and correct for us to do it here instead. Now we initialize bss segments as part of `__wasm_init_memory` at the same time we initialize passive segments. In addition we extent the us of `__wasm_init_memory` outside of shared memory situations. Specifically it is now used to initialize bss segments when the memory is imported. Differential Revision: https://reviews.llvm.org/D112667	2021-10-28 17:15:08 -07:00
Sam Clegg	50bfc45109	[lld][WebAssemlby] Always enable mutable-globals feature in PIC mode This works around an issue where the feature can be forgotten in the case of LTO + object file with no functions. See: https://bugs.llvm.org/show_bug.cgi?id=52339 Differential Revision: https://reviews.llvm.org/D112769	2021-10-28 16:24:54 -07:00
Sam Clegg	28848e9e1b	[lld][WebAssembly] Handle duplicate archive member names in ThinLTO This entire change, including the test case, comes almost verbatim from the ELF driver. Fixes: https://github.com/emscripten-core/emscripten/issues/12763 Differential Revision: https://reviews.llvm.org/D112723	2021-10-28 11:48:04 -07:00
Sam Clegg	4da38c14d0	[lld] Rename addCombinedLTOObjects to match ELF driver. NFC This function was renamed in https://reviews.llvm.org/D62291. The new name seems more accurate and also its good to maintain some consistency between these methods in the different drivers. Differential Revision: https://reviews.llvm.org/D112719	2021-10-28 11:46:19 -07:00
Fangrui Song	2b1e32410c	[ELF] Change common diagnostics to report both object file location and source file location Many diagnostics use `getErrorPlace` or `getErrorLocation` to report a location. In the presence of line table debug information, `getErrorPlace` uses a source file location and ignores the object file location. However, the object file location is sometimes more useful. This patch changes "undefined symbol" and "out of range" diagnostics to report both object/source file locations. Other diagnostics can use similar format if needed. The key idea is to let `InputSectionBase::getLocation` report the object file location and use `getSrcMsg` for source file/line information. `getSrcMsg` doesn't leverage `STT_FILE` information yet, but I think the temporary lack of the functionality is ok. For the ARM "branch and link relocation" diagnostic, I arbitrarily place the source file location at the end of the line. The diagnostic is not very common so its formatting doesn't need to be pretty. Differential Revision: https://reviews.llvm.org/D112518	2021-10-28 09:38:45 -07:00
Sam Clegg	e091a66cb7	[lld][ELF] Update name of function in comment. NFC This function was renamed in https://reviews.llvm.org/D62291.	2021-10-28 07:29:43 -07:00
Vincent Lee	d54360cd32	[lld-macho] Implement -S There are a couple internal builds that require the use of this flag. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D112594	2021-10-27 17:09:57 -07:00
Nico Weber	7f369304df	[lld/mac] Don't crash on undefined symbols with --icf=all ICF runs before relocation processing, but undefined symbol errors are only emitted during relocation processing. So just ignore Undefineds during ICF (instead of crashing) -- lld will emit an error once ICF is done. Fixes PR52330. Differential Revision: https://reviews.llvm.org/D112643	2021-10-27 16:20:10 -04:00
Jez Ng	b7e12ca7aa	[lld-macho] If export_size is zero, export_off must be zero Otherwise tools like codesign_allocate will choke. We were already handling this correctly for the other DYLD_INFO sections. Doing this correctly is a bit subtle: we don't know if export_size will be zero until we have run `ExportSection::finalizeContents()`. However, we must still add the ExportSection to the `__LINKEDIT` segment in order that it gets sorted during `sortSectionsAndSegments()`. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D112589	2021-10-27 14:58:42 -04:00
Nico Weber	6503a68565	[lld/mac] Don't assert when ICFing arm64 code WordLiteralSection dedupes literals by content. WordLiteralInputSection::getOffset() used to read a literal at the passed-in offset and look up this value in the deduping map to find the offset of the deduped value. But it's possible that (e.g.) a 16-byte literal's value is accessed 4 bytes in. To get the offset at that address, we have to get the deduped value at offset 0 and then apply the offset 4 to the result. (See also WordLiteralSection::finalizeContents() which fills in those maps.) Only a problem on arm64 because in x86_64 the offset is part of the instruction instead of a separate ARM64_RELOC_ADDEND relocation. (See bug for more details.) Fixes PR51999. Differential Revision: https://reviews.llvm.org/D112584	2021-10-27 14:02:07 -04:00
Sam Clegg	1aeb4c4a43	[lld][WebAssebmly] Convert tests to use disassembly. NFC Differential Revision: https://reviews.llvm.org/D112590	2021-10-27 10:34:52 -07:00
Fangrui Song	ecc93ed2d7	[ELF] Replace InputBaseSection::{areRelocsRela,firstRelocation,numRelocation} with relSecIdx For `InputSection` `.foo`, its `InputBaseSection::{areRelocsRela,firstRelocation,numRelocation}` basically encode the information of `.rel[a].foo`. However, one uint32_t (the relocation section index) suffices. See the implementation of `relsOrRelas`. This change decreases sizeof(InputSection) from 184 to 176 on 64-bit Linux. The maximum resident set size linking a large application (1.2G output) decreases by 0.39%. Differential Revision: https://reviews.llvm.org/D112513	2021-10-27 09:51:07 -07:00
Fangrui Song	35c3f5610c	[ELF][X86] Write R_X86_64_TLSDESC addends with -z rel Similar to D100544 for AArch64. Reviewed By: arichardson Differential Revision: https://reviews.llvm.org/D112592	2021-10-27 09:35:30 -07:00
Nico Weber	9f90347588	fix comment typos to cycle bots	2021-10-27 09:53:08 -04:00
Jez Ng	1d2a4cd57d	[lld-macho] Fix compact-unwind-bad-reloc.s test Broken by `a9353dbe51`. Now that the functions point to the compact unwind entries, instead of the other way around, we need to perform the "invalid reference" check in a different place. This change was originally part of the stacked diff D109946, but should have been included as part of D109945.	2021-10-26 18:59:12 -04:00
Nuri Amari	a299b24712	Regenerate LC_CODE_SIGNATURE during llvm-objcopy operations Context: This is a second attempt at introducing signature regeneration to llvm-objcopy. In this diff: https://reviews.llvm.org/D109840, a script was introduced to test the validity of a code signature. In this diff: https://reviews.llvm.org/D109803 (now reverted), an effort was made to extract the signature generation behavior out of LLD into a common location for use in llvm-objcopy. In this diff: https://reviews.llvm.org/D109972 it was decided that there was no appropriate common location and that a small amount of duplication to bring signature generation to llvm-objcopy would be better. This diff introduces this duplication. Summary Prior to this change, if a LC_CODE_SIGNATURE load command was included in the binary passed to llvm-objcopy, the command and associated section were simply copied and included verbatim in the new binary. If rest of the binary was modified at all, this results in an invalid Mach-O file. This change regenerates the signature rather than copying it. The code_signature_lc.test test was modified to include the yaml representation of a small signed MachO executable in order to effectively test the signature generation. Reviewed By: alexander-shaposhnikov, #lld-macho Differential Revision: https://reviews.llvm.org/D111164	2021-10-26 14:51:13 -07:00
Jez Ng	a9353dbe51	[lld-macho] Simplify the handling of "no unwind info" functions This diff does away with `addEntriesForFunctionsWithoutUnwindInfo()`, because `addSymbol()` can now determine which functions need those entries. While overhauling UnwindInfoSection, I also parallelized the relocation of the contents of the CUEs. This somewhat offsets the time regression from creating one InputSection per CUE (which was done in D109944). Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D109945	2021-10-26 16:04:16 -04:00
Jez Ng	002eda7056	[lld-macho] Associate compact unwind entries with function symbols Compact unwind entries (CUEs) contain pointers to their respective function symbols. However, during the link process, it's far more useful to have pointers from the function symbol to the CUE than vice versa. This diff adds that pointer in the form of `Defined::compactUnwind`. In particular, when doing dead-stripping, we want to mark CUEs live when their function symbol is live; and when doing ICF, we want to dedup sections iff the symbols in that section have identical CUEs. In both cases, we want to be able to locate the symbols within a given section, as well as locate the CUEs belonging to those symbols. So this diff also adds `InputSection::symbols`. The ultimate goal of this refactor is to have ICF support dedup'ing functions with unwind info, but that will be handled in subsequent diffs. This diff focuses on simplifying `-dead_strip` -- `findFunctionsWithUnwindInfo` is no longer necessary, and `Defined::isLive()` is now a lot simpler. Moreover, UnwindInfoSection no longer has to check for dead CUEs -- we simply avoid adding them in the first place. Additionally, we now support stripping of dead LSDAs, which follows quite naturally since `markLive()` can now reach them via the CUEs. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D109944	2021-10-26 16:04:15 -04:00
Jez Ng	622150ad5f	[lld-macho] Put GOT into `__DATA` segment where appropriate We were previously always emitting the GOT into `__DATA_CONST`, even for target platforms where it should end up in `__DATA`. I stumbled onto this while trying to use the `class-dump` tool -- with the wrong segment names, it fails to locate the ObjC runtime info and therefore fails to dump any classes. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D112500	2021-10-26 11:38:01 -04:00
Vy Nguyen	e5fb79b314	[lld-macho] Make test produce the dead.o and live.o that are used below. Follow up fix to breakages in D112485	2021-10-25 22:10:24 -04:00
Vy Nguyen	46ef187dcc	[lld-macho] Fix incremental build (again) from D112485	2021-10-25 21:51:34 -04:00
Jez Ng	d3ddd569eb	[lld-macho] Fix incremental builds	2021-10-25 20:51:05 -04:00
Fangrui Song	3b42fc8a07	[ELF] Simplify sortSection. NFC	2021-10-25 16:57:46 -07:00
Jez Ng	413e249a47	[lld-macho][nfc] Test that we don't emit undef symbol errors for dead code This is what ld64 does too, so we have parity here (though I think ld64 still removes dead code more effectively than we do...) Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D112485	2021-10-25 19:05:39 -04:00
Fangrui Song	4d9f6caee3	[ELF] Change SharedFile::soName from std::string to StringRef	2021-10-25 15:54:04 -07:00
Fangrui Song	25da870057	[ELF] Remove irrelevant group signature hack working around old gold -r	2021-10-25 15:09:08 -07:00
Fangrui Song	43753f8f9d	[ELF] Remove irrelevant SHT_INIT_ARRAY/SHT_FINI_ARRAY hack The hack is irrelevant for two reasons: * binutils 2.24 is quite old and cannot handle R_X86_64_REX_GOTPCRELX from 2016 onwards anyway * `canMergeToProgbits` allows combining SHT_INIT_ARRAY/SHT_FINI_ARRAY into SHT_PROGBITS	2021-10-25 14:23:05 -07:00
Fangrui Song	6506907a0a	[ELF] Update comments/diagnostics for -defsym and -image-base to use the canonical two-dash form	2021-10-25 14:01:36 -07:00
Fangrui Song	ca8105b76c	[ELF][X86] Support R_X86_64_PLTOFF64 For a function call (using the default `-fplt`), GCC `-mcmodel=large` generates an assembly modifier which leads to an R_X86_64_PLTOFF64 relocation. In real world, http://git.ageinghacker.net/jitter (used by GNU poke) uses `-mcmodel=large`. R_X86_64_PLTOFF64's formula is (if preemptible) `L - GOT + A` or (if non-preemptible) `S - GOT + A` where `GOT` is (confusingly) the address of `.got.plt` Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D112386	2021-10-25 13:05:17 -07:00
Fangrui Song	a14ccaf509	[ELF] Support 128-bit bitmask in oneof(RelExpr) Taken from Chih-Mao Chen's D100835. RelExpr has 64 bits now and needs the extension to support new members (`R_PLT_GOTPLT` for `R_X86_64_PLTOFF64` support). Note: RelExpr needs to have at least a member >=64 to prevent -Wtautological-constant-out-of-range-compare for `if (expr >= 64)`. Reviewed By: arichardson, peter.smith Differential Revision: https://reviews.llvm.org/D112385	2021-10-25 13:05:17 -07:00
Fangrui Song	bf6e259b21	[ELF] Update comments/diagnostics for some long options to use the canonical two-dash form Rewrite some comments as appropriate.	2021-10-25 12:52:06 -07:00
Fangrui Song	4ae1c2c6f1	[ELF] Delete unneeded hack for discarding empty name local symbol This actually improves GNU ld compatibility. Correct assemblers don't create such symbols. Also simplify the code.	2021-10-25 11:55:31 -07:00
Vy Nguyen	7d549acbb6	[lld-macho][nfc] Rename output binary so it doesn't overwrite existing one `%t/basics` already exists - it would be nice to be able to examine it afterward Differential Revision: https://reviews.llvm.org/D112392	2021-10-25 09:55:40 -04:00
Fangrui Song	815a1207bf	[ELF] Remove ignored options that likely nobody uses GNU ld doesn't support `--no-pic-executable`. `-p` has been removed from likely the only use case (Linux kernel) for over 2.5 years: https://git.kernel.org/linus/091bb549f7722723b284f63ac665e2aedcf9dec9 `--no-add-needed` was the pre-binutils-2.23 spelling for `--no-copy-dt-needed-entries`. The legacy alias is irrelevant in 2021.	2021-10-24 18:29:45 -07:00
Kazu Hirata	4bd46501c3	Use llvm::any_of and llvm::none_of (NFC)	2021-10-24 17:35:33 -07:00
Kazu Hirata	4ba9d9c84f	Use StringRef::contains (NFC)	2021-10-23 20:41:46 -07:00
Vy Nguyen	236197e2d0	[lld-macho] Implement -oso_prefix https://bugs.llvm.org/show_bug.cgi?id=50229 Differential Revision: https://reviews.llvm.org/D112291	2021-10-22 16:32:42 -04:00
Jez Ng	77fdc0e56b	[lld-macho] Simplify lc-linker-option.ll and re-enable it on Windows While attempting to simplify it, I discovered a concerning discrepancy between our handling of LC_LINKER_OPTION vs ld64's. In particular, ld64 does not appear to check for `-all_load` nor `-ObjC` when processing those options. Thus, if/when we fix this behavior, no duplicate symbol error will be expected regardless of the use-after-free. As such, I've removed the test logic that tries to induce the duplicate symbol error. We can just rely on ASAN to do the verification. In order to make the test run on Windows, I've removed the symlink logic. Both ld64 and LLD handle this un-symlinked framework just fine. I also capitalized the framework name, since that's the typical convention. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D112195	2021-10-21 11:23:44 -04:00
Igor Kudrin	1302fdc233	[ELF] Avoid adding an orphan section to a less suitable segment If segments are defined in a linker script, placing an orphan section before the found closest-rank section can result in adding it in a previous segment and changing flags of that segment. This happens if the orphan section has a lower sort rank than the found section. To avoid that, the patch forces orphan sections to be moved after the found section if segments are explicitly defined. Differential Revision: https://reviews.llvm.org/D111717	2021-10-21 11:38:39 +07:00
Vy Nguyen	6b715e9c4d	[lld-macho][nfc] Added some notes on deliberate differences btw LD64 vs LLD-MACHO For future references and to help with debugging crashes, this could be useful. Differential Revision: https://reviews.llvm.org/D110464	2021-10-20 22:41:57 -04:00
Jez Ng	9ef55ddc3f	[lld-macho] Temporarily disable lc-linker-option.ll on Windows It's currently using a symlink, which is not supported on Windows.	2021-10-20 20:05:30 -04:00
Nico Weber	1412719066	[lld/mac] Remove else-after-return in ICF code No behavior change.	2021-10-20 14:24:13 -04:00
Kaining Zhong	aab0f2264a	[lld-macho] Fix dangling string reference when adding frameworks In Driver.cpp, addFramework used std::string instance to represent the path of a framework, which will be freed after the function returns. However, this string is stored in loadedArchive, which will be used later to compare with path of newly added frameworks. This caused https://bugs.llvm.org/show_bug.cgi?id=52133. A test is included in this commit to reproduce this bug. Now resolveDylibPath returns a StringRef instance, and it uses StringSaver to save its data, then returns it to functions on the top. This ensures the resolved framework path is still valid after LC_LINKER_OPTION is parsed. Reviewed By: int3, #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D111706	2021-10-20 11:21:40 -04:00
Paulo Matos	6d0c7bc17d	[WebAssembly] Implementation of table.get/set for reftypes in LLVM IR This change implements new DAG nodes TABLE_GET/TABLE_SET, and lowering methods for load and stores of reference types from IR arrays. These global LLVM IR arrays represent tables at the Wasm level. Differential Revision: https://reviews.llvm.org/D111154	2021-10-20 10:31:31 +02:00
Noah Shutty	e678c51177	[Support][ThinLTO] Move ThinLTO caching to LLVM Support library We would like to move ThinLTO’s battle-tested file caching mechanism to the LLVM Support library so that we can use it elsewhere in LLVM. Patch By: noajshu Differential Revision: https://reviews.llvm.org/D111371	2021-10-18 18:57:25 -07:00
Petr Hosek	8e46e34d24	Revert "[Support][ThinLTO] Move ThinLTO caching to LLVM Support library" This reverts commit `92b8cc52bb` since it broke the gold plugin.	2021-10-18 12:24:05 -07:00
Noah Shutty	92b8cc52bb	[Support][ThinLTO] Move ThinLTO caching to LLVM Support library We would like to move ThinLTO’s battle-tested file caching mechanism to the LLVM Support library so that we can use it elsewhere in LLVM. Patch By: noajshu Differential Revision: https://reviews.llvm.org/D111371	2021-10-18 12:08:49 -07:00
Kazu Hirata	8568ca789e	Use llvm::erase_if (NFC)	2021-10-18 09:33:42 -07:00
gbreynoo	f2c144fc18	[LLD][TEST] Add testing for negative addends for R_X86_64_32 and R_X86_64_PC32 relocations This change is derived from a test case we have locally but I could not see an equivalent in LLD's testing. Differential Revision: https://reviews.llvm.org/D111803	2021-10-18 16:38:33 +01:00
Kazu Hirata	10726992fa	Use llvm::erase_value (NFC)	2021-10-16 23:31:21 -07:00
Fangrui Song	f8ee74fc13	[ELF] Require two-dash form for --pack-dyn-relocs LLD specific options can be more rigid. Also add a test.	2021-10-15 15:36:30 -07:00
Sam Clegg	659a08399a	[WebAssembly] Add import info to `dylink` section of shared libraries See https://github.com/WebAssembly/tool-conventions/pull/175 Differential Revision: https://reviews.llvm.org/D111345	2021-10-15 11:49:16 -07:00
Nico Weber	4e572db0c2	[lld/mac] Mark private externs with GOT relocs as LOCAL in indirect symbtab prepareSymbolRelocation() in Writer.cpp adds both symbols that need binding and symbols relocated with a pointer relocation to the got. Pointer relocations are emitted for non-movq GOTPCREL(%rip) loads. (movqs become GOT_LOADs so that the linker knows they can be relaxed to leaqs, while others, such as addq, become just GOT -- a pointer relocation -- since they can't be relaxed in that way). For example, this C file produces a private_extern GOT relocation when compiled with -O2 with clang: extern const char kString[]; const char* g(int a) { return kString + a; } Linkers need to put pointer-relocated symbols into the GOT, but ld64 marks them as LOCAL in the indirect symbol table. This matters, since `strip -x` looks at the indirect symbol table when deciding what to strip. The indirect symtab emitting code was assuming that only symbols that need binding are in the GOT, but pointer relocations where there too. Hence, the code needs to explicitly check if a symbol is a private extern. Fixes https://crbug.com/1242638, which has some more information in comments 14 and 15. With this patch, the output of `nm -U` on Chromium Framework after stripping now contains just two symbols when using lld, just like with ld64. Differential Revision: https://reviews.llvm.org/D111852	2021-10-15 13:24:47 -04:00
Heejin Ahn	9261ee32dc	[WebAssembly] Make EH work with dynamic linking This makes Wasm EH work with dynamic linking. So far we were only able to handle destructors, which do not use any tags or LSDA info. 1. This uses `TargetExternalSymbol` for `GCC_except_tableN` symbols, which points to the address of per-function LSDA info. It is more convenient to use than `MCSymbol` because it can take additional target flags. 2. When lowering `wasm_lsda` intrinsic, if PIC is enabled, make the symbol relative to `__memory_base` and generate the `add` node. If PIC is disabled, continue to use the absolute address. 3. Make tag symbols (`__cpp_exception` and `__c_longjmp`) undefined in the backend, because it is hard to make it work with dynamic linking's loading order. Instead, we make all tag symbols undefined in the LLVM backend and import it from JS. 4. Add support for undefined tags to the linker. Companion patches: - https://github.com/WebAssembly/binaryen/pull/4223 - https://github.com/emscripten-core/emscripten/pull/15266 Reviewed By: sbc100 Differential Revision: https://reviews.llvm.org/D111388	2021-10-12 23:28:27 -07:00
Nico Weber	f09dce564e	[lld] fix typos to cycle bots	2021-10-12 17:03:39 -04:00
Andrew Ng	649cc160e3	[ELF][test] Add testing for dynamic TLS relocations in .debug_info Differential Revision: https://reviews.llvm.org/D111436	2021-10-12 10:54:52 +01:00
Fangrui Song	71ec1e5015	[ELF] Demote !isUsedInRegularObj lazy symbol I think D79300 has fixed the D51892 (`__i686.get_pc_thunk.bx`) issue, so we can bring back rL330869. D79300 says `would error undefined symbol instead of the more relevant discarded section` but it doesn't reproduce now. This avoids a quirk in `isUndefWeak()`. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D111365	2021-10-11 09:46:31 -07:00
Ben Dunbobbin	aaeba6483f	[LLD] [TEST] Add test case for patching an absolute relocation to a weak undef I noticed that we had this case in our internal testsuite but couldn't find it in LLD's tests. This adds that case. Differential Revision: https://reviews.llvm.org/D110716	2021-10-11 13:14:45 +01:00
Keith Smiley	dfeaa1941b	[lld][test] Remove /usr/local/lib test requirement This field only exists if the directory exists on the machine running the test. It likely exists for most Intel macOS users because of homebrew, but doesn't exist on some of the CI machines. This unfortunately makes this test a bit less strict. Differential Revision: https://reviews.llvm.org/D111361	2021-10-07 15:17:52 -07:00
Keith Smiley	0885afb8b0	[lld][test] Fix darwin REQUIRES (NFC) Some subprojects like compiler-rt define the `darwin` feature in their lit config, but lld does not do that, so we need to use the global system-darwin here instead. This test seems to have drifted from the actual behavior so I also had to add `/usr/local/lib` here to make it pass. Differential Revision: https://reviews.llvm.org/D111268	2021-10-07 12:37:37 -07:00
Heejin Ahn	3ec1760d91	[WebAssembly] Remove WasmTagType This removes `WasmTagType`. `WasmTagType` contained an attribute and a signature index: ``` struct WasmTagType { uint8_t Attribute; uint32_t SigIndex; }; ``` Currently the attribute field is not used and reserved for future use, and always 0. And that this class contains `SigIndex` as its property is a little weird in the place, because the tag type's signature index is not an inherent property of a tag but rather a reference to another section that changes after linking. This makes tag handling in the linker also weird that tag-related methods are taking both `WasmTagType` and `WasmSignature` even though `WasmTagType` contains a signature index. This is because the signature index changes in linking so it doesn't have any info at this point. This instead moves `SigIndex` to `struct WasmTag` itself, as we did for `struct WasmFunction` in D111104. In this CL, in lib/MC and lib/Object, this now treats tag types in the same way as function types. Also in YAML, this removes `struct Tag`, because now it only contains the tag index. Also tags set `SigIndex` in `WasmImport` union, as functions do. I think this makes things simpler and makes tag handling more in line with function handling. These two shares similar properties in that both of them have signatures, but they are kind of nominal so having the same signature doesn't mean they are the same element. Also a drive-by fix: the reserved 'attirubute' part's encoding changed from uleb32 to uint8 a while ago. This was fixed in lib/MC and lib/Object but not in YAML. This doesn't change object files because the field's value is always 0 and its encoding is the same for the both encoding. This is effectively NFC; I didn't mark it as such just because it changed YAML test results. Reviewed By: sbc100, tlively Differential Revision: https://reviews.llvm.org/D111086	2021-10-05 17:11:22 -07:00
Heejin Ahn	9a9ec8e04b	[lld][WebAssembly] Remove redundant check for undefined global (NFC) Also does some refactoring. Reviewed By: sbc100 Differential Revision: https://reviews.llvm.org/D111101	2021-10-05 15:11:27 -07:00
Sam Clegg	8fe128476e	[lld][WebAssembly] Create optional internal symbols only after LTO object as been added This is important for the cases where new symbols can be introduced during LTO. Specifically this happens for during TLS-lowering where references to `__tls_base` can be introduced. Fixes: https://github.com/emscripten-core/emscripten/issues/12489 Differential Revision: https://reviews.llvm.org/D111171	2021-10-05 13:31:09 -07:00
Andrew Ng	3334b9d70b	[ELF][test] Enhance relative dynamic relocation tests Add checking of the value of the relocation with an addend. Also check all relocation offsets. Differential Revision: https://reviews.llvm.org/D111071	2021-10-05 11:32:22 +01:00
Igor Kudrin	65c284a7be	[ELF][test][NFC] Make a test standard compliant PT_LOAD segments in the program header must be sorted by their virtual addresses, so they should be defined in a similar order as the associated sections. Differential Revision: https://reviews.llvm.org/D111068	2021-10-05 11:40:02 +07:00
Sam Clegg	c0039de295	[Object][WebAssemlby] Report function types (signatures). NFC This simplifies the code in a number of ways and avoids having to track functions and their types separately. Differential Revision: https://reviews.llvm.org/D111104	2021-10-04 17:33:56 -07:00
Nico Weber	f3091831f4	[lld] Use checkError more No behavior change.	2021-10-04 11:46:16 -04:00
Andrew Ng	39f3f7c08f	[ELF][test] Fix several LLD ICF tests A number of the ICF tests were not updated to use --print-icf-sections instead of --verbose and various '-NOT' checks were not updated to the latest output format of --print-icf-sections. Because these are all 'negative' tests, these issues have gone unnoticed. Differential Revision: https://reviews.llvm.org/D110353	2021-10-04 11:10:10 +01:00
Daniel Rodríguez Troitiño	657f02d458	Revert "Extract LC_CODE_SIGNATURE related implementation out of LLD" This reverts commit `cc8229603b`. As discussed in the review of https://reviews.llvm.org/D109972, this was not right approach, so we are reverting to start with a different approach. Differential Revision: https://reviews.llvm.org/D110974	2021-10-01 17:19:50 -07:00
Teresa Johnson	b55a964197	Second attempt to fix Windows failures from test changes Try to address Windows flakes from `d87bdc272b` by adding "\|\| true" as suggested in D110276 so the whole test doesn't fail when Windows thinks it can't remove the binary.	2021-09-29 19:24:35 -07:00
Teresa Johnson	2f1b99ca67	Use rm -f to fix Windows failures from test changes Try to address Windows flakes from `d87bdc272b` by using 'rm -f' instead of just 'rm' as discussed in D110276. For example: http://45.33.8.238/win/46115/step_7.txt	2021-09-29 08:01:22 -07:00
Nico Weber	c19315ef60	[lld/mac] Don't warn on both --icf=all and -no_deduplicate Instead, just make the later flag win, like usual. Implement this by making -no_deduplicate an actual alias for --icf=none at the Options.td level. Differential Revision: https://reviews.llvm.org/D110672	2021-09-29 08:25:21 -04:00
Teresa Johnson	d87bdc272b	Clean up large copies of binaries copied into temp directories in tests In looking at the disk space used by a ninja check-all, I found that a few of the largest files were copies of clang and lld made into temp directories by a couple of tests. These tests were added in D53021 and D74811. Clean up these copies after usage. Differential Revision: https://reviews.llvm.org/D110276	2021-09-28 17:04:09 -07:00
Shoaib Meenai	f9b3c18e74	[CodeGen] Fix wrapping personality symbol on ARM The ARM backend was explicitly setting global binding on the personality symbol. This was added without any comment in `a7ec2dcefd`, which introduced EHABI support (back in 2011). None of the other backends do anything equivalent, as far as I can tell. This causes problems when attempting to wrap the personality symbol. Wrapped symbols are marked as weak inside LTO to inhibit IPO (see https://reviews.llvm.org/D33621). When we wrap the personality symbol, it initially gets weak binding, and then the ARM backend attempts to change the binding to global, which causes an error in MC because of attempting to change the binding of a symbol from non-global to global (the error was added in https://reviews.llvm.org/D90108). Simply drop the ARM backend's explicit global binding setting to fix this. This matches all the other backends, and a large internal application successfully linked and ran with this change, so it shouldn't cause any problems. Test via LLD, since wrapping is required to exhibit the issue. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D110609	2021-09-28 15:01:05 -07:00
Fangrui Song	74a47e54be	[llvm-objdump] Fix -R display and support ET_EXEC * Add a newline before `DYNAMIC RELOCATION RECORDS` (see D101796) * Add the missing `OFFSET TYPE VALUE` line * Align columns Note: llvm-readobj/ELFDumper.cpp `loadDynamicTable` has sophisticated PT_DYNAMIC code which is unavailable in llvm-objdump. Reviewed By: jhenderson, Higuoxing Differential Revision: https://reviews.llvm.org/D110595	2021-09-28 09:58:27 -07:00
Fangrui Song	2bf06d9345	[ELF] Support symbol names with space in linker script expressions Fix PR51961 Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D110490	2021-09-27 09:50:42 -07:00
Fangrui Song	db6a00daa0	[ELF] Remove unneeded binding parameter from addOptionalRegular. NFC __rela_iplt_start uses spurious STB_WEAK, but it doesn't matter because STV_HIDDEN overrides the binding.	2021-09-25 15:47:27 -07:00
Fangrui Song	d23fd8ae89	[ELF] Replace noneRel = R__NONE with static constexpr. NFC All architectures define R__NONE to 0.	2021-09-25 15:16:44 -07:00
Fangrui Song	40cd4db442	[ELF] Default gotBaseSymInGotPlt to false (NFC for most architectures) Most architectures use .got instead of .got.plt, so switching the default can minimize customization. This fixes an issue for SPARC V9 which uses .got . AVR, AMDGPU, and MSP430 don't seem to use _GLOBAL_OFFSET_TABLE_.	2021-09-25 15:06:09 -07:00
Fangrui Song	a892c0e49e	[ELF][test] Improve test coverage	2021-09-25 11:57:54 -07:00
Mike Hommey	08ef24f6ab	Wrap xar/xar.h include in extern "C" block Without such wrapping, linking lld fails with missing symbols because of C++ symbol mangling with older versions of the MacOSX SDK, in which xar.h doesn't have an extern "C" block itself. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D110224	2021-09-23 09:37:30 +02:00
Fangrui Song	19d53d45f2	[ELF][AArch64] Refine and fix the condition when BTI/PAC PLT needs bti c (As I mentioned in https://reviews.llvm.org/D62609#1534158 , the condition for using bti c for executable can be loosened.) In two cases the address of a PLT may escape: * canonical PLT entry for a STT_FUNC * non-preemptible STT_GNU_IFUNC which is converted to STT_FUNC The first case can be detected with `needsPltAddr`. The second case is not straightforward to detect because for the Relocations.cpp created `directSym`, it's difficult to know whether the associated `sym` has exercised the `!needsPlt(expr)` code path. Just use the conservative `isInIplt` condition. A non-preemptible ifunc not referenced by non-GOT-generating non-PLT-generating relocations will have an unneeded `bti c`, but the cost is acceptable. The second case fixes a bug as well: a -shared link may have non-preemptible ifunc. Before the patch we did not emit `bti c` and could be wrong if the PLT address escaped. GNU ld doesn't handle the case: `relocation R_AARCH64_ADR_PREL_PG_HI21 against STT_GNU_IFUNC symbol 'ifunc2' isn't handled by elf64_aarch64_final_link_relocate` (https://sourceware.org/bugzilla/show_bug.cgi?id=28370) For -shared, if BTI is enabled but PAC is disabled, the PLT entry size increases from 16 to 24 because we have to select the PLT scheme early, but the cost is acceptable. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D110217	2021-09-22 11:51:09 -07:00
Hongtao Yu	d9b511d8e8	[CSSPGO] Set PseudoProbeInserter as a default pass. Currenlty PseudoProbeInserter is a pass conditioned on a target switch. It works well with a single clang invocation. It doesn't work so well when the backend is called separately (i.e, through the linker or llc), where user has always to pass -pseudo-probe-for-profiling explictly. I'm making the pass a default pass that requires no command line arg to trigger, but will be actually run depending on whether the CU comes with `llvm.pseudo_probe_desc` metadata. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D110209	2021-09-22 09:09:48 -07:00
Andrew Ng	05b1303421	[ELF][test] Restore important part of ICF alignment test Restore the checking of addresses in ICF test which was testing the behaviour of ICF with regards to different alignments of otherwise identical sections. Also make the test more robust to layout changes. Differential Revision: https://reviews.llvm.org/D110090	2021-09-22 14:15:33 +01:00
Amy Huang	6e994a833e	[lld] Remove timers.ll because inconsistent timers behavior causes the test to fail sometimes See https://reviews.llvm.org/D109904	2021-09-20 09:57:18 -07:00
Fangrui Song	a954bb18b1	[ELF] Add --why-extract= to query why archive members/lazy object files are extracted Similar to D69607 but for archive member extraction unrelated to GC. This patch adds --why-extract=. Prior art: GNU ld -M prints ``` Archive member included to satisfy reference by file (symbol) a.a(a.o) main.o (a) b.a(b.o) (b()) ``` -M is mainly for input section/symbol assignment <-> output section mapping (often huge output) and the information may appear ad-hoc. Apple ld64 ``` __Z1bv forced load of b.a(b.o) _a forced load of a.a(a.o) ``` It doesn't say the reference file. Arm's proprietary linker ``` Selecting member vsnprintf.o(c_wfu.l) to define vsnprintf. ... Loading member vsnprintf.o from c_wfu.l. definition: vsnprintf reference : _printf_a ``` --- --why-extract= gives the user the full data (which is much shorter than GNU ld -Map). It is easy to track a chain of references to one archive member with a one-liner, e.g. ``` % ld.lld main.o a_b.a b_c.a c.a -o /dev/null --why-extract=- \| tee stdout reference extracted symbol main.o a_b.a(a_b.o) a a_b.a(a_b.o) b_c.a(b_c.o) b() b_c.a(b_c.o) c.a(c.o) c() % ruby -ane 'BEGIN{p={}}; p[$F[1]]=[$F[0],$F[2]] if $.>1; END{x="c.a(c.o)"; while y=p[x]; puts "#{y[0]} extracts #{x} to resolve #{y[1]}"; x=y[0] end}' stdout b_c.a(b_c.o) extracts c.a(c.o) to resolve c() a_b.a(a_b.o) extracts b_c.a(b_c.o) to resolve b() main.o extracts a_b.a(a_b.o) to resolve a ``` Archive member extraction happens before --gc-sections, so this may not be a live path under --gc-sections, but I think it is a good approximation in practice. * Specifying a file avoids output interleaving with --verbose. * Required `=` prevents accidental overwrite of an input if the user forgets `=`. (Most of compiler drivers' long options accept `=` but not ` `) Differential Revision: https://reviews.llvm.org/D109572	2021-09-20 09:52:30 -07:00
Fangrui Song	d001ab82e4	[ELF] Don't fall back to .text for e_entry We have the rule to simulate (https://sourceware.org/binutils/docs/ld/Entry-Point.html), but the behavior is questionable (https://sourceware.org/pipermail/binutils/2021-September/117929.html). gold doesn't fall back to .text. The behavior is unlikely relied by projects (there is even a warning for executable links), so let's just delete this fallback path. Reviewed By: jhenderson, peter.smith Differential Revision: https://reviews.llvm.org/D110014	2021-09-20 09:35:12 -07:00
Nico Weber	1b2c36aa5f	[lld/mac] Fix comment typo to cycle bots	2021-09-18 11:15:21 -04:00
Amy Huang	724a1dff8a	[lld] Fix small error in previous commit `6f7483b1ec`.	2021-09-17 17:47:21 -07:00
Amy Huang	6f7483b1ec	Reland "[LLD] Remove global state in lld/COFF" after fixing asan and msan test failures Original commit description: [LLD] Remove global state in lld/COFF This patch removes globals from the lldCOFF library, by moving globals into a context class (COFFLinkingContext) and passing it around wherever it's needed. See https://lists.llvm.org/pipermail/llvm-dev/2021-June/151184.html for context about removing globals from LLD. I also haven't moved the `driver` or `config` variables yet. Differential Revision: https://reviews.llvm.org/D109634 This reverts commit `a2fd05ada9`. Original commits were `b4fa71eed3` and `e03c7e367a`.	2021-09-17 17:18:42 -07:00
Jez Ng	91ace9f062	[lld-macho] Construct CFString literals by copying the ConcatInputSection ... instead of constructing a new one each time. This allows us to take advantage of {D105305}. I didn't see a substantial difference when linking chromium_framework, but this paves the way for reusing similar logic for splitting compact unwind entries into sections. There are a lot more of those, so the performance impact is significant. Differential Revision: https://reviews.llvm.org/D109895	2021-09-17 19:46:20 -04:00
Vy Nguyen	b428c3e8c1	[lld-macho] Ignore local personality symbols if non-local with the same name exisst, to avoid "too many personalities" error. Sometimes people intentionally re-define a dylib personlity symbol as a local defined symbol as a workaround to a ld -r bug. As a result, we could see "too many personalities" to encode. This patch tries to handle this case by ignoring the local symbols entirely. Differential Revision: https://reviews.llvm.org/D107533	2021-09-17 12:59:42 -04:00
Nuri Amari	aaf00f3f19	Add MachO signature verification test Add a test to ensure that MachO files including a LC_CODE_SIGNATURE load command produced by lld are signed correctly. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D109840	2021-09-16 17:55:32 -07:00
Nuri Amari	cc8229603b	Extract LC_CODE_SIGNATURE related implementation out of LLD Move the functionality in lld that handles writing of the LC_CODE_SIGNATURE load command and associated data section to a central reusable location. This change is in preparation for another change that modifies llvm-objcopy to reproduce the LC_CODE_SIGNATURE load command and corresponding data section to maintain the validity of signed macho object files passed through llvm-objcopy. Reviewed By: #lld-macho, int3, oontvoo Differential Revision: https://reviews.llvm.org/D109803	2021-09-16 17:43:39 -07:00
Fangrui Song	1d08a19a38	[ELF] Clarify --export-dynamic-symbol/--dynamic-list. NFC	2021-09-16 17:13:08 -07:00
Amy Huang	a2fd05ada9	Temporarily revert "[LLD] Remove global state in lld/COFF" and "[lld] Add test to check for timer output" Seems to be causing a number of asan test failures. This reverts commit `b4fa71eed3` and `e03c7e367a`.	2021-09-16 11:58:11 -07:00
Amy Huang	e03c7e367a	[lld] Add test to check for timer output This test checks that timers are working and printing as expected. I also seem to have changed the order of the timers in my globals refactoring patch, so I fixed it here. Differential Revision: https://reviews.llvm.org/D109904	2021-09-16 11:36:46 -07:00
Amy Huang	b4fa71eed3	[LLD] Remove global state in lld/COFF This patch removes globals from the lldCOFF library, by moving globals into a context class (COFFLinkingContext) and passing it around wherever it's needed. See https://lists.llvm.org/pipermail/llvm-dev/2021-June/151184.html for context about removing globals from LLD. I also haven't moved the `driver` or `config` variables yet. Differential Revision: https://reviews.llvm.org/D109634	2021-09-16 11:00:23 -07:00
Alfonso Gregory	a2c319fdc6	[LLVM][CMake][NFC] Resolve FIXME: Rename LLVM_CMAKE_PATH to LLVM_CMAKE_DIR throughout the project This way, we do not need to set LLVM_CMAKE_PATH to LLVM_CMAKE_DIR when (NOT LLVM_CONFIG_FOUND) Reviewed By: #libc, ldionne Differential Revision: https://reviews.llvm.org/D107717	2021-09-16 18:29:57 +02:00
Thomas Lively	962acf0a27	[lld][WebAssembly] Use llvm-objdump to test __wasm_init_memory Rather than depending on the hex dump from obj2yaml. Now the test shows the expected function body in a human readable format. Differential Revision: https://reviews.llvm.org/D109730	2021-09-14 18:07:59 -07:00
Nico Weber	ed2f0ad307	[lld/mac] Search .tbd before binary for framework files too This matters for example for the iPhoneSimulator14.0.sdk, which has a System/Library/Frameworks/UIKit.framework/UIKit that has LC_BUILD_VERSION with minos of 14.0, so linking against that file will produce warnings like: .../iPhoneSimulator14.0.sdk/System/Library/Frameworks/UIKit.framework/UIKit has version 14.0.0, which is newer than target minimum of 12.0.0 when targeting x86_64-apple-ios12.0-simulator. That doens't happen when linking against UIKit.tbd instead, obviously. Linking with RC_TRACE_DYLIB_SEARCHING=1 shows that ld64 also searches the tbd file first, and we already get that right for non-framework dylibs. Fixes crbug.com/1249456. Differential Revision: https://reviews.llvm.org/D109768	2021-09-14 15:26:45 -04:00
Sam Clegg	6ee55f9ab5	Fix test failure created by `ef8c9135ef` Followup to https://reviews.llvm.org/D108877 to fix test failure.	2021-09-14 07:35:05 -07:00
Sam Clegg	ef8c9135ef	[WebAssembly] Allow import and export of TLS symbols between DSOs We previously had a limitation that TLS variables could not be exported (and therefore could also not be imported). This change removed that limitation. Differential Revision: https://reviews.llvm.org/D108877	2021-09-14 06:47:37 -07:00
Thomas Lively	b2032f18c9	[lld][WebAssembly] Relax limitations on multithreaded instantiation For multithreaded modules (i.e. modules with a shared memory), lld injects a synthetic Wasm start function that is automatically called during instantiation to initialize memory from passive data segments. Even though the module will be instantiated separately on each thread, memory initialization should happen only once. Furthermore, memory initialization should be finished by the time each thread finishes instantiation. Since multiple threads may be instantiating their modules at the same time, the synthetic function must synchronize them. The current synchronization tries to atomically increment a flag from 0 to 1 in memory then enters one of two cases. First, if the increment was successful, the current thread is responsible for initializing memory. It does so, increments the flag to 2 to signify that memory has been initialized, then notifies all threads waiting on the flag. Otherwise, the thread atomically waits on the flag with an expected value of 1 until memory has been initialized. Either the initializer thread finishes initializing memory (i.e. sets the flag to 2) first and the waiter threads do not end up blocking, or the waiter threads succesfully start waiting before memory is initialized so they will be woken by the initializer thread once it has finished. One complication with this scheme is that there are various contexts on the Web, most notably on the main browser thread, that cannot successfully execute a wait. Executing a wait in these contexts causes a trap, and in this case would cause instantiation to fail. The embedder must therefore ensure that these contexts win the race and become responsible for initializing memory, since that is the only code path that does not execute a wait. Unfortunately, since only one thread can win the race and initialize memory, this scheme makes it impossible to have multiple threads in contexts that cannot wait. For example, it is not currently possible to instantiate the module on both the main browser thread as well as in an AudioWorklet. To loosen this restriction, this commit inserts an extra check so that the wait will not be executed at all when memory has already been initialized, i.e. when the flag value is 2. After this change, the module can be instantiated on threads in non-waiting contexts as long as the embedder can guarantee either that the thread will win the race and initialize memory (as before) or that memory has already been initialized when instantiation begins. Threads in contexts that can wait can continue racing to initialize memory. Fixes (or at least improves) PR51702. Reviewed By: dschuff Differential Revision: https://reviews.llvm.org/D109722	2021-09-13 15:03:51 -07:00
Sam Clegg	b78c85a44a	[WebAssembly] Convert to new "dylink.0" section format This format is based on sub-sections (like the "linking" and "name" sections) and is therefore easier to extend going forward. spec change: https://github.com/WebAssembly/tool-conventions/pull/170 binaryen change: https://github.com/WebAssembly/binaryen/pull/4141 wabt change: https://github.com/WebAssembly/wabt/pull/1707 emscripten change: https://github.com/emscripten-core/emscripten/pull/15019 Differential Revision: https://reviews.llvm.org/D109595	2021-09-12 05:30:38 -07:00
Sam Clegg	3a7bcba34b	[lld][WebAssembly] Cleanup output of --verbose Remove some unnecessary logging from wasm-ld when running under `--verbose`. Unlike `-debug` this logging is available in release builds. This change makes it little more minimal/readable. Also, avoid compiling the `debugWrite` function in releaase builds where it does nothing. This should remove a lot debug strings from the binary, and avoid having to construct unused debug strings at runtime. Differential Revision: https://reviews.llvm.org/D109583	2021-09-10 11:35:50 -04:00
Fangrui Song	bcc34ab6c8	[lld] Enable ANSI escape code for Windows Buffered diagnostics need ENABLE_VIRTUAL_TERMINAL_PROCESSING after D87272. Do it unconditionally like FileCheck.	2021-09-09 16:51:11 -07:00
Sam Clegg	6355234660	[lld][WebAssembly] Fix crash on un-used __tls_base symbol In the case that TLS is used in the single-threaded program, and therefore effectively lowered away, we still optionally create a `__tls_base` symbols, but the code for setting it was assuming it was always created. Differential Revision: https://reviews.llvm.org/D109518	2021-09-09 12:45:58 -04:00
Fangrui Song	0db402c5b4	[lld] Buffer writes when composing a single diagnostic llvm::errs() is unbuffered. On a POSIX platform, composing a diagnostic string may invoke the ::write syscall multiple times, which can be slow. Buffer writes to a temporary SmallString when composing a single diagnostic to reduce the number of ::write syscalls to one (also easier to read under strace/truss). For an invocation of ld.lld with 62000+ lines of `ld.lld: warning: symbol ordering file: no such symbol: ` warnings (D87121), the buffering decreases the write time from 1s to 0.4s (for /dev/tty) and from 0.4s to 0.1s (for a tmpfs file). This can speed up `relocation R_X86_64_PC32 out of range` diagnostic printing as well with `--noinhibit-exec --no-fatal-warnings`. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D87272	2021-09-09 09:27:14 -07:00
Sam Clegg	44177e5fb2	[WebAssembly] Add explict TLS symbol flag As before we maintain backwards compat with older object files by also infering the TLS flag based on the name of the segment. This change is was split out from https://reviews.llvm.org/D108877. Differential Revision: https://reviews.llvm.org/D109426	2021-09-09 10:03:30 -04:00
Fangrui Song	aa4dfba522	[ELF] Infer EM_HEXAGON in getBitcodeMachineKind	2021-09-07 20:46:37 -07:00
Fangrui Song	abd80ecf6e	[ELF][test] Improve gitBitcodeMachineKind tests	2021-09-07 11:38:43 -07:00
Jez Ng	d9ab62ca3d	[lld-macho] Initialize LTO backend with diagnostic handler Failing to do so results in `std::bad_function_call` being thrown when a pass tries to emit a diagnostic. I've copied the relevant test over from LLD-ELF's test suite. Reviewed By: #lld-macho, thevinster Differential Revision: https://reviews.llvm.org/D109274	2021-09-04 17:40:07 -04:00
David Blaikie	bc066e26c9	DebugInfo: Fix a few bot failures for type dumping fixes	2021-09-03 14:08:58 -07:00
Nico Weber	c15b588852	[lld/mac] Don't assert during thunk insertion if there are undefined symbols We end up calling resolveBranchVA(), which asserts for Undefineds. As fix, just return early in Writer::run() if there are any diagnostics after processing relocations (which is where undefined symbol errors are emitted). This matches what the ELF port does. Differential Revision: https://reviews.llvm.org/D109079	2021-09-03 12:22:41 -04:00
Nico Weber	9d22754389	Fix lld build after `5881dcff7e`	2021-09-02 15:07:10 -04:00
Sid Manning	0d7e5daedc	[lld][Hexagon] Add checks for instructions that can have TLS relocations Several instructions with potential TLS relocations were missing. This issue was found when building the Canadian LLVM toolchain.	2021-09-01 13:15:18 -07:00
Alexandre Ganea	7f0664f193	[LLD][COFF] Clean paths in PDB even when /pdbsourcepath is omitted Differential Revision: https://reviews.llvm.org/D109030	2021-08-31 19:05:10 -04:00
Fangrui Song	f9277caffc	[ELF][test] Fix R_AARCH64_ADR_PREL_PG_HI21 typo Found by redfast00	2021-08-31 13:09:55 -07:00
Nico Weber	86c8f395ae	[lld/mac] Leave more room for thunks in thunk placement code Fixes PR51578 in practice. Currently there's only enough room for a single thunk, which for real-life code isn't enough. The error case only happens when there are many branch statements very close to each other (0 or 1 instructions apart), with the function at the finalization barrier small. There's a FIXME on what to do if we hit this case, but that suggestion sounds complicated to me (see end of PR51578 comment 5 for why). Instead, just leave more room for thunks. Chromium's unit_tests links fine with room for 3 thunks. Leave room for 100, which should fix this for most cases in practice. There's little cost for leaving lots of room: This slop value only determines when we finalize sections, and we insert thunks for forward jumps into unfinalized sections. So leaving room means we'll need a few more thunks, but the thunk jump range is 128 MiB while a single thunk is just 12 bytes. For Chromium's unit_tests: With a slop of 3: thunk calls = 355418, thunks = 10903 With a slop of 100: thunk calls = 355426, thunks = 10904 Chances are 100 is enough for all use cases we'll hit in practice, but even bumping it to 1000 would probably be fine. Differential Revision: https://reviews.llvm.org/D108930	2021-08-30 22:09:05 -04:00
Nico Weber	83df94067d	[lld/mac] Tweak estimateStubsInRangeVA a bit - Move a few variables closer to their uses, remove some completely (no behavior change) - Add some comments - Make maxPotentialThunks include calls to stubs. It's possible that an earlier call to a stub late in the stub table will need a thunk, and that inserted thunk could push a stub earlier in the stub table out of range. This is unlikely to happen, but usually there are way fewer stub calls than non-stub calls, so if we're doing a conservative approximation here we might as well do it correctly. (For chromium's unit_tests target, 134421/242639 stub calls are direct calls without this change, compared to 134408/242639 with this change) No real, meaningful behavior difference. Differential Revision: https://reviews.llvm.org/D108924	2021-08-30 13:56:45 -04:00
Nico Weber	9721197520	[lld/mac] Set branchRange a bit more carefully - Don't subtract thunkSize from branchRange. Most places care about the actual maximal branch range. Subtract thunkSize in the one place that wants to leave room for a thunk. - Set it to 0x800_0000 instead of 0xFF_FFFF - Subtract 4 for the positive branch direction since it's a two's complement 24bit number sign-extended mutiplied by 4, so its range is -0x800_0000..+0x7FF_FFFC - Make boundary checks include the boundary values This doesn't make a huge difference in practice. It's preparation for a "real" fix for PR51578 -- but it also lets the repro in comment 0 in that bug place one more thunk before hitting the TODO. Differential Revision: https://reviews.llvm.org/D108897	2021-08-30 12:36:06 -04:00
Fangrui Song	3726039561	[ELF] Simplify addGotEntry. NFC	2021-08-29 13:40:08 -07:00
Fangrui Song	d3fdc312b2	[ELF] Untangle TLS IE and regular GOT from addGotEntry for non-mips. NFC	2021-08-29 13:21:06 -07:00
Fangrui Song	1861160697	[ELF] Move handleTlsRelocations. NFC Prepare for addGotEntry simplification.	2021-08-29 13:11:35 -07:00
Fangrui Song	204b2902d5	[ELF] Remove unused processRelocAux argument. NFC	2021-08-29 12:07:56 -07:00
Nico Weber	28be02f334	[lld/mac] Don't assert on -dead_strip + arm64 range extension thunks The assert is harmless and thinks worked fine in builds with asserts enabled, but it's still nice to fix the assert. Differential Revision: https://reviews.llvm.org/D108853	2021-08-27 23:27:45 -04:00
Pirama Arumuga Nainar	9632ce14e4	[lld/test/ELF] Test fetch from archive to resolve undefined symbols in shared libs Add missing test coverage uncovered in review of D108006. Differential Revision: https://reviews.llvm.org/D108328	2021-08-27 14:17:32 -07:00
Nico Weber	34ac7a7ac1	[lld/COFF] Ignore /LTCG, /LTCG:, /LTCGOUT:, /ILK: flags We currently complain "could not open /LTCG: no such file or directory", which isn't very useful. We could emit a warning when we see this flag, but just ignoring it seems fine. Final missing part of PR38799. Differential Revision: https://reviews.llvm.org/D108799	2021-08-27 09:13:30 -04:00
Nico Weber	66dc44f703	[lld/COFF] Use P_priv more P_priv does the same as the old QF further down. Standardize on P_priv. No behavior change. Differential Revision: https://reviews.llvm.org/D108798	2021-08-27 08:48:05 -04:00
Jez Ng	c74eb05f21	[lld-macho][nfc] Clean up InputSection constructors	2021-08-26 19:07:48 -04:00
Jez Ng	9b5148d426	[lld-macho] Have -ObjC load archive members before symbol resolution This is what ld64 does. Deviating in behavior here can result in some subtle duplicate symbol errors, as detailed in the objc.s test. Differential Revision: https://reviews.llvm.org/D108781	2021-08-26 18:52:07 -04:00
Jez Ng	9065fe5591	[lld-macho] Refactor archive loading The previous logic was duplicated between symbol-initiated archive loads versus flag-initiated loads (i.e. `-force_load` and `-ObjC`). This resulted in code duplication as well as redundant work -- we would create Archive instances twice whenever we had one of those flags; once in `getArchiveMembers` and again when we constructed the ArchiveFile. This was motivated by an upcoming diff where we load archive members containing ObjC-related symbols before loading those containing ObjC-related sections, as well as before performing symbol resolution. Without this refactor, it would be difficult to do that while avoiding loading the same archive member twice. Differential Revision: https://reviews.llvm.org/D108780	2021-08-26 18:52:07 -04:00
Jez Ng	2179930868	[lld-macho] Fix unwind info personality size This was missed by {D107035}. This fix addresses the following warning: loop variable 'personality' has type 'const uint32_t &' (aka 'const unsigned int &') but is initialized with type 'const unsigned long long' resulting in a copy [-Wrange-loop-analysis] In addition to fixing the size, I also removed the const reference, since there's no performance benefit to avoiding copies of integer-sized values.	2021-08-26 18:52:06 -04:00
Nico Weber	400a1de3ac	[lld/COFF] Improve handling of the /manifestdependency: flag If multiple /manifestdependency: flags are passed, they are naively deduped, but after that each of them should have an effect, instead of just the last one. Also, /manifestdependency: flags are allowed in .drectve sections (from `#pragma comment(linker, ...`). To make the interaction between /manifestdependency: flags enabling manifest by default but /manifest:no overriding this work, add an explict ManifestKind::Default state to represent no explicit /manifest flag being passed. To make /manifestdependency: flags from input file .drectve sections work with /manifest:embed, delay embedded manifest emission until after input files have been read. Differential Revision: https://reviews.llvm.org/D108628	2021-08-25 14:36:32 -04:00
Heejin Ahn	77b921b870	[WebAssembly] Tidy up EH/SjLj options This CL is small, but the description can be a little long because I'm trying to sum up the status quo for Emscripten/Wasm EH/SjLj options. First, this CL adds an option for Wasm SjLj (`-wasm-enable-sjlj`), which handles SjLj using Wasm EH. The implementation for this will be added as a followup CL, but this adds the option first to do error checking. This also adds an option for Wasm EH (`-wasm-enable-eh`), which has been already implemented. Before we used `-exception-model=wasm` as the same meaning as enabling Wasm EH, but after we add Wasm SjLj, it will be possible to use Wasm EH instructions for Wasm SjLj while not enabling EH, so going forward, to use Wasm EH, `opt` and `llc` will need this option. This only affects `opt` and `llc` command lines and does not affect Emscripten user interface. Now we have two modes of EH (Emscripten/Wasm) and also two modes of SjLj (also Emscripten/Wasm). The options corresponding to each of are: - Emscripten EH: `-enable-emscripten-cxx-exceptions` - Emscripten SjLj: `-enable-emscripten-sjlj` - Wasm EH: `-wasm-enable-eh -exception-model=wasm` `-mattr=+exception-handling` - Wasm SjLj: `-wasm-enable-sjlj -exception-model=wasm` `-mattr=+exception-handling` The reason Wasm EH/SjLj's options are a little complicated are `-exception-model` and `-mattr` are common LLVM options ane not under our control. (`-mattr` can be omitted if it is embedded within the bitcode file.) And we have the following rules of the option composition: - Emscripten EH and Wasm EH cannot be turned on at the same itme - Emscripten SjLj and Wasm SjLj cannot be turned on at the same time - Wasm SjLj should be used with Wasm EH Which means we now allow these combinations: - Emscripten EH + Emscripten SjLj: the current default in `emcc` - Wasm EH + Emscripten SjLj: This is allowed, but only as an interim step in which we are testing Wasm EH but not yet have a working implementation of Wasm SjLj. This will error out (D107687) in compile time if `setjmp` is called in a function in which Wasm exception is used. - Wasm EH + Wasm SjLj: This will be the default mode later when using Wasm EH. Currently Wasm SjLj implementation doesn't exist, so it doesn't work. - Emscripten EH + Wasm SjLj will not work. This CL moves these error checking routines to `WebAssemblyPassConfig::addIRPasses`. Not sure if this is an ideal place to do this, but I couldn't find elsewhere. Currently some checking is done within LowerEmscriptenEHSjLj, but these checks only run if LowerEmscriptenEHSjLj runs so it may not run when Wasm EH is used. This moves that to `addIRPasses` and adds some more checks. Currently LowerEmscriptenEHSjLj pass is responsible for Emscripten EH and Emscripten SjLj. Wasm EH transformations are done in multiple places, including WasmEHPrepare, LateEHPrepare, and CFGStackify. But in the followup CL, LowerEmscriptenEHSjLj pass will be also responsible for a part of Wasm SjLj transformation, because WasmSjLj will also be using several Emscripten library functions, and we will be sharing more than half of the transformation to do that between Emscripten SjLj and Wasm SjLj. Currently we have `-enable-emscripten-cxx-exceptions` and `-enable-emscripten-sjlj` but these only work for `llc`, because for `llc` we feed these options to the pass but when we run the pass using `opt` the pass will be created with no options and the default options will be used, which turns both Emscripten EH and Emscripten SjLj on. Now we have one more SjLj option to care for, LowerEmscriptenEHSjLj pass needs a finer way to control these options. This CL removes those default parameters and make LowerEmscriptenEHSjLj pass read directly from command line options specified. So if we only run `opt -wasm-lower-em-ehsjlj`, currently both Emscripten EH and Emscripten SjLj will run, but with this CL, none will run unless we additionally pass `-enable-emscripten-cxx-exceptions` or `-enable-emscripten-sjlj`, or both. This does not affect users; this only affects our `opt` tests because `emcc` will not call either `opt` or `llc`. As a result of this, our existing Emscripten EH/SjLj tests gained one or both of those options in their `RUN` lines. Reviewed By: dschuff Differential Revision: https://reviews.llvm.org/D107685	2021-08-24 17:54:39 -07:00
Sam Clegg	c468dc1b12	[lld][WebAssembly] Handle weakly defined symbols in shared libraries. In the case of weakly defined symbols in shared libraries we now generate both an import and an export. The dynamic linker can then choose how a winner from among all the shared libraries that define a given symbol. Previously any direct usage of a weakly defined symbol would use the DSO-local definition (For example, even through there would be single address for a weakly defined function, each DSO could end up directly calling its local version). Fixes: https://github.com/emscripten-core/emscripten/issues/13773 Differential Revision: https://reviews.llvm.org/D108413	2021-08-19 19:25:49 -04:00
Sam Clegg	e4888be74e	[WebAssembly] Avoid unused function imports in PIC mode In PIC mode we import function address via `GOT.mem` imports but for direct function calls we still import the first class function. However, if the function is never directly called we can avoid the first class import completely. Differential Revision: https://reviews.llvm.org/D108345	2021-08-18 22:31:04 -04:00
Sam Clegg	12b1dc0467	[WebAssembly][lld] Convert signature-mismatch.ll test to asm. NFC Differential Revision: https://reviews.llvm.org/D108346	2021-08-18 22:17:02 -04:00
Fangrui Song	f74b70ef57	[lld-macho][test] Remove ld64.lld: prefix in a diagnostic The convention is not to check the prefix before `error: `. This gives flexibility if we need to rename ld64.lld to something else, (e.g. a while ago we used ld64.lld.darwinnew).	2021-08-16 19:41:12 -07:00
Fangrui Song	54e76cb17a	[split-file] Default to --no-leading-lines It turns out that the --leading-lines may be a bad default. [[#@LINE+-num]] is rarely used.	2021-08-16 19:23:11 -07:00
Vincent Lee	08d55c5c01	[lld-macho] Refactor parseSections to avoid creating isec on LLVM segments Address post follow up comment in D108016. Avoid creating isec for LLVM segments since we are skipping over it. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D108167	2021-08-16 18:47:50 -07:00
Vincent Lee	15dc93e61c	[lld-macho] Ignore LLVM segments to prevent duplicate syms There was an instance of a third-party archive containing multiple _llvm symbols from different files that clashed with each other producing duplicate symbols. Symbols under the LLVM segment don't seem to be producing any meaningful value, so just ignore them. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D108016	2021-08-16 12:41:03 -07:00
Martin Storsjö	f8340c8c5d	[LLD] [MinGW] Add more options for disabling flags in the executable In `e72403f96d`, we added the flag "--no-dynamicbase" for disabling the dynamicbase flag which we set by default. At the time, ld.bfd didn't have any corresponding option (as ld.bfd defaulted to not setting the flag). Almost at the same time, corresponding options were added to ld.bfd for disabling it (while it was being enabled by default), with a different name, "--disable-dynamicbase". Thus add the "--disable-dynamicbase" option. Make this default one advertised in the help listing, but keep the "--no-dynamicbase" form as an alias. Also improve checking for the last option set if there are multiple ones on the same command line. Also add corresponding disable options for a lot of other flags that we set by default, also added in ld.bfd in the same commit: https://sourceware.org/git/?p=binutils-gdb.git;a=commitdiff;h=514b4e191d5f46de8e142fe216e677a35fa9c4bb Differential Revision: https://reviews.llvm.org/D107930	2021-08-12 13:27:09 +03:00
Reid Kleckner	fb9a075c81	[lld] Add llvm-profdata to lld test deps As of https://reviews.llvm.org/D104431, the test suite runs llvm-profdata, so it must be added to the list of deps.	2021-08-11 11:52:40 -07:00
Yolanda Chen	8fa16cc628	[LTO][lld] Add lto-pgo-warn-mismatch option When enable CSPGO for ThinLTO, there are profile cfg mismatch warnings that will cause lld-link errors (with /WX) due to source changes (e.g. `#if` code runs for profile generation but not for profile use) To disable it we have to use an internal "/mllvm:-no-pgo-warn-mismatch" option. In contrast clang uses option ”-Wno-backend-plugin“ to avoid such warnings and gcc has an explicit "-Wno-coverage-mismatch" option. Add "lto-pgo-warn-mismatch" option to lld COFF/ELF to help turn on/off the profile mismatch warnings explicitly when build with ThinLTO and CSPGO. Differential Revision: https://reviews.llvm.org/D104431	2021-08-11 09:45:55 -07:00
Wang, Pengfei	6c4809825d	Revert "[lld] Add lto-pgo-warn-mismatch option" This reverts commit `0cfb00a1c9`.	2021-08-11 16:25:42 +08:00
Yolanda Chen	0cfb00a1c9	[lld] Add lto-pgo-warn-mismatch option When enable CSPGO for ThinLTO, there are profile cfg mismatch warnings that will cause lld-link errors (with /WX). To disable it we have to use an internal "/mllvm:-no-pgo-warn-mismatch" option. In contrast clang uses option ”-Wno-backend-plugin“ to avoid such warnings and gcc has an explicit "-Wno-coverage-mismatch" option. Add this "lto-pgo-warn-mismatch" option to lld to help turn on/off the profile mismatch warnings explicitly when build with ThinLTO and CSPGO. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D104431	2021-08-11 14:43:26 +08:00
Sam Clegg	56175b2f5c	[lld][WebAssembly] Prefer objdump -d over obj2yaml for tests. NFC Now that we have https://reviews.llvm.org/D105539 we can use objdump -d to actually check for instruction sequences rather than binary blobs. This is just an example of how to do that we should followup with a wider ranging conversion of existing tests. Differential Revision: https://reviews.llvm.org/D106897	2021-08-10 18:17:58 -04:00
Fangrui Song	76093b1739	[InlineAdvisor] Add single quotes around caller/callee names Clang diagnostics refer to identifier names in quotes. This patch makes inline remarks conform to the convention. New behavior: ``` % clang -O2 -Rpass=inline -Rpass-missed=inline -S a.c a.c:4:25: remark: 'foo' inlined into 'bar' with (cost=-30, threshold=337) at callsite bar:0:25; [-Rpass=inline] int bar(int a) { return foo(a); } ^ ``` Reviewed By: hoy Differential Revision: https://reviews.llvm.org/D107791	2021-08-10 11:51:31 -07:00
Ben Dunbobbin	8392e8c007	[LLD][Test] Add thin archives to map file test This adds thin archives to the map file test. I noticed that we had this test-case in our downstream testsuite but it wasn't in the upstream testing. Differential revision: https://reviews.llvm.org/D107555	2021-08-10 10:24:01 +01:00
Pan, Tao	c70fa6da9a	Fix gcc build error after D105519 Same as `3bec7ed59e` Reviewed By: sbc100 Differential Revision: https://reviews.llvm.org/D107422	2021-08-09 14:32:34 +08:00
Simon Atanasyan	454f69bcc1	[LLD] Add required `ppc` target to the test cases. NFC	2021-08-07 13:29:59 +03:00
Simon Atanasyan	c6ebc651b6	[LLD] Support compressed input sections on big-endian targets This patch enables compressed input sections on big-endian targets by checking the target endianness and selecting an appropriate `Chdr` structure. Fixes PR51369 Differential Revision: https://reviews.llvm.org/D107635	2021-08-07 13:20:13 +03:00
Paul Robinson	34035b1044	2nd Speculative fix for MachO lld test after "Have REQUIRES support the target triple" See: http://45.33.8.238/macm1/15677/step_10.txt Follow-up to `f88ad8d` as it appears the lld invocations both emit an error message; so, try adding 'not' to the RUN lines.	2021-08-06 10:49:36 -07:00
Paul Robinson	f88ad8d00f	Speculative fix for MachO lld test after "Have REQUIRES support the target triple" See: http://45.33.8.238/macm1/15677/step_10.txt This is a test that has `REQUIRES: x86` which means it never ran before; I don't have a MachO environment but based on the FileCheck output it looks like it should be sufficient to remove one CHECK line.	2021-08-06 09:23:45 -07:00
Fangrui Song	72d070b4db	[ELF] Support copy relocation on non-default version symbols Copy relocation on a non-default version symbol is unsupported and can crash at runtime. Fortunately there is a one-line fix which works for most cases: ensure `getSymbolsAt` unconditionally returns `ss`. If two non-default version symbols are defined at the same place and both are copy relocated, our implementation will copy relocated them into different addresses. The pointer inequality is very unlikely an issue. In GNU ld, copy relocating version aliases seems to create more pointer inequality problems than us. ( In glibc, sys_errlist@GLIBC_2.2.5 sys_errlist@GLIBC_2.3 sys_errlist@GLIBC_2.4 are defined at the same place, but it is unlikely they are all copy relocated in one executable. Even if so, the variables are read-only and pointer inequality should not be a problem. ) Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D107535	2021-08-05 10:32:14 -07:00
Fangrui Song	00809c8889	[ELF] Apply version script patterns to non-default version symbols Currently version script patterns are ignored for .symver produced non-default version (single @) symbols. This makes such symbols not localizable by `local:`, e.g. ``` .symver foo3_v1,foo3@v1 .globl foo_v1 foo3_v1: ld.lld --version-script=a.ver -shared a.o ``` This patch adds the support: * Move `config->versionDefinitions[VER_NDX_LOCAL].patterns` to `config->versionDefinitions[versionId].localPatterns` * Rename `config->versionDefinitions[versionId].patterns` to `config->versionDefinitions[versionId].nonLocalPatterns` * Allow `findAllByVersion` to find non-default version symbols when `includeNonDefault` is true. (Note: `symtab` keys do not have `@@`) * Make each pattern check both the unversioned `pat.name` and the versioned `${pat.name}@${v.name}` * `localPatterns` can localize `${pat.name}@${v.name}`. `nonLocalPatterns` can prevent localization by assigning `verdefIndex` (before `parseSymbolVersion`). --- If a user notices new `undefined symbol` errors with a version script containing `local: *;`, the issue is likely due to a missing `global:` pattern. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D107234	2021-08-04 23:52:56 -07:00
Fangrui Song	a533eb7423	Revert "[ELF] Apply version script patterns to non-default version symbols" This reverts commit `7ed22a6fa9`. buf is not cleared so the commit misses some cases.	2021-08-04 23:52:55 -07:00
Fangrui Song	7a6482216f	[CMake][gn] lldMachO=>lldMachOOld, lldMachO2=>lldMachO Now that D95204 switched default to new Darwin backend, rename some CMake targets to match. Reviewed By: #lld-macho, smeenai, int3 Differential Revision: https://reviews.llvm.org/D107516	2021-08-04 18:52:41 -07:00
Fangrui Song	bd484c9940	[lld] Remove unused LLD_REPOSITORY Remnant after D72803. Distributions who want to customize the string can customize LLD_VERSION_STRING instead. Reviewed By: #lld-macho, mstorsjo, thakis Differential Revision: https://reviews.llvm.org/D107416	2021-08-04 13:04:10 -07:00
Fangrui Song	0a6aad5991	[ELF] Fix typo. NFC	2021-08-04 09:26:29 -07:00
Fangrui Song	66d4430492	[ELF] Combine foo@v1 and foo with the same versionId if both are defined Due to an assembler design flaw (IMO), `.symver foo,foo@v1` produces two symbols `foo` and `foo@v1` if `foo` is defined. * `v1 {};` produces both `foo` and `foo@v1`, but GNU ld only produces `foo@v1` * `v1 { foo; };` produces both `foo@@v1` and `foo@v1`, but GNU ld only produces `foo@v1` * `v2 { foo; };` produces both `foo@@v2` and `foo@v1`, matching GNU ld. (Tested by symver.s) This patch implements the GNU ld behavior by reusing the symbol redirection mechanism in D92259. The new test symver-non-default.s checks the first two cases. Without the patch, the second case will produce `foo@v1` and `foo@@v1` which looks weird and makes foo unnecessarily default versioned. Note: `.symver foo,foo@v1,remove` exists but the unfortunate `foo` will not go away anytime soon. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D107235	2021-08-04 09:06:05 -07:00
Fangrui Song	7ed22a6fa9	[ELF] Apply version script patterns to non-default version symbols Currently version script patterns are ignored for .symver produced non-default version (single @) symbols. This makes such symbols not localizable by `local:`, e.g. ``` .symver foo3_v1,foo3@v1 .globl foo_v1 foo3_v1: ld.lld --version-script=a.ver -shared a.o # In a.out, foo3@v1 is incorrectly exported. ``` This patch adds the support: * Move `config->versionDefinitions[VER_NDX_LOCAL].patterns` to `config->versionDefinitions[versionId].localPatterns` * Rename `config->versionDefinitions[versionId].patterns` to `config->versionDefinitions[versionId].nonLocalPatterns` * Allow `findAllByVersion` to find non-default version symbols when `includeNonDefault` is true. (Note: `symtab` keys do not have `@@`) * Make each pattern check both the unversioned `pat.name` and the versioned `${pat.name}@${v.name}` * `localPatterns` can localize `${pat.name}@${v.name}`. `nonLocalPatterns` can prevent localization by assigning `verdefIndex` (before `parseSymbolVersion`). --- If a user notices new `undefined symbol` errors with a version script containing `local: *;`, the issue is likely due to a missing `global:` pattern. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D107234	2021-08-04 09:02:11 -07:00
Fangrui Song	9bd29a73d1	[ELF] Make dot in .tbss correct GNU ld doesn't support multiple SHF_TLS SHT_NOBITS output sections (it restores the address after an SHF_TLS SHT_NOBITS section, so consecutive SHF_TLS SHT_NOBITS sections will have conflicting address ranges). That said, `threadBssOffset` implements limited support for consecutive SHF_TLS SHT_NOBITS sections. (SHF_TLS SHT_PROGBITS following a SHF_TLS SHT_NOBITS can still be incorrect.) `.` in an output section description of an SHF_TLS SHT_NOBITS section is incorrect. (https://lists.llvm.org/pipermail/llvm-dev/2021-July/151974.html) This patch saves the end address of the previous tbss section in `ctx->tbssAddr`, changes `dot` in the beginning of `assignOffset` so that `.` evaluation will be correct. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D107208	2021-08-04 08:58:50 -07:00
Fangrui Song	44361e5b90	[ELF] Add --export-dynamic-symbol-list This is available in GNU ld 2.35 and can be seen as a shortcut for multiple --export-dynamic-symbol, or a --dynamic-list variant without the symbolic intention. In the long term, this option probably should be preferred over --dynamic-list. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D107317	2021-08-03 09:01:03 -07:00
Martin Storsjö	b7fb5b54a9	[LLD] [MinGW] Support both "--opt value" and "--opt=value" for more options This does the same fix as D107237 but for a couple more options, converting all remaining cases of such options to accept both forms, for consistency. This fixes building e.g. openldap, which uses --image-base=<value>. Differential Revision: https://reviews.llvm.org/D107253	2021-08-03 10:55:44 +03:00
Mateusz Mikuła	05b025edf4	[LLD][MinGW] Accept joined format for --stack Postgresql uses `--stack=` in its Makefile. Downstream issue: https://github.com/msys2/MINGW-packages/pull/9167 Reviewed By: mstorsjo Differential Revision: https://reviews.llvm.org/D107237	2021-08-01 23:27:00 +03:00
Fangrui Song	52f35c9f14	[ELF][test] Improve .symver & --version-script tests And delete redundant tests.	2021-07-31 18:57:19 -07:00
Fangrui Song	b06426da76	[ELF] Add -Bsymbolic-non-weak-functions This option is a subset of -Bsymbolic-functions. It applies to STB_GLOBAL STT_FUNC definitions. The address of a vague linkage function (STB_WEAK STT_FUNC, e.g. an inline function, a template instantiation) seen by a -Bsymbolic-functions linked shared object may be different from the address seen from outside the shared object. Such cases are uncommon. (ELF/Mach-O programs may use `-fvisibility-inlines-hidden` to break such pointer equality. On Windows, correct dllexport and dllimport are needed to make pointer equality work. Windows link.exe enables /OPT:ICF by default so different inline functions may have the same address.) ``` // a.cc -> a.o -> a.so (-Bsymbolic-functions) inline void f() {} void g() { return (void )&f; } // b.cc -> b.o -> exe // The address is different! inline void f() {} ``` -Bsymbolic-non-weak-functions is a safer (C++ conforming) subset of -Bsymbolic-functions, which can make such programs work. Implementations usually emit a vague linkage definition in a COMDAT group. We could detect the group (with more code) but I feel that we should just check STB_WEAK for simplicity. A weak definition will thus serve as an escape hatch for rare cases when users want interposition on definitions. GNU ld feature request: https://sourceware.org/bugzilla/show_bug.cgi?id=27871 Longer write-up: https://maskray.me/blog/2021-05-16-elf-interposition-and-bsymbolic If Linux distributions migrate to protected non-vague-linkage external linkage functions by default, the linker option can still be handy because it allows rapid experiment without recompilation. Protected function addresses currently have deep issues in GNU ld. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D102570	2021-07-29 14:46:53 -07:00
Vy Nguyen	0bd14711ac	[lld-macho] Change personalities entry type to Ptr to avoid overflowing uint32 PR51262 Differential Revision: https://reviews.llvm.org/D107035	2021-07-29 14:26:07 -04:00
Jez Ng	a26bb9cc05	[lld-macho][nfc] Simplify common-symbol-coalescing test	2021-07-29 11:07:50 -04:00
Jez Ng	e49374f9e0	[lld-macho] Support common symbols in bitcode (but differently from ld64) ld64 seems to handle common symbols in bitcode rather bizarrely. They follow entirely different precedence rules from their non-bitcode counterparts. I initially tried to emulate ld64 in D106597, but I'm not sure the extra complexity is worth it, especially given that common symbols are not, well, very common. This diff accords common bitcode symbols the same precedence as regular common symbols, just as we treat all other pairs of bitcode and non-bitcode symbol types. The tests document ld64's behavior in detail, just in case we want to revisit this. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D107027	2021-07-29 11:07:50 -04:00
Jessica Clarke	cfaa5bf4ce	[ELF] Align the first section of a PT_TLS even if its type is SHT_NOBITS This is somewhat of a repeat of D66658 but for sections in PT_TLS segments. Although such sections don't need to be aligned such that address and offset are congruent modulo the page size, they do need to be congruent modulo the segment alignment, otherwise the whole PT_TLS will be unaligned. We therefore use the normal calculation to determine the section's address within the PT_LOAD rather than bailing out early due to being SHT_NOBITS. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D106987	2021-07-29 15:14:00 +01:00
Jessica Clarke	b96bb7899f	[ELF] Add two new tests showing broken .tbss alignment if first in PT_TLS This is a similar problem to D66658, where we are too aggressive in not aligning NOBITS sections, and the tests are based on the ones added for that fix. If a .tbss section is first in a PT_TLS segment (i.e. there is no .tdata section) then, although it doesn't need to be aligned such that address and offset are congruent modulo the page size, they do need to be congruent modulo the segment alignment, otherwise the whole PT_TLS will be unaligned. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D106986	2021-07-29 15:13:52 +01:00
Jez Ng	dc9ee39251	[lld-macho] Downgrade "cannot export hidden symbol" to warning This matches ld64's behavior, and makes it easier to fit LLD into existing build systems. Reviewed By: #lld-macho, smeenai Differential Revision: https://reviews.llvm.org/D107011	2021-07-28 18:46:26 -04:00
Fangrui Song	660b753e28	[ELF][test] Convert --start-address= and --stop-address= values to hexadecimal so that readers can connect them with the hexadecimal addresses in the output.	2021-07-28 12:55:09 -07:00
Fangrui Song	f17e7df04a	[ELF][test] Delete unneeded --triple=thumb* from llvm-objdump RUN lines	2021-07-28 12:47:12 -07:00
Tom Stellard	08c766a731	Bump the trunk major version to 14 and clear the release notes.	2021-07-27 21:58:25 -07:00
Fangrui Song	323b9bf862	[lld] Replace LLVM_ATTRIBUTE_NORETURN with [[noreturn]] [[noreturn]] can be used since 2016 when the minimum compiler requirement was bumped to GCC 4.8/MSVC 2015.	2021-07-27 18:51:17 -07:00
Fangrui Song	b00c8ab1b9	Revert "[ELF] --gc-sections: allow GC on reserved sections in a group" clang may place dynamic initializations for explicitly specialized class template static data members in comdat. Such in-comdat SHT_INIT_ARRAY was an abuse but we have to work around it for a while.	2021-07-27 16:34:32 -07:00
Amilendra Kodithuwakku	b9cf1769de	[lld][ELF] remove empty SyntheticSections from inputSections Change removeUnusedSyntheticSections() to actually remove empty SyntheticSections in inputSections. In addition to doing what removeUnusedSyntheticSections() was meant to do, this will also make the shuffle-sections tests, which shuffles inputSections, less sensitive to empty Synthetic Sections that will not appear in the final image. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D106427 Change-Id: I589eaf596472161a4395fb658aea0fad73318088	2021-07-27 23:29:02 +01:00
Nico Weber	dd57915b1e	[lld/mac] Fix sub-library.s on Windows after `8e8701abca` The endswith() check for the framework name fails when joining with the native path separator. Always use the posix separator as fix.	2021-07-27 15:25:52 -04:00
Nico Weber	e26356a00e	[lld/mac] Fix application-extension.s failure after `8e8701abca` The test accidentally tested something else that makes lld fail with a different (correct-looking) error that wasn't the one the test tries to test for. (The test case before this change makes ld64 hang in an infinite loop.)	2021-07-27 14:39:43 -04:00
Nico Weber	8e8701abca	[lld/mac] When loading reexports, look for basename in -F / -L first Matches ld64 (cf Options::findIndirectDylib()), and fixes PR51218. Differential Revision: https://reviews.llvm.org/D106842	2021-07-27 14:28:52 -04:00
Derek Schuff	cf54424a46	[lld][WebAssembly] Do not remove name section with --strip-debug Leave the name section in the output when using the --strip-debug flag. This treats it more like ELF symbol tables, as the name section has similar uses at runtime (e.g. wasm engines understand it and it can be used for symbolization at runtime). Fixes https://github.com/emscripten-core/emscripten/issues/14623 Differential Revision: https://reviews.llvm.org/D106728	2021-07-26 11:06:52 -07:00
Fangrui Song	c0da287c30	[yaml2obj][MachO] Rename PayloadString to Content The new name is conciser and matches yaml2obj ELF & DWARF. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D106759	2021-07-26 09:04:51 -07:00
Fangrui Song	e7a7ad134f	[ELF] Support quoted symbols in symbol assignments glibc/elf/tst-absolute-zero-lib.lds uses `"absolute" = 0;`	2021-07-25 16:26:37 -07:00
Nico Weber	75e7d1320c	[lld/mac] Make comment style uniform in start-end.s test	2021-07-25 18:37:49 -04:00
Nico Weber	80caa1eb4a	[lld/mac] Add support for segment$start$ and segment$end$ symbols These symbols are somewhat interesting in that they create non-existing segments, which as far as I know is the only way to create segments that don't contain any sections. Final part of part of PR50760. Like D106629, but for segments instead of sections. I'm not aware of anything that needs this in practice. Differential Revision: https://reviews.llvm.org/D106767	2021-07-25 18:25:13 -04:00
Nico Weber	afdeb432f0	[lld/mac] Move output segment rename logic into OutputSegment Fixes the output segment name if both -rename_section and -rename_segment are used and the post-section-rename segment name is the same as the pre-segment-rename segment name to match ld64's behavior. The motivation is that segment$start$ can create section-less segments, and this makes a corner case in the interaction between segment$start and -rename_segment in the upcoming segment$start patch. Differential Revision: https://reviews.llvm.org/D106766	2021-07-25 18:20:09 -04:00
Nico Weber	6bf7d2d9c9	[lld/mac] Reland: Add tests for the interaction between -rename_section and -rename_segment No behavior change. Differential Revision: https://reviews.llvm.org/D106765	2021-07-25 18:16:33 -04:00
Nico Weber	14bb6e4d70	Revert "[lld/mac] Add tests for the interaction between -rename_section and -rename_segment" This reverts commit `a6eb34624d`. The test fails, I screwed something up.	2021-07-25 18:11:36 -04:00
Nico Weber	a6eb34624d	[lld/mac] Add tests for the interaction between -rename_section and -rename_segment No behavior change. Differential Revision: https://reviews.llvm.org/D106765	2021-07-25 18:03:25 -04:00
Ayke van Laethem	13ca0c87ed	[lld][WebAssembly] Align __heap_base __heap_base was not aligned. In practice, it will often be aligned simply because it follows the stack, but when the stack is placed at the beginning (with the --stack-first option), the __heap_base might be unaligned. It could even be byte-aligned. At least wasi-libc appears to expect that __heap_base is aligned: `659ff41456/dlmalloc/src/malloc.c (L5224)` While WebAssembly itself does not appear to require any alignment for memory accesses, it is sometimes required when sharing a pointer externally. For example, WASI might expect alignment up to 8: https://github.com/WebAssembly/WASI/blob/main/phases/snapshot/docs.md#-timestamp-u64 This issue got introduced with the addition of the --stack-first flag: https://reviews.llvm.org/D46141 I suspect the lack of alignment wasn't intentional here. Differential Revision: https://reviews.llvm.org/D106499	2021-07-24 14:03:26 +02:00
Nico Weber	92c085e7c4	[lld/mac] Fix comment typo in new start-end.s test	2021-07-23 18:14:38 -04:00
Nico Weber	04f5eb407c	[lld/mac] Fix start-stop.s test with expensive checks enabled See e.g. https://lab.llvm.org/buildbot/#/builders/16/builds/14317 Not 100% sure why this fails yet, but this fixes it. Let's get the bots green again first :) Differential Revision: https://reviews.llvm.org/D106711	2021-07-23 17:01:16 -04:00
Nico Weber	04e8d0b62d	[lld/mac] Implement support for section$start and section$ end symbols With this, libclang_rt.profile_osx.a can be linked, that is coverage and PGO-instrumented builds should now work with lld. section$start and section$end symbols can create non-existing sections. They're also undefined symbols that are only magic if there isn't a regular symbol with their name, which means the need to be handled in treatUndefined() instead of just looping over all existing sections and adding start and end symbols like the ELF port does. To represent the actual symbols, this uses absolute symbols that get their value updated once an output section is layed out. segment$start and segment$end are still missing for now, but they produce a nicer error message after this patch. Main part of PR50760. Differential Revision: https://reviews.llvm.org/D106629	2021-07-23 16:01:09 -04:00
Jez Ng	d9a639901f	[lld-macho][nfc] Add test for resolution of bitcode symbols We lacked a test for bitcode symbol precedence. We assumed that they followed the same rules as their regular symbol counterparts, but never had a test to verify that we were matching ld64's behavior. It turns out that we were largely correct, though we deviate from ld64 when there are bitcode and non-bitcode symbols of the same name. The test added in this diff both verifies our behavior and documents the differences. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D106596	2021-07-23 11:49:00 -04:00
Jez Ng	cafed6f292	[lld-macho][nfc] Fix test to reflect that symbol attributes don't matter within an archive We had a comment that claimed that defined symbols had priority over common symbols if they occurred in the same archive. In fact, they appear to have equal precedence. Our implementation already does this, so I'm just updating the test comment. Also added a few other test comments along the way for readability. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D106595	2021-07-23 11:49:00 -04:00
Jez Ng	3313b84481	[lld-macho] ICF: Do more work in equalsConstant, less in equalsVariable In particular, relocations to absolute symbols or literal sections can be handled in equalsConstant(), since their output addresses will not change across each iteration of ICF. Offsets and addends can also be dealt with entirely in equalsConstant(), making the code somewhat easier to reason about. Only ConcatInputSections need to be handled in equalsVariable(). LLD-ELF's implementation takes a similar approach. Although this should make ICF do less work, in practice it seems like there is no stat sig difference in time taken when linking chromium_framework. This refactor is motivated by an upcoming diff which improves ICF's handling of addends. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D106212	2021-07-23 11:49:00 -04:00
Jez Ng	8eac5dcb36	[lld-macho] Reorganize + extend ICF test I found icf.s a bit hard to work with as it was not possible to extend any of the functions `_a` ... `_k` to test new relocation / referent types without modifying every single one of them. Additionally, their one-letter names were not descriptive (though the comments helped). I've renamed all the functions to reflect the feature they are testing, and shrunk them so that they contain just enough to test that one feature. I've also added tests for non-zero addends (via the `_abs1a_ref_with_addend` and `_defined_ref_with_addend_1` functions). Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D106211	2021-07-23 11:49:00 -04:00
Nico Weber	9482aa98e5	[lld/mac] Let OutputSegment store its start address segment$start$/segment$end$ symbols allow creating segments without sections, so getting the segment address off the first section won't work there. Storing the address on the segment is arguably a bit simpler too. No behavior change, part of PR50760. Differential Revision: https://reviews.llvm.org/D106665	2021-07-23 11:43:25 -04:00
Nico Weber	2c508cf583	[lld/mac] Don't crash on absolute symbols in order files Absolute symbols have a nullptr isec. buildInputSectionPriorities() would defer isec, causing crashes. Ordering absolute symbols doesn't make sense, so just ignore them. This seems to match ld64. Differential Revision: https://reviews.llvm.org/D106628	2021-07-23 11:33:23 -04:00
Nico Weber	687181caba	[lld/mac] Add missing REQUIRES line to new test	2021-07-23 10:40:22 -04:00
Leonard Grey	5acc6d4572	[lld-macho] Disambiguate bitcode files with the same name by archive name/offset in archive Ported from COFF/ELF; test is adapted from test/COFF/thinlto-archivecollision.ll LTO expects every bitcode file to have a unique name. If given multiple bitcode files with the same name, it errors with "Expected at most one ThinLTO module per bitcode file". This change incorporates the archive name, to disambiguate members with the same name in different archives and the offset in archive to disambiguate members with the same name in the same archive. Differential Revision: https://reviews.llvm.org/D106179	2021-07-22 22:50:25 -04:00
Nico Weber	393116faad	[lld/mac] Remove "else" after return No behavior change	2021-07-22 21:31:52 -04:00
Fangrui Song	120b18767c	[ELF] --gc-sections: allow GC on reserved sections in a group This generalizes D70146 (SHT_NOTE) to more reserved sections and makes our rules more consistent. Now SHF_GROUP is more similar to SHF_LINK_ORDER. For SHT_INIT_ARRAY/SHT_FINI_ARRAY, the rule will be closer to PE/COFF link.exe. Previously sanitizers use llvm.global_ctors to make module_ctor a GC root, which is considered an abuse. https://groups.google.com/g/generic-abi/c/TpleUEkNoQI We can squeak through on compatibility issues because compilers otherwise don't use SHF_GROUP special sections.	2021-07-22 17:09:23 -07:00
Fangrui Song	54bc2d812e	[ELF][test] Add a test about GCable SHF_LINK_ORDER SHT_INIT_ARRAY	2021-07-22 17:04:54 -07:00
Nico Weber	2d6fb62ef2	[lld/mac] Handle symbols from -U in treatUndefinedSymbol() In ld64, `-U section$start$FOO$bar` handles `section$start$FOO$bar` as a regular `section$start` symbol, that is section$start processing happens before -U processing. Likely, nobody uses that in practice so it doesn't seem very important to be compatible with this, but it also moves the -U handling code next to the `-undefined dynamic_lookup` handling code, which is nice because they do the same thing. And, in fact, this did identify a bug in a corner case in the intersection of `-undefined dynamic_lookup` and dead-stripping (fix for that in D106565). Vaguely related to PR50760. No interesting behavior change. Differential Revision: https://reviews.llvm.org/D106566	2021-07-22 19:43:57 -04:00
Nico Weber	5ae39d4f97	[lld/mac] Fix bug in interaction of -dead_strip and -undefined dynamic_lookup We lost the `used` bit on the Undefined when we replaced it with a DylibSymbol in treatUndefined(). Differential Revision: https://reviews.llvm.org/D106565	2021-07-22 19:30:46 -04:00
Nick Fitzgerald	1d445a6e76	Reland: "[WebAssembly] Deduplicate imports of the same module name, field name, and type" When two symbols import the same thing, only one import should be emitted in the Wasm file. Fixes https://bugs.llvm.org/show_bug.cgi?id=50938 Reverted in: `16aac493e5`. Reviewed By: sbc100 Differential Revision: https://reviews.llvm.org/D105519	2021-07-22 14:16:05 -07:00
Martin Storsjö	9dbc4b09af	[LLD] [COFF] Make -export-all-symbols work as intended for EXEs If some symbols are marked with dllexport, we still want to export all symbols if -export-all-symbols is specified. Previously, this only worked as it should for DLL output, not for EXE. This should fix downstream bug https://github.com/msys2/MINGW-packages/issues/9163. Differential Revision: https://reviews.llvm.org/D106245	2021-07-22 23:34:03 +03:00
Nico Weber	9d43c000e1	[lld/mac] Move handling of special undefineds later treatUndefinedSymbol() was previously called before gatherInputSections() and markLive() for these special symbols, but after them for normal undefineds. For PR50760, treatUndefinedSymbol() will have to potentially create sections, so it's good to move treatUndefinedSymbol() for special undefineds later, so that it can assume that gatherInputSections() and markLive() has already been called always. No intended behavior change, but part of PR50760 (and covered in tests in the patch for the full feature). Differential Revision: https://reviews.llvm.org/D106552	2021-07-22 11:43:49 -04:00
Douglas Yung	4e52a04833	Change requires line from arm to aarch64 since the test uses arm64_32 which is AArch64.	2021-07-21 12:51:53 -07:00
Fangrui Song	c53a5eebb1	[ELF][test] Add -DAG The guid of a local linkage variable has the module path encoded, so the order between a local linkage variable and a non-local linkage variable isn't guaranteed.	2021-07-20 15:27:52 -07:00
Martin Storsjö	e0e09481ee	[LLD] [COFF] Add a couple "MinGW only" comments re linking against DLLs. NFC. This was requested in the post-commit review of D104530.	2021-07-20 23:57:24 +03:00
Vincent Lee	33ab995617	Recommit "[lld-macho] Use DO_BIND_ADD_ADDR_IMM_SCALED for bind opcodes" Implement pass 3 of bind opcodes from ld64 (which supports both 32-bit and 64-bit). Pass 3 implementation condenses BIND_OPCODE_DO_BIND_ADD_ADDR_ULEB opcode to BIND_OPCODE_DO_BIND_ADD_ADDR_IMM_SCALED. This change is already behind an O2 flag so it shouldn't impact current performance. I verified ld64's output with x86_64 LLD and they were both emitting the same optimized bind opcodes (although in a slightly different order). Tested with arm64_32 LLD and compared that with x86 LLD that the order of the bind opcodes are the same (offset values are different which should be expected). Reviewed By: int3, #lld-macho, MaskRay Differential Revision: https://reviews.llvm.org/D106128	2021-07-20 13:45:24 -07:00
Fangrui Song	db5e078690	[LTO] Add SelectionKind to IRSymtab and use it in ld.lld/LLVMgold In PGO, a C++ external linkage function `foo` has a private counter `__profc_foo` and a private `__profd_foo` in a `comdat nodeduplicate`. A `__attribute__((weak))` function `foo` has a weak hidden counter `__profc_foo` and a private `__profd_foo` in a `comdat nodeduplicate`. In `ld.lld a.o b.o`, say a.o defines an external linkage `foo` and b.o defines a weak `foo`. Currently we treat `comdat nodeduplicate` as `comdat any`, ld.lld will incorrectly consider `b.o:__profc_foo` non-prevailing. In the worst case when `b.o:__profd_foo` is retained and `b.o:__profc_foo` isn't, there will be dangling reference causing an `undefined hidden symbol` error. Add SelectionKind to `Comdat` in IRSymtab and let linkers ignore nodeduplicate comdat. Differential Revision: https://reviews.llvm.org/D106228	2021-07-20 13:22:00 -07:00
Sam Clegg	d51f74acdf	[lld][WebAssembly] Error on import of TLS symbols in shared libraries In https://reviews.llvm.org/D102044 we made exporting a TLS symbol into an error, but we also want to error on import. See https://github.com/emscripten-core/emscripten/issues/14461 Differential Revision: https://reviews.llvm.org/D106385	2021-07-20 12:36:03 -07:00
Sam Clegg	f428693de0	Reland "[lld][WebAssembly] Cleanup duplicate fields in Symbols.h. NFC" This avoids duplication and simplifies the code in several places without increasing the size of the symbol union (at least not above the assert'd limit of 120 bytes). Originally commit: `9b965b37c7` Reverted in: `16aac493e5`. Differential Revision: https://reviews.llvm.org/D106026	2021-07-20 12:13:08 -07:00
Fangrui Song	88e2268a34	Revert D106128 "[lld-macho] Use DO_BIND_ADD_ADDR_IMM_SCALED for bind opcodes" This reverts commit `321b2bef09`. `for (BindIR *p = &opcodes[0]; p->opcode != BIND_OPCODE_DONE; ++p) {` has a heap-buffer-overflow with test/MachO/bind-opcodes.	2021-07-19 18:13:52 -07:00
Fangrui Song	16aac493e5	Revert D105519 "[WebAssembly] Deduplicate imports of the same module name, field name, and type" and its followup This reverts commit `4ae575b999` and `9b965b37c7`. There is an use-of-uninitialized-value bug in the `else` branch in ImportSection::addImport.	2021-07-19 17:09:01 -07:00
Vincent Lee	321b2bef09	[lld-macho] Use DO_BIND_ADD_ADDR_IMM_SCALED for bind opcodes Implement pass 3 of bind opcodes from ld64 (which supports both 32-bit and 64-bit). Pass 3 implementation condenses BIND_OPCODE_DO_BIND_ADD_ADDR_ULEB opcode to BIND_OPCODE_DO_BIND_ADD_ADDR_IMM_SCALED. This change is already behind an O2 flag so it shouldn't impact current performance. I verified ld64's output with x86_64 LLD and they were both emitting the same optimized bind opcodes (although in a slightly different order). Tested with arm64_32 LLD and compared that with x86 LLD that the order of the bind opcodes are the same (offset values are different which should be expected). Reviewed By: int3, #lld-macho Differential Revision: https://reviews.llvm.org/D106128	2021-07-19 16:18:33 -07:00
Sam Clegg	9b965b37c7	[lld][WebAssembly] Cleanup duplicate fields in Symbols.h. NFC This avoids duplication and simplifies the code in several places without increasing the size of the symbol union (at least not above the assert'd limit of 120 bytes). Differential Revision: https://reviews.llvm.org/D106026	2021-07-19 14:31:09 -07:00
Derek Schuff	ad1f5457d2	[WebAssembly] Generate R_WASM_FUNCTION_OFFSET relocs in debuginfo sections Debug info sections need R_WASM_FUNCTION_OFFSET_I32 relocs (with FK_Data_4 fixup kinds) to refer to functions (instead of R_WASM_TABLE_INDEX as is used in data sections). Usually this is done in a convoluted way, with unnamed temp data symbols which target the start of the function, in which case WasmObjectWriter::recordRelocation converts it to use the section symbol instead. However in some cases the function can actually be undefined; in this case the dwarf generator uses the function symbol (a named undefined function symbol) instead. In that case the section-symbol transform doesn't work and we need to generate the correct reloc type a different way. In this change WebAssemblyWasmObjectWriter::getRelocType takes the fixup section type into account to choose the correct reloc type. Fixes PR50408 Differential Revision: https://reviews.llvm.org/D103557	2021-07-19 14:02:33 -07:00
Nick Fitzgerald	4ae575b999	[WebAssembly] Deduplicate imports of the same module name, field name, and type When two symbols import the same thing, only one import should be emitted in the Wasm file. Fixes https://bugs.llvm.org/show_bug.cgi?id=50938 Reviewed By: sbc100 Differential Revision: https://reviews.llvm.org/D105519	2021-07-19 13:59:02 -07:00
Leonard Grey	6ef37b640d	[lld/mac] Add test for --lto-O This belongs to `fe08e9c487`, I (thakis) forgot to `git add` it back then. Differential Revision: https://reviews.llvm.org/D105223	2021-07-19 16:45:33 -04:00
Nico Weber	fbb45947b2	[lld/mac] Resolve defined symbols before undefined symbols Ports https://reviews.llvm.org/D95985 to the MachO port. Happens to fix PR51135; see that bug for details. Also makes lld's behavior match ld64 for the included test case. Differential Revision: https://reviews.llvm.org/D106293	2021-07-19 16:37:41 -04:00
Nico Weber	bcbb3066ce	[lld/mac] Change load command order to be more like ld64 No meaningful behavior change. Makes diffing `otool -l` output a bit easier. Differential Revision: https://reviews.llvm.org/D106219	2021-07-19 15:04:32 -04:00
Wouter van Oortmerssen	670944fb20	[WebAssembly] Support R_WASM_MEMORY_ADDR_TLS_SLEB64 for wasm64 Also fixed TLS tests swapping addr & value in store op Differential Revision: https://reviews.llvm.org/D106096	2021-07-19 10:22:43 -07:00
Jez Ng	428a7c1b38	[lld-macho] Have ICF operate on all sections at once ICF previously operated only within a given OutputSection. We would merge all CFStrings first, then merge all regular code sections in a second phase. This worked fine since CFStrings would never reference regular `__text` sections. However, I would like to expand ICF to merge functions that reference unwind info. Unwind info references the LSDA section, which can in turn reference the `__text` section, so we cannot perform ICF in phases. In order to have ICF operate on InputSections spanning multiple OutputSections, we need a way to distinguish InputSections that are destined for different OutputSections, so that we don't fold across section boundaries. We achieve this by creating OutputSections early, and setting `InputSection::parent` to point to them. This is what LLD-ELF does. (This change should also make it easier to implement the `section$start$` symbols.) This diff also folds InputSections w/o checking their flags, which I think is the right behavior -- if they are destined for the same OutputSection, they will have the same flags in the output (even if their input flags differ). I.e. the `parent` pointer check subsumes the `flags` check. In practice this has nearly no effect (ICF did not become any more effective on chromium_framework). I've also updated ICF.cpp's block comment to better reflect its current status. Reviewed By: #lld-macho, smeenai Differential Revision: https://reviews.llvm.org/D105641	2021-07-17 13:42:51 -04:00
Fangrui Song	fa3231eb18	[COFF][test] Fix llvm-readobj tests	2021-07-16 13:28:46 -07:00
Fangrui Song	8f806d5f52	[test] Avoid llvm-readelf/llvm-readobj one-dash long options	2021-07-16 12:03:07 -07:00
Fangrui Song	3c9d86f951	[ELF][test] Avoid llvm-readelf/llvm-readobj one-dash long options	2021-07-16 10:02:47 -07:00
Vincent Lee	d695d0d6f6	[lld-macho] Optimize bind opcodes with multiple passes In D105866, we used an intermediate container to store a list of opcodes. Here, we use that data structure to help us perform optimization passes that would allow a more efficient encoding of bind opcodes. Currently, the functionality mirrors the optimization pass {1,2} done in ld64 for bind opcodes under optimization gate to prevent slight regressions. Reviewed By: int3, #lld-macho Differential Revision: https://reviews.llvm.org/D105867	2021-07-15 20:52:46 -07:00
Vincent Lee	f2b1264141	[lld-macho] Use intermediate arrays to store opcodes We want to incorporate some of the optimization passes in bind opcodes from ld64. This revision makes no functional changes but to start storing opcodes in intermediate containers in preparation for implementing the optimization passes in a follow-up revision. Differential Revision: https://reviews.llvm.org/D105866	2021-07-15 16:57:45 -07:00
Fangrui Song	f8cb78e99a	[ELF] Don't define __rela_iplt_start for -pie/-shared `clang -fuse-ld=lld -static-pie -fpie` produced executable currently crashes and this patch makes it work. See https://sourceware.org/bugzilla/show_bug.cgi?id=27164 and https://sourceware.org/pipermail/libc-alpha/2021-July/128810.html While it seems unreasonable to keep csu/libc-start.c ARCH_APPLY_IREL unclear in static-pie mode and have an unneeded diff -u =(ld.bfd --verbose) =(ld.bfd -pie --verbose) difference, glibc folks don't want to fix their code. I feel sad about that but this patch can remove an iffy condition for lld/ELF as well: `needsInterpSection()`.	2021-07-15 11:31:11 -07:00
Fangrui Song	80f9fd4ce3	[ELF][test] Rework non-preemptible ifunc tests	2021-07-15 11:31:05 -07:00
Fangrui Song	aa3df8ddcd	[test] Avoid llvm-readelf/llvm-readobj one-dash long options and deprecated aliases (e.g. --file-headers)	2021-07-15 10:26:21 -07:00
Wouter van Oortmerssen	4157b6033d	[WebAssembly] Fixed LLD generation of 64-bit __wasm_apply_data_relocs Differential Revision: https://reviews.llvm.org/D105863	2021-07-15 10:02:02 -07:00
Leonard Grey	c931ff72bd	[lld-macho] Add LTO cache support This adds support for the lld-only `--thinlto-cache-policy` option, as well as implementations for ld64's `-cache_path_lto`, `-prune_interval_lto`, `-prune_after_lto`, and `-max_relative_cache_size_lto`. Test is adapted from lld/test/ELF/lto/cache.ll Differential Revision: https://reviews.llvm.org/D105922	2021-07-15 12:56:13 -04:00
Fangrui Song	7299c6f635	[test] Avoid llvm-nm one-dash long options	2021-07-15 09:50:36 -07:00
Fangrui Song	7de2173c2a	[ELF] --fortran-common: prefer STB_WEAK to COMMON The ELF specification says "The link editor honors the common definition and ignores the weak ones." GNU ld and our Symbol::compare follow this, but the --fortran-common code (D86142) made a mistake on the precedence. Fixes https://bugs.llvm.org/show_bug.cgi?id=51082 Reviewed By: peter.smith, sfertile Differential Revision: https://reviews.llvm.org/D105945	2021-07-14 10:18:30 -07:00
Alexander Shaposhnikov	d21772fa21	[lld][MachO] Code cleanup Make use of ArgList::getLastArgValue. NFC. Test plan: make check-lld-macho Differential revision: https://reviews.llvm.org/D105452	2021-07-14 04:33:09 -07:00
Alexander Yermolovich	24129fbc9a	[LLD] Adding support for RELA for CG Profile. This is a follow up to https://reviews.llvm.org/D104080, and `ca3bdb57fa (diff-e64a48fabe31db213a631fdc5f2acb51bdddf3f16a8fb2928784f4c579229585)`. The implementation of call graph profile was changed from a black box section to relocation approach. This was done to be compatible with post processing tools like strip/objcopy, and llvm equivalent. When they are invoked on object file before the final linking step with this new approach the symbol indices correctness is preserved. The GNU binutils tools change the REL section to RELA section, unlike llvm tools. For example when strip -S is run on the ELF object files, as an intermediate step before linking. To preserve compatibility this patch extends implementation in LLD and ELFDumper to support both REL and RELA sections for call graph profile. Reviewed By: MaskRay, jhenderson Differential Revision: https://reviews.llvm.org/D105217	2021-07-13 13:56:30 -07:00
Hafiz Abid Qadeer	fb9c5c3dce	[lld][AMDGPU] Handle R_AMDGPU_REL16 relocation. This patch is a followup patch to https://reviews.llvm.org/D105760 which adds this relocation. This handles the relocation in lld. The s_branch family of instruction does the following: PC = PC + signext(simm * 4) + 4 so we we do the opposite on the target address before writing it in the instruction stream. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D105761	2021-07-13 20:41:11 +01:00
Wouter van Oortmerssen	b568c11b40	[WebAssembly] Fixed LLD generation of 64-bit __wasm_init_memory Differential Revision: https://reviews.llvm.org/D105849	2021-07-12 15:26:11 -07:00
Nico Weber	f21801dab2	[lld/mac] Implement -application_extension Differential Revision: https://reviews.llvm.org/D105818	2021-07-12 13:42:16 -04:00
Nico Weber	396f2e9d6d	[lld/mac] Make tbd files in one test valid No behavior change, but ld64 can't load .tbd files without the trailing `...`, so include them to make it easier to run tests with l64 too.	2021-07-12 11:13:54 -04:00
Jez Ng	0fb299072c	[lld-macho][nfc] Fix YAML input in compact-unwind-sym-relocs.s * Adjust strsize so llvm-objdump doesn't complain about it extending past the end of file * Remove symbol that was referencing a deleted section * Adjust n_sect of the remaining `_main` symbol to point at the right section	2021-07-11 21:36:24 -04:00
Jez Ng	11a0d23650	[lld-macho][nfc] clang-format	2021-07-11 18:36:59 -04:00
Jez Ng	28a2102ee3	[lld-macho][nfc] Remove unnecessary llvm:: namespace prefixes	2021-07-11 18:36:53 -04:00
Jez Ng	f6e84a84f9	[lld-macho][nfc] Avoid using std::map for PlatformKinds The mappings we were using had a small number of keys, so a vector is probably better. This allows us to remove the last usage of std::map in our codebase. I also used `removeSimulator` to simplify the code a bit further. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D105786	2021-07-11 18:24:53 -04:00
Nico Weber	c10947b5f8	[lld/mac] Unbreak objc.s after `6e05c1cd5f`	2021-07-11 13:57:15 -04:00
Nico Weber	6e05c1cd5f	[lld/mac] Always reference dyld_stub_binder when linked with libSystem lld currently only references dyld_stub_binder when it's needed. ld64 always references it when libSystem is linked. Match ld64. The (somewhat lame) motivation is that `nm` on a binary without any export writes a "no symbols" warning to stderr, and this change makes it so that every binary in practice has at least a reference to dyld_stub_binder, which suppresses that. Every "real" output file will reference dyld_stub_binder, so most of the time this shouldn't make much of a difference. And if you really don't want to have this reference for whatever reason, you can stop passing -lSystem, like you have to for ld64 anyways. (After linking any dylib, we dump the exported list of symbols to a txt file with `nm` and only relink downstream deps if that txt file changes. A nicer fix is to make lld optionally write .tbd files with the public interface of a linked dylib and use that instead, but for now the txt files are what we do.) Differential Revision: https://reviews.llvm.org/D105782	2021-07-11 13:37:48 -04:00
Nico Weber	10e28a7484	[lld/mac] Use normal Undefined machinery for dyld_stub_binder lookup This is for aesthetic reasons, I'm not aware of anything that needs this in practice. It does have a few effects: - `-undefined dynamic_lookup` now has an effect for dyld_stub_binder. This matches ld64. - `-U dyld_stub_binder` now works like you'd expect (it doesn't work in ld64). - The error message for a missing dyld_stub_binder symbol now looks like other undefined reference symbols, it changes from symbol dyld_stub_binder not found (normally in libSystem.dylib). Needed to perform lazy binding. to error: undefined symbol: dyld_stub_binder >>> referenced by lazy binding (normally in libSystem.dylib) Also add test coverage for that error message. But in practice, this should have no interesting effects since everything links in dyld_stub_binder via libSystem anyways. Differential Revision: https://reviews.llvm.org/D105781	2021-07-11 12:48:59 -04:00
Jez Ng	d5c0b9c848	[lld-macho][nfc] Expand the compact unwind symbol reloc test Add a bit more detail to the comments, and check that the final binary does indeed have a `__unwind_info` section (D105557 previosly regressed this). Also rename the test to emphasize that we are testing relocations compact unwind, not relocations in general.	2021-07-11 00:35:05 -04:00
Vy Nguyen	3822e3d5b0	[lld-macho] Fix bug in handling unwind info from ld -r Two changess: - Drop assertions that all symbols are in GOT - Set allEntriesAreOmitted correctly Related bug: 50812 Differential Revision: https://reviews.llvm.org/D105364	2021-07-09 22:44:51 -04:00
Wouter van Oortmerssen	9647a6f719	[WebAssembly] Added initial type checker to MC Assembler This to protect against non-sensical instruction sequences being assembled, which would either cause asserts/crashes further down, or a Wasm module being output that doesn't validate. Unlike a validator, this type checker is able to give type-errors as part of the parsing process, which makes the assembler much friendlier to be used by humans writing manual input. Because the MC system is single pass (instructions aren't even stored in MC format, they are directly output) the type checker has to be single pass as well, which means that from now on .globaltype and .functype decls must come before their use. An extra pass is added to Codegen to collect information for this purpose, since AsmPrinter is normally single pass / streaming as well, and would otherwise generate this information on the fly. A `-no-type-check` flag was added to llvm-mc (and any other tools that take asm input) that surpresses type errors, as a quick escape hatch for tests that were not intended to be type correct. This is a first version of the type checker that ignores control flow, i.e. it checks that types are correct along the linear path, but not the branch path. This will still catch most errors. Branch checking could be added in the future. Differential Revision: https://reviews.llvm.org/D104945	2021-07-09 14:07:25 -07:00
Simon Pilgrim	1440d4564f	Fix MSVC "not all control paths return a value" warning. NFCI.	2021-07-09 12:07:34 +01:00
Alex Richardson	cc7cb9523e	[ELF][AArch64] Write addends for TLSDESC relocations with -z rel Since D100490 this case is diagnosed for -z rel. This commit implements R_AARCH64_TLSDESC cases for AArch64::getImplicitAddend() and AArch64::relocate(). However, there are probably further relocation types that need to be handled for full support of -z rel. Fixes https://bugs.llvm.org/show_bug.cgi?id=47009 Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D100544	2021-07-09 10:41:41 +01:00
Alex Richardson	97fe637539	[ELF] Implement RISCV::getImplicitAddend() This allows checking dynamic relocation addends for -z rel and --apply-dynamic-relocs output. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101455	2021-07-09 10:41:40 +01:00
Alex Richardson	e564932842	[ELF] Write R_RISCV_IRELATIVE addends with -z rel I found this missing case with the new --check-dynamic-relocation flag while running the lld tests with --apply-dynamic-relocs enabled by default. This is the same as D101452 just for RISC-V Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101454	2021-07-09 10:41:40 +01:00
Alex Richardson	79332fb722	[ELF] Write R_X86_64_IRELATIVE addends with -z rel I found this missing case with the new --check-dynamic-relocation flag while running the lld tests with --apply-dynamic-relocs enabled by default. This also fixes a broken CHECK in lld/test/ELF/x86-64-gotpc-relax.s: The test wasn't using CHECK-NEXT, so it was passing despite the output actually containing relocations. I am not sure when this changed, but I think this behaviour is correct. Found with D101450 + enabling --apply-dynamic-relocs by default. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101452	2021-07-09 10:41:40 +01:00
Alex Richardson	f4b0c9abfb	[ELF] Implement X86_64::getImplicitAddend() This allows checking dynamic relocation addends for -z rel and --apply-dynamic-relocs output. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101451	2021-07-09 10:41:40 +01:00
Alex Richardson	35c5e564e6	[ELF] Check the Elf_Rel addends for dynamic relocations There used to be many cases where addends for Elf_Rel were not emitted in the final object file (mostly when building for MIPS64 since the input .o files use RELA but the output uses REL). These cases have been fixed since, but this patch adds a check to ensure that the written values are correct. It is based on a previous patch that I added to the CHERI fork of LLD since we were using MIPS64 as a baseline. The work has now almost entirely shifted to RISC-V and Arm Morello (which use Elf_Rela), but I thought it would be useful to upstream our local changes anyway. This patch adds a (hidden) command line flag --check-dynamic-relocations that can be used to enable these checks. It is also on by default in assertions builds for targets that handle all dynamic relocations kinds that LLD can emit in Target::getImplicitAddend(). Currently this is enabled for ARM, MIPS, and I386. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101450	2021-07-09 10:41:40 +01:00
Alex Richardson	6d87ca08ae	[ELF] Refactor DynamicReloc to fix incorrect relocation addends This patch changes the DynamicReloc class to store an enum instead of the overloaded useSymVA member to make it easier to understand and fix incorrect addends being written in some corner cases. The change is motivated by a follow-up review that checks the value of implicit Elf_Rel addends written to the output file. This patch fixes an incorrect output when using `-z rela` for i386 files with R_386_GOT32 relocations (not that this really matters since it's an unsupported configuration). Storing the relocation expression kind also addresses an incorrect addend FIXME in ppc64-abs64-dyn.s introduced in D63383. DynamicReloc now also has a special case for the MIPS TLS relocations (DynamicReloc::AgainstSymbolWithTargetVA) since the R_MIPS_TLS_TPREL{32/64} the symbol VA to the GOT for preemptible symbols. I'm not sure if the symbol value actually should be written for R_MIPS_TLS_TPREL32, but this patch does not attempt to change that behaviour. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D100490	2021-07-09 10:41:40 +01:00
David Blaikie	1def2579e1	PR51018: Remove explicit conversions from SmallString to StringRef to future-proof against C++23 C++23 will make these conversions ambiguous - so fix them to make the codebase forward-compatible with C++23 (& a follow-up change I've made will make this ambiguous/invalid even in <C++23 so we don't regress this & it generally improves the code anyway)	2021-07-08 13:37:57 -07:00
Mikael Holmen	21fd875952	[lld/mac] Fix warning about unused variable [NFC] Change "dyn_cast" to "isa" to get rid of the unused variable "bitcodeFile". gcc warned with lld/MachO/Driver.cpp:531:17: warning: unused variable 'bitcodeFile' [-Wunused-variable] 531 \| if (auto *bitcodeFile = dyn_cast<BitcodeFile>(file)) { \| ^~~~~~~~~~~	2021-07-08 09:46:30 +02:00
Thomas Lively	0fd5e7b2d8	[WebAssembly][lld] Fix segfault on .bss sections in mapfile When memory is declared in the Wasm module, we rely on the implicit zero initialization behavior and do not explicitly output .bss sections. The means that they do not have associated `outputSec` entries, which was causing segfaults in the mapfile support. Fix the issue by guarding against null `outputSec` and falling back to using a zero offset. Differential Revision: https://reviews.llvm.org/D102951	2021-07-07 23:31:48 -07:00
Jeremy Drake	7a7da69fbe	[LLD] [COFF] Avoid thread exhaustion on 32-bit Windows host LLD on 32-bit Windows would frequently fail on large projects with an exception "thread constructor failed: Exec format error". The stack trace pointed to this usage of std::async, and looking at the implementation in libc++ it seems using std::async with std::launch::async results in the immediate creation of a new thread for every call. This could result in a potentially unbounded number of threads, depending on the number of input files. This seems to be hitting some limit in 32-bit Windows host. I took the easy route, and only use threads on 64-bit Windows, not all Windows as before. I was thinking a more proper solution might involve using a thread pool rather than blindly spawning any number of new threads, but that may have other unforeseen consequences. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D105506	2021-07-07 22:00:18 +03:00
Vy Nguyen	e25a384055	[lld-macho][nfc] Rename test file to be more descriptive (rather than referencing the bug number) Differential Revision: https://reviews.llvm.org/D105559	2021-07-07 13:15:55 -04:00
Nico Weber	8a7b5ebf4d	[lld/mac] Don't crash when dead-stripping removes all unwind info If the input has compact unwind info but all of it is removed after dead stripping, we would crash. Now we don't write any __unwind_info section at all, like ld64. This is a bit awkward to implement because we only know the final state of unwind info after UnwindInfoSectionImpl<Ptr>::finalize(), which is called after sections are added. So add a small amount of bookkeeping to relocateCompactUnwind() instead (which runs earlier) so that we can predict what finalize() will do before it runs. Fixes PR51010. Differential Revision: https://reviews.llvm.org/D105557	2021-07-07 13:05:40 -04:00
Nico Weber	d7e65757ed	[lld/mac] Tweak reserve() argument in unwind code addEntriesForFunctionsWithoutUnwindInfo() can add entries to cuVector, so cuCount can be stale. Use cuVector.size() instead. No behavior change.	2021-07-07 11:44:22 -04:00
Nico Weber	76f734040a	[lld/mac] Give several LTO tests an "lto-" prefix Differential Revision: https://reviews.llvm.org/D105476	2021-07-06 15:23:42 -04:00
Nico Weber	3eb2fc4b50	[lld/mac] Partially implement -export_dynamic This implements the part of -export_dynamic that adds external symbols as dead strip roots even for executables. It does not yet implement the effect -export_dynamic has for LTO. I tried just replacing `config->outputType != MH_EXECUTE` with `(config->outputType != MH_EXECUTE \|\| config->exportDynamic)` in LTO.cpp, but then local symbols make it into the symbol table too, which is too much (and also doesn't match ld64). So punt on this for now until I understand it better. (D91583 may or may not be related too). Differential Revision: https://reviews.llvm.org/D105482	2021-07-06 11:22:18 -04:00
Nico Weber	64be5b7d87	[lld/mac] Implement -arch_multiple This is the other flag clang passes when calling clang with two -arch flags (which means with this, `clang -arch x86_64 -arch arm64 -fuse-ld=lld ...` now no longer prints any warnings \o/). Since clang calls the linker several times in that setup, it's not clear to the user from which invocation the errors are. The flag's help text is Specifies that the linker should augment error and warning messages with the architecture name. In ld64, the only effect of the flag is that undefined symbols are prefaced with Undefined symbols for architecture x86_64: instead of the usual "Undefined symbols:". So for now, let's add this only to undefined symbol errors too. That's probably the most common linker diagnostic. Another idea would be to prefix errors and warnings with "ld64.lld(x86_64):" instead of the usual "ld64.lld:", but I'm not sure if people would misunderstand that as a comment about the arch of ld itself. But open to suggestions on what effect this flag should have :) And we don't have to get it perfect now, we can iterate on it. Differential Revision: https://reviews.llvm.org/D105450	2021-07-06 00:25:18 -04:00
Nico Weber	2c25f39fcc	[lld/mac] Implement -final_output This is one of two flags clang passes to the linker when giving calling clang with multiple -arch flags. I think it'd make sense to also use finalOutput instead of outputFile in CodeSignatureSection() and when replacing @executable_path, but ld64 doesn't do that, so I'll at least put those in separate commits. Differential Revision: https://reviews.llvm.org/D105449	2021-07-05 20:06:26 -04:00
Nico Weber	db64306d99	[lld/mac] Implement -umbrella I think this is an old way for doing what is done with -reexport_library these days, but it's e.g. still used in libunwind's build (the opensource.apple.com one, not the llvm one). Differential Revision: https://reviews.llvm.org/D105448	2021-07-05 20:06:25 -04:00
Jez Ng	718c32175b	[lld-macho] Only emit one BIND_OPCODE_SET_SYMBOL per symbol Size-wise, BIND_OPCODE_SET_SYMBOL_TRAILING_FLAGS_IMM is the most expensive opcode, since it comes with an associated symbol string. We were previously emitting it once per binding, instead of once per symbol. This diff groups all bindings for a given symbol together and ensures we only emit one such opcode per symbol. This matches ld64's behavior. While this is a relatively small win on chromium_framework (-72KiB), for programs that have more dynamic bindings, the difference can be quite large. This change is perf-neutral when linking chromium_framework. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D105075	2021-07-05 20:00:19 -04:00
Jez Ng	4aaf878750	[lld-macho][nfc] Add REQUIRES: x86 to test I didn't realize that llvm-objdump's features were arch-specific. This should fix the non-x86 buildbots.	2021-07-05 03:40:54 -04:00
Jez Ng	bcaf57cae8	[lld-macho] Parse relocations quickly by assuming sorted order clang and gcc both seem to emit relocations in reverse order of address. That means we can match relocations to their containing subsections in `O(relocs + subsections)` rather than the `O(relocs * log(subsections))` that our previous binary search implementation required. Unfortunately, `ld -r` can still emit unsorted relocations, so we have a fallback code path for that (less common) case. Numbers for linking chromium_framework on my 3.2 GHz 16-Core Intel Xeon W: N Min Max Median Avg Stddev x 20 4.04 4.11 4.075 4.0775 0.018027756 + 20 3.95 4.02 3.98 3.985 0.020900768 Difference at 95.0% confidence -0.0925 +/- 0.0124919 -2.26855% +/- 0.306361% (Student's t, pooled s = 0.0195172) Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D105410	2021-07-05 01:13:44 -04:00
Nico Weber	9e24979d73	[lld/mac] Fix function offset on 1st-level unwind table sentinel Two bugs: 1. This tries to take the address of the last symbol plus the length of the last symbol. However, the sorted vector is cuPtrVector, not cuVector. Also, cuPtrVector has tombstone values removed and cuVector doesn't. If there was a stripped value at the end, the "last" element's value was UINT64_MAX, which meant the sentinel value was one less than the length of that "last" dead symbol. 2. We have to subtract in.header->addr. For 64-bit binaries that's (1 << 32) and functionAddress is 32-bit so this is a no-op, but for 32-bit binaries the sentinel's value was too large. I believe this has no effect in practice since the first-level binary search code in libunwind (in UnwindCursor.hpp) does: uint32_t low = 0; uint32_t high = sectionHeader.indexCount(); uint32_t last = high - 1; while (low < high) { uint32_t mid = (low + high) / 2; if ((mid == last) \|\| (topIndex.functionOffset(mid + 1) > targetFunctionOffset)) { low = mid; break; } else { low = mid + 1; } So the address of the last entry in the first-level table isn't really checked -- except for the very end, but the check against `last` means we just run the loop once more than necessary. But it makes `unwinddump` output look less confusing, and it's what it looks was the intention here. (No test since I can't think of a way to make FileCheck check that one number is larger than another.) Differential Revision: https://reviews.llvm.org/D105404	2021-07-04 18:06:20 -04:00
Nico Weber	d2d6da3011	[lld/mac] Don't crash on 32-bit output binaries when dead-stripping Fixes PR50974. Differential Revision: https://reviews.llvm.org/D105399	2021-07-04 18:03:31 -04:00
David Blaikie	bf7f846b68	Fix test so it doesn't try to write to the test directory, only to %t	2021-07-02 14:59:50 -07:00
Vy Nguyen	c7c5a1c9ae	[lld-macho] Ignore debug symbols while preparing relocations. Details: see https://bugs.llvm.org/show_bug.cgi?id=50812 Differential Revision: https://reviews.llvm.org/D105210	2021-07-02 13:51:46 -04:00
Martin Storsjö	ce211c505b	[LLD] [COFF] Fix up missing stdcall decorations in MinGW mode If linking directly against a DLL without an import library, the DLL export symbols might not contain stdcall decorations. If we have an undefined symbol with decoration, and we happen to have a matching undecorated symbol (which either is lazy and can be loaded, or already defined), then alias it against that instead. This matches what's done in reverse, when we have a def file declaring to export a symbol without decoration, but we only have a defined decorated symbol. In that case we do a fuzzy match (SymbolTable::findMangle). This case is more straightforward; if we have a decorated undefined symbol, just strip the decoration and look for the corresponding undecorated symbol name. Add warnings and options for either silencing the warning or disabling the whole feature, corresponding to how ld.bfd does it. (This feature works for any symbol decoration mismatch, not only when linking against a DLL directly; ld.bfd also tolerates it anywhere, and also fixes up mismatches in the other direction, like SymbolTable::findMangle, for any symbol, not only exports. But in practice, at least for lld, it would primarily end up used for linking against DLLs.) Differential Revision: https://reviews.llvm.org/D104532	2021-07-02 09:49:14 +03:00
Martin Storsjö	c09e5e50b1	[LLD] [MinGW] Allow linking to DLLs directly As the COFF linker is capable of linking directly against a DLL now (after D104530, as long as it is running in mingw mode), don't error out here but successfully load libraries specified with "-l" from DLLs if that's what ld.bfd would have matched. Differential Revision: https://reviews.llvm.org/D104531	2021-07-02 09:49:13 +03:00
Martin Storsjö	a9ff1ce1b9	[LLD] [COFF] Support linking directly against DLLs in MinGW mode GNU ld.bfd supports linking directly against DLLs without using an import library, and some projects have picked up on this habit. (There's no one single unsurmountable issue with using import libraries, but this is a regularly surfacing missing feature.) As long as one is linking by name (instead of by ordinal), the DLL export table contains most of the information needed. (One can inspect what section a symbol points at, to see if it's a function or data symbol. The practical implementation of this loops over all sections for each symbol, but as long as they're not very many, that should hopefully be tolerable performance wise.) One exception where the information in the DLL isn't entirely enough is on i386 with stdcall functions; depending on how they're done, the exported function name can be a plain undecorated name, while the import library would contain the full decorated symbol name. This issue is addressed separately in a different patch. This is implemented mimicing the structure of a regular import library, with one InputFile corresponding to the static archive that just adds lazy symbols, which then are fetched when they are needed. When such a symbol is fetched, we synthesize a coff_import_header structure in memory and create a regular ImportFile out of it. The implementation could be even smaller by just creating ImportFiles for every symbol available immediately, but that would have the drawback of actually ending up importing all symbols unless running with GC enabled (and mingw mode defaults to having it disabled for historical reasons). Differential Revision: https://reviews.llvm.org/D104530	2021-07-02 09:49:13 +03:00
Jez Ng	f6b6e72143	[lld-macho] Factor out common InputSection members We have been creating many ConcatInputSections with identical values due to .subsections_via_symbols. This diff factors out the identical values into a Shared struct, to reduce memory consumption and make copying cheaper. I also changed `callSiteCount` from a uint32_t to a 31-bit field to save an extra word. All in all, this takes InputSection from 120 to 72 bytes (and ConcatInputSection from 160 to 112 bytes), i.e. 30% size reduction in ConcatInputSection. Numbers for linking chromium_framework on my 3.2 GHz 16-Core Intel Xeon W: N Min Max Median Avg Stddev x 20 4.14 4.24 4.18 4.183 0.027548999 + 20 4.04 4.11 4.075 4.0775 0.018027756 Difference at 95.0% confidence -0.1055 +/- 0.0149005 -2.52211% +/- 0.356215% (Student's t, pooled s = 0.0232803) Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D105305	2021-07-01 21:22:39 -04:00
Jez Ng	08715e6c47	[lld-macho][nfc] Remove unnecessary vertical spacing This makes NonLazyPointerSectionBase's style more in line with the rest of the classes in its file.	2021-07-01 21:22:38 -04:00
Jez Ng	ac2dd06b91	[lld-macho] Deduplicate CFStrings `__cfstring` is a special literal section, so instead of breaking it up at symbol boundaries, we break it up at fixed-width boundaries (since each literal is the same size). Symbols can only occur at one of those boundaries, so this is strictly more powerful than `.subsections_via_symbols`. With that in place, we then run the section through ICF. This change is about perf-neutral when linking chromium_framework. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D105045	2021-07-01 21:22:38 -04:00
Jez Ng	3a11528d97	[lld-macho] Move ICF earlier to avoid emitting redundant binds This is a pretty big refactoring diff, so here are the motivations: Previously, ICF ran after scanRelocations(), where we emitting bind/rebase opcodes etc. So we had a bunch of redundant leftovers after ICF. Having ICF run before Writer seems like a better design, and is what LLD-ELF does, so this diff refactors it accordingly. However, ICF had two dependencies on things occurring in Writer: 1) it needs literals to be deduplicated beforehand and 2) it needs to know which functions have unwind info, which was being handled by `UnwindInfoSection::prepareRelocations()`. In order to do literal deduplication earlier, we need to add literal input sections to their corresponding output sections. So instead of putting all input sections into the big `inputSections` vector, and then filtering them by type later on, I've changed things so that literal sections get added directly to their output sections during the 'gather' phase. Likewise for compact unwind sections -- they get added directly to the UnwindInfoSection now. This latter change is not strictly necessary, but makes it easier for ICF to determine which functions have unwind info. Adding literal sections directly to their output sections means that we can no longer determine `inputOrder` from iterating over `inputSections`. Instead, we store that order explicitly on InputSection. Bloating the size of InputSection for this purpose would be unfortunate -- but LLD-ELF has already solved this problem: it reuses `outSecOff` to store this order value. One downside of this refactor is that we now make an additional pass over the unwind info relocations to figure out which functions have unwind info, since want to know that before `processRelocations()`. I've made sure to run that extra loop only if ICF is enabled, so there should be no overhead in non-optimizing runs of the linker. The upside of all this is that the `inputSections` vector now contains only ConcatInputSections that are destined for ConcatOutputSections, so we can clean up a bunch of code that just existed to filter out other elements from that vector. I will test for the lack of redundant binds/rebases in the upcoming cfstring deduplication diff. While binds/rebases can also happen in the regular `.text` section, they're more common in `.data` sections, so it seems more natural to test it that way. This change is perf-neutral when linking chromium_framework. Reviewed By: oontvoo Differential Revision: https://reviews.llvm.org/D105044	2021-07-01 21:22:38 -04:00
Leonard Grey	fe08e9c487	[lld-macho] Add support for LTO optimization level Everything (including test) modified from ELF/COFF. Using the same syntax (--lto-O3, etc) as ELF. Differential Revision: https://reviews.llvm.org/D105223	2021-07-01 15:01:59 -04:00
Jez Ng	b41b4148e7	[lld-macho] Only enable `__DATA_CONST` for newer platforms Matches ld64. Reviewed By: #lld-macho, alexander-shaposhnikov Differential Revision: https://reviews.llvm.org/D105080	2021-06-30 18:55:48 -04:00
Jez Ng	0d6d35e63b	[lld-macho] -section_rename should work on synthetic sections too Previously, we only applied the renames to ConcatOutputSections. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D105079	2021-06-30 18:55:48 -04:00
Fangrui Song	03051f7ac8	[ELF] Preserve section order within an INSERT AFTER command For ``` SECTIONS { text.0 : {} text.1 : {} text.2 : {} } INSERT AFTER .data; ``` the current order is `.data text.2 text.1 text.0`. It makes more sense to preserve the specified order and thus improve compatibility with GNU ld. For ``` SECTIONS { text.0 : {} } INSERT AFTER .data; SECTIONS { text.3 : {} } INSERT AFTER .data; ``` GNU ld somehow collects sections with `INSERT AFTER .data` together (IMO inconsistent) but I think it makes more sense to execute the commands in order and get `.data text.3 text.0` instead. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D105158	2021-06-30 11:35:50 -07:00
Fangrui Song	7b06bfc49e	[ELF] -pie: produce dynamic relocations for absolute relocations referencing undef weak See the comment for my understanding of -no-pie and -shared expectation. -no-pie has freedom on choices. We choose dynamic relocations to be consistent with the handling of GOT-generating relocations. Note: GNU ld has arch-varying behaviors and its x86 -pie has a very complex rule: if there is at least one GOT-generating or PLT-generating relocation and -z dynamic-undefined-weak (enabled by default) is in effect, generate a dynamic relocation. We don't emulate its rule. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D105164	2021-06-30 09:43:28 -07:00
Peter Smith	fc1cb3104b	[LLD][ELF][ARM] Tidy up test to hook up missing filecheck patterns [NFC] A couple of filecheck patterns had not been hooked up with the patterns suffering from some drift. As this test is old and llvm-objdump has improved a lot, take this opportunity to hide the instruction encoding. I've also taken out a lot of the explanatory comments that llvm-objdump improvements make redundant, as these comments oftern don't get updated when addresses change. Differential Revision: https://reviews.llvm.org/D104907	2021-06-30 14:16:40 +01:00
Peter Smith	dd4d3f7406	[LLD][ELF][ARM] Fix case of patched unrelocated BLX There are a couple of problems with the code to patch unrelocated BLX instructions: 1. The calculation of the PC needs to take into account the alignment of the instruction. The Thumb BLX uses alignDown(PC, 4) for the source address. 2. The calculation of the PC bias is hard-coded to 4 which works for Thumb, but when there is a BLX the branch will be in Arm state so it needs an 8 byte PC bias. No asssembler generates an unrelocated BLX instruction so these problems do not affect real world programs. However we should still fix them. Differential Revision: https://reviews.llvm.org/D104905	2021-06-30 14:07:35 +01:00
Igor Kudrin	657e067bb5	[ARMInstPrinter] Print the target address of a branch instruction This follows other patches that changed printing immediate values of branch instructions to target addresses, see D76580 (x86), D76591 (PPC), D77853 (AArch64). As observing immediate values might sometimes be useful, they are printed as comments for branch instructions. // llvm-objdump -d output (before) 000200b4 <_start>: 200b4: ff ff ff fa blx #-4 <thumb> 000200b8 <thumb>: 200b8: ff f7 fc ef blx #-8 <_start> // llvm-objdump -d output (after) 000200b4 <_start>: 200b4: ff ff ff fa blx 0x200b8 <thumb> @ imm = #-4 000200b8 <thumb>: 200b8: ff f7 fc ef blx 0x200b4 <_start> @ imm = #-8 // GNU objdump -d. 000200b4 <_start>: 200b4: faffffff blx 200b8 <thumb> 000200b8 <thumb>: 200b8: f7ff effc blx 200b4 <_start> Differential Revision: https://reviews.llvm.org/D104701	2021-06-30 16:35:28 +07:00
Nico Weber	aed0a08c69	[lld/mac] Make symbol table order deterministic SymtabSection::emitStabs() writes the symbol table in the order of externalSymbols, which has the order of symtab->getSymbols(), which is just the order symbols are added to the symbol table. In practice, symbols in the symbol files of input .o files are sorted, but since that's not guaranteed we sort them in ObjFile::parseSymbols(). To make sure several symbols with the same address keep the order they're in the input file, we have to use stable_sort(). In practice, std::sort() on already-sorted inputs won't change the order of just adjacent elements, and while in theory std::sort() could use a random pivot, in practice the code should be deterministic as it was previously too. But now lld/test/MachO/stabs.s passes with LLVM_ENABLE_EXPENSIVE_CHECKS=ON (the last test that was failing with that set). Fixes a regression from D99972. While here, remove an empty section in stabs.s and move .subsections_via_symbols to the end where it usually is (this part no behavior change). Differential Revision: https://reviews.llvm.org/D105071	2021-06-29 09:29:49 -04:00
Leonard Grey	a8a6e5b094	[lld-macho] Preserve alignment for non-deduplicated cstrings Fixes PR50637. Downstream bug: https://crbug.com/1218958 Currently, we split __cstring along symbol boundaries with .subsections_via_symbols when not deduplicating, and along null bytes when deduplicating. This change splits along null bytes unconditionally, and preserves original alignment in the non- deduplicated case. Removing subsections-section-relocs.s because with this change, __cstring is never reordered based on the order file. Differential Revision: https://reviews.llvm.org/D104919	2021-06-28 22:26:43 -04:00
Nico Weber	f1969b74a7	[lld/mac] Fix nondeterminism in output section ordering The two different thread_local_regular sections (__thread_data and more_thread_data) had nondeterminstic ordering for two reasons: 1. https://reviews.llvm.org/D102972 changed concatOutputSections from MapVector to DenseMap, so when we iterate it to make output segments, we would add the two sections to the __DATA output segment in nondeterministic order. 2. The same change also moved the two stable_sort()s for segments and sections to sort(). Since sections with assigned priority (such as TLV data) have the same priority for all sections, this is incorrect -- we must use stable_sort() so that the initial (input-order-based) order remains. As a side effect, we now (deterministically) put the __common section in front of __bss (while previously we happened to put it after it). (__common and __bss are both zerofill so both have order INT_MAX, but common symbols are added to inputSections before normal sections are collected.) Makes lld/test/MachO/tlv.s and lld/test/MachO/tlv-dylib.s pass with LLVM_ENABLE_EXPENSIVE_CHECKS=ON. Differential Revision: https://reviews.llvm.org/D105054	2021-06-28 18:41:33 -04:00
Jez Ng	bf457919f2	[lld-macho][nfc] Remove unnecessary dyn_cast and simplify code	2021-06-28 14:50:44 -04:00
Jez Ng	74d5f30d83	[lld-macho][nfc] Add absolute-vs-non-absolute symbol test for ICF Make sure we don't wrongly fold two sections that refer to symbols with the same value if they are not both absolute / non-absolute. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D104876	2021-06-28 14:49:40 -04:00
Jez Ng	557e1fa02f	[lld-macho] Extend ICF to literal sections Literal sections can be deduplicated before running ICF. That makes it easy to compare them during ICF: we can tell if two literals are constant-equal by comparing their offsets in their OutputSection. LLD-ELF takes a similar approach. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D104671	2021-06-28 14:49:39 -04:00
David Spickett	6942076096	[lld][MachO] Temporarily require 64 bit build for dead-strip.s This test has always failed on 32 bit armv8 bots: https://lab.llvm.org/buildbot/#/builders/178/builds/42 Due to the output order of some symbols changing. I don't think this is an Arm specific issue so disabling on 32 bit while it's investigated.	2021-06-28 09:37:45 +00:00
Igor Kudrin	d25e572421	[llvm-objdump] Print memory operand addresses as regular comments The patch reuses the common code to print memory operand addresses as instruction comments. This helps to align the comments and enables using target-specific comment markers when `evaluateMemoryOperandAddress()` is implemented for them. Differential Revision: https://reviews.llvm.org/D104861	2021-06-28 14:25:22 +07:00
Igor Kudrin	e7fffa6f03	[llvm-objdump] Prefix memory operand addresses with '0x' This helps to avoid ambiguity when the address contains only digits 0..9. Differential Revision: https://reviews.llvm.org/D104909	2021-06-28 14:25:21 +07:00
Nico Weber	0f24ffcdfa	[lld/mac] Don't fold UNWIND_X86_64_MODE_STACK_IND unwind entries libunwind uses unwind info to find the function address belonging to the current instruction pointer. libunwind/src/CompactUnwinder.hpp's step functions read functionStart for UNWIND_X86_64_MODE_STACK_IND (and for nothing else), so these encodings need a dedicated entry per function, so that the runtime can get the stacksize off the `subq` instrunction in the function's prologue. This matches ld64. (CompactUnwinder.hpp from https://opensource.apple.com/source/libunwind/ also reads functionStart in a few more cases if `SUPPORT_OLD_BINARIES` is set, but it defaults to 0, and ld64 seems to not worry about these additional cases.) Related upstream bug: https://crbug.com/1220175 Differential Revision: https://reviews.llvm.org/D104978	2021-06-27 06:49:32 -04:00
Jan Kratochvil	a7afaf9019	Fix lld testsuite after llvm-dwarfdump now errors on invalid DWARF D104271 broke buildbots for lld/test/ELF/non-abs-reloc.s .	2021-06-27 12:26:11 +02:00
Fangrui Song	2508733e1b	[ELF] --sysroot: change sysrooted script to not fall back for an absolute path Modify the D13209 logic: for a script inside the sysroot, if an absolute path does not exist, report an error instead of falling back to the path without the sysroot prefix. This matches GNU ld, which makes sense to me: we don't want to find an arbitrary file in the host. Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D104894	2021-06-25 12:52:39 -07:00
Martin Storsjö	d07f43641f	[LLD] [COFF] Fix handling of LTO comdats with nontrivial selection types after `728cc0075e` Commit `728cc0075e` made comdat symbols from LTO objects be treated as any regular comdat symbol. This works great for symbols that actually are IMAGE_COMDAT_SELECT_ANY, but if the symbols have a less trivial selection type that require comparing either the section chunk size or contents, we can't check that before actually doing the LTO compilation. Therefore bring back one aspect of handling from before; that comdat resolution with a leader from an LTO symbol is essentially skipped, like it was before `728cc0075e`. Differential Revision: https://reviews.llvm.org/D104605	2021-06-25 09:39:56 +03:00
Fangrui Song	ca3bdb57fa	[MC][ELF] Change SHT_LLVM_CALL_GRAPH_PROFILE relocations from SHT_RELA to SHT_REL ... even on targets preferring RELA. The section is only consumed by ld.lld which can handle REL. Follow-up to D104080 as I explained in the review. There are two advantages: * The D104080 code only handles RELA, so arm/i386/mips32 etc may warn for -fprofile-use=/-fprofile-sample-use= usage. * Decrease object file size for RELA targets While here, change the relocation to relocate weights, instead of 0,1,2,3,.. I failed to catch the issue during review.	2021-06-24 21:35:48 -07:00
Jez Ng	8aa17d1eae	[lld-macho] Move ICF members from InputSection to ConcatInputSection `icfEqClass` only makes sense on ConcatInputSections since (in contrast to literal sections) they are deduplicated as an atomic unit. Similarly, `hasPersonality` and `replacement` don't make sense on literal sections. This mirrors LLD-ELF, which stores `icfEqClass` only on non-mergeable sections. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D104670	2021-06-24 22:23:12 -04:00
Fangrui Song	c4ca39e0f5	[ELF] Fix .rela.llvm.call-graph-profile detection after D104080 A SHT_SYMTAB section's sh_info is the number of local symbols. sh_info may coincide with the section header index of SHT_LLVM_CALL_GRAPH_PROFILE.	2021-06-24 15:21:28 -07:00
Fangrui Song	f1e2d5851b	[OptTable] Rename PrintHelp to printHelp To be consistent with other member functions and match the coding standard.	2021-06-24 14:47:03 -07:00
Martin Storsjö	3c6f8ca7c9	[lld] Rename StringRef _lower() method calls to _insensitive()	2021-06-25 00:22:01 +03:00
Jez Ng	4a8503c8e0	[lld-macho] Align all cstrings to 16 bytes when deduplicating We previously did this only for x86_64, but it turns out that arm64 needs this too -- see PR50791. Ultimately this is a hack, and we should avoid over-aligning strings that don't need it. I'm just having a hard time figuring out how ld64 is determining the right alignment. No new test for this since we were already testing this behavior for x86_64, and extending it to arm64 seems too trivial. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D104835	2021-06-24 16:53:29 -04:00
Alexander Yermolovich	a224c5199b	[LLD][LLVM] CG Graph profile using relocations Currently when .llvm.call-graph-profile is created by llvm it explicitly encodes the symbol indices. This section is basically a black box for post processing tools. For example, if we run strip -s on the object files the symbol table changes, but indices in that section do not. In non-visible behavior indices point to wrong symbols. The visible behavior indices point outside of Symbol table: "invalid symbol index". This patch changes the format by using R_*_NONE relocations to indicate the from/to symbols. The Frequency (Weight) will still be in the .llvm.call-graph-profile, but symbol information will be in relocation section. In LLD information from both sections is used to reconstruct call graph profile. Relocations themselves will never be applied. With this approach post processing tools that handle relocations correctly work for this section also. Tools can add/remove symbols and as long as they handle relocation sections with this approach information stays correct. Doing a quick experiment with clang-13. The size went up from 107KB to 322KB, aggregate of all the input sections. Size of clang-13 binary is ~118MB. For users of -fprofile-use/-fprofile-sample-use the size of object files will go up slightly, it will not impact final binary size. Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D104080	2021-06-24 09:09:33 -07:00
Greg McGary	8a8558ae27	[lld-macho] add tests for ICF, plus cleanups Add tests for pending TODOs, plus some global cleanups: * No fold: func has personality/LSDA * Fold: reference to absolute symbol with different name but identical value * No fold: reloc references to absolute symbols with different values * No fold: N_ALT_ENTRY symbols Differential Revision: https://reviews.llvm.org/D104721	2021-06-23 20:44:25 -07:00
Nico Weber	ef75358080	[lld/mac] Delete incorrect FIXME """Bitcode symbols only exist before LTO runs, and only serve the purpose of resolving visibility so LTO can better optimize. Running LTO creates ObjFiles from BitcodeFiles, and those ObjFiles contain regular Defined symbols (with isec set and all) that will replace the bitcode symbols. So things should (hopefully) work as-is :)""" -- https://reviews.llvm.org/rGdbbc8d8333f29cf4ad6f4793da1adf71bbfdac69#inline-6081	2021-06-23 16:25:34 -04:00
Nico Weber	dbbc8d8333	[lld/mac] Don't crash on absolute symbols in unwind info generation Fixes a regression from `d6565a2dbc` and PR50820.	2021-06-23 14:25:34 -04:00
Martin Storsjö	f1a18fb699	[LLD] [MinGW] Silence the printouts in one test. NFC. This particular linker invocation is only run to check that we accept options, but we don't inspect the generated command line. As all other commands in the file have their output piped to FileCheck, the lit test doesn't print any other output; therefore silence this one for consistency as well.	2021-06-23 10:44:01 +03:00
Martin Storsjö	fdf54f5c50	[LLD] [MinGW] Print the lld-link command to stderr This is consistent with how clang prints its internal commands with -### and -v. When linking with -verbose, we get log messages from the actual linking written to stderr. By printing the command to the same stream, we make sure they appear in a sensible chronological order. Differential Revision: https://reviews.llvm.org/D104527	2021-06-23 10:21:42 +03:00
Colin Cross	e387778722	[ELF] Optimize ScriptLexer::getLineNumber by caching the previous line number and offset getLineNumber() was counting the number of line feeds from the start of the buffer to the current token. For large linker scripts this became a performance bottleneck. For one 4MB linker script over 4 minutes was spent in getLineNumber's StringRef::count. Store the line number from the last token, and only count the additional line feeds since the last token. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D104137	2021-06-22 15:35:24 -07:00
Reid Kleckner	5bcbc7ee52	Add regression test for maybeMangle issue This was crbug.com/1222724, which caused D104529 to be reverted. The new test fails when D104529 is reapplied locally.	2021-06-22 12:55:25 -07:00
Reid Kleckner	8d84751ac4	Revert "[LLD] [COFF] Avoid doing repeated fuzzy symbol lookup for each iteration. NFC." This reverts commit `e1adf90826`. This appears to affect the way that C++ mangled symbols appear in the import library when using a .def file that names a C++ free function with no name decoration. I will follow up with a reduced test case shortly.	2021-06-22 11:35:14 -07:00
Nico Weber	d6565a2dbc	[lld/mac] Add explicit "no unwind info" entries for functions without unwind info Fixes PR50529. With this, lld-linked Chromium base_unittests passes on arm macs. Surprisingly, no measurable impact on link time. Differential Revision: https://reviews.llvm.org/D104681	2021-06-22 06:12:42 -04:00
Nico Weber	3a6a60f6c9	[lld/mac] Make a variable more local; no behavior change The variable used to need the wider scope, but doesn't after the reland. See LC_LINKER_OPTIONS-related discussion on https://reviews.llvm.org/D104353 for background.	2021-06-20 21:59:15 -04:00
Nico Weber	e6cb55d5ce	[lld/mac] Test zerofill sections after __thread_bss Real zerofill sections go after __thread_bss, since zerofill sections must all be at the end of their segment and __thread_bss must be right after __thread_data. Works fine already, but wasn't tested as far as I can tell. Also tweak comment about zerofill sections a bit. No behavior change. Differential Revision: https://reviews.llvm.org/D104609	2021-06-20 20:44:29 -04:00
Jez Ng	f79e7a5a48	[lld-macho] Have inputOrder default to less than INT_MAX We make it less than INT_MAX in order not to conflict with the ordering of zerofill sections, which must always be placed at the end of their segment. This is the more structural fix for the issue addressed in {D104596}. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D104607	2021-06-20 19:49:27 -04:00
Fangrui Song	89e66a3ab3	[ELF] Delete --no-cref which does not exist in GNU ld Also delete the single dash form which does not appear to be used.	2021-06-20 14:28:56 -07:00
Fangrui Song	cd6b1b2b86	[ELF][test] Add missing tests for --no-export-dynamic & --no-warn-backrefs	2021-06-20 14:20:14 -07:00
Fangrui Song	50225112b5	[lld-link] Fix -Wunused-but-set-variable in -DLLVM_ENABLE_ASSERTIONS=off build. NFC	2021-06-20 11:35:02 -07:00
Martin Storsjö	e1adf90826	[LLD] [COFF] Avoid doing repeated fuzzy symbol lookup for each iteration. NFC. This is run every time around in the main linker loop. Once a match has been found, stop trying to rematch such a symbol. Not sure if this has any actual measurable performance impact though (SymbolTable::findMangle() iterates over the whole symbol table for each call and does fuzzy matching on top of that) but this makes the code more reassuring to read at least. (This is in practice run for def files listing undecorated stdcall functions to be exported.) Differential Revision: https://reviews.llvm.org/D104529	2021-06-19 22:32:37 +03:00
Martin Storsjö	1c8bb625b7	[LLD] [MinGW] Print errors/warnings in lld-link with a "ld.lld" prefix Pass the original argv[0] to the coff linker, as the coff linker uses the basename of argv[0] as the log prefix. This makes error messages to be printed with a "ld.lld:" prefix instead of "lld-link:". The current "lld-link:" prefix can be confusing to users, as they're invoking the MinGW linker (and might not even have a lld-link executable). Keep the first argument as lld-link when printing the command line, to make it an actually reproducible standalone command. Differential Revision: https://reviews.llvm.org/D104526	2021-06-19 22:32:37 +03:00
Nico Weber	c931e12b1d	[lld/mac] Make sure __thread_ptrs is in front of __thread_bss The exact location doesn't matter, but it should be in front of __thread_bss. We put it right in front of __thread_data which is where ld64 seems to put it as well. Fixes PR50769. (As mentioned on the bug, there is probably a more structural fix too, see comment 5. If we don't address this, it's likely we'll run into this again with other synthetic sections. But for now, let's fix the immediate breakage.) Differential Revision: https://reviews.llvm.org/D104596	2021-06-19 12:56:43 -04:00
Nico Weber	17271ece0d	[lld/mac] Give __DATA,__thread_ptrs type S_THREAD_LOCAL_VARIABLE_POINTERS ...instead of S_NON_LAZY_SYMBOL_POINTERS. This matches ld64. Part of PR50769. While here, also remove an old TODO that was done in D87178. Differential Revision: https://reviews.llvm.org/D104594	2021-06-19 12:56:42 -04:00
Jez Ng	4507f64165	[re-land][lld-macho] Avoid force-loading the same archive twice This reverts commit `c9b241efd6`, which was a backout diff to fix the buildbots. The real culprit of the crash is `1d31fb8d12`, which is being reverted. Differential Revision: https://reviews.llvm.org/D104353	2021-06-18 22:43:50 -04:00
Jez Ng	a79c018325	Revert "[lld-macho] Have path-related functions return std::string, not StringRef" This reverts commit `1d31fb8d12`. Making `rerootPath` return a temporary std::string caused a use-after-free: https://ci.chromium.org/ui/p/chromium/builders/try/win_upload_clang/1608/overview	2021-06-18 22:43:49 -04:00
Nico Weber	c9b241efd6	Revert "[lld-macho] Avoid force-loading the same archive twice" This reverts commit `24706cd73c`. Test seems to fail flakily. See comments on https://reviews.llvm.org/D104353 for a hypothesis for why.	2021-06-18 20:25:27 -04:00
Jez Ng	1d31fb8d12	[lld-macho] Have path-related functions return std::string, not StringRef findLibrary() returned a StringRef while findFramework & other helper functions returned std::strings. Standardize on std::string. (I initially tried making the helper functions all return StringRefs, but I realized we shouldn't return input StringRefs since their lifetimes would not be obvious from the calling code.)	2021-06-18 16:36:14 -04:00
Jez Ng	4c49f9ceaf	[lld-macho] Handle non-extern symbols marked as private extern Previously, we asserted that such a case was invalid, but in fact `ld -r` can emit such symbols if the input contained a (true) private extern, or if it contained a symbol started with "L". Non-extern symbols marked as private extern are essentially equivalent to regular TU-scoped symbols, so no new functionality is needed. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D104502	2021-06-18 16:36:14 -04:00
Nico Weber	f7366890c2	[lld/mac] Support -data_in_code_info, -function_starts flags These are on by default, but there's also an explicit flag for them. Differential Revision: https://reviews.llvm.org/D104543	2021-06-18 13:01:42 -04:00
Greg McGary	8120c9e379	Rename option -icf MODE to --icf=MODE The `icf` command-line option is not present in ld64, so it should use the LLD option syntax, which begins with double dashes and separates primary option from any suboption with the equal sign. Differential Revision: https://reviews.llvm.org/D104548	2021-06-18 09:52:15 -07:00
Muhammad Omair Javaid	9777f3fd06	Fix build failure on 32 bit Arm This patch fixes build failure caused by commit `f27e4548fc` on 32 bit arm. Differential Revision: https://reviews.llvm.org/D103292	2021-06-18 15:27:09 +00:00
Heejin Ahn	1d891d44f3	[WebAssembly] Rename event to tag We recently decided to change 'event' to 'tag', and 'event section' to 'tag section', out of the rationale that the section contains a generalized tag that references a type, which may be used for something other than exceptions, and the name 'event' can be confusing in the web context. See - https://github.com/WebAssembly/exception-handling/issues/159#issuecomment-857910130 - https://github.com/WebAssembly/exception-handling/pull/161 Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D104423	2021-06-17 20:34:19 -07:00
Sam Clegg	d01e673a9f	[lld][WebAssembly] Fix crash calling weakly undefined function in PIC code Differential Revision: https://reviews.llvm.org/D104495	2021-06-17 16:49:02 -07:00
Sam Clegg	758633f922	[lld][WebAssembly] Add new `--import-undefined` option This change revisits https://reviews.llvm.org/D79248 which originally added support for the --unresolved-symbols flag. At the time I thought it would make sense to add a third option to this flag called `import-functions` but it turns out (as was suspects by on the reviewers IIRC) that this option can be authoganal. Instead I've added a new option called `--import-undefined` that only operates on symbols that can be imported (for example, function symbols can always be imported as opposed to data symbols we can only be imported when compiling with PIC). This option gives us the full expresivitiy that emscripten needs to be able allow reporting of undefined data symbols as well as the option to disable that. This change does remove the `--unresolved-symbols=import-functions` option, which is been in the codebase now for about a year but I would be extremely surprised if anyone was using it. Differential Revision: https://reviews.llvm.org/D103290	2021-06-17 11:44:21 -07:00
Vy Nguyen	366df11a35	[lld-macho] Rework mergeFlag to behave closer to what ld64 does. Details: I've been getting a few weird errors similar to the following from our internal tests: ``` ld64.lld.darwinnew: error: Cannot merge section __eh_frame (type=0x0) into __eh_frame (type=0xB): inconsistent types ld64.lld.darwinnew: error: Cannot merge section __eh_frame (flags=0x0) into __eh_frame (flags=0x6800000B): strict flags differ ld64.lld.darwinnew: error: Cannot merge section __eh_frame (type=0x0) into __eh_frame (type=0xB): inconsistent types ld64.lld.darwinnew: error: Cannot merge section __eh_frame (flags=0x0) into __eh_frame (flags=0x6800000B): strict flags differ ``` Differential Revision: https://reviews.llvm.org/D103971	2021-06-17 14:22:58 -04:00
Greg McGary	f27e4548fc	[lld-macho] Implement ICF ICF = Identical C(ode\|OMDAT) Folding This is the LLD ELF/COFF algorithm, adapted for MachO. So far, only `-icf all` is supported. In order to support `-icf safe`, we will need to port address-significance tables (`.addrsig` directives) to MachO, which will come in later diffs. `check-{llvm,clang,lld}` have 0 regressions for `lld -icf all` vs. baseline ld64. We only run ICF on `__TEXT,__text` for reasons explained in the block comment in `ConcatOutputSection.cpp`. Here is the perf impact for linking `chromium_framekwork` on a Mac Pro (16-core Xeon W) for the non-ICF case vs. pre-ICF: ``` N Min Max Median Avg Stddev x 20 4.27 4.44 4.34 4.349 0.043029977 + 20 4.37 4.46 4.405 4.4115 0.025188761 Difference at 95.0% confidence 0.0625 +/- 0.0225658 1.43711% +/- 0.518873% (Student's t, pooled s = 0.0352566) ``` Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D103292	2021-06-17 10:07:44 -07:00
Jez Ng	24706cd73c	[lld-macho] Avoid force-loading the same archive twice We need to dedup archive loads (similar to what we do for dylib loads). I noticed this issue after building some Swift stuff that used `-force_load_swift_libs`, as it caused some Swift archives to be loaded many times. Reviewed By: #lld-macho, thakis, MaskRay Differential Revision: https://reviews.llvm.org/D104353	2021-06-17 11:13:54 -04:00
Igor Kudrin	5355b8c631	[ELF] Restore arm-branch.s test After D77330, the comments are inconsistent with the disassembled code. As the value of `far` has been changed, a thunk to reach it is now generated, and target addresses of branch instructions are different from what was initially expected. The patch fixes that and makes the test closer to what it was originally. Differential Revision: https://reviews.llvm.org/D104286	2021-06-17 17:08:13 +07:00
Martin Storsjö	ceee35e3e4	[LLD] [COFF] Remove a stray duplicate comment. NFC. The following class isn't part of the export table; there's a second correctly placed comment about the things that actually belong to the export table.	2021-06-17 13:02:35 +03:00
Xuanda Yang	01cb9c5fc5	[lld][MachO] Sort symbols in parallel in -map source: https://bugs.llvm.org/show_bug.cgi?id=50689 When writing a map file, sort symbols in parallel using parallelSort. Use address name to break ties if two symbols have the same address. Reviewed By: thakis, int3 Differential Revision: https://reviews.llvm.org/D104346	2021-06-17 10:19:59 +08:00
Jez Ng	560636e549	[lld-macho] Put DATA_IN_CODE immediately after FUNCTION_STARTS codesign checks for this. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D104354	2021-06-16 15:23:07 -04:00
Jez Ng	eeac6b2bec	[lld-macho] Handle multiple LC_LINKER_OPTIONs We previously only parsed the first one. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D104352	2021-06-16 15:23:06 -04:00
Jez Ng	b8bbb9723a	[lld-macho][nfc] Put back shouldOmitFromOutput() asserts I removed them in rG5de7467e982 but @thakis pointed out that they were useful to keep, so here they are again. I've also converted the `!isCoalescedWeak()` asserts into `!shouldOmitFromOutput()` asserts, since the latter check subsumes the former. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D104169	2021-06-16 15:23:04 -04:00
Jez Ng	d52d1b93c3	[lld-macho] Downgrade version mismatch to warning It's a warning in ld64. While having LLD be stricter would be nice, it makes it harder for it to be a drop-in replacement into existing builds. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D104333	2021-06-16 11:06:26 -04:00
Nico Weber	b579938d40	[lld/mac] Add support for -no_data_in_code_info flag Differential Revision: https://reviews.llvm.org/D104345	2021-06-16 06:40:42 -04:00
Nico Weber	46ac1b213a	[lld/mac] Put lld-only flags in "LLD-SPECIFIC:" --help section Differential Revision: https://reviews.llvm.org/D104347	2021-06-16 06:39:36 -04:00
Konstantin Schwarz	5d621ed85d	[ELF] Consider that NOLOAD sections should be placed in a PT_LOAD segment During PHDR creation, the case where an output section does not require a PT_LOAD header but still occupies memory in the current VMA region was not handled. If such an output section interleaves two output sections that have the same VMA and LMA regions set, we would previously re-use the existing PT_LOAD header for the second output section. However, since the memory region is not contiguous, we need to start a new PT_LOAD segment. This fixes https://bugs.llvm.org/show_bug.cgi?id=50558 Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D103815	2021-06-16 12:36:45 +02:00

... 7 8 9 10 11 ...

15048 Commits