llvm-project

Commit Graph

Author	SHA1	Message	Date
Jez Ng	f79e7a5a48	[lld-macho] Have inputOrder default to less than INT_MAX We make it less than INT_MAX in order not to conflict with the ordering of zerofill sections, which must always be placed at the end of their segment. This is the more structural fix for the issue addressed in {D104596}. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D104607	2021-06-20 19:49:27 -04:00
Fangrui Song	89e66a3ab3	[ELF] Delete --no-cref which does not exist in GNU ld Also delete the single dash form which does not appear to be used.	2021-06-20 14:28:56 -07:00
Fangrui Song	cd6b1b2b86	[ELF][test] Add missing tests for --no-export-dynamic & --no-warn-backrefs	2021-06-20 14:20:14 -07:00
Fangrui Song	50225112b5	[lld-link] Fix -Wunused-but-set-variable in -DLLVM_ENABLE_ASSERTIONS=off build. NFC	2021-06-20 11:35:02 -07:00
Martin Storsjö	e1adf90826	[LLD] [COFF] Avoid doing repeated fuzzy symbol lookup for each iteration. NFC. This is run every time around in the main linker loop. Once a match has been found, stop trying to rematch such a symbol. Not sure if this has any actual measurable performance impact though (SymbolTable::findMangle() iterates over the whole symbol table for each call and does fuzzy matching on top of that) but this makes the code more reassuring to read at least. (This is in practice run for def files listing undecorated stdcall functions to be exported.) Differential Revision: https://reviews.llvm.org/D104529	2021-06-19 22:32:37 +03:00
Martin Storsjö	1c8bb625b7	[LLD] [MinGW] Print errors/warnings in lld-link with a "ld.lld" prefix Pass the original argv[0] to the coff linker, as the coff linker uses the basename of argv[0] as the log prefix. This makes error messages to be printed with a "ld.lld:" prefix instead of "lld-link:". The current "lld-link:" prefix can be confusing to users, as they're invoking the MinGW linker (and might not even have a lld-link executable). Keep the first argument as lld-link when printing the command line, to make it an actually reproducible standalone command. Differential Revision: https://reviews.llvm.org/D104526	2021-06-19 22:32:37 +03:00
Nico Weber	c931e12b1d	[lld/mac] Make sure __thread_ptrs is in front of __thread_bss The exact location doesn't matter, but it should be in front of __thread_bss. We put it right in front of __thread_data which is where ld64 seems to put it as well. Fixes PR50769. (As mentioned on the bug, there is probably a more structural fix too, see comment 5. If we don't address this, it's likely we'll run into this again with other synthetic sections. But for now, let's fix the immediate breakage.) Differential Revision: https://reviews.llvm.org/D104596	2021-06-19 12:56:43 -04:00
Nico Weber	17271ece0d	[lld/mac] Give __DATA,__thread_ptrs type S_THREAD_LOCAL_VARIABLE_POINTERS ...instead of S_NON_LAZY_SYMBOL_POINTERS. This matches ld64. Part of PR50769. While here, also remove an old TODO that was done in D87178. Differential Revision: https://reviews.llvm.org/D104594	2021-06-19 12:56:42 -04:00
Jez Ng	4507f64165	[re-land][lld-macho] Avoid force-loading the same archive twice This reverts commit `c9b241efd6`, which was a backout diff to fix the buildbots. The real culprit of the crash is `1d31fb8d12`, which is being reverted. Differential Revision: https://reviews.llvm.org/D104353	2021-06-18 22:43:50 -04:00
Jez Ng	a79c018325	Revert "[lld-macho] Have path-related functions return std::string, not StringRef" This reverts commit `1d31fb8d12`. Making `rerootPath` return a temporary std::string caused a use-after-free: https://ci.chromium.org/ui/p/chromium/builders/try/win_upload_clang/1608/overview	2021-06-18 22:43:49 -04:00
Nico Weber	c9b241efd6	Revert "[lld-macho] Avoid force-loading the same archive twice" This reverts commit `24706cd73c`. Test seems to fail flakily. See comments on https://reviews.llvm.org/D104353 for a hypothesis for why.	2021-06-18 20:25:27 -04:00
Jez Ng	1d31fb8d12	[lld-macho] Have path-related functions return std::string, not StringRef findLibrary() returned a StringRef while findFramework & other helper functions returned std::strings. Standardize on std::string. (I initially tried making the helper functions all return StringRefs, but I realized we shouldn't return input StringRefs since their lifetimes would not be obvious from the calling code.)	2021-06-18 16:36:14 -04:00
Jez Ng	4c49f9ceaf	[lld-macho] Handle non-extern symbols marked as private extern Previously, we asserted that such a case was invalid, but in fact `ld -r` can emit such symbols if the input contained a (true) private extern, or if it contained a symbol started with "L". Non-extern symbols marked as private extern are essentially equivalent to regular TU-scoped symbols, so no new functionality is needed. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D104502	2021-06-18 16:36:14 -04:00
Nico Weber	f7366890c2	[lld/mac] Support -data_in_code_info, -function_starts flags These are on by default, but there's also an explicit flag for them. Differential Revision: https://reviews.llvm.org/D104543	2021-06-18 13:01:42 -04:00
Greg McGary	8120c9e379	Rename option -icf MODE to --icf=MODE The `icf` command-line option is not present in ld64, so it should use the LLD option syntax, which begins with double dashes and separates primary option from any suboption with the equal sign. Differential Revision: https://reviews.llvm.org/D104548	2021-06-18 09:52:15 -07:00
Muhammad Omair Javaid	9777f3fd06	Fix build failure on 32 bit Arm This patch fixes build failure caused by commit `f27e4548fc` on 32 bit arm. Differential Revision: https://reviews.llvm.org/D103292	2021-06-18 15:27:09 +00:00
Heejin Ahn	1d891d44f3	[WebAssembly] Rename event to tag We recently decided to change 'event' to 'tag', and 'event section' to 'tag section', out of the rationale that the section contains a generalized tag that references a type, which may be used for something other than exceptions, and the name 'event' can be confusing in the web context. See - https://github.com/WebAssembly/exception-handling/issues/159#issuecomment-857910130 - https://github.com/WebAssembly/exception-handling/pull/161 Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D104423	2021-06-17 20:34:19 -07:00
Sam Clegg	d01e673a9f	[lld][WebAssembly] Fix crash calling weakly undefined function in PIC code Differential Revision: https://reviews.llvm.org/D104495	2021-06-17 16:49:02 -07:00
Sam Clegg	758633f922	[lld][WebAssembly] Add new `--import-undefined` option This change revisits https://reviews.llvm.org/D79248 which originally added support for the --unresolved-symbols flag. At the time I thought it would make sense to add a third option to this flag called `import-functions` but it turns out (as was suspects by on the reviewers IIRC) that this option can be authoganal. Instead I've added a new option called `--import-undefined` that only operates on symbols that can be imported (for example, function symbols can always be imported as opposed to data symbols we can only be imported when compiling with PIC). This option gives us the full expresivitiy that emscripten needs to be able allow reporting of undefined data symbols as well as the option to disable that. This change does remove the `--unresolved-symbols=import-functions` option, which is been in the codebase now for about a year but I would be extremely surprised if anyone was using it. Differential Revision: https://reviews.llvm.org/D103290	2021-06-17 11:44:21 -07:00
Vy Nguyen	366df11a35	[lld-macho] Rework mergeFlag to behave closer to what ld64 does. Details: I've been getting a few weird errors similar to the following from our internal tests: ``` ld64.lld.darwinnew: error: Cannot merge section __eh_frame (type=0x0) into __eh_frame (type=0xB): inconsistent types ld64.lld.darwinnew: error: Cannot merge section __eh_frame (flags=0x0) into __eh_frame (flags=0x6800000B): strict flags differ ld64.lld.darwinnew: error: Cannot merge section __eh_frame (type=0x0) into __eh_frame (type=0xB): inconsistent types ld64.lld.darwinnew: error: Cannot merge section __eh_frame (flags=0x0) into __eh_frame (flags=0x6800000B): strict flags differ ``` Differential Revision: https://reviews.llvm.org/D103971	2021-06-17 14:22:58 -04:00
Greg McGary	f27e4548fc	[lld-macho] Implement ICF ICF = Identical C(ode\|OMDAT) Folding This is the LLD ELF/COFF algorithm, adapted for MachO. So far, only `-icf all` is supported. In order to support `-icf safe`, we will need to port address-significance tables (`.addrsig` directives) to MachO, which will come in later diffs. `check-{llvm,clang,lld}` have 0 regressions for `lld -icf all` vs. baseline ld64. We only run ICF on `__TEXT,__text` for reasons explained in the block comment in `ConcatOutputSection.cpp`. Here is the perf impact for linking `chromium_framekwork` on a Mac Pro (16-core Xeon W) for the non-ICF case vs. pre-ICF: ``` N Min Max Median Avg Stddev x 20 4.27 4.44 4.34 4.349 0.043029977 + 20 4.37 4.46 4.405 4.4115 0.025188761 Difference at 95.0% confidence 0.0625 +/- 0.0225658 1.43711% +/- 0.518873% (Student's t, pooled s = 0.0352566) ``` Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D103292	2021-06-17 10:07:44 -07:00
Jez Ng	24706cd73c	[lld-macho] Avoid force-loading the same archive twice We need to dedup archive loads (similar to what we do for dylib loads). I noticed this issue after building some Swift stuff that used `-force_load_swift_libs`, as it caused some Swift archives to be loaded many times. Reviewed By: #lld-macho, thakis, MaskRay Differential Revision: https://reviews.llvm.org/D104353	2021-06-17 11:13:54 -04:00
Igor Kudrin	5355b8c631	[ELF] Restore arm-branch.s test After D77330, the comments are inconsistent with the disassembled code. As the value of `far` has been changed, a thunk to reach it is now generated, and target addresses of branch instructions are different from what was initially expected. The patch fixes that and makes the test closer to what it was originally. Differential Revision: https://reviews.llvm.org/D104286	2021-06-17 17:08:13 +07:00
Martin Storsjö	ceee35e3e4	[LLD] [COFF] Remove a stray duplicate comment. NFC. The following class isn't part of the export table; there's a second correctly placed comment about the things that actually belong to the export table.	2021-06-17 13:02:35 +03:00
Xuanda Yang	01cb9c5fc5	[lld][MachO] Sort symbols in parallel in -map source: https://bugs.llvm.org/show_bug.cgi?id=50689 When writing a map file, sort symbols in parallel using parallelSort. Use address name to break ties if two symbols have the same address. Reviewed By: thakis, int3 Differential Revision: https://reviews.llvm.org/D104346	2021-06-17 10:19:59 +08:00
Jez Ng	560636e549	[lld-macho] Put DATA_IN_CODE immediately after FUNCTION_STARTS codesign checks for this. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D104354	2021-06-16 15:23:07 -04:00
Jez Ng	eeac6b2bec	[lld-macho] Handle multiple LC_LINKER_OPTIONs We previously only parsed the first one. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D104352	2021-06-16 15:23:06 -04:00
Jez Ng	b8bbb9723a	[lld-macho][nfc] Put back shouldOmitFromOutput() asserts I removed them in rG5de7467e982 but @thakis pointed out that they were useful to keep, so here they are again. I've also converted the `!isCoalescedWeak()` asserts into `!shouldOmitFromOutput()` asserts, since the latter check subsumes the former. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D104169	2021-06-16 15:23:04 -04:00
Jez Ng	d52d1b93c3	[lld-macho] Downgrade version mismatch to warning It's a warning in ld64. While having LLD be stricter would be nice, it makes it harder for it to be a drop-in replacement into existing builds. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D104333	2021-06-16 11:06:26 -04:00
Nico Weber	b579938d40	[lld/mac] Add support for -no_data_in_code_info flag Differential Revision: https://reviews.llvm.org/D104345	2021-06-16 06:40:42 -04:00
Nico Weber	46ac1b213a	[lld/mac] Put lld-only flags in "LLD-SPECIFIC:" --help section Differential Revision: https://reviews.llvm.org/D104347	2021-06-16 06:39:36 -04:00
Konstantin Schwarz	5d621ed85d	[ELF] Consider that NOLOAD sections should be placed in a PT_LOAD segment During PHDR creation, the case where an output section does not require a PT_LOAD header but still occupies memory in the current VMA region was not handled. If such an output section interleaves two output sections that have the same VMA and LMA regions set, we would previously re-use the existing PT_LOAD header for the second output section. However, since the memory region is not contiguous, we need to start a new PT_LOAD segment. This fixes https://bugs.llvm.org/show_bug.cgi?id=50558 Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D103815	2021-06-16 12:36:45 +02:00
Vitaly Buka	b01bfdfda6	[lld][MachO] Fix UB after D103006 ubsan detected: lld/MachO/SyntheticSections.cpp:636:15: runtime error: null pointer passed as argument 2, which is declared to never be null	2021-06-14 21:15:54 -07:00
Alexander Shaposhnikov	928394d109	[lld][MachO] Add support for LC_DATA_IN_CODE Add first bits for emitting LC_DATA_IN_CODE. Test plan: make check-lld-macho Differential revision: https://reviews.llvm.org/D103006	2021-06-14 19:21:59 -07:00
Jez Ng	cc17bfe489	[lld-macho] Fix "shift exponent too large" UBSAN error UBSAN seems to have added this check somewhere along the way... This might also fix the PPC buildbot, which is failing on the same test	2021-06-14 13:47:25 -04:00
Jez Ng	e06b9ba485	[lld-macho] Reword comment for clarity	2021-06-14 13:47:25 -04:00
Jez Ng	9c5d43fb55	[lld-macho] Try to fix MSAN "uninitialized memory" error I think this is the fix, with the regression being introduced by D104199. Not 100% sure since MSAN isn't supported on my Mac machine, and it'll take some time to spin up a Linux box... will look at the buildbots for answers	2021-06-13 23:47:09 -04:00
Jez Ng	da24e6d43e	[lld-macho][nfc] Add `final` to classes where possible I wanted to see if we would get any perf wins out of this, but it doesn't seem to be the case. But it still seems worth committing. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D104200	2021-06-13 19:52:03 -04:00
Jez Ng	c5c05ffa45	[lld-macho][nfc] Represent the image loader cache with a ConcatInputSection We don't need to define any special behavior for this section, so creating a subclass for it is redundant. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D104199	2021-06-13 19:51:31 -04:00
Jez Ng	b2a0739012	[lld-macho][nfc] Remove InputSection::outSecFileOff `outSecFileOff` and the associated `getFileOffset()` accessors were unnecessary. For all the cases we care about, `outSecFileOff` is the same as `outSecOff`. The only time they deviate is if there are zerofill sections within a given segment. But since zerofill sections are always at the end of a segment, the only sections where the two values deviate are zerofill sections themselves. And we never actually query the outSecFileOff of zerofill sections. As for `getFileOffset()`, the only place it was being used was to calculate the offset of the entry symbol. However, we can compute that value by just taking the difference between the address of the entry symbol and the address of the Mach-O header. In fact, this appears to be what ld64 itself does. This difference is the same as the file offset as long as there are no intervening zerofill sections, but since `__text` is the first section in `__TEXT`, this never happens, so our previous use of `getFileOffset()` was not wrong -- just inefficient. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D104177	2021-06-13 19:51:30 -04:00
Fangrui Song	899fdf548e	[ELF] Add OVERWRITE_SECTIONS command This implements https://sourceware.org/bugzilla/show_bug.cgi?id=26404 An `OVERWRITE_SECTIONS` command is a `SECTIONS` variant which contains several output section descriptions. The output sections do not have specify an order. Similar to `INSERT [BEFORE\|AFTER]`, `LinkerScript::hasSectionsCommand` is not set, so the built-in rules (see `docs/ELF/linker_script.rst`) still apply. `OVERWRITE_SECTIONS` can be more convenient than `INSERT` because it does not need an anchor section. The initial syntax is intentionally narrow to facilitate backward compatible extensions in the future. Symbol assignments cannot be used. This feature is versatile. To list a few usage: * Use `section : { KEEP(...) }` to retain input sections under GC * Define encapsulation symbols (start/end) for an output section * Use `section : ALIGN(...) : { ... }` to overalign an output section (similar to ld64 `-sectalign`) When an output section is specified by both `OVERWRITE_SECTIONS` and `INSERT`, `INSERT` is processed after overwrite sections. To make this work, this patch changes `InsertCommand` to use name based matching instead of pointer based matching. (This may cause a difference when `INSERT` moves one output section more than once. Such duplicate commands should not be used in practice (seems that in GNU ld the output sections may just disappear).) A linker script can be used without -T/--script. The traditional `SECTIONS` commands are concatenated, so a wrong rule can be more noticeable from the section order. This feature if misused can be less noticeable, just like `INSERT`. Differential Revision: https://reviews.llvm.org/D103303	2021-06-13 12:41:11 -07:00
Nico Weber	7d4c8a2b8f	[lld/mac] clarify comment This is a "we should do X in the future" fixme, not an "X might go wrong" fixme.	2021-06-13 13:30:07 -04:00
Nico Weber	5f9bc580d8	fix comment typos to cycle bots	2021-06-13 10:18:51 -04:00
Alexander Shaposhnikov	b9095f5e1a	[lld][MachO] Fix function starts section Sort the addresses stored in FunctionStarts section. Previously we were encoding potentially large numbers (due to unsigned overflow). Test plan: make check-all Differential revision: https://reviews.llvm.org/D103662	2021-06-11 17:47:28 -07:00
Jez Ng	5de7467e98	[lld-macho] Fix debug build D103977 broke a bunch of stuff as I had only tested the release build which eliminated asserts. I've retained the asserts where possible, but I also removed a bunch instead of adding a whole lot of verbose ConcatInputSection casts.	2021-06-11 20:21:27 -04:00
Jez Ng	464d3dc3d1	[lld-macho] Have dead-stripping work with literal sections Literal sections are not atomically live or dead. Rather, liveness is tracked for each individual literal they contain. CStrings have their liveness tracked via a `live` bit in StringPiece, and fixed-width literals have theirs tracked via a BitVector. The live-marking code now needs to track the offset within each section that is to be marked live, in order to identify the literal at that particular offset. Numbers for linking chromium_framework on my 3.2 GHz 16-Core Intel Xeon W with both `-dead_strip` and `--deduplicate-literals`, with and without this diff applied: ``` N Min Max Median Avg Stddev x 20 4.32 4.44 4.375 4.372 0.03105174 + 20 4.3 4.39 4.36 4.3595 0.023277502 No difference proven at 95.0% confidence ``` This gives us size savings of about 0.4%. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D103979	2021-06-11 19:50:09 -04:00
Jez Ng	681cfeb591	[lld-macho][nfc] Have InputSection ctors take some parameters This is motivated by an upcoming diff in which the WordLiteralInputSection ctor sets itself up based on the value of its section flags. As such, it needs to be passed the `flags` value as part of its ctor parameters, instead of having them assigned after the fact in `parseSection()`. While refactoring code to make that possible, I figured it would make sense for the other InputSections to also take their initial values as ctor parameters. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D103978	2021-06-11 19:50:09 -04:00
Jez Ng	7f2ba39b16	[lld-macho][nfc] Move liveness-tracking fields into ConcatInputSection These fields currently live in the parent InputSection class, but they should be specific to ConcatInputSection, since the other InputSection classes (that contain literals) aren't atomically live or dead -- rather their component string/int literals should have individual liveness states. (An upcoming diff will add liveness bits for StringPieces and fixed-sized literals.) I also factored out some asserts for isCoalescedWeak() in MarkLive.cpp. We now avoid putting coalesced sections in the `inputSections` vector, so we don't have to check/assert against it everywhere. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D103977	2021-06-11 19:50:08 -04:00
Jez Ng	5d88f2dd94	[lld-macho] Deduplicate fixed-width literals Conceptually, the implementation is pretty straightforward: we put each literal value into a hashtable, and then write out the keys of that hashtable at the end. In contrast with ELF, the Mach-O format does not support variable-length literals that aren't strings. Its literals are either 4, 8, or 16 bytes in length. LLD-ELF dedups its literals via sorting + uniq'ing, but since we don't need to worry about overly-long values, we should be able to do a faster job by just hashing. That said, the implementation right now is far from optimal, because we add to those hashtables serially. To parallelize this, we'll need a basic concurrent hashtable (only needs to support concurrent writes w/o interleave reads), which shouldn't be to hard to implement, but I'd like to punt on it for now. Numbers for linking chromium_framework on my 3.2 GHz 16-Core Intel Xeon W: N Min Max Median Avg Stddev x 20 4.27 4.39 4.315 4.3225 0.033225703 + 20 4.36 4.82 4.44 4.4845 0.13152846 Difference at 95.0% confidence 0.162 +/- 0.0613971 3.74783% +/- 1.42041% (Student's t, pooled s = 0.0959262) This corresponds to binary size savings of 2MB out of 335MB, or 0.6%. It's not a great tradeoff as-is, but as mentioned our implementation can be signficantly optimized, and literal dedup will unlock more opportunities for ICF to identify identical structures that reference the same literals. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D103113	2021-06-11 19:50:08 -04:00
Nico Weber	f2b1a1e10c	[lld/mac] Use sectionType() more Not sure sectionType() carries its weight, but while we have it we should use it consistently. No behavior change. Differential Revision: https://reviews.llvm.org/D104027	2021-06-11 11:15:47 -04:00
Nico Weber	54418c5a35	[lld/mac] Make binaries written by lld strippable Be less clever when writing the indirect symbols in LC_DYSYMTAB: lld used to make point __stubs and __la_symbol_ptr point at the same bytes in the indirect symbol table in the __LINKEDIT segment. That confused strip, so write the same bytes twice and make __stubs and __la_symbol_ptr point at one copy each, so that they don't share data. This unconfuses strip, and seems to be what ld64 does too, so hopefully tools are generally more used to this. This makes the output binaries a bit larger, but not much: 4 bytes for roughly each called function from a dylib and each weak function. Chromium Framewoork grows by 6536 bytes, clang-format by a few hundred. With this, `strip -x Chromium\ Framework` works (244 MB before stripping to 171 MB after stripping, compared to 236 MB=>164 MB with ld64). Running strip without `-x` produces the same error message now for lld-linked Chromium Framework as for when using ld64 as a linker. `strip clang-format` also works now but didn't previously. Fixes PR50657. Differential Revision: https://reviews.llvm.org/D104081	2021-06-11 00:18:03 -04:00
Fangrui Song	0995bbdb66	[ELF] Simplify getAArch64UndefinedRelativeWeakVA. NFC	2021-06-10 13:30:16 -07:00
Fangrui Song	c03b6305d8	[ELF][RISCV] Resolve branch relocations referencing undefined weak to current location if not using PLT In a -no-pie link we optimize R_PLT_PC to R_PC. Currently we resolve a branch relocation to the link-time zero address. However such a choice tends to cause relocation overflow possibility for RISC architectures. * aarch64: GNU ld: rewrite the instruction to a NOP; ld.lld: branch to the next instruction * mips: GNU ld: branch to the start of the text segment (?); ld.lld: branch to zero * ppc32: GNU ld: rewrite the instruction to a NOP; ld.lld: branch to the current instruction * ppc64: GNU ld: rewrite the instruction to a NOP; ld.lld: branch to the current instruction * riscv: GNU ld: branch to the absolute zero address (with instruction rewriting) * i386/x86_64: GNU ld/ld.lld: branch to the link-time zero address I think that resolving to the same location is a good choice. The instruction, if triggered, is clearly an undefined behavior. Resolving to the same location can cause an infinite loop (making the user aware of the issue) while ensuring no overflow. Reviewed By: jrtc27 Differential Revision: https://reviews.llvm.org/D103001	2021-06-10 13:25:16 -07:00
Jez Ng	4b5c6c5c4b	[lld-macho][nfc] Fix uninitialized members warning from Coverity We were always assigning to this member before using it, but just to be safe... See https://lists.llvm.org/pipermail/llvm-dev/2021-June/151029.html	2021-06-10 15:09:07 -04:00
Nico Weber	e87c095af3	[lld/mac] Print dylib search details with --print-dylib-search or RC_TRACE_DYLIB_SEARCHING For debugging dylib loading, it's useful to have some insight into what the linker is doing. ld64 has the undocumented RC_TRACE_DYLIB_SEARCHING env var for this printing dylib search candidates. This adds a flag --print-dylib-search to make lld print the seame information. It's useful for users, but also for writing tests. The output is formatted slightly differently than ld64, but we still support RC_TRACE_DYLIB_SEARCHING to offer at least a compatible way to trigger this. ld64 has both `-print_statistics` and `-trace_symbol_output` to enable diagnostics output. I went with "print" since that seems like a more straightforward name. Differential Revision: https://reviews.llvm.org/D103985	2021-06-09 22:08:20 -04:00
Nico Weber	bbe6f51b72	[lld/mac] Make framework symlinks in tests more realistic In a framework Foo.framework, Foo.framework/Foo is usually a relative symbolic link to Foo.framework/Versions/Current/Foo, and Foo.framework/Versions/Current is usually a relative symbolic link to A. Our tests used absolute symbolic links. Now they use relative symbolic links. No behavior change, just makes the tests more representative of the real world. (implicit-dylib.s omits the "Current" folder too, but I'm not changing that here.) Differential Revision: https://reviews.llvm.org/D103998	2021-06-09 20:39:39 -04:00
Nico Weber	0e399eb527	[lld/mac] When handling @loader_path, use realpath() of symlinks This is important for Frameworks, which are usually symlinks. ld64 gets this right for @rpath that's replaced with @loader_path, but not for bare @loader_path -- ld64's code calls realpath() in that case too, but ignores the result. ld64 somehow manages to find libbar1.dylib in the test without the explicit `-rpath` in Foo1. I don't understand why or how. But this change is a step forward and fixes an immediate problem I'm having, so let's start with this :) Differential Revision: https://reviews.llvm.org/D103990	2021-06-09 20:36:07 -04:00
Fangrui Song	928a197d26	[ELF] Add a GRP_COMDAT test with a local signature symbol See https://groups.google.com/g/generic-abi/c/2X6mR-s2zoc Test that a local signature symbol does not suppress COMDAT deduplication.	2021-06-08 09:22:30 -07:00
David Blaikie	c5d56fec50	NFC: .clang-tidy: Inherit configs from parents to improve maintainability In the interests of disabling misc-no-recursion across LLVM (this seems like a stylistic choice that is not consistent with LLVM's style/development approach) this NFC preliminary change adjusts all the .clang-tidy files to inherit from their parents as much as possible. This change specifically preserves all the quirks of the current configs in order to make it easier to review as NFC. I validatad the change is NFC as follows: for X in `cat ../files.txt`; do mkdir -p ../tmp/$(dirname $X) touch $(dirname $X)/blaikie.cpp clang-tidy -dump-config $(dirname $X)/blaikie.cpp > ../tmp/$(dirname $X)/after rm $(dirname $X)/blaikie.cpp done (similarly for the "before" state, without this patch applied) for X in `cat ../files.txt`; do echo $X diff \ ../tmp/$(dirname $X)/before \ <(cat ../tmp/$(dirname $X)/after \ \| sed -e "s/,readability-identifier-naming$.$,-readability-identifier-naming/\1/" \ \| sed -e "s/,-llvm-include-order$.$,llvm-include-order/\1/" \ \| sed -e "s/,-misc-no-recursion$.$,misc-no-recursion/\1/" \ \| sed -e "s/,-clang-diagnostic-\$.$,clang-diagnostic-\/\1/") done (using sed to strip some add/remove pairs to reduce the diff and make it easier to read) The resulting report is: .clang-tidy clang/.clang-tidy 2c2 < Checks: 'clang-diagnostic-,clang-analyzer-,-,clang-diagnostic-,llvm-,misc-,-misc-unused-parameters,-misc-non-private-member-variables-in-classes,-readability-identifier-naming,-misc-no-recursion' --- > Checks: 'clang-diagnostic-,clang-analyzer-,-,clang-diagnostic-,llvm-,misc-,-misc-unused-parameters,-misc-non-private-member-variables-in-classes,-misc-no-recursion' compiler-rt/.clang-tidy 2c2 < Checks: 'clang-diagnostic-,clang-analyzer-,-,clang-diagnostic-,llvm-,-llvm-header-guard,misc-,-misc-unused-parameters,-misc-non-private-member-variables-in-classes' --- > Checks: 'clang-diagnostic-,clang-analyzer-,-,clang-diagnostic-,llvm-,misc-,-misc-unused-parameters,-misc-non-private-member-variables-in-classes,-llvm-header-guard' flang/.clang-tidy 2c2 < Checks: 'clang-diagnostic-,clang-analyzer-,-,llvm-,-llvm-include-order,misc-,-misc-no-recursion,-misc-unused-parameters,-misc-non-private-member-variables-in-classes' --- > Checks: 'clang-diagnostic-,clang-analyzer-,-,llvm-,misc-,-misc-unused-parameters,-misc-non-private-member-variables-in-classes,-llvm-include-order,-misc-no-recursion' flang/include/flang/Lower/.clang-tidy flang/include/flang/Optimizer/.clang-tidy flang/lib/Lower/.clang-tidy flang/lib/Optimizer/.clang-tidy lld/.clang-tidy lldb/.clang-tidy llvm/tools/split-file/.clang-tidy mlir/.clang-tidy The `clang/.clang-tidy` change is a no-op, disabling an option that was never enabled. The compiler-rt and flang changes are no-op reorderings of the same flags. (side note, the .clang-tidy file in parallel-libs is broken and crashes clang-tidy because it uses "lowerCase" as the style instead of "lower_case" - so I'll deal with that separately) Differential Revision: https://reviews.llvm.org/D103842	2021-06-08 08:25:59 -07:00
Jez Ng	447dfbe005	[lld-macho] Implement -force_load_swift_libs It causes libraries whose names start with "swift" to be force-loaded. Note that unlike the more general `-force_load`, this flag only applies to libraries specified via LC_LINKER_OPTIONS, and not those passed on the command-line. This is what ld64 does. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D103709	2021-06-07 23:48:35 -04:00
Jez Ng	04259cde15	[lld-macho] Implement cstring deduplication Our implementation draws heavily from LLD-ELF's, which in turn delegates its string deduplication to llvm-mc's StringTableBuilder. The messiness of this diff is largely due to the fact that we've previously assumed that all InputSections get concatenated together to form the output. This is no longer true with CStringInputSections, which split their contents into StringPieces. StringPieces are much more lightweight than InputSections, which is important as we create a lot of them. They may also overlap in the output, which makes it possible for strings to be tail-merged. In fact, the initial version of this diff implemented tail merging, but I've dropped it for reasons I'll explain later. Alignment Issues Mergeable cstring literals are found under the `__TEXT,__cstring` section. In contrast to ELF, which puts strings that need different alignments into different sections, clang's Mach-O backend puts them all in one section. Strings that need to be aligned have the `.p2align` directive emitted before them, which simply translates into zero padding in the object file. I think ld64 extracts the desired per-string alignment from this data by preserving each string's offset from the last section-aligned address. I'm not entirely certain since it doesn't seem consistent about doing this; but perhaps this can be chalked up to cases where ld64 has to deduplicate strings with different offset/alignment combos -- it seems to pick one of their alignments to preserve. This doesn't seem correct in general; we can in fact can induce ld64 to produce a crashing binary just by linking in an additional object file that only contains cstrings and no code. See PR50563 for details. Moreover, this scheme seems rather inefficient: since unaligned and aligned strings are all put in the same section, which has a single alignment value, it doesn't seem possible to tell whether a given string doesn't have any alignment requirements. Preserving offset+alignments for strings that don't need it is wasteful. In practice, the crashes seen so far seem to stem from x86_64 SIMD operations on cstrings. X86_64 requires SIMD accesses to be 16-byte-aligned. So for now, I'm thinking of just aligning all strings to 16 bytes on x86_64. This is indeed wasteful, but implementation-wise it's simpler than preserving per-string alignment+offsets. It also avoids the aforementioned crash after deduplication of differently-aligned strings. Finally, the overhead is not huge: using 16-byte alignment (vs no alignment) is only a 0.5% size overhead when linking chromium_framework. With these alignment requirements, it doesn't make sense to attempt tail merging -- most strings will not be eligible since their overlaps aren't likely to start at a 16-byte boundary. Tail-merging (with alignment) for chromium_framework only improves size by 0.3%. It's worth noting that LLD-ELF only does tail merging at `-O2`. By default (at `-O1`), it just deduplicates w/o tail merging. @thakis has also mentioned that they saw it regress compressed size in some cases and therefore turned it off. `ld64` does not seem to do tail merging at all. Performance Numbers CString deduplication reduces chromium_framework from 250MB to 242MB, or about a 3.2% reduction. Numbers for linking chromium_framework on my 3.2 GHz 16-Core Intel Xeon W: N Min Max Median Avg Stddev x 20 3.91 4.03 3.935 3.95 0.034641016 + 20 3.99 4.14 4.015 4.0365 0.0492336 Difference at 95.0% confidence 0.0865 +/- 0.027245 2.18987% +/- 0.689746% (Student's t, pooled s = 0.0425673) As expected, cstring merging incurs some non-trivial overhead. When passing `--no-literal-merge`, it seems that performance is the same, i.e. the refactoring in this diff didn't cost us. N Min Max Median Avg Stddev x 20 3.91 4.03 3.935 3.95 0.034641016 + 20 3.89 4.02 3.935 3.9435 0.043197831 No difference proven at 95.0% confidence Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D102964	2021-06-07 23:48:35 -04:00
Nico Weber	17c43c4045	[lld/mac] Add reexports after reexporter to inputFiles When a library "host"'s reexports change their installName with `$ld$os10.11$install_name$host`, we used to write a load command for "host" but write the version numbers of the reexport instead of "host". This fixes that. I first thought that the rule is to take the version numbers from the library that originally had that install name (implemented in D103819), but that's not what ld64 seems to be doing: It takes the version number from the first dylib with that install name it loads, and it loads the reexporting library before the reexports. We already did most of that, we just added reexports before the reexporter. After this change, we add the reexporter before the reexports. Addresses https://bugs.llvm.org/show_bug.cgi?id=49800#c11 part 1. (ld64 seems to add reexports after processing _all_ files on the command line, while we add them right after the reexporter. For the common case of reexport + $ld$ symbol changing back to the exporter name, this doesn't make a difference, but you can construct a case where it does. I expect this to not make a difference in practice though.) Differential Revision: https://reviews.llvm.org/D103821	2021-06-07 17:04:03 -04:00
Nico Weber	422544414b	[lld/mac] Add a test for -reexport_library + -dead_strip_dylibs Our behavior here already matched ld64, now we have a test for it. (ld64 even strips the library here if you also pass -needed_library bar.dylib. That seems wrong to me, and lld honors needed_library in that case.) Differential Revision: https://reviews.llvm.org/D103812	2021-06-07 13:44:58 -04:00
Nico Weber	c5ffe97988	[lld/mac] Implement support for searching dylibs with @rpath/ in install name Also adjust a few comments, and move the DylibFile comment talking about umbrella next to the parameter again. Differential Revision: https://reviews.llvm.org/D103783	2021-06-07 06:22:52 -04:00
Nico Weber	52489021cf	[lld/mac] Implement support for searching dylibs with @loader_path/ in install name Differential Revision: https://reviews.llvm.org/D103779	2021-06-06 20:19:50 -04:00
Nico Weber	a48bd587f7	[lld/mac] Implement support for searching dylibs with @executable_path/ in install name Differential Revision: https://reviews.llvm.org/D103775	2021-06-06 20:01:50 -04:00
Nico Weber	7def700667	[lld/mac] Rename DylibFile::dylibName to DylibFile::installName The flag to set it is called `-install_name`, and it's called `installName` in tbd files. No behavior change. Differential Revision: https://reviews.llvm.org/D103776	2021-06-06 20:00:35 -04:00
Nico Weber	e910437443	[lld/mac] Use fewer magic numbers in magic $ld$ handling code Also simply a conditional and de-alias a variable. Minor cleanups, no behavior change. Differential Revision: https://reviews.llvm.org/D103774	2021-06-06 18:13:16 -04:00
Alexander Shaposhnikov	5e49ee8794	[lld][MachO] Add support for $ld$install_name symbols This diff adds support for $ld$install_name symbols. Test plan: make check-lld-macho Differential revision: https://reviews.llvm.org/D103746	2021-06-05 12:58:59 -07:00
Alexander Shaposhnikov	cf29a92b90	[lld][MachO] Fix typo in special-symbol-ld-previous.s Fix typo in the test special-symbol-ld-previous.s. NFC.	2021-06-05 01:27:42 -07:00
Alexander Shaposhnikov	1309c181a8	[lld][MachO] Add first bits to support special symbols This diff adds first bits to support special symbols $ld$previous* in LLD. $ld$* symbols modify properties/behavior of the library (e.g. its install name, compatibility version or hide/add symbols) for specific target versions. Test plan: make check-lld-macho Differential revision: https://reviews.llvm.org/D103505	2021-06-04 23:32:26 -07:00
Nico Weber	1aae55ddea	[lld/mac] Add test coverage for --reproduce + -flat_namespace Works fine already, now it has a test too. Differential Revision: https://reviews.llvm.org/D103643	2021-06-03 21:00:35 -04:00
Alex Richardson	90344499ae	[lld-macho] Fix BUILD_SHARED_LIBS build `ca6751043d` added a dependency on XAR (at least for the shared libs build), so without this change we get the following linker error: Undefined symbols for architecture x86_64: "_xar_close", referenced from: lld::macho::BitcodeBundleSection::finalize() in SyntheticSections.cpp.o Reviewed By: #lld-macho, int3, thakis Differential Revision: https://reviews.llvm.org/D100999	2021-06-03 19:58:43 +01:00
Nikita Popov	d93b678abb	[lld] Add missing includes (NFC) Fix lld build after `983565a6fe`.	2021-06-03 18:55:18 +02:00
Jez Ng	6881f29a36	[lld-macho] Parse re-exports of nested TAPI documents D103423 neglected to call `parseReexports()` for nested TBD documents, leading to symbol resolution failures when trying to look up a symbol nested more than one level deep in a TBD file. This fixes the regression and adds a test. It also appears that `umbrella` wasn't being set properly when calling `parseLoadCommands` -- it's supposed to resolve to `this` if `nullptr` is passed. I didn't write a failing test case for this but I've made `umbrella` a member so the previous behavior should be preserved. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D103586	2021-06-03 12:02:30 -04:00
Martin Storsjö	728cc0075e	[LLD] [COFF] Fix autoexport from LTO objects with comdat symbols Make sure that comdat symbols also have a non-null dummy SectionChunk associated. This requires moving around an existing FIXME regarding comdats in LTO. Differential Revision: https://reviews.llvm.org/D103012	2021-06-03 15:14:49 +03:00
Nico Weber	5ecfdb5123	[lld/mac] try to fix tests after `a5645513db` My linux system doesn't like the `grep` for some reason, but FileCheck seems to work.	2021-06-02 11:33:11 -04:00
Nico Weber	a5645513db	[lld/mac] Implement -dead_strip Also adds support for live_support sections, no_dead_strip sections, .no_dead_strip symbols. Chromium Framework 345MB unstripped -> 250MB stripped (vs 290MB unstripped -> 236M stripped with ld64). Doing dead stripping is a bit faster than not, because so much less data needs to be processed: % ministat lld_* x lld_nostrip.txt + lld_strip.txt N Min Max Median Avg Stddev x 10 3.929414 4.07692 4.0269079 4.0089678 0.044214794 + 10 3.8129408 3.9025559 3.8670411 3.8642573 0.024779651 Difference at 95.0% confidence -0.144711 +/- 0.0336749 -3.60967% +/- 0.839989% (Student's t, pooled s = 0.0358398) This interacts with many parts of the linker. I tried to add test coverage for all added `isLive()` checks, so that some test will fail if any of them is removed. I checked that the test expectations for the most part match ld64's behavior (except for live-support-iterations.s, see the comment in the test). Interacts with: - debug info - export tries - import opcodes - flags like -exported_symbol(s_list) - -U / dynamic_lookup - mod_init_funcs, mod_term_funcs - weak symbol handling - unwind info - stubs - map files - -sectcreate - undefined, dylib, common, defined (both absolute and normal) symbols It's possible it interacts with more features I didn't think of, of course. I also did some manual testing: - check-llvm check-clang check-lld work with lld with this patch as host linker and -dead_strip enabled - Chromium still starts - Chromium's base_unittests still pass, including unwind tests Implemenation-wise, this is InputSection-based, so it'll work for object files with .subsections_via_symbols (which includes all object files generated by clang). I first based this on the COFF implementation, but later realized that things are more similar to ELF. I think it'd be good to refactor MarkLive.cpp to look more like the ELF part at some point, but I'd like to get a working state checked in first. Mechanical parts: - Rename canOmitFromOutput to wasCoalesced (no behavior change) since it really is for weak coalesced symbols - Add noDeadStrip to Defined, corresponding to N_NO_DEAD_STRIP (`.no_dead_strip` in asm) Fixes PR49276. Differential Revision: https://reviews.llvm.org/D103324	2021-06-02 11:09:26 -04:00
Nico Weber	66a1ecd2cf	[lld/mac] Implement -needed_framework, -needed_library, -needed-l These allow overriding dead_strip_dylibs. Differential Revision: https://reviews.llvm.org/D103499	2021-06-02 11:06:42 -04:00
Nico Weber	e14fd7d879	[lld/mac] Don't strip explicit dylib also mentioned in LC_LINKER_OPTION Noticed by Jez in D103499. Differential Revision: https://reviews.llvm.org/D103521	2021-06-02 10:59:56 -04:00
Nico Weber	476e4d65d4	[lld/mac] Address review feedback and improve a comment I forgot to move the message() call around as requested in D103428 before committing that change. Move it now. Also, improve the ordinal uniq'ing comment. I hadn't realized that the distinct-but-identical files happen with --reproduce and not in general. No behavior change. Differential Revision: https://reviews.llvm.org/D103522	2021-06-02 10:54:53 -04:00
Nico Weber	78ce89bb1e	[lld/mac] Implement -reexport_framework, -reexport_library, -reexport-l These are slightly easier-to-use versions of -sub_library and -sub_umbrella. Differential Revision: https://reviews.llvm.org/D103497	2021-06-02 06:37:34 -04:00
Nico Weber	222a88a243	[lld/mac] Make -t work correctly with -flat_namespace We used to not print dylibs referenced by other dylibs in `-t` mode. This affected reexports, and with `-flat_namespace` also just dylibs loaded by dylibs. Now we print them. Fixes PR49514. Differential Revision: https://reviews.llvm.org/D103428	2021-06-01 19:23:39 -04:00
Vy Nguyen	8f89c054af	[lld-macho][nfc] Remove unnecessary use of Optional<T*> In all of these cases, the functions could simply return a nullptr instead of {}. There is no case where Optional<nullptr> has a special meaning. Differential Revision: https://reviews.llvm.org/D103489	2021-06-01 18:35:31 -04:00
Nico Weber	aeae3e0ba9	[lld/mac] Emit only one LC_LOAD_DYLIB per dylib In some cases, we end up with several distinct DylibFiles that have the same install name. Only emit a single LC_LOAD_DYLIB in those cases. This happens in 3 cases I know of: 1. Some tbd files are symlinks. libpthread.tbd is a symlink against libSystem.tbd for example, so `-lSystem -lpthread` loads libSystem.tbd twice. We could (and maybe should) cache loaded dylibs by realpath() to catch this. 2. Some tbd files are copies of each other. For example, CFNetwork.framework/CFNetwork.tbd and CFNetwork.framework/Versions/A/CFNetwork.tbd are two distinct copies of the same file. The former is found by `-framework CFNetwork` and the latter by the reexport in CoreServices.tbd. We could conceivably catch this by making `-framework` search look in `Versions/Current` instead of in the root, and/or by using a content hash to cache tbd files, but that's starting to sound complicated. 3. Magic $ld$ symbol processing can change the install name of a dylib based on the target platform_version. Here, two truly distinct dylibs can have the same install name. So we need this code to deal with (3) anyways. Might as well use it for 1 and 2, at least for now :) With this (and D103430), clang-format links in the same dylibs when linked with lld and ld64. Differential Revision: https://reviews.llvm.org/D103488	2021-06-01 18:15:35 -04:00
Sam Clegg	c1a59fa550	[lld][WebAssemlby] Fix for string merging of -dwarf-5 sections We were mistakenly treating `.debug_str_offsets` as a string mergable section when it is not (it contains integers not strings). This is an indication that we really should find a way to store flags for custom sections. Fixes: https://bugs.llvm.org/show_bug.cgi?id=48828 Fixes: https://bugs.chromium.org/p/chromium/issues/detail?id=1172217 Differential Revision: https://reviews.llvm.org/D103486	2021-06-01 14:33:56 -07:00
Nico Weber	8d80139ccc	[lld/mac] fix test failure after `24979e111` If there is an error reading the dylib, we shouldn't try to load its reexports. Caught e.g. by https://lab.llvm.org/buildbot/#/builders/36/builds/8946	2021-06-01 16:35:25 -04:00
Nico Weber	2c1903412b	[lld/mac] Implement removal of unused dylibs This omits load commands for unreferenced dylibs if: - the dylib was loaded implicitly, - it is marked MH_DEAD_STRIPPABLE_DYLIB - or -dead_strip_dylibs is passed This matches ld64. Currently, the "is dylib referenced" state is computed before dead code stripping and is not updated after dead code stripping. This too matches ld64. We should do better here. With this, clang-format linked with lld (like with ld64) no longer has libobjc.A.dylib in `otool -L` output. (It was implicitly loaded as a reexport of CoreFoundation.framework, but it's not needed.) Differential Revision: https://reviews.llvm.org/D103430	2021-06-01 16:06:30 -04:00
Nico Weber	24979e1113	[lld/mac] Don't load DylibFiles from the DylibFile constructor loadDylib() keeps a name->DylibFile cache, but it only writes to the cache once the DylibFile constructor has completed. So dylib loads done recursively from the DylibFile constructor wouldn't use the cache. Now, we load additional dylibs after writing to the cache, which means the cache now gets used for dylibs loaded because they're referenced from other dylibs. Related to PR49514 and PR50101, but no dramatic behavior change in itself. (Technically we no longer crash when a tbd file reexports itself, but that doesn't happen in practice. We now accept it silently instead of crashing; ld64 has a diag for the reexport cycle.) Differential Revision: https://reviews.llvm.org/D103423	2021-06-01 15:31:02 -04:00
Nico Weber	0b39f055d8	[lld/mac] Don't write mtimes to N_OSO entries if ZERO_AR_DATE is set. This is important for build determinism. This matches ld64. Differential Revision: https://reviews.llvm.org/D103446	2021-06-01 15:29:38 -04:00
Nico Weber	c4053cd14e	[lld/mac] Don't crash on -order_file with assembly inputs on arm64 .s files with `-g` generate __debug_aranges on darwin/arm64 for some reason, and those lead to `nullptr` symbols. Don't crash on that. Fixes PR50517. Differential Revision: https://reviews.llvm.org/D103350	2021-05-28 21:00:46 -04:00
Fangrui Song	2644399ce7	[lld-macho][test] Simplify --allow-empty with count 0	2021-05-28 15:15:59 -07:00
Alexandre Ganea	2b9b9652ce	[LLD][COFF] Reduce the maximum size of the GHASH table Before this patch, the maximum size of the GHASH table was 2^31 buckets. However we were storing the bucket index into a TypeIndex which has an input limit of (2^31)-4095 indices, see this link. Any value above that limit will improperly set the TypeIndex's high bit, which is interpreted as DecoratedItemIdMask. This used to cause bad indices on extraction when calling TypeIndex::toArrayIndex(). Differential Revision: https://reviews.llvm.org/D103297	2021-05-28 09:46:17 -04:00
Reid Kleckner	99f023656b	[PDB] Fix ubsan complaint about memcpy from null pointer	2021-05-27 19:49:09 -07:00
Reid Kleckner	109aac9212	[PDB] Enable parallel ghash type merging by default Ghashing is probably going to be faster in most cases, even without precomputed ghashes in object files. Here is my table of results linking clang.pdb: ------------------------------- \| threads \| GHASH \| NOGHASH \| ------------------------------- \| j1 \| 51.031s \| 25.141s \| \| j2 \| 31.079s \| 22.109s \| \| j4 \| 18.609s \| 23.156s \| \| j8 \| 11.938s \| 21.984s \| \| j28 \| 8.375s \| 18.391s \| ------------------------------- This shows that ghashing is faster if at least four cores are available. This may make the linker slower if most cores are busy in the middle of a build, but in that case, the linker probably isn't on the critical path of the build. Incremental build performance is arguably more important than highly contended batch build link performance. The -time output indicates that ghash computation is the dominant factor: Input File Reading: 924 ms ( 1.8%) GC: 689 ms ( 1.3%) ICF: 527 ms ( 1.0%) Code Layout: 414 ms ( 0.8%) Commit Output File: 24 ms ( 0.0%) PDB Emission (Cumulative): 49938 ms ( 94.8%) Add Objects: 46783 ms ( 88.8%) Global Type Hashing: 38983 ms ( 74.0%) GHash Type Merging: 5640 ms ( 10.7%) Symbol Merging: 2154 ms ( 4.1%) Publics Stream Layout: 188 ms ( 0.4%) TPI Stream Layout: 18 ms ( 0.0%) Commit to Disk: 2818 ms ( 5.4%) -------------------------------------------------- Total Link Time: 52669 ms (100.0%) We can speed that up with a faster content hash (not SHA1). Differential Revision: https://reviews.llvm.org/D102888	2021-05-27 14:19:36 -07:00
Jez Ng	7599e98ab7	[lld-macho][nfc] Remove unnecessary parameterization of section sort As @alexshap pointed out [here](https://reviews.llvm.org/D102972#inline-975208), it's a bit confusing to have the option to sort OutputSections with any comparator when in practice we only use one. Reviewed By: #lld-macho, alexshap, thakis Differential Revision: https://reviews.llvm.org/D102974	2021-05-25 14:58:30 -04:00
Jez Ng	fcab06bd85	[lld-macho][nfc] Sort OutputSections based on explicit order of command-line inputs This diff paves the way for {D102964} which adds a new kind of InputSection. We previously maintained section ordering implicitly: we created InputSections as we parsed each file in command-line order, and passed on this ordering when we created OutputSections and OutputSegments by iterating over these InputSections. The implicitness of the ordering made it difficult to refactor the code to e.g. handle a new type of InputSection. As such, I've codified the ordering explicitly via `inputOrder` fields. This also allows us to use `sort` instead of `stable_sort`. Benchmarking chromium_framework on my 3.2 GHz 16-Core Intel Xeon W: N Min Max Median Avg Stddev x 20 4.23 4.35 4.27 4.274 0.030157481 + 20 4.24 4.38 4.27 4.2815 0.033759989 No difference proven at 95.0% confidence Reviewed By: #lld-macho, alexshap Differential Revision: https://reviews.llvm.org/D102972	2021-05-25 14:58:29 -04:00
Jez Ng	33706191d8	[lld-macho][nfc] Rename MergedOutputSection to ConcatOutputSection The ELF format has the concept of merge sections (marked by SHF_MERGE), which contain data that can be safely deduplicated. The Mach-O equivalents are called literal sections (marked by S_CSTRING_LITERALS or S_{4,8,16}BYTE_LITERALS). While the Mach-O format doesn't use the word 'merge', to avoid confusion, I've renamed our MergedOutputSection to ConcatOutputSection. I believe it's a more descriptive name too. This renaming sets the stage for {D102964}. Reviewed By: #lld-macho, alexshap Differential Revision: https://reviews.llvm.org/D102971	2021-05-25 14:58:29 -04:00
Jez Ng	9cc0d893f7	[lld-macho][nfc] clang-format everything	2021-05-25 14:58:29 -04:00
Jez Ng	8535834ef7	[lld-macho][nfc] Misc code cleanup * Move `static_asserts` into cpp instead of header file. I noticed they had been separated from the main class definition in the header, so I set about to clean that up, then figured it made more sense as part of the cpp file so as not to incur unnecessary compile-time overhead. * Remove unnecessary `virtual`s * Remove unnecessary comment / reword another comment	2021-05-25 14:58:29 -04:00
Nathan Lanza	2f65166056	[lld:elf] Weaken the requirement for a computed binding to be STB_LOCAL Given the following scenario: ``` // Cat.cpp struct Animal { virtual void makeNoise() const = 0; }; struct Cat : Animal { void makeNoise() const override; }; extern "C" int puts(char const ); void Cat::makeNoise() const { puts("Meow"); } void doThingWithCat(Animal a) { static_cast<Cat >(a)->makeNoise(); } // CatUser.cpp struct Animal { virtual void makeNoise() const = 0; }; struct Cat : Animal { void makeNoise() const override; }; void doThingWithCat(Animal a); void useDoThingWithCat() { Cat d = new Cat; doThingWithCat(d); } // cat.ver { global: _Z17useDoThingWithCatv; local: ; }; $ clang++ Cat.cpp CatUser.cpp -fpic -flto=thin -fwhole-program-vtables -shared -O3 -fuse-ld=lld -Wl,--lto-whole-program-visibility -Wl,--version-script,cat.ver ``` We cannot devirtualize `Cat::makeNoise`. The issue is complex: Due to `-fsplit-lto-unit` and usage of type metadata, we place the Cat vtable declaration into module 0 and the Cat vtable definition with type metadata into module 1, causing duplicate entries (Undefined followed by Defined) in the `lto::InputFile::symbols()` output. In `BitcodeFile::parse`, after processing the `Undefined` then the `Defined`, the final state is `Defined`. In `BitcodeCompiler::add`, for the first symbol, `computeBinding` returns `STB_LOCAL`, then we reset it to `Undefined` because it is prevailing (`versionId` is `preserved`). For the second symbol, because the state is now `Undefined`, `computeBinding` returns `STB_GLOBAL`, causing `ExportDynamic` to be true and suppressing devirtualization. In D77280, the `computeBinding` change used a stricter `isDefined()` condition to make weak``Lazy` symbol work. This patch relaxes the condition to weaker `!isLazy()` to keep it working while making the devirtualization work as well. Differential Revision: https://reviews.llvm.org/D98686	2021-05-24 23:32:21 -04:00
David Blaikie	e5b66a3734	lld-coff: Simplify a few lambda uses after `7975dd033c`	2021-05-24 17:26:46 -07:00
serge-sans-paille	4ab3041acb	Revert "[NFC] remove explicit default value for strboolattr attribute in tests" This reverts commit `bda6e5bee0`. See https://lab.llvm.org/buildbot/#/builders/109/builds/15424 for instance	2021-05-24 19:43:40 +02:00
serge-sans-paille	bda6e5bee0	[NFC] remove explicit default value for strboolattr attribute in tests Since `d6de1e1a71`, no attributes is quivalent to setting attribute to false. This is a preliminary commit for https://reviews.llvm.org/D99080	2021-05-24 19:31:04 +02:00
Alexander Shaposhnikov	57501e512e	[lld][MachO] Fix code formatting Apply clang-format -style=llvm to InputFile.cpp. NFC. Test plan: make check-all	2021-05-23 20:35:55 -07:00
Fangrui Song	0f298ec6cc	[ELF][test] Avoid local signature symbols for section groups to match reality If we support local signature symbols (PR43094), these tests would fail. When the support is added, new tests (local signature symbol specific) should be developed.	2021-05-22 17:48:45 -07:00
Fangrui Song	7f0acc4e4f	[docs] ld.lld.1: Mention -z nostart-stop-gc	2021-05-21 19:57:51 -07:00
Sam Clegg	8544b40b6e	[lld][WebAssembly] Fix for PIC output + TLS + non-shared-memory Prior to this change build with `-shared/-pie` and using TLS (but without -shared-memory) would hit this assert: "Currenly only a single data segment is supported in PIC mode" This is because we were not including TLS data when merging data segments. However, when we build without shared-memory (i.e. without threads) we effectively lower away TLS into a normal active data segment.. so we were ending up with two active data segments: the merged data, and the lowered TLS data. To fix this problem we can instead avoid combining data segments at all when running in shared memory mode (because in this case all segment initialization is passive). And then in non-shared memory mode we know that TLS has been lowered and therefore we can can and should combine all segments. So with this new behavior we have two different modes: 1. With shared memory / mutli-threaded: Never combine data segments since it is not necessary. (All data segments as passive already). 2. Wihout shared memory / single-threaded: Combine all data segments since we treat TLS as normal data. (We end up with a single active data segment). Differential Revision: https://reviews.llvm.org/D102937	2021-05-21 15:16:47 -07:00
Axel Y. Rivera	4fb131b497	[LLD][COFF] PR49068: Include the IMAGE_REL_BASED_HIGHLOW relocation base type when the machine is 64 bits and the relocation type is ADDR32 The COFF driver produces an ABSOLUTE relocation base for an ADDR32 relocation type and the system is 64 bits (machine=AMD64). The relocation information won't be added in the output and could produce an incorrect address access during run-time. This change set checks if the relocation type is IMAGE_REL_AMD64_ADDR32 and if so, adds the relocated symbol as IMAGE_REL_BASED_HIGHLOW base. Differential Revision: https://reviews.llvm.org/D96619	2021-05-21 23:45:55 +03:00
Reid Kleckner	e73203a561	[PDB] Check the type server guid when ghashing Previously we simply didn't check this. Prereq to make the test suite pass with ghash enabled by default. Differential Revision: https://reviews.llvm.org/D102885	2021-05-20 16:36:12 -07:00
Martin Storsjö	33b71ec9c6	[LLD] [COFF] Fix automatic export of symbols from LTO objects Differential Revision: https://reviews.llvm.org/D101569	2021-05-21 00:36:58 +03:00
Wouter van Oortmerssen	3a293cbf13	[WebAssembly] Fix PIC/GOT codegen for wasm64 __table_base is know 64-bit, since in LLVM it represents a function pointer offset __table_base32 is a copy in wasm32 for use in elem init expr, since no truncation may be used there. New reloc R_WASM_TABLE_INDEX_REL_SLEB64 added Differential Revision: https://reviews.llvm.org/D101784	2021-05-20 09:59:31 -07:00
Sam Clegg	356b85edd7	[lld][WebAssembly] Fix for string tail merging and -r/--relocatable Ensure that both SyntheticMergedChunk and all MergeInfoChunks that it comprises are assigned the correct output section. Without this we would crash when outputting relocations in --relocatable mode. Fixes: https://github.com/emscripten-core/emscripten/issues/14220 Differential Revision: https://reviews.llvm.org/D102806	2021-05-19 15:25:58 -07:00
Reid Kleckner	12dd8df38b	[PDB] Do not record PGO or coverage public symbols These symbols are long, and they tend to cause the PDB file size to overflow. They are generally not necessary when debugging problems in user code. This change reduces the size of chrome.dll.pdb with coverage from 6,937,108,480 bytes to 4,690,210,816 bytes. Differential Revision: https://reviews.llvm.org/D102719	2021-05-19 12:41:31 -07:00
Nico Weber	fd09a764eb	[lld/mac] Remove dead declaration	2021-05-19 14:18:03 -04:00
Mariusz Ceier	9383e9c1e6	Fix lld macho standalone build by including llvm/Config/llvm-config.h instead of llvm/Config/config.h lld/MachO/Driver.cpp and lld/MachO/SyntheticSections.cpp include llvm/Config/config.h which doesn't exist when building standalone lld. This patch replaces llvm/Config/config.h include with llvm/Config/llvm-config.h just like it is in lld/ELF/Driver.cpp and HAVE_LIBXAR with LLVM_HAVE_LIXAR and moves LLVM_HAVE_LIBXAR from config.h to llvm-config.h Also it adds LLVM_HAVE_LIBXAR to LLVMConfig.cmake and links liblldMachO2.so with XAR_LIB if LLVM_HAVE_LIBXAR is set. Differential Revision: https://reviews.llvm.org/D102084	2021-05-19 11:15:07 -04:00
Reid Kleckner	ac2226b0f5	[PDB] Improve error handling when writes fail Handle PDB writing errors like any other error in LLD: emit an error and continue. This allows the linker to print timing data and summary data after linking, which can be helpful for finding PDB size problems. Also report how large the file would have been. Example output: lld-link: error: Output data is larger than 4 GiB. File size would have been 6,937,108,480 lld-link: error: failed to write PDB file ./chrome.dll.pdb Summary -------------------------------------------------------------------------------- 33282 Input OBJ files (expanded from all cmd-line inputs) 4 PDB type server dependencies 0 Precomp OBJ dependencies 33396931 Input type records ... snip ... Input File Reading: 59756 ms ( 45.5%) GC: 7500 ms ( 5.7%) ICF: 3336 ms ( 2.5%) Code Layout: 6329 ms ( 4.8%) PDB Emission (Cumulative): 46192 ms ( 35.2%) Add Objects: 27609 ms ( 21.0%) Type Merging: 16740 ms ( 12.8%) Symbol Merging: 10761 ms ( 8.2%) Publics Stream Layout: 9383 ms ( 7.1%) TPI Stream Layout: 1678 ms ( 1.3%) Commit to Disk: 3461 ms ( 2.6%) -------------------------------------------------- Total Link Time: 131244 ms (100.0%) Differential Revision: https://reviews.llvm.org/D102713	2021-05-18 13:17:17 -07:00
Sam Clegg	876d49baad	[lld][WebAssembly] Convert test to assembly. NFC. Differential Revision: https://reviews.llvm.org/D102704	2021-05-18 12:31:13 -07:00
Sam Clegg	45b7cf9955	[lld][WebAssembly] Enable string tail merging in debug sections This is a followup to https://reviews.llvm.org/D97657 which applied string tail merging to data segments. Fixes: https://bugs.llvm.org/show_bug.cgi?id=48828 Differential Revision: https://reviews.llvm.org/D102436	2021-05-18 12:25:39 -07:00
Nico Weber	b4ead2c37b	[lld/mac] Correctly set nextdefsym In LC_DYSYMTAB, private externs were still emitted as exported symbols instead of as locals. Fixes PR50373. See bug for details. Differential Revision: https://reviews.llvm.org/D102662	2021-05-18 13:53:55 -04:00
Martin Storsjö	dd7575ba44	[LLD] [MinGW] Pass the canExitEarly parameter through properly The MinGW driver passed a hardcoded true to this parameter since `6f4e255219`, but when the MinGW driver got the canExitEarly parameter for consistency in `b11386f9be`, this call was missed so it wasn't passed on properly.	2021-05-18 15:09:07 +03:00
Nico Weber	095c520fb4	[lld/mac] Propagate -(un)exported_symbol(s_list) to privateExtern in Driver That way, it's done only once instead of every time shouldExportSymbol() is called. Possibly a bit faster: % ministat at_main at_symtodo x at_main + at_symtodo N Min Max Median Avg Stddev x 30 3.9732189 4.114846 4.024621 4.0304692 0.037022865 + 30 3.93766 4.0510042 3.9973931 3.991469 0.028472565 Difference at 95.0% confidence -0.0390002 +/- 0.0170714 -0.967635% +/- 0.423559% (Student's t, pooled s = 0.0330256) In other runs with n=30 it makes no perf difference, so maybe it's just noise. But being able to quickly and conveniently answer "is this symbol exported?" is useful for fixing PR50373 and for implementing -dead_strip, so this seems like a good change regardless. No behavior change. Differential Revision: https://reviews.llvm.org/D102661	2021-05-18 07:42:58 -04:00
Alexander Shaposhnikov	dc2c6cf274	[lld][MachO] Adjust isCodeSection signature This diff changes the type of the argument of isCodeSection to const InputSection *. NFC. Test plan: make check-lld-macho Differential revision: https://reviews.llvm.org/D102664	2021-05-17 22:09:47 -07:00
Sam Clegg	5a9b25e15b	[lld][WebAssembly] Refactor input chunk class hierarchy. NFC The main motivation for this refactor is to remove the subclass relationship between the InputSegment and MergeInputSegment and SyntenticMergedInputSegment so that we can use the merging classes for debug sections which are not data segments. In the process of refactoring I also remove all the virtual functions from the class hierarchy and try to reuse techniques used in the ELF linker (see `lld/ELF/InputSections.h`). Differential Revision: https://reviews.llvm.org/D102546	2021-05-17 21:01:17 -07:00
Nico Weber	bc588f9961	[lld/mac] Inline a check `match()` can only return for non-empty vectors, but at least in non-LTO builds that isn't clear to the compiler. Help it out. This is a minor but measurable speedup on my machine (but less than what we might've lost in https://reviews.llvm.org/D100818#2764272 -- bot note higher N on this measurement here, so higher confidence here): % ministat at_main at_branch x at_main + at_branch N Min Max Median Avg Stddev x 30 3.9243979 4.0395119 3.987375 3.9826236 0.027567796 + 30 3.8495831 4.0009291 3.931325 3.9347135 0.037832878 Difference at 95.0% confidence -0.0479101 +/- 0.0171102 -1.20298% +/- 0.429622% (Student's t, pooled s = 0.0331007) No behavior change. Eventually we should apply these lists at symbol parse time instead of every time shouldExportSymbol() though :) Differential Revision: https://reviews.llvm.org/D102655	2021-05-17 20:04:45 -04:00
Markus Böck	65271ffe84	[lld][MinGW] Introduce aliases for -Bdynamic and -Bstatic Besides -Bdynamic and -Bstatic, ld documents additional aliases for both of these options. Instead of -Bstatic, one may write -dn, -non_shared or -static. Instead of -Bdynamic one may write -dy or -call_shared. Source: https://sourceware.org/binutils/docs-2.36/ld/Options.html This patch adds those aliases to the MinGW driver of lld for the sake of ld compatibility. Encountered this case while compiling a static Qt 6.1 distribution and got build failures as -static was passed directly to the linker, instead of through the compiler driver. Differential Revision: https://reviews.llvm.org/D102637	2021-05-17 22:13:26 +02:00
Nico Weber	4a12248ee2	[lld/mac] Honor REFERENCED_DYAMICALLY, set it on __mh_execute_header Has the effect that `__mh_execute_header` stays in the symbol table of outputs even after running `strip` on the output. I don't know if that's important for anything -- my motivation for the patch is just is to make the output more similar to ld64. (Corresponds to symbolTableInAndNeverStrip in ld64.) Differential Revision: https://reviews.llvm.org/D102619	2021-05-17 14:22:12 -04:00
Mateusz Mikuła	84306ef9c4	[LLD][MinGW] Add --fatal-warnings and --no-fatal-warnings flags Differential Revision: https://reviews.llvm.org/D102514	2021-05-17 10:40:31 +03:00
Harald van Dijk	d62413452f	[lld][X86] Restore gotEntrySize. D62727 removed GotEntrySize and GotPltEntrySize with a comment that they are always equal to wordsize(), but that is not entirely true: X32 has a word size of 4, but needs 8-byte GOT entries. This restores gotEntrySize for both, adjusted for current naming conventions, but defaults it to config->wordsize to keep things simple for architectures other than x86_64. This partially reverts D62727. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D102509	2021-05-17 00:13:00 +01:00
Sam Clegg	119f61af3a	[lld][WebAssembly] Remove unused method declaration. NFC This method was removed in https://reviews.llvm.org/D102265 but the declaration was missed.	2021-05-14 15:54:33 -07:00
Mateusz Mikuła	f84a4cb0df	[LLD][MinGW] Ignore --no-undefined flag AFAIK this is the default behaviour when this flag is not passed. Differential Revision: https://reviews.llvm.org/D102516	2021-05-14 23:47:50 +03:00
Fangrui Song	5741dc87a5	[test] Improve x86-64-plt.s	2021-05-14 10:38:40 -07:00
Fangrui Song	4adf7a7604	[ELF] Add -Bno-symbolic This option will be available in GNU ld 2.27 (https://sourceware.org/bugzilla/show_bug.cgi?id=27834). This option can cancel previously specified -Bsymbolic and -Bsymbolic-functions. This is useful for excluding some links when the default uses -Bsymbolic-functions. Reviewed By: jhenderson, peter.smith Differential Revision: https://reviews.llvm.org/D102383	2021-05-14 09:40:32 -07:00
Fangrui Song	da9b6d0656	[ELF][test] Improve -Bsymbolic & -Bsymbolic-functions test Previously there was no test checking that -Bsymbolic-functions only applies to STT_FUNC symbols. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D102461	2021-05-14 09:33:43 -07:00
Reid Kleckner	ee23f8b36f	[COFF] Remove a truncation assertion from setRVA LLD already produces a nice error message when sections exceed 4GB, and this setRVA assertion causes LLD to crash instead of diagnosing the error properly. No test because we don't want slow tests that create 4GB files.	2021-05-13 19:37:14 -07:00
Sam Clegg	cd01430ff1	[lld][WebAssembly] Allow data symbols to extend past end of segment This fixes a bug with string merging with string symbols that contain NULLs, as is the case in the `merge-string.s` test. The bug only showed when we run with `--relocatable` and then try read the resulting object back in. In this case we would end up with string symbols that extend past the end of the segment in which they live. The problem comes from the fact that sections which are flagged as string mergable assume that all strings are NULL terminated. The merging algorithm will drop trailing chars that follow a NULL since they are essentially unreachable. However, the "size" attribute (in the symbol table) of such a truncated symbol is not updated resulting a symbol size that can overlap the end of the segment. I verified that this can happen in ELF too given the right conditions and the its harmless enough. In practice Strings that contain embedded null should not be part of a mergable section. Differential Revision: https://reviews.llvm.org/D102281	2021-05-12 13:43:37 -07:00
Sam Clegg	3041b16f73	[WebAssembly] Add TLS data segment flag: WASM_SEG_FLAG_TLS Previously the linker was relying solely on the name of the segment to imply TLS. Differential Revision: https://reviews.llvm.org/D102202	2021-05-12 13:31:02 -07:00
Fangrui Song	a8053399cd	[ELF][AVR] Add explicit relocation types to getRelExpr	2021-05-12 12:38:27 -07:00
Martin Storsjö	7e0768329c	[LLD] [COFF] Fix including the personality function for DWARF EH when linking with --gc-sections Since `c579a5b1d9` we don't traverse .eh_frame when doing GC. But the exception handling personality function needs to be included, and is only referenced from within .eh_frame. Differential Revision: https://reviews.llvm.org/D102138	2021-05-12 22:23:01 +03:00
Shoaib Meenai	56f7e5a822	[cmake] Add support for multiple distributions LLVM's build system contains support for configuring a distribution, but it can often be useful to be able to configure multiple distributions (e.g. if you want separate distributions for the tools and the libraries). Add this support to the build system, along with documentation and usage examples. Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D89177	2021-05-12 11:13:18 -07:00
Greg McGary	93c8559baf	[lld-macho] Implement branch-range-extension thunks Extend the range of calls beyond an architecture's limited branch range by first calling a thunk, which loads the far address into a scratch register (x16 on ARM64) and branches through it. Other ports (COFF, ELF) use multiple passes with successively-refined guesses regarding the expansion of text-space imposed by thunk-space overhead. This MachO algorithm places thunks during MergedOutputSection::finalize() in a single pass using exact thunk-space overheads. Thunks are kept in a separate vector to avoid the overhead of inserting into the `inputs` vector of `MergedOutputSection`. FIXME: * arm64-stubs.s test is broken * add thunk tests * Handle thunks to DylibSymbol in MergedOutputSection::finalize() Differential Revision: https://reviews.llvm.org/D100818	2021-05-12 09:44:58 -07:00
Sam Clegg	19cedd3cd3	[lld][WebAssembly] Fix for string merging + negative addends Don't include the relocation addend when calculating the virtual address of a symbol. Instead just pass the symbol's offset and add the addend afterwards. Without this fix we hit the `offset is outside the section` error in MergeInputSegment::getSegmentPiece. This fixes a real world error we were are seeing in emscripten. Differential Revision: https://reviews.llvm.org/D102271	2021-05-11 17:47:57 -07:00
Sam Clegg	b49a798e71	[lld][WebAssembly] Remove relocation target verification We have this extra step in wasm-ld that doesn't exist in other lld backend which verifies the existing contents of the relocation targets. This was originally intended as an extra form of double checking and an aid to compiler developers. However it has always been somewhat controversial and there have been suggestions in the past the we simply remove it. My motivation for removing it now is that its causing me a headache when trying to fix an issue with negative addends. In the case of negative addends that final result can be wrapped/negative but this checking code would require significant modification to be able to deal with that case. For example with some test cases I'm looking at I'm seeing error like this: ``` wasm-ld: warning: /usr/local/google/home/sbc/dev/wasm/llvm-build/tools/lld/test/wasm/Output/merge-string.s.tmp.o:(.rodata_relocs): unexpected existing value for R_WASM_MEMORY_ADDR_I32: existing=FFFFFFFA expected=FFFFFFFFFFFFFFFA ``` Rather than try to refactor `calcExpectedValue` to somehow return two different types of results (32 and 64-bit) depending on the relocation type, I think we can just remove this code. Differential Revision: https://reviews.llvm.org/D102265	2021-05-11 12:05:14 -07:00
Sam Clegg	b2f227c6c8	[lld][WebAssembly] Convert test to assembly. NFC. Differential Revision: https://reviews.llvm.org/D102264	2021-05-11 11:37:53 -07:00
Nico Weber	9ab49ae55d	[lld/mac] Implement -sectalign clang sometimes passes this flag along (see D68351), so we should implement it. Differential Revision: https://reviews.llvm.org/D102247	2021-05-11 13:31:32 -04:00
Martin Storsjö	518b7f9135	[LLD] [COFF] Add an assert regarding the RVA of exported symbols. NFC. As this isn't handled as a regular relocation, the normal handling of maybeReportRelocationToDiscarded in Chunks.cpp doesn't apply here. This would have caught the issue fixed by `82de4e0753`. Differential Revision: https://reviews.llvm.org/D102115	2021-05-11 13:04:01 +03:00
Igor Kudrin	70c23e232e	[LLD] Improve reporting unresolved symbols in shared libraries Currently, when reporting unresolved symbols in shared libraries, if an undefined symbol is firstly seen in a regular object file that shadows the reference for the same symbol in a shared object. As a result, the error for the unresolved symbol in the shared library is not reported. If referencing sections in regular object files are discarded because of '--gc-sections', no reports about such symbols are generated, and the linker finishes successfully, generating an output image that fails on the run. The patch fixes the issue by keeping symbols, which should be checked, for each shared library separately. Differential Revision: https://reviews.llvm.org/D101996	2021-05-11 12:48:29 +07:00
Sam Clegg	3b8d2be527	Reland: "[lld][WebAssembly] Initial support merging string data" This change was originally landed in: `5000a1b4b9` It was reverted in: `061e071d8c` This change adds support for a new WASM_SEG_FLAG_STRINGS flag in the object format which works in a similar fashion to SHF_STRINGS in the ELF world. Unlike the ELF linker this support is currently limited: - No support for SHF_MERGE (non-string merging) - Always do full tail merging ("lo" can be merged with "hello") - Only support single byte strings (p2align 0) Like the ELF linker merging is only performed at `-O1` and above. This fixes part of https://bugs.llvm.org/show_bug.cgi?id=48828, although crucially it doesn't not currently support debug sections because they are not represented by data segments (they are custom sections) Differential Revision: https://reviews.llvm.org/D97657	2021-05-10 16:03:38 -07:00
Nico Weber	061e071d8c	Revert "[lld][WebAssembly] Initial support merging string data" This reverts commit `5000a1b4b9`. Breaks tests, see https://reviews.llvm.org/D97657#2749151 Easily repros locally with `ninja check-llvm-mc-webassembly`.	2021-05-10 18:28:28 -04:00
Sam Clegg	5000a1b4b9	[lld][WebAssembly] Initial support merging string data This change adds support for a new WASM_SEG_FLAG_STRINGS flag in the object format which works in a similar fashion to SHF_STRINGS in the ELF world. Unlike the ELF linker this support is currently limited: - No support for SHF_MERGE (non-string merging) - Always do full tail merging ("lo" can be merged with "hello") - Only support single byte strings (p2align 0) Like the ELF linker merging is only performed at `-O1` and above. This fixes part of https://bugs.llvm.org/show_bug.cgi?id=48828, although crucially it doesn't not currently support debug sections because they are not represented by data segments (they are custom sections) Differential Revision: https://reviews.llvm.org/D97657	2021-05-10 13:15:12 -07:00
Jez Ng	b1c3c2e4fc	[lld-macho] Fix order file arch filtering We had a hardcoded check and a stale TODO, written back when we only had support for one architecture. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D102154	2021-05-10 15:45:54 -04:00
Jez Ng	2516b0b526	[lld-macho] Treat undefined symbols uniformly In particular, we should apply the `-undefined` behavior to all such symbols, include those that are specified via the command line (i.e. `-e`, `-u`, and `-exported_symbol`). ld64 supports this too. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D102143	2021-05-10 15:45:54 -04:00
Jez Ng	3d5e5066f1	[lld-macho][nfc] Clean up tests * Remove unnecessary `rm -rf %t`s * Have lc-linker-option.ll use the right comment marker	2021-05-10 15:45:54 -04:00
Fangrui Song	1f44fee521	[lld-macho] Improve an external weak def test The rebase table entry is untested. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D102150	2021-05-10 10:35:44 -07:00
Sam Clegg	bda8b84884	[lld][WebAssembly] Disallow exporting of TLS symbols Cross module TLS is currently not supported by our ABI. This change makes explicitly exporting a TLS symbol into an error and prevents implicit exporting (via --export-all). See https://github.com/emscripten-core/emscripten/issues/14120 Differential Revision: https://reviews.llvm.org/D102044	2021-05-10 09:58:44 -07:00
Fangrui Song	7a0231ae59	[llvm-objdump][MachO] Print a newline before lazy bind/bind/weak/exports trie This adds a separator between two pieces of information. Reviewed By: #lld-macho, alexshap Differential Revision: https://reviews.llvm.org/D102114	2021-05-10 09:16:18 -07:00
Jez Ng	75f74f2673	[lld-macho] Add llvm-otool as a test dependency This unbreaks my local build, which is configured to build only parts of LLVM.	2021-05-09 21:12:58 -04:00
Nico Weber	7f673fcaa9	[lld/mac] Fix alignment on subsections On a section with alignment of 16, subsections aligned to 16-byte boundaries should keep their 16-byte alignment. Fixes PR50274. (The same bug could have happened with -order_file previously.) Differential Revision: https://reviews.llvm.org/D102139	2021-05-09 21:00:56 -04:00
Jez Ng	0f8854f7f5	[lld-macho] Don't reference entry symbol for non-executables This would cause us to pull in symbols (and code) that should be unused. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D102137	2021-05-09 20:30:26 -04:00
Greg McGary	4b89629403	[lld-macho][NFC] Purge stale test-output trees prior to split-file Enforce standard practice Differential Revision: https://reviews.llvm.org/D102112	2021-05-08 17:36:30 -07:00
Greg McGary	5be8502271	[lld-macho] Explicitly undefine literal exported symbols Symbols explicitly exported via command-line options `--exported_symbol SYM` and `--exported_symbols_list FILE` must be defined. Before this fix, lazy symbols defined in archives would be left to languish. We now force them to be included in the linked output. Differential Revision: https://reviews.llvm.org/D102100	2021-05-08 11:37:00 -07:00
Nico Weber	7b6dd265ce	[lld/mac] Copy some of the commit message of `d5a70db193` into a comment	2021-05-08 13:03:17 -04:00
Arthur Eubanks	34a8a437bf	[NewPM] Hide pass manager debug logging behind -debug-pass-manager-verbose Printing pass manager invocations is fairly verbose and not super useful. This allows us to remove DebugLogging from pass managers and PassBuilder since all logging (aside from analysis managers) goes through instrumentation now. This has the downside of never being able to print the top level pass manager via instrumentation, but that seems like a minor downside. Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D101797	2021-05-07 21:51:47 -07:00
Nico Weber	d5a70db193	[lld/mac] Write every weak symbol only once in the output Before this, if an inline function was defined in several input files, lld would write each copy of the inline function the output. With this patch, it only writes one copy. Reduces the size of Chromium Framework from 378MB to 345MB (compared to 290MB linked with ld64, which also does dead-stripping, which we don't do yet), and makes linking it faster: N Min Max Median Avg Stddev x 10 3.9957051 4.3496981 4.1411121 4.156837 0.10092097 + 10 3.908154 4.169318 3.9712729 3.9846753 0.075773012 Difference at 95.0% confidence -0.172162 +/- 0.083847 -4.14165% +/- 2.01709% (Student's t, pooled s = 0.0892373) Implementation-wise, when merging two weak symbols, this sets a "canOmitFromOutput" on the InputSection belonging to the weak symbol not put in the symbol table. We then don't write InputSections that have this set, as long as they are not referenced from other symbols. (This happens e.g. for object files that don't set .subsections_via_symbols or that use .alt_entry.) Some restrictions: - not yet done for bitcode inputs - no "comdat" handling (`kindNoneGroupSubordinate*` in ld64) -- Frame Descriptor Entries (FDEs), Language Specific Data Areas (LSDAs) (that is, catch block unwind information) and Personality Routines associated with weak functions still not stripped. This is wasteful, but harmless. - However, this does strip weaks from __unwind_info (which is needed for correctness and not just for size) - This nopes out on InputSections that are referenced form more than one symbol (eg from .alt_entry) for now Things that work based on symbols Just Work: - map files (change in MapFile.cpp is no-op and not needed; I just found it a bit more explicit) - exports Things that work with inputSections need to explicitly check if an inputSection is written (e.g. unwind info). This patch is useful in itself, but it's also likely also a useful foundation for dead_strip. I used to have a "canoncialRepresentative" pointer on InputSection instead of just the bool, which would be handy for ICF too. But I ended up not needing it for this patch, so I removed that again for now. Differential Revision: https://reviews.llvm.org/D102076	2021-05-07 17:11:40 -04:00
LemonBoy	f876383384	[AsmParser][ARM] Make .thumb_func imply .thumb GNU as documentation states that a `.thumb_func` directive implies `.thumb`, teach the asm parser to switch mode whenever it's encountered. On the other hand the labeled form, exclusive to Apple's toolchain, doesn't switch mode at all. Reviewed By: nickdesaulniers, peter.smith Differential Revision: https://reviews.llvm.org/D101975	2021-05-07 12:13:36 +02:00
Stefan Pintilie	f0adf3a24c	[PowerPC][LLD] Make sure that the correct Thunks are used. This fixes an issue where mixed TOC / NOTOC calls can call the incorrect thunks if a previous thunk already exists. The issue appears when a TOC funciton calls a NOTOC callee and then a different NOTOC function calls the same NOTOC callee. In this case the linker would sometimes incorrectly call the same thunk for both cases. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101837	2021-05-06 12:00:04 -05:00
Jez Ng	9260760235	[lld-macho] Support loading of zippered dylibs ld64 can emit dylibs that support more than one platform (typically macOS and macCatalyst). This diff allows LLD to read in those dylibs. Note that this is a super bare-bones implementation -- in particular, I haven't added support for LLD to emit those multi-platform dylibs, nor have I added a variety of validation checks that ld64 does. Until we have a use-case for emitting zippered dylibs, I think this is good enough. Fixes PR49597. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D101954	2021-05-06 11:19:40 -04:00
Jez Ng	7654d8e1a9	[lld-macho][nfc] Convert the mock libSystem.tbd to TBDv4 It doesn't seem like TBDv3 allows for specifying multiple platforms, so I'm upgrading us to TBDv4. (We need to support multiple platforms in order to test that we can handle zippered dylibs; that functionality will be added in an upcoming diff.) Differential Revision: https://reviews.llvm.org/D101953	2021-05-06 11:19:40 -04:00
Ben Dunbobbin	5dd9f44c17	[LLD] Improve --strip-all help text This is a slight improvement to the help text, as I was slightly surprised when strip-all did more than remove the symbol table. Currently, we match gold's help text for strip-all and strip-debug. I think that the GNU documentation for these options is not particularly clear. However, I have opted to make only a minor change here and keep the help text similar to gold's as these are mature options that are well understood. ld.bfd (https://sourceware.org/binutils/docs/ld/Options.html) has a similar implication although it defines strip-debug as a subset of strip-all. However, felt that noting that strip-all implies strip-debug is better; because, with the ld.bfd approach you have to read both the --strip-debug and the --strip-all help text to understand the behaviour of --strip-all (and the --strip-all help text doesn't indicate that he --strip-debug help text is related). Differential Revision: https://reviews.llvm.org/D101890	2021-05-06 12:34:06 +01:00
Alex Reinking	7ac3fcc526	Allow /STACK in #pragma comment(linker, ...) The Halide project uses `#pragma comment(linker, "/STACK:...")` to set the stack size high enough for our embedded compiler to run in end-user programs on Windows. Unfortunately, lld-link.exe breaks on this when embedded in a COFF object, despite supporting the flag on the command line. MSVC's link.exe supports this fine. This patch extends support for this to lld-link.exe for better compatibility with MSVC projects. Differential Revision: https://reviews.llvm.org/D99680	2021-05-05 16:00:33 -07:00
Vy Nguyen	23233ad139	[lld-macho] Check simulator platforms to avoid issuing false positive errors. Currently the linker causes unnecessary errors when either the target or the config's platform is a simulator. Differential Revision: https://reviews.llvm.org/D101855	2021-05-05 18:07:58 -04:00
Isuru Fernando	662a58fa05	[lld] Convert LLVM_CMAKE_PATH to a CMake path Otherwise I get the following error on windows. ``` CMake Error at D:/bld/lld_1569206597988/work/build/CMakeFiles/CMakeTmp/CMakeLists.txt:2 (set): Syntax error in cmake code at D:/bld/lld_1569206597988/work/build/CMakeFiles/CMakeTmp/CMakeLists.txt:2 when parsing string D:\bld\lld_1569206597988\_h_env\Library\lib\cmake\llvm Invalid character escape '\b'. CMake Error at D:/bld/lld_1569206597988/_build_env/Library/share/cmake-3.15/Modules/CheckSymbolExists.cmake:100 (try_compile): Failed to configure test project build system. Call Stack (most recent call first): D:/bld/lld_1569206597988/_build_env/Library/share/cmake-3.15/Modules/CheckSymbolExists.cmake:57 (__CHECK_SYMBOL_EXISTS_IMPL) D:/bld/lld_1569206597988/_h_env/Library/lib/cmake/llvm/HandleLLVMOptions.cmake:943 (check_symbol_exists) CMakeLists.txt:56 (include) ``` Reviewed By: sbc100 Differential Revision: https://reviews.llvm.org/D68158	2021-05-05 15:42:55 -05:00
Jez Ng	75ba351300	[lld-macho] Try to unbreak build Looks like the PointerUnion casting cares about const-ness...	2021-05-05 15:47:14 -04:00
Jez Ng	8806df4778	[lld-macho] Preliminary support for ARM_RELOC_BR24 ARM_RELOC_BR24 is used for BL/BLX instructions from within ARM (i.e. not Thumb) code. This diff just handles the basic case: branches from ARM to ARM, or from ARM to Thumb where no shimming is required. (See comments in ARM.cpp for why shims are required.) Note: I will likely be deprioritizing ARM work for the near future to focus on other parts of LLD. Apologies for the half-done state of this; I'm just trying to wrap up what I've already worked on. Reviewed By: #lld-macho, alexshap Differential Revision: https://reviews.llvm.org/D101814	2021-05-05 14:41:01 -04:00
Jez Ng	20f51ffe67	[lld-macho] Have --reproduce account for path rerooting We need to account for path rerooting when generating the response file. We could either reroot the paths before generating the file, or pass through the original filenames and change just the syslibroot. I've opted for the latter, in order that the reproduction run more closely mirrors the original. We must also be careful not to make an absolute path relative if it is shadowed by a rerooted path. See repro6.tar in reroot-path.s for details. I've moved the call to `createResponseFile()` after the initialization of `config->systemLibraryRoots`, since it now needs to know what those roots are. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D101224	2021-05-05 14:41:01 -04:00
Martin Storsjö	82de4e0753	[LLD] [COFF] Actually include the exported comdat symbols This is a followup to 2b01a417d7ccb001ccc1185ef5fdc967c9fac8d7; previously the RVAs of the exported symbols from comdats were left zero. Thanks to Kleis Auke Wolthuizen for the fix suggestion and pointing out the omission. Differential Revision: https://reviews.llvm.org/D101615	2021-05-04 22:13:08 +03:00
Fangrui Song	0c2e2f88fb	[llvm-objdump] Improve newline consistency between different pieces of information When dumping multiple pieces of information (e.g. --all-headers), there is sometimes no separator between two pieces. This patch uses the "\nheader:\n" style, which generally improves compatibility with GNU objdump. Note: objdump -t/-T does not add a newline before "SYMBOL TABLE:" and "DYNAMIC SYMBOL TABLE:". We add a newline to be consistent with other information. `objdump -d` prints two empty lines before the first 'Disassembly of section'. We print just one with this patch. Differential Revision: https://reviews.llvm.org/D101796	2021-05-04 09:56:07 -07:00
Greg McGary	27b426b0c8	[lld-macho] Implement builtin section renaming ld64 automatically renames many sections depending on output type and assorted flags. Here, we implement the most common configs. We can add more obscure flags and behaviors as needed. Depends on D101393 Differential Revision: https://reviews.llvm.org/D101395	2021-05-03 21:26:51 -07:00
Sam Clegg	808fcddae4	[lld][WebAssembly] Fix crash with `-pie` without `--allow-undefined` `shouldImport` was not returning true in PIC mode even though out assumption elsewhere (in Relocations.cpp:scanRelocations) is that we don't report undefined symbols in PIC mode today. This was resulting functions that were undefined and but also not imported which hits an assert later on that all functions have valid indexes. Differential Revision: https://reviews.llvm.org/D101716	2021-05-03 18:04:55 -07:00
Sam Clegg	4ef1f90e4d	[lld][WebAssembly] Convert more tests to asm format. NFC Two of these are trivial. The third (shared.s) did have some expectations changes but only due to two data symbols being re-ordered. Differential Revision: https://reviews.llvm.org/D101711	2021-05-03 17:16:31 -07:00
Sam Clegg	73332d73e1	[lld][WebAssembly] Do not merge comdat data segments When running in relocatable mode any input data segments that are part of a comdat group should not be merged with other segments of the same name. This is because the final linker needs to keep the separate so they can be included/excluded individually. Often this is not a problem since normally only one section with a given name `foo` ends up in the output object file. However, the problem occurs when one input contains `foo` which part of a comdat and another object contains a local symbol `foo` we were attempting to merge them. This behaviour matches (I believe) that of the ELF linker. See `LinkerScript.cpp:addInputSec`. Fixes: https://github.com/emscripten-core/emscripten/issues/9726 Differential Revision: https://reviews.llvm.org/D101703	2021-05-03 16:43:29 -07:00
Jez Ng	183b0dad4e	[lld-macho] Add ARM requirement to objc.s	2021-05-03 18:47:30 -04:00
Jez Ng	001ba65375	[lld-macho] De-templatize mach_header operations @thakis pointed out that `mach_header` and `mach_header_64` actually have the same set of (used) fields, with the 64-bit version having extra padding. So we can access the fields we need using the single `mach_header` type instead of using templates to switch between the two. I also spotted a potential issue where hasObjCSection tries to parse a file w/o checking if it does indeed match the target arch... As such, I've added a quick magic number check to ensure we don't access invalid memory during `findCommand()`. Addresses PR50180. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D101724	2021-05-03 18:31:23 -04:00
Fangrui Song	f9c8ebdc30	[ELF] Don't suggest alternative spelling of an empty name Fix PR50111 Differential Revision: https://reviews.llvm.org/D101698	2021-05-03 09:04:55 -07:00
Fangrui Song	818b508953	[ELF] Simplify the condition adding .got header Adopt my suggestion in https://reviews.llvm.org/D91426#2653926 , generalizing the ppc64 specific code. GNU ld and glibc ld.so has a contract about the first few entries of .got . There are somewhat complex conditions when the header is needed. This patch switches to a simpler approach: add a header unconditionally if _GLOBAL_OFFSET_TABLE_ is used or the number of entries is more than just the header.	2021-04-30 17:19:45 -07:00
Jez Ng	c00fc180ec	[llvm-readobj] Recognize N_THUMB_DEF as a symbol flag The right symbol flag mask is ~0x7, not ~0xf. Also emit string names for the other flags (we were missing some). Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D101548	2021-04-30 17:39:56 -04:00
Jez Ng	05c5363b39	[lld-macho] Parse & emit the N_ARM_THUMB_DEF symbol flag Eventually we'll use this flag to properly handle bl/blx opcodes. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D101558	2021-04-30 16:17:26 -04:00
Jez Ng	2d28100bf2	[lld-macho] Initial scaffolding for ARM32 support This just parses the `-arch armv7` and emits the right header flags. The rest will be slowly fleshed out in upcoming diffs. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D101557	2021-04-30 16:17:25 -04:00
Duncan P. N. Exon Smith	518d955f9d	Support: Stop using F_{None,Text,Append} compatibility synonyms, NFC Stop using the compatibility spellings of `OF_{None,Text,Append}` left behind by `1f67a3cba9`. A follow-up will remove them. Differential Revision: https://reviews.llvm.org/D101650	2021-04-30 11:00:03 -07:00
Nico Weber	a1a2a8e8ac	[lld/mac] Remove unused -L%t flags from tests No behavior change. Differential Revision: https://reviews.llvm.org/D101623	2021-04-30 09:37:02 -04:00
Nico Weber	4b456038e4	[lld/mac] Tweak two comments and fix style on one variable name Cosmetic, no behavior change.	2021-04-30 09:30:51 -04:00
Hans Wennborg	cbe62f2f2f	Require shell for lld/test/MachO/reproduce.s as a way of not running it on Windows, where the file paths when extracting repro2.tar can become longer than the maximum file length limit (depending on the build dir name) and cause the test to fail. (See https://crbug.com/1204463 for example test failure.)	2021-04-30 14:23:35 +02:00
Zequan Wu	6b30240288	Reland "[lld-link] Enable addrsig table in COFF lto" This reverts commit `a78fa73bcf`. The commit `cab48e2f0e` fixes the issue on `eabd55b1b2`.	2021-04-29 15:54:12 -07:00
Martin Storsjö	2b01a417d7	[LLD] [COFF] Fix the mingw --export-all-symbols behaviour with comdat symbols When looking for the "all" symbols that are supposed to be exported, we can't look at the live flag - the symbols we mark as to be exported will become GC roots even if they aren't yet marked as live. With this in place, building an LLVM library with BUILD_SHARED_LIBS produces the same set of symbols exported regardless of whether the --gc-sections flag is specified, both with and without being built with -ffunction-sections. Differential Revision: https://reviews.llvm.org/D101522	2021-04-29 23:35:10 +03:00
Jez Ng	07884152ec	[lld-macho] Remove stray file	2021-04-29 15:33:23 -04:00
Jez Ng	d9c8ffa958	[lld-macho][nfc] Clean up header.s test I don't think it's super worthwhile to test the dylib headers outputs of all the different archs when x86_64 is the only one that has interesting behavior. Motivated by my upcoming addition of arm32...	2021-04-29 15:11:23 -04:00
Jez Ng	7e115da5df	[lld-macho] Make everything PIE by default Modern versions of macOS (>= 10.7) and in general all modern Mach-O target archs want PIEs by default. ld64 defaults to PIE for iOS >= 4.3, as well as for all versions of watchOS and simulators. Basically all the platforms LLD is likely to target want PIE. So instead of cluttering LLD's code with legacy version checks, I think it's simpler to just default to PIE for everything. Note that `-no_pie` still works, so users can still opt out of it. Reviewed By: #lld-macho, thakis, MaskRay Differential Revision: https://reviews.llvm.org/D101513	2021-04-29 15:11:23 -04:00
Sam Clegg	a6f406480a	[lld][WebAssembly] Add `--export-if-defined` Unlike the existing `--export` option this will not causes errors or warnings if the specified symbol is not defined. See: https://github.com/emscripten-core/emscripten/issues/13736 Differential Revision: https://reviews.llvm.org/D99887	2021-04-29 10:58:45 -07:00
Fangrui Song	c9b1bd1012	[ELF] Support .rela.eh_frame with unordered r_offset values GNU ld -r can create .rela.eh_frame with unordered r_offset values. (With LLD, we can craft such a case by reordering sections in .eh_frame.) This is currently unsupported and will trigger `assert(pieces[i].inputOff <= off ...` in `OffsetGetter::get` (the content is corrupted in a -DLLVM_ENABLE_ASSERTIONS=off build). This patch supports this case. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D101116	2021-04-29 08:51:09 -07:00
Sam Clegg	8a4ee3b39c	Fix typo from https://reviews.llvm.org/D101399	2021-04-28 11:31:16 -07:00
Sam Clegg	3e7bc0da57	[lld][WebAssembly] Allow relocations against non-live global symbols Just like the in case for function and data symbols this is needed to support relocations in debug info sections which are allowed contains relocations against non-live symbols. The motivating use case is an object file that contains debug info that references `__stack_pointer` (a local symbol) but does not actually contain any uses of `__stack_pointer`. Fixes: https://github.com/emscripten-core/emscripten/issues/14025 Differential Revision: https://reviews.llvm.org/D101399	2021-04-28 10:29:41 -07:00
Alex Richardson	aed66d2787	[ELF] Update URL for MIPS TLS wiki page The original page no longer works, so use a web.archive.org link instead. Reviewed By: atanasyan Differential Revision: https://reviews.llvm.org/D100949	2021-04-28 12:19:19 +01:00
Greg McGary	465204d63a	[lld-macho][NFC] define more strings in section_names:: and segment_names:: As preparation for a subsequent diff that implements builtin section renaming, define more `constexpr` strings in namespaces `lld::macho::segment_names` and `lld::macho::section_names`, and use them to replace string literals. Differential Revision: https://reviews.llvm.org/D101393	2021-04-27 17:48:45 -07:00
Jez Ng	700402b00e	[lld-macho] Don't put an antivirus test file in reproduce.s It appears that some antivirii do not recognize that "this is a test": https://reviews.llvm.org/D101218#2720676 Reviewed By: #lld-macho, smeenai Differential Revision: https://reviews.llvm.org/D101402	2021-04-27 18:02:59 -04:00
Jez Ng	7ca133c360	[lld-macho] std::sort -> llvm::sort	2021-04-27 18:02:59 -04:00
Jessica Clarke	7fefd032cb	[ELF][MIPS] Emit dynamic relocations for PIC non-preemptible static TLS This is the same problem as `127176e59e`, but for static TLS rather than dynamic TLS. Although we know the symbol will be the one in our own TLS segment, and thus the offset of it within that, we don't know where in the static TLS block our data will be allocated and thus we must emit a dynamic relocation for this case. Reviewed By: MaskRay, atanasyan Differential Revision: https://reviews.llvm.org/D101381	2021-04-27 19:04:50 +01:00
Jessica Clarke	1d505016ef	[ELF][MIPS] Don't emit dynamic relocations for PIE non-preemptible TLS Whilst not wrong (unless using static PIE where the relocations are likely not implemented by the runtime), this is inefficient, as the TLS module indices and offsets are independent of the executable's load address. Reviewed By: MaskRay, atanasyan Differential Revision: https://reviews.llvm.org/D101382	2021-04-27 19:04:50 +01:00
Vitaly Buka	f2a585e6d3	[NFC] Fix "not used" warning	2021-04-26 22:09:23 -07:00
Greg McGary	c2419aae76	[lld-macho] Add option --error-limit=N Add option to limit (or remove limits) on the number of errors printed before exiting. This option exists in the other lld ports: COFF & ELF. Differential Revision: https://reviews.llvm.org/D101274	2021-04-26 07:10:12 -07:00
Nico Weber	de266ce4f9	[lld/mac] Don't assert when using -exported_symbol with private symbol When I added this assert in D93609, it asserted that a symbol that is privateExtern is also isExternal(). In D98381 the privateExtern check moved into shouldExportSymbol() but the assert didn't -- now it checked that _every_ non-exported symbol is isExternal(), which isn't true. Move the assert into the privateExtern check where it used to be. Fixes PR50098. Differential Revision: https://reviews.llvm.org/D101223	2021-04-24 10:21:51 -04:00
Nico Weber	4ca0fbfabd	[lld/mac] simplify export-options.s test a bit - the macro seems needlessly clever -- shorter and imho clearer without it - give all filenames an extension so they look like filenames - rename .private_extern symbol from _private to _private_extern to prepare for follow-up that adds a truly private symbol No behavior change. Differential Revision: https://reviews.llvm.org/D101222	2021-04-24 08:03:55 -04:00
Nico Weber	4e2d5fcf71	[lld/mac] add test coverage for -sectcreate and -order_file with --reproduce Would've caught the (since fixed) regression in D97610. No behavior change. Differential Revision: https://reviews.llvm.org/D101218	2021-04-24 08:00:49 -04:00
Fangrui Song	9aad886e28	[ELF] Simplify a condition in addGotEntry. NFC	2021-04-23 22:11:14 -07:00
Jez Ng	3fe5c3b018	[lld-macho] Fix use-after-free in loadDylib() We were taking a reference to a value in `loadedDylibs`, which in turn called `make<DylibFile>()`, which could then recursively call `loadDylibs`, which would then potentially resize `loadedDylibs` and invalidate that reference. Fixes PR50101. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D101175	2021-04-23 18:05:49 -04:00
Jez Ng	035eb6d154	[lld-macho]][nfc] Fix some typos + rephrase a comment I was a bit confused by the comment because I thought that "Tests that..." was describing the tests contained within the same file. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D101160	2021-04-23 18:05:48 -04:00
Nico Weber	a61891d491	[lld/mac] Support more flags for --reproduce I went through the callers of `readFile()` and `addFile()` in Driver.cpp and checked that the options that use them all get rewritten in the --reproduce response file. -(un)exported_symbols_list and -bundle_loader weren't, so add them. Also spruce up the test for reproduce a bit and actually try linking with the exptracted repro archive. Motivated by the response file in PR50098 complaining abou the -exported_symbols_list path being wrong :) Differential Revision: https://reviews.llvm.org/D101182	2021-04-23 14:40:24 -04:00
Nico Weber	fcf59cc917	fix comment typo to cycle bots	2021-04-23 11:45:49 -04:00
Jez Ng	fd28f71872	[lld-macho] Have tests default to targeting macos 10.15 D101114 enforced proper version checks, which exposed a variety of version mismatch issues in our tests. We previously changed the test inputs to target 10.0, which was the simpler thing to do, but we should really just have our lit.local.cfg default to targeting 10.15, which is what is done here. We're not likely to ever have proper support for the older versions anyway, as that would require more work for unclear benefit; for instance, llvm-mc seems to generate a different compact unwind format for older macOS versions, which would cause our compact-unwind.s test to fail. Targeting 10.15 by default causes the following behavioral changes: * `__mh_execute_header` is now a section symbol instead of an absolute symbol * LC_BUILD_VERSION gets emitted instead of LC_VERSION_MIN_MACOSX. The former is 32 bytes in size whereas the latter is 16 bytes, so a bunch of hardcoded address offsets in our tests had to be updated. * >= 10.6 executables are PIE by default Note that this diff was stacked atop of a local revert of most of the test changes in rG8c17a875150f8e736e8f9061ddf084397f45f4c5, to make review easier. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D101119	2021-04-23 09:25:08 -04:00
Nico Weber	a38ebed258	[lld/mac] Implement support for .weak_def_can_be_hidden I first had a more invasive patch for this (D101069), but while trying to get that polished for review I realized that lld's current symbol merging semantics mean that only a very small code change is needed. So this goes with the smaller patch for now. This has no effect on projects that build with -fvisibility=hidden (e.g. chromium), since these see .private_extern symbols instead. It does have an effect on projects that build with -fvisibility-inlines-hidden (e.g. llvm) in -O2 builds, where LLVM's GlobalOpt pass will promote most inline functions from .weak_definition to .weak_def_can_be_hidden. Before this patch: % ls -l out/gn/bin/clang out/gn/lib/libclang.dylib -rwxr-xr-x 1 thakis staff 113059936 Apr 22 11:51 out/gn/bin/clang -rwxr-xr-x 1 thakis staff 86370064 Apr 22 11:51 out/gn/lib/libclang.dylib % out/gn/bin/llvm-objdump --macho --weak-bind out/gn/bin/clang \| wc -l 8291 % out/gn/bin/llvm-objdump --macho --weak-bind out/gn/lib/libclang.dylib \| wc -l 5698 With this patch: % ls -l out/gn/bin/clang out/gn/lib/libclang.dylib -rwxr-xr-x 1 thakis staff 111721096 Apr 22 11:55 out/gn/bin/clang -rwxr-xr-x 1 thakis staff 85291208 Apr 22 11:55 out/gn/lib/libclang.dylib thakis@MBP llvm-project % out/gn/bin/llvm-objdump --macho --weak-bind out/gn/bin/clang \| wc -l 725 thakis@MBP llvm-project % out/gn/bin/llvm-objdump --macho --weak-bind out/gn/lib/libclang.dylib \| wc -l 542 Linking clang becomes a tiny bit faster with this patch: x 100 0.67263818 0.77847815 0.69430709 0.69877208 0.017715892 + 100 0.67209601 0.73323393 0.68600798 0.68917346 0.012824377 Difference at 95.0% confidence -0.00959861 +/- 0.00428661 -1.37364% +/- 0.613449% (Student's t, pooled s = 0.0154648) This only happens if lld with the patch and lld without the patch are both linked with an lld with the patch or both linked with an lld without the patch (...or with ld64). I accidentally linked the lld with the patch with an lld without the patch and the other way round at first. In that setup, no difference is found. That makese sense, since having fewer weak imports will make the linked output a bit faster too. So not only does this make linking binaries such as clang a bit faster (since fewer exports need to be written to the export trie by lld), the linked output binary binary is also a bit faster (since dyld needs to process fewer dynamic imports). This also happens to fix the one `check-clang` failure when using lld as host linker, but mostly for silly reasons: See crbug.com/1183336, mostly comment 26. The real bug here is that c-index-test links all of LLVM both statically and dynamically, which is an ODR violation. Things just happen to work with this patch. So after this patch, check-clang, check-lld, check-llvm all pass with lld as host linker :) Differential Revision: https://reviews.llvm.org/D101080	2021-04-22 22:51:34 -04:00
Nico Weber	88b76cb130	[lld/mac] slightly improve weak-private-extern.s test - __got is in --bind output, so print that too (makes the test a bit stronger) - WEAK_DEFINES, BINDS_TO_WEAK are in the mach-o header, so --private-header is enough, no need for --all-headers (makes the test a bit easier to work with when it fails) Differential Revision: https://reviews.llvm.org/D101065	2021-04-22 22:42:48 -04:00
Jez Ng	8c17a87515	[re-land][lld-macho] Fix min version check We had got it backwards... the minimum version of the target should be higher than the min version of the object files, presumably since new platforms are backwards-compatible with older formats. Fixes PR50078. The original commit (`aa05439c9c`) broke many tests that had inputs too new for our target platform (10.0). This commit changes the inputs to target 10.0, which was the simpler thing to do, but we should really just have our lit.local.cfg default to targeting 10.15... we're not likely to ever have proper support for the older versions anyway. I will follow up with a change to that effect. Differential Revision: https://reviews.llvm.org/D101114	2021-04-22 19:35:32 -04:00
Jez Ng	75ecb804b1	Revert "[lld-macho] Fix min version check" This reverts commit `aa05439c9c`.	2021-04-22 19:07:41 -04:00
Jez Ng	aa05439c9c	[lld-macho] Fix min version check We had got it backwards... the minimum version of the target should be higher than the min version of the object files, presumably since new platforms are backwards-compatible with older formats. Fixes PR50078. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D101114	2021-04-22 18:25:44 -04:00
Jez Ng	2618eaf614	[lld-macho][nfc] Clean up some constructor declarations Remove unnecessary default ctor + add `explicit` to others	2021-04-22 18:25:44 -04:00
Nico Weber	71281462c8	[lld/mac] tweak comment in a test	2021-04-22 11:14:58 -04:00
Nico Weber	c1b2a7bfbf	[lld/mac] make a few "named parameter comments" more consistent Most of LLVM and almost all of lld/MachO uses `/foo=/bar` style. No behavior change.	2021-04-22 10:48:03 -04:00
Nico Weber	487885129c	[lld/mac] add a comment pointing to a test that took me a while to find	2021-04-22 09:09:55 -04:00
Jez Ng	ed4a4e3312	[lld-macho][nfc] Add accessors for commonly-used PlatformInfo fields As discussed here: https://reviews.llvm.org/D100523#inline-951543 Reviewed By: #lld-macho, thakis, alexshap Differential Revision: https://reviews.llvm.org/D100978	2021-04-21 15:43:56 -04:00
Nico Weber	b309f17abf	[lld/mac] add aarch64 to requirements of encryption-info.s test	2021-04-21 14:21:42 -04:00
Jez Ng	ab9c21bbab	[lld-macho] Support LC_ENCRYPTION_INFO This load command records a range spanning from the end of the load commands to the end of the `__TEXT` segment. Presumably the kernel will encrypt all this data. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D100973	2021-04-21 13:39:56 -04:00
Alexander Shaposhnikov	b5720354ef	[lld][MachO] Refactor findCommand Refactor findCommand to allow passing multiple types. NFC. Test plan: make check-lld-macho Differential revision: https://reviews.llvm.org/D100954	2021-04-21 08:38:17 -07:00
Alexander Shaposhnikov	5c835e1ae5	[lld][MachO] Add support for LC_VERSION_MIN_* load commands This diff adds initial support for the legacy LC_VERSION_MIN_* load commands. Test plan: make check-lld-macho Differential revision: https://reviews.llvm.org/D100523	2021-04-21 05:41:14 -07:00
Yang Fan	c09277b0d8	[lld][ELF] Fix "enumeral and non-enumeral type in conditional expression" warning (NFC) GCC warning: ``` /llvm-project/lld/ELF/SyntheticSections.cpp: In member function ‘virtual void lld:🧝:VersionTableSection::writeTo(uint8_t*)’: /llvm-project/lld/ELF/SyntheticSections.cpp:3128:34: warning: enumeral and non-enumeral type in conditional expression [-Wextra] 3128 \| write16(buf, s.sym->isLazy() ? VER_NDX_GLOBAL : s.sym->versionId); \| ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ```	2021-04-21 16:01:46 +08:00
Jez Ng	7208bd4b32	[lld-macho] Skip platform checks for a few libSystem re-exports XCode 12 ships with mismatched platforms for these libraries, so this hack is necessary... Fixes PR49799. Reviewed By: #lld-macho, gkm, smeenai Differential Revision: https://reviews.llvm.org/D100913	2021-04-20 19:54:53 -04:00
Zequan Wu	aa80955f63	[lld-link] Warn on exported deleting dtor MSVC linker has this [[ https://docs.microsoft.com/en-us/cpp/error-messages/tool-errors/linker-tools-warning-lnk4102?view=msvc-160 \| warning]], so lld-link should also warn on this. Differential Revision: https://reviews.llvm.org/D100606	2021-04-20 14:06:31 -07:00
Jez Ng	bb62ef9943	[lld-macho] Ensure segments are laid out contiguously codesign/libstuff checks that the `__LLVM` segment is directly before `__LINKEDIT` by checking that `fileOff + fileSize == next segment fileOff`. Previously, there would be gaps between the segments due to the fact that their fileOffs are page-aligned but their fileSizes aren't. In order to satisfy codesign, we page-align fileOff before calculating fileSize. (I don't think codesign checks for the relative ordering of other segments, so in theory we could do this just for `__LLVM`, but ld64 seems to do it for all segments.) Note that we don't round up the fileSize of the `__LINKEDIT` segment. Since it's the last segment, it doesn't need to worry about contiguity; in addition, codesign checks that the last (hidden) section in `__LINKEDIT` covers the last byte of the segment, so if we rounded up `__LINKEDIT`'s size we would have to do the same for its last section, which is a bother. While at it, I also addressed a FIXME in the linkedit-contiguity.s test to cover more `__LINKEDIT` sections. Reviewed By: #lld-macho, thakis, alexshap Differential Revision: https://reviews.llvm.org/D100848	2021-04-20 16:58:57 -04:00
Jez Ng	1aa29dffce	[lld-macho] Support subtractor relocations that reference sections The minuend (but not the subtrahend) can reference a section. Note that we do not yet properly validate that the subtrahend isn't referencing a section; I've filed PR50034 to track that. I've also extended the reloc-subtractor.s test to reorder symbols, to make sure that the addends are being associated with the minuend (and not the subtrahend) relocation. Fixes PR49999. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D100804	2021-04-20 16:58:57 -04:00
Fangrui Song	1c00530b30	[ELF] Don't set versionId on undefined weak lazy symbols An unfetched lazy symbol (undefined weak) should be considered to have its original versionId which is VER_NDX_GLOBAL, instead of the lazy symbol's versionId. (The original versionId cannot be non-VER_NDX_GLOBAL because a undefined versioned symbol is an error.) The regression was introduced in D77280 when making version scripts work with lazy symbols fetched by LTO calls. Fix PR49915 Differential Revision: https://reviews.llvm.org/D100624	2021-04-20 11:23:10 -07:00
Fangrui Song	03769d9308	[lld] Delete unused includes. NFC	2021-04-19 10:56:49 -07:00
Nico Weber	0871ce3547	fix comment typo to cycle bots	2021-04-19 12:59:20 -04:00
Fangrui Song	7c74ce3c68	[ELF] --wrap: don't clear sym->isUsedInRegularObj if real->isUsedInRegularObj; set wrap's initial binding to sym's Fix PR49897: if `__real_foo` has the isUsedInRegularObj bit set, we need to retain `foo` in .symtab, even if `foo` is undefined. The new behavior will match GNU ld. Before the patch, we produced an R_X86_64_JUMP_SLOT relocation referencing the index 0 undefined symbol, which would be erroed by glibc (see `f96ff3c0f8`). While here, fix another bug: if `__wrap_foo` does not exist, its initial binding should be `foo`'s.	2021-04-17 00:29:51 -07:00
Fangrui Song	b2a3d31eed	[ELF] Simplify R_386_TLS_GD computation. NFC	2021-04-16 19:08:23 -07:00
Jez Ng	3e1045ec04	[lld] Canonicalize HAVE_LIBXAR I think this should unbreak the build after D100650...	2021-04-16 17:21:06 -04:00
Jez Ng	bb0e1ae7c4	[lld-macho] Add separator to error message	2021-04-16 17:01:14 -04:00
Jez Ng	ca6751043d	[lld-macho] Initial groundwork for -bitcode_bundle This diff creates an empty XAR file and copies it into `__LLVM,__bundle`. Follow-up work will actually populate the contents of that XAR. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D100650	2021-04-16 16:47:14 -04:00
Fangrui Song	6d2d3bd0a6	[ELF] Default to -z start-stop-gc with a glibc "__libc_" special case Change the default to facilitate GC for metadata section usage, so that they don't need SHF_LINK_ORDER or SHF_GROUP just to drop the unhelpful rule (if they want to be unconditionally retained, use SHF_GNU_RETAIN (`__attribute__((retain))`) or linker script `KEEP`). The dropped SHF_GROUP special case makes the behavior of -z start-stop-gc and -z nostart-stop-gc closer to GNU ld>=2.37 (https://sourceware.org/PR27451). However, we default to -z start-stop-gc (which actually matches more closely to GNU ld before 2015-10 https://sourceware.org/PR19167), which is different from modern GNU ld (which has the unhelpful rule to work around glibc). As a compensation, we special case `__libc_` sections as a workaround for glibc<2.34 (https://sourceware.org/PR27492). Since -z start-stop-gc as the default actually matches the traditional GNU ld behavior, there isn't much to be aware of. There was a systemd usage which has been fixed by https://github.com/systemd/systemd/pull/19144	2021-04-16 12:18:46 -07:00
LemonBoy	7c6f177477	[lld] Fix test crashing when AVR target is missing Fixes buildbot error.	2021-04-16 11:12:29 +02:00
LemonBoy	7a781fb692	[LLD][ELF][AVR] Propagate ELF flags to the linked image The `e_flags` for a ELF file targeting the AVR ISA contains two fields at the time of writing: - A 7-bit integer field specifying the ISA revision being targeted - A 1-bit flag specifying whether the object files being linked are suited for applying the relaxations at link time The linked ELF file is blessed with the arch revision shared among all the files. The behaviour in case of mismatch is purposefully different than the one implemented in libbfd: LLD will raise a fatal error while libbfd silently picks a default value of `avr2`. The relaxation-ready flag is handled as done by libbfd, in order for it to appear in the linked object every source object must be tagged with it. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D99754	2021-04-16 10:40:18 +02:00
Jez Ng	4938b090cf	[lld-macho] Don't use arrays as template parameters MSVC from VSCode 2017 appears unhappy with it (causes an internal compiler error.) This also means that we need to avoid doing `sizeof(stubCode)` as `sizeof(int[N])` on function array parameters decays into `sizeof(int *)`. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D100605	2021-04-15 21:16:34 -04:00
Jez Ng	1acda12d00	[lld-macho] Make load relaxation work for arm64_32 arm64_32 uses 32-bit GOT loads, so we should accept those instructions in `ARM64Common::relaxGotLoad()` too. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D100229	2021-04-15 21:16:34 -04:00
Jez Ng	1460942c15	[lld-macho] Add 32-bit compact unwind support This could probably have been part of D99633, but I split it up to make things a bit more reviewable. I also fixed some bugs in the implementation that were masked through integer underflows when operating in 64-bit mode. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D99823	2021-04-15 21:16:33 -04:00
Jez Ng	3bc88eb392	[lld-macho] Add support for arm64_32 From what I can tell, it's pretty similar to arm64. The two main differences are: 1. No 64-bit relocations 2. Stub code writes to 32-bit registers instead of 64-bit Plus of course the various on-disk structures like `segment_command` are using the 32-bit instead of the 64-bit variants. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D99822	2021-04-15 21:16:33 -04:00
Jez Ng	db7a413e51	[lld-macho] Re-root absolute input file paths if -syslibroot is specified Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D100147	2021-04-15 21:16:33 -04:00
Jez Ng	eb5b7d4497	[lld-macho] LTO: Unset VisibleToRegularObj where possible This allows LLVM's LTO to internalize symbols that are not referenced directly by regular objects. Naturally, this means we need to track which symbols are referenced by regular objects. The approach taken here is similar to LLD-COFF's: like the COFF port, we extend `SymbolTable::insert()` to set the isVisibleToRegularObj bit. (LLD-ELF relies on the Symbol constructor and `Symbol::mergeProperties()`, but the Mach-O port does not have a `mergeProperties()` equivalent.) From what I can tell, ld64 (which uses libLTO) doesn't do this optimization at all. I'm not even sure libLTO provides a way to do this. Not having ld64's behavior as a reference implementation is unfortunate; instead, I am relying on LLD-ELF/COFF's behavior as references while erring on the conservative side. In particular, LLD-MachO will only do this optimization for executables right now. We also don't attempt it when `-flat_namespace` is used -- otherwise we'd need scan the symbol table to find matches for every un-namespaced symbol reference, which is expensive. internalize.ll is based off the LLD-ELF tests `internalize-basic.ll` and `internalize-undef.ll`. Looks like @davide added some of LLD-ELF's internalize tests, so adding him as a reviewer... Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D99105	2021-04-15 21:16:33 -04:00
Alex Orlov	49cbf4cd85	Fix bug in .eh_frame/.debug_frame PC offset calculation for DW_EH_PE_pcrel This fixes the following bugs: https://bugs.llvm.org/show_bug.cgi?id=27249 https://bugs.llvm.org/show_bug.cgi?id=46414 Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D100328	2021-04-15 15:06:20 +04:00
Nico Weber	da0ef5ad5b	fix typo to cycle bots	2021-04-14 14:59:18 -04:00
Nico Weber	1e89f08f59	fix typo to cycle bots	2021-04-14 14:52:53 -04:00
Reid Kleckner	18a9b18087	[COFF] Simplify ICF associated comdat handling This is a different approach from D98993 that should achieve most of the same benefit. The two changes are: 1. Sort the list of associated child sections by section name 2. Do not consider associated sections to have children themselves This fixes the main issue, which was that we sometimes considered an .xdata section to have a child .pdata section. That lead to slow links and larger binaries (less xdata folding). Otherwise, this should be NFC: we go back to ignoring .debug/.gljmp and other metadata sections rather than only looking at pdata/xdata. We discovered that we do care about other associated sections, like ASan global registration metadata.	2021-04-14 10:40:16 -07:00
Bogdan Graur	7975dd033c	[NFC] Fix unused variable warning. Differential Revision: https://reviews.llvm.org/D100451	2021-04-14 09:36:12 +02:00
Pengfei Wang	184377da5c	[LLD] Implement /guard:[no]ehcont Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D99078	2021-04-14 15:06:49 +08:00
Stella Stamenova	bcef28621a	Fix resolution-err.ll chmod tries to be very helpful on some platforms and prevent naive mistakes, by warning the user. This results in the following error during the test: ```chmod: ...resolution-err.ll.tmp.resolution.txt: new permissions are r--rw-rw-, not r--r--r--``` To fix the test, call chmod with u. Differential Revision: https://reviews.llvm.org/D100417	2021-04-13 16:11:58 -07:00
Jez Ng	84cf9a7a4a	[lld-macho] rm old test directory for segments.s This should unbreak incremental builds after `8ca366935b`	2021-04-13 14:46:20 -04:00
Jez Ng	8ca366935b	Revert "[lld-macho] Add support for arm64_32" and other stacked diffs This reverts commits: * `8914902b01` * `35a745d814` * `682d1dfe09`	2021-04-13 12:40:58 -04:00
Jez Ng	84c52f3a19	[lld-macho] arm64_32 executables are always PIE This should fix the assert that's currently breaking the build.	2021-04-13 11:55:45 -04:00
Jez Ng	682d1dfe09	[lld-macho] Make load relaxation work for arm64_32 arm64_32 uses 32-bit GOT loads, so we should accept those instructions in `ARM64Common::relaxGotLoad()` too. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D100229	2021-04-13 10:43:28 -04:00
Jez Ng	3142fc3b5b	[lld-macho] Have toString() emit full path to archive files It doesn't make sense to take just the base filename for archives when we emit the full path for object files. (LLD-ELF emits the full path too.) This will also make it easier to write a proper test for {D100147}. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D100357	2021-04-13 10:43:28 -04:00
Jez Ng	35a745d814	[lld-macho] Add 32-bit compact unwind support This could probably have been part of D99633, but I split it up to make things a bit more reviewable. I also fixed some bugs in the implementation that were masked through integer underflows when operating in 64-bit mode. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D99823	2021-04-13 10:43:28 -04:00
Jez Ng	8914902b01	[lld-macho] Add support for arm64_32 From what I can tell, it's pretty similar to arm64. The two main differences are: 1. No 64-bit relocations 2. Stub code writes to 32-bit registers instead of 64-bit Plus of course the various on-disk structures like `segment_command` are using the 32-bit instead of the 64-bit variants. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D99822	2021-04-13 10:43:28 -04:00
Jez Ng	74283fc853	[lld-macho][nfc] Convert tabs to spaces	2021-04-11 23:25:23 -04:00
Jez Ng	88cb786ec2	[lld-macho][nfc] Remove DYSYM8 reloc attribute It's likely redundant, per discussion with @gkm. The BYTE8 attribute covers the bit width requirement already. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D100133	2021-04-09 19:48:08 -04:00
Alex Orlov	f47a4c0713	[lld] Fixed CodeView GuidAdapter::format to handle GUID bytes in the right order. This fixes https://bugs.llvm.org/show_bug.cgi?id=41712 bug. Reviewed By: aganea Differential Revision: https://reviews.llvm.org/D99978	2021-04-09 05:29:14 +04:00
Jez Ng	c2e76a9a6d	[lld-macho][nfc] Use varargs form of hasArg()	2021-04-08 14:12:55 -04:00
Jez Ng	c23b92acd0	[lld-macho] Support -add_ast_path Swift builds seem to use it. All it requires is emitting the corresponding paths as STABS. Fixes llvm.org/PR49385. Reviewed By: clayborg Differential Revision: https://reviews.llvm.org/D100076	2021-04-08 14:12:55 -04:00
Jez Ng	3f6753efe1	[lld-macho][nfc] Extend abs-symbol.s to test for local absolute symbols Addresses an old TODO. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D100082	2021-04-08 12:21:01 -04:00
Jez Ng	050a7a27ca	[lld-macho] Support --thinlto-jobs The test is loosely based off LLD-ELF's `thinlto.ll`. However, I found that test questionable because the the -save_temps behavior it checks for is identical regardless of whether we are running in single- or multi-threaded mode. I tried writing a test based on `--time-trace` but couldn't get it to run deterministically... so I've opted to just skip checking that behavior for now. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D99356	2021-04-08 12:21:01 -04:00
Jez Ng	d9065fe8ea	[lld-macho] Parallelize __LINKEDIT generation Benchmarking chromium_framework on a 3.2 GHz 16-Core Intel Xeon W Mac Pro: N Min Max Median Avg Stddev x 20 4.33 4.42 4.37 4.37 0.021026299 + 20 4.12 4.23 4.18 4.175 0.035318103 Difference at 95.0% confidence -0.195 +/- 0.0186025 -4.46224% +/- 0.425686% (Student's t, pooled s = 0.0290644) Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D99998	2021-04-07 19:55:52 -04:00
Vy Nguyen	db851dfb49	[lld-macho] Make time-trace* options more permissive. If either `time-trace-granularity` or `time-trace-file` is specified, then don't make users specify `-time-trace`. It seems silly that I have to type all three options, eg, `-time-trace -time-trace-file=- -time-trace-granularity=...`. Differential Revision: https://reviews.llvm.org/D100011	2021-04-07 16:00:20 -04:00
Vy Nguyen	ffc65824f0	[lld-macho][nfc] Minor refactoring + clang-tidy fixes - use "empty()" instead of "size()" - refactor the re-export code so it doesn't create a new vector every time. Differential Revision: https://reviews.llvm.org/D100019	2021-04-07 13:55:52 -04:00
Jez Ng	982e3c0510	[lld-macho] Sibling N_SO symbols must have the empty string We had been giving them a string index of zero, which actually corresponds to a string with a single space due to {D89639}. This was far from obvious in the old test because llvm-nm doesn't quote the symbol names, making the empty string look identical to a string of a single space. `dsymutil -s` quotes its strings, so I've changed the test accordingly. Fixes llvm.org/PR48714. Thanks @clayborg for the tips! Reviewed By: clayborg Differential Revision: https://reviews.llvm.org/D100003	2021-04-07 12:08:14 -04:00
Jez Ng	d855a727bb	[lld-macho][nfc] Add test for ARM64 stubs Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D99813	2021-04-07 12:08:12 -04:00
Jez Ng	2461804b48	[lld-macho] Symbol::value should always be uint64_t D98837 migrated a bunch of `value`s to uint64_t, but missed these.	2021-04-06 17:54:11 -04:00
Jez Ng	ceec610754	[lld-macho] Fix & refactor symbol size calculations I noticed two problems with the previous implementation: * N_ALT_ENTRY symbols weren't being handled correctly -- they should determine the size of the previous symbol, even though they don't cause a new section to be created * The last symbol in a section had its size calculated wrongly; the first subsection's size was used instead of the last one I decided to take the opportunity to refactor things as well, mainly to realize my observation [here](https://reviews.llvm.org/D98837#inline-931511) that we could avoid doing a binary search to match symbols with subsections. I think the resulting code is a bit simpler too. N Min Max Median Avg Stddev x 20 4.31 4.43 4.37 4.3775 0.034162922 + 20 4.32 4.43 4.38 4.3755 0.02799906 No difference proven at 95.0% confidence Reviewed By: #lld-macho, alexshap Differential Revision: https://reviews.llvm.org/D99972	2021-04-06 15:10:01 -04:00
Jez Ng	94f75202ac	[lld-macho][nfc] Remove HelpHidden from aliases to implemented flags This is a no-op. Just cleaning up Options.td... Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D99874	2021-04-06 15:10:00 -04:00
Jez Ng	9456e720ec	[lld-macho][nfc] Rename some tests "stub" is a bit too overloaded... we were using it to refer to TAPI files, but it's also the name for the PLT trampolines in Mach-O. Going ahead, let's just use "TAPI" or ".tbd" to refer to TAPI stuff. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D99807	2021-04-06 15:10:00 -04:00
Jez Ng	174deb0539	[lld-macho] clang-format cleanup find . -type f -name ".cpp" -o -name ".h" \| xargs clang-format -i	2021-04-06 14:26:13 -04:00
Jez Ng	e0df2b540a	[lld-macho] Rename SubsectionMapping to SubsectionMap We bikeshedded about it here: https://reviews.llvm.org/D98837#inline-931557 I initially suggested SubsectionMapping, but I thought the discussion landed on doing `std::vector<SubsectionEntry>`. @alexshap went and did both, but on hindsight I regret adding 3 more characters to an already long name, and I think SubsectionEntry is descriptive enough... This diff also renames `subsectionMap` to `subsecMap` for consistency with other variable names in the codebase.	2021-04-06 14:26:13 -04:00
Abhina Sreeskantharajan	82b3e28e83	[SystemZ][z/OS][Windows] Add new OF_TextWithCRLF flag and use this flag instead of OF_Text Problem: On SystemZ we need to open text files in text mode. On Windows, files opened in text mode adds a CRLF '\r\n' which may not be desirable. Solution: This patch adds two new flags - OF_CRLF which indicates that CRLF translation is used. - OF_TextWithCRLF = OF_Text \| OF_CRLF indicates that the file is text and uses CRLF translation. Developers should now use either the OF_Text or OF_TextWithCRLF for text files and OF_None for binary files. If the developer doesn't want carriage returns on Windows, they should use OF_Text, if they do want carriage returns on Windows, they should use OF_TextWithCRLF. So this is the behaviour per platform with my patch: z/OS: OF_None: open in binary mode OF_Text : open in text mode OF_TextWithCRLF: open in text mode Windows: OF_None: open file with no carriage return OF_Text: open file with no carriage return OF_TextWithCRLF: open file with carriage return The Major change is in llvm/lib/Support/Windows/Path.inc to only set text mode if the OF_CRLF is set. ``` if (Flags & OF_CRLF) CrtOpenFlags \|= _O_TEXT; ``` These following files are the ones that still use OF_Text which I left unchanged. I modified all these except raw_ostream.cpp in recent patches so I know these were previously in Binary mode on Windows. ./llvm/lib/Support/raw_ostream.cpp ./llvm/lib/TableGen/Main.cpp ./llvm/tools/dsymutil/DwarfLinkerForBinary.cpp ./llvm/unittests/Support/Path.cpp ./clang/lib/StaticAnalyzer/Core/HTMLDiagnostics.cpp ./clang/lib/Frontend/CompilerInstance.cpp ./clang/lib/Driver/Driver.cpp ./clang/lib/Driver/ToolChains/Clang.cpp Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D99426	2021-04-06 07:23:31 -04:00
Sam Clegg	dc1a08caef	[lld][WebAssembly] Rewrite exports test in assembly. NFC Differential Revision: https://reviews.llvm.org/D99885	2021-04-05 11:14:12 -07:00
Cyndy Ishida	0116d04d04	[TextAPI] move source code files out of subdirectory, NFC TextAPI/ELF has moved out into InterfaceStubs, so theres no longer a need to seperate out TextAPI between formats. Reviewed By: ributzka, int3, #lld-macho Differential Revision: https://reviews.llvm.org/D99811	2021-04-05 10:24:42 -07:00
Stefan Pintilie	660c4e57b4	[PowerPC] Fix issue where binary uses a .got but is missing a .TOC. From the PowerPC ELFv2 ABI section 4.2.3. Global Offset Table. ``` The GOT consists of an 8-byte header that contains the TOC base (the first TOC base when multiple TOCs are present), followed by an array of 8-byte addresses. ``` Due to the introduction of PC Relative code it is now possible to require a GOT without having a .TOC. symbol in the object that is being linked. Since LLD uses the .TOC. symbol to determine whether or not a GOT is required the GOT header is not setup correctly and the 8-byte header is missing. This patch allows the Power PC GOT setup to happen when an element is added to the GOT instead of at the very begining. When this header is added a .TOC. symbol is also added. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D91426	2021-04-05 09:13:20 -05:00
Jez Ng	bd115d0991	[lld-macho] Another attempt at fixing 32-bit builds	2021-04-03 11:58:23 -04:00
Jez Ng	c04e1c8b66	[lld-macho] Fix build on 32-bit systems Summary: Follow-up to D99633.	2021-04-03 11:12:11 -04:00
Nico Weber	a78fa73bcf	Revert "[lld-link] Enable addrsig table in COFF lto" This reverts commit `eabd55b1b2`. Speculative, for crbug.com/1195545	2021-04-03 10:56:33 -04:00
Fangrui Song	c318746345	[lld-macho] Fix -Wsuggest-override after D99633. NFC	2021-04-02 17:04:11 -07:00
Jez Ng	817d98d841	[lld-macho][nfc] Refactor in preparation for 32-bit support The main challenge was handling the different on-disk structures (e.g. `mach_header` vs `mach_header_64`). I tried to strike a balance between sprinkling `target->wordSize == 8` checks everywhere (branchy = slow, and ugly) and templatizing everything (causes code bloat, also ugly). I think I struck a decent balance by judicious use of type erasure. Note that LLD-ELF has a similar architecture, though it seems to use more templating. Linking chromium_framework takes about the same time before and after this change: N Min Max Median Avg Stddev x 20 4.52 4.67 4.595 4.5945 0.044423204 + 20 4.5 4.71 4.575 4.582 0.056344803 No difference proven at 95.0% confidence Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D99633	2021-04-02 18:46:39 -04:00
Greg McGary	3f8c6f493b	[lld-macho][NFC] Remove redundant member from class Defined `class Symbol` defines a data member `InputFile file;` `class Defined` inherits from `Symbol` and also defines a data member `InputFile file;` for no apparent purpose. Differential Revision: https://reviews.llvm.org/D99783	2021-04-02 09:19:15 -07:00
Yang Fan	d441dee5c2	[lld][MachO] Fix -Wsign-compare warning (NFC) GCC warning: ``` /llvm-project/lld/MachO/InputFiles.cpp:484:24: warning: comparison of integer expressions of different signedness: ‘int64_t’ {aka ‘long int’} and ‘uint64_t’ {aka ‘long unsigned int’} [-Wsign-compare] 484 \| return value < subsectionEntry.offset; \| ~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~ ```	2021-04-02 11:33:56 +08:00
Yang Fan	062d4ddd22	[lld] Add missing header guard (NFC)	2021-04-02 11:12:23 +08:00
Alexander Shaposhnikov	f6ad045366	[lld][MachO] Make emitEndFunStab independent from .subsections_via_symbols This diff addresses FIXME in SyntheticSections.cpp and removes the dependency of emitEndFunStab on .subsections_via_symbols. Test plan: make check-lld-macho Differential revision: https://reviews.llvm.org/D99054	2021-04-01 17:48:09 -07:00
Alexander Shaposhnikov	f1e4e2fb20	[lld][MachO] Refactor handling of subsections This diff is a preparation for fixing FunStabs (incorrect size calculation). std::map<uint32_t, InputSection*> (SubsectionMap) is replaced with a sorted vector + binary search. If .subsections_via_symbols is set this vector will contain the list of subsections, otherwise, the offsets will be used for calculating the symbols sizes. Test plan: make check-all Differential revision: https://reviews.llvm.org/D98837	2021-03-31 16:52:53 -07:00
Jez Ng	9b6dde8af8	[lld-macho] Parallelize UUID hash computation This reuses the approach (and some code) from LLD-ELF. It's a decent win when linking chromium_framework on a Mac Pro (3.2 GHz 16-Core Intel Xeon W): N Min Max Median Avg Stddev x 20 4.58 4.83 4.66 4.6685 0.066591844 + 20 4.42 4.61 4.5 4.505 0.04751731 Difference at 95.0% confidence -0.1635 +/- 0.0370242 -3.5022% +/- 0.793064% (Student's t, pooled s = 0.0578462) The output binary is 381MB. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D99279	2021-03-31 15:48:36 -04:00
Jez Ng	09aed384ba	[lld-macho][nfc] Test that -ObjC will import bitcode with category sections The functionality was originally added in {D95265}, but the test in that diff only checked if `-ObjC` would cause bitcode containing ObjC class symbols to be loaded. It neglected to test for bitcode containing categories but no class symbols. This diff also changes the lto-archive.ll test to use `-why_load` instead of inspecting the output binary's symbol table. This is motivated by the stacked diff {D99105}, which will hide irrelevant bitcode symbols. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D99215	2021-03-31 15:48:36 -04:00
Zequan Wu	eabd55b1b2	[lld-link] Enable addrsig table in COFF lto This allow safe-icf mode to work when linking with LTO. Differential Revision: https://reviews.llvm.org/D99613	2021-03-30 15:47:53 -07:00
Greg McGary	427d359721	[lld-macho][NFC] Drop unnecessary macho:: namespace prefix on unambiguous references to Symbol Within `lld/macho/`, only `InputFiles.cpp` and `Symbols.h` require the `macho::` namespace qualifier to disambiguate references to `class Symbol`. Add braces to outer `for` of a 5-level single-line `if`/`for` nest. Differential Revision: https://reviews.llvm.org/D99555	2021-03-30 14:58:35 -07:00
Amy Huang	5127da0291	Revert "[COFF] Only consider associated EH sections during ICF" This change causes an asan error for ODR violation. This reverts commit `7ce9a3e9a9`.	2021-03-29 19:15:35 -07:00
Nico Weber	221388f451	fix comment typo to cycle bots	2021-03-29 14:50:17 -04:00
Nico Weber	742f663705	fix comment typo to cycle bots	2021-03-29 14:35:57 -04:00
Jez Ng	a43f588e01	[lld-macho] Implement -segprot Addresses llvm.org/PR49405. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D99389	2021-03-29 14:08:12 -04:00
Jez Ng	ae7aa9ed15	[lld-macho] Add time tracing for LTO The test is similar to the one used for LLD-ELF. Differential Revision: https://reviews.llvm.org/D99318	2021-03-26 18:14:10 -04:00
Jez Ng	94e369400e	[lld-macho] Fix parsing of --time-trace-{granularity,file} Summary: We needed to use `Joined` instead of `Flag`. This wasn't caught because the relevant test that was copied from LLD-ELF was still invoking LLD-ELF instead of LLD-MachO... Differential Revision: https://reviews.llvm.org/D99313	2021-03-26 18:14:10 -04:00
Jez Ng	45cdceb40c	[lld-macho] Support -no_function_starts Pretty simple code-wise. Also threw in some refactoring: * Put the functionStartSection under Writer instead of InStruct, since it doesn't need to be accessed outside of Writer * Adjusted the test to put all files under the temp dir instead of at the top-level * Added some CHECK-LABELs to make it clearer where the function starts data is Differential Revision: https://reviews.llvm.org/D99112	2021-03-26 18:14:10 -04:00
Vy Nguyen	dee5787d3e	Reland [lld-macho][nfc] minor clean up, follow up to D98559 This reverts commit `77b4230ed9`. New change: Fixed tests on windows Differential Revision: https://reviews.llvm.org/D99210	2021-03-25 16:46:37 -04:00
Vy Nguyen	e2f34cc330	[lld-macho][nfc] Removed unnecessary static_cast Differential Revision: https://reviews.llvm.org/D99365	2021-03-25 15:07:46 -04:00
Jez Ng	0113cf00b6	[lld-macho] Add support for --threads Code and test are largely identical to the LLD-ELF equivalents. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D99312	2021-03-25 14:51:31 -04:00
Jez Ng	4bcaafeb0e	[lld-macho] Add more TimeTraceScopes I added just enough to allow us to see a top-level breakdown of time taken. This is the result of loading the time-trace output into `chrome:://tracing`: `ef5e8234f3/tracing.png` Reviewed By: oontvoo Differential Revision: https://reviews.llvm.org/D99311	2021-03-25 14:51:31 -04:00
Jez Ng	53fd1ada76	[lld-macho] Fix typo in diagnostic message	2021-03-25 14:51:31 -04:00
Abhina Sreeskantharajan	c83cd8feef	[NFC] Reordering parameters in getFile and getFileOrSTDIN In future patches I will be setting the IsText parameter frequently so I will refactor the args to be in the following order. I have removed the FileSize parameter because it is never used. ``` static ErrorOr<std::unique_ptr<MemoryBuffer>> getFile(const Twine &Filename, bool IsText = false, bool RequiresNullTerminator = true, bool IsVolatile = false); static ErrorOr<std::unique_ptr<MemoryBuffer>> getFileOrSTDIN(const Twine &Filename, bool IsText = false, bool RequiresNullTerminator = true); static ErrorOr<std::unique_ptr<MB>> getFileAux(const Twine &Filename, uint64_t MapSize, uint64_t Offset, bool IsText, bool RequiresNullTerminator, bool IsVolatile); static ErrorOr<std::unique_ptr<WritableMemoryBuffer>> getFile(const Twine &Filename, bool IsVolatile = false); ``` Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D99182	2021-03-25 09:47:49 -04:00
Martin Storsjö	a88556733a	[LLD] Fix probing a MSYS based 'tar' in a Windows Container Don't run the 'tar' tool in a cleared environment with only the LANG variable set, just set LANG on top of the existing environment. If the 'tar' tool is an MSYS based tool, running it in a Windows Container hangs if all environment variables are cleared - in particular, the USERPROFILE variable needs to be kept intact. This is the same issue fixed as was fixed in other places in `9de63b2e05`, but contrary to running the actual tests, running with an as-cleared-as-possible environment here is less important. Differential Revision: https://reviews.llvm.org/D99304	2021-03-25 09:45:27 +02:00
Yolanda Chen	4f9c61ef72	[lld] add context-sensitive PGO options for COFF. Add lld CSPGO (Contex-Sensitive PGO) options for COFF target. Reference the ELF options from https://reviews.llvm.org/D56675 Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D98763	2021-03-24 23:40:09 -07:00
Vy Nguyen	d988ffc34f	[lld-macho][nfc] Fixed test so it output to %t/ rather than current directory. The a.out broke our build. Differential Revision: https://reviews.llvm.org/D99271	2021-03-24 16:55:37 -04:00
Konstantin Zhuravlyov	4f28303133	AMDGPU/LLD: Add target id and code object v4 support to linker Differential Revision: https://reviews.llvm.org/D95811	2021-03-24 13:41:10 -04:00
Konstantin Zhuravlyov	f4ace63737	AMDGPU: Add target id and code object v4 support - Add target id support (https://clang.llvm.org/docs/ClangOffloadBundler.html#target-id) - Add code object v4 support (https://llvm.org/docs/AMDGPUUsage.html#elf-code-object) - Add kernarg_size to kernel descriptor - Change trap handler ABI to no longer move queue pointer into s[0:1] - Cleanup ELF definitions - Add V2, V3, V4 suffixes to make a clear distinction for code object version - Consolidate note names Differential Revision: https://reviews.llvm.org/D95638	2021-03-24 11:54:05 -04:00
Andy Wingo	9ac5620cb8	[WebAssembly] Rename WasmLimits::Initial to ::Minimum. NFC. This patch renames the "Initial" member of WasmLimits to the name used in the spec, "Minimum". In the core WebAssembly specification, the Limits data type has one required "min" member and one optional "max" member, indicating the minimum required size of the corresponding table or memory, and the maximum size, if any. Although the WebAssembly spec does instantiate locally-defined tables and memories with the initial size being equal to the minimum size, it can't impose such a requirement for imports. It doesn't make sense to require an initial size for a memory import, for example. The compiler can only sensibly express the minimum and maximum sizes. See https://github.com/WebAssembly/js-types/blob/master/proposals/js-types/Overview.md#naming-of-size-limits for a related discussion that agrees that the right name of "initial" is "minimum" when querying the type of a table or memory from JavaScript. (Of course it still makes sense for JS to speak in terms of an initial size when it explicitly instantiates memories and tables.) Differential Revision: https://reviews.llvm.org/D99186	2021-03-24 09:10:11 +01:00
Shoaib Meenai	48d9b2fd8e	[lld] Fix test to work with and without a vendor string	2021-03-23 16:16:25 -07:00
Vy Nguyen	aa6e4cdd73	[lld-macho] Fixed lld-version expectation in test so it works on Fuchsia. On Fuchsia, it's called Fuchsia LLD Differential Revision: https://reviews.llvm.org/D99217	2021-03-23 17:56:23 -04:00
Vy Nguyen	77b4230ed9	Revert "[lld-macho][nfc] minor clean up, follow up to D98559" This reverts commit `1bc33eb6a3`. tests failed on windows	2021-03-23 17:15:36 -04:00
Vy Nguyen	1bc33eb6a3	[lld-macho][nfc] minor clean up, follow up to D98559 Differential Revision: https://reviews.llvm.org/D99210	2021-03-23 16:13:09 -04:00
Vy Nguyen	f499b932bf	Revert "Revert "Revert "Revert "Revert "Revert "[lld-macho] Implement -dependency_info (partially - more opcodes needed)"""""" This reverts commit `4876ba5b2d`. Third-attemp relanding D98559, new change: - explicitly cast enum to underlying type to avoid ambiguity (workaround to clang's bug).	2021-03-23 14:51:05 -04:00
Mehdi Amini	4876ba5b2d	Revert "Revert "Revert "Revert "Revert "[lld-macho] Implement -dependency_info (partially - more opcodes needed)""""" This reverts commit `3c21166a94`. The build is broken (clang-8 host compiler): lld/MachO/DriverUtils.cpp:271:8: error: use of overloaded operator '<<' is ambiguous (with operand types 'llvm::raw_fd_ostream' and 'lld::macho::DependencyTracker::DepOpCode') os << opcode; ~~ ^ ~~~~~~	2021-03-23 00:19:12 +00:00
Vy Nguyen	3c21166a94	Revert "Revert "Revert "Revert "[lld-macho] Implement -dependency_info (partially - more opcodes needed)"""" This reverts commit `9670d2e4af`. Second attemp to reland D98559. New changes: - inline functions removed from cpp file. - updated tests to use CHECK-DAG instead of CHECK-NEXT - fixed ambiguous "<<" operator by switching `char` to uint8_t	2021-03-22 19:34:51 -04:00
Reid Kleckner	7ce9a3e9a9	[COFF] Only consider associated EH sections during ICF The only known reason why ICF should not merge otherwise identical sections with differing associated sections has to do with exception handling tables. It's not clear what ICF should do when there are other kinds of associated sections. In every other case when this has come up, debug info and CF guard metadata, we have opted to make ICF ignore the associated sections. For comparison, ELF doesn't do anything for comdat groups. Instead, .eh_frame is parsed to figure out if a section has an LSDA, and if so, ICF is disabled. Another issue is that the order of associated sections is not defined. We have had issues in the past (crbug.com/1144476) where changing the order of the .xdata/.pdata sections in the object file lead to large ICF slowdowns. To address these issues, I decided it would be best to explicitly consider only .pdata and .xdata sections during ICF. This makes it easy to ignore the object file order, and I think it makes the intention of the code clearer. I've also made the children() accessor return an empty list for associated sections. This mostly only affects ICF and GC. This was the behavior before I made this a linked list, so the behavior change should be good. This had positive effects on chrome.dll: more .xdata sections were merged that previously could not be merged because they were associated with distinct .pdata sections. Reviewed By: mstorsjo Differential Revision: https://reviews.llvm.org/D98993	2021-03-22 15:36:26 -07:00
Reid Kleckner	e5646e4570	[PDB] Add missing test for `b552adf8b3`	2021-03-22 14:56:36 -07:00
Vy Nguyen	9670d2e4af	Revert "Revert "Revert "[lld-macho] Implement -dependency_info (partially - more opcodes needed)""" This reverts commit `5ad2c225f3`. bots still unhappy - revertting again	2021-03-22 14:54:01 -04:00
Vy Nguyen	5ad2c225f3	Revert "Revert "[lld-macho] Implement -dependency_info (partially - more opcodes needed)"" This reverts commit `2554b95db5`. Relanding [lld-macho] Implement -dependency_info (D98559) with changes: - inline functions removed from cpp file. - updated tests to not check libSystem.tbd with other input files (because of possible indeterministic ordering)	2021-03-22 14:41:57 -04:00
Stefan Pintilie	f21704e080	[LLD][PowerPC] Fix bug in PC-Relative initial exec There is a bug when initial exec is relaxed to local exec. In the following situation: InitExec.c ``` extern __thread unsigned TGlobal; unsigned getConst(unsigned); unsigned addVal(unsigned, unsigned); unsigned GetAddrT() { return addVal(getConst(&TGlobal), &TGlobal); } ``` Def.c ``` __thread unsigned TGlobal; unsigned getConst(unsigned* A) { return A + 3; } unsigned addVal(unsigned A, unsigned B) { return A + *B; } ``` The problem is in InitExec.c but Def.c is required if you want to link the example and see the problem. To compile everything: ``` clang -O3 -mcpu=pwr10 -c InitExec.c clang -O3 -mcpu=pwr10 -c Def.c ld.lld InitExec.o Def.o -o IeToLe ``` If you objdump the problem object file: ``` $ llvm-objdump -dr --mcpu=pwr10 InitExec.o ``` you will get the following assembly: ``` 0000000000000000 <GetAddrT>: 0: a6 02 08 7c mflr 0 4: f0 ff c1 fb std 30, -16(1) 8: 10 00 01 f8 std 0, 16(1) c: d1 ff 21 f8 stdu 1, -48(1) 10: 00 00 10 04 00 00 60 e4 pld 3, 0(0), 1 0000000000000010: R_PPC64_GOT_TPREL_PCREL34 TGlobal 18: 14 6a c3 7f add 30, 3, 13 0000000000000019: R_PPC64_TLS TGlobal 1c: 78 f3 c3 7f mr 3, 30 20: 01 00 00 48 bl 0x20 0000000000000020: R_PPC64_REL24_NOTOC getConst 24: 78 f3 c4 7f mr 4, 30 28: 30 00 21 38 addi 1, 1, 48 2c: 10 00 01 e8 ld 0, 16(1) 30: f0 ff c1 eb ld 30, -16(1) 34: a6 03 08 7c mtlr 0 38: 00 00 00 48 b 0x38 0000000000000038: R_PPC64_REL24_NOTOC addVal ``` The lines of interest are: ``` 10: 00 00 10 04 00 00 60 e4 pld 3, 0(0), 1 0000000000000010: R_PPC64_GOT_TPREL_PCREL34 TGlobal 18: 14 6a c3 7f add 30, 3, 13 0000000000000019: R_PPC64_TLS TGlobal 1c: 78 f3 c3 7f mr 3, 30 ``` Which once linked gets turned into: ``` 10010210: ff ff 03 06 00 90 6d 38 paddi 3, 13, -28672, 0 10010218: 00 00 00 60 nop 1001021c: 78 f3 c3 7f mr 3, 30 ``` The problem is that register 30 is never set after the optimization. Therefore it is not correct to relax the above instructions by replacing the add instruction with a nop. Instead the add instruction should be replaced with a copy (mr) instruction. If the add uses the same resgiter as input and as ouput then it is safe to continue to replace the add with a nop. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D95262	2021-03-22 13:15:44 -05:00
Yang Fan	0db28c0f3b	[ELF][docs] Add line breaks	2021-03-22 16:08:47 +08:00
Nico Weber	2554b95db5	Revert "[lld-macho] Implement -dependency_info (partially - more opcodes needed)" This reverts commit `c53a1322f3`. Test only passes depending on build dir having a lexicographically later name than the source dir, and doesn't link on mac/win. See https://reviews.llvm.org/D98559#2640265 onward.	2021-03-21 16:35:38 -04:00
Vy Nguyen	c53a1322f3	[lld-macho] Implement -dependency_info (partially - more opcodes needed) Bug: https://bugs.llvm.org/show_bug.cgi?id=49278 The flag is not well documented, so this implementation is based on observed behaviour. When specified, `-dependency_info <path>` produced a text file containing information pertaining to the current linkage, such as input files, output file, linker version, etc. This file's layout is also not documented, but it seems to be a series of null ('\0') terminated strings in the form `<op code><path>` `<op code>` could be: `0x00` : linker version `0x10` : input `0x11` : files not found(??) `0x40` : output `<path>` : is the file path, except for the linker-version case. (??) This part is a bit unclear. I think it means all the files the linker attempted to look at, but could not find. Differential Revision: https://reviews.llvm.org/D98559	2021-03-21 14:35:46 -04:00
Jez Ng	8757616de3	[lld-macho][nfc] Format Options.td Summary: A good chunk of it was mis-indented. Fixed by using the formatting settings from llvm/utils/vim.	2021-03-21 09:33:04 -04:00
Jez Ng	47fdaa32f9	[lld-macho] Minor touch-up to objc.s	2021-03-20 14:42:16 -04:00
Vy Nguyen	6c1ae8f2dc	[lld-macho][nfc] Fixed typo in comment Missed this one from https://reviews.llvm.org/D97007?id=331759#inline-930034 Differential Revision: https://reviews.llvm.org/D98973	2021-03-19 14:19:36 -04:00
Vy Nguyen	66f340051a	[lld-macho] Define __mh_*_header synthetic symbols. Bug: https://bugs.llvm.org/show_bug.cgi?id=49290 Differential Revision: https://reviews.llvm.org/D97007	2021-03-19 14:14:40 -04:00
Fangrui Song	16c30c3c23	[ELF] Change --shuffle-sections=<seed> to --shuffle-sections=<section-glob>=<seed> `--shuffle-sections=<seed>` applies to all sections. The new `--shuffle-sections=<section-glob>=<seed>` makes shuffling selective. To the best of my knowledge, the option is only used as debugging, so just drop the original form. `--shuffle-sections '.init_array=-1'` `--shuffle-sections '.fini_array=-1'`. reverses static constructors/destructors of the same priority. Useful to detect some static initialization order fiasco. `--shuffle-sections '.data=-1'` reverses `.data` sections. Useful to detect unfunded pointer comparison results of two unrelated objects. If certain sections have an intrinsic order, the old form cannot be used. Differential Revision: https://reviews.llvm.org/D98679	2021-03-18 10:18:19 -07:00
caoming.roy	ed8bff13dc	[lld-macho] implement options -map Implement command-line options -map Reviewed By: int3, #lld-macho Differential Revision: https://reviews.llvm.org/D98323	2021-03-18 10:39:19 -04:00
Greg McGary	74b888baad	[lld-macho][NFC] Minor refactor of Writer::run() Move some functions closer to their uses. Move detailed address-assignment logic out of the otherwise abstract `Writer::run()`. This prepares the ground for a diff to implement branch range extension thunks. * `SyntheticSections.cpp` move `needsBinding()` and `prepareBranchTarget()` into `Writer.cpp` move `addNonLazyBindingEntries()` adjacent to its use. * `Writer.cpp` move address-assignment logic from `Writer::run()` into new function `Writer::assignAddresses()` move `needsBinding()` and `prepareBranchTarget()` from `SyntheticSections.cpp` * `Target.h` ** remove orphaned decls of `prepareSymbolRelocation()` and `validateRelocationInfo()` which were moved to other files in earlier diffs. Differential Revision: https://reviews.llvm.org/D98795	2021-03-17 15:13:43 -07:00
Fangrui Song	423cb321df	[ELF] Special case --shuffle-sections=-1 to reverse input sections If the number of sections changes, which is common for re-links after incremental updates, the section order may change drastically. Special case -1 to reverse input sections. This is a stable transform. The section order is more resilient to incremental updates. Usually the code issue (e.g. Static Initialization Order Fiasco, assuming pointer comparison result of two unrelated objects) is due to the relative order between two problematic input files A and B. Checking the regular order and the reversed order is sufficient. Differential Revision: https://reviews.llvm.org/D98445	2021-03-17 09:32:44 -07:00
Greg McGary	a170533632	[lld-macho][NFC] Drop unnecessary braces around simple if/for bodies Minor cleanup Differential Revision: https://reviews.llvm.org/D98758	2021-03-16 22:39:39 -07:00
Greg McGary	db1e845a96	[lld-macho] Handle error cases properly for -exported_symbol(s_list) This fixes defects in D98223 [lld-macho] implement options -(un)exported_symbol(s_list): * disallow export of hidden symbols * verify that whitelisted literal names are defined in the symbol table * reflect export-status overrides in `nlist` attribute of `N_EXT` or `N_PEXT` Thanks to @thakis for raising these issues Differential Revision: https://reviews.llvm.org/D98381	2021-03-16 21:20:39 -07:00
Markus Böck	af2796c76d	[test] Add ability to get error messages from CMake for errc substitution Visual Studios implementation of the C++ Standard Library does not use strerror to produce a message for std::error_code unlike other standard libraries such as libstdc++ or libc++ that might be used. This patch adds a cmake script that through running a C++ program gets the error messages for the POSIX error codes and passes them onto lit through an optional config parameter. If the config parameter is not set, or getting the messages failed, due to say a cross compiling configuration without an emulator, it will fall back to using pythons strerror functions. Differential Revision: https://reviews.llvm.org/D98278	2021-03-15 20:56:08 +01:00
Jez Ng	29d4676059	[lld-macho] Place LC_FUNCTION_STARTS data at the right position This pleases the codesign (Otherwise it complains about "function starts data out of place") Reviewed By: #lld-macho, smeenai Differential Revision: https://reviews.llvm.org/D98648	2021-03-15 14:56:31 -04:00

... 5 6 7 8 9 ...

14530 Commits