llvm-project

Commit Graph

Author	SHA1	Message	Date
Fangrui Song	bccdf9197b	Revert "[lld-macho] Work around odr-use of const non-inline static data member to fix -O0 build after D128298" This reverts commit `20b2d3260d`. The workaround is no longer needed for C++17.	2022-08-06 16:44:14 -07:00
Jez Ng	ee61dc5f6c	[lld-macho][nfc] Reduce nesting of code added in D130125	2022-07-23 13:16:00 -04:00
Kazu Hirata	1cc7f5bede	Use static_assert instead of assert (NFC) Identified with misc-static-assert.	2022-07-23 09:22:27 -07:00
Jez Ng	d23da0ec6c	[lld-macho] Fold __objc_imageinfo sections Previously, we treated it as a regular ConcatInputSection. However, ld64 actually parses its contents and uses that to synthesize a single image info struct, generating one 8-byte section instead of `8 * number of object files with ObjC code`. I'm not entirely sure what impact this section has on the runtime, so I just tried to follow ld64's semantics as closely as possible in this diff. My main motivation though was to reduce binary size. No significant perf change on chromium_framework on my 16-core Mac Pro: base diff difference (95% CI) sys_time 1.764 ± 0.062 1.748 ± 0.032 [ -2.4% .. +0.5%] user_time 5.112 ± 0.104 5.106 ± 0.046 [ -0.9% .. +0.7%] wall_time 6.111 ± 0.184 6.085 ± 0.076 [ -1.6% .. +0.8%] samples 30 32 Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D130125	2022-07-23 12:12:01 -04:00
Daniel Bertalan	54e18b2397	[lld-macho] Optimize rebase opcode generation This commit reduces the size of the emitted rebase sections by generating the REBASE_OPCODE_DO_REBASE_ADD_ADDR_ULEB and REBASE_OPCODE_DO_REBASE_ULEB_TIMES_SKIPPING_ULEB opcodes. With this change, chromium_framework's rebase section is a 40% smaller 197 kilobytes, down from the previous 320 kB. That is 6 kB smaller than what ld64 produces for the same input. Performance figures from my M1 Mac mini: x before + after N Min Max Median Avg Stddev x 10 4.2269349 4.3300061 4.2689675 4.2690016 0.031151669 + 10 4.219331 4.2914009 4.2398136 4.2448277 0.023817308 No difference proven at 95.0% confidence Differential Revision: https://reviews.llvm.org/D130180	2022-07-21 10:00:39 +02:00
Daniel Bertalan	8d29f0fdb9	[lld-macho] Emit REBASE_OPCODE_ADD_ADDR_IMM_SCALED if possible An ADD_ADDR rebase opcode's argument can be encoded as an immediate if the offset is less than 15 * word size. This change reduces the size of chromium_framework by 100+ KiB. Differential Revision: https://reviews.llvm.org/D128798	2022-06-29 22:28:39 +02:00
Fangrui Song	20b2d3260d	[lld-macho] Work around odr-use of const non-inline static data member to fix -O0 build after D128298 ``` ld.lld: error: undefined symbol: lld::macho::CodeSignatureSection::blockSize >>> referenced by SyntheticSections.cpp:1253 (/home/maskray/llvm/lld/MachO/SyntheticSections.cpp:1253) >>> tools/lld/MachO/CMakeFiles/lldMachO.dir/SyntheticSections.cpp.o:(lld::macho::CodeSignatureSection::writeHashes(unsigned char*) const::$_7::operator()(unsigned long) const) ```	2022-06-21 19:22:28 -07:00
Nico Weber	0baf13e282	[lld/mac] Parallelize code signature computation According to ministat, this is a small but measurable speedup (using the repro in PR56121): N Min Max Median Avg Stddev x 10 3.7439518 3.7783802 3.7730219 3.7655502 0.012375226 + 10 3.6149218 3.692198 3.6519327 3.6502951 0.025905601 Difference at 95.0% confidence -0.115255 +/- 0.0190746 -3.06078% +/- 0.506554% (Student's t, pooled s = 0.0203008) (Without `858e8b17f7`, this change here to use parallelFor is an 18% speedup, and doing `858e8b17f7` on top of this change is just a 2.55% +/- 0.58% win. Doing both results in a total speedup of 20.85% +/- 0.44%.) Differential Revision: https://reviews.llvm.org/D128298	2022-06-21 20:41:35 -04:00
Daniel Bertalan	5792797c5b	Reland "[lld-macho] Show source information for undefined references" The error used to look like this: ld64.lld: error: undefined symbol: _foo >>> referenced by /path/to/bar.o:(symbol _baz+0x4) If DWARF line information is available, we now show where in the source the references are coming from: ld64.lld: error: unreferenced symbol: _foo >>> referenced by: bar.cpp:42 (/path/to/bar.cpp:42) >>> /path/to/bar.o:(symbol _baz+0x4) The reland is identical to the first time this landed. The fix was in D128294. This reverts commit `0cc7ad4175`. Differential Revision: https://reviews.llvm.org/D128184	2022-06-21 18:50:06 -04:00
Nico Weber	3ade3d3724	[lld/mac] Replace while loop with for loop No behavior change. In preparation for using a parallelFor() here. Differential Revision: https://reviews.llvm.org/D128295	2022-06-21 15:42:06 -04:00
Nico Weber	858e8b17f7	[lld/mac] On Apple systems, call CC_SHA256 from libSystem It's in libSystem, so it doesn't bring in any new deps, and it's currently much faster than LLVM's current SHA256 implementation. Makes linking (arm64) Chromium Framework with ld64.lld 17% faster. See also PR56121. No behavior change. Differential Revision: https://reviews.llvm.org/D128290	2022-06-21 14:58:04 -04:00
Nico Weber	ca25baee7e	[lld/mac] Extract a sha256() function No behavior change. Differential Revision: https://reviews.llvm.org/D128289	2022-06-21 14:02:42 -04:00
Nico Weber	0cc7ad4175	Revert "[lld-macho] Show source information for undefined references" This reverts commit `cd7624f153`. See https://reviews.llvm.org/D128184#3597534	2022-06-20 19:15:57 -04:00
Daniel Bertalan	cd7624f153	[lld-macho] Show source information for undefined references The error used to look like this: ld64.lld: error: undefined symbol: _foo >>> referenced by /path/to/bar.o:(symbol _baz+0x4) If DWARF line information is available, we now show where in the source the references are coming from: ld64.lld: error: unreferenced symbol: _foo >>> referenced by: bar.cpp:42 (/path/to/bar.cpp:42) >>> /path/to/bar.o:(symbol _baz+0x4) Differential Revision: https://reviews.llvm.org/D128184	2022-06-20 18:49:42 -04:00
Vy Nguyen	82de9bb66b	[lld-macho] Addressed additional post-commit comments from D126046 - fixed newlines - renamed helper function for clarity - added additional comment Differential Revision: https://reviews.llvm.org/D126792	2022-06-03 15:48:11 -04:00
Nico Weber	815825f442	[lld/mac] clang-format after `f5709066e3`	2022-06-01 14:53:08 -04:00
Michael Eisel	f5709066e3	[lld/mac] Cache file IDs of symbols in emitStabs for faster sorting This reduces the time emitStabs() takes by about 275ms, or 3% of overall linking time for the project I'm on. Although the parent function is run in parallel, it's one of the slowest tasks in that concurrent batch (I have another optimization for another slow task as well). Differential Revision: https://reviews.llvm.org/D126785	2022-06-01 14:51:34 -04:00
Vy Nguyen	fae6bd7563	[lld-macho] Support -non_global_symbols_strip_list, -non_global_symbols_no_strip_list, -x PR/55600 Differential Revision: https://reviews.llvm.org/D126046	2022-05-25 19:22:04 +07:00
Vy Nguyen	c0ec1036d6	[lld-macho][nfc] Run clang-format on lld/MachO/*.{h,cpp} - fixed inconsistent indents and spaces - prevent extraneous formatting changes in other patches Differential Revision: https://reviews.llvm.org/D126262	2022-05-24 08:36:20 +07:00
Jez Ng	1cff723ff5	[lld-macho][nfc] Use includeInSymtab for all symtab-skipping logic {D123302} got me looking deeper at `includeInSymtab`. I thought it was a little odd that there were excluded (live) symbols for which `includeInSymtab` was false; we shouldn't have so many different ways to exclude a symbol. As such, this diff makes the `L`-prefixed-symbol exclusion code use `includeInSymtab` too. (Note that as part of our support for `__eh_frame`, we will also be excluding all `__eh_frame` symbols from the symtab in a future diff.) Another thing I noticed is that the `emitStabs` code never has to deal with excluded symbols because `SymtabSection::finalize()` already filters them out. As such, I've updated the comments and asserts from {D123302} to reflect this. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D123433	2022-04-11 15:45:46 -04:00
Nico Weber	2cb3d28b17	[lld/mac] Add some comments and asserts I was wondering if SymtabSection::emitStabs() should check defined->includeInSymtab. Add asserts and comments explaining why that's not necessary. No behavior change. Differential Revision: https://reviews.llvm.org/D123302	2022-04-07 15:43:28 -04:00
Nico Weber	8c1ea1ab81	[lld/mac] Don't emit stabs entries for functions folded during ICF This matches ld64, and makes dsymutil work better with lld's output. Fixes PR54783, see there for details. Reduces time needed to run dsymutil on Chromium Framework from 8m30s (which is already down from 26 min with D123218) to 6m30s and removes many lines of "could not find object file symbol for symbol" from dsymutil output (previously: several MB of those messages, now dsymutil is completely silent). Differential Revision: https://reviews.llvm.org/D123252	2022-04-07 08:09:32 -04:00
Simon Pilgrim	156b94c2d3	Fix "result of 32-bit shift implicitly converted to 64 bits" MSVC warning. NFC.	2022-04-07 11:25:09 +01:00
Argyrios Kyrtzidis	330268ba34	[Support/Hash functions] Change the `final()` and `result()` of the hashing functions to return an array of bytes Returning `std::array<uint8_t, N>` is better ergonomics for the hashing functions usage, instead of a `StringRef`: * When returning `StringRef`, client code is "jumping through hoops" to do string manipulations instead of dealing with fixed array of bytes directly, which is more natural * Returning `std::array<uint8_t, N>` avoids the need for the hasher classes to keep a field just for the purpose of wrapping it and returning it as a `StringRef` As part of this patch also: * Introduce `TruncatedBLAKE3` which is useful for using BLAKE3 as the hasher type for `HashBuilder` with non-default hash sizes. * Make `MD5Result` inherit from `std::array<uint8_t, 16>` which improves & simplifies its API. Differential Revision: https://reviews.llvm.org/D123100	2022-04-05 21:38:06 -07:00
Jez Ng	c9c2363048	[lld-macho][nfc] Don't mix file sizes with addresses Update DataInCode's calculation of `endAddr` to use `getSize()` instead of `getFileSize()` -- while in practice they're the same for non-zerofill sections (which code sections are), we still should treat address sizes / offsets as distinct from file sizes / offsets.	2022-03-22 17:52:53 -04:00
Jez Ng	ceff23c6e3	[lld-macho] -flat_namespace for dylibs should make all externs interposable All references to interposable symbols can be redirected at runtime to point to a different symbol definition (with the same name). For example, if both dylib A and B define symbol _foo, and we load A before B at runtime, then all references to _foo within dylib B will point to the definition in dylib A. ld64 makes all extern symbols interposable when linking with `-flat_namespace`. TODO 1: Support `-interposable` and `-interposable_list`, which should just be a matter of parsing those CLI flags and setting the `Defined::interposable` bit. TODO 2: Set Reloc::FinalDefinitionInLinkageUnit correctly with this info (we are currently not setting it at all, so we're erring on the conservative side, but we should help the LTO backend generate more optimal code.) Reviewed By: modimo, MaskRay Differential Revision: https://reviews.llvm.org/D119294	2022-03-14 22:18:32 -04:00
Jez Ng	7f3ddf8443	[lld-macho][nfc] Allow Defined symbols to be placed in binding sections Previously, we only allowed this for DylibSymbols. However, in order to properly support `-flat_namespace` as well as `-interposable`, we need to allow this for Defined symbols too. Therefore we hoist the `lazyBindOffset` and the `stubsHelperIndex` into the parent Symbol class. The actual change to support interposition under `-flat_namespace` is in {D119294}; the NFC changes here have been split out for easier review. Perf regression isn't stat sig on my 3.2 GHz 16-Core Intel Xeon W linking chromium_framework: base diff difference (95% CI) sys_time 1.227 ± 0.021 1.234 ± 0.031 [ -0.3% .. +1.5%] user_time 3.665 ± 0.036 3.674 ± 0.035 [ -0.2% .. +0.7%] wall_time 4.596 ± 0.055 4.609 ± 0.064 [ -0.3% .. +0.9%] samples 34 47 Max RSS regression is barely stat sig: base diff difference (95% CI) time 1003664356.324 ± 15404053.912 1010380403.613 ± 10578309.455 [ +0.0% .. +1.3%] samples 37 31 Reviewed By: modimo Differential Revision: https://reviews.llvm.org/D121351	2022-03-14 22:18:32 -04:00
Jez Ng	4308f031cd	[lld-macho] Align cstrings less conservatively Previously, we aligned every cstring to 16 bytes as a temporary hack to deal with https://github.com/llvm/llvm-project/issues/50135. However, it was highly wasteful in terms of binary size. To recap, in contrast to ELF, which puts strings that need different alignments into different sections, `clang`'s Mach-O backend puts them all in one section. Strings that need to be aligned have the .p2align directive emitted before them, which simply translates into zero padding in the object file. In other words, we have to infer the alignment of the cstrings from their addresses. We differ slightly from ld64 in how we've chosen to align these cstrings. Both LLD and ld64 preserve the number of trailing zeros in each cstring's address in the input object files. When deduplicating identical cstrings, both linkers pick the cstring whose address has more trailing zeros, and preserve the alignment of that address in the final binary. However, ld64 goes a step further and also preserves the offset of the cstring from the last section-aligned address. I.e. if a cstring is at offset 18 in the input, with a section alignment of 16, then both LLD and ld64 will ensure the final address is 2-byte aligned (since `18 == 16 + 2`). But ld64 will also ensure that the final address is of the form 16 * k + 2 for some k (which implies 2-byte alignment). Note that ld64's heuristic means that a dedup'ed cstring's final address is dependent on the order of the input object files. E.g. if in addition to the cstring at offset 18 above, we have a duplicate one in another file with a `.cstring` section alignment of 2 and an offset of zero, then ld64 will pick the cstring from the object file earlier on the command line (since both have the same number of trailing zeros in their address). So the final cstring may either be at some address `16 * k + 2` or at some address `2 * k`. I've opted not to follow this behavior primarily for implementation simplicity, and secondarily to save a few more bytes. It's not clear to me that preserving the section alignment + offset is ever necessary, and there are many cases that are clearly redundant. In particular, if an x86_64 object file contains some strings that are accessed via SIMD instructions, then the .cstring section in the object file will be 16-byte-aligned (since SIMD requires its operand addresses to be 16-byte aligned). However, there will typically also be other cstrings in the same file that aren't used via SIMD and don't need this alignment. They will be emitted at some arbitrary address `A`, but ld64 will treat them as being 16-byte aligned with an offset of `16 % A`. I have verified that the two repros in https://github.com/llvm/llvm-project/issues/50135 work well with the new alignment behavior. Fixes https://github.com/llvm/llvm-project/issues/54036. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D121342	2022-03-10 15:18:15 -05:00
Jez Ng	2b78ef06c2	[lld-macho][nfc] Eliminate InputSection::Shared Earlier in LLD's evolution, I tried to create the illusion that subsections were indistinguishable from "top-level" sections. Thus, even though the subsections shared many common field values, I hid those common values away in a private Shared struct (see D105305). More recently, however, @gkm added a public `Section` struct in D113241 that served as an explicit way to store values that are common to an entire set of subsections (aka InputSections). Now that we have another "common value" struct, `Shared` has been rendered redundant. All its fields can be moved into `Section` instead, and the pointer to `Shared` can be replaced with a pointer to `Section`. This `Section` pointer also has the advantage of letting us inspect other subsections easily, simplifying the implementation of {D118798}. P.S. I do think that having both `Section` and `InputSection` makes for a slightly confusing naming scheme. I considered renaming `InputSection` to `Subsection`, but that would break the symmetry with `OutputSection`. It would also make us deviate from LLD-ELF's naming scheme. This change is perf-neutral on my 3.2 GHz 16-Core Intel Xeon W machine: base diff difference (95% CI) sys_time 1.258 ± 0.031 1.248 ± 0.023 [ -1.6% .. +0.1%] user_time 3.659 ± 0.047 3.658 ± 0.041 [ -0.5% .. +0.4%] wall_time 4.640 ± 0.085 4.625 ± 0.063 [ -1.0% .. +0.3%] samples 49 61 There's also no stat sig change in RSS (as measured by `time -l`): base diff difference (95% CI) time 998038627.097 ± 13567305.958 1003327715.556 ± 15210451.236 [ -0.2% .. +1.2%] samples 31 36 Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D118797	2022-02-03 19:55:42 -05:00
Alexandre Ganea	83d59e05b2	Re-land [LLD] Remove global state in lldCommon Move all variables at file-scope or function-static-scope into a hosting structure (lld::CommonLinkerContext) that lives at lldMain()-scope. Drivers will inherit from this structure and add their own global state, in the same way as for the existing COFFLinkerContext. See discussion in https://lists.llvm.org/pipermail/llvm-dev/2021-June/151184.html The previous land `f860fe3622` caused issues in https://lab.llvm.org/buildbot/#/builders/123/builds/8383, fixed by `22ee510dac`. Differential Revision: https://reviews.llvm.org/D108850	2022-01-20 14:53:26 -05:00
Keith Smiley	3f38dc5c04	[lld-macho] Silence XAR deprecation warning If you're building this on macOS 12.x+ this produces a deprecation warning. I'm not sure what this means for the bitcode format going forward, but it seems safe to silence for now. Do we need to worry about GCC for this? Differential Revision: https://reviews.llvm.org/D117718	2022-01-19 13:51:55 -08:00
Alexandre Ganea	e6b153947d	Revert [LLD] Remove global state in lldCommon It seems to be causing issues on https://lab.llvm.org/buildbot/#/builders/123/builds/8383	2022-01-16 11:03:06 -05:00
Alexandre Ganea	f860fe3622	[LLD] Remove global state in lldCommon Move all variables at file-scope or function-static-scope into a hosting structure (lld::CommonLinkerContext) that lives at lldMain()-scope. Drivers will inherit from this structure and add their own global state, in the same way as for the existing COFFLinkerContext. See discussion in https://lists.llvm.org/pipermail/llvm-dev/2021-June/151184.html Differential Revision: https://reviews.llvm.org/D108850	2022-01-16 08:57:57 -05:00
Fangrui Song	7c269db779	[lld-macho] Simplify DeduplicatedCStringSection::finalizeContents. NFC Tail merge is slow and of low value. With regular string deduplication, we can just use the return value of StringTableBuilder::add. There is no noticeable performance increase because without deduplication `__cstring` is quite small (7.6MiB for chromium_framework). Reviewed By: #lld-macho, Jez Ng Differential Revision: https://reviews.llvm.org/D117273	2022-01-14 13:12:57 -08:00
Juergen Ributzka	3025c3eded	Replace PlatformKind with PlatformType. The PlatformKind/PlatformType enums contain the same information, which requires them to be kept in-sync. This commit changes over to PlatformType as the sole source of truth, which allows the removal of the redundant PlatformKind. The majority of the changes were in LLD and TextAPI. Reviewed By: cishida Differential Revision: https://reviews.llvm.org/D117163	2022-01-13 09:23:49 -08:00
Kazu Hirata	b12fd13812	Fix bugprone argument comments. Identified by bugprone-argument-comment.	2022-01-09 12:21:02 -08:00
Jez Ng	098430cd25	[lld-macho][nfc] Simplify LC_DATA_IN_CODE generation 1. After D113241, we have the section address easily accessible and no longer need to iterate across the LC_SEGMENT commands to emit LC_DATA_IN_CODE. 2. There's no need to store a pointer to the data in code entries during the parse step; we can just look it up as part of the output step. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D115556	2021-12-11 01:01:57 -05:00
Vy Nguyen	944071eca2	[lld-macho] Don't replace local personality symbol with LazySymbol Follup-up to D107533, where we replaced local syms with non-local. It doesn't make sense to replace local symbol with lazy. Differential Revision: https://reviews.llvm.org/D110040	2021-11-22 14:09:54 -05:00
Greg McGary	3a1b3c9afe	[lld-macho][nfc] rename parsed-section types & variables This is an NFC diff that prepares for pruning & relocating `__eh_frame`. Along the way, I made the following changes to ... * clarify usage of `section` vs. `subsection` * remove `map` & `vec` from type names * disambiguate class `Section` from template parameter `SectionHeader`. Differential Revision: https://reviews.llvm.org/D113241	2021-11-16 07:06:41 -07:00
Jez Ng	a271f2410f	[lld-macho][nfc] Canonicalize all pointers to InputSections early on Having to remember to call `canonical()` all over the place is error-prone; let's do it in a centralized location instead. It also appears to improve performance slightly. base diff difference (95% CI) sys_time 0.984 ± 0.009 0.983 ± 0.014 [ -0.8% .. +0.6%] user_time 6.508 ± 0.035 6.475 ± 0.036 [ -0.8% .. -0.2%] wall_time 5.321 ± 0.034 5.300 ± 0.033 [ -0.7% .. -0.1%] samples 36 23 Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D112687	2021-10-29 11:00:28 -04:00
Vincent Lee	d54360cd32	[lld-macho] Implement -S There are a couple internal builds that require the use of this flag. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D112594	2021-10-27 17:09:57 -07:00
Nuri Amari	a299b24712	Regenerate LC_CODE_SIGNATURE during llvm-objcopy operations Context: This is a second attempt at introducing signature regeneration to llvm-objcopy. In this diff: https://reviews.llvm.org/D109840, a script was introduced to test the validity of a code signature. In this diff: https://reviews.llvm.org/D109803 (now reverted), an effort was made to extract the signature generation behavior out of LLD into a common location for use in llvm-objcopy. In this diff: https://reviews.llvm.org/D109972 it was decided that there was no appropriate common location and that a small amount of duplication to bring signature generation to llvm-objcopy would be better. This diff introduces this duplication. Summary Prior to this change, if a LC_CODE_SIGNATURE load command was included in the binary passed to llvm-objcopy, the command and associated section were simply copied and included verbatim in the new binary. If rest of the binary was modified at all, this results in an invalid Mach-O file. This change regenerates the signature rather than copying it. The code_signature_lc.test test was modified to include the yaml representation of a small signed MachO executable in order to effectively test the signature generation. Reviewed By: alexander-shaposhnikov, #lld-macho Differential Revision: https://reviews.llvm.org/D111164	2021-10-26 14:51:13 -07:00
Jez Ng	002eda7056	[lld-macho] Associate compact unwind entries with function symbols Compact unwind entries (CUEs) contain pointers to their respective function symbols. However, during the link process, it's far more useful to have pointers from the function symbol to the CUE than vice versa. This diff adds that pointer in the form of `Defined::compactUnwind`. In particular, when doing dead-stripping, we want to mark CUEs live when their function symbol is live; and when doing ICF, we want to dedup sections iff the symbols in that section have identical CUEs. In both cases, we want to be able to locate the symbols within a given section, as well as locate the CUEs belonging to those symbols. So this diff also adds `InputSection::symbols`. The ultimate goal of this refactor is to have ICF support dedup'ing functions with unwind info, but that will be handled in subsequent diffs. This diff focuses on simplifying `-dead_strip` -- `findFunctionsWithUnwindInfo` is no longer necessary, and `Defined::isLive()` is now a lot simpler. Moreover, UnwindInfoSection no longer has to check for dead CUEs -- we simply avoid adding them in the first place. Additionally, we now support stripping of dead LSDAs, which follows quite naturally since `markLive()` can now reach them via the CUEs. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D109944	2021-10-26 16:04:15 -04:00
Jez Ng	622150ad5f	[lld-macho] Put GOT into `__DATA` segment where appropriate We were previously always emitting the GOT into `__DATA_CONST`, even for target platforms where it should end up in `__DATA`. I stumbled onto this while trying to use the `class-dump` tool -- with the wrong segment names, it fails to locate the ObjC runtime info and therefore fails to dump any classes. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D112500	2021-10-26 11:38:01 -04:00
Vy Nguyen	236197e2d0	[lld-macho] Implement -oso_prefix https://bugs.llvm.org/show_bug.cgi?id=50229 Differential Revision: https://reviews.llvm.org/D112291	2021-10-22 16:32:42 -04:00
Nico Weber	4e572db0c2	[lld/mac] Mark private externs with GOT relocs as LOCAL in indirect symbtab prepareSymbolRelocation() in Writer.cpp adds both symbols that need binding and symbols relocated with a pointer relocation to the got. Pointer relocations are emitted for non-movq GOTPCREL(%rip) loads. (movqs become GOT_LOADs so that the linker knows they can be relaxed to leaqs, while others, such as addq, become just GOT -- a pointer relocation -- since they can't be relaxed in that way). For example, this C file produces a private_extern GOT relocation when compiled with -O2 with clang: extern const char kString[]; const char* g(int a) { return kString + a; } Linkers need to put pointer-relocated symbols into the GOT, but ld64 marks them as LOCAL in the indirect symbol table. This matters, since `strip -x` looks at the indirect symbol table when deciding what to strip. The indirect symtab emitting code was assuming that only symbols that need binding are in the GOT, but pointer relocations where there too. Hence, the code needs to explicitly check if a symbol is a private extern. Fixes https://crbug.com/1242638, which has some more information in comments 14 and 15. With this patch, the output of `nm -U` on Chromium Framework after stripping now contains just two symbols when using lld, just like with ld64. Differential Revision: https://reviews.llvm.org/D111852	2021-10-15 13:24:47 -04:00
Daniel Rodríguez Troitiño	657f02d458	Revert "Extract LC_CODE_SIGNATURE related implementation out of LLD" This reverts commit `cc8229603b`. As discussed in the review of https://reviews.llvm.org/D109972, this was not right approach, so we are reverting to start with a different approach. Differential Revision: https://reviews.llvm.org/D110974	2021-10-01 17:19:50 -07:00
Mike Hommey	08ef24f6ab	Wrap xar/xar.h include in extern "C" block Without such wrapping, linking lld fails with missing symbols because of C++ symbol mangling with older versions of the MacOSX SDK, in which xar.h doesn't have an extern "C" block itself. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D110224	2021-09-23 09:37:30 +02:00
Nuri Amari	cc8229603b	Extract LC_CODE_SIGNATURE related implementation out of LLD Move the functionality in lld that handles writing of the LC_CODE_SIGNATURE load command and associated data section to a central reusable location. This change is in preparation for another change that modifies llvm-objcopy to reproduce the LC_CODE_SIGNATURE load command and corresponding data section to maintain the validity of signed macho object files passed through llvm-objcopy. Reviewed By: #lld-macho, int3, oontvoo Differential Revision: https://reviews.llvm.org/D109803	2021-09-16 17:43:39 -07:00
Nico Weber	9482aa98e5	[lld/mac] Let OutputSegment store its start address segment$start$/segment$end$ symbols allow creating segments without sections, so getting the segment address off the first section won't work there. Storing the address on the segment is arguably a bit simpler too. No behavior change, part of PR50760. Differential Revision: https://reviews.llvm.org/D106665	2021-07-23 11:43:25 -04:00

1 2 3 4

187 Commits