llvm-project

Commit Graph

Author	SHA1	Message	Date
Jez Ng	7bbdbacd00	[lld-macho] Use export trie instead of symtab when linking against dylibs Summary: This allows us to link against stripped dylibs. Moreover, it's simply more correct: The symbol table includes symbols that the dylib uses but doesn't export. This temporarily regresses our ability to do lazy symbol binding because dyld_stub_binder isn't in libSystem's export trie. Rather, it is in one of the sub-libraries libSystem re-exports. (This doesn't affect our tests since we are mocking out dyld_stub_binder there.) A follow-up diff will address this by adding support for sub-libraries. Depends on D79114. Reviewers: ruiu, pcc, MaskRay, smeenai, alexshap, gkm, Ktwu, christylee Subscribers: mgorny, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79226	2020-05-09 20:56:22 -07:00
Jez Ng	5d3feefa0d	[lld-macho] Dylib symbols should always replace undefined symbols Summary: Otherwise we get undefined symbol errors depending on the order of arguments on the command line. Depends on D78270. Reviewers: ruiu, pcc, MaskRay, smeenai, alexshap, gkm, Ktwu, christylee Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79114	2020-05-09 20:56:22 -07:00
Jez Ng	b3e2fc931d	[lld-macho] Support calls to functions in dylibs Summary: This diff implements lazy symbol binding -- very similar to the PLT mechanism in ELF. ELF's .plt section is broken up into two sections in Mach-O: StubsSection and StubHelperSection. Calls to functions in dylibs will end up calling into StubsSection, which contains indirect jumps to addresses stored in the LazyPointerSection (the counterpart to ELF's .plt.got). Initially, the LazyPointerSection contains addresses that point into one of the entry points in the middle of the StubHelperSection. The code in StubHelperSection will push on the stack an offset into the LazyBindingSection. The push is followed by a jump to the beginning of the StubHelperSection (similar to PLT0), which then calls into dyld_stub_binder. dyld_stub_binder is a non-lazily bound symbol, so this call looks it up in the GOT. The stub binder will look up the bind opcodes in the LazyBindingSection at the given offset. The bind opcodes will tell the binder to update the address in the LazyPointerSection to point to the symbol, so that subsequent calls don't have to redo the symbol resolution. The binder will then jump to the resolved symbol. Depends on D78269. Reviewers: ruiu, pcc, MaskRay, smeenai, alexshap, gkm, Ktwu, christylee Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78270	2020-05-09 20:56:22 -07:00
Jez Ng	db157d2733	[lld-macho] Follow-up to D77893 Summary: 1. Don't have isHidden() depend on isNeeded(). Whether a section is hidden is orthogonal from whether it is needed: hidden sections will never have a header regardless of whether they have a body. (I know we override this method with return false for synthetic sections, but regardless I think it's confusing to write it this way for non-synthetic sections.) 2. Don't call writeTo() on unneeded sections. D78270 assumes that this is true when implementing the stub helper section. 3. Filter out the unneeded sections early on to avoid having to deal with them in multiple places. 4. Remove assumption in test that the referenced file has no other symbols. (We should create separate input files for future tests to avoid such issues.) Reviewers: ruiu, pcc, MaskRay, smeenai, alexshap, gkm, Ktwu, christylee Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79460	2020-05-09 20:56:22 -07:00
Reid Kleckner	77ecf90c52	[COFF] Migrate COFFObjectFile to Expected<T> I noticed that std::error_code() does one-time initialization. Avoid that overhead with Expected<T> and llvm::Error. Also, it is consistent with the virtual interface and ELF, and generally cleaner. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D79643	2020-05-08 14:01:39 -07:00
Thomas Lively	a1ae9566ea	[WebAssembly] Disallow 'shared-mem' rather than 'atomics' Summary: The WebAssembly backend automatically lowers atomic operations and TLS to nonatomic operations and non-TLS data when either are present and the atomics or bulk-memory features are not present, respectively. The resulting object is no longer thread-safe, so the linker has to be told not to allow it to be linked into a module with shared memory. This was previously done by disallowing the 'atomics' feature, which prevented any objct with its atomic operations or TLS removed from being linked with any object containing atomics or TLS, and therefore preventing it from being linked into a module with shared memory since shared memory requires atomics. However, as of https://github.com/WebAssembly/threads/issues/144, the validation rules are relaxed to allow atomic operations to validate with unshared memories, which makes it perfectly safe to link an object with stripped atomics and TLS with another object that still contains TLS and atomics as long as the resulting module has an unshared memory. To allow this kind of link, this patch disallows a pseudo-feature 'shared-mem' rather than 'atomics' to communicate to the linker that the object is not thread-safe. This means that the 'atomics' feature is available to accurately reflect whether or not an object has atomics enabled. As a drive-by tweak, this change also requires that bulk-memory be enabled in addition to atomics in order to use shared memory. This is because initializing shared memories requires bulk-memory operations. Reviewers: aheejin, sbc100 Subscribers: dschuff, jgravelle-google, hiraditya, sunfish, jfb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79542	2020-05-08 13:52:39 -07:00
Wei Mi	538208f6c0	[lld] Add a new output section ".text.unknown" for funtions with unknown hotness For sampleFDO, because the optimized build uses profile generated from previous release, often we couldn't tell a function without profile was truely cold or just newly created so we had to treat them conservatively and put them in .text section instead of .text.unlikely. The result was when we persue the best performance by locking .text.hot and .text in memory, we wasted a lot of memory to keep cold functions inside. This problem has been largely solved for regular sampleFDO using profile-symbol-list (https://reviews.llvm.org/D66374), but for the case when we use partial profile, we still waste a lot of memory because of it. In https://reviews.llvm.org/D62540, we propose to save functions with unknown hotness information in a special section called ".text.unknown", so that compiler will treat those functions as luck-warm, but runtime can choose not to mlock the special section in memory or use other strategy to save memory. That will solve most of the memory problem even if we use a partial profile. The patch adds the support in lld for the special section.For sampleFDO, because the optimized build uses profile generated from previous release, often we couldn't tell a function without profile was truely cold or just newly created so we had to treat them conservatively and put them in .text section instead of .text.unlikely. The result was when we persue the best performance by locking .text.hot and .text in memory, we wasted a lot of memory to keep cold functions inside. This problem has been largely solved for regular sampleFDO using profile-symbol-list (https://reviews.llvm.org/D66374), but for the case when we use partial profile, we still waste a lot of memory because of it. In https://reviews.llvm.org/D62540, we propose to save functions with unknown hotness information in a special section called ".text.unknown", so that compiler will treat those functions as luck-warm, but runtime can choose not to mlock the special section in memory or use other strategy to save memory. That will solve most of the memory problem even if we use a partial profile. The patch adds the support in lld for the special section. Differential Revision: https://reviews.llvm.org/D79590	2020-05-08 11:14:48 -07:00
Reid Kleckner	3b3e28a07c	[PDB] Optimize public symbol processing Reduces time to link PGO instrumented net_unittets.exe by 11% (9.766s -> 8.672s, best of three). Reduces peak memory by 65.7MB (2142.71MB -> 2076.95MB). Use a more compact struct, BulkPublic, for faster sorting. Sort in parallel. Construct the hash buckets in parallel. Try to use one vector to hold all the publics instead of copying them from one to another. Allocate all the memory needed to serialize publics up front, and then serialize them in place in parallel. Reviewed By: aganea, hans Differential Revision: https://reviews.llvm.org/D79467	2020-05-08 10:23:27 -07:00
Fangrui Song	e20a215992	[ELF] Add convenience TableGen classes to enforce two dashes for options not supported by GNU ld Announced on https://lists.llvm.org/pipermail/llvm-dev/2020-May/141416.html For many options, we have to support either one or two dash to be compatible with GNU ld. For newer and lld specific options, we can enforce strict double dashes. Affected options: * --thinlto-* * --lto-* * --shuffle-sections= This patch does not change `-plugin-opt=` because clang driver passes `-plugin-opt=` and I don't intend to cause churn. In 2000, GNU ld tried something similar with --omagic https://sourceware.org/git/?p=binutils-gdb.git;a=commit;h=e4897a3288f37d5f69e8acd256a6e83e607fe8d8 Reviewed By: tejohnson, psmith Differential Revision: https://reviews.llvm.org/D79371	2020-05-08 07:37:06 -07:00
Reid Kleckner	d71c3c425c	[COFF] Dump string table size for COFF file headers I couldn't find this info in any other dumper, so it might as well be here.	2020-05-06 15:48:36 -07:00
Sam Clegg	f03b6e785b	[lld][WebAssembly] Honor --allow-undefined for data symbols too This was originally the way this worked before before https://reviews.llvm.org/D60882. In retrospect it seems inconsistent that `--allow-undefined` doesn't work for all symbols. See: https://groups.google.com/g/emscripten-discuss/c/HSRgQiIq1gI/m/Kt9oFWHiAwAJ I'm also planning a followup change which implement the full `--unresolved-symbols=..` flags supported by ELF linkers (both ld and ld.lld) since it seems more standard. Differential Revision: https://reviews.llvm.org/D79247	2020-05-06 12:39:29 -07:00
Alexandre Ganea	6adc45d3fd	[LLD][COFF] Move debug info for thread-local variables into PDB global stream Before this patch, the debug record S_GTHREAD32 which represents global thread_local symbols, was emitted by LLD into the respective module stream. This makes Visual Studio unable to display thread_local symbols in the debugger. After this patch, S_GTHREAD32 is moved into the globals stream. This matches MSVC behavior. Differential Revision: https://reviews.llvm.org/D79005	2020-05-06 15:23:58 -04:00
Reid Kleckner	932f0276ea	[Support] Move LLD's parallel algorithm wrappers to support Essentially takes the lld/Common/Threads.h wrappers and moves them to the llvm/Support/Paralle.h algorithm header. The changes are: - Remove policy parameter, since all clients use `par`. - Rename the methods to `parallelSort` etc to match LLVM style, since they are no longer C++17 pstl compatible. - Move algorithms from llvm::parallel:: to llvm::, since they have "parallel" in the name and are no longer overloads of the regular algorithms. - Add range overloads - Use the sequential algorithm directly when 1 thread is requested (skips task grouping) - Fix the index type of parallelForEachN to size_t. Nobody in LLVM was using any other parameter, and it made overload resolution hard for for_each_n(par, 0, foo.size(), ...) because 0 is int, not size_t. Remove Threads.h and update LLD for that. This is a prerequisite for parallel public symbol processing in the PDB library, which is in LLVM. Reviewed By: MaskRay, aganea Differential Revision: https://reviews.llvm.org/D79390	2020-05-05 15:21:05 -07:00
Sid Manning	0e6536fd97	[Hexagon] Add R_HEX_GD_PLT_B22/32_PCREL relocations Extended versions of GD_PLT_B22_PCREL. These surface when -mlong-calls is used. Differential Revision: https://reviews.llvm.org/D79191	2020-05-05 11:47:51 -05:00
Peter Smith	48aebfc908	[ELF][ARM] Do not create .ARM.exidx sections for out of range inputs A linker will create .ARM.exidx sections for InputSections that don't have them. This can cause a relocation out of range error If the InputSection happens to be extremely far away from the other sections. This is often the case for the vector table on older ARM CPUs as the only two places that the table can be placed is 0 or 0xffff0000. We fix this by removing InputSections that need a linker generated .ARM.exidx section if that would cause an error. Differential Revision: https://reviews.llvm.org/D79289	2020-05-05 09:59:45 +01:00
Martin Storsjö	5a1c30177f	[LLD] [COFF] Fix a typo in an assert message. NFC.	2020-05-05 11:46:50 +03:00
Zakk Chen	ad5fad0ac5	[LTO] Suppress emission of empty combined module by default Summary: That unless the user requested an output object (--lto-obj-path), the an unused empty combined module is not emitted. This changed is helpful for some target (ex. RISCV-V) which encoded the ABI info in IR module flags (target-abi). Empty unused module has no ABI info so the linker would get the linking error during merging incompatible ABIs. Reviewers: tejohnson, espindola, MaskRay Subscribers: emaste, inglorion, arichardson, hiraditya, simoncook, MaskRay, steven_wu, dexonsmith, PkmX, dang, lenary, s.egerton, luismarques, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78988	2020-05-04 18:31:09 -07:00
Reid Kleckner	2868ee5b32	[PDB] Use the global BumpPtrAllocator Profiling shows that time is spent destroying the allocator member of PDBLinker, and that is unneeded.	2020-05-04 16:15:36 -07:00
Fangrui Song	6939fe6e08	[lld-macho] Support X86_64_RELOC_SIGNED_{1,2,4} We currently only support extern relocations. `X86_64_RELOC_SIGNED_{1,2,4}` are like X86_64_RELOC_SIGNED, but with the implicit addend fixed to 1, 2, and 4, respectively. See the comment in `lib/Target/X86/MCTargetDesc/X86MachObjectWriter.cpp RecordX86_64Relocation`. Reviewed By: int3 Differential Revision: https://reviews.llvm.org/D79311	2020-05-04 15:15:35 -07:00
Fangrui Song	c49f83b6e9	[ELF] Don't advance sh_offset for an empty section whose PT_LOAD is removed (due to p_memsz=0) removeEmptyPTLoad() removes empty (p_memsz=0) PT_LOAD segments. In assignFileOffsets(), setFileOffset() unnecessarily advances file offsets for containing empty sections. This is exposed by arm Linux kernel's multi_v5_defconfig (see https://bugs.llvm.org/show_bug.cgi?id=45632) ``` ld.lld (max-page-size=65536): [34] .init.data PROGBITS c0c24000 c34000 0128ac 00 WA 0 0 4096 [35] .text_itcm PROGBITS fffe0000 c50000 000000 00 WA 0 0 1 [36] .data_dtcm PROGBITS fffe8000 c58000 000000 00 WA 0 0 1 [37] .data PROGBITS c0c38000 c58000 0647a0 00 WA 0 0 32 arm-linux-gnueabi-ld (max-page-size=65536): [23] .init.data PROGBITS c0c12000 c22000 0128ac 00 WA 0 0 4096 [24] .text_itcm PROGBITS fffe0000 ca2558 000000 00 W 0 0 1 [25] .data_dtcm PROGBITS fffe8000 ca2558 000000 00 W 0 0 1 [26] .data PROGBITS c0c26000 c36000 0647a0 00 WA 0 0 32 ``` This patch clears OutputSection::ptLoad if ptLoad is removed by removeEmptyPTLoad(). Conceptually this removes "dangling" references. Reviewed By: psmith Differential Revision: https://reviews.llvm.org/D79254	2020-05-04 08:07:34 -07:00
Reid Kleckner	fce5457a14	[COFF] Avoid allocating temporary vectors during ICF Heap profiling with ETW shows that LLD performs 4,053,721 heap allocations over its lifetime, and ~800,000 of them come from assocEquals. These vectors are created just to do a comparison, so fuse the comparison into the loop and avoid the allocation. ICF is overall a small portion of the time spent linking, and I did not measure overall throughput improvements from this change above the noise threshold. However, these show up in the heap profiler, and the work is done, so we might as well land it if the code is clear enough. Reviewed By: hans Differential Revision: https://reviews.llvm.org/D79297	2020-05-04 07:01:14 -07:00
Peter Smith	3834385f27	[ELF] Move SHF_LINK_ORDER till OutputSection addresses are known Sections with the SHF_LINK_ORDER flag must be ordered in the same relative order as the Sections they have a link to. When using a linker script an arbitrary expression may be used for the virtual address of the OutputSection. In some cases the virtual address does not monotonically increase as the OutputSection index increases, so if we base the ordering of the SHF_LINK_ORDER sections on the index then we can get the order wrong. We fix this by moving SHF_LINK_ORDER resolution till after we have created OutputSection virtual addresses. Differential Revision: https://reviews.llvm.org/D79286	2020-05-04 14:25:25 +01:00
Reid Kleckner	9b7f6146bd	[COFF] Paritally inline Symbol::getName, NFC	2020-05-03 07:58:05 -07:00
Reid Kleckner	1e5793345b	Re-land "[PDB] Avoid calling discoverTypeIndices for a known record kind" Fixed bad usage of slice API causing assertion failures. Reverts `810c8e9b49` Reinstates `bd7ea8641e`	2020-05-02 18:39:33 -07:00
Nico Weber	810c8e9b49	Revert "[PDB] Avoid calling discoverTypeIndices for a known record kind" This reverts commit `bd7ea8641e`. Breaks check-lld everywhere.	2020-05-02 21:06:06 -04:00
Reid Kleckner	bd7ea8641e	[PDB] Avoid calling discoverTypeIndices for a known record kind This particular overload allocates memory, and we do this for every S_[GL]PROC32_ID record. Instead, hardcode the offset of the typeindex that we are looking for in the LF_[MEM]FUNC_ID record. We already assumed that looking up the item index already found a record of this kind.	2020-05-02 15:51:08 -07:00
Reid Kleckner	3542384ae9	[COFF] Use a global option table to avoid reconstructing it Otherwise an ArgumentParser is constructed for every directive section, and that involves copying the entire table of options into a vector. There is no need for this, just have one option table.	2020-05-02 15:04:19 -07:00
Thomas Preud'homme	d735c7048c	[test] Fix lld's ELF/linkerscript/thunk-gen-mips.s Summary: Lld test ELF/linkerscript/thunk-gen-mips.s was accidentally disabled due to the use of wrong FileCheck directives. As a result the test seems to have bitrotted as it fails to pass if fixing the directive. To ease updates to the test in case of change of the __start address the checks have been changed to use numeric variables to express all the addresses based on the __start address. Reviewed By: atanasyan Differential Revision: https://reviews.llvm.org/D79270	2020-05-02 22:49:23 +01:00
Reid Kleckner	270d3faf6e	[COFF] Add and use a zero-copy tokenizer for .drectve This generalizes the main Windows command line tokenizer to be able to produce StringRef substrings as well as freshly copied C strings. The implementation is still shared with the normal tokenizer, which is important, because we have unit tests for that. .drective sections can be very long. They can potentially list up to every symbol in the object file by name. It is worth avoiding these string copies. This saves a lot of memory when linking chrome.dll with PGO instrumentation: BEFORE AFTER % IMP peak memory: 6657.76MB 4983.54MB -25% real: 4m30.875s 2m26.250s -46% The time improvement may not be real, my machine was noisy while running this, but that the peak memory usage improvement should be real. This change may also help apps that heavily use dllexport annotations, because those also use linker directives in object files. Apps that do not use many directives are unlikely to be affected. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D79262	2020-05-02 10:47:02 -07:00
Kellie Medlin	6cb073133c	[lld] Merge Mach-O input sections Summary: Similar to other formats, input sections in the MachO implementation are now grouped under output sections. This is primarily a refactor, although there's some new logic (like resolving the output section's flags based on its inputs). Differential Revision: https://reviews.llvm.org/D77893	2020-05-01 16:57:18 -07:00
Fangrui Song	5c86b08a6f	[ELF][test] Improve tests Prepare for the upcomong change that removes unneeded sh_offset advancement for empty sections whose PT_LOAD are removed.	2020-05-01 11:27:51 -07:00
Sam Clegg	0a6c4d8d2e	[WebAssmebly] Add support for defined wasm globals in MC and lld This change add support for defined wasm globals in the .s format, the MC layer, and wasm-ld Currently there is no support custom initialization and all wasm globals are initialized to zero. Fixes: PR45742 Differential Revision: https://reviews.llvm.org/D79137	2020-04-30 12:43:15 -07:00
Thomas Preud'homme	9ecddde321	[test] Fix ELF/linkerscript/input-archive.s w/ @ in path Lld test ELF/linkerscript/input-archive.s fails when path contain a @ because is not accepted in unquoted token in linker scripts which leads to the path being broken in 2 around the @. This commit quotes the path used in the linker script created by this and similar testcases allowing the test to pass even in the presence of an @ sign in the path. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D79103	2020-04-30 20:14:22 +01:00
Fangrui Song	b257d3c8a8	[ELF][PPC64] Suppress toc-indirect to toc-relative relaxation if R_PPC64_TOC16_LO is seen The current implementation assumes that R_PPC64_TOC16_HA is always followed by R_PPC64_TOC16_LO_DS. This can break with R_PPC64_TOC16_LO: // Load the address of the TOC entry, instead of the value stored at that address addis 3, 2, .LC0@tloc@ha # R_PPC64_TOC16_HA addi 3, 3, .LC0@tloc@l # R_PPC64_TOC16_LO blr which is used by boringssl's util/fipstools/delocate/delocate.go https://github.com/google/boringssl/blob/master/crypto/fipsmodule/FIPS.md has some documentation. In short, this tool converts an assembly file to avoid any potential relocations. The distance to an input .toc is not a constant after linking, so it cannot use an `addis;ld` pair. Instead, it jumps to a stub which loads the TOC entry address with `addis;addi`. This patch checks the presence of R_PPC64_TOC16_LO and suppresses toc-indirect to toc-relative relaxation if R_PPC64_TOC16_LO is seen. This approach is conservative and loses some relaxation opportunities but is easy to implement. addis 3, 2, .LC0@toc@ha # no relaxation addi 3, 3, .LC0@toc@l # no relaxation li 9, 0 addis 4, 2, .LC0@toc@ha # can relax but suppressed ld 4, .LC0@toc@l(4) # can relax but suppressed Also note that interleaved R_PPC64_TOC16_HA and R_PPC64_TOC16_LO_DS is possible and this patch accounts for that. addis 3, 2, .LC1@toc@ha # can relax addis 4, 2, .LC2@toc@ha # can relax ld 3, .LC1@toc@l(3) # can relax ld 4, .LC2@toc@l(4) # can relax Reviewed By: #powerpc, sfertile Differential Revision: https://reviews.llvm.org/D78431	2020-04-30 09:16:51 -07:00
Fangrui Song	b912b887d8	[ELF] Add --print-archive-stats= gold has an option --print-symbol-counts= which prints: // For each archive archive $archive $members $fetched_members // For each object file symbols $object $defined_symbols $used_defined_symbols In most cases, `$defined_symbols = $used_defined_symbols` unless weak symbols are present. Strangely `$used_defined_symbols` includes symbols defined relative to --gc-sections discarded sections. The `symbols` lines do not appear to be useful. `archive` lines are useful: `$fetched_members=0` lines correspond to unused archives. The information can be used to trim dependencies. This patch implements --print-archive-stats= which prints the number of members and the number of fetched members for each archive. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D78983	2020-04-29 18:04:37 -07:00
Fangrui Song	e96d7b5e9e	[ELF] Add --rosegment to complement --no-rosegment This option can cancel --no-rosegment and it just seems right to have a corresponding positive option for a --no-* negative option. Anecdotally, gold had --rosegment but did not have --no-rosegment. I added --no-rosegment (https://sourceware.org/git/?p=binutils-gdb.git;a=commit;h=9a6c68caa9543e09b064b7ac7c2b658f277bc19c) for binutils>=2.35	2020-04-29 18:00:00 -07:00
Jez Ng	e82c5e17b5	[lld-macho] Support X86_64_RELOC_BRANCH Relatively straightforward diff, to set the stage for calling functions in dylibs. Differential Revision: https://reviews.llvm.org/D78269	2020-04-29 15:45:01 -07:00
Jez Ng	df92377823	[lld-macho] Have Symbol::getVA() return a non-relative virtual address Currently, getVA() returns a virtual address with the assumption that the ImageBase is zero. As I understand, this is what lld-ELF is doing. However, under our current design, it seems like an awkward setup -- I'm finding that I have to add and subtract ImageBase in several places to make things work out. As such, I think it's simpler to have getVA() return a non-relative VA, but I'm not sure if I'm missing something. Would love to hear more from folks familiar with lld-ELF. Differential Revision: https://reviews.llvm.org/D78168	2020-04-29 15:44:50 -07:00
Jez Ng	918948db4d	[lld-macho] Support reading of universal binaries Differential Revision: https://reviews.llvm.org/D77006	2020-04-29 15:44:44 -07:00
Jez Ng	89285a1a97	[lld-macho] Disable colors in errors when not printing to a pty This makes for better tests (and is just the right thing to do) Differential Revision: https://reviews.llvm.org/D79069	2020-04-29 15:44:35 -07:00
Jez Ng	9854edd817	[lld-macho] Implement basic export trie Build the trie by performing a three-way radix quicksort: We start by sorting the strings by their first characters, then sort the strings with the same first characters by their second characters, and so on recursively. Each time the prefixes diverge, we add a node to the trie. Thanks to @ruiu for the idea. I used llvm-mc's radix quicksort implementation as a starting point. The trie offset fixpoint code was taken from MachONormalizedFileBinaryWriter.cpp. Differential Revision: https://reviews.llvm.org/D76977	2020-04-29 15:44:27 -07:00
Fangrui Song	1ccde53342	[ELF] --gdb-index: support .debug_loclists --gdb-index currently crashes when reading a translation unit with DWARF v5 .debug_loclists . Call stack: ``` SyntheticSections.cpp GdbIndexSection::create SyntheticSections.cpp readAddressAreas DWARFUnit.cpp DWARFUnit::tryExtractDIEsIfNeeded DWARFListTable.cpp DWARFListTableHeader::extract ... DWARFDataExtractor.cpp DWARFDataExtractor::getRelocatedValue lld/ELF/DWARF.cpp LLDDwarfObj<ELFT>::find (sec.sec is nullptr) ... ``` This patch adds support for .debug_loclists to make `DWARFUnit::tryExtractDIEsIfNeeded` happy. Building --gdb-index does not need .debug_loclists Reviewed By: dblaikie, grimar Differential Revision: https://reviews.llvm.org/D79061	2020-04-29 15:04:13 -07:00
Fangrui Song	53ff95254d	Reland D78837 [lld] Remove special cases from default ld driver mode. Drops the behavior from rL217112. Use the Gnu driver mode by default for all platforms when ld is invoked. Other names for the program (such as link or ld64) continue working as before. Reviewed By: MaskRay, srhines, smeenai, ruiu Differential Revision: https://reviews.llvm.org/D78837	2020-04-29 14:45:44 -07:00
Dan Albert	0a78e42b1f	Revert "[lld] Remove special cases from default ld driver mode." This reverts commit `da093c388f`. Broke a test on Darwin. Will fix the test and resubmit.	2020-04-29 14:14:51 -07:00
Dan Albert	da093c388f	[lld] Remove special cases from default ld driver mode. Summary: Use the Gnu driver mode by default for all platforms when ld is invoked. Other names for the program (such as link or ld64) continue working as before. Reviewers: MaskRay, int3, srhines, smeenai, ruiu Reviewed By: MaskRay, srhines, smeenai, ruiu Subscribers: smeenai, srhines, nickdesaulniers, llvm-commits Tags: #lld, #llvm Differential Revision: https://reviews.llvm.org/D78837	2020-04-29 12:28:29 -07:00
Sean Fertile	f9106e85c4	Revert "[ELF][PPC64] Don't perform toc-indirect to toc-relative relax... " This reverts commit `03ffe58605`. Full tile of reverted commit is: [ELF][PPC64] Don't perform toc-indirect to toc-relative relaxation for R_PPC64_TOC16_HA not followed by R_PPC64_TOC16_LO_DS Breaks the multistage lld PowerPC buildbot.	2020-04-29 10:30:35 -04:00
Saleem Abdulrasool	216833b32b	Revert "Temporarily revert "build: use `find_package(Python3)` if available"" This reverts commit `35edd704e0`. Revert the revert and extend the patch further to account for the use of the `PYTHONINTERP_FOUND`.	2020-04-29 01:38:08 +00:00
Jez Ng	62b8f32f76	[lld-macho][reland] Add support for emitting dylibs with a single symbol This got reverted due to UBSAN errors in a diff lower in the stack, which is being fixed in https://reviews.llvm.org/D79050. This diff is otherwise identical to the original https://reviews.llvm.org/D76908 (which was committed in `9598778bd1` and reverted in `b52bc2653b`). Differential Revision: https://reviews.llvm.org/D79051	2020-04-28 17:08:32 -07:00
Jez Ng	4f0cccdd7a	[lld-macho][reland] Add basic symbol table output This diff implements basic support for writing a symbol table. Attributes are loosely supported for extern symbols and not at all for other types. Initial version by Kellie Medlin <kelliem@fb.com> Originally committed in `a3d95a50ee` and reverted in `fbae153ca5` due to UBSAN erroring over unaligned writes. That has been fixed in the current diff with the following changes: ``` diff --git a/lld/MachO/SyntheticSections.cpp b/lld/MachO/SyntheticSections.cpp --- a/lld/MachO/SyntheticSections.cpp +++ b/lld/MachO/SyntheticSections.cpp @@ -133,6 +133,9 @@ SymtabSection::SymtabSection(StringTableSection &stringTableSection) : stringTableSection(stringTableSection) { segname = segment_names::linkEdit; name = section_names::symbolTable; + // TODO: When we introduce the SyntheticSections superclass, we should make + // all synthetic sections aligned to WordSize by default. + align = WordSize; } size_t SymtabSection::getSize() const { diff --git a/lld/MachO/Writer.cpp b/lld/MachO/Writer.cpp --- a/lld/MachO/Writer.cpp +++ b/lld/MachO/Writer.cpp @@ -371,6 +371,7 @@ void Writer::assignAddresses(OutputSegment seg) { ArrayRef<InputSection > sections = p.second; for (InputSection isec : sections) { addr = alignTo(addr, isec->align); + // We must align the file offsets too to avoid misaligned writes of + // structs. + fileOff = alignTo(fileOff, isec->align); isec->addr = addr; addr += isec->getSize(); fileOff += isec->getFileSize(); @@ -396,6 +397,7 @@ void Writer::writeSections() { uint64_t fileOff = seg->fileOff; for (auto &sect : seg->getSections()) { for (InputSection isec : sect.second) { + fileOff = alignTo(fileOff, isec->align); isec->writeTo(buf + fileOff); fileOff += isec->getFileSize(); } ``` I don't think it's easy to write a test for alignment (that doesn't involve brittly hard-coding file offsets), so there isn't one... but UBSAN builds pass now. Differential Revision: https://reviews.llvm.org/D79050	2020-04-28 17:07:06 -07:00
Eric Christopher	35edd704e0	Temporarily revert "build: use `find_package(Python3)` if available" as it seems to be causing multiple people problems with running tests and building. This reverts commit `c4c3883b00`.	2020-04-28 16:41:22 -07:00

1 2 3 4 5 ...

12988 Commits