llvm-project

Commit Graph

Author	SHA1	Message	Date
Fangrui Song	913914f0f8	[ELF] Simplify writing the Elf_Chdr header. NFC And avoiding changing `size` in `writeTo`.	2022-01-26 10:23:56 -08:00
Benjamin Kramer	f15014ff54	Revert "Rename llvm::array_lengthof into llvm::size to match std::size from C++17" This reverts commit `ef82063207`. - It conflicts with the existing llvm::size in STLExtras, which will now never be called. - Calling it without llvm:: breaks C++17 compat	2022-01-26 16:55:53 +01:00
serge-sans-paille	ef82063207	Rename llvm::array_lengthof into llvm::size to match std::size from C++17 As a conquence move llvm::array_lengthof from STLExtras.h to STLForwardCompat.h (which is included by STLExtras.h so no build breakage expected).	2022-01-26 16:17:45 +01:00
Fangrui Song	3704abaa16	[ELF] --gdb-index: replace vector<uint8_t> with unique_ptr<uint8_t[]>. NFC	2022-01-25 23:53:23 -08:00
Fangrui Song	571d6a7120	[ELF] Optimize .relr.dyn to not grow vector<uint64_t>. NFC	2022-01-25 23:33:40 -08:00
Fangrui Song	9fac78d0e1	[ELF] Simplify and optimize .relr.dyn NFC	2022-01-25 22:50:03 -08:00
Fangrui Song	2a80c3dbe1	[ELF] Clarify that Z_BEST_SPEED==1 in a comment. NFC	2022-01-25 22:40:53 -08:00
Fangrui Song	07bd467643	[ELF] --build-id: replace vector<uint8_t> with unique_ptr<uint8_t[]>. NFC We can't use C++20 make_unique_for_overwrite yet.	2022-01-25 22:39:43 -08:00
Fangrui Song	7438dbe078	[ELF] Cast size to size_t. NFC To fix ../../chromeclang/bin/../include/c++/v1/__algorithm/min.h:39:1: note: candidate template ignored: deduced conflicting types for parameter '_Tp' ('unsigned long' vs. 'unsigned long long') on macOS arm64.	2022-01-25 22:38:24 -08:00
Fangrui Song	223f9dea3d	[ELF] maybeCompress: replace vector<uint8_t> with unique_ptr<uint8_t[]>. NFC And mention that it is zero-initialized. I do not notice a speed-up if changed to be uninitialized by forcing the zero filler in writeTo.	2022-01-25 22:15:44 -08:00
Puyan Lotfi	227d18b3a8	[lld][macho][NFC] Make MachO/start-end.s test less britle by checking for _main: In start-end.s there is a lit check line `# SEG: _main` to begin the check at the start of the function main where `_main` is the Darwin name mangling for C main. Because the text file that FileCheck is getting as input has the path of the compiler build in it from llvm-mc and llvm-objdump, and because of the lack of a trailing colon in this check line we end up inadvertently matching against the line of text with the compiler path in it in the case where said path contains "_main" some place. This can be very likely if the compiler branch has "main" or "_main" in it. To fix this I include the training : since that will match on the function label and not the path line.	2022-01-25 19:23:51 -08:00
Fangrui Song	4cdc441690	[ELF] Parallelize --compress-debug-sections=zlib When linking a Debug build clang (265MiB SHF_ALLOC sections, 920MiB uncompressed debug info), in a --threads=1 link "Compress debug sections" takes 2/3 time and in a --threads=8 link "Compress debug sections" takes ~70% time. This patch splits a section into 1MiB shards and calls zlib `deflake` parallelly. DEFLATE blocks are a bit sequence. We need to ensure every shard starts at a byte boundary for concatenation. We use Z_SYNC_FLUSH for all shards but the last to flush the output to a byte boundary. (Z_FULL_FLUSH can be used as well, but Z_FULL_FLUSH clears the hash table which just wastes time.) The last block requires the BFINAL flag. We call deflate with Z_FINISH to set the flag as well as flush the output to a byte boundary. Under the hood, all of Z_SYNC_FLUSH, Z_FULL_FLUSH, and Z_FINISH emit a non-compressed block (called stored block in zlib). RFC1951 says "Any bits of input up to the next byte boundary are ignored." In a --threads=8 link, "Compress debug sections" is 5.7x as fast and the total speed is 2.54x. Because the hash table for one shard is not shared with the next shard, the output is slightly larger. Better compression ratio can be achieved by preloading the window size from the previous shard as dictionary (`deflateSetDictionary`), but that is overkill. ``` # 1MiB shards % bloaty clang.new -- clang.old FILE SIZE VM SIZE -------------- -------------- +0.3% +129Ki [ = ] 0 .debug_str +0.1% +105Ki [ = ] 0 .debug_info +0.3% +101Ki [ = ] 0 .debug_line +0.2% +2.66Ki [ = ] 0 .debug_abbrev +0.0% +1.19Ki [ = ] 0 .debug_ranges +0.1% +341Ki [ = ] 0 TOTAL # 2MiB shards % bloaty clang.new -- clang.old FILE SIZE VM SIZE -------------- -------------- +0.2% +74.2Ki [ = ] 0 .debug_line +0.1% +72.3Ki [ = ] 0 .debug_str +0.0% +69.9Ki [ = ] 0 .debug_info +0.1% +976 [ = ] 0 .debug_abbrev +0.0% +882 [ = ] 0 .debug_ranges +0.0% +218Ki [ = ] 0 TOTAL ``` Bonus in not using zlib::compress * we can compress a debug section larger than 4GiB * peak memory usage is lower because for most shards the output size is less than 50% input size (all less than 55% for a large binary I tested, but decreasing the initial output size does not decrease memory usage) Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D117853	2022-01-25 10:29:04 -08:00
Leonard Grey	a5c9d71780	[lld-macho] Move order file and call graph sorting into SectionPriorities See https://reviews.llvm.org/D117354 for context and discussion.	2022-01-25 12:18:15 -05:00
Leonard Grey	f23d57a632	[lld-macho] Rename CallGraphSort.{h,cpp} to SectionPriorities This is in preparation for moving the code that parses and processes order files into this file. See https://reviews.llvm.org/D117354 for context and discussion.	2022-01-25 12:15:14 -05:00
Fangrui Song	c03fdd3403	[ELF] Fix the branch range computation when reusing a thunk Notation: dst is `t->getThunkTargetSym()->getVA()` On AArch64, when `src-0x8000000-r_addend <= dst < src-0x8000000`, the condition `target->inBranchRange(rel.type, src, rel.sym->getVA(rel.addend))` may incorrectly consider a thunk reusable. `rel.addend = -getPCBias(rel.type)` resets the addend to 0 for AArch64/PPC and the zero addend is used by `rel.sym->getVA(rel.addend)` to check out-of-range relocations. See the test for a case this computation is wrong: `error: a.o:(.text_high+0x4): relocation R_AARCH64_JUMP26 out of range: -134217732 is not in [-134217728, 134217727]` I have seen a real world case with r_addend=19960. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D117734	2022-01-24 09:03:21 -08:00
serge-sans-paille	5f290c090a	Move STLFunctionalExtras out of STLExtras Only using that change in StringRef already decreases the number of preoprocessed lines from 7837621 to 7776151 for LLVMSupport Perhaps more interestingly, it shows that many files were relying on the inclusion of StringRef.h to have the declaration from STLExtras.h. This patch tries hard to patch relevant part of llvm-project impacted by this hidden dependency removal. Potential impact: - "llvm/ADT/StringRef.h" no longer includes <memory>, "llvm/ADT/Optional.h" nor "llvm/ADT/STLExtras.h" Related Discourse thread: https://llvm.discourse.group/t/include-what-you-use-include-cleanup/5831	2022-01-24 14:13:21 +01:00
Peter Smith	a08447d0de	[LLD][ELF][AArch64] Update test with incorrect REQUIRES line [NFC] D54759 introduced aarch64-combined-dynrel.s and aarch64-combined-dynrel-ifunc.s . Unfortunately the requires line at the top was AArch64 instead of aarch64 which means they were never run. Update the tests to use aarch64 and fix to match current lld output. Differential Revision: https://reviews.llvm.org/D117896	2022-01-24 10:04:28 +00:00
Sam Clegg	ac2f3df839	[lld][WebAssembly] Remove redundant config setting Unresolved symbols are not currently reported when building with `-shared` or `-pie` so setting unresolvedSymbols doesn't have any effect. Differential Revision: https://reviews.llvm.org/D117737	2022-01-20 15:21:56 -08:00
Roger Kim	f84023a812	[lld][macho] Stop grouping symbols by sections in mapfile. As per [Bug 50689](https://bugs.llvm.org/show_bug.cgi?id=50689), ``` 2. getSectionSyms() puts all the symbols into a map of section -> symbols, but this seems unnecessary. This was likely copied from the ELF port, which prints a section header before the list of symbols it contains. But the Mach-O map file doesn't print these headers. ``` This diff removes `getSectionSyms()` and keeps all symbols in a flat vector. What does ld64's mapfile look like? ``` $ llvm-mc -filetype=obj -triple=x86_64-apple-darwin test.s -o test.o $ llvm-mc -filetype=obj -triple=x86_64-apple-darwin foo.s -o foo.o $ ld -map map test.o foo.o -o out -L/Library/Developer/CommandLineTools/SDKs/MacOSX.sdk/usr/lib -lSystem ``` ``` [ 0] linker synthesized [ 1] test.o [ 2] foo.o 0x100003FB7 0x00000001 __TEXT __text 0x100003FB8 0x00000000 __TEXT obj 0x100003FB8 0x00000048 __TEXT __unwind_info 0x100004000 0x00000001 __DATA __common 0x100003FB7 0x00000001 [ 1] _main 0x100003FB8 0x00000000 [ 2] _foo 0x100003FB8 0x00000048 [ 0] compact unwind info 0x100004000 0x00000001 [ 1] _number ``` Perf numbers when linking chromium framework on a 16-Core Intel Xeon W Mac Pro: ``` base diff difference (95% CI) sys_time 1.406 ± 0.020 1.388 ± 0.019 [ -1.9% .. -0.6%] user_time 5.557 ± 0.023 5.914 ± 0.020 [ +6.2% .. +6.6%] wall_time 4.455 ± 0.041 4.436 ± 0.035 [ -0.8% .. -0.0%] samples 35 35 ``` Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D114735	2022-01-20 12:16:37 -08:00
Alexandre Ganea	83d59e05b2	Re-land [LLD] Remove global state in lldCommon Move all variables at file-scope or function-static-scope into a hosting structure (lld::CommonLinkerContext) that lives at lldMain()-scope. Drivers will inherit from this structure and add their own global state, in the same way as for the existing COFFLinkerContext. See discussion in https://lists.llvm.org/pipermail/llvm-dev/2021-June/151184.html The previous land `f860fe3622` caused issues in https://lab.llvm.org/buildbot/#/builders/123/builds/8383, fixed by `22ee510dac`. Differential Revision: https://reviews.llvm.org/D108850	2022-01-20 14:53:26 -05:00
John Ericson	df31ff1b29	[cmake] Make include(GNUInstallDirs) always below project(..) Its defaulting logic must go after `project(..)` to work correctly, but `project(..)` is often in a standalone condition making this awkward, since the rest of the condition code may also need GNUInstallDirs. The good thing is there are the various standalone booleans, which I had missed before. This makes splitting the conditional blocks less awkward. Reviewed By: arichardson, phosek, beanz, ldionne, #libunwind, #libc, #libc_abi Differential Revision: https://reviews.llvm.org/D117639	2022-01-20 18:59:17 +00:00
Sam Clegg	feddf11502	[lld][WebAssemlby] Convert test to check disassembly output. NFC Differential Revision: https://reviews.llvm.org/D117739	2022-01-20 09:32:01 -08:00
Adrian Prantl	54ba376d08	Add missing include to fix modular build	2022-01-20 08:33:44 -08:00
Jez Ng	8f811effac	[lld-macho] Fix grammar in doc	2022-01-19 23:59:35 -08:00
Fangrui Song	a7a4115bf3	[ELF] Replace .zdebug string comparison with SHF_COMPRESSED check. NFC	2022-01-19 22:33:32 -08:00
Fangrui Song	03909c4400	[ELF] Remove StringRefZ StringRefZ does not improve performance. Non-local symbols always have eagerly computed nameSize. Most local symbols's lengths will be updated in either: * shouldKeepInSymtab * SymbolTableBaseSection::addSymbol Its benefit is offsetted by strlen in every call site (sums up to 5KiB code in a release x86-64 build), so using StringRefZ may be slower. In a -s link (uncommon) there is minor speedup, like ~0.3% for clang and chrome. Reviewed By: alexander-shaposhnikov Differential Revision: https://reviews.llvm.org/D117644	2022-01-19 20:09:41 -08:00
Alexandre Ganea	aba5b91b69	Re-land [CodeView] Add full repro to LF_BUILDINFO record This patch writes the full -cc1 command into the resulting .OBJ, like MSVC does. This allows for external tools (Recode, Live++) to rebuild a source file without any external dependency but the .OBJ itself (other than the compiler) and without knowledge of the build system. The LF_BUILDINFO record stores a full path to the compiler, the PWD (CWD at program startup), a relative or absolute path to the source, and the full CC1 command line. The stored command line is self-standing (does not depend on the environment). In the same way, MSVC doesn't exactly store the provided command-line, but an expanded version (a somehow equivalent of CC1) which is also self-standing. For more information see PR36198 and D43002. Differential Revision: https://reviews.llvm.org/D80833	2022-01-19 19:44:37 -05:00
Jez Ng	ef95d45138	[lld-macho] Mention string literal deduplication as a difference from ld64 Reviewed By: keith Differential Revision: https://reviews.llvm.org/D117250	2022-01-19 16:30:52 -08:00
Keith Smiley	3f38dc5c04	[lld-macho] Silence XAR deprecation warning If you're building this on macOS 12.x+ this produces a deprecation warning. I'm not sure what this means for the bitcode format going forward, but it seems safe to silence for now. Do we need to worry about GCC for this? Differential Revision: https://reviews.llvm.org/D117718	2022-01-19 13:51:55 -08:00
Keith Smiley	67090e3446	[lld-macho] Implement -noall_load This flag is the default, so in ld64 it is not implemented, but it can be useful to negate previous -all_load arguments. Specifically if your build system has some global linker flags, that you may want to negate for specific links. We use something like this today to make sure some C++ symbols are automatically discovered for all links, which passing -all_load hides. Differential Revision: https://reviews.llvm.org/D117629	2022-01-19 13:12:18 -08:00
Fangrui Song	5bd38a2826	[ELF] Fix split-stack caller with hidden non-split-stack callee Fix a regression after `aabe901d57` (`[ELF] Remove one redundant computeBinding`): isLocal() does not indicate that the symbol is originally local. For simplicity, just drop this optimization.	2022-01-19 12:25:01 -08:00
Fangrui Song	0aae2bf373	[lld-macho] Add --start-lib --end-lib In ld.lld, when an ObjFile/BitcodeFile is read in --start-lib state, the file is given archive semantics. --end-lib closes the previous --start-lib. A build system can use this feature as an alternative to archives. This patch ports the feature to lld-macho. --start-lib and --end-lib are positional, unlike usual ld64 options. I think the slight drawback does not matter as (a) reusing option names make build systems convenient (b) `--start-lib a.o b.o --end-lib` conveys more information than an alternative design: `-objlib a.o -objlib b.o` because --start-lib makes it clear which objects are in the same conceptual archive. This provides flexibility (c) `-objlib`/`-filelist` interaction may be weird. Close https://github.com/llvm/llvm-project/issues/52931 Reviewed By: #lld-macho, Jez Ng, oontvoo Differential Revision: https://reviews.llvm.org/D116913	2022-01-19 10:14:49 -08:00
Fangrui Song	d838bf2adc	[ELF] Allow non-bitcode archive with an empty index When an archive with an empty index contains only bitcode files, it is handled as a group of lazy (--start-lib) object files. If there is a non-bitcode file, there will be a diagnostic a la GNU ld. For some programs, the archive member extraction ratio is high (e.g. for chrome, 79% archive members are extracted according to --print-archive-stats=). Because symbol interning is cached for ObjFile::parseLazy but not for ArchiveFile, parsing an archive as a group of --start-lib object files may be faster. If the linker speculatively creates section representations for archive members, the archive index will not be used. If we take the above view, the archive index is essentially useless. If a user wants a fast build without using --start-lib, they may just build thin archives without index (`ar rcS --thin`). Therefore, I suggest that we no longer treat the code as a hack, instead as a supported feature. I believe we will do this anyway if we add parallel symbol interning (parallel symbol interning for lazy object files is simpler than that for archives). Ecosystem issues: * parseLazy actually has nearly the same behavior as ArchiveFile::parse, but the symbol order may be different. * users may get addicted to the behavior and build archives not working with GNU ld and gold. I think it is easy to rebuild archives to be compatible. Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D117284	2022-01-19 10:01:53 -08:00
Ayke van Laethem	d649faff9c	[LLD][COFF] Support GNU style == aliases D46245 added support for this in llvm-libtool, but while lld-link can also create .lib files from .def files it didn't support aliases. I compared the Inputs/library.def test against the output from llvm-libtool and it matches, except for the fact that lld-link reorders functions for some reason. I have also verified that this fixes a bug I was running into while trying to compile .def files to .lib files in MinGW-w64 (using lld-link instead of llvm-libtool). Differential Revision: https://reviews.llvm.org/D113365	2022-01-19 14:22:13 +01:00
Fangrui Song	288082d45d	[ELF] Move SHT_REL/SHT_RELA handling from createInputSection to initializeSections This simplifies the code a bit. While here, * change the `multiple relocation sections` diagnostic from `fatal` to `error` and include the relocated section name. * drop less useful name from `getRelocTarget`. Without -r/--emit-relocs we don't need to get SHT_REL/SHT_RELA names.	2022-01-18 23:31:51 -08:00
Fangrui Song	84944b63f3	[ELF] Simplify ObjFile<ELFT>::initializeSections. NFC	2022-01-18 22:45:04 -08:00
Fangrui Song	5f404a749a	[ELF] De-template InputSectionBase::getLocation. NFC	2022-01-18 17:33:58 -08:00
Fangrui Song	eafd34581f	[ELF] Simplify/optimize EhInputSection::split and change some `fatal` to `errorOrWarn`. EhFrame.cpp is a helper file. We don't place all .eh_frame implementation there, so the code move is fine.	2022-01-18 17:03:23 -08:00
Vincent Lee	e5347f2556	[lld-macho] Allow deduplicate-literals to be overridden It's still uncertain but whether we want to have `deduplicate-literals` be the default flag for LLD out of the box or not. If `deduplicate-literals` is the default behavior, then we will need a way override it and not deduplicate. Luckily, we have `no_deduplicate` to fill this gap. For now, I've set the default to be false which aligns with the existing behavior. That can only always be changed after discussions on D117250. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D117387	2022-01-18 15:42:59 -08:00
Sam Clegg	ec47dba1c8	[lld][WebAssembly] Perform data relocations during start function We already perform memory initialization and apply global relocations during start. It makes sense to performs data relocations too. I think the reason we were not doing this already is solely historical. Differential Revision: https://reviews.llvm.org/D117412	2022-01-18 14:08:42 -08:00
Sam Clegg	ae1573e131	[lld][WebAssembly] Reinstate mistakenly disabled test. NFC It seems the first half of this test was disabled in error as part of https://reviews.llvm.org/D93066. Differential Revision: https://reviews.llvm.org/D117594	2022-01-18 12:22:22 -08:00
Alexander Shaposhnikov	2bb7f226af	[lld] Fix typo. NFC	2022-01-18 02:33:27 +00:00
Fangrui Song	83c7f5d3fb	[ELF] EhInputSection::split: remove unneeded check	2022-01-17 13:59:52 -08:00
Fangrui Song	ac0986f880	[ELF] Change std::vector<InputSectionBase *> to SmallVector There is no remaining std::vector<InputSectionBase> now. My x86-64 lld executable is 2KiB small.	2022-01-17 10:25:07 -08:00
Fangrui Song	f855074ed1	[ELF] GnuHashTableSection: replace stable_sort with 2-key sort. NFC strTabOffset stabilizes llvm::sort. My x86-64 executable is 5+KiB smaller.	2022-01-17 00:34:42 -08:00
Fangrui Song	54fe70bfba	[ELF] RelocationScanner::scanOne: replace rel.r_offset with offset. NFC	2022-01-17 00:05:27 -08:00
Fangrui Song	4c36567179	[ELF] Relocations: remove some cast<Undefined>. NFC	2022-01-17 00:02:47 -08:00
Fangrui Song	b8d4eb84d7	[ELF] De-template getAlternativeSpelling. NFC	2022-01-16 23:56:25 -08:00
Fangrui Song	9c4292a59d	[ELF] Remove unneeded SyntheticSection memset(, 0, ) After the D33630 fallout was properly fixed by `a4c5db30be`. Tested by D37462/D44986 tests, the new --no-rosegment test in build-id.s, and a few --rosegment/--no-rosegment programs.	2022-01-16 22:51:57 -08:00
Fangrui Song	a4c5db30be	[ELF] Remove redundant fillTrap and memset(, 0, ). NFC The new tests in build-id.s would catch problems if we made a mistake here.	2022-01-16 22:37:31 -08:00
Fangrui Song	d46054d75d	[ELF][test] Add --build-id tests for -z separate-loadable-segments and --no-rosegment	2022-01-16 22:36:22 -08:00
Fangrui Song	aad90763d9	[ELF] RelocationSection<ELFT>::writeTo: use unstable partition	2022-01-16 21:44:19 -08:00
Fangrui Song	769057a5d0	[ELF] Change some DenseMap<StringRef, > to DenseMap<CachedHashStringRef, >. NFC	2022-01-16 21:19:01 -08:00
Fangrui Song	e205445434	[ELF] StringTableSection: Use DenseMap<CachedHashStringRef> to avoid redundant hash computation 5~6% speedup when linking clang and chrome.	2022-01-16 21:02:05 -08:00
Alexandre Ganea	e6b153947d	Revert [LLD] Remove global state in lldCommon It seems to be causing issues on https://lab.llvm.org/buildbot/#/builders/123/builds/8383	2022-01-16 11:03:06 -05:00
Alexandre Ganea	30a4020a7d	[LLD] Supplement with more comments. Clarify the intention in `f860fe3622`.	2022-01-16 09:17:39 -05:00
Alexandre Ganea	f860fe3622	[LLD] Remove global state in lldCommon Move all variables at file-scope or function-static-scope into a hosting structure (lld::CommonLinkerContext) that lives at lldMain()-scope. Drivers will inherit from this structure and add their own global state, in the same way as for the existing COFFLinkerContext. See discussion in https://lists.llvm.org/pipermail/llvm-dev/2021-June/151184.html Differential Revision: https://reviews.llvm.org/D108850	2022-01-16 08:57:57 -05:00
Fangrui Song	e7c8cd4a93	[ELF] Remove forEachRelSec. NFC	2022-01-16 00:28:47 -08:00
Fangrui Song	9e885eac54	[ELF] Remove !isLazy() condition from computeBinding Seems applicable since we demote lazy symbols to Undefined (D111365).	2022-01-15 23:58:15 -08:00
Fangrui Song	c0fc09ab91	[ELF] Remove config->relocatable condition from Symbol::computeBinding	2022-01-15 23:49:48 -08:00
Fangrui Song	b3cc47006b	[ELF] Speed up Symbol::computeBinding. NFC When computeBinding is inlined into includeInDynsym and computeIsPreemptible, the optimizer can remove the config->gnuUnique load.	2022-01-15 23:40:44 -08:00
Fangrui Song	01a51629c2	[ELF] Slightly speed up Symbol::includeInDynsym. NFC	2022-01-15 23:32:48 -08:00
Fangrui Song	7330fd236e	[ELF] Simplify Symbol::includeInDynsym	2022-01-15 23:27:45 -08:00
Fangrui Song	3736d0854a	[ELF] Optimize -z combreloc Sorting dynamic relocations is a bottleneck. Simplifying the comparator improves performance. Linking clang is 4~5% faster with --threads=8. This change may shuffle R_MIPS_REL32 for Mips and is a NFC for non-Mips.	2022-01-15 22:33:51 -08:00
Fangrui Song	102d0a2baf	[ELF] Simplify elf::link exit. NFC	2022-01-15 17:59:05 -08:00
Fangrui Song	8b2f33231c	[ELF] Make some diagnostics follow the convention	2022-01-15 10:46:25 -08:00
Phoebe Wang	0f499d1ed4	Revert "[X86][LLD] Update datelayout in LLD tests. NFCI" This reverts commit `9b43237128`.	2022-01-15 10:54:37 +08:00
Fangrui Song	7c269db779	[lld-macho] Simplify DeduplicatedCStringSection::finalizeContents. NFC Tail merge is slow and of low value. With regular string deduplication, we can just use the return value of StringTableBuilder::add. There is no noticeable performance increase because without deduplication `__cstring` is quite small (7.6MiB for chromium_framework). Reviewed By: #lld-macho, Jez Ng Differential Revision: https://reviews.llvm.org/D117273	2022-01-14 13:12:57 -08:00
Juergen Ributzka	3025c3eded	Replace PlatformKind with PlatformType. The PlatformKind/PlatformType enums contain the same information, which requires them to be kept in-sync. This commit changes over to PlatformType as the sole source of truth, which allows the removal of the redundant PlatformKind. The majority of the changes were in LLD and TextAPI. Reviewed By: cishida Differential Revision: https://reviews.llvm.org/D117163	2022-01-13 09:23:49 -08:00
Igor Kudrin	e00ac48df3	[ELF] Use tombstone values for discarded symbols in relocatable output This extends D81784. Sections can be discarded when linking a relocatable output. Before the patch, LLD did not update the content of debug sections and only replaced the corresponding relocations with R_*_NONE, which could break the debug information. Differential Revision: https://reviews.llvm.org/D116946	2022-01-13 11:38:26 +07:00
Fangrui Song	a5249c2dd2	[ELF] Change gnuHashTab/hashTab to unique_ptr. NFC and remove associated make<XXX> calls. My x86-64 `lld` is ~5KiB smaller.	2022-01-12 13:04:32 -08:00
Fangrui Song	43d927984c	[ELF] Refactor how .gnu.hash and .hash are discarded Switch to the D114180 approach which is simpler and allows gnuHashTab/hashTab to switch to unique_ptr.	2022-01-12 12:47:07 -08:00
Fangrui Song	b592cbf329	[ELF][test] Improve discard-gnu-hash.s to check DT_HASH and DT_GNU_HASH	2022-01-12 12:43:49 -08:00
Fangrui Song	bf9c8636f2	[ELF] Support discarding .relr.dyn `db08df0570` does not work because part.relrDyn is a unique_ptr and `reset` destroys the object which may still be referenced. This commit uses the D114180 approach. Also improve the test to check that there is no R_X86_64_RELATIVE.	2022-01-12 11:55:22 -08:00
Fangrui Song	d8b7ae947d	[ELF][test] Temporarily remove .relr.dyn test which is not working	2022-01-12 11:43:56 -08:00
Fangrui Song	f8476fd47b	[llvm-ar][test] Test that --plugin is ignored	2022-01-12 11:32:31 -08:00
Fangrui Song	5014d6fc53	[ELF] -Map --why-extract=: print despite errors Fix https://github.com/llvm/llvm-project/issues/53073 In case of a relocation error, GNU ld's link map includes the archive member extraction information but not output sections. Our -Map and --why-extract= are currently no-op in case of an error. This change makes the two options work. Reviewed By: ikudrin, peter.smith Differential Revision: https://reviews.llvm.org/D116838	2022-01-12 10:40:33 -08:00
Fangrui Song	db08df0570	[ELF] Support discarding .relr.dyn to prepare for D116838, otherwise for linkerscript/discard-section-err.s, there will be a null pointer dereference in `part.relrDyn->getParent()->size` in `finalizeSynthetic(part.relrDyn.get())`.	2022-01-12 10:38:59 -08:00
Leonard Grey	6db04b97e6	[lld-macho] Port CallGraphSort from COFF/ELF Depends on D112160 This adds the new options `--call-graph-profile-sort` (default), `--no-call-graph-profile-sort` and `--print-symbol-order=`. If call graph profile sorting is enabled, reads `__LLVM,__cg_profile` sections from object files and uses the resulting graph to put callees and callers close to each other in the final binary via the C3 clustering heuristic. Differential Revision: https://reviews.llvm.org/D112164	2022-01-12 10:47:04 -05:00
Phoebe Wang	9b43237128	[X86][LLD] Update datelayout in LLD tests. NFCI rG1bb0caf56168 changed the datalayout of f80 on Windows 32 bits. But it missed the related use in the LLD tests. This patch will fix the problem catched by buildbot.	2022-01-12 19:13:41 +08:00
Jez Ng	62790f366f	[lld-macho] Try and fix map-file.s' flakiness After {D117069}, map-file.s seems flaky. It seems that the "Total Write map file" section always exists, but the "Write map file" sub-section may or may not be emitted. So we check for the former.	2022-01-11 23:02:45 -08:00
Fangrui Song	bfd00ae31e	[lld-link] Change config and driver to unique_ptr Similar to D116143. My x86-64 `lld` is ~5KiB smaller. Reviewed By: mstorsjo Differential Revision: https://reviews.llvm.org/D116996	2022-01-11 18:31:25 -08:00
Jez Ng	e976c457c5	[lld-macho] Initialize separate time trace profiler for mapfile worker After {D115416}, the "Write map file" event no longer shows up in the time trace. Each time trace profiler instance is thread-local, but we had neglected to initialize a separate instance for the mapfile worker thread. Reviewed By: keith Differential Revision: https://reviews.llvm.org/D117069	2022-01-11 17:45:18 -08:00
Fangrui Song	97a5dccb7d	[lld-macho] Rename LazySymbol to LazyArchive. NFC D116913 will add LazyObject. Rename LazySymbol to LazyArchive to avoid confusion and mirror ELF. Reviewed By: #lld-macho, Jez Ng Differential Revision: https://reviews.llvm.org/D116914	2022-01-11 16:49:06 -08:00
Fangrui Song	37a1291885	[ELF] Add RelocationScanner. NFC Currently the way some relocation-related static functions pass around states is clumsy. Add a Resolver class to store some states as member variables. Advantages: * Avoid the parameter `InputSectionBase &sec` (this offsets the cost passing around `this` paramemter) * Avoid the parameter `end` (Mips and PowerPC hacks) * `config` and `target` can be cached as member variables to reduce global state accesses. (potential speedup because the compiler didn't know `config`/`target` were not changed across function calls) * If we ever want to reduce if-else costs (e.g. `config->emachine==EM_MIPS` for non-Mips) or introduce parallel relocation scan not handling some tricky arches (PPC/Mips), we can templatize Resolver `target` isn't used as much as `config`, so I change it to a const reference during the migration. There is a minor performance inprovement for elf::scanRelocations. Reviewed By: ikudrin, peter.smith Differential Revision: https://reviews.llvm.org/D116881	2022-01-11 09:54:53 -08:00
Simon Atanasyan	0199e47373	[mips][lld] Add test case to check symbol index reading on mips64el. NFC	2022-01-11 19:08:20 +03:00
Fangrui Song	5dbbd4eeb8	[ELF] Move OffsetGetter before some static functions. NFC to prepare for D116881.	2022-01-10 20:16:02 -08:00
Fangrui Song	477bc36d3b	[lld-macho] Change some global pointers to unique_ptr Similar to D116143. My x86-64 `lld` is ~8KiB smaller. Reviewed By: keith Differential Revision: https://reviews.llvm.org/D116902	2022-01-10 19:39:14 -08:00
Fangrui Song	2968467e39	[lld-macho][test] Add missing coverage for archive/dylib resolution after D115092 When `file->fetch(sym)` is replaced with a no-op, no test fails. The new test catches the case. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D116916	2022-01-10 19:36:24 -08:00
Fangrui Song	7f1955dc96	[ELF] Support mixed TLSDESC and TLS GD We only support both TLSDESC and TLS GD for x86 so this is an x86-specific problem. If both are used, only one R_X86_64_TLSDESC is produced and TLS GD accesses will incorrectly reference R_X86_64_TLSDESC. Fix this by introducing SymbolAux::tlsDescIdx. Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D116900	2022-01-10 10:03:21 -08:00
Vincent Lee	7a161eb43b	[lld-macho] Fix shadowed variable This fixes a windows build failure from D115416.	2022-01-10 00:20:35 -08:00
Alexander Shaposhnikov	8acc3b4ab0	[lld][ELF] Support adrp+ldr GOT optimization for AArch64 This diff adds first bits to support relocation relaxations for AArch64 discussed on https://github.com/ARM-software/abi-aa/pull/106. In particular, the case of adrp x0, :got: symbol ldr x0, [x0, :got_lo12: symbol] is handled. Test plan: make check-all Differential revision: https://reviews.llvm.org/D112063	2022-01-10 05:20:37 +00:00
Fangrui Song	5d3bd7f360	[ELF] Move gotIndex/pltIndex/globalDynIndex to SymbolAux to decrease sizeof(SymbolUnion) by 8 on ELF64 platforms. Symbols needing such information are typically 1% or fewer (5134 out of 560520 when linking clang, 19898 out of 5550705 when linking chrome). Storing them elsewhere can decrease memory usage and symbol initialization time. There is a ~0.8% saving on max RSS when linking a large program. Future direction: * Move some of dynsymIndex/verdefIndex/versionId to SymbolAux * Support mixed TLSDESC and TLS GD without increasing sizeof(SymbolUnion) Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D116281	2022-01-09 13:43:27 -08:00
Kazu Hirata	8afcfbfb8f	Use true/false instead of 1/0 (NFC) Identified by modernize-use-bool-literals.	2022-01-09 12:21:06 -08:00
Kazu Hirata	b12fd13812	Fix bugprone argument comments. Identified by bugprone-argument-comment.	2022-01-09 12:21:02 -08:00
John Ericson	a1da5f3c2d	[lld] Deprecate using llvm-config to detect llvm installation This is continuing in the path of D51714, which did this for Clang. I have rearranged the source code Clang so one can diff the top-level CMakeLists.txt of Clang and LLD, ensuring we use the same strategy for both. Besides diffing the two files, `git diff --color-moved` on LLD also helps review. Reviewed By: beanz Differential Revision: https://reviews.llvm.org/D116492	2022-01-07 20:51:14 +00:00
John Ericson	44e3365775	[CMake] Factor out config prefix finding logic See the docs in the new function for details. I think I found every instance of this copy pasted code. Polly could also use it, but currently does something different, so I will save the behavior change for a future revision. We get the shared, non-installed CMake modules following the pattern established in D116472. It might be good to have LLD and Flang also use this, but that would be a functional change and so I leave it as future work. Reviewed By: beanz, lebedev.ri Differential Revision: https://reviews.llvm.org/D116521	2022-01-07 20:16:18 +00:00
Brian Cain	ddf1fb1f13	[Hexagon] Save results from partial compound Previously compounding was all-or-nothing. Now, the compounding attempts will iterate and yield the most compounds that still result in a valid packet.	2022-01-06 14:08:33 -08:00
Vincent Lee	a963bc490d	[lld-macho] Increase slops to prevent thunk out of range One of our internal arm64 apps hit a thunk out of range error when building with LLD. Per the comment, I'm arbitrarily increasing slop size to 256. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D116705	2022-01-06 12:29:12 -08:00
Vy Nguyen	fb9bfb2c59	[lld][macho][nfc] Make tests less britle by not expecting ordering in symbol table dump. (parial)fixes PR/53026 Differential Revision: https://reviews.llvm.org/D116718	2022-01-06 09:45:44 -05:00

1 2 3 4 5 ...

15000 Commits