llvm-project

Commit Graph

Author	SHA1	Message	Date
Fangrui Song	cb0a4bb5be	[ELF] Change (NOLOAD) section type mismatch error to warning Making a (NOLOAD) section SHT_PROGBITS is fishy (the user may expect all-zero content, but the linker does not check that), but some projects (e.g. Linux kernel https://github.com/ClangBuiltLinux/linux/issues/1597) traditionally rely on the behavior. Issue a warning to not break them.	2022-02-18 11:20:36 -08:00
Jez Ng	fd3669c256	[lld-macho] Improve hiding of unnamed_addr symbols Symbols for which `canBeOmittedFromSymbolTable()` is true should be treated as private externs. This diff tries to do that by unsetting the ExportDynamic bit. It seems to mostly work with the FullLTO backend, but with the ThinLTO backend, the `local_unnamed_addr` symbols still fail to be properly hidden. Nonetheless, this is a step in the right direction. I've documented all the remaining differences between our behavior and LD64's in the lto-internalized-unnamed-addr.ll test. See also https://discourse.llvm.org/t/mach-o-lto-handling-of-linkonce-odr-unnamed-addr/60015 Reviewed By: #lld-macho, thevinster Differential Revision: https://reviews.llvm.org/D119767	2022-02-18 12:09:38 -05:00
Fangrui Song	66f8ac8d36	[ELF] Support (TYPE=<value>) to customize the output section type The current output section type allows to set the ELF section type to SHT_PROGBITS or SHT_NOLOAD. This patch allows an arbitrary section value to be specified. Some common SHT_* literal names are supported as well. ``` SECTIONS { note (TYPE=SHT_NOTE) : { BYTE(8) *(note) } init_array ( TYPE=14 ) : { QUAD(14) } fini_array (TYPE = SHT_FINI_ARRAY) : { QUAD(15) } } ``` When `sh_type` is specified, it is an error if an input section has a different type. Our syntax is compatible with GNU ld 2.39 (https://sourceware.org/bugzilla/show_bug.cgi?id=28841). Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D118840	2022-02-17 12:10:58 -08:00
Fangrui Song	941f06282a	[lld] Make error handling functions opaque The inline `lld::error` expands to two function calls `errorHandler` and `error` where the latter is opaque. Move the functions to .cpp files to decrease code size. My x86-64 lld executable is 9KiB smaller. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D120002	2022-02-17 11:54:57 -08:00
Leonard Grey	a52b9102d1	[lld-macho] Allow order files and call graph sorting to be used together If both an order file and a call graph profile are present, the edges of the call graph which use symbols present in the order file are not used. All of the symbols in the order file will appear at the beginning of the section just as they do currently. In other words, the highest priority derived from the call graph will be below the lowest priority derived from the order file. Practically, this change renames CallGraphSort.{h,cpp} to SectionPriorities.{h,cpp}, and most order file and call graph profile related code is moved into the new file to reduce duplication. Differential Revision: https://reviews.llvm.org/D117354	2022-02-17 14:19:34 -05:00
Jez Ng	69297cf639	[lld-macho] Don't include CommandFlags.h in CommonLinkerContext.h Main motivation: including `llvm/CodeGen/CommandFlags.h` in `CommonLinkerContext.h` means that the declaration of `llvm::Reloc` is visible in any file that includes `CommonLinkerContext.h`. Since our cpp files have both `using namespace llvm` and `using namespace lld::macho`, this results in conflicts with `lld::macho::Reloc`. I suppose we could put `llvm::Reloc` into a nested namespace, but in general, I think we should avoid transitively including too many header files in a very widely used header like `CommonLinkerContext.h`. RegisterCodeGenFlags' ctor initializes a bunch of function-`static` structures and does nothing else, so it should be fine to "initialize" it as a temporary stack variable rather than as a file static. Reviewed By: aganea Differential Revision: https://reviews.llvm.org/D119913	2022-02-16 20:05:07 -05:00
Sam Clegg	dabbab6861	[lld][WebAssembly] Apply global relocs before data relocs Since the code for apply data relocations can sometimes use the values stored in he globals, they need to be relocated before the data relocations can be run. Fixes: https://github.com/emscripten-core/emscripten/issues/13398 Differential Revision: https://reviews.llvm.org/D119666	2022-02-16 14:30:39 -08:00
Arthur Eubanks	b5c9512df2	[test] Mark archive-as-start-lib.s as unsupported on Windows gnuwin32 tail does not support the `tail -c +9` syntax. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D119956	2022-02-16 10:27:43 -08:00
Fangrui Song	ae62aaa171	[ELF][test] Add --undefine-glob test to lto/duplicated.ll	2022-02-16 09:40:55 -08:00
Peter Kasting	c5fb05f663	Reland: Make lld-link work in a non-MSVC shell, add /winsysroot: This relands `73e585e44d` (and `0574b5fc65`), with a fix for the failing test (by using Optional<StringRef>s instead of making StringRef::empty() mean absence of value). Differential Revision: https://reviews.llvm.org/D118070	2022-02-16 09:22:39 -05:00
Nemanja Ivanovic	d32b875dbc	[ELF][test] Fix build break after `20bdd3e232` The added run lines build a bitcode file for x86 and an object file for whatever the default target is that is running the test. This causes an incompatibility between the files. Add the triple to the llvm-mc invocation.	2022-02-16 05:56:25 -06:00
Jez Ng	aa108fffec	[lld-macho][nfc] Clean up trailing spaces and tabs	2022-02-15 21:33:26 -05:00
Jez Ng	94c28d289a	[lld-macho][nfc] Factor out callgraph parsing code `parseSections()` is a getting a bit large unwieldy, let's factor out logic where we can. Other minor changes in this diff: * `"__cg_profile"` is now a global constexpr * We now use `checkError()` instead of `fatal()`-ing without handling the Error * Check for `callGraphProfileSort` before checking the section name, since the boolean comparison is likely cheaper Reviewed By: #lld-macho, lgrey, oontvoo Differential Revision: https://reviews.llvm.org/D119892	2022-02-15 21:13:55 -05:00
Fangrui Song	20bdd3e232	[ELF][test] Improve LTO duplicate symbol test	2022-02-15 17:54:38 -08:00
Sam Clegg	d2a0ef9844	[lld][WebAssembly] Don't force the export symbols assiged internal/dummy GOT entries Symbols with regular GOT entries do need to be exported, but those that are internalized (and have dymmy/internal GOT entries) need not be exported. This happens to fix the failures on the emscripten waterfall where extra symbols were being exported by the linker (and then later removed by wasm-opt). Differential Revision: https://reviews.llvm.org/D119902	2022-02-15 17:29:45 -08:00
Fangrui Song	132553b8c7	[ELF] --exclude-libs: skip local symbols for ET_REL. NFC Beside the optimization, this will avoid accessing nullptr entries with my planned change to parallelize initializeLocalSymbols.	2022-02-15 17:02:56 -08:00
Sam Clegg	faab70b783	[lld][WebAssemlby] Warn on unknown -z flags This code mirrors that in lld/ELF/Driver.cpp, as does the new test code. Differential Revision: https://reviews.llvm.org/D119888	2022-02-15 14:42:04 -08:00
Fangrui Song	53b59fdc52	[ELF][PPC64] Fix assertion failure for branches to hidden undefined weak for -no-pie Reported by Stefan Pintilie in D119773. For a branch to a hidden undefined weak symbol, there is an `assert(sym->getVA());` failure in PPC64LongBranchTargetSection::writeTo for a -no-pie link. The root cause is that we unnecessarily create the thunk for the -no-pie link. Fix this by changing the condition to just `s.isUndefined()`. See the inline comment. Rename ppc64-weak-undef-call.s to ppc64-undefined-weak.s to be consistent with other architectures. Reviewed By: sfertile, stefanp Differential Revision: https://reviews.llvm.org/D119787	2022-02-15 12:57:27 -08:00
Fangrui Song	467e1b3aaa	[ELF] reportDuplicate: change Symbol * to const Symbol &. NFC	2022-02-15 11:18:31 -08:00
Fangrui Song	3d85424096	[ELF] Parse archives as --start-lib object files https://maskray.me/blog/2022-01-16-archives-and-start-lib For every definition in an extracted archive member, we intern the symbol twice, once for the archive index entry, once for the .o symbol table after extraction. This is inefficient. Symbols in a --start-lib ObjFile/BitcodeFile are only interned once because the result is cached in symbols[i]. Just handle an archive using the --start-lib code path. We can therefore remove ArchiveFile and LazyArchive. For many projects, archive member extraction ratio is high and it is a net performance win. Linking a Release build of clang is 1.01x as fast. Note: --start-lib scans symbols in the same order that llvm-ar adds them to the index, so in the common case the semantics should be identical. If the archive symbol table was created in a different order, or is incomplete, this strategy may have different semantics. Such cases are considered user error. The `is neither ET_REL nor LLVM bitcode` error is changed to a warning. Previously an archive may have such members without a diagnostic. Using a warning prevents breakage. * For some tests, the diagnostics get improved where we did not consider the archive member name: `b.a:` => `b.a(b.o):`. * `no-obj.s`: the link is now allowed, matching GNU ld * `archive-no-index.s`: the `is neither ET_REL nor LLVM bitcode` diagnostic is demoted to a warning. * `incompatible.s`: even when an archive is unextracted, we may report an "incompatible with" error. --- I recently decreased sizeof(SymbolUnion) by 8 and decreased memory usage quite a bit, so retaining `symbols` for un-extracted archive members should not cause a memory usage problem. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D119074	2022-02-15 09:38:00 -08:00
Sam Clegg	37f422f4ac	[WebAssembly] Use GeneralDynamic TLS for exception handling builtins. These global TLS symbols are shared across all shared libraries and therefor should not be assumed to be local to the current module. Also add new error in the linker when TLS relocations are used against undefined symbols. TLS relocations are offsets into the current modules tls data segment, and don't make sense for undefined symbols which are modeled as global imports. Fixes: https://github.com/emscripten-core/emscripten/issues/13398 Differential Revision: https://reviews.llvm.org/D119630	2022-02-14 14:08:32 -08:00
Fangrui Song	fb40a61b2f	[ELF][docs] Document "Output section type"	2022-02-14 09:52:20 -08:00
Fangrui Song	f2fd1587bc	[ELF] Fix dead initialization. NFC Reported by scan-build.	2022-02-14 09:27:42 -08:00
Fangrui Song	8b01b638d0	[ELF] demoteSharedSymbols: make binding more appropriate for lazy symbols. NFC The binding will matter if we remove the `sym->replace(und)` kludge from initializeSymbols. While here, rename the function to be more appropriate.	2022-02-12 20:43:40 -08:00
Douglas Yung	437d4e01fe	Revert "try to fix windows build after 73e585e44d" and Revert "Reland "[lld/coff] Make lld-link work in a non-MSVC shell, add /winsysroot:"" This reverts commit `0574b5fc65` and `73e585e44d`. This change is causing the test Driver/cl-options.c to fail on Windows buildbots. https://lab.llvm.org/staging/#/builders/204/builds/1343	2022-02-11 23:47:53 -08:00
Jez Ng	103e1d934a	[lld-macho] Unset ExportDynamic where possible for LTO By unsetting this property, we are now able to internalize more symbols during LTO. I compared the output of `-save-temps` for both LLD and ld64, and we now match ld64's behavior as far as `lto-internalize.ll` is concerned. (Thanks @smeenai for working on an initial version of this diff!) Fixes https://github.com/llvm/llvm-project/issues/50574. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D119372	2022-02-11 22:26:19 -05:00
Roger Kim	dafe4c0b5c	[Mach-O][NFC] Reorder map file tests We are just grouping the files and the tests together. Reviewed By: int3, #lld-macho Differential Revision: https://reviews.llvm.org/D119456	2022-02-11 19:42:20 -05:00
Roger Kim	4f2c46c35c	Print C-string literals in mapfile This diff has the C-string literals printed into the mapfile in the symbol table like how ld64 does. Here is what ld64's mapfile looks like with C-string literals: ``` # Path: out # Arch: x86_64 # Object files: [ 0] linker synthesized [ 1] foo.o # Sections: # Address Size Segment Section 0x100003F7D 0x0000001D __TEXT __text 0x100003F9A 0x0000001E __TEXT __cstring 0x100003FB8 0x00000048 __TEXT __unwind_info # Symbols: # Address Size File Name 0x100003F7D 0x0000001D [ 1] _main 0x100003F9A 0x0000000E [ 1] literal string: Hello world!\n 0x100003FA8 0x00000010 [ 1] literal string: Hello, it's me\n 0x100003FB8 0x00000048 [ 0] compact unwind info ``` Here is what the new lld's Mach-O mapfile looks like: ``` # Path: /Users/rgr/local/llvm-project/build/Debug/tools/lld/test/MachO/Output/map-file.s.tmp/c-string-liter al-out # Arch: x86_64 # Object files: [ 0] linker synthesized [ 1] /Users/rgr/local/llvm-project/build/Debug/tools/lld/test/MachO/Output/map-file.s.tmp/c-string-literal .o # Sections: # Address Size Segment Section 0x1000002E0 0x0000001D __TEXT __text 0x1000002FD 0x0000001D __TEXT __cstring # Symbols: # Address File Name 0x1000002E0 [ 1] _main 0x1000002FD [ 1] literal string: Hello world!\n 0x10000030B [ 1] literal string: Hello, it's me\n ``` Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D118077	2022-02-11 19:42:20 -05:00
Nico Weber	73e585e44d	Reland "[lld/coff] Make lld-link work in a non-MSVC shell, add /winsysroot:" This relands commit `b3b2538df1`, except that the new files in Support are instead in a new library WindowsDriver.	2022-02-11 17:07:33 -05:00
Adrian Prantl	baac665adf	Revert "[lld/coff] Make lld-link work in a non-MSVC shell, add /winsysroot:" This reverts commit `b3b2538df1`, it introduced a cycklic module depenency that broke the -DLLVM_ENABLE_MODULES=1 build.	2022-02-11 13:07:23 -08:00
Peter Kasting	b3b2538df1	[lld/coff] Make lld-link work in a non-MSVC shell, add /winsysroot: Makes lld-link work in a non-MSVC shell by autodetecting MSVC toolchain. Also adds support for /winsysroot and a few other switches. All this is done by refactoring to share code with clang-cl's existing support for the same. Differential Revision: https://reviews.llvm.org/D118070	2022-02-11 13:55:18 -05:00
Jez Ng	4490a26a3e	[lld-macho][nfc] Rename %no_fatal_warnings_lld in tests ... to use hyphens instead of underscores, making it consistent with our other substitutions like %no-arg-lld and %lld-watchos. Reviewed By: keith Differential Revision: https://reviews.llvm.org/D119513	2022-02-11 10:06:38 -05:00
Vincent Lee	ef764ee207	[lld-macho][nfc] Centralize usages of ld64.lld in tests We have a mix of substituted lld (`%lld`) and hard-coded lld (`ld64.lld`) commands. When testing with different versions of LLD, this would require going into every place where lld is hard-coded and changing that. If we centralize it, this'll only require us to modify it in only one place and will make it easy to run the same test suite. Plus, this will make it be consistent with how we write other tests. Reviewed By: #lld-macho, int3, oontvoo Differential Revision: https://reviews.llvm.org/D119394	2022-02-10 17:27:07 -08:00
Krzysztof Drewniak	1ce314ce6b	[MLIR][GPU][lld] Use LLD bundled in ROCm, removing workaround Having clarified that executing the SerializeToHsaco pass can depend on a ROCm installation, switch from calling lld as a library to using the copy of lld guaranteed to be included in a ROCm install. This removes the workaround introduced in D119277 Reviewed By: whchung Differential Revision: https://reviews.llvm.org/D119463	2022-02-10 19:37:30 +00:00
Ben Dunbobbin	666aa43cbf	Fix comment after upstream: `9e08e92980` - [ELF] Allow STV_PROTECTED shared definition to set exportDynamic?	2022-02-09 23:51:31 +00:00
Fangrui Song	4631cba10b	[ELF][docs] Remove ignore -dc from ld.lld.1	2022-02-09 10:38:36 -08:00
Fangrui Song	ce45c95694	[ELF] Remove obscure -dp and GNU ld incompatible --[no-]define-common, ignore -d/-dc https://maskray.me/blog/2022-02-06-all-about-common-symbols#no-define-common In GNU ld, -dc only affects -r links and causes COMMON symbols to be allocated. --no-define-common is defined to make COMMON symbols undefined for -shared. AIUI --no-define-common is a workaround around glibc 2.1 time and not really useful. gold confuses --define-common with -d/FORCE_COMMON_ALLOCATION and implements --define-common with -d semantics. Its --no-define-common is incompatible with GNU ld. In ld.lld, `b2a23cf3c0` fixed the default -r behavior for COMMON symbols but ported the incompatible gold --[no-]define-common. To the best of my knowledge, no project uses -dp --[no-]define-common. So just remove these options. -d/-dc are used by the following projects: * grub grub-core/genmod.sh.in uses -Wl,-r,-d (https://lists.gnu.org/archive/html/grub-devel/2022-02/msg00088.html) * FreeBSD crunchgen uses -Wl,-dc (https://reviews.freebsd.org/D34215) A no-op implementation works for them. Only when a program inspects relocatable output by itself and does not recognize COMMON symbols, there may be a problem. This is an extremely unlikely case. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D119108	2022-02-09 10:35:53 -08:00
Fangrui Song	99580e29d8	[ELF] --warn-backrefs: suppress warnings for backward references within the archive	2022-02-08 21:45:55 -08:00
Alexandre Ganea	bb8be26a7e	[LLD] Fix issue in HIP due to unspecified order of evaluation of the function object This fixes the issue raised in https://reviews.llvm.org/D108850#3303452 Before C++17, the function object is evaluated in a unspecified order. In the following example: https://godbolt.org/z/8ao4vdsr7 the function object is either evaluated before or after the arguments, depending on the compiler. With MSVC and /std:c++14 the function object is evaluated after the arguments; with clang and gcc, it is evaluated before. With C++17, the function object is guaranteed to be evaluated before the arguments, see: https://riptutorial.com/cplusplus/example/19369/evaluation-order-of-function-arguments In our case, the issue was that the `args` conversion to `ArrayRef` was evaluated before the lambda call `link`, which internally was calling `parseFlavor()`, which in turned modified `args`. We ended with an `ArrayRef` argument that reflected the previous contents of `args`. Add coverage for `-flavor` which we didn't have before. Differential Revision: https://reviews.llvm.org/D119278	2022-02-08 19:12:15 -05:00
Alexandre Ganea	1e661e583d	[MLIR] Temporary workaround for calling the LLD ELF driver as-a-lib This fixes the situation described in https://github.com/llvm/llvm-project/issues/53475 with a repro exposed by https://github.com/ROCmSoftwarePlatform/D108850-lld-bug-reproduction This is purposely just a workaround to unblock users. This could be transplanted to the release/14.x branch if need be. A proper fix will later be provided in https://reviews.llvm.org/D119049. Differential Revision: https://reviews.llvm.org/D119277	2022-02-08 19:12:15 -05:00
Fangrui Song	f237ab0dd1	[ELF] AArch64ErrataFix: replace std::map with DenseMap. NFC There is now no <map> in lld/ELF.	2022-02-07 22:02:25 -08:00
Fangrui Song	27bb799095	[ELF] Clean up headers. NFC	2022-02-07 21:53:34 -08:00
Jez Ng	06f863ac5e	[lld-macho] Include address offsets in error messages This makes it easier to pinpoint the source of the problem. TODO: Have more relocation error messages make use of this functionality. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D118798	2022-02-07 21:06:18 -05:00
Fangrui Song	cb03ac0b5d	[ELF] Move Symbol::needsTlsLd to config->needsTlsLd to decrease sizeof(SymbolUnion) from 72 to 64 on ELF64 platforms. Use a dummy `Undefined` to prevent null pointer dereference (though unused) `*rel.sym` in InputSectionBase::relocateAlloc. The relocation order may shuffle a bit, but otherwise there is no behavior difference.	2022-02-07 10:26:16 -08:00
Alexander Kornienko	ec8a693717	Revert "[ELF] Move Symbol::needsTlsLd to config->needsTlsLd. NFC" This reverts commit `f9e3ca542e`. The commit results in internal test failures. Test case provided offline.	2022-02-07 19:00:09 +01:00
Mariusz Ceier	e8bff9ae54	Fix lld standalone build lld/ELF/OutputSections.cpp includes llvm/Config/config.h for LLVM_ENABLE_ZLIB definition, but llvm/Config/config.h doesn't exist in standalone build. To fix this, this patch moves LLVM_ENABLE_ZLIB from config.h to llvm-config.h and updates OutputSections.cpp to include llvm-config.h instead of config.h Reviewed By: MaskRay, mgorny Differential Revision: https://reviews.llvm.org/D119058	2022-02-07 09:20:03 -08:00
Jared Irwin	31626cc111	[lld-macho] Add -pagezero_size Adds `-pagezero_size`. `-pagezero_size` commonly used for kernel development. `-pagezero_size` changes the `__PAGEZERO` size, removing that segment if it is set to zero. One of the four flags from {D118570} Now with error messages and tests. Differential Revision: https://reviews.llvm.org/D118724	2022-02-06 13:15:16 -05:00
Fangrui Song	bad1b7fbb0	[ELF] Fix crash when an input is incompatible with a lazy object file The diagnostic is concise. It is ok because the case is rare.	2022-02-05 23:34:14 -08:00
Fangrui Song	5ad2aae244	[ELF] SharedFile::parse: move verdefIndex assignment outside of ctor. NFC SharedSymbol::SharedSymbol initializes verdefIndex and Symbol::replace copies verdefIndex. By move verdefIndex assignment outside of ctor, Symbol::replace can be changed to not copy verdefIndex. This can be used to decrease work for for ObjKind/BitcodeKind.	2022-02-05 20:43:51 -08:00
Fangrui Song	977a1a523c	[ELF] Symbol::replace: use the old nameData/nameSize. NFC Currently `this->getName() == newSym.getName()`. By keeping the old nameData/nameSize, newSym's nameData/nameSize will be ignored. The call sites can avoid calling getName(). printTraceSymbol needs to take the symbol name since `other`'s name is empty.	2022-02-05 16:34:02 -08:00
Fangrui Song	50460b8004	[ELF] Don't access other eSym members it st_shndx == SHN_UNDEF. NFC	2022-02-05 15:25:23 -08:00
Fangrui Song	9af90e205a	[ELF] De-template reportUndefinedSymbols. NFC My x86-64 lld executable is 16KiB smaller.	2022-02-05 15:03:56 -08:00
Fangrui Song	f9e3ca542e	[ELF] Move Symbol::needsTlsLd to config->needsTlsLd. NFC to decrease sizeof(SymbolUnion) from 72 to 64 on ELF64 platforms.	2022-02-05 14:40:15 -08:00
Fangrui Song	73f55fba76	[ELF] Reorder Symbol members to improve access locality. NFC * partition and isPreemptible are frequently used. Move it to the front * move used beside isUsedInRegularObj. They are similar and accessed together in .symtab finalizing * move auxIdx/dynsymIndex/verdefIndex to the end. This decreases code size.	2022-02-05 14:11:37 -08:00
Fangrui Song	7c675923c7	[ELF] Merge canInline into scriptDefined They perform similar tasks and are essentially the same after `d28c26bbdd`.	2022-02-05 12:00:34 -08:00
Fangrui Song	764cd491b1	[ELF] Simplify shouldKeepInSymtab after Symbol::used is false by default. NFC	2022-02-05 11:21:44 -08:00
Fangrui Song	38e6361d84	[ELF] Simplify includeInSymtab. NFC	2022-02-05 11:18:08 -08:00
Fangrui Song	bb4eacdb70	[ELF] Refactor how Symbol::used is set. NFC	2022-02-05 11:09:40 -08:00
Fangrui Song	ac2911e738	[ELF] Refactor how exportDynamic is set. NFC	2022-02-05 10:25:25 -08:00
Fangrui Song	7288b85cc8	[ELF] --wrap: don't copy exportDynamic For -no-pie/-pie, when `__real_foo` is interposable in a shared object, `foo` is exported. This rule does not match GNU ld and is unneeded because: * the exported `foo` does not interpose `__real_foo` at run-time * the similar `__wrap_foo` <-> `foo` relation does not have the rule	2022-02-05 09:56:29 -08:00
Fangrui Song	9e08e92980	[ELF] Allow STV_PROTECTED shared definition to set exportDynamic A STV_PROTECTED shared definition does not set exportDynamic of a defined symbol. This is on the basis that a protected definition cannot be preempted so the export is unnecessary. However, the condition is imperfect because we don't know whether the shared object was built with a symbolic option. Since dropping the condition simplifies code and matches GNU ld, let's do it.	2022-02-05 01:10:43 -08:00
Shoaib Meenai	997f2a56de	[ELF] Avoid wrapping unreferenced lazy symbols There's a couple of motivations here: * LLD 12 (which I was originally testing with) was adding an undefined symbol to the symbol table if you attempted to wrap an unreferenced lazy symbol, which would later break `--no-allow-shlib-undefined`. LLD on main actually produces a weak undefined symbol, so this doesn't break anyway, but it's cleaner to not have the weak undefined symbol as well. The new behavior also matches bfd and gold. * PROVIDE in a linker script referencing a wrapped symbol would think that an otherwise-unreferenced lazy symbol which was wrapped was actually referenced, and therefore proceed with the definition, which goes against expectations. The new behavior also matches bfd and gold. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D118756	2022-02-04 18:09:37 -08:00
Fangrui Song	53fc5d9b9a	[ELF] Support R_PPC_NONE/R_PPC64_NONE in getImplicitAddend Similar to `f457863ae3`	2022-02-04 15:13:37 -08:00
Fangrui Song	3fd30958b7	[ELF][test] Test getImplicitAddend for R_ARM_NONE/R_RISCV_NONE/R_386_NONE/R_X86_64_NONE Similar to `f457863ae3`	2022-02-04 13:59:18 -08:00
Byoungchan Lee	da08d50fd6	[lld][macho] Add more skip platform check for libSystem re-exports Xcode 13 comes with a mismatched platform in libcompiler_rt.dylib, so this creates a linker error on mac catalyst. Fix it by adding it to the skip list. Reviewed By: MaskRay, #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D117925	2022-02-04 16:34:56 -05:00
Fangrui Song	f457863ae3	[ELF] Support REL-format R_AARCH64_NONE relocation -fprofile-use=/-fprofile-sample-use= compiles may produce REL-format .rel.llvm.call-graph-profile even if the prevailing format is RELA on AArch64. Add R_AARCH64_NONE to getImplicitAddend to fix this linker error: ``` ld.lld: error: internal linker error: cannot read addend for relocation R_AARCH64_NONE PLEASE submit a bug report to https://crbug.com and run tools/clang/scripts/process_crashreports.py (only works inside Google) which will upload a report and include the crash backtrace. ```	2022-02-04 13:20:49 -08:00
Jez Ng	2b78ef06c2	[lld-macho][nfc] Eliminate InputSection::Shared Earlier in LLD's evolution, I tried to create the illusion that subsections were indistinguishable from "top-level" sections. Thus, even though the subsections shared many common field values, I hid those common values away in a private Shared struct (see D105305). More recently, however, @gkm added a public `Section` struct in D113241 that served as an explicit way to store values that are common to an entire set of subsections (aka InputSections). Now that we have another "common value" struct, `Shared` has been rendered redundant. All its fields can be moved into `Section` instead, and the pointer to `Shared` can be replaced with a pointer to `Section`. This `Section` pointer also has the advantage of letting us inspect other subsections easily, simplifying the implementation of {D118798}. P.S. I do think that having both `Section` and `InputSection` makes for a slightly confusing naming scheme. I considered renaming `InputSection` to `Subsection`, but that would break the symmetry with `OutputSection`. It would also make us deviate from LLD-ELF's naming scheme. This change is perf-neutral on my 3.2 GHz 16-Core Intel Xeon W machine: base diff difference (95% CI) sys_time 1.258 ± 0.031 1.248 ± 0.023 [ -1.6% .. +0.1%] user_time 3.659 ± 0.047 3.658 ± 0.041 [ -0.5% .. +0.4%] wall_time 4.640 ± 0.085 4.625 ± 0.063 [ -1.0% .. +0.3%] samples 49 61 There's also no stat sig change in RSS (as measured by `time -l`): base diff difference (95% CI) time 998038627.097 ± 13567305.958 1003327715.556 ± 15210451.236 [ -0.2% .. +1.2%] samples 31 36 Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D118797	2022-02-03 19:55:42 -05:00
Vy Nguyen	36f32ffd2a	[lld-macho] Minor clean up: use .find() to check for key existence rather than [], which would create a new entry. Differential Revision: https://reviews.llvm.org/D118945	2022-02-03 17:17:14 -05:00
John Ericson	e0eeae9a44	[lld][clang][cmake] Clean up a few things - If not using `llvm-config`, `LLVM_MAIN_SRC_DIR` now has a sane default - `LLVM_CONFIG_PATH` will continue to work for LLD for back compat. - More quoting of paths in an abundance of caution. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D118792	2022-02-03 20:01:28 +00:00
Keith Smiley	187ce07a06	[lld-macho] Fix duplicate symbols with relocatable objects In the case your framework bundles contain relocatable objects, and your objects include LC_LINKER_OPTIONs for the framework, previously they would not be deduplicated like they would have if they were static archives. This was also the case if you passed `-framework` for the framework as well. Reviewed By: #lld-macho, thakis, oontvoo Differential Revision: https://reviews.llvm.org/D114841	2022-02-02 14:54:10 -08:00
Jez Ng	9408b75ec3	[lld-macho][nfc] Hoist out creation of Section in parseSections() Simplifies the code slightly. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D118796	2022-02-02 17:09:14 -05:00
Tom Stellard	a2601c9887	Bump the trunk major version to 15	2022-02-01 23:54:52 -08:00
Tom Stellard	e80c52986e	[docs] Remove hard-coded version numbers from sphinx configs This updates all the non-runtime project release notes to use the version number from CMake instead of the hard-coded version numbers in conf.py. It also hides warnings about pre-releases when the git suffix is dropped from the LLVM version in CMake. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D112181	2022-02-01 23:14:12 -08:00
Alexander Shaposhnikov	4450a2a23d	[lld][ELF] Add support for ADRP+ADD optimization for AArch64 This diff adds support for ADRP+ADD optimization for AArch64 described in `d2ca58c54b` i.e. under appropriate constraints ADRP x0, symbol ADD x0, x0, :lo12: symbol can be turned into NOP ADR x0, symbol Test plan: make check-all Differential revision: https://reviews.llvm.org/D117614	2022-02-02 06:09:55 +00:00
Jez Ng	3e951808d5	[lld-macho][nfc] Comments and style fixes Added some comments (particularly around finalize() and finalizeContents()) as well as doing some rephrasing / grammar fixes for existing comments. Also did some minor style fixups, such as by putting methods together in a class definition and having fields of similar types next to each other. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D118714	2022-02-01 13:45:59 -05:00
Fangrui Song	fbf2f66400	[ELF] Update flag propagation rule to ignore discarded output sections See the updated insert-before.test for the effects: many synthetic sections are SHF_ALLOC\|SHF_WRITE. If they are discarded, we don't want to propagate their flags to subsequent output section descriptions. `getFirstInputSection(sec) == nullptr` can technically be merged into `isDiscardable` but I'd like to postpone that as not sharing code may give more refactoring opportunity. Depends on D118529. Reviewed By: peter.smith, bluca Differential Revision: https://reviews.llvm.org/D118530	2022-02-01 10:19:30 -08:00
Fangrui Song	a0318711c8	[ELF] Rename adjustSectionsBeforeSorting to adjustOutputSections and make it affect INSERT commands adjustSectionsBeforeSorting updates some output section attributes (alignment/flags) and removes discardable empty sections. When it is called, INSERT commands have not been processed. Therefore the flags propagation rule may not affect output sections defined in an INSERT command properly. Fix this by moving processInsertCommands before adjustSectionsBeforeSorting. adjustSectionsBeforeSorting is somewhat misnamed. The order between it and sortInputSections does not matter. With the pass shuffle, the name of adjustSectionsBeforeSorting becomes wrong. Therefore rename it. The new name is not set into stone. The function mixes several tasks and the code may be refactored in a way that we may give them more meaningful names. With this patch, I think the behavior of attribute propagation becomes more reasonable. In particular, in the absence of non-INSERT SECTIONS, inserting a section after a SHF_ALLOC one will give us a SHF_ALLOC section, not a non-SHF_ALLOC one (see linkerscript/insert-after.test). Reviewed By: peter.smith, bluca Differential Revision: https://reviews.llvm.org/D118529	2022-02-01 10:16:12 -08:00
Fangrui Song	0c3704fdbd	[ELF] Deduplicate names of local symbols only with -O2 The deduplication requires a DenseMap of the same size of the local part of .strtab . I optimized it in `e205445434` but it is still quite slow. For Release build of clang, deduplication makes .strtab 1.1% smaller and makes the link 3% slower. For chrome, deduplication makes .strtab 0.1% smaller and makes the link 6% slower. I suggest that we only perform the optimization with -O2 (default is -O1). Not deduplicating local symbol names will simplify parallel symbol table write. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D118577	2022-02-01 10:10:22 -08:00
Fangrui Song	17a39aecd1	[ELF] Simplify code with invokeELFT. NFC	2022-02-01 09:53:29 -08:00
Fangrui Song	7518d38f0a	[ELF] De-template LinkerDriver::link. NFC Replace `f<ELFT>(x)` with `InvokeELFT(f, x)`. The size reduction comes from turning `link` from 4 specializations into 1. My x86-64 lld executable is 26KiB smaller. Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D118551	2022-02-01 09:47:56 -08:00
Alexander Shaposhnikov	7244901ef6	[lld][MachO] Fix typo in rename.s	2022-02-01 11:57:04 +00:00
Alexander Shaposhnikov	f131d4d0d0	[lld][ELF] Add missing RUN in aarch64-adrp-ldr-got.s	2022-02-01 11:25:16 +00:00
Fangrui Song	4d38d7684c	[ELF] Change vector<Symbol *> to SmallVector. NFC	2022-02-01 00:16:42 -08:00
Fangrui Song	196aedb843	[ELF] Change vector<InputSection *> to SmallVector. NFC My x86-64 lld executable is 8KiB smaller.	2022-02-01 00:14:21 -08:00
Fangrui Song	d97749fabc	[ELF] Switch split-stack to use SmallVector. NFC My x86-64 lld executable is 1.1KiB smaller.	2022-02-01 00:09:30 -08:00
Jez Ng	96fb7d059d	[lld-macho][test] Add test for UUID format Reviewed By: keith Differential Revision: https://reviews.llvm.org/D118646	2022-01-31 23:52:42 -05:00
Fangrui Song	7aaf024dac	[BitcodeWriter] Fix cases of some functions `WriteIndexToFile` is used by external projects so I do not touch it.	2022-01-31 16:46:11 -08:00
Fangrui Song	457273fda5	[ELF] splitStrings: replace entSize==1 special case with manual loop unswitch. NFC My x86-64 lld executable is actually smaller.	2022-01-30 17:15:45 -08:00
Fangrui Song	7cd0c45364	[ELF] Simplify SectionBase::partition handling and make it live by default. NFC Previously an InputSectionBase is dead (`partition==0`) by default. SyntheticSection calls markLive and BssSection overrides that with markDead. It is more natural to make InputSectionBase live by default and let --gc-sections mark InputSectionBase dead. When linking a Release build of clang: * --no-gc-sections:, the removed `inputSections` loop decreases markLive time from 4ms to 1ms. * --gc-sections: the extra `inputSections` loop increases markLive time from 0.181296s to 0.188526s. This is as of we lose the removing one `inputSections` loop optimization (`4374824ccf`). I believe the loss can be mitigated if we refactor markLive.	2022-01-30 15:12:09 -08:00
Fangrui Song	73fd7d2304	[ELF] Change splitSections to objectFiles based parallelForEach. NFC The work is more balanced.	2022-01-30 13:34:27 -08:00
Keith Smiley	a6298fb160	[lld-macho] Add support for -add_empty_section This is a ld64 option equivalent to `-sectcreate seg sect /dev/null` that's useful for creating sections like the RESTRICT section. Differential Revision: https://reviews.llvm.org/D117749	2022-01-30 10:03:41 -08:00
Keith Smiley	0ab09a9009	[test][lld-macho] Improve LC_FUNCTION_STARTS test coverage Previously functions that aren't included in the symtab were also excluded from the function starts. Symbols missing from function starts degrades the debugger experience in the case you don't have debug info for them. Differential Revision: https://reviews.llvm.org/D114275	2022-01-30 09:46:36 -08:00
Fangrui Song	5a2020d069	[ELF] copyShtGroup: replace unordered_set<uint32_t> with DenseSet<uint32_t>. NFC We don't need to support the empty/tombstone key section index.	2022-01-30 01:18:41 -08:00
Fangrui Song	f318fd9bf8	[ELF] crtbegin/crtend test: replace std::regex with hand-written matcher. NFC My x86-64 lld executable is 18KiB smaller.	2022-01-30 01:11:19 -08:00
Fangrui Song	a7f9c002cd	[ELF][test] Test {crtbegin,crtend}{S,T}.o	2022-01-30 01:08:10 -08:00
Fangrui Song	fcd8817da5	[ELF] Simplify maybeCompress with lld::split. NFC	2022-01-30 00:44:19 -08:00
Fangrui Song	bc1369fae3	[ELF] Optimize MergeInputSection::splitNonStrings with resize_for_overwrite. NFC	2022-01-30 00:10:52 -08:00
Fangrui Song	988a03c585	[ELF] Add some MipsSection to InStruct and change make<MipsSection> to std::make_unique Similar to D116143. My x86-64 lld executable is 20+KiB smaller.	2022-01-29 23:55:29 -08:00
Fangrui Song	c0b986aa0c	[ELF] Remove make<std::unique_ptr<MemoryBuffer>>. NFC	2022-01-29 23:35:15 -08:00
Fangrui Song	8d8fce87bb	[ELF] De-template getErrorPlace. NFC	2022-01-29 23:05:54 -08:00
Fangrui Song	72a005bf19	[ELF] De-template getAndFeatures. NFC	2022-01-29 20:11:59 -08:00
Fangrui Song	d754c0b64f	[ELF] Make errorOrWarn opaque to decrease code size. NFC In my x86-64 lld, .text is -3.08Ki smaller.	2022-01-29 19:31:09 -08:00
Fangrui Song	ee647d4c96	[ELF] Optimize obj.getSectionIndex. NFC	2022-01-29 18:01:58 -08:00
Fangrui Song	5d00d37617	[ELF] Simplify eSyms. NFC	2022-01-29 17:00:38 -08:00
Fangrui Song	d86435c230	[ELF] createInputSection: remove unneeded argument. NFC	2022-01-29 16:52:32 -08:00
Fangrui Song	ee7720acd6	[ELF] Avoid repeated getObj construction in getSectionIndex. NFC	2022-01-29 16:51:00 -08:00
Fangrui Song	94e97e668c	[ELF] Reorder InputSectionBase::parent. NFC Move it before others.	2022-01-29 16:20:40 -08:00
Fangrui Song	b204d7c459	[ELF] Reorder InputFile members. NFC `symbols` is used frequently. Moving it before others can decrease offsets.	2022-01-29 16:10:52 -08:00
Fangrui Song	469c4124ab	[ELF] --gdb-index: switch to SmallVector. NFC	2022-01-29 15:24:56 -08:00
Fangrui Song	da0e5b885b	[ELF] Refactor -z combreloc * `RelocationBaseSection::addReloc` increases `numRelativeRelocs`, which duplicates the work done by RelocationSection<ELFT>::writeTo. * --pack-dyn-relocs=android has inappropropriate DT_RELACOUNT. AndroidPackedRelocationSection does not necessarily place relative relocations in the front and DT_RELACOUNT might cause semantics error (though our implementation doesn't and Android bionic doesn't use DT_RELACOUNT anyway.) Move `llvm::partition` to a new function `partitionRels` and compute `numRelativeRelocs` there. Now `RelocationBaseSection::addReloc` is trivial and can be moved to the header to enable inlining. The rest of DynamicReloc and `-z combreloc` handling is moved to the non-template `RelocationBaseSection::computeRels` to decrease code size. My x86-64 lld executable is 44+KiB smaller. While here, rename `sort` to `combreloc`.	2022-01-29 14:45:58 -08:00
Mateusz Mikuła	460830a9c6	[LLD][MinGW] Add --heap argument support Noticed in https://github.com/msys2/MINGW-packages/pull/10567. Differential Revision: https://reviews.llvm.org/D118405	2022-01-30 00:01:45 +02:00
Fangrui Song	f097c108b8	[ELF][test] Improve INSERT [AFTER\|BEFORE] and adjustSectionsBeforeSorting tests	2022-01-28 22:21:13 -08:00
Petr Hosek	71dcd9bd04	[ELF] Change the search order for dependent libraries When processing dependent libraries, if there's a directory of the same name as the library being searched for, either in the current directory or earlier in the search order, LLD will try to open it and report an error. This is because LLD uses file existence check. To address this issue we reverse the order, searching the library by basename first and only considering search paths later, and current directory last. Differential Revision: https://reviews.llvm.org/D118498	2022-01-28 20:46:01 -08:00
Fangrui Song	33b38339a0	[lld] Add module name to LTO inline asm diagnostic Close #52781: for LTO, the inline asm diagnostic uses `<inline asm>` as the file name (lib/CodeGen/AsmPrinter/AsmPrinterInlineAsm.cpp) and it is unclear which module has the issue. With this patch, we will see the module name (say `asm.o`) before `<inline asm>` with ThinLTO. ``` % clang -flto=thin -c asm.c && myld.lld asm.o -e f ld.lld: error: asm.o <inline asm>:1:2: invalid instruction mnemonic 'invalid' invalid ^~~~~~~ ``` For regular LTO, unfortunately the original module name is lost and we only get ld-temp.o. Reviewed By: #lld-macho, ychen, Jez Ng Differential Revision: https://reviews.llvm.org/D118434	2022-01-28 11:32:42 -08:00
Roger Kim	422084332a	[lld][Macho] Include dead-stripped symbols in mapfile ld64 outputs dead stripped symbols when using the -dead-strip flag. This change mimics that behavior for lld. ld64's -dead_strip flag outputs: ``` $ ld -map map basics.o -o out -dead_strip -L/Library/Developer/CommandLineTools/SDKs/MacOSX.sdk/usr/lib -lSystem $ cat map # Path: out # Arch: x86_64 # Object files: [ 0] linker synthesized [ 1] basics.o # Sections: # Address Size Segment Section 0x100003F97 0x00000021 __TEXT __text 0x100003FB8 0x00000048 __TEXT __unwind_info 0x100004000 0x00000008 __DATA_CONST __got 0x100008000 0x00000010 __DATA __ref_section 0x100008010 0x00000001 __DATA __common # Symbols: # Address Size File Name 0x100003F97 0x00000006 [ 1] _ref_local 0x100003F9D 0x00000001 [ 1] _ref_private_extern 0x100003F9E 0x0000000C [ 1] _main 0x100003FAA 0x00000006 [ 1] _no_dead_strip_globl 0x100003FB0 0x00000001 [ 1] _ref_from_no_dead_strip_globl 0x100003FB1 0x00000006 [ 1] _no_dead_strip_local 0x100003FB7 0x00000001 [ 1] _ref_from_no_dead_strip_local 0x100003FB8 0x00000048 [ 0] compact unwind info 0x100004000 0x00000008 [ 0] non-lazy-pointer-to-local: _ref_com 0x100008000 0x00000008 [ 1] _ref_data 0x100008008 0x00000008 [ 1] l_ref_data 0x100008010 0x00000001 [ 1] _ref_com # Dead Stripped Symbols: # Size File Name <<dead>> 0x00000006 [ 1] _unref_extern <<dead>> 0x00000001 [ 1] _unref_local <<dead>> 0x00000007 [ 1] _unref_private_extern <<dead>> 0x00000001 [ 1] _ref_private_extern_u <<dead>> 0x00000008 [ 1] _unref_data <<dead>> 0x00000008 [ 1] l_unref_data <<dead>> 0x00000001 [ 1] _unref_com ``` Reviewed By: int3, #lld-macho, thevinster Differential Revision: https://reviews.llvm.org/D114737	2022-01-28 10:51:27 -08:00
Alexander Shaposhnikov	0d71f2e097	[lld][ELF] Cleanup %t directory in tests, NFC	2022-01-28 08:41:52 +00:00
Sam Clegg	875ee937ae	[lld][WebAssembly] Handle TLS symbols in older object file In older versions of llvm (e.g. llvm 13), symbols were not individually flagged as TLS. In this case, the indent was to implicitly mark any symbols defined in TLS segments as TLS. However, we were not performing this implicit conversion if the segment was explicitly marked as TLS As it happens, llvm 13 was branched between the addition of the segment flag and the addition of the symbol flag. See: - segment flag added: https://reviews.llvm.org/D102202 - symbol flag added: https://reviews.llvm.org/D109426 Testing this is tricky because the assembler will imply the TLS status of the symbol based on the segment its declared in, so we are forced to use a yaml file here. Fixes: https://github.com/emscripten-core/emscripten/issues/15891 Differential Revision: https://reviews.llvm.org/D118414	2022-01-27 17:27:09 -08:00
Fangrui Song	3bc152769d	[ELF] Parallelize computeIsPreemptible	2022-01-26 23:45:04 -08:00
Fangrui Song	1372d53639	[ELF] Optimize two vector. NFC	2022-01-26 23:10:40 -08:00
Fangrui Song	afeb4a6628	[ELF] Optimize -Map. NFC getVA is slow. Avoid calling it in the llvm::sort comparator.	2022-01-26 22:51:31 -08:00
Fangrui Song	14b7785c09	[ELF] Simplify InputSection::writeTo. NFC	2022-01-26 22:03:26 -08:00
Fangrui Song	913914f0f8	[ELF] Simplify writing the Elf_Chdr header. NFC And avoiding changing `size` in `writeTo`.	2022-01-26 10:23:56 -08:00
Benjamin Kramer	f15014ff54	Revert "Rename llvm::array_lengthof into llvm::size to match std::size from C++17" This reverts commit `ef82063207`. - It conflicts with the existing llvm::size in STLExtras, which will now never be called. - Calling it without llvm:: breaks C++17 compat	2022-01-26 16:55:53 +01:00
serge-sans-paille	ef82063207	Rename llvm::array_lengthof into llvm::size to match std::size from C++17 As a conquence move llvm::array_lengthof from STLExtras.h to STLForwardCompat.h (which is included by STLExtras.h so no build breakage expected).	2022-01-26 16:17:45 +01:00
Fangrui Song	3704abaa16	[ELF] --gdb-index: replace vector<uint8_t> with unique_ptr<uint8_t[]>. NFC	2022-01-25 23:53:23 -08:00
Fangrui Song	571d6a7120	[ELF] Optimize .relr.dyn to not grow vector<uint64_t>. NFC	2022-01-25 23:33:40 -08:00
Fangrui Song	9fac78d0e1	[ELF] Simplify and optimize .relr.dyn NFC	2022-01-25 22:50:03 -08:00
Fangrui Song	2a80c3dbe1	[ELF] Clarify that Z_BEST_SPEED==1 in a comment. NFC	2022-01-25 22:40:53 -08:00
Fangrui Song	07bd467643	[ELF] --build-id: replace vector<uint8_t> with unique_ptr<uint8_t[]>. NFC We can't use C++20 make_unique_for_overwrite yet.	2022-01-25 22:39:43 -08:00
Fangrui Song	7438dbe078	[ELF] Cast size to size_t. NFC To fix ../../chromeclang/bin/../include/c++/v1/__algorithm/min.h:39:1: note: candidate template ignored: deduced conflicting types for parameter '_Tp' ('unsigned long' vs. 'unsigned long long') on macOS arm64.	2022-01-25 22:38:24 -08:00
Fangrui Song	223f9dea3d	[ELF] maybeCompress: replace vector<uint8_t> with unique_ptr<uint8_t[]>. NFC And mention that it is zero-initialized. I do not notice a speed-up if changed to be uninitialized by forcing the zero filler in writeTo.	2022-01-25 22:15:44 -08:00
Puyan Lotfi	227d18b3a8	[lld][macho][NFC] Make MachO/start-end.s test less britle by checking for _main: In start-end.s there is a lit check line `# SEG: _main` to begin the check at the start of the function main where `_main` is the Darwin name mangling for C main. Because the text file that FileCheck is getting as input has the path of the compiler build in it from llvm-mc and llvm-objdump, and because of the lack of a trailing colon in this check line we end up inadvertently matching against the line of text with the compiler path in it in the case where said path contains "_main" some place. This can be very likely if the compiler branch has "main" or "_main" in it. To fix this I include the training : since that will match on the function label and not the path line.	2022-01-25 19:23:51 -08:00
Fangrui Song	4cdc441690	[ELF] Parallelize --compress-debug-sections=zlib When linking a Debug build clang (265MiB SHF_ALLOC sections, 920MiB uncompressed debug info), in a --threads=1 link "Compress debug sections" takes 2/3 time and in a --threads=8 link "Compress debug sections" takes ~70% time. This patch splits a section into 1MiB shards and calls zlib `deflake` parallelly. DEFLATE blocks are a bit sequence. We need to ensure every shard starts at a byte boundary for concatenation. We use Z_SYNC_FLUSH for all shards but the last to flush the output to a byte boundary. (Z_FULL_FLUSH can be used as well, but Z_FULL_FLUSH clears the hash table which just wastes time.) The last block requires the BFINAL flag. We call deflate with Z_FINISH to set the flag as well as flush the output to a byte boundary. Under the hood, all of Z_SYNC_FLUSH, Z_FULL_FLUSH, and Z_FINISH emit a non-compressed block (called stored block in zlib). RFC1951 says "Any bits of input up to the next byte boundary are ignored." In a --threads=8 link, "Compress debug sections" is 5.7x as fast and the total speed is 2.54x. Because the hash table for one shard is not shared with the next shard, the output is slightly larger. Better compression ratio can be achieved by preloading the window size from the previous shard as dictionary (`deflateSetDictionary`), but that is overkill. ``` # 1MiB shards % bloaty clang.new -- clang.old FILE SIZE VM SIZE -------------- -------------- +0.3% +129Ki [ = ] 0 .debug_str +0.1% +105Ki [ = ] 0 .debug_info +0.3% +101Ki [ = ] 0 .debug_line +0.2% +2.66Ki [ = ] 0 .debug_abbrev +0.0% +1.19Ki [ = ] 0 .debug_ranges +0.1% +341Ki [ = ] 0 TOTAL # 2MiB shards % bloaty clang.new -- clang.old FILE SIZE VM SIZE -------------- -------------- +0.2% +74.2Ki [ = ] 0 .debug_line +0.1% +72.3Ki [ = ] 0 .debug_str +0.0% +69.9Ki [ = ] 0 .debug_info +0.1% +976 [ = ] 0 .debug_abbrev +0.0% +882 [ = ] 0 .debug_ranges +0.0% +218Ki [ = ] 0 TOTAL ``` Bonus in not using zlib::compress * we can compress a debug section larger than 4GiB * peak memory usage is lower because for most shards the output size is less than 50% input size (all less than 55% for a large binary I tested, but decreasing the initial output size does not decrease memory usage) Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D117853	2022-01-25 10:29:04 -08:00
Leonard Grey	a5c9d71780	[lld-macho] Move order file and call graph sorting into SectionPriorities See https://reviews.llvm.org/D117354 for context and discussion.	2022-01-25 12:18:15 -05:00
Leonard Grey	f23d57a632	[lld-macho] Rename CallGraphSort.{h,cpp} to SectionPriorities This is in preparation for moving the code that parses and processes order files into this file. See https://reviews.llvm.org/D117354 for context and discussion.	2022-01-25 12:15:14 -05:00
Fangrui Song	c03fdd3403	[ELF] Fix the branch range computation when reusing a thunk Notation: dst is `t->getThunkTargetSym()->getVA()` On AArch64, when `src-0x8000000-r_addend <= dst < src-0x8000000`, the condition `target->inBranchRange(rel.type, src, rel.sym->getVA(rel.addend))` may incorrectly consider a thunk reusable. `rel.addend = -getPCBias(rel.type)` resets the addend to 0 for AArch64/PPC and the zero addend is used by `rel.sym->getVA(rel.addend)` to check out-of-range relocations. See the test for a case this computation is wrong: `error: a.o:(.text_high+0x4): relocation R_AARCH64_JUMP26 out of range: -134217732 is not in [-134217728, 134217727]` I have seen a real world case with r_addend=19960. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D117734	2022-01-24 09:03:21 -08:00
serge-sans-paille	5f290c090a	Move STLFunctionalExtras out of STLExtras Only using that change in StringRef already decreases the number of preoprocessed lines from 7837621 to 7776151 for LLVMSupport Perhaps more interestingly, it shows that many files were relying on the inclusion of StringRef.h to have the declaration from STLExtras.h. This patch tries hard to patch relevant part of llvm-project impacted by this hidden dependency removal. Potential impact: - "llvm/ADT/StringRef.h" no longer includes <memory>, "llvm/ADT/Optional.h" nor "llvm/ADT/STLExtras.h" Related Discourse thread: https://llvm.discourse.group/t/include-what-you-use-include-cleanup/5831	2022-01-24 14:13:21 +01:00
Peter Smith	a08447d0de	[LLD][ELF][AArch64] Update test with incorrect REQUIRES line [NFC] D54759 introduced aarch64-combined-dynrel.s and aarch64-combined-dynrel-ifunc.s . Unfortunately the requires line at the top was AArch64 instead of aarch64 which means they were never run. Update the tests to use aarch64 and fix to match current lld output. Differential Revision: https://reviews.llvm.org/D117896	2022-01-24 10:04:28 +00:00
Sam Clegg	ac2f3df839	[lld][WebAssembly] Remove redundant config setting Unresolved symbols are not currently reported when building with `-shared` or `-pie` so setting unresolvedSymbols doesn't have any effect. Differential Revision: https://reviews.llvm.org/D117737	2022-01-20 15:21:56 -08:00
Roger Kim	f84023a812	[lld][macho] Stop grouping symbols by sections in mapfile. As per [Bug 50689](https://bugs.llvm.org/show_bug.cgi?id=50689), ``` 2. getSectionSyms() puts all the symbols into a map of section -> symbols, but this seems unnecessary. This was likely copied from the ELF port, which prints a section header before the list of symbols it contains. But the Mach-O map file doesn't print these headers. ``` This diff removes `getSectionSyms()` and keeps all symbols in a flat vector. What does ld64's mapfile look like? ``` $ llvm-mc -filetype=obj -triple=x86_64-apple-darwin test.s -o test.o $ llvm-mc -filetype=obj -triple=x86_64-apple-darwin foo.s -o foo.o $ ld -map map test.o foo.o -o out -L/Library/Developer/CommandLineTools/SDKs/MacOSX.sdk/usr/lib -lSystem ``` ``` [ 0] linker synthesized [ 1] test.o [ 2] foo.o 0x100003FB7 0x00000001 __TEXT __text 0x100003FB8 0x00000000 __TEXT obj 0x100003FB8 0x00000048 __TEXT __unwind_info 0x100004000 0x00000001 __DATA __common 0x100003FB7 0x00000001 [ 1] _main 0x100003FB8 0x00000000 [ 2] _foo 0x100003FB8 0x00000048 [ 0] compact unwind info 0x100004000 0x00000001 [ 1] _number ``` Perf numbers when linking chromium framework on a 16-Core Intel Xeon W Mac Pro: ``` base diff difference (95% CI) sys_time 1.406 ± 0.020 1.388 ± 0.019 [ -1.9% .. -0.6%] user_time 5.557 ± 0.023 5.914 ± 0.020 [ +6.2% .. +6.6%] wall_time 4.455 ± 0.041 4.436 ± 0.035 [ -0.8% .. -0.0%] samples 35 35 ``` Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D114735	2022-01-20 12:16:37 -08:00
Alexandre Ganea	83d59e05b2	Re-land [LLD] Remove global state in lldCommon Move all variables at file-scope or function-static-scope into a hosting structure (lld::CommonLinkerContext) that lives at lldMain()-scope. Drivers will inherit from this structure and add their own global state, in the same way as for the existing COFFLinkerContext. See discussion in https://lists.llvm.org/pipermail/llvm-dev/2021-June/151184.html The previous land `f860fe3622` caused issues in https://lab.llvm.org/buildbot/#/builders/123/builds/8383, fixed by `22ee510dac`. Differential Revision: https://reviews.llvm.org/D108850	2022-01-20 14:53:26 -05:00
John Ericson	df31ff1b29	[cmake] Make include(GNUInstallDirs) always below project(..) Its defaulting logic must go after `project(..)` to work correctly, but `project(..)` is often in a standalone condition making this awkward, since the rest of the condition code may also need GNUInstallDirs. The good thing is there are the various standalone booleans, which I had missed before. This makes splitting the conditional blocks less awkward. Reviewed By: arichardson, phosek, beanz, ldionne, #libunwind, #libc, #libc_abi Differential Revision: https://reviews.llvm.org/D117639	2022-01-20 18:59:17 +00:00
Sam Clegg	feddf11502	[lld][WebAssemlby] Convert test to check disassembly output. NFC Differential Revision: https://reviews.llvm.org/D117739	2022-01-20 09:32:01 -08:00
Adrian Prantl	54ba376d08	Add missing include to fix modular build	2022-01-20 08:33:44 -08:00
Jez Ng	8f811effac	[lld-macho] Fix grammar in doc	2022-01-19 23:59:35 -08:00
Fangrui Song	a7a4115bf3	[ELF] Replace .zdebug string comparison with SHF_COMPRESSED check. NFC	2022-01-19 22:33:32 -08:00
Fangrui Song	03909c4400	[ELF] Remove StringRefZ StringRefZ does not improve performance. Non-local symbols always have eagerly computed nameSize. Most local symbols's lengths will be updated in either: * shouldKeepInSymtab * SymbolTableBaseSection::addSymbol Its benefit is offsetted by strlen in every call site (sums up to 5KiB code in a release x86-64 build), so using StringRefZ may be slower. In a -s link (uncommon) there is minor speedup, like ~0.3% for clang and chrome. Reviewed By: alexander-shaposhnikov Differential Revision: https://reviews.llvm.org/D117644	2022-01-19 20:09:41 -08:00
Alexandre Ganea	aba5b91b69	Re-land [CodeView] Add full repro to LF_BUILDINFO record This patch writes the full -cc1 command into the resulting .OBJ, like MSVC does. This allows for external tools (Recode, Live++) to rebuild a source file without any external dependency but the .OBJ itself (other than the compiler) and without knowledge of the build system. The LF_BUILDINFO record stores a full path to the compiler, the PWD (CWD at program startup), a relative or absolute path to the source, and the full CC1 command line. The stored command line is self-standing (does not depend on the environment). In the same way, MSVC doesn't exactly store the provided command-line, but an expanded version (a somehow equivalent of CC1) which is also self-standing. For more information see PR36198 and D43002. Differential Revision: https://reviews.llvm.org/D80833	2022-01-19 19:44:37 -05:00
Jez Ng	ef95d45138	[lld-macho] Mention string literal deduplication as a difference from ld64 Reviewed By: keith Differential Revision: https://reviews.llvm.org/D117250	2022-01-19 16:30:52 -08:00
Keith Smiley	3f38dc5c04	[lld-macho] Silence XAR deprecation warning If you're building this on macOS 12.x+ this produces a deprecation warning. I'm not sure what this means for the bitcode format going forward, but it seems safe to silence for now. Do we need to worry about GCC for this? Differential Revision: https://reviews.llvm.org/D117718	2022-01-19 13:51:55 -08:00
Keith Smiley	67090e3446	[lld-macho] Implement -noall_load This flag is the default, so in ld64 it is not implemented, but it can be useful to negate previous -all_load arguments. Specifically if your build system has some global linker flags, that you may want to negate for specific links. We use something like this today to make sure some C++ symbols are automatically discovered for all links, which passing -all_load hides. Differential Revision: https://reviews.llvm.org/D117629	2022-01-19 13:12:18 -08:00
Fangrui Song	5bd38a2826	[ELF] Fix split-stack caller with hidden non-split-stack callee Fix a regression after `aabe901d57` (`[ELF] Remove one redundant computeBinding`): isLocal() does not indicate that the symbol is originally local. For simplicity, just drop this optimization.	2022-01-19 12:25:01 -08:00
Fangrui Song	0aae2bf373	[lld-macho] Add --start-lib --end-lib In ld.lld, when an ObjFile/BitcodeFile is read in --start-lib state, the file is given archive semantics. --end-lib closes the previous --start-lib. A build system can use this feature as an alternative to archives. This patch ports the feature to lld-macho. --start-lib and --end-lib are positional, unlike usual ld64 options. I think the slight drawback does not matter as (a) reusing option names make build systems convenient (b) `--start-lib a.o b.o --end-lib` conveys more information than an alternative design: `-objlib a.o -objlib b.o` because --start-lib makes it clear which objects are in the same conceptual archive. This provides flexibility (c) `-objlib`/`-filelist` interaction may be weird. Close https://github.com/llvm/llvm-project/issues/52931 Reviewed By: #lld-macho, Jez Ng, oontvoo Differential Revision: https://reviews.llvm.org/D116913	2022-01-19 10:14:49 -08:00
Fangrui Song	d838bf2adc	[ELF] Allow non-bitcode archive with an empty index When an archive with an empty index contains only bitcode files, it is handled as a group of lazy (--start-lib) object files. If there is a non-bitcode file, there will be a diagnostic a la GNU ld. For some programs, the archive member extraction ratio is high (e.g. for chrome, 79% archive members are extracted according to --print-archive-stats=). Because symbol interning is cached for ObjFile::parseLazy but not for ArchiveFile, parsing an archive as a group of --start-lib object files may be faster. If the linker speculatively creates section representations for archive members, the archive index will not be used. If we take the above view, the archive index is essentially useless. If a user wants a fast build without using --start-lib, they may just build thin archives without index (`ar rcS --thin`). Therefore, I suggest that we no longer treat the code as a hack, instead as a supported feature. I believe we will do this anyway if we add parallel symbol interning (parallel symbol interning for lazy object files is simpler than that for archives). Ecosystem issues: * parseLazy actually has nearly the same behavior as ArchiveFile::parse, but the symbol order may be different. * users may get addicted to the behavior and build archives not working with GNU ld and gold. I think it is easy to rebuild archives to be compatible. Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D117284	2022-01-19 10:01:53 -08:00
Ayke van Laethem	d649faff9c	[LLD][COFF] Support GNU style == aliases D46245 added support for this in llvm-libtool, but while lld-link can also create .lib files from .def files it didn't support aliases. I compared the Inputs/library.def test against the output from llvm-libtool and it matches, except for the fact that lld-link reorders functions for some reason. I have also verified that this fixes a bug I was running into while trying to compile .def files to .lib files in MinGW-w64 (using lld-link instead of llvm-libtool). Differential Revision: https://reviews.llvm.org/D113365	2022-01-19 14:22:13 +01:00
Fangrui Song	288082d45d	[ELF] Move SHT_REL/SHT_RELA handling from createInputSection to initializeSections This simplifies the code a bit. While here, * change the `multiple relocation sections` diagnostic from `fatal` to `error` and include the relocated section name. * drop less useful name from `getRelocTarget`. Without -r/--emit-relocs we don't need to get SHT_REL/SHT_RELA names.	2022-01-18 23:31:51 -08:00
Fangrui Song	84944b63f3	[ELF] Simplify ObjFile<ELFT>::initializeSections. NFC	2022-01-18 22:45:04 -08:00
Fangrui Song	5f404a749a	[ELF] De-template InputSectionBase::getLocation. NFC	2022-01-18 17:33:58 -08:00
Fangrui Song	eafd34581f	[ELF] Simplify/optimize EhInputSection::split and change some `fatal` to `errorOrWarn`. EhFrame.cpp is a helper file. We don't place all .eh_frame implementation there, so the code move is fine.	2022-01-18 17:03:23 -08:00
Vincent Lee	e5347f2556	[lld-macho] Allow deduplicate-literals to be overridden It's still uncertain but whether we want to have `deduplicate-literals` be the default flag for LLD out of the box or not. If `deduplicate-literals` is the default behavior, then we will need a way override it and not deduplicate. Luckily, we have `no_deduplicate` to fill this gap. For now, I've set the default to be false which aligns with the existing behavior. That can only always be changed after discussions on D117250. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D117387	2022-01-18 15:42:59 -08:00
Sam Clegg	ec47dba1c8	[lld][WebAssembly] Perform data relocations during start function We already perform memory initialization and apply global relocations during start. It makes sense to performs data relocations too. I think the reason we were not doing this already is solely historical. Differential Revision: https://reviews.llvm.org/D117412	2022-01-18 14:08:42 -08:00
Sam Clegg	ae1573e131	[lld][WebAssembly] Reinstate mistakenly disabled test. NFC It seems the first half of this test was disabled in error as part of https://reviews.llvm.org/D93066. Differential Revision: https://reviews.llvm.org/D117594	2022-01-18 12:22:22 -08:00
Alexander Shaposhnikov	2bb7f226af	[lld] Fix typo. NFC	2022-01-18 02:33:27 +00:00
Fangrui Song	83c7f5d3fb	[ELF] EhInputSection::split: remove unneeded check	2022-01-17 13:59:52 -08:00
Fangrui Song	ac0986f880	[ELF] Change std::vector<InputSectionBase *> to SmallVector There is no remaining std::vector<InputSectionBase> now. My x86-64 lld executable is 2KiB small.	2022-01-17 10:25:07 -08:00
Fangrui Song	f855074ed1	[ELF] GnuHashTableSection: replace stable_sort with 2-key sort. NFC strTabOffset stabilizes llvm::sort. My x86-64 executable is 5+KiB smaller.	2022-01-17 00:34:42 -08:00
Fangrui Song	54fe70bfba	[ELF] RelocationScanner::scanOne: replace rel.r_offset with offset. NFC	2022-01-17 00:05:27 -08:00
Fangrui Song	4c36567179	[ELF] Relocations: remove some cast<Undefined>. NFC	2022-01-17 00:02:47 -08:00
Fangrui Song	b8d4eb84d7	[ELF] De-template getAlternativeSpelling. NFC	2022-01-16 23:56:25 -08:00
Fangrui Song	9c4292a59d	[ELF] Remove unneeded SyntheticSection memset(, 0, ) After the D33630 fallout was properly fixed by `a4c5db30be`. Tested by D37462/D44986 tests, the new --no-rosegment test in build-id.s, and a few --rosegment/--no-rosegment programs.	2022-01-16 22:51:57 -08:00
Fangrui Song	a4c5db30be	[ELF] Remove redundant fillTrap and memset(, 0, ). NFC The new tests in build-id.s would catch problems if we made a mistake here.	2022-01-16 22:37:31 -08:00
Fangrui Song	d46054d75d	[ELF][test] Add --build-id tests for -z separate-loadable-segments and --no-rosegment	2022-01-16 22:36:22 -08:00
Fangrui Song	aad90763d9	[ELF] RelocationSection<ELFT>::writeTo: use unstable partition	2022-01-16 21:44:19 -08:00
Fangrui Song	769057a5d0	[ELF] Change some DenseMap<StringRef, > to DenseMap<CachedHashStringRef, >. NFC	2022-01-16 21:19:01 -08:00
Fangrui Song	e205445434	[ELF] StringTableSection: Use DenseMap<CachedHashStringRef> to avoid redundant hash computation 5~6% speedup when linking clang and chrome.	2022-01-16 21:02:05 -08:00
Alexandre Ganea	e6b153947d	Revert [LLD] Remove global state in lldCommon It seems to be causing issues on https://lab.llvm.org/buildbot/#/builders/123/builds/8383	2022-01-16 11:03:06 -05:00
Alexandre Ganea	30a4020a7d	[LLD] Supplement with more comments. Clarify the intention in `f860fe3622`.	2022-01-16 09:17:39 -05:00
Alexandre Ganea	f860fe3622	[LLD] Remove global state in lldCommon Move all variables at file-scope or function-static-scope into a hosting structure (lld::CommonLinkerContext) that lives at lldMain()-scope. Drivers will inherit from this structure and add their own global state, in the same way as for the existing COFFLinkerContext. See discussion in https://lists.llvm.org/pipermail/llvm-dev/2021-June/151184.html Differential Revision: https://reviews.llvm.org/D108850	2022-01-16 08:57:57 -05:00
Fangrui Song	e7c8cd4a93	[ELF] Remove forEachRelSec. NFC	2022-01-16 00:28:47 -08:00
Fangrui Song	9e885eac54	[ELF] Remove !isLazy() condition from computeBinding Seems applicable since we demote lazy symbols to Undefined (D111365).	2022-01-15 23:58:15 -08:00
Fangrui Song	c0fc09ab91	[ELF] Remove config->relocatable condition from Symbol::computeBinding	2022-01-15 23:49:48 -08:00
Fangrui Song	b3cc47006b	[ELF] Speed up Symbol::computeBinding. NFC When computeBinding is inlined into includeInDynsym and computeIsPreemptible, the optimizer can remove the config->gnuUnique load.	2022-01-15 23:40:44 -08:00
Fangrui Song	01a51629c2	[ELF] Slightly speed up Symbol::includeInDynsym. NFC	2022-01-15 23:32:48 -08:00
Fangrui Song	7330fd236e	[ELF] Simplify Symbol::includeInDynsym	2022-01-15 23:27:45 -08:00
Fangrui Song	3736d0854a	[ELF] Optimize -z combreloc Sorting dynamic relocations is a bottleneck. Simplifying the comparator improves performance. Linking clang is 4~5% faster with --threads=8. This change may shuffle R_MIPS_REL32 for Mips and is a NFC for non-Mips.	2022-01-15 22:33:51 -08:00
Fangrui Song	102d0a2baf	[ELF] Simplify elf::link exit. NFC	2022-01-15 17:59:05 -08:00
Fangrui Song	8b2f33231c	[ELF] Make some diagnostics follow the convention	2022-01-15 10:46:25 -08:00
Phoebe Wang	0f499d1ed4	Revert "[X86][LLD] Update datelayout in LLD tests. NFCI" This reverts commit `9b43237128`.	2022-01-15 10:54:37 +08:00
Fangrui Song	7c269db779	[lld-macho] Simplify DeduplicatedCStringSection::finalizeContents. NFC Tail merge is slow and of low value. With regular string deduplication, we can just use the return value of StringTableBuilder::add. There is no noticeable performance increase because without deduplication `__cstring` is quite small (7.6MiB for chromium_framework). Reviewed By: #lld-macho, Jez Ng Differential Revision: https://reviews.llvm.org/D117273	2022-01-14 13:12:57 -08:00
Juergen Ributzka	3025c3eded	Replace PlatformKind with PlatformType. The PlatformKind/PlatformType enums contain the same information, which requires them to be kept in-sync. This commit changes over to PlatformType as the sole source of truth, which allows the removal of the redundant PlatformKind. The majority of the changes were in LLD and TextAPI. Reviewed By: cishida Differential Revision: https://reviews.llvm.org/D117163	2022-01-13 09:23:49 -08:00
Igor Kudrin	e00ac48df3	[ELF] Use tombstone values for discarded symbols in relocatable output This extends D81784. Sections can be discarded when linking a relocatable output. Before the patch, LLD did not update the content of debug sections and only replaced the corresponding relocations with R_*_NONE, which could break the debug information. Differential Revision: https://reviews.llvm.org/D116946	2022-01-13 11:38:26 +07:00
Fangrui Song	a5249c2dd2	[ELF] Change gnuHashTab/hashTab to unique_ptr. NFC and remove associated make<XXX> calls. My x86-64 `lld` is ~5KiB smaller.	2022-01-12 13:04:32 -08:00
Fangrui Song	43d927984c	[ELF] Refactor how .gnu.hash and .hash are discarded Switch to the D114180 approach which is simpler and allows gnuHashTab/hashTab to switch to unique_ptr.	2022-01-12 12:47:07 -08:00
Fangrui Song	b592cbf329	[ELF][test] Improve discard-gnu-hash.s to check DT_HASH and DT_GNU_HASH	2022-01-12 12:43:49 -08:00
Fangrui Song	bf9c8636f2	[ELF] Support discarding .relr.dyn `db08df0570` does not work because part.relrDyn is a unique_ptr and `reset` destroys the object which may still be referenced. This commit uses the D114180 approach. Also improve the test to check that there is no R_X86_64_RELATIVE.	2022-01-12 11:55:22 -08:00
Fangrui Song	d8b7ae947d	[ELF][test] Temporarily remove .relr.dyn test which is not working	2022-01-12 11:43:56 -08:00
Fangrui Song	f8476fd47b	[llvm-ar][test] Test that --plugin is ignored	2022-01-12 11:32:31 -08:00
Fangrui Song	5014d6fc53	[ELF] -Map --why-extract=: print despite errors Fix https://github.com/llvm/llvm-project/issues/53073 In case of a relocation error, GNU ld's link map includes the archive member extraction information but not output sections. Our -Map and --why-extract= are currently no-op in case of an error. This change makes the two options work. Reviewed By: ikudrin, peter.smith Differential Revision: https://reviews.llvm.org/D116838	2022-01-12 10:40:33 -08:00
Fangrui Song	db08df0570	[ELF] Support discarding .relr.dyn to prepare for D116838, otherwise for linkerscript/discard-section-err.s, there will be a null pointer dereference in `part.relrDyn->getParent()->size` in `finalizeSynthetic(part.relrDyn.get())`.	2022-01-12 10:38:59 -08:00
Leonard Grey	6db04b97e6	[lld-macho] Port CallGraphSort from COFF/ELF Depends on D112160 This adds the new options `--call-graph-profile-sort` (default), `--no-call-graph-profile-sort` and `--print-symbol-order=`. If call graph profile sorting is enabled, reads `__LLVM,__cg_profile` sections from object files and uses the resulting graph to put callees and callers close to each other in the final binary via the C3 clustering heuristic. Differential Revision: https://reviews.llvm.org/D112164	2022-01-12 10:47:04 -05:00
Phoebe Wang	9b43237128	[X86][LLD] Update datelayout in LLD tests. NFCI rG1bb0caf56168 changed the datalayout of f80 on Windows 32 bits. But it missed the related use in the LLD tests. This patch will fix the problem catched by buildbot.	2022-01-12 19:13:41 +08:00
Jez Ng	62790f366f	[lld-macho] Try and fix map-file.s' flakiness After {D117069}, map-file.s seems flaky. It seems that the "Total Write map file" section always exists, but the "Write map file" sub-section may or may not be emitted. So we check for the former.	2022-01-11 23:02:45 -08:00
Fangrui Song	bfd00ae31e	[lld-link] Change config and driver to unique_ptr Similar to D116143. My x86-64 `lld` is ~5KiB smaller. Reviewed By: mstorsjo Differential Revision: https://reviews.llvm.org/D116996	2022-01-11 18:31:25 -08:00
Jez Ng	e976c457c5	[lld-macho] Initialize separate time trace profiler for mapfile worker After {D115416}, the "Write map file" event no longer shows up in the time trace. Each time trace profiler instance is thread-local, but we had neglected to initialize a separate instance for the mapfile worker thread. Reviewed By: keith Differential Revision: https://reviews.llvm.org/D117069	2022-01-11 17:45:18 -08:00
Fangrui Song	97a5dccb7d	[lld-macho] Rename LazySymbol to LazyArchive. NFC D116913 will add LazyObject. Rename LazySymbol to LazyArchive to avoid confusion and mirror ELF. Reviewed By: #lld-macho, Jez Ng Differential Revision: https://reviews.llvm.org/D116914	2022-01-11 16:49:06 -08:00
Fangrui Song	37a1291885	[ELF] Add RelocationScanner. NFC Currently the way some relocation-related static functions pass around states is clumsy. Add a Resolver class to store some states as member variables. Advantages: * Avoid the parameter `InputSectionBase &sec` (this offsets the cost passing around `this` paramemter) * Avoid the parameter `end` (Mips and PowerPC hacks) * `config` and `target` can be cached as member variables to reduce global state accesses. (potential speedup because the compiler didn't know `config`/`target` were not changed across function calls) * If we ever want to reduce if-else costs (e.g. `config->emachine==EM_MIPS` for non-Mips) or introduce parallel relocation scan not handling some tricky arches (PPC/Mips), we can templatize Resolver `target` isn't used as much as `config`, so I change it to a const reference during the migration. There is a minor performance inprovement for elf::scanRelocations. Reviewed By: ikudrin, peter.smith Differential Revision: https://reviews.llvm.org/D116881	2022-01-11 09:54:53 -08:00
Simon Atanasyan	0199e47373	[mips][lld] Add test case to check symbol index reading on mips64el. NFC	2022-01-11 19:08:20 +03:00
Fangrui Song	5dbbd4eeb8	[ELF] Move OffsetGetter before some static functions. NFC to prepare for D116881.	2022-01-10 20:16:02 -08:00
Fangrui Song	477bc36d3b	[lld-macho] Change some global pointers to unique_ptr Similar to D116143. My x86-64 `lld` is ~8KiB smaller. Reviewed By: keith Differential Revision: https://reviews.llvm.org/D116902	2022-01-10 19:39:14 -08:00
Fangrui Song	2968467e39	[lld-macho][test] Add missing coverage for archive/dylib resolution after D115092 When `file->fetch(sym)` is replaced with a no-op, no test fails. The new test catches the case. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D116916	2022-01-10 19:36:24 -08:00
Fangrui Song	7f1955dc96	[ELF] Support mixed TLSDESC and TLS GD We only support both TLSDESC and TLS GD for x86 so this is an x86-specific problem. If both are used, only one R_X86_64_TLSDESC is produced and TLS GD accesses will incorrectly reference R_X86_64_TLSDESC. Fix this by introducing SymbolAux::tlsDescIdx. Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D116900	2022-01-10 10:03:21 -08:00
Vincent Lee	7a161eb43b	[lld-macho] Fix shadowed variable This fixes a windows build failure from D115416.	2022-01-10 00:20:35 -08:00
Alexander Shaposhnikov	8acc3b4ab0	[lld][ELF] Support adrp+ldr GOT optimization for AArch64 This diff adds first bits to support relocation relaxations for AArch64 discussed on https://github.com/ARM-software/abi-aa/pull/106. In particular, the case of adrp x0, :got: symbol ldr x0, [x0, :got_lo12: symbol] is handled. Test plan: make check-all Differential revision: https://reviews.llvm.org/D112063	2022-01-10 05:20:37 +00:00
Fangrui Song	5d3bd7f360	[ELF] Move gotIndex/pltIndex/globalDynIndex to SymbolAux to decrease sizeof(SymbolUnion) by 8 on ELF64 platforms. Symbols needing such information are typically 1% or fewer (5134 out of 560520 when linking clang, 19898 out of 5550705 when linking chrome). Storing them elsewhere can decrease memory usage and symbol initialization time. There is a ~0.8% saving on max RSS when linking a large program. Future direction: * Move some of dynsymIndex/verdefIndex/versionId to SymbolAux * Support mixed TLSDESC and TLS GD without increasing sizeof(SymbolUnion) Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D116281	2022-01-09 13:43:27 -08:00
Kazu Hirata	8afcfbfb8f	Use true/false instead of 1/0 (NFC) Identified by modernize-use-bool-literals.	2022-01-09 12:21:06 -08:00
Kazu Hirata	b12fd13812	Fix bugprone argument comments. Identified by bugprone-argument-comment.	2022-01-09 12:21:02 -08:00
John Ericson	a1da5f3c2d	[lld] Deprecate using llvm-config to detect llvm installation This is continuing in the path of D51714, which did this for Clang. I have rearranged the source code Clang so one can diff the top-level CMakeLists.txt of Clang and LLD, ensuring we use the same strategy for both. Besides diffing the two files, `git diff --color-moved` on LLD also helps review. Reviewed By: beanz Differential Revision: https://reviews.llvm.org/D116492	2022-01-07 20:51:14 +00:00
John Ericson	44e3365775	[CMake] Factor out config prefix finding logic See the docs in the new function for details. I think I found every instance of this copy pasted code. Polly could also use it, but currently does something different, so I will save the behavior change for a future revision. We get the shared, non-installed CMake modules following the pattern established in D116472. It might be good to have LLD and Flang also use this, but that would be a functional change and so I leave it as future work. Reviewed By: beanz, lebedev.ri Differential Revision: https://reviews.llvm.org/D116521	2022-01-07 20:16:18 +00:00
Brian Cain	ddf1fb1f13	[Hexagon] Save results from partial compound Previously compounding was all-or-nothing. Now, the compounding attempts will iterate and yield the most compounds that still result in a valid packet.	2022-01-06 14:08:33 -08:00
Vincent Lee	a963bc490d	[lld-macho] Increase slops to prevent thunk out of range One of our internal arm64 apps hit a thunk out of range error when building with LLD. Per the comment, I'm arbitrarily increasing slop size to 256. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D116705	2022-01-06 12:29:12 -08:00
Vy Nguyen	fb9bfb2c59	[lld][macho][nfc] Make tests less britle by not expecting ordering in symbol table dump. (parial)fixes PR/53026 Differential Revision: https://reviews.llvm.org/D116718	2022-01-06 09:45:44 -05:00
Fangrui Song	6e4bbbfcc8	[ELF] Enforce double-dash form for --color-diagnostics/--rsp-quoting/--symbol-ordering-file They are LLD-specific and by convention we enforce the double-dash form to avoid collision with short options (e.g. weird `-c olor-diagnostics` interpretation in GNU ld). They are rarely used and to the best of my investigation the undesired single-dash forms are not used in the wild.	2022-01-06 01:02:14 -08:00
Fangrui Song	bfc2f4b122	[ELF] Update help messages to prefer canonical name for some long options And improve the help message for --pop-state.	2022-01-06 00:43:46 -08:00
Nico Weber	d5b2921faf	[lld/tests] Stop setting the "asserts" and "debug" features The last use of `REQUIRES: debug` was removed in 2013 in `72c5d3d7c` in favor of `REQUIRES: asserts`. The last use of `REQUIRES: asserts` was removed in 2015 in `251b0e268` when the old COFF linker was removed. lld's test suite currently has no behavior difference with respect to assertions or debug builds (and hasn't had it for 6 years). Let's keep it that way :) Differential Revision: https://reviews.llvm.org/D115941	2022-01-05 13:39:17 -05:00
Fangrui Song	954aaf7c14	[ELF] Demote all lazy symbols. NFC This complements D111365. D111365 did not demote isUsedInRegularObj lazy symbols just to work around a --symbol-ordering-file diagnostic quirk. The quirk was dropped by `00dd2d15a4`, so we can demote all lazy symbols now, not just the isUsedInRegularObj ones.	2022-01-05 10:24:29 -08:00
Nico Weber	085f078307	Revert "Revert D109159 "[amdgpu] Enable selection of `s_cselect_b64`."" This reverts commit `859ebca744`. The change contained many unrelated changes and e.g. restored unit test failes for the old lld port.	2022-01-05 13:10:25 -05:00
David Salinas	859ebca744	Revert D109159 "[amdgpu] Enable selection of `s_cselect_b64`." This reverts commit `640beb38e7`. That commit caused performance degradtion in Quicksilver test QS:sGPU and a functional test failure in (rocPRIM rocprim.device_segmented_radix_sort). Reverting until we have a better solution to s_cselect_b64 codegen cleanup Change-Id: Ibf8e397df94001f248fba609f072088a46abae08 Reviewed By: kzhuravl Differential Revision: https://reviews.llvm.org/D115960 Change-Id: Id169459ce4dfffa857d5645a0af50b0063ce1105	2022-01-05 17:57:32 +00:00
Nico Weber	5730d11c2b	[lld-link] Consistently print all /verbose output to stderr lld-link used to consistently print all /verbose output to stdout, and that was an intentional decision: https://reviews.llvm.org/rG4bce7bcc88f3 https://reviews.llvm.org/rGe6e206d4b4814 added message() and log(), and back then `log()` morally was just `if (verbose) message(...)` and message() wrote to stdout. So that change moved most /verbose-induced writes to outs() to log(). Except for the one in printDiscardedMessage(), since the check for `verbose` for that one is in the caller, in Writer::createSections(): if (config->verbose) sc->printDiscardedMessage(); Later, https://reviews.llvm.org/D41033 changed log() to write to stderr. That moved lld-link from writing all its /verbose output to stdout to writing almost all of its /verbose output to stderr -- except for printDiscardedMessage() output. This change moves printDiscardedMessage() to call log() as well, so that all /verbose output once again consistently goes to the same stream. Differential Revision: https://reviews.llvm.org/D116667	2022-01-05 11:52:04 -05:00
Benjamin Kramer	afc14a0d17	Retire llvm::make_reverse_iterator in favor of std::make_reverse_iterator std::make_reverse_iterator is a C++14 feature, gcc has it since GCC 5.1.	2022-01-05 14:07:08 +01:00
Fangrui Song	ddea3bf7d1	[ELF] Remove redundant cast. NFC	2022-01-05 02:07:15 -08:00
Fangrui Song	0940cd18f2	[ELF] --symbol-ordering-file: use getLocalSymbols. NFC	2022-01-05 02:06:31 -08:00
Fangrui Song	00dd2d15a4	[ELF] --symbol-ordering-file: remove weird !lazy condition for "no such symbol" diagnostic The diagnostic is emitted for an unextracted lazy symbol but suppressed for an undefined symbol. Suppressing the diagnostic for unextracted lazy symbol probably makes more sense because (a) an unextracted lazy symbol is quite similar to an undefined symbol and (b) an unextracted lazy symbol is different from "no such symbol".	2022-01-05 02:04:36 -08:00
Fangrui Song	935229f66b	[ELF] Symbol::getVA: assert not called on a lazy symbol The code path is dead after D111365.	2022-01-05 00:46:48 -08:00
Xu Mingjie	b5149f4e66	[LTO] Fix assertion failed when flushing bitcode incrementally for LTO output. In https://reviews.llvm.org/D86905, we introduce an optimization, when lld emits LLVM bitcode, we allow bitcode writer flush data to disk early when buffered data size is above some threshold. But when `--plugin-opt=emit-llvm` and `-o /dev/null` are used, lld will trigger assertion `BytesRead >= 0 && static_cast<size_t>(BytesRead) == BytesFromDisk`. When we write output to /dev/null, BytesRead is zero, but at this program point BytesFromDisk is always non-zero. Reviewed By: stephan.yichao.zhao, MaskRay Differential Revision: https://reviews.llvm.org/D112297	2022-01-04 21:40:23 -08:00
Fangrui Song	292395329c	[lld-link] Remove unneeded lto::InputFile::create after D116434	2022-01-04 19:38:32 -08:00
Luís Ferreira	10e40a4ea3	[lld] Add support for other demanglers other than Itanium LLVM core library supports demangling other mangled symbols other than itanium, such as D and Rust. LLD should use those demanglers in order to output pretty demangled symbols on error messages. Reviewed By: MaskRay, #lld-macho Differential Revision: https://reviews.llvm.org/D116279	2022-01-05 03:25:41 +00:00
Fangrui Song	d496abbe2a	[lld-link] Replace LazyObjFile with lazy ObjFile/BitcodeFile Similar to ELF `3a5fb57393`. * previously when a LazyObjFile was extracted, a new ObjFile/BitcodeFile was created; now the file is reused, just with `lazy` cleared * avoid the confusing transfer of `symbols` from LazyObjFile to the new file * simpler code, smaller executable (5200+ bytes smaller on x86-64) * make eager parsing feasible (for parallel section/symbol table initialization) Reviewed By: aganea, rnk Differential Revision: https://reviews.llvm.org/D116434	2022-01-04 15:11:44 -08:00
Markus Böck	c40049d6d7	[lld][MinGW] Remove `--no-as-needed` from ignored flags In the post commit discussion of https://reviews.llvm.org/D116484 it was concluded that `--no-as-needed` should not be ignored. `--as-needed` stays ignored as it is already the default behaviour on COFF, which cannot be changed.	2022-01-03 23:01:02 +01:00
Kazu Hirata	5e1177302b	[wasm] Use nullptr instead of NULL (NFC) Identified with modernize-use-nullptr.	2022-01-02 10:20:21 -08:00
Markus Böck	1b708b67f6	[lld][MinGW] Ignore `--[no-]as-neeed` flags in MinGW driver These flags are specific to ELF, but are still accepted by GNU ld, even if it does not do anything. This patch adds them as ignored option for the sake of compatibility. Part of https://github.com/llvm/llvm-project/issues/52947 Differential Revision: https://reviews.llvm.org/D116484	2022-01-02 12:03:21 +01:00
John Ericson	b3af9fbcc9	Set the path to the shared cmake modules based on the llvm directory It’s still possible to build parts of the main llvm build (lld, clang etc) by symlinking them into llvm/tools. Reviewed By: Ericson2314 Differential Revision: https://reviews.llvm.org/D116472	2022-01-01 17:59:08 +00:00
John Ericson	896537048d	[lld][CMake] Use `GNUInstallDirs` to support custom installation dirs Extracted from D99484. My new plan is to start from the outside and work inward. Reviewed By: stephenneuendorffer Differential Revision: https://reviews.llvm.org/D115568	2021-12-31 18:57:57 +00:00
Fangrui Song	ed67d5a03a	[ELF] Switch cNamedSections to SmallVector. NFC Make it smaller	2021-12-30 16:08:26 -08:00
Fangrui Song	441de75f69	[lld][docs] Update _templates/indexsidebar.html after Bugzilla->GitHub issue migration	2021-12-30 13:34:45 -08:00
Fangrui Song	dabac5feec	[ELF][LTO] Cache symbol table of lazy BitcodeFile Similar to D62188: a BitcodeFile's symbol table may be iterated twice, once in --start-lib (lazy) state, and once in the non-lazy state. This patch makes `parseLazy` save `symbols[i]` so that the non-lazy state does not need to re-insert to the global symbol table. Avoiding a redundant `saver.save` may save memory. `Maximum resident set size (kbytes)` for a large --thinlto-index-only link: * without the patch: 10164000 * with the patch: 10095716 (0.6% decrease) Note: we can remove `saver.save` if `BitcodeCompiler::add` does not transfer the ownership of `f.obj` in `checkError(ltoObj->add(std::move(f.obj), resols));`. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D116390	2021-12-30 12:03:29 -08:00
Fangrui Song	a96fe1bf3b	[ELF][LTO] Call madvise(MADV_DONTNEED) on MemoryBuffer instances @tejohnson noticed that freeing MemoryBuffer instances right before `lto->compile` can save RSS, likely because the memory can be reused by LTO indexing (e.g. ThinLTO import/export lists).). For ELFFileBase instances, symbol and section names are backed by MemoryBuffer, so destroying MemoryBuffer would make some infrequent passes (parseSymbolVersion, reportBackrefs) crash and make debugging difficult. For a BitcodeFile, its content is completely unused, but destroying its MemoryBuffer makes the buffer identifier inaccessible and may introduce constraints for future changes. This patch leverages madvise(MADV_DONTNEED) which achieves the major gain without the latent issues. `Maximum resident set size (kbytes): ` for a large --thinlto-index-only link: * current behavior: 10146104KiB * destroy MemoryBuffer instances: 8555240KiB * madvise(MADV_DONTNEED) just bitcodeFiles and lazyBitcodeFiles: 8737372KiB * madvise(MADV_DONTNEED) all MemoryBuffers: 8739796KiB (16% decrease) Depends on D116366 Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D116367	2021-12-30 11:36:58 -08:00
Luís Ferreira	8792cd75d0	Revert "[lld] Add support for other demanglers other than Itanium" This reverts commit `e60d6dfd5a`. clang-ppc64le-rhel buildbot failed (https://lab.llvm.org/buildbot#builders/57/builds/13424): tools/lld/MachO/CMakeFiles/lldMachO.dir/Symbols.cpp.o: In function `lld::demangle(llvm::StringRef, bool)': Symbols.cpp:(.text._ZN3lld8demangleEN4llvm9StringRefEb[_ZN3lld8demangleEN4llvm9StringRefEb]+0x90): undefined reference to `llvm::demangle(std::string const&)'	2021-12-30 18:04:21 +00:00
Luís Ferreira	e60d6dfd5a	[lld] Add support for other demanglers other than Itanium LLVM core library supports demangling other mangled symbols other than itanium, such as D and Rust. LLD should use those demanglers in order to output pretty demangled symbols on error messages. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D116279	2021-12-30 17:52:38 +00:00
Fangrui Song	de92a13fec	[ELF] --gc-sections: Work around SHT_PROGBITS .init_array.N for Rust See https://github.com/rust-lang/rust/issues/92181	2021-12-28 16:40:51 -08:00
Mike Hommey	319181f767	[lld-macho] Fix alignment of TLV data sections References from thread-local variable sections are treated as offsets relative to the start of the thread-local data memory area, which is initialized via copying all the TLV data sections (which are all contiguous). If later data sections require a greater alignment than earlier ones, the offsets of data within those sections won't be guaranteed to aligned unless we normalize alignments. We therefore use the largest alignment for all TLV data sections. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D116263	2021-12-28 19:03:13 -05:00
Fangrui Song	49f646a9ed	[ELF] Change EhInputSection::pieces to SmallVector The decreased size does not matter that much as one file contributes at most one EhInputSection.	2021-12-27 21:34:38 -08:00
Fangrui Song	cb203f3f92	[ELF] Change InStruct/Partition pointers to unique_ptr and remove associated make<XXX> calls. gnuHash and sysvHash are unchanged, otherwise LinkerScript::discard would destroy the objects which may be referenced by input section descriptions. My x86-64 lld executable is 121+KiB smaller.	2021-12-27 18:15:23 -08:00
Fangrui Song	049cd480a0	[ELF] Use const reference. NFC	2021-12-27 17:05:48 -08:00
Fangrui Song	3c94d5d9d2	[ELF] addOrphanSections: avoid std::function	2021-12-27 15:57:38 -08:00
Fangrui Song	b8a4780032	[ELF] Simplify and optimize SymbolTableSection<ELFT>::writeTo	2021-12-27 15:16:14 -08:00
Fangrui Song	80c14dcc0e	[ELF] Delete stale declaration. NFC	2021-12-27 12:56:38 -08:00
Fangrui Song	e590c9bc73	[ELF] -r: move zero OutputSection::addr code into finalizeAddressDependentContent Ensure addresses are unchanged after finalizeAddressDependentContent.	2021-12-27 12:10:23 -08:00
Fangrui Song	abc388ed3c	[ELF] Move excludeLibs/redirectSymbols/replaceCommonSymbols adjacent Make post-thinlto-index symbol resolution passes closer.	2021-12-27 00:31:55 -08:00
Fangrui Song	66c550f8de	[ELF] Delete unused LazyObjKind	2021-12-27 00:03:53 -08:00
Fangrui Song	b07292f77a	[ELF] Serialize deleteFallThruJmpInsn to fix concurrency issue New deleteFallThruJmpInsn calls `make<JumpInstrMod>` which cannot be called concurrently. Losing parallelism is unfortunate but we can think of a better approach if parallelism here justifies itself.	2021-12-26 23:26:13 -08:00
Fangrui Song	315554e873	[ELF] Unify sizeof(InputSection) limits for _WIN32 and others Windows sizeof(InputSection) seems to match non-Windows now.	2021-12-26 23:02:24 -08:00
Fangrui Song	e90c8c0422	[ELF] Optimize basic block section bytesDropped/jumpInstrMods and make them more space efficient. This decreases sizeof(InputSection) from 176 to 160, and decreases peak memory usage by 0.3% when linking Chrome.	2021-12-26 22:17:30 -08:00
Fangrui Song	64038ef8c3	[ELF] ScriptParser: change std::vector to SmallVector	2021-12-26 20:12:55 -08:00
Fangrui Song	e9262edf0d	[ELF] SymbolTable:🔣 don't filter out PlaceholderKind Placeholders (-y and redirectSymbols removed versioned symbols) are very rare and the check just makes symbol table iteration slower. Most iterations filter out placeholders anyway, so this change just drops the filter behavior. For "Add symbols to symtabs", we need to ensure that redirectSymbols sets isUsedInRegularObj to false when making a symbol placeholder, to avoid an assertion failure in SymbolTableSection<ELFT>::writeTo. My .text is 2KiB smaller. The speed-up linking chrome is 0.x%.	2021-12-26 18:11:45 -08:00
Fangrui Song	7924b3814f	[ELF] Add Symbol::hasVersionSuffix "Process symbol versions" may take 2+% time. "Redirect symbols" may take 0.6% time. This change speeds up the two passes and makes `*sym.getVersionSuffix() == '@'` in the `undefined reference` diagnostic cleaner. Linking chrome (no debug info) and another large program is 1.5% faster. For empty-ver2.s: the behavior now matches GNU ld, though I'd consider the input invalid and the exact behavior does not matter.	2021-12-26 17:25:54 -08:00
Fangrui Song	469144ffa3	[ELF] De-template InputSectionBase::getEnclosingFunction	2021-12-26 15:21:22 -08:00
Fangrui Song	213896bc5a	[ELF] Remove unused InputSection::getOffsetInFile	2021-12-26 15:18:56 -08:00
Fangrui Song	a1c2ee0147	[ELF] LinkerScript/OutputSection: change other std::vector members to SmallVector 11+KiB smaller .text with both libc++ and libstdc++ builds.	2021-12-26 13:53:47 -08:00
Fangrui Song	10316a6f94	[ELF] Change InputSectionDescription members from vector to SmallVector This decreases sizeof(lld:🧝:InputSectionDescription) from 264 to 232.	2021-12-26 13:06:54 -08:00
Fangrui Song	bf7f3dd74e	[ELF] Move outSecOff addition from InputSection::writeTo to the caller Simplify the code a bit and improve consistency with SyntheticSection::writeTo.	2021-12-26 12:11:41 -08:00
Fangrui Song	aabe901d57	[ELF] Remove one redundant computeBinding This does resolve the redundancy in includeInDynsym().	2021-12-25 23:59:27 -08:00
Fangrui Song	20b4704da3	[ELF] reportRangeError: mention symbol name for non-STT_SECTION local symbols like non-global symbols	2021-12-25 23:46:47 -08:00
Fangrui Song	2c8ebab32e	[ELF] sortSymTabSymbols: change vector to SmallVector This function may take ~1% time. SmallVector<SymbolTableEntry, 0> is smaller (16 bytes instead of 24) and more efficient.	2021-12-25 23:16:27 -08:00
Fangrui Song	d5e310b154	[ELF][test] Make some TLS tests less sensitive to addresses	2021-12-25 22:05:20 -08:00
Fangrui Song	a00f480fe8	[ELF] scanReloc: remove unused start parameter. NFC This was once used as a workaround for detecting missing PPC64 TLSGD/TLSLD relocations produced by ancient IBM XL C/C++.	2021-12-25 14:34:06 -08:00
Fangrui Song	dd4f5d4ae5	[ELF] De-template handleTlsRelocation. NFC	2021-12-25 14:23:13 -08:00
Fangrui Song	70912420bb	[ELF] Move TLS dynamic relocations to postScanRelocations This temporarily increases sizeof(SymbolUnion), but allows us to mov GOT/PLT/etc index members outside Symbol in the future. Then, we can make TLSDESC and TLSGD use different indexes and support mixed TLSDESC and TLSGD (tested by x86-64-tlsdesc-gd-mixed.s). Note: needsTlsGd and needsTlsGdToIe may optionally be combined. Test updates are due to reordered GOT entries.	2021-12-24 22:36:49 -08:00
Fangrui Song	cde37a7e5a	[ELF][test] Add tests for mixed GD-to-IE and IE, mixed TLSDESC and GD Note: mixed TLSDESC and GD currently does not work.	2021-12-24 22:24:15 -08:00
Kazu Hirata	62e48ed10f	Use isa instead of dyn_cast (NFC)	2021-12-24 21:22:27 -08:00
Kazu Hirata	9c0a4227a9	Use Optional::getValueOr (NFC)	2021-12-24 20:57:40 -08:00
Fangrui Song	40fae4d8fc	[ELF] Optimize replaceCommonSymbols This decreases the 0.2% time (no debug info) to nearly no.	2021-12-24 19:01:51 -08:00
Fangrui Song	745420d3f4	[ELF] Cache global variable `target` in relocate* This avoid repeated load of the unique_ptr in hot paths.	2021-12-24 17:54:12 -08:00
Fangrui Song	b5a0f0f397	[ELF] Add ELFFileBase::{elfShdrs,numELFShdrs} to avoid duplicate llvm::object::ELFFile::sections() This mainly avoid `relsOrRelas` cost in `InputSectionBase::relocate`. `llvm::object::ELFFile::sections()` has redundant and expensive checks.	2021-12-24 17:10:38 -08:00
Fangrui Song	5e3403bd22	[ELF] parseLazy: skip local symbols	2021-12-24 13:16:34 -08:00
Fangrui Song	e694180033	[ELF] Optimize --wrap to only check non-local symbols	2021-12-24 12:28:59 -08:00
Fangrui Song	e1b6b5be46	[ELF] Avoid referencing SectionBase::repl after ICF It is fairly easy to forget SectionBase::repl after ICF. Let ICF rewrite a Defined symbol's `section` field to avoid references to SectionBase::repl in subsequent passes. This slightly improves the --icf=none performance due to less indirection (maybe for --icf={safe,all} as well if most symbols are Defined). With this change, there is only one reference to `repl` (--gdb-index D89751). We can undo `f4fb5fd752` (`Move Repl to SectionBase.`) but move `repl` to `InputSection` instead. Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D116093	2021-12-24 12:09:48 -08:00
Fangrui Song	0d749e13f7	[ELF] Optimize symbol initialization and resolution Avoid repeated load of global pointer (symtab) / members (sections.size(), firstGlobal) in the hot paths. And remove some unneeded this->	2021-12-23 21:54:32 -08:00
Fangrui Song	1d285f2de0	[ELF] Simplify and optimize ObjFile<ELFT>::parseLazy	2021-12-23 20:23:13 -08:00
Fangrui Song	1abbbc7b24	[ELF] scanVersionScript: remove unused variable	2021-12-23 18:18:25 -08:00
Fangrui Song	a2baf634a1	[ELF] Simplify SymbolTable::insert. NFC	2021-12-23 17:59:25 -08:00
Fangrui Song	417cd2e5c5	[ELF] SymbolTable: change some vector<Symbol *> to SmallVector The generated assembly for Symbol::insert is much shorter (std::vector resize is inefficient) and enables some inlining.	2021-12-23 16:49:38 -08:00
Fangrui Song	464cc4c920	[ELF] Remove stale comment which was duplicated in MarkLive<ELFT>::run Pointed out by thakis	2021-12-23 15:13:46 -08:00
Kristina Bessonova	81378f7e56	Revert "[DwarfDebug] Support emitting function-local declaration for a lexical block" & dependent patches Try to revert D113741 once again. This also reverts `0ac75e82ff` (D114705) as it causes LLDB's lldb-api.lang/cpp/nsimport.TestCppNsImport.py test failure w/o D113741. This reverts commit `f9607d45f3`. Differential Revision: https://reviews.llvm.org/D116225	2021-12-24 00:47:04 +02:00
Fangrui Song	bf45624ba0	[ELF][PPC32] Support .got2 in an output section description I added `PPC32Got2Section` D62464 to support .got2 but did not implement .got2 in another output section. PR52799 has a linker script placing .got2 in .rodata, which causes a null pointer dereference because a MergeSyntheticSection's file is nullptr. Add the support.	2021-12-23 11:32:44 -08:00
Fangrui Song	4374824ccf	[ELF] --gc-sections: combine two iterations over inputSections There is a slight speed-up.	2021-12-23 09:53:08 -08:00
Fangrui Song	33319dde2a	[ELF] LTO: skip expensive usedStartStop initialization if bitcodeFiles.empty() This may cost 1.3+% of total link time.	2021-12-23 01:52:54 -08:00
Fangrui Song	61312fd5aa	[ELF] sortSections: delete unneeded outSecOff assignment Related to D45368 but outSecOff is unneeded because resolveShfLinkOrder uses stable_sort.	2021-12-23 01:24:32 -08:00
Fangrui Song	5d0be553fa	[ELF] Optimize copyLocalSymbols. NFC	2021-12-23 00:59:29 -08:00
Fangrui Song	ad26b0b233	Revert "[ELF] Make Partition/InStruct members unique_ptr and remove associate make<XXX>" This reverts commit `e48b1c8a27`. This reverts commit `d019de23a1`. The changes caused memory leaks (non-final classes cannot use unique_ptr).	2021-12-22 23:55:11 -08:00
Fangrui Song	ba948c5a9c	[ELF] Use SmallVector for some global variables (Files and Sections). NFC My lld executable is 26+KiB smaller.	2021-12-22 22:30:08 -08:00

... 4 5 6 7 8 ...

15321 Commits