llvm-project

Commit Graph

Author	SHA1	Message	Date
Fangrui Song	b483ce1228	[ELF][ARM] Fix unneeded thunk for branches to hidden undefined weak Similar to D123750 for AArch64.	2022-04-14 23:58:13 -07:00
Fangrui Song	02eab52866	[ELF][AArch64] Fix unneeded thunk for branches to hidden undefined weak Similar to D119787 for PPC64. A hidden undefined weak may change its binding to local before some `isUndefinedWeak` code, so some `isUndefinedWeak` code needs to be changed to `isUndefined`. The undefined non-weak case has been errored, so just using `isUndefined` is fine. The Linux kernel recently has a usage that a branch from 0xffff800008491ee0 references a hidden undefined weak symbol `vfio_group_set_kvm`. It relies on the behavior that a branch to undefined weak resolving to the next instruction, otherwise it'd see spurious relocation out of range errors. Fixes https://github.com/ClangBuiltLinux/linux/issues/1624 Differential Revision: https://reviews.llvm.org/D123750	2022-04-14 11:32:30 -07:00
Matt Arsenault	63fe6d7eae	lld/AMDGPU: Fix asserts if no object files are involved in link Fixes issue 47690. The reproduction steps produced a shared object from clang directly, and then fed the shared object back into lld. With no regular object files, this assert was hit. I'm not sure if we need to or should be looking for equivalent fields in shared objects.	2022-04-08 14:18:52 -04:00
Nico Weber	cd52b35ee4	fix comment typos to cycle bots	2022-04-04 08:56:18 -04:00
Jacob Lambert	71b162c4bd	[AMDGPU][LLD] Adding support for ABI version 5 option Code object version 5 will use the same EFlags as version 4, so we only need to add an additional case Differential Revision: https://reviews.llvm.org/D122190	2022-03-23 01:22:37 -07:00
Fangrui Song	767e64fc11	[ELF] Support some absolute/PC-relative relocation types for REL format ctfconvert seems to use REL-format `.rel.SUNW_dof` for 32-bit architectures. ``` Binary file usr/ports/lang/perl5.32/work/perl-5.32.1/dtrace_mini.o matches [alfredo.junior@dell-a ~/tmp/llvm-bug]$ readelf -r dtrace_mini.o Relocation section (.rel.SUNW_dof): r_offset r_info r_type st_value st_name 00000184 0000281a R_PPC_REL32 00000000 $dtrace1772974259.Perl_dtrace_probe_load ``` Support R_PPC_REL32 to fix `ld.lld: error: drti.c:(.SUNW_dof+0x4E4): internal linker error: cannot read addend for relocation R_PPC_REL32`. While here, add some common relocation types for AArch64, PPC, and PPC64. We perform minimum tests. Reviewed By: adalava, arichardson Differential Revision: https://reviews.llvm.org/D120535	2022-02-25 19:25:18 +00:00
Fangrui Song	38fbedab32	[ELF] Don't rely on Symbols.h's transitive inclusion of InputFiles.h. NFC	2022-02-23 20:44:34 -08:00
Fangrui Song	ae1ba6194f	[ELF] Replace uncompressed InputSectionBase::data() with rawData. NFC In many call sites we know uncompression cannot happen (non-SHF_ALLOC, or the data (even if compressed) must have been uncompressed by a previous pass). Prefer rawData in these cases. data() increases code size and prevents optimization on rawData.	2022-02-21 00:39:26 -08:00
Fangrui Song	53b59fdc52	[ELF][PPC64] Fix assertion failure for branches to hidden undefined weak for -no-pie Reported by Stefan Pintilie in D119773. For a branch to a hidden undefined weak symbol, there is an `assert(sym->getVA());` failure in PPC64LongBranchTargetSection::writeTo for a -no-pie link. The root cause is that we unnecessarily create the thunk for the -no-pie link. Fix this by changing the condition to just `s.isUndefined()`. See the inline comment. Rename ppc64-weak-undef-call.s to ppc64-undefined-weak.s to be consistent with other architectures. Reviewed By: sfertile, stefanp Differential Revision: https://reviews.llvm.org/D119787	2022-02-15 12:57:27 -08:00
Fangrui Song	27bb799095	[ELF] Clean up headers. NFC	2022-02-07 21:53:34 -08:00
Fangrui Song	977a1a523c	[ELF] Symbol::replace: use the old nameData/nameSize. NFC Currently `this->getName() == newSym.getName()`. By keeping the old nameData/nameSize, newSym's nameData/nameSize will be ignored. The call sites can avoid calling getName(). printTraceSymbol needs to take the symbol name since `other`'s name is empty.	2022-02-05 16:34:02 -08:00
Fangrui Song	53fc5d9b9a	[ELF] Support R_PPC_NONE/R_PPC64_NONE in getImplicitAddend Similar to `f457863ae3`	2022-02-04 15:13:37 -08:00
Fangrui Song	f457863ae3	[ELF] Support REL-format R_AARCH64_NONE relocation -fprofile-use=/-fprofile-sample-use= compiles may produce REL-format .rel.llvm.call-graph-profile even if the prevailing format is RELA on AArch64. Add R_AARCH64_NONE to getImplicitAddend to fix this linker error: ``` ld.lld: error: internal linker error: cannot read addend for relocation R_AARCH64_NONE PLEASE submit a bug report to https://crbug.com and run tools/clang/scripts/process_crashreports.py (only works inside Google) which will upload a report and include the crash backtrace. ```	2022-02-04 13:20:49 -08:00
Alexander Shaposhnikov	4450a2a23d	[lld][ELF] Add support for ADRP+ADD optimization for AArch64 This diff adds support for ADRP+ADD optimization for AArch64 described in `d2ca58c54b` i.e. under appropriate constraints ADRP x0, symbol ADD x0, x0, :lo12: symbol can be turned into NOP ADR x0, symbol Test plan: make check-all Differential revision: https://reviews.llvm.org/D117614	2022-02-02 06:09:55 +00:00
Alexandre Ganea	83d59e05b2	Re-land [LLD] Remove global state in lldCommon Move all variables at file-scope or function-static-scope into a hosting structure (lld::CommonLinkerContext) that lives at lldMain()-scope. Drivers will inherit from this structure and add their own global state, in the same way as for the existing COFFLinkerContext. See discussion in https://lists.llvm.org/pipermail/llvm-dev/2021-June/151184.html The previous land `f860fe3622` caused issues in https://lab.llvm.org/buildbot/#/builders/123/builds/8383, fixed by `22ee510dac`. Differential Revision: https://reviews.llvm.org/D108850	2022-01-20 14:53:26 -05:00
Alexander Shaposhnikov	2bb7f226af	[lld] Fix typo. NFC	2022-01-18 02:33:27 +00:00
Alexandre Ganea	e6b153947d	Revert [LLD] Remove global state in lldCommon It seems to be causing issues on https://lab.llvm.org/buildbot/#/builders/123/builds/8383	2022-01-16 11:03:06 -05:00
Alexandre Ganea	f860fe3622	[LLD] Remove global state in lldCommon Move all variables at file-scope or function-static-scope into a hosting structure (lld::CommonLinkerContext) that lives at lldMain()-scope. Drivers will inherit from this structure and add their own global state, in the same way as for the existing COFFLinkerContext. See discussion in https://lists.llvm.org/pipermail/llvm-dev/2021-June/151184.html Differential Revision: https://reviews.llvm.org/D108850	2022-01-16 08:57:57 -05:00
Fangrui Song	8b2f33231c	[ELF] Make some diagnostics follow the convention	2022-01-15 10:46:25 -08:00
Alexander Shaposhnikov	8acc3b4ab0	[lld][ELF] Support adrp+ldr GOT optimization for AArch64 This diff adds first bits to support relocation relaxations for AArch64 discussed on https://github.com/ARM-software/abi-aa/pull/106. In particular, the case of adrp x0, :got: symbol ldr x0, [x0, :got_lo12: symbol] is handled. Test plan: make check-all Differential revision: https://reviews.llvm.org/D112063	2022-01-10 05:20:37 +00:00
Fangrui Song	5d3bd7f360	[ELF] Move gotIndex/pltIndex/globalDynIndex to SymbolAux to decrease sizeof(SymbolUnion) by 8 on ELF64 platforms. Symbols needing such information are typically 1% or fewer (5134 out of 560520 when linking clang, 19898 out of 5550705 when linking chrome). Storing them elsewhere can decrease memory usage and symbol initialization time. There is a ~0.8% saving on max RSS when linking a large program. Future direction: * Move some of dynsymIndex/verdefIndex/versionId to SymbolAux * Support mixed TLSDESC and TLS GD without increasing sizeof(SymbolUnion) Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D116281	2022-01-09 13:43:27 -08:00
Kazu Hirata	8afcfbfb8f	Use true/false instead of 1/0 (NFC) Identified by modernize-use-bool-literals.	2022-01-09 12:21:06 -08:00
Fangrui Song	e90c8c0422	[ELF] Optimize basic block section bytesDropped/jumpInstrMods and make them more space efficient. This decreases sizeof(InputSection) from 176 to 160, and decreases peak memory usage by 0.3% when linking Chrome.	2021-12-26 22:17:30 -08:00
Fangrui Song	baa3eb0dd9	[ELF] Change some non-null pointer parameters to references. NFC	2021-12-22 20:51:11 -08:00
Fangrui Song	bee5bc9075	[ELF] #undef PPC to support GCC powerpc32 build GCC's powerpc32 port predefines `PPC` as a macro in GNU C++ mode in some configurations (Linux, FreeBSD, and some others. See `builtin_define_std ("PPC"); ` in gcc/config/rs6000). ``` % powerpc-linux-gnu-g++ -E -dM -xc++ /dev/null -o - \| grep -w PPC #define PPC 1 ``` Fixes https://bugs.gentoo.org/829599 Reviewed By: thesamesam Differential Revision: https://reviews.llvm.org/D116017	2021-12-20 10:12:51 -08:00
Fangrui Song	cf783be8d7	Reland D114783/D115603 [ELF] Split scanRelocations into scanRelocations/postScanRelocations (Fixed an issue about GOT on a copy relocated alias.) (Fixed an issue about not creating r_addend=0 IRELATIVE for unreferenced non-preemptible ifunc.) The idea is to make scanRelocations mark some actions are needed (GOT/PLT/etc) and postpone the real work to postScanRelocations. It gives some flexibility: * Make it feasible to support .plt.got (PR32938): we need to know whether GLOB_DAT and JUMP_SLOT are both needed. * Make non-preemptible IFUNC handling slightly cleaner: avoid setting/clearing sym.gotInIgot * -z nocopyrel: report all copy relocation places for one symbol * Make GOT deduplication feasible * Make parallel relocation scanning feasible (if we can avoid all stateful operations and make Symbol attributes atomic), but parallelism may not be the appealing choice Since this patch moves a large chunk of code out of ELFT templates. My x86-64 executable is actually a few hundred bytes smaller. For ppc32-ifunc-nonpreemptible-pic.s: I remove absolute relocation references to non-preemptible ifunc because absolute relocation references are incorrect in -fpie mode. Reviewed By: peter.smith, ikudrin Differential Revision: https://reviews.llvm.org/D114783	2021-12-14 16:28:41 -08:00
Fangrui Song	ea15b862d7	Revert D114783 [ELF] Split scanRelocations into scanRelocations/postScanRelocations May cause a failure for non-preemptible `bcmp` in a glibc -static link.	2021-12-14 14:33:50 -08:00
Fangrui Song	b79686c6dc	[ELF] Remove needsPltAddr in favor of needsCopy needsPltAddr is equivalent to `needsCopy && isFunc`. In many places, it is equivalent to `needsCopy` because the non-STT_FUNC cases are ruled out. Reviewed By: ikudrin, peter.smith Differential Revision: https://reviews.llvm.org/D115603	2021-12-14 09:52:43 -08:00
Ard Biesheuvel	da66263b6e	[ARM] implement support for ALU/LDR PC-relative group relocations Currently, LLD does not support the complete set of ARM group relocations. Given that I intend to start using these in the Linux kernel [0], let's add support for these. This implements the group processing as documented in the ELF psABI. Notably, this means support is dropped for very far symbol references that also carry a small component, where the immediate is rotated in such a way that only part of it wraps to the other end of the 32-bit word. To me, it seems unlikely that this is something anyone could be relying on, but of course I could be wrong. [0] https://lore.kernel.org/r/20211122092816.2865873-8-ardb@kernel.org/ Reviewed By: peter.smith, MaskRay Differential Revision: https://reviews.llvm.org/D114172	2021-11-27 10:26:37 +01:00
Fangrui Song	5922dd91f8	[ELF] Rename hasStaticTlsModel to hasTlsIe and remove unneeded atomic.	2021-11-24 21:06:04 -08:00
Fangrui Song	371290dfd4	[ELF] Remove unneeded DF_STATIC_TLS for EM_386 local-exec TLS which is also untested.	2021-11-24 20:43:58 -08:00
Fangrui Song	38ed1db7e8	[ELF] Support non-RAX/non-adjacent R_X86_64_GOTPC32_TLSDESC/R_X86_64_TLSDESC_CALL The current TLSDESC optimization code assumes: ``` leaq x@tlsdesc(%rip), %rax call x@tlscall(%rax) # adjacent ``` From https://gitlab.freedesktop.org/mesa/mesa/-/issues/5665 , it seems that the two instructions may not be adjacent in GCC 10's output: ``` leaq x@tlsdesc(%rip), %rax something else call x@tlscall(%rax) ``` This patch supports the case. While here, support non-RAX registers for R_X86_64_GOTPC32_TLSDESC, in case the compiler generates inefficient: ``` leaq x@tlsdesc(%rip), %rcx # or %rdx, %rbx, %rdi, ... movq %rcx, %rax call *x@tlscall(%rax) # GNU ld/gold error for non-RAX ``` Differential Revision: https://reviews.llvm.org/D114416	2021-11-23 10:30:11 -08:00
Fangrui Song	a05384dc89	[ELF] Make --no-relax disable R_X86_64_GOTPCRELX and R_X86_64_REX_GOTPCRELX GOT optimization This brings back the original version of D81359. I have found several use cases now. * Unlike GNU ld, LLD's relocation processing is one pass. If we decide to optimize(relax) R_X86_64_{,REX_}GOTPCRELX, we will suppress GOT generation and cannot undo the decision later. Optimizing R_X86_64_REX_GOTPCRELX can usually make it easy to hit `relocation R_X86_64_REX_GOTPCRELX out of range` because the distance to GOT is usually shorter. Without --no-relax, the user has to recompile with `-Wa,-mrelax-relocations=no`. * The option would help during my investigationg of the root cause of https://git.kernel.org/linus/09e43968db40c33a73e9ddbfd937f46d5c334924 * There is need for relaxation for AArch64 & RISC-V. Implementing this for x86-64 improves consistency with little target-specific cost (two-line X86_64.cpp change). Reviewed By: alexander-shaposhnikov Differential Revision: https://reviews.llvm.org/D113615	2021-11-12 09:47:31 -08:00
Petr Hosek	d56b171ee9	[lld][ELF] Support for R_ARM_THM_JUMP8 This change implements support for R_ARM_THM_JUMP8 relocation in addition to R_ARM_THM_JUMP11 which is already supported by LLD. Differential Revision: https://reviews.llvm.org/D21225	2021-11-11 09:06:52 -08:00
Fangrui Song	d71bb6a409	[ELF] Inline isPPC64SmallCodeModelTocReloc which is only called once. NFC	2021-11-09 20:41:05 -08:00
Fangrui Song	cebb0a64b4	[ELF][ARM] Improve error message for unknown relocation Like rLLD354040. Before: `error: unrecognized relocation Unknown (254)` Now: `error: unknown relocation (254) against symbol foo`	2021-11-08 12:39:08 -08:00
Fangrui Song	3fe4b54915	[ELF] Make getImplicitAddend return 0 for R_ARM_V4BX. NFC Will be useful if we move R_ARM_V4BX handling around.	2021-10-30 23:31:39 -07:00
Fangrui Song	aa1d32f519	[ELF][Mips] Use R_DTPREL for R_MIPS_TLS_DTPREL*	2021-10-30 21:58:43 -07:00
Fangrui Song	e39c138f45	[ELF] Implement TLSDESC for x86-32 `-z rela` is also supported. Tested with: ``` cat > ./a.c <<eof #include <assert.h> int foo(); int bar(); int main() { assert(foo() == 2); assert(foo() == 4); assert(bar() == 2); assert(bar() == 4); } eof cat > ./b.c <<eof #include <stdio.h> __thread int tls0; extern __thread int tls1; int foo() { return ++tls0 + ++tls1; } static __thread int tls2, tls3; int bar() { return ++tls2 + ++tls3; } eof echo '__thread int tls1;' > ./c.c sed 's/ /\t/' > ./Makefile <<'eof' .MAKE.MODE = meta curDirOk=true CC := gcc -m32 -g -fpic -mtls-dialect=gnu2 LDFLAGS := -m32 -Wl,-rpath=. all: a0 a1 a2 run: all ./a0 && ./a1 && ./a2 c.so: c.o; ${LINK.c} -shared $> -o $@ bc.so: b.o c.o; ${LINK.c} -shared $> -o $@ b.so: b.o c.so; ${LINK.c} -shared $> -o $@ a0: a.o b.o c.o; ${LINK.c} $> -o $@ a1: a.o b.so; ${LINK.c} $> -o $@ a2: a.o bc.so; ${LINK.c} $> -o $@ eof ``` and glibc `elf/tst-gnu2-tls1`. `/usr/local/bin/ld` points to the freshly built `lld`. `bmake run && bmake CFLAGS=-O1 run` => ok. Differential Revision: https://reviews.llvm.org/D112582	2021-10-28 17:52:03 -07:00
Fangrui Song	2b1e32410c	[ELF] Change common diagnostics to report both object file location and source file location Many diagnostics use `getErrorPlace` or `getErrorLocation` to report a location. In the presence of line table debug information, `getErrorPlace` uses a source file location and ignores the object file location. However, the object file location is sometimes more useful. This patch changes "undefined symbol" and "out of range" diagnostics to report both object/source file locations. Other diagnostics can use similar format if needed. The key idea is to let `InputSectionBase::getLocation` report the object file location and use `getSrcMsg` for source file/line information. `getSrcMsg` doesn't leverage `STT_FILE` information yet, but I think the temporary lack of the functionality is ok. For the ARM "branch and link relocation" diagnostic, I arbitrarily place the source file location at the end of the line. The diagnostic is not very common so its formatting doesn't need to be pretty. Differential Revision: https://reviews.llvm.org/D112518	2021-10-28 09:38:45 -07:00
Fangrui Song	ecc93ed2d7	[ELF] Replace InputBaseSection::{areRelocsRela,firstRelocation,numRelocation} with relSecIdx For `InputSection` `.foo`, its `InputBaseSection::{areRelocsRela,firstRelocation,numRelocation}` basically encode the information of `.rel[a].foo`. However, one uint32_t (the relocation section index) suffices. See the implementation of `relsOrRelas`. This change decreases sizeof(InputSection) from 184 to 176 on 64-bit Linux. The maximum resident set size linking a large application (1.2G output) decreases by 0.39%. Differential Revision: https://reviews.llvm.org/D112513	2021-10-27 09:51:07 -07:00
Fangrui Song	35c3f5610c	[ELF][X86] Write R_X86_64_TLSDESC addends with -z rel Similar to D100544 for AArch64. Reviewed By: arichardson Differential Revision: https://reviews.llvm.org/D112592	2021-10-27 09:35:30 -07:00
Fangrui Song	ca8105b76c	[ELF][X86] Support R_X86_64_PLTOFF64 For a function call (using the default `-fplt`), GCC `-mcmodel=large` generates an assembly modifier which leads to an R_X86_64_PLTOFF64 relocation. In real world, http://git.ageinghacker.net/jitter (used by GNU poke) uses `-mcmodel=large`. R_X86_64_PLTOFF64's formula is (if preemptible) `L - GOT + A` or (if non-preemptible) `S - GOT + A` where `GOT` is (confusingly) the address of `.got.plt` Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D112386	2021-10-25 13:05:17 -07:00
Fangrui Song	d23fd8ae89	[ELF] Replace noneRel = R__NONE with static constexpr. NFC All architectures define R__NONE to 0.	2021-09-25 15:16:44 -07:00
Fangrui Song	40cd4db442	[ELF] Default gotBaseSymInGotPlt to false (NFC for most architectures) Most architectures use .got instead of .got.plt, so switching the default can minimize customization. This fixes an issue for SPARC V9 which uses .got . AVR, AMDGPU, and MSP430 don't seem to use _GLOBAL_OFFSET_TABLE_.	2021-09-25 15:06:09 -07:00
Fangrui Song	19d53d45f2	[ELF][AArch64] Refine and fix the condition when BTI/PAC PLT needs bti c (As I mentioned in https://reviews.llvm.org/D62609#1534158 , the condition for using bti c for executable can be loosened.) In two cases the address of a PLT may escape: * canonical PLT entry for a STT_FUNC * non-preemptible STT_GNU_IFUNC which is converted to STT_FUNC The first case can be detected with `needsPltAddr`. The second case is not straightforward to detect because for the Relocations.cpp created `directSym`, it's difficult to know whether the associated `sym` has exercised the `!needsPlt(expr)` code path. Just use the conservative `isInIplt` condition. A non-preemptible ifunc not referenced by non-GOT-generating non-PLT-generating relocations will have an unneeded `bti c`, but the cost is acceptable. The second case fixes a bug as well: a -shared link may have non-preemptible ifunc. Before the patch we did not emit `bti c` and could be wrong if the PLT address escaped. GNU ld doesn't handle the case: `relocation R_AARCH64_ADR_PREL_PG_HI21 against STT_GNU_IFUNC symbol 'ifunc2' isn't handled by elf64_aarch64_final_link_relocate` (https://sourceware.org/bugzilla/show_bug.cgi?id=28370) For -shared, if BTI is enabled but PAC is disabled, the PLT entry size increases from 16 to 24 because we have to select the PLT scheme early, but the cost is acceptable. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D110217	2021-09-22 11:51:09 -07:00
Sid Manning	0d7e5daedc	[lld][Hexagon] Add checks for instructions that can have TLS relocations Several instructions with potential TLS relocations were missing. This issue was found when building the Canadian LLVM toolchain.	2021-09-01 13:15:18 -07:00
Hafiz Abid Qadeer	fb9c5c3dce	[lld][AMDGPU] Handle R_AMDGPU_REL16 relocation. This patch is a followup patch to https://reviews.llvm.org/D105760 which adds this relocation. This handles the relocation in lld. The s_branch family of instruction does the following: PC = PC + signext(simm * 4) + 4 so we we do the opposite on the target address before writing it in the instruction stream. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D105761	2021-07-13 20:41:11 +01:00
Alex Richardson	cc7cb9523e	[ELF][AArch64] Write addends for TLSDESC relocations with -z rel Since D100490 this case is diagnosed for -z rel. This commit implements R_AARCH64_TLSDESC cases for AArch64::getImplicitAddend() and AArch64::relocate(). However, there are probably further relocation types that need to be handled for full support of -z rel. Fixes https://bugs.llvm.org/show_bug.cgi?id=47009 Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D100544	2021-07-09 10:41:41 +01:00
Alex Richardson	97fe637539	[ELF] Implement RISCV::getImplicitAddend() This allows checking dynamic relocation addends for -z rel and --apply-dynamic-relocs output. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101455	2021-07-09 10:41:40 +01:00

1 2 3 4 5 ...

408 Commits