llvm-project

Commit Graph

Author	SHA1	Message	Date
Fangrui Song	75e551e5d8	[ELF] Relax R_RISCV_CALL and R_RISCV_CALL_PLT A pair of auipc+jalr relocated by R_RISCV_CALL or R_RISCV_CALL_PLT can be converted to c.j, c.jal, or jal. * c.j: RVC and displacement is representable as an int12 * c.jal: RV32C and displacement is representable as an int12 * jal: displacement is representable as an int21 Use the D127581 relaxation framework to implement the relaxation. If a shorter sequence is satisfied, we record the new relocation type in `relocTypes` and saves the new instruction into `writes`. Finally let `riscvFinalizeRelax` rewrite the instruction by setting `skip`. Differential Revision: https://reviews.llvm.org/D127611	2022-07-07 10:18:45 -07:00
Fangrui Song	6611d58f5b	[ELF] Relax R_RISCV_ALIGN Alternative to D125036. Implement R_RISCV_ALIGN relaxation so that we can handle -mrelax object files (i.e. -mno-relax is no longer needed) and creates a framework for future relaxation. `relaxAux` is placed in a union with InputSectionBase::jumpInstrMod, storing auxiliary information for relaxation. In the first pass, `relaxAux` is allocated. The main data structure is `relocDeltas`: when referencing `relocations[i]`, the actual offset is `r_offset - (i ? relocDeltas[i-1] : 0)`. `relaxOnce` performs one relaxation pass. It computes `relocDeltas` for all text section. Then, adjust st_value/st_size for symbols relative to this section based on `SymbolAnchor`. `bytesDropped` is set so that `assignAddresses` knows that the size has changed. Run `relaxOnce` in the `finalizeAddressDependentContent` loop to wait for convergence of text sections and other address dependent sections (e.g. SHT_RELR). Note: extrating `relaxOnce` into a separate loop works for many cases but has issues in some linker script edge cases. After convergence, compute section contents: shrink the NOP sequence of each R_RISCV_ALIGN as appropriate. Instead of deleting bytes, we run a sequence of memcpy on the content delimitered by relocation locations. For R_RISCV_ALIGN let the next memcpy skip the desired number of bytes. Section content computation is parallelizable, but let's ensure the implementation is mature before optimizations. Technically we can save a copy if we interleave some code with `OutputSection::writeTo`, but let's not pollute the generic code (we don't have templated relocation resolving, so using conditions can impose overhead to non-RISCV.) Tested: `make ARCH=riscv CROSS_COMPILE=riscv64-linux-gnu- LLVM=1 defconfig all` built Linux kernel using -mrelax is bootable. FreeBSD RISCV64 system using -mrelax is bootable. bash/curl/firefox/libevent/vim/tmux using -mrelax works. Differential Revision: https://reviews.llvm.org/D127581	2022-07-07 10:16:09 -07:00
Tim Northover	0f4339a835	lld test fix: don't check the precise hex emitted as a comment. It can vary depending on the platform, so as with the NO-FMA test just check for "0x".	2022-07-07 13:25:24 +01:00
Tim Northover	fe62019387	lld: fix test after x86 instruction comments now end in newline	2022-07-07 13:01:32 +01:00
Jin Xin Ng	65001f5777	[LTO][ELF] Add selective --save-temps= option Allows specific “temps” to be saved, instead of the current all-or-nothing nature of --save-temps. Multiple of these “temps” can be saved by specifying the argument multiple times. Differential Revision: https://reviews.llvm.org/D127778	2022-07-06 10:06:18 -07:00
Fangrui Song	e0612c91cd	[ELF] Optimize getInputSections. NFC In the majority of cases (e.g. orphan sections), an OutputSection has at most one InputSectionDescription (isd). By changing the return type to ArrayRef<InputSection *> we can just reference the isd->sections. For OutputSections with more than one InputSectionDescription we use a caller provided SmallVector to copy the elements as before. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D129111	2022-07-05 23:31:09 -07:00
Ben Dunbobbin	c35a6454b1	[BUILD] Add missed CMakeLists.txt change from `dfb77f2` See: https://reviews.llvm.org/D128195	2022-07-05 16:04:58 +01:00
Ben Dunbobbin	dfb77f2e99	[LLD][ELF] Add FORCE_LLD_DIAGNOSTICS_CRASH to force LLD to crash Add FORCE_LLD_DIAGNOSTICS_CRASH inspired by the existing FORCE_CLANG_DIAGNOSTICS_CRASH. This is particularly useful for people customizing LLD as they may want to modify the crash reporting behavior. Differential Revision: https://reviews.llvm.org/D128195	2022-07-05 09:43:09 +01:00
Daniel Bertalan	2028fe6fbc	[lld-macho] Handle LOH_ARM64_ADRP_LDR_GOT optimization hints This hint instructs the linker to perform the AdrpLdr or AdrpAdd transformation depending on whether the GOT load has been relaxed to load a local symbol's address. Differential Revision: https://reviews.llvm.org/D129059	2022-07-05 07:33:13 +02:00
Pengxuan Zheng	b5e49cdea9	[LLD][COFF] Ignore /kernel flag There exists some description of the flag from Microsoft, but not sure if there's more to it. We ignore the flag for now until we find out more about it. https://docs.microsoft.com/en-us/cpp/build/reference/kernel-create-kernel-mode-binary?view=msvc-170 Reviewed By: thieta, hans Differential Revision: https://reviews.llvm.org/D128238	2022-07-01 10:03:02 -07:00
Daniel Bertalan	73b659ff55	[lld-macho] Fix left shift of negative value UB I introduced this mistake in `573c7e6b3c`. Fixes the failure on this UBSan bot: https://lab.llvm.org/buildbot/#/builders/5/builds/25537	2022-07-01 12:00:16 +02:00
Daniel Bertalan	573c7e6b3c	[lld-macho] Handle LOH_ARM64_ADRP_LDR linker optimization hints This linker optimization hint transforms a pair of adrp+ldr (immediate) instructions into an ldr (literal) load from a PC-relative address if it is 4-byte aligned and within +/- 1 MiB, as ldr can encode a signed 19-bit offset that gets multiplied by 4. In the wild, only a small number of these hints are applicable because not many loads end up close enough to the data segment. However, the added helper functions will be useful in implementing the rest of the LOH types. Differential Revision: https://reviews.llvm.org/D128942	2022-07-01 09:44:24 +02:00
Daniel Bertalan	a3f67f0920	[lld-macho] Initial support for Linker Optimization Hints Linker optimization hints mark a sequence of instructions used for synthesizing an address, like ADRP+ADD. If the referenced symbol ends up close enough, it can be replaced by a faster sequence of instructions like ADR+NOP. This commit adds support for 2 of the 7 defined ARM64 optimization hints: - LOH_ARM64_ADRP_ADD, which transforms a pair of ADRP+ADD into ADR+NOP if the referenced address is within +/- 1 MiB - LOH_ARM64_ADRP_ADRP, which transforms two ADRP instructions into ADR+NOP if they reference the same page These two kinds already cover more than 50% of all LOHs in chromium_framework. Differential Review: https://reviews.llvm.org/D128093	2022-06-30 06:28:42 +02:00
Fangrui Song	9a572164d5	[ELF] Move InputFiles global variables (memoryBuffers, objectFiles, etc) into Ctx. NFC	2022-06-29 18:53:38 -07:00
Fangrui Song	e980f16d52	[ELF] Move whyExtract/backwardReferences from LinkerDriver to Ctx. NFC Ctx was recently added as a more suitable place for such singletons.	2022-06-29 17:34:31 -07:00
Daniel Bertalan	8d29f0fdb9	[lld-macho] Emit REBASE_OPCODE_ADD_ADDR_IMM_SCALED if possible An ADD_ADDR rebase opcode's argument can be encoded as an immediate if the offset is less than 15 * word size. This change reduces the size of chromium_framework by 100+ KiB. Differential Revision: https://reviews.llvm.org/D128798	2022-06-29 22:28:39 +02:00
Brad Smith	84b2e04aea	[docs] Remove outdated status update for FreeBSD Reviewed By: emaste, MaskRay Differential Revision: https://reviews.llvm.org/D128592	2022-06-27 19:41:53 -04:00
Sam Clegg	53217ecb88	[lld][WebAssembly] Don't apply data relocations at static constructor time Instead, export `__wasm_apply_data_relocs` and `__wasm_call_ctors` separately. This is required since user code in a shared library (such as static constructors) should not be run until relocations have been applied to all loaded libraries. See: https://github.com/emscripten-core/emscripten/issues/17295 Differential Revision: https://reviews.llvm.org/D128515	2022-06-27 15:50:02 -07:00
Kazu Hirata	586fb81eee	[lld] Don't use Optional::hasValue (NFC) This patch replaces x.hasValue() with x where x is contextually convertible to bool.	2022-06-26 19:37:14 -07:00
Fangrui Song	0688b00fc3	[ELF] Remove deprecated -dc -dc is deprecated in release/14.x. Remove it for 15.0. The only usage I know was FreeBSD crungen which was removed by https://reviews.freebsd.org/D34215 glibc just dropped -Wl,-d today. Keep -d for now.	2022-06-26 17:26:44 -07:00
Fangrui Song	b95cca03cd	[ELF] Improve compound assignment tests Also use strchr instead of is_contained.	2022-06-25 22:30:52 -07:00
Fangrui Song	0a0effdd5b	[ELF] Support -= *= /= <<= >>= &= \|= in symbol assignments	2022-06-25 22:22:59 -07:00
Fangrui Song	77295c5486	[ELF] Allow ? without adjacent space GNU ld allows 1 ? 2?3:4 : 5?6 :7	2022-06-25 21:16:59 -07:00
Fangrui Song	e3f3d2abf0	[ELF][test] Improve expression test	2022-06-25 21:11:32 -07:00
Fangrui Song	21bf6bb3d3	[ELF] Fix assertion failure when PROVIDE/HIDDEN/PROVIDE_HIDDEN does not have =	2022-06-25 20:26:47 -07:00
Fangrui Song	fe0de25b21	[ELF] Allow an expression to follow = in a symbol assignment GNU ld doesn't require whitespace before =. Match it.	2022-06-25 20:25:34 -07:00
Fangrui Song	b0d6dd3905	[ELF] Fix precedence of ? when there are 2 or more operators on the left hand side For 1 != 1 <= 1 ? 1 : 2, the current code incorrectly considers that ? has a higher precedence than != (minPrec). Also, add a test for right associativity.	2022-06-25 13:48:52 -07:00
Fangrui Song	d479b2e4db	[ELF] Fix precedence of == and != in expressions In GNU ld, the == and != operators have lower precedence than < > <= >=. This behavior matches C.	2022-06-25 13:47:32 -07:00
Fangrui Song	4cb05dc3cb	[ELF] Support quoted name in the TARGET command	2022-06-25 12:31:20 -07:00
Fangrui Song	363b29567e	[ELF] Support quoted symbol in the ENTRY command This matches GNU ld and matches other places we unquote the symbol name. Fixes #56208	2022-06-25 12:19:45 -07:00
Fangrui Song	c5578fca16	[ELF][test] Improve linkerscript/entry.s	2022-06-25 12:14:47 -07:00
Kazu Hirata	3b7c3a654c	Revert "Don't use Optional::hasValue (NFC)" This reverts commit `aa8feeefd3`.	2022-06-25 11:56:50 -07:00
Kazu Hirata	aa8feeefd3	Don't use Optional::hasValue (NFC)	2022-06-25 11:55:57 -07:00
Peter Collingbourne	b064bc18c3	ELF: Do not relax ADRP/LDR -> ADRP/ADD for absolute symbols in PIC. GOT references to absolute symbols can't be relaxed to use ADRP/ADD in position-independent code because these instructions produce a relative address. Differential Revision: https://reviews.llvm.org/D128492	2022-06-24 08:47:23 -07:00
Daniel Bertalan	0836fc395f	[NFC][lld] Fix typos to test commit access	2022-06-24 00:19:18 +02:00
Nico Weber	a2c1f7c90d	[lld, ELF and mac] Add --time-trace=<file>, remove --time-trace-file=<file> `--time-trace=foo` has the same behavior as `--time-trace --time-trace-file=<file>` had previously. Also, for mac, make --time-trace-granularity not imply --time-trace, to match behavior of the ELF port. Differential Revision: https://reviews.llvm.org/D128451	2022-06-23 15:46:22 -04:00
Jin Xin Ng	22f1273357	[ThinLTO][ELF] Add --thinlto-emit-index-files option Allows ThinLTO indices to be written to disk on-the-fly/as-part-of “normal” linker execution. Previously ThinLTO indices could be written via --thinlto-index-only but that would cause the linker to exit early. For MLGO specifically, this enables saving the ThinLTO index files without having to restart the linker to collect data only available at later stages (i.e. output of --save-temps) of the linker's execution. Note, this option does not currently work with: --thinlto-object-suffix-replace, as this is intended to be used to consume minimized IR bitcode files while --thinlto-emit-index-files is intended to be run together with InProcessThinLTO (which cannot parse minimized IR). --thinlto-prefix-replace support is left unimplemented but can be implemented if needed Differential Revision: https://reviews.llvm.org/D127777	2022-06-23 12:35:42 -07:00
Nico Weber	0ec87addb7	[lld/mac] Add a few TimeTraceScopes Identical literal folding takes ~1.4% of the time, and was missing from the trace. Signature computation still needs ~2.2% of the time, so probably worth explicitly marking its contribution to "Write output file" (9.1%) Differential Revision: https://reviews.llvm.org/D128343	2022-06-23 11:46:57 -04:00
Daniel Bertalan	ed39fd515a	[lld-macho] Use source information in duplicate symbol errors Similarly to how undefined symbol diagnostics were changed in D128184, we now show where in the source file duplicate symbols are defined at: ld64.lld: error: duplicate symbol: _foo >> defined in bar.c:42 >> /path/to/bar.o >> defined in baz.c:1 >> /path/to/libbaz.a(baz.o) For objects that don't contain DWARF data, the format is unchanged. A slight difference to undefined symbol diagnostics is that we don't print the name of the symbol on the third line, as it's already contained on the first line. Differential Revision: https://reviews.llvm.org/D128425	2022-06-23 11:07:15 -04:00
Fangrui Song	4512dda6af	[ELF][test] Clean up thinlto*	2022-06-22 16:19:17 -07:00
Fangrui Song	20b2d3260d	[lld-macho] Work around odr-use of const non-inline static data member to fix -O0 build after D128298 ``` ld.lld: error: undefined symbol: lld::macho::CodeSignatureSection::blockSize >>> referenced by SyntheticSections.cpp:1253 (/home/maskray/llvm/lld/MachO/SyntheticSections.cpp:1253) >>> tools/lld/MachO/CMakeFiles/lldMachO.dir/SyntheticSections.cpp.o:(lld::macho::CodeSignatureSection::writeHashes(unsigned char*) const::$_7::operator()(unsigned long) const) ```	2022-06-21 19:22:28 -07:00
Nico Weber	0baf13e282	[lld/mac] Parallelize code signature computation According to ministat, this is a small but measurable speedup (using the repro in PR56121): N Min Max Median Avg Stddev x 10 3.7439518 3.7783802 3.7730219 3.7655502 0.012375226 + 10 3.6149218 3.692198 3.6519327 3.6502951 0.025905601 Difference at 95.0% confidence -0.115255 +/- 0.0190746 -3.06078% +/- 0.506554% (Student's t, pooled s = 0.0203008) (Without `858e8b17f7`, this change here to use parallelFor is an 18% speedup, and doing `858e8b17f7` on top of this change is just a 2.55% +/- 0.58% win. Doing both results in a total speedup of 20.85% +/- 0.44%.) Differential Revision: https://reviews.llvm.org/D128298	2022-06-21 20:41:35 -04:00
Daniel Bertalan	5792797c5b	Reland "[lld-macho] Show source information for undefined references" The error used to look like this: ld64.lld: error: undefined symbol: _foo >>> referenced by /path/to/bar.o:(symbol _baz+0x4) If DWARF line information is available, we now show where in the source the references are coming from: ld64.lld: error: unreferenced symbol: _foo >>> referenced by: bar.cpp:42 (/path/to/bar.cpp:42) >>> /path/to/bar.o:(symbol _baz+0x4) The reland is identical to the first time this landed. The fix was in D128294. This reverts commit `0cc7ad4175`. Differential Revision: https://reviews.llvm.org/D128184	2022-06-21 18:50:06 -04:00
Daniel Bertalan	77b6efbd82	[ADT] [lld-macho] Check for end iterator deref in filter_iterator_base If ld64.lld was supplied an object file that had a `__debug_abbrev` or `__debug_str` section, but didn't have any compile unit DIEs in `__debug_info`, it would dereference an iterator pointing to the empty array of DIEs. This underlying issue started causing segmentation faults when parsing for `__debug_info` was addded in D128184. That commit was reverted, and this one fixes the invalid dereference to allow relanding it. This commit adds an assertion to `filter_iterator_base`'s dereference operators to catch bugs like this one. Ran check-llvm, check-clang and check-lld. Differential Revision: https://reviews.llvm.org/D128294	2022-06-21 15:47:45 -04:00
Nico Weber	3ade3d3724	[lld/mac] Replace while loop with for loop No behavior change. In preparation for using a parallelFor() here. Differential Revision: https://reviews.llvm.org/D128295	2022-06-21 15:42:06 -04:00
Nico Weber	858e8b17f7	[lld/mac] On Apple systems, call CC_SHA256 from libSystem It's in libSystem, so it doesn't bring in any new deps, and it's currently much faster than LLVM's current SHA256 implementation. Makes linking (arm64) Chromium Framework with ld64.lld 17% faster. See also PR56121. No behavior change. Differential Revision: https://reviews.llvm.org/D128290	2022-06-21 14:58:04 -04:00
Nico Weber	ca25baee7e	[lld/mac] Extract a sha256() function No behavior change. Differential Revision: https://reviews.llvm.org/D128289	2022-06-21 14:02:42 -04:00
Martin Storsjö	4d2eda2bb3	Revert "[LLD] [COFF] Use StringTableBuilder to optimize the string table" This reverts commit `9ffeaaa0ea`. This fixes debugging large executables with lldb and gdb. When StringTableBuilder is used, the string offsets for any string can point anywhere in the string table - while previously, all strings were inserted in order (without deduplication and tail merging). For symbols, there's no complications in encoding the string offset; the offset is encoded as a raw 32 bit binary number in half of the symbol name field. For sections, the string table offset is written as "/<decimaloffset>", but if the decimal offset would be larger than 7 digits, it's instead written as "//<base64offset>". Tools that operate on object files can handle the base64 offset format, but apparently neither lldb nor gdb expect that syntax when locating the debug information section. Prior to the reverted commit, all long section names were located at the start of the string table, so their offset never exceeded the range for the decimal syntax. Just reverting this change for now, as the actual benefit from it was fairly modest. Longer term, lld could write all long section names unoptimized at the start of the string table, followed by all the strings for symbol names, with deduplication and tail merging. And lldb and gdb could be fixed to handle sections with the base64 offset syntax. This fixes https://github.com/mstorsjo/llvm-mingw/issues/289.	2022-06-21 13:25:08 +03:00
Kazu Hirata	ed8fceaa09	Don't use Optional::getValue (NFC)	2022-06-20 23:35:53 -07:00
Kazu Hirata	064a08cd95	Don't use Optional::hasValue (NFC)	2022-06-20 20:05:16 -07:00

1 2 3 4 5 ...

15391 Commits