History

Jez Ng 403d61aedd [lld-macho] Enable EH frame relocation / pruning This just removes the code that gates the logic. The main issue here is perf impact: without {D122258}, LLD takes a significant perf hit because it now has to do a lot more work in the input parsing phase. But with that change to eliminate unnecessary EH frames from input object files, the perf overhead here is minimal. Concretely, here are the numbers for some builds as measured on my 16-core Mac Pro: chromium_framework This is without the use of `-femit-dwarf-unwind=no-compact-unwind`: base diff difference (95% CI) sys_time 1.826 ± 0.019 1.962 ± 0.034 [ +6.5% .. +8.4%] user_time 9.306 ± 0.054 9.926 ± 0.082 [ +6.2% .. +7.1%] wall_time 8.225 ± 0.068 8.947 ± 0.128 [ +8.0% .. +9.6%] samples 15 22 With that flag enabled, the regression mostly disappears, as hoped: base diff difference (95% CI) sys_time 1.839 ± 0.062 1.866 ± 0.068 [ -0.9% .. +3.8%] user_time 9.452 ± 0.068 9.490 ± 0.067 [ -0.1% .. +0.9%] wall_time 8.383 ± 0.127 8.452 ± 0.114 [ -0.1% .. +1.8%] samples 17 21 Unnamed internal app Without `-femit-dwarf-unwind`, this is the perf hit: base diff difference (95% CI) sys_time 1.372 ± 0.029 1.317 ± 0.024 [ -4.6% .. -3.5%] user_time 2.835 ± 0.028 2.980 ± 0.027 [ +4.8% .. +5.4%] wall_time 3.205 ± 0.079 3.383 ± 0.066 [ +4.9% .. +6.2%] samples 102 83 With `-femit-dwarf-unwind`, the perf hit almost disappears: base diff difference (95% CI) sys_time 1.274 ± 0.026 1.270 ± 0.025 [ -0.9% .. +0.3%] user_time 2.812 ± 0.023 2.822 ± 0.035 [ +0.1% .. +0.7%] wall_time 3.166 ± 0.047 3.174 ± 0.059 [ -0.2% .. +0.7%] samples 95 97 Just for fun, I measured the impact of `-femit-dwarf-unwind` on ld64 (`base` has the extra DWARF unwind info in the input object files, `diff` doesn't): base diff difference (95% CI) sys_time 1.128 ± 0.010 1.124 ± 0.023 [ -1.3% .. +0.6%] user_time 7.176 ± 0.030 7.106 ± 0.094 [ -1.5% .. -0.4%] wall_time 7.874 ± 0.041 7.795 ± 0.121 [ -1.7% .. -0.3%] samples 16 25 And for LLD: base diff difference (95% CI) sys_time 1.315 ± 0.019 1.280 ± 0.019 [ -3.2% .. -2.0%] user_time 2.980 ± 0.022 2.822 ± 0.016 [ -5.5% .. -5.0%] wall_time 3.369 ± 0.038 3.175 ± 0.033 [ -6.2% .. -5.3%] samples 47 47 So parsing the extra EH frames is a lot more expensive for us than for ld64. But given that we are quite a lot faster than ld64 to begin with, I guess this isn't entirely unexpected... Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D129540		2022-07-13 21:14:05 -04:00
..
COFF	[COFF] Add vfsoverlay flag	2022-07-11 21:31:01 +00:00
Common	[lld-macho] Add support for -w	2022-06-11 17:38:50 -07:00
ELF	[PowerPC][LLD] Change PPC64R2SaveStub to only use non-PC-relative code	2022-07-13 19:34:33 -05:00
MachO	[lld-macho] Enable EH frame relocation / pruning	2022-07-13 21:14:05 -04:00
MinGW	[LLD] [MinGW] Implement --disable-reloc-section, mapped to /fixed	2022-06-15 16:51:20 +03:00
cmake/modules	Revert "[cmake] Don't export `LLVM_TOOLS_INSTALL_DIR` anymore"	2022-06-10 19:26:12 +00:00
docs	[lld-macho] Enable EH frame relocation / pruning	2022-07-13 21:14:05 -04:00
include/lld/Common	[NFC][lld] Fix typos to test commit access	2022-06-24 00:19:18 +02:00
test	[PowerPC][LLD] Change PPC64R2SaveStub to only use non-PC-relative code	2022-07-13 19:34:33 -05:00
tools/lld	[LLD][ELF] Add FORCE_LLD_DIAGNOSTICS_CRASH to force LLD to crash	2022-07-05 09:43:09 +01:00
utils	…
wasm	Use has_value instead of hasValue (NFC)	2022-07-13 01:58:03 -07:00
.clang-format	…
.clang-tidy	NFC: .clang-tidy: Inherit configs from parents to improve maintainability	2021-06-08 08:25:59 -07:00
.gitignore	…
CMakeLists.txt	[NFC][lld] Fix typos to test commit access	2022-06-24 00:19:18 +02:00
CODE_OWNERS.TXT	…
LICENSE.TXT	…
README.md	[doc] Place sha256 in lld/README.md into backticks	2021-01-12 10:19:40 -08:00

README.md

LLVM Linker (lld)

This directory and its subdirectories contain source code for the LLVM Linker, a modular cross platform linker which is built as part of the LLVM compiler infrastructure project.

lld is open source software. You may freely distribute it under the terms of the license agreement found in LICENSE.txt.

Benchmarking

In order to make sure various developers can evaluate patches over the same tests, we create a collection of self contained programs.

It is hosted at https://s3-us-west-2.amazonaws.com/linker-tests/lld-speed-test.tar.xz

The current sha256 is 10eec685463d5a8bbf08d77f4ca96282161d396c65bd97dc99dbde644a31610f.