llvm-project

Commit Graph

Author	SHA1	Message	Date
Rui Ueyama	136d27ab4d	[Coding style change][lld] Rename variables for non-ELF ports This patch does the same thing as r365595 to other subdirectories, which completes the naming style change for the entire lld directory. With this, the naming style conversion is complete for lld. Differential Revision: https://reviews.llvm.org/D64473 llvm-svn: 365730	2019-07-11 05:40:30 +00:00
Reid Kleckner	ee4e0a2942	Re-land r361206 "[COFF] Store alignment in log2 form, NFC" The previous patch lost the call to PowerOf2Ceil, which causes LLD to crash when handling common symbols with a non-power-of-2 size. I tweaked the existing common.test to make the bsspad16 common symbol be 15 bytes to add coverage for this case. llvm-svn: 361426	2019-05-22 20:21:52 +00:00
Nico Weber	67510fac36	Revert r361206 "[COFF] Store alignment in log2 form, NFC" Makes the linker crash when linking nasm.exe. llvm-svn: 361212	2019-05-21 02:06:59 +00:00
Reid Kleckner	1a5cc629de	[COFF] Store alignment in log2 form, NFC Summary: Valid section or chunk alignments are powers of 2 in the range [1, 8192]. These can be stored more canonically in log2 form to free up some bits in Chunk. Combined with D61696, SectionChunk gets 8 bytes smaller. Reviewers: ruiu, aganea Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D61698 llvm-svn: 361206	2019-05-20 22:57:52 +00:00
Reid Kleckner	0a1b1d6e62	Shrink SectionChunk by combining Relocs and SectionName sizes SectionChunk is one of the most frequently allocated data structures in LLD, since there are about four per function when optimizations and debug info are enabled (.text, .pdata, .xdata, .debug$S). A PE COFF file cannot be larger than 2GB, so there is an inherent limit on the length of the section name and the number of relocations. Decompose the ArrayRef and StringRef into pointer and size, and put them back together in the accessors for section name and relocation list. I plan to gather complete performance numbers later by padding SectionChunk with dead data and measuring performance after all the size optimizations are done. llvm-svn: 359923	2019-05-03 20:17:14 +00:00
Fangrui Song	32c0ebe615	Use llvm::stable_sort Make some small adjustment while touching the code: make parameters const, use less_first(), etc. Differential Revision: https://reviews.llvm.org/D60989 llvm-svn: 358943	2019-04-23 02:42:06 +00:00
Reid Kleckner	cc525c97b7	[COFF] Reduce the size of Chunk and SectionChunk, NFC Summary: Reorder the fields in both to use padding more efficiently, and add more comments on the purpose of the fields. Replace `std::vector<SectionChunk*> AssociativeChildren` with a singly-linked list. This avoids the separate vector allocation to list associative children, and shrinks the 3 pointers used for the typically empty vector down to 1. In the end, this reduces the sum of heap allocations used to link browser_tests.exe with NO PDB by 13.10%, going from 2,248,728 KB to 1,954,071 KB of heap. These numbers exclude memory mapped files, which are of course a significant factor in LLD's memory usage. Reviewers: ruiu, mstorsjo, aganea Subscribers: jdoerfert, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59797 llvm-svn: 357535	2019-04-02 22:11:58 +00:00
Fangrui Song	4ac6d7e4b8	[COFF] Delete unused declarations and add a missing forward declaration. NFC llvm-svn: 356241	2019-03-15 09:40:03 +00:00
Peter Collingbourne	bcd08c16bb	COFF, ELF: ICF: Perform 2 rounds of relocation hash propagation. LLD's performance on PGO instrumented Windows binaries was still not great even with the fix in D56955; out of the 2m41s linker runtime, around 2 minutes were still being spent in ICF. I looked into this more closely and discovered that the vast majority of the runtime was being spent segregating .pdata sections with the following relocation chain: .pdata -> identical .text -> unique PGO counter (not eligible for ICF) This patch causes us to perform 2 rounds of relocation hash propagation, which allows the hash for the .pdata sections to incorporate the identifier from the PGO counter. With that, the amount of time spent in ICF was reduced to about 2 seconds. I also found that the same change led to a significant ICF performance improvement in a regular release build of Chromium's chrome_child.dll, where ICF time was reduced from around 1s to around 700ms. With the same change applied to the ELF linker, median of 100 runs for lld-speed-test/chrome reduced from 4.53s to 4.45s on my machine. I also experimented with increasing the number of propagation rounds further, but I did not observe any further significant performance improvements linking Chromium or Firefox. Differential Revision: https://reviews.llvm.org/D56986 llvm-svn: 351899	2019-01-22 23:54:49 +00:00
Peter Collingbourne	3426111145	COFF, ELF: Adjust ICF hash computation to account for self relocations. It turns out that sections in PGO instrumented object files on Windows contain a large number of relocations pointing to themselves. With r347429 this can cause many sections to receive the same hash (usually zero) as a result of a section's hash being xor'ed with itself. This patch causes the COFF and ELF linkers to avoid this problem by adding the hash of the relocated section instead of xor'ing it. On my machine this causes the regressing test case provided by Mozilla to terminate in 2m41s. Differential Revision: https://reviews.llvm.org/D56955 llvm-svn: 351898	2019-01-22 23:51:35 +00:00
Chandler Carruth	2946cd7010	Update the file headers across all of the LLVM projects in the monorepo to reflect the new license. We understand that people may be surprised that we're moving the header entirely to discuss the new license. We checked this carefully with the Foundation's lawyer and we believe this is the correct approach. Essentially, all code in the project is now made available by the LLVM project under our new license, so you will see that the license headers include that license only. Some of our contributors have contributed code under our old license, and accordingly, we have retained a copy of our old license notice in the top-level files in each project and repository. llvm-svn: 351636	2019-01-19 08:50:56 +00:00
Fangrui Song	4ed350d6c4	[COFF] ICF: use parallelForEach{,N} Summary: They have an additional `ThreadsEnabled` check, which does not matter much. Reviewers: pcc, ruiu, rnk Reviewed By: ruiu Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D54812 llvm-svn: 347587	2018-11-26 20:07:07 +00:00
Peter Collingbourne	b007cabb87	COFF: ICF: Include contents of referenced sections in initial partitioning hash. NFCI. Previously we were taking over 13 minutes to link Firefox's xul.dll on ARM64; this reduces link time to around 18s on my machine. The root cause of the problem was that all of the input .pdata sections had the same unrelocated section data and therefore the same hash, which made segregation quadratic in the number of .pdata sections. The reason why we weren't observing this on other architectures was that ARM has a different .pdata format. On non-ARM the format is (start address, end address, .xdata), which caused the size of the function to appear in the unrelocated section data where the end address field is. However, the ARM format omits the end address field. Fixes PR39667. Differential Revision: https://reviews.llvm.org/D54809 llvm-svn: 347429	2018-11-21 21:29:35 +00:00
Martin Storsjo	802fcb4167	[COFF] When doing automatic dll imports, replace whole .refptr.<var> chunks with __imp_<var> After fixing up the runtime pseudo relocation, the .refptr.<var> will be a plain pointer with the same value as the IAT entry itself. To save a little binary size and reduce the number of runtime pseudo relocations, redirect references to the IAT entry (via the __imp_<var> symbol) itself and discard the .refptr.<var> chunk (as long as the same section chunk doesn't contain anything else than the single pointer). As there are now cases for both setting the Live variable to true and false externally, remove the accessors and setters and just make the variable public instead. Differential Revision: https://reviews.llvm.org/D51456 llvm-svn: 341175	2018-08-31 07:45:20 +00:00
Peter Collingbourne	ab038025a5	COFF: Implement safe ICF on rodata using address-significance tables. Differential Revision: https://reviews.llvm.org/D51050 llvm-svn: 340555	2018-08-23 17:44:42 +00:00
Rui Ueyama	7f97570e79	Make ICF log output order deterministic. This patch does the same thing as r338153 for COFF. Note that this patch affects only the order of log messages. The output file is already deterministic. Differential Revision: https://reviews.llvm.org/D50023 llvm-svn: 338406	2018-07-31 18:04:58 +00:00
Peter Collingbourne	62f7af712c	COFF: Allow ICFing sections with different alignments. The combined section gets the maximum alignment of all sections. Differential Revision: https://reviews.llvm.org/D46786 llvm-svn: 332273	2018-05-14 18:36:51 +00:00
Peter Collingbourne	107f55005b	COFF: ICF a section and its associated sections as a unit. This is needed to avoid merging two functions with identical instructions but different xdata. It also reduces binary size by deduplicating identical pdata sections. Fixes PR35337. Differential Revision: https://reviews.llvm.org/D46672 llvm-svn: 332169	2018-05-12 02:12:40 +00:00
Peter Collingbourne	b6c5a3045b	COFF: Allow ICF on vtable sections. Differential Revision: https://reviews.llvm.org/D46734 llvm-svn: 332059	2018-05-10 23:31:58 +00:00
Peter Collingbourne	fa322abee9	COFF: Rename Chunk::getPermissions to getOutputCharacteristics. In an upcoming change I will need to make a distinction between section type (code, data, bss) and permissions. The term that I use for both of these things is "output characteristics". Differential Revision: https://reviews.llvm.org/D45799 llvm-svn: 330361	2018-04-19 20:03:24 +00:00
Peter Collingbourne	2f6d00612d	COFF: Make SectionChunk::Relocs field an ArrayRef. NFCI. Differential Revision: https://reviews.llvm.org/D45714 llvm-svn: 330172	2018-04-17 01:54:34 +00:00
Bob Haarman	3ddeb33e00	[lld] fix data race in ICF.cpp Summary: Fixes PR36823. Reviewers: ruiu, pcc, rnk Reviewed By: ruiu Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D44716 llvm-svn: 328610	2018-03-27 06:08:35 +00:00
Peter Collingbourne	f1a11f87a0	COFF: Implement string tail merging. In COFF, duplicate string literals are merged by placing them in a comdat whose leader symbol name contains a specific prefix followed by the hash and partial contents of the string literal. This gives us an easy way to identify sections containing string literals in the linker: check for leader symbol names with the given prefix. Any sections that are identified in this way as containing string literals may be tail merged. We do so using the StringTableBuilder class, which is also used to tail merge string literals in the ELF linker. Tail merging is enabled only if ICF is enabled, as this provides a signal as to whether the user cares about binary size. Differential Revision: https://reviews.llvm.org/D44504 llvm-svn: 327668	2018-03-15 21:14:02 +00:00
Sam Clegg	f187c4d2e5	Consistent use of header file for ICF and MarkLive Previously wasm used a separate header to declare markLive and ELF used to declare ICF. This change makes each backend consistently declare these in their own headers. Differential Revision: https://reviews.llvm.org/D43529 llvm-svn: 325631	2018-02-20 22:09:59 +00:00
Zachary Turner	727f153b6f	[coff] Print detailed timing information with /TIME. The classes used to print and update time information are in common, so other linkers could use this as well if desired. Differential Revision: https://reviews.llvm.org/D41915 llvm-svn: 322736	2018-01-17 19:16:26 +00:00
Sam Clegg	0fb6faa0be	Prefer `ArrayRef` over `const std::vector&` Differential Revision: https://reviews.llvm.org/D40993 llvm-svn: 320125	2017-12-08 01:09:21 +00:00
Peter Collingbourne	d01571353d	COFF: Stop requiring comdat sections to have an external leader to participate in ICF. This requirement was added in r254578 to fix pr25686. However, it appears to have originated from a misdiagnosis of the problem: link.exe refused to merge the two sections because they are non-executable, not because they have internal leaders. If I set up a similar scenario with functions instead of globals I see that link.exe merges them. Differential Revision: https://reviews.llvm.org/D40236 llvm-svn: 318682	2017-11-20 18:51:29 +00:00
Reid Kleckner	d99ac29a24	All .xdata sections are eligble for ICF Summary: Many small functions have identical unwind info because they push the same sets of CSRs in the same order and have the same stack and prologue size. The VC linker merges duplicate .xdata, and so should LLD. This reduces the .xdata section size of clang.exe from 1.8MB to 94KB. Reviewers: pcc, ruiu Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D40160 llvm-svn: 318547	2017-11-17 19:50:10 +00:00
Rui Ueyama	f52496e1e0	Rename SymbolBody -> Symbol Now that we have only SymbolBody as the symbol class. So, "SymbolBody" is a bit strange name now. This is a mechanical change generated by perl -i -pe s/SymbolBody/Symbol/g $(git grep -l SymbolBody lld/ELF lld/COFF) nd clang-format-diff. Differential Revision: https://reviews.llvm.org/D39459 llvm-svn: 317370	2017-11-03 21:21:47 +00:00
Bob Haarman	b8a59c8aa5	[lld] unified COFF and ELF error handling on new Common/ErrorHandler Summary: The COFF linker and the ELF linker have long had similar but separate Error.h and Error.cpp files to implement error handling. This change introduces new error handling code in Common/ErrorHandler.h, changes the COFF and ELF linkers to use it, and removes the old, separate implementations. Reviewers: ruiu Reviewed By: ruiu Subscribers: smeenai, jyknight, emaste, sdardis, nemanjai, nhaehnle, mgorny, javed.absar, kbarton, fedor.sergeev, llvm-commits Differential Revision: https://reviews.llvm.org/D39259 llvm-svn: 316624	2017-10-25 22:28:38 +00:00
Rui Ueyama	274aa2fb88	[ICF] Include section contents in section hash values. Computing section content hashes early seems like a win in terms of performance. It increases a chance that two different sections will get different class IDs from the beginning. Without threads, this patch improves Chromium link time by about 0.3 seconds. With threads, by 0.1 seconds. That's less than 1% time saving but not bad for a small patch. llvm-svn: 314644	2017-10-02 01:21:07 +00:00
Rui Ueyama	cfc2f80df6	Remove {get,set}Align accessor functions and use Alignment member variable instead. llvm-svn: 313204	2017-09-13 21:54:55 +00:00
Rui Ueyama	96cbf8bca6	Fix the sanitizer-windows bot. Looks like r303801 broke the sanitizer-windows bot. I don't fully understand what is going on, so I'll partially revert that patch. llvm-svn: 303805	2017-05-24 20:32:23 +00:00
Rui Ueyama	27abe98cfa	Close the gap between ELF and COFF ICF implementations. NFC. We originally wrote the ICF code for COFF and ported it to ELF. They started diverging since then. This patch closes the gap. llvm-svn: 303801	2017-05-24 19:56:29 +00:00
Rui Ueyama	f04c04837c	Improve parallelism of ICF. This is the only place we use threads for ICF. The intention of this code was to split an input vector into 256 shards and process them in parallel. What the code was actually doing was to split an input into 257 shards, process the first 256 shards in parallel, and the remaining one in serial. That means this code takes ceil(256/n)+1 instead of ceil(256/n) where n is the number of available CPU cores. The former converges to 2 while the latter converges to 1. This patches fixes the above issue. llvm-svn: 303797	2017-05-24 19:22:34 +00:00
Zachary Turner	3a57fbd6db	[Support] Move Parallel algorithms from LLD to LLVM. Differential Revision: https://reviews.llvm.org/D33024 llvm-svn: 302748	2017-05-11 00:03:52 +00:00
Zachary Turner	092c767745	[Core] Make parallel algorithms match C++ Parallelism TS. Differential Revision: https://reviews.llvm.org/D33016 llvm-svn: 302613	2017-05-10 01:16:22 +00:00
Rui Ueyama	88172fb603	Use the same terminology as ELF. This patch do s/color/class/g. llvm-svn: 302326	2017-05-05 23:52:24 +00:00
Rui Ueyama	a85572ebf0	COFF ICF: Merge only functions. Do not merge read-only data. This seems to be the behavior of the MSVC linker. Previously, this incompatibility caused nasty issues in chromium build a few times. Differential Revision: https://reviews.llvm.org/D30363 llvm-svn: 301598	2017-04-27 23:03:22 +00:00
Rui Ueyama	e6e206d4b4	Do not use errs() or outs() directly. Instead use message(), log() or error() LLD is a multi-threaded program. errs() or outs() are not guaranteed to be thread-safe (they are actually not). LLD's message(), log() or error() are thread-safe. We should use them. llvm-svn: 295787	2017-02-21 23:22:56 +00:00
Peter Collingbourne	79a5e6b1b7	COFF: New symbol table design. This ports the ELF linker's symbol table design, introduced in r268178, to the COFF linker. Differential Revision: http://reviews.llvm.org/D21166 llvm-svn: 289280	2016-12-09 21:55:24 +00:00
Rui Ueyama	3a618e5606	Port parallel ICF to COFF. LLD used to take 11.73 seconds to link Clang. Now it is 6.94 seconds. MSVC link takes 83.02 seconds. Note that ICF is enabled by default on Windows, so a low latency ICF is more important than in ELF. llvm-svn: 288487	2016-12-02 08:03:58 +00:00
Rui Ueyama	27498b5dd5	Fix a bug in ICF involving COFF associative sections. Associative sections are sections that need to be linked if their associated sections are linked. Associative sections are used to append auxiliary data such as debug info. Previously, we compared all associative sections when comparing two comdat sections. Because usually assocative sections are not mergeable sections, we missed a lot of mergeable sections. MSVC linker doesn't seem to check the identity of associative sections. This patch makes LLD to ignore associative sections when doing ICF. llvm-svn: 288483	2016-12-02 07:46:12 +00:00
Rui Ueyama	29d8eef440	Rename so that the function name is consistent between ELF and COFF. llvm-svn: 261914	2016-02-25 18:49:11 +00:00
Rui Ueyama	43e12900d9	COFF: Non-external COMDAT sections sholud not be merged by ICF. If a section symbol is not external, that COMDAT section should never be merge with other sections in other compilation unit. Previously, we didn't take visibility into account. Note that COMDAT sections with non-external visibility makes sense because they can be removed by dead-stripping. Fixes https://llvm.org/bugs/show_bug.cgi?id=25686 llvm-svn: 254578	2015-12-03 02:23:33 +00:00
Rui Ueyama	df985afa14	COFF: De-parallelize ICF for now. There was a threading issue in the ICF code for COFF. That seems like a venign bug in the sense that it doesn't produce an incorrect output, but it oftentimes misses reducible sections. As a result, mergeable sections could remain in outputs, which makes the output nondeterministic. Basically the algorithm we are using for ICF is this: We group sections so that identical sections will eventually be in the same group. Initially, all sections are in one group. We split the group by relocation targets until we get a convergence (if relocation targets are in different gruops, the sections are different). Once a group is split, they will never be merged. Each section has a group ID. That variable itself is atomic, so there's no threading issue at the level that we can use thread sanitizer. The point is, when we split a group, we re-assign new group IDs to group of sections. That are multiple separate writes to atomic varaibles. Thus, splitting a group is not an atomic operation, and there's a small chance that the other thread observes inconsistent group IDs. Over-splitting is always "safe", so it will never create incorrect output. I suspect that the nondeterminism stems from that point. However, I cannot prove or fix that at this moment, so I'm going to avoid using threads here. llvm-svn: 251300	2015-10-26 16:20:00 +00:00
Rui Ueyama	548d22c073	COFF: ICF should not merge sectinos if their alignments are not the same. There's actually a room to improve this patch. Instead of not merging sections that have different alignements, we can choose the section that has the largest alignment requirement among all sections that are otherwise considered the same. Then all section alignments are satisfied, so we can merge them. I don't know if that improvement could make any difference for real-world input, so I'll leave it alone. Would be interesting to revisit later. llvm-svn: 248581	2015-09-25 16:50:12 +00:00
Rui Ueyama	c9e746b9e6	COFF: Fix local varaible type. This is intended to be 64-bit integer, but size_t is not guranteed to be the same or larger type than uint64_t. llvm-svn: 248580	2015-09-25 16:38:13 +00:00
Rui Ueyama	c28a08b8d2	COFF: Remove duplicate parameter from hash value calculation. llvm-svn: 248526	2015-09-24 19:00:42 +00:00
Rui Ueyama	97d92736f5	COFF: Improve section hash value. std::distance(C->Relocs.end(), C->Relocs.begin()) is the same as NumRelocs which is already added to the hash value. What we are missing here is the section size. llvm-svn: 248202	2015-09-21 19:41:38 +00:00

1 2

74 Commits