llvm-project

Commit Graph

Author	SHA1	Message	Date
Amy Huang	6f7483b1ec	Reland "[LLD] Remove global state in lld/COFF" after fixing asan and msan test failures Original commit description: [LLD] Remove global state in lld/COFF This patch removes globals from the lldCOFF library, by moving globals into a context class (COFFLinkingContext) and passing it around wherever it's needed. See https://lists.llvm.org/pipermail/llvm-dev/2021-June/151184.html for context about removing globals from LLD. I also haven't moved the `driver` or `config` variables yet. Differential Revision: https://reviews.llvm.org/D109634 This reverts commit `a2fd05ada9`. Original commits were `b4fa71eed3` and `e03c7e367a`.	2021-09-17 17:18:42 -07:00
Amy Huang	a2fd05ada9	Temporarily revert "[LLD] Remove global state in lld/COFF" and "[lld] Add test to check for timer output" Seems to be causing a number of asan test failures. This reverts commit `b4fa71eed3` and `e03c7e367a`.	2021-09-16 11:58:11 -07:00
Amy Huang	b4fa71eed3	[LLD] Remove global state in lld/COFF This patch removes globals from the lldCOFF library, by moving globals into a context class (COFFLinkingContext) and passing it around wherever it's needed. See https://lists.llvm.org/pipermail/llvm-dev/2021-June/151184.html for context about removing globals from LLD. I also haven't moved the `driver` or `config` variables yet. Differential Revision: https://reviews.llvm.org/D109634	2021-09-16 11:00:23 -07:00
Amy Huang	5127da0291	Revert "[COFF] Only consider associated EH sections during ICF" This change causes an asan error for ODR violation. This reverts commit `7ce9a3e9a9`.	2021-03-29 19:15:35 -07:00
Reid Kleckner	7ce9a3e9a9	[COFF] Only consider associated EH sections during ICF The only known reason why ICF should not merge otherwise identical sections with differing associated sections has to do with exception handling tables. It's not clear what ICF should do when there are other kinds of associated sections. In every other case when this has come up, debug info and CF guard metadata, we have opted to make ICF ignore the associated sections. For comparison, ELF doesn't do anything for comdat groups. Instead, .eh_frame is parsed to figure out if a section has an LSDA, and if so, ICF is disabled. Another issue is that the order of associated sections is not defined. We have had issues in the past (crbug.com/1144476) where changing the order of the .xdata/.pdata sections in the object file lead to large ICF slowdowns. To address these issues, I decided it would be best to explicitly consider only .pdata and .xdata sections during ICF. This makes it easy to ignore the object file order, and I think it makes the intention of the code clearer. I've also made the children() accessor return an empty list for associated sections. This mostly only affects ICF and GC. This was the behavior before I made this a linked list, so the behavior change should be good. This had positive effects on chrome.dll: more .xdata sections were merged that previously could not be merged because they were associated with distinct .pdata sections. Reviewed By: mstorsjo Differential Revision: https://reviews.llvm.org/D98993	2021-03-22 15:36:26 -07:00
Zequan Wu	5bdc5e7efd	[lld-link] Add safe icf mode to lld-link, which does safe icf for all sections. Differential Revision: https://reviews.llvm.org/D97436	2021-03-03 14:52:33 -08:00
Andrew Paverd	0139c8af8d	[CFGuard] Add address-taken IAT tables and delay-load support This patch adds support for creating Guard Address-Taken IAT Entry Tables (.giats$y sections) in object files, matching the behavior of MSVC. These contain lists of address-taken imported functions, which are used by the linker to create the final GIATS table. Additionally, if any DLLs are delay-loaded, the linker must look through the .giats tables and add the respective load thunks of address-taken imports to the GFIDS table, as these are also valid call targets. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D87544	2020-11-17 18:24:45 -08:00
Hans Wennborg	418f18c6cd	Revert "Reland [CFGuard] Add address-taken IAT tables and delay-load support" This broke both Firefox and Chromium (PR47905) due to what seems like dllimport function not being handled correctly. > This patch adds support for creating Guard Address-Taken IAT Entry Tables (.giats$y sections) in object files, matching the behavior of MSVC. These contain lists of address-taken imported functions, which are used by the linker to create the final GIATS table. > Additionally, if any DLLs are delay-loaded, the linker must look through the .giats tables and add the respective load thunks of address-taken imports to the GFIDS table, as these are also valid call targets. > > Reviewed By: rnk > > Differential Revision: https://reviews.llvm.org/D87544 This reverts commit `cfd8481da1`.	2020-11-11 16:03:33 +01:00
Andrew Paverd	cfd8481da1	Reland [CFGuard] Add address-taken IAT tables and delay-load support This patch adds support for creating Guard Address-Taken IAT Entry Tables (.giats$y sections) in object files, matching the behavior of MSVC. These contain lists of address-taken imported functions, which are used by the linker to create the final GIATS table. Additionally, if any DLLs are delay-loaded, the linker must look through the .giats tables and add the respective load thunks of address-taken imports to the GFIDS table, as these are also valid call targets. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D87544	2020-10-13 13:20:52 -07:00
Arthur Eubanks	499260c03b	Revert "[CFGuard] Add address-taken IAT tables and delay-load support" This reverts commit `ef4e971e5e`.	2020-10-01 11:29:54 -07:00
Andrew Paverd	ef4e971e5e	[CFGuard] Add address-taken IAT tables and delay-load support This patch adds support for creating Guard Address-Taken IAT Entry Tables (.giats$y sections) in object files, matching the behavior of MSVC. These contain lists of address-taken imported functions, which are used by the linker to create the final GIATS table. Additionally, if any DLLs are delay-loaded, the linker must look through the .giats tables and add the respective load thunks of address-taken imports to the GFIDS table, as these are also valid call targets. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D87544	2020-10-01 12:45:07 +01:00
Reid Kleckner	932f0276ea	[Support] Move LLD's parallel algorithm wrappers to support Essentially takes the lld/Common/Threads.h wrappers and moves them to the llvm/Support/Paralle.h algorithm header. The changes are: - Remove policy parameter, since all clients use `par`. - Rename the methods to `parallelSort` etc to match LLVM style, since they are no longer C++17 pstl compatible. - Move algorithms from llvm::parallel:: to llvm::, since they have "parallel" in the name and are no longer overloads of the regular algorithms. - Add range overloads - Use the sequential algorithm directly when 1 thread is requested (skips task grouping) - Fix the index type of parallelForEachN to size_t. Nobody in LLVM was using any other parameter, and it made overload resolution hard for for_each_n(par, 0, foo.size(), ...) because 0 is int, not size_t. Remove Threads.h and update LLD for that. This is a prerequisite for parallel public symbol processing in the PDB library, which is in LLVM. Reviewed By: MaskRay, aganea Differential Revision: https://reviews.llvm.org/D79390	2020-05-05 15:21:05 -07:00
Reid Kleckner	fce5457a14	[COFF] Avoid allocating temporary vectors during ICF Heap profiling with ETW shows that LLD performs 4,053,721 heap allocations over its lifetime, and ~800,000 of them come from assocEquals. These vectors are created just to do a comparison, so fuse the comparison into the loop and avoid the allocation. ICF is overall a small portion of the time spent linking, and I did not measure overall throughput improvements from this change above the noise threshold. However, these show up in the heap profiler, and the work is done, so we might as well land it if the code is clear enough. Reviewed By: hans Differential Revision: https://reviews.llvm.org/D79297	2020-05-04 07:01:14 -07:00
Nico Weber	79a8476d43	dummy comment typo fix commit to cycle the bots llvm-svn: 374270	2019-10-10 02:04:56 +00:00
Bob Haarman	19712415a5	[NFC][COFF] fix typo in comment ("algortihm" -> "algorithm") llvm-svn: 372776	2019-09-24 20:17:54 +00:00
Rui Ueyama	136d27ab4d	[Coding style change][lld] Rename variables for non-ELF ports This patch does the same thing as r365595 to other subdirectories, which completes the naming style change for the entire lld directory. With this, the naming style conversion is complete for lld. Differential Revision: https://reviews.llvm.org/D64473 llvm-svn: 365730	2019-07-11 05:40:30 +00:00
Reid Kleckner	ee4e0a2942	Re-land r361206 "[COFF] Store alignment in log2 form, NFC" The previous patch lost the call to PowerOf2Ceil, which causes LLD to crash when handling common symbols with a non-power-of-2 size. I tweaked the existing common.test to make the bsspad16 common symbol be 15 bytes to add coverage for this case. llvm-svn: 361426	2019-05-22 20:21:52 +00:00
Nico Weber	67510fac36	Revert r361206 "[COFF] Store alignment in log2 form, NFC" Makes the linker crash when linking nasm.exe. llvm-svn: 361212	2019-05-21 02:06:59 +00:00
Reid Kleckner	1a5cc629de	[COFF] Store alignment in log2 form, NFC Summary: Valid section or chunk alignments are powers of 2 in the range [1, 8192]. These can be stored more canonically in log2 form to free up some bits in Chunk. Combined with D61696, SectionChunk gets 8 bytes smaller. Reviewers: ruiu, aganea Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D61698 llvm-svn: 361206	2019-05-20 22:57:52 +00:00
Reid Kleckner	0a1b1d6e62	Shrink SectionChunk by combining Relocs and SectionName sizes SectionChunk is one of the most frequently allocated data structures in LLD, since there are about four per function when optimizations and debug info are enabled (.text, .pdata, .xdata, .debug$S). A PE COFF file cannot be larger than 2GB, so there is an inherent limit on the length of the section name and the number of relocations. Decompose the ArrayRef and StringRef into pointer and size, and put them back together in the accessors for section name and relocation list. I plan to gather complete performance numbers later by padding SectionChunk with dead data and measuring performance after all the size optimizations are done. llvm-svn: 359923	2019-05-03 20:17:14 +00:00
Fangrui Song	32c0ebe615	Use llvm::stable_sort Make some small adjustment while touching the code: make parameters const, use less_first(), etc. Differential Revision: https://reviews.llvm.org/D60989 llvm-svn: 358943	2019-04-23 02:42:06 +00:00
Reid Kleckner	cc525c97b7	[COFF] Reduce the size of Chunk and SectionChunk, NFC Summary: Reorder the fields in both to use padding more efficiently, and add more comments on the purpose of the fields. Replace `std::vector<SectionChunk*> AssociativeChildren` with a singly-linked list. This avoids the separate vector allocation to list associative children, and shrinks the 3 pointers used for the typically empty vector down to 1. In the end, this reduces the sum of heap allocations used to link browser_tests.exe with NO PDB by 13.10%, going from 2,248,728 KB to 1,954,071 KB of heap. These numbers exclude memory mapped files, which are of course a significant factor in LLD's memory usage. Reviewers: ruiu, mstorsjo, aganea Subscribers: jdoerfert, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59797 llvm-svn: 357535	2019-04-02 22:11:58 +00:00
Fangrui Song	4ac6d7e4b8	[COFF] Delete unused declarations and add a missing forward declaration. NFC llvm-svn: 356241	2019-03-15 09:40:03 +00:00
Peter Collingbourne	bcd08c16bb	COFF, ELF: ICF: Perform 2 rounds of relocation hash propagation. LLD's performance on PGO instrumented Windows binaries was still not great even with the fix in D56955; out of the 2m41s linker runtime, around 2 minutes were still being spent in ICF. I looked into this more closely and discovered that the vast majority of the runtime was being spent segregating .pdata sections with the following relocation chain: .pdata -> identical .text -> unique PGO counter (not eligible for ICF) This patch causes us to perform 2 rounds of relocation hash propagation, which allows the hash for the .pdata sections to incorporate the identifier from the PGO counter. With that, the amount of time spent in ICF was reduced to about 2 seconds. I also found that the same change led to a significant ICF performance improvement in a regular release build of Chromium's chrome_child.dll, where ICF time was reduced from around 1s to around 700ms. With the same change applied to the ELF linker, median of 100 runs for lld-speed-test/chrome reduced from 4.53s to 4.45s on my machine. I also experimented with increasing the number of propagation rounds further, but I did not observe any further significant performance improvements linking Chromium or Firefox. Differential Revision: https://reviews.llvm.org/D56986 llvm-svn: 351899	2019-01-22 23:54:49 +00:00
Peter Collingbourne	3426111145	COFF, ELF: Adjust ICF hash computation to account for self relocations. It turns out that sections in PGO instrumented object files on Windows contain a large number of relocations pointing to themselves. With r347429 this can cause many sections to receive the same hash (usually zero) as a result of a section's hash being xor'ed with itself. This patch causes the COFF and ELF linkers to avoid this problem by adding the hash of the relocated section instead of xor'ing it. On my machine this causes the regressing test case provided by Mozilla to terminate in 2m41s. Differential Revision: https://reviews.llvm.org/D56955 llvm-svn: 351898	2019-01-22 23:51:35 +00:00
Chandler Carruth	2946cd7010	Update the file headers across all of the LLVM projects in the monorepo to reflect the new license. We understand that people may be surprised that we're moving the header entirely to discuss the new license. We checked this carefully with the Foundation's lawyer and we believe this is the correct approach. Essentially, all code in the project is now made available by the LLVM project under our new license, so you will see that the license headers include that license only. Some of our contributors have contributed code under our old license, and accordingly, we have retained a copy of our old license notice in the top-level files in each project and repository. llvm-svn: 351636	2019-01-19 08:50:56 +00:00
Fangrui Song	4ed350d6c4	[COFF] ICF: use parallelForEach{,N} Summary: They have an additional `ThreadsEnabled` check, which does not matter much. Reviewers: pcc, ruiu, rnk Reviewed By: ruiu Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D54812 llvm-svn: 347587	2018-11-26 20:07:07 +00:00
Peter Collingbourne	b007cabb87	COFF: ICF: Include contents of referenced sections in initial partitioning hash. NFCI. Previously we were taking over 13 minutes to link Firefox's xul.dll on ARM64; this reduces link time to around 18s on my machine. The root cause of the problem was that all of the input .pdata sections had the same unrelocated section data and therefore the same hash, which made segregation quadratic in the number of .pdata sections. The reason why we weren't observing this on other architectures was that ARM has a different .pdata format. On non-ARM the format is (start address, end address, .xdata), which caused the size of the function to appear in the unrelocated section data where the end address field is. However, the ARM format omits the end address field. Fixes PR39667. Differential Revision: https://reviews.llvm.org/D54809 llvm-svn: 347429	2018-11-21 21:29:35 +00:00
Martin Storsjo	802fcb4167	[COFF] When doing automatic dll imports, replace whole .refptr.<var> chunks with __imp_<var> After fixing up the runtime pseudo relocation, the .refptr.<var> will be a plain pointer with the same value as the IAT entry itself. To save a little binary size and reduce the number of runtime pseudo relocations, redirect references to the IAT entry (via the __imp_<var> symbol) itself and discard the .refptr.<var> chunk (as long as the same section chunk doesn't contain anything else than the single pointer). As there are now cases for both setting the Live variable to true and false externally, remove the accessors and setters and just make the variable public instead. Differential Revision: https://reviews.llvm.org/D51456 llvm-svn: 341175	2018-08-31 07:45:20 +00:00
Peter Collingbourne	ab038025a5	COFF: Implement safe ICF on rodata using address-significance tables. Differential Revision: https://reviews.llvm.org/D51050 llvm-svn: 340555	2018-08-23 17:44:42 +00:00
Rui Ueyama	7f97570e79	Make ICF log output order deterministic. This patch does the same thing as r338153 for COFF. Note that this patch affects only the order of log messages. The output file is already deterministic. Differential Revision: https://reviews.llvm.org/D50023 llvm-svn: 338406	2018-07-31 18:04:58 +00:00
Peter Collingbourne	62f7af712c	COFF: Allow ICFing sections with different alignments. The combined section gets the maximum alignment of all sections. Differential Revision: https://reviews.llvm.org/D46786 llvm-svn: 332273	2018-05-14 18:36:51 +00:00
Peter Collingbourne	107f55005b	COFF: ICF a section and its associated sections as a unit. This is needed to avoid merging two functions with identical instructions but different xdata. It also reduces binary size by deduplicating identical pdata sections. Fixes PR35337. Differential Revision: https://reviews.llvm.org/D46672 llvm-svn: 332169	2018-05-12 02:12:40 +00:00
Peter Collingbourne	b6c5a3045b	COFF: Allow ICF on vtable sections. Differential Revision: https://reviews.llvm.org/D46734 llvm-svn: 332059	2018-05-10 23:31:58 +00:00
Peter Collingbourne	fa322abee9	COFF: Rename Chunk::getPermissions to getOutputCharacteristics. In an upcoming change I will need to make a distinction between section type (code, data, bss) and permissions. The term that I use for both of these things is "output characteristics". Differential Revision: https://reviews.llvm.org/D45799 llvm-svn: 330361	2018-04-19 20:03:24 +00:00
Peter Collingbourne	2f6d00612d	COFF: Make SectionChunk::Relocs field an ArrayRef. NFCI. Differential Revision: https://reviews.llvm.org/D45714 llvm-svn: 330172	2018-04-17 01:54:34 +00:00
Bob Haarman	3ddeb33e00	[lld] fix data race in ICF.cpp Summary: Fixes PR36823. Reviewers: ruiu, pcc, rnk Reviewed By: ruiu Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D44716 llvm-svn: 328610	2018-03-27 06:08:35 +00:00
Peter Collingbourne	f1a11f87a0	COFF: Implement string tail merging. In COFF, duplicate string literals are merged by placing them in a comdat whose leader symbol name contains a specific prefix followed by the hash and partial contents of the string literal. This gives us an easy way to identify sections containing string literals in the linker: check for leader symbol names with the given prefix. Any sections that are identified in this way as containing string literals may be tail merged. We do so using the StringTableBuilder class, which is also used to tail merge string literals in the ELF linker. Tail merging is enabled only if ICF is enabled, as this provides a signal as to whether the user cares about binary size. Differential Revision: https://reviews.llvm.org/D44504 llvm-svn: 327668	2018-03-15 21:14:02 +00:00
Sam Clegg	f187c4d2e5	Consistent use of header file for ICF and MarkLive Previously wasm used a separate header to declare markLive and ELF used to declare ICF. This change makes each backend consistently declare these in their own headers. Differential Revision: https://reviews.llvm.org/D43529 llvm-svn: 325631	2018-02-20 22:09:59 +00:00
Zachary Turner	727f153b6f	[coff] Print detailed timing information with /TIME. The classes used to print and update time information are in common, so other linkers could use this as well if desired. Differential Revision: https://reviews.llvm.org/D41915 llvm-svn: 322736	2018-01-17 19:16:26 +00:00
Sam Clegg	0fb6faa0be	Prefer `ArrayRef` over `const std::vector&` Differential Revision: https://reviews.llvm.org/D40993 llvm-svn: 320125	2017-12-08 01:09:21 +00:00
Peter Collingbourne	d01571353d	COFF: Stop requiring comdat sections to have an external leader to participate in ICF. This requirement was added in r254578 to fix pr25686. However, it appears to have originated from a misdiagnosis of the problem: link.exe refused to merge the two sections because they are non-executable, not because they have internal leaders. If I set up a similar scenario with functions instead of globals I see that link.exe merges them. Differential Revision: https://reviews.llvm.org/D40236 llvm-svn: 318682	2017-11-20 18:51:29 +00:00
Reid Kleckner	d99ac29a24	All .xdata sections are eligble for ICF Summary: Many small functions have identical unwind info because they push the same sets of CSRs in the same order and have the same stack and prologue size. The VC linker merges duplicate .xdata, and so should LLD. This reduces the .xdata section size of clang.exe from 1.8MB to 94KB. Reviewers: pcc, ruiu Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D40160 llvm-svn: 318547	2017-11-17 19:50:10 +00:00
Rui Ueyama	f52496e1e0	Rename SymbolBody -> Symbol Now that we have only SymbolBody as the symbol class. So, "SymbolBody" is a bit strange name now. This is a mechanical change generated by perl -i -pe s/SymbolBody/Symbol/g $(git grep -l SymbolBody lld/ELF lld/COFF) nd clang-format-diff. Differential Revision: https://reviews.llvm.org/D39459 llvm-svn: 317370	2017-11-03 21:21:47 +00:00
Bob Haarman	b8a59c8aa5	[lld] unified COFF and ELF error handling on new Common/ErrorHandler Summary: The COFF linker and the ELF linker have long had similar but separate Error.h and Error.cpp files to implement error handling. This change introduces new error handling code in Common/ErrorHandler.h, changes the COFF and ELF linkers to use it, and removes the old, separate implementations. Reviewers: ruiu Reviewed By: ruiu Subscribers: smeenai, jyknight, emaste, sdardis, nemanjai, nhaehnle, mgorny, javed.absar, kbarton, fedor.sergeev, llvm-commits Differential Revision: https://reviews.llvm.org/D39259 llvm-svn: 316624	2017-10-25 22:28:38 +00:00
Rui Ueyama	274aa2fb88	[ICF] Include section contents in section hash values. Computing section content hashes early seems like a win in terms of performance. It increases a chance that two different sections will get different class IDs from the beginning. Without threads, this patch improves Chromium link time by about 0.3 seconds. With threads, by 0.1 seconds. That's less than 1% time saving but not bad for a small patch. llvm-svn: 314644	2017-10-02 01:21:07 +00:00
Rui Ueyama	cfc2f80df6	Remove {get,set}Align accessor functions and use Alignment member variable instead. llvm-svn: 313204	2017-09-13 21:54:55 +00:00
Rui Ueyama	96cbf8bca6	Fix the sanitizer-windows bot. Looks like r303801 broke the sanitizer-windows bot. I don't fully understand what is going on, so I'll partially revert that patch. llvm-svn: 303805	2017-05-24 20:32:23 +00:00
Rui Ueyama	27abe98cfa	Close the gap between ELF and COFF ICF implementations. NFC. We originally wrote the ICF code for COFF and ported it to ELF. They started diverging since then. This patch closes the gap. llvm-svn: 303801	2017-05-24 19:56:29 +00:00
Rui Ueyama	f04c04837c	Improve parallelism of ICF. This is the only place we use threads for ICF. The intention of this code was to split an input vector into 256 shards and process them in parallel. What the code was actually doing was to split an input into 257 shards, process the first 256 shards in parallel, and the remaining one in serial. That means this code takes ceil(256/n)+1 instead of ceil(256/n) where n is the number of available CPU cores. The former converges to 2 while the latter converges to 1. This patches fixes the above issue. llvm-svn: 303797	2017-05-24 19:22:34 +00:00

1 2

89 Commits