llvm-project

Commit Graph

Author	SHA1	Message	Date
Amy Huang	6f7483b1ec	Reland "[LLD] Remove global state in lld/COFF" after fixing asan and msan test failures Original commit description: [LLD] Remove global state in lld/COFF This patch removes globals from the lldCOFF library, by moving globals into a context class (COFFLinkingContext) and passing it around wherever it's needed. See https://lists.llvm.org/pipermail/llvm-dev/2021-June/151184.html for context about removing globals from LLD. I also haven't moved the `driver` or `config` variables yet. Differential Revision: https://reviews.llvm.org/D109634 This reverts commit `a2fd05ada9`. Original commits were `b4fa71eed3` and `e03c7e367a`.	2021-09-17 17:18:42 -07:00
Amy Huang	a2fd05ada9	Temporarily revert "[LLD] Remove global state in lld/COFF" and "[lld] Add test to check for timer output" Seems to be causing a number of asan test failures. This reverts commit `b4fa71eed3` and `e03c7e367a`.	2021-09-16 11:58:11 -07:00
Amy Huang	b4fa71eed3	[LLD] Remove global state in lld/COFF This patch removes globals from the lldCOFF library, by moving globals into a context class (COFFLinkingContext) and passing it around wherever it's needed. See https://lists.llvm.org/pipermail/llvm-dev/2021-June/151184.html for context about removing globals from LLD. I also haven't moved the `driver` or `config` variables yet. Differential Revision: https://reviews.llvm.org/D109634	2021-09-16 11:00:23 -07:00
Alexandre Ganea	f2efb5742c	[LLD][COFF] Cover usage of LLD-as-a-library in tests In lit tests, we run each LLD invocation twice (LLD_IN_TEST=2), without shutting down the process in-between. This ensures a full cleanup is properly done between runs. Only active for the COFF driver for now. Other drivers still use LLD_IN_TEST=1 which executes just one iteration with full cleanup, like before. When the environment variable LLD_IN_TEST is unset, a shortcut is taken, only one iteration is executed, no cleanup for faster exit, like before. A public API, lld::safeLldMain(), is also available when using LLD as a library. Differential Revision: https://reviews.llvm.org/D70378	2020-09-24 15:07:50 -04:00
Rui Ueyama	136d27ab4d	[Coding style change][lld] Rename variables for non-ELF ports This patch does the same thing as r365595 to other subdirectories, which completes the naming style change for the entire lld directory. With this, the naming style conversion is complete for lld. Differential Revision: https://reviews.llvm.org/D64473 llvm-svn: 365730	2019-07-11 05:40:30 +00:00
Alexandre Ganea	09cca5b243	[LLD][COFF] Generate import modules & COFF groups in PDB Generate import modules for each imported DLL, along with its symbol stream. Also create COFF groups in the * Linker * module, one for each PartialSection (input, unmerged sections) Currently COFF groups are disabled for MINGW because it significantly increases PDB sizes. We could enable that later with an option. The overall objective for this change is to support code hot patching tools. Such tools need to know the import libraries used, from the PDB alone. Differential Revision: https://reviews.llvm.org/D54802 llvm-svn: 357308	2019-03-29 20:25:34 +00:00
Fangrui Song	4ac6d7e4b8	[COFF] Delete unused declarations and add a missing forward declaration. NFC llvm-svn: 356241	2019-03-15 09:40:03 +00:00
Chandler Carruth	2946cd7010	Update the file headers across all of the LLVM projects in the monorepo to reflect the new license. We understand that people may be surprised that we're moving the header entirely to discuss the new license. We checked this carefully with the Foundation's lawyer and we believe this is the correct approach. Essentially, all code in the project is now made available by the LLVM project under our new license, so you will see that the license headers include that license only. Some of our contributors have contributed code under our old license, and accordingly, we have retained a copy of our old license notice in the top-level files in each project and repository. llvm-svn: 351636	2019-01-19 08:50:56 +00:00
Martin Storsjo	57ddec0dd1	[COFF] Add support for creating range extension thunks for ARM This is a feature that MS link.exe lacks; it currently errors out on such relocations, just like lld did before. This allows linking clang.exe for ARM - practically, any image over 16 MB will likely run into the issue. Differential Revision: https://reviews.llvm.org/D52156 llvm-svn: 342962	2018-09-25 10:59:29 +00:00
Martin Storsjo	7a41693898	[COFF] Provide __CTOR_LIST__ and __DTOR_LIST__ symbols for MinGW MinGW uses these kind of list terminator symbols for traversing the constructor/destructor lists. These list terminators are actual pointers entries in the lists, with the values 0 and (uintptr_t)-1 (instead of just symbols pointing to the start/end of the list). (This mechanism exists in both the mingw-w64 crt startup code and in libgcc; normally the mingw-w64 one is used, but a DLL build of libgcc uses the libgcc one. Therefore it's not trivial to change the mechanism without lots of cross-project synchronization and potentially invalidating some combinations of old/new versions of them.) When mingw-w64 has been used with lld so far, the CRT startup object files have so far provided these symbols, ending up with different, incompatible builds of the CRT startup object files depending on whether binutils or lld are going to be used. In order to avoid the need of different configuration of the CRT startup object files depending on what linker to be used, provide these symbols in lld instead. (Mingw-w64 checks at build time whether the linker provides these symbols or not.) This unifies this particular detail between the two linkers. This does disallow the use of the very latest lld with older versions of mingw-w64 (the configure check for the list was added recently; earlier it simply checked whether the CRT was built with gcc or clang), and requires rebuilding the mingw-w64 CRT. But the number of users of lld+mingw still is low enough that such a change should be tolerable, and unifies this aspect of the toolchains, easing interoperability between the toolchains for the future. The actual test for this feature is added in ctors_dtors_priority.s, but a number of other tests that checked absolute output addresses are updated. Differential Revision: https://reviews.llvm.org/D52053 llvm-svn: 342294	2018-09-14 22:26:59 +00:00
Peter Collingbourne	381b3d8aa3	COFF: Use (name, output characteristics) as a key when grouping input sections into output sections. This is what link.exe does and lets us avoid needing to worry about merging output characteristics while adding input sections to output sections. With this change we can't process /merge in the same way as before because sections with different output characteristics can still be merged into one another. So this change moves the processing of /merge to just before we assign addresses. In the case where there are multiple output sections with the same name, link.exe only merges the first section with the source name into the first section with the target name, and we do the same. At the same time I also implemented transitive merging (which means that /merge:.c=.b /merge:.b=.a merges both .c and .b into .a). This isn't quite enough though because link.exe has a special case for .CRT in 32-bit mode: it processes sections whose output characteristics are DATA \| R \| W as though the output characteristics were DATA \| R (so that they get merged into things like constructor lists in the expected way). Chromium has a few such sections, and it turns out that those sections were causing the problem that resulted in r318699 (merge .xdata into .rdata) being reverted: because of the previous permission merging semantics, the .CRT sections were causing the entire .rdata section to become writable, which caused the SEH runtime to crash because it apparently requires .xdata to be read-only. This change also implements the same special case. This should unblock being able to merge .xdata into .rdata by default, as well as .bss into .data, both of which will be done in followups. Differential Revision: https://reviews.llvm.org/D45801 llvm-svn: 330479	2018-04-20 21:10:33 +00:00
Peter Collingbourne	be084eca5b	COFF: Remove OutputSection::getPermissions() and getCharacteristics(). All callers can just access the header directly. Differential Revision: https://reviews.llvm.org/D45800 llvm-svn: 330367	2018-04-19 21:48:37 +00:00
Peter Collingbourne	435b099115	COFF: Move assignment of section RVAs to assignAddresses(). NFCI. This makes the design a little more similar to the ELF linker and should allow for features such as ARM range extension thunks to be implemented more easily. Differential Revision: https://reviews.llvm.org/D44501 llvm-svn: 327667	2018-03-15 21:13:46 +00:00
Zachary Turner	727f153b6f	[coff] Print detailed timing information with /TIME. The classes used to print and update time information are in common, so other linkers could use this as well if desired. Differential Revision: https://reviews.llvm.org/D41915 llvm-svn: 322736	2018-01-17 19:16:26 +00:00
Sam Clegg	0fb6faa0be	Prefer `ArrayRef` over `const std::vector&` Differential Revision: https://reviews.llvm.org/D40993 llvm-svn: 320125	2017-12-08 01:09:21 +00:00
Rui Ueyama	cbf969eb20	Remove Symtab aliases. Various classes have `Symtab` member variables even though we have lld::coff::Symtab variable because previous attempts to make COFF lld's internal structure resemble to ELF's was incomplete. This patch finishes that job by removing member variables. llvm-svn: 311938	2017-08-28 21:51:07 +00:00
Peter Collingbourne	6f24fdb6a0	COFF: Change the /lldmap output format to be more like the ELF linker. Differential Revision: https://reviews.llvm.org/D28717 llvm-svn: 291990	2017-01-14 03:14:46 +00:00
Rui Ueyama	9f66f8277d	Re-submit r283825: Add section header stream to PDB. It was reverted because the change that depends on was reverted. Now it was submitted as r283925, so we can submit this as well. llvm-svn: 283926	2016-10-11 19:45:07 +00:00
Rui Ueyama	9aa4ab6f9b	Revert "Add section header stream to PDB." because it depends on r283823. The change this patch depends on was reverted. llvm-svn: 283837	2016-10-11 01:01:40 +00:00
Rui Ueyama	55505954fe	Add section header stream to PDB. Differential Revision: https://reviews.llvm.org/D25357 llvm-svn: 283825	2016-10-10 23:44:10 +00:00
Rui Ueyama	a5f0f758d3	COFF: Move markLive() from Writer.cpp to its own file. Conceptually, garbage collection is not part of Writer, so move the function out of the file. llvm-svn: 248099	2015-09-19 21:36:28 +00:00
Rafael Espindola	beee25e484	Make these headers as being c++. llvm-svn: 245050	2015-08-14 14:12:54 +00:00
Rafael Espindola	b835ae8e4a	Port the error functions from ELF to COFF. This has a few advantages * Less C++ code (about 300 lines less). * Less machine code (about 14 KB of text on a linux x86_64 build). * It is more debugger friendly. Just set a breakpoint on the exit function and you get the complete lld stack trace of when the error was found. * It is a more robust API. The errors are handled early and we don't get a std::error_code hot potato being passed around. * In most cases the error function in a better position to print diagnostics (it has more context). llvm-svn: 244215	2015-08-06 14:58:50 +00:00
Rui Ueyama	cb8474edae	COFF, ELF2: Pass output file path implicitly using Config global variable. Various parameters are passed implicitly using Config global variable already. Output file path is no different from others, so there was no special reason to handle that differnetly. This patch changes the signature of writeResult(SymbolTable , StringRef) to writeResult(SymbolTable ). llvm-svn: 244180	2015-08-05 23:51:50 +00:00
Rui Ueyama	685c41cd39	COFF: Simplify Writer interface by hiding Writer class. llvm-svn: 244175	2015-08-05 23:43:53 +00:00
Rui Ueyama	67fcd1a0c7	COFF: Fix bad #includes. Writer.h is intended to be included only by Writer.cpp and Driver.cpp. Use of the header in other files are bad. llvm-svn: 244106	2015-08-05 19:51:28 +00:00
Rui Ueyama	a8eed749a2	COFF: Write import library symbols to a symbol table. Previously no __imp_ symbols nor dllimport thunk functions were written to a symbol table. llvm-svn: 243350	2015-07-27 23:40:20 +00:00
Rui Ueyama	3afd5bfd7b	COFF: Handle base relocation as a tuple of relocation type and RVA. NFC. On x64 and x86, we use only one base relocation type, so we handled base relocations just as a list of RVAs. That doesn't work well for ARM becuase we have to handle two types of base relocations on ARM. This patch changes the type of base relocation from uint32_t to {reltype, uint32_t} to make it easy to port this code to ARM. llvm-svn: 243197	2015-07-25 01:44:32 +00:00
Rui Ueyama	cd3f99b6c5	COFF: Implement Safe SEH support for x86. An object file compatible with Safe SEH contains a .sxdata section. The section contains a list of symbol table indices, each of which is an exception handler function. A safe SEH-enabled executable contains a list of exception handler RVAs. So, what the linker has to do to support Safe SEH is basically to read the .sxdata section, interpret the contents as a list of symbol indices, unique-fy and sort their RVAs, and then emit that list to .rdata. This patch implements that feature. llvm-svn: 243182	2015-07-24 23:51:14 +00:00
Rui Ueyama	e59a530a6c	COFF: Split createSymbolAndSymbolTable to small functions. NFC. llvm-svn: 242066	2015-07-13 20:56:31 +00:00
Rui Ueyama	1b53ec796a	COFF: Remove Writer::Is64 and use Config::is64 instead. NFC. llvm-svn: 241819	2015-07-09 16:40:39 +00:00
David Majnemer	2c345a337c	COFF: Emit a symbol table if /debug is specified Providing a symbol table in the executable is quite useful when debugging a fully-linked executable without having to reconstruct one from DWARF. Differential Revision: http://reviews.llvm.org/D11023 llvm-svn: 241689	2015-07-08 16:37:50 +00:00
Rui Ueyama	11863b4ae1	COFF: Support x86 file header and relocations. llvm-svn: 241657	2015-07-08 01:45:29 +00:00
Rui Ueyama	88e0f9206b	COFF: Fix a bug of __imp_ symbol. The change I made in r240620 was not correct. If a symbol foo is defined, and if you use __imp_foo, __imp_foo symbol is automatically defined as a pointer (not just an alias) to foo. Now that we need to create a chunk for automatically-created symbols. I defined LocalImportChunk class for them. llvm-svn: 240622	2015-06-25 03:31:47 +00:00
Rui Ueyama	49560c7a10	COFF: Move code for ICF from Writer.cpp to ICF.cpp. llvm-svn: 240590	2015-06-24 20:40:03 +00:00
Rui Ueyama	ddf71fc370	COFF: Initial implementation of Identical COMDAT Folding. Identical COMDAT Folding (ICF) is an optimization to reduce binary size by merging COMDAT sections that contain the same metadata, actual data and relocations. MSVC link.exe and many other linkers have this feature. LLD achieves on per with MSVC in terms produced binary size with this patch. This technique is pretty effective. For example, LLD's size is reduced from 64MB to 54MB by enaling this optimization. The algorithm implemented in this patch is extremely inefficient. It puts all COMDAT sections into a set to identify duplicates. Time to self-link with/without ICF are 3.3 and 320 seconds, respectively. So this option roughly makes LLD 100x slower. But it's okay as I wanted to achieve correctness first. LLD is still able to link itself with this optimization. I'm going to make it more efficient in followup patches. Note that this optimization is not entirely safe. C/C++ require different functions have different addresses. If your program relies on that property, your program wouldn't work with ICF. However, it's not going to be an issue on Windows because MSVC link.exe turns ICF on by default. As long as your program works with default settings (or not passing /opt:noicf), your program would work with LLD too. llvm-svn: 240519	2015-06-24 04:36:52 +00:00
Rui Ueyama	a77336bd5d	COFF: Support delay-load import tables. DLLs are usually resolved at process startup, but you can delay-load them by passing /delayload option to the linker. If a /delayload is specified, the linker has to create data which is similar to regular import table. One notable difference is that the pointers in a delay-load import table are originally pointing to thunks that resolves themselves. Each thunk loads a DLL, resolve its name, and then overwrites the pointer with the result so that subsequent function calls directly call a desired function. The linker has to emit thunks. llvm-svn: 240250	2015-06-21 22:31:52 +00:00
Rui Ueyama	4d769c3a57	COFF: Support exception table. .pdata section contains a list of triplets of function start address, function end address and its unwind information. Linkers have to sort section contents by function start address and set the section address to the file header (so that runtime is able to find it and do binary search.) This change seems to resolve all but one remaining test failures in check{,-clang,-lld} when building the entire stuff with clang-cl and lld-link. llvm-svn: 240231	2015-06-21 04:00:54 +00:00
Rui Ueyama	97dff9ee3a	COFF: Support creating DLLs. DLL files are in the same format as executables but they have export tables. The format of the export table is described in PE/COFF spec section 5.3. A new class, EdataContents, takes care of creating chunks for export tables. What we need to do is to parse command line flags for dllexports, and then instantiate the class to create chunks. For the writer, export table chunks are opaque data -- it just add chunks to .edata section. llvm-svn: 239869	2015-06-17 00:16:33 +00:00
Rui Ueyama	bc2cc7d0b8	COFF: Fix .reloc section attributes. llvm-svn: 239738	2015-06-15 18:03:47 +00:00
Rui Ueyama	588e832d0a	COFF: Support base relocations. PE/COFF executables/DLLs usually contain data which is called base relocations. Base relocations are a list of addresses that need to be fixed by the loader if load-time relocation is needed. Base relocations are in .reloc section. We emit one base relocation entry for each IMAGE_REL_AMD64_ADDR64 relocation. In order to save disk space, base relocations are grouped by page. Each group is called a block. A block starts with a 32-bit page address followed by 16-bit offsets in the page. That is more efficient representation of addresses than just an array of 32-bit addresses. llvm-svn: 239710	2015-06-15 01:23:58 +00:00
Davide Italiano	d106ab263a	[COFF] Spell the namespace correctly. llvm-svn: 239641	2015-06-12 21:37:55 +00:00
Rui Ueyama	4b22fa7437	COFF: Move Windows-specific code from Chunk.{cpp,h} to DLL.{cpp,h}. llvm-svn: 239239	2015-06-07 01:15:04 +00:00
Rui Ueyama	cc608e4f35	COFF: Rename writeHeader -> writeHeaderTo. Chunk has writeTo function which takes uint8_t Buf. writeHeaderTo feels more consistent with that because this member function also takes uint8_t Buf. llvm-svn: 239236	2015-06-06 23:19:38 +00:00
Rui Ueyama	c6ea057d7f	COFF: Move .idata constructor from Writer to Chunk. Previously, half of the constructor for .idata contents was in Chunks.cpp and the rest was in Writer.cpp. This patch moves the latter to Chunks.cpp. Now IdataContents class manages everything for .idata section. llvm-svn: 239230	2015-06-06 22:46:15 +00:00
Rui Ueyama	eb262ce4b6	COFF: /include'd symbols must be preserved. Not only entry point symbol but also symbols specified by /include option must be preserved, as they will never be dead-stripped. http://reviews.llvm.org/D10220 llvm-svn: 239005	2015-06-04 02:12:16 +00:00
Rui Ueyama	bda72a4af4	COFF: Change OutputSections' type from vector<unique_ptr<T>> to vector<T*>. This is mainly for readability. OutputSection objects are still owned by the writer using SpecificBumpPtrAllocator. llvm-svn: 238936	2015-06-03 16:44:00 +00:00
Rui Ueyama	e00d651071	Use initializer instead of memset to zero out. llvm-svn: 238662	2015-05-30 19:28:58 +00:00
Rui Ueyama	bfb4aa1791	COFF: Support long section name. Section names were truncated to 8 bytes because the section table's name field is 8 byte long. This patch creates the string table to store long names. llvm-svn: 238661	2015-05-30 19:09:50 +00:00
Rui Ueyama	411c636081	COFF: Add a new PE/COFF port. This is an initial patch for a section-based COFF linker. The patch has 2300 lines of code including comments and blank lines. Before diving into details, you want to start from reading README because it should give you an overview of the design. All important things are written in the README file, so I write summary here. - The linker is already able to self-link on Windows. - It's significantly faster than the existing implementation. The existing one takes 5 seconds to link LLD on my machine, while the new one only takes 1.2 seconds, even though the new one is not multi-threaded yet. (And a proof-of-concept multi- threaded version was able to link it in 0.5 seconds.) - It uses much less memory (250MB vs. 2GB virtual memory space to self-host). - IMHO the new code is much simpler and easier to read than the existing PE/COFF port. http://reviews.llvm.org/D10036 llvm-svn: 238458	2015-05-28 19:09:30 +00:00

50 Commits