llvm-project

Commit Graph

Author	SHA1	Message	Date
Rafael Espindola	35dad13714	Update for LLVM api change. llvm-svn: 254697	2015-12-04 02:42:47 +00:00
Peter Collingbourne	70507e59fe	COFF: Destroy LTOModules as they are linked. This should help reduce memory consumption during LTO. Differential Revision: http://reviews.llvm.org/D14672 llvm-svn: 253397	2015-11-17 23:30:59 +00:00
Yunzhong Gao	fbee89263f	Fixing build failures caused by r253367. Sorry for breaking the build. llvm-svn: 253374	2015-11-17 20:48:12 +00:00
Rui Ueyama	98a98cffb6	COFF: Do not call std::async with std::launch::async if multithreading is disabled. llvm-svn: 248193	2015-09-21 19:12:36 +00:00
Rui Ueyama	997b357ac1	COFF: Run InputFile::parse() in background using std::async(). Previously, InputFile::parse() was run in batch. We construct a list of all input files and call parse() on each file using parallel_for_each. That means we cannot start parsing files until we get a complete list of input files, although InputFile::parse() is safe to call from anywhere. This patch makes it asynchronous. As soon as we add a file to the symbol table, we now start parsing the file using std::async(). This change shortens self-hosting time (650 ms) by 28 ms. It's about 4% improvement. llvm-svn: 248109	2015-09-20 03:11:16 +00:00
Rui Ueyama	1cce300843	COFF: Change Symbol::Body type from atomic pointer to regular pointer. I made the field an atomic pointer in hope that we would be able to parallelize the symbol resolver soon, but that's not going to happen soon. This patch reverts that change for the sake of readability. llvm-svn: 248104	2015-09-20 00:00:05 +00:00
Rui Ueyama	0652c59506	COFF: Actually parallelize InputFile::parse(). This is a follow-up patch to r248078. llvm-svn: 248098	2015-09-19 21:33:26 +00:00
Rui Ueyama	27e9e6540c	Remove unused #includes. llvm-svn: 248081	2015-09-19 02:28:32 +00:00
Rui Ueyama	f4d05d7a80	COFF: Parallelize InputFile::parse(). InputFile::parse() can be called in parallel with other calls of the same function. By doing that, time to self-link improves from 741 ms to 654 ms or 12% faster. This is probably the last low hanging fruit in terms of parallelism. Input file parsing and symbol table insertion takes 450 ms in total. If we want to optimize further, we probably have to parallelize symbol table insertion using concurrent hashmap or something. That's doable, but that's not easy, especially if you want to keep the exact same semantics and linking order. I'm not going to do that at least soon. Anyway, compared to r248019 (the change before the first attempt for parallelism), we achieved 36% performance improvement from 1022 ms to 654 ms. MSVC linker takes 3.3 seconds to link the same program. MSVC's ICF feature is very slow for some reason, but even if we disable the feature, it still takes about 1.2 seconds. Our number is probably good enough. llvm-svn: 248078	2015-09-19 01:48:26 +00:00
Rui Ueyama	4151972c22	Enable extra LTO verification only when build type is debug. llvm-svn: 247956	2015-09-17 22:54:08 +00:00
Rui Ueyama	63dd8766ab	COFF: Remove DefinedSymbol::isLive() and markLive(). NFC. Basically the concept of "liveness" is for sections (or chunks in LLD terminology) and not for symbols. Symbols are always available or live, or otherwise it indicates a link failure. Previously, we had isLive() and markLive() methods for DefinedSymbol. They are confusing methods. What they actually did is to act as a proxy to backing section chunks. We can simplify eliminate these methods and call section chunk's methods directly. llvm-svn: 247869	2015-09-16 23:55:52 +00:00
Duncan P. N. Exon Smith	a11f81973a	LTO: Adjust to LLVM r247735 Perhaps lld wants to disable the verifier sometimes during COFF LTO, but for now just match behaviour from before r247735. llvm-svn: 247736	2015-09-15 23:06:16 +00:00
Rafael Espindola	fb497d79f6	Remove an allocator which was used for just one allocation. llvm-svn: 246662	2015-09-02 16:07:11 +00:00
Peter Collingbourne	df5783b7a5	COFF: Implement parallel LTO code generation. This is exposed via a new flag /opt:lldltojobs=N, where N is the number of code generation threads. Differential Revision: http://reviews.llvm.org/D12309 llvm-svn: 246342	2015-08-28 22:16:09 +00:00
Peter Collingbourne	0bb50d92b6	COFF: Update for LTO API change. llvm-svn: 245892	2015-08-24 22:22:58 +00:00
Rui Ueyama	d2d2360222	COFF: Fix /lldmap option. isLive returns false if it's not COMDAT, so check for that condition. llvm-svn: 245676	2015-08-21 07:01:08 +00:00
Peter Collingbourne	526ff15546	COFF: Introduce flag /opt:lldlto=N for controlling LTO optimization level. Differential Revision: http://reviews.llvm.org/D12024 llvm-svn: 245027	2015-08-14 04:47:07 +00:00
Rafael Espindola	b835ae8e4a	Port the error functions from ELF to COFF. This has a few advantages * Less C++ code (about 300 lines less). * Less machine code (about 14 KB of text on a linux x86_64 build). * It is more debugger friendly. Just set a breakpoint on the exit function and you get the complete lld stack trace of when the error was found. * It is a more robust API. The errors are handled early and we don't get a std::error_code hot potato being passed around. * In most cases the error function in a better position to print diagnostics (it has more context). llvm-svn: 244215	2015-08-06 14:58:50 +00:00
Rui Ueyama	506f6d1ae1	COFF: _tls_used is __tls_used on x86. llvm-svn: 243495	2015-07-28 22:56:02 +00:00
Rui Ueyama	5e706b3ee3	COFF: Use short identifiers. NFC. llvm-svn: 243229	2015-07-25 21:54:50 +00:00
Rui Ueyama	35ccb0f7d4	COFF: Don't assume !is64() means i386. In many places we assumed that is64() means AMD64 and i386 otherwise. This assumption is not sound because Windows also supports ARM. The linker doesn't support ARM yet, but this is a first step. llvm-svn: 243188	2015-07-25 00:20:06 +00:00
Rui Ueyama	cd3f99b6c5	COFF: Implement Safe SEH support for x86. An object file compatible with Safe SEH contains a .sxdata section. The section contains a list of symbol table indices, each of which is an exception handler function. A safe SEH-enabled executable contains a list of exception handler RVAs. So, what the linker has to do to support Safe SEH is basically to read the .sxdata section, interpret the contents as a list of symbol indices, unique-fy and sort their RVAs, and then emit that list to .rdata. This patch implements that feature. llvm-svn: 243182	2015-07-24 23:51:14 +00:00
Rui Ueyama	3cb895c930	COFF: Fix __ImageBase symbol relocation. __ImageBase is a special symbol whose value is the image base address. Previously, we handled __ImageBase symbol as an absolute symbol. Absolute symbols point to specific locations in memory and the locations never change even if an image is base-relocated. That means that we don't have base relocation entries for absolute symbols. This is not a case for __ImageBase. If an image is base-relocated, its base address changes, and __ImageBase needs to be shifted as well. So we have to have base relocations for __ImageBase. That means that __ImageBase is not really an absolute symbol but a different kind of symbol. In this patch, I introduced a new type of symbol -- DefinedRelative. DefinedRelative is similar to DefinedAbsolute, but it has not a VA but RVA and is a subject of base relocation. Currently only __ImageBase is of the new symbol type. llvm-svn: 243176	2015-07-24 22:58:44 +00:00
Rui Ueyama	a50387f1b3	COFF: Fix entry name inference for x86. Entry name selection rule is already complicated on x64, but it's more complicated on x86 because of the underscore name mangling scheme. If one of _main, _main@<number> (a C function) or ?main@@... (a C++ function) is defined, entry name is _mainCRTStartup. If _wmain, _wmain@<number or ?wmain@@... is defined, entry name is _wmainCRTStartup. And so on. llvm-svn: 242110	2015-07-14 02:58:13 +00:00
Rui Ueyama	270960f5cd	COFF: Find C++ mangled name for symbols starting with underscore. Symbol foo is mangled as _foo in C and ?foo@@... in C++ on x86. findMangle has to remove prefix underscore before mangle a given name as a C++ symbol. llvm-svn: 241874	2015-07-09 23:03:51 +00:00
Peter Collingbourne	f5339ec035	COFF: Improve undefined symbol diagnostics. We now report the names of any files containing undefined symbol references. Differential Revision: http://reviews.llvm.org/D10982 llvm-svn: 241612	2015-07-07 18:38:39 +00:00
Peter Collingbourne	8e17451d54	COFF: Fix bug involving archives defining a symbol multiple times. Previously we were unnecessarily loading lazy symbols if they appeared in an archive multiple times, as can happen with comdat symbols. This change fixes the bug by only loading symbols from archives at load time if the original symbol was undefined. Differential Revision: http://reviews.llvm.org/D10980 llvm-svn: 241538	2015-07-07 02:15:25 +00:00
Rui Ueyama	183f53fd22	COFF: Support isa<> for Symbol::Body, whose type is std::atomic<SymbolBody *>. llvm-svn: 241477	2015-07-06 17:45:22 +00:00
Rui Ueyama	e2eb15577d	COFF: Use CAS to update Sym->Body. Note that the linker is not multi-threaded yet. This is a preparation for that. llvm-svn: 241417	2015-07-05 22:05:08 +00:00
Rui Ueyama	c80c03da6c	COFF: Use atomic pointers in preparation for parallelizing. In the new design, mutation of Symbol pointers is the name resolution operation. This patch makes them atomic pointers so that they can be mutated by multiple threads safely. I'm going to use atomic compare-exchange on these pointers. dyn_cast<> doesn't recognize atomic pointers as pointers, so we need to call load(). This is unfortunate, but in other places automatic type conversion works fine. llvm-svn: 241416	2015-07-05 21:54:42 +00:00
Peter Collingbourne	2612a32ce5	COFF: Numerous fixes for interaction between LTO and weak externals. We were previously hitting assertion failures in the writer in cases where a regular object file defined a weak external symbol that was defined by a bitcode file. Because /export and /entry name mangling were implemented using weak externals, the same problem affected mangled symbol names in bitcode files. The underlying cause of the problem was that weak external symbols were being resolved before doing LTO, so the symbol table may have contained stale references to bitcode symbols. The fix here is to defer weak external symbol resolution until after LTO. Also implement support for weak external symbols in bitcode files by modelling them as replaceable DefinedBitcode symbols. Differential Revision: http://reviews.llvm.org/D10940 llvm-svn: 241391	2015-07-04 05:28:41 +00:00
Rui Ueyama	a3d463df6f	COFF: Print directive section contents if /verbose. llvm-svn: 241384	2015-07-04 01:39:11 +00:00
Peter Collingbourne	da2f094bbb	COFF: Fix the case where an object defines a weak external and its alias. This worked before, but only by accident, and only with assertions disabled. We ended up storing a DefinedRegular symbol in the WeakAlias field, and never using it as an Undefined. Differential Revision: http://reviews.llvm.org/D10934 llvm-svn: 241376	2015-07-03 22:03:36 +00:00
Rui Ueyama	49d6cd35ad	COFF: Fix /base option. Previously, __ImageBase symbol got a different value than the one specified by /base:<number> because the symbol was created in the SymbolTable's constructor. When the constructor is called, no command line options are processed yet, so the symbol was created always with the initial value. This caused wrong relocations and thus caused mysterious crashes of some executables linked by LLD. llvm-svn: 241313	2015-07-03 00:02:19 +00:00
Rui Ueyama	6be9099140	COFF: Define SymbolTable::insert to simplify. NFC. llvm-svn: 241311	2015-07-02 22:52:33 +00:00
Rui Ueyama	458d74421b	COFF: Merge SymbolTable::find{,Symbol}. NFC llvm-svn: 241238	2015-07-02 03:59:04 +00:00
Rui Ueyama	85225b0a36	COFF: Infer entry point as early as possible, but not too early. On Windows, we have four different main functions, {w,}{main,WinMain}. The linker has to choose a corresponding entry point function among {w,}{main,WinMain}CRTStartup. These entry point functions are defined in the standard library. The linker resolves one of them by looking at which main function is defined and adding a corresponding undefined symbol to the symbol table. Object files containing entry point functions conflicts each other. For example, we cannot resolve both mainCRTStartup and WinMainCRTStartup because other symbols defined in the files conflict. Previously, we inferred CRT function name at the very end of name resolution. I found that that is sometimes too late. If the linker already linked one of these four archive member objects, it's too late to change the decision. The right thing to do here is to infer entry point name after adding all symbols from command line files and before adding any other files (which are specified by directive sections). This patch does that. llvm-svn: 241236	2015-07-02 03:15:15 +00:00
Rui Ueyama	3d4c69c04d	COFF: Resolve AlternateNames using weak aliases. Previously, we use SymbolTable::rename to resolve AlternateName symbols. This patch is to merge that mechanism with weak aliases, so that we remove that function. llvm-svn: 241230	2015-07-02 02:38:59 +00:00
Rui Ueyama	0744e87fad	COFF: Rename getReplacement -> repl. The previous name was too long to my taste. llvm-svn: 241215	2015-07-02 00:21:11 +00:00
Rui Ueyama	18f8d2c5c0	COFF: Change GCRoot member type from StringRef to Undefined. NFC. I think Undefined symbols are a bit more convenient than StringRefs since SymbolBodies are handles for symbols. You can get resolved symbols for undefined symbols just by calling getReplacmenet without looking up the symbol table. llvm-svn: 241214	2015-07-02 00:21:08 +00:00
Rui Ueyama	6bf638e688	COFF: Simplify and rename findMangle. NFC. Occasionally we have to resolve an undefined symbol to its mangled symbol. Previously, we did that on calling side of findMangle by explicitly updating SymbolBody. In this patch, mangled symbols are handled as weak aliases for undefined symbols. llvm-svn: 241213	2015-07-02 00:04:14 +00:00
Rui Ueyama	4897596728	COFF: Chagne weak alias' type from SymbolBody** to SymbolBody*. NFC. llvm-svn: 241198	2015-07-01 22:32:23 +00:00
Rui Ueyama	4b6698917d	COFF: Simplify SymbolTable::findLazy. NFC. llvm-svn: 241128	2015-06-30 23:46:52 +00:00
Rui Ueyama	8d3010a1a6	COFF: Change the order of adding symbols to the symbol table. Previously, the order of adding symbols to the symbol table was simple. We have a list of all input files. We read each file from beginning of the list and add all symbols in it to the symbol table. This patch changes that order. Now all archive files are added to the symbol table first, and then all the other object files are added. This shouldn't change the behavior in single-threading, and make room to parallelize in multi-threading. In the first step, only lazy symbols are added to the symbol table because archives contain only Lazy symbols. Member object files found to be necessary are queued. In the second step, defined and undefined symbols are added from object files. Adding an undefined symbol to the symbol table may cause more member files to be added to the queue. We simply continue reading all object files until the queue is empty. Finally, new archive or object files may be added to the queues by object files' directive sections (which contain new command line options). The above process is repeated until we get no new files. Symbols defined both in object files and in archives can make results undeterministic. If an archive is read before an object, a new member file gets linked, while in the other way, no new file would be added. That is the most popular cause of an undeterministic result or linking failure as I observed. Separating phases of adding lazy symbols and undefined symbols makes that deterministic. Adding symbols in each phase should be parallelizable. llvm-svn: 241107	2015-06-30 19:35:21 +00:00
Chandler Carruth	be6e80b012	[opt] Hoist the call throuh SymbolBody::getReplacement out of the inline method to get a SymbolBody and into the callers, and kill now dead includes. This removes the need to have the SymbolBody definition when we're defining the inline method and makes it a better inline method. That was the only reason for a lot of header includes here. Removing these and using forward declarations actually uncovers a bunch of cross-header dependencies that I've fixed while I'm here, and will allow me to introduce some important inline code into Chunks.h that requires the definition of ObjectFile. No functionality changed at this point. Differential Revision: http://reviews.llvm.org/D10789 llvm-svn: 240982	2015-06-29 18:50:11 +00:00
Rui Ueyama	6b79ed128a	COFF: Fix /export. Mangled dllexported symbols may be defined in a library. If that's the case, we have to read a member file from the library. llvm-svn: 240947	2015-06-29 14:27:10 +00:00
Rui Ueyama	45044f47d3	COFF: Fix logic to find default entry name or subsystem. The previous logic to find default entry name or subsystem does not seem correct (i.e. was not compatible with MSVC linker). Previously, default entry name was inferred from CRT functions and user-defined entry functions. Subsystem was inferred from CRT functions. Default entry name and subsystem are now inferred based on the following table. Note that we no longer use CRT functions to infer them. Entry name Subsystem main mainCRTStartup console wmain wmainCRTStartup console WinMain WinMainCRTStartup windows wWinMain wWinMainCRTStartup windows llvm-svn: 240922	2015-06-29 01:03:53 +00:00
Rui Ueyama	f5313b3498	COFF: Allow mangled symbols as arguments for /export. Usually dllexported symbols are defined with 'extern "C"', so identifying them is easy. We can just do hash table lookup to look up exported symbols. However, C++ non-member functions are also allowed to be exported, and they can be specified with unmangled name. So, if /export:foo is given, we need to look up not only "foo" but also its all mangled names. In MSVC mangling scheme, that means that we need to look up any symbol which starts with "?foo@@Y". In this patch, we scan the entire symbol table to search for a mangled symbol. The symbol table is a DenseMap, and that doesn't support table lookup by string prefix. This is of course very inefficient. But that should be probably OK because the user should always add 'extern "C"' to dllexported symbols. llvm-svn: 240919	2015-06-28 22:16:41 +00:00
Rui Ueyama	f24e6f8607	COFF: Undefined weak aliases are not fatal if /force is given. llvm-svn: 240917	2015-06-28 20:34:09 +00:00
Rui Ueyama	95925fd1ab	COFF: Support /force flag. This option is to ignore remaining undefined symbols and force the linker to create an output file anyways. The existing code assumes that there's no undefined symbol after reportRemainingUndefines(). That assumption is legitimate. I also don't want to mess up the existing code for this minor feature. In order to keep it as is, remaining undefined symbols are replaced with dummy defined symbols. llvm-svn: 240913	2015-06-28 19:35:15 +00:00

1 2

82 Commits