llvm-project

Commit Graph

Author	SHA1	Message	Date
Rui Ueyama	230d2ec75f	Remove return after report_fatal_error which has noreturn attribute. llvm-svn: 234204	2015-04-06 20:25:18 +00:00
Rui Ueyama	d18a97cb7a	PECOFF: Create layout-afters instead of layout-befores. All readers except PE/COFF reader create layout-after edges to preserve the original symbol order. PE/COFF uses layout-before edges as primary edges for no reason. This patch makes PE/COFF reader to create layout-after edges. Resolver is updated to recognize reverse edges of layout-after edges in the garbage collection pass. Now we can retire layout-before edges. I don't do that in this patch because if I do, I would have updated many tests to replace all occurrrences of "layout-before" with "layout-after". So that's a TODO. llvm-svn: 231615	2015-03-09 00:06:07 +00:00
Rui Ueyama	803150c9d0	Revert r231552: Resolver: optimize fallback atoms. This patch broke a buildbot. llvm-svn: 231611	2015-03-08 21:31:38 +00:00
Rui Ueyama	0536677ad6	Remove sectionPosition attribute. This code is simply dead. No one is using it. http://reviews.llvm.org/D8125 llvm-svn: 231583	2015-03-08 01:01:40 +00:00
Rui Ueyama	9b21ded6c8	Resolver: optimize fallback atoms. Atoms with fallback atoms are never be added to the symbol table. However, we added such atoms to _undefines array. We had to call isCoalescedAway to identify and skip them. We should just stop adding them in the first place. This seems to make the linker ~1% faster in my test case. llvm-svn: 231552	2015-03-07 04:23:46 +00:00
Rui Ueyama	abd39f0549	Resolver: Reduce number of SymbolTable::isDefined function calls. If an undefined symbol is added to the symbol table by the previous call of SymbolTable::add, SymbolTable::isDefined will always return false for the same symbol. llvm-svn: 231551	2015-03-07 03:55:32 +00:00
Rui Ueyama	923147b954	Resolver: Reduce number of hash function call. This is yet another optimization patch. Previously we called SymbolTable::isDefined() and SymbolTable::findByName() from a very frequently executed function. Because isDefined calls findByName, findByName is called twice on each iteration. findByName is not a cheap function. It computes a hash value for a given symbol name. When linking C++ programs, it can be expensive because of C++ mangled long symbols. This patch reduces the number of call from 2 to 1. Performance improvements by this patch was larger than I expected. Linking time of chrome.dll gets almost 5% shorter. llvm-svn: 231549	2015-03-07 03:22:37 +00:00
Rui Ueyama	2039fc1f42	Resolver: move code inside an assert. llvm-svn: 231518	2015-03-06 21:15:06 +00:00
Rui Ueyama	13003d7774	Core: Make the resolver faster. In the resolver, we maintain a list of undefined symbols, and when we visit an archive file, we check that file if undefined symbols can be resolved using files in the archive. The archive file class provides find() function to lookup a symbol. Previously, we call find() for each undefined symbols. Archive files may be visited multiple times if they are in a --start-group and --end-group. If we visit a file M times and if we have N undefined symbols, find() is called MN times. I found that that is one of the most significant bottlenecks in LLD when linking a large executable. find() is not a very cheap operation because it looks up a hash table for a given string. And a string, or a symbol name, can be pretty long if you are dealing with C++ symbols. We can eliminate the bottleneck. Calling find() with the same symbol multiple times is a waste. If a result of looking up a symbol is "not found", it stays "not found" forever because the symbol simply doesn't exist in the archive. Thus, we should call find() only for newly-added undefined symbols. This optimization makes O(MN) O(N). In this patch, all undefined symbols are added to a vector. For each archive/shared library file, we maintain a start position P. All symbols [0, P) are already searched. [P, end of the vector) are not searched yet. For each file, we scan the vector only once. This patch changes the order in which undefined symbols are looked for. Previously, we iterated over the result of _symbolTable.undefines(). Now we iterate over the new vector. This is a benign change but caused differences in output if remaining undefines exist. This is why some tests are updated. The performance improvement of this patch seems sometimes significant. Previously, linking chrome.dll on my workstation (Xeon 2.4GHz 8 cores) took about 70 seconds. Now it takes (only?) 30 seconds! http://reviews.llvm.org/D8091 llvm-svn: 231434	2015-03-06 00:28:41 +00:00
Rui Ueyama	25d5abdb3a	Optimize resolver by using std::unordered_multimap. _reverseRef is a multimap from atoms to atoms. The map contains reverse edges of "layout-before" and "group" edges for dead-stripping. The type of the variable was DenseMap<Atom , DenseSet<Atom >>. This patch changes that to std::unordered_multimap<Atom , Atom >. A DenseMap with a value type of DenseSet was not fast. Inserting 900k items to the map took about 1.6 seconds on my workstation. unordered_multimap on the other hand took only 0.6 seconds. Use of the map also got faster -- originally markLive took 1.3 seconds in the same test case, and it now took 1.0 seconds. In total we shove off 1.3 seconds out of 27 seconds in that test case. llvm-svn: 231432	2015-03-06 00:22:48 +00:00
Rui Ueyama	6780cea628	Early return. NFC. llvm-svn: 231403	2015-03-05 20:22:14 +00:00
Rui Ueyama	e5bf769443	Resolver: Update preload map after File::beforeLink(). We maintain a map from symbols to archive files for the archive file pre-loading. That map is created at the beginning of the resolve() and is never updated. However, the input file list may be updated by File::beforeLink(). This is a patch to update the map after beforeLink. llvm-svn: 231395	2015-03-05 19:25:58 +00:00
Rui Ueyama	394d10e34d	Make File non-const in the resolver. File objects are not really const in the resolver. We set ordinals to them and call beforeLink hooks. Also, File's member functions marked as const are not really const. ArchiveFile never returns the same member file twice, so it remembers files returned before. find() has side effects. In order to deal with the inconsistencies, we sprinkled const_casts and marked member varaibles as mutable. This patch removes const from there to reflect the reality. llvm-svn: 231212	2015-03-04 04:36:46 +00:00
Rui Ueyama	791db5f0a3	Implement our own future and use that for FileArchive::preload(). std::promise and std::future in old version of libstdc++ are buggy. I think that's the reason why LLD tests were flaky on Ubuntu 13 buildbots until we disabled file preloading. In this patch, I implemented very simple future and used that in FileArchive. Compared to std::promise and std::future, it lacks many features, but should serve our purpose. http://reviews.llvm.org/D8025 llvm-svn: 231153	2015-03-03 22:19:46 +00:00
Rui Ueyama	b2e16973df	Do s/_context/_ctx/ to Resolver.cpp. llvm-svn: 230814	2015-02-27 23:40:00 +00:00
Rui Ueyama	cb1c4d1a9a	Remove a varaible that's used only once. NFC. llvm-svn: 230813	2015-02-27 23:36:05 +00:00
Rui Ueyama	c817fd2273	Call File::beforeLink hook even if the file is in an archive. Previously we didn't call the hook on a file in an archive, which let the PE/COFF port fail to link files in archives. It was a simple mistake. Added a call to the hook and also added a test to catch that error. const_cast is an unfortunate hack. Files in the resolver are usually const, but they are not actually const objects, since they are mutated if either a file is taken from an archive (an archive file does never return the same file twice) or the beforeLink hook is called. Maybe we should just remove const from there -- because they are not const. llvm-svn: 230808	2015-02-27 23:15:11 +00:00
Rui Ueyama	0bde1de3bd	Temporarily disable FileArchive::preload(). It is observed that the function throws std::future_error on a few buildbots. That cannot be easily reproducible on local machines. Kill the feature temporarily to see if this is going to fix the buildbot issue. llvm-svn: 230735	2015-02-27 05:26:05 +00:00
Davide Italiano	a82ae9146e	[Core] Do not reclaim absolute atoms in resolver. This fixes a linker crash (found out while testing --gc-sections, testcase provided by Rafael Avila de Espindola). While this behaviour was found while testing ELF, it' not necessarily ELF specific and this change is (apparently) harmless on all the other drivers. Differential Revision: D7823 Reviewed by: ruiu llvm-svn: 230614	2015-02-26 05:39:57 +00:00
Rui Ueyama	f9a30ddcc0	Fix runtime error on Windows. I believe the original code is valid, but on Windows it failed with an assertion error saying "Expression: vector iterator is not decrementable." Don't use rbegin and rend to workaround that error. llvm-svn: 226706	2015-01-21 22:16:17 +00:00
Rui Ueyama	bcf8918013	Simplify. What we are trying to do here is to skip object files in group if group is repeated. This code is simpler than before. llvm-svn: 226688	2015-01-21 18:54:26 +00:00
Rui Ueyama	e6556a9cea	Fix --start-group/end-group. We used to manage the state whether we are in a group or not using a counter. The counter is incremented by one if we jump from end-group to start-group, and decremented by one if we don't. The counter was assumed to be either zero or one, but obviously it could be negative (if there's a group which is not repeated at all). This is a fix for that issue. llvm-svn: 226632	2015-01-21 01:26:43 +00:00
Rui Ueyama	3a8d7e2f10	[PATCH] Speculatively instantiate archive members LLD parses archive file index table only at first. When it finds a symbol it is looking for is defined in a member file in an archive file, it actually reads the member from the archive file. That's done in the core linker. That's a single-thread process since the core linker is single threaded. If your command line contains a few object files and a lot of archive files (which is quite often the case), LLD hardly utilizes hardware parallelism. This patch improves parallelism by speculatively instantiating archive file members. At the beginning of the core linking, we first create a map containing all symbols defined in all members, and each time we find a new undefined symbol, we instantiate a member file containing the symbol (if such file exists). File instantiation is side effect free, so this should not affect correctness. This is a quick benchmark result. Time to link self-link LLD executable: Linux 9.78s -> 8.50s (0.86x) Windows 6.18s -> 4.51s (0.73x) http://reviews.llvm.org/D7015 llvm-svn: 226336	2015-01-16 22:44:50 +00:00
Rui Ueyama	bd350a5cd2	Remove duplication code. llvm-svn: 226321	2015-01-16 21:11:00 +00:00
Rui Ueyama	d4730ea555	Run the resolver in parallel with the reader. This patch makes File::parse() multi-thread safe. If one thread is running File::parse(), other threads will block if they try to call the same method. File::parse() is idempotent, so you can safely call multiple times. With this change, we don't have to wait for all worker threads to finish in Driver::link(). Previously, Driver::link() calls TaskGroup::sync() to wait for all threads running File::parse(). This was not ideal because we couldn't start the resolver until we parse all files. This patch increase parallelism by making Driver::link() to not wait for worker threads. The resolver calls parse() to make sure that the file being read has been parsed, and then uses the file. In this approach, the resolver can run with the parser threads in parallel. http://reviews.llvm.org/D6994 llvm-svn: 226281	2015-01-16 15:54:13 +00:00
Rui Ueyama	883afba228	Remove InputGraph and use std::vector<Node> instead. In total we have removed more than 1000 lines! llvm-svn: 226149	2015-01-15 08:46:36 +00:00
Rui Ueyama	61635440a9	Rename InputElement Node. InputElement was named that because it's an element of an InputGraph. It's losing the origin because the InputGraph is now being removed. InputElement's subclass is FileNode, that naming inconsistency needed to be fixed. llvm-svn: 226147	2015-01-15 08:31:46 +00:00
Rui Ueyama	cdb1071be5	Remove InputGraph::size(). llvm-svn: 226140	2015-01-15 07:20:39 +00:00
Rui Ueyama	80c04431ca	Re-commit r225766, r225767, r225769, r225814, r225816, r225829, and r225832. These changes depended on r225674 and had been rolled back in r225859. Because r225674 has been re-submitted, it's safe to re-submit them. llvm-svn: 226132	2015-01-15 06:49:21 +00:00
Rui Ueyama	cfb2534ef8	Revert "Convert other drivers to use WrapperNode" and subsequent commits. r225764 broke a basic functionality on Mac OS. This change reverts r225764, r225766, r225767, r225769, r225814, r225816, r225829, and r225832. llvm-svn: 225859	2015-01-14 00:21:34 +00:00
Rui Ueyama	b34838424a	Remove InputGraph::getNextFile(). getNextFile used to have a complex logic to determine which file should be processed by the Resolver on next iteration. Now, it is just a sequential accessor to the internal array and provides no sensible feature. This patch also removes InputGraph::getGroupSize and InputGraph:: skipGroup to simplify the code. llvm-svn: 225832	2015-01-13 21:27:12 +00:00
Rui Ueyama	e9b455184f	Simplify InputGraph API. These member functions returns either no_more_files error or a File object. We could simply return a nullptr instead of a no_more_files. This function will be removed soon as a part of InputGraph cleanup. I had to do that step by step. llvm-svn: 224208	2014-12-14 02:04:01 +00:00
Rui Ueyama	00eb257f2e	Re-commit r223330: Rewrite InputGraph's Group llvm-svn: 223867	2014-12-10 00:33:00 +00:00
Rui Ueyama	5ae2050420	Revert "Rewrite InputGraph's Group" This reverts commit r223330 because it broke Darwin and ELF linkers in a way that we couldn't have caught with the existing test cases. llvm-svn: 223373	2014-12-04 18:29:03 +00:00
Rui Ueyama	60df72ff61	Rewrite InputGraph's Group The aim of this patch is to reduce the excessive abstraction from the InputGraph. We found that even a simple thing, such as sorting input files (Mach-O) or adding a new file to the input file list (PE/COFF), is nearly impossible with the InputGraph abstraction, because it hides too much information behind it. As a result, we invented complex interactions between components (e.g. notifyProgress() mechanism) and tricky code to work around that limitation. There were many occasions that we needed to write awkward code. This patch is a first step to make it cleaner. As a first step, this removes Group class from the InputGraph. The grouping feature is now directly handled by the Resolver. notifyProgress is removed since we no longer need that. I could have cleaned it up even more, but in order to keep the patch minimum, I focused on Group. SimpleFileNode class, a container of File objects, is now limited to have only one File. We shold have done this earlier. We used to allow putting multiple File objects to FileNode. Although SimpleFileNode usually has only one file, the Driver class actually used that capability. I modified the Driver class a bit, so that one FileNode is created for each input File. We should now probably remove SimpleFileNode and directly store File objects to the InputGraph in some way, because a container that can contain only one object is useless. This is a TODO. Mach-O input files are now sorted before they are passe to the Resolver. DarwinInputGraph class is no longer needed, so removed. PECOFF still has hacky code to add a new file to the input file list. This will be cleaned up in another patch. llvm-svn: 223330	2014-12-04 01:09:06 +00:00
Tim Northover	f98b1c9960	[mach-o] remove __compact_unwind atoms once __unwind_info has been generated The job of the CompactUnwind pass is to turn __compact_unwind data (and __eh_frame) into the compressed final form in __unwind_info. After it's done, the original atoms are no longer relevant and should be deleted (they cause problems during actual execution, quite apart from the fact that they're not needed). llvm-svn: 221301	2014-11-04 21:57:32 +00:00
Shankar Easwaran	2b67fca033	Sort include files according to convention. llvm-svn: 220131	2014-10-18 05:33:55 +00:00
Nick Kledzik	be43d7ef29	[mach-o] Implement -demangle. The darwin linker has the -demangle option which directs it to demangle C++ (and soon Swift) mangled symbol names. Long term we need some Diagnostics object for formatting errors and warnings. But for now we have the Core linker just writing messages to llvm::errs(). So, to enable demangling, I changed the Resolver to call a LinkingContext method on the symbol name. To make this more interesting, the demangling code is done via __cxa_demangle() which is part of the C++ ABI, which is only supported on some platforms, so I had to conditionalize the code with the config generated HAVE_CXXABI_H. llvm-svn: 218718	2014-09-30 23:15:39 +00:00
Rui Ueyama	d20d44fbe6	Use DenseMap::lookup. No functionality change. llvm-svn: 218554	2014-09-26 23:21:10 +00:00
Rui Ueyama	b774a0e750	Fix crash bug on Windows. Mutating the DenseMap here seems to cause the Windows executable to crash. Don't use operator[] to access possibly nonexistent key. llvm-svn: 218548	2014-09-26 22:27:42 +00:00
Rui Ueyama	61d7f97000	[PECOFF] Support COMDAT associative sections. COFF supports a feature similar to ELF's section groups. This patch implements it. In ELF, section groups are identified by their names, and they are treated somewhat differently from regular symbols. In COFF, the feature is realized in a more straightforward way. A section can have an annotation saying "if Nth section is linked, link this section too." I added a new reference type, kindAssociate. If a target atom is coalesced away, the referring atom is removed by Resolver, so that they are treated as a group. Differential Revision: http://reviews.llvm.org/D4028 llvm-svn: 211106	2014-06-17 16:19:33 +00:00
Rafael Espindola	b1a4d3a26c	Don't import error_code into the lld namespace. llvm-svn: 210785	2014-06-12 14:53:47 +00:00
Rui Ueyama	733b45f3b0	Add SymbolTable::isCoalescedAway isCoalescedAway(x) is faster than replacement(x) != x as the former does not follow the replacement atom chain. Also it's easier to use. llvm-svn: 210242	2014-06-05 07:37:29 +00:00
Rui Ueyama	52edc49031	Print error message in LinkOnce handler. Rather than outside of the handler function to make the code simple. llvm-svn: 210241	2014-06-05 07:37:25 +00:00
Rui Ueyama	9aee050a0c	Remove group-parent references. Previously section groups are doubly linked to their children. That is, an atom representing a group has group-child references to its group contents, and content atoms also have group-parent references to the group atom. That relationship was invariant; if X has a group-child edge to Y, Y must have a group-parent edge to X. However we were not using group-parent references at all. The resolver only needs group-child edges. This patch simplifies the section group by removing the unused reverse edge. No functionality change intended. Differential Revision: http://reviews.llvm.org/D3945 llvm-svn: 210066	2014-06-03 03:07:49 +00:00
Rui Ueyama	23487e878b	Make dead-striping to handle reverse edges. Layout-before edges are no longer used for layout, but they are still there for dead-stripping. If we would just remove them from code, LLD would wrongly remove live atoms that were referenced by layout-befores. This patch fixes the issue. Before dead-stripping, it scans all atoms to construct a reverse map for layout-after edges. Dead- stripping pass uses the map to traverse the graph. Differential Revision: http://reviews.llvm.org/D3986 llvm-svn: 210057	2014-06-03 01:59:02 +00:00
Rui Ueyama	6ffe42eee5	Simplify markLive(). Reference::target() never returns a nullptr, so NULL check is not needed and is more harmful than doing nothing. No functionality change. llvm-svn: 210008	2014-06-02 08:06:57 +00:00
Rui Ueyama	2a52251153	Fix regression introduced in r205566. In r205566, I made a change to Resolver so that Resolver revisit only archive files in --start-group and --end-group pair. That's not correct, as it also has to revisit DSO files. This patch is to fix the issue. Added a test to demonstrate the fix. I confirmed that it succeeded before r205566, failed after r205566, and is ok with this patch. Differential Revision: http://reviews.llvm.org/D3734 llvm-svn: 208797	2014-05-14 17:29:27 +00:00
Rui Ueyama	20822d924e	Return the expression result directly. llvm-svn: 207878	2014-05-02 22:32:01 +00:00
Rui Ueyama	992fdc0785	SymbolTable::size() returns an unsigned int. It's better to use the same type rather than a fixed width integer type that may be different from the return type. llvm-svn: 205597	2014-04-04 01:22:51 +00:00

1 2 3

146 Commits