llvm-project

Commit Graph

Author	SHA1	Message	Date
Rui Ueyama	27e9e6540c	Remove unused #includes. llvm-svn: 248081	2015-09-19 02:28:32 +00:00
Rui Ueyama	f4d05d7a80	COFF: Parallelize InputFile::parse(). InputFile::parse() can be called in parallel with other calls of the same function. By doing that, time to self-link improves from 741 ms to 654 ms or 12% faster. This is probably the last low hanging fruit in terms of parallelism. Input file parsing and symbol table insertion takes 450 ms in total. If we want to optimize further, we probably have to parallelize symbol table insertion using concurrent hashmap or something. That's doable, but that's not easy, especially if you want to keep the exact same semantics and linking order. I'm not going to do that at least soon. Anyway, compared to r248019 (the change before the first attempt for parallelism), we achieved 36% performance improvement from 1022 ms to 654 ms. MSVC linker takes 3.3 seconds to link the same program. MSVC's ICF feature is very slow for some reason, but even if we disable the feature, it still takes about 1.2 seconds. Our number is probably good enough. llvm-svn: 248078	2015-09-19 01:48:26 +00:00
Michael J. Spencer	acf5bdfd88	[elf2] Improve relocation-undefined-weak.s test. llvm-svn: 248071	2015-09-19 00:15:38 +00:00
Rui Ueyama	8197a4e0bf	COFF: Use parallel_sort in Writer::sortExceptionTable(). This patch saves 4 ms out of 5 ms. Very small improvement, but maybe better than nothing. llvm-svn: 248063	2015-09-18 23:17:34 +00:00
Rui Ueyama	49e72e69e5	Fix build error that std::atomic is not copy-constructible. llvm-svn: 248061	2015-09-18 22:58:12 +00:00
Rui Ueyama	e629a45531	COFF: Address review comments. - Fix race condition of `Redo` - Avoid std::distance llvm-svn: 248058	2015-09-18 22:31:15 +00:00
Michael J. Spencer	9779535c5d	[elf2] Relocate against undefined weak symbols. llvm-svn: 248056	2015-09-18 22:26:13 +00:00
Michael J. Spencer	658dccd1c8	[elf2] Relocate against common symbols. llvm-svn: 248054	2015-09-18 22:13:25 +00:00
Rui Ueyama	e0e0796d83	COFF: Parallelize Writer::writeSections(). Self-hosting took 801 ms on my machine. Of which this function took 69 ms. Now it takes 37 ms. That is about 4% overall performance improvement. llvm-svn: 248052	2015-09-18 22:07:10 +00:00
Michael J. Spencer	9567495154	[elf2] Convert if/else cascade into a covered switch. NFC. llvm-svn: 248049	2015-09-18 21:48:38 +00:00
Rui Ueyama	e8d1c59756	Style fix to make it look consistent. NFC. llvm-svn: 248044	2015-09-18 21:17:44 +00:00
Rui Ueyama	aa95e5a4cc	COFF: Parallelize ICF. The LLD's ICF algorithm is highly parallelizable. This patch does that using parallel_for_each. ICF accounted for about one third of total execution time. Previously, it took 324 ms when self-hosting. Now it takes only 62 ms. Of course your mileage may vary. My machine is a beefy 24-core Xeon machine, so you may not see this much speedup. But this optimization should be effective even for 2-core machine, since I saw speedup (324 ms -> 189 ms) when setting parallelism parameter to 2. llvm-svn: 248038	2015-09-18 21:06:34 +00:00
Davide Italiano	f7892a1f3a	[ELF2] Constify member functions. llvm-svn: 248019	2015-09-18 18:28:08 +00:00
Rafael Espindola	5c2310c30c	Start adding support for creating the GOT. With this a program can call into a shared library with jmp *foo@GOTPCREL(%rip) llvm-svn: 247992	2015-09-18 14:40:19 +00:00
Rui Ueyama	603d51104b	COFF: Reorder comparisons. This change makes equalsConstant a bit faster (193ms -> 163ms). llvm-svn: 247965	2015-09-18 02:40:54 +00:00
Rui Ueyama	8c73dfb6bf	COFF: Remove useless micro-optimization. This patch simplifies code by removing micro-optimization that doesn't contribute to speed. llvm-svn: 247964	2015-09-18 02:15:34 +00:00
Rui Ueyama	c9a6e827bd	COFF: Optimize ICF by not creating temporary vectors. Previously, ICF created a vector for each SectionChunk. The vector contained pointers to successors, which are namely associative sections and COMDAT relocation targets. The reason I created vectors is because I thought that that would make section comparison faster. It did make the comparison faster. When self-linking, for example, it saved about 10 ms on each iteration. The time we spent on constructing the vectors was 124 ms. If we iterate more than 12 times, return from the investment exceeds the initial cost. In reality, it usually needs 5 iterations. So we shouldn't construct the vectors. llvm-svn: 247963	2015-09-18 01:51:37 +00:00
Rui Ueyama	7d8263bf1d	COFF: Optimize ICF by comparing relocations before section contents. equalsConstants() is the heaviest function in ICF, and that consumes more than half of total ICF execution time. Of which, section content comparison accounts for roughly one third. Previously, we compared section contents at the beginning of the function after comparing their checksums. The comparison is very likely to succeed because when the control reaches that comparison, their checksums are always equal. And because checksums are 64-bit CRC, they are unlikely to collide. We compared relocations and associative sections after that. If they are different, the time we spent on byte-by-byte comparison of section contents were wasted. This patch moves the comparison at the end of function. If the comparison fails, the time we spent on relocation comparison are wasted, but as I wrote it's very unlikely to happen. LLD took 1198 ms to link itself to produce a 27.11 MB executable. Of which, ICF accounted for 536 ms. This patch cuts it by 90 ms, which is 17% speedup of ICF and 7.5% speedup overall. All numbers are median of ten runs. llvm-svn: 247961	2015-09-18 01:30:56 +00:00
Davide Italiano	b5b47b432b	[ELF2] Fill up local symbols fields correctly. Differential Revision: http://reviews.llvm.org/D12944 llvm-svn: 247960	2015-09-18 01:08:17 +00:00
Rui Ueyama	4151972c22	Enable extra LTO verification only when build type is debug. llvm-svn: 247956	2015-09-17 22:54:08 +00:00
Rafael Espindola	8315b1c995	Remove dead member variable. llvm-svn: 247949	2015-09-17 21:34:32 +00:00
Michael J. Spencer	879b597b9d	[elf2] Extend program-header-layout.s to check that read only sections are correctly merged into the first PT_LOAD. llvm-svn: 247947	2015-09-17 21:19:56 +00:00
Michael J. Spencer	55f56320c7	[elf2] Fix dynamic-reloc.s test. The offset of the .rela.dyn section isn't the same between hosts because a path comes before it. This test doesn't care what the offset is. llvm-svn: 247946	2015-09-17 21:13:41 +00:00
Michael J. Spencer	2f0082424f	[elf2] Combine adjacent compatible OutputSections in PT_LOADs. llvm-svn: 247925	2015-09-17 19:58:07 +00:00
Rafael Espindola	40102eb27f	Use a MapVector to output symbols in a deterministic order. We used to sort the symbols at the very end, but we need to know the order earlier so that we can create reference to them in the dynamic relocations. Thanks to Igor Kudrin for pointing out the problem. llvm-svn: 247911	2015-09-17 18:26:25 +00:00
Rafael Espindola	67a5da60ed	Add support of Elf_Rel dynamic relocations. llvm-svn: 247888	2015-09-17 14:02:10 +00:00
Denis Protivensky	18add764f0	[ELF2] Fix typo in RelocationSection::hasRelocs method llvm-svn: 247878	2015-09-17 09:54:29 +00:00
Rui Ueyama	63dd8766ab	COFF: Remove DefinedSymbol::isLive() and markLive(). NFC. Basically the concept of "liveness" is for sections (or chunks in LLD terminology) and not for symbols. Symbols are always available or live, or otherwise it indicates a link failure. Previously, we had isLive() and markLive() methods for DefinedSymbol. They are confusing methods. What they actually did is to act as a proxy to backing section chunks. We can simplify eliminate these methods and call section chunk's methods directly. llvm-svn: 247869	2015-09-16 23:55:52 +00:00
Rui Ueyama	66c06ceaca	COFF: ICF: Print out the number of iterations. NFC. llvm-svn: 247868	2015-09-16 23:55:39 +00:00
Rafael Espindola	eade07ba59	Start adding support for Elf_Rel. I don't intend to add full i686 support right now, just make sure we have all the infrastructure in place for it. llvm-svn: 247858	2015-09-16 21:57:07 +00:00
Rui Ueyama	4dbff20c91	COFF: Fix bug that not all symbols were written to symtab if /opt:noref. Only live symbols are written to the symbol table. Because isLive() returned false if dead-stripping was disabled entirely, only non-COMDAT sections were written to the symbol table. This patch fixes the issue. llvm-svn: 247856	2015-09-16 21:40:47 +00:00
Rui Ueyama	3b153e6541	COFF: Fix bug that /opt:noicf was ignored. llvm-svn: 247854	2015-09-16 21:30:55 +00:00
Rui Ueyama	4bce7bcc88	COFF: Output messages for /verbose to stdout instead of stderr. This patch also makes the message less verbose. llvm-svn: 247853	2015-09-16 21:30:40 +00:00
Davide Italiano	6d328d3841	[ELF2] Initial support for local symbols. Symbol table is now populated correctly, but some fields are missing, they'll be added in the future. This patch also adds --discard-all flag, which was the default behavior until now. Differential Revision: http://reviews.llvm.org/D12874 llvm-svn: 247849	2015-09-16 20:45:57 +00:00
Rafael Espindola	6569522f4b	Relax the test to fix some bots. The string table contains a file path, so addresses and offsets can be different on some setups. llvm-svn: 247839	2015-09-16 19:48:57 +00:00
Rafael Espindola	2b92d8f184	Move code computing NumEntries to finalize. When DynamicSection is constructed we still don't know if there will be any dynamic relocations or not. llvm-svn: 247838	2015-09-16 19:26:31 +00:00
Rafael Espindola	3887ebfc21	Add DT_RELA and DT_RELASZ to the dynamic table. llvm-svn: 247837	2015-09-16 18:52:42 +00:00
Rui Ueyama	b02a320f5e	COFF: Enable ICF by default. MSVC linker enables ICF as long as /opt:ref is eanbled, so do we. llvm-svn: 247817	2015-09-16 16:41:38 +00:00
Rui Ueyama	c04d5dbf20	COFF: Rename /opt:lldicf -> /opt:icf. Now that ICF is complete, we can rename this option so that the driver accepts the MSVC-compatible command line option. llvm-svn: 247816	2015-09-16 16:33:57 +00:00
Rafael Espindola	19e3889dba	Start creating dynamic relocations. For now we don't create got/plt and only Elf_Rela is supported. llvm-svn: 247811	2015-09-16 15:54:15 +00:00
Rui Ueyama	92298d5418	COFF: Create ICF class to move code from SectionChunk to ICF. NFC. This patch defines ICF class and defines ICF-related functions as members of the class. By doing this we can move code that are related only to ICF from SectionChunk to the newly-defined class. This also eliminates a global variable "NextID". llvm-svn: 247802	2015-09-16 14:19:10 +00:00
Rafael Espindola	37ecff14f4	Remove redundant "protected:". llvm-svn: 247797	2015-09-16 13:47:45 +00:00
Simon Atanasyan	92480f9a20	[Mips] Rejects all --hash-style arguments except 'sysv' in case of MIPS target MIPS ABI supports only --hash-style=sysv variant. llvm-svn: 247796	2015-09-16 13:36:32 +00:00
Simon Atanasyan	49c67236f1	[Mips] Do not show an error if R_MIPS_GPREL32 relocation has a non-local target This matches GNU linker behaviour. llvm-svn: 247795	2015-09-16 13:36:24 +00:00
Rafael Espindola	72373fb5b7	Fix test to actually test something. llvm-svn: 247790	2015-09-16 12:50:32 +00:00
Rui Ueyama	9cb2870ce0	ICF: Improve ICF to reduce more sections than before. This is a patch to make LLD to be on par with MSVC in terms of ICF effectiveness. MSVC produces a 27.14MB executable when linking LLD. LLD previously produced a 27.61MB when self-linking. Now the size is reduced to 27.11MB. Note that without ICF the size is 29.63MB. In r247387, I implemented an algorithm that handles section graphs as cyclic graphs and merge them using SCC. The algorithm did not always work as intended as I demonstrated in r247721. The new algortihm implemented in this patch is different from the previous one. If you are interested the details, you want to read the file comment of ICF.cpp. llvm-svn: 247770	2015-09-16 03:26:31 +00:00
Michael J. Spencer	141dd91ac5	[elf2] Simplify overflow checks. llvm-svn: 247768	2015-09-16 02:02:04 +00:00
Michael J. Spencer	75e5fda3de	[elf2] Add R_X86_64_32S. llvm-svn: 247758	2015-09-16 00:24:19 +00:00
Michael J. Spencer	dff84070da	[elf2] Add error checking for the R_X86_64_32 relocation. llvm-svn: 247745	2015-09-15 23:36:30 +00:00
Michael J. Spencer	3c1ac0a17a	[elf2] Relocate absolute symbols. llvm-svn: 247738	2015-09-15 23:12:02 +00:00

1 2 3 4 5 ...

3952 Commits