llvm-project

Commit Graph

Author	SHA1	Message	Date
Rafael Espindola	5708b2f8a6	Ignore R_X86_64_NONE. It looks like the way dtrace works is * The user creates .o files that reference magical symbol names. * dtrace reads those files, collecs the info it needs and changes the relocation to R_X86_64_NONE expecting the linker to ignore them. llvm-svn: 288485	2016-12-02 08:00:09 +00:00
Rui Ueyama	27498b5dd5	Fix a bug in ICF involving COFF associative sections. Associative sections are sections that need to be linked if their associated sections are linked. Associative sections are used to append auxiliary data such as debug info. Previously, we compared all associative sections when comparing two comdat sections. Because usually assocative sections are not mergeable sections, we missed a lot of mergeable sections. MSVC linker doesn't seem to check the identity of associative sections. This patch makes LLD to ignore associative sections when doing ICF. llvm-svn: 288483	2016-12-02 07:46:12 +00:00
Rui Ueyama	1b6bab011c	Fix the worse case performance of ICF. r288228 seems to have regressed ICF performance in some cases in which a lot of sections are actually mergeable. In r288228, I made a change to create a Range object for each new color group. So every time we split a group, we allocated and added a new group to a list of groups. This patch essentially reverted r288228 with an improvement to parallelize the original algorithm. Now the ICF main loop is entirely allocation-free and lock-free. Just like pre-r288228, we search for group boundaries by linear scan instead of managing the information using Range class. r288228 was neutral in performance-wise, and so is this patch. I confirmed that this produces the exact same result as before using chromium and clang as tests. llvm-svn: 288480	2016-12-02 05:35:46 +00:00
Rafael Espindola	103fc28961	Add a test documenting how we handle addends on Elf_Rela. llvm-svn: 288477	2016-12-02 04:20:47 +00:00
Rafael Espindola	858c092daa	Allow duplicated abs symbols with the same value. This is a fairly reasonable bfd extension since there is one obvious value. dtrace depends on this feature as it creates multiple absolute symbols with the same value. llvm-svn: 288461	2016-12-02 02:58:21 +00:00
Rafael Espindola	f4ff80c128	Write the addent to got entries when using Elf_Rel. llvm-svn: 288451	2016-12-02 01:57:24 +00:00
Rui Ueyama	395859bdb7	Fix undefined behavior. New items can be added to Ranges here, and that invalidates an iterater that previously pointed the end of the vector. llvm-svn: 288443	2016-12-02 00:38:15 +00:00
Rui Ueyama	a6cd5fe415	Add an assert instead of ignoring an impossible condition. llvm-svn: 288419	2016-12-01 21:41:06 +00:00
Rui Ueyama	91ae861af5	Updates file comments and variable names. Use "color" instead of "group id" to describe the ICF algorithm. llvm-svn: 288409	2016-12-01 19:45:22 +00:00
Rui Ueyama	c1835319c9	Parallelize ICF to make LLD's ICF really fast. ICF is short for Identical Code Folding. It is a size optimization to identify two or more functions that happened to have the same contents to merges them. It usually reduces output size by a few percent. ICF is slow because it is computationally intensive process. I tried to paralellize it before but failed because I couldn't make a parallelized version produce consistent outputs. Although it didn't create broken executables, every invocation of the linker generated slightly different output, and I couldn't figure out why. I think I now understand what was going on, and also came up with a simple algorithm to fix it. So is this patch. The result is very exciting. Chromium for example has 780,662 input sections in which 20,774 are reducible by ICF. LLD previously took 7.980 seconds for ICF. Now it finishes in 1.065 seconds. As a result, LLD can now link a Chromium binary (output size 1.59 GB) in 10.28 seconds on my machine with ICF enabled. Compared to gold which takes 40.94 seconds to do the same thing, this is an amazing number. From here, I'll describe what we are doing for ICF, what was the previous problem, and what I did in this patch. In ICF, two sections are considered identical if they have the same section flags, section data, and relocations. Relocations are tricky, becuase two relocations are considered the same if they have the same relocation type, values, and if they point to the same section _in terms of ICF_. Here is an example. If foo and bar defined below are compiled to the same machine instructions, ICF can (and should) merge the two, although their relocations point to each other. void foo() { bar(); } void bar() { foo(); } This is not an easy problem to solve. What we are doing in LLD is some sort of coloring algorithm. We color non-identical sections using different colors repeatedly, and sections in the same color when the algorithm terminates are considered identical. Here is the details: 1. First, we color all sections using their hash values of section types, section contents, and numbers of relocations. At this moment, relocation targets are not taken into account. We just color sections that apparently differ in different colors. 2. Next, for each color C, we visit sections having color C to see if their relocations are the same. Relocations are considered equal if their targets have the same color. We then recolor sections that have different relocation targets in new colors. 3. If we recolor some section in step 2, relocations that were previously pointing to the same color targets may now be pointing to different colors. Therefore, repeat 2 until a convergence is obtained. Step 2 is a heavy operation. For Chromium, the first iteration of step 2 takes 2.882 seconds, and the second iteration takes 1.038 seconds, and in total it needs 23 iterations. Parallelizing step 1 is easy because we can color each section independently. This patch does that. Parallelizing step 2 is tricky. We could work on each color independently, but we cannot recolor sections in place, because it will break the invariance that two possibly-identical sections must have the same color at any moment. Consider sections S1, S2, S3, S4 in the same color C, where S1 and S2 are identical, S3 and S4 are identical, but S2 and S3 are not. Thread A is about to recolor S1 and S2 in C'. After thread A recolor S1 in C', but before recolor S2 in C', other thread B might observe S1 and S2. Then thread B will conclude that S1 and S2 are different, and it will split thread B's sections into smaller groups wrongly. Over- splitting doesn't produce broken results, but it loses a chance to merge some identical sections. That was the cause of indeterminism. To fix the problem, I made sections have two colors, namely current color and next color. At the beginning of each iteration, both colors are the same. Each thread reads from current color and writes to next color. In this way, we can avoid threads from reading partial results. After each iteration, we flip current and next. This is a very simple solution and is implemented in less than 50 lines of code. I tested this patch with Chromium and confirmed that this parallelized ICF produces the identical output as the non-parallelized one. Differential Revision: https://reviews.llvm.org/D27247 llvm-svn: 288373	2016-12-01 17:09:04 +00:00
Sean Silva	2eed75926c	Add `isRelExprOneOf` helper In various places in LLD's hot loops, we have expressions of the form "E == R_FOO \|\| E == R_BAR \|\| ..." (E is a RelExpr). Some of these expressions are quite long, and even though they usually go just a very small number of ways and so should be well predicted, they can still occupy branch predictor resources harming other parts of the code, or they won't be predicted well if they overflow branch predictor resources or if the branches are too dense and the branch predictor can't track them all (the compiler can in theory avoid this, at a cost in text size). And some of these expressions are so large and executed so frequently that even when well-predicted they probably still have a nontrivial cost. This speedup should be pretty portable. The cost of these simple bit tests is independent of: - the target we are linking for - the distribution of RelExpr's for a given link (which can depend on how the input files were compiled) - what compiler was used to compile LLD (it is just a simple bit test; hopefully the compiler gets it right!) - adding new target-dependent relocations (e.g. needsPlt doesn't pay any extra cost checking R_PPC_PLT_OPD on x86-64 builds) I did some rough measurements on clang-fsds and this patch gives over about 4% speedup for a regular -O1 link, about 2.5% for -O3 --gc-sections and over 5% for -O0. Sorry, I don't have my current machine set up for doing really accurate measurements right now. This also is just a bit cleaner. Thanks for Joerg for suggesting for this approach. Differential Revision: https://reviews.llvm.org/D27156 llvm-svn: 288314	2016-12-01 05:43:48 +00:00
Rui Ueyama	10091b0ac2	Simplify ScriptParser. - Rename currentBuffer -> getCurrentMB to start it with verb. - Simplify containsString. - Add llvm_unreachable at end of getCurrentMB. llvm-svn: 288310	2016-12-01 04:36:54 +00:00
Rui Ueyama	3cd22d3104	Do not name a variable Ret which is not a return value. llvm-svn: 288309	2016-12-01 04:36:51 +00:00
Rui Ueyama	b5f1c3ec0c	Make get{Line,Column}Number members of StringParser. This patch also renames currentLocation getCurrentLocation. llvm-svn: 288308	2016-12-01 04:36:49 +00:00
Rui Ueyama	50fb82743e	Split getPos into getLineNumber and getColumnNumber. llvm-svn: 288306	2016-12-01 03:56:27 +00:00
Rui Ueyama	c5cb737584	Dump Codeview type information correctly. llvm-svn: 288298	2016-12-01 01:22:48 +00:00
Rui Ueyama	9dedfb1fa8	Change how we manage groups in ICF. Previously, on each iteration in ICF, we scan the entire vector of input sections to find boundaries of groups having the same ID. This patch changes the algorithm so that we now have a vector of ranges. Each range contains a starting index and an ending index of the group. So we no longer have to search boundaries on each iteration. Performance-wise, this seems neutral. Instead of searching boundaries, we now have to maintain ranges. But I think this is more readable than the previous implementation. Moreover, this makes easy to parallelize the main loop of ICF, which I'll do in a follow-up patch. llvm-svn: 288228	2016-11-30 01:50:03 +00:00
Rui Ueyama	84e65a7ca1	Use StringRefZ explicitly instead of const char . This patch is to avoid an implicit conversion from const char to StringRefZ, to make it apparent where we are using StringRefZ. llvm-svn: 288182	2016-11-29 19:11:39 +00:00
Rui Ueyama	a13efc2a73	Introduce StringRefZ class to represent null-terminated strings. StringRefZ is a class to represent a null-terminated string. String length is computed lazily, so it's more efficient than StringRef to represent strings in string table. The motivation of defining this new class is to merge functions that only differ in string types; we have many constructors that takes `const char *` or `StringRef`. With StringRefZ, we can merge them. Differential Revision: https://reviews.llvm.org/D27037 llvm-svn: 288172	2016-11-29 18:05:04 +00:00
Peter Smith	de3e73880e	[ELF] Add support for static TLS to ARM The module index dynamic relocation R_ARM_DTPMOD32 is always 1 for an executable. When static linking and when we know that we are not a shared object we can resolve the module index relocation statically. The logic in handleNoRelaxTlsRelocation remains the same for Mips as it has its own custom GOT writing code. For ARM we add the module index relocation to the GOT when it can be resolved statically. In addition the type of the RelExpr for the static resolution of TlsGotRel should be R_TLS and not R_ABS as we need to include the size of the thread control block in the calculation. Addresses the TLS part of PR30218. Differential revision: https://reviews.llvm.org/D27213 llvm-svn: 288153	2016-11-29 16:23:50 +00:00
George Rimar	9b3ae73fc8	[ELF] - Disable emiting multiple output sections when merging is disabled. When -O0 is specified, we do not do section merging. Though before this patch several sections were generated instead of single, what is useless. Differential revision: https://reviews.llvm.org/D27041 llvm-svn: 288151	2016-11-29 16:11:09 +00:00
George Rimar	3fb5a6dc9e	[ELF] - Add support of proccessing of the rest allocatable synthetic sections from linkerscript. This change continues what was started by D27040 Now all allocatable synthetics should be available from script side. Differential revision: https://reviews.llvm.org/D27131 llvm-svn: 288150	2016-11-29 16:05:27 +00:00
Simon Atanasyan	160bf723f5	[ELF][MIPS] Make stable an order of GOT page address entries llvm-svn: 288137	2016-11-29 13:26:04 +00:00
Simon Atanasyan	198dd205e6	[ELF][MIPS] Add new check to the test case in attempt to investigate Windows build-bot failure llvm-svn: 288132	2016-11-29 11:19:47 +00:00
Simon Atanasyan	9705ff74ed	[ELF][MIPS] Restore Config->Threads for MIPS targets llvm-svn: 288130	2016-11-29 10:24:00 +00:00
Simon Atanasyan	9fae3b8a2c	[ELF][MIPS] Do not change MipsGotSection state in the getPageEntryOffset method The MipsGotSection::getPageEntryOffset calculates index of GOT entry with a "page" address. Previously this method changes the state of MipsGotSection because it modifies PageIndexMap field. That leads to the unpredictable results if getPageEntryOffset called from multiple threads. The patch makes getPageEntryOffset constant. To do so it calculates GOT entry index but does not update PageIndexMap field. Later in the MipsGotSection::writeTo method linker calculates "page" addresses and writes them to the output. llvm-svn: 288129	2016-11-29 10:23:56 +00:00
Simon Atanasyan	a0efc4268c	[ELF][MIPS] Replace the magic number of GOT header entries by constant. NFC llvm-svn: 288128	2016-11-29 10:23:50 +00:00
Simon Atanasyan	80f3d9ce93	[ELF][MIPS] Fix calculation of GOT "page address" entries number If output section which referenced by R_MIPS_GOT_PAGE or R_MIPS_GOT16 relocations is small (less that 0x10000 bytes) and occupies two adjacent 0xffff-bytes pages, current formula gives incorrect number of required "page" GOT entries. The problem is that in time of calculation we do not know the section address and so we cannot calculate number of 0xffff-bytes pages exactly. This patch fix the formula. Now it gives a correct number of pages in the worst case when "small" section intersects 0xffff-bytes page boundary. From the other side, sometimes it adds one more redundant GOT entry for each output section. But usually number of output sections referenced by GOT relocations is small. llvm-svn: 288127	2016-11-29 10:23:46 +00:00
George Rimar	595a763f38	[ELF] - Implemented -N (-omagic) command line option. -N (-omagic) Set the text and data sections to be readable and writable. Also, do not page-align the data segment. Differential revision: https://reviews.llvm.org/D26888 llvm-svn: 288123	2016-11-29 09:43:51 +00:00
Eugene Leviant	84569e6caa	[ELF] Refactor target error messages Differential revision: https://reviews.llvm.org/D27097 llvm-svn: 288114	2016-11-29 08:05:44 +00:00
Rui Ueyama	bf4ddeb97c	Style fix. llvm-svn: 288113	2016-11-29 04:22:57 +00:00
Rui Ueyama	c910bc7e19	Simplify "missing argument" error message. llvm-svn: 288112	2016-11-29 04:17:31 +00:00
Rui Ueyama	e5e407beb4	Add comments. llvm-svn: 288111	2016-11-29 04:17:30 +00:00
Rui Ueyama	bd2a812ff0	Print error message header in red. llvm-svn: 288110	2016-11-29 04:09:08 +00:00
Rafael Espindola	f1e245315b	Use relocations to fill statically known got entries. Right now we just remember a SymbolBody for each got entry and duplicate a bit of logic to decide what value, if any, should be written for that SymbolBody. With ARM there will be more complicated values, and it seems better to just use the relocation code to fill the got entries. This makes it clear that each entry is filled by the dynamic linker or by the static linker. llvm-svn: 288107	2016-11-29 03:45:36 +00:00
Rafael Espindola	d3b32df3de	Sort. NFC. llvm-svn: 288102	2016-11-29 03:36:30 +00:00
Rafael Espindola	5498ba38df	Add a test. It would have found a missing case in another patch. llvm-svn: 288101	2016-11-29 03:30:07 +00:00
George Rimar	1642c5d871	[ELF] - Do not put non exec sections first when -no-rosegment That unifies handling cases when we have SECTIONS and when -no-rosegment is given in compareSectionsNonScript() Now Config->SingleRoRx is used for check, testcase is provided. llvm-svn: 288022	2016-11-28 10:26:21 +00:00
George Rimar	18a3096282	[ELF] - Set Config->SingleRoRx differently. NFC. Previously Config->SingleRoRx was set in createFiles() and used HasSections. This change moves it to readConfigs at place of common flags handling, and adds logic that sets this flag separatelly from ScriptParser if SECTIONS present. llvm-svn: 288021	2016-11-28 10:11:10 +00:00
George Rimar	63bf011003	[ELF] - Implemented -no-rosegment. --no-rosegment: Do not put read-only non-executable sections in their own segment Differential revision: https://reviews.llvm.org/D26889 llvm-svn: 288020	2016-11-28 10:05:20 +00:00
Eugene Leviant	ed30ce7ae4	[ELF] Print file:line for 'undefined section' errors Differential revision: https://reviews.llvm.org/D27108 llvm-svn: 288019	2016-11-28 09:58:04 +00:00
Rafael Espindola	8e67000f1a	Always create a PT_ARM_EXIDX if needed. Unfortunatelly PT_ARM_EXIDX is special. There is no way to create it from linker scripts, so we have to create it even if PHDRS is used. This matches bfd and is required for the lld output to survive bfd's strip. llvm-svn: 288012	2016-11-28 00:40:21 +00:00
Rui Ueyama	1dd86a664f	Add paralell_for and use it where appropriate. When we iterate over numbers as opposed to iterable elements, parallel_for fits better than parallel_for_each. llvm-svn: 288002	2016-11-27 19:28:32 +00:00
Rafael Espindola	5fcc99c27d	Also skip regular symbol assignment at the start of a script. Unfortunatelly some scripts look like kernphys = ... . = .... and the expectation in that every orphan section is after the assignment. llvm-svn: 287996	2016-11-27 09:44:45 +00:00
Rafael Espindola	7fe4ec9b3a	Don't put an orphan before the first . assignment. This is an horrible special case, but seems to match bfd's behaviour and is important for avoiding placing an orphan section before the expected start of the file. llvm-svn: 287994	2016-11-27 07:39:45 +00:00
Rui Ueyama	e8a077badf	Change return types of split{Non,}Strings. They return new vectors, but at the same time they mutate other vectors, so returning values doesn't make much sense. We should just mutate two vectors. llvm-svn: 287979	2016-11-26 15:15:11 +00:00
Rui Ueyama	72b1ee2533	Make getColorDiagnostics return a boolean value instead of an enum. Config->ColorDiagnostics was of type enum before. Now it is just a boolean flag. Thanks Rafael for suggestion. llvm-svn: 287978	2016-11-26 15:10:01 +00:00
Rui Ueyama	1880bbed39	Split MergeOutputSection::finalize. llvm-svn: 287977	2016-11-26 15:09:58 +00:00
Rafael Espindola	f93b8c29c8	Create sections with just assignments as STT_NOBITS. This matches the behaviour of bfd ld. Using 0 was causing problems with strip, which would remove these sections. llvm-svn: 287969	2016-11-26 06:55:35 +00:00
Davide Italiano	3bfa081aa9	[ELF] Be compliant with LLVM and rename Lto into LTO. NFCI. llvm-svn: 287967	2016-11-26 05:37:04 +00:00
Rui Ueyama	d873e3a694	Fix buildbots. llvm-svn: 287952	2016-11-25 20:42:39 +00:00
Rui Ueyama	1df9316922	Fix typo. llvm-svn: 287951	2016-11-25 20:41:45 +00:00
Rui Ueyama	c01321c6b8	Do not print out ARGV0 in white because it's unreadable on white background. llvm-svn: 287950	2016-11-25 20:37:16 +00:00
Rui Ueyama	8c8818a58c	Support -color-diagnostics={auto,always,never}. -color-diagnostics=auto is default because that's the same as Clang's default. When color is enabled, error or warning messages are colored like this. error: <bold>ld.lld</bold> <red>error:</red> foo.o: no such file warning: <bold>ld.lld</bold> <magenta>warning:</magenta> foo.o: no such file Differential Revision: https://reviews.llvm.org/D27117 llvm-svn: 287949	2016-11-25 20:27:32 +00:00
Rui Ueyama	6066641423	We shouldn't call parallle_for_each if -no-thread is given. llvm-svn: 287948	2016-11-25 20:20:57 +00:00
Rui Ueyama	2555952ba8	Parallelize uncompress() and splitIntoPieces(). Uncompressing section contents and spliting mergeable section contents into smaller chunks are heavy tasks. They scan entire section contents and do CPU-intensive tasks such as uncompressing zlib-compressed data or computing a hash value for each section piece. Luckily, these tasks are independent to each other, so we can do that in parallel_for_each. The number of input sections is large (as opposed to the number of output sections), so there's a large parallelism here. Actually the current design to call uncompress() and splitIntoPieces() in batch was chosen with doing this in mind. Basically what we need to do here is to replace `for` with `parallel_for_each`. It seems this patch improves latency significantly if linked programs contain debug info (which in turn contain lots of mergeable strings.) For example, the latency to link Clang (debug build) improved by 20% on my machine as shown below. Note that ld.gold took 19.2 seconds to do the same thing. Before: 30801.782712 task-clock (msec) # 3.652 CPUs utilized ( +- 2.59% ) 104,084 context-switches # 0.003 M/sec ( +- 1.02% ) 5,063 cpu-migrations # 0.164 K/sec ( +- 13.66% ) 2,528,130 page-faults # 0.082 M/sec ( +- 0.47% ) 85,317,809,130 cycles # 2.770 GHz ( +- 2.62% ) 67,352,463,373 stalled-cycles-frontend # 78.94% frontend cycles idle ( +- 3.06% ) <not supported> stalled-cycles-backend 44,295,945,493 instructions # 0.52 insns per cycle # 1.52 stalled cycles per insn ( +- 0.44% ) 8,572,384,877 branches # 278.308 M/sec ( +- 0.66% ) 141,806,726 branch-misses # 1.65% of all branches ( +- 0.13% ) 8.433424003 seconds time elapsed ( +- 1.20% ) After: 35523.764575 task-clock (msec) # 5.265 CPUs utilized ( +- 2.67% ) 159,107 context-switches # 0.004 M/sec ( +- 0.48% ) 8,123 cpu-migrations # 0.229 K/sec ( +- 23.34% ) 2,372,483 page-faults # 0.067 M/sec ( +- 0.36% ) 98,395,342,152 cycles # 2.770 GHz ( +- 2.62% ) 79,294,670,125 stalled-cycles-frontend # 80.59% frontend cycles idle ( +- 3.03% ) <not supported> stalled-cycles-backend 46,274,151,813 instructions # 0.47 insns per cycle # 1.71 stalled cycles per insn ( +- 0.47% ) 8,987,621,670 branches # 253.003 M/sec ( +- 0.60% ) 148,900,624 branch-misses # 1.66% of all branches ( +- 0.27% ) 6.747548004 seconds time elapsed ( +- 0.40% ) llvm-svn: 287946	2016-11-25 20:05:08 +00:00
Rui Ueyama	623b36e358	Move typedefs inside a class definition. llvm-svn: 287945	2016-11-25 18:51:56 +00:00
Rui Ueyama	22375f2406	Remove a parameter from ScriptParser. llvm-svn: 287944	2016-11-25 18:51:54 +00:00
Rui Ueyama	da06bfb794	Move getLocation from Relocations.cpp to InputSection.cpp. The function was used only within Relocations.cpp, but now we are using it in many places, so this patch moves it to a file that fits to the functionality. llvm-svn: 287943	2016-11-25 18:51:53 +00:00
Eugene Leviant	f04777527e	[ELF] Add explicit template instantiations for toString llvm-svn: 287938	2016-11-25 16:42:04 +00:00
Eugene Leviant	ab024a353f	[ELF] Refactor getDynRel to print error location Differential revision: https://reviews.llvm.org/D27055 llvm-svn: 287915	2016-11-25 08:56:36 +00:00
Eugene Leviant	c8c1b7bfae	[ELF] EhOutputSection improvements Differential revision: https://reviews.llvm.org/D27098 llvm-svn: 287914	2016-11-25 08:27:15 +00:00
George Rimar	11992c86d9	[ELF] - Add support for access to most of synthetic sections from linkerscript. This is important for cases like: .sdata : { *(.got.plt .got) ... } That was not supported before as there was no way to get access to synthetic sections from script. More details on review page. Differential revision: https://reviews.llvm.org/D27040 llvm-svn: 287913	2016-11-25 08:05:41 +00:00
Rui Ueyama	26081caf48	Use toString() to report incompatible files. llvm-svn: 287901	2016-11-24 20:59:44 +00:00
Rui Ueyama	d4c94d1899	Include a hint how to see all errors if error is truncated. This patch changes the error message from too many errors emitted, stopping now to too many errors emitted, stopping now (use -error-limit=0 to see all errors) Thanks for Sean for the suggestion! llvm-svn: 287900	2016-11-24 20:31:43 +00:00
Rui Ueyama	a3ac17372b	Define toString(const SymbolBody &) and remove maybeDemangle instead. Differential Revision: https://reviews.llvm.org/D27065 llvm-svn: 287899	2016-11-24 20:24:18 +00:00
Rafael Espindola	4862ae8cc5	Use a more explicit type for the sizeof. llvm-svn: 287895	2016-11-24 16:38:35 +00:00
Peter Smith	719eb8efa5	[ELF] Add terminating sentinel .ARM.exidx table entry The .ARM.exidx table has an entry for each function with the first entry giving the start address of the function, the table is sorted in ascending order of function address. Given a PC value, the unwinder will search the table for the entry that contains the PC value. If the table entry happens to be the last, the range of the addresses that the final unwinding table describes will extend to the end of the address space. To prevent an incorrect address outside the address range of the program matching the last entry we follow ld.bfd's example and add a sentinel EXIDX_CANTUNWIND entry at the end of the table. This gives the final real table entry an upper bound. In addition the llvm libunwind unwinder currently depends on the presence of a sentinel entry (PR31091). Differential revision: https://reviews.llvm.org/D26977 llvm-svn: 287869	2016-11-24 11:43:55 +00:00
George Rimar	066bf6e1b3	[ELF] - Removed unused method. NFC. llvm-svn: 287860	2016-11-24 09:42:10 +00:00
Rui Ueyama	3aaacb282f	Update comment. llvm-svn: 287850	2016-11-24 01:44:21 +00:00
Rui Ueyama	f373dd76ce	Remove HasError and use ErrorCount instead. HasError was always true if ErrorCount > 0, so we can use ErrorCount instead. llvm-svn: 287849	2016-11-24 01:43:21 +00:00
Rui Ueyama	bf9523f549	[COFF] Add DebugInfoCodeView dependency rL287555 introduces a link error when building with BUILD_SHARED_LIBS: undefined reference to llvm::codeview::CVSymbolDumper::dump(), and more... The functions are available in libDebugInfoCodeView, from LLVM. Patch by Visoiu Mistrih Francis! llvm-svn: 287837	2016-11-23 22:58:25 +00:00
Rui Ueyama	2eda6d1633	Set default entry point to .text if no entry point is found. Previously, if a symbol specified by -e or ENTRY() is not found, we didn't set entry point address. That is incompatible with GNU because GNU linkers set the first address of .text to entry. This patch implement that behavior. llvm-svn: 287836	2016-11-23 22:41:00 +00:00
Simon Atanasyan	8469b8841c	[ELF][MIPS] Fix handling of _gp/_gp_disp/__gnu_local_gp symbols Offset between beginning of a .got section and _gp symbols used in MIPS GOT relocations calculations. Usually the expression looks like VA + Offset - GP, where VA is the .got section address, Offset - offset of the GOT entry, GP - offset between .got and _gp. Also there two "magic" symbols _gp_disp and __gnu_local_gp which hold the offset mentioned above. These symbols might be referenced by MIPS relocations. Now the linker always defines _gp symbol and uses hardcoded value for its initialization. So offset between .got and _gp is 0x7ff0. The _gp_disp and __gnu_local_gp defined if required and initialized by 0x7ff0. In fact that is not correct because _gp symbol might be defined by a linker script and holds arbitrary value. In that case we need to use this value in relocation calculation and initialize _gp_disp and __gnu_local_gp properly. The patch fixes the problem and completes fixing the bug #30311. https://llvm.org/bugs/show_bug.cgi?id=30311 Differential revision: https://reviews.llvm.org/D27036 llvm-svn: 287832	2016-11-23 22:22:16 +00:00
Rui Ueyama	835bd72322	Remove trailing whitespace. llvm-svn: 287830	2016-11-23 22:10:46 +00:00
Rui Ueyama	d3a06ffb1c	Use llvm::utohexstr instead of Twine::utohexstr. They are essentially the same in this context, so I prefer the one that doesn't need `Twine::`. llvm-svn: 287814	2016-11-23 21:24:26 +00:00
Rafael Espindola	20b6d3d0d3	Fix this on 32 bit hosts. Looks like we have no 32 bit bot that builds with mips support. llvm-svn: 287799	2016-11-23 19:16:20 +00:00
Rui Ueyama	1b591cf3d3	Fix uninitialized variable access. llvm-svn: 287797	2016-11-23 19:03:35 +00:00
Rui Ueyama	0ede7f281e	Make log(), error() and fatal() thread-safe. llvm-svn: 287794	2016-11-23 18:34:28 +00:00
Rui Ueyama	ac95f6bfcc	Limit default maximum number of errors to 20. This is in the context of https://llvm.org/bugs/show_bug.cgi?id=31109. When LLD prints out errors for relocations, it tends to print out extremely large number of errors (like millions) because it would print out one error per relocation. This patch makes LLD bail out if it prints out more than 20 errors. You can configure the limitation using -error-limit argument. -error-limit=0 means no limit. I chose the flag name because Clang has the same feature as -ferror-limit. "f" doesn't make sense to us, so I omitted it. Differential Revision: https://reviews.llvm.org/D26981 llvm-svn: 287789	2016-11-23 18:15:37 +00:00
Rui Ueyama	28590b6118	Re-commit r287727: Use SHA1::hash and MD5::hash functions. r287727 was not a change that broke buildbots; the other change (r287726) that I made to LLVM broke them. llvm-svn: 287788	2016-11-23 18:11:38 +00:00
Rui Ueyama	3fc0f7e54f	Define toString() as a generic function to get a string for error message. We have different functions to stringize objects to construct error messages. For InputFile, we have getFilename, and for InputSection, we have getName. You had to memorize them. I think this is the case where the function overloading comes in handy. This patch defines toString() functions that are overloaded for all these types, so that you just call it in error(). Differential Revision: https://reviews.llvm.org/D27030 llvm-svn: 287787	2016-11-23 18:07:33 +00:00
Ed Maste	8fd0196c6f	lld: Default image base address to 0x200000 on x86-64 Align to the large page size (known as a superpage or huge page). FreeBSD automatically promotes large, superpage-aligned allocations. Differential Revision: https://reviews.llvm.org/D27042 llvm-svn: 287782	2016-11-23 17:44:02 +00:00
Ed Maste	8d0381d61f	Replace test instruction byte strings with {{.*}} An upcoming change to the image base address for x86-64 (D27042) will will change some addresses and hence the instruction encodings. We care about the disassembled instructions, not their encodings. Differential Revision: https://reviews.llvm.org/D27056 llvm-svn: 287778	2016-11-23 17:09:38 +00:00
Eugene Leviant	c3a44b2fbe	[ELF] Refactor several error messages Differential revision: https://reviews.llvm.org/D26970 llvm-svn: 287753	2016-11-23 10:07:46 +00:00
Eugene Leviant	3582ebf35e	[ELF] Fixup buffer pointer when writing synthetic sections Differential revision: https://reviews.llvm.org/D26980 llvm-svn: 287751	2016-11-23 09:47:38 +00:00
Eugene Leviant	531df4fcef	[ELF] Print error location in .eh_frame parser Differential revision: https://reviews.llvm.org/D26914 llvm-svn: 287750	2016-11-23 09:45:17 +00:00
Rui Ueyama	0cbf749397	Remove one of SymbolTable::addRegular function that forwards other addRegular. So that we have less number of overloaded functions. llvm-svn: 287745	2016-11-23 06:59:47 +00:00
Rui Ueyama	768c6f0ca6	Remove a forwarding constructor that is used only once. llvm-svn: 287742	2016-11-23 06:31:23 +00:00
Rui Ueyama	35fa6c58ad	Parse symbol versions in scanVersionScript() instead of insert(). There are two ways to set symbol versions. One way is to use symbol definition file, and the other is to embed version names to symbol names. In the latter way, symbol name is in the form of `foo@version1` where `foo` is a real name and `version1` is a version. We were parsing symbol names in insert(). That seems unnecessarily too early. We can do it later after we resolve all symbols. Doing it lazily is a good thing because it makes code easier to read (because now we have a separate pass to parse symbol names). Also it could slightly improve performance because if two identical symbols have versions, we now parse them only once. llvm-svn: 287741	2016-11-23 05:48:40 +00:00
Simon Atanasyan	399ac5c3fb	[ELF][MIPS] Turn Config->Threads off for MIPS targets For now MipsGotSection class is not ready for concurrent access from multiple threads. The problem is in the getPageEntryOffset method. It changes state of MipsGotSection object and might be called from different threads at the same time. So turn Threads off for this target. It's a temporary solution. The patch fixes MipsGotSection::getPageEntryOffset is almost ready. Differential revision: https://reviews.llvm.org/D27035 llvm-svn: 287740	2016-11-23 05:25:02 +00:00
Rui Ueyama	d14743e17f	Better formatting. If a line is too long, its error message becomes hard to read. llvm-svn: 287739	2016-11-23 05:14:01 +00:00
Simon Atanasyan	97deade619	[ELF][MIPS] clang-format the code llvm-svn: 287738	2016-11-23 05:09:36 +00:00
Rui Ueyama	c72ba3a4d7	Allow calling getName() on local symbols. Previously, we stored offsets in string tables to symbols, so you needed to pass a string table to get a symbol name. This patch stores const char pointers instead to eliminate the need to pass a string table. llvm-svn: 287737	2016-11-23 04:57:25 +00:00
Rui Ueyama	fd265a1f30	Revert r287727: Use SHA1::hash and MD5::hash functions. It broke buildbots. llvm-svn: 287730	2016-11-23 01:19:13 +00:00
Rui Ueyama	be6c2f1a36	Use SHA1::hash and MD5::hash functions. llvm-svn: 287727	2016-11-23 00:52:28 +00:00
Rui Ueyama	24625fd9b2	Dump not only type records but symbol records. llvm-svn: 287723	2016-11-22 23:51:34 +00:00
Rui Ueyama	3cc93d7a53	Fix memory leak detected by asan. llvm-svn: 287714	2016-11-22 23:13:08 +00:00
Rui Ueyama	838ff4d5c5	Accept -script=<file> in addition to -script <file>. Fixes PR31126. llvm-svn: 287711	2016-11-22 22:54:03 +00:00
Rafael Espindola	4e1a698be0	Fix test to not depend on the path size. llvm-svn: 287704	2016-11-22 21:37:38 +00:00
Rafael Espindola	3c97dc7055	move VerDef finalization before DynStrTab llvm-svn: 287701	2016-11-22 21:12:20 +00:00
Davide Italiano	f4de3b68bb	[LTO] Remove a check on datalayout. Now that lld switched to lib/LTO, which always calls setDataLayout(), we don't need this check anymore. Thanks to Peter for pointing out! llvm-svn: 287699	2016-11-22 20:37:37 +00:00
Rui Ueyama	8a44c94b8a	Remove '.' from a help message. llvm-svn: 287692	2016-11-22 20:15:35 +00:00
Rui Ueyama	bdfa155fa2	Fix build breakage. We cannot have MipsRldMap class and In<ELFT>::MipsRldMap. Renamed the class. llvm-svn: 287683	2016-11-22 19:24:52 +00:00
Meador Inge	b2d99d6a0f	[ELF] Allow `ASSERT` in output section descriptions GNU LD allows `ASSERT` commands to be in output section descriptions. Note that LD also mandates that `ASSERT` commands in this context must end with a semicolon. llvm-svn: 287677	2016-11-22 18:01:50 +00:00
Eugene Leviant	17b7a57533	[ELF] Convert .rld_map to input section Differential revision: https://reviews.llvm.org/D26958 llvm-svn: 287675	2016-11-22 17:49:14 +00:00
Rui Ueyama	81a4b26b48	Inline small function. NFC. llvm-svn: 287620	2016-11-22 04:33:01 +00:00
Rui Ueyama	1d75de00e8	Do not save unused pointers to In<ELFT>. llvm-svn: 287617	2016-11-22 04:28:39 +00:00
Rui Ueyama	8b493f1ac6	Remove default definition no one uses. llvm-svn: 287616	2016-11-22 04:17:12 +00:00
Rui Ueyama	9cfac8a849	Convert MipsOptionsSection to SyntheticSection. llvm-svn: 287615	2016-11-22 04:13:09 +00:00
Rui Ueyama	b71cae90de	Convert MipsReginfoSection to SyntheticSection. llvm-svn: 287614	2016-11-22 03:57:08 +00:00
Rui Ueyama	12f2da870e	Convert MipsAbiFlagsSection to SyntheticSection. llvm-svn: 287613	2016-11-22 03:57:06 +00:00
Rui Ueyama	bb536fee32	Remove redundant assignment. llvm-svn: 287607	2016-11-22 01:36:19 +00:00
Rui Ueyama	2d98fead25	Convert BuildId a derived class of SyntheticSection. Some synthetic sections are not derived calsses of SyntehticSection. They are derived directly from InputSection. For consistencly, we should use SyntheticSection. llvm-svn: 287606	2016-11-22 01:31:32 +00:00
Rui Ueyama	ee18d95764	Remove a parameter from getOutputLoc and rename for readability. NFC. llvm-svn: 287605	2016-11-22 01:10:34 +00:00
Rui Ueyama	c4030a1937	Merge BuildId subclasses. We had five different BuildId subclasses for five different types of build-ids. They can simply be merged to a single class. llvm-svn: 287603	2016-11-22 00:54:15 +00:00
Rui Ueyama	d30cf96203	Remove useless newlines. llvm-svn: 287595	2016-11-21 23:17:09 +00:00
Rafael Espindola	28d5f059ae	Use the correct page size. Config->MaxPageSize is what we use for the segment alignment, so that is the one that we have to use for placing the header. llvm-svn: 287569	2016-11-21 20:20:04 +00:00
Rafael Espindola	1c57007ec8	Fix address computation for headers. If the linker script has SECTIONS, the address computation is now always done in LinkerScript::assignAddresses, like for any other section. Before fixHeaders would do a tentative computation that assignAddresses would sometimes override. This patch also splits the cases where assignAddresses needs to add the headers to the first PT_LOAD and the address computation. The net effect is that we no longer create an empty page for no reason in the included test case, which matches bfd behavior. llvm-svn: 287565	2016-11-21 19:59:33 +00:00
Rui Ueyama	b38ddb15dd	Move a function definition to SyntheticSections.cpp. This should have been moved along with r287554. llvm-svn: 287564	2016-11-21 19:46:04 +00:00
Rui Ueyama	be939b3ff6	Do plumbing work for CodeView debug info. Previously, we discarded .debug$ sections. This patch adds them to files so that PDB.cpp can access them. This patch also adds a debug option, /dumppdb, to dump debug info fed to createPDB so that we can verify that valid data has been passed. llvm-svn: 287555	2016-11-21 17:22:35 +00:00
Eugene Leviant	e9bab5d857	[ELF] Convert Version*** sections to input sections Differential revision: https://reviews.llvm.org/D26918 llvm-svn: 287554	2016-11-21 16:59:33 +00:00
Eugene Leviant	952eb4d348	[ELF] Convert EhFrameHeader to input section Differential revision: https://reviews.llvm.org/D26906 llvm-svn: 287549	2016-11-21 15:52:10 +00:00
Eugene Leviant	03ff016666	[ELF] Better error reporting for linker scripts Differential revision: https://reviews.llvm.org/D26795 llvm-svn: 287547	2016-11-21 15:49:56 +00:00
Eugene Leviant	91972d7f9d	[ELF] Attempt to fix Windows buidbot llvm-svn: 287538	2016-11-21 13:57:50 +00:00
Rui Ueyama	e8785ba4d7	Change the way how we print out line numbers. LLD's error messages contain line numbers, function names or section names. Currently they are formatter as follows. foo.c (32): symbol 'foo' not found foo.c (function bar): symbol 'foo' not found foo.c (.text+0x1234): symbol 'foo' not found This patch changes them so that they are consistent with Clang's output. foo.c:32: symbol 'foo' not found foo.c:(function bar): symbol 'foo' not found foo.c:(.text+0x1234): symbol 'foo' not found Differential Revision: https://reviews.llvm.org/D26901 llvm-svn: 287537	2016-11-21 13:49:57 +00:00
Eugene Leviant	7d7ff80f4b	[ELF] Better error reporting for broken archives Differential revision: https://reviews.llvm.org/D26852 llvm-svn: 287527	2016-11-21 09:28:07 +00:00
Eugene Leviant	a113a4194c	[ELF] Convert GdbIndexSection to input section Differential revision: https://reviews.llvm.org/D26854 llvm-svn: 287526	2016-11-21 09:24:43 +00:00
Rui Ueyama	0b1b695a9e	Add comments. This patch rearranges code a bit to make it easy to explain. llvm-svn: 287515	2016-11-21 02:11:05 +00:00
Rui Ueyama	e0be2901cd	Simplify. NFC. llvm-svn: 287514	2016-11-21 02:10:12 +00:00
Rui Ueyama	60c4e36c7e	Simplify. NFC. llvm-svn: 287510	2016-11-20 23:15:56 +00:00
Rui Ueyama	7bed9eec36	Update comments. llvm-svn: 287509	2016-11-20 23:15:54 +00:00
Rui Ueyama	f94efdddc0	Add a flag to InputSectionBase for linker script. Previously, we set (uintptr_t)-1 to InputSectionBase::OutSec to record that a section has already been set to be assigned to some output section by linker scripts. Later, we restored nullptr to the pointer to use the field for the original purpose. That overloading is not very easy to understand. This patch adds a bit flag for that purpose, so that we don't need to piggyback the flag on an unrelated pointer. llvm-svn: 287508	2016-11-20 23:15:52 +00:00
Rui Ueyama	9f8cb730eb	Use auto for obvious types. llvm-svn: 287481	2016-11-20 02:43:44 +00:00
Rui Ueyama	bd1f0630a8	Do not expose ICF class from the file. Also this patch uses file-scope functions instead of class member function. Now that ICF class is not visible from outside, InputSection class can no longer be "friend" of it. So I removed the friend relation and just make it expose the features to public. llvm-svn: 287480	2016-11-20 02:39:59 +00:00
Rui Ueyama	f7dfb2e250	Remove a file that is too short to be an independent file. We have a .cpp and a .h for parseDynamicList(). This patch moves the function to DriverUtil.cpp. llvm-svn: 287468	2016-11-19 23:26:41 +00:00
Rui Ueyama	8f47556796	Remove unused #include. llvm-svn: 287467	2016-11-19 23:18:43 +00:00
Rui Ueyama	e2dfbc17c8	Refactor ICF. In order to use forEachGroup in the final loop in ICF::run, I changed some function parameter types. llvm-svn: 287466	2016-11-19 23:14:23 +00:00
Rui Ueyama	a05134e837	Use std::equal instead of hand-written loops. llvm-svn: 287460	2016-11-19 20:15:55 +00:00
Rui Ueyama	1cb63183af	Restore a comment that was accidentally changed. llvm-svn: 287457	2016-11-19 19:26:52 +00:00
Rui Ueyama	061f9286df	Use Optional<std::string> instead of "" to represent a failure. llvm-svn: 287456	2016-11-19 19:23:58 +00:00
Rui Ueyama	ec75220fe9	Make buildSysrootedPath file-scoped. This patch also changes its return type to simplify callers. llvm-svn: 287455	2016-11-19 19:23:56 +00:00
Rui Ueyama	844c1c908c	Simplify "missing argument" error message. We do not have an option taking more than one arguments, so we can just say "missing argument" instead of "missing argument(s)". llvm-svn: 287454	2016-11-19 18:49:38 +00:00
Rui Ueyama	98bdbdaedd	Split getFdeEncoding. llvm-svn: 287452	2016-11-19 18:44:09 +00:00
Rui Ueyama	a9b7514eb0	Fix typo in error message. llvm-svn: 287451	2016-11-19 18:34:55 +00:00
George Rimar	0a94bffe12	[ELF] - Exit on --version call. GNU linkers disagree here. Though both -version and -v are mentioned in help to print the version information, GNU ld just normally exits, while gold can continue linking. We are compatible with ld.bfd here. This fixes PR31057. Differential revision: https://reviews.llvm.org/D26865 llvm-svn: 287448	2016-11-19 18:14:24 +00:00
Rui Ueyama	6e68c5e5cf	Simplify. NFC. llvm-svn: 287446	2016-11-19 18:05:58 +00:00
Rui Ueyama	16068aeb58	Change filler type from ArrayRef<uint8_t> to uint32_t. Filler expressions in linker script "=fillexp" are always handled as 32-bit integers. Thus the new type is more natural. llvm-svn: 287445	2016-11-19 18:05:56 +00:00
Rui Ueyama	5c851a5a6d	Simplify. NFC. llvm-svn: 287372	2016-11-18 19:45:04 +00:00
Rafael Espindola	4d2d9c07a5	Simplify handling of SHF_LINK_ORDER. It seems a lot simpler to just sort the sections and let the relocations do the rest. llvm-svn: 287365	2016-11-18 19:02:15 +00:00
Eugene Leviant	ff23d3e741	[ELF] Convert PltSection to input section Differential revision: https://reviews.llvm.org/D26842 llvm-svn: 287346	2016-11-18 14:35:03 +00:00
Eugene Leviant	b96e809c7f	[ELF] Convert HashTableSection to input section Differential revision: https://reviews.llvm.org/D26834 llvm-svn: 287326	2016-11-18 09:06:47 +00:00
Rui Ueyama	f8f6f1e783	Update comment. llvm-svn: 287325	2016-11-18 07:03:56 +00:00
Rui Ueyama	009d174229	Omit empty parameter list. llvm-svn: 287324	2016-11-18 06:49:09 +00:00
Rui Ueyama	46247b85be	Use consume() instead of peek() and skip(). llvm-svn: 287323	2016-11-18 06:49:07 +00:00
Eugene Leviant	be809a7125	[ELF] Convert GnuHashTableSection to input section Differential revision: https://reviews.llvm.org/D26792 llvm-svn: 287322	2016-11-18 06:44:18 +00:00
Rui Ueyama	12450b20b4	Split ScriptParser::readVersionDeclaration. readVersionDeclaration was to read anonymous version definition and named version definition. Splitting it into two functions should improve readability as the two cases are different enough. I also changed a few helper functions to return values instead of mutating given references. llvm-svn: 287319	2016-11-18 06:30:09 +00:00
Rui Ueyama	8980c92dde	Use consistent variable name. llvm-svn: 287318	2016-11-18 06:30:08 +00:00
Rui Ueyama	77f2a87575	Simplify MergeOutputSection. MergeOutputSection class was a bit hard to use because it provdes a series of finalize functions that have to be called in a right way at a right time. It also intereacted with MergeInputSection, and the logic was somewhat entangled between the two classes. This patch simplifies it by providing only one finalize function. Now, all you have to do is to call MergeOutputSection::finalize when you have added all sections to the output section. Then, it internally merges strings and initliazes StringPiece objects. I think this is much easier to understand. This patch also adds comments. llvm-svn: 287314	2016-11-18 05:05:43 +00:00
Davide Italiano	2e8b2a70ab	[ELF] Rename an historical leftover, `Chunk` is now `InputSection`. llvm-svn: 287297	2016-11-18 02:23:48 +00:00
Davide Italiano	44665e7bc5	[ELF] Use std::for_each() and hoist common code in a lambda. llvm-svn: 287296	2016-11-18 02:18:04 +00:00
Rafael Espindola	25a41c1f63	Add missing REQUIRES. llvm-svn: 287284	2016-11-18 00:11:12 +00:00
Rafael Espindola	933fcab2ad	Always compute sh_link for SHF_LINK_ORDER sections. Since the output has a section table too, it is meaningful to compute the sh_link. In a more practical note, the binutils' strip crashes if sh_link is not set for SHT_ARM_EXIDX. llvm-svn: 287280	2016-11-17 23:16:39 +00:00
Simon Atanasyan	b8bfec686f	[ELF][MIPS] Remove 'mips' word from MipsGotSection fields and methods names. NFC Also add new comments with MIPS GOT description. llvm-svn: 287264	2016-11-17 21:49:14 +00:00
Rafael Espindola	dab02d4b68	Allow use define symbols to override linker defined ones. I hit an internal linker script that was defining _DYNAMIC instead of letting the linker do it. It turns out that both bfd and gold allow that. This is pretty easy to implement, just make the linker defined symbol weak. This should have no impact in the case where there is no user defined symbol: The visibility is hidden, which causes the output to still be local. llvm-svn: 287260	2016-11-17 21:20:16 +00:00
Rui Ueyama	edf75e7992	Allow SIZEOF() command on nonexistent section. Linker script doesn't create a section if it has no content. So the following script doesn't create .norelocs section if it doesn't have any .rel* sections. .norelocs : { (.rel) } Later, if you assert that the size of .norelocs is 0, LLD printed out an error message, because it didn't allow calling SIZEOF() on nonexistent sections. This patch allows SIZEOF() on nonexistent sections, so that you can do something like this. ASSERT(SIZEOF(.norelocs), "shouldn't contain .rel sections!") Note that this behavior is compatible with GNU. Differential Revision: https://reviews.llvm.org/D26810 llvm-svn: 287257	2016-11-17 20:27:10 +00:00
Rui Ueyama	d84124f043	Add single quotes to error messages. llvm-svn: 287254	2016-11-17 19:57:47 +00:00
Rui Ueyama	96db27c74f	Use consistent variable name. llvm-svn: 287253	2016-11-17 19:57:45 +00:00
Rui Ueyama	cd236a9577	Use llvm::reverse to get a reverse range. llvm-svn: 287252	2016-11-17 19:57:43 +00:00
Rui Ueyama	7ee38e00a5	Enable -threads by default. LLD supports multi-threading, and it seems to be working well as you can see in r287140. In short, LLD runs a few percent to 30% faster with -threads and more than 50% faster if you are using -build-id (your mileage may vary depending on your computer). However, I don't think most users even don't know about that because -threads is not a default option. This patch enables it by default. Discussion thread: http://lists.llvm.org/pipermail/llvm-dev/2016-November/107160.html llvm-svn: 287237	2016-11-17 17:06:51 +00:00
Rui Ueyama	bac1c3ce85	Pass StringRefs instead of StringMatcher because it's simpler. llvm-svn: 287234	2016-11-17 16:48:53 +00:00
Rafael Espindola	74fa2822f6	Simplify. NFC. llvm-svn: 287231	2016-11-17 15:29:11 +00:00
Rafael Espindola	d8b81d6663	Avoid accessing an end() iterator. llvm-svn: 287225	2016-11-17 14:18:08 +00:00
Eugene Leviant	4b84a9041a	[ELF] Remove unneeded forward declarations. NFC. llvm-svn: 287218	2016-11-17 10:34:05 +00:00
Eugene Leviant	9230db94b7	[ELF] Convert SymbolTableSection to input section Differential revision: https://reviews.llvm.org/D26740 llvm-svn: 287216	2016-11-17 09:16:34 +00:00
Chris Bieneman	aeb30257c5	[CMake] NFC. Updating CMake dependency specifications This patch updates a couple places where add_dependencies was being explicitly called to add dependencies on intrinsics_gen to instead use the DEPENDS named parameter. This cleanup is needed for a patch I'm working on to add a dependency debugging mode to the build system. llvm-svn: 287205	2016-11-17 04:36:35 +00:00
Rui Ueyama	729ac793a2	Rename a function so that that starts with a lowercase letter. llvm-svn: 287202	2016-11-17 04:10:09 +00:00
Rui Ueyama	0ee25a6973	Simplify and use consistent variable name. NFC. llvm-svn: 287200	2016-11-17 03:52:14 +00:00
Rui Ueyama	da805c4800	Use uint16_t instead of size_t for symbol version ID. Because it is uint16_t in the ELF spec. Using size_t was confusing. llvm-svn: 287198	2016-11-17 03:39:21 +00:00
Rui Ueyama	aade0e29ad	Add single quotes to a warning message for consistency. llvm-svn: 287197	2016-11-17 03:32:41 +00:00
Rui Ueyama	77d917de57	Simplify handleAnonymousVersion even more. We used to create a vector contantaining all version definitions with wildcards because doing that was efficient. All patterns were compiled to a regexp and matched against symbol names. Because a regexp can be converted to a DFA, matching against union of patterns is as cheap as matching against one patter. We are no longer converting them to regexp. Our own glob pattern handler doesn't do such optimization. Therefore, creating a vector no longer makes sense. llvm-svn: 287196	2016-11-17 03:19:34 +00:00
Rui Ueyama	4162baa4bb	Simplify. NFC. llvm-svn: 287192	2016-11-17 02:16:06 +00:00
Rui Ueyama	94bcfae26d	Split scanVersionScript. NFC. llvm-svn: 287191	2016-11-17 02:09:42 +00:00
Simon Atanasyan	725dc14bb2	[ELF][MIPS] Add MipsGotSection to handle MIPS GOT MIPS GOT handling is very different from other targets so it is better to keep the code in the separatre section class MipsGotSection. This patch introduces the new section and moves all MIPS specific code from GotSection to the new class. I did not rename fields and methods in the MipsGotSection class to reduce the diff and plan to do that by the separate commit. Differential revision: https://reviews.llvm.org/D26733 llvm-svn: 287150	2016-11-16 21:01:02 +00:00
Davide Italiano	c223d1bc6b	[ELF] Don't replace path separators on *NIX. Apparently this is wrong because it's legal to have a filename on UNIX which contains a backslash. Differential Revision: https://reviews.llvm.org/D26734 llvm-svn: 287143	2016-11-16 19:35:36 +00:00
Rui Ueyama	87ff6fef0f	Reduce number of tasks in parallel_for_each. TaskGroup has a fairly high overhead, so we don't want to partition tasks into too small tasks. This patch partition tasks into up to 1024 tasks. I compared this patch with the original LLD's parallel_for_each. I reverted r287042 locally for comparison. With this patch, time to self-link lld with debug info changed from 6.23 seconds to 4.62 seconds (-25.8%), with -threads and without -build-id. With both -threads and -build-id, it improved from 11.71 seconds to 4.94 seconds (-57.8%). Full results are below. BTW, GNU gold takes 11.65 seconds to link the same binary. NOW --no-threads --build-id=none 6789.847776 task-clock (msec) # 1.000 CPUs utilized ( +- 1.86% ) 685 context-switches # 0.101 K/sec ( +- 2.82% ) 4 cpu-migrations # 0.001 K/sec ( +- 31.18% ) 1,424,690 page-faults # 0.210 M/sec ( +- 1.07% ) 21,339,542,522 cycles # 3.143 GHz ( +- 1.49% ) 13,092,260,230 stalled-cycles-frontend # 61.35% frontend cycles idle ( +- 2.23% ) <not supported> stalled-cycles-backend 21,462,051,828 instructions # 1.01 insns per cycle # 0.61 stalled cycles per insn ( +- 0.41% ) 3,955,296,378 branches # 582.531 M/sec ( +- 0.39% ) 75,699,909 branch-misses # 1.91% of all branches ( +- 0.08% ) 6.787630744 seconds time elapsed ( +- 1.86% ) --threads --build-id=none 14767.148697 task-clock (msec) # 3.196 CPUs utilized ( +- 2.56% ) 28,891 context-switches # 0.002 M/sec ( +- 1.99% ) 905 cpu-migrations # 0.061 K/sec ( +- 5.49% ) 1,262,122 page-faults # 0.085 M/sec ( +- 1.68% ) 43,116,163,217 cycles # 2.920 GHz ( +- 3.07% ) 33,690,171,242 stalled-cycles-frontend # 78.14% frontend cycles idle ( +- 3.67% ) <not supported> stalled-cycles-backend 22,836,731,536 instructions # 0.53 insns per cycle # 1.48 stalled cycles per insn ( +- 1.13% ) 4,382,712,998 branches # 296.788 M/sec ( +- 1.33% ) 78,622,295 branch-misses # 1.79% of all branches ( +- 0.54% ) 4.621228056 seconds time elapsed ( +- 1.90% ) --threads --build-id=sha1 24594.457135 task-clock (msec) # 4.974 CPUs utilized ( +- 1.78% ) 29,902 context-switches # 0.001 M/sec ( +- 2.62% ) 1,097 cpu-migrations # 0.045 K/sec ( +- 6.29% ) 1,313,947 page-faults # 0.053 M/sec ( +- 2.36% ) 70,516,415,741 cycles # 2.867 GHz ( +- 0.78% ) 47,570,262,296 stalled-cycles-frontend # 67.46% frontend cycles idle ( +- 0.86% ) <not supported> stalled-cycles-backend 73,124,599,029 instructions # 1.04 insns per cycle # 0.65 stalled cycles per insn ( +- 0.33% ) 10,495,266,104 branches # 426.733 M/sec ( +- 0.41% ) 91,444,149 branch-misses # 0.87% of all branches ( +- 0.83% ) 4.944291711 seconds time elapsed ( +- 1.72% ) PREVIOUS --threads --build-id=none 7307.437544 task-clock (msec) # 1.160 CPUs utilized ( +- 2.34% ) 3,128 context-switches # 0.428 K/sec ( +- 4.37% ) 352 cpu-migrations # 0.048 K/sec ( +- 5.98% ) 1,354,450 page-faults # 0.185 M/sec ( +- 2.20% ) 22,081,733,098 cycles # 3.022 GHz ( +- 1.46% ) 13,709,991,267 stalled-cycles-frontend # 62.09% frontend cycles idle ( +- 1.77% ) <not supported> stalled-cycles-backend 21,634,468,895 instructions # 0.98 insns per cycle # 0.63 stalled cycles per insn ( +- 0.86% ) 3,993,062,361 branches # 546.438 M/sec ( +- 0.83% ) 76,188,819 branch-misses # 1.91% of all branches ( +- 0.19% ) 6.298101157 seconds time elapsed ( +- 2.03% ) --threads --build-id=sha1 12845.420265 task-clock (msec) # 1.097 CPUs utilized ( +- 1.95% ) 4,020 context-switches # 0.313 K/sec ( +- 2.89% ) 369 cpu-migrations # 0.029 K/sec ( +- 6.26% ) 1,464,822 page-faults # 0.114 M/sec ( +- 1.37% ) 40,668,449,813 cycles # 3.166 GHz ( +- 0.96% ) 18,863,982,388 stalled-cycles-frontend # 46.38% frontend cycles idle ( +- 1.82% ) <not supported> stalled-cycles-backend 71,560,499,058 instructions # 1.76 insns per cycle # 0.26 stalled cycles per insn ( +- 0.14% ) 10,044,152,441 branches # 781.925 M/sec ( +- 0.19% ) 87,835,773 branch-misses # 0.87% of all branches ( +- 0.09% ) 11.711773314 seconds time elapsed ( +- 1.51% ) llvm-svn: 287140	2016-11-16 19:27:33 +00:00
Rui Ueyama	28212d71f6	Export fewer functions from Error.h. Also add a comment saying that check() returns a value. llvm-svn: 287136	2016-11-16 18:54:37 +00:00
George Rimar	17c65af82f	[ELF] - Separate locals list from versions. This change separates all versioned locals to be a separate list in config, that was suggested by Rafael and simplifies the logic a bit. Differential revision: https://reviews.llvm.org/D26754 llvm-svn: 287132	2016-11-16 18:46:23 +00:00
Rafael Espindola	95eae57d78	Don't error if __tls_get_addr is defined. Turns out some systems do define it. Not producing an error in this case matches gold and bfd. llvm-svn: 287125	2016-11-16 18:01:41 +00:00
George Rimar	e0fc24210d	[ELF] - Added support for extern "c++" local symbols in version script. Previously we did not support them, patch implements this functionality Differential revision: https://reviews.llvm.org/D26604 llvm-svn: 287124	2016-11-16 17:59:10 +00:00
George Rimar	7759e5b61b	[ELF] - Change error message according to review comment. NFC. Forgot about that, I am sorry. llvm-svn: 287123	2016-11-16 17:45:45 +00:00
George Rimar	76f429b4db	[ELF] - Improve diagnostic messages. Particulaty "cannot preempt symbol" message is extended with locations now. Differential revision: https://reviews.llvm.org/D26738 llvm-svn: 287120	2016-11-16 17:24:06 +00:00
Rui Ueyama	cb87675186	Define -build-id=tree as a synonym for -build-id=sha1. Our build-id is a tree hash anyway, so I'll define this as a synonym for sha1. GNU gold takes this parameter, so this is for compatibility with that. llvm-svn: 287119	2016-11-16 17:14:11 +00:00
Eugene Leviant	a96d9027a3	[ELF] Convert RelocationSection to input section Differential revision: https://reviews.llvm.org/D26669 llvm-svn: 287092	2016-11-16 10:02:27 +00:00
Eugene Leviant	afaa934304	[ELF] Add Section() to expression object This allows making symbols containing ADDR(section) synthetic, and defining synthetic symbols outside SECTIONS block. Differential revision: https://reviews.llvm.org/D25441 llvm-svn: 287090	2016-11-16 09:49:39 +00:00
George Rimar	f3c143188d	[ELF] - Better diagnostic for "can't create dynamic relocation" error. Patch improves message to show locations for "can't create dynamic relocation" error. Differential revision: https://reviews.llvm.org/D26548 llvm-svn: 287086	2016-11-16 08:34:19 +00:00
Davide Italiano	197f07e650	[ELF] Update lld now that ELF.h in LLVM has been converted to Expected. llvm-svn: 287082	2016-11-16 05:11:30 +00:00
Rui Ueyama	23f441d3ac	Add -no-threads option that negates the effect of -threads. llvm-svn: 287072	2016-11-16 01:39:50 +00:00
Rui Ueyama	c91f716643	PDB: Add "* Linker *" module. The added module contains nothing, but it is still useful as a test to ensure that we are emitting modules that can be read back. llvm-svn: 287071	2016-11-16 01:10:46 +00:00
Rafael Espindola	e5cd5ecd2e	Use one task per iteration in parallel_for_loop. This seems far more natural. A user can create larger chunks if the overhead is too large. With this linking xul with "--threads --build-id=sha1 goes from 13.938177535 to 11.035953538 seconds on linux. llvm-svn: 287042	2016-11-15 22:13:16 +00:00

... 2 3 4 5 6 ...

7534 Commits