llvm-project

Commit Graph

Author	SHA1	Message	Date
Peter Collingbourne	ca8c994818	ELF: Compute used bit for --as-needed during symbol resolution. We can now use this to decide whether to emit a verneed during the final pass over the symbols. We were previously wrongly creating a verneed entry in the case where all references to a DSO's symbols were weak. In a future change we may also want to use the used bit to control whether shared symbols are preemptible and appear in the dynsym. This seems a little tricky to do at the moment because isNeeded() is templated. The only other functional change here is that we emit a DT_NEEDED for DSOs whose symbols are all preempted by objects that appear later in the link. But that doesn't seem too important to me. Differential Revision: http://reviews.llvm.org/D21171 llvm-svn: 272282	2016-06-09 18:01:35 +00:00
Rui Ueyama	406b469de4	Avoid doing binary search. MergedInputSection::getOffset is the busiest function in LLD if string merging is enabled and input files have lots of mergeable sections. It is usually the case when creating executable with debug info, so it is pretty common. The reason why it is slow is because it has to do faily complex computations. For non-mergeable sections, section contents are contiguous in output, so in order to compute an output offset, we only have to add the output section's base address to an input offset. But for mergeable strings, section contents are split for merging, so they are not contigous. We've got to do some lookups. We used to do binary search on the list of section pieces. It is slow because I think it's hostile to branch prediction. This patch replaces it with hash table lookup. Seems it's working pretty well. Below is "perf stat -r10" output when linking clang with debug info. In this case this patch speeds up about 4%. Before: 6584.153205 task-clock (msec) # 1.001 CPUs utilized ( +- 0.09% ) 238 context-switches # 0.036 K/sec ( +- 6.59% ) 0 cpu-migrations # 0.000 K/sec ( +- 50.92% ) 1,067,675 page-faults # 0.162 M/sec ( +- 0.15% ) 18,369,931,470 cycles # 2.790 GHz ( +- 0.09% ) 9,640,680,143 stalled-cycles-frontend # 52.48% frontend cycles idle ( +- 0.18% ) <not supported> stalled-cycles-backend 21,206,747,787 instructions # 1.15 insns per cycle # 0.45 stalled cycles per insn ( +- 0.04% ) 3,817,398,032 branches # 579.786 M/sec ( +- 0.04% ) 132,787,249 branch-misses # 3.48% of all branches ( +- 0.02% ) 6.579106511 seconds time elapsed ( +- 0.09% ) After: 6312.317533 task-clock (msec) # 1.001 CPUs utilized ( +- 0.19% ) 221 context-switches # 0.035 K/sec ( +- 4.11% ) 1 cpu-migrations # 0.000 K/sec ( +- 45.21% ) 1,280,775 page-faults # 0.203 M/sec ( +- 0.37% ) 17,611,539,150 cycles # 2.790 GHz ( +- 0.19% ) 10,285,148,569 stalled-cycles-frontend # 58.40% frontend cycles idle ( +- 0.30% ) <not supported> stalled-cycles-backend 18,794,779,900 instructions # 1.07 insns per cycle # 0.55 stalled cycles per insn ( +- 0.03% ) 3,287,450,865 branches # 520.799 M/sec ( +- 0.03% ) 72,259,605 branch-misses # 2.20% of all branches ( +- 0.01% ) 6.307411828 seconds time elapsed ( +- 0.19% ) Differential Revision: http://reviews.llvm.org/D20645 llvm-svn: 270999	2016-05-27 14:39:13 +00:00
Rui Ueyama	0fcdc730ad	Create Relocations.cpp and move scanRelocs there. scanReloc and the functions on which scanReloc depends is in total more than 600 lines of code. Since scanReloc does not depend on Writer, it is better to move it into a separate file. Differential Revision: http://reviews.llvm.org/D20554 llvm-svn: 270606	2016-05-24 20:24:43 +00:00
Rafael Espindola	fe3a2f1b81	Revert "Simplify. Thanks to Rui for the suggestion." This reverts commit r270551. Sorry, I commited the wrong branch :-( llvm-svn: 270554	2016-05-24 12:12:06 +00:00
Rafael Espindola	dba64b8ea4	Simplify. Thanks to Rui for the suggestion. llvm-svn: 270551	2016-05-24 11:53:15 +00:00
Rui Ueyama	ace4f90cf3	Do not pass the symbol table. NFC. Since the symbol table is a singleton class and globally accessible, we don't need to pass it around. llvm-svn: 270533	2016-05-24 04:25:47 +00:00
Rui Ueyama	0b9a90364b	Rename EHInputSection -> EhInputSection. llvm-svn: 270532	2016-05-24 04:19:20 +00:00
Rui Ueyama	022d8e8a86	Make scanReloc and related functions non-member functions. scanReloc does not depend on Writer, so it doesn't have to be in the class. llvm-svn: 270530	2016-05-24 03:36:07 +00:00
Rui Ueyama	afa35a2a37	Remove Writer::ensureBss(). Previously, we created a .bss section when needed. We had a function ensureBss() for that purpose. Turned out that was error-prone because it was easy to forget to call that function before accessing the .bss section. This patch always make the BSS section. The section is added to the output when it's not empty. llvm-svn: 270527	2016-05-24 03:16:51 +00:00
Rui Ueyama	98843087cb	Reject zero-sized symbols when creating copy relocations. Copy relocations are relocations to copy data from DSOs to executable's .bss segment at runtime. It doesn't make sense to create such relocations for zero-sized symbols. GNU linkers don't agree with each other. ld rejects such relocation/symbol pair. gold don't reject that but do not create copy relocations as well. I took the former approach because I don't think the latter is what user wants. llvm-svn: 270525	2016-05-24 02:37:40 +00:00
Rui Ueyama	8a6ef4e6b2	Remove dead code. Since now we always set SHT_PROGBITS to .eh_frame sections, this code path is not executed at runtime. llvm-svn: 270446	2016-05-23 16:24:22 +00:00
Rui Ueyama	3b31e6711b	Make .eh_frame a singleton output object. .eh_frame_hdr assumes that there is only one .eh_frame and ensures it by assertions. This patch makes .eh_frame a real singleton object to simplify. llvm-svn: 270445	2016-05-23 16:24:16 +00:00
Rui Ueyama	f86cb90a2d	Do not propagate section name and attributes to .eh_frame. .eh_frame is always ".eh_frame" and its attribute is fixed. No need to copy from inputs to outputs. GNU gold also sets SHT_PROGBITS. llvm-svn: 270443	2016-05-23 15:12:41 +00:00
Rui Ueyama	1e479c23aa	Rename EHOutputSection -> EhOutputSection for consistency. llvm-svn: 270442	2016-05-23 15:07:59 +00:00
Rui Ueyama	90fa3722d2	Simplify SplitInputSection::getRangeAndSize. This patch adds Size member to SectionPiece so that getRangeAndSize can just return a SectionPiece instead of a std::pair<SectionPiece *, uint_t>. Also renamed the function. llvm-svn: 270346	2016-05-22 00:41:38 +00:00
Rui Ueyama	3ea8727188	Define SectionPiece and use it instead of std::pair<uint_t, uint_t>. We were using std::pair to represents pieces of splittable section contents. It hurt readability because "first" and "second" are not meaningful. This patch give them names. One more thing is that piecewise liveness information is stored to the second element of the pair as a special value of output section offset. It was confusing, so I defiend a new bit, "Live", in the new struct. llvm-svn: 270340	2016-05-22 00:13:04 +00:00
Rafael Espindola	ebed1fe0de	Refactor R_RELAX_TLS_* value computation. This makes it explicit that each R_RELAX_TLS_* is equivalent to some other expression. With this I think we are at a sweet spot for how much is done in Target.cpp. I did experiment with moving all the value math out of it. It has the advantage that we know the final value in target independent code, but it gets quite verbose. llvm-svn: 270277	2016-05-20 21:23:52 +00:00
Rafael Espindola	6989ebf661	Simplify, NFC. llvm-svn: 269983	2016-05-18 21:05:18 +00:00
Rafael Espindola	e4c86d83fe	Drop vestigial support for UseLazyBinding=false. Lazy binding is quite important for use case like a shared build of llvm. Also, if someone wants to disable it, it is better done in the compiler (disable plt generation). The only reason to keep it is to make it easier to add a new architecture. But it doesn't really help much as it is possible to start with non lazy relocation and plt code but still let the generic part create a dedicated .got.plt and .rela.plt. llvm-svn: 269982	2016-05-18 21:03:36 +00:00
Simon Atanasyan	4e3a15c9f3	[ELF][MIPS] Rename R_MIPS_GOT_xxx relocation expression kinds New names reflect purpose of corresponding GOT entries better. Both expression types related to entries allocated in the 'local' part of MIPS GOT. R_MIPS_GOT_LOCAL_PAGE is for entries contain 'page' addresses. R_MIPS_GOT_LOCAL is for entries contain 'full' address. llvm-svn: 269597	2016-05-15 18:13:50 +00:00
Rui Ueyama	9194db78fb	Support --build-id=0x<hexstring>. If you specify the option in the form of --build-id=0x<hexstring>, that hexstring is set as a build ID. We observed that the feature is actually in use in some builds, so we want this feature. llvm-svn: 269495	2016-05-13 21:55:56 +00:00
Rafael Espindola	7229496787	When using Rela, don't write the addend to the output section. The Elf_Rela has an explicit addend. It doesn't need the addend to be written to the section being relocated. Since relative relocations are very common in the output, this is a noticeable speedup. The results I got were chromium master 4.778149487 patch 4.761120792 0.996436131802 chromium fast master 1.896253636 patch 1.840990582 0.970856718241 the gold plugin master 0.399337811 patch 0.392279276 0.982324401032 clang master 0.666873675 patch 0.665895708 0.998533504865 llvm-as master 0.037101095 patch 0.037123149 1.00059442989 the gold plugin fsds master 0.422473396 patch 0.414192879 0.980399909016 clang fsds master 0.747302008 patch 0.744843964 0.996710775599 llvm-as fsds master 0.033146245 patch 0.033064531 0.997534743377 scylla master 4.08857525 patch 4.082245184 0.998451767275 llvm-svn: 269417	2016-05-13 14:15:37 +00:00
Rafael Espindola	686ffc6f4c	Slit the relocation scan in two parts. The first part handles whatever has to be written to the r_offset position. The second part handles creating got and plt entries. llvm-svn: 269375	2016-05-12 22:51:22 +00:00
Rafael Espindola	203b0773a3	Move addend computation to a helper function. llvm-svn: 269369	2016-05-12 22:19:35 +00:00
Rafael Espindola	01f1636408	Handle thunks in adjustExpr. This is similar to the other changes this function does. With this all Relocations.push_back calls look similar. llvm-svn: 269362	2016-05-12 21:53:34 +00:00
Rafael Espindola	62cb02eef1	This reverts commit r269359 and r269360. I will commit again with a fixed commit message. llvm-svn: 269361	2016-05-12 21:51:16 +00:00
Rafael Espindola	cc42a90b76	Handle thunks in adjustExpr. This is similar to the other changes this function does. With this all Relocations.push_back calls look similar. llvm-svn: 269360	2016-05-12 21:47:26 +00:00
Rafael Espindola	01a94f8336	bra llvm-svn: 269359	2016-05-12 21:47:24 +00:00
George Rimar	fa91000290	[ELF] implemented -z defs option Just do not allow to link shared library if there are undefined symbols. This fixes PR27447 Differential revision: http://reviews.llvm.org/D20169 llvm-svn: 269183	2016-05-11 13:48:41 +00:00
George Rimar	c191acf097	[ELF] - Implemented -z combrelocs/nocombreloc. This is the option which sorts relocs to optimize dynamic linker performance. -z combelocs is the default in gold, also it ignores -z nocombreloc, this patch do the same. Patch sorts relocations by symbols only and do not create any DT_REL[A]COUNT entries. That is different with what gold/bfd do. More information about option is here: http://www.airs.com/blog/archives/186 http://people.redhat.com/jakub/prelink.pdf, p.2 Differential revision: http://reviews.llvm.org/D19528 llvm-svn: 269066	2016-05-10 15:47:57 +00:00
Rafael Espindola	78db5a9dca	Print member name in undefined symbol error. llvm-svn: 268976	2016-05-09 21:40:06 +00:00
Rafael Espindola	45a33fb799	Allow user defined __init_aray_start. Fixes pr27683. llvm-svn: 268926	2016-05-09 15:25:54 +00:00
Simon Atanasyan	9ac819860f	[ELF][MIPS] Reduce all MIPS R_GOTREL addends by MipsGPOffset in the single place. NFC llvm-svn: 268742	2016-05-06 15:02:50 +00:00
Simon Atanasyan	1a728fdf5c	[ELF][MIPS] Simplify `if` condition. NFC In case of MIPS ABI relocation has R_GOTREL expression's type iif the relocation type is either R_MIPS_GPREL16 or R_MIPS_GPREL32. So it is enough to check expression's type only. llvm-svn: 268741	2016-05-06 15:02:45 +00:00
Rafael Espindola	d39dadeb64	Don't produce a relocation to read only memory. This is hopefully last case where we would produce a relocation to a read only section. llvm-svn: 268688	2016-05-05 21:19:38 +00:00
Rafael Espindola	66434562e7	Fix copy relocations in pie. We were creating the copy relocations just fine, but then thinking that the .bss position could be preempted and creating a dynamic relocation to it, which would crash at runtime since that memory is read only. llvm-svn: 268668	2016-05-05 19:41:49 +00:00
Peter Collingbourne	3ad1c1e242	ELF: Undefine all symbols, not just those that we expect to be defined. This allows the combined LTO object to provide a definition with the same name as a symbol that was internalized without causing a duplicate symbol error. This normally happens during parallel codegen which externalizes originally-internal symbols, for example. In order to make this work, I needed to relax the undefined symbol error to only report an error for symbols that are used in regular objects. Differential Revision: http://reviews.llvm.org/D19954 llvm-svn: 268649	2016-05-05 17:13:49 +00:00
Rafael Espindola	474eb019b4	Move static function to avoid forward declaration. NFC. llvm-svn: 268646	2016-05-05 16:40:28 +00:00
Rafael Espindola	462220de47	Reuse logic for deciding whether to keep a local symbol or not. llvm-svn: 268644	2016-05-05 16:38:46 +00:00
Peter Collingbourne	e29e142a10	ELF: Do not use -1 to mark pieces of merge sections as being tail merged. We were previously using an output offset of -1 for both GC'd and tail merged pieces. We need to distinguish these two cases in order to filter GC'd symbols from the symbol table -- we were previously asserting when we asked for the VA of a symbol pointing into a dead piece, which would end up asking the tail merging string table for an offset even though we hadn't initialized it properly. This patch fixes the bug by using an offset of -1 to exclusively mean GC'd pieces, using 0 for tail merges, and distinguishing the tail merge case from an offset of 0 by asking the output section whether it is tail merge. Differential Revision: http://reviews.llvm.org/D19953 llvm-svn: 268604	2016-05-05 04:10:12 +00:00
Rafael Espindola	de17d28a32	Don't produce relative relocs to ro segments. We were already checking for non relative relocations. If we ever decide to add support for rw text segments this means we will have a single spot to add the flag. llvm-svn: 268558	2016-05-04 21:40:07 +00:00
Rafael Espindola	3fa5bbd91b	Rename isRelRelative. What it is computing is if we need a dynamic relocation or not. llvm-svn: 268556	2016-05-04 21:28:56 +00:00
Rafael Espindola	946ca27b61	Use early return. NFC. llvm-svn: 268554	2016-05-04 21:09:24 +00:00
Rafael Espindola	38bd217d0c	Delete getTlsGotRel. It was an old hack to avoid duplicating expression computation, but that is not needed with getExprRel. llvm-svn: 268515	2016-05-04 15:51:23 +00:00
Rafael Espindola	ebb04b9eb6	Simplify handling of hint relocations. llvm-svn: 268501	2016-05-04 14:44:22 +00:00
Simon Atanasyan	add74f37f2	[ELF][MIPS] Read/write .MIPS.options section MIPS N64 ABI introduces .MIPS.options section which specifies miscellaneous options to be applied to an object/shared/executable file. LLVM as well as modern versions of GNU tools read and write the only type of the options - ODK_REGINFO. It is exact copy of .reginfo section used by O32 ABI. llvm-svn: 268485	2016-05-04 10:07:38 +00:00
Peter Collingbourne	6f535b744f	Check return value of addOptionalSynthetic before calling a member function on it. Found with UBSan. llvm-svn: 268410	2016-05-03 18:03:45 +00:00
Peter Collingbourne	c357278a38	ELF: Remove the function SymbolTable<ELFT>::findFile. We already have the function SymbolBody::getSourceFile which does the same thing. llvm-svn: 268353	2016-05-03 01:48:25 +00:00
Peter Collingbourne	6a4225962d	ELF: Forbid all relative relocations to absolute symbols in PIC, except for weak undefined. Weak undefined symbols resolve to the image base. This is a little strange, but it allows us to link function calls to such symbols. Normally such a call will be guarded with a comparison, which will load a zero from the GOT. There's one example of such a function call in crti.o in Linux's CRT. As part of this change, I also needed to make the synthetic start and end symbols image base relative in the case where their sections were empty, so that PC-relative references to those symbols would continue to work. Differential Revision: http://reviews.llvm.org/D19844 llvm-svn: 268350	2016-05-03 01:21:08 +00:00
Rui Ueyama	dd368fcb05	Pass all buffers to BuildId hash function at once. NFC. This change simplifies the BuildId classes by removing a few member functions and variables from them. It should also make it easy to parallelize hash computation in future because now each BuildId object see all inputs rather than one at a time. llvm-svn: 268333	2016-05-02 23:35:59 +00:00

1 2 3 4 5 ...

668 Commits