llvm-project

Commit Graph

Author	SHA1	Message	Date
Rafael Espindola	9f0c4bb795	Parse relocations only once. Relocations are the last thing that we wore storing a raw section pointer to and parsing on demand. With this patch we parse it only once and store a pointer to the actual data. The patch also changes where we store it. It is now in InputSectionBase. Not all sections have relocations, but most do and this simplifies the logic. It also means that we now only support one relocation section per section. Given that that constraint is maintained even with -r with gold bfd and lld, I think it is OK. llvm-svn: 286459	2016-11-10 14:53:24 +00:00
Rafael Espindola	8f9026baff	Don't add null and discarded sections to the global list. Avoids having to skip them multiple times. llvm-svn: 286261	2016-11-08 18:23:02 +00:00
Rui Ueyama	8c6a5aaf15	Create a vector containing all input sections. Previously, we do this piece of code to iterate over all input sections. for (elf::ObjectFile<ELFT> F : Symtab.getObjectFiles()) for (InputSectionBase<ELFT> S : F->getSections()) It turned out that this mechanisms doesn't work well with synthetic input sections because synthetic input sections don't belong to any input file. This patch defines a vector that contains all input sections including synthetic ones. llvm-svn: 286051	2016-11-05 22:37:59 +00:00
Rafael Espindola	e19abab9a7	Now that the ELFFile constructor does nothing, create it when needed. This avoids duplicating the buffer in InputFile. llvm-svn: 285965	2016-11-03 20:44:50 +00:00
Rafael Espindola	454fe154a7	Update for llvm change. llvm-svn: 285956	2016-11-03 19:07:44 +00:00
Rafael Espindola	1854a8ebb8	Delete trivial getters. NFC. llvm-svn: 285190	2016-10-26 12:36:56 +00:00
Rafael Espindola	58139d1758	Delete getSectionHdr. We were fairly inconsistent as to what information should be accessed with getSectionHdr and what information (like alignment) was stored elsewhere. Now all section info has a dedicated getter. The code is also a bit more compact. llvm-svn: 285079	2016-10-25 16:14:25 +00:00
George Rimar	a4c7e74d4b	[ELF] - Applied clang format. NFC. llvm-svn: 284705	2016-10-20 08:36:42 +00:00
Rui Ueyama	8da7aa0894	Allow linker-script-defined entry symbols. Previously, we were checking the existence of an entry symbol too early. It was done before the linker script processor creates symbols defined in scripts. Fixes bug 30743. llvm-svn: 284676	2016-10-20 00:07:36 +00:00
Rafael Espindola	116d83fbe0	Don't call markLiveAt for non alloc sections. We don't gc them anyway, so just use an early return in Enqueue. llvm-svn: 284663	2016-10-19 23:13:40 +00:00
Peter Smith	0760605ac5	[ELF][ARM] Garbage collection support for .ARM.exidx sections .ARM.exidx sections have a reverse dependency on the section they have a SHF_LINK_ORDER dependency on. In other words a .ARM.exidx section is live only if the executable section it describes is live. We implement this with a reverse dependency field in InputSection. Adding the dependency to InputSection is the simplest implementation but it could be moved out to a separate map if it were found to decrease performance for non ARM targets. Differential revision: https://reviews.llvm.org/D25234 llvm-svn: 283734	2016-10-10 10:10:27 +00:00
Rui Ueyama	a03380808c	Early continue. NFC. llvm-svn: 283382	2016-10-05 21:06:32 +00:00
Rui Ueyama	ad9268a7b4	Early return. NFC. llvm-svn: 283173	2016-10-04 00:46:36 +00:00
Eugene Leviant	d2778848ef	Don't GC non-alloc mergeable section pieces Differential revision: https://reviews.llvm.org/D25033 llvm-svn: 282708	2016-09-29 10:27:10 +00:00
Rafael Espindola	615c5ae7ae	Read addend from the correct location. llvm-svn: 282663	2016-09-29 01:20:40 +00:00
George Rimar	74661eb040	[ELF] - Fixed linkage error when using -g --gc-sections together. r282444 introduced new issue, sample program below fails to link on assert(Piece.Live); int main() { return 0; } clang test.cpp -c -o out.o -g ld.lld -flavor gnu --gc-sections out.o -o out Problem is that .debug_info contains relocations to .debug_str: Section (7) .rela.debug_info { .. 0xC R_X86_64_32 .debug_str 0x0 0x12 R_X86_64_32 .debug_str 0x37 .. But we do not preserve .debug_str in a right way now. To fix this we should ignore relocations from non-allocatable sections to allocatable to allow GC work at full power, but still should proccess relocations from non-allocatable to non-allocatable sections as usual to mark some parts of debug sections alive to keep them so we do not end up with such assert when trying to access dead pieces. That looks like what gold/ld do, they do not strip .debug_str section from what I saw using sample provided. Thanks to Evgeny Leviant for suggestions about how to fix this. Differential revision: https://reviews.llvm.org/D24967 llvm-svn: 282495	2016-09-27 15:55:29 +00:00
Rafael Espindola	2b9008e5e5	Non alloca sections should not keep other sections live. This matches the gold behaviour and is important to prevent debug info from effectively disabling gc. llvm-svn: 282444	2016-09-26 21:34:34 +00:00
Eugene Leviant	527a84ec12	Linkerscript: don't crash when GC .debug_line Differential revision: https://reviews.llvm.org/D24870 llvm-svn: 282393	2016-09-26 09:04:16 +00:00
Eugene Leviant	8287b1bce1	Linkerscript: do not GC non-allocated sections Differential revision: https://reviews.llvm.org/D24733 llvm-svn: 282391	2016-09-26 08:32:41 +00:00
Davide Italiano	49ab2517f5	[ELF/GC] Don't crash while processing Discarded sections. The ELF spec doesn't allow relocations to point directly to a deduplicated COMDAT section but this unfortunately happens in practice. Bail out early instead of crashing. Differential Revision: https://reviews.llvm.org/D24750 llvm-svn: 282197	2016-09-22 21:08:51 +00:00
Rui Ueyama	38dbd3eea9	Simplify InputFile ownership management. Previously, all input files were owned by the symbol table. Files were created at various places, such as the Driver, the lazy symbols, or the bitcode compiler, and the ownership of new files was transferred to the symbol table using std::unique_ptr. All input files were then free'd when the symbol table is freed which is on program exit. I think we don't have to transfer ownership just to free all instance at once on exit. In this patch, all instances are automatically collected to a vector and freed on exit. In this way, we no longer have to use std::unique_ptr. Differential Revision: https://reviews.llvm.org/D24493 llvm-svn: 281425	2016-09-14 00:05:51 +00:00
Rui Ueyama	3e9037d419	Inline small funciton. llvm-svn: 281381	2016-09-13 19:56:27 +00:00
Rafael Espindola	c7e1e03498	Store an ArrayRef for Data in InputSectionData. llvm-svn: 281210	2016-09-12 13:13:53 +00:00
Rafael Espindola	042a3f209b	Compute section names only once. This simplifies error handling as there is now only one place in the code that needs to consider the possibility that the name is corrupted. Before we would do it in every access. llvm-svn: 280937	2016-09-08 14:06:08 +00:00
Benjamin Kramer	df8f196f9a	Unpollute the global namespace. lld edition. llvm-svn: 277926	2016-08-06 13:52:37 +00:00
Rafael Espindola	2deeb6093d	Fix PR28575. Not all relocations from a .eh_frame that point to an executable section should be ignored. In particular, the relocation finding the personality function should not. This is a reduction from trying to bootstrap a static lld on linux. llvm-svn: 276329	2016-07-21 20:18:30 +00:00
Rafael Espindola	3fab868675	Pass section by reference. NFC. llvm-svn: 275803	2016-07-18 16:05:09 +00:00
Rui Ueyama	03e1cc6322	Remove duplicate declaration. llvm-svn: 274429	2016-07-02 08:50:05 +00:00
Rui Ueyama	0b9a90364b	Rename EHInputSection -> EhInputSection. llvm-svn: 270532	2016-05-24 04:19:20 +00:00
Rui Ueyama	b91bf1a9a0	Do not split mergeable sections if they are gc'ed. Previously, mergeable section's constructors did more than just setting member variables; it split section contents into small pieces. It is not always computationally cheap task because if the section is a mergeable string section, it needs to scan the entire section to split them by NUL characters. If a section would be thrown away by GC, that cost ended up being a waste of time. It is going to be larger problem if the section is compressed -- the whole time to uncompress it and split it up is going to be a waste. Luckily, we can defer section splitting after GC. We just have to remember which offsets are in use during GC and apply that later. This patch implements it. Differential Revision: http://reviews.llvm.org/D20516 llvm-svn: 270455	2016-05-23 16:55:43 +00:00
Rui Ueyama	90fa3722d2	Simplify SplitInputSection::getRangeAndSize. This patch adds Size member to SectionPiece so that getRangeAndSize can just return a SectionPiece instead of a std::pair<SectionPiece *, uint_t>. Also renamed the function. llvm-svn: 270346	2016-05-22 00:41:38 +00:00
Rui Ueyama	3ea8727188	Define SectionPiece and use it instead of std::pair<uint_t, uint_t>. We were using std::pair to represents pieces of splittable section contents. It hurt readability because "first" and "second" are not meaningful. This patch give them names. One more thing is that piecewise liveness information is stored to the second element of the pair as a special value of output section offset. It was confusing, so I defiend a new bit, "Live", in the new struct. llvm-svn: 270340	2016-05-22 00:13:04 +00:00
Rafael Espindola	d89fbca21d	Fix --gc-sections when .eh_frame has a lsda. We have to add sections to the work list, not just mark them live. llvm-svn: 268628	2016-05-05 13:51:14 +00:00
Peter Collingbourne	e29e142a10	ELF: Do not use -1 to mark pieces of merge sections as being tail merged. We were previously using an output offset of -1 for both GC'd and tail merged pieces. We need to distinguish these two cases in order to filter GC'd symbols from the symbol table -- we were previously asserting when we asked for the VA of a symbol pointing into a dead piece, which would end up asking the tail merging string table for an offset even though we hadn't initialized it properly. This patch fixes the bug by using an offset of -1 to exclusively mean GC'd pieces, using 0 for tail merges, and distinguishing the tail merge case from an offset of 0 by asking the output section whether it is tail merge. Differential Revision: http://reviews.llvm.org/D19953 llvm-svn: 268604	2016-05-05 04:10:12 +00:00
Rui Ueyama	4f8d21f387	Do not pass Symtab to markLive/doICF since Symtab is globally accessible. llvm-svn: 268286	2016-05-02 19:30:42 +00:00
George Rimar	af90d54660	[ELF] - keep alive all non-text sections referenced by .eh_frame Patch implements one of suggestions from Rafael Ávila de Espíndola, to fix segfault after section that contains personality being garbage collected. Suggestion was just to keep alive all non executable sections referenced by .eh_frame. This fixes PR27529. Differential revision: http://reviews.llvm.org/D19656 llvm-svn: 268228	2016-05-02 13:49:42 +00:00
Peter Collingbourne	4f9527065c	ELF: New symbol table design. This patch implements a new design for the symbol table that stores SymbolBodies within a memory region of the Symbol object. Symbols are mutated by constructing SymbolBodies in place over existing SymbolBodies, rather than by mutating pointers. As mentioned in the initial proposal [1], this memory layout helps reduce the cache miss rate by improving memory locality. Performance numbers: old(s) new(s) Without debug info: chrome 7.178 6.432 (-11.5%) LLVMgold.so 0.505 0.502 (-0.5%) clang 0.954 0.827 (-15.4%) llvm-as 0.052 0.045 (-15.5%) With debug info: scylla 5.695 5.613 (-1.5%) clang 14.396 14.143 (-1.8%) Performance counter results show that the fewer required indirections is indeed the cause of the improved performance. For example, when linking chrome, stalled cycles decreases from 14,556,444,002 to 12,959,238,310, and instructions per cycle increases from 0.78 to 0.83. We are also executing many fewer instructions (15,516,401,933 down to 15,002,434,310), probably because we spend less time allocating SymbolBodies. The new mechanism by which symbols are added to the symbol table is by calling add* functions on the SymbolTable. In this patch, I handle local symbols by storing them inside "unparented" SymbolBodies. This is suboptimal, but if we do want to try to avoid allocating these SymbolBodies, we can probably do that separately. I also removed a few members from the SymbolBody class that were only being used to pass information from the input file to the symbol table. This patch implements the new design for the ELF linker only. I intend to prepare a similar patch for the COFF linker. [1] http://lists.llvm.org/pipermail/llvm-dev/2016-April/098832.html Differential Revision: http://reviews.llvm.org/D19752 llvm-svn: 268178	2016-05-01 04:55:03 +00:00
Peter Collingbourne	676c7cd1ed	ELF: Move code to where it is used, and related cleanups. NFC. Differential Revision: http://reviews.llvm.org/D19490 llvm-svn: 267637	2016-04-26 23:52:44 +00:00
Rafael Espindola	1b65ad1725	Don't gc symbols that have to go in the dynamic symbol table. We were only doing it for .so and --export-dynamic, but those are not the only ways a symbol ends up in the dynamic symbol table. Problem diagnostic and earlier patch version by Peter Collingbourne. llvm-svn: 267568	2016-04-26 16:26:45 +00:00
Rafael Espindola	0b9531c8e6	Bring r267164 back with a fix. The fix is to handle local symbols referring to SHF_MERGE sections. Original message: GC entries of SHF_MERGE sections. It is a fairly direct extension of the gc algorithm. For merge sections instead of remembering just a live bit, we remember which offsets were used. This reduces the .rodata sections in chromium from 9648861 to `9477472` bytes. llvm-svn: 267233	2016-04-22 22:09:35 +00:00
Rafael Espindola	46c039f2c0	Revert "GC entries of SHF_MERGE sections." This reverts commit r267164. Revert "Trying to fix the windows build." This reverts commit r267168. Debugging a bootstrap problem. llvm-svn: 267194	2016-04-22 19:31:35 +00:00
Peter Collingbourne	dadcc17ead	ELF: Move Visibility, IsUsedInRegularObj and MustBeInDynSym flags to Symbol. These are properties of a symbol name, rather than a particular instance of a symbol in an object file. We can simplify the code by collecting these properties in Symbol. The MustBeInDynSym flag has been renamed ExportDynamic, as its semantics have been changed to be the same as those of --dynamic-list and --export-dynamic-symbol, which do not cause hidden symbols to be exported. Differential Revision: http://reviews.llvm.org/D19400 llvm-svn: 267183	2016-04-22 18:42:48 +00:00
Rafael Espindola	caa831d85a	GC entries of SHF_MERGE sections. It is a fairly direct extension of the gc algorithm. For merge sections instead of remembering just a live bit, we remember which offsets were used. This reduces the .rodata sections in chromium from 9648861 to `9477472` bytes. llvm-svn: 267164	2016-04-22 16:46:08 +00:00
Rafael Espindola	197d6a882f	This reverts commit r267154 and r267161. It turns out that this will read data from the section to properly handle Elf_Rel implicit addends. Sorry for the noise. Original messages: Try to fix Windows lld build. Move getRelocTarget to ObjectFile. It doesn't use anything from the InputSection. llvm-svn: 267163	2016-04-22 16:39:59 +00:00
Nico Weber	104871f433	Try to fix Windows lld build. llvm\tools\lld\ELF\MarkLive.cpp(49): error C2872: 'ObjectFile': ambiguous symbol llvm\tools\lld\elf\InputFiles.h(100): note: could be 'lld:🧝:ObjectFile' llvm\include\llvm/Object/IRObjectFile.h(26): note: or 'llvm::object::ObjectFile' llvm\tools\lld\ELF\MarkLive.cpp(133): note: see reference to function template instantiation 'void forEachSuccessor<ELFT>(lld:🧝:InputSection<ELFT> , std::function<void (lld:🧝:InputSectionBase<ELFT> )>)' being compiled with [ ELFT=llvm::object::ELF32LE ] llvm\tools\lld\ELF\MarkLive.cpp(136): note: see reference to function template instantiation 'void lld:🧝:markLive<llvm::object::ELF32LE>(lld:🧝:SymbolTable<llvm::object::ELF32LE> *) being compiled llvm-svn: 267161	2016-04-22 15:23:02 +00:00
Rafael Espindola	ea4d177977	Move getRelocTarget to ObjectFile. It doesn't use anything from the InputSection. llvm-svn: 267154	2016-04-22 14:17:14 +00:00
Rafael Espindola	5221516a04	Simplify. NFC. llvm-svn: 267147	2016-04-22 11:46:28 +00:00
Rafael Espindola	5be6064974	Don't gc protected symbols. llvm-svn: 267081	2016-04-21 23:59:19 +00:00
Rui Ueyama	07320e4030	ELF: Template LinkerScript class. Originally, linker scripts were basically an alternative way to specify options to the command line options. But as we add more features to hanlde symbols and sections, many member functions needed to be templated. Now most the members are templated. It is probably time to template the entire class. Previously, LinkerScript is an executor of the linker script as well as a storage of linker script configurations. This is not suitable to template the class because when we are reading linker script files, we don't know the ELF type yet, so we can't instantiate ELF-templated classes. In this patch, I defined a new class, ScriptConfiguration, to store linker script configurations. ScriptParser writes parse results to it, and LinkerScript uses them. Differential Revision: http://reviews.llvm.org/D19302 llvm-svn: 266908	2016-04-20 20:13:41 +00:00
Rafael Espindola	38c67a27fe	Store a Symbol for EntrySym. This makes it impossible to forget to call repl on the SymbolBody. llvm-svn: 266432	2016-04-15 14:41:56 +00:00

1 2

74 Commits