llvm-project

Commit Graph

Author	SHA1	Message	Date
Peter Collingbourne	6a4225962d	ELF: Forbid all relative relocations to absolute symbols in PIC, except for weak undefined. Weak undefined symbols resolve to the image base. This is a little strange, but it allows us to link function calls to such symbols. Normally such a call will be guarded with a comparison, which will load a zero from the GOT. There's one example of such a function call in crti.o in Linux's CRT. As part of this change, I also needed to make the synthetic start and end symbols image base relative in the case where their sections were empty, so that PC-relative references to those symbols would continue to work. Differential Revision: http://reviews.llvm.org/D19844 llvm-svn: 268350	2016-05-03 01:21:08 +00:00
Rui Ueyama	6d0cd2b62b	Teach Undefined symbols from which file they are created from. This patch increases the size of Undefined by the size of a pointer, but it wouldn't actually increase the size of memory that LLD uses because we are not allocating the exact size but the size of the largest SymbolBody. llvm-svn: 268310	2016-05-02 21:30:42 +00:00
Peter Collingbourne	4f9527065c	ELF: New symbol table design. This patch implements a new design for the symbol table that stores SymbolBodies within a memory region of the Symbol object. Symbols are mutated by constructing SymbolBodies in place over existing SymbolBodies, rather than by mutating pointers. As mentioned in the initial proposal [1], this memory layout helps reduce the cache miss rate by improving memory locality. Performance numbers: old(s) new(s) Without debug info: chrome 7.178 6.432 (-11.5%) LLVMgold.so 0.505 0.502 (-0.5%) clang 0.954 0.827 (-15.4%) llvm-as 0.052 0.045 (-15.5%) With debug info: scylla 5.695 5.613 (-1.5%) clang 14.396 14.143 (-1.8%) Performance counter results show that the fewer required indirections is indeed the cause of the improved performance. For example, when linking chrome, stalled cycles decreases from 14,556,444,002 to 12,959,238,310, and instructions per cycle increases from 0.78 to 0.83. We are also executing many fewer instructions (15,516,401,933 down to 15,002,434,310), probably because we spend less time allocating SymbolBodies. The new mechanism by which symbols are added to the symbol table is by calling add* functions on the SymbolTable. In this patch, I handle local symbols by storing them inside "unparented" SymbolBodies. This is suboptimal, but if we do want to try to avoid allocating these SymbolBodies, we can probably do that separately. I also removed a few members from the SymbolBody class that were only being used to pass information from the input file to the symbol table. This patch implements the new design for the ELF linker only. I intend to prepare a similar patch for the COFF linker. [1] http://lists.llvm.org/pipermail/llvm-dev/2016-April/098832.html Differential Revision: http://reviews.llvm.org/D19752 llvm-svn: 268178	2016-05-01 04:55:03 +00:00
Rui Ueyama	62ee16faa8	Remove Size from Undefined symbol. There seems to be no reason to keep st_size of undefined symbols. This patch removes the member for it. This patch will change outputs in cases that undefined symbols are copied to output, but I think this is unimportant. Differential Revision: http://reviews.llvm.org/D19574 llvm-svn: 267826	2016-04-28 00:26:54 +00:00
Peter Collingbourne	60976ed7c0	ELF: Merge UndefinedBitcode and UndefinedElf. NFC. Differential Revision: http://reviews.llvm.org/D19566 llvm-svn: 267640	2016-04-27 00:05:06 +00:00
Peter Collingbourne	892d498017	ELF: Re-implement -u directly and remove CanKeepUndefined flag. The semantics of the -u flag are to load the lazy symbol named by the flag. We were previously relying on this behavior falling out of symbol resolution against a synthetic undefined symbol, but that didn't quite give us the correct behavior, so we needed a flag to mark symbols created with -u so we could treat them specially in the writer. However, it's simpler and less error prone to implement the required behavior directly and remove the flag. This fixes an issue where symbols loaded with -u would receive hidden visibility even when the definition in an object file had wider visibility. Differential Revision: http://reviews.llvm.org/D19560 llvm-svn: 267639	2016-04-27 00:05:03 +00:00
Peter Collingbourne	dbe4187d11	ELF: Simplify preemption logic. Do not include weak undefined symbols in non-DSOs. Add a test for -Bsymbolic + undefined symbols. llvm-svn: 267323	2016-04-24 04:29:59 +00:00
Peter Collingbourne	d869a040ee	ELF: Always include undefined DSO symbols in the symbol table. Fixes check-llvm when bootstrapping. Also remove mostly dead and most likely incorrect logic regarding preemption of weak undefined symbols. llvm-svn: 267314	2016-04-24 02:31:02 +00:00
Peter Collingbourne	66ac1d6152	ELF: Implement basic support for --version-script. This patch only implements support for version scripts of the form: { [ global: symbol1; symbol2; [...]; symbolN; ] local: *; }; No wildcards are supported, other than for the local entry. Symbol versioning is also not supported. It works by introducing a new Symbol flag which tracks whether a symbol appears in the global section of a version script. This patch also simplifies the logic in SymbolBody::isPreemptible(), and teaches it to handle the case where symbols with default visibility in DSOs do not appear in the dynamic symbol table because of a version script. Fixes PR27482. Differential Revision: http://reviews.llvm.org/D19430 llvm-svn: 267208	2016-04-22 20:21:26 +00:00
Rui Ueyama	8bf71066c5	Inline SymbolTable::compareCommons and add comments. NFC. llvm-svn: 267195	2016-04-22 19:34:59 +00:00
Peter Collingbourne	dadcc17ead	ELF: Move Visibility, IsUsedInRegularObj and MustBeInDynSym flags to Symbol. These are properties of a symbol name, rather than a particular instance of a symbol in an object file. We can simplify the code by collecting these properties in Symbol. The MustBeInDynSym flag has been renamed ExportDynamic, as its semantics have been changed to be the same as those of --dynamic-list and --export-dynamic-symbol, which do not cause hidden symbols to be exported. Differential Revision: http://reviews.llvm.org/D19400 llvm-svn: 267183	2016-04-22 18:42:48 +00:00
Rafael Espindola	4d480ed545	Internalize linkonce_odr more often. Since there is a copy in every translation unit that uses them, they can be omitted from the symbol table if the address is not significant. This still doesn't catch as many cases as the gold plugin. The difference is that we check canBeOmittedFromSymbolTable in each file and use lazy loading which limits what it can do. Gold checks it in the merged file. I think the correct way of getting the same results as gold is just to cache in the IR the result of canBeOmittedFromSymbolTable. llvm-svn: 267063	2016-04-21 21:44:25 +00:00
Rafael Espindola	ae605c1b0c	Start adding support for internalizing shared libraries. llvm-svn: 267045	2016-04-21 20:35:25 +00:00
Rafael Espindola	3666025880	Two small related fixes. * A hidden undefined is not preemptable. * It is always zero, so we don't need a dynamic reloc for it. llvm-svn: 266424	2016-04-15 11:57:07 +00:00
Rafael Espindola	f9d3dcf0a8	Don't set MustBeInDynSym for hidden symbols. llvm-svn: 266230	2016-04-13 19:03:34 +00:00
Peter Collingbourne	f6e9b4ec24	ELF: Use hidden visibility for all DefinedSynthetic symbols. This simplifies the code by allowing us to remove the visibility argument to functions that create synthetic symbols. The only functional change is that the visibility of the MIPS "_gp" symbol is now hidden. Because this symbol is defined in every executable or DSO, it would be difficult to observe a visibility change here. Differential Revision: http://reviews.llvm.org/D19033 llvm-svn: 266208	2016-04-13 16:57:28 +00:00
Rafael Espindola	8caf33c483	Cleanup the handling of MustBeInDynSym and IsUsedInRegularObj. Now MustBeInDynSym is only true if the symbol really must be in the dynamic symbol table. IsUsedInRegularObj is only true if the symbol is used in a .o or -u. Not a .so or a .bc. A benefit is that this is now done almost entirilly during symbol resolution. The only exception is copy relocations because of aliases. This includes a small fix in that protected symbols in .so don't force executable symbols to be exported. This also opens the way for implementing internalize for -shared. llvm-svn: 265826	2016-04-08 18:39:03 +00:00
Rafael Espindola	a15fb15b05	Don't lower the visibility because of shared symbols. If a shared library has a protected symbol 'foo', that doesn't imply that the symbol 'foo' in the output should be protected or not. llvm-svn: 265794	2016-04-08 16:11:42 +00:00
Rui Ueyama	f8baa66056	ELF: Implement --start-lib and --end-lib start-lib and end-lib are options to link object files in the same semantics as archive files. If an object is in start-lib and end-lib, the object is linked only when the file is needed to resolve undefined symbols. That means, if an object is in start-lib and end-lib, it behaves as if it were in an archive file. In this patch, I introduced a new notion, LazyObjectFile. That is analogous to Archive file type, but that works for a single object file instead of for an archive file. http://reviews.llvm.org/D18814 llvm-svn: 265710	2016-04-07 19:24:51 +00:00
Rafael Espindola	74031ba1e9	Simplify dynamic relocation creation. The position of a relocation can always be expressed as an offset in an output section. llvm-svn: 265682	2016-04-07 15:20:56 +00:00
Rafael Espindola	5e34568f79	Use a bit in SymbolBody to store CanKeepUndefined. UndefinedElf for 64 bits goes from 72 to 64 bytes. llvm-svn: 265543	2016-04-06 14:31:03 +00:00
Rafael Espindola	f47657301b	Change the type hierarchy for undefined symbols. We have to differentiate undefined symbols from bitcode and undefined symbols from other sources. Undefined symbols from bitcode should not inhibit the symbol being internalized. Undefined symbols from other sources should. llvm-svn: 265536	2016-04-06 13:22:41 +00:00
Rafael Espindola	242ffa8da1	Fix use of uninitialized. The names of undefined locals are not used, so I don't think it is possible to actually test this. llvm-svn: 265534	2016-04-06 12:19:25 +00:00
Rafael Espindola	f9b79a479e	Rename a few Visibility arguments to StOther. llvm-svn: 265533	2016-04-06 12:14:31 +00:00
Rafael Espindola	d9a1717efc	Remove redundant argument. NFC. llvm-svn: 265386	2016-04-05 11:47:46 +00:00
Peter Collingbourne	d0856a6bb2	ELF: Make SymbolBody::compare a non-template function. Differential Revision: http://reviews.llvm.org/D18781 llvm-svn: 265372	2016-04-05 00:47:58 +00:00
Peter Collingbourne	e8afa4971c	ELF: Preserve MustBeInDynSym for bitcode symbols. Make sure to copy the MustBeInDynSym field when replacing shared symbols with bitcode symbols, and when replacing bitcode symbols with regular symbols in addCombinedLtoObject. Fixes interposition of DSO symbols with bitcode symbols in the main executable. Differential Revision: http://reviews.llvm.org/D18780 llvm-svn: 265371	2016-04-05 00:47:55 +00:00
Rui Ueyama	b5792b231b	Rename Other -> StOther. "Other" as a name is too generic, so name it StOther. llvm-svn: 265332	2016-04-04 19:09:08 +00:00
Rafael Espindola	ccfe3cb3d6	Don't store an Elf_Sym for most symbols. Our symbol representation was redundant, and some times would get out of sync. It had an Elf_Sym, but some fields were copied to SymbolBody. Different parts of the code were checking the bits in SymbolBody and others were checking Elf_Sym. There are two general approaches to fix this: * Copy the required information and don't store and Elf_Sym. * Don't copy the information and always use the Elf_Smy. The second way sounds tempting, but has a big problem: we would have to template SymbolBody. I started doing it, but it requires templeting everything and creates a bit chicken and egg problem at the driver where we have to find ELFT before we can create an ArchiveFile for example. As much as possible I compared the test differences with what gold and bfd produce to make sure they are still valid. In most cases we are just adding hidden visibility to a local symbol, which is harmless. In most tests this is a small speedup. The only slowdown was scylla (1.006X). The largest speedup was clang with no --build-id, -O3 or --gc-sections (i.e.: focus on the relocations): 1.019X. llvm-svn: 265293	2016-04-04 14:04:16 +00:00
Rui Ueyama	bfc1d9d976	Remove DefinedElf class. DefinedElf was a superclass of DefinedRegular and SharedSymbol classes and represented the notion of defined symbols created for ELF symbols. It turned out that we didn't use that class often. We had only two occurrences of dyn_cast'ing to DefinedElf, and both were easily rewritten without it. The class was also a bit confusing. The concept of "created for ELF symbol" is orthogonal to defined/undefined types. However, we had two distinct classes, DefinedElf and UndefinedElf. This patch simply removes the class. Now the class hierarchy is one level shallower. llvm-svn: 265234	2016-04-02 18:06:18 +00:00
Simon Atanasyan	13f6da1d2c	[ELF] Implement infrastructure for thunk code creation Some targets might require creation of thunks. For example, MIPS targets require stubs to call PIC code from non-PIC one. The patch implements infrastructure for thunk code creation and provides support for MIPS LA25 stubs. Any MIPS PIC code function is invoked with its address in register $t9. So if we have a branch instruction from non-PIC code to the PIC one we cannot make the jump directly and need to create a small stub to save the target function address. See page 3-38 ftp://www.linux-mips.org/pub/linux/mips/doc/ABI/mipsabi.pdf - In relocation scanning phase we ask target about thunk creation necessity by calling `TagetInfo::needsThunk` method. The `InputSection` class maintains list of Symbols requires thunk creation. - Reassigning offsets performed for each input sections after relocation scanning complete because position of each section might change due thunk creation. - The patch introduces new dedicated value for DefinedSynthetic symbols DefinedSynthetic::SectionEnd. Synthetic symbol with that value always points to the end of the corresponding output section. That allows to escape updating synthetic symbols if output sections sizes changes after relocation scanning due thunk creation. - In the `InputSection::writeTo` method we write thunks after corresponding input section. Each thunk is written by calling `TargetInfo::writeThunk` method. - The patch supports the only type of thunk code for each target. For now, it is enough. Differential Revision: http://reviews.llvm.org/D17934 llvm-svn: 265059	2016-03-31 21:26:23 +00:00
Davide Italiano	f6523aecd7	Revert r264961. I didn't have asserts enable when testing. llvm-svn: 264692	2016-03-29 02:20:10 +00:00
Davide Italiano	a50e0b97f1	[LTO] Include bitcode symbol name in unreachable messages. llvm-svn: 264691	2016-03-29 01:40:07 +00:00
Rafael Espindola	5432287bad	Make needsPlt a plain function instead of a template. llvm-svn: 264267	2016-03-24 12:55:27 +00:00
Davide Italiano	901de03fe2	[ELF] Simplify code a bit. No functional change. llvm-svn: 263999	2016-03-21 22:44:24 +00:00
Rafael Espindola	8381c565c3	Make evaluation order explicit. llvm-svn: 263762	2016-03-17 23:36:19 +00:00
Rui Ueyama	9328b2cdde	Use ELFT instead of ELFFile<ELFT>. llvm-svn: 263510	2016-03-14 23:16:09 +00:00
George Rimar	343580097d	[ELF] implement --warn-common/--no-warn-common -warn-common Warn when a common symbol is combined with another common symbol or with a symbol definition. Unix linkers allow this somewhat sloppy practice, but linkers on some other operating systems do not. This option allows you to find potential problems from combining global symbols. Differential revision: http://reviews.llvm.org/D17998 llvm-svn: 263413	2016-03-14 09:19:30 +00:00
Rui Ueyama	2df72898c8	Remove `else` after `return`. llvm-svn: 263392	2016-03-13 20:54:38 +00:00
Rui Ueyama	c4466605d8	ELF: Redefine canBeDefined as a member function of SymbolBody. We want to make SymbolBody the central place to query symbol information. This patch also renames canBePreempted to isPreemptible because I feel that the latter is slightly better (the former is three words and the latter is two words.) llvm-svn: 263386	2016-03-13 19:48:18 +00:00
Rui Ueyama	7ede54310a	Redefine isGnuIfunc as a member function of SymbolBody. llvm-svn: 263365	2016-03-13 04:40:14 +00:00
George Rimar	777f96304e	Recommit of r263252, [ELF] - Change all messages to lowercase to be consistent. which was reverted because included unrelative changes by mistake. Original commit message: [ELF] - Change all messages to lowercase to be consistent. That is directly opposite to http://reviews.llvm.org/D18045, which was reverted. This patch changes all messages to start from lowercase letter if they were not before. That is done to be consistent with clang. Differential revision: http://reviews.llvm.org/D18085 llvm-svn: 263337	2016-03-12 08:31:34 +00:00
Rui Ueyama	f714955402	Revert r263252: "[ELF] - Change all messages to lowercase to be consistent." This reverts commit r263252 because the change contained unrelated changes. llvm-svn: 263272	2016-03-11 18:46:51 +00:00
George Rimar	96bcdae1a5	[ELF] - Change all messages to lowercase to be consistent. That is directly opposite to http://reviews.llvm.org/D18045, which was reverted. This patch changes all messages to start from lowercase letter if they were not before. That is done to be consistent with clang. Differential revision: http://reviews.llvm.org/D18085 llvm-svn: 263252	2016-03-11 16:40:55 +00:00
Rafael Espindola	1f5b70f64f	Represent local symbols with DefinedRegular. llvm-svn: 263237	2016-03-11 14:21:37 +00:00
Rafael Espindola	87d9f10733	Compute value of local symbol with getVA. llvm-svn: 263225	2016-03-11 12:19:05 +00:00
Rafael Espindola	67d72c02bc	Create a SymbolBody for locals. pr26878 shows a case where locals have to be in the got. llvm-svn: 263222	2016-03-11 12:06:30 +00:00
George Rimar	56e0d53e92	[ELF] - Move initSymbols() to Driver.cpp. NFC. That is followup for http://reviews.llvm.org/D18047 patch. initSymbols() moved to Driver.cpp and made static. llvm-svn: 263214	2016-03-11 10:07:18 +00:00
Rui Ueyama	17d6983a4e	Rename MaxAlignment -> Alignment. We can argue about a maximum alignment of a group of symbols, but for each symbol, there is only one alignment. So it is a bit weird that each symbol has a "maximum alignment". llvm-svn: 263151	2016-03-10 18:58:53 +00:00
George Rimar	3498c7fbd0	[ELF] - Refactor of SymbolBody::compare() That makes it a bit shorter. Differential revision: http://reviews.llvm.org/D18004 llvm-svn: 263144	2016-03-10 18:49:24 +00:00

1 2 3

122 Commits