llvm-project

Commit Graph

Author	SHA1	Message	Date
James Henderson	8dd4c06a77	[ELF] Pad x86 executable sections with 0xcc int3 instructions Executable sections should not be padded with zero by default. On some architectures, 0x00 is the start of a valid instruction sequence, so can confuse disassembly between InputSections (and indeed the start of the next InputSection in some situations). Further, in the case of misjumps into padding, padding may start to be executed silently. On x86, the "0xcc" byte represents the int3 trap instruction. It is a single byte long so can serve well as padding. This change switches x86 (and x86_64) to use this value for padding in executable sections, if no linker script directive overrides it. It also puts the behaviour into place making it easy to change the behaviour of other targets when desired. I do not know the relevant instruction sequences for trap instructions on other targets however, so somebody should add this separately. Because the old behaviour simply wrote padding in the whole section before overwriting most of it, this change also modifies the padding algorithm to write padding only where needed. This in turn has caused a small behaviour change with regards to what values are written via Fill commands in linker scripts, bringing it into line with ld.bfd. The fill value is now written starting from the end of the previous block, which means that it always starts from the first byte of the fill, whereas the old behaviour meant that the padding sometimes started mid-way through the fill value. See the test changes for more details. Reviewed by: ruiu Differential Revision: https://reviews.llvm.org/D30886 Bugzilla: http://bugs.llvm.org/show_bug.cgi?id=32227 llvm-svn: 299635	2017-04-06 09:29:08 +00:00
Rui Ueyama	63fda39e2f	Remove unnecessary virtual dtor. This class doesn't have virtual member functions, and no instances of this class is deleted through base pointers. llvm-svn: 299581	2017-04-05 19:21:35 +00:00
Rui Ueyama	03fc8d1e0d	Fix comments. llvm-svn: 299579	2017-04-05 19:20:54 +00:00
Rui Ueyama	4eb2eccb24	Rename ScriptConfig::UndefinedSymbols ReferencedSymbols. Symbols referenced by linker scripts are not necessarily be undefined, so the previous name didn't convey the meaining of the variable. llvm-svn: 299573	2017-04-05 18:02:30 +00:00
Rui Ueyama	96b3fe025a	Do not make ScriptParser class public. This class is used only within this file, so it can be file-local. llvm-svn: 299516	2017-04-05 05:08:01 +00:00
Rui Ueyama	2ec34544aa	Move the parser for the linker script to a separate file. LinkerScript.cpp contains both the linker script processor and the linker script parser. I put both into a single file, but the file grown too large, so it's time to put them into two different files. llvm-svn: 299515	2017-04-05 05:07:39 +00:00
Rui Ueyama	8f99f73c8f	Use make to create linker script command objects. It simplifies variable types. llvm-svn: 299505	2017-04-05 03:20:42 +00:00
Rui Ueyama	d379f7357d	Remove default arguments because they don't improve readability. llvm-svn: 299504	2017-04-05 03:20:22 +00:00
Rui Ueyama	72e107f302	Return a result from computeInputSections instead of mutating its argument. This should improve readability. llvm-svn: 299498	2017-04-05 02:05:48 +00:00
Petr Hosek	30f16b2339	[ELF] Allow references to reserved symbols in linker scripts This requires collectign all symbols referenced in the linker script and adding them to symbol table as undefined symbol. Differential Revision: https://reviews.llvm.org/D31147 llvm-svn: 298577	2017-03-23 03:52:34 +00:00
Rui Ueyama	a34da93847	Make elf::ScriptConfig a LinkerScript class member variable. LinkerScript used to be a template class, so we couldn't instantiate that class in elf::link. We instantiated ScriptConfig class earlier instead so that the linker script parser can store configurations to the object. Now that LinkerScript is not a template, it doesn't make sense to separate ScriptConfig from LinkerScript. This patch merges them. llvm-svn: 298457	2017-03-21 23:03:09 +00:00
Rui Ueyama	b8dd23f56e	Rename LinkerScriptBase -> LinkerScript. llvm-svn: 298456	2017-03-21 23:02:51 +00:00
George Rimar	a8dba48762	[ELF] - Combine LinkerScriptBase and LinkerScript<ELFT> Patch removes templated linkerscript class. Unfortunately that required 2 additional static methods findSymbol() and addRegularSymbol() because code depends on Symtab<ELFT>::X Differential revision: https://reviews.llvm.org/D30982 llvm-svn: 298241	2017-03-20 10:09:58 +00:00
Rafael Espindola	7ba5f47eb8	Handle & and \| of non abs values. Handling & in particular is probably important because of its use in aligning addresses. llvm-svn: 298096	2017-03-17 14:55:36 +00:00
Rafael Espindola	72dc195d78	Change our linker script expr representation. This fixes pr32031 by representing the expressions results as a SectionBase and offset. This allows us to use an input section directly instead of getting lost trying to compute an offset in an outputsection when not all the information is available yet. This also creates a struct to represent the value of and expression, allowing the expression itself to be a simple typedef. I think this is easier to read and will make it easier to extend the expression computation to handle more complicated cases. llvm-svn: 298079	2017-03-17 13:05:04 +00:00
Rui Ueyama	98e55de699	Revert r297850: [ELF] - Linkerscript: make Dot public and remove getDot(). NFC. This reverts commit r297850 because this change was made based on a miscommunication. llvm-svn: 298001	2017-03-16 21:50:30 +00:00
George Rimar	20055d4cd2	[ELF] - Linkerscript: make Dot public and remove getDot(). NFC. Suggested by Rui Ueyama, also groups member variables in a single place, while I am here. llvm-svn: 297850	2017-03-15 16:07:02 +00:00
George Rimar	503206c567	[ELF] - Move LinkerScript::discard to LinkerScriptBase. NFC. Became possible after r297844 llvm-svn: 297848	2017-03-15 15:42:44 +00:00
Petr Hosek	02ad516b2e	Support ABSOLUTE on the right hand side in linker scripts This also requires postponing the assignment the assignment of symbols defined in input linker scripts since those can refer to output sections and in case we don't have a SECTIONS command, we need to wait until all output sections have been created and assigned addresses. Differential Revision: https://reviews.llvm.org/D30851 llvm-svn: 297802	2017-03-15 03:33:23 +00:00
George Rimar	a2a1ef1abc	[ELF] - Move members of LinkerScript to LinkerScriptBase. NFC. That moves all members that s possible to move for now (all which does not depend on ELFT templating). After that change LinkerScript contains only 8 methods in total, and I believe it is possible to move them all after tweaking other parts of linker. And we will be able to have single class for linkerscript at the end. llvm-svn: 297735	2017-03-14 12:03:34 +00:00
George Rimar	d83ce1b49d	[ELF] - Devirtualize LinkerScriptBase::getOutputSectionSize. NFC. It does not use ELFT templates so can be non-virtual. llvm-svn: 297727	2017-03-14 10:24:47 +00:00
George Rimar	851dc1e84d	[ELF] - Devirtualize LinkerScriptBase::getOutputSection It does not use ELFT templates so can be non-virtual. llvm-svn: 297725	2017-03-14 10:15:53 +00:00
George Rimar	d0bee506a0	[ELF] - Simplify LinkerScriptBase::getDot(). NFC. That makes it not dependent on virtual call, keeping logic the same. llvm-svn: 297723	2017-03-14 10:05:43 +00:00
George Rimar	0c1c8085bc	[ELF] - Move ThreadBssOffset and Dot to LinkerScriptBase. NFC. One more step to combine LinkerScript and LinkerScriptBase. llvm-svn: 297722	2017-03-14 10:00:19 +00:00
George Rimar	2d2621090d	[ELF] - Step to combine LinkerScript and LinkerScriptBase We can move all not templated functionality to LinkerScriptBase. Patch do that for hasPhdrsCommands() and shows how it helps to detemplate things in other places. Probably we should be able to merge these 2 classes into single one after such steps. Even if not, it still looks as reasonable cleanup for me. Differential revision: https://reviews.llvm.org/D30895 llvm-svn: 297714	2017-03-14 09:03:53 +00:00
George Rimar	78aa270041	[ELF] - Remove unnecessary template. NFC. llvm-svn: 297622	2017-03-13 14:40:58 +00:00
Rafael Espindola	4595df94bb	Don't pass Dot to every callback. It is available from ScriptBase. llvm-svn: 297472	2017-03-10 16:04:26 +00:00
Rafael Espindola	9bd4566dac	Use SectionBase for linker script expressions. This is a small step for fixing pr32031, which needs expressions that point to input sections. llvm-svn: 297431	2017-03-10 00:47:33 +00:00
Rafael Espindola	5616adf655	Remove DefinedSynthetic. With this we have a single section hierarchy. It is a bit less code, but the main advantage will be in a future patch being able to handle foo = symbol_in_obj; in a linker script. Currently that fails since we try to find the output section of symbol_in_obj. With this we should be able to just return an InputSection from the expression. llvm-svn: 297313	2017-03-08 22:36:28 +00:00
Rui Ueyama	02a036f2e6	De-template OutputSectionFactory. Since OutputSection is no longer a template, it doesn't make much sense to tempalte its factory class. llvm-svn: 296308	2017-02-27 02:31:48 +00:00
Rafael Espindola	24e6f363c5	Merge OutputSectionBase and OutputSection. NFC. Now that all special sections are SyntheticSections, we only need one OutputSection class. llvm-svn: 296127	2017-02-24 15:07:30 +00:00
Rafael Espindola	774ea7d0a9	Make InputSection a class. NFC. With the current design an InputSection is basically anything that goes directly in a OutputSection. That includes plain input section but also synthetic sections, so this should probably not be a template. llvm-svn: 295993	2017-02-23 16:49:07 +00:00
George Rimar	2146787609	[ELF] - Refactoring of LMA offset handling code. NFC. Thanks to Rui Ueyama for suggestion. llvm-svn: 295943	2017-02-23 07:57:55 +00:00
Rafael Espindola	c404d50d7c	Merge InputSectionData and InputSectionBase. Now that InputSectionBase is not a template there is no reason to have the two. llvm-svn: 295924	2017-02-23 02:32:18 +00:00
Rafael Espindola	b4c9b81aad	Convert InputSectionBase to a class. Removing this template is not a big win by itself, but opens the way for removing more templates. llvm-svn: 295923	2017-02-23 02:28:28 +00:00
George Rimar	a8d8dcf6ef	[ELF] - Addressed post commit review comments for D30187 * Added comment. * Pass std::string copy instead using move semantic. llvm-svn: 295817	2017-02-22 09:13:04 +00:00
George Rimar	ae4761c186	[ELF] - Postpone evaluation of LMA offset. Previously we evaluated the values of LMA incorrectly for next cases: .text : AT(ADDR(.text) - 0xffffffff80000000) { ... } .data : AT(ADDR(.data) - 0xffffffff80000000) { ... } .init.begin : AT(ADDR(.init.begin) - 0xffffffff80000000) { ... } Reason was that we evaluated offset when VA was not assigned. For case above we ended up with 3 loads that has similar LMA and it was incorrect. That is critical for linux kernel. Patch updates the offset after VA calculation. That fixes the issue. Differential revision: https://reviews.llvm.org/D30163 llvm-svn: 295722	2017-02-21 15:08:18 +00:00
George Rimar	2ee2d2dcb5	[ELF] - Improve diagnostic messages for move location counter errors. Previously LLD would error out just "ld.lld: error: unable to move location counter backward" What does not really reveal the place of issue, Patch adds location to the output. Differential revision: https://reviews.llvm.org/D30187 llvm-svn: 295720	2017-02-21 14:50:38 +00:00
Rafael Espindola	679828ff92	Diagnose another case of the location counter moving backwards. This case should be possible to handle, but it is hard: * In order to create program headers correctly, we have to scan the sections in the order they are in the file. * To find that order, we have to "execute" the linker script. * The linker script can contain SIZEOF_HEADERS. So to support this we have to start with a guess of how many headers we need (3), run the linker script and try to create the program headers. If it turns out we need more headers, we run the script again with a larger SIZEOF_HEADERS. Also, running the linker script depends on knowing the size of the sections, so we have to finalize them. But creating the program headers can change the value stored in some sections, so we have to split size finalization and content finalization. Looks like the last part is also needed for range extension thunks, so we might support this at some point. For now just report an error instead of producing broken files. llvm-svn: 295458	2017-02-17 16:26:13 +00:00
Rafael Espindola	4cd7352c4f	Reject moving the location counter backwards. We were only checking when the assignment was inside a section. llvm-svn: 295454	2017-02-17 16:01:51 +00:00
Rafael Espindola	8290274c13	Share more output section creation code. We can do this now that the linker script and the writer agree on which sections should be combined. llvm-svn: 295341	2017-02-16 17:32:26 +00:00
Rui Ueyama	8a8a953e99	Rename NotFlags -> NegFlags. Negative flags are still bit flags, so I think "not flag" is a very good name. llvm-svn: 293143	2017-01-26 02:58:59 +00:00
Meador Inge	b889744e5b	[LinkerScript] Implement `MEMORY` command As specified here: * https://sourceware.org/binutils/docs/ld/MEMORY.html#MEMORY There are two deviations from what is specified for GNU ld: 1. Only integer constants and not constant expressions are allowed in `LENGTH` and `ORIGIN` initializations. 2. The `I` and `L` attributes are not implemented. With (1) there is currently no easy way to evaluate integer only constant expressions. This can be enhanced in the future. With (2) it isn't clear how these flags map to the `SHF_*` flags or if they even make sense for an ELF linker. Differential Revision: https://reviews.llvm.org/D28911 llvm-svn: 292875	2017-01-24 02:34:00 +00:00
Eugene Leviant	f6aeed3624	[ELF] Linkerscript: print location of undefined symbol usage Differential revision: https://reviews.llvm.org/D27194 llvm-svn: 290339	2016-12-22 13:13:12 +00:00
Vitaly Buka	0b7de06a23	Fix build broken by changes in StringMatcher interface r290213 llvm-svn: 290231	2016-12-21 02:27:14 +00:00
Rafael Espindola	17cb7c0a2a	Detemplate PhdrEntry. NFC. llvm-svn: 290115	2016-12-19 17:01:01 +00:00
Meador Inge	95c7d8d2a7	[ELF] Allow output section data commands to take expressions The current implementation of the output section data store commands can only handle integer literals, but it should really handle arbitrary expressions [1]. This commit fixes that. [1] https://sourceware.org/binutils/docs-2.27/ld/Output-Section-Data.html#Output-Section-Data Differential Revision: https://reviews.llvm.org/D27561 llvm-svn: 289152	2016-12-08 23:21:30 +00:00
Rafael Espindola	d0ebd84c42	Change the implementation of --dynamic-list to use linker script parsing. The feature is documented as ----------------------------- The format of the dynamic list is the same as the version node without scope and node name. See *note VERSION:: for more information. -------------------------------- And indeed qt uses a dynamic list with an 'extern "C++"' in it. With this patch we support that The change to gc-sections-shared makes us match bfd. Just because we kept bar doesn't mean it has to be in the dynamic symbol table. The changes to invalid-dynamic-list.test and reproduce.s are because of the new parser. The changes to version-script.s are the only case where we change behavior with regards to bfd, but I would like to see a mix of --version-script and --dynamic-list used in the wild before complicating the code. llvm-svn: 289082	2016-12-08 17:54:26 +00:00
Eugene Leviant	2a942c4b45	[ELF] Print file:line for unknown PHDR error Differential revision: https://reviews.llvm.org/D27335 llvm-svn: 288678	2016-12-05 16:38:32 +00:00
Eugene Leviant	ed30ce7ae4	[ELF] Print file:line for 'undefined section' errors Differential revision: https://reviews.llvm.org/D27108 llvm-svn: 288019	2016-11-28 09:58:04 +00:00

1 2 3 4

188 Commits