llvm-project

Commit Graph

Author	SHA1	Message	Date
George Rimar	ee01b1d390	[ELF] - Print LMA in a -Map file. Currently, LLD prints VA, but not LMA in a map file. It seems can be useful to print both to reveal layout details and patch implements it. Differential revision: https://reviews.llvm.org/D44899 llvm-svn: 329271	2018-04-05 10:51:06 +00:00
George Rimar	1fc9f39bd5	[ELF] - Check that output sections fit in address space. Added checks to test that we do not produce output where VA of sections overruns the address space available. Differential revision: https://reviews.llvm.org/D43820 llvm-svn: 329063	2018-04-03 12:39:28 +00:00
George Rimar	fd11560f6e	[ELF] - Linkerscript: support MIN and MAX. Sample for the OVERLAY command from the spec (https://access.redhat.com/documentation/en-US/Red_Hat_Enterprise_Linux/4/html/Using_ld_the_GNU_Linker/sections.html) uses MAX command that we do not support currently: . = 0x1000 + MAX (SIZEOF (.text0), SIZEOF (.text1)); This patch implements support for MIN and MAX. Differential revision: https://reviews.llvm.org/D44734 llvm-svn: 328696	2018-03-28 11:33:00 +00:00
George Rimar	a6ce78ece1	This is PR36799. Currently, we might have a bug with scripts like below: .foo : ALIGN(8) { *(.foo) } > ram because do not expand the memory region when doing ALIGN. This might result in file range overlaps. The patch fixes the issue. Differential revision: https://reviews.llvm.org/D44730 llvm-svn: 328479	2018-03-26 08:58:16 +00:00
George Rimar	d8281379f9	[ELF] - Do not ignore discarding of .rela.plt/.rela.dyn, allow doing custom layout for them. Currently when we build input sections list in linker script we ignore all rel[a] sections. That was done to support scripts like .rela.dyn : { (.rela.data) } for emit relocs. Though as a result following scripts were also silently ignored: /DISCARD/ : { (.rela.plt) /DISCARD/ : { *(.rela.dyn) and we produced output with this sections. That is not ideal. The solution this patch suggests is simple: do not ignore synthetic rel[a] sections. That way we can enable common discarding logic for them and report a proper error. Differential revision: https://reviews.llvm.org/D41640 llvm-svn: 328419	2018-03-24 13:10:19 +00:00
George Rimar	54634f1990	[ELF] - Another fix for "LLD crashes with --emit-relocs when trying to proccess .eh_frame" This fixes PR36367 which is about segfault when --emit-relocs is used together with .eh_frame sections which happens because of reordering of regular and .rel[a] sections. Path changes loop that iterates over input sections to create relocation target sections first. Differential revision: https://reviews.llvm.org/D44679 llvm-svn: 328299	2018-03-23 09:18:31 +00:00
Rui Ueyama	aa92fca83c	Fix linker script operator precedence. "&" should have higher priority than "\|" [1]. Previously, they had the same priority. [1] https://sourceware.org/binutils/docs/ld/Operators.html Differential Revision: https://reviews.llvm.org/D43880 llvm-svn: 327684	2018-03-15 23:12:33 +00:00
George Rimar	76f1c78dea	[ELF] - Simplify test case. NFCI. llvm-svn: 327614	2018-03-15 09:26:08 +00:00
George Rimar	84bcabcb86	[ELF] - Show data and assignment commands in the map file. Patch teaches LLD to print BYTE/SHORT/LONG/QUAD and location move commands to the map file. Differential revision: https://reviews.llvm.org/D44004 llvm-svn: 327612	2018-03-15 09:16:40 +00:00
George Rimar	271ed6eb0d	[ELF] - Convert overlapping-sections.s testcase to x86 and cleanup. Patch do the following changes: * Test case was converted from MIPS to x86. * Removed part of the test checking we are able to produce a valid output. Since we do that already in other tests, this one's intention should be only to check we are still able to report overlaps and/or produce broken output with overlaps. Differential revision: https://reviews.llvm.org/D44438 llvm-svn: 327480	2018-03-14 07:44:23 +00:00
George Rimar	b9f4b70f20	[ELF} - Fix build bots. llvm-svn: 327419	2018-03-13 16:23:48 +00:00
George Rimar	b7288836ca	[ELF] - Fix mistype in comment. NFC. llvm-svn: 327417	2018-03-13 16:11:02 +00:00
George Rimar	f95e7c6f7a	[ELF] - Rename test cases to *.test. This is a follow-up for r327410. llvm-svn: 327416	2018-03-13 16:02:45 +00:00
George Rimar	cfd2c97008	[ELF] - Represent tests as linker scripts instead of asm. This follows recently started direction and sometimes allows to fully get rid from `echo` calls. I'll rename changed files to *.test in a follow-up. llvm-svn: 327410	2018-03-13 15:47:14 +00:00
George Rimar	796684b451	[ELF] - Implement INSERT BEFORE. This finishes PR35877. INSERT BEFORE used similar to INSERT AFTER, it inserts sections before the given target section. Differential revision: https://reviews.llvm.org/D44380 llvm-svn: 327378	2018-03-13 09:18:11 +00:00
George Rimar	ebc1d1fdde	[ELF] - Fix wrong "REQUIRES" in test. Its a follow up for r327374 to fix BB. llvm-svn: 327377	2018-03-13 08:50:36 +00:00
George Rimar	2313086726	[ELF] - Restrict section offsets that exceeds file size. This is part of PR36515. With some linkerscripts it is possible to get file offset overlaps and overflows. Currently LLD checks overlaps in checkNoOverlappingSections(). And also we allow broken output with --no-inhibit-exec. Problem is that sometimes final offset of sections is completely broken and we calculate output file size wrong and might crash. Patch implements check to verify that there is no output section which offset exceeds file size. Differential revision: https://reviews.llvm.org/D43819 llvm-svn: 327376	2018-03-13 08:47:17 +00:00
George Rimar	afbf90aef9	[ELF] - Drop special flags for empty output sections. This fixes PR36598. LLD currently crashes when we have empty output section with SHF_LINK_ORDER flag. This might happen if we place an empty synthetic section in the linker script, but keep output section alive with the use of additional symbol, for example. The patch fixes the issue by dropping all special flags for empty sections. Differential revision: https://reviews.llvm.org/D44376 llvm-svn: 327374	2018-03-13 08:32:56 +00:00
George Rimar	e3f198d58a	[ELF] - Change consume()->expect() in INSERT AFTER parsing. AFTER keyword is mandatory and consume() was used by mistake here. We accepted broken script before this patch, testcase shows the issue. llvm-svn: 327260	2018-03-12 12:34:43 +00:00
George Rimar	9e2c8a9db1	[ELF] - Support "INSERT AFTER" statement. This implements INSERT AFTER in a following way: During reading scripts it collects all insert statements. After we done and read all files it inserts statements into script commands list. With that: * Rest of code does know nothing about INSERT. * Approach is straightforward and have no visible limitations. * It is also easy to support INSERT BEFORE (was seen in clang code once). * Should work for PR35877 and similar cases. Cons: * It assumes we have "main" scripts that describes sections. Differential revision: https://reviews.llvm.org/D43468 llvm-svn: 327003	2018-03-08 14:54:38 +00:00
George Rimar	bf3c384673	[ELF] - Adjust rangeToString to report ranges in a different format. It was raised during the review of D43819. LLD usually use [X, Y] for reporting ranges, like below: "relocation R_386_16 out of range: 65536 is not in [0, 65535]" Patch changes rangeToString() to do the same. Differential revision: https://reviews.llvm.org/D44207 llvm-svn: 326918	2018-03-07 17:54:25 +00:00
George Rimar	527bfd7a48	[ELF] - Recommit r326892,r326893 "[ELF] - Report LMA region overflows." With fix: add missing "RUN:" prefix to test case. Original commit message: We do not report LMA region overflows currently. Both GNU linkers do that. The patch implements it. Differential revision: https://reviews.llvm.org/D44094 llvm-svn: 326895	2018-03-07 12:44:18 +00:00
George Rimar	06846c2251	[ELF] - Revert r326892, r326893. Bots are still unhappy: http://lab.llvm.org:8011/builders/llvm-clang-lld-x86_64-scei-ps4-ubuntu-fast/builds/26259 llvm-svn: 326894	2018-03-07 12:33:00 +00:00
George Rimar	64b2ba1f81	[ELF] - Fix build bot after r326892 "[ELF] - Report LMA region overflows." Removed excessive line from testcase. llvm-svn: 326893	2018-03-07 12:16:26 +00:00
George Rimar	97e054e00d	[ELF] - Report LMA region overflows. We do not report LMA region overflows currently. Both GNU linkers do that. The patch implements it. Differential revision: https://reviews.llvm.org/D44094 llvm-svn: 326892	2018-03-07 11:54:30 +00:00
George Rimar	54baa5f45f	[ELF] - Allow discarding .hash and .gnu.hash from linker script. Currently, LLD segfaults when linker script attempts to discard one of the hash sections. This patch fixes that. Differential revision: https://reviews.llvm.org/D44012 llvm-svn: 326891	2018-03-07 11:47:15 +00:00
George Rimar	162d436c8e	[ELF] - Support moving location counter when MEMORY is used. We do not expand memory region correctly for following scripts: .foo.1 : { *(.foo.1) . += 0x1000; } > ram Patch generalizes expanding of output sections and memory regions in one place and fixes the issue. Differential revision: https://reviews.llvm.org/D43999 llvm-svn: 326688	2018-03-05 10:54:03 +00:00
George Rimar	7b91e2133e	[ELF] - Report location for div/mod by zero. "division by zero" or "modulo by zero" are not very informative errors and even probably confusing as does not let to know that error is coming from linker script. Patch adds location reporting. Differential revision: https://reviews.llvm.org/D43934 llvm-svn: 326686	2018-03-05 10:02:44 +00:00
George Rimar	97785af464	[ELF] - Report error when memory region is overflowed by data commands. LLD can not catch a memory area overflow when using a data command. If we have the script below: .foo : { *(.foo) BYTE(0x1) } > ram where BYTE overflows the ram region, we do not report it currently. Patch fixes that. Differential revision: https://reviews.llvm.org/D43948 llvm-svn: 326545	2018-03-02 08:11:58 +00:00
Rafael Espindola	e75b42ee4e	Don't allocate a header bellow address 0. With the current code if the script has a PHDRS we always obey and try to allocate a header. This can cause Min - HeaderSize to underflow. It looks like bfd actually prints an error for this case. With this patch we do the same. Found while looking at pr36515. llvm-svn: 326441	2018-03-01 15:25:46 +00:00
George Rimar	b068b03793	[ELF] - Don't crash on broken MEMORY declaration. LLD crashes with broken scripts shown in testcase, because fails to read memory regon name and accesses MemoryRegions's element which is nullptr. Patch fixes it. Differential revision: https://reviews.llvm.org/D43866 llvm-svn: 326431	2018-03-01 12:36:01 +00:00
George Rimar	c4df670dea	[ELF] - Do not remove empty sections that use symbols in expressions. This is PR36515. Currenly if we have a script like .debug_info 0 : { *(.debug_info) }, we would not remove this section and keep it in the output. That does not work, because it is common case for debug sections to have a zero address expression. Patch changes behavior so that we remove only sections that do not use symbols in its expressions. Differential revision: https://reviews.llvm.org/D43863 llvm-svn: 326430	2018-03-01 12:27:04 +00:00
George Rimar	f3e93b23f7	[ELF] - Fix eh-frame-reloc-out-of-range.test. Was broken after recent testcases changes. llvm-svn: 326427	2018-03-01 10:38:51 +00:00
Rui Ueyama	05660daced	Convert more tests as linker scripts instead of assembly. llvm-svn: 326415	2018-03-01 04:21:42 +00:00
Rui Ueyama	dc32dc1770	Convert more .s files to linker script files. Summary: This change removes large "echo" commands from the test by writing tests themselves as linker scripts. Reviewers: rafael Subscribers: emaste, javed.absar, llvm-commits, arichardson Differential Revision: https://reviews.llvm.org/D43900 llvm-svn: 326403	2018-03-01 01:19:12 +00:00
Rui Ueyama	2dfe49a441	Write some tests as linker scripts instead of assembly files. Some linker script test cases contain only a few lines of assembly and a long linker script. Such tests are easier to maintain if we write the main test file as a linkier script instead of assembly. Differential Revision: https://reviews.llvm.org/D43887 llvm-svn: 326363	2018-02-28 20:22:42 +00:00
Rui Ueyama	39ba31ff50	Add "%" operator to the linker script. This patch improves compatibility with GNU linkers. Differential Revision: https://reviews.llvm.org/D43883 llvm-svn: 326348	2018-02-28 18:38:13 +00:00
Igor Kudrin	c844524e46	[ELF] Process linker scripts deeper when declaring symbols. We should process symbols inside output section declarations the same way as top-level ones. Differential Revision: https://reviews.llvm.org/D43008 llvm-svn: 326305	2018-02-28 05:55:56 +00:00
Igor Kudrin	3345c9ac18	[ELF] Create and export symbols provided by a linker script if they referenced by DSOs. It should be possible to resolve undefined symbols in dynamic libraries using symbols defined in a linker script. Differential Revision: https://reviews.llvm.org/D43011 llvm-svn: 326176	2018-02-27 07:18:07 +00:00
Rafael Espindola	79c23eec04	Keep flags from phantom synthetic sections. This fixes pr36475. I think this code can be simplified a bit, but I would like to check in the more direct fix if we are in agreement on the direction and then refactor. This is not something that bfd does. The issue is not noticed in bfd because it keeps fewer sections from the linkerscript in the output. The reasons why it seems reasonable to do this: - As George noticed, we would still keep the flags if the output section had both an empty synthetic section and a regular section - We need an heuristic to find the flags of output sections. Using the flags of a synthetic section that would have been there seems a reasonable heuristic. llvm-svn: 326137	2018-02-26 22:32:15 +00:00
George Rimar	db1a062447	[ELF] - Do not remove empty output sections that are explicitly assigned to phdr in script. This continues direction started in D43069. We can keep sections that are explicitly assigned to segment in script. It helps to simplify code. Differential revision: https://reviews.llvm.org/D43571 llvm-svn: 325887	2018-02-23 10:53:04 +00:00
George Rimar	3cdf0d969a	[ELF] - Report error if removed empty output section declaration used undefined symbols. This is for fixing PR36297. Issue itself is that if we have SECTIONS { .bar (a+b) : { *(.stub) } }; script and no section .stub, when LLD will remove .bar, but produce output with undefined symbols a and b. Differential revision: https://reviews.llvm.org/D43069 llvm-svn: 325875	2018-02-23 10:15:54 +00:00
George Rimar	4e6f52c9a4	[ELF] - Add testcase documenting flags assigned when empty synthetic section is removed. This responds to PR36475, r325763 led to unexprected layout change, though new behavior seems to be more correct. Previously we could have following script: .foo : { (.foo) } .bar : { (.synthetic_empty) BYTE(0x11) }} where synthetic_empty is a synthetic section which is empty and hence removed by linker. Before r325763 .bar would receive section flags from .synthetic_empty, but after this revision it receives flags the same as .foo section has. It is the same as if there would not be any synthetic_empty section in a script, so looks reasonable and consistent behavior: .foo : { *(.foo) } .bar : { BYTE(0x11) }} Patch adds testcase to document it. Differential revision: https://reviews.llvm.org/D43632 llvm-svn: 325873	2018-02-23 09:57:17 +00:00
George Rimar	db7c630b01	[ELF] - Simplify testcase. NFC. This removes script input file and inlines script into testcase body. That is consistent with othet LS tests and makes testcase easier to read. llvm-svn: 325673	2018-02-21 11:56:55 +00:00
Rui Ueyama	ff59a899d6	Use toString to print out garbage-collected sections. Currently, archive file name is missing in this message. In general, we should avoid constructing strings in an ad-hoc manner and instead use toString() to get consistent output strings. Differential Revision: https://reviews.llvm.org/D43420 llvm-svn: 325416	2018-02-17 00:09:49 +00:00
George Rimar	1c08e9f5ce	[ELF] - Support COPY, INFO, OVERLAY output sections attributes. This is PR36298. (COPY), (INFO), (OVERLAY) all have the same effect: section should be marked as non-allocatable. (https://www.eecs.umich.edu/courses/eecs373/readings/Linker.pdf, 3.6.8.1 Output Section Type) Differential revision: https://reviews.llvm.org/D43071 llvm-svn: 325331	2018-02-16 10:42:58 +00:00
Igor Kudrin	25f917341e	[ELF] Simplify handling of AT section attribute. This also makes the behavior close to GNU ld's. Differential Revision: https://reviews.llvm.org/D43284 llvm-svn: 325213	2018-02-15 06:13:52 +00:00
George Rimar	308c92b25d	[ELF] - Fix BB after r324463. Test requires arm, but specified x86. llvm-svn: 324464	2018-02-07 09:41:14 +00:00
George Rimar	3d5e86e5ee	[ELF] - Remove unused synthetic sections correctly. This is PR35740 which now crashes because we remove unused synthetic sections incorrectly. We can keep input section description and corresponding output section live even if it must be empty and dead. This results in a crash because SHF_LINK_ORDER handling code tries to access first section which is nullptr in this case. Patch fixes the issue. Differential revision: https://reviews.llvm.org/D42681 llvm-svn: 324463	2018-02-07 09:11:07 +00:00
Rui Ueyama	532fa0e1ca	Make sure that --no-check-sections doesn't print out warning messages. Differential Revision: https://reviews.llvm.org/D42988 llvm-svn: 324434	2018-02-07 00:41:34 +00:00
Rui Ueyama	6a8e79b8e5	Add -{no,}-check-sections flags to enable/disable section overlchecking GNU linkers have this option. Differential Revision: https://reviews.llvm.org/D42858 llvm-svn: 324150	2018-02-02 22:24:06 +00:00
Rafael Espindola	27b2990d11	Sort each InputSectionDescription individually. This fixes pr36190. Thanks to James Henderson for the testcase and for pointing out how to fix this. llvm-svn: 323993	2018-02-01 19:30:15 +00:00
Alexander Richardson	6b367faa45	[ELF] Make overlapping output sections an error Summary: While trying to make a linker script behave the same way with lld as it did with bfd, I discovered that lld currently doesn't diagnose overlapping output sections. I was getting very strange runtime failures which I tracked down to overlapping sections in the resulting binary. When linking with ld.bfd overlapping output sections are an error unless --noinhibit-exec is passed and I believe lld should behave the same way here to avoid surprising crashes at runtime. The patch also uncovered an errors in the tests: arm-thumb-interwork-thunk was creating a binary where .got.plt was placed at an address overlapping with .got. Reviewers: ruiu, grimar, rafael Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D41046 llvm-svn: 323856	2018-01-31 09:22:44 +00:00
Rafael Espindola	a9f488588d	Run dos2unix on another file. NFC. llvm-svn: 323796	2018-01-30 18:05:56 +00:00
Rafael Espindola	c9265e81f4	Run dos2unix in a few files. NFC. llvm-svn: 323793	2018-01-30 17:24:28 +00:00
Rafael Espindola	22d533568b	Sort orphan section if --symbol-ordering-file is given. Before this patch orphan sections were not sorted. llvm-svn: 323779	2018-01-30 16:20:08 +00:00
George Rimar	c4ccfb5d93	[ELF] - Define linkerscript symbols early. Currently symbols assigned or created by linkerscript are not processed early enough. As a result it is not possible to version them or assign any other flags/properties. Patch creates Defined symbols for -defsym and linkerscript symbols early, so that issue from above can be addressed. It is based on Rafael Espindola's version of D38239 patch. Fixes PR34121. Differential revision: https://reviews.llvm.org/D41987 llvm-svn: 323729	2018-01-30 09:04:27 +00:00
Rafael Espindola	a0d7df3988	Put the header in the first PT_LOAD even if that PT_LOAD has a LMAExpr. This should fix PR36017. The root problem is that we were creating a PT_LOAD just for the header. That was technically valid, but inconvenient: we should not be making the ELF discontinuous. The solution is to allow a section with LMAExpr to be added to a PT_LOAD if that PT_LOAD doesn't already have a LMAExpr. llvm-svn: 323625	2018-01-29 03:44:44 +00:00
Rafael Espindola	db9dd5b43e	Improve LMARegion handling. This fixes the crash reported at PR36083. The issue is that we were trying to put all the sections in the same PT_LOAD and crashing trying to write past the end of the file. This also adds accounting for used space in LMARegion, without it all 3 PT_LOADs would have the same physical address. llvm-svn: 323449	2018-01-25 17:42:03 +00:00
Benjamin Kramer	3f47fcf102	[ELF] Keep tests from wrinting to the test directory. llvm-svn: 322943	2018-01-19 14:15:13 +00:00
Rafael Espindola	5e9c77624c	Handle parsing AT(ADDR(.foo-bar)). The problem we had with it is that anything inside an AT is an expression, so we failed to parse the section name because of the - in it. llvm-svn: 322801	2018-01-18 01:14:57 +00:00
George Rimar	0b89c55aea	[ELF] - Stop mixing order of -defsym/-script commands. Previously we always handled -defsym after other commands in command line. That made impossible to overload values set by -defsym from linker script: test.script: foo = 0x22; -defsym=foo=0x11 -script t.script would always set foo to 0x11. That is inconstent with common logic which allows to override command line options. it is inconsistent with bfd behavior and seems breaks assumption that -defsym is the same as linker script assignment, as -defsyms always handled out of command line order. Patch fixes the handling order. Differential revision: https://reviews.llvm.org/D42054 llvm-svn: 322625	2018-01-17 10:24:49 +00:00
Rafael Espindola	75702389bd	Fix incorrect physical address on self-referencing AT command. When a section placement (AT) command references the section itself, the physical address of the section in the ELF header was calculated incorrectly due to alignment happening right after the location pointer's value was captured. The problem was diagnosed and the first version of the patch written by Erick Reyes. llvm-svn: 322421	2018-01-12 23:26:25 +00:00
George Rimar	5d01a8be96	[ELF] - Fix for ld.lld does not accept "AT" syntax for declaring LMA region AT> lma_region expression allows to specify the memory region for section load address. Should fix PR35684. Differential revision: https://reviews.llvm.org/D41397 llvm-svn: 322359	2018-01-12 09:07:35 +00:00
James Henderson	e1689689d8	[ELF] Compress debug sections after assignAddresses and support custom layout Previously, in r320472, I moved the calculation of section offsets and sizes for compressed debug sections into maybeCompress, which happens before assignAddresses, so that the compression had the required information. However, I failed to take account of relocations that patch such sections. This had two effects: 1. A race condition existed when a debug section referred to a different debug section (see PR35788). 2. References to symbols in non-debug sections would be patched incorrectly. This is because the addresses of such symbols are not calculated until after assignAddresses (this was a partial regression caused by r320472, but they could still have been broken before, in the event that a custom layout was used in a linker script). assignAddresses does not need to know about the output section size of non-allocatable sections, because they do not affect the value of Dot. This means that there is no longer a reason not to support custom layout of compressed debug sections, as far as I'm aware. These two points allow for delaying when maybeCompress can be called, removing the need for the loop I previously added to calculate the section size, and therefore the race condition. Furthermore, by delaying, we fix the issues of relocations getting incorrect symbol values, because they have now all been finalized. llvm-svn: 321986	2018-01-08 10:17:03 +00:00
Rafael Espindola	2640a0a5e5	Align SHT_NOBITS sections is they are the first on a PT_LOAD. We normally want to ignore SHT_NOBITS sections when computing offsets. The sh_offset of section itself seems to be irrelevant and - If the section is in the middle of a PT_LOAD, it will make no difference on the computed offset of the followup section. - If it is in the end of a PT_LOAD, we want to avoid its alignment changing the offset of the followup sections. The issue is if it is at the start of the PT_LOAD. In that case we do have to align it so that the following sections have congruent address and offset module the page size. We were not handling this case. This should fix freebsd kernel link. llvm-svn: 321657	2018-01-02 16:46:30 +00:00
Rafael Espindola	6a97f80755	Fix output section offset and contents when linker script uses memory region and data commands. Advance the memory region offset when handling a linker script data command such as BYTE or LONG. Failure to advance the offset results in corrupted output with overlapping sections. Update tests to check for this combination of both a) memory regions and b) data commands. Fixes https://bugs.llvm.org/show_bug.cgi?id=35565 Patch by Owen Shaw! llvm-svn: 321418	2017-12-24 03:46:35 +00:00
Rafael Espindola	9cbb6dd1fc	Result of subtracting two symbols should be absolute. When two linker script symbols are subtracted, the result should be absolute. This is the behavior of binutils' ld. Patch by Erick Reyes! llvm-svn: 321390	2017-12-22 21:55:28 +00:00
Igor Kudrin	5966d15943	[ELF] Fix an assignment command at the end of an .ARM.exidx section. The value of the symbol in the assignment should include the sentinel entry. Differential Revision: https://reviews.llvm.org/D41234 llvm-svn: 321154	2017-12-20 08:56:10 +00:00
Rafael Espindola	4e125de4a6	Use # instead of // for comments in a test. The test was using both // and # before. llvm-svn: 321049	2017-12-19 00:53:06 +00:00
Peter Smith	96ca4f5e91	[ELF] Remove Duplicate .ARM.exidx sections The ARM.exidx section contains a table of 8-byte entries with the first word of each entry an offset to the function it describes and the second word instructions for unwinding if an exception is thrown from that function. The SHF_LINK_ORDER processing will order the table in ascending order of the functions described by the exception table entries. As the address range of an exception table entry is terminated by the next table entry, it is possible to merge consecutive table entries that have identical unwind instructions. For this implementation we define a table entry to be identical if: - Both entries are the special EXIDX_CANTUNWIND. - Both entries have the same inline unwind instructions. We do not attempt to establish if table entries that are references to .ARM.extab sections are identical. This implementation works at a granularity of a single .ARM.exidx InputSection. If all entries in the InputSection are identical to the previous table entry we can remove the InputSection. A more sophisticated but more complex implementation would rewrite InputSection contents so that duplicates within a .ARM.exidx InputSection can be merged. Differential Revision: https://reviews.llvm.org/D40967 llvm-svn: 320803	2017-12-15 11:09:41 +00:00
Igor Kudrin	f01caab4b7	[ELF] Prevent crash in writing an .ARM.exidx sentinel entry. We might crash in 'ARMExidxSentinelSection::writeTo()' because it expected the sentinel entry to be put in the same 'InputSectionDescription' as the last real entry. This assumption fails if the last output section command for .ARM.exidx is anything but an input section description, because in this case 'OutputSection::addSection()' creates a new 'InputSectionDescription'. Differential Revision: https://reviews.llvm.org/D41105 llvm-svn: 320668	2017-12-14 06:23:50 +00:00
Rui Ueyama	1ce416c635	Remove trailing whitespace. llvm-svn: 320520	2017-12-12 20:00:30 +00:00
James Henderson	8d0efdd5db	[ELF] Reset OutputSection size prior to processing linker script commands The size of an OutputSection is calculated early, to aid handling of compressed debug sections. However, subsequent to this point, unused synthetic sections are removed. In the event that an OutputSection, from which such an InputSection is removed, is still required (e.g. because it has a symbol assignment), and no longer has any InputSections, dot assignments, or BYTE()-family directives, the size member is never updated when processing the commands. If the removed InputSection had a non-zero size (such as a .got.plt section), the section ends up with the wrong size in the output. The fix is to reset the OutputSection size prior to processing the linker script commands relating to that OutputSection. This ensures that the size is correct even in the above situation. Additionally, to reduce the risk of developers misusing OutputSection Size and InputSection OutSecOff, they are set to simply the number of InputSections in an OutputSection, and the corresponding index respectively. We cannot completely stop using them, due to SHF_LINK_ORDER sections requiring them. Compressed debug sections also require the full size. This is now calculated in maybeCompress for these kinds of sections. Reviewers: ruiu, rafael Differential Revision: https://reviews.llvm.org/D38361 llvm-svn: 320472	2017-12-12 11:51:13 +00:00
Jake Ehrlich	0ca350a92d	[ELF] Change default output section type to SHT_NOBITS When an output section has no byte commands and has no input sections then it would be ideal if the type of the section is SHT_NOBITS so that the file can take up less space. This change sets the default type of of output sections to SHT_NOBITS instead of SHT_PROGBITS to allow this. This required some minor test changes (which double as tests for this new behavior) but extend-pt-load.s had be changed in a non-trivial way. Since it seems to me that the point of the test is to point out the consequences of how flags are assigned to output sections that don't have input sections I changed the test to work and still show how the memsize of the executable segment was changed. Differential Revision: https://reviews.llvm.org/D41082 llvm-svn: 320437	2017-12-11 23:25:27 +00:00
Alexander Richardson	d2481bed05	[ELF] When a relocation is out of range print the value and the range Reviewers: ruiu, grimar Reviewed By: ruiu Subscribers: emaste, nemanjai, javed.absar, kbarton, llvm-commits Differential Revision: https://reviews.llvm.org/D40962 llvm-svn: 320416	2017-12-11 20:47:21 +00:00
Rafael Espindola	63fcc5cccc	Create reserved symbols early so they can be versioned. This fixes pr35570. We were creating these symbols after parsing version scripts, so they could not be versioned. We cannot move the version script parsing later because we need it for lto. One option is to move both addReservedSymbols and createSyntheticSections earlier. The disadvantage is that some sections created by createSyntheticSections replace other input sections. For example, gdb index replaces .debug_gnu_pubnames, so it wants to run after gc sections so that it can set S->Live to false. What this patch does instead is to move just the ElfHeader creation early. llvm-svn: 320390	2017-12-11 17:23:28 +00:00
Rui Ueyama	2569edd9b8	Fix a test that didn't actually test anything. llvm-svn: 320117	2017-12-08 00:00:37 +00:00
George Rimar	78e27e830d	[ELF] - Produce relocation section name consistent with output section name when --emit-reloc used with linker script. This is for "Bug 35474 - --emit-relocs produces wrongly-named reloc sections". LLD currently for scripts like: .text.boot : { *(.text.boot) } emits relocation section with name .rela.text because does not take redefined name of output section into account and builds section name using rules for non-scripted case. Patch fixes this oddness. Differential revision: https://reviews.llvm.org/D40652 llvm-svn: 319526	2017-12-01 09:04:52 +00:00
Rafael Espindola	279e5fa715	Add missing test. NFC. We had no tests for what PROVIDE should do if there is a shared symbol with the same name. In both bfd and our existing implementation PROVIDE wins. Add a test for that. llvm-svn: 319486	2017-11-30 22:29:14 +00:00
Rafael Espindola	de38b3d22f	Handle copy relocations in symbol assignments. When a linker script has "foo = bar" and bar is the result of a copy relocation foo should point to the same location in .bss. This is part of a growing evidence that copy relocations should be implemented by using replaceSymbol to replace the SharedSymbol with a Defined. llvm-svn: 319449	2017-11-30 17:51:10 +00:00
Alexander Richardson	1de78471f5	[ELF] Fall back to search dirs for linker scripts specified with -T Summary: This matches the behaviour of ld.bfd: https://sourceware.org/binutils/docs/ld/Options.html#Options If scriptfile does not exist in the current directory, ld looks for it in the directories specified by any preceding '-L' options. Multiple '-T' options accumulate. Reviewers: ruiu, grimar Reviewed By: ruiu, grimar Subscribers: emaste, llvm-commits Differential Revision: https://reviews.llvm.org/D40129 llvm-svn: 318655	2017-11-20 15:43:20 +00:00
Alexander Richardson	f463042312	[ELF][MIPS] Fix crash in LLD when linking code that needs PIC thunks Summary: The bug triggers when the following conditions are met: - A thunk is created in a given input section S - A linker script is specified - There is at least one matcher in the linker script .text section output that does not match any of the sections in the input files, before the matcher that matches section S. The issue was found when linking the FreeBSD kernel for MIPS when built with -fPIC. Patch by Alfredo Mazzinghi. Reviewers: ruiu, psmith, atanasyan Reviewed By: ruiu Subscribers: peter.smith, emaste, sdardis, krytarowski, llvm-commits Differential Revision: https://reviews.llvm.org/D40174 llvm-svn: 318653	2017-11-20 15:37:19 +00:00
Rafael Espindola	8bc2a19ef8	Drop conflicting sh_entsize values. An output section can include elements from two input sections with different sh_entsize. When that happens the output section itself should not have a sh_entsize. llvm-svn: 318311	2017-11-15 17:35:22 +00:00
Rafael Espindola	a5d43d004a	Propagate sh_entsize out. No difference in practice other than having sh_entsize in the output. This should simplify the patch for handling SHF_MERGE in -r. Based on a patch by George Rimar. llvm-svn: 318306	2017-11-15 16:56:20 +00:00
George Rimar	8c825db25e	[ELF] - Linkerscript: fixed non-determinism when handling MEMORY. When findMemoryRegion do search to find a region for output section it iterates over MemoryRegions which is DenseMap and so does not guarantee iteration in insertion order. As a result selected region depends on its name and not on its definition position Testcase shows the issue, patch fixes it. Behavior after applying the patch seems consistent with bfd. Differential revision: https://reviews.llvm.org/D39544 llvm-svn: 317307	2017-11-03 08:21:51 +00:00
George Rimar	9814d15136	[ELF] - Implement --orphan-handling option. It is PR34946. Spec (http://man7.org/linux/man-pages/man1/ld.1.html) tells about --orphan-handling=MODE, option where MODE can be one of four: "place", "discard", "warn", "error". Currently we already report orphans when -verbose given, what becomes excessive with option implemented. Patch stops reporting orphans when -versbose is given, and support "place", "warn" and "error" modes. It is not yet clear that "discard" mode is useful so it is not supported. Differential revision: https://reviews.llvm.org/D39000 llvm-svn: 316583	2017-10-25 15:20:30 +00:00
George Rimar	f22ec9ddf6	[ELF] - Linkerscript: fix issue with SUBALIGN. This is PR34886. SUBALIGN command currently triggers failture if result expression is zero. Patch fixes the issue, treating zero as 1, what is consistent with other places and ELF spec it seems. Patch also adds "is power of 2" check for this and other expressions returning alignment. Differential revision: https://reviews.llvm.org/D38846 llvm-svn: 316580	2017-10-25 14:50:51 +00:00
Petr Hosek	2fd533db9f	[ELF] When placing orphans, handle case when last section is dead r315292 introduced a change that's supposed to consistently ignore "dead" output sections when placing orphans. Unfortunately, that change doesn't handle the special case when the orphan section is second to last section and the last section is dead (e.g. because it's being discarded) introducing a regression in some cases. This change handles this case by using the same predicate when checking the last section. Differential Revision: https://reviews.llvm.org/D39172 llvm-svn: 316307	2017-10-23 00:51:08 +00:00
George Rimar	81eca18df3	[ELF] - Linkerscript: Add `~` as separate math token. Previously we did not support following: foo = ~0xFF; and had to add space before numeric value: foo = ~ 0xFF That was constistent with ld.bfd < 2.30, which shows: script.txt:3: undefined symbol `~2' referenced in expression, but inconsistent with gold. It was fixed for ld.bfd 2.30 as well: https://sourceware.org/bugzilla/show_bug.cgi?id=22267 Differential revision: https://reviews.llvm.org/D36508 llvm-svn: 315569	2017-10-12 08:40:12 +00:00
George Rimar	26fa916deb	[ELF] - Do not set output section flags except SHF_{ALLOC,WRITE,EXECINSTR}. This is PR34546. Currently LLD creates output sections even if it has no input sections, but its command contains an assignment. Committed code just assigns the same flag that was used in previous live section. That does not work sometimes. For example if we have following script: .ARM.exidx : { (.ARM.exidx) } .foo : { _foo = 0; } } Then first section has SHF_LINK_ORDER flag. But section foo should not. That was a reason of crash in OutputSection::finalize(). LLD tried to calculate Link value, calling front() on empty input sections list. We should only keep access flags and omit all others when creating such sections. Patch fixes the crash observed. Differential revision: https://reviews.llvm.org/D37736 llvm-svn: 315441	2017-10-11 08:13:40 +00:00
James Henderson	b5ca92ef73	[ELF] Set Dot initially to --image-base value when using linker scripts When parsing linker scripts, LLD previously started with a '.' value of 0, regardless of the internal default image base for the target, and regardless of switches such as --image-base. It seems reasonable to use a different image base value when using linker scripts and --image-base is specified, since otherwise the switch has no effect. This change does this, as well as removing unnecessary initialisation of Dot where it is not used. The default image base should not be used when processing linker scripts, because this will change the behaviour for existing linker script users, and potentially result in invalid output being produced, as a subsequent assignment to Dot could move the location counter backwards. Instead, we maintain the existing behaviour of starting from 0 if --image-base is not specified. Reviewers: ruiu Differential Revision: https://reviews.llvm.org/D38360 llvm-svn: 315293	2017-10-10 10:09:35 +00:00
Andrew Ng	4d54a4b4f7	[LLD] Fix findOrphanPos to consistently ignore "dead" OutputSection's When findOrphanPos does the reverse search to find the OutputSection preceding the orphan's insertion point, look for a live OutputSection and ignore "dead" OutputSection's. This matches the behaviour of the forward search performed earlier in this function. Added test which without the above fix fails as a result of an orphan executable section being incorrectly placed in a non-executable segment. Differential Review: https://reviews.llvm.org/D38690 llvm-svn: 315292	2017-10-10 10:05:52 +00:00
George Rimar	d46753e421	[ELF] - Do --hash-style=both by default. Its PR34712, GNU linkers recently changed default values to "both" of "sysv". Patch do the same for all targets except MIPS, where .gnu.hash section is not yet supported. Code suggested by Rui Ueyama. Differential revision: https://reviews.llvm.org/D38407 llvm-svn: 315051	2017-10-06 09:37:44 +00:00
Rafael Espindola	1f0fe88a1b	Fix header location with PHDR. We were not subtracting its size, causing it to overlap with section data. Fixes PR34750. llvm-svn: 314440	2017-09-28 18:12:13 +00:00
George Rimar	347c70d782	[ELF] - Report orphan sections if -verbose given. When -verbose is specified, patch outputs names of each input orphan section assigned to output. Differential revision: https://reviews.llvm.org/D37517 llvm-svn: 314098	2017-09-25 09:41:32 +00:00
Rafael Espindola	23be5e8d70	Consider ForceAbsolute again in moveAbsRight. This patch goes back to considering ForceAbsolute in moveAbsRight, but only if the second argument is not already absolute. With this we can handle "foo + ABSOLUTE(foo)" and "ABSOLUTE(foo) + foo". llvm-svn: 313800	2017-09-20 19:24:57 +00:00
Rafael Espindola	01a409520b	Consider only A.Sec in moveAbsRight. The idea of this function is to simplify the implementation of binary operators like add. A value might be absolute because of an ABSOLUTE expression, but it still depends on the value of a section and we might not be able to evaluate it early. We should keep such values on the LHS, so that we can delay the evaluation. We can now handle both "1 + ABSOLUTE(foo)" and "ABSOLUTE(foo) + 1". llvm-svn: 313794	2017-09-20 18:56:08 +00:00
Rafael Espindola	8b250344e9	Add a special case for trivial alignment. Normally to find the offset of a value in a section, we have to compute the value since the alignment is defined on the final address. If the alignment is trivial, we can skip the value computation. This allows us to know the offset even in cases where we cannot yet know the value. llvm-svn: 313777	2017-09-20 17:43:44 +00:00
Rafael Espindola	e4bad83edb	Don't try to compute a value that is known to fail. We try to evaluate expressions early when possible, but it is not possible to evaluate them early if they are based on a section. Before we would get this wrong on ABSOLUTE expressions. llvm-svn: 313764	2017-09-20 16:42:56 +00:00
Rafael Espindola	aad64e0a1c	Tweak orphan section placement. Given a linker script that ends in .some_sec { ...} ; __stack_start = .; . = . + 0x2000; __stack_end = .; lld would put orphan sections like .comment before __stack_end, corrupting the intended meaning. The reason we don't normally move orphans past assignments to . is to avoid breaking rx_sec : { (rx_sec) } . = ALIGN(0x1000); / The RW PT_LOAD starts here*/ but in this case, there is nothing after and it seems safer to put the orphan section last. This seems to match bfd's behavior and is convenient for writing linker scripts that care about the layout of SHF_ALLOC sections, but not of any non SHF_ALLOC sections. llvm-svn: 313646	2017-09-19 17:29:58 +00:00
Rafael Espindola	a6acd23c53	Align addresses, not offsets. This fixes two more cases where we were aligning the offset in a section, instead of the final address. llvm-svn: 312983	2017-09-12 00:06:00 +00:00
Rafael Espindola	b7147ad3dd	Correct ALIGN expression when inside a section. When given foobar = ALIGN(., 0x100); my expectation from what the manual says is that the final address of foobar will be aligned. It seems that bfd aligns the offset in the section, which causes some odd results if the section is not 0x100 aligned. Gold aligns the address. This changes lld to align the final address. llvm-svn: 312979	2017-09-11 23:44:53 +00:00
Adrian Prantl	dcf890598c	Update testcases for llvm-dwarfdump command line interface change llvm-svn: 312976	2017-09-11 23:34:12 +00:00
James Henderson	4c2a3ec33b	[ELF] Fix issue with test when build path contains '@' '@' is a valid character in file paths, but the linker script tokenizer treats it as a separate token. This was leading to an unexpected test failure, on our local builds. This patch changes the test to quote the path to prevent this happening. An alternative would have been to add '@' to the list of "unquoted tokens" in ScriptLexer.cpp, but ld.bfd has the same behaviour as the current LLD. Reviewers: ruiu Differential Revision: https://reviews.llvm.org/D37689 llvm-svn: 312922	2017-09-11 15:55:54 +00:00
Dmitry Mikulin	1e30f07ce7	Currently lld creates a single section to collect all commons. There is no way to separate commons based on file name patterns. The following linker script construct does not work because commons are allocated before section placement is done and the only synthesized BssSection that holds all commons has no file associated with it: SECTIONS { .common_0 : { *file0.o(COMMON) }} This patch changes the allocation of commons to create a section per common symbol and let the section logic do the layout. Differential revision: https://reviews.llvm.org/D37489 llvm-svn: 312796	2017-09-08 16:22:43 +00:00
George Rimar	113a5ca029	[ELF] - Simplify and improve symbols.s testcase. There is no need to check anything excepr that symbol is not in output. Previously additional iformation like symbol values or flags were checked, that was not correct. For example if we would provide symbol with different value/visibility/type for case when should not provide symbol at all, testcase would not fail. llvm-svn: 312779	2017-09-08 09:31:01 +00:00
George Rimar	5f37541c73	[ELF] - Linkerscript: implement REGION_ALIAS. REGION_ALIAS(alias, region) Alias names can be added to existing memory regions created with the MEMORY command. Each name corresponds to at most one memory region. Differential revision: https://reviews.llvm.org/D37477 llvm-svn: 312777	2017-09-08 08:23:15 +00:00
Rui Ueyama	0440be4a42	Detect linker script INCLUDE cycles. Differential Revision: https://reviews.llvm.org/D37524 llvm-svn: 312656	2017-09-06 18:14:08 +00:00
George Rimar	c2dffe3aa0	[ELF] - Linkerscript: set load address correctly if MEMORY command used. Previously LLD did not calculate LMAOffset correctly when AT and MEMORY were used together. Patch fixes PR34407. Differential revision: https://reviews.llvm.org/D37469 llvm-svn: 312625	2017-09-06 09:35:09 +00:00
Petr Hosek	18821b60b0	[ELF] Generate symbol assignments for predefined symbols The problem with symbol assignments in implicit linker scripts is that they can refer synthetic symbols such as _end, _etext or _edata. The value of these symbols is currently fixed only after all linker script commands are processed, so these assignments will be using non-final and hence invalid value. Rather than fixing the symbol values after all command processing have finished, we instead change the logic to generate symbol assignment commands that set the value of these symbols while processing the commands, this ensures that the value is going to be correct by the time any reference to these symbol is processed and is equivalent to defining these symbols explicitly in linker script as BFD ld does. Differential Revision: https://reviews.llvm.org/D36986 llvm-svn: 312305	2017-09-01 02:23:31 +00:00
Dmitry Mikulin	f300ca211f	Currently lld uses base names of files to match against file patterns in linker script SECTION rules. This patch extends it to use a fully specified file name as it appears in --trace output to match agains, i.e, "<path>/<objname>.o" or "<path>/<libname>.a(<objname>.o)". Differential Revision: https://reviews.llvm.org/D37031 llvm-svn: 311713	2017-08-24 22:01:40 +00:00
Petr Hosek	b93c5b9f7e	[ELF] Don't output headers into a segment if there's no space for them Currently, LLD checks whether there's enough space for headers by checking if headers fit below the address of the first allocated section. However, that's always thue if the binary doesn't start at zero which means that LLD always emits a segment for headers, even if no other sections belong to that segment. This is a problem in cases when linker script is being used with a non-zero start address when we don't want to make the headers visible by not leaving enough space for them. This pattern is common in embedded programming but doesn't work in LLD. This patch changes the behavior of LLD in case when linker script is being to match the behavior of BFD ld and gold, which is to only place headers into a segment when they're covered by some output section. Differential Revision: https://reviews.llvm.org/D36256 llvm-svn: 311586	2017-08-23 18:44:34 +00:00
George Rimar	de2d1066ae	[ELF] - Do not report multiple errors for single one in ScriptLexer::setError. Previously up to 3 errors were reported at once, with patch we always will report only one, just like in other linker code. Differential revision: https://reviews.llvm.org/D37015 llvm-svn: 311537	2017-08-23 08:48:39 +00:00
George Rimar	5d0ea70ad5	[ELF] - Do not segfault when doing logical and/or operations on symbols that have no output sections. Previously we would crash on samples from testcase, because were trying to access zero pointer to output section. Differential revision: https://reviews.llvm.org/D36145 llvm-svn: 311311	2017-08-21 07:57:12 +00:00
George Rimar	7e5b0a5978	[ELF] - Don't segfault when accessing location counter inside MEMORY command. We would previously crash on next script: MEMORY { name : ORIGIN = .; } Patch fixes that. Differential revision: https://reviews.llvm.org/D36138 llvm-svn: 311073	2017-08-17 08:47:21 +00:00
Hafiz Abid Qadeer	6f1d954ef4	[ELF, LinkerScript] Support ! operator in linker script. Summary: This small patch adds the support for ! operator in linker scripts. Reviewers: ruiu, rafael Reviewed By: ruiu Subscribers: meadori, grimar, emaste, llvm-commits Differential Revision: https://reviews.llvm.org/D36451 llvm-svn: 310607	2017-08-10 15:25:47 +00:00
George Rimar	945cd31c4b	[ELF] - Linkerscript: disallow discarding COMMON. This patch restricts following construction: /DISCARD/ : { *(COMMON) } Previously LLD would crash. Differential revision: https://reviews.llvm.org/D36468 llvm-svn: 310554	2017-08-10 08:07:05 +00:00
George Rimar	d6bcde389a	[ELF] - Fix "--symbol-ordering-file doesn't work with linker scripts" This is PR33889, Patch adds support of combination of linkerscript and -symbol-ordering-file option. If no sorting commands are present in script inside section declaration and no --sort-section option specified, code uses sorting from ordering file if any exist. Differential revision: https://reviews.llvm.org/D35843 llvm-svn: 310045	2017-08-04 10:25:29 +00:00
George Rimar	5fb17128f7	[ELF] - Do not segfault if linkerscript tries to access Target too early. Following possible scripts triggered accessing to Target when it was not yet initialized (was nullptr). MEMORY { name : ORIGIN = DATA_SEGMENT_RELRO_END; } MEMORY { name : ORIGIN = CONSTANT(COMMONPAGESIZE); } Patch errors out instead. Differential revision: https://reviews.llvm.org/D36140 llvm-svn: 309953	2017-08-03 16:05:08 +00:00
George Rimar	60833f6e22	[ELF] - Do not crash when ALIGN/DATA_SEGMENT_ALIGN expression used with zero value. Previously we would crash when tried to ALIGN(0). Patch uses value 1 instead in this case, that looks to be consistent with GNU linkers and reasonable and simple behavior itself. Differential revision: https://reviews.llvm.org/D35942 llvm-svn: 309372	2017-07-28 09:27:49 +00:00
Meador Inge	b0e6229742	[ELF, LinkerScript] Memory region name parsing fix This patch fixes a small issue with respect to how memory region names are parsed on output section descriptions. For example, consider: .text : { (.text) } > rom That can also be written like: .text : { (.text) } >rom The latter form is accepted by GNU LD and is fairly common. Differential Revision: https://reviews.llvm.org/D35920 llvm-svn: 309191	2017-07-26 21:51:09 +00:00
George Rimar	f694d33f4c	[ELF] - Fix calculation of memory region offset. This is PR33714. Previously for each input section offset of memory region was incremented on a size of output section. That resulted in a wrong error message saying about overflow. Patch fixes that. Differential revision: https://reviews.llvm.org/D35803 llvm-svn: 308955	2017-07-25 08:29:29 +00:00
Dmitry Mikulin	97d6a80895	If user requested section alignment is greater than MaxPageSize, propagate it to segment headers correctly. Differential Revision: https://reviews.llvm.org/D35813 llvm-svn: 308930	2017-07-24 21:27:02 +00:00
Rafael Espindola	fae04ae44c	Handle a section being more aligned than a page size. llvm-svn: 308812	2017-07-22 00:17:57 +00:00
Rafael Espindola	ca5740d95a	Don't crash on an empty section with an ALIGN. llvm-svn: 308809	2017-07-22 00:00:51 +00:00
Petr Hosek	039fb8c296	[ELF] Align the value if needed when computing the expression Also add the test cases for the addition and subtraction both for the relative and absolute case. Differential Revision: https://reviews.llvm.org/D35346 llvm-svn: 308692	2017-07-20 23:11:47 +00:00
Rafael Espindola	a4148c6eed	Fix REQUIRES line. llvm-svn: 308385	2017-07-18 22:14:26 +00:00
Rafael Espindola	afbb7be6d5	Fix a crash. This is PR33821. What we really want to check in here is if the output section was created, not if the command was empty. llvm-svn: 308382	2017-07-18 21:46:27 +00:00
Jon Chesterfield	e0ca2ff070	[LLD] Mark a number of x86 only tests to require x86 Noticed while testing for an out of tree target. There are probably more tests that should be so marked. I'm not sure who owns these tests so I've added a few names I recognise from the recent history. With advice from probinson, ruiu, rafael and dramatically improved by davidb. Thank you all! Differential Revision: https://reviews.llvm.org/D34685 llvm-svn: 308335	2017-07-18 18:40:50 +00:00
Igor Kudrin	202a9f6817	[ELF] Fix writing the content of the .got section in a wrong place. In filling the .got sections, InputSection::OutSecOff was added twice when finding the position to apply a relocation: first time in InputSection::writeTo() and then in SectionBase::getOffset(). Differential revision: https://reviews.llvm.org/D34232 llvm-svn: 308003	2017-07-14 08:10:45 +00:00
George Rimar	8c804d9746	[ELF] - Allow moving location counter backward in some cases. Patch removes restriction about moving location counter backwards outside of output sections declarations. That may be useful for some apps relying on such scripts, known example is linux kernel. Differential revision: https://reviews.llvm.org/D34977 llvm-svn: 307794	2017-07-12 14:50:25 +00:00
Petr Hosek	52db9a4fe6	[ELF] Remove unused synthetic sections from script commands Script commands are processed before unused synthetic sections are removed. Therefore, if a linker script matches one of these sections it'll get emitted as an empty output section because the logic for removing unused synthetic sections ignores script commands which could have already matched and captured one of these sections. This patch fixes that by also removing the unused synthetic sections from the script commands. Differential Revision: https://reviews.llvm.org/D34800 llvm-svn: 307037	2017-07-03 15:49:25 +00:00
Andrew Ng	a020d3487e	[LLD][LinkerScript] Allow non-alloc sections to be assigned to segments. This patch makes changes to allow sections without the SHF_ALLOC bit to be assigned to segments in a linker script. The assignment of output sections to segments is performed in LinkerScript::createPhdrs. Previously, this function would bail as soon as it encountered an output section which did not have the SHF_ALLOC bit set, thus preventing any output section without SHF_ALLOC from being assigned to a segment. This restriction has now been removed from LinkerScript::createPhdrs and instead a check for SHF_ALLOC has been added to LinkerScript::adjustSectionsAfterSorting to not propagate program headers to sections without SHF_ALLOC which matches the behaviour of bfd linker scripts. Differential Revision: https://reviews.llvm.org/D34204 llvm-svn: 307013	2017-07-03 10:11:25 +00:00
Rafael Espindola	36f2edb6fd	Check the produced file instead of stderr. It is somewhat pointless to check that a specific error is not produced. That is already checked by the ld.lld exit value. Instead make the test a bit stronger by checking that the output file has the expected symbol and section. llvm-svn: 306496	2017-06-28 01:46:31 +00:00
Rafael Espindola	9c0395e39e	Prefer -Ttext over linker script values. I found this while trying to build u-boot. It uses -Ttext in combination with linker scripts. My first reaction was to change the linker scripts to have the correct value, but I found that it is actually quite convenient to have -Ttext take precedence. By having just .text : { *(.text) } In the script, they can define the text address in a single Makefile and pass it to ld with -Ttext and for the C code with -DFoo=value. Doing the same with linker scripts would require them to be generated during the build. llvm-svn: 305766	2017-06-20 01:51:50 +00:00
Andrew Ng	6e9f98c198	[LLD][LinkerScript] Add support for segment NONE. This patch adds support for segment NONE in linker scripts which enables the specification that a section should not be assigned to any segment. Note that GNU ld does not disallow the definition of a segment named NONE, which if defined, effectively overrides the behaviour described above. This feature has been copied. Differential Revision: https://reviews.llvm.org/D34203 llvm-svn: 305700	2017-06-19 15:28:58 +00:00
Rafael Espindola	4f1fca270a	Error when discarding .dynstr. We would crash before. llvm-svn: 305615	2017-06-16 23:53:36 +00:00
Rafael Espindola	656cc20f5b	Error when discarding .dynsym. We would crash instead before. llvm-svn: 305614	2017-06-16 23:50:09 +00:00
Rafael Espindola	2af64b0bf8	Error on trying to discard .dynamic. We would crash instead before. llvm-svn: 305613	2017-06-16 23:45:35 +00:00
Petr Hosek	40f2866a67	[ELF] Mark symbols referenced from linker script as live This is necessary to ensure that sections containing symbols referenced from linker scripts (e.g. in data commands) don't get GC'd. Differential Revision: https://reviews.llvm.org/D34195 llvm-svn: 305452	2017-06-15 05:34:31 +00:00
Rafael Espindola	dece28087e	Set non alloc section address to 0 earlier. Currently we do layout as if non alloc sections had an actual address and then set it to zero. This produces a few odd results where a symbol has an address that is inconsistent with the section address. The simplest way to fix it is probably to just set the address earlier. The behavior of bfd seems to be similar, but it only sets the non alloc section address is missing from the linker script or if the script has an explicit " : 0" setting the address of the output section (which the default script does). llvm-svn: 305323	2017-06-13 20:57:43 +00:00
Rafael Espindola	4c4becf83c	Also check section address in test. This shows an oddity of this output. While the section address is 0, the the symbol address is computed as if the section was allocatable. llvm-svn: 305250	2017-06-12 23:22:00 +00:00
Rui Ueyama	3271d3704a	Fix a bug in output section directive. Previously, it couldn't parse SECTIONS .text (0x1000) : { *(.text) } because "(" was interpreted as the begining of the "(NOLOAD)" directive. llvm-svn: 305006	2017-06-08 19:47:16 +00:00
George Rimar	fbb0463f39	[ELF] - Linkerscript: implement NOLOAD section type. This is PR32351 Each output section may have a type. The type is a keyword in parentheses. (https://sourceware.org/binutils/docs/ld/Output-Section-Type.html#Output-Section-Type) This patch support only one type, it is NOLOAD. If output section has such type, we force it to be SHT_NOBITS. More details are available on a review page. Differential revision: https://reviews.llvm.org/D33647 llvm-svn: 304925	2017-06-07 16:31:08 +00:00
George Rimar	990c9cb2bf	[ELF] - Do not merge relocation sections by name when using --emit-relocs. Previously we would merge relocation sections by name. That did not work in some cases, like testcase shows. Patch implements logic to merge relocation sections if their target sections were merged into the same output section. Differential revision: https://reviews.llvm.org/D33824 llvm-svn: 304886	2017-06-07 09:20:35 +00:00
George Rimar	41c7ab4a3d	[ELF] - Linkerscript: improved error reporting. When linking linux kernel LLD currently reports next errors: ld: error: unable to evaluate expression: input section .head.text has no output section assigned ld: error: At least one side of the expression must be absolute ld: error: At least one side of the expression must be absolute That does not provide file/line information and overall looks unclear. Patch adds location information to ExprValue and that allows to provide more clear error messages. Differential revision: https://reviews.llvm.org/D33943 llvm-svn: 304881	2017-06-07 08:54:43 +00:00
George Rimar	d4096140e3	[ELF] - Do not crash when linkerscript applies fill to .bss. I found that during visual inspection of code while wrote different patch. Script in testcase probably have nothing common with real life, but we segfault currently using it. If output section is known NOBITS, there is no need to create writers threads for doing nothing or proccess any filler logic that is useless here. We can just early return, that is what this patch do. DIfferential revision: https://reviews.llvm.org/D33646 llvm-svn: 304192	2017-05-30 05:48:09 +00:00
Petr Hosek	08dfd53269	[ELF] Filter out non InputSection members from InputSections InputSections may contain MergeInputSection members which trigger a segmentation fault when trying to cast them to InputSection. Differential Revision: https://reviews.llvm.org/D33628 llvm-svn: 304189	2017-05-30 05:17:58 +00:00
Petr Hosek	3c6de1a66c	[ELF] Use late evaluation for ALIGN in expression While the following expression is handled fine: PROVIDE_HIDDEN(newsym = oldsym + address); The following expression triggers an error because the expression is evaluated as absolute: PROVIDE_HIDDEN(newsym = ALIGN(oldsym, CONSTANT(MAXPAGESIZE)) + address); To avoid this error, we use late evaluation for ALIGN by making the alignment an attribute of the expression itself. Differential Revision: https://reviews.llvm.org/D33629 llvm-svn: 304185	2017-05-30 03:18:28 +00:00
Rafael Espindola	d23e9267a6	Order writable executable sections before writable ones. On SPARC, .plt is both writeable and executable. The current way sections are sorted means that lld puts it after .data/.bss. but it really needs to be close to .test to make sure branches into .plt don't overflow. I'd argue that because .bss is supposed to come last on all architectures, we should change the default sort order such that writable and executable sections come before sections that are just writeable. read-only executable sections should still come after sections that are just read-only of course. This diff makes this change. llvm-svn: 304008	2017-05-26 17:23:25 +00:00
Dmitry Mikulin	fd0c844fbb	Do not track section types of previous sections, always use PROGBITS for dummy sections. Fix for PR33029. llvm-svn: 303770	2017-05-24 16:48:31 +00:00
Rafael Espindola	5210141b07	Optimize orphan placement in a general way. We used to place orphans by just using compareSectionsNonScript. Then we noticed that since linker scripts can use another order, we should first try match the section to a given PT_LOAD. But there is nothing special about PT_LOAD. The same issue can show up for PT_GNU_RELRO for example. In general, we have to search for the most similar section and put the orphan next to it. Most similar being defined as how long they follow the same code path in compareSecitonsNonScript. That is what this patch does. We now compute a rank for each output section, with a bit for each branch in what was compareSectionsNonScript. With this findOrphanPos is now fully general and orphan placement can be optimized by placing every section with the same rank at once. The included testcase is a variation of many-sections.s that uses allocatable sections to avoid the fast path in the existing code. Without threads it goes form 46 seconds to 0.9 seconds. llvm-svn: 302903	2017-05-12 14:52:22 +00:00
George Rimar	f2cd0f9d05	[ELF] - Make text section location explicit in early-assign-symbol.s test. Testcase itself depends on .text section location, which was orphan earlier. Suggested by Rafael Espíndola llvm-svn: 302792	2017-05-11 11:53:49 +00:00
Petr Hosek	6b936bf6c7	[ELF] Define __ehdr_start unconditionally even when using linker script This behavior differs from the semantics implemented by GNU linkers which only define this symbol iff ELF headers are in the memory mapped segment. Differential Revision: https://reviews.llvm.org/D33019 llvm-svn: 302687	2017-05-10 16:20:33 +00:00
George Rimar	608cf67084	[ELF] - Don't segfault when assigning non-calculatable absolute symbol value. This is PR32664. Issue was revealed by linux kernel script which was: SECTIONS { . = (0xffffffff80000000 + ALIGN(0x1000000, 0x200000)); phys_startup_64 = ABSOLUTE(startup_64 - 0xffffffff80000000); .text : AT(ADDR(.text) - 0xffffffff80000000) { ..... *(.head.text) Where startup_64 is in .head.text. At the place of assignment to phys_startup_64 we can not calculate absolute value for startup_64 because .text section has no VA assigned. Two patches were prepared earlier to address this: D32173 and D32174. And in comments for D32173 was suggested not try to support this case, but error out. Differential revision: https://reviews.llvm.org/D32793 llvm-svn: 302668	2017-05-10 14:23:33 +00:00
Rui Ueyama	91b95b61f8	Add memory ORIGIN and LENGTH expression support Adds support for the ORIGIN and LENGTH linker script built in functions. ORIGIN(memory) Return the origin of the memory region LENGTH(memory) Return the length of the memory region Redo of D29775 for refactored linker script parsing. Patch by Robert Clarke Differential Revision: https://reviews.llvm.org/D32934 llvm-svn: 302564	2017-05-09 18:24:38 +00:00
George Rimar	d86a4e505b	[ELF] - Linkerscript: support combination of linkerscript and --compress-debug-sections. Previously it was impossible to use linkerscript with --compress-debug-sections because of assert failture: Assertion failed: isFinalized(), file C:\llvm\lib\MC\StringTableBuilder.cpp, line 64 Patch fixes the issue llvm-svn: 302413	2017-05-08 10:18:12 +00:00
Rafael Espindola	4aa2ef5b0e	Fix pr32816. When using linkerscripts we were trying to sort SHF_LINK_ORDER sections too early. Instead of always doing two runs of assignAddresses, record the section order in processCommands. llvm-svn: 301830	2017-05-01 20:32:39 +00:00
Rafael Espindola	4f013bb3b2	Create an OutputSection for each non-empty OutputSectionCommand. We were already pretty close, the one exception was when a name was reused in another SECTIONS directive: SECTIONS { .text : { (.text) } .data : { (.data) } } SECTIONS { .data : { (other) } } In this case we would create a single .data and magically output "other" while looking at the first OutputSectionCommand. We now create two .data sections. This matches what gold does. If we really want to create a single one, we should change the parser so that the above is parsed as if the user had written SECTIONS { .text : { (.text) } .data : { (.data) (other)} } That is, there should be only one OutputSectionCommand for .data and it would have two InputSectionDescriptions. By itself this patch makes the code a bit more complicated, but is an important step in allowing assignAddresses to operate just on the linker script. llvm-svn: 301484	2017-04-26 22:30:15 +00:00
Rafael Espindola	40d406534e	Use CHECK-NEXT in a test. This will simplify a future patch. llvm-svn: 301415	2017-04-26 15:05:10 +00:00
George Rimar	1022112d77	[ELF] - Linkerscript: make section with no content to be SHT_PROGBITS by default. Imagine next script: SECTIONS { BYTE(0x11); } Section content written to disk will be 0x11. Previous LLD behavior was to make this section SHT_NOBITS. What is not correct because section has content. ld.bfd makes such sections SHT_PROGBITS, this patch do the same. This fixes PR32537 Differential revision: https://reviews.llvm.org/D32016 llvm-svn: 300317	2017-04-14 09:37:00 +00:00
George Rimar	36a0b98e24	[ELF] - Cleanup of align.s testcase. NFC. llvm-svn: 300316	2017-04-14 09:30:50 +00:00
George Rimar	01aa795f82	[ELF] LinkerScript: Don't assign zero to all regular symbols This fixes an assertion `Align != 0u && "Align can't be 0."' in llvm::alignTo() when a linker script references a globally defined variable in an ALIGN() context. Patch by Alexander Richardson ! Differential revision: https://reviews.llvm.org/D31984 llvm-svn: 300315	2017-04-14 09:23:26 +00:00
Rui Ueyama	15732b718b	Fix FILL linker script command. FILL command doesn't need a semicolon. Fixes https://bugs.llvm.org/show_bug.cgi?id=32657 llvm-svn: 300280	2017-04-13 23:40:00 +00:00
Rui Ueyama	040af7deab	Allow expressions in MEMORY command. Previously, we allowed only integers in this context. Now you can write expressions there. LLD is now able to handle the following linker, for example. MEMORY { rom (rx) : ORIGIN = (1024 * 1024) } llvm-svn: 300131	2017-04-12 23:16:52 +00:00
Rui Ueyama	e9c9edf67b	Make intentional typos look more obvious. We do not check for similarities when handling unknown tokens in linker scripts, so "ORIGI" and "LENTH" are not good tokens as a test for unknown tokens, as I was tempted to "fix" them. llvm-svn: 300130	2017-04-12 23:16:33 +00:00
Rui Ueyama	732baa4d03	Remove redundant spaces. llvm-svn: 300129	2017-04-12 23:16:13 +00:00
Rui Ueyama	53e0fd33d4	Remove useless 0x prefixes. llvm-svn: 300128	2017-04-12 23:15:55 +00:00
Rui Ueyama	58c18f8f23	Make a few tests shorter. NFC. llvm-svn: 300120	2017-04-12 22:38:02 +00:00
James Henderson	9d9a663731	[ELF] Recommit r299635 to pad x86 executable sections with 0xcc This follows r299748 which fixed a latent bug the original commit exposed. llvm-svn: 299755	2017-04-07 10:36:42 +00:00
James Henderson	d983180778	Revert r299635 because it exposed a latent bug. llvm-svn: 299655	2017-04-06 15:22:58 +00:00
James Henderson	8dd4c06a77	[ELF] Pad x86 executable sections with 0xcc int3 instructions Executable sections should not be padded with zero by default. On some architectures, 0x00 is the start of a valid instruction sequence, so can confuse disassembly between InputSections (and indeed the start of the next InputSection in some situations). Further, in the case of misjumps into padding, padding may start to be executed silently. On x86, the "0xcc" byte represents the int3 trap instruction. It is a single byte long so can serve well as padding. This change switches x86 (and x86_64) to use this value for padding in executable sections, if no linker script directive overrides it. It also puts the behaviour into place making it easy to change the behaviour of other targets when desired. I do not know the relevant instruction sequences for trap instructions on other targets however, so somebody should add this separately. Because the old behaviour simply wrote padding in the whole section before overwriting most of it, this change also modifies the padding algorithm to write padding only where needed. This in turn has caused a small behaviour change with regards to what values are written via Fill commands in linker scripts, bringing it into line with ld.bfd. The fill value is now written starting from the end of the previous block, which means that it always starts from the first byte of the fill, whereas the old behaviour meant that the padding sometimes started mid-way through the fill value. See the test changes for more details. Reviewed by: ruiu Differential Revision: https://reviews.llvm.org/D30886 Bugzilla: http://bugs.llvm.org/show_bug.cgi?id=32227 llvm-svn: 299635	2017-04-06 09:29:08 +00:00
Rui Ueyama	6bd3822007	Use uint64_t to keep file size even on 32-bit machines. If an output file is too large for 32-bit, we should report an error. llvm-svn: 299592	2017-04-05 21:37:09 +00:00
Evgeniy Stepanov	1df7d9b60b	Change section flag character for SHF_LINK_ORDER to "o". See matching MC change in https://reviews.llvm.org/D31554. llvm-svn: 299480	2017-04-04 22:35:16 +00:00
Rui Ueyama	b87602032a	Change the error message format for undefined symbols. Previously, undefined symbol errors are one line like this and wasn't easy to read. /ssd/clang/bin/ld.lld: error: /ssd/llvm-project/lld/ELF/Writer.cpp:207: undefined symbol 'lld:🧝:EhFrameSection<llvm::object::ELFType<(llvm::support::endianness)0, true> >::addSection(lld:🧝:InputSectionBase*)' This patch make it more structured like this. bin/ld.lld: error: undefined symbol: lld:🧝:EhFrameSection<llvm::object::ELFType<(llvm::support::endianness)0, true> >>> Referenced by Writer.cpp:207 (/ssd/llvm-project/lld/ELF/Writer.cpp:207) >>> Writer.cpp.o in archive lib/liblldELF.a Discussion thread: http://lists.llvm.org/pipermail/llvm-dev/2017-March/111459.html Differential Revision: https://reviews.llvm.org/D31481 llvm-svn: 299097	2017-03-30 19:13:47 +00:00
Petr Hosek	30f16b2339	[ELF] Allow references to reserved symbols in linker scripts This requires collectign all symbols referenced in the linker script and adding them to symbol table as undefined symbol. Differential Revision: https://reviews.llvm.org/D31147 llvm-svn: 298577	2017-03-23 03:52:34 +00:00
Rafael Espindola	7ba5f47eb8	Handle & and \| of non abs values. Handling & in particular is probably important because of its use in aligning addresses. llvm-svn: 298096	2017-03-17 14:55:36 +00:00
Rafael Espindola	5f08a1dca8	Refuse to add two non absolute symbols. Since there is no way to produce the correct answer at runtime, it is probably better to just err. llvm-svn: 298094	2017-03-17 14:51:07 +00:00
Rafael Espindola	f2115f04c8	Support non abs values in the rhs of +. llvm-svn: 298088	2017-03-17 13:45:36 +00:00
Rafael Espindola	5d5a267830	Add a test that now passes. llvm-svn: 298083	2017-03-17 13:19:15 +00:00
Rafael Espindola	72dc195d78	Change our linker script expr representation. This fixes pr32031 by representing the expressions results as a SectionBase and offset. This allows us to use an input section directly instead of getting lost trying to compute an offset in an outputsection when not all the information is available yet. This also creates a struct to represent the value of and expression, allowing the expression itself to be a simple typedef. I think this is easier to read and will make it easier to extend the expression computation to handle more complicated cases. llvm-svn: 298079	2017-03-17 13:05:04 +00:00
Petr Hosek	02ad516b2e	Support ABSOLUTE on the right hand side in linker scripts This also requires postponing the assignment the assignment of symbols defined in input linker scripts since those can refer to output sections and in case we don't have a SECTIONS command, we need to wait until all output sections have been created and assigned addresses. Differential Revision: https://reviews.llvm.org/D30851 llvm-svn: 297802	2017-03-15 03:33:23 +00:00
Eugene Leviant	5784e96f5c	[ELF] Fix LMA offset calculation Differential revision: https://reviews.llvm.org/D30832 llvm-svn: 297713	2017-03-14 08:57:09 +00:00
Eugene Leviant	30c1b436ad	[ELF] Fix crash when .eh_frame(_hdr) is discarded lld crashes when .eh_frame or .eh_frame_hdr section is discarded in linker script and there is no PHDRS directive. Differential revision: https://reviews.llvm.org/D30885 llvm-svn: 297712	2017-03-14 08:49:09 +00:00
Eugene Leviant	2968547997	[ELF] Fix error reporting for synthetic sections Synthetic sections don't belong to any input file, but still they are input sections. Whenever problem occurs with relocations in these sections lld crashes in error reporting, trying to print input file name. Differential revision: https://reviews.llvm.org/D30889 llvm-svn: 297711	2017-03-14 08:33:45 +00:00
Petr Hosek	7b79321e88	[ELF] Propely handle .eh_frame in linker scripts Using .eh_frame input section pattern in linker script currently causes a crash; this is because .eh_frame input sections require special handling since they're all combined into a synthetic section rather than regular output section. Differential Revision: https://reviews.llvm.org/D30627 llvm-svn: 297501	2017-03-10 20:00:42 +00:00
Rafael Espindola	692b2f88d3	Fully precise gc handling of __start and __stop symbols. This puts us at parity with bfd, which could already gc this case. I noticed the sections not being gced when linking a modified freebsd kernel. A section that was not gced and not mentioned in the linker script would end up breaking the expected layout. Since fixing the gc is relatively simple and an improvement, that seems better than trying to hack the orphan placement code. There are 173 input section in the entire link whose names are valid C identifiers, so this is probably not too performance critical. llvm-svn: 297049	2017-03-06 18:48:18 +00:00
Rafael Espindola	e937a828c4	Simplify test by producing an executable. llvm-svn: 296786	2017-03-02 19:19:59 +00:00
Rafael Espindola	4368bdb270	Make gc a bit more aggressive. We were not gcing any section whose name was a C identifier. Both gold and bfd only keep those if they are used. To avoid having to create the __start/__stop symbols early or doing string lookups in resolvedReloc, this patch just looks for undefined symbols __start/__stop to decide if a section is needed or not. llvm-svn: 296723	2017-03-02 01:50:34 +00:00
Rafael Espindola	b691ccf0a5	Revert "Add terminator to .eh_frame sections" This reverts commit r296378. I am pretty sure this is incorrect. In particular, for just .cfi_startproc nop .cfi_endproc We now add an extra 4 zeros that neither bfd nor gold add. llvm-svn: 296503	2017-02-28 18:55:08 +00:00
Rui Ueyama	1720ef1343	Add terminator to .eh_frame sections Patch by Mark Kettenis. Currenlty ld.lld does not add a terminator (a CIE with its length field set to zero) to the .eh_frame sections it generates. While the relevant standards (the AMD64 SysV ABI and the Linux LSB) are not explicit about this, such a terminator is expected by some unwinder implementations and seems to be always emitted by ld.bfd. In addition to that, the Linux LSB https://refspecs.linuxfoundation.org/LSB_5.0.0/LSB-Core-generic/LSB-Core-generic/ehframechpt.html#EHFRAME explicitly says that The .eh_frame section shall contain 1 or more Call Frame Information (CFI) records. Currently, if the .eh_frame sections of the input files only contain terminators, ld.lld emits a zero=sized .eh_frame section which clearly doesn't meet that requirement. The diff makes sure a terminator gets added to each .eh_frame section and adjusts all the relevant tests to account for that. An additional test isn't needed as these adjustments mean that the existence of the terminator is tested for by several tests already. Differential Revision: https://reviews.llvm.org/D30335 llvm-svn: 296378	2017-02-27 20:44:59 +00:00
Petr Hosek	5e51f7d24e	[ELF] Insert linkerscript symbols directly into symbol table This change exposes the symbol table insert method and uses it to insert the linkerscript defined symbols directly into the symbol table to avoid unnecessarily pulling the object out of an archive. Differential Revision: https://reviews.llvm.org/D30224 llvm-svn: 295780	2017-02-21 22:32:51 +00:00
George Rimar	78ef645f94	[ELF] - Do not segfault when using --gc-sections with linker script Patch fixes PR32024. Sections that were not marked as Live has null output section. Previously we tried to access that field and segfaulted. Differential revision: https://reviews.llvm.org/D30188 llvm-svn: 295727	2017-02-21 15:46:43 +00:00
George Rimar	6d8957b979	[ELF] - Shortify at-addr.s testcase. llvm-svn: 295724	2017-02-21 15:10:30 +00:00
George Rimar	ae4761c186	[ELF] - Postpone evaluation of LMA offset. Previously we evaluated the values of LMA incorrectly for next cases: .text : AT(ADDR(.text) - 0xffffffff80000000) { ... } .data : AT(ADDR(.data) - 0xffffffff80000000) { ... } .init.begin : AT(ADDR(.init.begin) - 0xffffffff80000000) { ... } Reason was that we evaluated offset when VA was not assigned. For case above we ended up with 3 loads that has similar LMA and it was incorrect. That is critical for linux kernel. Patch updates the offset after VA calculation. That fixes the issue. Differential revision: https://reviews.llvm.org/D30163 llvm-svn: 295722	2017-02-21 15:08:18 +00:00
George Rimar	2ee2d2dcb5	[ELF] - Improve diagnostic messages for move location counter errors. Previously LLD would error out just "ld.lld: error: unable to move location counter backward" What does not really reveal the place of issue, Patch adds location to the output. Differential revision: https://reviews.llvm.org/D30187 llvm-svn: 295720	2017-02-21 14:50:38 +00:00
George Rimar	60f1fe8438	[ELF] - Make ASSERT() return Dot instead of evaluated value. Previously ASSERT we implemented returned expression value. Ex: . = ASSERT(0x100); would set Dot value to 0x100 Form of assert when it is assigned to Dot was implemented for compatibility with very old GNU ld which required it. Some scripts in the wild, including linux kernel scripts use such ASSERTs at the end for doing different checks. Currently we fail with "unable to move location counter backward" for such scripts. Patch changes ASSERT to return location counter value to fix that. Differential revision: https://reviews.llvm.org/D30171 llvm-svn: 295703	2017-02-21 07:33:38 +00:00
George Rimar	858a659a4f	[ELF] - Added support of linkerscript's "/DISCARD/" for --emit-relocs Previously LLD crashed on on provided testcases because "/DISCARD/" was not supported. Patch implements that. After this I think there is no known issues with --emit-relocs implementation required for linux kernel linking. Differential revision: https://reviews.llvm.org/D29273 llvm-svn: 295488	2017-02-17 19:46:47 +00:00
Rafael Espindola	3773bcac55	Fix --print-gc-sections with linker scripts. Before it would never print anything. Thanks to George Rimar for pointing it out. llvm-svn: 295485	2017-02-17 19:37:30 +00:00
Rafael Espindola	ecbfd871f9	Don't print DISCARD sections as gced. This is a small difference I noticed to gold and bfd. When given --print-gc-sections, we print sections a linkerscript marks DISCARD. The other linkers don't. llvm-svn: 295467	2017-02-17 17:35:07 +00:00
Rafael Espindola	679828ff92	Diagnose another case of the location counter moving backwards. This case should be possible to handle, but it is hard: * In order to create program headers correctly, we have to scan the sections in the order they are in the file. * To find that order, we have to "execute" the linker script. * The linker script can contain SIZEOF_HEADERS. So to support this we have to start with a guess of how many headers we need (3), run the linker script and try to create the program headers. If it turns out we need more headers, we run the script again with a larger SIZEOF_HEADERS. Also, running the linker script depends on knowing the size of the sections, so we have to finalize them. But creating the program headers can change the value stored in some sections, so we have to split size finalization and content finalization. Looks like the last part is also needed for range extension thunks, so we might support this at some point. For now just report an error instead of producing broken files. llvm-svn: 295458	2017-02-17 16:26:13 +00:00
Rafael Espindola	4cd7352c4f	Reject moving the location counter backwards. We were only checking when the assignment was inside a section. llvm-svn: 295454	2017-02-17 16:01:51 +00:00
George Rimar	505ac8dc41	[ELF] - Do not crash when discarding sections that are referenced by others. SHF_LINK_ORDER sections adds special ordering requirements. Such sections references other sections. Previously we would crash if section that other were referenced to was discarded by script. Patch fixes that by discarding all dependent sections in that case. It supports chained dependencies, testcase is provided. Differential revision: https://reviews.llvm.org/D30033 llvm-svn: 295332	2017-02-16 16:06:13 +00:00
Rafael Espindola	908a3d3420	Ignore relocation sections in linker scripts. Unfortunately, the common way of writing linker scripts seems to be to get the output of ld.bfd --verbose and edit it a bit. Also unfortunately, the bfd default script contains things like .rela.dyn : { *(... .rela.data ...) } but bfd actually ignores that for -emit-relocs, so we have to do the same. llvm-svn: 295324	2017-02-16 14:36:09 +00:00
Rui Ueyama	731a66ae98	Apply different tokenization rules to linker script expressions. The linker script lexer is context-sensitive. In the regular context, arithmetic operator characters are regular characters, but in the expression context, they are independent tokens. This afects how the lexer tokenizes "3*4", for example. (This kind of expression is real; the Linux kernel uses it.) This patch defines function `maybeSplitExpr`. This function splits the current token into multiple expression tokens if the lexer is in the expression context. Differential Revision: https://reviews.llvm.org/D29963 llvm-svn: 295225	2017-02-15 19:58:17 +00:00
Rui Ueyama	a4601b5d7a	Simplify operator tests. llvm-svn: 295222	2017-02-15 19:36:01 +00:00
Rui Ueyama	fd5edff8d6	Rename a test as they are tests for operators. llvm-svn: 295221	2017-02-15 19:35:41 +00:00
George Rimar	4e01c3e8cd	[ELF] - Linkerscript - fix handling of OUTPUT_ARCH command. OUTPUT_ARCH command can contain architecture values separated with ":", like: OUTPUT_ARCH(i386:x86-64) We did not support that, because got 3 lexer tokens here after recent changes. This trivial patch fixes the issue, now whole expression inside OUTPUT_ARCH is just ignored. Differential revision: https://reviews.llvm.org/D29640 llvm-svn: 294432	2017-02-08 09:59:06 +00:00
George Rimar	ffc9e41ff4	[ELF] - Rename the test. NFC. Addressing post commit comments, it do nothing relative with orphans. llvm-svn: 294429	2017-02-08 09:28:50 +00:00
Petr Hosek	165088aa5c	[ELF] Handle output section alignment in linker scripts LLD already parses ALIGN expression to specifiy alignment for output sections in linker scripts but it never applies the alignment to the output section. This change handles that. Differential Revision: https://reviews.llvm.org/D29689 llvm-svn: 294374	2017-02-07 23:42:31 +00:00
George Rimar	c6cf1f1f02	[ELF] - Assign proper values for DefinedSynthetic symbols attached to non-allocatable sections. DefinedSynthetic symbols are attached to sections, for the case when such symbol was attached to non-allocated section, we calculated its value incorrectly. We subtracted Body->Section->Addr, but non-allocatable sections should have zero VA in output and therefore result value was wrong. And at the same time we have Body->Section->Addr != 0 for them internally because use it for calculation of section size. Patch fixes calculation of such symbols values. Differential revision: https://reviews.llvm.org/D29653 llvm-svn: 294322	2017-02-07 17:51:35 +00:00
George Rimar	a5e4119184	[ELF] - Removed excessive check call from outputarch.s. NFC. For case when LLD should error out, llm-readobj was called, what worked because argument was an output from first test run. llvm-svn: 294310	2017-02-07 15:09:07 +00:00
Rafael Espindola	06f4743a48	Handle symbol assignments before the first section switch. We now create a dummy section with index 1 before processing the linker script. Thanks to George Rimar for finding the bug and providing the initial testcase. llvm-svn: 294252	2017-02-06 22:21:46 +00:00
Rafael Espindola	2532431332	Stop propagating Entsize. Now that we combine multiple synthetic merge section into one output section there is no point in trying to propagate a value. llvm-svn: 294048	2017-02-03 21:29:51 +00:00
Rafael Espindola	4524268c02	Handle numbers followed by ":" in linker scripts. This is a fix for Bugzilla 31813. The problem is that the tokenizer does not create a separate token for ":" unless there's white space before it. Changed it to always create a token for ":" and reworked some logic that relied on ":" being attached to some tokens like "global:" and "local:". llvm-svn: 294006	2017-02-03 13:24:01 +00:00
Rafael Espindola	9e9754b520	Replace MergeOutputSection with a synthetic section. With a synthetic merge section we can have, for example, a single .rodata section with stings, fixed sized constants and non merge constants. I can be simplified further by not setting Entsize, but that is probably better done is a followup patch. This should allow some cleanup in the linker script code now that every output section command maps to just one output section. llvm-svn: 294005	2017-02-03 13:06:18 +00:00
George Rimar	cc4d3e5745	[ELF] - Linkerscript: properly mark minus expression with non-absolute flag This is alternative to D28857 which was incorrect. One of linux scripts contains: vvar_start = . - 2 * (1 << 12); vvar_page = vvar_start; vvar_vsyscall_gtod_data = vvar_page + 128; Previously we did not mark first expression as non-absolute, though it contains location counter. And LLD failed with error: relocation R_X86_64_PC32 cannot refer to absolute symbol This patch should fix the issue, and opens road for doing the same for other operators (though not clear if that is needed). Differential revision: https://reviews.llvm.org/D29332 llvm-svn: 293748	2017-02-01 09:01:16 +00:00
George Rimar	2fe079233b	[ELF] - Linkerscript: do not fail on additional semicolons in linkerscript. Linux kernel linkerscript contains additional semicolon (last line): .apicdrivers : AT(ADDR(.apicdrivers) - LOAD_OFFSET) { __apicdrivers = .; (.apicdrivers); I checked that both gold and bfd are able to parse something like: .text : { ;;(.text);;S = 0;; } } Patch do the same. Differential revision: https://reviews.llvm.org/D29276 llvm-svn: 293612	2017-01-31 08:50:11 +00:00
Rafael Espindola	fe12450e8e	Revert commits r293276 and r293278. [ELF] Fixed formatting. NFC and [ELF] Bypass section type check Differential revision: https://reviews.llvm.org/D28761 They do the opposite of what was asked for in the code review. llvm-svn: 293320	2017-01-27 18:39:30 +00:00
Eugene Leviant	8b7cadcf96	[ELF] Bypass section type check Differential revision: https://reviews.llvm.org/D28761 llvm-svn: 293276	2017-01-27 11:01:43 +00:00
Meador Inge	b889744e5b	[LinkerScript] Implement `MEMORY` command As specified here: * https://sourceware.org/binutils/docs/ld/MEMORY.html#MEMORY There are two deviations from what is specified for GNU ld: 1. Only integer constants and not constant expressions are allowed in `LENGTH` and `ORIGIN` initializations. 2. The `I` and `L` attributes are not implemented. With (1) there is currently no easy way to evaluate integer only constant expressions. This can be enhanced in the future. With (2) it isn't clear how these flags map to the `SHF_*` flags or if they even make sense for an ELF linker. Differential Revision: https://reviews.llvm.org/D28911 llvm-svn: 292875	2017-01-24 02:34:00 +00:00
George Rimar	23be5d94eb	[ELF] - Committed missing ld.ldd invocation to constructor.s Thanks to Meador Ingle for noticing. llvm-svn: 292799	2017-01-23 16:55:13 +00:00
George Rimar	8e2eca229e	[ELF] - Linkerscripts: ignore CONSTRUCTORS in output section declaration. It is used in linux kernel script: http://lxr.free-electrons.com/source/arch/x86/kernel/vmlinux.lds.S#L140 Though CONSTRUCTORS is ignored for ELF. Differential revision: https://reviews.llvm.org/D28951 llvm-svn: 292777	2017-01-23 09:36:19 +00:00
Rafael Espindola	0347c0b874	Don't create a bogus PT_PHDR if we don't allocate the headers. llvm-svn: 292644	2017-01-20 20:46:15 +00:00
George Rimar	60aed44387	[ELF] - Do not crash when assign common symbol's values in script Found that during attempts of linking linux kernel, previously we partially duplicated code from getOutputSection(), and it missed commons symbol case. Differential revision: https://reviews.llvm.org/D28903 llvm-svn: 292594	2017-01-20 09:45:36 +00:00
George Rimar	7185a1acec	[ELF] - Support optional comma after output section command. I found this when tried to link linux kernel with LLD: https://github.com/torvalds/linux/blob/master/arch/x86/entry/vdso/vdso-layout.lds.S#L86 Output section command can have optional comma at the end: .text : { (.text) } :text =0x90909090, It was documented about 3 years ago for binutils: https://sourceware.org/ml/binutils/2014-04/msg00045.html Differential revision: https://reviews.llvm.org/D28803 llvm-svn: 292225	2017-01-17 15:32:12 +00:00
George Rimar	4b0253af7e	[ELF] - Fix for huge-temporary-file.s Removed mentioning of checks. Sorry for noise. llvm-svn: 292221	2017-01-17 14:06:44 +00:00
George Rimar	ad530b2ac7	[ELF] - Added huge-temporary-file.s testcase. Inputs shown in that testcase previously created a huge temporarily file under 32 bits. It was fixed by D28107. During review was suggested to add a testcase even without CHECKs for documentation purposes. Patch do that. llvm-svn: 292220	2017-01-17 14:04:16 +00:00
George Rimar	1e799942b3	[ELF] - Move the addition of synthetics from addPredefinedSections() These were 3 last synthetics that were added in addPredefinedSections() instead of createSyntheticSections(). Now it is possible to move addition to correct common place. Also patch fixes testcase which discards .shstrtab, by restricting doing that. Differential revision: https://reviews.llvm.org/D28561 llvm-svn: 291908	2017-01-13 16:18:15 +00:00
Peter Collingbourne	628ec9f193	ELF: Place relro sections after non-relro sections in r/w segment. This is in preparation for my next change, which will introduce a relro nobits section. That requires that relro sections appear at the end of the progbits part of the r/w segment so that the relro nobits section can appear contiguously. Because of the amount of churn required in the test suite, I'm making this change separately. llvm-svn: 291523	2017-01-10 01:21:30 +00:00
Meador Inge	8f1f3c40f6	[ELF] Allow defined symbols to be assigned from linker script This patch allows for linker scripts to assign a new value to a symbol that is already defined (either in an object file or the linker script itself). llvm-svn: 291459	2017-01-09 18:36:57 +00:00
Rafael Espindola	337139830e	Change which input sections we concatenate After Mark's patch I was wondering what was the rationale for the ELF spec requiring us to merge only sections with matching flags and types. I tried emailing https://groups.google.com/forum/#!forum/generic-abi, but looks like my emails are not being posted (the list is probably moderated). I emailed Cary Coutant instead. Cary pointed out that the section was a late addition and didn't got the scrutiny it deserved. Given that and the problems found by implementing the letter of the standard, I propose changing lld to merge all sections with the same name and issue errors if the types or some critical flags are different. This should allow an unmodified firefox linked with lld to run. This also merges some code with the linkerscript path. llvm-svn: 291107	2017-01-05 14:20:35 +00:00
Eugene Leviant	f6aeed3624	[ELF] Linkerscript: print location of undefined symbol usage Differential revision: https://reviews.llvm.org/D27194 llvm-svn: 290339	2016-12-22 13:13:12 +00:00
George Rimar	d450065308	[ELF] - Linkerscript: Fall back to search paths when INCLUDE not found From https://sourceware.org/binutils/docs/ld/File-Commands.html: The file will be searched for in the current directory, and in any directory specified with the -L option. Patch done by Alexander Richardson. Differential revision: https://reviews.llvm.org/D27831 llvm-svn: 290247	2016-12-21 09:42:25 +00:00
Rui Ueyama	5d804dc8f7	[ELF] - Linkerscript: Implement two argument version of ALIGN() Fixes http://llvm.org/PR31129 Patch by Alexander Richardson! Differential Revision: https://reviews.llvm.org/D27848 llvm-svn: 289968	2016-12-16 18:19:35 +00:00
George Rimar	b86448c669	[ELF] - Accept --sort-section=xxx command form. --sort-section=xxx is the same as --sort-section xxx, was found in one of FreeBSD ports. llvm-svn: 289938	2016-12-16 11:59:52 +00:00
George Rimar	14460e0216	[ELF] - Do not crash when move location counter backward. PR31335 shows that we do that in next case: SECTIONS { .text 0x2000 : {. = 0x100 ; *(.text) } } though documentations says that "If . is used inside a section description however, it refers to the byte offset from the start of that section, not an absolute address. " looks does not work as documented in bfd (as mentioned in comments for PR31335). Until we find out the expected behavior was suggested at least not to 'crash', what we do after trying to generate huge file. Differential revision: https://reviews.llvm.org/D27712 llvm-svn: 289782	2016-12-15 07:27:28 +00:00
Meador Inge	b06147db0c	[ELF] Fix test case thinko from r289152 It was pointed out in a post-commit review that the tests were structured oddly. Fixed thusly. llvm-svn: 289278	2016-12-09 21:51:37 +00:00
Rui Ueyama	df41b13b09	Remove `REQUIRES: shell` hack to workaround an echo issue. These tests are disabled on Windows, but they seem to work just fine now, so I'll enable them. llvm-svn: 289251	2016-12-09 18:49:37 +00:00
Meador Inge	95c7d8d2a7	[ELF] Allow output section data commands to take expressions The current implementation of the output section data store commands can only handle integer literals, but it should really handle arbitrary expressions [1]. This commit fixes that. [1] https://sourceware.org/binutils/docs-2.27/ld/Output-Section-Data.html#Output-Section-Data Differential Revision: https://reviews.llvm.org/D27561 llvm-svn: 289152	2016-12-08 23:21:30 +00:00
Rui Ueyama	89ccd0f31c	Split linkerscript.s into small test files. linkerscript.s is the first test file for linker script, and at the moment it contains all tests for linker scripts. Now that test file doesn't make sense. linkerscript2.s was just badly named. Renamed searchdir.s. llvm-svn: 289148	2016-12-08 22:36:12 +00:00
Rui Ueyama	3c04f8d790	Print a warning message if ENTRY() symbol is not found. llvm-svn: 289146	2016-12-08 22:26:31 +00:00
Rafael Espindola	a86a9c6fad	Use the correct MaxPageSize. Found by inspection. llvm-svn: 288970	2016-12-07 20:10:43 +00:00
George Rimar	a2a32c2cc8	[ELF] - Teach LLD to recognize PT_OPENBSD_BOOTDATA Minor patch to fix PR31288 OpenBSD commit: `d39116912b` Differential revision: https://reviews.llvm.org/D27458 llvm-svn: 288832	2016-12-06 17:57:42 +00:00
Eugene Leviant	2a942c4b45	[ELF] Print file:line for unknown PHDR error Differential revision: https://reviews.llvm.org/D27335 llvm-svn: 288678	2016-12-05 16:38:32 +00:00
George Rimar	3fb5a6dc9e	[ELF] - Add support of proccessing of the rest allocatable synthetic sections from linkerscript. This change continues what was started by D27040 Now all allocatable synthetics should be available from script side. Differential revision: https://reviews.llvm.org/D27131 llvm-svn: 288150	2016-11-29 16:05:27 +00:00
Eugene Leviant	ed30ce7ae4	[ELF] Print file:line for 'undefined section' errors Differential revision: https://reviews.llvm.org/D27108 llvm-svn: 288019	2016-11-28 09:58:04 +00:00
Rafael Espindola	8e67000f1a	Always create a PT_ARM_EXIDX if needed. Unfortunatelly PT_ARM_EXIDX is special. There is no way to create it from linker scripts, so we have to create it even if PHDRS is used. This matches bfd and is required for the lld output to survive bfd's strip. llvm-svn: 288012	2016-11-28 00:40:21 +00:00
Rafael Espindola	5fcc99c27d	Also skip regular symbol assignment at the start of a script. Unfortunatelly some scripts look like kernphys = ... . = .... and the expectation in that every orphan section is after the assignment. llvm-svn: 287996	2016-11-27 09:44:45 +00:00

... 3 4 5 6 7 ...

630 Commits