llvm-project

Commit Graph

Author	SHA1	Message	Date
Fangrui Song	98afa2c1f1	[ELF] De-template PltSection::addEntry. NFC	2019-12-16 11:03:20 -08:00
Fangrui Song	f036f1cc85	[ELF] Delete redundant isLive() check. NFC	2019-12-15 21:59:55 -08:00
Vlad Tsyrklevich	17063abd1e	Revert "[ELF] Allow getErrPlace() to work before Out::bufferStart is set" This reverts commit `2bbd32f5e8`, it was causing UBSan failures like the following: lld/ELF/Target.cpp:103:41: runtime error: applying non-zero offset 24 to null pointer	2019-12-13 09:43:51 -08:00
Fangrui Song	69d10d282e	[ELF] Update st_size when merging a common symbol with a shared symbol When a common symbol is merged with a shared symbol, increase st_size if the shared symbol has a larger st_size. At runtime, the executable's symbol overrides the shared symbol. The shared symbol may be created from common symbols in a previous link. This rule makes sure we pick the largest size among all common symbols. This behavior matches GNU ld. See https://sourceware.org/bugzilla/show_bug.cgi?id=25236 for discussions. A shared symbol does not hold alignment constraints. Ignore the alignment update. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D71161	2019-12-13 09:23:36 -08:00
Alex Richardson	2bbd32f5e8	[ELF] Allow getErrPlace() to work before Out::bufferStart is set Summary: So far it seems like the only test affected by this change is the one I recently added for R_MIPS_JALR relocations since the other test cases that use this function early (unknown-relocation-*) do not have a valid input section for the relocation offset. Reviewers: ruiu, grimar, MaskRay, espindola Reviewed By: ruiu, MaskRay Subscribers: emaste, sdardis, jrtc27, atanasyan, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70659	2019-12-13 12:19:55 +00:00
Rui Ueyama	69da7e29de	Revert an accidental commit `af5ca40b47`	2019-12-13 15:17:40 +09:00
Rui Ueyama	6faf8bdcc4	Update the man page Add a description about the compression level of the debug info. Differential Revision: https://reviews.llvm.org/D71385	2019-12-13 15:15:00 +09:00
Rui Ueyama	af5ca40b47	temporary	2019-12-13 14:35:03 +09:00
Fangrui Song	ba8149e27d	[ELF] Add a comment to handleSectionGroup(). NFC Apply suggestion in https://reviews.llvm.org/D71157#1780834 Reviewed By: grimar, ruiu Differential Revision: https://reviews.llvm.org/D71388	2019-12-12 09:23:59 -08:00
Sam Clegg	881d877846	[WebAssembly] Add new `export_name` clang attribute for controlling wasm export names This is equivalent to the existing `import_name` and `import_module` attributes which control the import names in the final wasm binary produced by lld. This maps the existing This attribute currently requires a string rather than using the symbol name for a couple of reasons: 1. Avoid confusion with static and dynamic linking which is based on symbol name. Exporting a function from a wasm module using this directive is orthogonal to both static and dynamic linking. 2. Avoids name mangling. Differential Revision: https://reviews.llvm.org/D70520	2019-12-11 11:54:57 -08:00
Fangrui Song	5a3a9e9927	[ELF][AArch64] Rename --force-bti to -z force-bti and --pac-plt to -z pac-plt Summary: The original design used --foo but the upstream complained that ELF only options should be -z foo. See https://sourceware.org/ml/binutils/2019-04/msg00151.html https://sourceware.org/git/?p=binutils-gdb.git;a=commitdiff;h=8bf6d176b0a442a8091d338d4af971591d19922c made the rename. Our --force-bti and --pac-plt implement the same functionality, so it seems wise to be consistent with GNU ld. Reviewed By: peter.smith Subscribers: emaste, arichardson, kristof.beyls, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71327	2019-12-11 09:26:32 -08:00
Peter Smith	86d24193a9	[LLD][ELF][AArch64][ARM] When errata patching, round thunk size to 4KiB. On some edge cases such as Chromium compiled with full instrumentation we have a .text section over twice the size of the maximum branch range and the instrumented code generation containing many examples of the erratum sequence. The combination of Thunks and many erratum sequences causes finalizeAddressDependentContent() to not converge. We end up with: start - Thunk Creation (disturbs addresses after thunks, creating more patches) - Patch Creation (disturbs addresses after patches, creating more thunks) - goto start In most images with few thunks and patches the mutual disturbance does not cause convergence problems. As the .text size and number of patches go up the risk increases. A way to prevent the thunk creation from interfering with patch creation is to round up the size of the thunks to a 4KiB boundary when the erratum patch is enabled. As the erratum sequence only triggers when an instruction sequence starts at 0xff8 or 0xffc modulo (4 KiB) by making the thunks not affect addresses modulo (4 KiB) we prevent thunks from interfering with the patch. The patches themselves could be aggregated in the same way that Thunks are within ThunkSections and we could round up the size in the same way. This would reduce the number of patches created in a .text section size > 128 MiB but would not likely help convergence problems. Differential Revision: https://reviews.llvm.org/D71281 fixes (remaining part of) pr44071, other part in D71242	2019-12-11 14:09:15 +00:00
Peter Smith	247b2ce11c	[LLD][ELF][AArch64][ARM] Add missing classof to patch sections. The code to insert patch section merges them with a comparison function that uses logic of the form: return (isa<PatchSection>(a) && !isa<PatchSection>(b)); If the PatchSections don't implement classof this check fails if b is also a SyntheticSection. This can result in the patches being out of range if the SyntheticSection is big, for example a ThunkSection with lots of thunks. Differential Revision: https://reviews.llvm.org/D71242 fixes (part of) pr44071	2019-12-11 14:09:15 +00:00
Fangrui Song	6e513a5382	[ELF] Move a computeIsPreemptible() pass into ICF. NFC Address post-commit review for D71163. Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D71326	2019-12-10 22:21:05 -08:00
Fangrui Song	cd0ab2428f	[ELF] --icf: do not fold preemptible symbols Fixes PR44124. A preemptible symbol may refer to a different definition at runtime. When comparing a pair of relocations, if they refer to different symbols, and either symbol is preemptible, the two containing sections should be considered different. gold has a similar rule https://sourceware.org/git/gitweb.cgi?p=binutils-gdb.git;a=commit;h=ce97fa81e0c46d216b80b143ad8c02fff6906fef Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D71163	2019-12-10 09:06:08 -08:00
Fangrui Song	60ce444eaa	[ELF] Refine section group --gc-sections rules to not discard .debug_types clang/gcc -fdebug-type-sections places .debug_types and .rela.debug_types in a section group, with a signature symbol which represents the type signature. The section group is for deduplication purposes. After D70146, we will discard such section groups. Refine the rule so that we will retain the group if no member has the SHF_ALLOC flag. GNU ld has a similar rule to retain the group if all members have the SEC_DEBUGGING flag. We try to be more general for future-proof purposes: if other non-SHF_ALLOC sections have deduplication needs, they may be placed in a section group. Don't discard them. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D71157	2019-12-10 09:00:58 -08:00
Sam Clegg	b4f4e370b5	[WebAssebmly][MC] Support .import_name/.import_field asm directives Convert the MC test to use asm rather than bitcode. This is a precursor to https://reviews.llvm.org/D70520. Differential Revision: https://reviews.llvm.org/D70877	2019-12-06 15:09:56 -08:00
Fangrui Song	c8f0d3e130	[ELF][PPC64] Support long branch thunks with addends Fixes PPC64 part of PR40438 // clang -target ppc64le -c a.cc // .text.unlikely may be placed in a separate output section (via -z keep-text-section-prefix) // The distance between bar in .text.unlikely and foo in .text may be larger than 32MiB. static void foo() {} __attribute__((section(".text.unlikely"))) static int bar() { foo(); return 0; } __attribute__((used)) static int dummy = bar(); This patch makes such thunks with addends work for PPC64. AArch64: .text -> `__AArch64ADRPThunk_ (adrp x16, ...; add x16, x16, ...; br x16)` -> target PPC64: .text -> `__long_branch_ (addis 12, 2, ...; ld 12, ...(12); mtctr 12; bctr)` -> target AArch64 can leverage ADRP to jump to the target directly, but PPC64 needs to load an address from .branch_lt . Before Power ISA v3.0, the PC-relative ADDPCIS was not available. .branch_lt was invented to work around the limitation. Symbol::ppc64BranchltIndex is replaced by PPC64LongBranchTargetSection::entry_index which take addends into consideration. The tests are rewritten: ppc64-long-branch.s tests -no-pie and ppc64-long-branch-pi.s tests -pie and -shared. Reviewed By: sfertile Differential Revision: https://reviews.llvm.org/D70937	2019-12-05 10:17:45 -08:00
Fangrui Song	944f109ad7	[ELF][PPC64] Don't copy ppc64BranchltIndex in replaceWithDefined replaceWithDefined is used by canonical PLT and copy relocations, which imply that the symbol is preemptable. ppc64BranchltIndex is only used by non-preemptable cases, and it can only be the default value in replaceWithDefined.	2019-12-05 09:33:30 -08:00
Peter Smith	784f57584f	[LLD][ELF][AArch64] .note.gnu.property sections should have alignment 8 The .note.gnu.property SHT_NOTE sections on AArch64 (a 64-bit target) should have alignment 8 to more closely match the binutils implementation where alignment is 4-bytes on 32-bit machines and 8-bytes on 64-bit machines. Previously LLD was using 4 for both 32-bit and 64-bit machines. Differential Revision: https://reviews.llvm.org/D70962	2019-12-05 10:11:31 +00:00
Peter Smith	4d6c4cb426	[LLD][ELF] Add support for PT_GNU_PROPERTY The PT_GNU_PROPERTY program header describes the location of the .note.gnu.property SHT_NOTES section. The linux kernel uses this program header to find the .note.gnu.property section rather than parsing. Executables that have properties that the kernel needs to act on that don't have the PT_GNU_PROPERTY program header will not boot. Differential Revision: https://reviews.llvm.org/D70961	2019-12-05 09:54:58 +00:00
Fangrui Song	bf535ac4a2	[ELF][AArch64] Support R_AARCH64_{CALL26,JUMP26} range extension thunks with addends Fixes AArch64 part of PR40438 The current range extension thunk framework does not handle a relocation relative to a STT_SECTION symbol with a non-zero addend, which may be used by jumps/calls to local functions on some RELA targets (AArch64, powerpc ELFv1, powerpc64 ELFv2, etc). See PR40438 and the following code for examples: // clang -target $target a.cc // .text.cold may be placed in a separate output section. // The distance between bar in .text.cold and foo in .text may be larger than 128MiB. static void foo() {} __attribute__((section(".text.cold"))) static int bar() { foo(); return 0; } __attribute__((used)) static int dummy = bar(); This patch makes such thunks with addends work for AArch64. The target independent part can be reused by PPC in the future. On REL targets (ARM, MIPS), jumps/calls are not represented as STT_SECTION + non-zero addend (see MCELFObjectTargetWriter::needsRelocateWithSymbol), so they don't need this feature, but we need to make sure this patch does not affect them. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D70637	2019-12-02 10:07:24 -08:00
Fangrui Song	3d9b1128d6	[ELF][ARM] Add getPCBias() ThunkCreator::getThunk and ThunkCreator::normalizeExistingThunk currently assume that the implicit addends are -8 for ARM and -4 for Thumb. In D70637, ThunkCreator::getThunk will need to take care of the relocation addend explicitly. Add the utility function getPCBias() as a prerequisite so that the getThunk change in D70637 can be more general. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D70690	2019-11-27 09:09:46 -08:00
Georgii Rymar	3b35603a56	[llvm-readobj] - Always print "Predecessors" for version definition sections. This is a follow-up discussed in D70495 thread. The current logic is unusual for llvm-readobj. It doesn't print predecessors list when it is empty. This is not good for machine parsers. D70495 had to add this condition during refactoring to reduce amount of changes, in tests, because the original code also had a similar logic. Now seems it is time to get rid of it. This patch does it. Differential revision: https://reviews.llvm.org/D70717	2019-11-27 12:29:55 +03:00
Shoaib Meenai	75fd939bb9	[ELF] Adjust test to work for zlib 1.2.8 The previous data had the same length with compression levels 1 and 6 for zlib 1.2.8. Adjust the test to work for this library version. I've also tested this with zlib 1.2.7 and zlib 1.2.11.	2019-11-26 17:19:02 -08:00
Fangrui Song	54a366f515	[ELF] Add a corrector for case mismatch problems Reviewed By: grimar, peter.smith Differential Revision: https://reviews.llvm.org/D70506	2019-11-26 09:11:56 -08:00
Fangrui Song	a2fc964417	[ELF] Replace SymbolTable::forEachSymbol with iterator_range symbols() D62381 introduced forEachSymbol(). It seems that many call sites cannot be parallelized because the body shared some states. Replace forEachSymbol with iterator_range<filter_iterator<...>> symbols() to simplify code and improve debuggability (std::function calls take some frames). It also allows us to use early return to simplify code added in D69650. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D70505	2019-11-26 09:09:32 -08:00
Georgii Rymar	19edd675c6	[LLD][ELF] - Make compression level be dependent on -On. Currently LLD always use zlib compression level 6. This patch changes it to use 1 for -O0, -O1 and 6 for -O2. It fixes https://bugs.llvm.org/show_bug.cgi?id=44089. There was also a thread in llvm-dev on this topic: https://lists.llvm.org/pipermail/llvm-dev/2018-August/125020.html Here is a table with results of building clang mentioned there: ``` Level Time Size 0 0m17.128s 2045081496 Z_NO_COMPRESSION 1 0m31.471s 922618584 Z_BEST_SPEED 2 0m32.659s 903642376 3 0m36.749s 890805856 4 0m41.532s 876697184 5 0m48.383s 862778576 6 1m3.176s 855251640 Z_DEFAULT_COMPRESSION 7 1m15.335s 853755920 8 2m0.561s 852497560 9 2m33.972s 852397408 Z_BEST_COMPRESSION ``` It shows that it is probably not reasonable to use values greater than 6. Differential revision: https://reviews.llvm.org/D70658	2019-11-26 11:50:22 +03:00
Fangrui Song	a71c1e2a57	[ELF] Support input section description .rel[a].dyn in /DISCARD/ Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D70695	2019-11-25 21:49:46 -08:00
Fangrui Song	f0558f582a	[ELF] Delete unused Configuration::zExecstack after D56554	2019-11-25 14:44:09 -08:00
Nemanja Ivanovic	4d5c8caf9b	[LLD] Add a default copy constructor to avoid warnings This should fix the failure on the PPC64LE LLD bot.	2019-11-25 14:09:16 -06:00
Fangrui Song	6b90f2bbec	[ELF][test] Clean up some thunk tests Use llvm-objdump --no-show-raw-insn --print-imm-hex Prefer generic aarch64 triple to linux/freebsd specific triples.	2019-11-23 12:42:39 -08:00
Fangrui Song	4dc2fb123d	[ELF] Error if -Ttext-segment is specified In GNU ld, -Ttext sets the address of the .text section and -Ttext-segment sets the address of the text segment (RX). gold only supports the -Ttext-segment semantic and treats -Ttext as an alias for -Ttext-segment. lld only supports the -Ttext semantic and treats -Ttext-segment as an alias for -Ttext. The text segment will be assigned to an address less than the specified -Ttext-segment value. This patch drops the -Ttext-segment alias. The text segment is traditionally the first segment. Users who specify -Ttext-segment may actually want to specify --image-base, the lld way to express this. Unfortunately currently this is supported by GNU ld's COFF port but not by its ELF port. gold does not support this option. With -z separate-code, the behavior of GNU ld -Ttext-segment is weird (see https://sourceware.org/bugzilla/show_bug.cgi?id=25207) rL289827 introduced the alias for linking qemu's non-pie user mode binaries. As explained previously, this actually assigns the text segment to an address less than 0x60000000. I feel that a better fix is on the qemu side: https://lists.nongnu.org/archive/html/qemu-devel/2019-11/msg02480.html Reviewed By: grimar, ruiu Differential Revision: https://reviews.llvm.org/D70468	2019-11-21 09:41:55 -08:00
James Y Knight	d3fec7fb45	LLD: Don't use the stderrOS stream in link before it's reassigned. Remove the lld::enableColors function, as it just obscures which stream it's affecting, and replace with explicit calls to the stream's enable_colors. Also, assign the stderrOS and stdoutOS globals first in link function, just to ensure nothing might use them. (Either change individually fixes the issue of using the old stream, but both together seems best.) Follow-up to `b11386f9be`. Differential Revision: https://reviews.llvm.org/D70492	2019-11-21 10:55:03 -05:00
Alex Richardson	5bab291b7b	Ignore R_MIPS_JALR relocations against non-function symbols Summary: Current versions of clang would erroneously emit this relocation not only against functions (loaded from the GOT) but also against data symbols (e.g. a table of function pointers). LLD was then changing this into a branch-and-link instruction, causing the program to jump to the data symbol at run time. I discovered this problem when attempting to boot MIPS64 FreeBSD after updating the to the latest upstream master. Reviewers: atanasyan, jrtc27, espindola Reviewed By: atanasyan Subscribers: emaste, sdardis, krytarowski, MaskRay, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70406	2019-11-20 13:23:26 +00:00
Georgii Rymar	1cc78fdb6f	[llvm-readobj/llvm-readelf] - Improve dumping of versioning sections. Our elf-versioninfo.test is not perfect. It does not properly test how flags are dumped and also we have a bug: they are dumped as enums in LLVM style now, i.e not dumped properly. GNU style uses a `versionFlagToString` method to build a string from flags which seems is consistent with GNU readelf. In this patch I fixed the issues mentioned. Differential revision: https://reviews.llvm.org/D70399	2019-11-20 11:55:55 +03:00
Rui Ueyama	47feae5dd6	Use lld::make<T> to make TpiSource objects In lld we rarely use std::unique_ptr but instead allocate new instances using lld::make<T>() so that they are deallocated at the end of linking. This patch changes existing code so that that follows the convention. Differential Revision: https://reviews.llvm.org/D70420	2019-11-20 13:14:44 +09:00
Fangrui Song	ce5de93e83	[ELF] Disallow out-of-range section group indices after D70146 Exposed by invalid/sht-group-wrong-section.test http://45.33.8.238/win/2613/step_9.txt	2019-11-19 09:49:45 -08:00
Fangrui Song	6b0eb5a672	[ELF] Improve --gc-sections compatibility with GNU ld regarding section groups Based on D70020 by serge-sans-paille. The ELF spec says: > Furthermore, there may be internal references among these sections that would not make sense if one of the sections were removed or replaced by a duplicate from another object. Therefore, such groups must be included or omitted from the linked object as a unit. A section cannot be a member of more than one group. GNU ld has 2 behaviors that we don't have: - Group members (nextInSectionGroup != nullptr) are subject to garbage collection. This includes non-SHF_ALLOC SHT_NOTE sections. In particular, discarding non-SHF_ALLOC SHT_NOTE sections is an expected behavior by the Annobin project. See https://developers.redhat.com/blog/2018/02/20/annobin-storing-information-binaries/ for more information. - Groups members are retained or discarded as a unit. Members may have internal references that are not expressed as SHF_LINK_ORDER, relocations, etc. It seems that we should be more conservative here: if a section is marked live, mark all the other member within the group. Both behaviors are reasonable. This patch implements them. A new field InputSectionBase::nextInSectionGroup tracks the next member within a group. on ELF64, this increases sizeof(InputSectionBase) froms 144 to 152. InputSectionBase::dependentSections tracks section dependencies, which is used by both --gc-sections and /DISCARD/. We can't overload it for the "next member" semantic, because we should allow /DISCARD/ to discard sections independent of --gc-sections (GNU ld behavior). This behavior may be reasonably used by `/DISCARD/ : { (.ARM.exidx) }` or `/DISCARD/ : { (.note) }` (new test `linkerscript/discard-group.s`). Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D70146	2019-11-19 08:54:06 -08:00
Rui Ueyama	45f8ee5f3c	Revert "Fix shared lib build." This reverts commit `17e37ba57a` because it introduced a circular dependency between Core and Common. Because `d0371f4736` fixed a build issue, we no longer need that dependency.	2019-11-19 12:05:55 +09:00
Rui Ueyama	d0371f4736	Fix component build `b11386f9be` broke lld build if `-DBUILD_SHARED_LIBS=ON` is passed to CMake.	2019-11-19 11:20:52 +09:00
Michael Liao	17e37ba57a	Fix shared lib build.	2019-11-18 13:08:51 -05:00
Rui Ueyama	909d7bf9a7	Fix Windows buildbots `stdout` and `stderr` might be defined as macros, so we needed to avoid using them as variable names.	2019-11-18 16:45:00 +09:00
Rui Ueyama	b11386f9be	Make it possible to redirect not only errs() but also outs() This change is for those who use lld as a library. Context: https://reviews.llvm.org/D70287 This patch adds a new parmeter to lld::::link() so that we can pass an raw_ostream object representing stdout. Previously, lld::::link() took only an stderr object. Justification for making stdoutOS and stderrOS mandatory: I wanted to make link() functions to take stdout and stderr in that order. However, if we change the function signature from bool link(ArrayRef<const char > args, bool canExitEarly, raw_ostream &stderrOS = llvm::errs()); to bool link(ArrayRef<const char > args, bool canExitEarly, raw_ostream &stdoutOS = llvm::outs(), raw_ostream &stderrOS = llvm::errs()); , then the meaning of existing code that passes stderrOS silently changes (stderrOS would be interpreted as stdoutOS). So, I chose to make existing code not to compile, so that developers can fix their code. Differential Revision: https://reviews.llvm.org/D70292	2019-11-18 11:18:06 +09:00
Ayke van Laethem	57776f71fa	[ELF] Fix lld build on Windows/MinGW The patch in https://reviews.llvm.org/D64077 causes a build failure because both the Defined and SharedSymbol classes are bigger than 80 bytes on MinGW 8. This patch fixes this build failure by changing the type of the bitfields. It is a similar change to the bitfield changes in https://reviews.llvm.org/D64238, but instead of changing to bool I decided to use uint8_t because one of the bitfields takes up two bits instead of one. Note: the patch is slightly different from the one reviewed in Phabricator, but it is a trivial change to align it with LLVM master instead of LLVM 9. Also, it passes all lld tests. Differential Revision: https://reviews.llvm.org/D70266	2019-11-16 13:28:53 +01:00
Reid Kleckner	ce0f3ee5e4	[COFF] Don't error if the only inputs are from /wholearchive: Fixes PR43744 Differential Revision: https://reviews.llvm.org/D69968	2019-11-15 16:09:07 -08:00
Reid Kleckner	4c1a1d3cf9	Add missing includes needed to prune LLVMContext.h include, NFC These are a pre-requisite to removing #include "llvm/Support/Options.h" from LLVMContext.h: https://reviews.llvm.org/D70280	2019-11-14 15:23:15 -08:00
Reid Kleckner	de3fb1ec05	[COFF] Avoid CodeView include in header Most LLD/COFF files don't care about CodeView. Avoid using CodeView types in InputFiles.h.	2019-11-14 14:27:48 -08:00
Reid Kleckner	adfad4d7c8	Forward declare the DWARFCache to avoid including LLVM DWARF details LLD's DWARF.h header leaks a lot of LLVM DWARF includes that LLD doesn't need. For Chunks.cpp, I see a compile time decrease of 3.1s to 2.7s.	2019-11-14 14:17:49 -08:00
Reid Kleckner	f24c3352c9	[COFF] Don't include llvm/LTO/LTO.h in a header LLVM's LTO header includes all of llvm/IR, which most of the COFF linker doesn't need.	2019-11-14 13:47:18 -08:00
Rui Ueyama	000ff301e7	Warn on /align if used without /driver /align is not supposed to be used without /driver, so it makes sense to warn if only /align is passed. MSVC link.exe warns on this too. Differential Revision: https://reviews.llvm.org/D70163	2019-11-14 13:13:07 +09:00
Rui Ueyama	f95ed69641	Implement /driver, /driver:wdm and /driver:uponly This patch implements /driver, /driver:wdm and /driver:uponly as described in https://docs.microsoft.com/en-us/cpp/build/reference/driver-windows-nt-kernel-mode-driver?view=vs-2019. Differential Revision: https://reviews.llvm.org/D70162	2019-11-14 13:07:56 +09:00
Martin Storsjö	38bc9559ba	[LLD] [COFF] Fix automatically importing data symbols from DLLs with LTO This broke in `51dcb292cc`, "[lld-link] diagnose undefined symbols before LTO when possible" (very soon after the 9.0 branch, so luckily the 9.0 release is unaffected). The code for loading objects we believe might be needed for autoimport (loadMinGWAutomaticImports()) does run before the new reportUnresolvable() function, but it had a condition to only operate on symbols from regular object files. This condition came from resolveRemainingUndefines(), but as loadMinGWAutomaticImports() now has to operate before the LTO, it has to operate on undefineds from LTO objects as well. Differential Revision: https://reviews.llvm.org/D70166	2019-11-13 22:48:36 +02:00
Fangrui Song	8f089f2099	[MC] Emit unused undefined symbol even if its binding is not set Recommit r373168, which was reverted by r373242. This actually exposed a boringssl bug which has been fixed for more than one month. For the following two cases, we currently suppress the symbols. This patch emits them (compatible with GNU as). * `test2_a = undef`: if `undef` is otherwise unused. * `.hidden hidden`: if `hidden` is unused. This is the main point of the patch, because omitting the symbol would cause a linker semantic difference. It causes a behavior change that is not compatible with GNU as: .weakref foo1, bar1 When neither foo1 nor bar1 is used, we now emit bar1, which is arguably more consistent. Another change is that we will emit .TOC. for .TOC.@tocbase . For this directive, suppressing .TOC. can be seen as a size optimization, but we choose to drop it for simplicity and consistency.	2019-11-08 14:47:48 -08:00
Fangrui Song	5b47efa20e	[ELF] Fix stack-use-after-scope after D69592 and 69650	2019-11-08 11:21:32 -08:00
Fangrui Song	59d3fbc227	[ELF] Suggest extern "C" when the definition is mangled while an undefined reference is not The definition may be mangled while an undefined reference is not. This may come up when (1) the reference is from a C file or (2) the definition misses an extern "C". (2) is more common. Suggest an arbitrary mangled name that matches the undefined reference, if such a definition exists. ld.lld: error: undefined symbol: foo >>> referenced by a.o:(.text+0x1) >>> did you mean to declare foo(int) as extern "C"? >>> defined in: a1.o Reviewed By: dblaikie, ruiu Differential Revision: https://reviews.llvm.org/D69650	2019-11-08 09:46:45 -08:00
Fangrui Song	70e62a4fa6	[ELF] Suggest extern "C" when an undefined reference is mangled while the definition is not When missing an extern "C" declaration, an undefined reference may be mangled while the definition is not. Suggest the missing extern "C" and the base name. Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D69592	2019-11-08 09:42:50 -08:00
Rui Ueyama	f95273f75a	Keep symbols passed by -init and -fini Previously, symbols passed by -init and -fini look as if they are not referenced by anyone, and the LTO might eliminate them. This patch fixes the issue. Fixes a bug reported in https://bugs.llvm.org/show_bug.cgi?id=43927 Differential Revision: https://reviews.llvm.org/D69985	2019-11-08 19:08:15 +09:00
Peter Collingbourne	2c6fae179e	ELF: Discard .ARM.exidx sections for empty functions instead of misordering them. The logic added in r372781 caused ARMExidxSyntheticSection::addSection() to return false for exidx sections without a link order dep that passed isValidExidxSectionDep(). This included exidx sections for empty functions. As a result, such exidx sections would end up treated like ordinary sections and would end up being laid out before the ARMExidxSyntheticSection, most likely in the wrong order relative to the exidx entries in the ARMExidxSyntheticSection, breaking the orderedness invariant relied upon by unwinders. Fix this by simply discarding such sections. Differential Revision: https://reviews.llvm.org/D69744	2019-11-04 09:11:14 -08:00
Reid Kleckner	deaf121b65	Warn when an output section name is longer than 8 characters Recent versions of Microsoft's dumpbin tool cannot handle such PE files. LLVM tools and GNU tools can, and use this to encode long section names like ".debug_info", which is commonly used for DWARF. Don't do this in mingw mode or when -debug:dwarf is passed, since the user probably wants long section names for DWARF sections. PR43754 Reviewers: ruiu, mstorsjo Differential Revision: https://reviews.llvm.org/D69594	2019-11-01 12:59:13 -07:00
Nico Weber	07255f81fa	comment typo fix to cycle bots	2019-10-31 07:54:16 -04:00
Nico Weber	4138fc9567	comment typo fix to cycle bots	2019-10-30 22:17:52 -04:00
Fangrui Song	db8dad20b3	[ELF][test] Change references of %T to %t.dir Test files in the same directory share the same %T. %T is easy to misuse and cause race conditions (when running concurrently) so it has been deprecated since D48842 (see docs/CommandGuide/lit.rst). While here, add `rm -rf %t.dir` so that tests cannot depend on old files lying around. Reviewed By: jhenderson, ruiu Differential Revision: https://reviews.llvm.org/D69572	2019-10-30 09:22:48 -07:00
Georgii Rymar	d213da49e1	[LLD] - Fix a test after obj2yaml change. I am not sure why obj2yaml is used to check the linkers output, but anyways, the format was changed in https://reviews.llvm.org/rG6e779e953e9d.	2019-10-30 18:28:52 +03:00
Sam Clegg	baff8ec2e1	[WebAssembly][lld] Fix for static linking of PIC code When statically linking PIC code we create an internalized __memory_base so that memory-base-relative relocation work correctly. The value of this global should be zero, and not the globalBase since the globalBase offset is already taken into account by getVirtualAddress. Fixes: https://github.com/emscripten-core/emscripten/issues/9013 Differential Revision: https://reviews.llvm.org/D69600	2019-10-29 18:58:56 -07:00
Nick Terrell	6814232429	[LLD][ELF] Support --[no-]mmap-output-file with F_no_mmap Summary: Add a flag `F_no_mmap` to `FileOutputBuffer` to support `--[no-]mmap-output-file` in ELF LLD. LLD currently explicitly ignores this flag for compatibility with GNU ld and gold. We need this flag to speed up link time for large binaries in certain scenarios. When we link some of our larger binaries we find that LLD takes 50+ GB of memory, which causes memory pressure. The memory pressure causes the VM to flush dirty pages of the output file to disk. This is normally okay, since we should be flushing cold pages. However, when using BtrFS with compression we need to write 128KB at a time when we flush a page. If any page in that 128KB block is written again, then it must be flushed a second time, and so on. Since LLD doesn't write sequentially this causes write amplification. The same 128KB block will end up being flushed multiple times, causing the linker to many times more IO than necessary. We've observed 3-5x faster builds with -no-mmap-output-file when we hit this scenario. The bad scenario only applies to compressed filesystems, which group together multiple pages into a single compressed block. I've tested BtrFS, but the problem will be present for any compressed filesystem on Linux, since it is caused by the VM. Silently ignoring --no-mmap-output-file caused a silent regression when we switched from gold to lld. We pass --no-mmap-output-file to fix this edge case, but since lld silently ignored the flag we didn't realize it wasn't being respected. Benchmark building a 9 GB binary that exposes this edge case. I linked 3 times with --mmap-output-file and 3 times with --no-mmap-output-file and took the average. The machine has 24 cores @ 2.4 GHz, 112 GB of RAM, BtrFS mounted with -compress-force=zstd, and an 80% full disk. \| Mode \| Time \| \|---------\|-------\| \| mmap \| 894 s \| \| no mmap \| 126 s \| When compression is disabled, BtrFS performs just as well with and without mmap on this benchmark. I was unable to reproduce the regression with any binaries in lld-speed-test. Reviewed By: ruiu, MaskRay Differential Revision: https://reviews.llvm.org/D69294	2019-10-29 15:49:08 -07:00
Fangrui Song	94bfa6deb0	[ELF] Delete redundant comment after D56554. NFC	2019-10-29 10:00:48 -07:00
Michał Górny	2a0fcae3d4	[lld] [ELF] Add '-z nognustack' opt to suppress emitting PT_GNU_STACK Add a new '-z nognustack' option that suppresses emitting PT_GNU_STACK segment. This segment is not supported at all on NetBSD (stack is always non-executable), and the option is meant to be used to disable emitting it. Differential Revision: https://reviews.llvm.org/D56554	2019-10-29 17:54:23 +01:00
Nico Weber	5976a3f5aa	Fix a few typos in lld/ELF to cycle bots	2019-10-28 21:41:47 -04:00
Nico Weber	b911d2db5d	lld/COFF: Simplify getOutputPath() using sys::path functions. Also mention "basename" and "dirname" in Path.h since I tried to find these functions by looking for these strings. It might help others find them faster if the comments contain these strings. No behavior change. Differential Revision: https://reviews.llvm.org/D69458	2019-10-28 10:38:32 -04:00
Sterling Augustine	118ceea5c3	Crt files are special cased by name when dealing with ctor and dtor sections, but the current code misses certain variants. In particular, those named when clang takes the code path in clang/lib/Driver/ToolChain.cpp:416, where crtfiles are named: clang_rt.<component>-<arch>-<env>.<suffix> Previously, the code only handled: clang_rt.<component>.<suffix> <component>.<suffix> This revision fixes that.	2019-10-25 11:04:56 -07:00
georgerim	e3105e71f3	[LLD][ELF] - Update test case after yaml2obj change. SHT_NOTE needs at least an empty "Content" in the YAML description. Should fix http://lab.llvm.org:8011/builders/llvm-clang-lld-x86_64-scei-ps4-ubuntu-fast	2019-10-25 13:40:44 +03:00
Fangrui Song	56d81104f1	[ELF] -r: fix crash when processing a SHT_REL[A] that relocates a SHF_MERGE after D67504/r372734 Fix PR43767 In -r mode, when processing a SHT_REL[A] that relocates a SHF_MERGE, sec->getRelocatedSection() is a MergeInputSection and its parent is an OutputSection but is asserted to be a SyntheticSection (MergeSyntheticSection) in LinkerScript.cpp:addInputSec(). ## The code path is not exercised in non -r mode because the relocated section changed from MergeInputSection to InputSection. Reorder the code to make the non -r logic apply to -r as well, thus fix the crash. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D69364	2019-10-24 11:35:29 -07:00
LLVM GN Syncbot	f86dc64bad	typo fix test commit	2019-10-22 21:32:11 +00:00
Nico Weber	e0e7d06df3	fix a few typos to test git committing	2019-10-22 16:34:00 -04:00
Martin Storsjo	150a9ad3ff	[LLD] [COFF] Fix use of uninitialized memory since SVN r375390 llvm-svn: 375400	2019-10-21 09:35:34 +00:00
Martin Storsjo	65b1c497d2	[LLD] [COFF] Use the local dwarf code instead of Symbolizer for resolving code locations. NFC. As we now have code that parses the dwarf info for variable locations, we can use that instead of relying on the higher level Symbolizer library, reducing the previous two different dwarf codepaths into one. Differential Revision: https://reviews.llvm.org/D69198 llvm-svn: 375391	2019-10-21 08:01:59 +00:00
Martin Storsjo	908b780952	[LLD] Move duplicated dwarf parsing code to the Common library. NFC. Differential Revision: https://reviews.llvm.org/D69197 llvm-svn: 375390	2019-10-21 08:01:52 +00:00
George Rimar	c4107383e5	[LLD][ELF] - Update tests after yaml2obj tool update. yaml2obj doesn't create .symtab by default anymore. llvm-svn: 375360	2019-10-20 14:47:09 +00:00
Reid Kleckner	90c64a3456	Move endian constant from Host.h to SwapByteOrder.h, prune include Works on this dependency chain: ArrayRef.h -> Hashing.h -> --CUT-- Host.h -> StringMap.h / StringRef.h ArrayRef is very popular, but Host.h is rarely needed. Move the IsBigEndianHost constant to SwapByteOrder.h. Clients of that header are more likely to need it. llvm-svn: 375316	2019-10-19 00:48:11 +00:00
Thomas Lively	393d0f799f	[WebAssembly] Allow multivalue signatures in object files Summary: Also changes the wasm YAML format to reflect the possibility of having multiple return types and to put the returns after the params for consistency with the binary encoding. Reviewers: aheejin, sbc100 Subscribers: dschuff, jgravelle-google, hiraditya, sunfish, arphaman, rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69156 llvm-svn: 375283	2019-10-18 20:27:30 +00:00
Michael Liao	92fea8bb8d	[lld][coff] Add missing dependency to fix build. llvm-svn: 375238	2019-10-18 14:31:58 +00:00
Martin Storsjo	b38f577c01	[LLD] [COFF] Try to report source locations for duplicate symbols This fixes the second part of PR42407. For files with dwarf debug info, it manually loads and iterates .debug_info to find the declared location of variables, to allow reporting them. (This matches the corresponding code in the ELF linker.) For functions, it uses the existing getFileLineDwarf which uses LLVMSymbolizer for translating addresses to file lines. In object files with codeview debug info, only the source location of duplicate functions is printed. (And even there, only for the first input file. The getFileLineCodeView function requires the object file to be fully loaded and initialized to properly resolve source locations, but duplicate symbols are reported at a stage when the second object file isn't fully loaded yet.) Differential Revision: https://reviews.llvm.org/D68975 llvm-svn: 375218	2019-10-18 10:43:15 +00:00
Rui Ueyama	9a5ad9bd5a	Update release notes llvm-svn: 375206	2019-10-18 06:11:16 +00:00
Jordan Rupprecht	ce88cdf096	[lld][test] Speculative fix for lld+windows failures This updates some more places using `%T` to use `%/T` for path normalization. If this does not work, this and r375126 should be reverted together. llvm-svn: 375131	2019-10-17 16:29:03 +00:00
Jordan Rupprecht	1b6c3ca126	[lld][test] Fix use of escape character in an lld test on Windows Summary: Glob support was improved to accept `\` as an escape character in r375051, but reverted as r375052 due to a failure in this test on Windows. The reason this failure seems Windows specific is because the path separator `\` is currently being relied on to be interpreted literally instead of as an escape character. Per documentation on linker input section wildcard patterns, this seems to be a bug in lld accepting `\` as a literal instead of an escape character. For example: ``` SECTIONS{ .foo :{ /path/to/foo.o(.foo) }} # OK: standard UNIX path SECTIONS{ .foo :{ C:/path/to/foo.o(.foo) }} # OK: windows accepts slashes in either direction SECTIONS{ .foo :{ C:\\path\\to\\foo.o(.foo) }} # OK: escape character used to match a literal \ SECTIONS{ .foo :{ C:\path\to\foo.o(.foo) }} # BAD: this actually matches the path C:pathtofoo.o(.foo) ``` This avoids the problem in the test by using `%/T` in place of `%T` to normalize the path separator to `/`, which windows should also accept. This patch just fixes the test, and glob support will be be relanded separately. For a sample buildbot error, see: http://lab.llvm.org:8011/builders/clang-x64-windows-msvc/builds/11578/steps/stage%201%20check/logs/stdio Reviewers: evgeny777, ruiu, MaskRay, espindola Reviewed By: ruiu, MaskRay Subscribers: emaste, arichardson, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69074 llvm-svn: 375126	2019-10-17 15:35:28 +00:00
George Rimar	938db706fe	[LLD][ELF] - Update test cases after llvm-readobj output format change. The change was: SHT_GNU_verdef { -> VersionDefinitions [ SHT_GNU_verneed { -> VersionRequirements [ Version symbols [ -> VersionSymbols [ EH_FRAME Header [ -> EHFrameHeader { llvm-svn: 375096	2019-10-17 10:23:59 +00:00
Sam Clegg	67b055841f	[lld][WebAssebmly] Preserve custom import attributes with LTO Undefined symbols in WebAssembly can come with custom `import-module` and `import-field` attributes. However when reading symbols from bitcode object files during LTO those curtom attributes are not available. Once we compile the LTO object and read in the symbol table from the object file we have access to these custom attributes. In this case, when undefined symbols are added and a symbol already exists in the SymbolTable we can't simple return it, we may need to update the symbol's attributes. Fixes: PR43211 Differential Revision: https://reviews.llvm.org/D68959 llvm-svn: 375081	2019-10-17 05:16:54 +00:00
Sam Clegg	6c393e9d74	[lld][WebAssembly] Fix for weak references to data symbols in archives Fix a bug where were not handling relocations against weakly undefined data symbol. Add a test for this case. Also ensure that the weak references to data symbols are not pulled in from archive files by default (but are if `-u <name>` is added to the command line). Fixes: PR43696 Differential Revision: https://reviews.llvm.org/D69073 llvm-svn: 375077	2019-10-17 03:21:02 +00:00
Thomas Lively	190dacc3cc	[WebAssembly] Elide data segments for .bss sections Summary: WebAssembly memories are zero-initialized, so when module does not import its memory initializing .bss sections is guaranteed to be a no-op. To reduce binary size and initialization time, .bss sections are simply not emitted into the final binary unless the memory is imported. Reviewers: sbc100 Subscribers: dschuff, jgravelle-google, aheejin, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68965 llvm-svn: 374940	2019-10-15 19:05:11 +00:00
James Clarke	1ab27c74d4	[lld][WebAssembly] Fix static linking of -fPIC code with external undefined data Reviewers: ruiu, sbc100 Reviewed By: sbc100 Subscribers: dschuff, jgravelle-google, aheejin, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68991 llvm-svn: 374913	2019-10-15 17:05:42 +00:00
Sid Manning	ab50256544	[lld] Check for branch range overflows. Differential Revision: https://reviews.llvm.org/D68875 llvm-svn: 374891	2019-10-15 14:12:54 +00:00
Martin Storsjo	e0916f4fbe	[LLD] [COFF] Update a leftover comment after SVN r374869. NFC. llvm-svn: 374874	2019-10-15 09:46:33 +00:00
Martin Storsjo	cd8759c3c2	[LLD] [COFF] Fix -Wmissing-field-initializers warnings. NFC. llvm-svn: 374873	2019-10-15 09:33:14 +00:00
Martin Storsjo	9318c94ebb	[LLD] [COFF] Wrap file location pair<StringRef,int> in Optional<>. NFC. This makes use of it slightly clearer, and makes it match the same construct in the lld ELF linker. Differential Revision: https://reviews.llvm.org/D68935 llvm-svn: 374869	2019-10-15 09:18:18 +00:00
Jordan Rupprecht	c526ff8a62	[llvm-objdump] Adjust spacing and field width for --section-headers Summary: - Expand the "Name" column past 13 characters when any of the section names are longer. Current behavior is a staggard output instead of a nice table if a single name is longer. - Only print the required number of hex chars for addresses (i.e. 8 characters for 32-bit, 16 characters for 64-bit) - Fix trailing spaces Reviewers: grimar, jhenderson, espindola Reviewed By: grimar Subscribers: emaste, sbc100, arichardson, aheejin, seiya, llvm-commits, MaskRay Tags: #llvm Differential Revision: https://reviews.llvm.org/D68730 llvm-svn: 374795	2019-10-14 17:47:17 +00:00
George Rimar	a8346cee8a	[LLD][ELF] - Update test cases after llvm-readobj change. https://reviews.llvm.org/D68704 changed the output format. llvm-svn: 374542	2019-10-11 12:27:20 +00:00
Amy Huang	c9428a04fc	Change test case so that it accepts backslashes in file path, in the case that the test runs on Windows llvm-svn: 374473	2019-10-10 23:35:53 +00:00
Zachary Turner	02c5386811	[PDB] Fix bug when using multiple PCH header objects with the same name. A common pattern in Windows is to have all your precompiled headers use an object named stdafx.obj. If you've got a project with many different static libs, you might use a separate PCH for each one of these. During the final link step, a file from A might reference the PCH object from A, but it will have the same name (stdafx.obj) as any other PCH from another project. The only difference will be the path. For example, A might be A/stdafx.obj while B is B/stdafx.obj. The existing algorithm checks only the filename that was passed on the command line (or stored in archive), but this is insufficient in the case where relative paths are used, because depending on the command line object file / library order, it might find the wrong PCH object first resulting in a signature mismatch. The fix here is to simply check whether the absolute path of the PCH object (which is stored in the input obj file for the file that references the PCH) ends with the full relative path of whatever is specified on the command line (or is in the archive). Differential Revision: https://reviews.llvm.org/D66431 llvm-svn: 374442	2019-10-10 20:25:51 +00:00
Russell Gallop	6d6ec1b869	[LLD][ELF] Fix stale comments about doing ICF Differential Revision: https://reviews.llvm.org/D68396 llvm-svn: 374362	2019-10-10 14:50:02 +00:00
Rui Ueyama	9adea6e4fa	Make nullptr check more robust The only condition that isecLoc becomes null is Out::bufferStart == nullptr, isec->getParent()->offset == 0, and isec->outSecOff == 0. We can check the first condition only once. llvm-svn: 374332	2019-10-10 12:41:08 +00:00
Roman Lebedev	1508fbad79	[lld] getErrPlace(): don't perform arithmetics on maybe-null pointer isecLoc there can be null, but at the same time isec->getSize() may be non-null. It is UB to offset a nullptr.The most straight-forward fix here appears to perform casts+normal integral arithmetics. FAIL: lld :: ELF/invalid/invalid-relocation-aarch64.test (1158 of 2217) ****************** TEST 'lld :: ELF/invalid/invalid-relocation-aarch64.test' FAILED **************** Script: -- : 'RUN: at line 2'; /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm_build_ubsan/bin/yaml2obj /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm-project/lld/test/ELF/invalid/invalid-relocation-aarch64.test -o /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm_build_ubsan/tools/lld/test/ELF/invalid/Output/invalid-relocation-aarch64.test.tmp.o : 'RUN: at line 3'; not /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm_build_ubsan/bin/ld.lld /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm_build_ubsan/tools/lld/test/ELF/invalid/Output/invalid-relocation-aarch64.test.tmp.o -o /dev/null 2>&1 \| /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm_build_ubsan/bin/FileCheck /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm-project/lld/test/ELF/invalid/invalid-relocation-aarch64.test -- Exit Code: 1 Command Output (stderr): -- /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm-project/lld/test/ELF/invalid/invalid-relocation-aarch64.test:4:10: error: CHECK: expected string not found in input # CHECK: error: unknown relocation (1024) against symbol foo ^ <stdin>:1:1: note: scanning from here /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm-project/lld/ELF/Target.cpp💯41: runtime error: applying non-zero offset 24 to null pointer ^ <stdin>:1:118: note: possible intended match here /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm-project/lld/ELF/Target.cpp💯41: runtime error: applying non-zero offset 24 to null pointer ^ -- **************** Testing: 0.. 10.. 20.. 30.. 40.. 50. FAIL: lld :: ELF/invalid/invalid-relocation-x64.test (1270 of 2217) **************** TEST 'lld :: ELF/invalid/invalid-relocation-x64.test' FAILED **************** Script: -- : 'RUN: at line 2'; /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm_build_ubsan/bin/yaml2obj /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm-project/lld/test/ELF/invalid/invalid-relocation-x64.test -o /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm_build_ubsan/tools/lld/test/ELF/invalid/Output/invalid-relocation-x64.test.tmp1.o : 'RUN: at line 3'; echo ".global foo; foo:" > /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm_build_ubsan/tools/lld/test/ELF/invalid/Output/invalid-relocation-x64.test.tmp2.s : 'RUN: at line 4'; /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm_build_ubsan/bin/llvm-mc /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm_build_ubsan/tools/lld/test/ELF/invalid/Output/invalid-relocation-x64.test.tmp2.s -o /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm_build_ubsan/tools/lld/test/ELF/invalid/Output/invalid-relocation-x64.test.tmp2.o -filetype=obj -triple x86_64-pc-linux : 'RUN: at line 5'; not /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm_build_ubsan/bin/ld.lld /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm_build_ubsan/tools/lld/test/ELF/invalid/Output/invalid-relocation-x64.test.tmp1.o /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm_build_ubsan/tools/lld/test/ELF/invalid/Output/invalid-relocation-x64.test.tmp2.o -o /dev/null 2>&1 \| /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm_build_ubsan/bin/FileCheck /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm-project/lld/test/ELF/invalid/invalid-relocation-x64.test -- Exit Code: 1 Command Output (stderr): -- /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm-project/lld/test/ELF/invalid/invalid-relocation-x64.test:6:10: error: CHECK: expected string not found in input # CHECK: error: unknown relocation (152) against symbol foo ^ <stdin>:1:1: note: scanning from here /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm-project/lld/ELF/Target.cpp💯41: runtime error: applying non-zero offset 24 to null pointer ^ <stdin>:1:118: note: possible intended match here /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm-project/lld/ELF/Target.cpp💯41: runtime error: applying non-zero offset 24 to null pointer ^ -- **************** Testing: 0.. 10.. 20.. 30.. 40.. 50.. 60.. 70.. 80.. 90.. Testing Time: 20.73s ****************** Failing Tests (2): lld :: ELF/invalid/invalid-relocation-aarch64.test lld :: ELF/invalid/invalid-relocation-x64.test llvm-svn: 374329	2019-10-10 12:22:55 +00:00
Fangrui Song	d79c3be618	[COFF] Wrap definitions in namespace lld { namespace coff {. NFC Similar to D67323, but for COFF. Many lld/COFF/ files already use `namespace lld { namespace coff {`. Only a few need changing. Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D68772 llvm-svn: 374314	2019-10-10 11:27:58 +00:00
Rui Ueyama	37bf9bb405	Use error instead of fatal to report usage errors Differential Revision: https://reviews.llvm.org/D68768 llvm-svn: 374297	2019-10-10 09:46:41 +00:00
Martin Storsjo	0226c35262	[LLD] [MinGW] Look for other library patterns with -l GNU ld looks for a number of other patterns than just lib<name>.dll.a and lib<name>.a. GNU ld does support linking directly against a DLL without using an import library. If that's the only match for a -l argument, point out that the user needs to use an import library, instead of leaving the user with a puzzling message about the -l argument not being found at all. Also convert an existing case of fatal() into error(). Differential Revision: https://reviews.llvm.org/D68689 llvm-svn: 374292	2019-10-10 08:52:39 +00:00
Martin Storsjo	e742794ffa	[LLD] [MinGW] Add a testcase for -l:name style library options. NFC. Differential Revision: https://reviews.llvm.org/D68688 llvm-svn: 374291	2019-10-10 08:52:31 +00:00
Rui Ueyama	d7ead5b58d	Improve error message for bad SHF_MERGE sections This patch adds a section name to error messages. Differential Revision: https://reviews.llvm.org/D68758 llvm-svn: 374290	2019-10-10 08:32:12 +00:00
Fangrui Song	33c59abf5c	[WebAssembly] Wrap definitions in namespace lld { namespace wasm {. NFC Similar to D68323, but for wasm. Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D68759 llvm-svn: 374279	2019-10-10 05:25:39 +00:00
Sam Clegg	ad2e12a3d9	[lld][WebAssembly] Refactor markLive.cpp. NFC This pattern matches the ELF implementation add if also useful as part of a planned change where running `mark` more than once is needed. Differential Revision: https://reviews.llvm.org/D68749 llvm-svn: 374275	2019-10-10 03:23:06 +00:00
Nico Weber	79a8476d43	dummy comment typo fix commit to cycle the bots llvm-svn: 374270	2019-10-10 02:04:56 +00:00
Hans Wennborg	1e1e3ba252	Unify the two CRC implementations David added the JamCRC implementation in r246590. More recently, Eugene added a CRC-32 implementation in r357901, which falls back to zlib's crc32 function if present. These checksums are essentially the same, so having multiple implementations seems unnecessary. This replaces the CRC-32 implementation with the simpler one from JamCRC, and implements the JamCRC interface in terms of CRC-32 since this means it can use zlib's implementation when available, saving a few bytes and potentially making it faster. JamCRC took an ArrayRef<char> argument, and CRC-32 took a StringRef. This patch changes it to ArrayRef<uint8_t> which I think is the best choice, and simplifies a few of the callers nicely. Differential revision: https://reviews.llvm.org/D68570 llvm-svn: 374148	2019-10-09 09:06:30 +00:00
Rui Ueyama	07775b207a	Use lld-link instead of llvm-dlltool to create an implib Suggested by Martin Storsjö. llvm-svn: 374142	2019-10-09 07:04:38 +00:00
Rui Ueyama	c3c5e0fbbf	[lld] Don't create hints-section if Hint/Name Table is empty Fixes assert in addLinkerModuleCoffGroup() when using by-ordinal imports only. Patch by Stefan Schmidt. Differential revision: https://reviews.llvm.org/D68352 llvm-svn: 374140	2019-10-09 06:48:24 +00:00
Sid Manning	aca5d395d5	[lld][Hexagon] Support PLT relocation R_HEX_B15_PCREL_X/R_HEX_B9_PCREL_X These are sometimes generated by tail call optimizations. Differential Revision: https://reviews.llvm.org/D66542 llvm-svn: 374052	2019-10-08 14:23:49 +00:00
Rui Ueyama	d2e9dd3877	Use /dev/null for tests that we do not need outputs llvm-svn: 374023	2019-10-08 08:03:44 +00:00
Rui Ueyama	5493366729	Report error if -export-dynamic is used with -r The combination of the two flags doesn't make sense. And other linkers seem to just ignore --export-dynamic if --relocatable is given, but we probably should report it as an error to let users know that is an invalid combination. Fixes https://bugs.llvm.org/show_bug.cgi?id=43552 Differential Revision: https://reviews.llvm.org/D68441 llvm-svn: 374022	2019-10-08 08:03:40 +00:00
Fangrui Song	24ec80425a	[ELF][MIPS] De-template writeValue. NFC Depends on D68561. llvm-svn: 373886	2019-10-07 08:52:07 +00:00
Fangrui Song	bd8cfe65f5	[ELF] Wrap things in `namespace lld { namespace elf {`, NFC This makes it clear `ELF/*/.cpp` files define things in the `lld::elf` namespace and simplifies `elf::foo` to `foo`. Reviewed By: atanasyan, grimar, ruiu Differential Revision: https://reviews.llvm.org/D68323 llvm-svn: 373885	2019-10-07 08:31:18 +00:00
Fangrui Song	5761e3cef4	[ELF][MIPS] Use lld:🧝:{read,write}* instead of llvm::support::endian::{read,write}* This allows us to delete `using namespace llvm::support::endian` and simplify D68323. This change adds runtime config->endianness check but the overhead should be negligible. Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D68561 llvm-svn: 373884	2019-10-07 08:30:46 +00:00
Martin Storsjo	9809ed6135	[LLD] [COFF] Always demangle the __imp_ prefix to __declspec(dllimport) Differential Revision: https://reviews.llvm.org/D68017 llvm-svn: 373781	2019-10-04 19:47:59 +00:00
Martin Storsjo	4d1405ad87	Revert r371732: "lld-link: Fix tests that do not run on macOS after r371729." This commit should be reverted along with r371729. llvm-svn: 373713	2019-10-04 08:34:26 +00:00
Fangrui Song	7588cf09da	[ELF] Use union-find set and doubly linked list in Call-Chain Clustering (C³) heuristic Before, SecToClusters[*] was used to track the belonged cluster. During a merge (From -> Into), every element of From has to be updated. Use a union-find set to speed up this use case. Also, replace `std::vector<int> Sections;` with a doubly-linked pointers: int Next, Prev; Reviewed By: Bigcheese Differential Revision: https://reviews.llvm.org/D46228 llvm-svn: 373708	2019-10-04 07:56:54 +00:00
Rui Ueyama	e4758a5c27	[MinGW] Add --reproduce option Differential Revision: https://reviews.llvm.org/D68382 llvm-svn: 373705	2019-10-04 07:27:45 +00:00
Rui Ueyama	0d53ac8096	Add /reproduce option to lld/COFF This patch adds /reproduce:<path> option to lld/COFF. This is an lld-specific option, so we can name it freely. I chose /reproduce over other names (e.g. /lldlinkrepro) for consistency with other lld ports. Differential Revision: https://reviews.llvm.org/D68381 llvm-svn: 373704	2019-10-04 07:27:38 +00:00
Rui Ueyama	6785824431	Revert r371729: lld-link: Make /linkrepro: take a filename, not a directory. This reverts commit r371729 because /linkrepro option also exists in Microsoft link.exe and their linker takes not a filename but a directory name as an argument for /linkrepro. Differential Revision: https://reviews.llvm.org/D68378 llvm-svn: 373703	2019-10-04 07:27:31 +00:00
Jordan Rupprecht	9d4a6b1bb2	[llvm-objdump] Further rearrange llvm-objdump sections for compatability Summary: rL371826 rearranged some output from llvm-objdump for GNU objdump compatability, but there still seem to be some more. I think this rearrangement is a little closer. Overview of the ordering which matches GNU objdump: * Archive headers * File headers * Section headers * Symbol table * Dwarf debugging * Relocations (if `--disassemble` is not used) * Section contents * Disassembly Reviewers: jhenderson, justice_adams, grimar, ychen, espindola Reviewed By: jhenderson Subscribers: aprantl, emaste, arichardson, jrtc27, atanasyan, seiya, llvm-commits, MaskRay Tags: #llvm Differential Revision: https://reviews.llvm.org/D68066 llvm-svn: 373671	2019-10-03 22:01:08 +00:00
Simon Atanasyan	afe7197f13	[mips] Use llvm-readobj `-A` flag in test cases. NFC llvm-svn: 373589	2019-10-03 12:08:04 +00:00
Peter Collingbourne	42ec0b777f	Add missing REQUIRES: arm. llvm-svn: 373357	2019-10-01 17:35:56 +00:00
Peter Collingbourne	0bb825d208	ELF: Add .interp synthetic sections first in createSyntheticSections(). Our .interp section is not a SyntheticSection. As a result, it terminates the loop in removeUnusedSyntheticSections(). This has at least two consequences: - The synthetic .bss and .bss.rel.ro sections are always present in dynamically linked executables, even when they are not needed. - The synthetic .ARM.exidx (and possibly other) sections are always present in partitions other than the last one, even when not needed. .ARM.exidx in particular is problematic because it assumes that its list of code sections is non-empty in getLinkOrderDep(), which can lead to a crash if the partition does not have any code sections. Fix these problems by moving the creation of the .interp sections to the top of createSyntheticSections(). While here, make the code a little less error-prone by changing the add() lambdas to take a SyntheticSection instead of an InputSectionBase. Differential Revision: https://reviews.llvm.org/D68256 llvm-svn: 373347	2019-10-01 16:10:13 +00:00
Peter Collingbourne	97e251e05a	ELF: Don't merge SHF_LINK_ORDER sections for different output sections in relocatable links. Merging SHF_LINK_ORDER sections can affect semantics if the sh_link fields point to different sections. Specifically, for SHF_LINK_ORDER sections, the sh_link field acts as a reverse dependency from the linked section, causing the SHF_LINK_ORDER section to be included if the linked section is included. Merging sections with different sh_link fields will cause the entire contents of the SHF_LINK_ORDER section to be associated with a single (arbitrarily chosen) output section, whereas the correct semantics are for the individual pieces of the SHF_LINK_ORDER section to be associated with their linked output sections. As a result we can end up incorrectly dropping SHF_LINK_ORDER section contents or including the wrong section contents, depending on which linked sections were chosen. Differential Revision: https://reviews.llvm.org/D68094 llvm-svn: 373255	2019-09-30 20:23:00 +00:00
Nico Weber	2331cd69cd	Revert "[MC] Emit unused undefined symbol even if its binding is not set" This reverts r373168. It caused PR43511. llvm-svn: 373242	2019-09-30 18:13:48 +00:00
Fangrui Song	4006434ed1	[ELF][test] Change llvm-readobj --arm-attributes to --arch-specific after r373125 llvm-svn: 373178	2019-09-30 04:24:35 +00:00
Fangrui Song	c513360662	[MC] Emit unused undefined symbol even if its binding is not set For the following two cases, we currently suppress the symbols. This patch emits them (compatible with GNU as). * `test2_a = undef`: if `undef` is otherwise unused. * `.hidden hidden`: if `hidden` is unused. This is the main point of the patch, because omitting the symbol would cause a linker semantic difference. It causes a behavior change that is not compatible with GNU as: .weakref foo1, bar1 When neither foo1 nor bar1 is used, we now emit bar1, which is arguably more consistent. Another change is that we will emit .TOC. for .TOC.@tocbase . For this directive, suppressing .TOC. can be seen as a size optimization, but we choose to drop it for simplicity and consistency. llvm-svn: 373168	2019-09-29 15:26:12 +00:00
Martin Storsjo	ac7c6d554d	[LLD] [test] Add a forgotten comment. NFC. This was requested in https://reviews.llvm.org/D68014, but I forgot to add it before pushing the commit. llvm-svn: 373145	2019-09-28 09:51:02 +00:00
Martin Storsjo	5ebab1f8f9	[LLD] Simplify the demangleItanium function. NFC. Instead of returning an optional, just return the input string if demangling fails, as that's what all callers use anyway. Differential Revision: https://reviews.llvm.org/D68015 llvm-svn: 373077	2019-09-27 12:24:18 +00:00
Martin Storsjo	dd71b2d4c3	[LLD] Convert demangleItanium to use the higher level llvm::demangle function. NFC. This avoids a few lines of boilerplate of dealing with C string allocations. Add a testcase for a case where demangling shouldn't happen. Differential Revision: https://reviews.llvm.org/D68014 llvm-svn: 373076	2019-09-27 12:24:03 +00:00
Martin Storsjo	bf6f4e9932	[LLD] [COFF] Use the unified llvm demangle frontend function. NFC. Add test cases for some cases where we don't want demangling to happen. Differential Revision: https://reviews.llvm.org/D67301 llvm-svn: 373075	2019-09-27 12:23:45 +00:00
Fangrui Song	f1e1451946	[ELF] Set SectionBase::partition in processSectionCommands Fixes PR43461 (regression caused by D67504) The partition field of a SECTIONS-specified section is not set after D67504. The 0 value affects findSection() which checks if the partition field is 1. So `Out::initArray = findSection(".init_array")` is null, and DT_INIT_ARRAYSZ is not set. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D68087 llvm-svn: 372996	2019-09-26 17:10:09 +00:00
Matt Davis	4d17cdc704	[lld][mach-o] Avoid segfaulting when handling an empty section list. Summary: The following patch avoids segfaulting if the section list is empty when writing a mach-o MH_OBJECT. I ran into this case from a more complicated example trying to dead_strip while using '-r' in lld. I'm not sure if having empty sections is a legal mach-o, but it does seem that other llvm-binutils tools can ingest such a boring object with out issue. Would it be better to emit an error, emit a warning, or do nothing? It seems that adding a warning diagnostic might be helpful to users, as I did not expect to have a section-less object when the linker was done. Reviewers: kledzik, ruiu Subscribers: llvm-commits, jrm Tags: #lld, #llvm Differential Revision: https://reviews.llvm.org/D67735 llvm-svn: 372995	2019-09-26 17:03:20 +00:00
Simon Atanasyan	fba48fcf44	[mips] Relax jalr/jr instructions using R_MIPS_JALR relocation The R_MIPS_JALR relocation denotes jalr/jr instructions in position independent code. Both these instructions take a target's address from the $25 register. If offset to the target symbol fits into the 18-bits, it's more efficient to replace jalr/jr by bal/b instructions. Differential Revision: https://reviews.llvm.org/D68057 llvm-svn: 372951	2019-09-26 09:13:20 +00:00
Martin Storsjo	c4046ff04d	[LLD] Fix testcase from SVN r372843 if executed on windows This should fix buildbot errors like this one: http://lab.llvm.org:8011/builders/clang-x64-windows-msvc/builds/10997 llvm-svn: 372852	2019-09-25 12:07:50 +00:00
Martin Storsjo	1d06d48bb3	[LLD] [COFF] Resolve source locations for undefined references using dwarf This fixes PR42407. Differential Revision: https://reviews.llvm.org/D67053 llvm-svn: 372843	2019-09-25 11:03:48 +00:00
Fangrui Song	0264950697	[ELF] Add -z separate-loadable-segments to complement separate-code and noseparate-code D64906 allows PT_LOAD to have overlapping p_offset ranges. In the default R RX RW RW layout + -z noseparate-code case, we do not tail pad segments when transiting to another segment. This can save at most 3*maxPageSize bytes. a) Before D64906, we tail pad R, RX and the first RW. b) With -z separate-code, we tail pad R and RX, but not the first RW (RELRO). In some cases, b) saves one file page. In some cases, b) wastes one virtual memory page. The waste is a concern on Fuchsia. Because it uses compressed binaries, it doesn't benefit from the saved file page. This patch adds -z separate-loadable-segments to restore the behavior before D64906. It can affect section addresses and can thus be used as a debugging mechanism (see PR43214 and ld.so partition bug in crbug.com/998712). Reviewed By: jakehehrlich, ruiu Differential Revision: https://reviews.llvm.org/D67481 llvm-svn: 372807	2019-09-25 03:39:31 +00:00
Bob Haarman	9f0f36e022	[ELF] accept thinlto options without --plugin-opt= prefix Summary: When support for ThinLTO was first added to lld, the options that control it were prefixed with --plugin-opt= for compatibility with an existing implementation as a linker plugin. This change enables shorter versions of the options to be used, as follows: New Existing -thinlto-emit-imports-files --plugin-opt=thinlto-emit-imports-files -thinlto-index-only --plugin-opt=thinlto-index-only -thinlto-index-only= --plugin-opt=thinlto-index-only= -thinlto-object-suffix-replace= --plugin-opt=thinlto-object-suffix-replace= -thinlto-prefix-replace= --plugin-opt=thinlto-prefix-replace= -lto-obj-path= --plugin-opt=obj-path= The options with the --plugin-opt= prefix have been retained as aliases for the shorter variants so that they continue to be accepted. Reviewers: tejohnson, ruiu, espindola Reviewed By: ruiu Subscribers: emaste, arichardson, MaskRay, steven_wu, dexonsmith, arphaman, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67782 llvm-svn: 372798	2019-09-25 01:19:48 +00:00
Peter Smith	06b3e3421a	[ELF][ARM] Fix crash when discarding InputSections that have .ARM.exidx When /DISCARD/ is used on an input section, that input section may have a .ARM.exidx metadata section that depends on it. As the discard handling comes after the .ARM.exidx synthetic section is created we need to make sure that we account for the case where the .ARM.exidx output section should be removed because there are no more live input sections. Differential Revision: https://reviews.llvm.org/D67848 llvm-svn: 372781	2019-09-24 21:44:14 +00:00
Sam Clegg	937b955837	[lld][WebAssembly] Fix static linking of -fPIC code with external undefined functions Differential Revision: https://reviews.llvm.org/D66784 llvm-svn: 372779	2019-09-24 20:52:12 +00:00
Bob Haarman	19712415a5	[NFC][COFF] fix typo in comment ("algortihm" -> "algorithm") llvm-svn: 372776	2019-09-24 20:17:54 +00:00
George Rimar	355764e388	[LLD][ELF][MIPS] - Inline the short helper function. NFC. It was requested in a post-commit comment for r372570. llvm-svn: 372747	2019-09-24 12:53:53 +00:00
Fangrui Song	e447d5afd3	[ELF] Delete SectionBase::assigned D67504 removed uses of `assigned` from OutputSection::addSection, which makes `assigned` purely used in processSectionCommands() and its callees. By replacing its references with `parent`, we can remove `assigned`. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D67531 llvm-svn: 372735	2019-09-24 11:48:46 +00:00
Fangrui Song	e47bbd28f8	[ELF] Make MergeInputSection merging aware of output sections Fixes PR38748 mergeSections() calls getOutputSectionName() to get output section names. Two MergeInputSections may be merged even if they are made different by SECTIONS commands. This patch moves mergeSections() after processSectionCommands() and addOrphanSections() to fix the issue. The new pass is renamed to OutputSection::finalizeInputSections(). processSectionCommands() and addorphanSections() are changed to add sections to InputSectionDescription::sectionBases. finalizeInputSections() merges MergeInputSections and migrates `sectionBases` to `sections`. For the -r case, we drop an optimization that tries keeping sh_entsize non-zero. This is for the simplicity of addOrphanSections(). The updated merge-entsize2.s reflects the change. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D67504 llvm-svn: 372734	2019-09-24 11:48:31 +00:00
Sam Clegg	f6f4b98f03	[lld][WebAssembly] Preserve symbol flags in --relocatable output Fixes https://github.com/emscripten-core/emscripten/issues/8879 Differential Revision: https://reviews.llvm.org/D67729 llvm-svn: 372660	2019-09-23 21:28:29 +00:00
Simon Atanasyan	4750d79ac6	[mips] Support elf32btsmipn32_fbsd / elf32ltsmipn32_fbsd emulations Patch by Kyle Evans. llvm-svn: 372651	2019-09-23 20:32:43 +00:00
Simon Atanasyan	25a7dd8553	[mips] Add tests to check MIPS FreeBSD emulations. NFC llvm-svn: 372650	2019-09-23 20:32:33 +00:00
Simon Atanasyan	fa9a4aa197	[mips] Reformat test case to simplify addition new tests. NFC llvm-svn: 372649	2019-09-23 20:32:27 +00:00
George Rimar	c60913f162	[LLD][ELF] - Simplify getFlagsFromEmulation(). NFCI. A straightforward simplification. llvm-svn: 372570	2019-09-23 09:55:10 +00:00
Simon Atanasyan	e03007cb4e	[mips] Deduce MIPS specific ELF header flags from `emulation` In case of linking binary blobs which do not have any ELF headers, we can deduce MIPS ABI ELF header flags from an `emulation` option. Patch by Kyle Evans. llvm-svn: 372513	2019-09-22 16:26:39 +00:00
Fangrui Song	2672051495	[ELF] Error if the linked-to section of a SHF_LINK_ORDER section is discarded Summary: If st_link(A)=B, and A has the SHF_LINK_ORDER flag, we may dereference a null pointer if B is garbage collected (PR43147): 1. In Wrter.cpp:compareByFilePosition, `aOut->sectionIndex` or `bOut->sectionIndex` 2. In OutputSections::finalize, `d->getParent()->sectionIndex` Simply error and bail out to avoid null pointer dereferences. ld.bfd has a similar error: sh_link of section `.bar' points to discarded section `.foo0' of `a.o' ld.bfd is more permissive in that it just checks whether the linked-to section of the first input section is discarded. This is likely because it sets sh_link of the output section according to the first input section. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D67761 llvm-svn: 372400	2019-09-20 15:03:21 +00:00
Thomas Lively	0c3d4cfbad	[WebAssembly][NFC] Remove unnecessary braces llvm-svn: 372358	2019-09-19 21:51:52 +00:00
Mitch Phillips	f5fcf61566	Don't false-positive match against binary path. copy-rel-abs.s uses llvm-objdump to generate output that's then run through FileCheck. llvm-objdump prints the file path at the top of the file, which means that any build path that contains 'zed' will get false-matched. Ensure that 'zed' is only matched after the 'SYMBOL TABLE:' output, preventing this from failing if your build directory is ~/build/sanitized-xxx/, or similar. llvm-svn: 372351	2019-09-19 20:44:12 +00:00
Thomas Lively	21143b93a6	[WebAssembly] Sort output data sections to place .bss last Summary: This was always the intended behavior, but had not been implemented. This ordering is important for Emscripten when generating .mem files while compiling to JS, since only zeros at the end of initialized memory can be dropped. Fixes https://github.com/emscripten-core/emscripten/issues/8999 Reviewers: sbc100 Subscribers: dschuff, jgravelle-google, aheejin, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67736 llvm-svn: 372284	2019-09-19 01:14:59 +00:00
Sam Clegg	e40ef12bfa	[lld][WebAssembly] Fix use after free of archive path This was fixed in the ELF backend in https://reviews.llvm.org/D34554. Differential Revision: https://reviews.llvm.org/D67676 llvm-svn: 372266	2019-09-18 21:51:03 +00:00
Peter Smith	43d32cdd87	[ELF][AARCH64] Refactor AArchErrataFix to match changes in ARMErrataFix NFC. D67284 introduced ARMErrataFix.cpp which was derived from AArch64ErrataFix.cpp. There were some useful refactoring changes made to ARMErrataFix.cpp made as part of the review. This change applies the relevant changes back to AArch64ErrataFix.cpp. Main changes are: - Old style variable names in comments like IS, are now new style isec. - Simplify init() collection of mappingSymbols to always start with a code mapping symbol. - Simplify logic in mergeCmp(). - Fix one 80 column overflow caused by IS -> isec transformation. Differential Revision: https://reviews.llvm.org/D67622 llvm-svn: 372094	2019-09-17 09:49:30 +00:00
Fangrui Song	4816e516e5	[ELF][Hexagon] Allow PT_LOAD to have overlapping p_offset ranges on EM_HEXAGON Port the D64906 technique to EM_HEXAGON. This concludes the patch series. Differential Revision: https://reviews.llvm.org/D67605 llvm-svn: 372059	2019-09-17 02:45:38 +00:00
Steven Wu	dd63b9f570	[lld] Update lld driver to use new LTO APIs to handle libcall symbols NFC. Remove duplicated code in ELF/COFF driver and libLTO legacy interfaces. llvm-svn: 372022	2019-09-16 18:49:57 +00:00
Peter Smith	1d74940b31	[ELF][ARM] Fix -Werror buildbots NFC. Provide a missing initializer to get rid of warning provoking buildbot failures. error: missing field 'rel' initializer [-Werror,-Wmissing-field-initializers] llvm-svn: 371970	2019-09-16 10:07:53 +00:00
Peter Smith	ea99ce5e9b	[ELF][ARM] Implement --fix-cortex-a8 to fix erratum 657417 The --fix-cortex-a8 option implements a linker workaround for the coretex-a8 erratum 657417. A summary of the erratum conditions is: - A 32-bit Thumb-2 branch instruction B.w, Bcc.w, BL, BLX spans two 4KiB regions. - The destination of the branch is to the first 4KiB region. - The instruction before the branch is a 32-bit Thumb-2 non-branch instruction. The linker fix is to redirect the branch to a patch not in the first 4KiB region. The patch forwards the branch on to its target. The cortex-a8, is an old CPU, with the first implementation of this workaround in ld.bfd appearing in 2009. The cortex-a8 has been used in early Android Phones and there are some critical applications that still need to run on a cortex-a8 that have the erratum. The patch is applied roughly 10 times on LLD and 20 on Clang when they are built with --fix-cortex-a8 on an Arm system. The formal erratum description is avaliable in the ARM Core Cortex-A8 (AT400/AT401) Errata Notice document. This is available from Arm on request but it seems to be findable via a web search. Differential Revision: https://reviews.llvm.org/D67284 llvm-svn: 371965	2019-09-16 09:38:38 +00:00
Fangrui Song	d4306e90cb	[ELF][X86] Allow PT_LOAD to have overlapping p_offset ranges on EM_X86_64 Port the D64906 technique to EM_X86_64. Differential Revision: https://reviews.llvm.org/D67482 llvm-svn: 371958	2019-09-16 07:05:34 +00:00
Fangrui Song	06bb7dfbd4	[ELF] Map the ELF header at imageBase If there is no readonly section, we map: * The ELF header at imageBase+maxPageSize * Program headers at imageBase+maxPageSize+sizeof(Ehdr) * The first section .text at imageBase+maxPageSize+sizeof(Ehdr)+sizeof(program headers) Due to the interaction between Writer<ELFT>::fixSectionAlignments and LinkerScript::allocateHeaders, `alignDown(p_vaddr(R PT_LOAD)) = alignDown(p_vaddr(RX PT_LOAD))`. The RX PT_LOAD will override the R PT_LOAD at runtime, which is not ideal: ``` // PHDR at 0x401034, should be 0x400034 PHDR 0x000034 0x00401034 0x00401034 0x000a0 0x000a0 R 0x4 // R PT_LOAD contains just Ehdr and program headers. // At 0x401000, should be 0x400000 LOAD 0x000000 0x00401000 0x00401000 0x000d4 0x000d4 R 0x1000 LOAD 0x0000d4 0x004010d4 0x004010d4 0x00001 0x00001 R E 0x1000 ``` * createPhdrs allocates the headers to the R PT_LOAD. * fixSectionAlignments assigns `imageBase+maxPageSize+sizeof(Ehdr)+sizeof(program headers)` (formula: `alignTo(dot, maxPageSize) + dot % config->maxPageSize`) to addrExpr of .text * allocateHeaders computes the minimum address among SHF_ALLOC sections, i.e. addr(.text) * allocateHeaders sets address of ELF header to `addr(.text)-sizeof(Ehdr)-sizeof(program headers) = imageBase+maxPageSize` The main observation is that when the SECTIONS command is not used, we don't have to call allocateHeaders. This requires an assumption that the presence of PT_PHDR and addresses of headers can be decided regardless of address information. This may seem natural because dot is not manipulated by a linker script. The other thing is that we have to drop the special rule for -T<section> in `getInitialDot`. If -Ttext is smaller than the image base, the headers will not be allocated with the old behavior (allocateHeaders is called) but always allocated with the new behavior. The behavior change is not a problem. Whether and where headers are allocated can vary among linkers, or ld.bfd across different versions (--enable-separate-code or not). It is thus advised to use a linker script with the PHDRS command to have a consistent behavior across linkers. If PT_PHDR is needed, an explicit --image-base can be a simpler alternative. Differential Revision: https://reviews.llvm.org/D67325 llvm-svn: 371957	2019-09-16 07:04:16 +00:00
Nico Weber	c7d8cc48c1	lld-link: Make Options.td formatting more self-consistent. Also tighten up help strings for /force, --start-lib, and --end-lib. Differential Revision: https://reviews.llvm.org/D67457 llvm-svn: 371927	2019-09-14 23:41:42 +00:00
Nico Weber	d48ea5da94	lld-link: Add a flag /lldignoreenv that makes lld-link ignore env vars. This is useful for enforcing that builds are independent of the environment; it can be used when all system library paths are added via /libpath: already. It's similar ot cl.exe's /X flag. Since it should also affect %LINK% (the other caller of `Process::GetEnv` in lld/COFF), the early-option-parsing needs to move around a bit. The options are: - Add a manual loop over the argv ArrayRef and look for "/lldignoreenv". This repeats the name of the flag in both Options.td and in DriverUtils.cpp. - Add yet another table.ParseArgs() call just for /lldignoreenv before adding %LINK%. - Use the existing early ParseArgs() that's there for --rsp-quoting and use it for /lldignoreenv for %LINK% as well. This means --rsp-quoting and /lldignoreenv can't be passed via %LINK%. I went with the third approach. Differential Revision: https://reviews.llvm.org/D67456 llvm-svn: 371852	2019-09-13 13:13:52 +00:00
Fangrui Song	51ead00bf8	[ELF] Delete a redundant assignment to SectionBase::assigned. NFC LinkerScript::discard marks a section dead. It is unnecessary to set the `assigned` bit. llvm-svn: 371804	2019-09-13 02:18:04 +00:00
Amy Huang	227d85956b	[COFF] Fix to not add archive name to buffer identifiers when they come from thin archives. Currently lld adds the archive name to MemoryBufferRef identifiers in order to ensure they are unique. For thin archives, since the file name is already unique and we want to keep the original path to the file, don't add the archive name. Differential Revision: https://reviews.llvm.org/D67295 llvm-svn: 371778	2019-09-12 22:04:56 +00:00
Fangrui Song	2ad25a4aee	[ELF] ICF: change a dyn_cast<InputSection> to cast ICF is performed after EhInputSections and MergeInputSections were eliminated from inputSections. Every element of inputSections is an InputSection. llvm-svn: 371744	2019-09-12 16:46:19 +00:00
Nico Weber	d0c8004890	lld-link: Fix tests that do not run on macOS after r371729. llvm-svn: 371732	2019-09-12 12:35:34 +00:00
Nico Weber	3c44d595be	lld-link: Make /linkrepro: take a filename, not a directory. This makes lld-link behave like ld.lld. I don't see a reason for the two drivers to have different behavior here. While here, also make lld-link add a version.txt to the tar, like ld.lld does. Differential Revision: https://reviews.llvm.org/D67461 llvm-svn: 371729	2019-09-12 11:44:13 +00:00
Fangrui Song	786ce3fbd6	[ELF] Fix a common-page-size typo llvm-svn: 371716	2019-09-12 08:59:17 +00:00
Fangrui Song	60ff4dd9cd	[ELF] Support -z undefs -z undefs is the inverse of -z defs. It allows unresolved references from object files. This can be used to cancel --no-undefined or -z defs. Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D67479 llvm-svn: 371715	2019-09-12 08:55:17 +00:00
Fangrui Song	2aace1ea22	[ELF][test] Make tests more tolerant to exact symbol addresses llvm-svn: 371588	2019-09-11 06:20:14 +00:00
Amy Huang	7b1d793713	Reland "Change the X86 datalayout to add three address spaces for 32 bit signed, 32 bit unsigned, and 64 bit pointers." This reverts `57076d3199`. Original review at https://reviews.llvm.org/D64931. Review for added fix at https://reviews.llvm.org/D66843. llvm-svn: 371568	2019-09-10 23:15:38 +00:00
Simon Atanasyan	6c6f5a9984	[mips] Allow PT_LOAD to have overlapping p_offset ranges on EM_MIPS Port the D64906 <https://reviews.llvm.org/D64906> technique to MIPS. Fix PR33131 llvm-svn: 371554	2019-09-10 20:19:59 +00:00
Fangrui Song	1eda21e214	[ELF][test] Make tests more tolerant to exact symbol addresses Delete relocation-local.s and relocation-shared.s - covered by various tests llvm-svn: 371514	2019-09-10 12:28:07 +00:00
Rui Ueyama	89efb03463	[LLD][COFF] Add index to disambiguate archive members when using -wholearchive Patch by Markus Böck. PR42951: When linking an archive with members that have the same name linking fails when using the -wholearchive option. This patch passes the index of the member in the archive to the offset parameter to disambiguate the member. Differential Revision: https://reviews.llvm.org/D66239 llvm-svn: 371509	2019-09-10 11:50:26 +00:00
Simon Atanasyan	2439b8b0c5	[mips] Make another set of test cases more tolerant to exact symbol addresses. NFC llvm-svn: 371458	2019-09-09 22:04:20 +00:00
Simon Atanasyan	56e4ea2bff	[mips] Fix decoding of microMIPS JALX instruction microMIPS jump and link exchange instruction stores a target in a 26-bits field. Despite other microMIPS JAL instructions these bits are target address shifted right 2 bits [1]. The patch fixes the JALX instruction decoding and uses 2-bit shift. [1] MIPS Architecture for Programmers Volume II-B: The microMIPS32 Instruction Set Differential Revision: https://reviews.llvm.org/D67320 llvm-svn: 371428	2019-09-09 17:28:45 +00:00
Fangrui Song	e8c0d93360	[ELF] nmagic or omagic: don't allocate PT_PHDR or PF_R PT_LOAD for the !hasPhdrsCommands case ``` part.phdrs = script->hasPhdrsCommands() ? script->createPhdrs() : createPhdrs(part); ``` createPhdrs() allocates a PT_PHDR and a PF_R PT_LOAD, which will be deleted later in LinkerScript::allocateHeaders, but leave a gap between the program headers and the first section. Don't allocate the segments to avoid the gap. PT_INTERP is likely not needed as well. Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D67324 llvm-svn: 371398	2019-09-09 13:08:51 +00:00
Fangrui Song	298c7a09de	[ELF][AArch64] Apply some NFC cleanups to AArch64ErrataFix.cpp Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D67310 llvm-svn: 371389	2019-09-09 11:22:27 +00:00
Fangrui Song	88796a7988	[ELF][test] Improve and reorganize another set of tests Add file-level comments Replace trivial Input/*.s with echo ... \| llvm-mc Delete insignificant addresses to make them more tolerant to layout changes Simplify test output Merge merge-section-types.s into compatible-section-types.s and add a missed case Merge gnu-ifunc-gotpcrel.s (added in D19517) into gnu-ifunc-dso.s (added in D35119) and add missed cases Delete typed-undef.s - covered by executable-undefined-ignoreall.s Delete emit-relocs-shared.s - covered by emit-relocs-merge.s Replace copy-rel-pie.s and copy-rel-pie2.s with canonical-plt-pcrel.s, canonical-plt-symbolic.s and copy-rel.s: add -no-pie cases. add a case that a canonical PLT can be created for STT_GNU_IFUNC. The logic in Symbols.h was untested: // ctor of SharedSymbol if (this->type == llvm::ELF::STT_GNU_IFUNC) this->type = llvm::ELF::STT_FUNC; llvm-svn: 371361	2019-09-09 03:35:14 +00:00
Simon Atanasyan	9443c18704	[mips] Follow-up to r371313 - fix failed test case. NFC llvm-svn: 371316	2019-09-07 16:31:37 +00:00
Simon Atanasyan	fcef13344d	[mips] Make another set of test cases more tolerant to exact symbol addresses. NFC llvm-svn: 371313	2019-09-07 15:44:40 +00:00
Fangrui Song	0e79890d9b	[ELF][test] Improve tests Add file-level comments Delete insignificant addresses to make them more tolerant to layout changes Simplify test output Delete weak-undef-val.s - covered by relocation-undefined-weak.s Delete weak-undef-export.s - covered by additional test added to weak-undef.s Delete version-undef-sym.s - covered by undefined-versioned-symbol.s => version-symbol-undef.s Delete symbol-ordering-file2.s - covered by symbol-ordering-file.s Delete gotpcrelx.s - covered by gotpc-relax-und-dso.s => x86-64-gotpc-relax-und-dso.s llvm-svn: 371299	2019-09-07 10:42:11 +00:00
Fangrui Song	89f7859641	[ELF][test] Improve LTO tests Add file-level comments Delete insignificant addresses to make them more tolerant to layout changes Simplify test output llvm-svn: 371292	2019-09-07 08:20:09 +00:00
Matthew Voss	c177919409	Update lld tests dynamic-list.s and symbol-override.s to use llvm-nm The following tests failed on Windows bots due to nm not being available: lld/test/ELF/dynamic-list.s lld/test/ELF/symbol-override.s llvm-svn: 371267	2019-09-06 22:51:46 +00:00
Fangrui Song	2682bc3c9d	[ELF] Replace error() with errorOrWarn() for the ASSERT command Summary: ld.bfd produces an output with --noinhibit-exec when an ASSERT fails. Use errorOrWarn() so that we can produce an output as well. An interesting case is that symbol assignments may execute multiple times, so we probably want to suppress errors for non-final runs. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D67285 llvm-svn: 371225	2019-09-06 16:30:22 +00:00
Fangrui Song	8d30c1dcec	Reland D66717 [ELF] Do not ICF two sections with different output sections (by SECTIONS commands) Recommit r370635 (reverted by r371202), with one change: move addOrphanSections() before ICF. Before, orphan sections in two different partitions may be folded and moved to the main partition. Now, InputSection->OutputSection assignment for orphans happens before ICF. ICF does not fold input sections with different output sections. With the PR43241 reproduce, `llvm-objcopy --extract-partition libvr.so libchrome__combined.so libvr.so` => no error Updated description: Fixes PR39418. Complements D47241 (the non-linker-script case). processSectionCommands() assigns input sections to output sections. ICF is called before it, so .text.foo and .text.bar may be folded even if their output sections are made different by SECTIONS commands. ``` markLive<ELFT>() doIcf<ELFT>() // During ICF, we don't know the output sections writeResult() combineEhSections<ELFT>() script->processSectionCommands() // InputSection -> OutputSection assignment ``` This patch splits processSectionCommands() into processSectionCommands() and processSymbolAssignments(), and moves processSectionCommands()/addOrphanSections() before ICF: ``` markLive<ELFT>() combineEhSections<ELFT>() script->processSectionCommands() script->addOrphanSections(); doIcf<ELFT>() // should remove folded input sections writeResult() script->processSymbolAssignments() ``` An alternative approach is to unfold a section `sec` in processSectionCommands() when we find `sec` and `sec->repl` belong to different output sections. I feel this patch is superior because this can fold more sections and the decouple of SectionCommand/SymbolAssignment gives flexibility: * An ExprValue can't be evaluated before its section is assigned to an output section -> we can delete getOutputSectionVA and simplify another place where we had to check if the output section is null. Moreover, a case in linkerscript/early-assign-symbol.s can be handled now. * processSectionCommands/processSymbolAssignments can be freely moved around. llvm-svn: 371216	2019-09-06 15:57:44 +00:00
Fangrui Song	5d9f419a2e	Revert "Revert r370635, it caused PR43241." This reverts commit 50d2dca22b3b05d0ee4883b0cbf93d7d15f241fc. llvm-svn: 371215	2019-09-06 15:57:24 +00:00
Fangrui Song	8f10a5f42d	[ELF][test] Simplify and reorganize tests Add file-level comments Delete insignificant addresses to make them more tolerant to layout changes Simplify test output Delete simple Inputs/*.s files Delete version-script-copy-rel.s - covered by verdef-defaultver.s Delete version-wildcard.test - covered by version-script-glob.s llvm-svn: 371213	2019-09-06 15:10:31 +00:00
Nico Weber	8455294f2a	Revert r370635, it caused PR43241. llvm-svn: 371202	2019-09-06 13:23:42 +00:00
Fangrui Song	70e002b50b	[ELF][test] Update test after r371185 llvm-svn: 371189	2019-09-06 09:39:48 +00:00
Simon Atanasyan	ff2172c9f4	[mips] Make another set of test cases more tolerant to exact symbol addresses. NFC llvm-svn: 371174	2019-09-06 07:23:02 +00:00
Fangrui Song	a2028f73c2	Update SHT_LLVM_PART_EHDR test after r371157 llvm-svn: 371160	2019-09-06 01:18:01 +00:00
Fangrui Song	6dc2bd70bb	[ELF] Initialize PhdrEntry::p_align to maxPageSize for PT_LOAD ``` Writer<ELFT>::run assignFileOffsets setFileOffset computeFileOffset os->ptLoad->p_align may be smaller than config->maxPageSize setPhdrs p_align = max(p_align, config->maxPageSize) ``` If we move the config->maxPageSize logic to the constructor of PhdrEntry, computeFileOffset can be simplified. Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D67211 llvm-svn: 371085	2019-09-05 16:32:31 +00:00
Simon Atanasyan	2172f3f49d	[mips] Make a few test cases more tolerant to exact symbol addresses. NFC llvm-svn: 371065	2019-09-05 13:37:16 +00:00
Rui Ueyama	e99dc4ba57	Align output segments correctly Previously, segments were aligned according to their first section's alignment requirements. That was not correct, but segments are also aligned to a page boundary, and a page boundary is usually much larger than a section alignment requirement, so no one noticed this bug before. Now, lld has --nmagic option which sets maxPageSize to 1 to effectively disable page alignment, which reveals the issue. Fixes https://bugs.llvm.org/show_bug.cgi?id=43212 Differential Revision: https://reviews.llvm.org/D67152 llvm-svn: 371013	2019-09-05 05:30:24 +00:00
Martin Storsjo	d581dd5013	[LLD] [COFF] Implement MinGW default manifest handling In mingw environments, resources are normally compiled to resource object files directly, instead of letting the linker convert them to COFF format. Since some time, GCC supports the notion of a default manifest object. When invoking the linker, GCC looks for the default manifest object file, and if found in the expected path, it is added to linker commands. The default manifest is one that indicates support for the latest known versions of windows, to implicitly unlock the modern behaviours of certain APIs. Not all mingw/gcc distributions include this file, but e.g. in msys2, the default manifest object is distributed in a separate package (which can be but might not always be installed). This means that even if user projects only use one single resource object file, the linker can end up with two resource object files, and thus needs to support merging them. The default manifest has a language id of zero, and GNU ld has got logic for dropping a manifest with a zero language id, if there's another manifest present with a nonzero language id. If there are multiple manifests with a nonzero language id, the merging process errors out. Differential Revision: https://reviews.llvm.org/D66825 llvm-svn: 370974	2019-09-04 20:34:00 +00:00
Thomas Lively	09768c5d7a	[WebAssembly] Initialize memory in start function Summary: - `__wasm_init_memory` is now the WebAssembly start function instead of being called from `__wasm_call_ctors` or called directly by the runtime. - Adds a new synthetic data symbol `__wasm_init_memory_flag` that is atomically incremented from zero to one by the thread responsible for initializing memory. - All threads now unconditionally perform data.drop on all passive segments. - Removes --passive-segments and --active-segments flags and controls segment type based on --shared-memory instead. The deleted flags were only present to ameliorate the upgrade path in Emscripten. Reviewers: sbc100, aheejin Subscribers: dschuff, jgravelle-google, sunfish, jfb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65783 llvm-svn: 370965	2019-09-04 19:50:39 +00:00
Fangrui Song	7afffb54ea	[ELF] Don't shrink RelrSection Fixes PR43214. The size of SHT_RELR may oscillate between 2 numbers (see D53003 for a similar --pack-dyn-relocs=android issue). This can happen if the shrink of SHT_RELR causes it to take more words to encode relocation offsets (this can happen with thunks or segments with overlapping p_offset ranges), and the expansion of SHT_RELR causes it to take fewer words to encode relocation offsets. To avoid the issue, add padding 1s to the end of the relocation section if its size would decrease. Trailing 1s do not decode to more relocations. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D67164 llvm-svn: 370923	2019-09-04 16:27:35 +00:00
Fangrui Song	520bdf79b5	[ELF] Fix spell corrector: don't call elf::InputFile::getSymbols() on shared objects Exposed by pr34872.s llvm-svn: 370875	2019-09-04 11:02:58 +00:00
Fangrui Song	b4745fad24	[ELF] Add a spell corrector for "undefined symbol" diagnostics Non-undefined symbols with Levenshtein distance 1 or a transposition are suggestion candidates. This is probably good enough and it can suggest some missing/superfluous qualifiers: const, restrict, volatile, & and && ref-qualifier, e.g. error: undefined symbol: foo(int) >>> referenced by b.o:(.text+0x1) +>>> did you mean: foo(int const) +>>> defined in: a.o error: undefined symbol: foo(int&) >>> referenced by b.o:(.text+0x1) +>>> did you mean: foo(int) +>>> defined in: b.o Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D67039 llvm-svn: 370853	2019-09-04 09:04:26 +00:00
Bob Haarman	7dc5e7a0a4	reland "[lld-link] implement -start-lib and -end-lib" Summary: This is a re-land of r370487 with a fix for the use-after-free bug that rev contained. This implements -start-lib and -end-lib flags for lld-link, analogous to the similarly named options in ld.lld. Object files after -start-lib are included in the link only when needed to resolve undefined symbols. The -end-lib flag goes back to the normal behavior of always including object files in the link. This mimics the semantics of static libraries, but without needing to actually create the archive file. Reviewers: ruiu, smeenai, MaskRay Reviewed By: ruiu, MaskRay Subscribers: akhuang, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D66848 llvm-svn: 370816	2019-09-03 20:32:16 +00:00
Ed Maste	5289bbe4d4	ld.lld.1: explain long options may use one or two dashes Obtained from FreeBSD r329003 llvm-svn: 370800	2019-09-03 17:58:30 +00:00
Ed Maste	174e083345	ld.lld.1: stylistic changes suggested by igor igor is an automated man page "proofreader" from FreeBSD - see http://www.wonkity.com/~wblock/igor/igor.pdf No content change. llvm-svn: 370799	2019-09-03 17:58:24 +00:00
Martin Storsjo	a66fc1c99f	[LLD] [COFF] Demangle itanium symbols in mingw mode Differential Revision: https://reviews.llvm.org/D67051 llvm-svn: 370654	2019-09-02 13:25:46 +00:00
Fangrui Song	d8bc6a48ea	[ELF] Do not ICF two sections with different output sections (by SECTIONS commands) Fixes PR39418. Complements D47241 (the non-linker-script case). processSectionCommands() assigns input sections to output sections. ICF is called before it, so .text.foo and .text.bar may be folded even if their output sections are made different by SECTIONS commands. ``` markLive<ELFT>() doIcf<ELFT>() // During ICF, we don't know the output sections writeResult() combineEhSections<ELFT>() script->processSectionCommands() // InputSection -> OutputSection assignment ``` This patch splits processSectionCommands() into processSectionCommands() and processSymbolAssignments(), and moves processSectionCommands() before ICF: ``` markLive<ELFT>() combineEhSections<ELFT>() script->processSectionCommands() doIcf<ELFT>() // should remove folded input sections writeResult() script->processSymbolAssignments() ``` An alternative approach is to unfold a section `sec` in processSectionCommands() when we find `sec` and `sec->repl` belong to different output sections. I feel this patch is superior because this can fold more sections and the decouple of SectionCommand/SymbolAssignment gives flexibility: * An ExprValue can't be evaluated before its section is assigned to an output section -> we can delete getOutputSectionVA and simplify another place where we had to check if the output section is null. Moreover, a case in linkerscript/early-assign-symbol.s can be handled now. * processSectionCommands/processSymbolAssignments can be freely moved around. Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D66717 llvm-svn: 370635	2019-09-02 10:33:58 +00:00
Fangrui Song	4514ac7cfb	[ELF] Align SHT_LLVM_PART_EHDR to a maximum page size boundary Fixes https://bugs.chromium.org/p/chromium/issues/detail?id=998712 SHT_LLVM_PART_EHDR marks the start of a partition. The partition sections will be extracted to a separate file. Align to the next maximum page size boundary so that we can find the ELF header at the start. We cannot benefit from overlapping p_offset ranges with the previous segment anyway. It seems we lack some llvm-objcopy --extract-main-partition and --extract-partition sanity checks. It may place EHDR at the start even if p_offset if non zero. Anyway, the lld change is justified for the reasons above. Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D67032 llvm-svn: 370629	2019-09-02 08:49:50 +00:00
Vlad Tsyrklevich	802aab5de8	Revert "[lld-link] implement -start-lib and -end-lib" This reverts commit r370487 as it is causing ASan/MSan failures on sanitizer-x86_64-linux-fast llvm-svn: 370550	2019-08-30 23:24:41 +00:00
Sam Clegg	fecfc5920a	[lld][WebAssembly] Fix spurious signature mismatch warnings Summary: This a follow up on: https://reviews.llvm.org/D62153 Handle the case where there are multiple object files that contain undefined references to the same function. We only generate a function variant if the existing symbol is directly called. See: https://github.com/emscripten-core/emscripten/issues/8995 Subscribers: dschuff, jgravelle-google, aheejin, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67015 llvm-svn: 370509	2019-08-30 19:50:59 +00:00
Martin Storsjo	8c189e50c7	[LLD] [COFF] Add a missing REQUIRES line to a recently added test. NFC. This should fix failing buildbots like http://lab.llvm.org:8011/builders/clang-cmake-aarch64-lld/builds/7180. llvm-svn: 370491	2019-08-30 17:26:54 +00:00
Bob Haarman	fd7569c8e3	[lld-link] implement -start-lib and -end-lib Summary: This implements -start-lib and -end-lib flags for lld-link, analogous to the similarly named options in ld.lld. Object files after -start-lib are included in the link only when needed to resolve undefined symbols. The -end-lib flag goes back to the normal behavior of always including object files in the link. This mimics the semantics of static libraries, but without needing to actually create the archive file. Reviewers: ruiu, smeenai, MaskRay Reviewed By: ruiu, MaskRay Subscribers: akhuang, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D66848 llvm-svn: 370487	2019-08-30 16:50:10 +00:00
Fangrui Song	688183ec54	[ELF] Set `referenced` bit of Undefined created by BitcodeFile D64136 and D65584, while fixing STB_WEAK issues and improving our compatibility with ld.bfd, can cause another STB_WEAK problem related to LTO: If %tundef.o has an undefined reference on f, and %tweakundef.o has a weak undefined reference on f, %tdef.o has a definition of f ``` ld.lld %tundef.o %tweakundef.o --start-lib %tdef.o --end-lib ``` 1) `%tundef.o` doesn't set the `referenced` bit. 2) `%weakundef.o` changes the binding from STB_GLOBAL to STB_WEAK 3) `%tdef.o` is not fetched because the binding is weak. Step (1) is incorrect. This patch sets the `referenced` bit of Undefined created by bitcode files. Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D66992 llvm-svn: 370437	2019-08-30 07:10:30 +00:00
Martin Storsjo	3d3a9b3b41	[LLD] [COFF] Support merging resource object files Extend WindowsResourceParser to support using a ResourceSectionRef for loading resources from an object file. Only allow merging resource object files in mingw mode; keep the existing error on multiple resource objects in link mode. If there only is one resource object file and no .res resources, don't parse and recreate the .rsrc section, but just link it in without inspecting it. This allows users to produce any .rsrc section (outside of what the parser supports), just like before. (I don't have a specific need for this, but it reduces the risk of this new feature.) Separate out the .rsrc section chunks in InputFiles.cpp, and only include them in the list of section chunks to link if we've determined that there only was one single resource object. (We need to keep other chunks from those object files, as they can legitimately contain other sections as well, in addition to .rsrc section chunks.) Differential Revision: https://reviews.llvm.org/D66824 llvm-svn: 370436	2019-08-30 06:56:33 +00:00
Dan Gohman	7cb9c8a506	[WebAssembly] Implement NO_STRIP This patch implements support for the NO_STRIP flag, which will allow __attribute__((used)) to be implemented. This accompanies https://reviews.llvm.org/D62542, which moves to setting the NO_STRIP flag, and will continue to set EXPORTED for Emscripten targets for compatibility. Differential Revision: https://reviews.llvm.org/D66968 llvm-svn: 370416	2019-08-29 22:41:05 +00:00
Nico Weber	3c1996a489	lld: Make a test not fail if "repro" is part of the build directory name r268231 made it so that the name of the --reproduce archive is no longer listed in the response file. Previously, with "--reproduce repro.tar" the response file would contain repro/home/.../llvm-build-dir/.../foo.o but after that change it contained home/.../llvm-build-dir/.../foo.o instead. The test added for this in r268231 checked that the response file doesn't contain the string "repro", but if the build dir is named e.g. "llvm-build-repro" then the test fails because of that. Change the assert to check that "repro" doesn't exist at the beginning of the line instead. I verified that the test still fails with r268231 reverted. The test technically still fails if someone builds llvm in a directory '/repro' below the root directory. Don't do that :) llvm-svn: 370211	2019-08-28 14:33:35 +00:00
Fangrui Song	523f999acf	[ELF][RISCV] Allow PT_LOAD to have overlapping p_offset ranges on EM_RISCV Port the D64906 technique to RISC-V. It deletes 3 alignments at PT_LOAD boundaries for the default case: the size of a RISC-V binary decreases by at most 12kb. llvm-svn: 370192	2019-08-28 12:06:06 +00:00
Rui Ueyama	91864f82c7	[mach-o] Extend LC_DATA_IN_CODE support to x86_64 Patch by LemonBoy Differential Revision: https://reviews.llvm.org/D62185 llvm-svn: 370183	2019-08-28 10:11:50 +00:00
Fangrui Song	54a6f6839b	[ELF][AMDGPU][SPARC] Allow PT_LOAD to have overlapping p_offset ranges on EM_AMDGPU and EM_SPARCV9 llvm-svn: 370180	2019-08-28 09:45:06 +00:00
Fangrui Song	8fbe81fb29	[ELF][RISCV] Assign st_shndx of __global_pointer$ to 1 if .sdata does not exist This essentially reverts the code change of D63132 and switches to a simpler approach. In an executable/shared object, st_shndx of a symbol can be: 1) SHN_UNDEF: undefined symbol (or canonical PLT) 2) SHN_ABS: absolute symbol 3) any other value (usually a regular section index) represents a relative symbol. The actual value does not matter. Many ld.so (musl, all archs except MIPS of FreeBSD rtld-elf) even treat 2) and 3) the same. If .sdata does not exist, it does not matter what value/section __global_pointer$ has, as long as it is relative (otherwise there will be a pedantic lld error. See D63132). Just set the st_shndx arbitrarily to 1. Dummy st_shndx=1 may be used by __rela_iplt_start, linker-script-defined symbols outside a section, __dso_handle, etc. Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D66798 llvm-svn: 370172	2019-08-28 09:01:03 +00:00
Vlad Tsyrklevich	57076d3199	Revert "Change the X86 datalayout to add three address spaces for 32 bit signed," This reverts commit r370083 because it caused check-lld failures on sanitizer-x86_64-linux-fast. llvm-svn: 370142	2019-08-28 01:08:54 +00:00
Jacob Gravelle	92ed86d239	[lld][WebAssembly] Support for growable tables Adds --growable-table flag to handle building wasm modules with tables that can grow. Wasm tables that we use to store function pointers. In order to add functions to that table at runtime, we need to either preallocate space, or grow the table. In order to specify a table with no maximum size, we need some flag to handle that case, separately from a potential --max-table-size= flag. Note that the number of elements in the table isn't knowable until link-time, so it's unclear if we will want a --max-table-size= flag in the future. llvm-svn: 370127	2019-08-27 22:58:21 +00:00
Amy Huang	1299945b81	Change the X86 datalayout to add three address spaces for 32 bit signed, 32 bit unsigned, and 64 bit pointers. llvm-svn: 370083	2019-08-27 17:46:53 +00:00
Fangrui Song	024bf27ddf	[ELF][ARM] Allow PT_LOAD to have overlapping p_offset ranges on EM_ARM Port the D64906 technique to ARM. It deletes 3 alignments at PT_LOAD boundaries for the default case: the size of an arm binary decreases by at most 12kb. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D66749 llvm-svn: 370049	2019-08-27 11:52:36 +00:00
Fangrui Song	fdbc2bd2aa	[ELF][ARM] Add --no-show-raw-insn and -soname to some ARM tests Delete some insignificant addresses to make it simpler for layout changes. llvm-svn: 370048	2019-08-27 11:50:27 +00:00
Sam Clegg	040ef1091d	[lld][WebAssembly] Create optional symbols after handling --export/--undefined Handling of --export/--undefined can pull in lazy symbols which in turn can pull in referenced to optional symbols. We need to delay the creation of optional symbols until all possible references to them have been created. Differential Revision: https://reviews.llvm.org/D66768 llvm-svn: 370012	2019-08-27 04:27:57 +00:00
Sam Clegg	1a1df72a43	[lld][WebAssembly] Store table base in config rather than passing it around. NFC. I've got another change that makes more use of this value in other places. Differential Revision: https://reviews.llvm.org/D66777 llvm-svn: 370010	2019-08-27 04:19:34 +00:00
Richard Trieu	f837bb4a34	Copy test data so tests don't traverse test directories. NFC llvm-svn: 369984	2019-08-26 22:41:05 +00:00
Fangrui Song	1681ceb2c4	[ELF] EhFrameSection: postpone FDE liveness check to finalizeSections EhFrameSection::addSection checks liveness of FDE early. This makes it infeasible to move combineEhSections() before ICF. Postpone the check to EhFrameSection::finalizeContents(). This is what ARMExidxSyntheticSection does and it will make a subsequent patch D66717 simpler. Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D66727 llvm-svn: 369890	2019-08-26 10:32:12 +00:00
Fangrui Song	debcac9fef	[ELF] Make LinkerScript::assignAddresses iterative PR42990. For `SECTIONS { b = a; . = 0xff00 + (a >> 8); a = .; }`, we currently set st_value(a)=0xff00 while st_value(b)=0xffff. The following call tree demonstrates the problem: ``` link<ELF64LE>(Args); Script->declareSymbols(); // insert a and b as absolute Defined Writer<ELFT>().run(); Script->processSectionCommands(); addSymbol(cmd); // a and b are re-inserted. LinkerScript::getSymbolValue // is lazily called by subsequent evaluation finalizeSections(); forEachRelSec(scanRelocations<ELFT>); processRelocAux // another problem PR42506, not affected by this patch finalizeAddressDependentContent(); // loop executed once script->assignAddresses(); // a = 0, b = 0xff00 script->assignAddresses(); // a = 0xff00, _end = 0xffff ``` We need another assignAddresses() to finalize the value of `a`. This patch 1) modifies assignAddress() to track the original section/value of each symbol and return a symbol whose section/value has changed. 2) moves the post-finalizeSections assignAddress() inside the loop of finalizeAddressDependentContent() and makes it iterative. Symbol assignment may not converge so we make a few attempts before bailing out. Note, assignAddresses() must be called at least twice. The penultimate call finalized section addresses while the last finalized symbol values. It is somewhat obscure and there was no comment. linkerscript/addr-zero.test tests this. Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D66279 llvm-svn: 369889	2019-08-26 10:23:31 +00:00
Fangrui Song	8e5184af71	[ELF] Error if --strip-all and --emit-relocs are used together --strip-all suppresses the creation of in.symtab This can cause a null pointer dereference in OutputSection::finalize() // --emit-relocs => copyRelocs is true if (!config->copyRelocs \|\| (type != SHT_RELA && type != SHT_REL)) return; ... link = in.symTab->getParent()->sectionIndex; // in.symTab is null Let's just disallow the combination. In some cases the combination can cause GNU linkers to fail: * ld.bfd: final link failed: invalid operation * gold: internal error in set_no_output_symtab_entry, at ../../gold/object.h:1814 Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D66704 llvm-svn: 369878	2019-08-26 06:23:53 +00:00
Fangrui Song	76f005535a	[ELF] Delete a redundant dyn_cast<InputSection>. NFC llvm-svn: 369868	2019-08-25 14:41:18 +00:00
Fangrui Song	6d5a8c92bf	[ELF] Simplify with less_second. NFC llvm-svn: 369844	2019-08-24 08:40:20 +00:00
Fangrui Song	62083ec157	[ELF] Make member function Writer<ELFT>::removeEmptyPTLoad non-member. NFC llvm-svn: 369838	2019-08-24 06:31:34 +00:00
Fangrui Song	af47d0021c	[ELF] Align the first section of a PT_LOAD even if its type is SHT_NOBITS Reported at https://reviews.llvm.org/D64930#1642223 If the only section of a PT_LOAD is a SHT_NOBITS section (e.g. .bss), we may not align its sh_offset. p_offset of the PT_LOAD will be set to sh_offset, and we will get p_offset!=p_vaddr (mod p_align). If such executable is mapped by the Linux kernel, it will segfault. After D64906, this may happen the non-linker script case. The linker script case has had this issue for a long time. This was fixed by rL321657 (but the test linkerscript/nobits-offset.s failed to test a SHT_NOBITS section), but broken by rL345154. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D66658 llvm-svn: 369828	2019-08-24 00:41:15 +00:00
Peter Smith	7d6aa7eb7f	[ELF] Mention contents of reproduce archive and add help description. Building on D60557 mention the name of the linker generated contents of the reproduce archive, response.txt and version.txt. Also write a shorter description in the ld.lld --help that is closer to the documentation. Differential Revision: https://reviews.llvm.org/D66641 llvm-svn: 369762	2019-08-23 14:41:25 +00:00
Rui Ueyama	72d1089a3a	Explain --reproduce option I think --reproduce is no longer a debug-only option but a useful option that a common user may want to use. So, this patch updates the description of the option in the manual page. Differential Revision: https://reviews.llvm.org/D60557 llvm-svn: 369740	2019-08-23 08:52:55 +00:00
Rui Ueyama	6ef01c3e2b	Add a description about multiple linker scripts Differential Revision: https://reviews.llvm.org/D66630 llvm-svn: 369737	2019-08-23 07:50:09 +00:00
Benjamin Kramer	b3a991df3c	Fight a bit against global initializers. NFC. llvm-svn: 369695	2019-08-22 19:43:27 +00:00
Amy Huang	a1c022c791	[COFF] Add libcall symbols to the link when LTO is being used llvm-svn: 369694	2019-08-22 19:40:07 +00:00
Bob Haarman	5375b94e36	[lld-link] implement -lto-obj-path Summary: This adds the -lto-obj-path option to lld-link. This can be used to specify a path at which to write a native object file for the full LTO part when using LTO unit splitting. Reviewers: ruiu, tejohnson, pcc, rnk Reviewed By: ruiu, rnk Subscribers: mehdi_amini, steven_wu, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65964 llvm-svn: 369559	2019-08-21 18:24:59 +00:00
Fangrui Song	2d37bf843c	[ELF][ARM] Simplify some llvm-objdump tests with both ARM/Thumb states llvm-objdump can switch between ARM/Thumb states after D60927. In a few lld tests, we run both * llvm-objdump -d -triple=thumbv7a-none-linux-gnueabi %t * llvm-objdump -d -triple=armv7a-none-linux-gnueabi %t to test ARM/Thumb parts of the same file. In many cases we can just run one command. There is a problem that prevents us from cleaning more tests (e.g. test/ELF/arm-thumb-interwork-thunk.s): In llvm-objdump, while we have ARM/Thumb (primary and secondary) MCDisassembler and MCSubtargetInfo, we have just one MCInstrAnalysis which is used to resolve the targets of calls in both ARM/Thumb parts. // ThumbMCInstrAnalysis evaluating ARM parts or ARMMCInstrAnalysis evaluating Thumb parts // will have incorrect offsets. // An example of llvm-objdump -d -triple=thumbv7a on ARM part: 1304: 3d ff ff fa blx #-780 # no <...> 1308: 06 00 00 ea b #24 <arm_caller+0x24> # wrong target due to wrong offset Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D66539 llvm-svn: 369535	2019-08-21 15:30:55 +00:00
George Rimar	a50f115d36	[LLD][ELF] - Simplify the bad-archive.s test case. This removes the precompiled binary and improves the check of the error reported. Differential revision: https://reviews.llvm.org/D66523 llvm-svn: 369516	2019-08-21 11:19:50 +00:00
Fangrui Song	2d337fdc95	Reland D65242 "[ELF] More dynamic relocation packing"" This fixed a bug in r369488. When config->isRela is false, i->r_addend is not initialized (see encodeDynamicReloc). So we should check config->isRela before accessing r_addend: - if (j - i < 3 \|\| i->r_addend) + if (j - i < 3 \|\| (config->isRela && i->r_addend != 0)) Original description: Currently, with Android dynamic relocation packing, only relative relocations are grouped together. This patch implements similar packing for non-relative relocations. The implementation groups non-relative relocations with the same r_info and r_addend, if using RELA. By requiring a minimum group size of 3, this achieves smaller relocation sections. Building Android for an ARM32 device, I see the total size of /system/lib decrease by 392 KB. Grouping by r_info also allows the runtime dynamic linker to implement an 1-entry cache to reduce the number of symbol lookup required. With such 1-entry cache implemented on Android, I'm seeing 10% to 20% reduction in total time spent in runtime linker for several executables that I tested. As a simple correctness check, I've also built x86_64 Android and booted successfully. Differential Revision: https://reviews.llvm.org/D65242 Patch by Vic Yang llvm-svn: 369507	2019-08-21 09:21:37 +00:00
Fangrui Song	b2895a8cdc	Revert D65242 "[ELF] More dynamic relocation packing" This reverts r369488 and r369489. The change broke build bots: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-bootstrap-ubsan/builds/14511 http://lab.llvm.org:8011/builders/lld-x86_64-freebsd/builds/34407 llvm-svn: 369497	2019-08-21 06:50:08 +00:00
Fangrui Song	d840a9cbed	[ELF][test] Add CHECK lines omitted in r369488 Add append .o to some object file names llvm-svn: 369489	2019-08-21 03:15:57 +00:00
Fangrui Song	35f9a84a15	[ELF] More dynamic relocation packing Currently, with Android dynamic relocation packing, only relative relocations are grouped together. This patch implements similar packing for non-relative relocations. The implementation groups non-relative relocations with the same r_info and r_addend, if using RELA. By requiring a minimum group size of 3, this achieves smaller relocation sections. Building Android for an ARM32 device, I see the total size of /system/lib decrease by 392 KB. Grouping by r_info also allows the runtime dynamic linker to implement an 1-entry cache to reduce the number of symbol lookup required. With such 1-entry cache implemented on Android, I'm seeing 10% to 20% reduction in total time spent in runtime linker for several executables that I tested. As a simple correctness check, I've also built x86_64 Android and booted successfully. Differential Revision: https://reviews.llvm.org/D66491 Patch by Vic Yang! llvm-svn: 369488	2019-08-21 03:02:08 +00:00
Martin Storsjo	08a5a0aa25	[COFF] Check errorCount before committing the output file This avoids producing an output file if errors appeared late in the linking process (e.g. while fixing relocations, or as in the test, while checking for multiple resources). If an output file is produced, build tools might not retry building it on rebuilds, even if a previous build failed due to the error return code. Differential Revision: https://reviews.llvm.org/D66491 llvm-svn: 369445	2019-08-20 21:08:14 +00:00
Martin Storsjo	8a91aa53a0	[COFF] Print the file name on errors writing the pdb file This avoids confusing contextless error messages such as "No such file or directory" if e.g. the pdb output file should be written to a nonexistent directory. (This can happen with linkrepro scripts, at least old ones.) Differential Revision: https://reviews.llvm.org/D66466 llvm-svn: 369425	2019-08-20 18:56:48 +00:00
Sam Clegg	cf2b8722d4	[WebAssembly][lld] Fix crash when applying relocations to debug sections Debug sections are special in that they can contain relocations against symbols that are not present in the final output (i.e. not live). However it is also possible to have R_WASM_TABLE_INDEX relocations against symbols that don't have a table index assigned (since they are not address taken by actual code. Fixes: https://github.com/emscripten-core/emscripten/issues/9023 Differential Revision: https://reviews.llvm.org/D66435 llvm-svn: 369423	2019-08-20 18:39:24 +00:00
Martin Storsjo	6540e55067	[COFF] Require an explicit -implib option for creating implibs in mingw mode GNU ld doesn't produce implibs unless explicitly requested. Differential Revision: https://reviews.llvm.org/D66367 llvm-svn: 369363	2019-08-20 10:14:54 +00:00
Martin Storsjo	dadc6f2488	[COFF] Allow using custom .edata from input object files This is used by Wine for manually crafting export tables. If the input object contains .edata sections, GNU ld references them in the export directory instead of synthesizing an export table using either export directives or the normal auto export mechanism. (AFAIK, historically, way way back, GNU ld didn't support synthesizing the export table - one was supposed to generate it using dlltool and link it in instead.) If faced with --out-implib and --output-def, GNU ld still populates those output files with the same export info as it would have generated otherwise, disregarding the input .edata. As this isn't an intended usage combination, I'm not adding checks for that in tests. Differential Revision: https://reviews.llvm.org/D65903 llvm-svn: 369358	2019-08-20 09:53:06 +00:00
Fangrui Song	12d83b4270	[ELF][PPC] Allow PT_LOAD to have overlapping p_offset ranges on EM_PPC Ported the D64906 technique to EM_PPC. Delete ppc-rela.s that is covered by ppc32-abs-pic.s llvm-svn: 369351	2019-08-20 09:20:05 +00:00
Fangrui Song	9c371309f3	[ELF][X86] Allow PT_LOAD to have overlapping p_offset ranges on EM_386 Ported the D64906 technique to EM_386. If `sh_addralign(.tdata) < sh_addralign(.tbss)`, we can potentially make `p_vaddr(PT_TLS)%p_align(PT_TLS) != 0`. ld.so that are known to have problems if p_vaddr%p_align!=0: * FreeBSD 13.0-CURRENT rtld-elf * glibc https://sourceware.org/bugzilla/show_bug.cgi?id=24606 New test i386-tls-vaddr-align.s checks our workaround makes p_vaddr%p_align = 0. Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D65865 llvm-svn: 369347	2019-08-20 08:43:47 +00:00
Fangrui Song	f66b767abe	[ELF][AArch64] Allow PT_LOAD to have overlapping p_offset ranges Ported the D64906 technique to AArch64. It deletes 3 alignments at PT_LOAD boundaries for the default case: the size of an aarch64 binary decreases by at most 192kb. If `sh_addralign(.tdata) < sh_addralign(.tbss)`, we can potentially make `p_vaddr(PT_TLS)%p_align(PT_TLS) != 0`. ld.so that are known to have problems if p_vaddr%p_align!=0: * musl<=1.1.22 * FreeBSD 13.0-CURRENT (and before) rtld-elf arm64 New test aarch64-tls-vaddr-align.s checks that our workaround makes p_vaddr%p_align = 0. Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D64930 llvm-svn: 369344	2019-08-20 08:34:56 +00:00
Fangrui Song	01c7f4b606	[ELF][PPC] Allow PT_LOAD to have overlapping p_offset ranges This change affects the non-linker script case (precisely, when the `SECTIONS` command is not used). It deletes 3 alignments at PT_LOAD boundaries for the default case: the size of a powerpc64 binary can be decreased by at most 192kb. The technique can be ported to other targets. Let me demonstrate the idea with a maxPageSize=65536 example: When assigning the address to the first output section of a new PT_LOAD, if the end p_vaddr of the previous PT_LOAD is 0x10020, we advance to the next multiple of maxPageSize: 0x20000. The new PT_LOAD will thus have p_vaddr=0x20000. Because p_offset and p_vaddr are congruent modulo maxPageSize, p_offset will be 0x20000, leaving a p_offset gap [0x10020, 0x20000) in the output. Alternatively, if we advance to 0x20020, the new PT_LOAD will have p_vaddr=0x20020. We can pick either 0x10020 or 0x20020 for p_offset! Obviously 0x10020 is the choice because it leaves no gap. At runtime, p_vaddr will be rounded down by pagesize (65536 if pagesize=maxPageSize). This PT_LOAD will load additional initial contents from p_offset ranges [0x10000,0x10020), which will also be loaded by the previous PT_LOAD. This is fine if -z noseparate-code is in effect or if we are not transiting between executable and non-executable segments. ld.bfd -z noseparate-code leverages this technique to keep output small. This patch implements the technique in lld, which is mostly effective on targets with large defaultMaxPageSize (AArch64/MIPS/PPC: 65536). The 3 removed alignments can save almost 3*65536 bytes. Two places that rely on p_vaddr%pagesize = 0 have to be updated. 1) We used to round p_memsz(PT_GNU_RELRO) up to commonPageSize (defaults to 4096 on all targets). Now p_vaddr%commonPageSize may be non-zero. The updated formula takes account of that factor. 2) Our TP offsets formulae are only correct if p_vaddr%p_align = 0. Fix them. See the updated comments in InputSection.cpp for details. On targets that we enable the technique (only PPC64 now), we can potentially make `p_vaddr(PT_TLS)%p_align(PT_TLS) != 0` if `sh_addralign(.tdata) < sh_addralign(.tbss)` This exposes many problems in ld.so implementations, especially the offsets of dynamic TLS blocks. Known issues: FreeBSD 13.0-CURRENT rtld-elf (i386/amd64/powerpc/arm64) glibc (HEAD) i386 and x86_64 https://sourceware.org/bugzilla/show_bug.cgi?id=24606 musl<=1.1.22 on TLS Variant I architectures (aarch64/powerpc64/...) So, force p_vaddr%p_align = 0 by rounding dot up to p_align(PT_TLS). The technique will be enabled (with updated tests) for other targets in subsequent patches. Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D64906 llvm-svn: 369343	2019-08-20 08:34:25 +00:00
Sam Clegg	e8e914e640	[lld][WebAssembly] Honor --no-export-dynamic even with -shared Differential Revision: https://reviews.llvm.org/D66359 llvm-svn: 369276	2019-08-19 16:34:51 +00:00
Fangrui Song	ec4540d8a6	[ELF] Simplify processRelocAux and allow a corner-case error After D66007/r369262, if the control flow reaches `if (sym.isUndefined())`, we know: * The relocation is not a link-time constant => symbol is preemptable => Undefined or SharedSymbol * Not an undef weak. * -no-pie. * The symbol type is neither STT_OBJECT nor STT_FUNC. ld.lld --export-dynamic --unresolved-symbols=ignore-all %t.o can satisfy these conditions. Delete the isUndefined() test so that we error `symbol '...' has no type`, because we don't know the type to make the decision to create copy relocation/canonical PLT. llvm-svn: 369271	2019-08-19 15:52:29 +00:00
Fangrui Song	ac0e6c6502	[ELF] Move (copy relocation/canonical PLT) before error checking In processRelocAux(), we handle errors before copy relocation/canonical PLT. This makes error checking a bit complex because we have to check for conditions that will be allowed by copy relocation/canonical PLT. Instead, move copy relocation/canonical PLT before error checking. This simplifies the previous clumsy error checking code `config->shared \|\| (config->pie && expr == R_ABS && type != target->symbolicRel)` to the simple `config->isPic`. Some diagnostics can be reported in different ways. The code motion changes diagnostics for some contrived test cases: * copy-rel-pie-error.s -> copy-rel-pie2.s: It was rejected before but accepted now. ld.bfd also accepts the case. * copy-errors.s: "cannot preempt symbol" changes to "symbol 'bar' has no type" * got32{,x}-i386.s: the suggestion changes from "-fPIC or -Wl,-z,notext" to "-fPIE" * x86-64-dyn-rel-error5.s: one diagnostic changes for -pie case Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D66007 llvm-svn: 369262	2019-08-19 14:30:12 +00:00
Fangrui Song	c8a1dfc484	[ELF][Hexagon] Improve error message for unknown relocations Like rLLD354040 Previously, for unknown relocation types, in -no-pie/-pie mode, we got something like: foo.o: unrecognized relocation ... In -shared mode: error: can't create dynamic relocation ... against symbol: yyy in readonly segment Delete the default case from Hexagon::getRelExpr and add the error there. We will get consistent error message like `error: unknown relocation (1024) against symbol foo` Reviewed By: sidneym Differential Revision: https://reviews.llvm.org/D66275 llvm-svn: 369260	2019-08-19 14:07:14 +00:00
Sid Manning	a0a4c6b722	[lld][Hexagon] Add GOTREL relocations. Add GOTREL relocation support. (S + A - GOT) Differential Revision: https://reviews.llvm.org/D66260 llvm-svn: 369258	2019-08-19 13:32:32 +00:00
Peter Smith	2cafd872fb	[ELF][ARM] Add a test that maxes out the thunk convergence limit Add a test that takes the maximum amount of passes permitted to converge. This will make sure that any symbol defined in a linker script gets the correct value and that any other convergence limit involving symbol address doesn't restrict Thunk convergence. Differential Revision: https://reviews.llvm.org/D66346 llvm-svn: 369246	2019-08-19 09:46:52 +00:00
Fangrui Song	a560bbf7a0	[ELF] Replace local variable hasExportDynamic with config->exportDynamic. NFC llvm-svn: 369187	2019-08-17 10:04:18 +00:00
Fangrui Song	d5d79dfd56	[ELF][PPC] Fix getRelExpr for R_PPC64_REL16_HI Fixes https://github.com/ClangBuiltLinux/linux/issues/640 R_PPC64_REL16_HI was incorrectly computed as an R_ABS relocation. rLLD368964 made it a linker failure. Change it to use R_PC to fix the failures. Add ppc64-reloc-rel.s for these R_PPC64_REL* tests. llvm-svn: 369184	2019-08-17 06:28:03 +00:00
Fangrui Song	7ca1e0c825	[ELF][Hexagon] Replace R_HEXAGON_GOT with R_GOTPLT R_GOTPLT is relative to .got.plt since D59594. Since R_HEXAGON_GOT relocations always have 0 r_addend, they can use R_GOTPLT instead. Reviewed By: sidneym Differential Revision: https://reviews.llvm.org/D66274 llvm-svn: 369128	2019-08-16 17:01:56 +00:00
Sid Manning	2d3ebeb813	[lld][Hexagon]Support HEX_32 when building shared objects Differential Revision: https://reviews.llvm.org/D66105 llvm-svn: 369121	2019-08-16 15:35:02 +00:00
JF Bastien	7a210d65ed	Fix lld on GCC 5.1 after the C++14 move Summary: libstdc++ in GCC 5.1 has some bugs. The move to C++14 in D66195 triggered one such bug caused by the new constexpr support in C++14, and the implementation doing SFINAE wrong with the comparator to std::stable_sort. Here's a small repro: https://godbolt.org/z/2QC3-n The fix is to inline the lambdas directly into the llvm::stable_sort call instead of erasing them through a std::function. The code is more readable as well. Reviewers: thakis, ruiu, espindola Subscribers: emaste, arichardson, MaskRay, jkorous, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D66306 llvm-svn: 369023	2019-08-15 17:47:19 +00:00
Fangrui Song	e54ed4789f	[ELF][AArch64] Improve error message for unknown relocations Like rLLD354040. Previously, for unrecognized relocation types, in -no-pie/-pie mode, we got something like: foo.o: unrecognized relocation ... In -shared mode: error: can't create dynamic relocation ... against symbol: yyy in readonly segment Delete the default case from AArch64::getRelExpr and add the error there. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D66277 llvm-svn: 368983	2019-08-15 10:02:54 +00:00
Fangrui Song	1542ff5282	[ELF][PPC] Improve error message for unknown relocations Like rLLD354040. Previously, for unrecognized relocation types, in -no-pie mode: foo.o: unrecognized reloc 256 In -pie/-shared mode: error: can't create dynamic relocation R_PPC_xxx against symbol: yyy in readonly segment llvm-svn: 368964	2019-08-15 05:22:23 +00:00
Jonas Devlieghere	6ba7992031	[LLD] Migrate llvm::make_unique to std::make_unique Now that we've moved to C++14, we no longer need the llvm::make_unique implementation from STLExtras.h. This patch is a mechanical replacement of (hopefully) all the llvm::make_unique instances across the monorepo. Differential revision: https://reviews.llvm.org/D66259 llvm-svn: 368936	2019-08-14 22:28:17 +00:00
Fangrui Song	e220c67b7a	[ELF] --gdb-index: fix odd variable name cUs after r365730 and replace lower_bound with partition_point. NFC llvm-svn: 368845	2019-08-14 12:56:30 +00:00
Fangrui Song	b9138bc52b	[ELF][test] Update silent-ignore.test Some options are implemented now: --no-warn-common : r263413 --allow-shlib-undefined : r352826 Some are ignored but were not reflected in this test. llvm-svn: 368837	2019-08-14 12:20:21 +00:00
Martin Storsjo	0e5530abfc	[MinGW] Correct handling different forms of a few options Support the equals form of the long --entry=<symbol> option, add a test for the -e<symbol> form. Add tests for single dash forms of -exclude-all-symbols and -export-all-symbols. Support single-dash forms of -out-implib and -output-def, support the equals form of --output-def=<file>. (We previously had a test to explicitly disallow -out-implib, but it turns out that GNU ld actually does support it just fine, despite also matching the -o<file> option.) Disallow the double-dashed --u form, add a test for -u<symbol>. Differential Revision: https://reviews.llvm.org/D66066 llvm-svn: 368816	2019-08-14 09:35:44 +00:00
Martin Storsjo	ce26ad7a9d	[MinGW] Restructure Options.td to use multiclass where sensible. NFC. Differential Revision: https://reviews.llvm.org/D66065 llvm-svn: 368815	2019-08-14 09:35:40 +00:00
Martin Storsjo	df7ec1ff7b	[MinGW] Remove stray/inconsistent comment chars in test file. NFC. Test directives don't need to be in comments in this file. llvm-svn: 368814	2019-08-14 09:35:36 +00:00
Fangrui Song	f1d538cce5	[ELF] Initialize 2 fields of Symbol in SymbolTable::insert A new symbol is added to elf::symtab in 3 steps: 1) SymbolTable::insert creates a placeholder. 2) Symbol::mergeProperties 3) Symbol::replace Fields referenced by steps 2) and 3) should be initialized in SymbolTable::insert. `traced` and `referenced` were missed previously. This did not cause problems because compilers generated code that initialized them (bit fields) to 0. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D66130 llvm-svn: 368784	2019-08-14 01:52:47 +00:00
Sam Clegg	7185a7301e	[lld][WebAssembly] Allow linking of pic code into static binaries Summary: See https://github.com/emscripten-core/emscripten/issues/9013 Subscribers: dschuff, jgravelle-google, aheejin, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65922 llvm-svn: 368719	2019-08-13 17:02:02 +00:00
Jordan Rupprecht	7859641626	[lld][test] Update test to print ELF note description data llvm-svn: 368710	2019-08-13 16:11:09 +00:00
Fangrui Song	dc06b0bc9a	[ELF] Don't special case symbolic relocations with 0 addend to ifunc in writable locations Currently the following 3 relocation types do not trigger the creation of a canonical PLT (which changes STT_GNU_IFUNC to STT_FUNC and redirects all references): 1) GOT-generating (`needsGot`) 2) PLT-generating (`needsPlt`) 3) R_ABS with 0 addend in a writable location. This is used for for ifunc function pointers in writable sections such as .data and .toc. This patch deletes case 3) to simplify the R__IRELATIVE generating logic added in D57371. Other advantages: It is guaranteed no more than 1 R__IRELATIVE is created for an ifunc. PPC64: no need to special case ifunc in toc-indirect to toc-relative relaxation. See D65755 The deleted elf::addIRelativeRelocs demonstrates that one-pass scan through relocations makes several optimizations difficult. This is something we can think about in the future. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D65995 llvm-svn: 368661	2019-08-13 09:43:40 +00:00
Fangrui Song	c6cd62352c	[ELF] Simplify handling of exportDynamic and isPreemptible In Writer::includeInDynSym(), exportDynamic is used by a Defined with protected or default visibility, to record whether it is required to be exported into .dynsym. It is set when any of the following conditions hold: 1) There is an interposable symbol from a DSO (Undefined or SharedSymbol with default visibility) 2) If -shared or --export-dynamic is specified, any symbol in an object file/bitcode sets this property, unless suppressed by canBeOmittedFromSymbolTable(). 3) --dynamic-list when producing an executable 4) protected symbol from a DSO preempted by copy relocation/canonical PLT when --ignore-{data,function}-address-equality is specified 5) ifunc is exported when -z ifunc-noplt is specified Bullet points 4) and 5) are irrelevant in this patch. Bullet 3) does not play well with 1) and 2). When -shared is specified, exportDynamic of most symbols is true. This makes it incapable to record --dynamic-list marked symbols. We thus have obscure: if (!config->shared) b->exportDynamic = true; else if (b->includeInDynsym()) b->isPreemptible = true; This patch adds another bit `Symbol::inDynamicList` to record 3). We can thus simplify handleDynamicList() by unifying the DSO and executable cases. It also allows us to simplify isPreemptible - now the field is only used in finalizeSections() and later stages. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D66091 llvm-svn: 368659	2019-08-13 09:12:52 +00:00
Fangrui Song	ab04ad6af7	[ELF] Rename odd variable names "New" after r365730. NFC New -> newSym or newFlags Reviewed By: atanasyan Differential Revision: https://reviews.llvm.org/D66127 llvm-svn: 368651	2019-08-13 06:19:39 +00:00
Fangrui Song	892cdc73be	[ELF][test] Add dynamic-list-preempt2.s When producing a DSO, the isPreemptible property of a Defined with default or protected visibility is affected by the --dynamic-list file, but not by interposable symbols in other DSOs. llvm-svn: 368649	2019-08-13 05:08:46 +00:00
Bob Haarman	6e18c7f8d4	[lld] Remove unnecessary "class Lazy" llvm-svn: 368644	2019-08-13 01:02:30 +00:00
Fangrui Song	3cafdfddcb	[ELF] Remove unnecessary assignment to `isPreemptible` in replaceWithDefined() After r368535, it is no longer used in the handling of VER_NDX_LOCAL. Drop it. llvm-svn: 368550	2019-08-12 10:01:17 +00:00
Fangrui Song	cfdd4589f1	[ELF] Remove redundant isDefined() in Symbol::computeBinding() and delete one redundant call site After r367869, VER_NDX_LOCAL can only be assigned to Defined and CommonSymbol. CommonSymbol becomes Defined after replaceCommonSymbols(), thus `versionId == VER_NDX_LOCAL` will imply `isDefined()`. In maybeReportUndefined(), computeBinding() is called when the symbol is unknown to be Undefined. computeBinding() != STB_LOCAL will always be true. llvm-svn: 368536	2019-08-11 17:03:00 +00:00
Fangrui Song	635eda8bb0	[ELF] Remove redundant !isPreemptible in Symbol::computeBinding() !isPreemptible was added in r343668 to fix PR39104: symbols redefined by replaceWithDefined() might be incorrectly considered STB_LOCAL if a version script specified `local: *;`. After r367869 (`config->defaultSymbolVersion` was removed), we will assign VER_NDX_LOCAL to only regular Defined and CommonSymbol, not Defined created by replaceWithDefined() (because scanVersionScript() is called before scanRelocations()). The !isPreemptible is thus redundant and can be deleted. llvm-svn: 368535	2019-08-11 16:12:07 +00:00
Fangrui Song	38c5788548	[ELF] Remove unnecessary assignment to `used` in replaceWithDefined `Symbol::used` is used by Undefined and SharedSymbol to record if a .symtab entry is needed. It is of no use for Defined. llvm-svn: 368533	2019-08-11 13:32:46 +00:00
Igor Kudrin	b65016ddac	[ELF] For VS-style diagnostics, prefer printing full paths in the header. The filename part in the message header is used by Visual Studio to fill Error List so that a user can click on an item and jump to the mentioned location. If we use only the name of a source file and not the full path, Visual Studio might be unable to find the right file or, even worse, show a wrong one. Differential Revision: https://reviews.llvm.org/D65875 llvm-svn: 368409	2019-08-09 08:29:03 +00:00
Fangrui Song	179dc276eb	[ELF] Expand regions for gaps due to explicit address If the dot gets moved by an explicit section address, an empty gap between sections could be created. The encompassing region for the section being parsed needs to be expanded to include the gap. Differential Revision: https://reviews.llvm.org/D65722 Patch by Gabriel Smith! llvm-svn: 368379	2019-08-09 01:25:49 +00:00
David Blaikie	84b55e61dd	DebugInfo: Explicitly handle errors when parsing unit DIEs This ensures these errors produce a non-zero exit and improves the context (providing the name of the input object and section being parsed). llvm-svn: 368378	2019-08-09 01:14:36 +00:00
Peter Collingbourne	1fa1cd200b	ELF: Move sections referred to by __start_/__stop_ symbols into the main partition. In the case where C identifier sections have SHF_LINK_ORDER they will most likely be placed in the same partition as the section that they are associated with. But unless this happens to be the main partition, this will cause them to be excluded from the range covered by the __start_ and __stop_ symbols, which may lead to incorrect program behaviour. So we need to move them all into the main partition so that they will be covered by the __start_ and __stop_ symbols. We may want to refine this approach later and allow different __start_/__stop_ symbol values for different partitions. This would only make sense for relocations from SHT_NOTE sections since they are duplicated into each partition. Differential Revision: https://reviews.llvm.org/D65909 llvm-svn: 368375	2019-08-09 00:57:54 +00:00
Sam Clegg	dbfea28219	[lld][WebAssembly] Don't create optional symbols when outputing an object file Summary: This was a bug in rL368310. I'm working on a test case now. Differential Revision: https://reviews.llvm.org/D65985 llvm-svn: 368369	2019-08-08 23:56:21 +00:00
Guanzhong Chen	c5ccbf52ad	[WebAssembly][lld] control __data_end export with config->shared Summary: Emscripten expects `__data_end` to show up in PIC code as long as it's not linked with `--shared`. Currently, Emscripten breaks with latest LLVM because `__data_end` is controlled by `config->isPic` instead of `config->shared`.` Reviewers: tlively, sbc100 Reviewed By: sbc100 Subscribers: dschuff, jgravelle-google, aheejin, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65980 llvm-svn: 368361	2019-08-08 22:40:04 +00:00
Sam Clegg	ad1cc145e8	[lld][WebAssembly] Use createGlobalVariable helper function. NFC. Differential Revision: https://reviews.llvm.org/D65911 llvm-svn: 368325	2019-08-08 18:22:03 +00:00
Sam Clegg	caa0db1318	[lld][WebAssembly] Add optional symbols after input file handling This allows undefined references in input files be resolved by the optional symbols. Previously we were doing this before input file reading which means it was working only for command line symbols references (i.e. -u or --export). Also use addOptionalDataSymbol for __dso_handle and make all optional symbols hidden by default. Differential Revision: https://reviews.llvm.org/D65920 llvm-svn: 368310	2019-08-08 16:58:36 +00:00
Fangrui Song	ba4bda657e	[ELF][AArch64] Delete two unused RUN lines from aarch64-movw-tprel.s after D65882 llvm-svn: 368298	2019-08-08 13:57:53 +00:00
Peter Smith	d4695e1d75	[ELF][AArch64] Support for movz, movk tprel relocations This patch Implements the R_AARCH64_TLSLE_MOVW_TPREL_G*[_NC]. These are logically the same calculation as the existing TLSLE relocations with the result written back to mov[nz] and movk instructions. A typical code sequence is: movz x0, #:tprel_g2:foo // bits [47:32] of R_TLS with overflow check movk x0, #:tprel_g1_nc:foo // bits [31:16] of R_TLS with no overflow check movk x0, #:tprel_g0_nc:foo // bits [15:0] of R_TLS with no overflow check This type of code sequence is usually used with a large code model. Differential Revision: https://reviews.llvm.org/D65882 Fixes: PR42853 llvm-svn: 368293	2019-08-08 13:38:09 +00:00
David Blaikie	8a480d2cbf	gdb-index: Wire up str_offsets section to avoid incorrect error message about offsets_base There's still a need for a deeper fix to the way libDebugInfoDWARF error messages are propagated up to lld - if lld had exited non-zero on this error message we would've found the issue sooner. llvm-svn: 368229	2019-08-07 22:49:14 +00:00
David Blaikie	77ddf6af2b	Add a test demonstrating DWARF parse failures are not causing lld to exit non-zero This bug was/is masking other issues - committing this to demonstrate the problem/track fixing it. llvm-svn: 368220	2019-08-07 22:05:33 +00:00
David Blaikie	fa2f4395e9	API update for change to LLVM's lib/DebugInfo/DWARF llvm-svn: 368190	2019-08-07 17:18:18 +00:00
Igor Kudrin	0e1fea6754	Try to fix windows build bots after r368153. llvm-svn: 368169	2019-08-07 13:22:29 +00:00
Igor Kudrin	da41e2107e	[ELF] Fix splitting messages for duplicate symbols. D65213 (rL367536) does not work for the case when a source file path includes subdirectories. Differential Revision: https://reviews.llvm.org/D65810 llvm-svn: 368153	2019-08-07 11:32:43 +00:00
Rui Ueyama	e6a33e1f11	Handle /align option. Differential Revision: https://reviews.llvm.org/D65736 llvm-svn: 368145	2019-08-07 10:16:21 +00:00
Rui Ueyama	6c5fc94093	Simplify error message output. NFC. Differential Revision: https://reviews.llvm.org/D65855 llvm-svn: 368144	2019-08-07 10:11:24 +00:00
Fangrui Song	5ebee2558b	[ELF][X86] Add --no-show-raw-insn and -soname to some i386 tests llvm-svn: 368142	2019-08-07 09:43:54 +00:00
Rui Ueyama	cac8df1ab9	Re-submit r367649: Improve raw_ostream so that you can "write" colors using operator<< The original patch broke buildbots, perhaps because it changed the default setting whether colors are enabled or not. llvm-svn: 368131	2019-08-07 08:08:17 +00:00
Guanzhong Chen	0cb776e61a	[WebAssembly] Fix null pointer in createInitTLSFunction Summary: `createSyntheticSymbols`, which creates `WasmSym::InitTLS`, is only called when `!config->relocatable`, but this condition is not checked when calling `createInitTLSFunction`. This diff checks `!config->relocatable` before calling `createInitTLSFunction`. Fixes https://github.com/emscripten-core/emscripten/issues/9155. Reviewers: tlively, aheejin, kripken, sbc100 Subscribers: dschuff, jgravelle-google, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65785 llvm-svn: 368078	2019-08-06 20:09:04 +00:00
Fangrui Song	c55c0598f9	[ELF][PPC] Don't relax ifunc toc-indirect accesses to toc-relative Fixes PR42759. ``` // If ifunc is taken address in -fPIC code, it may have a toc entry .section .toc,"aw",@progbits .quad ifunc // ifunc may be defined as STT_GNU_IFUNC in another object file .type ifunc, %gnu_indirect_function ``` If ifunc is non-preemptable (e.g. when linking an executable), the toc entry will be relocated by R_PPC64_IRELATIVE. R_*_IRELATIVE represents the symbolic value of a non-preemptable ifunc (not associated with a canonical PLT) in a writable location. It has an unknown value at link time, so we cannot apply toc-indirect to toc-relative relaxation. Reviewed By: luporl, sfertile Differential Revision: https://reviews.llvm.org/D65755 llvm-svn: 368057	2019-08-06 16:57:54 +00:00
Peter Smith	7f320d4bf0	[ELF][ARM] Fix /DISCARD/ of section with .ARM.exidx section The combineEhSections runs, by design, before processSectionCommands so that input exception sections like .ARM.exidx and .eh_frame are not assigned to OutputSections. Unfortunately if /DISCARD/ removes InputSections that have associated .ARM.exidx sections without discarding the .ARM.exidx synthetic section then we will end up crashing when trying to sort the InputSections in ascending address order. We fix this by filtering out the sections that have been discarded prior to processing the InputSections in finalizeContents(). fixes pr42890 Differential Revision: https://reviews.llvm.org/D65759 llvm-svn: 368041	2019-08-06 14:13:38 +00:00
Fangrui Song	e49c417ed1	[ELF] Make binding (weak or non-weak) logic consistent for Undefined and SharedSymbol This is a case missed by D64136. If %t1.o has a weak reference on foo, and %t2.so has a non-weak reference on foo: ``` 0. ld.lld %t1.o %t2.so # ok; STB_WEAK; accepted since D64136 1. ld.lld %t2.so %t1.o # undefined symbol: foo; STB_GLOBAL 2. gold %t1.o %t2.so # ok; STB_WEAK 3. gold %t2.so %t1.o # undefined reference to 'foo'; STB_GLOBAL 4. ld.bfd %t1.o %t2.so # undefined reference to `foo'; STB_WEAK 5. ld.bfd %t2.so %t1.o # undefined reference to `foo'; STB_WEAK ``` It can be argued that in both cases, the binding of the undefined foo should be set to STB_WEAK, because the binding should not be affected by referenced from shared objects. --allow-shlib-undefined doesn't suppress errors (3,4,5), but -shared or --noinhibit-exec allows ld.bfd/gold to produce a binary: ``` 3. gold -shared %t2.so %t1.o # ok; STB_GLOBAL 4. ld.bfd -shared %t2.so %t1.o # ok; STB_WEAK 5. ld.bfd -shared %t1.o %t1.o # ok; STB_WEAK ``` If %t2.so has DT_NEEDED entries, ld.bfd will load them (lld/gold don't have the behavior). If one of the DSO defines foo and it is in the link-time search path (e.g. DT_NEEDED entry is an absolute path, via -rpath=, via -rpath-link=, etc), `ld.bfd %t1.o %t2.so` and `ld.bfd %t1.o %t2.so` will not error. In this patch, we make Undefined and SharedSymbol share the same binding computing logic. Case 1 will be allowed: ``` 0. ld.lld %t1.o %t2.so # ok; STB_WEAK; accepted since D64136 1. ld.lld %t2.so %t1.o # ok; STB_WEAK; changed by this patch ``` In the future, we can explore the option that turns both (0,1) into errors if --no-allow-shlib-undefined (default when linking an executable) is in action. Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D65584 llvm-svn: 368038	2019-08-06 14:03:45 +00:00
Igor Kudrin	5978c9249b	[MachO] Update LLD to use 64-bit offsets with DataExtractor (3/5) Differential Revision: https://reviews.llvm.org/D65639 llvm-svn: 368032	2019-08-06 13:37:34 +00:00
Fangrui Song	8981dc3058	[ELF][test] Reorganize some tls-.s tests Some tls-.s tests do not test generic TLS behavior but rather are x86 specific. Rename them to i386-.s or x86-64-.s Delete tls-static.s: covered by tls-opt.s Delete tls-opt-no-plt.s: add --implicit-check-not=.plt to x86-64-tls-gdie.s to cover it Rename tls-dynamic-i686.s to i386-tls-dynamic.s Rename tls-i686.s to i386-tls-le.s Rename tls-opt-i686.s to i386-tls-opt.s Rename tls-opt-iele-i686-nopic.s to i386-tls-opt-iele-nopic.s Rename tls-dynamic.s to x86-64-tls-dynamic.s . IE should be split off in the future. Rename tls-error.s to x86-64-reloc-tpoff32-error.s Rename tls-opt-gdie.s to x86-64-tls-gdie.s Rename tls-opt-x86_64-noplt.s to x86-64-tls-opt-noplt.s Rename tls-opt-local.s => x86-64-tls-ie-opt-local.s . It can be merged with x86-64-tls-ie-local.s in the future. llvm-svn: 367877	2019-08-05 15:34:01 +00:00
Fangrui Song	e28a70daf4	[ELF] Consistently prioritize non-* wildcards overs "" in version scripts We prioritize non- wildcards overs VER_NDX_LOCAL/VER_NDX_GLOBAL "". This patch generalizes the rule to "" of other versions and thus fixes PR40176. I don't feel strongly about this GNU linkers' behavior but the generalization simplifies code. Delete `config->defaultSymbolVersion` which was used to special case VER_NDX_LOCAL/VER_NDX_GLOBAL "*". In `SymbolTable::scanVersionScript`, custom versions are handled the same way as VER_NDX_LOCAL/VER_NDX_GLOBAL. So merge `config->versionScript{Locals,Globals}` into `config->versionDefinitions`. Overall this seems to simplify the code. In `SymbolTable::assign{Exact,Wildcard}Versions`, `sym->verdefIndex == config->defaultSymbolVersion` is changed to `verdefIndex == UINT32_C(-1)`. This allows us to give duplicate assignment diagnostics for `{ global: foo; };` `V1 { global: foo; };` In test/linkerscript/version-script.s: vs_index of an undefined symbol changes from 0 to 1. This doesn't matter (arguably 1 is better because the binding is STB_GLOBAL) because vs_index of an undefined symbol is ignored. Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D65716 llvm-svn: 367869	2019-08-05 14:31:39 +00:00
Nilanjana Basu	da60fc813c	Changing representation of .cv_def_range directives in Codeview debug info assembly format for better readability llvm-svn: 367867	2019-08-05 14:16:58 +00:00
Nilanjana Basu	b5e4d7de17	Revert "Changing representation of .cv_def_range directives in Codeview debug info assembly format for better readability" This reverts commit `a885afa9fa`. llvm-svn: 367861	2019-08-05 13:55:21 +00:00
Nilanjana Basu	a885afa9fa	Changing representation of .cv_def_range directives in Codeview debug info assembly format for better readability llvm-svn: 367850	2019-08-05 13:11:51 +00:00
Martin Storsjo	c92b3563c6	[MinGW] Add an lld specific option for requesting to delay load libraries With GNU tools, delayload is handled completely differently. (One creates a specific delayload import library using dlltool and then links against it instead of the normal import library.) Instead of requiring using -Xlink=-delayload:lib.dll, we can provide an lld specific option for this. Differential Revision: https://reviews.llvm.org/D65728 llvm-svn: 367837	2019-08-05 11:57:06 +00:00
Martin Storsjo	a0cbe16ed5	[COFF] Omit automatically imported symbols from the symbol table These symbols actually point to the symbol's IAT entry, which obviously is different from the symbol itself (which is imported from a different module and doesn't exist in the current one). Omitting this symbol helps gdb inspect automatically imported symbols, see https://sourceware.org/bugzilla/show_bug.cgi?id=24574 for discussion on the matter. Surprisingly, those extra symbols don't seem to be an issue for gdb when the sources have been built with clang, only with gcc. The actual logic in gdb that this depends on still is unknown, but omitting these symbols from the symbol table is the right thing to do in any case. Differential Revision: https://reviews.llvm.org/D65727 llvm-svn: 367836	2019-08-05 11:57:00 +00:00
Fangrui Song	d9b948b6eb	Rename F_{None,Text,Append} to OF_{None,Text,Append}. NFC F_{None,Text,Append} are kept for compatibility since r334221. llvm-svn: 367800	2019-08-05 05:43:48 +00:00
Fangrui Song	8f6cf97bee	[ELF][test] Delete redundant version-script-*.s tests Delete version-script-missing.s: it is covered by version-script-noundef.s Delete version-script-anonymous-local.s: it is covered by version-script-{glob,weak}.s etc Delete version-script-no-warn{,2}.s: add --fatal-warnings to some version-script.s commands instead llvm-svn: 367778	2019-08-04 09:05:21 +00:00
Fangrui Song	25ab1c6471	[ELF] Move R__IRELATIVE from .rel[a].plt to .rel[a].dyn unless --pack-dyn-relocs=android[+relr] An R__IRELATIVE represents the address of a STT_GNU_IFUNC symbol (redirected at runtime) which is non-preemptable and is not associated with a canonical PLT (associated with a symbol with a section index of SHN_UNDEF but a non-zero st_value). .rel[a].plt [DT_JMPREL, DT_JMPREL+DT_JMPRELSZ) contains relocations that can be lazily resolved. R__IRELATIVE are always eagerly resolved, so conceptually they do not belong to .rela.plt. "iplt" is mostly a misnomer. glibc powerpc and powerpc64 do not resolve R__IRELATIVE if they are in .rela.plt. // a.o - synthesized PLT call stub has an R__IRELATIVE void ifunc(); int main() { ifunc(); } // b.o static void real() {} asm (".type ifunc, %gnu_indirect_function"); void ifunc() { return &real; } The lld-linked executable crashes. ld.bfd places R__IRELATIVE in .rela.dyn and the executable works. glibc i386, x86_64, and aarch64 have logic (glibc/sysdeps//dl-machine.h:elf_machine_lazy_rel) to eagerly resolve R__IRELATIVE in .rel[a].plt so the lld-linked executable works. Move R__IRELATIVE from .rel[a].plt to .rel[a].dyn to fix the crashes on glibc powerpc/powerpc64. This also helps simplifying ifunc implementation in FreeBSD rtld-elf powerpc64. If --pack-dyn-relocs=android[+relr] is specified, the Android packed dynamic relocation format is used for .rela.dyn. We cannot name in.relaIplt ".rela.dyn" because the output section will have mixed formats. This can be improved in the future. Reviewed By: pcc Differential Revision: https://reviews.llvm.org/D65651 llvm-svn: 367745	2019-08-03 02:26:52 +00:00
Martin Storsjo	397a516a52	[COFF] Clarify a comment. NFC. It's the __delayLoadHelper2 function that overwrites the jump table slot, not this thunk. llvm-svn: 367674	2019-08-02 11:08:15 +00:00
Martin Storsjo	5f0077d238	[COFF] Avoid loading objects for mingw autoimport, when a defined alias exists This avoids a spurious and confusing log message in cases where both e.g. "alias" and "__imp_alias" exist. Differential Revision: https://reviews.llvm.org/D65598 llvm-svn: 367673	2019-08-02 11:02:34 +00:00
Rui Ueyama	4d41c332ef	Revert r367649: Improve raw_ostream so that you can "write" colors using operator<< This reverts commit r367649 in an attempt to unbreak Windows bots. llvm-svn: 367658	2019-08-02 07:22:34 +00:00
Rui Ueyama	c1981b2b26	Add an assert() to catch possible regexp errors. llvm-svn: 367651	2019-08-02 05:11:19 +00:00
Rui Ueyama	96a7a225f5	Add a comment for --vs-diagnostics. llvm-svn: 367650	2019-08-02 05:04:27 +00:00
Rui Ueyama	a52f982f1c	Improve raw_ostream so that you can "write" colors using operator<< 1. raw_ostream supports ANSI colors so that you can write messages to the termina with colors. Previously, in order to change and reset color, you had to call `changeColor` and `resetColor` functions, respectively. So, if you print out "error: " in red, for example, you had to do something like this: OS.changeColor(raw_ostream::RED); OS << "error: "; OS.resetColor(); With this patch, you can write the same code as follows: OS << raw_ostream::RED << "error: " << raw_ostream::RESET; 2. Add a boolean flag to raw_ostream so that you can disable colored output. If you disable colors, changeColor, operator<<(Color), resetColor and other color-related functions have no effect. Most LLVM tools automatically prints out messages using colors, and you can disable it by passing a flag such as `--disable-colors`. This new flag makes it easy to write code that works that way. Differential Revision: https://reviews.llvm.org/D65564 llvm-svn: 367649	2019-08-02 04:48:30 +00:00
Rui Ueyama	966b9a3b9d	Fix an unused variable warning. llvm-svn: 367643	2019-08-02 02:51:20 +00:00
Martin Storsjo	90b4388f56	[COFF] Fix wholearchive with thin archives The Archive object created when loading an archive specified with wholearchive got cleaned up immediately, when the owning std::unique_ptr went out of scope, even if persisted StringRefs pointed to memory that belonged to the archive, which no longer was mapped in memory. This hasn't been an issue with regular (as opposed to thin) archives, as references to the member objects has kept the mapping for the whole archive file alive - but with thin archives, all such references point to other files. Add the std::unique_ptr to the arena allocator, to retain it as long as necessary. This fixes (the last issue raised in) PR42388. Differential Revision: https://reviews.llvm.org/D65565 llvm-svn: 367599	2019-08-01 18:47:27 +00:00
Fangrui Song	5391f158c2	[ELF] Add -z separate-code and pad the last page of last PF_X PT_LOAD with traps only if -z separate-code is specified This patch 1) adds -z separate-code and -z noseparate-code (default). 2) changes the condition that the last page of last PF_X PT_LOAD is padded with trap instructions. Current condition (after D33630): if there is no `SECTIONS` commands. After this change: if -z separate-code is specified. -z separate-code was introduced to ld.bfd in 2018, to place the text segment in its own pages. There is no overlap in pages between an executable segment and a non-executable segment: 1) RX cannot load initial contents from R or RW(or non-SHF_ALLOC). 2) R and RW(or non-SHF_ALLOC) cannot load initial contents from RX. lld's current status: - Between R and RX: in `Writer<ELFT>::fixSectionAlignments()`, the start of a segment is always aligned to maxPageSize, so the initial contents loaded by R and RX do not overlap. I plan to allow overlaps in D64906 if -z noseparate-code is in effect. - Between RX and RW(or non-SHF_ALLOC if RW doesn't exist): we currently unconditionally pad the last page to commonPageSize (defaults to 4096 on all targets we support). This patch will make it effective only if -z separate-code is specified. -z separate-code is a dubious feature that intends to reduce the number of ROP gadgets (which is actually ineffective because attackers can find plenty of gadgets in the text segment, no need to find gadgets in non-code regions). With the overlapping PT_LOAD technique D64906, -z noseparate-code removes two more alignments at segment boundaries than -z separate-code. This saves at most defaultCommonPageSize2 bytes, which are significant on targets with large defaultCommonPageSize (AArch64/MIPS/PPC: 65536). Issues/feedback on alignment at segment boundaries to help understand the implication: binutils PR24490 (the situation on ld.bfd is worse because they have two R-- on both sides of R-E so more alignments.) * In binutils, the 2018-02-27 commit "ld: Add --enable-separate-code" made -z separate-code the default on Linux. `d969dea983` In musl-cross-make, binutils is configured with --disable-separate-code to address size regressions caused by -z separate-code. (lld actually has the same issue, which I plan to fix in a future patch. The ld.bfd x86 status is worse because they default to max-page-size=0x200000). * https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=237676 people want smaller code size. This patch will remove one alignment boundary. * Stef O'Rear: I'm opposed to any kind of page alignment at the text/rodata line (having a partial page of text aliased as rodata and vice versa has no demonstrable harm, and I actually care about small systems). So, make -z noseparate-code the default. Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D64903 llvm-svn: 367537	2019-08-01 09:58:25 +00:00
Igor Kudrin	07ceadda25	[ELF] With --vs-diagnostics, print a separate message for each location of a duplicate symbol. We extract and print the source location in the message header so that Visual Studio is able to parse it and jump there. As duplicate symbols are defined in several locations, it is more convenient to have separate error messages, which allows a user to easily access all the locations. Differential Revision: https://reviews.llvm.org/D65213 llvm-svn: 367536	2019-08-01 09:58:03 +00:00
Fangrui Song	0cfa72eaec	[ELF][X86] Improve tests * Add --no-show-raw-insn to llvm-objdump -d tests * When linking an executable with %t.so, the path %t.so will be recorded in the DT_NEEDED entry if %t.so doesn't have DT_SONAME. .dynstr will have varying lengths on different systems. Add -soname so that the string in .dynstr is of fixed length to make tests more robust. * Rename i386-tls-initial-exec-local.s to i386-tls-ie-local.s * Refactor tls-initial-exec-local.s to x86-64-tls-ie-local.s llvm-svn: 367533	2019-08-01 09:25:34 +00:00
Igor Kudrin	510086b5e5	[ELF] Fix finding locations in messages for undefined hidden symbols. Previously, when `--vs-diagnostics` was used, the linker printed something like hidden(undef.s): error: undefined hidden symbol: foo >>> referenced by undef.s:15 Differential Revision: https://reviews.llvm.org/D65499 llvm-svn: 367515	2019-08-01 05:23:45 +00:00
Rui Ueyama	d524c17029	Return early. NFC. llvm-svn: 367200	2019-07-29 05:24:51 +00:00
Bob Haarman	51dcb292cc	[lld-link] diagnose undefined symbols before LTO when possible Summary: This allows reporting undefined symbols before LTO codegen is run. Since LTO codegen can take a long time, this improves user experience by avoiding that time spend if the link is going to fail with undefined symbols anyway. Fixes PR32400. Reviewers: ruiu Reviewed By: ruiu Subscribers: mehdi_amini, steven_wu, dexonsmith, mstorsjo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62434 llvm-svn: 367136	2019-07-26 17:56:45 +00:00
Fangrui Song	d6c448244b	[ELF] Simplify with dyn_cast_or_null. NFC llvm-svn: 367126	2019-07-26 16:29:15 +00:00
Fangrui Song	3d51d4ed6d	[ELF] Detemplate maybeReportUndefined and copySectionsIntoPartitions llvm-svn: 367117	2019-07-26 14:57:53 +00:00
Fangrui Song	3e023a6dbc	[ELF][MIPS] Improve tests * Add --no-show-raw-insn to llvm-objdump -d tests * When linking an executable with %t.so, the path %t.so will be recorded in the DT_NEEDED entry if %t.so doesn't have DT_SONAME. .dynstr will have varying lengths on different systems. Add -soname to make tests more robust. llvm-svn: 366988	2019-07-25 07:12:23 +00:00
Guanzhong Chen	87186b2447	[WebAssembly] Set __tls_align to 1 when there is no TLS Summary: We want the tool conventions to state that `__tls_align` will be a power of 2. It makes sense to not have an exception for when there is no TLS. Reviewers: tlively, sunfish Reviewed By: tlively Subscribers: dschuff, sbc100, jgravelle-google, aheejin, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65177 llvm-svn: 366948	2019-07-24 21:48:14 +00:00
Alexander Richardson	a8104b4927	[LLD] Do not print additional newlines after reaching error limit Summary: This could previously happen if errors that are emitted after reaching the error limit. In that case, the flag inside the newline() function will be set to true which causes the next call to print a newline even though the actual message will be discarded. Reviewers: ruiu, grimar, MaskRay, espindola Reviewed By: ruiu Subscribers: emaste, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65198 llvm-svn: 366944	2019-07-24 20:56:23 +00:00
Simon Atanasyan	5adbfdda2d	[mips] Assign R_TLS type to the R_MIPS_TLS_TPREL_XXX relocations. NFC That allows to remove duplicated code which subtracts 0x7000 from the R_MIPS_TLS_TPREL_XXX relocations values in the `MIPS::relocateOne` function. llvm-svn: 366888	2019-07-24 11:37:13 +00:00
Nico Weber	9c0716f116	ld.lld: Demangle symbols from archives in diagnostics This ports r366573 from COFF to ELF. There are now to toString(Archive::Symbol), one doing MSVC demangling in COFF and one doing Itanium demangling in ELF, so rename these two to toCOFFString() and to toELFString() to not get a duplicate symbol. Nothing ever passes a raw Archive::Symbol to CHECK(), so these not being part of the normal toString() machinery seems ok. There are two code paths in the ELF linker that emits this type of diagnostic: 1. The "normal" one in InputFiles.cpp. This is covered by the tweaked test. 2. An additional one that's only used for libcalls if there's at least one bitcode in the link, and if the libcall symbol is lazy, and lazily loaded from an archive (i.e. not from a lazy .o file). (This code path was added in r339301.) Since all libcall names so far are C symbols and never mangled, the change there is not observable and hence not covered by tests. Differential Revision: https://reviews.llvm.org/D65095 llvm-svn: 366836	2019-07-23 19:00:01 +00:00
Martin Storsjo	b95fcf09c0	[test] Fix the test from the previous commit when run on windows. NFC. Apparently the escaped dollar sign didn't work the same way in "echo -e" on windows buildbots. llvm-svn: 366784	2019-07-23 07:28:23 +00:00
Martin Storsjo	341a68ca2f	[COFF] Unbreak sorting of mingw comdat .tls sections after SVN r363457 Code built for mingw with -fdata-sections will store each TLS variable in a comdat section, named .tls$$<varname>. Normal TLS variables are stored in sections named .tls$ with a trailing dollar, which are sorted after a starter marker (in a later linked object file) in a section named ".tls" (with no dollar suffix), before an ending marker in a section named ".tls$ZZZ". The mingw comdat section suffix stripping introduced in SVN r363457 broke sorting of such tls sections, ending up sorting the stripped .tls$$<varname> sections (stripped to ".tls") before the start marker in the section named ".tls". We could add exceptions to the section name suffix stripping for .tls (and .CRT, where suffixes always should be honored), but the more conservative option is probably the reverse; to only apply the stripping for the normal sections where sorting shouldn't have any effect. Differential Revision: https://reviews.llvm.org/D65018 llvm-svn: 366780	2019-07-23 06:38:04 +00:00

... 5 6 7 8 9 ...

12913 Commits