llvm-project

Commit Graph

Author	SHA1	Message	Date
Rui Ueyama	a1b79dff2a	Handle input section liveness only in MarkLive.cpp. The condition whether a section is alive or not by default is becoming increasingly complex, so the decision of garbage collection is spreading over InputSection.h and MarkLive.cpp, which is not a good state. This moves the code to MarkLive.cpp, to keep the file the central place to make decisions about garbage collection. llvm-svn: 315384	2017-10-10 22:59:32 +00:00
James Henderson	b5ca92ef73	[ELF] Set Dot initially to --image-base value when using linker scripts When parsing linker scripts, LLD previously started with a '.' value of 0, regardless of the internal default image base for the target, and regardless of switches such as --image-base. It seems reasonable to use a different image base value when using linker scripts and --image-base is specified, since otherwise the switch has no effect. This change does this, as well as removing unnecessary initialisation of Dot where it is not used. The default image base should not be used when processing linker scripts, because this will change the behaviour for existing linker script users, and potentially result in invalid output being produced, as a subsequent assignment to Dot could move the location counter backwards. Instead, we maintain the existing behaviour of starting from 0 if --image-base is not specified. Reviewers: ruiu Differential Revision: https://reviews.llvm.org/D38360 llvm-svn: 315293	2017-10-10 10:09:35 +00:00
Rui Ueyama	c9a4d1c735	Add comments. llvm-svn: 315268	2017-10-10 03:58:18 +00:00
Rui Ueyama	8befefb2ea	Remove OutputSection::updateAlignment. I feel it is easier to understand without this function. llvm-svn: 315140	2017-10-07 00:58:34 +00:00
George Rimar	8f3a6c8143	[ELF] - Do not produce broken .dynamic section with --no-rosegment LLD produces broken .dynamic section when --no-rosegment and at least one of following options is present: 1) -z rodynamic is given. 2) MIPS target. That happens because code that writes .dynamic assumes target buffer is zero-filled, what can be not true after LLD fills it with trap instructions. With one of two options above, .dynamic becomes SHF_ALLOC section, so can be affected. Differential revision: https://reviews.llvm.org/D38580 llvm-svn: 315054	2017-10-06 10:06:13 +00:00
George Rimar	2727ce2c1f	[ELF] - Do not produce broken .dynsym with --no-rosegment. We produce broken output currently. Code that writes .dynsym assumes output buffer is zero-filled, though that is not always true. When --no-rosegment is given, buffer can be filled with trap instructions. Patch fixes the issue. It is relative with PR34705. Differential revision: https://reviews.llvm.org/D38579 llvm-svn: 315053	2017-10-06 09:56:24 +00:00
Rui Ueyama	0fda1d70bb	Fix typo. llvm-svn: 315042	2017-10-06 04:32:08 +00:00
Rafael Espindola	0038dfa19d	Revert "Revert r314810: Use sched_getaffinity instead of std:🧵:hardware_concurrency." This reverts commit r314924. The required llvm patch was recommitted. llvm-svn: 314933	2017-10-04 20:35:05 +00:00
Rui Ueyama	61b9ce217a	Revert r314810: Use sched_getaffinity instead of std:🧵:hardware_concurrency. This reverts commit r314810 because r314809 was reverted. llvm-svn: 314924	2017-10-04 18:39:51 +00:00
Rui Ueyama	732f4e2778	Remove BssSection::reserveSpace(). We no longer call reserveSpace more than once, so it can be merged with its constructor. llvm-svn: 314867	2017-10-04 00:21:17 +00:00
Shoaib Meenai	50d7b36f5e	[ELF] Decompress debug info sections early When reporting a symbol conflict, LLD parses the debug info to report source location information. Sections have not been decompressed at this point, so if an object file contains zlib compressed debug info, LLD ends up passing this compressed debug info to the DWARF parser, which causes debug info parsing failures and can trigger assertions in the parser (as the test case demonstrates). Decompress debug sections when constructing the LLDDwarfObj to avoid this issue. This doesn't handle GNU-style compressed debug info sections (.zdebug_*), which at present are simply ignored by LLDDwarfObj; those can be done in a follow-up. Differential Revision: https://reviews.llvm.org/D38491 llvm-svn: 314866	2017-10-04 00:19:41 +00:00
Rafael Espindola	c804bf9397	Use sched_getaffinity instead of std:🧵:hardware_concurrency. The issue with std:🧵:hardware_concurrency is that it forwards to libc and some implementations (like glibc) don't take thread affinity into consideration. With this change a llvm program that can execute in only 2 cores will use 2 threads, even if the machine has 32 cores. This makes benchmarking a lot easier, but should also help if someone doesn't want to use all cores for compilation for example. llvm-svn: 314810	2017-10-03 16:25:48 +00:00
Rui Ueyama	f3f9bae842	Add a comment. llvm-svn: 314746	2017-10-03 00:45:24 +00:00
Rui Ueyama	3f851704c1	Move new lld's code to Common subdirectory. New lld's files are spread under lib subdirectory, and it isn't easy to find which files are actually maintained. This patch moves maintained files to Common subdirectory. Differential Revision: https://reviews.llvm.org/D37645 llvm-svn: 314719	2017-10-02 21:00:41 +00:00
Simon Atanasyan	649e4d328f	[MIPS] Fix PLT entries generation in case of linking regular and microMIPS code Currently LLD calls the `isMicroMips` routine to determine type of PLT entries needs to be generated: regular or microMIPS. This routine checks ELF header flags in the `FirstObj` to retrieve type of linked object files. So if the first file does not contain microMIPS code, LLD will generate PLT entries with regular (non-microMIPS) code only. Ideally, if a PLT entry is referenced by microMIPS code only this entry should contain microMIPS code, if a PLT entry is referenced by regular code this entry should contain regular code. In a "mixed" case the PLT entry can be either microMIPS or regular, but each "cross-mode-call" has additional cost. It's rather difficult to implement this ideal solution. But we can assume that if there is an input object file with microMIPS code, the most part of the code is microMIPS too. So we need to deduce type of PLT entries based on finally calculated ELF header flags and do not check only the first input object file. This change implements this. - The `getMipsEFlags` renamed to the `calcMipsEFlags`. The function called from the `LinkerDriver::link`. Result is stored in the Configuration::MipsEFlags field. - The `isMicroMips` and `isMipsR6` routines access the `MipsEFlags` field to get and check calculated ELF flags. - New types of PLT records created when necessary. Differential revision: https://reviews.llvm.org/D37747 llvm-svn: 314675	2017-10-02 14:56:41 +00:00
NAKAMURA Takumi	70947e2224	SyntheticSections.cpp: Appease g++-4.8, s/const/constexpr/ llvm-svn: 314592	2017-09-30 13:40:22 +00:00
Rui Ueyama	fbc622d1e4	Fix buildbots. llvm-svn: 314590	2017-09-30 12:19:08 +00:00
Rui Ueyama	c97a70c6f5	Parallelize string merging. String merging is one of the most time-consuming functions in lld. This patch parallelize it to speed it up. On my 2-socket 20-core 40-threads Xeon E5-2680 @ 2.8 GHz machine, this patch shorten the clang debug build link time from 7.11s to 5.16s. It's a 27% improvement and actually pretty noticeable. In this test condition, lld is now 4x faster than gold. Differential Revision: https://reviews.llvm.org/D38266 llvm-svn: 314588	2017-09-30 11:46:26 +00:00
Ben Dunbobbin	73eabf23a4	[ELF] Simpler scheme for handling common symbols Convert all common symbols to regular symbols after scan. This means that the downstream code does not to handle common symbols as a special case. Differential Revision: https://reviews.llvm.org/D38137 llvm-svn: 314495	2017-09-29 09:08:26 +00:00
George Rimar	aaf5471429	[ELF] - Detemplate of HashTableSection<ELFT> Detemplation of one more synthetic section. Differential revision: https://reviews.llvm.org/D38241 llvm-svn: 314283	2017-09-27 09:14:59 +00:00
George Rimar	5d6efd100b	[ELF] - Speedup -r and --emit-relocs This is "Bug 34688 - lld much slower than bfd when linking the linux kernel" Inside copyRelocations() we have O(N*M) algorithm, where N - amount of relocations and M - amount of symbols in symbol table. It isincredibly slow for linking linux kernel. Patch creates local search tables to speedup. With this fix link time goes for me from 12.95s to 0.55s what is almost 23x faster. (used release LLD). Differential revision: https://reviews.llvm.org/D38129 llvm-svn: 314282	2017-09-27 09:08:53 +00:00
Rui Ueyama	e26b7aafe0	Split MergeSyntheticSection into Merge{Tail,NoTail}Section. This patch alone is neutral in terms of code readability, but this change makes a following patch easier to read. llvm-svn: 314181	2017-09-26 00:54:24 +00:00
George Rimar	19d6ce9d8e	[ELF] - Simplify removeUnusedSyntheticSections a bit. Previously`InX::Got` and InX::MipsGot synthetic sections were not removed if ElfSym::GlobalOffsetTable was defined. ElfSym::GlobalOffsetTable is a symbol for _GLOBAL_OFFSET_TABLE_. Patch moves ElfSym::GlobalOffsetTable check out from removeUnusedSyntheticSections. Also note that there was no point to check ElfSym::GlobalOffsetTable for MIPS case because InX::MipsGot::empty() always returns false for non-relocatable case, and in case of relocatable output we do not create special symbols anyways. Differential revision: https://reviews.llvm.org/D37623 llvm-svn: 314099	2017-09-25 09:46:33 +00:00
Rui Ueyama	f9da2fdc78	Do not sort CU vectors. We used to sort and uniquify CU vectors, but looks like CU vectors in .gdb_index sections created by gold are not guaranteed to be sorted. llvm-svn: 314095	2017-09-25 05:30:39 +00:00
Rui Ueyama	17cc6f6ad9	Speeds up CU vector creation. We used to use std::set to uniquify CU vector elements, but as we know, std::set is pretty slow. Fortunately we didn't actually have to use a std::set here. This patch replaces it with std::vector. With this patch, lld's -gdb-index overhead when linking a clang debug build is now about 1 second (8.65 seconds without -gdb-index vs 9.60 seconds with -gdb-index). Since gold takes more than 6 seconds to create a .gdb_index for the same output, our number isn't that bad. llvm-svn: 314094	2017-09-25 04:55:27 +00:00
Rui Ueyama	8f222b8158	Fix off-by-one error. llvm-svn: 314093	2017-09-25 03:40:45 +00:00
Rui Ueyama	bbc477c9b6	Do not use StringTableBuilder to build symbol table for .gdb_index. Previously, we had two levels of hash table lookup. The first hash lookup uses CachedHashStringRefs as keys and returns offsets in string table. Then, we did the second hash table lookup to obtain GdbSymbol pointers. But we can directly map strings to GDbSymbols. One test file is updated in this patch because we no longer have a '\0' byte at the start of the string pool, which was automatically inserted by StringTableBuilder. This patch speeds up Clang debug build (with -gdb-index) link time by 0.3 seconds. llvm-svn: 314092	2017-09-25 02:29:51 +00:00
Rui Ueyama	22125d8c84	Compute string hashes early and cache them. This change alone speeds up linking of Clang debug build with -gdb-index by 1.2 seconds, from 12.5 seconds to 11.3 seconds. (Without -gdb-index, lld takes 8.5 seconds to link the same input files.) llvm-svn: 314090	2017-09-25 01:42:57 +00:00
Rui Ueyama	9f4f490c31	Refactor GdbIndexSection. NFC. This patch rewrites a part of GdbIndexSection to address the following issues in the previous implementation: - Previously, some struct declarations were in GdbIndex.h while they were not used in GdbIndex.cpp. Such structs are moved to SyntheticSection.h. - The actual implementation were split into GdbIndexSection and GdbHash section, but that separation didn't make much sense. They are now unified as GdbIndexSection. In addition to the above changes, this patch splits functions, rename variables and remove redundant functions/variables to generally improve code quality. llvm-svn: 314084	2017-09-24 21:45:35 +00:00
George Rimar	94444b9a07	[ELF] - Fix segfault when processing .eh_frame. Its a PR34648 which was a segfault that happened because we stored pointers to elements in DenseMap. When DenseMap grows such pointers are invalidated. Solution implemented is to keep elements by pointer and not by value. Differential revision: https://reviews.llvm.org/D38034 llvm-svn: 313741	2017-09-20 09:27:41 +00:00
NAKAMURA Takumi	169dbde262	Revert rL313697, "Compact EhSectionPiece from 32 bytes to 16 bytes." It broke selfhosting. http://lab.llvm.org:8011/builders/clang-with-lto-ubuntu/builds/4896 llvm-svn: 313731	2017-09-20 08:03:18 +00:00
Rui Ueyama	014b0f24ae	Compact EhSectionPiece from 32 bytes to 16 bytes. EhSectionPiece used to have a pointer to a section, but that pointer was mostly redundant because we almost always know what the section is without using that pointer. This patch removes the pointer from the struct. This patch also use uint32_t/int32_t instead of size_t to represent offsets that are hardly be larger than 4 GiB. At the moment, I think it is OK even if we cannot handle .eh_frame sections larger than 4 GiB. Differential Revision: https://reviews.llvm.org/D38012 llvm-svn: 313697	2017-09-19 23:36:48 +00:00
Rui Ueyama	74ea1f0938	Rename CieRecord instance variables. CieRecord is a struct containing a CIE and FDEs, but oftentimes the struct itself is named `Cie` which caused some confusion. This patch renames them `CieRecords` or `Rec`. llvm-svn: 313681	2017-09-19 21:31:57 +00:00
Rui Ueyama	faa38029e2	Simplify. NFC. llvm-svn: 313667	2017-09-19 20:28:03 +00:00
George Rimar	696a7f9ac6	[ELF] - Introduce std::vector<InputFile *> global arrays. This patch removes lot of static Instances arrays from different input file classes and introduces global arrays for access instead. Similar to arrays we have for InputSections/OutputSectionCommands. It allows to iterate over input files in a non-templated code. Differential revision: https://reviews.llvm.org/D35987 llvm-svn: 313619	2017-09-19 09:20:54 +00:00
Rui Ueyama	34a0dd5283	Rename EhSectionPiece::ID -> EhSectionPiece::Sec. ID sounds like an identifier, but this is actually a pointer to a section. llvm-svn: 313588	2017-09-18 23:07:33 +00:00
Rui Ueyama	e084aac123	Do not use inheritance for EhSectionPiece. EhSectionPiece inherited from SectionPiece, but we did not actually use EhSectionPiece objects as SectionPiece ojbects. They were handled as distinct types. So it didn't make much sense to use inheritance. llvm-svn: 313587	2017-09-18 23:07:21 +00:00
Rui Ueyama	a6ff617967	Remove useless accessor. llvm-svn: 313586	2017-09-18 23:07:09 +00:00
Rui Ueyama	27a357c9d9	Remove redundant cast<> and null check. "Repl" member is guranteed to have a non-null pointer. If an input section is not merged by ICF, "Repl" points to "this". Otherwise, it points to some other section. It must not be NULL. llvm-svn: 313556	2017-09-18 19:15:54 +00:00
Rui Ueyama	56614e41a4	Add a comment about a workaround for ld.gold -r. llvm-svn: 313095	2017-09-12 23:43:45 +00:00
Rafael Espindola	67df57a242	Remove Offset from Common. It is not needed since it is always 0. llvm-svn: 313076	2017-09-12 21:19:09 +00:00
Dmitry Mikulin	1e30f07ce7	Currently lld creates a single section to collect all commons. There is no way to separate commons based on file name patterns. The following linker script construct does not work because commons are allocated before section placement is done and the only synthesized BssSection that holds all commons has no file associated with it: SECTIONS { .common_0 : { *file0.o(COMMON) }} This patch changes the allocation of commons to create a section per common symbol and let the section logic do the layout. Differential revision: https://reviews.llvm.org/D37489 llvm-svn: 312796	2017-09-08 16:22:43 +00:00
Andrew Ng	6dee736c91	[LLD] Fix padding of .eh_frame when in executable segment The default padding for an executable segment is the target trap instruction which for x86_64 is 0xCC. However, the .eh_frame section requires the padding to be zero. The code that writes the .eh_frame section assumes that its segment is zero initialized and does not explicitly write the zero padding. This does not work when the .eh_frame section is in the executable segment (for example when using -no-rosegment). This patch changes the .eh_frame writing code to explicitly write the zero padding. Differential Revision: https://reviews.llvm.org/D37462 llvm-svn: 312706	2017-09-07 08:43:56 +00:00
George Rimar	e89c5bfbc2	[ELF] - Never call splitIntoPieces() twice. NFC. Previously it was called twice for .comment synthetic section. That created 2 pieces of data, which was deduplicated anyways, but was not clean. llvm-svn: 312327	2017-09-01 12:04:52 +00:00
Rui Ueyama	2b6631bb36	Remove GdbIndexSection::finalizeContents. GdbIndexSection doesn't need lazy finalization because when an instance of the class is created, we already know all debug info sections. We can initialize the instnace in the ctor. llvm-svn: 310931	2017-08-15 17:01:39 +00:00
Rui Ueyama	e5d642cf5b	Use ArrayRef instead of std::vector&. llvm-svn: 310930	2017-08-15 17:01:28 +00:00
Rui Ueyama	2114cab93d	Update a comment and rename a function. llvm-svn: 310929	2017-08-15 17:01:17 +00:00
Rui Ueyama	43099e4409	Remove SymbolTable::findInCurrentDSO. This function doesn't seem to add value to the symbol table as it is easy to write code without it. llvm-svn: 310925	2017-08-15 16:03:11 +00:00
Rui Ueyama	9c77d27004	Garbage-collect common symbols. Liveness is usually a notion of input sections, but this patch adds "liveness" bit to common symbols because they don't belong to any input section. This patch is based on https://reviews.llvm.org/D36520 Differential Revision: https://reviews.llvm.org/D36546 llvm-svn: 310617	2017-08-10 15:54:27 +00:00
Rafael Espindola	6e93d0546a	Move File from SymbolBody to Symbol. With this Symbol has the same size as before, but DefinedRegular goes from 72 to 64 bytes. I also find this a bit easier to read. There are fewer places initializing File for example. This has a small but measurable speed improvement on all tests (1% max). llvm-svn: 310142	2017-08-04 22:31:42 +00:00

1 2 3 4 5 ...

311 Commits