llvm-project

Commit Graph

Author	SHA1	Message	Date
Reid Kleckner	8cbdd0c0f2	[PDB] Add a module descriptor for every object file Summary: Expose the module descriptor index and fill it in for section contributions. Reviewers: zturner Subscribers: llvm-commits, ruiu, hiraditya Differential Revision: https://reviews.llvm.org/D34126 llvm-svn: 305296	2017-06-13 15:49:13 +00:00
Zachary Turner	68ea80d0a7	Slightly better fix for dealing with no-id-stream PDBs. The last fix required the user to manually add the required feature. This caused an LLD test to fail because I failed to update LLD. In practice we can hide this logic so it can just be transparently added when we write the PDB. llvm-svn: 305236	2017-06-12 21:46:51 +00:00
Zachary Turner	990d0c8158	[llvm-pdbdump] Don't fail on PDBs with no ID stream. Older PDBs don't have this. Its presence is detected by using the various "feature" flags that come at the end of the PDB Stream. Detect this, and don't try to dump the ID stream if the features tells us it's not present. llvm-svn: 305235	2017-06-12 21:34:53 +00:00
Zachary Turner	d334cebac4	Fix a null pointer dereference in llvm-pdbutil pretty. Static data members were causing a problem because I mistakenly assumed all members would affect a class's layout and so the Layout member would be non-null. llvm-svn: 305229	2017-06-12 20:46:35 +00:00
Sylvestre Ledru	337804d86a	Same expressions on both sides of the return Summary: I guess we want PointerToMemberFunction & PointerToDataMember Fix coverity cid 1376038 Reviewers: zturner Reviewed By: zturner Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34110 llvm-svn: 305219	2017-06-12 18:53:46 +00:00
David Blaikie	a91885a08c	dwarfdump: Handle relocs to zlib (.zdebug*) compressed sections llvm-svn: 305152	2017-06-10 19:32:50 +00:00
Galina Kistanova	038f9854ec	Added llvm_unreachable as ReportError cannot be specified as noreturn. llvm-svn: 305143	2017-06-10 07:50:14 +00:00
Zachary Turner	3226fe95bb	[pdb] Support CoffSymbolRVA debug subsection. llvm-svn: 305108	2017-06-09 20:46:52 +00:00
Zachary Turner	7e62cd17d6	Allow VarStreamArray to use stateful extractors. Previously extractors tried to be stateless with any additional context information needed in order to parse items being passed in via the extraction method. This led to quite cumbersome implementation challenges and awkwardness of use. This patch brings back support for stateful extractors, making the implementation and usage simpler. llvm-svn: 305093	2017-06-09 17:54:36 +00:00
Bob Haarman	fdf499bf2d	[codeview] use 32-bit integer for RelocOffset in DebugLinesSubsection Summary: RelocOffset is a 32-bit value, but we previously truncated it to 16 bits. Fixes PR33335. Reviewers: zturner, hiraditya! Reviewed By: zturner Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33968 llvm-svn: 305043	2017-06-09 01:18:10 +00:00
Zachary Turner	28c22c83e3	[pdb] Don't crash on unknown debug subsections. More and more unknown debug subsection kinds are being discovered so we should make it possible to dump these and display the bytes. llvm-svn: 305041	2017-06-09 00:53:59 +00:00
Zachary Turner	deb391309c	[CodeView] Support remaining debug subsection types This adds support for Symbols, StringTable, and FrameData subsection types. Even though these subsections rarely if ever appear in a PDB file (they are usually in object files), there's no theoretical reason why they couldn't appear in a PDB. The real issue though is that in order to add support for dumping and writing them (which will be useful for object files), we need a way to test them. And since there is no support for reading and writing them to / from object files yet, making PDB support them is the best way to both add support for the underlying format and add support for tests at the same time. Later, when we go to add support for reading / writing them from object files, we'll need only minimal changes in the underlying read/write code. llvm-svn: 305037	2017-06-09 00:28:08 +00:00
Zachary Turner	1bf7762049	[llvm-pdbdump] Support native ordering of subsections in raw mode. This is the same change for the YAML Output style applied to the raw output style. Previously we would queue up all subsections until every one had been read, and then output them in a pre- determined order. This was because some subsections need to be read first in order to properly dump later subsections. This patch allows them to be dumped in the order they appear. Differential Revision: https://reviews.llvm.org/D34015 llvm-svn: 305034	2017-06-08 23:49:01 +00:00
Zachary Turner	15eb237fd3	[PDB] Don't crash on /debug:fastlink PDBs. Apparently support for /debug:fastlink PDBs isn't part of the DIA SDK (!), and it was causing llvm-pdbdump to crash because we weren't checking for a null pointer return value. This manifests when calling findChildren on the IDiaSymbol, and it returns E_NOTIMPL. llvm-svn: 304982	2017-06-08 16:00:40 +00:00
NAKAMURA Takumi	92c99cd6dc	Update libdeps to add BinaryFormat, introduced in r304864. llvm-svn: 304869	2017-06-07 04:48:49 +00:00
Zachary Turner	264b5d9e88	Move Object format code to lib/BinaryFormat. This creates a new library called BinaryFormat that has all of the headers from llvm/Support containing structure and layout definitions for various types of binary formats like dwarf, coff, elf, etc as well as the code for identifying a file from its magic. Differential Revision: https://reviews.llvm.org/D33843 llvm-svn: 304864	2017-06-07 03:48:56 +00:00
Zachary Turner	1bfb9f47af	Fix uninitialized read. llvm-svn: 304846	2017-06-06 23:54:23 +00:00
Adrian Prantl	318d1195f2	Introduce -brief command line option to llvm-dwarfdump This patch introduces a new command line option, called brief, to llvm-dwarfdump. When -brief is used, the attribute forms for the .debug_info section will not be emitted to output. Patch by Spyridoula Gravani! rdar://problem/21474365 Differential Revision: https://reviews.llvm.org/D33867 llvm-svn: 304844	2017-06-06 23:28:45 +00:00
Chandler Carruth	aaeada6c75	Fix another ordering constraint with windows.h and comment about a revers constraint that we got right (by chance). llvm-svn: 304792	2017-06-06 12:43:20 +00:00
Chandler Carruth	6bda14b313	Sort the remaining #include lines in include/... and lib/.... I did this a long time ago with a janky python script, but now clang-format has built-in support for this. I fed clang-format every line with a #include and let it re-sort things according to the precise LLVM rules for include ordering baked into clang-format these days. I've reverted a number of files where the results of sorting includes isn't healthy. Either places where we have legacy code relying on particular include ordering (where possible, I'll fix these separately) or where we have particular formatting around #include lines that I didn't want to disturb in this patch. This patch is entirely mechanical. If you get merge conflicts or anything, just ignore the changes in this patch and run clang-format over your #include lines in the files. Sorry for any noise here, but it is important to keep these things stable. I was seeing an increasing number of patches with irrelevant re-ordering of #include lines because clang-format was used. This patch at least isolates that churn, makes it easy to skip when resolving conflicts, and gets us to a clean baseline (again). llvm-svn: 304787	2017-06-06 11:49:48 +00:00
Wolfgang Pieb	77d3e938f8	[DWARF] Adding support for the DWARF v5 string offsets table (consumer/reader part only). Reviewers: dblaikie, aprantl Differential Revision: https://reviews.llvm.org/D32779 llvm-svn: 304759	2017-06-06 01:22:34 +00:00
Zachary Turner	88101dadcc	[CodeView] Fix endianness bug. We should be outputting in little endian, but we were writing in host endianness. llvm-svn: 304741	2017-06-05 22:12:23 +00:00
Zachary Turner	349c18f837	[CodeView] Handle Cross Module Imports and Exports. While it's not entirely clear why a compiler or linker might put this information into an object or PDB file, one has been spotted in the wild which was causing llvm-pdbdump to crash. This patch adds support for reading-writing these sections. Since I don't know how to get one of the native tools to generate this kind of debug info, the only test here is one in which we feed YAML into the tool to produce a PDB and then spit out YAML from the resulting PDB and make sure that it matches. llvm-svn: 304738	2017-06-05 21:40:33 +00:00
Zachary Turner	5b74ff33e7	[PDB] Fix use after free. Previously MappedBlockStream owned its own BumpPtrAllocator that it would allocate from when a read crossed a block boundary. This way it could still return the user a contiguous buffer of the requested size. However, It's not uncommon to open a stream, read some stuff, close it, and then save the information for later. After all, since the entire file is mapped into memory, the data should always be available as long as the file is open. Of course, the exception to this is when the data isn't in the file, but rather in some buffer that we temporarily allocated to present this contiguous view. And this buffer would get destroyed as soon as the strema was closed. The fix here is to force the user to specify the allocator, this way it can provide an allocator that has whatever lifetime it chooses. Differential Revision: https://reviews.llvm.org/D33858 llvm-svn: 304623	2017-06-03 00:33:35 +00:00
Zachary Turner	92dcdda623	[CodeView] Support CodeView subsections in any order. Previously we would expect certain subsections to appear in a certain order because some subsections would reference other subsections, but in practice we need to support arbitrary orderings since some object file and PDB file producers generate them this way. This also paves the way for supporting Yaml <-> Object File conversion of CodeView, since Object Files typically have quite a large number of subsections in their debug info. Differential Revision: https://reviews.llvm.org/D33807 llvm-svn: 304588	2017-06-02 19:49:14 +00:00
Zachary Turner	afb81a83a9	Fix 2 more -Wreorder warnings. llvm-svn: 304494	2017-06-01 23:24:50 +00:00
Zachary Turner	ebd3ae8371	[CodeView] Properly align symbol records on read/write. Object files have symbol records not aligned to any particular boundary (e.g. 1-byte aligned), while PDB files have symbol records padded to 4-byte aligned boundaries. Since they share the same reading / writing code, we have to provide an option to specify the alignment and propagate it up to the producer or consumer who knows what the alignment is supposed to be for the given container type. Added a test for this by modifying the existing PDB -> YAML -> PDB round-tripping code to round trip symbol records as well as types. Differential Revision: https://reviews.llvm.org/D33785 llvm-svn: 304484	2017-06-01 21:52:41 +00:00
Adrian Prantl	f4bc1f77b7	[DWARF] Introduce Dump Options This commit introduces a structure that holds all the flags that control the pretty printing of dwarf output. Patch by Spyridoula Gravani! Differential Revision: https://reviews.llvm.org/D33749 llvm-svn: 304446	2017-06-01 18:18:23 +00:00
Zachary Turner	d427383cb8	[CodeView] Move CodeView YAML code to ObjectYAML. This is the beginning of an effort to move the codeview yaml reader / writer into ObjectYAML so that it can be shared. Currently the only consumer / producer of CodeView YAML is llvm-pdbdump, but CodeView can exist outside of PDB files, and indeed is put into object files and passed to the linker to produce PDB files. Furthermore, there are subtle differences in the types of records that show up in object file CodeView vs PDB file CodeView, but they are otherwise 99% the same. By having this code in ObjectYAML, we can have llvm-pdbdump reuse this code, while teaching obj2yaml and yaml2obj to use this syntax for dealing with object files that can contain CodeView. This patch only adds support for CodeView type information to ObjectYAML. Subsequent patches will add support for CodeView symbol information. llvm-svn: 304248	2017-05-30 21:53:05 +00:00
Galina Kistanova	8c1e2f9108	Added missing break. llvm-svn: 304230	2017-05-30 19:02:49 +00:00
Zachary Turner	591312c5c1	[CodeView] Add more DebugSubsection implementations. This adds implementations for Symbols and FrameData, and renames the existing codeview::StringTable class to conform to the DebugSectionStringTable convention. llvm-svn: 304222	2017-05-30 17:13:33 +00:00
Zachary Turner	8c099fe06e	[CodeView] Rename ModuleDebugFragment -> DebugSubsection. This is more concise, and matches the terminology used in other parts of the codebase more closely. llvm-svn: 304218	2017-05-30 16:36:15 +00:00
George Rimar	a25d329b33	Recommit "[DWARF] - Make collectAddressRanges() return section index in addition to Low/High PC" With fix of uninitialized variable. Original commit message: This change is intended to use for LLD in D33183. Problem we have in LLD when building .gdb_index is that we need to know section which address range belongs to. Previously it was solved on LLD side by providing fake section addresses with use of llvm::LoadedObjectInfo interface. We assigned file offsets as addressed. Then after obtaining ranges lists, for each range we had to find section ID's. That not only was slow, but also complicated implementation and was the reason of incorrect behavior when sections share the same offsets, like D33176 shows. This patch makes DWARF parsers to return section index as well. That solves problem mentioned above. Differential revision: https://reviews.llvm.org/D33184 llvm-svn: 304078	2017-05-27 18:10:23 +00:00
George Rimar	1f9cab6b1c	Revert r304002 "[DWARF] - Make collectAddressRanges() return section index in addition to Low/High PC" Revert it again. Now another bot unhappy: http://lab.llvm.org:8011/builders/clang-s390x-linux/builds/8750 llvm-svn: 304011	2017-05-26 17:36:23 +00:00
George Rimar	bc223c63cc	[DWARF] - Make collectAddressRanges() return section index in addition to Low/High PC This change is intended to use for LLD in D33183. Problem we have in LLD when building .gdb_index is that we need to know section which address range belongs to. Previously it was solved on LLD side by providing fake section addresses with use of llvm::LoadedObjectInfo interface. We assigned file offsets as addressed. Then after obtaining ranges lists, for each range we had to find section ID's. That not only was slow, but also complicated implementation and was the reason of incorrect behavior when sections share the same offsets, like D33176 shows. This patch makes DWARF parsers to return section index as well. That solves problem mentioned above. Differential revision: https://reviews.llvm.org/D33184 llvm-svn: 304002	2017-05-26 16:26:18 +00:00
George Rimar	a8403a64ea	Revert "[DWARF] - Make collectAddressRanges() return section index in addition to Low/High PC" Broked BB again: TEST 'LLVM :: DebugInfo/X86/dbg-value-regmask-clobber.ll' FAILED ... LLVM ERROR: Section was outside of section table. llvm-svn: 303984	2017-05-26 13:20:09 +00:00
George Rimar	655b7b63f6	Recommit r303978 "[DWARF] - Make collectAddressRanges() return section index in addition to Low/High PC" With fix of test compilation. Initial commit message: This change is intended to use for LLD in D33183. Problem we have in LLD when building .gdb_index is that we need to know section which address range belongs to. Previously it was solved on LLD side by providing fake section addresses with use of llvm::LoadedObjectInfo interface. We assigned file offsets as addressed. Then after obtaining ranges lists, for each range we had to find section ID's. That not only was slow, but also complicated implementation and was the reason of incorrect behavior when sections share the same offsets, like D33176 shows. This patch makes DWARF parsers to return section index as well. That solves problem mentioned above. Differential revision: https://reviews.llvm.org/D33184 llvm-svn: 303983	2017-05-26 13:13:50 +00:00
George Rimar	7d5f12185a	Revert r303978 "[DWARF] - Make collectAddressRanges() return section index in addition to Low/High PC" It failed BB. llvm-svn: 303981	2017-05-26 12:53:41 +00:00
George Rimar	732f268aa0	[DWARF] - Make collectAddressRanges() return section index in addition to Low/High PC This change is intended to use for LLD in D33183. Problem we have in LLD when building .gdb_index is that we need to know section which address range belongs to. Previously it was solved on LLD side by providing fake section addresses with use of llvm::LoadedObjectInfo interface. We assigned file offsets as addressed. Then after obtaining ranges lists, for each range we had to find section ID's. That not only was slow, but also complicated implementation and was the reason of incorrect behavior when sections share the same offsets, like D33176 shows. This patch makes DWARF parsers to return section index as well. That solves problem mentioned above. Differential revision: https://reviews.llvm.org/D33184 llvm-svn: 303978	2017-05-26 12:46:41 +00:00
Zachary Turner	f2110283c6	Remove unused member. llvm-svn: 303942	2017-05-25 23:47:56 +00:00
Zachary Turner	fed467eefb	[CV Type Merging] Find nested type indices faster. Merging two type streams is one of the most time consuming parts of generating a PDB, and as such it needs to be as fast as possible. The visitor abstractions used for interoperating nicely with many different types of inputs and outputs have been used widely and help greatly for testability and implementing tools, but the abstractions build up and get in the way of performance. This patch removes all of the visitation stuff from the type stream merger, essentially re-inventing the leaf / member switch and loop, but at a very low level. This allows us many other optimizations, such as not actually deserializing any records (even member records which don't describe their own length), as the operation of "figure out how long this record is" is somewhat faster than "figure out how long this record and get all its fields out". Furthermore, whereas before we had to deserialize, re-write type indices, then re-serialize, now we don't have to do any of those 3 steps. We just find out where the type indices are and pull them directly out of the byte stream and re-write them. This is worth a 50-60% performance increase. On top of all other optimizations that have been applied this week, I now get the following numbers when linking lld.exe and lld.pdb MSVC: 25.67s Before This Patch: 18.59s After This Patch: 8.92s So this is a huge performance win. Differential Revision: https://reviews.llvm.org/D33564 llvm-svn: 303935	2017-05-25 23:36:16 +00:00
Zachary Turner	2897e0306e	[lld] Fix a bug where we continually re-follow type servers. Originally this was intended to be set up so that when linking a PDB which refers to a type server, it would only visit the PDB once, and on subsequent visitations it would just skip it since all the records had already been added. Due to some C++ scoping issues, this was not occurring and it was revisiting the type server every time, which caused every record to end up being thrown away on all subsequent visitations. This doesn't affect the performance of linking clang-cl generated object files because we don't use type servers, but when linking object files and libraries generated with /Zi via MSVC, this means only 1 object file has to be linked instead of N object files, so the speedup is quite large. llvm-svn: 303920	2017-05-25 21:16:03 +00:00
Zachary Turner	7f97c362a4	[CodeView Type Merging] Don't keep re-allocating temp serializer. Previously, every time we wanted to serialize a field list record, we would create a new copy of FieldListRecordBuilder, which would in turn create a temporary instance of TypeSerializer, which itself had a std::vector<> that was about 128K in size. So this 128K allocation was happening every time. We can re-use the same instance over and over, we just have to clear its internal hash table and seen records list between each run. This saves us from the constant re-allocations. This is worth an ~18.5% speed increase (3.75s -> 3.05s) in my tests. Differential Revision: https://reviews.llvm.org/D33506 llvm-svn: 303919	2017-05-25 21:15:37 +00:00
Bob Haarman	55256ada25	[pdb] pad source file name buffer at the end instead of the beginning Summary: DbiStreamBuilder calculated the offset of the source file names inside the file info substream as the size of the file info substream minus the size of the file names. Since the file info substream is padded to a multiple of 4 bytes, this caused the first file name to be aligned on a 4-byte boundary. By contrast, DbiModuleList would read the file names immediately after the file name offset table, without skipping to the next 4-byte boundary. This change makes it so that the file names are written to the location where DbiModuleList expects them, and puts any necessary padding for the file info substream after the file names instead of before it. Reviewers: amccarth, rnk, zturner Reviewed By: amccarth, zturner Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33475 llvm-svn: 303917	2017-05-25 21:12:15 +00:00
Zachary Turner	c4e4b7e31e	Fix a bug in MappedBlockStream. It was using the number of blocks of the entire PDB file as the number of blocks of each stream that was created. This was only an issue in the readLongestContiguousChunk function, which was never called prior. This bug surfaced when I updated an algorithm to use this function and the algorithm broke. llvm-svn: 303916	2017-05-25 21:12:00 +00:00
Zachary Turner	dda25b128c	[CodeView Type Merging] Avoid record deserialization when possible. A profile shows the majority of time doing type merging is spent deserializing records from sequences of bytes into friendly C++ structures that we can easily access members of in order to find the type indices to re-write. Records are prefixed with their length, however, and most records have type indices that appear at fixed offsets in the record. For these records, we can save some cycles by just looking at the right place in the byte sequence and re-writing the value, then skipping the record in the type stream. This saves us from the costly deserialization of examining every field, including potentially null terminated strings which are the slowest, even though it was unnecessary to begin with. In addition, we apply another optimization. Previously, after deserializing a record and re-writing its type indices, we would unconditionally re-serialize it in order to compute the hash of the re-written record. This would result in an alloc and memcpy for every record. If no type indices were re-written, however, this was an unnecessary allocation. In this patch re-writing is made two phase. The first phase discovers the indices that need to be rewritten and their new values. This information is passed through to the de-duplication code, which only copies and re-writes type indices in the serialized byte sequence if at least one type index is different. Some records have type indices which only appear after variable length strings, or which have lists of type indices, or various other situations that can make it tricky to make this optimization. While I'm not giving up on optimizing these cases as well, for now we can get the easy cases out of the way and lay the groundwork for more complicated cases later. This patch yields another 50% speedup on top of the already large speedups submitted over the past 2 days. In two tests I have run, I went from 9 seconds to 3 seconds, and from 16 seconds to 8 seconds. Differential Revision: https://reviews.llvm.org/D33480 llvm-svn: 303914	2017-05-25 21:06:28 +00:00
Zachary Turner	bb64231d2d	Don't do a full scan of the type stream before processing records. LazyRandomTypeCollection is designed for random access, and in order to provide this it lazily indexes ranges of types. In the case of types from an object file, there is no partial index to build off of, so it has to index the full stream up front. However, merging types only requires sequential access, and when that is needed, this extra work is simply wasted. Changing the algorithm to work on sequential arrays of types rather than random access type collections eliminates this up front scan. llvm-svn: 303707	2017-05-24 00:26:27 +00:00
Zachary Turner	7daf62e743	[CodeView] Eliminate redundant hashes and allocations. When writing field list records, we would construct a temporary type serializer that shared a bump ptr allocator with the rest of the application, so anything allocated from here would live forever. Furthermore, this temporary serializer had all the properties of a full blown serializer including record hashing and de-duplication. These features are required when you're merging multiple type streams into each other, because different streams may contain identical records, but records from the same type stream will never collide with each other. So all of this hashing was unnecessary. To solve this, two fixes are made: 1) The temporary serializer keeps its own bump ptr allocator instead of sharing a global one. When it's finished, all of its memory is freed. 2) Instead of using the same temporary serializer for the life of an entire type stream, we use it only for the life of a single field list record and delete it when the field list record is completed. This way the hash table will not grow as other records from the same type stream get inserted. Further improvements could eliminate hashing entirely from this codepath. This reduces the link time by 85% in my test, from 1 minute to 9 seconds. llvm-svn: 303676	2017-05-23 18:56:23 +00:00
Reid Kleckner	36238b15d7	Speculative build fix for non-Windows llvm-svn: 303667	2017-05-23 18:28:13 +00:00
Reid Kleckner	ded38803c5	[PDB] Hash types up front when merging types instead of using StringMap Summary: First, StringMap uses llvm::HashString, which is only good for short identifiers and really bad for large blobs of binary data like type records. Moving to `DenseMap<StringRef, TypeIndex>` with some tricks for memory allocation fixes that. Unfortunately, that didn't buy very much performance. Profiling showed that we spend a long time during DenseMap growth rehashing existing entries. Also, in general, DenseMap is faster when the keys are small. This change takes that to the logical conclusion by introducing a small wrapper value type around a pointer to key data. The key data contains a precomputed hash, the original record data (pointer and size), and the type index, which is the "value" of our original map. This reduces the time to produce llvm-as.exe and llvm-as.pdb from ~15s on my machine to 3.5s, which is about a 4x improvement. Reviewers: zturner, inglorion, ruiu Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33428 llvm-svn: 303665	2017-05-23 18:23:59 +00:00
Zachary Turner	bf35e6ab2a	Revert "Make TypeSerializer's StringMap use the same allocator." This reverts commit e34ccb7b57da25cc89ded913d8638a2906d1110a. This is causing failures on the ASAN bots. llvm-svn: 303640	2017-05-23 15:50:37 +00:00
David Blaikie	15d85fc537	libDebugInfo: Support symbolizing using DWP files llvm-svn: 303609	2017-05-23 06:48:53 +00:00
David Blaikie	37d1cff491	FIX: Remove debugging assert left in previous commit Sorry for the bot noise. llvm-svn: 303592	2017-05-23 00:31:24 +00:00
David Blaikie	f9803fb4bb	libDebugInfo: Avoid independently parsing the same .dwo file for two separate CUs residing there NFC, just an optimization. Will be building on this for DWP support shortly. llvm-svn: 303591	2017-05-23 00:30:42 +00:00
Zachary Turner	d4136e945e	Implement various flavors of type merging. Previous algotirhm assumed that types and ids are in a single unified stream. For inputs that come from object files, this is the case. But if the input is already a PDB, or is the result of a previous merge, then the types and ids will already have been split up, in which case we need an algorithm that can accept operate on independent streams of types and ids that refer across stream boundaries to each other. Differential Revision: https://reviews.llvm.org/D33417 llvm-svn: 303577	2017-05-22 21:07:43 +00:00
Zachary Turner	12f8c31c04	Make TypeSerializer's StringMap use the same allocator. llvm-svn: 303576	2017-05-22 21:07:14 +00:00
David Blaikie	d2f3a941e0	libDebugInfo/DWARF: Apply relocations for debug_addr addresses in object files llvm-symbolizer would fail to symbolize addresses in unlinked object files when handling .dwo file data because the addresses would not be relocated in the same way as the ranges in the skeleton CU in the object file. Fix that so object files can be symbolized the same as executables. llvm-svn: 303532	2017-05-22 07:02:47 +00:00
David Blaikie	8d039d40c5	llvm-symbolizer: Support multiple CUs in a single DWO file llvm-svn: 303482	2017-05-20 03:32:49 +00:00
Zachary Turner	526f4f2aa8	Resubmit "[CodeView] Provide a common interface for type collections." This was originally reverted because it was a breaking a bunch of bots and the breakage was not surfacing on Windows. After much head-scratching this was ultimately traced back to a bug in the lit test runner related to its pipe handling. Now that the bug in lit is fixed, Windows correctly reports these test failures, and as such I have finally (hopefully) fixed all of them in this patch. llvm-svn: 303446	2017-05-19 19:26:58 +00:00
Zachary Turner	1dfcf8d92c	Revert "[CodeView] Provide a common interface for type collections." This is a squash of ~5 reverts of, well, pretty much everything I did today. Something is seriously broken with lit on Windows right now, and as a result assertions that fire in tests are triggering failures. I've been breaking non-Windows bots all day which has seriously confused me because all my tests have been passing, and after running lit with -a to view the output even on successful runs, I find out that the tool is crashing and yet lit is still reporting it as a success! At this point I don't even know where to start, so rather than leave the tree broken for who knows how long, I will get this back to green, and then once lit is fixed on Windows, hopefully hopefully fix the remaining set of problems for real. llvm-svn: 303409	2017-05-19 05:57:45 +00:00
Zachary Turner	47fdc73771	Don't crash if someone tries to visit an empty type stream. llvm-svn: 303408	2017-05-19 05:18:09 +00:00
Zachary Turner	59ab6a3816	[CodeView] Reduce memory usage in TypeSerializer. We were using a BumpPtrAllocator to allocate stable storage for a record, then trying to insert that into a hash table. If a collision occurred, the bytes were never inserted and the allocation was unnecessary. At the cost of an extra hash computation, check first if it exists, and only if it does do we allocate and insert. llvm-svn: 303407	2017-05-19 04:56:48 +00:00
Zachary Turner	8f1d87a79a	Fix crasher in CodeView test. Apparently this was always broken, but previously we were more graceful about it and we would print "unknown udt" if we couldn't find the type index, whereas now we just segfault because we assume it's valid. But this exposed a real bug, which is that we weren't looking in the right place. So fix that, and also fix this crash at the same time. llvm-svn: 303397	2017-05-19 00:56:39 +00:00
Zachary Turner	7b62d7ccc0	Fix some build errors and warnings. llvm-svn: 303391	2017-05-18 23:12:42 +00:00
Zachary Turner	b32ec02b80	[CodeView] Raise the source to ID map out of the TypeStreamMerger. This map will be needed to rewrite symbol streams after re-writing the corresponding type streams. llvm-svn: 303390	2017-05-18 23:04:08 +00:00
Zachary Turner	8fb441ab9c	[llvm-pdbdump] Add the ability to merge PDBs. Merging PDBs is a feature that will be used heavily by the linker. The functionality already exists but does not have deep test coverage because it's not easily exposed through any tools. This patch aims to address that by adding the ability to merge PDBs via llvm-pdbdump. It takes arbitrarily many PDBs and outputs a single PDB. Using this new functionality, a test is added for merging type records. Future patches will add the ability to merge symbol records, module information, etc. llvm-svn: 303389	2017-05-18 23:03:41 +00:00
Zachary Turner	0c60f269fc	[CodeView] Provide a common interface for type collections. Right now we have multiple notions of things that represent collections of types. Most commonly used are TypeDatabase, which is supposed to keep mappings from TypeIndex to type name when reading a type stream, which happens when reading PDBs. And also TypeTableBuilder, which is used to build up a collection of types dynamically which we will later serialize (i.e. when writing PDBs). But often you just want to do some operation on a collection of types, and you may want to do the same operation on any kind of collection. For example, you might want to merge two TypeTableBuilders or you might want to merge two type streams that you loaded from various files. This dichotomy between reading and writing is responsible for a lot of the existing code duplication and overlapping responsibilities in the existing CodeView library classes. For example, after building up a TypeTableBuilder with a bunch of type records, if we want to dump it we have to re-invent a bunch of extra glue because our dumper takes a TypeDatabase or a CVTypeArray, which are both incompatible with TypeTableBuilder. This patch introduces an abstract base class called TypeCollection which is shared between the various type collection like things. Wherever we previously stored a TypeDatabase& in some common class, we now store a TypeCollection&. The advantage of this is that all the details of how the collection are implemented, such as lazy deserialization of partial type streams, is completely transparent and you can just treat any collection of types the same regardless of where it came from. Differential Revision: https://reviews.llvm.org/D33293 llvm-svn: 303388	2017-05-18 23:03:06 +00:00
Zachary Turner	5a83fb153f	Fix some minor issues in PDB parsing library. 1) Until now I'd never seen a valid PDB where the DBI stream and the PDB Stream disagreed on the "Age" field. Because of that, we had code to assert that they matched. Recently though I was given a PDB where they disagreed, so this assumption has proven to be incorrect. Remove this check. 2) We were walking the entire list of hash values for types up front and then throwing away the values. For large PDBs this was a significant slow down. Remove this. With this patch, I can dump the list of all compilands from a 1.5GB PDB file in just a few seconds. llvm-svn: 303351	2017-05-18 15:14:44 +00:00
George Rimar	47f84b1a3c	[DWARF] - Simplify RelocVisitor implementation. We do not need to store relocation width field. Patch removes relative code, that simplifies implementation. Differential revision: https://reviews.llvm.org/D33274 llvm-svn: 303335	2017-05-18 08:25:11 +00:00
George Rimar	f98b9ac5da	[lib/Object] - Minor API update for llvm::Decompressor. I revisited Decompressor API (issue with it was triggered during D32865 review) and found it is probably provides more then we really need. Issue was about next method's signature: Error decompress(SmallString<32> &Out); It is too strict. At first I wanted to change it to decompress(SmallVectorImpl<char> &Out), but then found it is still not flexible because sticks to SmallVector. During reviews was suggested to use templating to simplify code. Patch do that. Differential revision: https://reviews.llvm.org/D33200 llvm-svn: 303331	2017-05-18 08:00:01 +00:00
Bob Haarman	de33a63784	[llvm-pdbdump] in yaml2pdb, generate default output filename if none given Summary: llvm-pdbdump yaml2pdb used to fail with a misleading error message ("An I/O error occurred on the file system") if no output file was specified. This change adds an assert to PDBFileBuilder to check that an output file name is specified, and makes llvm-pdbdump generate an output file name based on the input file name if no output file name is explicitly specified. Reviewers: amccarth, zturner Reviewed By: zturner Subscribers: fhahn, llvm-commits Differential Revision: https://reviews.llvm.org/D33296 llvm-svn: 303299	2017-05-17 20:46:48 +00:00
Zachary Turner	1d795c451e	[CodeView] Simplify the use of visiting type records & streams. There is often a lot of boilerplate code required to visit a type record or type stream. The #1 use case is that you have a sequence of bytes that represent one or more records, and you want to deserialize each one, switch on it, and call a callback with the deserialized record that the user can examine. Currently this requires at least 6 lines of code: codeview::TypeVisitorCallbackPipeline Pipeline; Pipeline.addCallbackToPipeline(Deserializer); Pipeline.addCallbackToPipeline(MyCallbacks); codeview::CVTypeVisitor Visitor(Pipeline); consumeError(Visitor.visitTypeRecord(Record)); With this patch, it becomes one line of code: consumeError(codeview::visitTypeRecord(Record, MyCallbacks)); This is done by having the deserialization happen internally inside of the visitTypeRecord function. Since this is occasionally not desirable, the function provides a 3rd parameter that can be used to change this behavior. Hopefully this can significantly reduce the barrier to entry to using the visitation infrastructure. Differential Revision: https://reviews.llvm.org/D33245 llvm-svn: 303271	2017-05-17 16:39:06 +00:00
George Rimar	fed9f09f48	[DWARF] - Cleanup relocations proccessing. RelocAddrMap was a pair of <width, address>, where width is relocation size (4/8/x, x < 8), and width field was never used in code. Relocations proccessing loop had checks for width field. Does not look like DWARF parser should do that. There is probably no much sense to validate relocations during proccessing them in parser. Patch removes relocation's width relative code from DWARFContext. Differential revision: https://reviews.llvm.org/D33194 llvm-svn: 303251	2017-05-17 12:10:51 +00:00
George Rimar	41e656768d	[DWARF] - Add RelocAddrEntry for cleanup. NFCi. Was mentioned as possible cleanup during review of D33184. llvm-svn: 303171	2017-05-16 14:05:45 +00:00
George Rimar	4671f2e08c	[DWARF] - Use DWARFAddressRange struct instead of uint64_t pair for DWARFAddressRangesVector. Recommit of r303159 "[DWARF] - Use DWARFAddressRange struct instead of uint64_t pair for DWARFAddressRangesVector" All places were shitched to use DWARFAddressRange now. Suggested during review of D33184. llvm-svn: 303163	2017-05-16 12:30:59 +00:00
George Rimar	3824cca7b3	Revert r303159 "[DWARF] - Use DWARFAddressRange struct instead of uint64_t pair for DWARFAddressRangesVector." Something went wrong, it broke BB. http://green.lab.llvm.org/green//job/clang-stage1-cmake-RA-incremental_build/38477/consoleFull#-200034420049ba4694-19c4-4d7e-bec5-911270d8a58c llvm-svn: 303162	2017-05-16 12:05:03 +00:00
George Rimar	8680b6ee9c	[DWARF] - Use DWARFAddressRange struct instead of uint64_t pair for DWARFAddressRangesVector. Suggested during review of D33184. llvm-svn: 303159	2017-05-16 11:54:19 +00:00
George Rimar	958b01aa69	[DWARF] - Speedup handling of relocations in DWARFContextInMemory. I am working on a speedup of building .gdb_index in LLD and noticed that relocations that are proccessed in DWARFContextInMemory often uses the same symbol in a row. This patch introduces caching to reduce the relocations proccessing time. For benchmark, I took debug LLC binary objects configured with -ggnu-pubnames and linked it using LLD. Link time without --gdb-index is about 4,45s. Link time with --gdb-index: a) Without patch: 19,16s b) With patch: 15,52s That means time spent on --gdb-index in this configuration is 19,16s - 4,45s = 14,71s (without patch) vs 15,52s - 4,45s = 11,07s (with patch). Differential revision: https://reviews.llvm.org/D31136 llvm-svn: 303051	2017-05-15 11:45:28 +00:00
Zachary Turner	dd3a739d52	[CodeView] Add a random access type visitor. This adds a visitor that is capable of accessing type records randomly and caching intermediate results that it learns about during partial linear scans. This yields amortized O(1) access to a type stream even though type streams cannot normally be indexed. Differential Revision: https://reviews.llvm.org/D33009 llvm-svn: 302936	2017-05-12 19:18:12 +00:00
Wolfgang Pieb	15fa44698c	[DWARF] Fix a parsing issue with type unit headers. Reviewers: dblaikie Differential Revision: https://reviews.llvm.org/D32987 llvm-svn: 302574	2017-05-09 19:38:38 +00:00
Aaron Ballman	f22f885b66	Removing a file that is not necessary (and was causing link diagnostics with MSVC 2015); NFC. llvm-svn: 302531	2017-05-09 14:22:48 +00:00
Diana Picus	e8da53f4e0	Revert "[Dwarf] Disable reference verification for now (PR32972)" This reverts commit r302520 because it break the unit tests. llvm-svn: 302524	2017-05-09 13:05:43 +00:00
Renato Golin	94d6c8fb36	[Dwarf] Disable reference verification for now (PR32972) There is no other explanation about why this only started happening now, even though it crashes on old code (supposedly reachable from here). The only common factor between the failing bots is that they use GCC (4.9 and 5.3) to compile Clang, while the others use Clang 3.8, but the failure is while building the tests, as an assertion, on Clang. Commenting it out for now in hope the bots will go back green, but we should keep looking for the real cause, and update bugzilla. llvm-svn: 302520	2017-05-09 12:36:50 +00:00
Greg Clayton	58a2e0d90b	Add const to "DWARFDie &Die" in a few functions as they can't change the DWARFDie. llvm-svn: 302471	2017-05-08 21:29:17 +00:00
Eugene Zemtsov	3b52dbd934	Fix typo llvm-svn: 302470	2017-05-08 21:20:53 +00:00
Greg Clayton	5404f114d3	Fix typo "veify" to "verify". llvm-svn: 302466	2017-05-08 20:53:00 +00:00
Zachary Turner	1dacb24222	[CodeView] Add support for random access type visitors. Previously type visitation was done strictly sequentially, and TypeIndexes were computed by incrementing the TypeIndex of the last visited record. This works fine for situations like dumping, but not when you want to visit types in random order. For example, in a debug session someone might lookup a symbol by name, find that it has TypeIndex 10,000 and then want to go straight to TypeIndex 10,000. In order to make this work, the visitation framework needs a mode where it can plumb TypeIndices through the callback pipeline. This patch adds such a mode. In doing so, it is necessary to provide an alternative implementation of TypeDatabase that supports random access, so that is done as well. Nothing actually uses these random access capabilities yet, but this will be done in subsequent patches. Differential Revision: https://reviews.llvm.org/D32928 llvm-svn: 302454	2017-05-08 18:38:43 +00:00
Zachary Turner	8c74673388	[CodeView] Reserve TypeDatabase records up front. Most of the time we know exactly how many type records we have in a list, and we want to use the visitor to deserialize them into actual records in a database. Previously we were just using push_back() every time without reserving the space up front in the vector. This is obviously terrible from a performance standpoint, and it's not uncommon to have PDB files with half a million type records, where the performance degredation was quite noticeable. llvm-svn: 302302	2017-05-05 22:02:37 +00:00
George Rimar	2122ff64c6	[llvm-dwarfdump] - Print an error message if section decompression failed. llvm-dwarfdump currently prints no message if decompression fails for some reason. I noticed that during work on one of LLD patches where LLD produced an broken output. It was a bit confusing to see no output for section dumped and no any error message at all. Patch adds error message for such cases. Differential revision: https://reviews.llvm.org/D32865 llvm-svn: 302221	2017-05-05 10:52:39 +00:00
Zachary Turner	bedc85fb4b	[pdb] Don't verify TPI hash values up front. Verifying the hash values as we are currently doing results in iterating every type record before the user even tries to access the first one, and the API user has no control over, or ability to hook into this process. As a result, when the user wants to iterate over types to print them or index them, this results in a second iteration over the same list of types. When there's upwards of 1,000,000 type records, this is obviously quite undesirable. This patch raises the verification outside of TpiStream , and llvm-pdbdump hooks a hash verification visitor into the normal dumping process. So we still verify the hash records, but we can do it while not requiring a second iteration over the type stream. Differential Revision: https://reviews.llvm.org/D32873 llvm-svn: 302206	2017-05-04 23:53:54 +00:00
Zachary Turner	1eb9a0297c	[PDB] Don't build the entire source file list up front. I tried to run llvm-pdbdump on a very large (~1.5GB) PDB to try and identify show-stopping performance problems. This patch addresses the first such problem. When loading the DBI stream, before anyone has even tried to access a single record, we build an in memory map of every source file for every module. In the particular PDB I was using, this was over 85 million files. Specifically, the complexity is O(mn) where m is the number of modules and n is the average number of source files (including headers) per module. The whole reason for doing this was so that we could have constant time access to any module and any of its source file lists. However, we can still get O(1) access to the source file list for a given module with a simple O(m) precomputation, and access to the list of modules is already O(1) anyway. So this patches reduces the O(mn) up-front precomputation to an O(m) one, where n is ~6,500 and n*m is about 85 million in my pathological test case. Differential Revision: https://reviews.llvm.org/D32870 llvm-svn: 302205	2017-05-04 23:53:29 +00:00
Greg Clayton	48ff66a280	Don't return an invalid line table if the DW_AT_stmt_list value is not in the .debug_line section. llvm-svn: 302180	2017-05-04 18:29:44 +00:00
Paul Robinson	ae2e6f37f3	clang-format and restyle DWARFFormValue before working on it. NFC llvm-svn: 302086	2017-05-03 21:53:21 +00:00
Zachary Turner	4f145b2a59	Remove unused private field. llvm-svn: 302069	2017-05-03 19:42:06 +00:00
Greg Clayton	c5b2d561e8	Break verification down into smaller functions to keep code clean. Adrian requested that we break things down to make things clean in the DWARFVerifier. This patch breaks everything down into nice individual functions and cleans up the code quite a bit and prepares us for the next round of verifiers. Differential Revision: https://reviews.llvm.org/D32812 llvm-svn: 302062	2017-05-03 18:25:46 +00:00
Davide Italiano	2e23ce4cad	[CodeView] Remove constructor initialization of a removed field. I should've staged this with my last commit. llvm-svn: 302059	2017-05-03 18:02:46 +00:00
Zachary Turner	cf468d86f3	[CodeView] Use actual strings for dealing with checksums and lines. The raw CodeView format references strings by "offsets", but it's confusing what table the offset refers to. In the case of line number information, it's an offset into a buffer of records, and an indirection is required to get another offset into a different table to find the final string. And in the case of checksum information, there is no indirection, and the offset refers directly to the location of the string in another buffer. This would be less confusing if we always just referred to the strings by their value, and have the library be smart enough to correctly resolve the offsets on its own from the right location. This patch makes that possible. When either reading or writing, all the user deals with are strings, and the library does the appropriate translations behind the scenes. llvm-svn: 302053	2017-05-03 17:11:40 +00:00
Zachary Turner	2d5c2cd3ce	[llvm-readobj] Update readobj to re-use parsing code. llvm-readobj hand rolls some CodeView parsing code for string tables, so this patch updates it to re-use some of the newly introduced parsing code in LLVMDebugInfoCodeView. Differential Revision: https://reviews.llvm.org/D32772 llvm-svn: 302052	2017-05-03 17:11:11 +00:00
Greg Clayton	b8c162b53c	Create DWARFVerifier.cpp and .h and move all DWARF verification code over into it. Adrian requested we create a DWARFVerifier.cpp file to contain all of the DWARF verification stuff. This change simply moves the functionality over into DWARFVerifier.h and DWARFVerifier.cpp, renames the DWARFVerifier methods to start with lower case, and switches DWARFContext.cpp over to using the new functionality. Differential Revision: https://reviews.llvm.org/D32809 llvm-svn: 302044	2017-05-03 16:02:29 +00:00
Zachary Turner	c504ae3cef	Resubmit r301986 and r301987 "Add codeview::StringTable" This was reverted due to a "missing" file, but in reality what happened was that I renamed a file, and then due to a merge conflict both the old file and the new file got added to the repository. This led to an unused cpp file being in the repo and not referenced by any CMakeLists.txt but #including a .h file that wasn't in the repo. In an even more unfortunate coincidence, CMake didn't report the unused cpp file because it was in a subdirectory of the folder with the CMakeLists.txt, and not in the same directory as any CMakeLists.txt. The presence of the unused file was then breaking certain tools that determine file lists by globbing rather than by what's specified in CMakeLists.txt In any case, the fix is to just remove the unused file from the patch set. llvm-svn: 302042	2017-05-03 15:58:37 +00:00

1 2 3 4 5 ...

1012 Commits