llvm-project

Commit Graph

Author	SHA1	Message	Date
Zachary Turner	d1de2f4f5e	[llvm-pdbutil] Add support for dumping detailed module stats. This adds support for dumping a summary of module symbols and CodeView debug chunks. This option prints a table for each module of all of the symbols that occurred in the module and the number of times it occurred and total byte size. Then at the end it prints the totals for the entire file. Additionally, this patch adds the -jmc (just my code) option, which suppresses modules which are from external libraries or linker imports, so that you can focus only on the object files and libraries that originate from your own source code. llvm-svn: 311338	2017-08-21 14:53:25 +00:00
Zachary Turner	b57884e818	Fix some broken tests. These were pending in a separate patch but I forgot to squash them before comitting, and this one didn't go through. llvm-svn: 310764	2017-08-11 21:14:01 +00:00
Zachary Turner	2a77b266b5	Fix broken pdb test. For some reason I didn't see this failure the first time. The output format changed slightly, so we just have to update the test for the new format. llvm-svn: 310442	2017-08-09 04:48:16 +00:00
Zachary Turner	dd6f4368d6	Fix broken PDB tests. llvm-svn: 310130	2017-08-04 21:15:12 +00:00
Zachary Turner	fb1cd5090c	[llvm-pdbutil] Dump image section headers. Image section headers are stored in the DBI stream, but we had no way to dump them. This patch adds dumping support, along with some tests that LLD actually dumps them correctly. Differential Revision: https://reviews.llvm.org/D36332 llvm-svn: 310107	2017-08-04 20:02:38 +00:00
Reid Kleckner	14d90fd05c	[PDB] Improve GSI hash table dumping for publics and globals The PDB "symbol stream" actually contains symbol records for the publics and the globals stream. The globals and publics streams are essentially hash tables that point into a single stream of records. In order to match cvdump's behavior, we need to only dump symbol records referenced from the hash table. This patch implements that, and then implements global stream dumping, since it's just a subset of public stream dumping. Now we shouldn't see S_PROCREF or S_GDATA32 records when dumping publics, and instead we should see those record in the globals stream. llvm-svn: 309066	2017-07-26 00:40:36 +00:00
Reid Kleckner	52465615d4	Fix pdbdump-headers.test after TPI hash changes llvm-svn: 308244	2017-07-18 00:44:10 +00:00
Reid Kleckner	c50349d4c6	[PDB] Finish and simplify TPI hashing Summary: This removes the CVTypeVisitor updater and verifier classes. They were made dead by the minimal type dumping refactoring. Replace them with a single function that takes a type record and produces a hash. Call this from the minimal type dumper and compare the hash. I also noticed that the microsoft-pdb reference repository uses a basic CRC32 for records that aren't special. We already have an implementation of that CRC ready to use, because it's used in COFF for ICF. I'll make LLD call this hashing utility in a follow-up change. We might also consider using this same hash in type stream merging, so that we don't have to hash our records twice. Reviewers: inglorion, ruiu Subscribers: llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D35515 llvm-svn: 308240	2017-07-18 00:33:45 +00:00
Reid Kleckner	af88a910fd	[CodeView] Dump BuildInfoSym and ProcSym type indices I need to print the type index in hex so that I can match it in FileCheck for a test I'm writing. llvm-svn: 308107	2017-07-15 18:10:39 +00:00
Zachary Turner	6c4bfba8f3	[PDB] Teach libpdb to write DBI Stream ECNames. Based strictly on the name, this seems to have something to do width edit & continue. The goal of this patch has nothing to do with supporting edit and continue though. msvc link.exe writes very basic information into this area even when not compiling with support for E&C, and so the goal here is to bring lld-link to parity. Since we cannot know what assumptions standard tools make about the content of PDB files, we need to be as close as possible. This ECNames data structure is a standard PDB string hash table. link.exe puts a single string into this hash table, which is the full path to the PDB file on disk. It then references this string from the module descriptor for the compiler generated `* Linker *` module. With this patch, lld-link will generate the exact same sequence of bytes as MSVC link for this subsection for a given object file input (as reported by `llvm-pdbutil bytes -ec`). llvm-svn: 307356	2017-07-07 05:04:36 +00:00
Zachary Turner	eae44dfee9	[PDB] Add a test that verifies every known type record. We had a lot of one-off tests for this type and that type, or "every type that happens to be generated by this program I built". Eventually I got a bug report filed where we were crashing on a type that was not covered by any of these tests. So this test carefully constructs a minimal C++ program that will cause every type we support to be emitted. This ensures full coverage for type records. Differential Revision: https://reviews.llvm.org/D34915 llvm-svn: 307187	2017-07-05 18:43:25 +00:00
Zachary Turner	af8c75a8c0	[llvm-pdbutil] Output the symbol offset when dumping. Type records have a unique type index, but symbol records do not. Instead, symbol records refer to other symbol records by referencing their offset in the symbol stream. In a sense this is the analogue of the TypeIndex, but we are not printing it in the dumper. Printing it not only gives us more useful information when manually investigating the contents of a PDB, but also allows us to write better tests by enabling us to verify that fields that reference other symbol records do so correctly. Differential Revision: https://reviews.llvm.org/D34906 llvm-svn: 306890	2017-06-30 21:35:00 +00:00
Zachary Turner	990f01f8c6	Fix test broken by parameter mixup. llvm-svn: 306856	2017-06-30 18:25:07 +00:00
Zachary Turner	5f09852dfb	[llvm-pdbutil] Show what blocks a stream occupies. This is useful when you want to look at a specific chunk of a stream or look for discontinuities, and you need to know the list of blocks occupied by a stream. llvm-svn: 306150	2017-06-23 20:28:14 +00:00
Zachary Turner	9940203a2c	[llvm-pdbutil] Create a "bytes" subcommand. This idea originally came about when I was doing some deep investigation of why certain bytes in a PDB that we round-tripped differed from their original bytes in the source PDB. I found myself having to hack up the code in many places to dump the bytes of this substream, or that record. It would be nice if we could just do this for every possible stream, substream, debug chunk type, etc. It doesn't make sense to put this under dump because there's just so many options that would detract from the more common use case of just dumping deserialized records. So making a new subcommand seems like the most logical course of action. In doing so, we already have two command line options that are suitable for this new subcommand, so start out by moving them there. llvm-svn: 306056	2017-06-22 20:58:11 +00:00
Reid Kleckner	18d90e17ad	[CodeView] Fix dumping of public symbol record flags I noticed nonsensical type information while dumping PDBs produced by MSVC. llvm-svn: 305708	2017-06-19 16:54:51 +00:00
Zachary Turner	4e950647fb	[llvm-pdbutil] Add support for dumping lines and inlinee lines. llvm-svn: 305529	2017-06-15 23:56:19 +00:00
Zachary Turner	0e327d0360	[llvm-pdbutil] Add back support for dumping file checksums. When dumping module source files, also dump checksums. llvm-svn: 305526	2017-06-15 23:12:41 +00:00
Zachary Turner	f8a2e04812	[llvm-pdbutil] Add back the ability to dump hashes and index offsets. This was regressed in a previous patch that re-wrote the dumper, and I'm incrementally adding back the pieces that are missing. llvm-svn: 305524	2017-06-15 23:04:42 +00:00
Zachary Turner	6305545527	Resubmit "[llvm-pdbutil] rewrite the "raw" output style." This resubmits commit c0c249e9f2ef83e1d1e5f166b50673d92f3579d7. It was broken due to some weird template issues, which have since been fixed. llvm-svn: 305517	2017-06-15 22:24:24 +00:00
Zachary Turner	da504b794c	Revert "[llvm-pdbutil] rewrite the "raw" output style." This reverts commit 83ea17ebf2106859a51fbc2a86031b44d33696ad. This is failing due to some strange template problems, so reverting until it can be straightened out. llvm-svn: 305505	2017-06-15 20:55:51 +00:00
Zachary Turner	b560fdf3b8	[llvm-pdbutil] rewrite the "raw" output style. After some internal discussions, we agreed that the raw output style had outlived its usefulness. It was originally created before we had even thought of dumping to YAML, and it was intended to give us some insight into the internals of a PDB file. Now we have YAML mode which does almost exactly this but is more powerful in that it can round-trip back to a PDB, which the raw mode could not do. So the raw mode had become purely a maintenance burden. One option was to just delete it. However, its original goal was to be as readable as possible while staying close to the "metal" - i.e. presenting the output in a way that maps directly to the underlying file format. We don't actually need that last requirement anymore since it's covered by the yaml mode, so we could repurpose "raw" mode to actually just be as readable as possible. This patch implements about 80% of the functionality previously in raw mode, but in a completely different style that is more akin to what cvdump outputs. Records are very compressed, often times appearing on just one line. One nice thing about this is that it makes full record matching easier, because you can grep for indices, names, and leaf types on a single line often. See the tests for some examples of what the new output looks like. Note that this patch actually regresses the functionality of raw mode in a few areas, but only because the patch was already unreasonably large and going 100% would have been even worse. Specifically, this patch is missing: The ability to dump module debug subsections (checksums, lines, etc) The ability to dump section headers Aside from that everything is here. While goign through the tests fixing them all up, I found many duplicate tests. They've been deleted. In subsequent patches I will go through and re-add the missing functionality. Differential Revision: https://reviews.llvm.org/D34191 llvm-svn: 305495	2017-06-15 19:34:41 +00:00
Zachary Turner	bd336e44d8	Rename llvm-pdbdump -> llvm-pdbutil. This is to reflect the evolving nature of the tool as being useful for more than just dumping PDBs, as it can do many other things. Differential Revision: https://reviews.llvm.org/D34062 llvm-svn: 305106	2017-06-09 20:46:17 +00:00
Zachary Turner	1bf7762049	[llvm-pdbdump] Support native ordering of subsections in raw mode. This is the same change for the YAML Output style applied to the raw output style. Previously we would queue up all subsections until every one had been read, and then output them in a pre- determined order. This was because some subsections need to be read first in order to properly dump later subsections. This patch allows them to be dumped in the order they appear. Differential Revision: https://reviews.llvm.org/D34015 llvm-svn: 305034	2017-06-08 23:49:01 +00:00
Zachary Turner	3eedd16114	[llvm-pdbdump] Improve consistency among subcommands. The pdb2yaml and raw subcommands did something very similar but with a different output format, and they used a lot of the same command line options, but each one re-implemented the command line option with slightly different spellings / options. This patch merges them together into a single definition which is shared by both subcommands. This new syntax also allows for more flexibility in the way debug subsections are dumped. Differential Revision: https://reviews.llvm.org/D33996 llvm-svn: 305032	2017-06-08 23:39:33 +00:00
Zachary Turner	526f4f2aa8	Resubmit "[CodeView] Provide a common interface for type collections." This was originally reverted because it was a breaking a bunch of bots and the breakage was not surfacing on Windows. After much head-scratching this was ultimately traced back to a bug in the lit test runner related to its pipe handling. Now that the bug in lit is fixed, Windows correctly reports these test failures, and as such I have finally (hopefully) fixed all of them in this patch. llvm-svn: 303446	2017-05-19 19:26:58 +00:00
Zachary Turner	1dfcf8d92c	Revert "[CodeView] Provide a common interface for type collections." This is a squash of ~5 reverts of, well, pretty much everything I did today. Something is seriously broken with lit on Windows right now, and as a result assertions that fire in tests are triggering failures. I've been breaking non-Windows bots all day which has seriously confused me because all my tests have been passing, and after running lit with -a to view the output even on successful runs, I find out that the tool is crashing and yet lit is still reporting it as a success! At this point I don't even know where to start, so rather than leave the tree broken for who knows how long, I will get this back to green, and then once lit is fixed on Windows, hopefully hopefully fix the remaining set of problems for real. llvm-svn: 303409	2017-05-19 05:57:45 +00:00
Zachary Turner	27ac223a85	Fix a broken test. Similar to my previous fix, it turns out llvm-pdbdump has been printing an incorrect value since the beginning of time, but we didn't know it was incorrect. Specifically, we were interpreting a TypeIndex as referencing a type from the TPI stream when it actually should come from the IPI stream. So we were printing a string that looked like a valid string, but was just from the wrong place. llvm-svn: 303403	2017-05-19 03:04:08 +00:00
Zachary Turner	8a2ebfb1cd	[CodeView] Write CodeView line information. Differential Revision: https://reviews.llvm.org/D32716 llvm-svn: 301882	2017-05-01 23:27:42 +00:00
Zachary Turner	5b6e4e0aed	[llvm-pdbdump] Abstract some of the YAML/Raw printing code. There is a lot of duplicate code for printing line info between YAML and the raw output printer. This introduces a base class that can be shared between the two, and makes some minor cleanups in the process. llvm-svn: 301728	2017-04-29 01:13:21 +00:00
Rui Ueyama	0fcbb2893e	Revert r301487: Replace HashString algorithm with xxHash64 This reverts commit r301487 to make buildbots green. llvm-svn: 301491	2017-04-26 23:15:10 +00:00
Rui Ueyama	87b30ac9d3	Replace HashString algorithm with xxHash64 The previous algorithm processed one character at a time, which is very painful on a modern CPU. Replace it with xxHash64, which both already exists in the codebase and is fairly fast. Patch from Scott Smith! Differential Revision: https://reviews.llvm.org/D32509 llvm-svn: 301487	2017-04-26 22:45:04 +00:00
Reid Kleckner	a5d187b0ff	[PDB] Use two DBs when dumping the IPI stream Summary: When dumping these records from an object file section, we should use only one type database. However, when dumping from a PDB, we should use two: one for the type stream and one for the IPI stream. Certain type records that normally live in the .debug$T object file section get moved over to the IPI stream of the PDB file and they get new indices. So far, I've noticed that the MSVC linker always moves these records into IPI: - LF_FUNC_ID - LF_MFUNC_ID - LF_STRING_ID - LF_SUBSTR_LIST - LF_BUILDINFO - LF_UDT_MOD_SRC_LINE These records have index fields that can point into TPI or IPI. In particular, LF_SUBSTR_LIST and LF_BUILDINFO point to LF_STRING_ID records to describe compilation command lines. I've modified the dumper to have an optional pointer to the item DB, and to do type name lookup of these fields in that DB. See printItemIndex. The result is that our pdbdump-headers.test is more faithful to the PDB contents and the output is less confusing. Reviewers: ruiu Subscribers: amccarth, zturner, llvm-commits Differential Revision: https://reviews.llvm.org/D31309 llvm-svn: 298649	2017-03-23 21:36:25 +00:00
Reid Kleckner	45928018c5	[codeview] Use separate records for LF_SUBSTR_LIST and LF_ARGLIST They are structurally the same, but now we need to distinguish them because one record lives in the IPI stream and the other lives in TPI. llvm-svn: 298474	2017-03-22 01:37:38 +00:00
Zachary Turner	05d5e6136f	[PDB] Add support for parsing Flags from PDB Stream. This was discovered when running `llvm-pdbdump diff` against two files, the second of which was generated by running the first one through pdb2yaml and then yaml2pdb. The second one was missing some bytes from the PDB Stream, and tracking this down showed that at the end of the PDB Stream were some additional bytes that we were ignoring. Looking back to the reference code, these seem to specify some additional flags that indicate whether the PDB supports various optional features. This patch adds support for reading, writing, and round-tripping these flags through YAML and the raw dumper, and updates the tests accordingly. llvm-svn: 297984	2017-03-16 20:19:11 +00:00
Zachary Turner	840dee30d3	[pdb] Fix failing test llvm-svn: 293091	2017-01-25 21:21:02 +00:00
Zachary Turner	29da5db7a0	[pdb] Correctly parse the hash adjusters table from TPI stream. This is not a list of pairs, it is a hash table data structure. We now correctly parse this out and dump it from llvm-pdbdump. We still need to understand the conditions that lead to a type getting an entry in the hash adjuster table. That will be done in a followup investigation / patch. Differential Revision: https://reviews.llvm.org/D29090 llvm-svn: 293090	2017-01-25 21:17:40 +00:00
Zachary Turner	760ad4da60	[pdb] Write the Named Stream mapping to Yaml and binary. Differential Revision: https://reviews.llvm.org/D28919 llvm-svn: 292665	2017-01-20 22:42:09 +00:00
Zachary Turner	46225b193f	Resubmit "[CodeView] Hook CodeViewRecordIO for reading/writing symbols." The original patch was broken due to some undefined behavior as well as warnings that were triggering -Werror. llvm-svn: 290000	2016-12-16 22:48:14 +00:00
Zachary Turner	d0fffd1d14	Revert "[CodeView] Hook CodeViewRecordIO for reading/writing symbols." This reverts commit r289978, which is failing due to some rebase/merge issues. llvm-svn: 289981	2016-12-16 19:25:23 +00:00
Zachary Turner	a4e7dfbc16	[CodeView] Hook CodeViewRecordIO for reading/writing symbols. This is the 3rd of 3 patches to get reading and writing of CodeView symbol and type records to use a single codepath. Differential Revision: https://reviews.llvm.org/D26427 llvm-svn: 289978	2016-12-16 19:20:35 +00:00
Rui Ueyama	c95b46449a	Do not print out Flags field twice. llvm-svn: 285481	2016-10-28 23:57:37 +00:00
Bob Haarman	653baa2aaa	[pdb] added support for dumping globals stream Summary: This adds support for dumping the globals stream from PDB files using llvm-pdbdump, similar to the support we have for the publics stream. Reviewers: ruiu, zturner Subscribers: beanz, mgorny, modocache Differential Revision: https://reviews.llvm.org/D25801 llvm-svn: 284861	2016-10-21 19:43:19 +00:00
Zachary Turner	72c5b6451f	[pdb] Add command line options for dumping individual streams and blocks I ran into a situation where I wanted to print out the contents of page 6 of a PDB as a binary blob, and there was no straightforward way to do that. In addition to adding that, this patch also adds the ability to dump a stream by index as a binary blob, and it will stitch together all the blocks and dump the whole thing as one seemingly contiguous sequence of bytes. llvm-svn: 281070	2016-09-09 18:17:52 +00:00
Zachary Turner	ac5763eca4	Resubmit "Write the TPI stream from a PDB to Yaml." The original patch was breaking some buildbots due to an incorrect ordering of function definitions which caused some compilers to recognize a definition but others to not. llvm-svn: 279089	2016-08-18 16:49:29 +00:00
Justin Bogner	39eec466a2	Revert "Write the TPI stream from a PDB to Yaml." This is hitting a "use of undeclared identifier 'skipPadding' error locally and on some bots. This reverts r278869. llvm-svn: 278871	2016-08-16 23:37:10 +00:00
Zachary Turner	8321ba5437	Write the TPI stream from a PDB to Yaml. Reviewed By: ruiu, rnk Differential Revision: https://reviews.llvm.org/D23226 llvm-svn: 278869	2016-08-16 23:28:54 +00:00
Rui Ueyama	057625f616	Fix a test for r277545. This change should have been submitted with that commit. llvm-svn: 277548	2016-08-02 23:25:59 +00:00
Zachary Turner	d3c7b8e303	[msf] Teach LLVM to parse a split Fpm. The FPM is split at regular intervals across the MSF file, as the MS code suggests. It turns out that the value of the interval is precisely the block size. If the block size is 4096, then there are two Fpm pages every 4096 blocks. So here we teach the PDBFile class to parse a split FPM, and also add more options when dumping the FPM to display some additional information such as orphaned pages (pages which the FPM says are allocated, but which nothing appears to use), use after free pages (pages which the FPM says are not allocated, but which are referenced by a stream), and multiple use pages (pages which the FPM says are allocated but are used more than once). Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D23022 llvm-svn: 277388	2016-08-01 21:19:45 +00:00
Rui Ueyama	7a5cdc6225	pdbdump: Dump Free Page Map contents. Differential Revision: https://reviews.llvm.org/D22974 llvm-svn: 277216	2016-07-29 21:38:00 +00:00

1 2 3

108 Commits