llvm-project

Commit Graph

Author	SHA1	Message	Date
Bob Haarman	ea91fafd33	[llvm-pdbdump] [yaml2pdb] always include object file name in module info Summary: Previously, the yaml2pdb subcommand of llvm-pdbdump only included object file names in module info if a module info stream was present. This change makes it so that we include the object file name even if there is no module info stream for the module. As a result, running llvm-pdbdump pdb2yaml -dbi-module-info original.pdb > original.yaml && llvm-pdbdump yaml2pdb -pdb=new.pdb original.yaml && llvm-pdbdump pdb2yaml -dbi-module-info new.pdb > new.yaml now produces identical original.yaml and new.yaml files. Reviewers: amccarth, zturner Reviewed By: zturner Subscribers: fhahn, llvm-commits Differential Revision: https://reviews.llvm.org/D33463 llvm-svn: 303891	2017-05-25 18:04:17 +00:00
Zachary Turner	d4136e945e	Implement various flavors of type merging. Previous algotirhm assumed that types and ids are in a single unified stream. For inputs that come from object files, this is the case. But if the input is already a PDB, or is the result of a previous merge, then the types and ids will already have been split up, in which case we need an algorithm that can accept operate on independent streams of types and ids that refer across stream boundaries to each other. Differential Revision: https://reviews.llvm.org/D33417 llvm-svn: 303577	2017-05-22 21:07:43 +00:00
Zachary Turner	526f4f2aa8	Resubmit "[CodeView] Provide a common interface for type collections." This was originally reverted because it was a breaking a bunch of bots and the breakage was not surfacing on Windows. After much head-scratching this was ultimately traced back to a bug in the lit test runner related to its pipe handling. Now that the bug in lit is fixed, Windows correctly reports these test failures, and as such I have finally (hopefully) fixed all of them in this patch. llvm-svn: 303446	2017-05-19 19:26:58 +00:00
Zachary Turner	1dfcf8d92c	Revert "[CodeView] Provide a common interface for type collections." This is a squash of ~5 reverts of, well, pretty much everything I did today. Something is seriously broken with lit on Windows right now, and as a result assertions that fire in tests are triggering failures. I've been breaking non-Windows bots all day which has seriously confused me because all my tests have been passing, and after running lit with -a to view the output even on successful runs, I find out that the tool is crashing and yet lit is still reporting it as a success! At this point I don't even know where to start, so rather than leave the tree broken for who knows how long, I will get this back to green, and then once lit is fixed on Windows, hopefully hopefully fix the remaining set of problems for real. llvm-svn: 303409	2017-05-19 05:57:45 +00:00
Zachary Turner	27ac223a85	Fix a broken test. Similar to my previous fix, it turns out llvm-pdbdump has been printing an incorrect value since the beginning of time, but we didn't know it was incorrect. Specifically, we were interpreting a TypeIndex as referencing a type from the TPI stream when it actually should come from the IPI stream. So we were printing a string that looked like a valid string, but was just from the wrong place. llvm-svn: 303403	2017-05-19 03:04:08 +00:00
Zachary Turner	8fb441ab9c	[llvm-pdbdump] Add the ability to merge PDBs. Merging PDBs is a feature that will be used heavily by the linker. The functionality already exists but does not have deep test coverage because it's not easily exposed through any tools. This patch aims to address that by adding the ability to merge PDBs via llvm-pdbdump. It takes arbitrarily many PDBs and outputs a single PDB. Using this new functionality, a test is added for merging type records. Future patches will add the ability to merge symbol records, module information, etc. llvm-svn: 303389	2017-05-18 23:03:41 +00:00
Zachary Turner	0683be2ebc	[llvm-pdbdump] Add the option to sort functions and data. llvm-svn: 302998	2017-05-14 01:13:40 +00:00
Zachary Turner	edef14510e	[PDB/CodeView] Read/write codeview inlinee line information. Previously we wrote line information and file checksum information, but we did not write information about inlinee lines and functions. This patch adds support for that. llvm-svn: 301936	2017-05-02 16:56:09 +00:00
Zachary Turner	8a2ebfb1cd	[CodeView] Write CodeView line information. Differential Revision: https://reviews.llvm.org/D32716 llvm-svn: 301882	2017-05-01 23:27:42 +00:00
Zachary Turner	5b6e4e0aed	[llvm-pdbdump] Abstract some of the YAML/Raw printing code. There is a lot of duplicate code for printing line info between YAML and the raw output printer. This introduces a base class that can be shared between the two, and makes some minor cleanups in the process. llvm-svn: 301728	2017-04-29 01:13:21 +00:00
Rui Ueyama	0fcbb2893e	Revert r301487: Replace HashString algorithm with xxHash64 This reverts commit r301487 to make buildbots green. llvm-svn: 301491	2017-04-26 23:15:10 +00:00
Rui Ueyama	87b30ac9d3	Replace HashString algorithm with xxHash64 The previous algorithm processed one character at a time, which is very painful on a modern CPU. Replace it with xxHash64, which both already exists in the codebase and is fairly fast. Patch from Scott Smith! Differential Revision: https://reviews.llvm.org/D32509 llvm-svn: 301487	2017-04-26 22:45:04 +00:00
Zachary Turner	ee3b9c2558	[llvm-pdbdump] Dump File / Line Info to YAML. We were already parsing and dumping this to the human readable format, but not to the YAML format. This does so, in preparation for reading it in and reconstructing the line information from YAML. llvm-svn: 301357	2017-04-25 20:22:02 +00:00
Zachary Turner	c883a8c6dc	[llvm-pdbdump] More advanced class definition dumping. Previously the dumping of class definitions was very primitive, and it made it hard to do more than the most trivial of output formats when dumping. As such, we would only dump one line for each field, and then dump non-layout items like nested types and enums. With this patch, we do a complete analysis of the object hierarchy including aggregate types, bases, virtual bases, vftable analysis, etc. The only immediately visible effects of this are that a) we can now dump a line for the vfptr where before we would treat that as padding, and b) we now don't treat virtual bases that come at the end of a class as padding since we have a more detailed analysis of the class's storage usage. In subsequent patches, we should be able to use this analysis to display a complete graphical view of a class's layout including recursing arbitrarily deep into an object's base class / aggregate member hierarchy. llvm-svn: 300133	2017-04-12 23:18:21 +00:00
Reid Kleckner	6e545ffc4e	[PDB] Emit index/offset pairs for TPI and IPI streams Summary: This lets PDB readers lookup type record data by type index in O(log n) time. It also enables makes `cvdump -t` work on PDBs produced by LLD. cvdump will not dump a PDB that doesn't have an index-to-offset table. The table is sorted by type index, and has an entry every 8KB. Looking up a type record by index is a binary search of this table, followed by a scan of at most 8KB. Reviewers: ruiu, zturner, inglorion Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31636 llvm-svn: 299958	2017-04-11 16:26:15 +00:00
Zachary Turner	0c990bbe09	[llvm-pdbdump] Display padding bytes on record layout When dumping classes, show where padding occurs, and at the end of the class print statistics about how many bytes total of padding exist in a class. Since PDB doesn't specifically contain information about padding, we have to mimic this by sort of reversing a small portion of the record layout algorithm (e.g. looking at offsets and sizes and trying to determine whether something is part of the same field or a new field). Differential Revision: https://reviews.llvm.org/D31800 llvm-svn: 299869	2017-04-10 19:33:29 +00:00
Adrian McCarthy	08eb343cce	Improves pretty printing of variable types in llvm-pdbdump * Adds support for pointers to arrays, which was missing * Adds some tests * Improves consistency of const and volatile qualifiers * Eliminates non-composable special case code for arrays and function by using a more general recursive approach * Has a hack for getting the calling convention into the right spot for pointer-to-functions Given the rapid changes happenning in llvm-pdbdump, this may be difficult to merge. Differential Revision: https://reviews.llvm.org/D31832 llvm-svn: 299848	2017-04-10 16:43:09 +00:00
Zachary Turner	10169b6d0d	Allow specification of what kinds of class members to dump. Previously when dumping class definitions, there were only two modes - on or off. But it's useful to sometimes get a little more fine-grained. For example, you might only want to see the record layout (for example to look for extraneous padding). This patch adds a third mode, layout mode, which does exactly that. Only this-relative data members are displayed in this mode. Differential Revision: https://reviews.llvm.org/D31794 llvm-svn: 299733	2017-04-06 23:43:39 +00:00
Adrian McCarthy	4d93d66ddd	Re-land: "Make NativeExeSymbol a concrete subclass of NativeRawSymbol [PDB]" This should work on all platforms now that r299006 has landed. Tested locally on Windows and Linux. This moves exe symbol-specific method implementations out of NativeRawSymbol into a concrete subclass. Also adds implementations for hasCTypes and hasPrivateSymbols and a simple test to ensure the native reader can access the summary information for the executable from the PDB. Original Differential Revision: https://reviews.llvm.org/D31059 llvm-svn: 299019	2017-03-29 19:27:08 +00:00
Reid Kleckner	a5d187b0ff	[PDB] Use two DBs when dumping the IPI stream Summary: When dumping these records from an object file section, we should use only one type database. However, when dumping from a PDB, we should use two: one for the type stream and one for the IPI stream. Certain type records that normally live in the .debug$T object file section get moved over to the IPI stream of the PDB file and they get new indices. So far, I've noticed that the MSVC linker always moves these records into IPI: - LF_FUNC_ID - LF_MFUNC_ID - LF_STRING_ID - LF_SUBSTR_LIST - LF_BUILDINFO - LF_UDT_MOD_SRC_LINE These records have index fields that can point into TPI or IPI. In particular, LF_SUBSTR_LIST and LF_BUILDINFO point to LF_STRING_ID records to describe compilation command lines. I've modified the dumper to have an optional pointer to the item DB, and to do type name lookup of these fields in that DB. See printItemIndex. The result is that our pdbdump-headers.test is more faithful to the PDB contents and the output is less confusing. Reviewers: ruiu Subscribers: amccarth, zturner, llvm-commits Differential Revision: https://reviews.llvm.org/D31309 llvm-svn: 298649	2017-03-23 21:36:25 +00:00
Adrian McCarthy	3c0328e011	Somehow this still breaks because of ANSI color codes in test output on Linux. Reverting until I can figure out the root cause. Revert "Re-land: Make NativeExeSymbol a concrete subclass of NativeRawSymbol [PDB]" This reverts commit f461a70cc376f0f91c8b4917be79479cc86330a5. llvm-svn: 298626	2017-03-23 17:18:50 +00:00
Adrian McCarthy	e4e5a2b13e	Fix build break after r298623 Use the -color-output option explicitly to eliminate the ANSI color codes in pdb-native-summary.test. (The default should have done this.) llvm-svn: 298625	2017-03-23 16:57:53 +00:00
Adrian McCarthy	997a15c3c3	Re-land: Make NativeExeSymbol a concrete subclass of NativeRawSymbol [PDB] The new test should pass on all platforms now that llvm-pdbdump has the `-color-output` option. This moves exe symbol-specific method implementations out of NativeRawSymbol into a concrete subclass. Also adds implementations for hasCTypes and hasPrivateSymbols and a simple test to ensure the native reader can access the summary information for the executable from the PDB. Original Differential Revision: https://reviews.llvm.org/D31059 llvm-svn: 298623	2017-03-23 16:45:20 +00:00
Reid Kleckner	45928018c5	[codeview] Use separate records for LF_SUBSTR_LIST and LF_ARGLIST They are structurally the same, but now we need to distinguish them because one record lives in the IPI stream and the other lives in TPI. llvm-svn: 298474	2017-03-22 01:37:38 +00:00
Zachary Turner	2d9c082033	Revert "Make NativeExeSymbol a concrete subclass of NativeRawSymbol [PDB]" For some reason this is causing ANSI color codes to be printed even when run through FileCheck. llvm-svn: 298026	2017-03-17 00:46:42 +00:00
Zachary Turner	2ed2aa75bf	[pdb] Fix an uninitialized read, and add a test for it. This was originally reported in pr32249, uncovered by PTVS-Studio. There was no code coverage for this path because it was difficult to construct odd-case PDB files that were not generated by cl. Now that we can write construct minimal PDB files from YAML, it's easy to construct fragments that generate whatever we want. In this patch I add a test that creates 2 type records. One with a unique name, and one without. I verify that we can go from PDB to Yaml with no errors. In a future patch I'd like to add something like llvm-pdbdump raw -lookup-type that will just dump one record and nothing else, which should make it a bit cleaner to find this kind of thing. llvm-svn: 298017	2017-03-17 00:15:55 +00:00
Adrian McCarthy	21b54cf632	Make NativeExeSymbol a concrete subclass of NativeRawSymbol [PDB] This moves exe symbol-specific method implementations out of NativeRawSymbol into a concrete subclass. Also adds implementations for hasCTypes and hasPrivateSymbols and a simple test to ensure the native reader can access the summary information for the executable from the PDB. Differential Revision: https://reviews.llvm.org/D31059 llvm-svn: 298005	2017-03-16 22:28:39 +00:00
Zachary Turner	05d5e6136f	[PDB] Add support for parsing Flags from PDB Stream. This was discovered when running `llvm-pdbdump diff` against two files, the second of which was generated by running the first one through pdb2yaml and then yaml2pdb. The second one was missing some bytes from the PDB Stream, and tracking this down showed that at the end of the PDB Stream were some additional bytes that we were ignoring. Looking back to the reference code, these seem to specify some additional flags that indicate whether the PDB supports various optional features. This patch adds support for reading, writing, and round-tripping these flags through YAML and the raw dumper, and updates the tests accordingly. llvm-svn: 297984	2017-03-16 20:19:11 +00:00
Zachary Turner	ea4e60754e	[pdb] Write the module info and symbol record streams. Previously we did not have support for writing detailed module information for each module, as well as the symbol records. This patch adds support for this, and in doing so enables the ability to construct minimal PDBs from just a few lines of YAML. A test is added to illustrate this functionality. llvm-svn: 297900	2017-03-15 22:18:53 +00:00
Adrian McCarthy	65d2688842	Introduce NativeEnumModules and NativeCompilandSymbol Together, these allow lldb-pdbdump to list all the modules from a PDB using a native reader (rather than DIA). Note that I'll probably be specializing NativeRawSymbol in a subsequent patch. Differential Revision: https://reviews.llvm.org/D30956 llvm-svn: 297883	2017-03-15 20:17:58 +00:00
Zachary Turner	840dee30d3	[pdb] Fix failing test llvm-svn: 293091	2017-01-25 21:21:02 +00:00
Zachary Turner	29da5db7a0	[pdb] Correctly parse the hash adjusters table from TPI stream. This is not a list of pairs, it is a hash table data structure. We now correctly parse this out and dump it from llvm-pdbdump. We still need to understand the conditions that lead to a type getting an entry in the hash adjuster table. That will be done in a followup investigation / patch. Differential Revision: https://reviews.llvm.org/D29090 llvm-svn: 293090	2017-01-25 21:17:40 +00:00
Zachary Turner	760ad4da60	[pdb] Write the Named Stream mapping to Yaml and binary. Differential Revision: https://reviews.llvm.org/D28919 llvm-svn: 292665	2017-01-20 22:42:09 +00:00
Zachary Turner	46225b193f	Resubmit "[CodeView] Hook CodeViewRecordIO for reading/writing symbols." The original patch was broken due to some undefined behavior as well as warnings that were triggering -Werror. llvm-svn: 290000	2016-12-16 22:48:14 +00:00
Zachary Turner	d0fffd1d14	Revert "[CodeView] Hook CodeViewRecordIO for reading/writing symbols." This reverts commit r289978, which is failing due to some rebase/merge issues. llvm-svn: 289981	2016-12-16 19:25:23 +00:00
Zachary Turner	a4e7dfbc16	[CodeView] Hook CodeViewRecordIO for reading/writing symbols. This is the 3rd of 3 patches to get reading and writing of CodeView symbol and type records to use a single codepath. Differential Revision: https://reviews.llvm.org/D26427 llvm-svn: 289978	2016-12-16 19:20:35 +00:00
Zachary Turner	7251ede7c5	Add CodeViewRecordIO for reading and writing. Using a pattern similar to that of YamlIO, this allows us to have a single codepath for translating codeview records to and from serialized byte streams. The current patch only hooks this up to the reading of CodeView type records. A subsequent patch will hook it up for writing of CodeView type records, and then a third patch will hook up the reading and writing of CodeView symbols. Differential Revision: https://reviews.llvm.org/D26040 llvm-svn: 285836	2016-11-02 17:05:19 +00:00
Rui Ueyama	c95b46449a	Do not print out Flags field twice. llvm-svn: 285481	2016-10-28 23:57:37 +00:00
Bob Haarman	653baa2aaa	[pdb] added support for dumping globals stream Summary: This adds support for dumping the globals stream from PDB files using llvm-pdbdump, similar to the support we have for the publics stream. Reviewers: ruiu, zturner Subscribers: beanz, mgorny, modocache Differential Revision: https://reviews.llvm.org/D25801 llvm-svn: 284861	2016-10-21 19:43:19 +00:00
Zachary Turner	3b14764ce5	[pdb] Dump Module Symbols to Yaml. This is the first step towards round-tripping symbol information, and thusly being able to write symbol information to a PDB. This patch writes the symbol information for each compiland to the Yaml when running in pdb2yaml mode. There's still some loose ends, such as what to do about relocations (necessary in order to print linkage names), how to print enums with friendly names, and how to give the dumper access to the StringTable, but this is a good first start. llvm-svn: 283641	2016-10-08 01:12:01 +00:00
Rui Ueyama	5d6714e593	Do not pass a superblock to PDBFileBuilder. When we create a PDB file using PDBFileBuilder, the information in the superblock, such as the size of the resulting file, is not available. Previously, PDBFileBuilder::initialize took a superblock assuming that all the members of the struct are correct. That is useful when you want to restore the exact information from a YAML file, but that's probably the only use case in which that is useful. When we are creating a PDB file on the fly, we have to backfill the members. This patch redefines PDBFileBuilder::initialize to take only a block size. Now all the other members are left as default values, so that they'll be updated when commit() is called. Differential Revision: https://reviews.llvm.org/D25108 llvm-svn: 282944	2016-09-30 20:52:12 +00:00
Zachary Turner	620961deb9	[pdb] Write TPI hash values to the TPI stream. This completes being able to write all the interesting values of a PDB TPI stream. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D24370 llvm-svn: 281555	2016-09-14 23:00:02 +00:00
Zachary Turner	36efbfa6d8	[pdb] Print out some more info when dumping a raw stream. We have various command line options that print the type of a stream, the size of a stream, etc but nowhere that it can all be viewed together. Since a previous patch introduced the ability to dump the bytes of a stream, this seems like a good place to present a full view of the stream's properties including its size, what kind of data it represents, and the blocks it occupies. So I added the ability to print that information to the -stream-data command line option. llvm-svn: 281077	2016-09-09 19:00:49 +00:00
Zachary Turner	72c5b6451f	[pdb] Add command line options for dumping individual streams and blocks I ran into a situation where I wanted to print out the contents of page 6 of a PDB as a binary blob, and there was no straightforward way to do that. In addition to adding that, this patch also adds the ability to dump a stream by index as a binary blob, and it will stitch together all the blocks and dump the whole thing as one seemingly contiguous sequence of bytes. llvm-svn: 281070	2016-09-09 18:17:52 +00:00
Zachary Turner	c6d54da891	[pdb] Write PDB TPI Stream from Yaml. This writes the full sequence of type records described in Yaml to the TPI stream of the PDB file. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D24316 llvm-svn: 281063	2016-09-09 17:46:17 +00:00
Zachary Turner	c998571493	Re-add "Make FieldList records print as a YAML sequence" This was originally submitted in r280549, and reverted in r280577 due to breaking one MSVC buildbot. The issue is that MSVC 2013 doesn't synthesize move constructors. So even though i was writing std::move(A) it was copying it, leading to a bogus ArrayRef. The solution here is to simply remove the std::vector<> from the type, since it is unused and unnecessary. This way the ArrayRef continues to point into the original memory backing the CVType. llvm-svn: 280769	2016-09-06 23:45:47 +00:00
Nico Weber	05e78450be	Revert r280549. The test it added doesn't pass: http://lab.llvm.org:8011/builders/clang-x64-ninja-win7/builds/15318/steps/ninja%20check%201/logs/FAIL%3A%20LLVM%3A%3Apdbdump-yaml-types.test Command Output (stdout): -- $ "D:/buildslave/clang-x64-ninja-win7/stage1/./bin\llvm-pdbdump.EXE" "pdb2yaml" "-tpi-stream" "D:\buildslave\clang-x64-ninja-win7\llvm\test\DebugInfo\PDB/Inputs/empty.pdb" $ "D:/buildslave/clang-x64-ninja-win7/stage1/./bin\FileCheck.EXE" "-check-prefix=YAML" "D:\buildslave\clang-x64-ninja-win7\llvm\test\DebugInfo\PDB\pdbdump-yaml-types.test" # command stderr: D:\buildslave\clang-x64-ninja-win7\llvm\test\DebugInfo\PDB\pdbdump-yaml-types.test:36:7: error: expected string not found in input YAML: Name: apartment ^ <stdin>:153:10: note: scanning from here Value: 161 ^ llvm-svn: 280577	2016-09-03 03:18:49 +00:00
Zachary Turner	83849415de	[codeview] Make FieldList records print as a yaml sequence. Before we were kind of imitating the behavior of a Yaml sequence by outputting each record one after the other. This makes it a little cumbersome when we want to go the other direction -- from Yaml to Pdb. So this treats FieldList records as no different than any other list of records, by printing them as a Yaml sequence with the exact same format. llvm-svn: 280549	2016-09-02 22:19:01 +00:00
Zachary Turner	ac5763eca4	Resubmit "Write the TPI stream from a PDB to Yaml." The original patch was breaking some buildbots due to an incorrect ordering of function definitions which caused some compilers to recognize a definition but others to not. llvm-svn: 279089	2016-08-18 16:49:29 +00:00
Justin Bogner	39eec466a2	Revert "Write the TPI stream from a PDB to Yaml." This is hitting a "use of undeclared identifier 'skipPadding' error locally and on some bots. This reverts r278869. llvm-svn: 278871	2016-08-16 23:37:10 +00:00
Zachary Turner	8321ba5437	Write the TPI stream from a PDB to Yaml. Reviewed By: ruiu, rnk Differential Revision: https://reviews.llvm.org/D23226 llvm-svn: 278869	2016-08-16 23:28:54 +00:00
Rui Ueyama	7e49549d4f	pdbdump: Add a test to verify the result of PDB -> YAML -> PDB conversions. Currently not all information can be restored from YAML. This test verifies only the PDB header. llvm-svn: 277682	2016-08-03 23:54:39 +00:00
Rui Ueyama	d1d8c8312a	pdbdump: Fix crash bug. pdbdump calls DbiStreamBuilder::commit through PDBFileBuilder::commit without calling DbiStreamBuilder::finalize. Because `finalize` initializes `Header` member, `Header` remained nullptr which caused a crash bug. Differential Revision: https://reviews.llvm.org/D23143 llvm-svn: 277681	2016-08-03 23:43:23 +00:00
Rui Ueyama	057625f616	Fix a test for r277545. This change should have been submitted with that commit. llvm-svn: 277548	2016-08-02 23:25:59 +00:00
Zachary Turner	d3c7b8e303	[msf] Teach LLVM to parse a split Fpm. The FPM is split at regular intervals across the MSF file, as the MS code suggests. It turns out that the value of the interval is precisely the block size. If the block size is 4096, then there are two Fpm pages every 4096 blocks. So here we teach the PDBFile class to parse a split FPM, and also add more options when dumping the FPM to display some additional information such as orphaned pages (pages which the FPM says are allocated, but which nothing appears to use), use after free pages (pages which the FPM says are not allocated, but which are referenced by a stream), and multiple use pages (pages which the FPM says are allocated but are used more than once). Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D23022 llvm-svn: 277388	2016-08-01 21:19:45 +00:00
Rui Ueyama	7a5cdc6225	pdbdump: Dump Free Page Map contents. Differential Revision: https://reviews.llvm.org/D22974 llvm-svn: 277216	2016-07-29 21:38:00 +00:00
Rui Ueyama	d8388aaecb	[pdbdump] Use the "flow" style to print out a sequence of uint32_t. Summary: Lists can be written either with "-" or "[]" in YAML. Differential Revision: https://reviews.llvm.org/D22579 llvm-svn: 276168	2016-07-20 19:41:47 +00:00
Zachary Turner	b927e02e1b	[pdb] Teach MsfBuilder and other classes about the Free Page Map. Block 1 and 2 of an MSF file are bit vectors that represent the list of blocks allocated and free in the file. We had been using these blocks to write stream data and other data, so we mark them as the free page map now. We don't yet serialize these pages to the disk, but at least we make a note of what it is, and avoid writing random data to them. Doing this also necessitated cleaning up some of the tests to be more general and hardcode fewer values, which is nice. llvm-svn: 275629	2016-07-15 22:17:19 +00:00
Zachary Turner	5e534c7fb3	[pdb] Round trip the NameMap data structure to YAML. llvm-svn: 275628	2016-07-15 22:17:08 +00:00
Zachary Turner	faa554b2fd	[pdb] Use MsfBuilder to handle the writing PDBs. Previously we would read a PDB, then write some of it back out, but write the directory, super block, and other pertinent metadata back out unchanged. This generates incorrect PDBs since the amount of data written was not always the same as the amount of data read. This patch changes things to use the newly introduced `MsfBuilder` class to write out a correct and accurate set of Msf metadata for the data actually written, which opens up the door for adding and removing type records, symbol records, and other types of data to an existing PDB. llvm-svn: 275627	2016-07-15 22:16:56 +00:00
Zachary Turner	dbeaea7b35	Refactor the PDB writing to use a builder approach llvm-svn: 275110	2016-07-11 21:45:26 +00:00
Zachary Turner	f6b9382467	[pdb] Add a pdb2yaml option to not dump file headers. This will be useful once we start adding the ability to dump type records and symbol records, since it will allow us to generate mergeable information instead of information that specifies an entire file. llvm-svn: 275109	2016-07-11 21:45:09 +00:00
David Majnemer	7abd269aa9	[CodeView] Emit an appropriate symbol kind for globals We emitted debug info for globals/functions as if they all had external linkage. Instead, emit local symbol records when appropriate. llvm-svn: 274676	2016-07-06 21:07:47 +00:00
Zachary Turner	8848a7a6b2	[pdb] Round trip the PDB stream between YAML and binary PDB. This gets writing of the PDB stream working. llvm-svn: 274647	2016-07-06 18:05:57 +00:00
Zachary Turner	ab58ae8730	[pdb] Re-add code to write PDB files. Somehow all the functionality to write PDB files got removed, probably accidentally when uploading the patch perhaps the wrong one got uploaded. This re-adds all the code, as well as the corresponding test. llvm-svn: 274248	2016-06-30 17:43:00 +00:00
Zachary Turner	a30bd1a1bc	Update llvm-pdbdump to use subcommands. llvm-svn: 274247	2016-06-30 17:42:48 +00:00
Reid Kleckner	5b335b864b	[codeview] Add support for splitting field list records over 64KB The basic structure is that once a list record goes over 64K, the last subrecord of the list is an LF_INDEX record that refers to the next record. Because the type record graph must be toplogically sorted, this means we have to emit them in reverse order. We build the type record in order of declaration, so this means that if we don't want extra copies, we need to detect when we were about to split a record, and leave space for a continuation subrecord that will point to the eventual split top-level record. Also adds dumping support for these records. Next we should make sure that large method overload lists work properly. llvm-svn: 273294	2016-06-21 18:33:01 +00:00
Rui Ueyama	5dbea9db10	[Codeview] Add a class for LF_UDT_MOD_SRC_LINE. Differential Revision: http://reviews.llvm.org/D21406 llvm-svn: 272843	2016-06-15 21:25:29 +00:00
Zachary Turner	1dc9fd3c4a	Resubmit "[pdb] Actually write a PDB to disk from YAML."" Reviewed By: ruiu Differential Revision: http://reviews.llvm.org/D21220 llvm-svn: 272708	2016-06-14 20:48:36 +00:00
Zachary Turner	07c229c9e7	Revert "[pdb] Actually write a PDB to disk from YAML." This reverts commit 879139e1c6577b09df52de56a6bab856a19ed185. This was committed accidentally when I blindly typed git svn dcommit instead of the command to generate a patch. llvm-svn: 272693	2016-06-14 18:51:35 +00:00
Zachary Turner	fe5bc02492	[pdb] Actually write a PDB to disk from YAML. llvm-svn: 272692	2016-06-14 18:49:36 +00:00
Rui Ueyama	f14a74c102	[pdbdump] Print out # of hash buckets. In the reference code, the field name is `cHashBuckets`. llvm-svn: 272075	2016-06-07 23:53:43 +00:00
Rui Ueyama	d833917f98	[pdbdump] Print out TPI hash key size. llvm-svn: 272073	2016-06-07 23:44:27 +00:00
Rui Ueyama	4bc848047b	Fix CRLF -> LF. llvm-svn: 271960	2016-06-06 23:35:52 +00:00
Rui Ueyama	2c5384ae4c	[pdbdump] Print section header flags. llvm-svn: 271943	2016-06-06 21:34:55 +00:00
Zachary Turner	7120a478fa	[llvm-pdbdump] Dump MSF headers to YAML. This is the simplest possible patch to get some kind of YAML output. All it dumps is the MSF header fields so that in theory an empty MSF file could be reconstructed. Reviewed By: ruiu, majnemer Differential Revision: http://reviews.llvm.org/D20971 llvm-svn: 271939	2016-06-06 20:37:05 +00:00
Rui Ueyama	ef2b488482	[pdbdump] Print out New FPO stream contents. The data strucutre in the new FPO stream is described in the PE/COFF spec. There is one record per function if frame pointer is omitted. Differential Revision: http://reviews.llvm.org/D20999 llvm-svn: 271926	2016-06-06 18:39:21 +00:00
Rui Ueyama	fd97bf1f76	pdbdump: print out TPI hashes. Differential Revision: http://reviews.llvm.org/D20945 llvm-svn: 271736	2016-06-03 20:48:51 +00:00
Zachary Turner	71f725735c	[pdb] Add string table offsets to check output. llvm-svn: 271674	2016-06-03 14:22:46 +00:00
Zachary Turner	3df1bfaaec	[pdb] Print out file names instead of file offsets. When printing line information and file checksums, we were printing the file offset field from the struct header. This teaches llvm-pdbdump how to turn those numbers into the filename. In the case of file checksums, this is done by looking in the global string table. In the case of line contributions, this is done by indexing into the file names buffer of the DBI stream. Why they use a different technique I don't know. llvm-svn: 271630	2016-06-03 05:52:57 +00:00
Zachary Turner	d0563f29f9	[pdb] Dump file checksums from pdb codeview line info. llvm-svn: 271622	2016-06-03 04:01:48 +00:00
Zachary Turner	a96cce64a5	[codeview] Dump line number and column information. To facilitate this, a couple of changes had to be made: 1. `ModuleSubstream` got moved from `DebugInfo/PDB` to `DebugInfo/CodeView`, and various codeview related types are defined there. It turns out `DebugInfo/CodeView/Line.h` already defines many of these structures, but this is really old code that is not endian aware, doesn't interact well with `StreamInterface` and not very helpful for getting stuff out of a PDB. Eventually we should migrate the old readobj `COFFDumper` code to these new structures, or at least merge their functionality somehow. 2. A `ModuleSubstream` visitor is introduced. Depending on where your module substream array comes from, different subsets of record types can be expected. We are already hand parsing these substream arrays in many places especially in `COFFDumper.cpp`. In the future we can migrate these paths to the visitor as well, which should reduce a lot of code in `COFFDumper.cpp`. Differential Revision: http://reviews.llvm.org/D20936 Reviewed By: ruiu, majnemer llvm-svn: 271621	2016-06-03 03:25:59 +00:00
Zachary Turner	7eb6d358af	[llvm-pdbdump] Dump CodeView line information. This first pass only splits apart the records and dumps the line info kinds and binary data. Subsequent patches will parse out the binary data into more useful information and dump it in detail. llvm-svn: 271576	2016-06-02 20:11:22 +00:00
Rui Ueyama	90db78816b	pdbdump: print out COFF section headers. Unlike other sections that can grow to any size, the COFF section header stream has maximum length because each record is fixed size and the COFF file format limits the maximum number of sections. So I decided to not create a specific stream class for it. Instead, I added a member function to DbiStream class which returns a vector of COFF headers. Differential Revision: http://reviews.llvm.org/D20717 llvm-svn: 271557	2016-06-02 18:20:20 +00:00
Zachary Turner	93839cb4ac	[pdb] Parse and dump section map and section contribs Differential Revision: http://reviews.llvm.org/D20876 Reviewed By: rnk, ruiu llvm-svn: 271488	2016-06-02 05:07:49 +00:00
David Majnemer	8d106d5ea4	Update test to deal with non-zero exit codes llvm-svn: 271135	2016-05-28 19:02:12 +00:00
Rui Ueyama	6816367a27	pdbdump: print out the name of the stream 0. Differential Revision: http://reviews.llvm.org/D20712 llvm-svn: 270943	2016-05-27 00:32:07 +00:00
Rui Ueyama	9dc034dba1	pdbdump: Add -raw-all to enable all -raw-* flags. Differential Revision: http://reviews.llvm.org/D20707 llvm-svn: 270937	2016-05-26 23:26:55 +00:00
Zachary Turner	d3076ab36f	[llvm-pdbdump] Decipher the remaining PDB streams. We know at least know the meaning of every stream of the PDB file. Yay! llvm-svn: 270669	2016-05-25 05:49:48 +00:00
Zachary Turner	c9972c64f5	[llvm-pdbdump] Dump the IPI stream and all records. llvm-svn: 270661	2016-05-25 04:35:22 +00:00
Rui Ueyama	b12b158f20	pdbdump: fix bug in name hash table. name_ids() did not return all IDs but only the first NameCount items. The number of non-zero entries in IDs vector is NameCount, but it does not mean that all non-zero entries are at the beginning of IDs vector. Differential Revision: http://reviews.llvm.org/D20611 llvm-svn: 270656	2016-05-25 04:07:17 +00:00
Zachary Turner	c59261ca37	[llvm-pdbdump] Stream 0 isn't actually the MSF superblock. Oddly enough, I realized we don't actually know what stream 0 is (if anything). llvm-svn: 270655	2016-05-25 03:53:16 +00:00
Zachary Turner	85ed80b9e6	[llvm-pdbdump] Dump stream summary list. Try to figure out what each stream is, and dump its name. This gives us a better picture of what streams we still don't understand. llvm-svn: 270653	2016-05-25 03:43:17 +00:00
Zachary Turner	4caa1bf0bd	[codeview] Add support for new type records. This adds support for parsing and dumping the following symbol types: S_LPROCREF S_ENVBLOCK S_COMPILE2 S_REGISTER S_COFFGROUP S_SECTION S_THUNK32 S_TRAMPOLINE As of this patch, the test PDB files no longer have any unknown symbol types. llvm-svn: 270628	2016-05-24 22:58:46 +00:00
Zachary Turner	96e60f7573	[llvm-pdbdump] Rework command line options. When dumping huge PDB files, too many of the options were grouped together so you would get neverending spew of output. This patch introduces more granular display options so you can only dump the fields you actually care about. llvm-svn: 270607	2016-05-24 20:31:48 +00:00
Zachary Turner	9e33e6f89b	[codeview, pdb] Dump symbol records in publics stream Differential Revision: http://reviews.llvm.org/D20580 Reviewed By: ruiu llvm-svn: 270597	2016-05-24 18:55:14 +00:00
Zachary Turner	cac29ae038	Dump symbol record details in llvm-pdbdump This makes use of the newly introduced `CVSymbolVisitor` to dump details of each type of symbol record in the symbol streams. Future patches will bring this visitor based dumping to the publics stream, as well as creating a `SymbolDumpDelegate` to print more information about relocations etc. Differential Revision: http://reviews.llvm.org/D20545 Reviewed By: ruiu llvm-svn: 270585	2016-05-24 17:30:25 +00:00
Rui Ueyama	0fcd82605e	pdbdump: print out symbol names referred by publics stream. DBI stream contains a stream number of the symbol record stream. Symbol record streams is an array of length-type-value members. Each member represents one symbol. Publics stream contains offsets to the symbol record stream. This patch is to print out all symbols that are referenced by the publics stream. Note that even with this patch, llvm-pdbdump cannot dump all the information in a publics stream since it contains more information than symbol names. I'll improve it in followup patches. Differential Revision: http://reviews.llvm.org/D20480 llvm-svn: 270262	2016-05-20 19:55:17 +00:00
Rui Ueyama	0376b1a2d7	pdbdump: Rename NumberOfSymbols -> SymbolRecordStreamIndex. Differential Revision: http://reviews.llvm.org/D20441 llvm-svn: 270088	2016-05-19 18:05:58 +00:00
Rui Ueyama	350b29862f	pdbdump: Print out section offsets in the publics stream. llvm-svn: 269955	2016-05-18 16:24:16 +00:00

1 2 3 4

189 Commits