llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Bieneman	2e752db47a	[DWARF] [ObjectYAML] Adding APIs for unittesting Summary: This patch adds some new APIs to enable using the YAML DWARF representation in unit tests. The most basic new API is DWARFYAML::EmitDebugSections which converts a YAML string into a series of owned MemoryBuffer objects stored in a StringMap. The string map can then be used to construct a DWARFContext for parsing in place of an ObjectFile. Reviewers: dblaikie, clayborg Subscribers: mgorny, fhahn, jgosnell, aprantl, llvm-commits Differential Revision: https://reviews.llvm.org/D28828 llvm-svn: 292634	2017-01-20 19:03:14 +00:00
Zachary Turner	a332fa38e9	Fix a few more build errors. llvm-svn: 292538	2017-01-19 23:44:14 +00:00
Zachary Turner	d54deaee6c	Fix incorrectly formed assert statement. llvm-svn: 292537	2017-01-19 23:41:11 +00:00
Zachary Turner	11036a909f	[pdb] Add HashTable data structure. This was being parsed / serialized ad-hoc inside the code for a specific PDB stream. But this data structure is used in multiple ways / places within the PDB format. To be able to re-use it we need to raise this code out and make it more generic. In doing so, a number of bugs are fixed in the original implementation, and support is added for growing the hash table and deleting items from the hash table, which had either been omitted or incorrect implemented in the initial version. Differential Revision: https://reviews.llvm.org/D28715 llvm-svn: 292535	2017-01-19 23:31:24 +00:00
Rui Ueyama	dcd32937dc	PDB: Add a class to create the /names stream contents. This patch adds a new class NameHashTableBuilder which creates /names streams. This patch contains a test to confirm that a stream created by NameHashTableBuilder can be read by NameHashTable reader class. Differential Revision: https://reviews.llvm.org/D28707 llvm-svn: 292040	2017-01-15 00:36:02 +00:00
Greg Clayton	c109bbea57	Add a variant of DWARFDie::find() and DWARFDie::findRecursively() that takes a llvm::ArrayRef<dwarf::Attribute>. This allows us efficiently look for more than one attribute, something that is quite common in DWARF consumption. Differential Revision: https://reviews.llvm.org/D28704 llvm-svn: 291967	2017-01-13 22:32:12 +00:00
Greg Clayton	97d22187d0	Cleanup how DWARFDie attributes are accessed and decoded. Removed all DWARFDie::getAttributeValueAs() calls. Renamed: Optional<DWARFFormValue> DWARFDie::getAttributeValue(dwarf::Attribute); To: Optional<DWARFFormValue> DWARFDie::find(dwarf::Attribute); Added: Optional<DWARFFormValue> DWARFDie::findRecursively(dwarf::Attribute); All decoding of Optional<DWARFFormValue> values are now done using the dwarf::to() functions from DWARFFormValue.h: Old code: auto DeclLine = DWARFDie.getAttributeValueAsSignedConstant(DW_AT_decl_line).getValueOr(0); New code: auto DeclLine = toUnsigned(DWARFDie.find(DW_AT_decl_line), 0); This composition helps us since we can now easily do: auto DeclLine = toUnsigned(DWARFDie.findRecursively(DW_AT_decl_line), 0); This allows us to easily find attribute values in the current DIE only (the first new code above) or in any DW_AT_abstract_origin or DW_AT_specification Dies using the line above. Note that the code line length is shorter and more concise. Differential Revision: https://reviews.llvm.org/D28581 llvm-svn: 291959	2017-01-13 21:08:18 +00:00
Benjamin Kramer	061f4a5fe6	Apply clang-tidy's performance-unnecessary-value-param to LLVM. With some minor manual fixes for using function_ref instead of std::function. No functional change intended. llvm-svn: 291904	2017-01-13 14:39:03 +00:00
Greg Clayton	0e62ee7d60	Add the ability to iterate across all attributes in a DIE. Differential Revision: https://reviews.llvm.org/D28386 llvm-svn: 291861	2017-01-13 00:13:42 +00:00
Zachary Turner	629cb7d8cc	[CodeView] Finish decoupling TypeDatabase from TypeDumper. Previously the type dumper itself was passed around to a lot of different places and manipulated in ways that were more appropriate on the type database. For example, the entire TypeDumper was passed into the symbol dumper, when all the symbol dumper wanted to do was lookup the name of a TypeIndex so it could print it. That's what the TypeDatabase is for -- mapping type indices to names. Another example is how if the user runs llvm-pdbdump with the option to dump symbols but not types, we still have to visit all types so that we can print minimal information about the type of a symbol, but just without dumping full symbol records. The way we did this before is by hacking it up so that we run everything through the type dumper with a null printer, so that the output goes to /dev/null. But really, we don't need to dump anything, all we want to do is build the type database. Since TypeDatabaseVisitor now exists independently of TypeDumper, we can do this. We just build a custom visitor callback pipeline that includes a database visitor but not a dumper. All the hackery around printers etc goes away. After this patch, we could probably even delete the entire CVTypeDumper class since really all it is at this point is a thin wrapper that hides the details of how to build a useful visitation pipeline. It's not a priority though, so CVTypeDumper remains for now. After this patch we will be able to easily plug in a different style of type dumper by only implementing the proper visitation methods to dump one-line output and then sticking it on the pipeline. Differential Revision: https://reviews.llvm.org/D28524 llvm-svn: 291724	2017-01-11 23:24:22 +00:00
Greg Clayton	d1efea89c9	Remove all variants of DWARFDie::getAttributeValueAs...() that had parameters that specified default values. Now we only support returning Optional<> values and have changed all clients over to use Optional::getValueOr(). Differential Revision: https://reviews.llvm.org/D28569 llvm-svn: 291686	2017-01-11 17:43:37 +00:00
George Rimar	4bf308317d	[lib/Object] - Introduce Decompressor class. Decompressor intention is to reduce duplication of code. Currently LLD has own implementation of decompressor for compressed debug sections. This class helps to avoid it and share the code. LLD patch for reusing it is D28106 Differential revision: https://reviews.llvm.org/D28105 llvm-svn: 291675	2017-01-11 15:26:41 +00:00
Zachary Turner	a9054ddd9c	[CodeView/PDB] Rename a bunch of files. We were starting to get some name clashes between llvm-pdbdump and the common CodeView framework, so I took this opportunity to rename a bunch of files to more accurately describe their usage. This also helps in llvm-pdbdump to distinguish between different files and whether they are used for pretty dump mode or raw dump mode. llvm-svn: 291627	2017-01-11 00:35:43 +00:00
Zachary Turner	c640b76db5	[CodeView] Add TypeDatabase class. This creates a centralized class in which to store type records. It stores types as an array of entries, which matches the notion of a type stream being a topologically sorted DAG. Logic to build up such a database was already being used in CVTypeDumper, so CVTypeDumper is now updated to to read from a TypeDatabase which is filled out by an earlier visitor in the pipeline. Differential Revision: https://reviews.llvm.org/D28486 llvm-svn: 291626	2017-01-11 00:35:08 +00:00
Victor Leschuk	cbddae74f5	DebugInfo: support for DW_FORM_implicit_const Support for DW_FORM_implicit_const DWARFv5 feature. When this form is used attribute value goes to .debug_abbrev section (as SLEB). As this form would break any debug tool which doesn't support DWARFv5 it is guarded by dwarf version check. Attempt to use this form with dwarf version <= 4 is considered a fatal error. Differential Revision: https://reviews.llvm.org/D28456 llvm-svn: 291599	2017-01-10 21:18:26 +00:00
Greg Clayton	93e4fe8aad	Add iterator support to DWARFDie to allow child DIE iteration. Differential Revision: https://reviews.llvm.org/D28303 llvm-svn: 291194	2017-01-05 23:47:37 +00:00
Michal Gorny	89b6f16b3e	[cmake] Add LLVM_ENABLE_DIA_SDK option, and expose it in LLVMConfig Add an explicit LLVM_ENABLE_DIA_SDK option to control building support for DIA SDK-based debugging. Control its value to match whether DIA SDK support was found and expose it in LLVMConfig (alike LLVM_ENABLE_ZLIB). Its value is needed for LLDB to determine whether to run tests requiring DIA support. Currently it is obtained from llvm/Config/config.h; however, this file is not available for standalone builds. Following this change, LLDB will be modified to use the value from LLVMConfig. Differential Revision: https://reviews.llvm.org/D26255 llvm-svn: 290818	2017-01-02 18:19:35 +00:00
Chris Bieneman	e0e451d927	[ObjectYAML] Support for DWARF debug_info section This patch adds support for YAML<->DWARF for debug_info sections. This re-lands r290147, reverted in 290148, re-landed in r290204 after fixing the issue that caused bots to fail (thank you UBSan!), and reverted again in r290209 due to failures on big endian systems. After adding support for preserving endianness, this should be good now. llvm-svn: 290386	2016-12-22 22:44:27 +00:00
Greg Clayton	78a07bfa66	Add the ability for DWARFDie objects to get the parent DWARFDie. In order for the llvm DWARF parser to be used in LLDB we will need to be able to get the parent of a DIE. This patch adds that functionality by changing the DWARFDebugInfoEntry class to store a depth field instead of a sibling index. Using a depth field allows us to easily calculate the sibling and the parent without increasing the size of DWARFDebugInfoEntry. I tested llvm-dsymutil on a debug version of clang where this fully parses DWARF in over 1200 .o files to verify there was no serious regression in performance. Added a full suite of unit tests to test this functionality. Differential Revision: https://reviews.llvm.org/D27995 llvm-svn: 290274	2016-12-21 21:37:06 +00:00
Chris Bieneman	abecaa2f8c	Revert "[ObjectYAML] Support for DWARF debug_info section" This reverts commit r290204. Still breaking bots... In a meeting now, so I can't fix it immediately. Bot URL: http://lab.llvm.org:8011/builders/clang-s390x-linux/builds/2415 llvm-svn: 290209	2016-12-20 22:36:42 +00:00
Chris Bieneman	ffc4aef542	[ObjectYAML] Support for DWARF debug_info section This patch adds support for YAML<->DWARF for debug_info sections. This re-lands r290147, after fixing the issue that caused bots to fail (thank you UBSan!). llvm-svn: 290204	2016-12-20 21:35:31 +00:00
Chris Bieneman	891cbcc093	Revert "[ObjectYAML] Support for DWARF debug_info section" This reverts commit r290147. This commit is breaking a bot (http://lab.llvm.org:8011/builders/clang-atom-d525-fedora-rel/builds/621). I don't have time to investigate at the moment, so I'll revert for now. llvm-svn: 290148	2016-12-20 00:42:06 +00:00
Chris Bieneman	b5b0b23a25	[ObjectYAML] Support for DWARF debug_info section This patch adds support for YAML<->DWARF for debug_info sections. llvm-svn: 290147	2016-12-20 00:26:24 +00:00
Greg Clayton	2520c9ebee	Make a function to correctly extract the DW_AT_high_pc given the low pc value. DWARF 4 and later supports encoding the PC as an address or as as offset from the low PC. Clients using DWARFDie should be insulated from how to extract the high PC value. This function takes care of extracting the form value and looking for the correct form. Differential Revision: https://reviews.llvm.org/D27885 llvm-svn: 290131	2016-12-19 20:36:41 +00:00
David Majnemer	b7477b540e	[PDB] Don't use the long type Long is not the same size across a number of the platforms we support. Use unsigned int here instead, it is more appropriate because overflow/wrap-around is possible and, in this case, expected. llvm-svn: 290068	2016-12-18 20:10:50 +00:00
David Majnemer	1d3dcb0602	[PDB] Don't reimplement CRC32 We already have a CRC32 implementation which is compatible with the PDB hash, reuse it. llvm-svn: 290054	2016-12-18 00:41:15 +00:00
David Majnemer	9bca03bf81	[PDB] Validate superblock addresses - Validate the address of the block map. - Validate the address of the free block map. llvm-svn: 290053	2016-12-18 00:41:10 +00:00
George Rimar	e71e33fe93	[DWARF] - Introduce DWARFDebugPubTable class for dumping pub* sections. Patch implements parser of pubnames/pubtypes tables instead of static function used before. It is now should be possible to reuse it in LLD or other projects and clean up the duplication code. Differential revision: https://reviews.llvm.org/D27851 llvm-svn: 290040	2016-12-17 09:10:32 +00:00
Zachary Turner	10005d915e	Delete unused file. llvm-svn: 290021	2016-12-17 00:58:19 +00:00
Zachary Turner	46225b193f	Resubmit "[CodeView] Hook CodeViewRecordIO for reading/writing symbols." The original patch was broken due to some undefined behavior as well as warnings that were triggering -Werror. llvm-svn: 290000	2016-12-16 22:48:14 +00:00
Zachary Turner	d0fffd1d14	Revert "[CodeView] Hook CodeViewRecordIO for reading/writing symbols." This reverts commit r289978, which is failing due to some rebase/merge issues. llvm-svn: 289981	2016-12-16 19:25:23 +00:00
Zachary Turner	a4e7dfbc16	[CodeView] Hook CodeViewRecordIO for reading/writing symbols. This is the 3rd of 3 patches to get reading and writing of CodeView symbol and type records to use a single codepath. Differential Revision: https://reviews.llvm.org/D26427 llvm-svn: 289978	2016-12-16 19:20:35 +00:00
David Blaikie	7d4a5599da	Revert "dwarfdump: Support/process relocations on a CU's abbrev_off" Reverting because this breaks lld's gdb_index support - it's probably double counting the abbrev relocation offset. This reverts commit r289954. llvm-svn: 289961	2016-12-16 17:10:17 +00:00
David Blaikie	e9fda9f201	dwarfdump: Support/process relocations on a CU's abbrev_off Input can be produced by ld -r, for example (a normal LLVM workflow never hits this - LLVM only ever produces a single abbrev table in an object (shared by multiple CUs), so the reloc's always 0, and when it's linked together the relocation's resolved so it doesn't need to be handled) llvm-svn: 289954	2016-12-16 16:31:10 +00:00
Greg Clayton	52fe1f68c8	Add the ability to get attribute values as Optional<T> When getting attributes it is sometimes nicer to use Optional<T> some of the time instead of magic values. I tried to cut over to only using the Optional values but it made many of the call sites very messy, so it makes sense the leave in the calls that can return a default value. Otherwise code that looks like this: uint64_t CallColumn = Die.getAttributeValueAsAddress(DW_AT_call_line, 0); Has to be turned into: uint64_t CallColumn = 0; if (auto CallColumnValue = Die.getAttributeValueAsAddress(DW_AT_call_line)) CallColumn = *CallColumnValue; The first snippet of code looks much better. But in cases where you want an offset that may or may not be there, the following code looks better: if (auto StmtOffset = Die.getAttributeValueAsSectionOffset(DW_AT_stmt_list)) { // Use StmtOffset } Differential Revision: https://reviews.llvm.org/D27772 llvm-svn: 289731	2016-12-14 22:38:08 +00:00
Eric Christopher	ba1024cfb8	This change does two things: Adds a "Discriminator" field to struct DILineInfo, which defaults to 0. Fills out the "Discriminator" field in DILineInfo in DWARFDebugLine::LineTable::getFileLineInfoForAddress(). in order to have a slightly nicer interface in getFileLineInfoForAddress. Patch by Simon Que! Differential Revision: https://reviews.llvm.org/D27649 llvm-svn: 289683	2016-12-14 18:29:39 +00:00
Greg Clayton	1cbf3fa94a	Switch functions that returned bool and filled in a DWARFFormValue arg with ones that return Optional<DWARFFormValue> Differential Revision: https://reviews.llvm.org/D27737 llvm-svn: 289611	2016-12-13 23:20:56 +00:00
Greg Clayton	c8c1032c0c	Make a DWARFDIE class that can help avoid using the wrong DWARFUnit when extracting attributes Many places pass around a DWARFDebugInfoEntryMinimal and a DWARFUnit. It is easy to get things wrong by using the wrong DWARFUnit with a DWARFDebugInfoEntryMinimal. This patch creates a DWARFDie class that contains the DWARFUnit and DWARFDebugInfoEntryMinimal objects so that they can't get out of sync. All attribute extraction has been moved out of DWARFDebugInfoEntryMinimal and into DWARFDie. DWARFDebugInfoEntryMinimal was also renamed to DWARFDebugInfoEntry. DWARFDie objects are temporary objects that are used by clients and contain 2 pointers that you always need to have anyway. Keeping them grouped will avoid errors and simplify many of the attribute extracting APIs by not having to pass in a DWARFUnit. Differential Revision: https://reviews.llvm.org/D27634 llvm-svn: 289565	2016-12-13 18:25:19 +00:00
Greg Clayton	3462a420d1	Make a DWARF generator so we can unit test DWARF APIs with gtest. The only tests we have for the DWARF parser are the tests that use llvm-dwarfdump and expect output from textual dumps. More DWARF parser modification are coming in the next few weeks and I wanted to add tests that can verify that we can encode and decode all form types, as well as test some other basic DWARF APIs where we ask DIE objects for their children and siblings. DwarfGenerator.cpp was added in the lib/CodeGen directory. This file contains the code necessary to easily create DWARF for tests: dwarfgen::Generator DG; Triple Triple("x86_64--"); bool success = DG.init(Triple, Version); if (!success) return; dwarfgen::CompileUnit &CU = DG.addCompileUnit(); dwarfgen::DIE CUDie = CU.getUnitDIE(); CUDie.addAttribute(DW_AT_name, DW_FORM_strp, "/tmp/main.c"); CUDie.addAttribute(DW_AT_language, DW_FORM_data2, DW_LANG_C); dwarfgen::DIE SubprogramDie = CUDie.addChild(DW_TAG_subprogram); SubprogramDie.addAttribute(DW_AT_name, DW_FORM_strp, "main"); SubprogramDie.addAttribute(DW_AT_low_pc, DW_FORM_addr, 0x1000U); SubprogramDie.addAttribute(DW_AT_high_pc, DW_FORM_addr, 0x2000U); dwarfgen::DIE IntDie = CUDie.addChild(DW_TAG_base_type); IntDie.addAttribute(DW_AT_name, DW_FORM_strp, "int"); IntDie.addAttribute(DW_AT_encoding, DW_FORM_data1, DW_ATE_signed); IntDie.addAttribute(DW_AT_byte_size, DW_FORM_data1, 4); dwarfgen::DIE ArgcDie = SubprogramDie.addChild(DW_TAG_formal_parameter); ArgcDie.addAttribute(DW_AT_name, DW_FORM_strp, "argc"); // ArgcDie.addAttribute(DW_AT_type, DW_FORM_ref4, IntDie); ArgcDie.addAttribute(DW_AT_type, DW_FORM_ref_addr, IntDie); StringRef FileBytes = DG.generate(); MemoryBufferRef FileBuffer(FileBytes, "dwarf"); auto Obj = object::ObjectFile::createObjectFile(FileBuffer); EXPECT_TRUE((bool)Obj); DWARFContextInMemory DwarfContext(*Obj.get()); This code is backed by the AsmPrinter code that emits DWARF for the actual compiler. While adding unit tests it was discovered that DIEValue that used DIEEntry as their values had bugs where DW_FORM_ref1, DW_FORM_ref2, DW_FORM_ref8, and DW_FORM_ref_udata forms were not supported. These are all now supported. Added support for DW_FORM_string so we can emit inlined C strings. Centralized the code to unique abbreviations into a new DIEAbbrevSet class and made both the dwarfgen::Generator and the llvm::DwarfFile classes use the new class. Fixed comments in the llvm::DIE class so that the Offset is known to be the compile/type unit offset. DIEInteger now supports more DW_FORM values. There are also unit tests that cover: Encoding and decoding all form types and values Encoding and decoding all reference types (DW_FORM_ref1, DW_FORM_ref2, DW_FORM_ref4, DW_FORM_ref8, DW_FORM_ref_udata, DW_FORM_ref_addr) including cross compile unit references with that go forward one compile unit and backward on compile unit. Differential Revision: https://reviews.llvm.org/D27326 llvm-svn: 289010	2016-12-08 01:03:48 +00:00
Bob Haarman	a5b4358956	[pdb] handle missing pdb streams more gracefully Summary: The code we use to read PDBs assumed that streams we ask it to read exist, and would read memory outside a vector and crash if this wasn't the case. This would, for example, cause llvm-pdbdump to crash on PDBs generated by lld. This patch handles such cases more gracefully: the PDB reading code in LLVM now reports errors when asked to get a stream that is not present, and llvm-pdbdump will report missing streams and continue processing streams that are present. Reviewers: ruiu, zturner Subscribers: thakis, amccarth Differential Revision: https://reviews.llvm.org/D27325 llvm-svn: 288722	2016-12-05 22:44:00 +00:00
Eugene Zelenko	570e39a25c	[DebugInfo] Fix some Clang-tidy modernize-use-default and Include What You Use warnings; other minor fixes (NFC). Per Zachary Turner and Mehdi Amini suggestion to make only post-commit reviews. llvm-svn: 287838	2016-11-23 23:16:32 +00:00
Rui Ueyama	2b4ba04d57	Remove PDBFileBuilder::build() and related functions. PDBFileBuilder supports two different ways to create files. One is PDBFileBuilder::commit. That function takes a filename and write a result to the file. The other is PDBFileBuilder::build. That returns a new PDBFile object. This patch removes the latter because no one is using it and in a real life situation we are very unlikely to need it. Even if you need it, it'd be easy to write a new PDB to a memory buffer and read it back. Removing PDBFileBuilder::build enables us to remove other classes build transitively. Differential Revision: https://reviews.llvm.org/D26987 llvm-svn: 287697	2016-11-22 20:32:22 +00:00
Rui Ueyama	fb1e6d22a3	Align Modi and FileInfo substreams on 32-byte offsets. This is required by DbiStream, but DbiStreamBuilder didn't align these substreams, so the output of DbiSTreamBuilder couldn't be read by DbiStream. Test will be added to LLD. llvm-svn: 287067	2016-11-16 00:59:27 +00:00
Rui Ueyama	507013180e	Fix Modi and File count if there are more than 65535 modules/files. These numbers are intended to be capped at 65535, but `std::max<uint16_t>(UINT16_MAX, N)` always returns N for any N because the expression is the same as `std::max((uint16_t)UINT16_MAX, (uint16_t)N)`. llvm-svn: 287060	2016-11-16 00:38:33 +00:00
Greg Clayton	6f6e4dbd5d	Improve DWARF parsing speed by improving DWARFAbbreviationDeclaration This patch gets a DWARF parsing speed improvement by having DWARFAbbreviationDeclaration instances know if they have a fixed byte size. If an abbreviation has a fixed byte size that can be calculated given a DWARFUnit, then parsing a DIE becomes two steps: parse ULEB128 abbrev code, and then add constant size to the offset. This patch also adds a fixed byte size to each DWARFAbbreviationDeclaration::AttributeSpec so that attributes can quickly skip their values if needed without the need to lookup the fixed for size. Notable improvements: - DWARFAbbreviationDeclaration::findAttributeIndex() now returns an Optional<uint32_t> instead of a uint32_t and we no longer have to look for the magic -1U return value - Optional<uint32_t> DWARFAbbreviationDeclaration::findAttributeIndex(dwarf::Attribute attr) const; - DWARFAbbreviationDeclaration now has a getAttributeValue() function that extracts an attribute value given a DIE offset that takes advantage of the DWARFAbbreviationDeclaration::AttributeSpec::ByteSize - bool DWARFAbbreviationDeclaration::getAttributeValue(const uint32_t DIEOffset, const dwarf::Attribute Attr, const DWARFUnit &U, DWARFFormValue &FormValue) const; - A DWARFAbbreviationDeclaration instance can return a fixed byte size for itself so DWARF parsing is faster: - Optional<size_t> DWARFAbbreviationDeclaration::getFixedAttributesByteSize(const DWARFUnit &U) const; - Any functions that used to take a "const DWARFUnit *U" that would crash if U was NULL now take a "const DWARFUnit &U" and are only called with a valid DWARFUnit Differential Revision: https://reviews.llvm.org/D26567 llvm-svn: 286924	2016-11-15 01:23:06 +00:00
Rui Ueyama	a8a68a993e	Remove extra semicolon. llvm-svn: 286688	2016-11-12 00:23:32 +00:00
Rui Ueyama	f7c9c3234c	Define DbiStreamBuilder::addSectionContribs. This patch defines a new function to add a SectionContribs stream to a PDB file. Unlike SectionMap, SectionContribs contains a list of input sections as opposed to output sections. Note that this patch needs improving because currently we do not set Module field in SectionContribs entries. In a follow-up patch, I'll add Modules and then fix it after that. Differential Revision: https://reviews.llvm.org/D26210 llvm-svn: 286677	2016-11-11 23:41:13 +00:00
Greg Clayton	04c19286a1	Fixed issues found by Paul Robinson with my patch for: https://reviews.llvm.org/D26526 - Fixed DW_FORM_strp to be correctly sized and extracted for DWARF64 - Added some missing strp variants as well - Fixed comment typo llvm-svn: 286603	2016-11-11 17:38:14 +00:00
Greg Clayton	82f12b149f	Clean up DWARFFormValue by reducing duplicated code and removing DWARFFormValue::getFixedFormSizes() In preparation for a follow on patch that improves DWARF parsing speed, clean up DWARFFormValue so that we have can get the fixed byte size of a form value given a DWARFUnit or given the version, address byte size and dwarf32/64. This patch cleans up code so that everyone is using one of the new DWARFFormValue functions: static Optional<uint8_t> DWARFFormValue::getFixedByteSize(dwarf::Form Form, const DWARFUnit *U = nullptr); static Optional<uint8_t> DWARFFormValue::getFixedByteSize(dwarf::Form Form, uint16_t Version, uint8_t AddrSize, bool Dwarf32); This patch changes DWARFFormValue::skipValue() to rely on the output of DWARFFormValue::getFixedByteSize(...) instead of duplicating the code in each function. This will reduce the number of changes we need to make to DWARF to fewer places in DWARFFormValue when we add support for new form. This patch also starts to support DWARF64 so that we can get correct byte sizes for forms that vary according the DWARF 32/64. To reduce the code duplication a new FormSizeHelper pure virtual class was created that can be created as a FormSizeHelperDWARFUnit when you have a DWARFUnit, or FormSizeHelperManual where you manually specify the DWARF version, address byte size and DWARF32/DWARF64. There is now a single implementation of a function that gets the fixed byte size (instead of two where one took a DWARFUnit and one took the DWARF version, address byte size and DWARFFormat enum) and one function to skip the form values. https://reviews.llvm.org/D26526 llvm-svn: 286597	2016-11-11 16:21:37 +00:00
Zachary Turner	44728f4014	Fix some size_t / uint32_t ambiguity errors. llvm-svn: 286305	2016-11-08 22:30:11 +00:00
Zachary Turner	4efa0a4201	[CodeView] Hook up CodeViewRecordIO to type serialization path. Previously support had been added for using CodeViewRecordIO to read (deserialize) CodeView type records. This patch adds support for writing those same records. With this patch, reading and writing of CodeView type records finally uses a single codepath. Differential Revision: https://reviews.llvm.org/D26253 llvm-svn: 286304	2016-11-08 22:24:53 +00:00
Justin Bogner	f9fb2abb01	PDB: Fix some APIs to avoid use-after-frees The buffer is already owned by the PDBFile for all of these APIs, so don't pass it in separately. llvm-svn: 285953	2016-11-03 18:28:04 +00:00
Zachary Turner	7251ede7c5	Add CodeViewRecordIO for reading and writing. Using a pattern similar to that of YamlIO, this allows us to have a single codepath for translating codeview records to and from serialized byte streams. The current patch only hooks this up to the reading of CodeView type records. A subsequent patch will hook it up for writing of CodeView type records, and then a third patch will hook up the reading and writing of CodeView symbols. Differential Revision: https://reviews.llvm.org/D26040 llvm-svn: 285836	2016-11-02 17:05:19 +00:00
Rui Ueyama	ddc79225c3	Define DbiStreamBuilder::addSectionMap. This change enables LLD to construct a Section Map stream in a PDB file. I do not understand all these fields in the Section Map yet, but it seems like a copy of a COFF section header in another format. With this patch, DbiStreamBuilder can emit a Section Map which llvm-pdbdump can dump. Differential Revision: https://reviews.llvm.org/D26112 llvm-svn: 285606	2016-10-31 17:38:56 +00:00
Greg Clayton	cddab279f6	Modify DWARFFormValue to remember the DWARFUnit that it was decoded with. Modifying DWARFFormValue to remember the DWARFUnit that it was encoded with can simplify the usage of instances of this class. Previously users would have to try and pass in the same DWARFUnit that was used to decode the form value and there was a possibility that a different DWARFUnit might be supplied to the functions that extract values (strings, CU relative references, addresses) and cause problems. This fixes this potential issue by storing the DWARFUnit inside the DWARFFormValue so that this mistake can't be made. Instances of DWARFFormValue are not stored permanently and are used as temporary values, so the increase in size of an instance of DWARFFormValue isn't a big deal. This makes decoding form values more bullet proof and is a change that will be used by future modifications. https://reviews.llvm.org/D26052 llvm-svn: 285594	2016-10-31 16:46:02 +00:00
Rui Ueyama	77be2403f6	Define calculateDbgStreamSize for consistency. llvm-svn: 285487	2016-10-29 00:56:44 +00:00
Adrian Prantl	c4fbbcf9ed	Import/update constants from the DWARF 5 public review draft document. https://reviews.llvm.org/D26051 llvm-svn: 285421	2016-10-28 17:59:50 +00:00
Greg Clayton	6c273763a3	Switch all DWARF variables for tags, attributes and forms over to use the llvm::dwarf enumerations instead of using raw uint16_t values. This allows easier debugging as users can see the values of the enumerations in the variables view that will show the enumeration string instead of just a number. https://reviews.llvm.org/D26013 llvm-svn: 285309	2016-10-27 16:32:04 +00:00
Bob Haarman	26a87bd030	[codeview] support emitting indirect virtual base class information Summary: Fixes PR28281. MSVC lists indirect virtual base classes in the field list of a class, using LF_IVBCLASS records. This change makes LLVM emit such records when processing DW_TAG_inheritance tags with the DIFlagVirtual and (newly introduced) DIFlagIndirect tags. Reviewers: rnk, ruiu, zturner Differential Revision: https://reviews.llvm.org/D25578 llvm-svn: 285130	2016-10-25 22:11:52 +00:00
Bob Haarman	653baa2aaa	[pdb] added support for dumping globals stream Summary: This adds support for dumping the globals stream from PDB files using llvm-pdbdump, similar to the support we have for the publics stream. Reviewers: ruiu, zturner Subscribers: beanz, mgorny, modocache Differential Revision: https://reviews.llvm.org/D25801 llvm-svn: 284861	2016-10-21 19:43:19 +00:00
Zachary Turner	4d49eb9fa0	[CodeView] Refactor serialization to use StreamInterface. This was all using ArrayRef<>s before which presents a problem when you want to serialize to or deserialize from an actual PDB stream. An ArrayRef<> is really just a special case of what can be handled with StreamInterface though (e.g. by using a ByteStream), so changing this to use StreamInterface allows us to plug in a PDB stream and get all the record serialization and deserialization for free on a MappedBlockStream. Subsequent patches will try to remove TypeTableBuilder and TypeRecordBuilder in favor of class that operate on Streams as well, which should allow us to completely merge the reading and writing codepaths for both types and symbols. Differential Revision: https://reviews.llvm.org/D25831 llvm-svn: 284762	2016-10-20 18:31:19 +00:00
Reid Kleckner	990504e625	Remove LLVM_NOEXCEPT and replace it with noexcept Now that we have dropped MSVC 2013, all supported compilers support noexcept and we can drop this portability macro. llvm-svn: 284672	2016-10-19 23:52:38 +00:00
Zachary Turner	383803230b	[pdb] Improve error messages when DIA is not found. llvm-svn: 284610	2016-10-19 16:42:20 +00:00
David Blaikie	69494a9805	dwarfdump: add space missing from the type unit header description llvm-svn: 284540	2016-10-18 21:18:43 +00:00
David Blaikie	e4c3915a5a	dwarfdump: Include the name in the unit description, even in non-summarized mode (accidentally removed this from my previous change when I was rejecting some clang-format formatting... ) llvm-svn: 284539	2016-10-18 21:16:45 +00:00
David Blaikie	50cc27ecb9	dwarfdump: -summarize-types: print a short summary (unqualified type name, hash, length) of type units rather than dumping contents This is just a quick utility handy for getting rough summaries of types in a given object or dwo file. I've been using it to investigate the amount of type info redundancy across a project build, for example. llvm-svn: 284537	2016-10-18 21:09:48 +00:00
Reid Kleckner	edfc9dcf42	Truncate long names in type records In the MS ABI, the frontend is supposed to MD5 such pathologically long names. LLVM should still defend itself from long names, though. Fixes part of PR29098. llvm-svn: 284136	2016-10-13 17:33:22 +00:00
Reid Kleckner	fb58be862c	Update _MSC_VER equality checks for msdiaNNN.dll Use inequality instead of equality to defend against minor version increases in _MSC_VER. An _MSC_VER value of 1901 should still use msdia140.dll, as described in this blog post: https://blogs.msdn.microsoft.com/vcblog/2016/10/05/visual-c-compiler-version/ llvm-svn: 284058	2016-10-12 21:51:14 +00:00
Reid Kleckner	5d0bc63d91	Avoid braced initialization for default member initializers for MSVC 2013 llvm-svn: 283928	2016-10-11 20:02:57 +00:00
Rui Ueyama	f9904043ca	Re-submit r283823: Define DbiStreamBuilder::addDbgStream to add stream. The previous commit was failing because we filled empty slots of the debug stream index with kInvalidStreamIndex. It should've been 0. llvm-svn: 283925	2016-10-11 19:43:12 +00:00
Rui Ueyama	8af4988f35	Revert r283824 and r283823: Define DbiStreamBuilder::addDbgStream to add stream. This reverts commit r283824 and r283823 to fix buildbots. llvm-svn: 283828	2016-10-11 00:15:50 +00:00
Rui Ueyama	914eef6a64	Fix a bug in DbiStreamBuilder::addDbgStream. This feature will be tested in LLD unit tests. llvm-svn: 283824	2016-10-10 23:44:04 +00:00
Rui Ueyama	70edd9e41d	Define DbiStreamBuilder::addDbgStream to add stream. Previously, there is no way to create a stream other than pre-defined special stream such as DBI or IPI. This patch adds a new method, addDbgStream, to add a debug stream to a PDB file. Differential Revision: https://reviews.llvm.org/D25356 llvm-svn: 283823	2016-10-10 23:35:36 +00:00
Zachary Turner	3b14764ce5	[pdb] Dump Module Symbols to Yaml. This is the first step towards round-tripping symbol information, and thusly being able to write symbol information to a PDB. This patch writes the symbol information for each compiland to the Yaml when running in pdb2yaml mode. There's still some loose ends, such as what to do about relocations (necessary in order to print linkage names), how to print enums with friendly names, and how to give the dumper access to the StringTable, but this is a good first start. llvm-svn: 283641	2016-10-08 01:12:01 +00:00
Zachary Turner	0d8407447d	Refactor Symbol visitor code. Type visitor code had already been refactored previously to decouple the visitor and the visitor callback interface. This was necessary for having the flexibility to visit in different ways (for example, dumping to yaml, reading from yaml, dumping to ScopedPrinter, etc). This patch merely implements the same visitation pattern for symbol records that has already been implemented for type records. llvm-svn: 283609	2016-10-07 21:34:46 +00:00
Mehdi Amini	149f6eaed9	Re-commit "Use StringRef in Support/Darf APIs (NFC)" This reverts commit r283285 and re-commit r283275 with a fix for format("%s", Str); where Str is a StringRef. llvm-svn: 283298	2016-10-05 05:59:29 +00:00
Mehdi Amini	2bcac0fac4	Revert "Re-commit "Use StringRef in Support/Darf APIs (NFC)"" One test seems randomly broken: DebugInfo/X86/gnu-public-names.ll llvm-svn: 283285	2016-10-05 01:04:02 +00:00
Mehdi Amini	32b297a42f	Re-commit "Use StringRef in Support/Darf APIs (NFC)" This reverts commit r283278 and re-commit r283275 with the update to fix the build on the LLDB side. llvm-svn: 283281	2016-10-05 00:37:18 +00:00
Mehdi Amini	78b04ae7ac	Revert "Use StringRef in Support/Darf APIs (NFC)" This reverts commit r283275, it broke LLDB Android debug server. llvm-svn: 283278	2016-10-05 00:21:14 +00:00
Mehdi Amini	e0327be584	Use StringRef in Support/Darf APIs (NFC) llvm-svn: 283275	2016-10-04 23:55:40 +00:00
Rui Ueyama	5d6714e593	Do not pass a superblock to PDBFileBuilder. When we create a PDB file using PDBFileBuilder, the information in the superblock, such as the size of the resulting file, is not available. Previously, PDBFileBuilder::initialize took a superblock assuming that all the members of the struct are correct. That is useful when you want to restore the exact information from a YAML file, but that's probably the only use case in which that is useful. When we are creating a PDB file on the fly, we have to backfill the members. This patch redefines PDBFileBuilder::initialize to take only a block size. Now all the other members are left as default values, so that they'll be updated when commit() is called. Differential Revision: https://reviews.llvm.org/D25108 llvm-svn: 282944	2016-09-30 20:52:12 +00:00
Rui Ueyama	fc22cef98e	Pass a filename instead of a msf::WritableStream to PDBFileBuilder::commit. WritableStream needs the exact file size to open a file, but until we fix the final layout of a PDB file, we don't know the size of the file. This patch changes the parameter type of PDBFileBuilder::commit to solve that chiecken-and-egg problem. Now the function opens a file after fixing the layout, so it can create a file with the exact size. Differential Revision: https://reviews.llvm.org/D25107 llvm-svn: 282940	2016-09-30 20:34:44 +00:00
George Rimar	4f82df52ae	Revert r282238 "Revert r282235 "[llvm-dwarfdump] - Teach dwarfdump to dump gdb-index section."" Build bot issues (http://lab.llvm.org:8011/builders/clang-x64-ninja-win7/builds/15856/steps/ninja%20check%201/logs/FAIL%3A%20LLVM%3A%3Adwarfdump-dump-gdbindex.test) should be fixed in that version. Issue was that MSVS does not support "%zu". Though it works fine on MSCS 2015, Bot looks running MSVS 2013 that does not like it. MSDN also says that "z" prefix is not supported: https://msdn.microsoft.com/en-us/library/tcxf1dw6.aspx I had to use PRId64 instead. Original commit message: [llvm-dwarfdump] - Teach dwarfdump to dump gdb-index section. gold linker's --gdb-index option currently is able to create the .gdb_index section that allows GDB to locate and read the .dwo files as it needs them, this helps reduce the total size of the object files processed by the linker. More info about that: https://gcc.gnu.org/wiki/DebugFission https://sourceware.org/gdb/onlinedocs/gdb/Index-Section-Format.html Patch teaches dwarfdump tool to dump this section. Differential revision: https://reviews.llvm.org/D21503 llvm-svn: 282239	2016-09-23 11:01:53 +00:00
George Rimar	a348527186	Revert r282235 "[llvm-dwarfdump] - Teach dwarfdump to dump gdb-index section." It broke BB: http://lab.llvm.org:8011/builders/clang-x64-ninja-win7/builds/15856 llvm-svn: 282238	2016-09-23 10:12:56 +00:00
George Rimar	a77bcf5e42	[llvm-dwarfdump] - Teach dwarfdump to dump gdb-index section. gold linker's --gdb-index option currently is able to create the .gdb_index section that allows GDB to locate and read the .dwo files as it needs them, this helps reduce the total size of the object files processed by the linker. More info about that: https://gcc.gnu.org/wiki/DebugFission https://sourceware.org/gdb/onlinedocs/gdb/Index-Section-Format.html Patch teaches dwarfdump tool to dump this section. Differential revision: https://reviews.llvm.org/D21503 llvm-svn: 282235	2016-09-23 09:09:26 +00:00
Zachary Turner	de9ba15511	[pdb] Write the IPI stream. The IPI stream is structurally identical to the TPI stream, but it contains different record types. So we just re-use the TPI writing code. llvm-svn: 281638	2016-09-15 18:22:31 +00:00
Zachary Turner	a6cbfb53c2	[pdb] Fix the TPI stream size computation. We were inadvertently adding the size of the hash value stream to the size of the TPI stream, even though the hash value stream is an entirely separate stream. llvm-svn: 281636	2016-09-15 18:22:21 +00:00
Zachary Turner	c67b00c695	[pdb] Get rid of Data and RawData in CVType. The `CVType` had two redundant fields which were confusing and error-prone to fill out. By treating member records as a distinct type from leaf records, we are able to simplify this quite a bit. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D24432 llvm-svn: 281556	2016-09-14 23:00:16 +00:00
Zachary Turner	620961deb9	[pdb] Write TPI hash values to the TPI stream. This completes being able to write all the interesting values of a PDB TPI stream. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D24370 llvm-svn: 281555	2016-09-14 23:00:02 +00:00
Zachary Turner	36efbfa6d8	[pdb] Print out some more info when dumping a raw stream. We have various command line options that print the type of a stream, the size of a stream, etc but nowhere that it can all be viewed together. Since a previous patch introduced the ability to dump the bytes of a stream, this seems like a good place to present a full view of the stream's properties including its size, what kind of data it represents, and the blocks it occupies. So I added the ability to print that information to the -stream-data command line option. llvm-svn: 281077	2016-09-09 19:00:49 +00:00
Zachary Turner	9ba31a5efe	[pdb] Pass CVRecord's through the visitor as non-const references. This simplifies a lot of code, and will actually be necessary for an upcoming patch to serialize TPI record hash values. The idea before was that visitors should be examining records, not modifying them. But this is no longer true with a visitor that constructs a CVRecord from Yaml. To handle this until now, we were doing some fixups on CVRecord objects at a higher level, but the code is really awkward, and it makes sense to just have the visitor write the bytes into the CVRecord. In doing so I uncovered a few bugs related to `Data` and `RawData` and fixed those. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D24362 llvm-svn: 281067	2016-09-09 18:03:39 +00:00
Zachary Turner	c6d54da891	[pdb] Write PDB TPI Stream from Yaml. This writes the full sequence of type records described in Yaml to the TPI stream of the PDB file. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D24316 llvm-svn: 281063	2016-09-09 17:46:17 +00:00
Reid Kleckner	fa28396f97	[codeview] Use the correct max CV record length of 0xFF00 Previously we were splitting our records at 0xFFFF bytes, which the Microsoft tools don't like. Should fix failure on the new Windows self-host buildbot. This length appears in microsoft-pdb/PDB/dbi/dbiimpl.h llvm-svn: 280522	2016-09-02 18:43:27 +00:00
Reid Kleckner	d1882f2188	Fix the ASan fuse-lld.cc test after LLD r280012 With that change, images built with 'lld-link /debug' always have a debug directory. If no PDB filename was passed on the command line, then the filename in the executable is empty. PDB information would never work anyway if the PDB file name is empty, so go ahead and try DWARF in that case. llvm-svn: 280410	2016-09-01 20:28:59 +00:00
Zachary Turner	5c7c2307a8	[codeview] Properly propagate the TypeLeafKind through the pipeline. llvm-svn: 280388	2016-09-01 18:08:19 +00:00
Zachary Turner	77807637ff	[codeview] Have visitTypeBegin return the record type. Previously we were assuming that any visitation of types would necessarily be against a type we had binary data for. Reasonable assumption when were just reading PDBs and dumping them, but once we start writing PDBs from Yaml this breaks down, because we have no binary data yet, only Yaml, and from that we need to read the record kind and perform the switch based on that. So this patch does that. Instead of having the visitor switch on the kind that is already in the CVType record, we change the visitTypeBegin() method to return the Kind, and switch on the returned value. This way, the default implementation can still return the value from the CVType, but the implementation which visits Yaml records and serializes binary PDB type records can use the field in the Yaml as the source of the switch. llvm-svn: 280307	2016-08-31 23:14:31 +00:00
Zachary Turner	2f951ce9c9	[codeview] Add TypeVisitorCallbackPipeline. We were kind of hacking this together before by embedding the ability to forward requests into the TypeDeserializer. When we want to start adding more different kinds of visitor callback interfaces though, this doesn't scale well and is very inflexible. So introduce the notion of a pipeline, which itself implements the TypeVisitorCallbacks interface, but which contains an internal list of other callbacks to invoke in sequence. Also update the existing uses of CVTypeVisitor to use this new pipeline class for deserializing records before visiting them with another visitor. llvm-svn: 280293	2016-08-31 21:42:26 +00:00
Reid Kleckner	9dac47319d	[codeview] Emit vtable shape information The shape of the vtable is passed down as the size of the __vtbl_ptr_type. This special pointer type appears both as the pointee type of the vptr type, and by itself in every dynamic class. For classes with multiple vtables, only the shape of the primary vftable is included, as the shape of all secondary vftables will be the same as in the base class. Fixes PR28150 llvm-svn: 280254	2016-08-31 15:59:30 +00:00
Zachary Turner	f6884a1aac	Remove unused translation unit. llvm-svn: 279561	2016-08-23 20:08:02 +00:00
Eugene Zelenko	61a72d8850	[LLVM] Fix some Clang-tidy modernize-use-using and Include What You Use warnings Differential revision: https://reviews.llvm.org/D23675 llvm-svn: 279102	2016-08-18 17:56:27 +00:00
Vedant Kumar	c948d182e1	Fix -Wpessimizing-move error, NFC llvm-svn: 279095	2016-08-18 17:39:53 +00:00
Zachary Turner	ac5763eca4	Resubmit "Write the TPI stream from a PDB to Yaml." The original patch was breaking some buildbots due to an incorrect ordering of function definitions which caused some compilers to recognize a definition but others to not. llvm-svn: 279089	2016-08-18 16:49:29 +00:00
Justin Bogner	39eec466a2	Revert "Write the TPI stream from a PDB to Yaml." This is hitting a "use of undeclared identifier 'skipPadding' error locally and on some bots. This reverts r278869. llvm-svn: 278871	2016-08-16 23:37:10 +00:00
Zachary Turner	8321ba5437	Write the TPI stream from a PDB to Yaml. Reviewed By: ruiu, rnk Differential Revision: https://reviews.llvm.org/D23226 llvm-svn: 278869	2016-08-16 23:28:54 +00:00
Saleem Abdulrasool	015280211b	CodeView: extract the OMF Directory Header The DebugDirectory contains a pointer to the CodeView info structure which is a derivative of the OMF debug directory. The structure has evolved a bit over time, and PDB 2.0 used a slightly different definition from PDB 7.0. Both of these are specific to CodeView and not COFF. Reflect this by moving the structure definitions into the DebugInfo/CodeView headers. Define a generic DebugInfo union type that can be used to pass around a reference to the DebugInfo irrespective of the versioning. NFC. llvm-svn: 278075	2016-08-09 00:25:12 +00:00
Justin Bogner	272cbacc25	CodeView: Remove an unused variable It was breaking the -Werror build. llvm-svn: 277878	2016-08-05 21:57:10 +00:00
Zachary Turner	5e35eaac83	Fix non portable include path. llvm-svn: 277876	2016-08-05 21:50:02 +00:00
Zachary Turner	5e3e4bb26b	[CodeView] Decouple record deserialization from visitor dispatch. Until now, our use case for the visitor has been to take a stream of bytes representing a type stream, deserialize the records in sequence, and do something with them, where "something" is determined by how the user implements a particular set of callbacks on an abstract class. For actually writing PDBs, however, we want to do the reverse. We have some kind of description of the list of records in their in-memory format, and we want to process each one. Perhaps by serializing them to a byte stream, or perhaps by converting them from one description format (Yaml) to another (in-memory representation). This was difficult in the current model because deserialization and invoking the callbacks were tightly coupled. With this patch we change this so that TypeDeserializer is itself an implementation of the particular set of callbacks. This decouples deserialization from the iteration over a list of records and invocation of the callbacks. TypeDeserializer is initialized with another implementation of the callback interface, so that upon deserialization it can pass the deserialized record through to the next set of callbacks. In a sense this is like an implementation of the Decorator design pattern, where the Deserializer is a decorator. This will be useful for writing Pdbs from yaml, where we have a description of the type records in Yaml format. In this case, the visitor implementation would have each visitation callback method implemented in such a way as to extract the proper set of fields from the Yaml, and it could maintain state that builds up a list of these records. Finally at the end we can pass this information through to another set of callbacks which serializes them into a byte stream. Reviewed By: majnemer, ruiu, rnk Differential Revision: https://reviews.llvm.org/D23177 llvm-svn: 277871	2016-08-05 21:45:34 +00:00
Zachary Turner	660230eba4	[CodeView] Use llvm::Error instead of std::error_code. This eliminates the remnants of std::error_code from the DebugInfo libraries. llvm-svn: 277758	2016-08-04 19:39:55 +00:00
Rui Ueyama	d1d8c8312a	pdbdump: Fix crash bug. pdbdump calls DbiStreamBuilder::commit through PDBFileBuilder::commit without calling DbiStreamBuilder::finalize. Because `finalize` initializes `Header` member, `Header` remained nullptr which caused a crash bug. Differential Revision: https://reviews.llvm.org/D23143 llvm-svn: 277681	2016-08-03 23:43:23 +00:00
Zachary Turner	8cf51c340d	[msf] Make FPM reader use MappedBlockStream. MappedBlockSTream can work with any sequence of block data where the ordering is specified by a list of block numbers. So rather than manually stitch them together in the case of the FPM, reuse this functionality so that we can treat the FPM as if it were contiguous. Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D23066 llvm-svn: 277609	2016-08-03 16:53:21 +00:00
Rui Ueyama	4ee7f3c9aa	PDB: Mark extended file pages as free by default. BitVector::extend initializes extended bits as true by default. That is not desirable because new pages should be initially free. Differential Revision: https://reviews.llvm.org/D23048 llvm-svn: 277529	2016-08-02 21:56:37 +00:00
Zachary Turner	d3c7b8e303	[msf] Teach LLVM to parse a split Fpm. The FPM is split at regular intervals across the MSF file, as the MS code suggests. It turns out that the value of the interval is precisely the block size. If the block size is 4096, then there are two Fpm pages every 4096 blocks. So here we teach the PDBFile class to parse a split FPM, and also add more options when dumping the FPM to display some additional information such as orphaned pages (pages which the FPM says are allocated, but which nothing appears to use), use after free pages (pages which the FPM says are not allocated, but which are referenced by a stream), and multiple use pages (pages which the FPM says are allocated but are used more than once). Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D23022 llvm-svn: 277388	2016-08-01 21:19:45 +00:00
Rui Ueyama	7a5cdc6225	pdbdump: Dump Free Page Map contents. Differential Revision: https://reviews.llvm.org/D22974 llvm-svn: 277216	2016-07-29 21:38:00 +00:00
Zachary Turner	a3225b0451	[msf] Resubmit "Rename Msf -> MSF". Previously this change was submitted from a Windows machine, so changes made to the case of filenames and directory names did not survive the commit, and as a result the CMake source file names and the on-disk file names did not match on case-sensitive file systems. I'm resubmitting this patch from a Linux system, which hopefully allows the case changes to make it through unfettered. llvm-svn: 277213	2016-07-29 20:56:36 +00:00
Zachary Turner	334aec4dd2	Revert "[msf] Rename Msf to MSF." This reverts commit 4d1557ffac41e079bcb1abbcf04f512474dcd6fe. llvm-svn: 277194	2016-07-29 18:38:47 +00:00
Zachary Turner	a010f5cef0	[msf] Rename Msf to MSF. In a previous patch, it was suggested to use all caps instead of rolling caps for initialisms, so this patch changes everything to do this. llvm-svn: 277190	2016-07-29 18:24:26 +00:00
Zachary Turner	9f73c20228	[pdb] Fix an ambiguity when writing size_t on x64 platforms. llvm-svn: 277025	2016-07-28 19:29:52 +00:00
Zachary Turner	e98137c47f	[pdb] Fix some warnings that break -Werror builds. llvm-svn: 277021	2016-07-28 19:18:02 +00:00
Zachary Turner	d66889cbae	[pdb] Refactor library to more clearly separate reading/writing Reviewed By: amccarth, ruiu Differential Revision: https://reviews.llvm.org/D22693 llvm-svn: 277019	2016-07-28 19:12:28 +00:00
Zachary Turner	199f48a5f0	Get rid of IMsfStreamData class. This was a pure virtual base class whose purpose was to abstract away the notion of how you retrieve the layout of a discontiguous stream of blocks in an Msf file. This led to too many layers of abstraction making it difficult to figure out what was going on and extend things. Ultimately, a stream's layout is decided by its length and the array of block numbers that it lives on. So rather than have an abstract base class which can return this in any number of ways, it's more straightforward to simply store them as fields of a trivial struct, and also to give a more appropriate name. This patch does that. It renames IMsfStreamData to MsfStreamLayout, and deletes the 2 concrete implementations, DirectoryStreamData and IndexedStreamData. MsfStreamLayout is a trivial struct with the necessary data. llvm-svn: 277018	2016-07-28 19:11:09 +00:00
Vassil Vassilev	fe68d81709	[modules] Add missing includes. llvm-svn: 276970	2016-07-28 10:26:33 +00:00
Zachary Turner	e4a4f33daf	Make PDBFile store an msf::Layout. Previously it was storing all the fields of an msf::Layout as separate members. This is a trivial cleanup to make it store an msf::Layout directly. This makes the code more readable since it becomes clear which fields of PDBFile are actually the msf specific layout information in a sea of other bookkeeping fields. llvm-svn: 276460	2016-07-22 19:56:33 +00:00
Zachary Turner	e109dc63f9	[pdb] Have builders share a single BumpPtrAllocator. This makes it easier to have the writable and readable PDB interfaces share code since the read/write and write-only interfaces now share a single allocator, you don't have to worry about a builder building a read only interface and then having the read-only interface's data become corrupt when the builder goes out of scope. Now the allocator is specified explicitly to all constructors, so all interfaces can share a single allocator that is scoped appropriately. llvm-svn: 276459	2016-07-22 19:56:26 +00:00
Zachary Turner	bac69d33d0	[msf] Create LLVMDebugInfoMsf This provides a better layering of responsibilities among different aspects of PDB writing code. Some of the MSF related code was contained in CodeView, and some was in PDB prior to this. Further, we were often saying PDB when we meant MSF, and the two are actually independent of each other since in theory you can have other types of data besides PDB data in an MSF. So, this patch separates the MSF specific code into its own library, with no dependencies on anything else, and DebugInfoCodeView and DebugInfoPDB take dependencies on DebugInfoMsf. llvm-svn: 276458	2016-07-22 19:56:05 +00:00
Zachary Turner	b383d628df	[pdb] Move file layout header structs to RawTypes.h This facilitates code reuse between the builder classes and the "frozen" read only versions of the classes used for parsing existing PDB files. llvm-svn: 276427	2016-07-22 15:46:46 +00:00
Zachary Turner	d218c26124	[pdb] Round-trip module & file info to/from YAML. This implements support for writing compiland and compiland source file info to a binary PDB. This is tested by adding support for dumping these fields from an existing PDB to yaml, reading them back in, and dumping them again and verifying the values are as expected. llvm-svn: 276426	2016-07-22 15:46:37 +00:00
Pete Cooper	b2ba776aed	Avoid dsymutil calls to getFileNameByIndex. This change adds a hasFileAtIndex method. getChildDeclContext can first call this method, and if it returns true it knows it can then lookup the resolved path cache for the given file index. If we hit that cache then we don't even have to call getFileNameByIndex. Running dsymutil against the swift executable built from github gives a 20% performance improvement without any change in the binary. Differential Revision: https://reviews.llvm.org/D22655 Reviewed by friss. llvm-svn: 276380	2016-07-22 01:41:32 +00:00
Zachary Turner	b927e02e1b	[pdb] Teach MsfBuilder and other classes about the Free Page Map. Block 1 and 2 of an MSF file are bit vectors that represent the list of blocks allocated and free in the file. We had been using these blocks to write stream data and other data, so we mark them as the free page map now. We don't yet serialize these pages to the disk, but at least we make a note of what it is, and avoid writing random data to them. Doing this also necessitated cleaning up some of the tests to be more general and hardcode fewer values, which is nice. llvm-svn: 275629	2016-07-15 22:17:19 +00:00
Zachary Turner	5e534c7fb3	[pdb] Round trip the NameMap data structure to YAML. llvm-svn: 275628	2016-07-15 22:17:08 +00:00
Zachary Turner	faa554b2fd	[pdb] Use MsfBuilder to handle the writing PDBs. Previously we would read a PDB, then write some of it back out, but write the directory, super block, and other pertinent metadata back out unchanged. This generates incorrect PDBs since the amount of data written was not always the same as the amount of data read. This patch changes things to use the newly introduced `MsfBuilder` class to write out a correct and accurate set of Msf metadata for the data actually written, which opens up the door for adding and removing type records, symbol records, and other types of data to an existing PDB. llvm-svn: 275627	2016-07-15 22:16:56 +00:00
Saleem Abdulrasool	ea6a4fe841	DebugInfo: reorder some initializers Fix a few initialization ordering warnings from gcc from `-Wreorder`. NFC. llvm-svn: 275615	2016-07-15 21:10:31 +00:00
Zachary Turner	f52a899f4a	[pdb] Introduce MsfBuilder for laying out PDB files. Reviewed by: ruiu Differential Revision: https://reviews.llvm.org/D22308 llvm-svn: 275611	2016-07-15 20:43:38 +00:00
Rui Ueyama	dbdfe62c3f	Dump enum unique names. llvm-svn: 275152	2016-07-12 03:33:48 +00:00
Rui Ueyama	ef5ec2da4a	Re-enable TPI hash verification for enum records. We didn't read unique names correctly. As a result, we computed hashes on (non-)unique names instead of unique names. llvm-svn: 275150	2016-07-12 03:25:03 +00:00
Zachary Turner	dbeaea7b35	Refactor the PDB writing to use a builder approach llvm-svn: 275110	2016-07-11 21:45:26 +00:00
Benjamin Kramer	4d09892e9a	Give helper classes/functions internal linkage. NFC. llvm-svn: 275014	2016-07-10 11:28:51 +00:00
David Majnemer	1b79e9a5b9	[pdb] Sanity check the stream map Some abstractions in LLVM "know" that they are reading in-bounds, FixedStreamArray, and provide a simple result. This breaks down if the stream map is bogus. llvm-svn: 275010	2016-07-10 05:32:05 +00:00
David Majnemer	6211b1f1f9	[llvm-pdbdump] Propagate errors a little more consistently PDBFile::getBlockData didn't really return any indication that it failed. It merely returned an empty buffer. llvm-svn: 275009	2016-07-10 03:34:47 +00:00
David Majnemer	7abd269aa9	[CodeView] Emit an appropriate symbol kind for globals We emitted debug info for globals/functions as if they all had external linkage. Instead, emit local symbol records when appropriate. llvm-svn: 274676	2016-07-06 21:07:47 +00:00
Zachary Turner	8848a7a6b2	[pdb] Round trip the PDB stream between YAML and binary PDB. This gets writing of the PDB stream working. llvm-svn: 274647	2016-07-06 18:05:57 +00:00
Zachary Turner	fbabf2d040	Disable hash verification of enums. llvm-svn: 274639	2016-07-06 17:25:12 +00:00
Reid Kleckner	dafc5d75ea	Prune RelocVisitor.h include to avoid including COFF.h from MCJIT.h This helps to mitigate the conflict between COFF.h and winnt.h, which is PR28399. llvm-svn: 274637	2016-07-06 16:56:42 +00:00
Reid Kleckner	6e96a4c64a	[pdb] Check the display name for <unnamed-tag>, not the linkage name This issue was encountered on libcmt.pdb, which has a type record that looks like this: Struct (0x1094) { TypeLeafKind: LF_STRUCTURE (0x1505) MemberCount: 3 Properties [ (0x200) HasUniqueName (0x200) ] FieldList: <field list> (0x1093) DerivedFrom: 0x0 VShape: 0x0 SizeOf: 4 Name: <unnamed-tag> LinkageName: .?AU<unnamed-tag>@@ } The checks for startswith/endswith "<unnamed-tag>" should look at the display name, not the linkage name. llvm-svn: 274376	2016-07-01 18:43:29 +00:00
Reid Kleckner	64b16171df	[pdb] Avoid reporting an error when the module symbol stream is empty llvm-svn: 274309	2016-07-01 00:37:49 +00:00
Reid Kleckner	7aa95a9fca	[PDB] Indicate which type record failed hash validation llvm-svn: 274308	2016-07-01 00:37:25 +00:00
Zachary Turner	ab58ae8730	[pdb] Re-add code to write PDB files. Somehow all the functionality to write PDB files got removed, probably accidentally when uploading the patch perhaps the wrong one got uploaded. This re-adds all the code, as well as the corresponding test. llvm-svn: 274248	2016-06-30 17:43:00 +00:00
David Majnemer	f15064871a	[CodeView] Healthy paranoia around strings Make sure strings don't get too big for a record, truncate them if need-be. llvm-svn: 273710	2016-06-24 19:34:41 +00:00
Kevin Enderby	931cb65df2	Thread Expected<...> up from libObject’s getSymbolAddress() for symbols to allow a good error message to be produced. This is nearly the last libObject interface that used ErrorOr and the last one that appears in llvm/include/llvm/Object/MachO.h . For Mach-O objects this is just a clean up because it’s version of getSymbolAddress() can’t return an error. I will leave it to the experts on COFF and ELF to actually add meaning full error messages in their tests if they wish. And also leave it to these experts to change the last two ErrorOr interfaces in llvm/include/llvm/Object/ObjectFile.h for createCOFFObjectFile() and createELFObjectFile() if they wish. Since there are no test cases for COFF and ELF error cases with respect to getSymbolAddress() in the test suite this is no functional change (NFC). llvm-svn: 273701	2016-06-24 18:24:42 +00:00
Reid Kleckner	33848faa5e	[codeview] Use one byte for S_FRAMECOOKIE CookieKind and add flags byte We bailed out while printing codeview for an MSVC compiled SemaExprCXX.cpp that used this record. The MS reference headers look incorrect here, which is probably why we had this bug. They use a 32-bit enum as the field type, but the actual record appears to use one byte for the cookie kind followed by a flags byte. llvm-svn: 273691	2016-06-24 17:23:49 +00:00
Reid Kleckner	5aba52ff21	[pdb] Treat a stream size of ~0U as 0 My PDBs always have this size for stream 11. Not sure why. llvm-svn: 273504	2016-06-22 22:42:24 +00:00
Reid Kleckner	ac460619d2	[codeview] Fix the alignment padding that we add to list records Tweak the big-types.ll test case to catch this bug. We just need an enumerator name that doesn't have a length that is a multiple of 4. llvm-svn: 273477	2016-06-22 20:59:17 +00:00
Reid Kleckner	5b335b864b	[codeview] Add support for splitting field list records over 64KB The basic structure is that once a list record goes over 64K, the last subrecord of the list is an LF_INDEX record that refers to the next record. Because the type record graph must be toplogically sorted, this means we have to emit them in reverse order. We build the type record in order of declaration, so this means that if we don't want extra copies, we need to detect when we were about to split a record, and leave space for a continuation subrecord that will point to the eventual split top-level record. Also adds dumping support for these records. Next we should make sure that large method overload lists work properly. llvm-svn: 273294	2016-06-21 18:33:01 +00:00
Rui Ueyama	1abbb31bd4	[codeview] Add an extra check for TPI hash values. This patch adds a function that corresponds to `fUDTAnon` and use that to compute TPI hash values as the reference does. llvm-svn: 273139	2016-06-20 07:31:29 +00:00
Reid Kleckner	604105bb90	[codeview] Add DIFlags for pointer to member representations Summary: This seems like the least intrusive way to pass this information through. Fixes PR28151 Reviewers: majnemer, aprantl, dblaikie Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D21444 llvm-svn: 273053	2016-06-17 21:31:33 +00:00
Reid Kleckner	11582c59d7	[pdb] Don't error on missing FPO streams 64-bit PDBs never have FPO data. They have xdata instead. Also improve error recovery of stream summary dumping while I'm here. llvm-svn: 273046	2016-06-17 20:38:01 +00:00
Rui Ueyama	74c4341dde	[codeview] Use hashBufferV8 to verify all type records. Differential Revision: http://reviews.llvm.org/D21393 llvm-svn: 272930	2016-06-16 18:39:17 +00:00
Zachary Turner	01ee3dae04	Resubmit "[pdb] Change type visitor pattern to be dynamic." There was a regression introduced during type stream merging when visiting a field list record. This has been fixed in this patch. llvm-svn: 272929	2016-06-16 18:22:27 +00:00
Zachary Turner	73b0b2f555	Revert "[pdb] Change type visitor pattern to be dynamic." This reverts commit fb0dd311e1ad945827b8ffd5354f4810e2be1579. This breaks some llvm-readobj tests. llvm-svn: 272927	2016-06-16 18:09:04 +00:00
Zachary Turner	1f6372c429	[pdb] Change type visitor pattern to be dynamic. This allows better catching of compiler errors since we can use the override keyword to verify that methods are actually overridden. Also in this patch I've changed from storing a boolean Error code everywhere to returning an llvm::Error, to propagate richer error information up the call stack. Reviewed By: ruiu, rnk Differential Revision: http://reviews.llvm.org/D21410 llvm-svn: 272926	2016-06-16 18:00:28 +00:00
Rui Ueyama	43ed08efa3	[codeview] Pass CVRecord to visitTypeBegin callback. Both parameters to visitTypeBegin are actually members of CVRecord, so we can just pass CVRecord instead of destructuring it. Differential Revision: http://reviews.llvm.org/D21435 llvm-svn: 272899	2016-06-16 14:47:23 +00:00
Rui Ueyama	b9095ae7ee	[codeview] Remove unused parameter. Differential Revision: http://reviews.llvm.org/D21433 llvm-svn: 272898	2016-06-16 14:41:22 +00:00
Rui Ueyama	5c7248c959	Implement pdb::hashBufferV8 hash function. llvm-svn: 272894	2016-06-16 13:48:16 +00:00
Rui Ueyama	9caea82d3e	Remove redundant namespace specifiers. llvm-svn: 272889	2016-06-16 13:17:59 +00:00
Rui Ueyama	8b0ae136e2	[codeview] Use CVTypeVisitor instead of a hand-written switch-cases. Differential Revision: http://reviews.llvm.org/D21418 llvm-svn: 272888	2016-06-16 13:14:42 +00:00
Rui Ueyama	5dbea9db10	[Codeview] Add a class for LF_UDT_MOD_SRC_LINE. Differential Revision: http://reviews.llvm.org/D21406 llvm-svn: 272843	2016-06-15 21:25:29 +00:00
Reid Kleckner	b82f08fa3d	Axe some trailing whitespace from my last commit llvm-svn: 272830	2016-06-15 20:32:42 +00:00
Reid Kleckner	828c4f64e2	[codeview] Move deserialization methods out of line They aren't performance critical and don't need to be inline. llvm-svn: 272829	2016-06-15 20:30:34 +00:00
Rui Ueyama	41974f1e4d	[pdbdump] Verify LF_{CLASS,ENUM,INTERFACE,STRUCTURE,UNION} records. Differential Revision: http://reviews.llvm.org/D21361 llvm-svn: 272815	2016-06-15 18:26:59 +00:00
Rui Ueyama	9f3e96115c	[pdbdump] Verify TPI hash for LF_ENUM type records. llvm-svn: 272728	2016-06-14 22:25:07 +00:00
Zachary Turner	1dc9fd3c4a	Resubmit "[pdb] Actually write a PDB to disk from YAML."" Reviewed By: ruiu Differential Revision: http://reviews.llvm.org/D21220 llvm-svn: 272708	2016-06-14 20:48:36 +00:00
Zachary Turner	07c229c9e7	Revert "[pdb] Actually write a PDB to disk from YAML." This reverts commit 879139e1c6577b09df52de56a6bab856a19ed185. This was committed accidentally when I blindly typed git svn dcommit instead of the command to generate a patch. llvm-svn: 272693	2016-06-14 18:51:35 +00:00
Zachary Turner	fe5bc02492	[pdb] Actually write a PDB to disk from YAML. llvm-svn: 272692	2016-06-14 18:49:36 +00:00
Zachary Turner	97609bb2fd	[pdb] Fix issues with pdb writing. This fixes an alignment issue by forcing all cached allocations to be 8 byte aligned, and also fixes an issue arising on big endian systems by writing ulittle32_t's instead of uint32_t's in the test. llvm-svn: 272437	2016-06-10 21:47:26 +00:00
Zachary Turner	b84faa8baa	Make PDBFile take a StreamInterface instead of a MemBuffer. This is the next step towards being able to write PDBs. MemoryBuffer is immutable, and StreamInterface is our replacement which can be any combination of read-only, read-write, or write-only depending on the particular implementation. The one place where we were creating a PDBFile (in RawSession) is updated to subclass ByteStream with a simple adapter that holds a MemoryBuffer, and initializes the superclass with the buffer's array, so that all the functionality of ByteStream works transparently. llvm-svn: 272370	2016-06-10 05:10:19 +00:00
Zachary Turner	5acb4ac6d7	Add support for writing through StreamInterface. This adds method and tests for writing to a PDB stream. With this, even a PDB stream which is discontiguous can be treated as a sequential stream of bytes for the purposes of writing. Reviewed By: ruiu Differential Revision: http://reviews.llvm.org/D21157 llvm-svn: 272369	2016-06-10 05:09:12 +00:00
Rui Ueyama	c41cd6dcf7	[pdbdump] Verify part of TPI hash streams. TPI hash table contains a parallel array for the type records. For each type record R, a hash value is calculated by `H(R) % NumBuckets` where H is a hash function, and the result is stored to a bucket element. H is TPI1::hashPrec function in microsoft-pdb repository. Our hash function does not support all type record types yet. Currently it supports only records for line number. I'll extend it in a follow up patch. The aim of verify the hash table is not only detect corrupted files. It ensures that our understanding of how the hash values are calculated is correct. llvm-svn: 272229	2016-06-09 00:10:19 +00:00
Rui Ueyama	f05f360deb	Function names should start with lowercase letters. llvm-svn: 272225	2016-06-08 23:15:09 +00:00
Rui Ueyama	170988f21f	[PDB] Move PDB functions to a separate file. We are going to use the hash functions from TPI streams. Differential Revision: http://reviews.llvm.org/D21142 llvm-svn: 272223	2016-06-08 23:11:14 +00:00
Benjamin Kramer	c321e53402	Apply most suggestions of clang-tidy's performance-unnecessary-value-param Avoids unnecessary copies. All changes audited & pass tests with asan. No functional change intended. llvm-svn: 272190	2016-06-08 19:09:22 +00:00
Zachary Turner	a1657a9e64	[pdb] Handle stream index errors better. Reviewed By: ruiu Differential Revision: http://reviews.llvm.org/D21128 llvm-svn: 272172	2016-06-08 17:26:39 +00:00
Rui Ueyama	ced0853b46	Remove a patch .rej file. llvm-svn: 272171	2016-06-08 16:54:31 +00:00
Zachary Turner	d2b2bfed94	[pdb] Try to fix use after free. llvm-svn: 272078	2016-06-08 00:25:08 +00:00
Rui Ueyama	f14a74c102	[pdbdump] Print out # of hash buckets. In the reference code, the field name is `cHashBuckets`. llvm-svn: 272075	2016-06-07 23:53:43 +00:00
Rui Ueyama	d833917f98	[pdbdump] Print out TPI hash key size. llvm-svn: 272073	2016-06-07 23:44:27 +00:00
Zachary Turner	e6fee88ce1	[pdb] Convert StringRefs to ArrayRef<uint8_t>s. llvm-svn: 272058	2016-06-07 20:38:37 +00:00
Zachary Turner	5839503f08	[pdb] Fix a potential overflow and remove unnecessary comments. llvm-svn: 272043	2016-06-07 18:42:39 +00:00
Zachary Turner	d8447990b0	[pdb] Use MappedBlockStream to parse the PDB directory. In order to efficiently write PDBs, we need to be able to make a StreamWriter class similar to a StreamReader, which can transparently deal with writing to discontiguous streams, and we need to use this for all writing, similar to how we use StreamReader for all reading. Most discontiguous streams are the typical numbered streams that appear in a PDB file and are described by the directory, but the exception to this, that until now has been parsed by hand, is the directory itself. MappedBlockStream works by querying the directory to find out which blocks a stream occupies and various other things, so naturally the same logic could not possibly work to describe the blocks that the directory itself resided on. To solve this, I've introduced an abstraction IPDBStreamData, which allows the client to query for the list of blocks occupied by the stream, as well as the stream length. I provide two implementations of this: one which queries the directory (for indexed streams), and one which queries the super block (for the directory stream). This has the side benefit of vastly simplifying the code to parse the directory. Whereas before a mini state machine was rolled by hand, now we simply use FixedStreamArray to read out the stream sizes, then build a vector of FixedStreamArrays for the stream map, all in just a few lines of code. Reviewed By: ruiu Differential Revision: http://reviews.llvm.org/D21046 llvm-svn: 271982	2016-06-07 05:28:55 +00:00
Rui Ueyama	4a1ebae537	Add comments. llvm-svn: 271967	2016-06-07 00:59:04 +00:00
Reid Kleckner	4ece163c92	Try one more time to pacify -Wpessimizing-move, MSVC, libstdc++4.7, and the world without a named variable llvm-svn: 271964	2016-06-06 23:46:14 +00:00
Reid Kleckner	52a155fca3	Attempt to work around lack of std::map::emplace in libstdc++4.7 llvm-svn: 271958	2016-06-06 23:28:03 +00:00
Rui Ueyama	ba0aab94cc	[pdbdump] Verify the size of TPI hash records. llvm-svn: 271954	2016-06-06 23:19:23 +00:00
Rui Ueyama	ef2b488482	[pdbdump] Print out New FPO stream contents. The data strucutre in the new FPO stream is described in the PE/COFF spec. There is one record per function if frame pointer is omitted. Differential Revision: http://reviews.llvm.org/D20999 llvm-svn: 271926	2016-06-06 18:39:21 +00:00
David Majnemer	36b7b08d4f	[DebugInfo, PDB] Use sparse bitfields for the name map The name map might not be densely packed on disk. Using a sparse map will save memory in such situations. llvm-svn: 271811	2016-06-04 22:47:39 +00:00
David Majnemer	862a8ae812	[CodeView] Fix a busted assert in TypeTableBuilder::writeClass It was checking for Union when it should have checked for Interface. llvm-svn: 271792	2016-06-04 15:40:31 +00:00
David Majnemer	067e3d0cc5	[TypeStreamMerger] visitUnknownMember was supposed to be visitUnknownType llvm-svn: 271790	2016-06-04 15:40:27 +00:00
Rui Ueyama	fd97bf1f76	pdbdump: print out TPI hashes. Differential Revision: http://reviews.llvm.org/D20945 llvm-svn: 271736	2016-06-03 20:48:51 +00:00
Reid Kleckner	ab1dfaae06	Fix non-Windows build when inserting a move only type into a map llvm-svn: 271727	2016-06-03 20:29:51 +00:00
Reid Kleckner	f27f3f8491	[Symbolize] Check if the PE file has a PDB and emit an error if we can't load it Summary: Previously we would try to load PDBs for every PE executable we tried to symbolize. If that failed, we would fall back to DWARF. If there wasn't any DWARF, we'd print mostly useless symbol information using the export table. With this change, we only try to load PDBs for executables that claim to have them. If that fails, we can now print an error rather than falling back silently. This should make it a lot easier to diagnose and fix common symbolization issues, such as not having DIA or not having a PDB. Reviewers: zturner, eugenis Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D20982 llvm-svn: 271725	2016-06-03 20:25:09 +00:00
Reid Kleckner	a8d5740757	[codeview] Add basic record type translation This only translates data members for now. Translating overloaded methods is complicated, so I stopped short of doing that. Reviewers: aaboud Differential Revision: http://reviews.llvm.org/D20924 llvm-svn: 271680	2016-06-03 15:58:20 +00:00
Zachary Turner	3df1bfaaec	[pdb] Print out file names instead of file offsets. When printing line information and file checksums, we were printing the file offset field from the struct header. This teaches llvm-pdbdump how to turn those numbers into the filename. In the case of file checksums, this is done by looking in the global string table. In the case of line contributions, this is done by indexing into the file names buffer of the DBI stream. Why they use a different technique I don't know. llvm-svn: 271630	2016-06-03 05:52:57 +00:00
Zachary Turner	d0563f29f9	[pdb] Dump file checksums from pdb codeview line info. llvm-svn: 271622	2016-06-03 04:01:48 +00:00
Zachary Turner	a96cce64a5	[codeview] Dump line number and column information. To facilitate this, a couple of changes had to be made: 1. `ModuleSubstream` got moved from `DebugInfo/PDB` to `DebugInfo/CodeView`, and various codeview related types are defined there. It turns out `DebugInfo/CodeView/Line.h` already defines many of these structures, but this is really old code that is not endian aware, doesn't interact well with `StreamInterface` and not very helpful for getting stuff out of a PDB. Eventually we should migrate the old readobj `COFFDumper` code to these new structures, or at least merge their functionality somehow. 2. A `ModuleSubstream` visitor is introduced. Depending on where your module substream array comes from, different subsets of record types can be expected. We are already hand parsing these substream arrays in many places especially in `COFFDumper.cpp`. In the future we can migrate these paths to the visitor as well, which should reduce a lot of code in `COFFDumper.cpp`. Differential Revision: http://reviews.llvm.org/D20936 Reviewed By: ruiu, majnemer llvm-svn: 271621	2016-06-03 03:25:59 +00:00
Rui Ueyama	0350bf0966	Add comments. llvm-svn: 271597	2016-06-02 21:13:47 +00:00
Zachary Turner	7eb6d358af	[llvm-pdbdump] Dump CodeView line information. This first pass only splits apart the records and dumps the line info kinds and binary data. Subsequent patches will parse out the binary data into more useful information and dump it in detail. llvm-svn: 271576	2016-06-02 20:11:22 +00:00
Zachary Turner	f4e9c9ac08	[codeview] Fix a nasty use after free. StreamRef was designed to be a thin wrapper over an abstract stream interface that could itself be treated the same as any other stream interface. For this reason, it inherited publicly from StreamInterface, and stored a StreamInterface* internally. But StreamRef was also designed to be lightweight and easily copyable, similar to ArrayRef. This led to two misuses of the classes. 1) When creating a StreamRef A from another StreamRef B, it was possible to end up with A storing a pointer to B, even when B was a temporary object, leading to use after free. 2) The above situation could be repeated ad nauseum, so that A stores a pointer to B, which itself stores a pointer to another StreamRef C, and so on and so on, creating an unnecessarily level of nesting depth. This patch removes the public inheritance relationship between StreamRef and StreamInterface, making it so that we can never accidentally convert a StreamRef to a StreamInterface. llvm-svn: 271570	2016-06-02 19:51:48 +00:00
David Majnemer	b68f32f0cf	[CodeView] Use None instead of Void if there is no subprogram llvm-svn: 271566	2016-06-02 18:51:24 +00:00
Rui Ueyama	90db78816b	pdbdump: print out COFF section headers. Unlike other sections that can grow to any size, the COFF section header stream has maximum length because each record is fixed size and the COFF file format limits the maximum number of sections. So I decided to not create a specific stream class for it. Instead, I added a member function to DbiStream class which returns a vector of COFF headers. Differential Revision: http://reviews.llvm.org/D20717 llvm-svn: 271557	2016-06-02 18:20:20 +00:00
Zachary Turner	93839cb4ac	[pdb] Parse and dump section map and section contribs Differential Revision: http://reviews.llvm.org/D20876 Reviewed By: rnk, ruiu llvm-svn: 271488	2016-06-02 05:07:49 +00:00
David Majnemer	a7c29321be	[PDB] Make ModStream::symbols report errors llvm-svn: 271417	2016-06-01 18:13:04 +00:00
Zachary Turner	90b8b8db2e	[pdb] Add unit tests for PDB MappedBlockStream and zero copy Differential Revision: http://reviews.llvm.org/D20837 Reviewed By: ruiu llvm-svn: 271346	2016-05-31 22:41:52 +00:00
Kevin Enderby	9acb109930	Change llvm-objdump, llvm-nm and llvm-size when reporting an object file error when the object is from a slice of a Mach-O Universal Binary use something like "foo.o (for architecture i386)" as part of the error message when expected. Also fixed places in these tools that were ignoring object file errors from MachOUniversalBinary::getAsObjectFile() when the code moved on to see if the slice was an archive. To do this MachOUniversalBinary::getAsObjectFile() and MachOUniversalBinary::getObjectForArch() were changed from returning ErrorOr<...> to Expected<...> then that was threaded up to its users. Converting these interfaces to Expected<> from ErrorOr<> does involve touching a number of places. To contain the changes for now the use of errorToErrorCode() is still used in two places yet to be fully converted. llvm-svn: 271332	2016-05-31 20:35:34 +00:00
Reid Kleckner	fbdbe9e22b	[codeview] Improve readability of type record assembly Adds the method MCStreamer::EmitBinaryData, which is usually an alias for EmitBytes. In the MCAsmStreamer case, it is overridden to emit hex dump output like this: .byte 0x0e, 0x00, 0x08, 0x10 .byte 0x03, 0x00, 0x00, 0x00 .byte 0x00, 0x00, 0x00, 0x00 .byte 0x00, 0x10, 0x00, 0x00 Also, when verbose asm comments are enabled, this patch prints the dump output for each comment before its record, like this: # ArgList (0x1000) { # TypeLeafKind: LF_ARGLIST (0x1201) # NumArgs: 0 # Arguments [ # ] # } .byte 0x06, 0x00, 0x01, 0x12 .byte 0x00, 0x00, 0x00, 0x00 This should make debugging easier and testing more convenient. Reviewers: aaboud Subscribers: majnemer, zturner, amccarth, aaboud, llvm-commits Differential Revision: http://reviews.llvm.org/D20711 llvm-svn: 271313	2016-05-31 18:45:36 +00:00
Reid Kleckner	3b3f490f9c	[codeview] Add a CVTypeDumper::dump(ArrayRef<uint8_t>) overload This is a convenient wrapper when the type record is already laid out as bytes in memory. llvm-svn: 271309	2016-05-31 18:15:23 +00:00
David Majnemer	ba1439229a	Make sure we don't add an empty string to the stringmap llvm-svn: 271172	2016-05-29 06:18:06 +00:00
David Majnemer	c6cb2ec36e	[SymbolDumper] Validate the string table offset before using it llvm-svn: 271145	2016-05-28 20:04:46 +00:00
David Majnemer	b343310b4f	[SymbolDumper] Validate the string table offset before using it llvm-svn: 271142	2016-05-28 19:45:56 +00:00
David Majnemer	328b6d3903	Tighten some of the name map checks further llvm-svn: 271130	2016-05-28 18:03:37 +00:00
David Majnemer	869631f987	Bounds check the number of bitmap blocks in the name map llvm-svn: 271105	2016-05-28 05:59:25 +00:00
David Majnemer	7e950b261a	Make sure the directory contains info for all streams llvm-svn: 271103	2016-05-28 05:59:19 +00:00
Zachary Turner	0d43c1c339	[pdb] Finish conversion to zero copy pdb access. This converts remaining uses of ByteStream, which was still left in the symbol stream and type stream, to using the new StreamInterface zero-copy classes. RecordIterator is finally deleted, so this is the only way left now. Additionally, more error checking is added when iterating the various streams. With this, the transition to zero copy pdb access is complete. llvm-svn: 271101	2016-05-28 05:21:57 +00:00
David Majnemer	74b1fb00f7	Don't discard errors llvm-svn: 271056	2016-05-27 22:07:50 +00:00
Zachary Turner	7dd42598be	[pdb] Fix size check when reading stream bytes. We were accidentally bounds checking the read against the output ArrayRef instead of against the size of the read. llvm-svn: 271040	2016-05-27 20:17:33 +00:00
David Majnemer	6c13db402f	Make sure data is available before dereferencing it llvm-svn: 271032	2016-05-27 18:50:02 +00:00
Zachary Turner	1de49c9ffd	Resubmit "[pdb] Allow zero-copy read support for symbol streams."" Due to differences in template instantiation rules, it is not portable to static_assert(false) inside of an invalid specialization of a template. Instead I just =delete the method so that it can't be used, and leave a comment that it must be explicitly specialized. llvm-svn: 271027	2016-05-27 18:47:20 +00:00
Chad Rosier	6c247c8cc8	Revert "[pdb] Allow zero-copy read support for symbol streams." This reverts commit r271024 due to error: static_assert failed "You must either provide a specialization of VarStreamArrayExtractor or a custom extractor" llvm-svn: 271026	2016-05-27 18:31:02 +00:00
Zachary Turner	3a9a23ae62	[pdb] Allow zero-copy read support for symbol streams. This reduces the amount of memory used by llvm-pdbdump by roughly 1/3 of the size of the PDB file. Differential Revision: http://reviews.llvm.org/D20724 Reviewed By: ruiu llvm-svn: 271025	2016-05-27 18:20:20 +00:00
David Majnemer	836937ed79	Make sure these error codes are marked as checked llvm-svn: 271013	2016-05-27 16:16:56 +00:00
David Majnemer	9efba74778	Make sure there are enough blocks for the stream llvm-svn: 271012	2016-05-27 16:16:48 +00:00
David Majnemer	5d842ea68e	Make sure the directory block array fits in the file llvm-svn: 271011	2016-05-27 16:16:45 +00:00
David Majnemer	878cadb663	Validate the blocksize before using it The blocksize could be zero on disk causing later checks to divide by zero. llvm-svn: 271008	2016-05-27 15:57:38 +00:00
Benjamin Kramer	82de7d323d	Apply clang-tidy's misc-move-constructor-init throughout LLVM. No functionality change intended, maybe a tiny performance improvement. llvm-svn: 270997	2016-05-27 14:27:24 +00:00
Zachary Turner	b393d95359	[codeview] Remove StreamReader copying method. Since we want to move toward zero-copy access to stream data, we want to remove all instances of copying operations. So get rid of some of those here. Differential Revision: http://reviews.llvm.org/D20720 Reviewed By: ruiu llvm-svn: 270960	2016-05-27 03:51:53 +00:00
Zachary Turner	8dbe3629a0	[codeview,pdb] Try really hard to conserve memory when reading. PDBs can be extremely large. We're already mapping the entire PDB into the process's address space, but to make matters worse the blocks of the PDB are not arranged contiguously. So, when we have something like an array or a string embedded into the stream, we have to make a copy. Since it's convenient to use traditional data structures to iterate and manipulate these records, we need the memory to be contiguous. As a result of this, we were using roughly twice as much memory as the file size of the PDB, because every stream was copied out and re-stitched together contiguously. This patch addresses this by improving the MappedBlockStream to allocate from a BumpPtrAllocator only when a read requires a discontiguous read. Furthermore, it introduces some data structures backed by a stream which can iterate over both fixed and variable length records of a PDB. Since everything is backed by a stream and not a buffer, we can read almost everything from the PDB with zero copies. Differential Revision: http://reviews.llvm.org/D20654 Reviewed By: ruiu llvm-svn: 270951	2016-05-27 01:54:44 +00:00
Zachary Turner	d5d37dcf83	[codeview] Move StreamInterface and StreamReader to libcodeview. We have need to reuse this functionality, including making additional generic stream types that are smarter about how and when they copy memory versus referencing the original memory. So all of these structures belong in the common library rather than being pdb specific. llvm-svn: 270751	2016-05-25 20:37:03 +00:00
Zachary Turner	d3076ab36f	[llvm-pdbdump] Decipher the remaining PDB streams. We know at least know the meaning of every stream of the PDB file. Yay! llvm-svn: 270669	2016-05-25 05:49:48 +00:00
Zachary Turner	c9972c64f5	[llvm-pdbdump] Dump the IPI stream and all records. llvm-svn: 270661	2016-05-25 04:35:22 +00:00
Rui Ueyama	b12b158f20	pdbdump: fix bug in name hash table. name_ids() did not return all IDs but only the first NameCount items. The number of non-zero entries in IDs vector is NameCount, but it does not mean that all non-zero entries are at the beginning of IDs vector. Differential Revision: http://reviews.llvm.org/D20611 llvm-svn: 270656	2016-05-25 04:07:17 +00:00
Zachary Turner	c59261ca37	[llvm-pdbdump] Stream 0 isn't actually the MSF superblock. Oddly enough, I realized we don't actually know what stream 0 is (if anything). llvm-svn: 270655	2016-05-25 03:53:16 +00:00
Zachary Turner	85ed80b9e6	[llvm-pdbdump] Dump stream summary list. Try to figure out what each stream is, and dump its name. This gives us a better picture of what streams we still don't understand. llvm-svn: 270653	2016-05-25 03:43:17 +00:00
Zachary Turner	172d59c105	[codeview] Add support for new types and symbols. This patch adds support for: S_EXPORT LF_BITFIELD With this patch, I have run through a couple of gigabytes of PDB files and cannot find a type or symbol that we do not understand. llvm-svn: 270637	2016-05-25 00:12:48 +00:00
Zachary Turner	9f054d424f	[codeview] Add support for S_EXPORT symbol. llvm-svn: 270636	2016-05-25 00:12:40 +00:00
Zachary Turner	4caa1bf0bd	[codeview] Add support for new type records. This adds support for parsing and dumping the following symbol types: S_LPROCREF S_ENVBLOCK S_COMPILE2 S_REGISTER S_COFFGROUP S_SECTION S_THUNK32 S_TRAMPOLINE As of this patch, the test PDB files no longer have any unknown symbol types. llvm-svn: 270628	2016-05-24 22:58:46 +00:00
Zachary Turner	96e60f7573	[llvm-pdbdump] Rework command line options. When dumping huge PDB files, too many of the options were grouped together so you would get neverending spew of output. This patch introduces more granular display options so you can only dump the fields you actually care about. llvm-svn: 270607	2016-05-24 20:31:48 +00:00
Peter Collingbourne	4718f8b5f1	Add FIXMEs to all derived classes of std::error_category. This helps make clear that we're moving away from std::error_code. Differential Revision: http://reviews.llvm.org/D20592 llvm-svn: 270604	2016-05-24 20:13:46 +00:00
Zachary Turner	9e33e6f89b	[codeview, pdb] Dump symbol records in publics stream Differential Revision: http://reviews.llvm.org/D20580 Reviewed By: ruiu llvm-svn: 270597	2016-05-24 18:55:14 +00:00
Zachary Turner	00d847b19e	Fix build errors llvm-svn: 270587	2016-05-24 17:44:29 +00:00
Zachary Turner	cac29ae038	Dump symbol record details in llvm-pdbdump This makes use of the newly introduced `CVSymbolVisitor` to dump details of each type of symbol record in the symbol streams. Future patches will bring this visitor based dumping to the publics stream, as well as creating a `SymbolDumpDelegate` to print more information about relocations etc. Differential Revision: http://reviews.llvm.org/D20545 Reviewed By: ruiu llvm-svn: 270585	2016-05-24 17:30:25 +00:00
George Rimar	401e4e570e	Recommit r270547 ([llvm-dwarfdump] - Teach dwarfdump to decompress debug sections in zlib style.) Fix was: 1) Had to regenerate dwarfdump-test-zlib.elf-x86-64, dwarfdump-test-zlib-gnu.elf-x86-64 (because llvm-symbolizer-zlib.test uses that inputs for its purposes and failed). 2) Updated llvm-symbolizer-zlib.test (updated used call function address to match new files + added one more check for newly created dwarfdump-test-zlib-gnu.elf-x86-64 binary input). 3) Updated comment in dwarfdump-test-zlib.cc. Original commit message: [llvm-dwarfdump] - Teach dwarfdump to decompress debug sections in zlib style. Before this llvm-dwarfdump only recognized zlib-gnu compression style of headers, this patch adds support for zlib style. It looks reasonable to support both styles for dumping, even if we are not going to suport generating of deprecated gnu one. Differential revision: http://reviews.llvm.org/D20470 llvm-svn: 270557	2016-05-24 12:48:46 +00:00
George Rimar	f059dd4f76	Revert r270543 ("Recommit r270540") Failed build bot in another test. I am sorry for noise. http://lab.llvm.org:8080/green/job/clang-stage1-cmake-RA-incremental_check/23679/testReport/junit/LLVM/DebugInfo/llvm_symbolizer_zlib_test/ llvm-svn: 270547	2016-05-24 11:03:10 +00:00
George Rimar	e9b2e19109	Recommit r270540 fix: forgot to commit the updated dwarfdump-test-zlib.elf-x86-64 Original commit message: [llvm-dwarfdump] - Teach dwarfdump to decompress debug sections in zlib style. Before this llvm-dwarfdump only recognized zlib-gnu compression style of headers, this patch adds support for zlib style. It looks reasonable to support both styles for dumping, even if we are not going to suport generating of deprecated gnu one. Differential revision: http://reviews.llvm.org/D20470 llvm-svn: 270543	2016-05-24 10:46:43 +00:00
George Rimar	6a6185fd78	Revert r270540 "[llvm-dwarfdump] - Teach dwarfdump to decompress debug sections in zlib style." it broked bot: http://lab.llvm.org:8011/builders/clang-s390x-linux/builds/5036 llvm-svn: 270541	2016-05-24 09:44:44 +00:00
George Rimar	6bcbf4c572	[llvm-dwarfdump] - Teach dwarfdump to decompress debug sections in zlib style. Before this llvm-dwarfdump only recognized zlib-gnu compression style of headers, this patch adds support for zlib style. It looks reasonable to support both styles for dumping, even if we are not going to suport generating of deprecated gnu one. Differential revision: http://reviews.llvm.org/D20470 llvm-svn: 270540	2016-05-24 09:28:36 +00:00
Zachary Turner	3e78e2d43f	Remove unused variable. llvm-svn: 270516	2016-05-24 00:06:04 +00:00
Zachary Turner	aaad57440d	Make a symbol visitor and use it to dump CV symbols. Differential Revision: http://reviews.llvm.org/D20534 Reviewed By: rnk llvm-svn: 270511	2016-05-23 23:41:13 +00:00
Rui Ueyama	2a58779198	Fix struct member names and simplify. NFC. llvm-svn: 270289	2016-05-20 22:59:05 +00:00
Rui Ueyama	0fcd82605e	pdbdump: print out symbol names referred by publics stream. DBI stream contains a stream number of the symbol record stream. Symbol record streams is an array of length-type-value members. Each member represents one symbol. Publics stream contains offsets to the symbol record stream. This patch is to print out all symbols that are referenced by the publics stream. Note that even with this patch, llvm-pdbdump cannot dump all the information in a publics stream since it contains more information than symbol names. I'll improve it in followup patches. Differential Revision: http://reviews.llvm.org/D20480 llvm-svn: 270262	2016-05-20 19:55:17 +00:00
Reid Kleckner	e1587bce96	Fix -Wmicrosoft-enum-value warning llvm-svn: 270110	2016-05-19 20:20:22 +00:00
Rui Ueyama	0376b1a2d7	pdbdump: Rename NumberOfSymbols -> SymbolRecordStreamIndex. Differential Revision: http://reviews.llvm.org/D20441 llvm-svn: 270088	2016-05-19 18:05:58 +00:00
Rui Ueyama	350b29862f	pdbdump: Print out section offsets in the publics stream. llvm-svn: 269955	2016-05-18 16:24:16 +00:00
Daniel Sanders	016e6c4354	Try again to fix pdbdump-headers.test on big-endian hosts after r269861. r269898 fixed the problem with HashBuckets but the same issue occurred with AddressMap and ThunkMap too. llvm-svn: 269913	2016-05-18 12:36:25 +00:00
Daniel Sanders	c819d903e1	Attempt to fix pdbdump-headers.test on big-endian hosts after r269861. llvm-svn: 269898	2016-05-18 09:59:14 +00:00
Rui Ueyama	8dc18c5f45	pdbdump: Print out more strcutures. I don't yet fully understand the meaning of these data strcutures, but at least it seems that their sizes and types are correct. With this change, we can read publics streams till end. Differential Revision: http://reviews.llvm.org/D20343 llvm-svn: 269861	2016-05-17 23:07:48 +00:00
Reid Kleckner	fcc5550544	[codeview] Test serialization of all known type records This just checks that we emit all type records once, and then after merging the type stream with no other type streams, we still emit every kind of type record. We could test the dumper output more closely, but that would make the test very brittle. Currently we're just getting coverage. llvm-svn: 269778	2016-05-17 16:20:35 +00:00
Benjamin Kramer	a65b610bd2	Move helper classes into anonymous namespaces. NFC. llvm-svn: 269591	2016-05-15 15:18:11 +00:00
Reid Kleckner	0b269748a6	[codeview] Add type stream merging prototype Summary: This code is intended to be used as part of LLD's PDB writing. Until that exists, this is exposed via llvm-readobj for testing purposes. Type stream merging uses the following algorithm: - Begin with a new empty stream, and a new empty hash table that maps from type record contents to new type index. - For each new type stream, maintain a map from source type index to destination type index. - For each record, copy it and rewrite its type indices to be valid in the destination type stream. - If the new type record is not already present in the destination stream hash table, append it to the destination type stream, assign it the next type index, and update the two hash tables. - If the type record already exists in the destination stream, discard it and update the type index map to forward the source type index to the existing destination type index. Reviewers: zturner, ruiu Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D20122 llvm-svn: 269521	2016-05-14 00:02:53 +00:00
Rui Ueyama	1f6b6e2c53	pdbdump: Print "Publics" stream. Publics stream seems to contain information as to public symbols. It actually contains a serialized hash table along with fixed-sized headers. This patch is not complete. It scans only till the end of the stream and dump the header information. I'll write code to de-serialize the hash table later. Reviewers: zturner Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D20256 llvm-svn: 269484	2016-05-13 21:21:53 +00:00
Reid Kleckner	4525fbe22a	[codeview] Align class and print names of types Summary: This way we can get rid of one of the fields in the .def file. Reviewers: llvm-commits Subscribers: zturner Differential Revision: http://reviews.llvm.org/D20251 llvm-svn: 269461	2016-05-13 19:37:07 +00:00
Reid Kleckner	bab3fab806	[codeview] Dump the type index on the first line of each record This will make it easier to write FileCheck tests. llvm-svn: 269444	2016-05-13 17:48:24 +00:00
Reid Kleckner	ce5196e728	[codeview] Try to handle errors better in record iterator llvm-svn: 269381	2016-05-12 23:26:23 +00:00
Zachary Turner	123a52735d	Get rid of CVLeafTypes.def and combine with TypeRecords.def This merges the functionality of the macros in `CVLeafTypes.def` and the macros in `TypeRecords.def` into a single set of macros. Differential Revision: http://reviews.llvm.org/D20190 Reviewed By: rnk, amccarth llvm-svn: 269316	2016-05-12 17:45:51 +00:00
Zachary Turner	38cc8b3f21	Make CodeView record serialization more generic. This introduces a variadic template and some helper macros to safely and correctly deserialize many types of common record fields while maintaining error checking. Differential Revision: http://reviews.llvm.org/D20183 Reviewed By: rnk, amccarth llvm-svn: 269315	2016-05-12 17:45:44 +00:00
Zachary Turner	3f61c1ab5e	Fix build breakage in DebugInfoCodeview llvm-svn: 269217	2016-05-11 17:54:20 +00:00
Zachary Turner	ae3882a19a	Refactor CodeView type records to use common code. Differential Revision: http://reviews.llvm.org/D20138 Reviewed By: rnk llvm-svn: 269216	2016-05-11 17:47:35 +00:00
Eugene Zelenko	417d4c508b	Fix some Clang-tidy modernize-deprecated-headers and Include What You Use warnings; other minor fixes. Differential revision: http://reviews.llvm.org/D20042 llvm-svn: 268989	2016-05-09 23:11:38 +00:00
Zachary Turner	06c2b4be25	[pdb] Parse the module info stream for each module. Differential Revision: http://reviews.llvm.org/D20026 Reviewed By: rnk llvm-svn: 268942	2016-05-09 17:45:21 +00:00
Zachary Turner	9073ed6e5a	Make TypeIterator generic so it can iterate symbols too. Reviewed By: amccarth Differential Revision: http://reviews.llvm.org/D20038 llvm-svn: 268941	2016-05-09 17:44:58 +00:00
Zachary Turner	5d105a977e	Drop error when trying to fallback from PDB to DWARF. llvm-svn: 268813	2016-05-06 22:29:34 +00:00
Zachary Turner	5a1b5ef9eb	Make llvm-pdbdump print CV type records This reuses the CVTypeDumper from libcodeview to dump full information about type records within a PDB file. Differential Revision: http://reviews.llvm.org/D20022 Reviewed By: rnk llvm-svn: 268808	2016-05-06 22:15:42 +00:00
Zachary Turner	2b37017c38	Add missing include. llvm-svn: 268792	2016-05-06 20:59:35 +00:00
Zachary Turner	819e77d196	Port DebugInfoPDB over to using llvm::Error. Differential Revision: http://reviews.llvm.org/D19940 Reviewed By: rnk llvm-svn: 268791	2016-05-06 20:51:57 +00:00
Reid Kleckner	745f3cbcfc	[codeview] Improve some comments This FIXME was already fixed, and these LF_* enum names were inconsistent. llvm-svn: 268683	2016-05-05 20:58:46 +00:00
Reid Kleckner	338034759a	Fix CVTypeDumperImpl formatting after class rename llvm-svn: 268678	2016-05-05 20:31:16 +00:00
Reid Kleckner	4a14bcac41	[codeview] Move dumper into lib/DebugInfo/CodeView So that we can call it from llvm-pdbdump. llvm-svn: 268580	2016-05-05 00:34:33 +00:00
Zachary Turner	ec28fc3499	Move pdb code into pdb namespace. llvm-svn: 268544	2016-05-04 20:32:13 +00:00
Reid Kleckner	7960de99db	[codeview] Add a type visitor to help abstract away type stream handling Summary: Port the dumper in llvm-readobj over to it. I'm planning to use this visitor to power type stream merging. While we're at it, try to switch from StringRef to ArrayRef<uint8_t> in some places. Reviewers: zturner, amccarth Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D19899 llvm-svn: 268535	2016-05-04 19:39:28 +00:00
Zachary Turner	ce48c4d975	Remove unused variable. llvm-svn: 268455	2016-05-03 22:26:46 +00:00
Zachary Turner	2d02ceefdc	Move CodeViewTypeStream to DebugInfo/CodeView Ability to parse codeview type streams is also needed by DebugInfoPDB for parsing PDBs, so moving this into a library gives us this option. Since DebugInfoPDB had already hand rolled some code to do this, that code is now convereted over to using this common abstraction. Differential Revision: http://reviews.llvm.org/D19887 Reviewed By: dblaikie, amccarth llvm-svn: 268454	2016-05-03 22:18:17 +00:00
Zachary Turner	66635f0235	Change operation_not_supported to not_supported. Apparently operation_not_supported is... not supported everywhere. llvm-svn: 268348	2016-05-03 00:53:16 +00:00
Zachary Turner	f5c59654f7	Parse the TPI (type information) stream of PDB files. This parses the TPI stream (stream 2) from the PDB file. This stream contains some header information followed by a series of codeview records. There is some additional complexity here in that alongside this stream of codeview records is a serialized hash table in order to efficiently query the types. We parse the necessary bookkeeping information to allow us to reconstruct the hash table, but we do not actually construct it yet as there are still a few things that need to be understood first. Differential Revision: http://reviews.llvm.org/D19840 Reviewed By: ruiu, rnk llvm-svn: 268343	2016-05-03 00:28:21 +00:00
Zachary Turner	d6192f482f	[llvm-pdbdump] Fix read past EOF when file is too small. llvm-svn: 268316	2016-05-02 22:16:57 +00:00
Kevin Enderby	7bd8d99497	Thread Expected<...> up from libObject’s getType() for symbols to allow llvm-objdump to produce a good error message. Produce another specific error message for a malformed Mach-O file when a symbol’s section index is more than the number of sections. The existing test case in test/Object/macho-invalid.test for macho-invalid-section-index-getSectionRawName now reports the error with the message indicating that a symbol at a specific index has a bad section index and that bad section index value. Again converting interfaces to Expected<> from ErrorOr<> does involve touching a number of places. Where the existing code reported the error with a string message or an error code it was converted to do the same. Also there some were bugs in the existing code that did not deal with the old ErrorOr<> return values. So now with Expected<> since they must be checked and the error handled, I added a TODO and a comment: "// TODO: Actually report errors helpfully" and a call something like consumeError(NameOrErr.takeError()) so the buggy code will not crash since needed to deal with the Error. llvm-svn: 268298	2016-05-02 20:28:12 +00:00
Zachary Turner	a801dc17d9	Fix build breakage due to implicit conversion. llvm-svn: 268277	2016-05-02 18:36:58 +00:00
Zachary Turner	b56d904433	PDB - Instead of hardcoding stream numbers, use an enum. llvm-svn: 268270	2016-05-02 18:09:21 +00:00
Zachary Turner	0eace0bae5	Parse PDB Name Hash Table PDB has a lot of similar data structures. We already have code for parsing a Name Map, but PDB seems to have a different but very similar structure that is a hash table. This is the beginning of code needed in order to parse the name hash table, but it is not yet complete. It parses the basic metadata of the hash table, the bucket array, and the names buffer, but doesn't use any of these fields yet as the data structure requires a non-trivial amount of work to understand. llvm-svn: 268268	2016-05-02 18:09:14 +00:00
Zachary Turner	9213ba5304	Fix crash in PDB when loading corrupt file. There are probably hundreds of crashers we can find by fuzzing more. For now we do the simplest possible validation of the block size. Later, more complicated validations can verify that other fields of the super block such as directory size, number of blocks, agree with the size of the file etc. llvm-svn: 268084	2016-04-29 18:09:19 +00:00
Zachary Turner	2f09b5091c	Put PDB parsing code into a pdb namespace. llvm-svn: 268072	2016-04-29 17:28:47 +00:00
Zachary Turner	6ba65deeb9	Refactor the PDB Stream reading interface. The motivation for this change is that PDB has the notion of streams and substreams. Substreams often consist of variable length structures that are convenient to be able to treat as guaranteed, contiguous byte arrays, whereas the streams they are contained in are not necessarily so, as a single stream could be spread across many discontiguous blocks. So, when processing data from a substream, we want to be able to assume that we have a contiguous byte array so that we can cast pointers to variable length arrays and such. This leads to the question of how to be able to read the same data structure from either a stream or a substream using the same interface, which is where this patch comes in. We separate out the stream's read state from the underlying representation, and introduce a `StreamReader` class. Then we change the name of `PDBStream` to `MappedBlockStream`, and introduce a second kind of stream called a `ByteStream` which is simply a sequence of contiguous bytes. Finally, we update all of the std::vectors in `PDBDbiStream` to use `ByteStream` instead as a proof of concept. llvm-svn: 268071	2016-04-29 17:22:58 +00:00
David Majnemer	ca9ac4721d	[llvm-pdbdump] Try to appease the ASan bot We didn't check that the file was large enough to hold a super block. llvm-svn: 267965	2016-04-29 01:00:17 +00:00
David Majnemer	1573b242ae	[llvm-pdbdump] Restore error messages, handle bad block sizes We lost the ability to report errors, bring it back. Also, correctly validate the block size. llvm-svn: 267955	2016-04-28 23:47:27 +00:00

... 4 5 6 7 8 ...

1009 Commits