llvm-project

Commit Graph

Author	SHA1	Message	Date
James Henderson	216796f234	[DebugInfo] Fix infinite loop caused by reading past debug_line end If the claimed unit length of a debug line program is such that the line table would finish past the end of the .debug_line section, an infinite loop occurs because the data extractor will continue to "read" zeroes without changing the offset. This previously didn't hit an error because the line table program handles a series of zeroes as a bad extended opcode. This patch fixes the inifinite loop and adds a warning if the program doesn't fit in the available data. Reviewed by: JDevlieghere Differential Revision: https://reviews.llvm.org/D72279	2020-01-07 10:22:35 +00:00
James Henderson	d68904f957	[NFC] Fix trivial typos in comments Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D72143 Patch by Kazuaki Ishizaki.	2020-01-06 10:50:26 +00:00
Jonas Devlieghere	c75aac42a6	[DWARF] Don't assume optional always has a value. When getting the file name form the line table prologue we assume that a valid string form value can always be extracted as a string. If you look at the implementation of DWARFormValue this is not necessarily true. I hit this assertion from LLDB when I create a "dummy" DWARFContext that was missing the string section.	2020-01-03 09:53:44 -08:00
James Henderson	418cd8216b	[DebugInfo] Remove redundant checks for past-the-end of prologue The V5 directory and filename tables had checks in to make sure we hadn't read past the end of the line table prologue. Since previous changes to the data extractor class ensure we never read past the end, these checks are now redundant, so this patch removes them. There is still a check to show that the whole prologue remains within the prologue length. Reviewed By: JDevlieghere Differential Revision: https://reviews.llvm.org/D71768	2020-01-03 12:35:32 +00:00
Reid Kleckner	783db78835	[PDB] Print the most redundant type record indices with /summary Summary: I used this information to motivate splitting up the Intrinsic::ID enum (`5d986953c8`) and adding a key method to clang::Sema (`586f65d31f`) which saved a fair amount of object file size. Example output for clang.pdb: Top 10 types responsible for the most TPI input bytes: index total bytes count size 0x3890: 8,671,220 = 1,805 * 4,804 0xE13BE: 5,634,720 = 252 * 22,360 0x6874C: 5,181,600 = 408 * 12,700 0x2A1F: 4,520,528 = 1,574 * 2,872 0x64BFF: 4,024,020 = 469 * 8,580 0x1123: 4,012,020 = 2,157 * 1,860 0x6952: 3,753,792 = 912 * 4,116 0xC16F: 3,630,888 = 633 * 5,736 0x69DD: 3,601,160 = 985 * 3,656 0x678D: 3,577,904 = 319 * 11,216 In this case, we can see that record 0x3890 is responsible for ~8MB of total object file size for objects in clang. The user can then use llvm-pdbutil to find out what the record is: $ llvm-pdbutil dump -types -type-index 0x3890 Types (TPI Stream) ============================================================ Showing 1 records. 0x3890 \| LF_FIELDLIST [size = 4804] - LF_STMEMBER [name = `WORDTYPE_MAX`, type = 0x1001, attrs = public] - LF_MEMBER [name = `U`, Type = 0x37F0, offset = 0, attrs = private] - LF_MEMBER [name = `BitWidth`, Type = 0x0075 (unsigned), offset = 8, attrs = private] - LF_METHOD [name = `APInt`, # overloads = 8, overload list = 0x3805] ... In this case, we can see that these are members of the APInt class, which is emitted in 1805 object files. The next largest type is ASTContext: $ llvm-pdbutil dump -types -type-index 0xE13BE bin/clang.pdb 0xE13BE \| LF_FIELDLIST [size = 22360] - LF_BCLASS type = 0x653EA, offset = 0, attrs = public - LF_MEMBER [name = `Types`, Type = 0x653EB, offset = 8, attrs = private] - LF_MEMBER [name = `ExtQualNodes`, Type = 0x653EC, offset = 24, attrs = private] - LF_MEMBER [name = `ComplexTypes`, Type = 0x653ED, offset = 48, attrs = private] - LF_MEMBER [name = `PointerTypes`, Type = 0x653EE, offset = 72, attrs = private] ... ASTContext only appears 252 times, but the list of members is long, and must be repeated everywhere it is used. This was the output before I split Intrinsic::ID: Top 10 types responsible for the most TPI input: 0x686C: 69,823,920 = 1,070 * 65,256 0x686D: 69,819,640 = 1,070 * 65,252 0x686E: 69,819,640 = 1,070 * 65,252 0x686B: 16,371,000 = 1,070 * 15,300 ... These records were all lists of intrinsic enums. Reviewers: MaskRay, ruiu Subscribers: mgrang, zturner, thakis, hans, akhuang, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71437	2020-01-02 16:10:36 -08:00
James Henderson	bd402fc3f3	[DebugInfo][NFC] Use function_ref consistently in debug line parsing This patch fixes an inconsistency where we were using std::function in some places and function_ref in others to pass around the error handling callback. Reviewed by: MaskRay Differential Revision: https://reviews.llvm.org/D71762	2020-01-02 18:01:54 +00:00
Mark de Wever	8dc7b982b4	[NFC] Fixes -Wrange-loop-analysis warnings This avoids new warnings due to D68912 adds -Wrange-loop-analysis to -Wall. Differential Revision: https://reviews.llvm.org/D71857	2020-01-01 20:01:37 +01:00
David Blaikie	199700a5cf	DebugInfo: Support dumping any exprloc as an expression Now that DWARFv5 provides a way to identify DWARF expressions based on form, rather than only by attribute - use it to always provide pretty printing for any exprloc attribute, not only the attributes known to contain expressions.	2019-12-23 19:18:47 -08:00
Igor Kudrin	6f635f9092	[DWARF] Check that all fields of a Unit Header are read. Tests "dwarfdump-rnglists-dwarf64.s" and "dwarfdump-rnglists.s" were malformed because they had missing required DWO ID fields in split compilation unit headers. The patch fixes the tests and checks the reading of a unit header more thoroughly. Differential Revision: https://reviews.llvm.org/D71704	2019-12-24 09:38:20 +07:00
Yury Delendik	adf7a0a558	[WebAssembly] Use TargetIndex operands in DbgValue to track WebAssembly operands locations Extends DWARF expression language to express locals/globals locations. (via target-index operands atm) (possible variants are: non-virtual registers or address spaces) The WebAssemblyExplicitLocals can replace virtual registers to targertindex operand type at the time when WebAssembly backend introduces {get,set,tee}_local instead of corresponding virtual registers. Reviewed By: aprantl, dschuff Tags: #debug-info, #llvm Differential Revision: https://reviews.llvm.org/D52634	2019-12-20 14:39:05 -08:00
Evgenii Stepanov	b538a2aa07	llvm-symbolizer: support DW_FORM_loclistx locations. Summary: With -gdwarf-5 local variable locations are emitted as DW_FORM_loclistx form instead of the regular DW_FORM_sec_offset. Teach DWARFDie::getLocations to understand the new format and use it in llvm-symbolizer "FRAME" command. Reviewers: pcc, jdoerfert Subscribers: srhines, aprantl, hiraditya, rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70756	2019-12-20 10:36:14 -08:00
Eric Christopher	3075cd5c9f	Temporarily Revert "[Dsymutil][Debuginfo][NFC] Refactor dsymutil to separate DWARF optimizing part 2." as it causes a layering violation/dependency cycle: llvm/lib/CodeGen/AsmPrinter/DwarfDebug.cpp -> llvm/DebugInfo/DWARF/DWARFExpression.h llvm/include/llvm/DebugInfo/DWARF/DWARFOptimizer.h -> llvm/CodeGen/NonRelocatableStringpool.h This reverts commit `abc7f6800d`.	2019-12-19 13:29:02 -08:00
James Henderson	60cb33c9b8	[DebugInfo] Fix verbose printing of rows added via DW_LNE_end_sequence The debug line verbose printing was printing the wrong values for rows added via DW_LNE_end_sequence, because the row was being printed AFTER its state had been reset following it being appended to the line table. This patch fixes this issue by printing the row before appending it. Reviewers: dblaikie, MaskRay Differential Revision: https://reviews.llvm.org/D71664	2019-12-19 12:54:04 +00:00
Alexey Lapshin	abc7f6800d	[Dsymutil][Debuginfo][NFC] Refactor dsymutil to separate DWARF optimizing part 2. That patch is extracted from the D70709. It moves CompileUnit, DeclContext into llvm/DebugInfo/DWARF. It also adds new file DWARFOptimizer with AddressesMap class. AddressesMap generalizes functionality from RelocationManager. Differential Revision: https://reviews.llvm.org/D71271	2019-12-19 15:41:48 +03:00
David Blaikie	eed0242330	DebugInfo: Don't use implicit zero addr_base (found when LLVM fails to emit addr_base for gmlt+DWARFv5)	2019-12-18 16:28:19 -08:00
James Henderson	5666b70fd0	[DebugInfo] Only print a single blank line after an empty line table Commit `84a9756` added an extra blank line at the end of any line table. However, a blank line is also printed after the line table header, which meant that two blank lines in a row were being printed after a header, if there were no rows. This patch defers the post-header blank line printing until it has been determined that there are rows to print. Reviewed by: dblaikie Differential Revision: https://reviews.llvm.org/D71540	2019-12-17 12:04:09 +00:00
James Henderson	84a9756a72	[llvm-dwarfdump] Add blank line after printing line table This helps delineate it in the output from later tables or other output. Reviewed by: JDevlieghere Differential Revision: https://reviews.llvm.org/D71344	2019-12-12 14:06:10 +00:00
Alexey Lapshin	71aaebc824	[DWARF5][DWARFVerifier] Check that Skeleton compilation unit does not have children. That patch adds checking into DWARFVerifier that the Skeleton compilation unit does not have children. Differential Revision: https://reviews.llvm.org/D71244	2019-12-12 10:59:10 +03:00
James Henderson	2f8155023a	[DebugInfo] Fix printing of DW_LNS_set_isa The Isa register is a uint8_t, but at least on Windows this is internally an unsigned char, which meant that prior to this patch it got formatted as an ASCII character, rather than a decimal number. This patch fixes this by casting it to a uint64_t before printing. I did it this way instead of using a uint8_t formatter because a) it is simpler, and b) it allows us to change the internal type of Isa in the future without this code breaking. I also took the opportunity to test the printing of the other standard opcodes. Reviewed by: probinson Differential Revision: https://reviews.llvm.org/D71274	2019-12-11 13:38:41 +00:00
Sourabh Singh Tomar	fb4d8fe1a8	Recommit "[DWARF5] Start emitting DW_AT_dwo_name when -gdwarf-5 is specified." Reviewers: dblaikie, aprantl, probinson Tags: #debug-info #llvm Differential Revision: https://reviews.llvm.org/D71185	2019-12-11 01:24:50 +05:30
Sourabh Singh Tomar	d82b6ba21b	Revert "[DWARF5] Start emitting DW_AT_dwo_name when -gdwarf-5 is specified." This reverts commit `6ef01588f4`. Missing Differetial revision.	2019-12-11 01:20:40 +05:30
Sourabh Singh Tomar	6ef01588f4	[DWARF5] Start emitting DW_AT_dwo_name when -gdwarf-5 is specified.	2019-12-11 01:18:02 +05:30
Reid Kleckner	7f63db197e	Avoid naming variable after type to fix GCC 5.3 build GCC says: .../llvm/lib/DebugInfo/GSYM/FunctionInfo.cpp:195:12: error: ‘InfoType’ is not a class, namespace, or enumeration case InfoType::EndOfList: ^ Presumably, GCC thinks InfoType is a variable here. Work around it by using the name IT as is done above.	2019-12-06 11:25:28 -08:00
Douglas Yung	da650094b1	Fix build of LookupResult.cpp from `aeda128` with Visual C++.	2019-12-05 21:03:03 -08:00
Greg Clayton	aeda128a96	Add lookup functions for efficient lookups of addresses when using GsymReader classes. Summary: Lookup functions are designed to not fully decode a FunctionInfo, LineTable or InlineInfo, they decode only what is needed into a LookupResult object. This allows lookups to avoid costly memory allocations and avoid parsing large amounts of information one a suitable match is found. LookupResult objects contain the address that was looked up, the concrete function address range, the name of the concrete function, and a list of source locations. One for each inline function, and one for the concrete function. This allows one address to turn into multiple frames and improves the signal you get when symbolicating addresses in GSYM files. Reviewers: labath, aprantl Subscribers: mgorny, hiraditya, llvm-commits, lldb-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70993	2019-12-05 16:49:53 -08:00
Pavel Labath	4ee76a922a	[llvm/DWARF] Return section offset from DWARFUnit::get{Loc,Rng}listOffset Summary: Currently these function return the raw content of the appropriate table header, which means they are relative to the DW_AT_{loc,rng}list_base, and one has to relocate them in order to do anything. This changes the functions to perform the relocation themselves, which seems more clearer, particularly as they are sitting right next to the find{Rng,Loc}listFromOffset functions, but one cannot simply take the result of these functions and take pass them there. The only effect of this patch is to change what value is dumped for the DW_AT_ranges attribute, which I think is for the better, as previously the values appeared to point into thin air. (The main reason I am looking at this is because I was trying to implement equivalent functionality in lldb's DWARFUnit, and was stumped by this behavior. Reviewers: dblaikie, JDevlieghere, aprantl Subscribers: hiraditya, llvm-commits, SouraVX Tags: #llvm Differential Revision: https://reviews.llvm.org/D71006	2019-12-05 12:35:09 +01:00
Petr Hosek	00e436f130	[llvm-symbolizer] Support debug file lookup using build ID Build ID is a protocol for looking up debug files that's already supported by various tools including debuggers. For example, when locating debug files, gdb would check the following directories: - /usr/lib/debug/.build-id/ab/cdef1234.debug - /usr/bin/ls.debug - /usr/bin/.debug/ls.debug - /usr/lib/debug/usr/bin/ls.debug llvm-symbolizer currently consults all of these except for build ID based one. This patch implements support for build ID lookup. The set of debug directories to search is specified by the new option: --debug-file-directory, whose name matches the debug-file-directory variable used by gdb for the same purpose. Differential Revision: https://reviews.llvm.org/D70759	2019-12-04 15:07:56 -08:00
Pavel Labath	a3af3ac393	[DWARFDebugLoclists] Add support for other DW_LLE encodings Summary: lldb's loclists parser has support for DW_LLE_start_end(x) encodings. To avoid regressing when switching the implementation to llvm's, I add parsing support for all previously unsupported location list encodings. Reviewers: dblaikie, JDevlieghere, aprantl, SouraVX Subscribers: hiraditya, probinson, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70949	2019-12-04 10:38:21 +01:00
Pavel Labath	d34927e7db	[DWARFDebugRnglists] Add a callback-based version of the getAbsoluteRanges function Summary: The dump() function already accepts a callback. This makes getAbsoluteRanges do the same. The existing DWARFUnit overload is implemented on top of the new function. This enables usage of the debug_rnglists parser from within lldb (which has it's own dwarf parser). Reviewers: dblaikie, JDevlieghere, aprantl Subscribers: hiraditya, probinson, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70952	2019-12-04 10:35:57 +01:00
Pavel Labath	1fbe8a82e1	[DWARF] Add support for parsing/dumping section indices in location lists Summary: This does exactly what it says on the box. The only small gotcha is the section index computation for offset_pair entries, which can use either the base address section, or the section from the offset_pair entry. This is to support both the cases where the base address is relocated (points to the base of the CU, typically), and the case where the base address is a constant (typically zero) and relocations are on the offsets themselves. Reviewers: dblaikie, JDevlieghere, aprantl, SouraVX Subscribers: hiraditya, llvm-commits, probinson Tags: #llvm Differential Revision: https://reviews.llvm.org/D70540	2019-12-03 11:48:28 +01:00
Sourabh Singh Tomar	3f3d0f4f4b	[DebugInfo] Support for debug_macinfo.dwo section in llvm and llvm-dwarfdump. This patch adds support for debug_macinfo.dwo section[pre-standardized] to llvm and llvm-dwarfdump. Reviewers: probinson, dblaikie, aprantl, jini.susan.george, alok Differential Revision: https://reviews.llvm.org/D70705 Tags: #debug-info #llvm	2019-12-03 08:54:12 +05:30
Evgenii Stepanov	1b42cc0df1	llvm-symbolizer: fix handling of DW_AT_specification in FRAME. Summary: Use getSubroutineName() to the the subrouting name; this function knows how to handle cases when DW_TAG_subprogram refers to an earlier declaration: 0x00000050: DW_TAG_subprogram DW_AT_linkage_name ("_ZN1A1fEv") DW_AT_name ("f") ... 0x00000067: DW_TAG_subprogram DW_AT_low_pc (0x0000000000000000) DW_AT_high_pc (0x0000000000000020) DW_AT_specification (0x00000050 "_ZN1A1fEv") ... 0x0000008c: DW_TAG_variable Reviewers: pcc, vitalybuka, jdoerfert Subscribers: srhines, hiraditya, rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70630	2019-11-25 15:06:07 -08:00
Evgenii Stepanov	9f60820d84	llvm-symbolizer: Support loclist in FRAME. Summary: Support location lists in FRAME command. These are used for the majority of local variables in optimized code. Also support DW_OP_breg in addition to DW_OP_fbreg when it refers to the same register as DW_AT_frame_base. Reviewers: pcc, jdoerfert Subscribers: srhines, hiraditya, rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70629	2019-11-25 15:06:07 -08:00
Evgenii Stepanov	1c33d7130e	llvm-symbolizer: Fix FRAME handling of missing AT_name. Summary: llvm-symbolizer protocol is empty string means end-of-output. Do not emit empty string when a function or a variable do not have a name for any reason. Emit "??". Reviewers: pcc, vitalybuka, jdoerfert Subscribers: srhines, hiraditya, rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70626	2019-11-25 14:55:11 -08:00
Dávid Bolvanský	bc2b380c0d	[pdbutil] Fixed -Wdeprecated-copy in DbiModuleDescriptor	2019-11-23 23:33:22 +01:00
Sourabh Singh Tomar	0e02977b6e	Recommit "[DWARF] Support for loclist.dwo section in llvm and llvm-dwarfdump." The original commit message follows. This patch adds support for debug_loclists.dwo section in llvm and llvm-dwarfdump. Also Fixes PR43622, PR43623. Reviewers: dblaikie, probinson, labath, aprantl, jini.susan.george Differential Revision: https://reviews.llvm.org/D69462	2019-11-23 20:10:23 +05:30
Sourabh Singh Tomar	02cb4b2fd6	Revert "[DWARF] Support for loclist.dwo section in llvm and llvm-dwarfdump." This reverts commit `81b0a3284a`. Will Re-apply, with updated Differtial Revision, for automatic closure of Phabricator review.	2019-11-23 19:46:07 +05:30
Sourabh Singh Tomar	81b0a3284a	[DWARF] Support for loclist.dwo section in llvm and llvm-dwarfdump. This patch adds support for debug_loclists.dwo section in llvm and llvm-dwarfdump. Also Fixes PR43622, PR43623. Reviewers: dblaikie, probinson, labath, aprantl, jini.susan.george https://reviews.llvm.org/D69462	2019-11-23 10:25:11 +05:30
Pavel Labath	01bb3b07c3	[DWARFVerifier] Use the new location list api Summary: Instead of going to the debug_loc section directly, use new DWARFDie::getLocations instead. This means that the code will now automatically support debug_loclists sections. This is the last usage of the old debug_loc methods, and they can now be removed. Reviewers: dblaikie, JDevlieghere, aprantl, SouraVX Subscribers: hiraditya, probinson, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70534	2019-11-22 10:08:39 +01:00
Tom Stellard	ab411801b8	[cmake] Explicitly mark libraries defined in lib/ as "Component Libraries" Summary: Most libraries are defined in the lib/ directory but there are also a few libraries defined in tools/ e.g. libLLVM, libLTO. I'm defining "Component Libraries" as libraries defined in lib/ that may be included in libLLVM.so. Explicitly marking the libraries in lib/ as component libraries allows us to remove some fragile checks that attempt to differentiate between lib/ libraries and tools/ libraires: 1. In tools/llvm-shlib, because llvm_map_components_to_libnames(LIB_NAMES "all") returned a list of all libraries defined in the whole project, there was custom code needed to filter out libraries defined in tools/, none of which should be included in libLLVM.so. This code assumed that any library defined as static was from lib/ and everything else should be excluded. With this change, llvm_map_components_to_libnames(LIB_NAMES, "all") only returns libraries that have been added to the LLVM_COMPONENT_LIBS global cmake property, so this custom filtering logic can be removed. Doing this also fixes the build with BUILD_SHARED_LIBS=ON and LLVM_BUILD_LLVM_DYLIB=ON. 2. There was some code in llvm_add_library that assumed that libraries defined in lib/ would not have LLVM_LINK_COMPONENTS or ARG_LINK_COMPONENTS set. This is only true because libraries defined lib lib/ use LLVMBuild.txt and don't set these values. This code has been fixed now to check if the library has been explicitly marked as a component library, which should now make it easier to remove LLVMBuild at some point in the future. I have tested this patch on Windows, MacOS and Linux with release builds and the following combinations of CMake options: - "" (No options) - -DLLVM_BUILD_LLVM_DYLIB=ON - -DLLVM_LINK_LLVM_DYLIB=ON - -DBUILD_SHARED_LIBS=ON - -DBUILD_SHARED_LIBS=ON -DLLVM_BUILD_LLVM_DYLIB=ON - -DBUILD_SHARED_LIBS=ON -DLLVM_LINK_LLVM_DYLIB=ON Reviewers: beanz, smeenai, compnerd, phosek Reviewed By: beanz Subscribers: wuzish, jholewinski, arsenm, dschuff, jyknight, dylanmckay, sdardis, nemanjai, jvesely, nhaehnle, mgorny, mehdi_amini, sbc100, jgravelle-google, hiraditya, aheejin, fedor.sergeev, asb, rbar, johnrusso, simoncook, apazos, sabuasal, niosHD, jrtc27, MaskRay, zzheng, edward-jones, atanasyan, steven_wu, rogfer01, MartinMosbeck, brucehoult, the_o, dexonsmith, PkmX, jocewei, jsji, dang, Jim, lenary, s.egerton, pzheng, sameer.abuasal, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70179	2019-11-21 10:48:08 -08:00
Alexey Lapshin	7b957ddc98	[Debuginfo][NFC] removes redundant semicolon.	2019-11-21 16:16:24 +03:00
Pavel Labath	a03435ec8e	Recommit "[DWARF] Add an api to get "interpreted" location lists" This recommits `089c0f5814`, which was reverted due to failing tests on big endian machines. It includes a fix which I believe (I don't have BE machine) should fix this issue. The fix consists of correcting the invocation DWARFYAML::EmitDebugSections, which was missing one (default) function arguments, and so didn't actually force the little-endian mode. The original commit message follows. Summary: This patch adds DWARFDie::getLocations, which returns the location expressions for a given attribute (typically DW_AT_location). It handles both "inline" locations and references to the external location list sections (currently only of the DW_FORM_sec_offset type). It is implemented on top of DWARFUnit::findLoclistFromOffset, which is also added in this patch. I tried to make their signatures similar to the equivalent range list functionality. The actual location list interpretation logic is in DWARFLocationTable::visitAbsoluteLocationList. This part is not equivalent to the range list code, but this deviation is motivated by a desire to reuse the same location list parsing code within lldb. The functionality is tested via a c++ unit test of the DWARFDie API. Reviewers: dblaikie, JDevlieghere, SouraVX Subscribers: mgorny, hiraditya, cmtice, probinson, llvm-commits, aprantl Tags: #llvm Differential Revision: https://reviews.llvm.org/D70394	2019-11-20 16:24:11 +01:00
Pavel Labath	72d2929c52	Revert "[DWARF] Add an api to get "interpreted" location lists" The test fails on big endian machines. This reverts commit `089c0f5814` and the subsequent attempt to fix in `82dc32e2d4`.	2019-11-20 15:15:22 +01:00
Pavel Labath	089c0f5814	[DWARF] Add an api to get "interpreted" location lists Summary: This patch adds DWARFDie::getLocations, which returns the location expressions for a given attribute (typically DW_AT_location). It handles both "inline" locations and references to the external location list sections (currently only of the DW_FORM_sec_offset type). It is implemented on top of DWARFUnit::findLoclistFromOffset, which is also added in this patch. I tried to make their signatures similar to the equivalent range list functionality. The actual location list interpretation logic is in DWARFLocationTable::visitAbsoluteLocationList. This part is not equivalent to the range list code, but this deviation is motivated by a desire to reuse the same location list parsing code within lldb. The functionality is tested via a c++ unit test of the DWARFDie API. Reviewers: dblaikie, JDevlieghere, SouraVX Subscribers: mgorny, hiraditya, cmtice, probinson, llvm-commits, aprantl Tags: #llvm Differential Revision: https://reviews.llvm.org/D70394	2019-11-20 13:25:18 +01:00
Pavel Labath	39285a0f02	Add streaming/equality operators to DWARFAddressRange/DWARFLocationExpression The main motivation for this is being able to write simpler assertions and get better error messages in unit tests. Split off from D70394.	2019-11-19 10:34:30 +01:00
Pavel Labath	dca2b36ba0	Re-commit "DWARF location lists: Add section index dumping" This reapplies `c0f6ad7d1f` with an additional fix in test/DebugInfo/X86/constant-loclist.ll, which had a slightly different output on windows targets. The test now accounts for this difference. The original commit message follows. Summary: As discussed in D70081, this adds the ability to dump section names/indices to the location list dumper. It does this by moving the range specific logic from DWARFDie.cpp:dumpRanges into the DWARFAddressRange class. The trickiest part of this patch is the backflip in the meanings of the two dump flags for the location list sections. The dumping of "raw" location list data is now controlled by "DisplayRawContents" flag. This frees up the "Verbose" flag to be used to control whether we print the section index. Additionally, the DisplayRawContents flag is set for section-based dumps whenever the --verbose option is passed, but this is not done for the "inline" dumps. Also note that the index dumping currently does not work for the DWARF v5 location lists, as the parser does not fill out the appropriate fields. This will be done in a separate patch. Reviewers: dblaikie, probinson, JDevlieghere, SouraVX Subscribers: sdardis, hiraditya, jrtc27, atanasyan, arphaman, aprantl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70227	2019-11-18 15:30:10 +01:00
Simon Pilgrim	c070a27acc	Revert rGc0f6ad7d1f3c : "DWARF location lists: Add section index dumping" This reverts commit `c0f6ad7d1f` to fix the buildbots.	2019-11-18 13:26:51 +00:00
Pavel Labath	c0f6ad7d1f	DWARF location lists: Add section index dumping Summary: As discussed in D70081, this adds the ability to dump section names/indices to the location list dumper. It does this by moving the range specific logic from DWARFDie.cpp:dumpRanges into the DWARFAddressRange class. The trickiest part of this patch is the backflip in the meanings of the two dump flags for the location list sections. The dumping of "raw" location list data is now controlled by "DisplayRawContents" flag. This frees up the "Verbose" flag to be used to control whether we print the section index. Additionally, the DisplayRawContents flag is set for section-based dumps whenever the --verbose option is passed, but this is not done for the "inline" dumps. Also note that the index dumping currently does not work for the DWARF v5 location lists, as the parser does not fill out the appropriate fields. This will be done in a separate patch. Reviewers: dblaikie, probinson, JDevlieghere, SouraVX Subscribers: sdardis, hiraditya, jrtc27, atanasyan, arphaman, aprantl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70227	2019-11-18 10:50:22 +01:00
David Blaikie	77cfcd7509	DebugInfo: Use loclistx for DWARFv5 location lists to reduce the number of relocations This only implements the non-dwo part, but loclistx is necessary to use location lists in DWARFv5, so it's a precursor to that work - and generally reduces relocations (only using one reloc, then indexes/relative offsets for all location list references) in non-split DWARF.	2019-11-15 18:51:13 -08:00
David Blaikie	d295087639	DebugInfo: Templatize rnglist header parsing to setup for reuse with loclist header parsing	2019-11-15 16:23:02 -08:00
Pavel Labath	0908093977	DWARFDebugLoc(v4): Add an incremental parsing function Summary: This adds a visitLocationList function to the DWARF v4 location lists, similar to what already exists for DWARF v5. It follows the approach outlined in previous patches (D69672), where the parsed form is always stored in the DWARF v5 format, which makes it easier for generic code to be built on top of that. v4 location lists are "upgraded" during parsing, and then this upgrade is undone while dumping. Both "inline" and section-based dumping is rewritten to reuse the existing "generic" location list dumper. This means that the output format is consistent for all location lists (the only thing one needs to implement is the function which prints the "raw" form of a location list), and that debug_loc dumping correctly processes base address selection entries, etc. The previous existing debug_loc functionality (e.g., parseOneLocationList) is rewritten on top of the new API, but it is not removed as there is still code which uses them. This will be done in follow-up patches, after I build the API to access the "interpreted" location lists in a generic way (as that is what those users really want). Reviewers: dblaikie, probinson, JDevlieghere, aprantl, SouraVX Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69847	2019-11-15 13:38:00 +01:00
Pavel Labath	eafe0cf5fa	DWARFDebugLoclists: stricter base address handling Summary: This removes the use of zero as a base address in section-based dumping. Although this will often be true for (unlinked) object files with a single compile unit, it is not true in general. This means that section-based dumping will not be able to resolve entries referencing the base address (DW_LLE_offset_pair) -- it wasn't able to do that correctly before either, but now it will be more explicit about it. One exception to that is if the location list contains an explicit DW_LLE_base_address entry -- in this case the dumper will pick it up, and resolve subsequent entries normally. The patch also removes the fallback to zero in the "inline" dumping in case the compile unit does not contain a base address. Reviewers: dblaikie, probinson, JDevlieghere, aprantl, SouraVX Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70115	2019-11-14 10:01:48 +01:00
Pavel Labath	1eea3fa063	DWARFDebugLoclists: Add an api to get the location lists of a DWARF unit Summary: This avoid the need to duplicate the location lists searching logic in various users. The "inline location list dumping" code (which is the only user actually updated to handle DWARF v5 location lists) is switched to this method. After adding v4 location list support, I'll switch other users too. Reviewers: dblaikie, probinson, JDevlieghere, aprantl, SouraVX Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70084	2019-11-13 16:26:16 +01:00
Pavel Labath	ebe2f56030	DWARFDebugLoclists: add location list "interpretation" logic Summary: This patch extracts the logic for computing the "absolute" locations, which was partially present in the debug_loclists dumper, completes it, and moves it into a separate function. This makes it possible to later reuse the same logic for uses other than dumping. The dumper is changed to reuse the location list interpreter, and its format is changed somewhat. In "verbose" mode it prints the "raw" value of a location list, the interpreted location (if available) and the expression itself. In non-verbose mode it prints only one of the location forms: it prefers the interpreted form, but falls back to the "raw" format if interpretation is not possible (for instance, because we were not given a base address, or the resolution of indirect addresses failed). This patch also undos some of the changes made in D69672, namely the part about making all functions static. The main reason for this is that I learned that the original approach (dumping only fully resolved locations) meant that it was impossible to rewrite one of the existing tests. To make that possible (and make the "inline location" dump work in more cases), I now reuse the same dumping mechanism as is used for section-based dumping. As this required having more objects know about the various location lists classes, it seemed like a good idea to create an interface abstracting the difference between them. Therefore, I now create a DWARFLocationTable class, which will serve as a base class for the location list classes. DWARFDebugLoclists is made to inherit from that. DWARFDebugLoc will follow. Another positive effect of this change is that section-based dumping code will not need to use templates (as originally) envisioned, and that the argument lists of the dumping functions become shorter. Reviewers: dblaikie, probinson, JDevlieghere, aprantl, SouraVX Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70081	2019-11-12 10:40:13 +01:00
Fangrui Song	644de3b96e	[PDB] Make pdb::DbiModuleDescriptor destructor trivial	2019-11-11 21:26:26 -08:00
David Blaikie	39c308f6b8	DebugInfo: Use separate macinfo contributions for each CU The macinfo support was broken for LTO situations, by terminating macinfo lists only once - multiple macinfo contributions were correctly labeled, but they all continued/flowed into later contributions until only one terminator appeared at the end of the section. Correctly terminate each contribution & fix the parsing to handle this situation too. The parsing fix is also necessary for dumping linked binaries - the previous code would stop at the end of the first contribution - missing all later contributions in a linked binary. It'd be nice to improve the dumping to print the offsets of each contribution so it'd be easier to know which CU AT_macro_info refers to which macinfo contribution.	2019-11-08 13:27:00 -08:00
Pavel Labath	e1f8c8a16f	DWARFDebugLoclists: Move to a incremental parsing model Summary: This patch stems from the discussion D68270 (including some offline talks). The idea is to provide an "incremental" api for parsing location lists, which will avoid caching or materializing parsed data. An additional goal is to provide a high level location list api, which abstracts the differences between different encoding schemes, and can be used by users which don't care about those (such as LLDB). This patch implements the first part. It implements a call-back based "visitLocationList" api. This function parses a single location list, calling a user-specified callback for each entry. This is going to be the base api, which other location list functions (right now, just the dumping code) are going to be based on. Future patches will do something similar for the v4 location lists, and add a mechanism to translate raw entries into concrete address ranges. Reviewers: dblaikie, probinson, JDevlieghere, aprantl, SouraVX Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69672	2019-11-06 16:25:06 +01:00
Pavel Labath	b4c5b8f3f5	DWARFDebugLoclists: Make it possible to read relocated addresses Summary: Handling relocations was not needed when the loclists section was a DWO-only thing. But since DWARF5, it is possible to use it in regular objects too, and the standard permits embedding addresses into the section directly. These addresses need to be relocated in unlinked files. Reviewers: JDevlieghere, dblaikie, probinson Subscribers: aprantl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68271	2019-11-05 10:21:39 +01:00
Reid Kleckner	22d41ba024	Fix -Wsign-compare warning with clang-cl off_t apparently is just "long" on Win64, which is 32-bits, and therefore not long enough to compare with UINT32_MAX. Use auto to follow the surrounding code. uint64_t would also be fine.	2019-10-30 15:20:43 -07:00
Benjamin Kramer	bfa3f0c316	Hide implementation details in anonymous namespaces. NFC.	2019-10-24 10:48:43 +02:00
George Rimar	78d632d105	[LLVMDebugInfoPDB] - Use cantFail() instead of assert(). Currently injected-sources-native.test fails with "Expected<T> value was in success state. (Note: Expected<T> values in success mode must still be checked prior to being destroyed)" when llvm is compiled with LLVM_ENABLE_ABI_BREAKING_CHECKS in Release. The problem is that getStringForID returns Expected<StringRef> and Expected value must always be checked, even if it is in success state. Checking with assert only helps in Debug and is wrong. Differential revision: https://reviews.llvm.org/D69251 llvm-svn: 375492	2019-10-22 08:52:45 +00:00
George Rimar	2bf01dcbaa	[llvm/Object] - Make ELFObjectFile::getRelocatedSection return Expected<section_iterator> It returns just a section_iterator currently and have a report_fatal_error call inside. This change adds a way to return errors and handle them on caller sides. The patch also changes/improves current users and adds test cases. Differential revision: https://reviews.llvm.org/D69167 llvm-svn: 375408	2019-10-21 11:06:38 +00:00
Zinovy Nis	5b8546023f	Fix minor warning in DWARFVerifier. llvm-svn: 375357	2019-10-20 07:55:50 +00:00
Martin Storsjo	a4f6b59846	[Symbolize] Use the local MSVC C++ demangler instead of relying on dbghelp. NFC. This allows making a couple llvm-symbolizer tests run in all environments. Differential Revision: https://reviews.llvm.org/D68133 llvm-svn: 375041	2019-10-16 20:38:44 +00:00
David Blaikie	be744ea54f	DebugInfo: Remove unnecessary/mistaken inclusion of Bitcode/BitcodeAnalyzer.h Introduced in r374582, Michael Spencer pointed out this broke the modules build due to a missing tblgen dependency on llvm/IR/Attributes.inc. Michael fixed the dependency in r374827. So this removes the inclusion and the new dependency (effectively reverting r374827 and including the alternative fix of removing rather than supporting the new dependency). Thanks for the quick fix/notice, Michael! llvm-svn: 374831	2019-10-14 22:12:45 +00:00
Michael J. Spencer	9585d8c11a	[Modules Build] Add missing dependency. A previous commit made libLLVMDebugInfoDWARF depend on the LLVM_Bitcode module which depends on the LLVM_intrinsic_gen module which depends on "llvm/IR/Attributes.inc" which is a generated header not depended on by libLLVMDebugInfo. Add that dependency. llvm-svn: 374827	2019-10-14 21:53:51 +00:00
David Blaikie	c8e5b90ba6	DebugInfo: Fix msan use-of-uninitialized exposed by r374600 llvm-svn: 374619	2019-10-12 00:27:12 +00:00
David Blaikie	f358c3d371	llvm-dwarfdump: Add verbose printing for debug_loclists llvm-svn: 374582	2019-10-11 19:06:35 +00:00
Zachary Turner	02c5386811	[PDB] Fix bug when using multiple PCH header objects with the same name. A common pattern in Windows is to have all your precompiled headers use an object named stdafx.obj. If you've got a project with many different static libs, you might use a separate PCH for each one of these. During the final link step, a file from A might reference the PCH object from A, but it will have the same name (stdafx.obj) as any other PCH from another project. The only difference will be the path. For example, A might be A/stdafx.obj while B is B/stdafx.obj. The existing algorithm checks only the filename that was passed on the command line (or stored in archive), but this is insufficient in the case where relative paths are used, because depending on the command line object file / library order, it might find the wrong PCH object first resulting in a signature mismatch. The fix here is to simply check whether the absolute path of the PCH object (which is stored in the input obj file for the file that references the PCH) ends with the full relative path of whatever is specified on the command line (or is in the archive). Differential Revision: https://reviews.llvm.org/D66431 llvm-svn: 374442	2019-10-10 20:25:51 +00:00
Nico Weber	cae2662104	Fix Windows build after r374381 llvm-svn: 374413	2019-10-10 18:20:16 +00:00
Reid Kleckner	f05ed6601f	Remove strings.h include to fix GSYM Windows build Fifth time's the charm. llvm-svn: 374411	2019-10-10 18:17:24 +00:00
Greg Clayton	4ae13e2a7a	Unbreak buildbots. llvm-svn: 374410	2019-10-10 18:13:13 +00:00
Greg Clayton	d665bfcf7c	Fix buildbots by using memset instead of bzero. llvm-svn: 374409	2019-10-10 18:11:49 +00:00
Michael Liao	a121891a55	Fix build by adding the missing dependency. llvm-svn: 374406	2019-10-10 18:04:52 +00:00
Greg Clayton	4c145df6a7	Unbreak llvm-clang-lld-x86_64-scei-ps4-windows10pro-fast buildbot. llvm-svn: 374398	2019-10-10 17:52:33 +00:00
Greg Clayton	6a2eff1e68	Unbreak windows buildbots. llvm-svn: 374396	2019-10-10 17:49:33 +00:00
Greg Clayton	4b6c9de868	Add GsymCreator and GsymReader. This patch adds the ability to create GSYM files with GsymCreator, and read them with GsymReader. Full testing has been added for both new classes. This patch differs from the original patch https://reviews.llvm.org/D53379 in that is uses a StringTableBuilder class from llvm instead of a custom version. Support for big and little endian files has been added. If the endianness matches the current host, we use efficient extraction for the header, address table and address info offset tables. Differential Revision: https://reviews.llvm.org/D68744 llvm-svn: 374381	2019-10-10 17:10:11 +00:00
David Blaikie	411497c6c7	llvm-dwarfdump: Support multiple debug_loclists contributions Also fixing the incorrect "offset" field being computed/printed for each location list. llvm-svn: 374232	2019-10-09 21:25:28 +00:00
David Blaikie	746174706b	DebugInfo: Shot in the dark attempt to fix ubsan error from r374122 (specifying an underlying type for the enum might also be suitable - but this seems better/as good, since there's a clear expectation this can contain values other than the actual enumerators of this enum) llvm-svn: 374196	2019-10-09 18:37:13 +00:00
Hans Wennborg	1e1e3ba252	Unify the two CRC implementations David added the JamCRC implementation in r246590. More recently, Eugene added a CRC-32 implementation in r357901, which falls back to zlib's crc32 function if present. These checksums are essentially the same, so having multiple implementations seems unnecessary. This replaces the CRC-32 implementation with the simpler one from JamCRC, and implements the JamCRC interface in terms of CRC-32 since this means it can use zlib's implementation when available, saving a few bytes and potentially making it faster. JamCRC took an ArrayRef<char> argument, and CRC-32 took a StringRef. This patch changes it to ArrayRef<uint8_t> which I think is the best choice, and simplifies a few of the callers nicely. Differential revision: https://reviews.llvm.org/D68570 llvm-svn: 374148	2019-10-09 09:06:30 +00:00
David Blaikie	5841e9af1d	DebugInfo: Move LLE enum handling to .def to match RLE handling llvm-svn: 374122	2019-10-08 21:48:46 +00:00
Martin Storsjo	b8f790234f	Revert "[Symbolize] Use the local MSVC C++ demangler instead of relying on dbghelp. NFC." This reverts SVN r373698, as it broke sanitizer tests, e.g. in http://lab.llvm.org:8011/builders/sanitizer-windows/builds/52441. llvm-svn: 373701	2019-10-04 07:22:37 +00:00
Martin Storsjo	1ca074b86a	[Symbolize] Use the local MSVC C++ demangler instead of relying on dbghelp. NFC. This allows making a couple llvm-symbolizer tests run in all environments. Differential Revision: https://reviews.llvm.org/D68133 llvm-svn: 373698	2019-10-04 07:05:42 +00:00
David Blaikie	5ca306666c	DebugInfo: Add parsing support for debug_loc base address specifiers llvm-svn: 373278	2019-10-01 00:29:13 +00:00
Pavel Labath	aaff1a631a	MCRegisterInfo: Merge getLLVMRegNum and getLLVMRegNumFromEH Summary: The functions different in two ways: - getLLVMRegNum could return both "eh" and "other" dwarf register numbers, while getLLVMRegNumFromEH only returned the "eh" number. - getLLVMRegNum asserted if the register was not found, while the second function returned -1. The second distinction was pretty important, but it was very hard to infer that from the function name. Aditionally, for the use case of dumping dwarf expressions, we needed a function which can work with both kinds of number, but does not assert. This patch solves both of these issues by merging the two functions into one, returning an Optional<unsigned> value. While the same thing could be achieved by adding an "IsEH" argument to the (renamed) getLLVMRegNumFromEH function, it seemed better to avoid the confusion of two functions and put the choice of asserting into the hands of the caller -- if he checks the Optional value, he can safely process "untrusted" input, and if he blindly dereferences the Optional, he gets the assertion. I've updated all call sites to the new API, choosing between the two options according to the function they were calling originally, except that I've updated the usage in DWARFExpression.cpp to use the "safe" method instead, and added a test case which would have previously triggered an assertion failure when processing (incorrect?) dwarf expressions. Reviewers: dsanders, arsenm, JDevlieghere Subscribers: wdng, aprantl, javed.absar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67154 llvm-svn: 372710	2019-09-24 09:31:02 +00:00
Alexander Shaposhnikov	4fd11c1e45	[Object] Extend MachOUniversalBinary::getObjectForArch Make the method MachOUniversalBinary::getObjectForArch return MachOUniversalBinary::ObjectForArch and add helper methods MachOUniversalBinary::getMachOObjectForArch, MachOUniversalBinary::getArchiveForArch for those who explicitly expect to get a MachOObjectFile or an Archive. Differential revision: https://reviews.llvm.org/D67700 Test plan: make check-all llvm-svn: 372278	2019-09-19 00:02:12 +00:00
Greg Clayton	c6b156cbb8	GSYM: Add the llvm::gsym::Header header class with tests This patch adds the llvm::gsym::Header class which appears at the start of a stand alone GSYM file, or in the first bytes of the GSYM data in a GSYM section within a file. Added encode and decode methods with full error handling and full tests. Differential Revision: https://reviews.llvm.org/D67666 llvm-svn: 372149	2019-09-17 17:46:13 +00:00
Greg Clayton	b52650d57f	GSYM: add encoding and decoding to FunctionInfo This patch adds encoding and decoding of the FunctionInfo objects along with full error handling and tests. Full details of the FunctionInfo encoding format appear in the FunctionInfo.h header file. Differential Revision: https://reviews.llvm.org/D67506 llvm-svn: 372135	2019-09-17 16:15:49 +00:00
Simon Pilgrim	4f234aaf2c	[DebugInfo] Don't dereference a dyn_cast<PDBSymbolData> result. NFCI. The static analyzer is warning about a potential null dereference - but as we're in DataMemberLayoutItem we should be able to guarantee that the Symbol is a PDBSymbolData type, allowing us to use cast<PDBSymbolData> - and if not assert will fire for us. llvm-svn: 371933	2019-09-15 15:38:26 +00:00
David Blaikie	ffe5466c79	Add some missing changes to GSYM that was addressing a gcc compilation error due to a type and variable with the same name llvm-svn: 371681	2019-09-11 22:24:45 +00:00
Greg Clayton	7fcc2c2b5a	Add a LineTable class to GSYM and test it. This patch adds the ability to create a gsym::LineTable object, populate it, encode and decode it and test all functionality. The full format of the LineTable encoding is specified in the header file LineTable.h. Differential Revision: https://reviews.llvm.org/D66602 llvm-svn: 371657	2019-09-11 20:51:03 +00:00
David Bolvansky	5916799293	[GSYM][NFC] Fixed -Wdocumentation warning lib/DebugInfo/GSYM/InlineInfo.cpp:68:12: warning: parameter 'Inline' not found in the function declaration [-Wdocumentation] llvm-svn: 371125	2019-09-05 21:09:58 +00:00
Igor Kudrin	e46639620d	[DWARF] Fix referencing Range List Tables from CUs for DWARF64. As DW_AT_rnglists_base points after the header and headers have different sizes for DWARF32 and DWARF64, we have to use the format of the CU to adjust the offset correctly in order to extract the referenced range list table. The patch also changes the type of RangeSectionBase because in DWARF64 it is 8-bytes long. Differential Revision: https://reviews.llvm.org/D67098 llvm-svn: 371016	2019-09-05 07:02:28 +00:00
Igor Kudrin	991f0fb149	[DWARF] Support DWARF64 in DWARFListTableHeader. This enables 64-bit DWARF support for parsing range and location list tables. Differential Revision: https://reviews.llvm.org/D66643 llvm-svn: 371014	2019-09-05 06:49:05 +00:00
Greg Clayton	7d0a545ee6	Add encode and decode methods to InlineInfo and document encoding format to the GSYM file format. This patch adds the ability to encode and decode InlineInfo objects and adds test coverage. Error handling is introduced in the encoding and decoding which will be used from here on out for remaining patches. Differential Revision: https://reviews.llvm.org/D66600 llvm-svn: 370936	2019-09-04 17:32:51 +00:00
Pavel Labath	88b4e28a67	DWARF: Fix a regression in location list dumping Summary: While fixing the handling of some error cases, r370363 introduced new problems -- assertion failures due to unchecked errors (my excuse is that a very early version of that patch used Optional<T> instead of Expected). This patch adds proper handling of parsing errors encountered when dumping location lists from inside DWARF DIEs, and adds a bunch of additional tests. I reorder the arguments of the location list dumping functions to make them consistent, and also be able to dump the two kinds of location lists generically. Reviewers: JDevlieghere, dblaikie, probinson Subscribers: aprantl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67102 llvm-svn: 370868	2019-09-04 10:09:12 +00:00
Djordje Todorovic	5c6b82a756	[DWARFVerifier] Verify GNU extensions of call site DWARF symbols Verify that the call site DWARF symbols (added during the implementation of the debug entry values feature) are generated properly. Differential Revision: https://reviews.llvm.org/D66865 llvm-svn: 370631	2019-09-02 09:20:46 +00:00
Pavel Labath	bd546e5902	DWARFDebugLoc: Make parsing and error reporting more robust Summary: While examining this class for possible use in lldb, I noticed two things: - it spits out parsing errors directly to stderr - the loclists parser can incorrectly return valid location lists when parsing malformed (truncated) data I improve the stderr situation by making the parseOneLocationList functions return Expected<T>s. The errors are still dumped to stderr by their callers, so this is only a partial fix, but it is enough for my use case, as I intend to parse the locations lists one by one. I fix the behavior in the truncated scenario by using the newly introduced DataExtractor Cursor API. I also add tests for handling the error cases, as they currently have no coverage. Reviewers: dblaikie, JDevlieghere, probinson Subscribers: lldb-commits, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63591 llvm-svn: 370363	2019-08-29 14:26:05 +00:00
Pavel Labath	b1f29cec25	Add error handling to the DataExtractor class Summary: This is motivated by D63591, where we realized that there isn't a really good way of telling whether a DataExtractor is reading actual data, or is it just returning default values because it reached the end of the buffer. This patch resolves that by providing a new "Cursor" class. A Cursor object encapsulates two things: - the current position/offset in the DataExtractor - an error object Storing the error object inside the Cursor enables one to use the same pattern as the std::{io}stream API, where one can blindly perform a sequence of reads and only check for errors once at the end of the operation. Similarly to the stream API, as soon as we encounter one error, all of the subsequent operations are skipped (return default values) too, even if the would suceed with clear error state. Unlike the std::stream API (but in line with other llvm APIs), we force the error state to be checked through usage of llvm::Error. Reviewers: probinson, dblaikie, JDevlieghere, aprantl, echristo Subscribers: kristina, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63713 llvm-svn: 370042	2019-08-27 11:24:08 +00:00
Nilanjana Basu	7da6f432d8	Removing block comments from CodeView records in assembly files & related code cleanup llvm-svn: 369860	2019-08-25 01:09:11 +00:00
Greg Clayton	bf9ee07afa	Add FileWriter to GSYM and encode/decode functions to AddressRange and AddressRanges The full GSYM patch started with: https://reviews.llvm.org/D53379 This patch add the ability to encode data using the new llvm::gsym::FileWriter class. FileWriter is a simplified binary data writer class that doesn't require targets, target definitions, architectures, or require any other optional compile time libraries to be enabled via the build process. This class needs the ability to seek to different spots in the binary data that it produces to fix up offsets and sizes in GSYM data. It currently uses std::ostream over llvm::raw_ostream because llvm::raw_ostream doesn't support seeking which is required when encoding and decoding GSYM data. AddressRange objects are encoded and decoded to be relative to a base address. This will be the FunctionInfo's start address if the AddressRange is directly contained in a FunctionInfo, or a base address of the containing parent AddressRange or AddressRanges. This allows address ranges to be efficiently encoded using ULEB128 encodings as we encode the offset and size of each range instead of full addresses. This also makes encoded addresses easy to relocate as we just need to relocate one base address. Differential Revision: https://reviews.llvm.org/D63828 llvm-svn: 369587	2019-08-21 21:48:11 +00:00
Nilanjana Basu	ac3851c434	Improving CodeView debug info type record's inline comments llvm-svn: 369533	2019-08-21 15:19:58 +00:00
Igor Kudrin	ed413074f2	[DWARF] Adjust return type of DWARFUnit::getLength(). DWARFUnitHeader::getLength() returns uint64_t. DWARFUnit::getLength() should do the same. Differential Revision: https://reviews.llvm.org/D66472 llvm-svn: 369529	2019-08-21 14:10:57 +00:00
Igor Kudrin	59d5abaa71	[DWARF] Fix reading 64-bit DWARF type units. The type_offset field is 8 bytes long in DWARF64. The patch extends TypeOffset to uint64_t and fixes its reading. The patch also fixes checking of TypeOffset bounds as it was inaccurate in DWARF64 case. Differential Revision: https://reviews.llvm.org/D66465 llvm-svn: 369378	2019-08-20 12:52:32 +00:00
Igor Kudrin	a33004aca7	Remove the temporary code. NFC. That should have been done in rL368156 but somehow was missed. llvm-svn: 369082	2019-08-16 03:40:04 +00:00
Jonas Devlieghere	de0ce98abe	[DebugLine] Don't try to guess the path style In r368879 I made an attempt to guess the path style from the files in the line table. After some consideration I now think this is a poor idea. This patch undoes that behavior and instead adds an optional argument to specify the path style. This allows us to make that decision elsewhere where we have more information. In case of LLDB based on the Unit. llvm-svn: 369072	2019-08-15 23:53:15 +00:00
Jonas Devlieghere	0eaee545ee	[llvm] Migrate llvm::make_unique to std::make_unique Now that we've moved to C++14, we no longer need the llvm::make_unique implementation from STLExtras.h. This patch is a mechanical replacement of (hopefully) all the llvm::make_unique instances across the monorepo. llvm-svn: 369013	2019-08-15 15:54:37 +00:00
Michael Pozulp	9abf668c08	[llvm-objdump] Add warning messages if disassembly + source for problematic inputs Summary: Addresses https://bugs.llvm.org/show_bug.cgi?id=41905 Reviewers: jhenderson, rupprecht, grimar Reviewed By: jhenderson, grimar Subscribers: RKSimon, MaskRay, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62462 llvm-svn: 368963	2019-08-15 05:15:22 +00:00
Jonas Devlieghere	c0a9b1edca	[DebugLine] Improve path handling. After switching over LLDB's line table parser to libDebugInfo, we noticed two regressions on the Windows bot. The problem is that when obtaining a file from the line table prologue, we append paths without specifying a path style. This leads to incorrect results on Windows for debug info containing Posix paths: 0x0000000000201000: /tmp\b.c, is_start_of_statement = TRUE This patch is an attempt to fix that by guessing the path style whenever possible. Differential revision: https://reviews.llvm.org/D66227 llvm-svn: 368879	2019-08-14 17:00:10 +00:00
George Rimar	bcc00e1afb	Recommit r368812 "[llvm/Object] - Convert SectionRef::getName() to return Expected<>" Changes: no changes. A fix for the clang code will be landed right on top. Original commit message: SectionRef::getName() returns std::error_code now. Returning Expected<> instead has multiple benefits. For example, it forces user to check the error returned. Also Expected<> may keep a valuable string error message, what is more useful than having a error code. (Object\invalid.test was updated to show the new messages printed.) This patch makes a change for all users to switch to Expected<> version. Note: in a few places the error returned was ignored before my changes. In such places I left them ignored. My intention was to convert the interface used, and not to improve and/or the existent users in this patch. (Though I think this is good idea for a follow-ups to revisit such places and either remove consumeError calls or comment each of them to clarify why it is OK to have them). Differential revision: https://reviews.llvm.org/D66089 llvm-svn: 368826	2019-08-14 11:10:11 +00:00
George Rimar	468919e182	Revert r368812 "[llvm/Object] - Convert SectionRef::getName() to return Expected<>" It broke clang BB: http://lab.llvm.org:8011/builders/clang-x86_64-debian-fast/builds/16455 llvm-svn: 368813	2019-08-14 08:56:55 +00:00
George Rimar	a0c6a35714	[llvm/Object] - Convert SectionRef::getName() to return Expected<> SectionRef::getName() returns std::error_code now. Returning Expected<> instead has multiple benefits. For example, it forces user to check the error returned. Also Expected<> may keep a valuable string error message, what is more useful than having a error code. (Object\invalid.test was updated to show the new messages printed.) This patch makes a change for all users to switch to Expected<> version. Note: in a few places the error returned was ignored before my changes. In such places I left them ignored. My intention was to convert the interface used, and not to improve and/or the existent users in this patch. (Though I think this is good idea for a follow-ups to revisit such places and either remove consumeError calls or comment each of them to clarify why it is OK to have them). Differential revision: https://reviews.llvm.org/D66089 llvm-svn: 368812	2019-08-14 08:46:54 +00:00
David Blaikie	0fcc1f7bac	DebugInfo/DWARF: Provide some (pretty half-hearted) error handling access when parsing units This isn't the most robust error handling API, but does allow clients to opt-in to getting Errors they can handle. I suspect the long-term solution would be to move away from the lazy unit parsing and have an explicit step that parses the unit and then allows access to the other APIs that require a parsed unit. llvm-dwarfdump could be expanded to use this (or newer/better API) to demonstrate the benefit of it - but for now lld will use this in a follow-up cl which ensures lld can exit non-zero on errors like this (& provide more descriptive diagnostics including which object file the error came from). (error access to later errors when parsing nested DIEs would be good too - but, again, exposing that without it being a hassle for every consumer may be tricky) llvm-svn: 368377	2019-08-09 01:14:33 +00:00
David Blaikie	5b9508396c	Remove else-after-return llvm-svn: 368364	2019-08-08 23:17:23 +00:00
David Blaikie	1b1f1d6677	DebugInfo/DWARF: Remove unused return type from DWARFUnit::extractDIEsIfNeeded llvm-svn: 368212	2019-08-07 21:31:33 +00:00
David Blaikie	353938ec68	Fix indentation llvm-svn: 368198	2019-08-07 19:09:31 +00:00
David Blaikie	90146cd8b9	DebugInfo/DWARF: Normalize DWARFObject members on the DWARF spec section names Some of these names were abbreviated, some were not, some pluralised, some not. Made the API difficult to use - since it's an exact 1:1 mapping to the DWARF sections - use those names (changing underscore separation for camel casing). llvm-svn: 368189	2019-08-07 17:18:11 +00:00
Igor Kudrin	45ee93323b	Remove support for 32-bit offsets in utility classes (5/5) Differential Revision: https://reviews.llvm.org/D65641 llvm-svn: 368156	2019-08-07 11:44:47 +00:00
Igor Kudrin	2836cf0b72	Try to unbreak buildbots after r368014 llvm-svn: 368018	2019-08-06 11:12:13 +00:00
Igor Kudrin	f26a70a5e7	Switch LLVM to use 64-bit offsets (2/5) This updates all libraries and tools in LLVM Core to use 64-bit offsets which directly or indirectly come to DataExtractor. Differential Revision: https://reviews.llvm.org/D65638 llvm-svn: 368014	2019-08-06 10:49:40 +00:00
Igor Kudrin	f5f35c5cd1	Support 64-bit offsets in utility classes (1/5) Using 64-bit offsets is required to fully implement 64-bit DWARF. As these classes are used in many different libraries they should temporarily support both 32- and 64-bit offsets. Differential Revision: https://reviews.llvm.org/D64006 llvm-svn: 368013	2019-08-06 10:47:20 +00:00
Peter Collingbourne	f0380bac5f	Silence ubsan after r367926. Fixes e.g. http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-bootstrap-ubsan/builds/14273 We can't left shift here because left shifting of a negative number is UB. The same doesn't apply to unsigned arithmetic, but switching to unsigned doesn't appear to stop ubsan from complaining, so we need to mask out the high bits. llvm-svn: 367959	2019-08-06 00:21:30 +00:00
Peter Collingbourne	a56d81f4fb	llvm-symbolizer: Untag addresses in object files by default. Any addresses that we pass to llvm-symbolizer are going to be untagged, while any HWASAN instrumented globals are going to be tagged in the symbol table. Therefore we need to untag the addresses before using them. Differential Revision: https://reviews.llvm.org/D65769 llvm-svn: 367926	2019-08-05 20:59:25 +00:00
Nilanjana Basu	da60fc813c	Changing representation of .cv_def_range directives in Codeview debug info assembly format for better readability llvm-svn: 367867	2019-08-05 14:16:58 +00:00
Nilanjana Basu	b5e4d7de17	Revert "Changing representation of .cv_def_range directives in Codeview debug info assembly format for better readability" This reverts commit `a885afa9fa`. llvm-svn: 367861	2019-08-05 13:55:21 +00:00
Nilanjana Basu	a885afa9fa	Changing representation of .cv_def_range directives in Codeview debug info assembly format for better readability llvm-svn: 367850	2019-08-05 13:11:51 +00:00
Michael Pozulp	3046ef5c11	Revert "[llvm-objdump] Re-commit r367284." This reverts r367776 (git commit `d34099926e`). My changes to llvm-objdump tests caused them to fail on windows: http://lab.llvm.org:8011/builders/llvm-clang-lld-x86_64-scei-ps4-windows10pro-fast/builds/27368 llvm-svn: 367816	2019-08-05 08:52:28 +00:00
Fangrui Song	db26488bf9	[DWARF] Change DWARFDebugLoc::Entry::Loc from SmallVector<char, 4> to SmallString<4> SmallString has a conversion to StringRef, which can be leveraged to simplify two use sites. llvm-svn: 367801	2019-08-05 06:33:52 +00:00
Michael Pozulp	d34099926e	[llvm-objdump] Re-commit r367284. Add warning messages if disassembly + source for problematic inputs Summary: Addresses https://bugs.llvm.org/show_bug.cgi?id=41905 Reviewers: jhenderson, rupprecht, grimar Reviewed By: jhenderson, grimar Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62462 llvm-svn: 367776	2019-08-04 06:04:00 +00:00
JF Bastien	748dac7389	Remove support for unsupported MSVC versions Re-land r367727 with the #if fixed. Reviewers: rnk, lebedev.ri Subscribers: hiraditya, jkorous, dexonsmith, lebedev.ri, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65662 llvm-svn: 367734	2019-08-02 23:09:01 +00:00
JF Bastien	21d01ea9b6	Revert "Remove support for unsupported MSVC versions" Mismatched preprocessor, I'll fix in a follow-up. llvm-svn: 367728	2019-08-02 22:02:25 +00:00
JF Bastien	dc8af80c19	Remove support for unsupported MSVC versions Reviewers: rnk, lebedev.ri Subscribers: hiraditya, jkorous, dexonsmith, lebedev.ri, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65662 llvm-svn: 367727	2019-08-02 21:52:35 +00:00
Eric Christopher	5fb56b1966	Temporarily Revert "Changing representation of cv_def_range directives in Codeview debug info assembly format for better readability" This is breaking bots and the author asked me to revert. This reverts commit 367704. llvm-svn: 367707	2019-08-02 19:10:37 +00:00
Nilanjana Basu	1c67521591	Changing representation of cv_def_range directives in Codeview debug info assembly format for better readability llvm-svn: 367704	2019-08-02 18:44:39 +00:00
Eric Christopher	5a00b0772a	Temporarily revert "Changes to improve CodeView debug info type record inline comments" due to a sanitizer failure. This reverts commit 367623. llvm-svn: 367640	2019-08-02 01:05:47 +00:00
Nilanjana Basu	ac7e5788ca	Changes to improve CodeView debug info type record inline comments Signed-off-by: Nilanjana Basu <nilanjana.basu87@gmail.com> llvm-svn: 367623	2019-08-01 22:05:14 +00:00
Djordje Todorovic	b9973f87c6	Reland "[DwarfDebug] Dump call site debug info" The build failure found after the rL365467 has been resolved. Differential Revision: https://reviews.llvm.org/D60716 llvm-svn: 367446	2019-07-31 16:51:28 +00:00
Michael Pozulp	074db9b8e9	Revert "[llvm-objdump] Add warning messages if disassembly + source for problematic inputs" This reverts r367284 (git commit `b1cbe51bdf`). My changes to LLVMSymbolizer caused a test to fail: http://lab.llvm.org:8011/builders/clang-ppc64be-linux-lnt/builds/29488 llvm-svn: 367286	2019-07-30 07:05:27 +00:00
Michael Pozulp	b1cbe51bdf	[llvm-objdump] Add warning messages if disassembly + source for problematic inputs Summary: Addresses https://bugs.llvm.org/show_bug.cgi?id=41905 Reviewers: jhenderson, rupprecht, grimar Reviewed By: jhenderson, grimar Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62462 llvm-svn: 367284	2019-07-30 05:28:26 +00:00
Igor Kudrin	3daefb0744	[DWARF][NFC] Add constants for reserved values of an initial length field. Differential Revision: https://reviews.llvm.org/D65039 llvm-svn: 366887	2019-07-24 11:34:29 +00:00
Jonas Devlieghere	f8552e67e9	[DWARF] Use 32-bit format specifier for offset This should fix PR42730. llvm-svn: 366859	2019-07-23 22:34:21 +00:00
Jonas Devlieghere	0e7ba06e82	[DWARF] Add more error handling to debug line parser. This patch exnteds the error handling in the debug line parser to get rid of the existing MD5 assertion. I want to reuse the debug line parser from LLVM in LLDB where we cannot crash on invalid input. Differential revision: https://reviews.llvm.org/D64544 llvm-svn: 366762	2019-07-22 23:23:34 +00:00
Nilanjana Basu	06b8fe8d03	Changes to emit CodeView debug info nested type records properly using MCStreamer directives llvm-svn: 366720	2019-07-22 18:22:55 +00:00
Hsiangkai Wang	18ccfadd46	[DebugInfo] Generate fixups as emitting DWARF .debug_frame/.eh_frame. It is necessary to generate fixups in .debug_frame or .eh_frame as relaxation is enabled due to the address delta may be changed after relaxation. There is an opcode with 6-bits data in debug frame encoding. So, we also need 6-bits fixup types. Differential Revision: https://reviews.llvm.org/D58335 llvm-svn: 366524	2019-07-19 02:03:34 +00:00
Hsiangkai Wang	657277e0f1	Revert "[DebugInfo] Generate fixups as emitting DWARF .debug_frame/.eh_frame." This reverts commit 17e3cbf5fe656483d9016d0ba9e1d0cd8629379e. llvm-svn: 366444	2019-07-18 15:06:50 +00:00
Hsiangkai Wang	e43ce1a958	[DebugInfo] Generate fixups as emitting DWARF .debug_frame/.eh_frame. It is necessary to generate fixups in .debug_frame or .eh_frame as relaxation is enabled due to the address delta may be changed after relaxation. There is an opcode with 6-bits data in debug frame encoding. So, we also need 6-bits fixup types. Differential Revision: https://reviews.llvm.org/D58335 llvm-svn: 366442	2019-07-18 14:47:34 +00:00
Alex Bradbury	44deaf7e54	[DWARF][RISCV] Add support for RISC-V relocations needed for debug info When code relaxation is enabled many RISC-V fixups are not resolved but instead relocations are emitted. This happens even for DWARF debug sections. Therefore, to properly support the parsing of DWARF debug info we need to be able to resolve RISC-V relocations. This patch adds: * Support for RISC-V relocations in RelocationResolver * DWARF support for two relocations per object file offset * DWARF changes to support relocations in more DIE fields The two relocations per offset change is needed because some RISC-V relocations (used for label differences) come in pairs. Relocations can also be emitted for DWARF fields where relocations were not yet evaluated. Adding relocation support for some of these fields is essencial. On the other hand, LLVM currently emits RISC-V relocations for fixups that could be safely evaluated, since they can never be affected by code relaxations. This patch also adds relocation support for the fields affected by those extraneous relocations (the DWARF unit entry Length, and the DWARF debug line entry TotalLength and PrologueLength), for testing purposes. Differential Revision: https://reviews.llvm.org/D62062 Patch by Luís Marques. llvm-svn: 366402	2019-07-18 05:22:55 +00:00
Nilanjana Basu	4e22770219	Changes to display code view debug info type records in hex format llvm-svn: 366390	2019-07-17 23:43:58 +00:00
Nico Weber	7bb5fc0583	llvm-pdbdump: Fix several smaller issues with injected source compression handling - getCompression() used to return a PDB_SourceCompression even though the docs for IDiaInjectedSource are explicit about the return value being compiler-dependent. Return an uint32_t instead, and make the printing code handle unknown values better by printing "Unknown" and the int value instead of not printing any compression. - Print compressed contents as hex dump, not as string. - Add compression type "DotNet", which is used (at least) by csc.exe, the C# compiler. Also add a lengthy comment describing the stream contents (derived from looking at the raw hex contents long enough to see the GUIDs, which led me to the roslyn and mono implementations for handling this). - The native injected source dumper was dumping the contents of the whole data stream -- but csc.exe writes a stream that's padded with zero bytes to the next 512 boundary, and the dia api doesn't display those padding bytes. So make NativeInjectedSource::getCode() do the same thing. Differential Revision: https://reviews.llvm.org/D64879 llvm-svn: 366386	2019-07-17 22:59:52 +00:00
Nilanjana Basu	6e4076699c	Adding inline comments to code view type record directives for better readability llvm-svn: 366372	2019-07-17 21:01:12 +00:00
Nico Weber	d100b5dd01	Teach `llvm-pdbutil pretty -native` about `-injected-sources` `pretty -native -injected-sources -injected-source-content` works with this patch, and produces identical output to the dia version. Differential Revision: https://reviews.llvm.org/D64428 llvm-svn: 366236	2019-07-16 18:04:26 +00:00
Igor Kudrin	f48bc01812	[DWARF] Fix the reserved values for unit length in DWARFDebugLine. The DWARF3 documentation had inconsistency concerning the reserved range for unit length values. The issue was fixed in DWARF4. Differential Revision: https://reviews.llvm.org/D64622 llvm-svn: 366190	2019-07-16 07:01:08 +00:00
Igor Kudrin	74c350af21	[DWARF] Fix an incorrect format specifier. This adjusts the format specifier because PCOffset is uint16_t. Differential Revision: https://reviews.llvm.org/D64620 llvm-svn: 366189	2019-07-16 06:56:10 +00:00
Igor Kudrin	860f7ec058	[DWARF] Simplify DWARFAttribute. NFC. The first argument in the constructor was ignored, and the remaining arguments were always passed as their defaults. Differential Revision: https://reviews.llvm.org/D64407 llvm-svn: 366188	2019-07-16 06:53:06 +00:00
Jonas Devlieghere	ca16d280f7	Re-land "[DebugInfo] Move function from line table to the prologue (NFC)" In LLDB, when parsing type units, we don't need to parse the whole line table. Instead, we only need to parse the "support files" from the line table prologue. To make that possible, this patch moves the respective functions from the LineTable into the Prologue. Because I don't think users of the LineTable should have to know that these files come from the Prologue, I've left the original methods in place, and made them redirect to the LineTable. Differential revision: https://reviews.llvm.org/D64774 llvm-svn: 366164	2019-07-16 01:21:25 +00:00
Jonas Devlieghere	01ee172e9e	Revert "[DebugInfo] Move function from line table to the prologue (NFC)" This broke LLD, which I didn't have enabled. llvm-svn: 366160	2019-07-16 00:59:04 +00:00
Jonas Devlieghere	509903e887	[DebugInfo] Move function from line table to the prologue (NFC) In LLDB, when parsing type units, we don't need to parse the whole line table. Instead, we only need to parse the "support files" from the line table prologue. To make that possible, this patch moves the respective functions from the LineTable into the Prologue. Because I don't think users of the LineTable should have to know that these files come from the Prologue, I've left the original methods in place, and made them redirect to the LineTable. Differential revision: https://reviews.llvm.org/D64774 llvm-svn: 366158	2019-07-16 00:37:17 +00:00
Nico Weber	ac6375d99d	Expand comment about how StringsToBuckets was computed, and add more entries The construction was explained in https://reviews.llvm.org/D44810?id=139526#inline-391999 but reading the code shouldn't require hunting down old reviews to understand it. The precomputed list was missing an entry for the empty list case, and one entry at the very end. (The current last entry is the last one where 3 * BucketCount fits in a signed int, but the reference implementation uses unsigneds as far as I can tell, so there's room for one more entry.) No behavior change for inputs seen in practice. Differential Revision: https://reviews.llvm.org/D64738 llvm-svn: 366107	2019-07-15 18:56:56 +00:00
Nico Weber	51a52b5893	PDB HashTable: Move TraitsT from class parameter to the methods that need it The traits object is only used by a few methods. Deserializing a hash table and walking it is possible without the traits object, so it shouldn't be required to build a dummy object for that use case. The TraitsT object used to be a function template parameter before r327647, this restores it to that state. This makes it clear that the traits object isn't needed at all in 1 of the current 3 uses of HashTable (and I am going to add another use that doesn't need it), and that the default PdbHashTraits isn't used outside of tests. While here, also re-enable 3 checks in the test that were commented out (which requires making HashTableInternals templated and giving FooBar an operator==). No intended behavior change. Differential Revision: https://reviews.llvm.org/D64640 llvm-svn: 365974	2019-07-12 23:30:55 +00:00
Nico Weber	13f7ddff17	Slightly simplify MappedBlockStream::createIndexedStream() calls All callers had a PDBFile object at hand, so call Pdb.createIndexedStream() instead, which pre-populates all the arguments (and returns nullptr for kInvalidStreamIndex). Also change safelyCreateIndexedStream() to only take the string index, and update callers. Make the method public and call it in two places that manually did the bounds checking before. No intended behavior change. Differential Revision: https://reviews.llvm.org/D64633 llvm-svn: 365936	2019-07-12 18:24:38 +00:00
Djordje Todorovic	0739ccd3b5	Revert "[DwarfDebug] Dump call site debug info" A build failure was found on the SystemZ platform. This reverts commit 9e7e73578e54cd22b3c7af4b54274d743b6607cc. llvm-svn: 365886	2019-07-12 09:45:12 +00:00
Nico Weber	96dff91998	Fix a few 'no newline at end of file' warnings that Xcode emits (Xcode even has a snazzy "Fix" button, but clicking that inserts two newlines. So close!) llvm-svn: 365789	2019-07-11 15:26:45 +00:00
Djordje Todorovic	01eaae6dd1	[DwarfDebug] Dump call site debug info Dump the DWARF information about call sites and call site parameters into debug info sections. The patch also provides an interface for the interpretation of instructions that could load values of a call site parameters in order to generate DWARF about the call site parameters. ([13/13] Introduce the debug entry values.) Co-authored-by: Ananth Sowda <asowda@cisco.com> Co-authored-by: Nikola Prica <nikola.prica@rt-rk.com> Co-authored-by: Ivan Baev <ibaev@cisco.com> Differential Revision: https://reviews.llvm.org/D60716 llvm-svn: 365467	2019-07-09 11:33:56 +00:00
Nilanjana Basu	faed8516e4	Changing CodeView debug info type record representation in assembly files to make it more human-readable & editable & fixing bug introduced in r364987 llvm-svn: 365417	2019-07-09 01:11:02 +00:00
Yuanfang Chen	5de4692cc7	Teach the symbolizer lib symbolize objects directly. Currently, the symbolizer lib can only symbolize a file on disk. This patch teaches the symbolizer lib to symbolize objects. llvm-objdump needs this to support archive disassembly with source info. https://bugs.llvm.org/show_bug.cgi?id=41871 Reviewed by: jhenderson, grimar, MaskRay Differential Revision: https://reviews.llvm.org/D63521 llvm-svn: 365376	2019-07-08 19:28:57 +00:00
Nilanjana Basu	c0b557744a	Revert Changing CodeView debug info type record representation in assembly files to make it more human-readable & editable This reverts r364982 (git commit `2082bf28eb`) llvm-svn: 364987	2019-07-03 00:51:49 +00:00
Nilanjana Basu	2082bf28eb	Changing CodeView debug info type record representation in assembly files to make it more human-readable & editable llvm-svn: 364982	2019-07-03 00:26:23 +00:00
Igor Kudrin	c310b1aaed	[DWARF] Simplify dumping of a .debug_addr section. This patch removes the part which tried to interpret addresses in that section as offsets and simplifies the remaining code. Differential Revision: https://reviews.llvm.org/D64020 llvm-svn: 364896	2019-07-02 09:57:28 +00:00
Fangrui Song	78ee2fbf98	Cleanup: llvm::bsearch -> llvm::partition_point after r364719 llvm-svn: 364720	2019-06-30 11:19:56 +00:00
Fangrui Song	493a120259	[DebugInfo] Simplify GSYM::AddressRange and GSYM::AddressRanges Delete unnecessary getters of AddressRange. Simplify AddressRange::size(): Start <= End check should be checked in an upper layer. Delete isContiguousWith() that doesn't make sense. Simplify AddressRanges::insert. Delete commented code. Fix it when more than 1 ranges are to be deleted. Delete trailing newline. llvm-svn: 364637	2019-06-28 10:06:11 +00:00
Fangrui Song	e662b6985a	[DebugInfo] GSYM cleanups after D63104/r364427 llvm-svn: 364634	2019-06-28 08:58:05 +00:00
Michael Liao	c5486b23bc	Correct the file path. NFC. llvm-svn: 364577	2019-06-27 19:05:46 +00:00
Djordje Todorovic	a0d45058eb	[DWARF] Handle the DW_OP_entry_value operand Add the IR and the AsmPrinter parts for handling of the DW_OP_entry_values DWARF operation. ([11/13] Introduce the debug entry values.) Co-authored-by: Ananth Sowda <asowda@cisco.com> Co-authored-by: Nikola Prica <nikola.prica@rt-rk.com> Co-authored-by: Ivan Baev <ibaev@cisco.com> Differential Revision: https://reviews.llvm.org/D60866 llvm-svn: 364542	2019-06-27 13:52:34 +00:00
Greg Clayton	208cce7500	Fix builbots after r364427. I was using an iterator that was equal to the end of a collection. llvm-svn: 364447	2019-06-26 16:22:58 +00:00
Michael Liao	68ea5fee21	Fix build in shared lib mode. - The newly added GSYM misses LLVMBuild.txt. Add a barely one to pass the build. llvm-svn: 364440	2019-06-26 15:46:48 +00:00
Greg Clayton	044776bf5d	Add GSYM utility files along with unit tests. The full GSYM patch started with: https://reviews.llvm.org/D53379 In that patch we wanted to split up getting GSYM into the LLVM code base so we are not committing too much code at once. This is a first in a series of patches where I only add the foundation classes along with complete unit tests. They provide the foundation for encoding and decoding a GSYM file. File entries are defined in llvm::gsym::FileEntry. This class splits the file up into a directory and filename represented by uniqued string table offsets. This allows all files that are referred to in a GSYM file to be encoded as 1 based indexes into a global file table in the GSYM file. Function information in stored in llvm::gsym::FunctionInfo. This object represents a contiguous address range that has a name and range with an optional line table and inline call stack information. Line table entries are defined in llvm::gsym::LineEntry. They store only address, file and line information to keep the line tables simple and allows the information to be efficiently encoded in a subsequent patch. Inline information is defined in llvm::gsym::InlineInfo. These structs store the name of the inline function, along with one or more address ranges, and the file and line that called this function. They also contain any child inline information. There are also utility classes for address ranges in llvm::gsym::AddressRange, and string table support in llvm::gsym::StringTable which are simple classes. The unit tests test all the APIs on these simple classes so they will be ready for the next patches where we will create GSYM files and parse GSYM files. Differential Revision: https://reviews.llvm.org/D63104 llvm-svn: 364427	2019-06-26 14:09:09 +00:00
Peter Collingbourne	9c8282a9b3	llvm-symbolizer: Add a FRAME command. This command prints a description of the referenced function's stack frame. For each formal parameter and local variable, the tool prints: - function name - variable name - file/line of declaration - FP-relative variable location (if available) - size in bytes - HWASAN tag offset This information will be used by the HWASAN runtime to identify local variables in UAR reports. Differential Revision: https://reviews.llvm.org/D63468 llvm-svn: 364225	2019-06-24 20:03:23 +00:00
Fangrui Song	22e478f054	[Symbolize] Avoid lifetime extension and simplify std::map find/insert. NFC llvm-svn: 364025	2019-06-21 11:05:26 +00:00
Fangrui Song	dc8de6037c	Simplify std::lower_bound with llvm::{bsearch,lower_bound}. NFC llvm-svn: 364006	2019-06-21 05:40:31 +00:00
Fangrui Song	102b1efd53	[llvm-dwarfdump] --gdb-index: fix uninitialized TuListOffset The test only checks the existence of the `Types CU list` line. Unfortunately I can't make a better test because {gcc,clang} -fuse-ld={lld,gold} --gdb-index do not give me a non-empty types CU list. Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D63537 llvm-svn: 363800	2019-06-19 13:51:29 +00:00
Peter Collingbourne	0feb6e52f1	Symbolize: Remove dead code. NFCI. The only caller of SymbolizableObjectFile::create passes a non-null DebugInfoContext and asserts that they do so. Move the assert into SymbolizableObjectFile::create and remove null checks. Differential Revision: https://reviews.llvm.org/D63298 llvm-svn: 363334	2019-06-13 22:49:34 +00:00
Amy Huang	9970817c57	Deduplicate S_CONSTANTs in LLD. Summary: Deduplicate S_CONSTANTS when linking, if they have the same value. Reviewers: rnk Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63151 llvm-svn: 363089	2019-06-11 18:02:39 +00:00
Peter Collingbourne	e5bdedac9d	Symbolize: Make DWPName a symbolizer option instead of an argument to symbolize{,Inlined}Code. This makes the interface simpler and more consistent with the interface for .dSYM files and fixes a bug where llvm-symbolizer would not read the dwp if it was asked to symbolize data before symbolizing code. Differential Revision: https://reviews.llvm.org/D63114 llvm-svn: 363025	2019-06-11 02:32:27 +00:00
Dylan McKay	038e3b9f57	Extend the DWARFExpression address handling to support 16-bit addresses This allows the DWARFExpression class to handle addresses without crashing on targets with 16-bit pointers like AVR. This is required in order to generate assembly from clang via the '-S' flag. This fixes an error with the following message: clang: llvm/include/llvm/DebugInfo/DWARF/DWARFExpression.h:132: llvm::DWARFExpression::DWARFExpression(llvm::DataExtractor, uint16_t, uint8_t): Assertion `AddressSize == 8 \|\| AddressSize == 4' failed. llvm-svn: 362290	2019-06-01 09:18:26 +00:00
Tom Tan	eb4d6142dc	[COFF, ARM64] Add CodeView register mapping CodeView has its own register map which is defined in cvconst.h. Missing this mapping before saving register to CodeView causes debugger to show incorrect value for all register based variables, like variables in register and local variables addressed by register (stack pointer + offset). This change added mapping between LLVM register and CodeView register so the correct register number will be stored to CodeView/PDB, it aso fixed the mapping from CodeView register number to register name based on current CPUType but print PDB to yaml still assumes X86 CPU and needs to be fixed. Differential Revision: https://reviews.llvm.org/D62608 llvm-svn: 362280	2019-05-31 23:43:31 +00:00
David Blaikie	a17564c2f1	llvm-dwarfdump: Don't error on mixed units using/not using str_offsets This lead to errors when dumping binaries with v4 and v5 units linked together (but could've also errored on v5 units that did/didn't use str_offsets). Also improves error handling and messages around invalid str_offsets contributions. llvm-svn: 361683	2019-05-25 00:07:22 +00:00
Jonas Devlieghere	0da8160df3	[dwarfdump] Add flag to limit the number of parents DIEs This adds `-parent-recurse-depth` which limits the number of parent DIEs being dumped. Differential revision: https://reviews.llvm.org/D62359 llvm-svn: 361671	2019-05-24 21:11:28 +00:00
David Blaikie	fc302c2b7f	dwarfdump: Deterministically... determine whether parsing a DWARF32 or DWARF64 str_offsets header Rather than trying one and then the other - use the kind of the CU to select which kind of header to parse. llvm-svn: 361589	2019-05-24 01:41:58 +00:00
David Blaikie	79872a88a0	dwarfdump: Add a bit more DWARF64 support This test case was incorrect because it mixed DWARF32 and DWARF64 for a single unit (DWARF32 unit referencing a DWARF64 str_offsets section). So fix enough of the unit parsing for DWARF64 and make the test valid. (not sure if anyone needs DWARF64 support though - support in libDebugInfoDWARF has been added piecemeal and LLVM doesn't produce it at all) llvm-svn: 361582	2019-05-24 01:05:52 +00:00
Galina Kistanova	ed49f6d8e6	Reverted r361134 because of a failing test left unattended for a long time. http://lab.llvm.org:8011/builders/llvm-clang-x86_64-expensive-checks-win/builds/17792/steps/test-check-all/logs/stdio Failing Tests (1): LLVM :: CodeGen/AMDGPU/regbank-reassign.mir llvm-svn: 361430	2019-05-22 20:42:56 +00:00
Nick Desaulniers	bf940622c8	[DWARF] hoist nullptr checks. NFC Summary: This was flagged in https://www.viva64.com/en/b/0629/ under "Snippet No. 15" (see under #13). It looks like PVS studio flags nullptr checks where the ptr is used inbetween creation and checking against nullptr. Reviewers: JDevlieghere, probinson Reviewed By: JDevlieghere Subscribers: RKSimon, hiraditya, llvm-commits, srhines Tags: #llvm Differential Revision: https://reviews.llvm.org/D62118 llvm-svn: 361176	2019-05-20 16:58:59 +00:00
Fangrui Song	68774edcd6	Use llvm::sort. NFC llvm-svn: 361134	2019-05-20 10:18:35 +00:00
Fangrui Song	e183340c29	Recommit [Object] Change object::SectionRef::getContents() to return Expected<StringRef> r360876 didn't fix 2 call sites in clang. Expected<ArrayRef<uint8_t>> may be better but use Expected<StringRef> for now. Follow-up of D61781. llvm-svn: 360892	2019-05-16 13:24:04 +00:00
Hans Wennborg	4da9ff9fcf	Revert r360876 "[Object] Change object::SectionRef::getContents() to return Expected<StringRef>" It broke the Clang build, see llvm-commits thread. > Expected<ArrayRef<uint8_t>> may be better but use Expected<StringRef> for now. > > Follow-up of D61781. llvm-svn: 360878	2019-05-16 12:08:34 +00:00
Fangrui Song	a076ec54be	[Object] Change object::SectionRef::getContents() to return Expected<StringRef> Expected<ArrayRef<uint8_t>> may be better but use Expected<StringRef> for now. Follow-up of D61781. llvm-svn: 360876	2019-05-16 11:33:48 +00:00
Reid Kleckner	7c438c5b07	[codeview] Finish support for reading and writing S_ANNOTATION records Implement dumping via llvm-pdbutil and llvm-readobj. llvm-svn: 360813	2019-05-15 20:53:39 +00:00
David Blaikie	7598b71488	DebugInfo: Only move types out of type units if they're named or type united Follow up to r359122, after a bug was reported in it - the original change too aggressively tried to move related types out of type units, which included unnamed types (like array types) which can't reasonably be declared-but-not-defined. A step beyond that is that some types in type units can be anonymous, if they are types with a name for linkage purposes (eg: "typedef struct { } x;"). So ensure those don't get turned into plain declarations (without signatures) because, lacking names, they can't be resolved to the definition. [Also include a fix for llvm-dwarfdump/libDebugInfoDWARF to pretty print types in type units] llvm-svn: 360458	2019-05-10 19:15:29 +00:00
David Blaikie	12faa0d44b	DebugInfo/DWARF: Minor expression simplification llvm-svn: 360377	2019-05-09 21:23:40 +00:00
Simon Pilgrim	2a09a6cfe2	[DebugInfo] Fix use-after-move warning. NFCI. Don't rely on DWARFAbbreviationDeclarationSet::extract cleaning the struct up for reuse - the analyzers don't like it. llvm-svn: 360235	2019-05-08 10:09:57 +00:00
Fangrui Song	7e55672b22	DWARF v5: fix directory index in the line table Summary: Prior to DWARF v5, a directory index of 0 represents DW_AT_comp_dir. In DWARF v5, the index starts with 0 and Entry.DirIdx is the index into Prologue.IncludeDirectories. Reviewed By: labath Differential Revision: https://reviews.llvm.org/D61253 llvm-svn: 360015	2019-05-06 08:03:46 +00:00
Nico Weber	e577be4ed1	[PDB] Fix hash function used to write /src/headerblock lld-link used to write PDB files that DIA couldn't recover natvis files from if: - The global strings table was > 64kiB - There were at least 3 natvis files The cause was that the hash function for the /src/headerblock stream was incorrect: It needs to be truncated to 16 bit. If the global strings table was <= 64kiB, truncating to 16 bit is a no-op, so this wasn't needed for small programs. If there are only 1 or 2 natvis files, then the growth strategy in HashTable::grow() would mean the hash table would have 2 buckets (for 1 natvis file) or 4 buckets (for 4 natvis files), and since the hash function is used modulo number of buckets, and since 2 and 4 divide 0x10000, the missing `% 0x10000` is a no-op there too. For 3 natvis files, the hash table grows to 6 buckets, which has a factor that's not common with 0x10000 and the difference starts to matter. Fixes PR41626. Differential Revision: https://reviews.llvm.org/D61277 llvm-svn: 359515	2019-04-29 23:09:35 +00:00
Fangrui Song	97b8cd54ad	[DWARF] Fix dump of local/foreign TU lists in .debug_names Differential Revision: https://reviews.llvm.org/D61241 llvm-svn: 359425	2019-04-29 08:55:10 +00:00
Fangrui Song	cc1fec31d9	[DWARF] Delete a redundant check in getFileNameByIndex() llvm-svn: 359422	2019-04-29 08:15:13 +00:00
Fangrui Song	3153764c88	s/Dwarf 5/DWARF v5/ NFC llvm-svn: 359307	2019-04-26 13:41:19 +00:00
Fangrui Song	efd94c56ba	Use llvm::stable_sort While touching the code, simplify if feasible. llvm-svn: 358996	2019-04-23 14:51:27 +00:00
Fangrui Song	dd0e833555	[llvm-symbolizer] Fix section index at the end of a section This is very minor issue. The returned section index is only used by DWARFDebugLine as an llvm::upper_bound input and the use case shouldn't cause any behavioral change. llvm-svn: 358814	2019-04-20 13:00:09 +00:00
Fangrui Song	9a331bba2a	[DWARF] Use hasFileAtIndex to properly verify DWARF 5 after rL358732 llvm-svn: 358734	2019-04-19 03:34:28 +00:00
Ali Tamur	783d84bb39	[llvm] Prevent duplicate files in debug line header in dwarf 5: another attempt Another attempt to land the changes in debug line header to prevent duplicate files in Dwarf 5. I rolled back my previous commit because of a mistake in generating the object file in a test. Meanwhile, I addressed some offline comments and changed the implementation; the largest difference is that MCDwarfLineTableHeader does not keep DwarfVersion but gets it as a parameter. I also merged the patch to fix two lld tests that will strt to fail into this patch. Original Commit: https://reviews.llvm.org/D59515 Original Message: Motivation: In previous dwarf versions, file name indexes started from 1, and the primary source file was not explicit. Dwarf 5 standard (6.2.4) prescribes the primary source file to be explicitly given an entry with an index number 0. The current implementation honors the specification by just duplicating the main source file, once with index number 0, and later maybe with another index number. While this is compliant with the letter of the standard, the duplication causes problems for consumers of this information such as lldb. (Some files are duplicated, where only some of them have a line table although all refer to the same file) With this change, dwarf 5 debug line section files always start from 0, and the zeroth entry is not duplicated whenever possible. This requires different handling of dwarf 4 and dwarf 5 during generation (e.g. when a function returns an index zero for a file name, it signals an error in dwarf 4, but not in dwarf 5) However, I think the minor complication is worth it, because it enables all consumers (lldb, gdb, dwarfdump, objdump, and so on) to treat all files in the file name list homogenously. llvm-svn: 358732	2019-04-19 02:26:56 +00:00
Fangrui Song	a364d599ab	[DWARF] llvm::Error -> Error. NFC The unqualified name is more common and is used in the file as well. llvm-svn: 358567	2019-04-17 09:11:08 +00:00
Fangrui Song	c82e92bca8	Change some llvm::{lower,upper}_bound to llvm::bsearch. NFC llvm-svn: 358564	2019-04-17 07:58:05 +00:00
Fangrui Song	df44ff1b78	[DWARF] Pass ReferenceToDIEOffsets elements by reference llvm-svn: 358558	2019-04-17 06:33:52 +00:00
Fangrui Song	f56a436891	[DWARF] Fix DWARFVerifier::DieRangeInfo::contains It didn't handle empty LHS correctly. If two ranges of LHS were contiguous and jointly contained one range of RHS, it could also be incorrect. DWARFAddressRange::contains can be removed and its tests can be merged into DWARFVerifier::DieRangeInfo::contains llvm-svn: 358387	2019-04-15 10:02:36 +00:00
Fangrui Song	b93de4cd26	[DWARF] Fix DWARFVerifier::DieRangeInfo::intersects It was incorrect if RHS had more than 1 ranges and one of the ranges interacted with *this llvm-svn: 358376	2019-04-15 08:30:10 +00:00
Fangrui Song	50a09670f0	[DWARF] Make DWARFDebugLine::ParsingState::RowNumber a local variable llvm-svn: 358374	2019-04-15 07:40:30 +00:00
Fangrui Song	cecc435250	Use llvm::lower_bound. NFC This reapplies rL358161. That commit inadvertently reverted an exegesis file to an old version. llvm-svn: 358246	2019-04-12 02:02:06 +00:00
Ali Tamur	7822b46188	Revert "Use llvm::lower_bound. NFC" This reverts commit rL358161. This patch have broken the test: llvm/test/tools/llvm-exegesis/X86/uops-CMOV16rm-noreg.s llvm-svn: 358199	2019-04-11 17:35:20 +00:00
Fangrui Song	71cce580b9	Use llvm::lower_bound. NFC llvm-svn: 358161	2019-04-11 10:25:41 +00:00
Fangrui Song	6a285dfe71	[DWARF] Set discriminator to 0 for DW_LNS_copy Summary: Make DW_LNS_copy set the discriminator register to 0, to conform to DWARF 4 & 5: "Then it sets the discriminator register to 0, and sets the basic_block, prologue_end and epilogue_begin registers to false." Because all of DW_LNE_end_sequence, DN_LNS_copy, and special opcodes reset discriminator to 0, we can move discriminator=0 to appendRowToMatrix. Also, make DW_LNS_copy print before appending the row, as it is similar to a address+=0,line+=0 special opcode, which prints before appending the row. Reviewers: dblaikie, probinson, aprantl Reviewed By: dblaikie Subscribers: danielcdh, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60364 llvm-svn: 358148	2019-04-11 02:02:44 +00:00
Fangrui Song	b3be23d334	[DWARF] Simplify LineTable::findRowInSeq We want the last row whose address is less than or equal to Address. This can be computed as upper_bound - 1, which is simpler than lower_bound followed by skipping equal rows in a loop. Since FirstRow (LowPC) does not satisfy the predicate (OrderByAddress) while LastRow-1 (HighPC) satisfies the predicate. We can decrease the search range by two, i.e. upper_bound [FirstRow,LastRow) = upper_bound [FirstRow+1,LastRow-1) llvm-svn: 358053	2019-04-10 07:44:23 +00:00
Fangrui Song	9b22c469ca	[DWARF] DWARFDebugLine: replace Sequence::orderByLowPC with orderByHighPC In a sorted list of non-overlapping [LowPC,HighPC) ranges, locating an address with upper_bound on HighPC is simpler than lower_bound on LowPC. llvm-svn: 358012	2019-04-09 15:08:32 +00:00
Eugene Leviant	7671a1daa7	Use llvm::crc32 instead of crc32. NFC llvm-svn: 357911	2019-04-08 13:40:58 +00:00
Eugene Leviant	18873b22be	Attempt to recommit r357901 llvm-svn: 357905	2019-04-08 12:31:12 +00:00
Eugene Leviant	03d28a4490	Reverting r357901 as fails to build on some of the buildbots llvm-svn: 357902	2019-04-08 11:37:20 +00:00
Eugene Leviant	ad69bd6870	[Support] Add zlib independent CRC32 Differential revision: https://reviews.llvm.org/D59816 llvm-svn: 357901	2019-04-08 11:25:48 +00:00
Fangrui Song	c4c8bcaeec	[DWARF] DWARFDebugLine: delete unused parameter `Offset` llvm-svn: 357866	2019-04-07 13:56:14 +00:00
Fangrui Song	6a0746a92f	Change some StringRef::data() reinterpret_cast to bytes_begin() or arrayRefFromStringRef() llvm-svn: 357852	2019-04-07 03:58:42 +00:00
Fangrui Song	4be8629e49	[DWARF] Simplify DWARFDebugAranges::findAddress The current lower_bound approach has to check two iterators pos and pos-1. Changing it to upper_bound allows us to check one iterator (similar to DWARFUnitVector::getUnitFor*). llvm-svn: 357834	2019-04-06 09:12:53 +00:00
Fangrui Song	cb300f1243	[Symbolize] Uniquify sorted vector<pair<SymbolDesc, StringRef>> llvm-svn: 357833	2019-04-06 02:18:56 +00:00
Fangrui Song	afb54fd629	[Symbolize] Replace map<SymbolDesc, StringRef> with sorted vector llvm-svn: 357758	2019-04-05 12:52:04 +00:00
Fangrui Song	e2622b3e33	[Symbolize] Keep SymbolDescs with the same address and improve getNameFromSymbolTable heuristic I'll follow up with better heuristics or tests. llvm-svn: 357683	2019-04-04 11:08:45 +00:00
Igor Kudrin	0fed7b0564	[llvm-symbolizer] Add `--output-style` switch. In general, llvm-symbolizer follows the output style of GNU's addr2line. However, there are still some differences; in particular, for a requested address, llvm-symbolizer prints line and column, while addr2line prints only the line number. This patch adds a new switch to select the preferred style. Differential Revision: https://reviews.llvm.org/D60190 llvm-svn: 357675	2019-04-04 08:39:40 +00:00
Reid Kleckner	e10d00419a	[codeview] Remove Type member from CVRecord Summary: Now CVType and CVSymbol are effectively type-safe wrappers around ArrayRef<uint8_t>. Make the kind() accessor load it from the RecordPrefix, which is the same for types and symbols. Reviewers: zturner, aganea Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60018 llvm-svn: 357658	2019-04-04 00:28:48 +00:00
Jonas Devlieghere	2156797cf0	[dwarfdump] Remove bogus verifier error The standard doesn't require a DW_TAG_variable, DW_TAG_formal_parameter or DW_TAG_constant to A DW_AT_type attribute describing the type of the variable. It only specifies that it can have one. llvm-svn: 357628	2019-04-03 19:57:13 +00:00
Paul Semel	0c27bc2e1f	[DWARF] check whether the DIE is valid before querying for information Differential Revision: https://reviews.llvm.org/D60147 llvm-svn: 357607	2019-04-03 17:13:45 +00:00
Reid Kleckner	85e2cdac73	Delay initialization of three static global maps, NFC This avoids allocating a few KB of heap memory on startup, and instead allocates these maps lazily. I noticed this while profiling LLD. llvm-svn: 357192	2019-03-28 17:33:41 +00:00
Fangrui Song	3f2e29b013	[DWARF] Add D to Seen early to avoid duplicate elements in Worklist llvm-svn: 357054	2019-03-27 09:38:05 +00:00
Fangrui Song	38a4c619eb	[DWARF] Simplify DWARFVerifier::handleDebugAbbrev. NFC llvm-svn: 357053	2019-03-27 08:43:21 +00:00
Ali Tamur	02e96648d7	Revert "[llvm] Reapply "Prevent duplicate files in debug line header in dwarf 5."" This reverts commit rL357020. The commit broke the test llvm/test/tools/llvm-objdump/embedded-source.test on some builds including clang-ppc64be-linux-multistage, clang-s390x-linux, clang-with-lto-ubuntu, clang-x64-windows-msvc, llvm-clang-lld-x86_64-scei-ps4-windows10pro-fast (and others). llvm-svn: 357026	2019-03-26 20:05:27 +00:00
Ali Tamur	2f5cd03a3f	[llvm] Reapply "Prevent duplicate files in debug line header in dwarf 5." Reapply rL356941 after regenerating the object file in the failing test llvm/test/tools/llvm-objdump/embedded-source.test from source. Original commit message: [llvm] Prevent duplicate files in debug line header in dwarf 5. Motivation: In previous dwarf versions, file name indexes started from 1, and the primary source file was not explicit. Dwarf 5 standard (6.2.4) prescribes the primary source file to be explicitly given an entry with an index number 0. The current implementation honors the specification by just duplicating the main source file, once with index number 0, and later maybe with another index number. While this is compliant with the letter of the standard, the duplication causes problems for consumers of this information such as lldb. (Some files are duplicated, where only some of them have a line table although all refer to the same file) With this change, dwarf 5 debug line section files always start from 0, and the zeroth entry is not duplicated whenever possible. This requires different handling of dwarf 4 and dwarf 5 during generation (e.g. when a function returns an index zero for a file name, it signals an error in dwarf 4, but not in dwarf 5) However, I think the minor complication is worth it, because it enables all consumers (lldb, gdb, dwarfdump, objdump, and so on) to treat all files in the file name list homogenously. Tags: #llvm, #debug-info Differential Revision: https://reviews.llvm.org/D59515 llvm-svn: 357018	2019-03-26 18:53:23 +00:00
Ali Tamur	fdce82a814	Revert "[llvm] Prevent duplicate files in debug line header in dwarf 5." This reverts commit `312ab05887`. My commit broke the build; I will revert and find out what happened. llvm-svn: 356951	2019-03-25 21:09:07 +00:00
Ali Tamur	312ab05887	[llvm] Prevent duplicate files in debug line header in dwarf 5. Summary: Motivation: In previous dwarf versions, file name indexes started from 1, and the primary source file was not explicit. Dwarf 5 standard (6.2.4) prescribes the primary source file to be explicitly given an entry with an index number 0. The current implementation honors the specification by just duplicating the main source file, once with index number 0, and later maybe with another index number. While this is compliant with the letter of the standard, the duplication causes problems for consumers of this information such as lldb. (Some files are duplicated, where only some of them have a line table although all refer to the same file) With this change, dwarf 5 debug line section files always start from 0, and the zeroth entry is not duplicated whenever possible. This requires different handling of dwarf 4 and dwarf 5 during generation (e.g. when a function returns an index zero for a file name, it signals an error in dwarf 4, but not in dwarf 5) However, I think the minor complication is worth it, because it enables all consumers (lldb, gdb, dwarfdump, objdump, and so on) to treat all files in the file name list homogenously. Reviewers: dblaikie, probinson, aprantl, espindola Reviewed By: probinson Subscribers: emaste, jvesely, nhaehnle, aprantl, javed.absar, arichardson, hiraditya, MaskRay, rupprecht, jdoerfert, llvm-commits Tags: #llvm, #debug-info Differential Revision: https://reviews.llvm.org/D59515 llvm-svn: 356941	2019-03-25 20:08:00 +00:00
Fangrui Song	40483e1831	[DWARF] Delete a stray break and a stray comment. NFC llvm-svn: 356838	2019-03-23 16:15:40 +00:00
Alexey Lapshin	b2c4b8bded	[DebugInfo] follow up for "add SectionedAddress to DebugInfo interfaces" [Symbolizer] Add getModuleSectionIndexForAddress() helper routine The https://reviews.llvm.org/D58194 patch changed symbolizer interface. Particularily it requires not only Address but SectionIndex also. Note object::SectionedAddress parameter: Expected<DILineInfo> symbolizeCode(const std::string &ModuleName, object::SectionedAddress ModuleOffset, StringRef DWPName = ""); There are callers of symbolizer which do not know particular section index. That patch creates getModuleSectionIndexForAddress() routine which will detect section index for the specified address. Thus if caller set ModuleOffset.SectionIndex into object::SectionedAddress::UndefSection state then symbolizer would detect section index using getModuleSectionIndexForAddress routine. Differential Revision: https://reviews.llvm.org/D58848 llvm-svn: 356829	2019-03-23 08:08:40 +00:00
Fangrui Song	4597dce483	[DWARF] Refactor RelocVisitor and fix computation of SHT_RELA-typed relocation entries Summary: getRelocatedValue may compute incorrect value for SHT_RELA-typed relocation entries. // DWARFDataExtractor.cpp uint64_t DWARFDataExtractor::getRelocatedValue(uint32_t Size, uint32_t Off, ... // This formula is correct for REL, but may be incorrect for RELA if the value // stored in the location (getUnsigned(Off, Size)) is not zero. return getUnsigned(Off, Size) + Rel->Value; In this patch, we refactor these visit* functions to include a new parameter `uint64_t A`. Since these visit* functions are no longer used as visitors, rename them to resolve. + REL: A is used as the addend. A is the value stored in the location where the relocation applies: getUnsigned(Off, Size) + RELA: The addend encoded in RelocationRef is used, e.g. getELFAddend(R) and add another set of supports* functions to check if a given relocation type is handled. DWARFObjInMemory uses them to fail early. Reviewers: echristo, dblaikie Reviewed By: echristo Subscribers: mgorny, aprantl, aheejin, fedor.sergeev, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57939 llvm-svn: 356729	2019-03-22 02:43:11 +00:00
Reid Kleckner	cda7ff9ddc	[llvm-pdbutil] Add -type-ref-stats to help find unused type info Summary: This considers module symbol streams and the global symbol stream to be roots. Most types that this considers "unreferenced" are referenced by LF_UDT_MOD_SRC_LINE id records, which VC seems to always include. Essentially, they are types that the user can only find in the debugger if they call them by name, they cannot be found by traversing a symbol. In practice, around 80% of type information in a PDB is referenced by a symbol. That seems like a reasonable number. I don't really plan to do anything with this tool. It mostly just exists for informational purposes, and to confirm that we probably don't need to implement type reference tracking in LLD. We can continue to merge all types as we do today without wasting space. Reviewers: zturner, aganea Subscribers: mgorny, hiraditya, arphaman, jdoerfert, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59620 llvm-svn: 356692	2019-03-21 18:02:34 +00:00
Markus Lavin	b86ce219f4	[DebugInfo] Introduce DW_OP_LLVM_convert Introduce a DW_OP_LLVM_convert Dwarf expression pseudo op that allows for a convenient way to perform type conversions on the Dwarf expression stack. As an additional bonus it paves the way for using other Dwarf v5 ops that need to reference a base_type. The new DW_OP_LLVM_convert is used from lib/Transforms/Utils/Local.cpp to perform sext/zext on debug values but mainly the patch is about preparing terrain for adding other Dwarf v5 ops that need to reference a base_type. For Dwarf v5 the op maps to DW_OP_convert and for earlier versions a complex shift & mask pattern is generated to emulate sext/zext. This is a recommit of r356442 with trivial fixes for the failing tests. Differential Revision: https://reviews.llvm.org/D56587 llvm-svn: 356451	2019-03-19 13:16:28 +00:00
Markus Lavin	ad78768d59	Revert "[DebugInfo] Introduce DW_OP_LLVM_convert" This reverts commit 1cf4b593a7ebd666fc6775f3bd38196e8e65fafe. Build bots found failing tests not detected locally. Failing Tests (3): LLVM :: DebugInfo/Generic/convert-debugloc.ll LLVM :: DebugInfo/Generic/convert-inlined.ll LLVM :: DebugInfo/Generic/convert-linked.ll llvm-svn: 356444	2019-03-19 09:17:28 +00:00
Markus Lavin	cd8a940b37	[DebugInfo] Introduce DW_OP_LLVM_convert Introduce a DW_OP_LLVM_convert Dwarf expression pseudo op that allows for a convenient way to perform type conversions on the Dwarf expression stack. As an additional bonus it paves the way for using other Dwarf v5 ops that need to reference a base_type. The new DW_OP_LLVM_convert is used from lib/Transforms/Utils/Local.cpp to perform sext/zext on debug values but mainly the patch is about preparing terrain for adding other Dwarf v5 ops that need to reference a base_type. For Dwarf v5 the op maps to DW_OP_convert and for earlier versions a complex shift & mask pattern is generated to emulate sext/zext. Differential Revision: https://reviews.llvm.org/D56587 llvm-svn: 356442	2019-03-19 08:48:19 +00:00
Alexandre Ganea	4aeea4cc42	[DebugInfo][PDB] Don't write empty debug streams Before, empty debug streams were written as 8 bytes (4 bytes signature + 4 bytes for the GlobalRefs count). With this patch, unused empty streams aren't emitted anymore. Modules now encode 65535 as an 'unused stream' value, by convention. Also fix the * Linker * contrib section which wasn't correctly emitted previously. Differential Revision: https://reviews.llvm.org/D59502 llvm-svn: 356395	2019-03-18 19:13:23 +00:00
Mircea Trofin	2c3ab66539	[llvm] Skip over empty line table entries. Summary: This is similar to how addr2line handles consecutive entries with the same address - pick the last one. Reviewers: dblaikie, friss, JDevlieghere Reviewed By: dblaikie Subscribers: eugenis, vitalybuka, echristo, JDevlieghere, probinson, aprantl, hiraditya, rupprecht, jdoerfert, llvm-commits Tags: #llvm, #debug-info Differential Revision: https://reviews.llvm.org/D58952 llvm-svn: 356265	2019-03-15 15:00:12 +00:00
Evgeniy Stepanov	6e64a14804	Revert "[llvm] Skip over empty line table entries." This reverts commit r355972. See the discussion at https://reviews.llvm.org/D58952. llvm-svn: 356001	2019-03-13 01:37:58 +00:00
Mircea Trofin	0c29402eb4	[llvm] Skip over empty line table entries. Summary: This is similar to how addr2line handles consecutive entries with the same address - pick the last one. Reviewers: dblaikie, friss, JDevlieghere Reviewed By: dblaikie Subscribers: ormris, echristo, JDevlieghere, probinson, aprantl, hiraditya, rupprecht, jdoerfert, llvm-commits Tags: #llvm, #debug-info Differential Revision: https://reviews.llvm.org/D58952 llvm-svn: 355972	2019-03-12 20:48:45 +00:00
Nathan Lanza	cc51dc649a	Add Swift enumerator value for CodeView::SourceLanguage Summary: Swift now generates PDBs for debugging on Windows. llvm and lldb need a language enumerator value too properly handle the output emitted by swiftc. Subscribers: jdoerfert, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59231 llvm-svn: 355882	2019-03-11 23:27:59 +00:00
Petar Jovanovic	95817d3641	[DebugInfo] Fix the type of the formated variable Change the format type of Personality and LSDAAddress to PRIx64 since they are of type uint64_t. The problem was detected on mips builds, where it was printing junk values and causing test failure. Patch by Milos Stojanovic. Differential Revision: https://reviews.llvm.org/D58451 llvm-svn: 355607	2019-03-07 16:31:08 +00:00
Jonas Devlieghere	4cc567bb9e	[DWARFFormValue] Don't consider DW_FORM_data4/8 to be section offsets. When dumping ToT clan's debug info with dwarfdump, we were seeing an error saying that that the location list overflows the debug_loc section. After reducing the testcase we figured out that we were interpreting the DW_FORM_data4 as a section offset. In DWARF3 DW_FORM_data4 and DW_FORM_data8 served also as a section offset. Until now we didn't check check for the DWARF version, because some producers (read old versions of clang) were still emitting this. The relevant code/comment was added in 2013, and I believe it's now reasonable to start checking the version. The FormValue class is a little bit of a mess because it cashes the DWARF unit and context when it extracted the value itself. Several methods of the class rely on it being present, or return an Optional for the code path that needs it. At the same time the FormValue class also used in places where there's no DWARF unit. For this patch I went with the least invasive change: checking the version from the CU when it's available. If it's not (because the form value was created from a value directly) we default to the old behavior. Differential revision: https://reviews.llvm.org/D58698 llvm-svn: 355456	2019-03-05 23:47:22 +00:00
Vlad Tsyrklevich	53a9f1d367	Revert "[DWARFFormValue] Cleanup DWARFFormValue interface. (2/2) (NFC)" This reverts commit r355233, it was causing UBSan failures. llvm-svn: 355255	2019-03-02 01:10:00 +00:00
Jonas Devlieghere	2dc2baa8cc	[DWARFFormValue] Cleanup DWARFFormValue interface. (2/2) (NFC) Continues the work started in r354941. Changes (all but one) uses of the extractValue to static createFromData. llvm-svn: 355233	2019-03-01 22:14:24 +00:00
Adrian Prantl	fa37a00044	dsymutil support for DW_OP_convert Add support for cloning DWARF expressions that contain base type DIE references in dsymutil. <rdar://problem/48167812> Differential Revision: https://reviews.llvm.org/D58534 llvm-svn: 355148	2019-02-28 22:12:32 +00:00
Alexey Lapshin	77fc1f6049	[DebugInfo] add SectionedAddress to DebugInfo interfaces. That patch is the fix for https://bugs.llvm.org/show_bug.cgi?id=40703 "wrong line number info for obj file compiled with -ffunction-sections" bug. The problem happened with only .o files. If object file contains several .text sections then line number information showed incorrectly. The reason for this is that DwarfLineTable could not detect section which corresponds to specified address(because address is the local to the section). And as the result it could not select proper sequence in the line table. The fix is to pass SectionIndex with the address. So that it would be possible to differentiate addresses from various sections. With this fix llvm-objdump shows correct line numbers for disassembled code. Differential review: https://reviews.llvm.org/D58194 llvm-svn: 354972	2019-02-27 13:17:36 +00:00
Jonas Devlieghere	bb111152b7	[DWARFFormValue] Cleanup DWARFFormValue interface. (NFC) DWARFFormValues can be created from a data extractor or by passing its value directly. Until now this was done by member functions that modified an existing object's internal state. This patch replaces a subset of these methods with static method that return a new DWARFFormValue. llvm-svn: 354941	2019-02-27 00:58:09 +00:00
Markus Lavin	76dda218a0	[DebugInfo] Prep llvm-dwarfdump for typed DW5 ops. Adds llvm-dwarfdump support for pretty printing Dwarf5 expressions ops that reference a base type (right now only DW_OP_convert is added). Includes verification to verify that the ops operand is actually a DW_TAG_base_type DIE. Differential Revision: https://reviews.llvm.org/D58442 llvm-svn: 354552	2019-02-21 08:20:24 +00:00
Matt Davis	123be5d4c0	[symbolizer] Avoid collecting symbols belonging to invalid sections. Summary: llvm-symbolizer would originally report symbols that belonged to an invalid object file section. Specifically the case where: `*Symbol.getSection() == ObjFile.section_end()` This patch prevents the Symbolizer from collecting symbols that belong to invalid sections. The test (from PR40591) introduces a case where two symbols have address 0, one symbol is defined, 'foo', and the other is not defined, 'bar'. This patch will cause the Symbolizer to keep 'foo' and ignore 'bar'. As a side note, the logic for adding symbols to the Symbolizer's store (`SymbolizableObjectFile::addSymbol`) replaces symbols with the same <address, size> pair. At some point that logic should be revisited as in the aforementioned case, 'bar' was overwriting 'foo' in the Symbolizer's store, and 'foo' was forgotten. This fixes PR40591 Reviewers: jhenderson, rupprecht Reviewed By: rupprecht Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D58146 llvm-svn: 354083	2019-02-14 23:50:35 +00:00
Jordan Rupprecht	5b7ad42729	[DebugInfo] Fix /usr/lib/debug llvm-symbolizer lookup with relative paths Summary: rL189250 added a realpath call, and rL352916 because realpath breaks assumptions with some build systems. However, the /usr/lib/debug case has been clarified, falling back to /usr/lib/debug is currently broken if the obj passed in is a relative path. Adding a call to use absolute paths when falling back to /usr/lib/debug fixes that while still not making any realpath assumptions. This also adds a --fallback-debug-path command line flag for testing (since we probably can't write to /usr/lib/debug from buildbot environments), but was also verified manually: ``` $ rm -f path/to/dwarfdump-test.elf-x86-64 $ strace llvm-symbolizer --obj=relative/path/to/dwarfdump-test.elf-x86-64.debuglink 0x40113f \|& grep dwarfdump ``` Lookups went to relative/path/to/dwarfdump-test.elf-x86-64, relative/path/to/.debug/dwarfdump-test.elf-x86-64, and then finally /usr/lib/debug/absolute/path/to/dwarfdump-test.elf-x86-64. Reviewers: dblaikie, samsonov Reviewed By: dblaikie Subscribers: krytarowski, aprantl, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57916 llvm-svn: 353730	2019-02-11 18:05:48 +00:00
Benjamin Kramer	711950c116	Move some classes into anonymous namespaces. NFC. llvm-svn: 353710	2019-02-11 15:16:21 +00:00
Alexandre Ganea	120366edc7	[CodeView] Fix cycles in debug info when merging Types with global hashes When type streams with forward references were merged using GHashes, cycles were introduced in the debug info. This was caused by GlobalTypeTableBuilder::insertRecordAs() not inserting the record on the second pass, thus yielding an empty ArrayRef at that record slot. Later on, upon PDB emission, TpiStreamBuilder::commit() would skip that empty record, thus offseting all indices that came after in the stream. This solution comes in two steps: 1. Fix the hash calculation, by doing a multiple-step resolution, iff there are forward references in the input stream. 2. Fix merge by resolving with multiple passes, therefore moving records with forward references at the end of the stream. This patch also adds support for llvm-readoj --codeview-ghash. Finally, fix dumpCodeViewMergedTypes() which previously could reference deleted memory. Fixes PR40221 Differential Revision: https://reviews.llvm.org/D57790 llvm-svn: 353412	2019-02-07 15:24:18 +00:00
James Henderson	b6b5b1a592	[DebugInfo]Print correct value for special opcode address increment The wrong variable was being used when printing the address increment in verbose output of .debug_line. This patch fixes this. Reviewed by: JDevlieghere Differential Revision: https://reviews.llvm.org/D57693 llvm-svn: 353288	2019-02-06 10:31:50 +00:00
Jordan Rupprecht	835df27f85	[DebugInfo] Don't use realpath when looking up debug binary locations. Summary: Using realpath makes assumptions about build systems that do not always hold true. The debug binary referred to from the .gnu_debuglink should exist in the same directory (or in a .debug directory, etc.), but the files may only exist as symlinks to a differently named files elsewhere, and using realpath causes that lookup to fail. This was added in r189250, and this is basically a revert + regression test case. Reviewers: dblaikie, samsonov, jhenderson Reviewed By: dblaikie Subscribers: llvm-commits, hiraditya Tags: #llvm Differential Revision: https://reviews.llvm.org/D57609 llvm-svn: 352916	2019-02-01 21:04:16 +00:00
Wolfgang Pieb	58513b7761	[DWARF v5] Fix DWARF emitter and consumer to produce/expect a uleb for a location description's length. Reviewer: davide, JDevliegere Differential Revision: https://reviews.llvm.org/D57550 llvm-svn: 352889	2019-02-01 17:11:58 +00:00
Aleksandr Urakov	d17f6ab61b	[NativePDB] Fix access to both old & new fpo data entries from dbi stream Summary: This patch fixes access to fpo streams in native pdb from DbiStream and makes code consistent with DbiStreamBuilder. Patch By: leonid.mashinskiy Reviewers: zturner, aleksandr.urakov Reviewed By: zturner Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D56725 llvm-svn: 352615	2019-01-30 10:40:45 +00:00
Zachary Turner	8371da385a	[PDB] Increase TPI hash bucket count. PDBs contain several serialized hash tables. In the microsoft-pdb repo published to support LLVM implementing PDB support, the provided initializes the bucket count for the TPI and IPI streams to the maximum size. This occurs in tpi.cpp L33 and tpi.cpp L398. In the LLVM code for generating PDBs, these streams are created with minimum number of buckets. This difference makes LLVM generated PDBs slower for when used for debugging. Patch by C.J. Hebert Differential Revision: https://reviews.llvm.org/D56942 llvm-svn: 352117	2019-01-24 22:25:55 +00:00
James Henderson	33c16a3f16	[llvm-symbolizer] Add support for --basenames/-s This fixes https://bugs.llvm.org/show_bug.cgi?id=40068. --basenames is a GNU addr2line switch which strips the directory names from the file path in the output. Reviewed by: ruiu Differential Revision: https://reviews.llvm.org/D56919 llvm-svn: 351795	2019-01-22 10:24:32 +00:00
Chandler Carruth	2946cd7010	Update the file headers across all of the LLVM projects in the monorepo to reflect the new license. We understand that people may be surprised that we're moving the header entirely to discuss the new license. We checked this carefully with the Foundation's lawyer and we believe this is the correct approach. Essentially, all code in the project is now made available by the LLVM project under our new license, so you will see that the license headers include that license only. Some of our contributors have contributed code under our old license, and accordingly, we have retained a copy of our old license notice in the top-level files in each project and repository. llvm-svn: 351636	2019-01-19 08:50:56 +00:00
Alexandre Ganea	90f4b94da3	[CodeView] More appropriate name and type for a Microsoft precompiled headers parameter. NFC llvm-svn: 350520	2019-01-07 13:53:16 +00:00
David Blaikie	b917c3a41a	llvm-dwarfdump: Skip address index info (and dump only the address, if found) when non-verbose dumping addrx forms There's a few bugs here still - demonstrated with FIXITs in the test. llvm-svn: 350046	2018-12-24 06:52:31 +00:00
David Blaikie	2a38c17b34	DebugInfo: Accurately propagate the section used by a relocation when accessing ranges defined by low/high_pc This is difficult/not possible to test in LLVM, but is visible as a crash in LLD when parsing DWARF to generate gdb-index. This function is called by llvm-dwarfdump when parsing high_pc for non-verbose output (to print the actual high_pc rather than the low_pc relative value), but in that case llvm-dwarfdump doesn't print section names (if it did, it would hit this problem). We could add some other features to llvm-dwarfdump to expose this, but nothing really springs to my mind. I will add a test to lld, though. llvm-svn: 350010	2018-12-22 22:20:40 +00:00
David Blaikie	25179613f6	llvm-dwarfdump: Dump the section name/number for addr attributes llvm-svn: 350009	2018-12-22 20:34:58 +00:00
David Blaikie	9efb0153f0	llvm-dwarfdump: Remove extraneous space between '(' and 'indexed' When dumping string or address indexes llvm-svn: 349997	2018-12-22 08:43:08 +00:00
David Blaikie	c04d2bf22a	llvm-dwarfdump: Print the section name/number for addr_index attributes (addr attributes coming shortly) llvm-svn: 349996	2018-12-22 08:33:55 +00:00
David Blaikie	87ae80fb2f	DebugInfo: Refactor named section dumping into a reusable helper Currently the section name (& possibly number) is only printed on addresses in ranges - but no reason it couldn't also be displayed on other addresses (like low/high PC). Refactor in that direction by pulling out the section lookup and name ambiguity dumping logic into a reusable helper. llvm-svn: 349995	2018-12-22 08:23:10 +00:00
David Blaikie	e4e0b9f48f	DebugInfo: Remove extra attribute lookup llvm-svn: 349985	2018-12-22 02:24:13 +00:00
David Blaikie	219c6bd388	libDebugInfo: Refactor error handling in range list parsing Propagate the llvm::Error a little further up. This is NFC for llvm-dwarfdump in this change, but allows ld.lld to emit more precise error messages about which object and archive the erroneous DWARF is in. llvm-svn: 349978	2018-12-22 00:31:02 +00:00
David Blaikie	c3f30a7fc6	Reapply: DebugInfo: Assume an absence of ranges or high_pc on a CU means the CU is empty (devoid of code addresses) Originally committed in r349333, reverted in r349353. GCC emitted these unconditionally on/before 4.4/March 2012 Clang emitted these unconditionally on/before 3.5/March 2014 This improves performance when parsing CUs (especially those using split DWARF) that contain no code ranges (such as the mini CUs that may be created by ThinLTO importing - though generally they should be/are avoided, especially for Split DWARF because it produces a lot of very small CUs, which don't scale well in a bunch of other ways too (including size)). The revert was due to a (Google internal) test that had some checked in old object files missing DW_AT_ranges. That's since been fixed. llvm-svn: 349968	2018-12-21 22:25:01 +00:00
Luke Cheeseman	41a9e53500	[Dwarf/AArch64] Return address signing B key dwarf support - When signing return addresses with -msign-return-address=<scope>{+<key>}, either the A key instructions or the B key instructions can be used. To correctly authenticate the return address, the unwinder/debugger must know which key was used to sign the return address. - When and exception is thrown or a break point reached, it may be necessary to unwind the stack. To accomplish this, the unwinder/debugger must be able to first authenticate an the return address if it has been signed. - To enable this, the augmentation string of CIEs has been extended to allow inclusion of a 'B' character. Functions that are signed using the B key variant of the instructions should have and FDE whose associated CIE has a 'B' in the augmentation string. - One must also be able to preserve these semantics when first stepping from a high level language into assembly and then, as a second step, into an object file. To achieve this, I have introduced a new assembly directive '.cfi_b_key_frame ', that tells the assembler the current frame uses return address signing with the B key. - This ensures that the FDE is associated with a CIE that has 'B' in the augmentation string. Differential Revision: https://reviews.llvm.org/D51798 llvm-svn: 349895	2018-12-21 10:45:08 +00:00
David Blaikie	ac69af7ad6	llvm-dwarfdump: Improve/fix pretty printing of array dimensions This is to address post-commit feedback from Paul Robinson on r348954. The original commit misinterprets count and upper bound as the same thing (I thought I saw GCC producing an upper bound the same as Clang's count, but GCC correctly produces an upper bound that's one less than the count (in C, that is, where arrays are zero indexed)). I want to preserve the C-like output for the common case, so in the absence of a lower bound the count (or one greater than the upper bound) is rendered between []. In the trickier cases, where a lower bound is specified, a half-open range is used (eg: lower bound 1, count 2 would be "[1, 3)" and an unknown parts use a '?' (eg: "[1, ?)" or "[?, 7)" or "[?, ? + 3)"). Reviewers: aprantl, probinson, JDevlieghere Differential Revision: https://reviews.llvm.org/D55721 llvm-svn: 349670	2018-12-19 19:34:24 +00:00
Luke Cheeseman	f57d7d8237	[AArch64] - Return address signing dwarf support - Reapply changes intially introduced in r343089 - The archtecture info is no longer loaded whenever a DWARFContext is created - The runtimes libraries (santiziers) make use of the dwarf context classes but do not intialise the target info - The architecture of the object can be obtained without loading the target info - Adding a method to the dwarf context to get this information and multiplex the string printing later on Differential Revision: https://reviews.llvm.org/D55774 llvm-svn: 349472	2018-12-18 10:37:42 +00:00
Zachary Turner	bb3d7e565f	[PDB] Add some helper functions for working with scopes. llvm-svn: 349361	2018-12-17 16:15:36 +00:00
Eric Liu	6c933a2bed	Revert "DebugInfo: Assume an absence of ranges or high_pc on a CU means the CU is empty (devoid of code addresses)" This reverts commit r349333. It caused internal test to fail. I have sent more information to the author. llvm-svn: 349353	2018-12-17 14:14:40 +00:00
David Blaikie	884deed1b3	DebugInfo: Assume an absence of ranges or high_pc on a CU means the CU is empty (devoid of code addresses) GCC emitted these unconditionally on/before 4.4/March 2012 Clang emitted these unconditionally on/before 3.5/March 2014 This improves performance when parsing CUs (especially those using split DWARF) that contain no code ranges (such as the mini CUs that may be created by ThinLTO importing - though generally they should be/are avoided, especially for Split DWARF because it produces a lot of very small CUs, which don't scale well in a bunch of other ways too (including size)). llvm-svn: 349333	2018-12-17 08:27:19 +00:00
David Blaikie	023674a9e4	DebugInfo/DWARF: Pretty print subroutine types Doesn't handle varargs and other fun things, but it's a start. (also doesn't print these strictly as valid C++ when it's a pointer to function, it'll print as "void(int)" instead of "void ()(int)") llvm-svn: 348965	2018-12-12 19:53:03 +00:00
David Blaikie	3f8f004daf	DebugInfo/DWARF: Improve dumping of pointers to members ('int foo::' rather than 'int') llvm-svn: 348962	2018-12-12 19:34:02 +00:00
David Blaikie	815cffaad8	DebugInfo/DWARF: Refactor type dumping to dump types, rather than DIEs that reference types This lays the foundation for dumping types not referenced by DW_AT_type attributes (in the near-term, that'll be DW_AT_containing_type for a DW_TAG_ptr_to_member_type - in the future, potentially dumping the pretty printed name next to the DW_TAG for the type, rather than only when the type is referenced from elsewhere) llvm-svn: 348961	2018-12-12 19:33:08 +00:00
David Blaikie	92b5493a14	DebugInfo/DWARF: Refactor getAttributeValueAsReferencedDie to accept a DWARFFormValue Save searching for the attribute again when you already have the DWARFFormValue at hand. llvm-svn: 348960	2018-12-12 19:23:55 +00:00
David Blaikie	73066d60f1	llvm-dwarfdump: Dump array dimensions in stringified type names llvm-svn: 348954	2018-12-12 18:46:25 +00:00
Zachary Turner	a42bbe3981	[NativePDB] Reconstruct function declarations from debug info. Previously we would create an lldb::Function object for each function parsed, but we would not add these to the clang AST. This is a first step towards getting local variable support working, as we first need an AST decl so that when we create local variable entries, they have the proper DeclContext. Differential Revision: https://reviews.llvm.org/D55384 llvm-svn: 348631	2018-12-07 19:34:02 +00:00
Zachary Turner	a93458b050	[PDB] Move some code around. NFC. llvm-svn: 348505	2018-12-06 17:49:15 +00:00
Zachary Turner	579264bd59	Support skewed stream arrays. VarStreamArray was built on the assumption that it is backed by a StreamRef, and offset 0 of that StreamRef is the first byte of the first record in the array. This is a logical and intuitive assumption, but unfortunately we have use cases where it doesn't hold. Specifically, a PDB module's symbol stream is prefixed by 4 bytes containing a magic value, and the first byte of record data in the array is actually at offset 4 of this byte sequence. Previously, we would just truncate the first 4 bytes and then construct the VarStreamArray with the resulting StreamRef, so that offset 0 of the underlying stream did correspond to the first byte of the first record, but this is problematic, because symbol records reference other symbol records by the absolute offset including that initial magic 4 bytes. So if another record wants to refer to the first record in the array, it would say "the record at offset 4". This led to extremely confusing hacks and semantics in loading code, and after spending 30 minutes trying to get some math right and failing, I decided to fix this in the underlying implementation of VarStreamArray. Now, we can say that a stream is skewed by a particular amount. This way, when we access a record by absolute offset, we can use the same values that the records themselves contain, instead of having to do fixups. Differential Revision: https://reviews.llvm.org/D55344 llvm-svn: 348499	2018-12-06 16:55:00 +00:00
Zachary Turner	7c6b19f49b	[PDB] Emit S_UDT records in LLD. Previously these were dropped. We now understand them sufficiently well to start emitting them. From the debugger's perspective, this now enables us to have debug info about typedefs (both global and function-locally scoped) Differential Revision: https://reviews.llvm.org/D55228 llvm-svn: 348306	2018-12-04 21:48:46 +00:00
George Rimar	7e981f330b	[llvm-dwarfdump] - Dump the older versions of .eh_frame/.debug_frame correctly. The issue is the following. DWARF 2 used version 1 for .debug_frame. (Appendix G, p. 416 http://dwarfstd.org/doc/DWARF5.pdf) lib/MC now always sets version 1 for .eh_frame (and sets 1-4 versions for .debug_frame correctly): https://github.com/llvm-mirror/llvm/blob/master/lib/MC/MCDwarf.cpp#L1530 https://github.com/llvm-mirror/llvm/blob/master/lib/MC/MCDwarf.cpp#L1562 https://github.com/llvm-mirror/llvm/blob/master/lib/MC/MCDwarf.cpp#L1602 In version 1, return_address_register was defined as ubyte, while other versions switched to uleb128. (p 62, http://www.dwarfstd.org/doc/dwarf-2.0.0.pdf) Patch teaches llvm-dwarfdump about this difference. Differential revision: https://reviews.llvm.org/D54860 llvm-svn: 348242	2018-12-04 10:01:39 +00:00
Zachary Turner	1e0cce796c	Fix issue with Tpi Stream hash map. Part of the patch to not build the hash map eagerly was omitted due to a merge conflict. Add it back, which should fix the failing tests. llvm-svn: 348166	2018-12-03 19:05:12 +00:00
Zachary Turner	f861e291d6	Don't build the Tpi Hash map by default. This is very slow and should be done for specific cases where lookups will need to happen. llvm-svn: 348160	2018-12-03 18:32:05 +00:00

... 4 5 6 7 8 ...

2043 Commits