llvm-project

Commit Graph

Author	SHA1	Message	Date
David Blaikie	11e0b79b05	llvm-dwarfdump: Don't print even an empty string when a type is unprintable	2021-09-19 17:03:10 -07:00
David Blaikie	5bfe5207ef	llvm-dwarfdump: Pretty print names qualified/with scopes	2021-09-19 16:36:01 -07:00
David Blaikie	372e2c24b6	llvm-dwarfdump: Pretty printing types including a space between const and parenthesized references/pointers to arrays	2021-09-19 13:32:53 -07:00
David Blaikie	a51fb58c55	DWARFDie.cpp: Minor follow-up clang-format	2021-09-19 13:06:18 -07:00
David Blaikie	f09ca5c646	DWARFDie: Improve type printing for function and array types - with qualifiers (cv/reference) and pointers to them	2021-09-19 12:59:31 -07:00
David Blaikie	ae0873483d	DWARFDie:DWARFTypePrinter: Add common utility function for checking where parentheses are required	2021-09-18 22:54:57 -07:00
David Blaikie	d2373c04a7	DWARFDie.cpp: Reduce indentation with early continue	2021-09-18 22:22:25 -07:00
Simon Pilgrim	72e5786281	[DebugInfo] DWARF - Use const-ref iterator in for-range loop. NFCI. Avoid unnecessary copies, reported by MSVC static analyzer.	2021-09-17 14:04:54 +01:00
Peter Collingbourne	fc08cfb888	CodeView: static_cast result of getOffset() to size_t. Silences a narrowing conversion warning on 32-bit platforms after D109923.	2021-09-16 23:39:04 -07:00
Nico Weber	646299d183	[Support] Convert BinaryStream class zoo to 64-bit offsets Most PDB fields on disk are 32-bit but describe the file in terms of MSF blocks, which are 4 kiB by default. So PDB files can be a bit larger than 4 GiB, and much larger if you create them with a block size > 4 kiB. This is a first (necessary, but by far not not sufficient) step towards supporting such PDB files. Now we don't truncate in-memory file offsets (which are in terms of bytes, not in terms of blocks). No effective behavior change. lld-link will still error out if it were to produce PDBs > 4 GiB. Differential Revision: https://reviews.llvm.org/D109923	2021-09-16 19:14:52 -04:00
David Blaikie	065bb08bb8	NFC: DWARFTypePrinter: Remove "type" from member function names to reduce redundancy	2021-09-15 14:46:28 -07:00
David Blaikie	4cabaf594a	NFC: DebugInfo: refactor pretty printing into a utility class Laying more foundation for full template name rebuilding - more complex type printing benefits from an object to carry some state rather than passing it around as parameters to every function.	2021-09-14 15:54:29 -07:00
Alexey Lapshin	3493540830	[DebugInfo][NFC] Erase capacity in DWARFUnit::clearDIEs(). DWARFUnit::clearDIEs() uses std::vector::shrink_to_fit() to make capacity of DieArray matched with its size(). The shrink_to_fit() is not binding request to make capacity match with size(). Thus the memory could still be reserved after DWARFUnit::clearDIEs() is called. This patch erases capacity when DWARFUnit::clearDIEs() is requested. So the memory occupied by dies would be freed. Differential Revision: https://reviews.llvm.org/D109499	2021-09-10 10:07:28 +03:00
Greg Clayton	14850a0628	Log to the right stream in DwarfTransformer::handleDie(). Since we might end up using multiple threads when logging information in the DWARFTransformer, the handleDie() method must use the supplied stream named "OS" when logging warnings and errors. When we use multiple threads, we log to a thread specific stream buffer and then use a mutex to ensure our output doesn't overlap when we emit warnings and errors after a thread is done. Differential Revision: https://reviews.llvm.org/D109401	2021-09-08 14:00:19 -07:00
David Blaikie	bc066e26c9	DebugInfo: Fix a few bot failures for type dumping fixes	2021-09-03 14:08:58 -07:00
David Blaikie	40f1593558	DebugInfo: Correct/improve type formatting (pointers to function types especially) This does add some extra superfluous whitespace (eg: "int *") intended to make the Simplified Template Names work easier - this makes the DIE-based names match more exactly the clang-generated names, so it's easier to identify cases that don't generate matching names. (arguably we could change clang to skip that whitespace or add some fuzzy matching to accommodate differences in certain whitespace - but this seemed easier and fairly low-impact)	2021-09-03 12:22:28 -07:00
Alexander Yermolovich	779d24e151	[DWARF] Find offset of attribute. This is used by BOLT to do patching of DebugInfo section, and Line Table. Directly by using find, and through getAttrFieldOffsetForUnit. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D107874	2021-09-01 11:49:55 -07:00
Greg Clayton	a58c2e4af0	Fix DWARFDie::getDeclFile(...) to work with DW_AT_specification. DWARFDie::getDeclFile(...) previously only supported getting the DW_AT_decl_file if the DIE itself contained the DW_AT_decl_file attribute, or if the DIE had a DW_AT_abstract_origin that pointed to another DIE that had a DW_AT_decl_file. This patch allows the function to get the right attribute value if there is a DW_AT_specification that points to another DIE. We also test that if a DW_AT_abtract_origin or DW_AT_specification points to a DIE in another CU with a DW_FORM_ref_addr, that the right line table is used to extract the file index. Full tests were added for the following cases: - DIE has a DW_AT_decl_file attribute - DIE has a DW_AT_abtract_origin that points to another die in the same CU - DIE has a DW_AT_abtract_origin that points to another die in another CU - DIE has a DW_AT_specification that points to another die in the same CU - DIE has a DW_AT_specification that points to another die in another CU Differential Revision: https://reviews.llvm.org/D108480	2021-08-23 15:43:18 -07:00
Greg Clayton	e100a41bbe	Fix fallback code that gets decl file + line. When a function has no line table, but does have debug info (DW_TAG_subprogram), we fall back to creating a line table with a single line entry that has the start address of the function and the source file and line of the function declaration. The bug in this code was that we might have a DW_TAG_subprogram that uses a DW_AT_specification or DW_AT_abstract_origin that points to another DIE, and that DIE might be in another compile unit. The bug was we were grabbing the file index value from the DIE, and that index could be from the other DIE in another compile unit that has its own and compleltely different file table, so we might be using a file index from one compile unit with the file table from another. This was causing a crash in llvm-gsymuil when run against dSYM files. dsymutil, the Apple DWARF linker, will often unique types and can end up with more absolute references across different compile units. The fix is to use the DWARFDie::getDeclFile(...) accessor as it does fetch this information correctly. Differential Revision: https://reviews.llvm.org/D108497	2021-08-23 11:06:15 -07:00
Alexey Lapshin	07d44cc0b1	[DWARF][Verifier] Do not add child DieRangeInfo with empty address range to the parent. verifyDieRanges function checks for the intersected address ranges. It adds child DieRangeInfo into parent DieRangeInfo to check whether children have overlapping address ranges. It is safe to not add DieRangeInfo with empty address range into parent's children list. This decreases the number of children which should be navigated and as a result decreases execution time(parents having a lot of children with empty ranges spend much time navigating them). For this command: "llvm-dwarfdump --verify clang-repl" execution time decreased from 220 sec till 75 sec. Differential Revision: https://reviews.llvm.org/D107554	2021-08-22 19:39:21 +03:00
Kazu Hirata	49d7b2beae	[DWARF] Remove parseListTableHeader (NFC) The last use was removed on Oct 4, 2020 in commit `6d0be74af5`.	2021-08-19 23:34:22 -07:00
Alexey Lapshin	ab9d506be3	[DWARF][Verifier][NFC] Use reference to DWARFAddressRangesVector to avoid copying. Avoid copying while access to RangesOrError.get().	2021-08-19 16:23:05 +03:00
David Blaikie	62a4c2c10e	DWARFVerifier: Check section-relative references at the end of the section This ensures that debug_types references aren't looked for in debug_info section. Behavior is still going to be questionable in an unlinked object file - since cross-cu references could refer to symbols in another .debug_info (or, in theory, .debug_types) chunk - but if a producer only uses ref_addr to refer to things within the same .debug_info chunk in an object file (eg: whole program optimization/LTO - producing two CUs into a single .debug_info section in an object file - the ref_addrs there could be resolved relative to that .debug_info chunk, not needing to consider comdat (DWARFv5 type units or other creatures) chunks of .debug_info, etc)	2021-08-15 11:40:24 -07:00
Kazu Hirata	63c566b1fd	[DWARF] Remove extractFast (NFC) The last use was removed on Dec 13, 2016 in commit `c8c1032c0c`. This patch repurposes the function comment for the other variant of extractFast.	2021-08-11 09:55:00 -07:00
Christopher Di Bella	c874dd5362	[llvm][clang][NFC] updates inline licence info Some files still contained the old University of Illinois Open Source Licence header. This patch replaces that with the Apache 2 with LLVM Exception licence. Differential Revision: https://reviews.llvm.org/D107528	2021-08-11 02:48:53 +00:00
Kazu Hirata	d9c9d13365	[DWARF] Remove collectChildrenAddressRanges (NFC) The last use was removed on Dec 21, 2018 in commit `c3f30a7fc6`.	2021-08-08 08:57:32 -07:00
Alexander Yermolovich	5a865b0b1e	[DWARF] Don't process .debug_info relocations for DWO Context When we build with split dwarf in single mode the .o files that contain both "normal" debug sections and dwo sections, along with relocaiton sections for "normal" debug sections. When we create DWARF context in DWARFObjInMemory we process relocations and store them in the map for .debug_info, etc section. For DWO Context we also do it for non dwo dwarf sections. Which I believe is not necessary. This leads to a lot of memory being wasted. We observed 70GB extra memory being used. I went with context sensitive approach, flag is passed in. I am not sure if it's always safe not to process relocations for regular debug sections if Obj contains .dwo sections. If it is alternatvie might be just to scan, in constructor, sections and if there are .dwo sections not to process regular debug ones. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D106624	2021-08-02 10:41:47 -07:00
Esme-Yi	14f6cfcf3c	[Debug-Info][llvm-dwarfdump] Don't try to dump location list for attributes that don't have the loclist class. Summary: The overflow error occurs when we try to dump location list for those attributes that do not have the loclist class, like DW_AT_count and DW_AT_byte_size. After re-reviewed the entire list, I sorted those attributes into two parts, one for dumping location list and one for dumping the location expression. Reviewed By: probinson Differential Revision: https://reviews.llvm.org/D105613	2021-07-27 07:28:59 +00:00
Matheus Izvekov	f84c70a379	[CodeView] Saturate values bigger than supported by APInt. This fixes an assert firing when compiling code which involves 128 bit integrals. This would trigger runtime checks similar to this: ``` Assertion failed: getMinSignedBits() <= 64 && "Too many bits for int64_t", file llvm/include/llvm/ADT/APInt.h, line 1646 ``` To get around this, we just saturate those big values. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D105320	2021-07-26 22:15:26 +02:00
Simon Pilgrim	9277ce7932	[DebugInfo] Remove unnecessary <string> include dependency from DebugInfo headers. NFC. At most these use the StringRef/Twine wrappers and don't have any implicit uses of std::string. Move the include down to any cpp implementation where std::string is actually used.	2021-07-17 16:56:06 +01:00
Simon Giesecke	a12000e428	Reformat files. Differential Revision: https://reviews.llvm.org/D105982	2021-07-16 07:39:21 +00:00
David Blaikie	ea91749f01	DebugInfo: Use debug_rnglists.dwo for ranges in debug_info.dwo when parsing DWARFv5 This call would incorrectly overwrite (with the .debug_rnglists.dwo from the executable, if there was one) the rnglists section instead of the correct value (from the .debug_rnglists.dwo in the .dwo file) that's applied in DWARFUnit::tryExtractDIEsIfNeeded	2021-07-12 18:15:09 -07:00
David Blaikie	b447b9dce0	Reapply "llvm-symbolizer: Fix "start file" to work with Split DWARF" Originally committed as `04c203e310` Reverted in `768510632c` due to the test failing when encountering windows directory separators. Fix the path separator platform issue with a FileCheck pattern {{[/\\]}} Original commit message: A followup to the feature added in `69da27c749` that added the optional "start file name" to match "start line" - but this didn't work with Split DWARF because of the need for the decl file number resolution code to refer back to the skeleton unit to find its .debug_line contribution. So this patch adds the necessary infrastructure to track the skeleton unit corresponding to a split full unit for the purpose of this lookup.	2021-07-10 18:50:55 -07:00
Nico Weber	768510632c	Revert "llvm-symbolizer: Fix "start file" to work with Split DWARF" This reverts commit `04c203e310`. Test fails on Windows.	2021-07-10 13:35:05 -04:00
David Blaikie	04c203e310	llvm-symbolizer: Fix "start file" to work with Split DWARF A followup to the feature added in `69da27c749` that added the optional "start file name" to match "start line" - but this didn't work with Split DWARF because of the need for the decl file number resolution code to refer back to the skeleton unit to find its .debug_line contribution. So this patch adds the necessary infrastructure to track the skeleton unit corresponding to a split full unit for the purpose of this lookup.	2021-07-09 18:31:32 -07:00
Martin Storsjö	d20b013b49	[COFF] [CodeView] Add a few new enum values These are undocumented, but are visible in the SDK headers since some versions ago. Differential Revision: https://reviews.llvm.org/D105513	2021-07-07 22:00:18 +03:00
Soham Dixit	51d969dc27	[DebugInfo] Bug 41152 - Improve dumping of empty location expressions Fixes PR41152 (https://bugs.llvm.org/show_bug.cgi?id=41152). Reviewed by: jhenderson, dblaikie, SouraVX Differential Revision: https://reviews.llvm.org/D103502	2021-06-29 09:21:00 +01:00
Greg Clayton	ab546ead3b	Fix a case where multiple symbols with zero size would cause duplicate entries in gsym files. Symbol tables can have symbols with no size in mach-o files that were failing to get combined into a single entry. This resulted in many duplicate entries for the same address and made gsym files larger. Differential Revision: https://reviews.llvm.org/D105068	2021-06-28 18:26:26 -07:00
Jan Kratochvil	c19a28919f	llvm-dwarfdump: Print warnings on invalid DWARF llvm-dwarfdump was silent even when the format of DWARF was invalid and/or llvm-dwarfdump did not understand/support some of the constructs. This can be pretty confusing as llvm-dwarfdump is a tool for DWARF producers+consumers development. Review comments also by @dblaikie. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D104271	2021-06-27 11:38:35 +02:00
Martin Storsjö	42f74e8249	[llvm] Rename StringRef _lower() method calls to _insensitive() This is a mechanical change. This actually also renames the similarly named methods in the SmallString class, however these methods don't seem to be used outside of the llvm subproject, so this doesn't break building of the rest of the monorepo.	2021-06-25 00:22:01 +03:00
CarlosAlbertoEnciso	d0a5d86119	[Debug-Info][CodeView] Fix GUID string generation for MSVC generated objects. This patch is to address https://bugs.llvm.org/show_bug.cgi?id=50459. YAML:455:28: error: GUID strings are 38 characters long The valid format for a GUID is {XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX} where X is a hex digit (0,1,2,3,4,5,6,7,8,9,A,B,C,D,E,F). The length of the individual components must be: 8, 4, 4, 4, 12. For some cases, the converted string generated by obj2yaml, does not comply with those lengths. yaml2obj checks that the GUID string must be 38 characters including the dashes and braces. Reviewed By: amccarth Differential Revision: https://reviews.llvm.org/D103089	2021-06-15 06:53:21 +01:00
RamNalamothu	167e7afcd5	Implement DW_CFA_LLVM_* for Heterogeneous Debugging Add support in MC/MIR for writing/parsing, and DebugInfo. This is part of the Extensions for Heterogeneous Debugging defined at https://llvm.org/docs/AMDGPUDwarfExtensionsForHeterogeneousDebugging.html Specifically the CFI instructions implemented here are defined at https://llvm.org/docs/AMDGPUDwarfExtensionsForHeterogeneousDebugging.html#cfa-definition-instructions Reviewed By: clayborg Differential Revision: https://reviews.llvm.org/D76877	2021-06-14 08:51:50 +05:30
Simon Pilgrim	35a12023f3	DWARFDebugFrame.cpp - remove unused <string> include. NFCI.	2021-06-13 15:19:24 +01:00
Amir Ayupov	8ec73e96b7	[ELF] getRelocatedSection: remove the check for ET_REL object file getRelocatedSection interface should not check that the object file is relocatable, as executable files may have relocations preserved with `--emit-relocs` linker flag. The relocations are useful in context of post-link binary analysis for function reference identification. For example, BOLT relies on relocations to perform function reordering. Reviewed By: MaskRay, jhenderson Differential Revision: https://reviews.llvm.org/D102296	2021-06-07 13:17:00 -07:00
Simon Giesecke	5f2d4b23b4	Add --quiet option to llvm-gsymutil to suppress output of warnings. Differential Revision: https://reviews.llvm.org/D102829	2021-05-27 12:36:34 +00:00
Alex Orlov	752385b128	Add support for DWARF embedded source to llvm-symbolizer. This patch adds DWARF embedded source printout to llvm-symbolizer. Reviewed By: jhenderson, dblaikie Differential Revision: https://reviews.llvm.org/D102355	2021-05-20 21:40:28 +04:00
Simon Giesecke	81b2fcf26f	Use a non-recursive mutex in GsymCreator. There doesn't seem to be a need to support recursive locking, and a recursive mutex is unnecessarily inefficient. Differential Revision: https://reviews.llvm.org/D102486	2021-05-19 10:06:47 +00:00
Simon Giesecke	4ea4d9c066	Move FunctionInfo in addFunctionInfo rather than copying. Differential Revision: https://reviews.llvm.org/D102485	2021-05-19 10:06:47 +00:00
Simon Giesecke	f29c4c6097	Avoid calculating the string hash twice in GsymCreator::insertString. Do the single hash calculation before acquiring the lock, to reduce lock contention. If Copy is true, and the string was not yet contained in the StringStorage, use the new address from StringStorage, but reuse the hash we already calculated. Differential Revision: https://reviews.llvm.org/D102484	2021-05-19 10:06:47 +00:00
Simon Giesecke	e102fd50f9	Reformat GSYMCreator.cpp Differential Revision: https://reviews.llvm.org/D102483	2021-05-19 10:06:47 +00:00
Alex Orlov	4fedb3a613	[symbolizer] Added StartAddress for the resolved function. In many cases it is helpful to know at what address the resolved function starts. This patch adds a new StartAddress member to the DILineInfo structure. Reviewed By: jhenderson, dblaikie Differential Revision: https://reviews.llvm.org/D102316	2021-05-19 02:38:13 +04:00
Reid Kleckner	ac2226b0f5	[PDB] Improve error handling when writes fail Handle PDB writing errors like any other error in LLD: emit an error and continue. This allows the linker to print timing data and summary data after linking, which can be helpful for finding PDB size problems. Also report how large the file would have been. Example output: lld-link: error: Output data is larger than 4 GiB. File size would have been 6,937,108,480 lld-link: error: failed to write PDB file ./chrome.dll.pdb Summary -------------------------------------------------------------------------------- 33282 Input OBJ files (expanded from all cmd-line inputs) 4 PDB type server dependencies 0 Precomp OBJ dependencies 33396931 Input type records ... snip ... Input File Reading: 59756 ms ( 45.5%) GC: 7500 ms ( 5.7%) ICF: 3336 ms ( 2.5%) Code Layout: 6329 ms ( 4.8%) PDB Emission (Cumulative): 46192 ms ( 35.2%) Add Objects: 27609 ms ( 21.0%) Type Merging: 16740 ms ( 12.8%) Symbol Merging: 10761 ms ( 8.2%) Publics Stream Layout: 9383 ms ( 7.1%) TPI Stream Layout: 1678 ms ( 1.3%) Commit to Disk: 3461 ms ( 2.6%) -------------------------------------------------- Total Link Time: 131244 ms (100.0%) Differential Revision: https://reviews.llvm.org/D102713	2021-05-18 13:17:17 -07:00
Alex Orlov	88a8965a7d	NFC. Refactored DIPrinter for support embedded source. This patch introduces source loading and pruning functions. It will allow to use the DWARF embedded source and use the same code for JSON printout. No functional changes. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D102539	2021-05-15 23:01:12 +04:00
Greg Clayton	e5bdacba2e	Optimize GSymCreator::finalize. The algorithm removing duplicates from the Funcs list used to have amortized quadratic time complexity because it was potentially removing each entry using std::vector::erase individually. This patch is now using a erase-remove idiom with an adapted removeIfBinary algorithm. Probably this was made under the assumption that these removals are rare, but there are cases where the case of duplicate entries is occurring frequently. In these cases, the actual runtime was very poor, taking hours to process a single binary of around 1 GiB size including debug info. Another factor contributing to that is the frequent output of the warning, which is now removed. It seems this is particularly an issue with GCC-compiled binaries, rather than clang-built binaries. Reviewed By: clayborg Differential Revision: https://reviews.llvm.org/D102219	2021-05-12 15:18:07 -07:00
Vitaly Buka	85a96d82ca	[symbolizer] Fix leak after D96883	2021-05-11 22:51:36 -07:00
Alex Orlov	05d1ae4e18	* Add support for JSON output style to llvm-symbolizer This patch adds JSON output style to llvm-symbolizer to better support CLI automation by providing a machine readable output. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D96883	2021-05-11 13:10:54 +04:00
RamNalamothu	223852d76f	[DebugInfo] UnwindTable::create() should not add empty rows to CFI unwind table UnwindTable::parseRows() may return successfully if the CFIProgram has either no CFI instructions or only DW_CFA_nop instructions and the UnwindRow return argument will be empty. But currently, the callers are not checking for this case which is leading to incorrect dumps in the unwind tables in such cases i.e. CFA=unspecified Reviewed By: clayborg Differential Revision: https://reviews.llvm.org/D101892	2021-05-08 10:19:02 +05:30
Brock Wyma	4f8d585fbf	[CodeView] Truncate Long Type Names With An MD5 Hash Replace long CodeView type names with an MD5 hash of the name. Differential Revision: https://reviews.llvm.org/D97881	2021-05-04 10:51:21 -04:00
Alok Kumar Sharma	5a26345fe2	[DebugInfo][llvm-dwarfdump] Fix printing of Unit header with DW_UT_partial type llvm-dwarfdump crashed for Unit header with DW_UT_partial type. ------------- llvm-dwarfdump: /tmp/llvm/include/llvm/ADT/Optional.h:197: T& llvm::optional_detail::OptionalStorage<T, true>::getValue() & [with T = long unsigned int]: Assertion `hasVal' failed. PLEASE submit a bug report to the technical support section of https://developer.amd.com/amd-aocc and include the crash backtrace. Stack dump: 0. Program arguments: llvm-dwarfdump -v /tmp/test/DebugInfo/X86/Output/dwarfdump-he ader.s.tmp.o #0 0x00007f37d5ad8838 llvm::sys::PrintStackTrace(llvm::raw_ostream&, int) /tmp/llvm/lib/Support/Unix/Signals.inc:565:0 #1 0x00007f37d5ad88ef PrintStackTraceSignalHandler(void) /tmp/llvm/lib/Support/Unix/Signals.inc:632:0 #2 0x00007f37d5ad65bd llvm::sys::RunSignalHandlers() /tmp/llvm/lib/Support/Signals.cpp:71:0 #3 0x00007f37d5ad81b9 SignalHandler(int) /tmp/llvm/lib/Support/Unix/Signals.inc:407:0 #4 0x00007f37d4c26040 (/lib/x86_64-linux-gnu/libc.so.6+0x3f040) #5 0x00007f37d4c25fb7 raise /build/glibc-S9d2JN/glibc-2.27/signal/../sysdeps/unix/sysv/linux/raise.c:51:0 #6 0x00007f37d4c27921 abort /build/glibc-S9d2JN/glibc-2.27/stdlib/abort.c:81:0 #7 0x00007f37d4c1748a __assert_fail_base /build/glibc-S9d2JN/glibc-2.27/assert/assert.c:89:0 #8 0x00007f37d4c17502 (/lib/x86_64-linux-gnu/libc.so.6+0x30502) #9 0x00007f37d7576b81 llvm::optional_detail::OptionalStorage<unsigned long, true>::getValue() & /tmp/llvm/include/llvm/ADT/Optional.h:198:0 #10 0x00007f37d75726ac llvm::Optional<unsigned long>::operator() && /tmp/llvm/include/llvm/ADT/Optional.h:309:0 #11 0x00007f37d7582968 llvm::DWARFCompileUnit::dump(llvm::raw_ostream&, llvm::DIDumpOptions) /tmp/llvm/lib/DebugInfo/DWARF/DWARFCompileUnit.cpp:30:0 -------------- Patch by: @jini.susan Reviewed By: @probinson Differential Revision: https://reviews.llvm.org/D101255	2021-04-27 10:32:44 +05:30
Alex Orlov	49cbf4cd85	Fix bug in .eh_frame/.debug_frame PC offset calculation for DW_EH_PE_pcrel This fixes the following bugs: https://bugs.llvm.org/show_bug.cgi?id=27249 https://bugs.llvm.org/show_bug.cgi?id=46414 Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D100328	2021-04-15 15:06:20 +04:00
Alexander Yermolovich	b7459a10da	[DWARF] Fix crash for DWARFDie::dump. When DIE is extracted manually, the DieArray is empty. When dump is invoked on aforementioned DIE it tries to extract child, even if Dump options say otherwise. Resulting in crash. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D99698	2021-04-14 18:46:34 -07:00
Martin Storsjö	45f8946a75	[CodeView] Fix the ARM64 CPUType enum The old, incorrect one seems to have been added in `d41ac895bb`, with a similarly placed entry added in EnumTables.cpp in `eb4d6142dc`. This matches the value documented at https://docs.microsoft.com/en-us/visualstudio/debugger/debug-interface-access/cv-cpu-type-e?view=vs-2019. This fixes running obj2yaml on an object file generated by MSVC. Differential Revision: https://reviews.llvm.org/D100306	2021-04-13 12:54:22 +03:00
Alex Orlov	f47a4c0713	[lld] Fixed CodeView GuidAdapter::format to handle GUID bytes in the right order. This fixes https://bugs.llvm.org/show_bug.cgi?id=41712 bug. Reviewed By: aganea Differential Revision: https://reviews.llvm.org/D99978	2021-04-09 05:29:14 +04:00
Alex Orlov	5f57793c4f	* NFC. Refactored DIPrinter for better support of new print styles. This patch introduces a DIPrinter interface to implement by different output style printer implementations. DIPrinterGNU and DIPrinterLLVM implement the GNU and LLVM output style printing respectively. No functional changes. This refactoring clarifies and simplifies the code, and makes a new output style addition easier. Reviewed By: jhenderson, dblaikie Differential Revision: https://reviews.llvm.org/D98994	2021-04-05 15:40:41 +04:00
Abhina Sreeskantharajan	c83cd8feef	[NFC] Reordering parameters in getFile and getFileOrSTDIN In future patches I will be setting the IsText parameter frequently so I will refactor the args to be in the following order. I have removed the FileSize parameter because it is never used. ``` static ErrorOr<std::unique_ptr<MemoryBuffer>> getFile(const Twine &Filename, bool IsText = false, bool RequiresNullTerminator = true, bool IsVolatile = false); static ErrorOr<std::unique_ptr<MemoryBuffer>> getFileOrSTDIN(const Twine &Filename, bool IsText = false, bool RequiresNullTerminator = true); static ErrorOr<std::unique_ptr<MB>> getFileAux(const Twine &Filename, uint64_t MapSize, uint64_t Offset, bool IsText, bool RequiresNullTerminator, bool IsVolatile); static ErrorOr<std::unique_ptr<WritableMemoryBuffer>> getFile(const Twine &Filename, bool IsVolatile = false); ``` Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D99182	2021-03-25 09:47:49 -04:00
Nico Weber	605a503f35	[lld-link] emit an error when writing a PDB > 4 GiB Maybe there's a way to make them work, but until I've investigated if tools can consume large PDBs, erroring out is better than slowly and silently consuming all available ram due to internal invariants being violated. (Patch to make writing larger files work at https://bugs.chromium.org/p/chromium/issues/detail?id=1179085#c25 but I haven't had time to check if windbg & co can consume these large PDBs. llvm-pdbutil can't, but we can fix that one at least :) ) Differential Revision: https://reviews.llvm.org/D98788	2021-03-17 15:15:08 -04:00
Petr Hosek	9466f9b434	[CMake] Clean up unnecessary dependency The LINK_COMPONENTS dependency between DebugInfoCodeView and DebugInfoMSF is unnecessary. Breaking them would allow a more fine-controlled distribution. Patch By: dangyi Differential Revision: https://reviews.llvm.org/D98465	2021-03-15 16:29:16 -07:00
Alexander Yermolovich	51504bc1d9	[DWARF] Check for AddrOffsetSectionBase to work with DWO Units. Context: https://lists.llvm.org/pipermail/llvm-dev/2021-February/148521.html A fix for llvm-symbolizer, and other tools like BOLT, that allows retrieving address when built with -gsplit-dwarf=single mode. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D96827	2021-03-15 14:46:09 -07:00
Reid Kleckner	b69db4a7ab	Re-land "[PDB] Defer relocating .debug$S until commit time and parallelize it" This reverts commit `bacf9cf2c5` and reinstates commit `1a9bd5b813`. Reverting this commit did not appear to make the problem go away, so we can go ahead and reland it.	2021-03-10 15:14:09 -08:00
Alex Orlov	df6d0579e1	Fix a crash in DWARFUnit::getInlinedChainForAddress in case of unexpected DWARF information. In some cases a broken or invalid debug info could cause a crash in DWARFUnit::getInlinedChainForAddress during parsing a chain of in-lined functions. This patch fixes this issue. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D98119	2021-03-09 14:20:27 +04:00
Jan Kratochvil	4289a7f1d7	llvm-dwarfdump: Fix DWARF-5 DW_FORM_implicit_const (used by GCC) Differential Revision: https://reviews.llvm.org/D98195	2021-03-09 09:26:58 +01:00
Jan Kratochvil	ba8907bf6f	[nfc] llvm-dwarfdump: DWARFAbbreviationDeclaration::AttributeSpec -> DWARFAttribute `AttributeSpec` does not contain values while `DWARFAttribute` already does. Therefore one no longer needs to pass `uint64_t *OffsetPtr`. Differential Revision: https://reviews.llvm.org/D98194	2021-03-09 08:31:23 +01:00
Rafael Auler	7a3e664db5	[DebugInfo] Do not error on unsupported CIE version D81469 introduced a check to error on CIE version different than 1 for eh_frame, but older compilers mistakenly create binaries with this version set to 3 for DWARF4 or 4 to DWARF5. Move the check to dump time instead of eh_frame parse time, so we can be tolerant with older binaries. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D97830	2021-03-08 19:39:08 -08:00
Nico Weber	301551ae8e	[PDB] Fix unsigned integer overflow When building with -fsanitize=unsigned-integer-overflow, this code causes a diagnostic like: ../../llvm/lib/DebugInfo/PDB/Native/GSIStreamBuilder.cpp:159:15: runtime error: unsigned integer overflow: 90 - 229 cannot be represented in type 'unsigned long' unsigned integer overflow is well defined and it isn't an issue in practice, but in obscure scenarios (S1.size() small, S2.size over 2GB on 32-bit systems) it could even be a bug. So use the usual idiom for implementing cmp functions instead of the gernally considered buggy idiom :) See e.g. https://www.gnu.org/software/libc/manual/html_node/Comparison-Functions.html or https://stackoverflow.com/questions/10996418/efficient-integer-compare-function/10997428#10997428 Differential Revision: https://reviews.llvm.org/D97557	2021-02-26 11:26:53 -05:00
Fangrui Song	a7ceef9254	DebugInfo/Symbolize: Exclude ARM mapping symbols for .symtab symbolization after D95916 Their names don't convey much information, so they should be excluded. The behavior matches addr2line. Differential Revision: https://reviews.llvm.org/D96617	2021-02-12 11:04:20 -08:00
Scott Linder	12999d749d	[Symbolize] Teach symbolizer to work directly on object file. This patch intended to provide additional interface to LLVMsymbolizer such that they work directly on object files. There is an existing method - symbolizecode which takes an object file, this patch provides similar overloads for symbolizeInlinedCode, symbolizeData, symbolizeFrame. This can be useful for clients who already have a in-memory object files to symbolize for. Patch By: pvellien (praveen velliengiri) Reviewed By: scott.linder Differential Revision: https://reviews.llvm.org/D95232	2021-02-12 18:26:24 +00:00
Fangrui Song	0fd7c31a09	DebugInfo/Symbolize: Use stable_sort This fixes coff-dwarf.test on some build bots. The test relies on the sort order and prefers main (StorageClass: External) to .text (StorageClass: Static).	2021-02-11 22:53:56 -08:00
Fangrui Song	92ee3dd95d	DebugInfo/Symbolize: Don't differentiate function/data symbolization Before `d08bd13ac8`, only `SymbolRef::ST_Function` symbols were used for .symtab symbolization. That commit added a `"DATA"` mode to llvm-symbolizer which used `SymbolRef::ST_Data` symbols for symbolization. Since function and data symbols have different addresses, we don't need to differentiate the two modes. This patches unifies the two modes to simplify code. `"DATA"` is used by `compiler-rt/lib/sanitizer_common/sanitizer_symbolizer_libcdep.cpp`. `check-hwasan` and `check-tsan` have runtime tests. Differential Revision: https://reviews.llvm.org/D96322	2021-02-11 19:22:44 -08:00
Adrian Prantl	97dbab8797	llvm-dwarfdump: fix the counting when printing DW_OP_entry_value The block size is in bytes, and not number of operands. Differential Revision: https://reviews.llvm.org/D96472	2021-02-11 11:17:04 -08:00
Fangrui Song	04a2e12612	DebugInfo/Symbolize: Retrieve filename from the preceding STT_FILE for .symtab symbolization The ELF spec says: > STT_FILE: Conventionally, the symbol's name gives the name of the source file associated with the object file. A file symbol has STB_LOCAL binding, its section index is SHN_ABS, and it precedes the other STB_LOCAL symbols for the file, if it is present. For a local symbol, the preceding STT_FILE symbol is almost always in the same file[1]. GNU addr2line uses this heuristic to retrieve the filename associated with a local symbol (e.g. internal linkage functions in C/C++). GNU addr2line can assign STT_FILE filename to a non-local symbol, too, but the trick only works if no regular symbol precede STT_FILE. This patch does not implement this corner case (not useful for most executables which have more than one files). In case of filename mismatch between .debug_line & .symtab, arbitrarily make .debug_line win. [1]: LLD does not synthesize STT_FILE symbols (https://bugs.llvm.org/show_bug.cgi?id=48023 see also https://sourceware.org/bugzilla/show_bug.cgi?id=26822). An assembly file without `.file` directives can cause mis-attribution. This is an edge case. Differential Revision: https://reviews.llvm.org/D95927	2021-02-10 09:47:10 -08:00
Fangrui Song	6d766c8bf9	DebugInfo/Symbolize: Allow STT_NOTYPE/STT_GNU_IFUNC symbols for .symtab symbolization In assembly files, omitting `.type foo,@function` is common. Such functions have type `STT_NOTYPE` and llvm-symbolizer reports `??` for them. An ifunc symbol usually has an associated resolver symbol which is defined at the same address. Returning either one is fine for symbolization. The resolver symbol may not end up in the symbol table if (object file) `.L` is used (linked image) .symtab is stripped while .dynsym is retained. This patch allows ELF STT_NOTYPE/STT_GNU_IFUNC symbols for .symtab symbolization. I have left TODO in the test files for an unimplemented STT_FILE heuristic. Differential Revision: https://reviews.llvm.org/D95916	2021-02-08 12:29:11 -08:00
Wouter van Oortmerssen	e3c0b0fe09	[WebAssembly] locals can now be indirect in DWARF This for example to indicate that byval args are represented by a pointer to a struct. Followup to https://reviews.llvm.org/D94140 Differential Revision: https://reviews.llvm.org/D94347	2021-02-05 11:14:42 -08:00
Yang Fan	e5f258cb03	[NFC][DebugInfo] Fix Wreturn-type gcc warning GCC warning: ``` /llvm-project/llvm/lib/DebugInfo/DWARF/DWARFDebugFrame.cpp: In member function ‘llvm::Expected<long unsigned int> llvm::dwarf::CFIProgram::Instruction::getOperandAsUnsigned(const llvm::dwarf::CFIProgram&, uint32_t) const’: /llvm-project/llvm/lib/DebugInfo/DWARF/DWARFDebugFrame.cpp:425:1: warning: control reaches end of non-void function [-Wreturn-type] 425 \| } \| ^ /llvm-project/llvm/lib/DebugInfo/DWARF/DWARFDebugFrame.cpp: In member function ‘llvm::Expected<long int> llvm::dwarf::CFIProgram::Instruction::getOperandAsSigned(const llvm::dwarf::CFIProgram&, uint32_t) const’: /llvm-project/llvm/lib/DebugInfo/DWARF/DWARFDebugFrame.cpp:477:1: warning: control reaches end of non-void function [-Wreturn-type] 477 \| } \| ^ ```	2021-01-29 11:42:23 +08:00
Greg Clayton	f8122d3532	Add the ability to extract the unwind rows from DWARF Call Frame Information. This patch adds the ability to evaluate the state machine for CIE and FDE unwind objects and produce a UnwindTable with all UnwindRow objects needed to unwind registers. It will also dump the UnwindTable for each CIE and FDE when dumping DWARF .debug_frame or .eh_frame sections in llvm-dwarfdump or llvm-objdump. This allows users to see what the unwind rows actually look like for a given CIE or FDE instead of just seeing a list of opcodes. This patch adds new classes: UnwindLocation, RegisterLocations, UnwindRow, and UnwindTable. UnwindLocation is a class that describes how to unwind a register or Call Frame Address (CFA). RegisterLocations is a class that tracks registers and their UnwindLocations. It gets populated when parsing the DWARF call frame instruction opcodes for a unwind row. The registers are mapped from their register numbers to the UnwindLocation in a map. UnwindRow contains the result of evaluating a row of DWARF call frame instructions for the CIE, or a row from a FDE. The CIE can produce a set of initial instructions that each FDE that points to that CIE will use as the seed for the state machine when parsing FDE opcodes. A UnwindRow for a CIE will not have a valid address, whille a UnwindRow for a FDE will have a valid address. The UnwindTable is a class that contains a sorted (by address) vector of UnwindRow objects and is the result of parsing all opcodes in a CIE, or FDE. Parsing a CIE should produce a UnwindTable with a single row. Parsing a FDE will produce a UnwindTable with one or more UnwindRow objects where all UnwindRow objects have valid addresses. The rows in the UnwindTable will be sorted from lowest Address to highest after parsing the state machine, or an error will be returned if the table isn't sorted. To parse a UnwindTable clients can use the following methods: static Expected<UnwindTable> UnwindTable::create(const CIE Cie); static Expected<UnwindTable> UnwindTable::create(const FDE Fde); A valid table will be returned if the DWARF call frame instruction opcodes have no encoding errors. There are a few things that can go wrong during the evaluation of the state machine and these create functions will catch and return them. Differential Revision: https://reviews.llvm.org/D89845	2021-01-28 13:39:17 -08:00
Reid Kleckner	bacf9cf2c5	Revert "[PDB] Defer relocating .debug$S until commit time and parallelize it" This reverts commit `1a9bd5b813`. I suspect that this patch may have caused https://crbug.com/1171438.	2021-01-28 13:17:27 -08:00
David Blaikie	4318028cd2	DebugInfo: Add a DWARF FORM extension for addrx+offset references to reduce relocations This is an alternative to the use of complex DWARF expressions for addresses - shaving off a few extra bytes of expression overhead.	2021-01-28 10:20:02 -08:00
Simon Pilgrim	7396f720f9	[DebugInfo] Remove some unused includes. NFCI. Mainly removing a lot of <vector> includes from files that don't explicitly use std::vector	2021-01-28 11:21:35 +00:00
Kazu Hirata	0da15ea581	[llvm] Use append_range (NFC)	2021-01-27 23:25:41 -08:00
Kazu Hirata	16baad8f4e	[llvm] Use pop_back_val (NFC)	2021-01-24 12:18:57 -08:00
Kazu Hirata	551aaa24af	[llvm] Use isDigit (NFC)	2021-01-21 19:59:50 -08:00
Kazu Hirata	6de4865545	[llvm] Use hasSingleElement (NFC)	2021-01-20 21:35:55 -08:00
Reid Kleckner	1a9bd5b813	Reland "[PDB] Defer relocating .debug$S until commit time and parallelize it" This reverts commit `5b7aef6eb4` and relands `6529d7c5a4`. The ASan error was debugged and determined to be the fault of an invalid object file input in our test suite, which was fixed by my last change. LLD's project policy is that it assumes input objects are valid, so I have added a comment about this assumption to the relocation bounds check.	2021-01-20 11:53:43 -08:00
Kazu Hirata	b023cdeacc	[llvm] Use llvm::all_of (NFC)	2021-01-19 20:19:17 -08:00
Mitch Phillips	5b7aef6eb4	Revert "[PDB] Defer relocating .debug$S until commit time and parallelize it" This reverts commit `6529d7c5a4`. Reason: Broke the ASan buildbots. http://lab.llvm.org:8011/#/builders/99/builds/1567	2021-01-19 11:45:48 -08:00
Kazu Hirata	352fcfc697	[llvm] Use llvm::sort (NFC)	2021-01-17 10:39:45 -08:00
Reid Kleckner	6529d7c5a4	[PDB] Defer relocating .debug$S until commit time and parallelize it This is a pretty classic optimization. Instead of processing symbol records and copying them to temporary storage, do a first pass to measure how large the module symbol stream will be, and then copy the data into place in the PDB file. This requires defering relocation until much later, which accounts for most of the complexity in this patch. This patch avoids copying the contents of all live .debug$S sections into heap memory, which is worth about 20% of private memory usage when making PDBs. However, this is not an unmitigated performance win, because it can be faster to read dense, temporary, heap data than it is to iterate symbol records in object file backed memory a second time. Results on release chrome.dll: peak mem: 5164.89MB -> 4072.19MB (-1,092.7MB, -21.2%) wall-j1: 0m30.844s -> 0m32.094s (slightly slower) wall-j3: 0m20.968s -> 0m20.312s (slightly faster) wall-j8: 0m19.062s -> 0m17.672s (meaningfully faster) I gathered similar numbers for a debug, component build of content.dll in Chrome, and the performance impact of this change was in the noise. The memory usage reduction was visible and similar. Because of the new parallelism in the PDB commit phase, more cores makes the new approach faster. I'm assuming that most C++ developer machines these days are at least quad core, so I think this is a win. Differential Revision: https://reviews.llvm.org/D94267	2021-01-12 17:46:29 -08:00
Kazu Hirata	9850d3b10a	[CodeGen, DebugInfo] Use llvm::find_if (NFC)	2021-01-10 09:24:53 -08:00
Simon Pilgrim	028091195d	[DWARF] DWARFDebugLoc::dumpRawEntry - remove dead stores. NFCI. Don't bother zeroing local (unused) variables just before returning. Fixes clang static analyzer warning.	2021-01-07 12:53:28 +00:00
Kazu Hirata	cd088ba7e6	[llvm] Use llvm::lower_bound and llvm::upper_bound (NFC)	2021-01-05 21:15:59 -08:00
Kazu Hirata	0edbc90ec5	[DebugInfo] Use llvm::append_range (NFC)	2021-01-04 11:42:45 -08:00
Nico Weber	c8dedfe269	fix typos to cycle bots	2021-01-01 22:58:40 -05:00
Amy Huang	7e13694ac7	[llvm-symbolizer][Windows] Add start line when searching in line table sections. Fixes issue where if a line section doesn't start with a line number then the addresses at the beginning of the section don't have line numbers. For example, for a line section like this ``` 0001:00000010-00000014, line/column/addr entries = 1 7 00000013 ! ``` a line number wouldn't be found for addresses from 10 to 12. This matches behavior when using the DIA SDK. Differential Revision: https://reviews.llvm.org/D93306	2020-12-17 07:57:36 -08:00
Nico Weber	cf16437e05	fix typos to cycle bots	2020-12-12 20:19:33 -05:00
Georgii Rymar	ffbce65f95	[lib/Object, tools] - Make ELFObjectFile::getELFFile return reference. We always have an object, so we don't have to return a pointer. Differential revision: https://reviews.llvm.org/D92560	2020-12-04 16:02:29 +03:00
Amy Huang	efd1ec0dec	Recommit "[llvm-symbolizer] Switch to using native symbolizer by default on Windows" This reverts commit `1b63177a56`.	2020-11-30 17:36:12 -08:00
Amy Huang	00bbef2bb2	[llvm-symbolizer] Fix native symbolization on windows for inline sites. The existing code handles this correctly and I checked that the code in NativeInlineSiteSymbol also handles this correctly, but it was wrong in the NativeFunctionSymbol code. Differential Revision: https://reviews.llvm.org/D92134	2020-11-30 14:27:35 -08:00
Amy Huang	1b63177a56	Revert "[llvm-symbolizer] Switch to using native symbolizer by default on Windows" Breaks some asan tests on the buildbot. This reverts commit `c74b427cb2`.	2020-11-23 16:29:45 -08:00
Amy Huang	c74b427cb2	[llvm-symbolizer] Switch to using native symbolizer by default on Windows llvm-symbolizer used to use the DIA SDK for symbolization on Windows; this patch switches to using native symbolization, which was implemented recently. Users can still make the symbolizer use DIA by adding the `-dia` flag in the LLVM_SYMBOLIZER_OPTS environment variable. Differential Revision: https://reviews.llvm.org/D91814	2020-11-23 15:57:08 -08:00
Amy Huang	b4902bcd98	[NFC] remove print statement I accidentally added.	2020-11-23 10:51:09 -08:00
Georgii Rymar	9a99d23a1b	[lib/Object] - Generalize the RelocationResolver API. This allows to reuse the RelocationResolver from the code that doesn't want to deal with `RelocationRef` class. I am going to use it in llvm-readobj. See the description of D91530 for more details. Differential revision: https://reviews.llvm.org/D91533	2020-11-20 10:32:49 +03:00
Jorge Gorbe Moya	314a0d73a8	Fix crash after looking up dwo_id=0 in CU index. In the current state, if getFromHash(0) is called and there's no CU with dwo_id=0, the lookup will stop at an empty slot, then the check `Rows[H].getSignature() != S` won't cause the lookup to fail and return a nullptr (as it should), because the empty slot has a 0 in the signature field, and a pointer to the empty slot will be incorrectly returned. This patch fixes this by using the index field in the hash entry to check for empty slots: signature = 0 can match a valid hash but according to the spec the index for an occupied slot will always be non-zero. Differential Revision: https://reviews.llvm.org/D91670	2020-11-19 11:15:01 -08:00
Amy Huang	bc98034040	[llvm-symbolizer] Add inline stack traces for Windows. This adds inline stack frames for symbolizing on Windows. Differential Revision: https://reviews.llvm.org/D88988	2020-11-17 13:19:13 -08:00
serge-sans-paille	9218ff50f9	llvmbuildectomy - replace llvm-build by plain cmake No longer rely on an external tool to build the llvm component layout. Instead, leverage the existing `add_llvm_componentlibrary` cmake function and introduce `add_llvm_component_group` to accurately describe component behavior. These function store extra properties in the created targets. These properties are processed once all components are defined to resolve library dependencies and produce the header expected by llvm-config. Differential Revision: https://reviews.llvm.org/D90848	2020-11-13 10:35:24 +01:00
David Truby	81b96bb6f1	[Aarch64] Fix assumption that Windows implies x86 When compiling for Windows on Arm the amd64 debug interfce from the Visual Studio SDK is used as the cmake currently only distinguishes between x86 and amd64 by checking the pointer size. Instead we can get the target architecture for the compilier and check that to distinguish between architectures.	2020-10-30 12:11:34 +00:00
Amy Huang	7669f3c0f6	Recommit "[CodeView] Emit static data members as S_CONSTANTs." We used to only emit static const data members in CodeView as S_CONSTANTS when they were used; this patch makes it so they are always emitted. This changes CodeViewDebug.cpp to find the static const members from the class debug info instead of creating DIGlobalVariables in the IR whenever a static const data member is used. Bug: https://bugs.llvm.org/show_bug.cgi?id=47580 Differential Revision: https://reviews.llvm.org/D89072 This reverts commit `504615353f`.	2020-10-28 16:35:59 -07:00
Amy Huang	504615353f	Revert "[CodeView] Emit static data members as S_CONSTANTs." Seems like there's an assert in here that we shouldn't be running into. This reverts commit `515973222e`.	2020-10-27 11:29:58 -07:00
Amy Huang	515973222e	[CodeView] Emit static data members as S_CONSTANTs. We used to only emit static const data members in CodeView as S_CONSTANTS when they were used; this patch makes it so they are always emitted. I changed CodeViewDebug.cpp to find the static const members from the class debug info instead of creating DIGlobalVariables in the IR whenever a static const data member is used. Bug: https://bugs.llvm.org/show_bug.cgi?id=47580 Differential Revision: https://reviews.llvm.org/D89072	2020-10-26 15:30:35 -07:00
David Blaikie	0ec5baa132	llvm-dwarfdump: Support verbose printing DW_OP_convert to print the CU local offset before the resolved absolute offset	2020-10-23 18:50:15 -07:00
David Blaikie	a67d164a82	Revert several changes related to llvm-symbolizer exiting non-zero on failure. Seems users have enough different uses of the symbolizer where they might have unknown binaries and offsets such that "best effort" behavior is all that's expected of llvm-symbolizer - so even erroring on unknown executables and out of bounds offsets might not be suitable. This reverts commit `1de0199748`. This reverts commit `a7b209a6d4`. This reverts commit `338dd138ea`.	2020-10-21 15:21:44 -07:00
Luqman Aden	51892a42da	[COFF][ARM] Fix CodeView for Windows on 32bit ARM targets. Create the LLVM / CodeView register mappings for the 32-bit ARM Window targets. Reviewed By: compnerd Differential Revision: https://reviews.llvm.org/D89622	2020-10-19 22:16:16 -07:00
David Blaikie	a7b209a6d4	llvm-symbolizer: Exit non-zero when DWARF parsing errors have been rendered	2020-10-14 23:42:00 -07:00
David Blaikie	9670a45c98	libDebugInfoDWARF: Don't try to parse loclist[.dwo] headers when parsing debug_info[.dwo] There's no way to know whether there's a loclist contribution to parse if there's no loclistx encoding - and if there is one, there's no need to walk back from the loclist_base (or, uin the case of info.dwo/loclist.dwo - starting at 0 in the contribution) to parse the header, instead rely on the DWARF32/64 and address size in the CU that's already available. This would come up in split DWARF (non-split wouldn't try to read a loclist header in the absence of a loclist_base) when one unit had location lists and another does not (because the loclists.dwo section would be non-empty in that case - in the case where it's empty the parsing would silently skip). Simplify the testing a bit, rather than needing a whole dwp, etc - by creating a malformed loclists.dwo section (and use single file Split DWARF) that would trip up any attempt to parse it - but no attempt should be made.	2020-10-13 22:28:59 -07:00
Greg Clayton	a4b842e294	Show register names in DWARF unwind info. Register context information was already being passed into the DWARFDebugFrame code that dumps unwind information but it wasn't being used. This change adds the ability to dump registers names of a valid MC register context was passed in and if it knows about the register. Updated the tests to use the newly returned register names. Differential Revision: https://reviews.llvm.org/D88767	2020-10-05 15:34:33 -07:00
David Blaikie	6d0be74af5	llvm-dwarfdump: Don't try to parse rnglist tables when dumping CUs It's not possible to do this in complete generality - a CU using a sec_offset DW_AT_ranges has no way of knowing where its rnglists contribution starts, so should not attempt to parse any full rnglist table/header to do so. And even using FORM_rnglistx there's no need to parse the header - the offset can be computed using the CU's DWARF format (32 or 64) to compute offset entry sizes, and then the list parsed at that offset without ever trying to find a rnglist contribution header immediately prior to the rnglists_base.	2020-10-04 19:18:14 -07:00
David Blaikie	92c45e4ee2	llvm-dwarfdump: Add support for DW_RLE_startx_endx	2020-10-04 17:50:43 -07:00
David Blaikie	628a319475	llvm-dwarfdump: Print addresses in debug_line to the parsed address size	2020-10-04 16:05:49 -07:00
David Blaikie	ea83e0b17e	llvm-dwarfdump: Dump address forms in their encoded length rather than always in 64 bits Few places did this already - refactor them all into a common helper.	2020-10-04 15:48:57 -07:00
David Blaikie	8036cf7f54	llvm-dwarfdump: Skip tombstoned address ranges Make the dumper & API a bit more informative by using the new tombstone addresses to filter out or otherwise render more explicitly dead code ranges.	2020-10-04 13:43:29 -07:00
Reid Kleckner	5519e4da83	Re-land "[PDB] Merge types in parallel when using ghashing" Stored Error objects have to be checked, even if they are success values. This reverts commit `8d250ac3cd`. Relands commit 49b3459930655d879b2dc190ff8fe11c38a8be5f.. Original commit message: ----------------------------------------- This makes type merging much faster (-24% on chrome.dll) when multiple threads are available, but it slightly increases the time to link (+10%) when /threads:1 is passed. With only one more thread, the new type merging is faster (-11%). The output PDB should be identical to what it was before this change. To give an idea, here is the /time output placed side by side: BEFORE \| AFTER Input File Reading: 956 ms \| 968 ms Code Layout: 258 ms \| 190 ms Commit Output File: 6 ms \| 7 ms PDB Emission (Cumulative): 6691 ms \| 4253 ms Add Objects: 4341 ms \| 2927 ms Type Merging: 2814 ms \| 1269 ms -55%! Symbol Merging: 1509 ms \| 1645 ms Publics Stream Layout: 111 ms \| 112 ms TPI Stream Layout: 764 ms \| 26 ms trivial Commit to Disk: 1322 ms \| 1036 ms -300ms ----------------------------------------- -------- Total Link Time: 8416 ms 5882 ms -30% overall The main source of the additional overhead in the single-threaded case is the need to iterate all .debug$T sections up front to check which type records should go in the IPI stream. See fillIsItemIndexFromDebugT. With changes to the .debug$H section, we could pre-calculate this info and eliminate the need to do this walk up front. That should restore single-threaded performance back to what it was before this change. This change will cause LLD to be much more parallel than it used to, and for users who do multiple links in parallel, it could regress performance. However, when the user is only doing one link, it's a huge improvement. In the future, we can use NT worker threads to avoid oversaturating the machine with work, but for now, this is such an improvement for the single-link use case that I think we should land this as is. Algorithm ---------- Before this change, we essentially used a DenseMap<GloballyHashedType, TypeIndex> to check if a type has already been seen, and if it hasn't been seen, insert it now and use the next available type index for it in the destination type stream. DenseMap does not support concurrent insertion, and even if it did, the linker must be deterministic: it cannot produce different PDBs by using different numbers of threads. The output type stream must be in the same order regardless of the order of hash table insertions. In order to create a hash table that supports concurrent insertion, the table cells must be small enough that they can be updated atomically. The algorithm I used for updating the table using linear probing is described in this paper, "Concurrent Hash Tables: Fast and General(?)!": https://dl.acm.org/doi/10.1145/3309206 The GHashCell in this change is essentially a pair of 32-bit integer indices: <sourceIndex, typeIndex>. The sourceIndex is the index of the TpiSource object, and it represents an input type stream. The typeIndex is the index of the type in the stream. Together, we have something like a ragged 2D array of ghashes, which can be looked up as: tpiSources[tpiSrcIndex]->ghashes[typeIndex] By using these side tables, we can omit the key data from the hash table, and keep the table cell small. There is a cost to this: resolving hash table collisions requires many more loads than simply looking at the key in the same cache line as the insertion position. However, most supported platforms should have a 64-bit CAS operation to update the cell atomically. To make the result of concurrent insertion deterministic, the cell payloads must have a priority function. Defining one is pretty straightforward: compare the two 32-bit numbers as a combined 64-bit number. This means that types coming from inputs earlier on the command line have a higher priority and are more likely to appear earlier in the final PDB type stream than types from an input appearing later on the link line. After table insertion, the non-empty cells in the table can be copied out of the main table and sorted by priority to determine the ordering of the final type index stream. At this point, item and type records must be separated, either by sorting or by splitting into two arrays, and I chose sorting. This is why the GHashCell must contain the isItem bit. Once the final PDB TPI stream ordering is known, we need to compute a mapping from source type index to PDB type index. To avoid starting over from scratch and looking up every type again by its ghash, we save the insertion position of every hash table insertion during the first insertion phase. Because the table does not support rehashing, the insertion position is stable. Using the array of insertion positions indexed by source type index, we can replace the source type indices in the ghash table cells with the PDB type indices. Once the table cells have been updated to contain PDB type indices, the mapping for each type source can be computed in parallel. Simply iterate the list of cell positions and replace them with the PDB type index, since the insertion positions are no longer needed. Once we have a source to destination type index mapping for every type source, there are no more data dependencies. We know which type records are "unique" (not duplicates), and what their final type indices will be. We can do the remapping in parallel, and accumulate type sizes and type hashes in parallel by type source. Lastly, TPI stream layout must be done serially. Accumulate all the type records, sizes, and hashes, and add them to the PDB. Differential Revision: https://reviews.llvm.org/D87805	2020-09-30 15:44:38 -07:00
Reid Kleckner	8d250ac3cd	Revert "[PDB] Merge types in parallel when using ghashing" This reverts commit `49b3459930`.	2020-09-30 14:55:32 -07:00
Reid Kleckner	49b3459930	[PDB] Merge types in parallel when using ghashing This makes type merging much faster (-24% on chrome.dll) when multiple threads are available, but it slightly increases the time to link (+10%) when /threads:1 is passed. With only one more thread, the new type merging is faster (-11%). The output PDB should be identical to what it was before this change. To give an idea, here is the /time output placed side by side: BEFORE \| AFTER Input File Reading: 956 ms \| 968 ms Code Layout: 258 ms \| 190 ms Commit Output File: 6 ms \| 7 ms PDB Emission (Cumulative): 6691 ms \| 4253 ms Add Objects: 4341 ms \| 2927 ms Type Merging: 2814 ms \| 1269 ms -55%! Symbol Merging: 1509 ms \| 1645 ms Publics Stream Layout: 111 ms \| 112 ms TPI Stream Layout: 764 ms \| 26 ms trivial Commit to Disk: 1322 ms \| 1036 ms -300ms ----------------------------------------- -------- Total Link Time: 8416 ms 5882 ms -30% overall The main source of the additional overhead in the single-threaded case is the need to iterate all .debug$T sections up front to check which type records should go in the IPI stream. See fillIsItemIndexFromDebugT. With changes to the .debug$H section, we could pre-calculate this info and eliminate the need to do this walk up front. That should restore single-threaded performance back to what it was before this change. This change will cause LLD to be much more parallel than it used to, and for users who do multiple links in parallel, it could regress performance. However, when the user is only doing one link, it's a huge improvement. In the future, we can use NT worker threads to avoid oversaturating the machine with work, but for now, this is such an improvement for the single-link use case that I think we should land this as is. Algorithm ---------- Before this change, we essentially used a DenseMap<GloballyHashedType, TypeIndex> to check if a type has already been seen, and if it hasn't been seen, insert it now and use the next available type index for it in the destination type stream. DenseMap does not support concurrent insertion, and even if it did, the linker must be deterministic: it cannot produce different PDBs by using different numbers of threads. The output type stream must be in the same order regardless of the order of hash table insertions. In order to create a hash table that supports concurrent insertion, the table cells must be small enough that they can be updated atomically. The algorithm I used for updating the table using linear probing is described in this paper, "Concurrent Hash Tables: Fast and General(?)!": https://dl.acm.org/doi/10.1145/3309206 The GHashCell in this change is essentially a pair of 32-bit integer indices: <sourceIndex, typeIndex>. The sourceIndex is the index of the TpiSource object, and it represents an input type stream. The typeIndex is the index of the type in the stream. Together, we have something like a ragged 2D array of ghashes, which can be looked up as: tpiSources[tpiSrcIndex]->ghashes[typeIndex] By using these side tables, we can omit the key data from the hash table, and keep the table cell small. There is a cost to this: resolving hash table collisions requires many more loads than simply looking at the key in the same cache line as the insertion position. However, most supported platforms should have a 64-bit CAS operation to update the cell atomically. To make the result of concurrent insertion deterministic, the cell payloads must have a priority function. Defining one is pretty straightforward: compare the two 32-bit numbers as a combined 64-bit number. This means that types coming from inputs earlier on the command line have a higher priority and are more likely to appear earlier in the final PDB type stream than types from an input appearing later on the link line. After table insertion, the non-empty cells in the table can be copied out of the main table and sorted by priority to determine the ordering of the final type index stream. At this point, item and type records must be separated, either by sorting or by splitting into two arrays, and I chose sorting. This is why the GHashCell must contain the isItem bit. Once the final PDB TPI stream ordering is known, we need to compute a mapping from source type index to PDB type index. To avoid starting over from scratch and looking up every type again by its ghash, we save the insertion position of every hash table insertion during the first insertion phase. Because the table does not support rehashing, the insertion position is stable. Using the array of insertion positions indexed by source type index, we can replace the source type indices in the ghash table cells with the PDB type indices. Once the table cells have been updated to contain PDB type indices, the mapping for each type source can be computed in parallel. Simply iterate the list of cell positions and replace them with the PDB type index, since the insertion positions are no longer needed. Once we have a source to destination type index mapping for every type source, there are no more data dependencies. We know which type records are "unique" (not duplicates), and what their final type indices will be. We can do the remapping in parallel, and accumulate type sizes and type hashes in parallel by type source. Lastly, TPI stream layout must be done serially. Accumulate all the type records, sizes, and hashes, and add them to the PDB. Differential Revision: https://reviews.llvm.org/D87805	2020-09-30 14:22:48 -07:00
David Blaikie	0328feb086	DebugInfo: Filter DWARFv5 TUs out of the debug_info unit list when CUs requested Since DWARFv5 places TUs in debug_info, some of DWARFContext's APIs have become a bit erroneous, including TUs in the CU list by accident. Correct that by providing compile_units (& dwo_compile_units) that filter out the type units from the debug_info units. Differential Revision: https://reviews.llvm.org/D87935	2020-09-23 22:15:53 -07:00
Jonas Devlieghere	e1ef7183c6	[dwarfdump] Warn for tags with DW_CHILDREN_yes but no children. Flag DIEs that have DW_CHILDREN_yes set in their abbreviation but don't actually have any children. rdar://59809554 Differential revision: https://reviews.llvm.org/D88048	2020-09-23 22:12:04 -07:00
David Blaikie	ad68a8b952	DebugInfo: Cleanup RLE dumping, using a length-constrained DataExtractor rather than carrying the end offset separately	2020-09-18 19:32:38 -07:00
David Blaikie	51a505340d	DebugInfo: Simplify line table parsing to take all the units together, rather than CUs and TUs separately	2020-09-18 11:18:23 -07:00
David Blaikie	e0802fe016	DebugInfo: Tidy up initializing multi-section contributions in DWARFContext	2020-09-18 10:54:43 -07:00
Simon Pilgrim	ed53ff4cde	SymbolizableObjectFile.h - remove unnecessary includes. NFCI. Use forward declarations where possible, move includes down to SymbolizableObjectFile.cpp and avoid duplicate includes.	2020-09-17 13:18:53 +01:00
Reid Kleckner	e47d2927de	Include (Type\|Symbol)Record.h less Most clients only need CVType and CVSymbol, not structs for every type and symbol. Move CVSymbol and CVType to CVRecord.h to accomplish this. Update some of the common headers that need CVSymbol and CVType to use the new location.	2020-09-16 09:59:03 -07:00
Petr Hosek	9c73e55510	Revert "[DebugInfo] Remove dots from getFilenameByIndex return value" This is failing on Windows bots due to path separator normalization. This reverts commit `042c235068`.	2020-09-15 10:06:47 -07:00
Petr Hosek	042c235068	[DebugInfo] Remove dots from getFilenameByIndex return value When concatenating directory with filename in getFilenameByIndex, we might end up with a path that contains extra dots. For example, if the input is /path and ./example, we would return /path/./example. Run sys::path::remove_dots on the output to eliminate unnecessary dots. Differential Revision: https://reviews.llvm.org/D87657	2020-09-14 20:19:06 -07:00
David Blaikie	69da27c749	llvm-symbolizer: Add optional "start file" to match "start line" Since a function might have portions of its code coming from multiple different files, "start line" is ambiguous (it can't just be resolved relative to the file/line specified). Add start file to disambiguate it.	2020-09-08 15:40:58 -07:00
Xing GUO	67ce11405b	[llvm-dwarfdump] Warn user when it encounters no null terminated strings. When llvm-dwarfdump encounters no null terminated strings, we should warn user about it rather than ignore it and print nothing. Before this patch, when llvm-dwarfdump dumps a .debug_str section whose content is "abc", it prints: ``` .debug_str contents: ``` After this patch: ``` .debug_str contents: warning: no null terminated string at offset 0x0 ``` Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D86998	2020-09-03 08:49:57 +08:00
Xing GUO	369f9169a5	[DebugInfo] Simplify string table dumpers. This patch adds a helper function DumpStrSection to simplify codes. Besides, nonprintable chars in debug_str and debug_str.dwo sections are printed as escaped chars. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D86918	2020-09-02 08:41:10 +08:00
Jordan Rupprecht	202766947e	[NFC] Fix unused var in release builds. This was always unused, but the change in D86354 upgraded this to a compiler warning.	2020-09-01 16:38:24 -07:00
David Blaikie	f7a49d2aa6	[WIP][DebugInfo] Lazily parse debug_loclist offsets Parsing DWARFv5 debug_loclist offsets when a CU is parsed is weighing down memory usage of symbolizers that don't need to parse this data at all. There's not much benefit to caching these anyway - since they are O(1) lookup and reading once you know where the offset list starts (and can do bounds checking with the offset list size too). In general, I think it might be time to start paying down some of the technical debt of loc/loclist/range/rnglist parsing to try to unify it a bit more. eg: * Currently DWARFUnit has: RangeSection, RangeSectionBase, LocSection, LocSectionBase, LocTable, RngListTable, LoclistTableHeader (be nice if these were all wrapped up in two variables - one for loclists, one for rnglists) * rnglists and loclists are handled differently (see: LoclistTableHeader, but no RnglistTableHeader) * maybe all these types could be less stateful - lazily parse what they need to, even reparsing rather than caching because it doesn't seem too expensive, for instance. (though admittedly so long as it's constantcost/overead per compilatiton that's probably adequate) * Maybe implementing and using a DWARFDataExtractor that can be sub-ranged (so we could slice it up to just the single contribution) - though maybe that's not so useful because loc/ranges need to refer to it by absolute, not contribution-relative mechanisms Differential Revision: https://reviews.llvm.org/D86110	2020-08-18 10:49:39 -07:00
Igor Kudrin	95fad44e34	[DebugInfo] Avoid an infinite loop with a truncated pre-v5 .debug_str_offsets.dwo. dumpStringOffsetsSection() expects the size of a contribution to be correctly aligned. The patch adds the corresponding verifications for pre-v5 cases. Differential Revision: https://reviews.llvm.org/D85739	2020-08-14 13:11:37 +07:00
Igor Kudrin	9ceb192e14	[llvm-dwarfdump] Avoid crashing if an abbreviation offset is invalid. Note that DWARFUnit::getAbbreviations() returns nullptr if the abbreviations could not be read, but callers used the returned pointer without checking. Differential Revision: https://reviews.llvm.org/D85738	2020-08-12 16:01:53 +07:00
David Stenberg	91bd9db2cd	[DebugInfo] Allow GNU macro extension to be read Allow the GNU .debug_macro extension to be parsed and printed by llvm-dwarfdump. In an upcoming patch support will be added for emitting that format also. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D82974	2020-08-11 13:30:52 +02:00
James Henderson	ca05601cd2	[DebugInfo] Don't error for zero-length arange entries Although the DWARF specification states that .debug_aranges entries can't have length zero, these can occur in the wild. There's no particular reason to enforce this part of the spec, since functionally they have no impact. The patch removes the error and introduces a new warning for premature terminator entries which does not stop parsing. This is a relanding of `cb3a598c87`, adding the missing obj2yaml part that was needed. Fixes https://bugs.llvm.org/show_bug.cgi?id=46805. See also https://reviews.llvm.org/D71932 which originally introduced the error. Reviewed by: ikudrin, dblaikie, Higuoxing Differential Revision: https://reviews.llvm.org/D85313	2020-08-10 14:57:52 +01:00
Nico Weber	bc5d68dd8a	Revert "[DebugInfo] Don't error for zero-length arange entries" This reverts commit `cb3a598c87`. Breaks build of check-llvm dep obj2yaml everywhere.	2020-08-10 08:20:35 -04:00

1 2 3 4 5 ...

2281 Commits