llvm-project

Commit Graph

Author	SHA1	Message	Date
Esme-Yi	b66328701a	[PowerPC][llvm-objdump] enable --symbolize-operands for PowerPC ELF/XCOFF. Summary: When disassembling, symbolize a branch target operand to print a label instead of a real address. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D114492	2021-12-21 04:17:57 +00:00
Shilei Tian	a268a63b1a	Revert "[LTO] Add a function `LTOCodeGenerator::getMergedModule`" This reverts commit `75a5eaf7c6`.	2021-12-20 20:34:04 -05:00
Shilei Tian	75a5eaf7c6	[LTO] Add a function `LTOCodeGenerator::getMergedModule` One of the uses of `LTOCodeGenerator` is to take it as a middle+back end. Sometimes it is very helpful to access, especially get information from the optimized module. If the information can be changed in optimization, it cannot be get before the module is added to `LTOCodeGenerator`. This patch adds a function `LTOCodeGenerator::getMergedModule` to access the `MergedModule`. Reviewed By: steven_wu Differential Revision: https://reviews.llvm.org/D114201	2021-12-20 20:01:49 -05:00
Simon Pilgrim	67cce1ceee	[X86] Adjust some IceLake fp shuffle schedule classes (PR48110) The IceLake scheduler model is still mainly a copy of the SkylakeServer model. This patch adjusts the fp shuffle classes to account for most instructions now working on Port 1 as well as Port 5. This is based off Agner + uops.info as well as the PR48110 report. Differential Revision: https://reviews.llvm.org/D115752	2021-12-19 13:00:11 +00:00
Cyndy Ishida	4a878c14c4	[llvm-tapi-diff] rename tests for clarity, NFC	2021-12-18 16:42:45 -08:00
Cyndy Ishida	982604cc08	[llvm-tapi-diff] Replicate diff utility error handling For scripting purposes, use different error code for cases that aren't a result of different tbd files, e.g. bad input like a path to a broken symlink. This behaves more similiarly to how the unix `diff` command behaves. This also includes minor tweaks on error messages. Reviewed By: ributzka, JDevlieghere Differential Revision: https://reviews.llvm.org/D115905	2021-12-18 09:54:37 -08:00
David Blaikie	4f60a42878	DWARFVerifier: Skip resolution failures for locations in dwo files When reading location lists in dwo files the addresses cannot be resolved, but that's not a problem. Long term this probably should be fixed with a different API that exposes location expressions without the need to resolve the address ranges, since that's all the verifier (in its current state) requires. (though the verifier should probably also eventually verify the address ranges in location lists are a subset of the enclosing scope's address range)	2021-12-16 22:24:32 -08:00
David Blaikie	e5c3b97116	DWARFVerifier: Test failures to parse DW_AT_locations	2021-12-16 22:24:32 -08:00
David Blaikie	2444e305c5	DWARFVerifier: Cleanup a test, remove redundant abbreviation	2021-12-16 20:34:52 -08:00
David Blaikie	8f6d52c8db	DWARFVerifier: Don't error on missing ranges in Split DWARF When verifying dwo files address ranges won't be able to be resolved due to missing debug_addr (or missing debug_ranges in the case of DWARFv4 Split DWARF).	2021-12-16 20:34:44 -08:00
David Blaikie	ae6961d407	DWARFVerifier: Don't try to resolve decl_file in split CUs Since they refer to the debug_line in the skeleton unit, they can't be resolved from the dwo CU. But they can be resolved for split TUs, since those refer to .debug_line.dwo, which is available in the dwo file.	2021-12-16 18:12:52 -08:00
Ellis Hoag	ea5ba47407	[dSYM][NFC] Add error test case for llvm-dwarfdump Add a test to `llvm-dwarfdump` to simply test that the error messages make sense when passing bad `.dSYM`s. Reviewed By: clayborg Differential Revision: https://reviews.llvm.org/D115889	2021-12-16 15:23:18 -08:00
David Blaikie	eb12e61384	DebugInfoVerifier: Verify dwo units Initial pass - follow-up patches to address various limitations (debug ranges, location lists... )	2021-12-16 14:47:22 -08:00
Noah Shutty	35ccde20ea	[Debuginfod] Remove f-string from llvm-debuginfod-find lit test. This removes a python3 f-String from the llvm-debuginfod-find lit test script to maximize compatibility with different python versions, as it seemed to cause some issues. Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D115814	2021-12-16 19:22:13 +00:00
Wenlei He	f6f0409f6f	[llvm-profgen] Turn on preinliner by default preinliner has been tuned on large server workloads and it's not ready to be turned on by default. this change also updates the thresholds based on tuning. Differential Revision: https://reviews.llvm.org/D115770	2021-12-14 17:46:57 -08:00
wlei	0f53df864e	[CSSPGO][llvm-profgen] Fix external address issues of perf reader (return to external addr part) Before we have an issue with artificial LBR whose source is a return, recalling that "an internal code(A) can return to external address, then from the external address call a new internal code(B), making an artificial branch that looks like a return from A to B can confuse the unwinder". We just ignore the LBRs after this artificial LBR which can miss some samples. This change aims at fixing this by correctly unwinding them instead of ignoring them. List some typical scenarios covered by this change. 1) multiple sequential call back happen in external address, e.g. ``` [ext, call, foo] [foo, return, ext] [ext, call, bar] ``` Unwinder should avoid having foo return from bar. Wrong call stack is like [foo, bar] 2) the call stack before and after external call should be correctly unwinded. ``` {call stack1} {call stack2} [foo, call, ext] [ext, call, bar] [bar, return, ext] [ext, return, foo ] ``` call stack 1 should be the same to call stack2. Both shouldn't be truncated 3) call stack should be truncated after call into external code since we can't do inlining with external code. ``` [foo, call, ext] [ext, call, bar] [bar, call, baz] [baz, return, bar ] [bar, return, ext] ``` the call stack of code in baz should not include foo. ### Implementation: We leverage artificial frame to fix #2 and #3: when we got a return artificial LBR, push an extra artificial frame to the stack. when we pop frame, check if the parent is an artificial frame to pop(fix #2). Therefore, call/ return artificial LBR is just the same as regular LBR which can keep the call stack. While recording context on the trie, artificial frame is used as a tag indicating that we should truncate the call stack(fix #3). To differentiate #1 and #2, we leverage `getCallAddrFromFrameAddr`. Normally the target of the return should be the next inst of a call inst and `getCallAddrFromFrameAddr` will return the address of call inst. Otherwise, getCallAddrFromFrameAddr will return to 0 which is the case of #1. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D115550	2021-12-14 16:40:54 -08:00
wlei	3dcb60db9a	[CSSPGO][llvm-profgen] Fix external address issues of perf reader (leading external LBR part) We can have the sampling just hit into the external addresses, in that case, both the top stack frame and the latest LBR target are external addresses. For example: ``` ffffffff 0x4006c8/0xffffffff/P/-/-/0 0x40069b/0x400670/M/-/-/0 ffffffff 40067e 0xffffffff/0xffffffff/P/-/-/0 0x4006c8/0xffffffff/P/-/-/0 0x40069b/0x400670/M/-/-/0 ``` Before we will ignore the entire samples. However, we found there exists some internal LBRs in the remaining part of sample, the range between them is still a valid range, we will lose some valid LBRs. Those LBRs will be unwinded based on a empty(context-less) call stack. This change tries to fix it, instead of ignoring the entire sample, we only ignore the leading external addresses. Note that the first outgoing LBR is useful since there is a valid range between it's source and next LBR's target. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D115538	2021-12-14 16:40:53 -08:00
wlei	3220571793	[llvm-profgen] Skip disassembling for PLT section Skip disassembling .plt section, then .plt section code will be treated as external code. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D115699	2021-12-14 16:40:53 -08:00
David Blaikie	51776ecaff	DebugInfo: Fix test to match comment This produced a few verifier warnings that came up while I was investigating something else here. Change the assembly to match the comment so it's warning free. Doesn't seem necessary to change the CHECKs for the test since it's just a bug in the test, not in the code under test.	2021-12-14 16:03:34 -08:00
David Blaikie	92f2d02b4a	DebugInfo: Sink string form validation down from verifier to form parsing Avoid duplicating the string decoding - improve the error messages down in form parsing (& produce an Expected<const char> instead of Optional<const char> to communicate the extra error details)	2021-12-14 15:41:53 -08:00
Hongtao Yu	5740bb801a	[CSSPGO] Use nested context-sensitive profile. CSSPGO currently employs a flat profile format for context-sensitive profiles. Such a flat profile allows for precisely manipulating contexts that is either inlined or not inlined. This is a benefit over the nested profile format used by non-CS AutoFDO. A downside of this is the longer build time due to parsing the indexing the full CS contexts. For a CS flat profile, though only the context profiles relevant to a module are loaded when that module is compiled, the cost to figure out what profiles are relevant is noticeably high when there're many contexts, since the sample reader will need to scan all context strings anyway. On the contrary, a nested function profile has its related inline subcontexts isolated from other unrelated contexts. Therefore when compiling a set of functions, unrelated contexts will never need to be scanned. In this change we are exploring using nested profile format for CSSPGO. This is expected to work based on an assumption that with a preinliner-computed profile all contexts are precomputed and expected to be inlined by the compiler. Contexts not expected to be inlined will be cut off and returned to corresponding base profiles (for top-level outlined functions). This naturally forms a nested profile where all nested contexts are expected to be inlined. The compiler will less likely optimize on derived contexts that are not precomputed. A CS-nested profile will look exactly the same with regular nested profile except that each nested profile can come with an attributes. With pseudo probes, a nested profile shown as below can also have a CFG checksum. ``` main:1968679:12 2: 24 3: 28 _Z5funcAi:18 3.1: 28 _Z5funcBi:30 3: _Z5funcAi:1467398 0: 10 1: 10 _Z8funcLeafi:11 3: 24 1: _Z8funcLeafi:1467299 0: 6 1: 6 3: 287884 4: 287864 _Z3fibi:315608 15: 23 !CFGChecksum: 138828622701 !Attributes: 2 !CFGChecksum: 281479271677951 !Attributes: 2 ``` Specific work included in this change: - A recursive profile converter to convert CS flat profile to nested profile. - Extend function checksum and attribute metadata to be stored in nested way for text profile and extbinary profile. - Unifiy sample loader inliner path for CS and preinlined nested profile. - Changes in the sample loader to support probe-based nested profile. I've seen promising results regarding build time. A nested profile can result in a 20% shorter build time than a CS flat profile while keep an on-par performance. This is with -duplicate-contexts-into-base=1. Test Plan: Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D115205	2021-12-14 14:40:25 -08:00
Simon Pilgrim	74d1fc742a	[X86] Adjust some IceLake integer shuffle schedule classes (PR48110) The IceLake scheduler model is still mainly a copy of the SkylakeServer model. This patch adjusts the integer shuffle classes to account for most instructions now working on Port 1 as well as Port 5. This is based off Agner + uops.info as well as the PR48110 report. Differential Revision: https://reviews.llvm.org/D115547	2021-12-14 18:56:13 +00:00
Noah Shutty	f0ca8d2461	[llvm] [Debuginfo] Add llvm-debuginfod-find tool and end-to-end-tests. This implements the `llvm-debuginfod-find` tool, which wraps the Debuginfod library (D112758) to query debuginfod servers for artifacts according to the [[ https://www.mankier.com/8/debuginfod#Webapi \| specification ]]. Reviewed By: labath Differential Revision: https://reviews.llvm.org/D112759	2021-12-14 18:00:18 +00:00
Ellis Hoag	0005efd455	[DebugInfo][dsymutil] Keep locations for function-local globals In debug info, we expect variables to have location info if they are used and we don't want location info for functions that are not used. However, if an unused function is inlined, we could have the scenario where a function is not in the final binary but its static variable is in the final binary. Ensure that variables in the final binary have location debug info even if their scope was inlined. Also add `--implicit-check-not` to a test for clarity. Reviewed By: JDevlieghere Differential Revision: https://reviews.llvm.org/D115565	2021-12-14 09:28:22 -08:00
Simon Atanasyan	d5d8b1c972	[llvm-objcopy] Fix handling of MIPS64 little endian files MIPS64 little endian target has a "special" encoding of `r_info` relocation record field. Instead of one 64-bit little endian number, it is a little endian 32-bit number followed by a 32-bit big endian number. For correct reading and writing such fields we must provide information about target machine into the corresponding routine. This patch does this for the `llvm-objcopy` tool and fix handling of MIPS64 little endian files. The bug was reported in the issue #52647. Differential Revision: https://reviews.llvm.org/D115635	2021-12-14 17:21:27 +03:00
David Blaikie	892742907f	DWARFVerifier: Verbosely dump DIEs in verifier reports Seems helpful when you're dealing with invalid/problematic DWARF. Some diagnostic messages are probably redundant with the verbose dumping and could be simplified with this.	2021-12-13 19:00:38 -08:00
Noah Shutty	34491ca729	[Symbolizer][Debuginfo] Add debuginfod client to llvm-symbolizer. Adds a fallback to use the debuginfod client library (386655) in `findDebugBinary`. Fixed a cast of Erorr::success() to Expected<> in debuginfod library. Added Debuginfod to Symbolize deps in gn. Updates compiler-rt/lib/sanitizer_common/symbolizer/scripts/build_symbolizer.sh to include Debuginfod library to fix sanitizer-x86_64-linux breakage. Reviewed By: jhenderson, vitalybuka Differential Revision: https://reviews.llvm.org/D113717	2021-12-13 23:00:32 +00:00
Shilei Tian	aa0981ba86	Revert "[LTO] Add a function `LTOCodeGenerator::getMergedModule`" This reverts commit `61051e42c2`.	2021-12-13 16:31:22 -05:00
Shilei Tian	61051e42c2	[LTO] Add a function `LTOCodeGenerator::getMergedModule` One of the uses of `LTOCodeGenerator` is to take it as a middle+back end. Sometimes it is very helpful to access, especially get information from the optimized module. If the information can be changed in optimization, it cannot be get before the module is added to `LTOCodeGenerator`. This patch adds a function `LTOCodeGenerator::getMergedModule` to access the `MergedModule`. Reviewed By: steven_wu Differential Revision: https://reviews.llvm.org/D114201	2021-12-13 16:19:08 -05:00
Simon Pilgrim	d9655eec05	[MCA][X86] Add AVX512 subvector broadcast instruction test coverage	2021-12-13 18:48:25 +00:00
Simon Pilgrim	fe1b5b56c6	[MCA][X86] Add AVX512 movddup/movshdup/movsldup instruction test coverage As noted on D115547	2021-12-13 18:04:56 +00:00
Simon Pilgrim	b04c646711	[MCA][X86] Add AVX512 broadcast instruction test coverage As noted on D115547	2021-12-13 17:45:16 +00:00
Simon Pilgrim	9ad5969b5e	[X86][Atom] Fix CVT uops + port usage Fix overrides to use both ports. Update the uops counts + port usage based off the most recent llvm-exegesis captures (PR36895) and what Intel AoM / Agner reports as well.	2021-12-12 22:57:53 +00:00
Simon Pilgrim	4c1d248397	[MCA][X86] Fix duplicated cvtsi2ss/cvtsi2sd i32 + i64 folded tests Specify the integer width to ensure we're testing the correct instruction	2021-12-12 22:48:45 +00:00
Simon Pilgrim	c02f9791c6	[X86][AVX512] Remove xmm->xmm vpmovsx/vpmovzx rm overrides The XMM evex cases have the same behaviour as the SSE41 versions, which already uses WriteShuffleX.Folded	2021-12-12 16:08:10 +00:00
Simon Pilgrim	fc02ceb12a	[X86][AVX512] Use WriteShuffleX for xmm->xmm extensions The XMM evex cases have the same behaviour as the SSE41 versions, which already uses WriteShuffleX	2021-12-12 15:22:32 +00:00
Jayson Yan	1f35d7b5de	[llvm-readobj] Add JSONScopedPrinter to llvm-readelf Adds JSONScopedPrinter to llvm-readelf. It includes an empty JSONELFDumper class which will be used to override any LLVMELFDumper methods which utilize startLine() which JSONScopedPrinter cannot provide. This introduces a change where calls to llvm-readelf with non-ELF object files that specify --elf-output-style=GNU will now print file summary information where it previously didn't. Fixes previous Windows test failure which occured due to JSON escaping of '\' by not relying on LIT substitution. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D114225	2021-12-10 23:34:27 +00:00
Jayson Yan	2526335595	Revert "[llvm-readobj] Add JSONScopedPrinter to llvm-readelf" This reverts commit `824eef231a`. file-summary-json.test and pretty-print.test fails on Windows.	2021-12-10 21:01:27 +00:00
Jayson Yan	824eef231a	[llvm-readobj] Add JSONScopedPrinter to llvm-readelf Adds JSONScopedPrinter to llvm-readelf. It includes an empty JSONELFDumper class which will be used to override any LLVMELFDumper methods which utilize startLine() which JSONScopedPrinter cannot provide. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D114225	2021-12-10 18:57:34 +00:00
Simon Pilgrim	7f09aee0f6	[MCA][X86] Add missing VPMOVSX/VPMOVZX from AVX512 tests	2021-12-10 18:12:57 +00:00
Simon Pilgrim	6fae235885	[MCA][X86] Add missing ALIGND/ALIGNQ from AVX512F/AVX512VL tests	2021-12-10 15:59:52 +00:00
Simon Pilgrim	b025b062d6	[MCA][X86] Add missing PALIGNR from AVX512BW/AVX512BWVL tests	2021-12-10 15:59:52 +00:00
Simon Pilgrim	ebcc92ccda	[MCA][X86] Add missing PSLLDQ/PSRLDQ from AVX512BW/AVX512BWVL tests	2021-12-10 15:59:51 +00:00
Simon Pilgrim	550bf36732	[MCA][X86] Add missing PACKSS/PACKUS from AVX512BW/AVX512BWVL tests	2021-12-10 15:59:51 +00:00
Nico Weber	30f221bba0	Revert "[Symbolizer][Debuginfo] Add debuginfod client to llvm-symbolizer." This reverts commit `5bba0fe12b`. Makes lld depend on libcurl, see comments on https://reviews.llvm.org/D113717	2021-12-10 10:33:05 -05:00
Simon Pilgrim	80ce01c6fd	[MCA][X86] Add missing PSHUFLW from AVX512BWVL tests	2021-12-10 14:02:37 +00:00
Noah Shutty	0f30012def	Revert "[llvm] [Debuginfo] Add llvm-debuginfod-find tool and end-to-end-tests." This reverts commit `350fe22f2a` because it caused buildbot breakages (https://lab.llvm.org/buildbot/#/builders/91/builds/387).	2021-12-10 11:08:15 +00:00
Noah Shutty	350fe22f2a	[llvm] [Debuginfo] Add llvm-debuginfod-find tool and end-to-end-tests. This implements the `llvm-debuginfod-find` tool, which wraps the Debuginfod library (D112758) to query debuginfod servers for artifacts according to the [[ https://www.mankier.com/8/debuginfod#Webapi \| specification ]]. Reviewed By: labath Differential Revision: https://reviews.llvm.org/D112759	2021-12-10 10:33:52 +00:00
Noah Shutty	5bba0fe12b	[Symbolizer][Debuginfo] Add debuginfod client to llvm-symbolizer. Adds a fallback to use the debuginfod client library (386655) in `findDebugBinary`. Fixed a cast of Erorr::success() to Expected<> in debuginfod library. Added Debuginfod to Symbolize deps in gn. Updates compiler-rt/lib/sanitizer_common/symbolizer/scripts/build_symbolizer.sh to include Debuginfod library to fix sanitizer-x86_64-linux breakage. Reviewed By: jhenderson, vitalybuka Differential Revision: https://reviews.llvm.org/D113717	2021-12-10 01:32:36 +00:00
Noah Shutty	afa3c14e2f	Revert "[Symbolizer][Debuginfo] Add debuginfod client to llvm-symbolizer." This reverts commit `e2ad4f1756` because it does not correctly fix the sanitizer buildbot breakage.	2021-12-10 00:59:13 +00:00

1 2 3 4 5 ...

5569 Commits