llvm-project

Commit Graph

Author	SHA1	Message	Date
Rafael Auler	c09cd64e5c	[BOLT] Fix AND evaluation bug in shrink wrapping Fix a bug where shrink-wrapping would use wrong stack offsets because the stack was being aligned with an AND instruction, hence, making its true offsets only available during runtime (we can't statically determine where are the stack elements and we must give up on this case). Reviewed By: Amir Differential Revision: https://reviews.llvm.org/D126110	2022-05-26 14:59:28 -07:00
Amir Ayupov	f7581a3969	[BOLT][NFC] Use ListSeparator in BinaryFunction print methods Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D126243	2022-05-24 18:29:24 -07:00
Amir Ayupov	69f87b6c29	[BOLT][NFC] Customize endline character for printInstruction(s) This would be used in `BF::dumpGraph` to dump left-justified text. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D126232	2022-05-24 18:26:12 -07:00
Amir Ayupov	5d8247d4c7	[BOLT][NFC] Use for_each to simplify printLoopInfo Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D126242	2022-05-24 18:05:43 -07:00
Amir Ayupov	c907d6e0e9	[BOLT][NFC] Suppress unused variable warnings Addresses the warnings emitted by Apple Clang 13.1.6 (Xcode 13.3.1). Tip @tschuett issue #55404. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D125733	2022-05-17 14:30:23 -07:00
Amir Ayupov	a7b69dbdd1	[BOLT][NFC] Move BinaryDominatorTree out of BinaryLoop header Split up the BinaryLoop header and move BinaryDominatorTree into its own header, preparing it for a standalone use. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D125664	2022-05-17 14:20:11 -07:00
Amir Ayupov	bdba3d091c	[BOLT][CMAKE] Fix DYLIB build Move BOLT libraries out of `LLVM_LINK_COMPONENTS` to `target_link_libraries`. Addresses issue #55432. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D125568	2022-05-13 13:27:21 -07:00
Amir Ayupov	253b8f0abd	[BOLT][NFC] Use refs for loop variables to avoid copies Addresses warnings when built with Apple Clang. Reviewed By: yota9 Differential Revision: https://reviews.llvm.org/D125483	2022-05-13 20:18:29 +01:00
Amir Ayupov	139744ac53	[BOLT][NFC] Suppress unused variable warnings Address warnings in Release build without assertions. Tip @tschuett for reporting the issue #55404. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D125475	2022-05-13 20:10:19 +01:00
Amir Ayupov	d63c5a38fe	[BOLT][NFC] Use BitVector::set_bits Refactor and use `set_bits` BitVector interface. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D125374	2022-05-11 16:23:44 -07:00
Amir Ayupov	8cb7a873ab	[BOLT][NFC] Add MCPlus::primeOperands iterator_range Reviewed By: yota9 Differential Revision: https://reviews.llvm.org/D125397	2022-05-11 09:34:51 -07:00
Amir Ayupov	c2d40f1dfb	[BOLT] Add icp-inline option Add an option to only peel ICP targets that can be subsequently inlined. Yet there's no guarantee that they will be inlined. The mode is independent from the heuristic used to choose ICP targets: by exec count, mispredictions, or memory profile. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D124900	2022-05-11 03:21:24 -07:00
Alexander Yermolovich	3abb68a626	[BOLT][DWARF] Fix assert for split dwarf. Fixing a small bug where it would assert if CU does not modify .debug_addr section. Differential Revision: https://reviews.llvm.org/D125181	2022-05-08 19:18:17 -07:00
Alexander Yermolovich	ba1ac98c62	[BOLT][DWARF] Add version 5 split dwarf support Added support for DWARF5 Split Dwarf. Reviewed By: maksfb Differential Revision: https://reviews.llvm.org/D122988	2022-05-05 14:59:05 -07:00
Rahman Lavaee	733dc3e50b	[BOLT] Report per-section hotness in bolt-heatmap. This patch adds a new feature to bolt heatmap to print the hotness of each section in terms of the percentage of samples within that section. Sample output generated for the clang binary: Section Name, Begin Address, End Address, Percentage Hotness .text, 0x1a7b9b0, 0x20a2cc0, 1.4709 .init, 0x20a2cc0, 0x20a2ce1, 0.0001 .fini, 0x20a2ce4, 0x20a2cf2, 0.0000 .text.unlikely, 0x20a2d00, 0x431990c, 0.3061 .text.hot, 0x4319910, 0x4bc6927, 97.2197 .text.startup, 0x4bc6930, 0x4c10c89, 0.0058 .plt, 0x4c10c90, 0x4c12010, 0.9974 Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D124412	2022-05-05 11:37:46 -07:00
Amir Ayupov	f8d2d8b587	[BOLT][NFC] Move getInliningInfo out of Inliner class `getInliningInfo` is useful in other passes that need to check inlining eligibility for some function. Move the declaration and InliningInfo definition out of Inliner class. Prepare for subsequent use in ICP. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D124899	2022-05-04 14:08:06 -07:00
Amir Ayupov	2ad1c7540e	[BOLT][NFC] Minor cleanup in ICP getCallTargets and canPromoteCallsite Minor refactoring. NFC. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D124898	2022-05-04 14:06:53 -07:00
Amir Ayupov	68c7299f16	[BOLT][NFC] Fix MCPlusBuilder::getAliases caching behavior Caching behavior of `getAliases` causes a failure in unit tests where two MCPlusBuilder objects are created corresponding to AArch64 and X86: the alias cache is created for AArch64 but then used for X86. https://lab.llvm.org/staging/#/builders/211/builds/126 The issue only affects unit tests as we only construct one MCPlusBuilder for ELF binary. Resolve the issue by moving alias bitvectors to MCPlusBuilder object. Reviewed By: yota9 Differential Revision: https://reviews.llvm.org/D124942	2022-05-04 12:53:26 -07:00
Amir Ayupov	60957a5a08	[BOLT] Fix ICPJumpTablesTopN option use Fix non-sensical `opts::ICPJumpTablesTopN != 0 ? opts::ICPTopN : opts::ICPTopN`. Refactor/simplify another similar assignment. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D124880	2022-05-03 19:34:10 -07:00
Amir Ayupov	c3d5372093	[BOLT][NFC] Make ICP options naming uniform Rename `opts::IndirectCallPromotion` to `opts::ICP`, making option naming uniform and easier to follow. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D124879	2022-05-03 19:32:45 -07:00
Amir Ayupov	d0b1c98c96	[BOLT][NFC] ICP: simplify findTargetsIndex Unnest lambda and use `llvm::is_contained`. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D124877	2022-05-03 19:31:20 -07:00
Amir Ayupov	ec02227bf7	[BOLT][NFC] Refactor ICP::findCallTargetSymbols Reduce nesting making it easier to read. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D124876	2022-05-03 19:29:22 -07:00
Paul Kirth	625e0e611b	[BOLT] [NFC] Remove unused variable This patch fixes a warning from -Wunused-but-set-variable MismatchedBranches are counted, but are never reported. Since evaluateProfileData() should already identify and report these cases, we can safely remove the unused variable. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D124588	2022-05-03 15:15:56 +00:00
Amir Ayupov	64421e191b	[BOLT][NFC] Reduce Target/{AArch64,X86} dependencies We don't actually depend on entire X86/AArch64 components that pull in CodeGen, SelectionDAG etc., just the Desc part with opcode and other definitions. Note that it doesn't decouple BOLT from these components - we still pull in X86 and AArch64 from top-level llvm-bolt dependencies as we use assembler and disassembler. It's difficult to reduce these as this requires non-trivial changes to X86/AArch64 components themselves (e.g. moving out AsmPrinter). Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D124206	2022-04-29 20:37:53 -07:00
Paul Kirth	a0b8ab1ba3	[BOLT][NFC] Fix warning for unqualified call to std::move Fixes warning from RetpolineInsertion.cpp:171:44: warning: unqualified call to std::move [-Wunqualified-std-cast-call] Reviewed By: maksfb Differential Revision: https://reviews.llvm.org/D124482	2022-04-26 23:18:20 +00:00
Rahman Lavaee	e59e580116	[BOLT] Refactor DataAggregator::printLBRHeatMap. This also fixes some logs that were impacted by D123067. Reviewed By: Amir Differential Revision: https://reviews.llvm.org/D124281	2022-04-25 11:39:44 -07:00
Alexander Yermolovich	014cd37f51	[BOLT][DWARF] Implement monolithic DWARF5 Added implementation to support DWARF5 in monolithic mode. Next step DWARF5 split dwarf support. Reviewed By: maksfb Differential Revision: https://reviews.llvm.org/D121876	2022-04-21 16:02:23 -07:00
Alexey Moksyakov	48e894a536	[BOLT] Add R_AARCH64_PREL16/32/64 relocations support Reviewed By: yota9, rafauler Differential Revision: https://reviews.llvm.org/D122294	2022-04-21 13:52:47 +03:00
Vladislav Khmelevsky	63686af1e1	[BOLT] Fix build with GCC 7.3.0 The gcc 7.3.0 version raises "could not covert" error without std::move used explicitly. Differential Revision: https://reviews.llvm.org/D124009	2022-04-21 13:47:58 +03:00
Maksim Panchenko	76981fbcf6	[BOLT] Add fuzzy function name matching for LLVM LTO LLVM with LTO can generate function names in the form func.llvm.<number>, where <number> could vary based on the compilation environment. As a result, if a profiled binary originated from a different build than a corresponding binary used for BOLT optimization, then profiles for such LTO functions will be ignored. To fix the problem, use "fuzzy" matching with "func.llvm.*" form. Reviewed By: yota9, Amir Differential Revision: https://reviews.llvm.org/D124117	2022-04-20 17:00:21 -07:00
Alexander Yermolovich	7d6716786f	[BOLT][DWARF] Handle Error returned by visitLocationList Looks like implementation in llvm changed, and now we need to process error being returned. Reviewed By: maksfb Differential Revision: https://reviews.llvm.org/D124133	2022-04-20 16:40:46 -07:00
Amir Ayupov	4f277f28ab	[BOLT] Check if LLVM_REVISION is defined Handle the case where LLVM_REVISION is undefined (due to LLVM_APPEND_VC_REV=OFF or otherwise) by setting "<unknown>" value as before D123549. Reviewed By: yota9 Differential Revision: https://reviews.llvm.org/D123852	2022-04-15 06:33:14 -07:00
Amir Ayupov	2a9386726b	[BOLT][NFC] Use LLVM_REVISION instead of BOLT_VERSION_STRING Remove duplicate version string identification Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D123549	2022-04-14 19:16:35 -07:00
Maksim Panchenko	77b75ca53f	[BOLT][perf2bolt] Fix base address calculation for shared objects When processing profile data for shared object or PIE, perf2bolt needs to calculate base address of the binary based on the map info reported by the perf tool. When the mapping data provided is for the second (or any other than the first) segment and the segment's file offset does not match its memory offset, perf2bolt uses wrong assumption about the binary base address. Add a function to calculate binary base address using the reported memory mapping and use the returned base for further address adjustments. Reviewed By: yota9 Differential Revision: https://reviews.llvm.org/D123755	2022-04-14 10:29:53 -07:00
Vladislav Khmelevsky	2f98c5febc	[BOLT] Update skipRelocation for aarch64 The ld might relax ADRP+ADD or ADRP+LDR sequences to the ADR+NOP, add the new case to the skipRelocation for aarch64. Vladislav Khmelevsky, Advanced Software Technology Lab, Huawei Differential Revision: https://reviews.llvm.org/D123334	2022-04-13 22:54:06 +03:00
Maksim Panchenko	36cb736665	[BOLT] Ignore PC-relative relocations from data to data BOLT expects PC-relative relocations in data sections to reference code and the relocated data to form a jump table. However, there are cases where PC-relative addressing is used for data-to-data references (e.g. clang-15 can generate such code). BOLT should recognize and ignore such relocations. Otherwise, they will be considered relocations not claimed by any jump table and cause a failure in the strict mode. Reviewed By: yota9, Amir Differential Revision: https://reviews.llvm.org/D123650	2022-04-13 11:13:51 -07:00
Amir Ayupov	bad3798113	[BOLT] Fix data race in shortenInstructions Address ThreadSanitizer warning Reviewed By: maksfb Differential Revision: https://reviews.llvm.org/D121338	2022-04-13 11:10:36 -07:00
Rahman Lavaee	0c13d97e2b	Allow building heatmaps from basic sampled events with `-nl`. I find that this is useful for finding event hotspots. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D123067	2022-04-11 15:04:44 -07:00
Amir Ayupov	9b02dc631d	[BOLT] Check MCContext errors Abort on emission errors to prevent a malformed binary being written. Example: ``` <unknown>:0: error: Undefined temporary symbol .Ltmp26310 <unknown>:0: error: Undefined temporary symbol .Ltmp26311 <unknown>:0: error: Undefined temporary symbol .Ltmp26312 <unknown>:0: error: Undefined temporary symbol .Ltmp26313 <unknown>:0: error: Undefined temporary symbol .Ltmp26314 <unknown>:0: error: Undefined temporary symbol .Ltmp26315 BOLT-ERROR: Emission failed. ``` Reviewed By: yota9 Differential Revision: https://reviews.llvm.org/D123263	2022-04-08 21:08:39 -07:00
Argyrios Kyrtzidis	330268ba34	[Support/Hash functions] Change the `final()` and `result()` of the hashing functions to return an array of bytes Returning `std::array<uint8_t, N>` is better ergonomics for the hashing functions usage, instead of a `StringRef`: * When returning `StringRef`, client code is "jumping through hoops" to do string manipulations instead of dealing with fixed array of bytes directly, which is more natural * Returning `std::array<uint8_t, N>` avoids the need for the hasher classes to keep a field just for the purpose of wrapping it and returning it as a `StringRef` As part of this patch also: * Introduce `TruncatedBLAKE3` which is useful for using BLAKE3 as the hasher type for `HashBuilder` with non-default hash sizes. * Make `MD5Result` inherit from `std::array<uint8_t, 16>` which improves & simplifies its API. Differential Revision: https://reviews.llvm.org/D123100	2022-04-05 21:38:06 -07:00
Amir Ayupov	f99398fe0e	[BOLT][NFC] Move isADD64rr and isADDri out of MCPlusBuilder class Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D123077	2022-04-05 14:32:07 -07:00
Vladislav Khmelevsky	2e51a32219	[BOLT] Check for !isTailCall in isUnconditionalBranch Add !isTailCall in isUnconditionalBranch check in order to sync the x86 and aarch64 and fix the fixDoubleJumps pass on aarch64. Vladislav Khmelevsky, Advanced Software Technology Lab, Huawei Differential Revision: https://reviews.llvm.org/D122929	2022-04-05 23:39:34 +03:00
Vladislav Khmelevsky	4956e0e197	[BOLT] Fix plt relocations symbol match The bfd linker adds the symbol versioning string to the symbol name in symtab. Skip the versioning part in order to find the registered PLT function. Vladislav Khmelevsky, Advanced Software Technology Lab, Huawei Differential Revision: https://reviews.llvm.org/D122039	2022-04-05 15:57:26 +03:00
Amir Ayupov	686406a006	[BOLT][NFC] Use X86 mnemonic checks Remove switches in X86MCPlusBuilder.cpp, use mnemonic checks instead Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D122853	2022-04-04 14:05:46 -07:00
Vladislav Khmelevsky	3b1314f4de	[BOLT] AArch64: Read all static relocations Read static relocs on the same address, as dynamic in order to update constant island data address properly. Differential Revision: https://reviews.llvm.org/D122100	2022-04-03 19:03:35 +03:00
Vladislav Khmelevsky	4c14519ecb	[BOLT] LongJmp: Check for shouldEmit Check that the function will be emitted in the final binary. Preserving old function address is needed in case it is PLT trampiline, that is currently not moved by the BOLT. Differential Revision: https://reviews.llvm.org/D122098	2022-03-31 22:33:09 +03:00
Vladislav Khmelevsky	fed958c6cc	[BOLT] AArch64: Emit text objects BOLT treats aarch64 objects located in text as empty functions with contant islands. Emit them with at least 8-byte alignment to the new text section. Vladislav Khmelevsky, Advanced Software Technology Lab, Huawei Differential Revision: https://reviews.llvm.org/D122097	2022-03-31 22:28:50 +03:00
Amir Ayupov	c31af7cfe3	[MC][BOLT] Add setter for AllowAtInName Use the setter in BOLT to allow printing names with variant kind in the name (e.g. "func@PLT"). Fixes BOLT buildbot tests that broke after D122516: https://lab.llvm.org/buildbot/#/builders/215/builds/3595 Reviewed By: maksfb Differential Revision: https://reviews.llvm.org/D122694	2022-03-30 13:04:28 -07:00
Vladislav Khmelevsky	af9bdcfc46	[BOLT] Align constant islands to 8 bytes AArch64 requires CI to be aligned to 8 bytes due to access instructions restrictions. E.g. the ldr with imm, where imm must be aligned to 8 bytes. Differential Revision: https://reviews.llvm.org/D122065	2022-03-27 22:30:42 +03:00
spupyrev	4609f60ebc	[BOLT] Avoid pointless loop rotation It seems the earlier implementation does not follow the description in LoopRotationPass.h: It rotates loops even if they are already laid out correctly. The diff adjusts the behaviour. Given that the impact of LoopInversionPass is minor, this change won't yield significant perf differences. Tested on clang-10: there seems to be a 0.1%-0.3% cpu win and a small reduction of branch misses. Before: BOLT-INFO: 120 Functions were reordered by LoopInversionPass After: BOLT-INFO: 79 Functions were reordered by LoopInversionPass Reviewed By: yota9 Differential Revision: https://reviews.llvm.org/D121921	2022-03-22 12:42:42 -07:00

1 2 3 4

166 Commits