forked from OSchip/llvm-project
![]() Summary: We extended DynoStats to dump the histogram per instruction opcode. By default the dump is turned off. Use '-print-dyno-opcode-stats' to enable the dump. BOLT also dumps for each instruction opcode the maximum execution count and corresponding function name and basic block offsets where the instruction occurs. Below is a sample of the dump: Opcode, Execution Count, Max Exec Count, Function Name:Offset SHR8rCL, 232, 232, _ZNK5folly14AsyncSSLSocket4goodEv:53 VPADDDYrr, 13956, 388, chacha20_encrypt_bytes.part.0/3:736 PMOVSXBWrr, 4, 2, ares_expand_name/1:264 VMOVAPSmr, 1082, 43, chacha20_encrypt_bytes.part.0/3:2864 VPSHUFBrr, 9540, 1667, chacha20_encrypt_bytes.part.0/3:4416 VPUNPCKLDQYrr, 1102, 188, jsimd_ycc_rgb_convert_avx2/1:125 VPBROADCASTQYrm, 39, 39, chacha20_encrypt_bytes.part.0/3:400 PMOVSXWDrr, 8, 2, ares_expand_name/1:264 VPORrr, 817, 129, jsimd_idct_islow_avx2/1:41 PSLLDri, 8690752, 65644, blockmix_salsa8_xor/1:1424 (cherry picked from FBD28859624) |
||
---|---|---|
bolt |