llvm-project

Commit Graph

Author	SHA1	Message	Date
Igor Kudrin	657e067bb5	[ARMInstPrinter] Print the target address of a branch instruction This follows other patches that changed printing immediate values of branch instructions to target addresses, see D76580 (x86), D76591 (PPC), D77853 (AArch64). As observing immediate values might sometimes be useful, they are printed as comments for branch instructions. // llvm-objdump -d output (before) 000200b4 <_start>: 200b4: ff ff ff fa blx #-4 <thumb> 000200b8 <thumb>: 200b8: ff f7 fc ef blx #-8 <_start> // llvm-objdump -d output (after) 000200b4 <_start>: 200b4: ff ff ff fa blx 0x200b8 <thumb> @ imm = #-4 000200b8 <thumb>: 200b8: ff f7 fc ef blx 0x200b4 <_start> @ imm = #-8 // GNU objdump -d. 000200b4 <_start>: 200b4: faffffff blx 200b8 <thumb> 000200b8 <thumb>: 200b8: f7ff effc blx 200b4 <_start> Differential Revision: https://reviews.llvm.org/D104701	2021-06-30 16:35:28 +07:00
Fangrui Song	814dffa4b7	[llvm-objcopy][MachO] Support LC_LINKER_OPTIMIZATION_HINT load command The load command is currently specific to arm64 and holds information for instruction rewriting, e.g. converting a GOT load to an ADR to compute a local address. (On ELF the information is usually conveyed by relocations, e.g. R_X86_64_REX_GOTPCRELX, R_PPC64_TOC16_HA) Reviewed By: alexander-shaposhnikov Differential Revision: https://reviews.llvm.org/D104968	2021-06-29 18:47:55 -07:00
Fangrui Song	d4dcb55c70	[llvm-readobj] Make -s and -t match llvm-readelf llvm-readobj is an internal testing tool for binary formats. Its output and command line options do not need to be stable. It isn't supposed to be part of a build process. llvm-readelf was created as a user-facing utility and its interface intends to be compatible with GNU readelf (unless there are good reasons not to). The two tools have mostly compatible options. -s and -t are noticeable exceptions due to history. I think the cost of keeping the inconsistency overweighs the little history-compatible benefit and hinders transition from cl::opt to OptTable, so let's change it. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D105055	2021-06-29 11:56:26 -07:00
Fangrui Song	69937a8080	[llvm-objcopy][MachO] Support ARM64_RELOC_ADDEND An ARM64_RELOC_ADDEND relocation reuses the symbol field for the addend value. We should pass through such relocations. Reviewed By: alexander-shaposhnikov Differential Revision: https://reviews.llvm.org/D104967	2021-06-29 11:23:30 -07:00
gbreynoo	56fa49878b	[llvm-objdump] Add testing for --print-imm-hex, --headers, --section-headers and --private-headers llvm-objdump had some missing coverage that is fixed by this change: - A test specifically for --print-imm-hex, and coverage of --no-print-imm-hex - section-headers.test checks the aliases --headers or --section-headers - A test for the use of --private-headers for ELF that checks the output - A test for ELF program headers Differential Revision: https://reviews.llvm.org/D103974	2021-06-29 17:03:21 +01:00
Igor Kudrin	d25e572421	[llvm-objdump] Print memory operand addresses as regular comments The patch reuses the common code to print memory operand addresses as instruction comments. This helps to align the comments and enables using target-specific comment markers when `evaluateMemoryOperandAddress()` is implemented for them. Differential Revision: https://reviews.llvm.org/D104861	2021-06-28 14:25:22 +07:00
Igor Kudrin	e7fffa6f03	[llvm-objdump] Prefix memory operand addresses with '0x' This helps to avoid ambiguity when the address contains only digits 0..9. Differential Revision: https://reviews.llvm.org/D104909	2021-06-28 14:25:21 +07:00
Igor Kudrin	c2e6bcb494	[llvm-objdump] Prevent variable locations to overlap short comments For now, the source variable locations are printed at about the same space as the comments for disassembled code, which can make some ranges for variables disappear if a line contains comments, for example: ┠─ bar = W1 0: add x0, x2, #2, lsl #12 // =8192┃ 4: add z31.d, z31.d, #65280 // =0xff00 8: nop ┻ The patch shifts the report a bit to allow printing comments up to approximately 16 characters without interferences. Differential Revision: https://reviews.llvm.org/D104700	2021-06-28 14:25:21 +07:00
Igor Kudrin	abe0fa4352	[llvm-objdump] Print comments for the disassembled code LLVM disassembler can generate comments for disassembled instructions. The patch enables printing these comments for 'llvm-objdump -d'. Differential Revision: https://reviews.llvm.org/D104699	2021-06-28 14:25:20 +07:00
Jan Kratochvil	c19a28919f	llvm-dwarfdump: Print warnings on invalid DWARF llvm-dwarfdump was silent even when the format of DWARF was invalid and/or llvm-dwarfdump did not understand/support some of the constructs. This can be pretty confusing as llvm-dwarfdump is a tool for DWARF producers+consumers development. Review comments also by @dblaikie. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D104271	2021-06-27 11:38:35 +02:00
Eric Astor	e074d580b2	[ms] [llvm-ml] Disable C-style comments	2021-06-25 23:09:13 -04:00
Eric Astor	c8d0d8a8a1	[ms] [llvm-ml] Add support for ALIGN, EVEN, and ORG directives Match ML.EXE's behavior for ALIGN, EVEN, and ORG directives both at file level and in STRUCTs. We currently reject negative offsets passed to ORG inside STRUCTs (in ML.EXE and ML64.EXE, they wrap around as for an unsigned 32-bit integer). Also, if a STRUCT is declared using an ORG directive, no value of that type can be defined. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D92507	2021-06-25 17:19:45 -04:00
Serge Pavlov	b36d214bed	[X86] Add description of FXAM instruction Previously this instruction could be used only in assembler. This change makes it available for compiler also. Scheduling information was copied from FTST instruction, hopefully this can be a satisfactory approximation. Differential Revision: https://reviews.llvm.org/D104853	2021-06-25 12:26:51 +07:00
Fangrui Song	ca3bdb57fa	[MC][ELF] Change SHT_LLVM_CALL_GRAPH_PROFILE relocations from SHT_RELA to SHT_REL ... even on targets preferring RELA. The section is only consumed by ld.lld which can handle REL. Follow-up to D104080 as I explained in the review. There are two advantages: * The D104080 code only handles RELA, so arm/i386/mips32 etc may warn for -fprofile-use=/-fprofile-sample-use= usage. * Decrease object file size for RELA targets While here, change the relocation to relocate weights, instead of 0,1,2,3,.. I failed to catch the issue during review.	2021-06-24 21:35:48 -07:00
Aakanksha Patil	3453f3dd46	[AMDGPU] Add gfx1035 target Differential Revision: https://reviews.llvm.org/D104804	2021-06-24 14:32:41 -04:00
Alexander Yermolovich	a224c5199b	[LLD][LLVM] CG Graph profile using relocations Currently when .llvm.call-graph-profile is created by llvm it explicitly encodes the symbol indices. This section is basically a black box for post processing tools. For example, if we run strip -s on the object files the symbol table changes, but indices in that section do not. In non-visible behavior indices point to wrong symbols. The visible behavior indices point outside of Symbol table: "invalid symbol index". This patch changes the format by using R_*_NONE relocations to indicate the from/to symbols. The Frequency (Weight) will still be in the .llvm.call-graph-profile, but symbol information will be in relocation section. In LLD information from both sections is used to reconstruct call graph profile. Relocations themselves will never be applied. With this approach post processing tools that handle relocations correctly work for this section also. Tools can add/remove symbols and as long as they handle relocation sections with this approach information stays correct. Doing a quick experiment with clang-13. The size went up from 107KB to 322KB, aggregate of all the input sections. Size of clang-13 binary is ~118MB. For users of -fprofile-use/-fprofile-sample-use the size of object files will go up slightly, it will not impact final binary size. Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D104080	2021-06-24 09:09:33 -07:00
Jay Foad	beebe5a056	[MCA] Allow unlimited cycles in the timeline view Change --max-timeline-cycles=0 to mean no limit on the number of cycles. Use this in AMDGPU tests to show all instructions in the timeline view instead of having it arbitrarily truncated. Differential Revision: https://reviews.llvm.org/D104846	2021-06-24 12:54:57 +01:00
Bill Wendling	826947080b	[llvm-diff] Explicitly check ConstantStructs for differences A ConstantStruct is renamed when the LLVM context sees a new one. This makes global variable initializers appear different when they aren't. Instead, check the ConstantStruct for equivalence. Differential Revision: https://reviews.llvm.org/D104734	2021-06-23 16:26:34 -07:00
Andrew Litteken	9e73f7c8d2	[IRSim] Adding basic implementation of llvm-sim. This is a similarity visualization tool that accepts a Module and passes it to the IRSimilarityIdentifier. The resulting SimilarityGroups are output in a JSON file. Tests are found in test/tools/llvm-sim and check for the file not found, a bad module, and that the JSON is created correctly. Reviewers: paquette, jroelofs, MaskRay Recommit of: `15645d044b` to fix linking errors and GN build system. Differential Revision: https://reviews.llvm.org/D86974	2021-06-23 14:38:58 -05:00
Adrian Prantl	7a38a757a1	Move dwarfdump-invalid.test into the tools/llvm-dwarfdump directory.	2021-06-23 12:00:34 -07:00
Adrian Prantl	072f5180f2	Improve error handling in llvm-dwarfdump. Without this patch we're only showing a generic error message derived from the error code to the end user. rdar://79378794 Differential Revision: https://reviews.llvm.org/D104483	2021-06-23 10:44:13 -07:00
Roman Lebedev	707224ea16	[NFC] Update arm_function_name.ll after `4de0c40031`	2021-06-23 16:41:43 +03:00
Bill Wendling	46db43240f	[llvm-diff] Explicitly check ConstantArrays Global initializers may be ConstantArrays. They need to be checked explicitly, because different-yet-still-equivalent type names may be used for each, and/or a GEP instruction may appear in one.	2021-06-22 12:23:38 -07:00
Bill Wendling	ab6002871d	[llvm-diff] Add support for diffing the callbr instruction The only wrinkle is that we can't process the "blockaddress" arguments of the callbr until the blocks have been equated. So we force them to be "unified" before checking. This was left out when the callbr instruction was added. Differential Revision: https://reviews.llvm.org/D104606	2021-06-22 12:23:37 -07:00
Steven Wu	c747b7d1d9	[llvm] Fix lto tests that requires ld64 Since Xcode 13, ld64 requires linking libSystem for all the executable. Fix the tests that needs to run ld64 by linking libSystem from sysroot. rdar://77332728 Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D104332	2021-06-22 09:21:29 -07:00
Langston Barrett	a240358833	[llvm-reduce] Don't delete arguments of intrinsics The argument reduction pass shouldn't remove arguments of intrinsics, because the resulting module is ill-formed, and so inherently uninteresting. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D103129	2021-06-21 12:43:58 -07:00
Fangrui Song	ea23c38d06	[llvm-profdata] Allow omission of -o for --text output This makes it more convenient to get a text format profile. Add an error for printing non-text format output to a terminal for instrumentation profile. (It cannot be portably tested. For sample profile, raw_fd_ostream is hidden deeply so it's inconvenient to add a diagnostic.) Reviewed By: davidxl Differential Revision: https://reviews.llvm.org/D104600	2021-06-21 12:01:57 -07:00
Esme-Yi	657aa3a763	[yaml2obj] Add support for writing the long symbol name. Summary: This patch, as a follow-up of D95505, adds support for writing the long symbol name by implementing the StringTable. Only XCOFF32 is suppoted now. Reviewed By: jhenderson, shchenz Differential Revision: https://reviews.llvm.org/D103455	2021-06-21 05:09:56 +00:00
Fangrui Song	0873016cef	[llvm-cov gcov] Support GCC 12 format GCC 12 will change the length field to represent the number of bytes instead of 32-bit words. This avoids padding for strings.	2021-06-19 22:51:20 -07:00
Fangrui Song	cee85fcd76	[test] Fix nocompress.test	2021-06-19 16:27:53 -07:00
Fangrui Song	8ea2a58a2e	[llvm-profdata] Make diagnostics consistent with the (no capitalization, no period) style The format is currently inconsistent. Use the https://llvm.org/docs/CodingStandards.html#error-and-warning-messages style. And add `error:` or `warning:` to CHECK lines wherever appropriate.	2021-06-19 14:54:25 -07:00
Hongtao Yu	bd52495518	[CSSPGO] Undoing the concept of dangling pseudo probe As a follow-up to https://reviews.llvm.org/D104129, I'm cleaning up the danling probe related code in both the compiler and llvm-profgen. I'm seeing a 5% size win for the pseudo_probe section for SPEC2017 and 10% for Ciner. Certain benchmark such as 602.gcc has a 20% size win. No obvious difference seen on build time for SPEC2017 and Cinder. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D104477	2021-06-18 15:14:11 -07:00
Hongtao Yu	8c2c97287e	[CSSPGO][llvm-profgen] Ignore LBR records after interrupt transition If we have seen an inwards transition from external code to internal code, but not a following outwards transition, the inwards transition is likely due to interrupt which is usually unpaired. Ignore current and subsequent entries since they are likely from an unrelated pre-interrupt context. LBR records from different interrupt context are unrelated and they should not be mixed together. Currenlty the OS does this for task-scheduling interrupt but not for all interrupts. Reviewed By: wenlei, wlei Differential Revision: https://reviews.llvm.org/D104276	2021-06-18 12:13:53 -07:00
Igor Kudrin	85ec210751	[objdump][ARM] Fix evaluating the target address of a Thumb BLX(i) The instruction can be 16-bit aligned while targeting 32-bit aligned code. To calculate the target address correctly, the address of the instruction has to be adjusted. Differential Revision: https://reviews.llvm.org/D104446	2021-06-18 10:40:55 +07:00
Martin Storsjö	ca56b33daf	[llvm-dlltool] Imply the target arch from a tool triple prefix Also use the default LLVM target as default for dlltool. This matches how GNU dlltool behaves; it is compiled with one default target, which is used if no option is provided. Extend the anonymous namespace in the implementation file instead of using static functions. Based on a patch by Mateusz Mikuła. The effect of the default LLVM target, if neither the -m option nor a tool triple prefix is provided, isn't tested, as we can't make assumptions about what it is set to. (We could make the default be forced to one of the four supported architectures if the default triple is another arch, and then just test that llvm-dlltool without an -m option is able to produce an import library, without checking the actual architecture though.) Differential Revision: https://reviews.llvm.org/D104212	2021-06-17 13:02:35 +03:00
Martin Storsjö	675d52bc46	[llvm-dlltool] [test] Add a testcase for all machine option types. NFC. The existing tests only test that some options (but not e.g. arm) are accepted, but it doesn't test their functional effect of affecting the generated object files. Differential Revision: https://reviews.llvm.org/D104215	2021-06-17 13:02:35 +03:00
Martin Storsjö	08be746728	[llvm-dlltool] [test] Remove superfluous --coff-exports option to llvm-readobj. NFC. The --coff-exports option to llvm-readobj prints the exported symbols from a DLL/EXE, it doesn't do anything with regards to an import library. Differential Revision: https://reviews.llvm.org/D104214	2021-06-17 13:02:34 +03:00
Martin Storsjö	4fe3d5248d	[llvm-dlltool] [test] Test both short and long forms of options. NFC. Differential Revision: https://reviews.llvm.org/D104213	2021-06-17 13:02:34 +03:00
Fangrui Song	d619cf5ac5	[llvm-objcopy][MachO] Copy LC_LINKER_OPTIMIZATION_HINT This fixes `error: unsupported load command (cmd=0x2e)`	2021-06-16 12:09:50 -07:00
Hongtao Yu	cef9b96b01	[CSSPGO] Report zero-count probe in profile instead of dangling probes. Previously dangling samples were represented by INT64_MAX in sample profile while probes never executed were not reported. This was based on an observation that dangling probes were only at a smaller portion than zero-count probes. However, with compiler optimizations, dangling probes end up becoming at large portion of all probes in general and reporting them does not make sense from profile size point of view. This change flips sample reporting by reporting zero-count probes instead. This enabled dangling probe to be represented by none (missing entry in profile). This has a couple benefits: 1. Reducing sample profile size in optimize mode, even when the number of non-executed probes outperform the number of dangling probes, since INT64_MAX takes more space over 0 to encode. 2. Binary size savings. No need to encode dangling probe anymore, since missing probes are treated as dangling in the profile reader. 3. Reducing compiler work to track dangling probes. However, for probes that are real dead and removed, we still need the compiler to identify them so that they can be reported as zero-count, instead of mistreated as dangling probes. 4. Improving counts quality by respecting the counts already collected on the non-dangling copy of a probe. A probe, when duplicated, gets two copies at runtime. If one of them is dangling while the other is not, merging the two probes at profile generation time will cause the real samples collected on the non-dangling one to be discarded. Not reporting the dangling counterpart will keep the real samples. 5. Better readability. 6. Be consistent with non-CS dwarf line number based profile. Zero counts are trusted by the compiler counts inferencer while missing counts will be inferred by the compiler. Note that the current patch does include any work for #3. There will be follow-up changes. For #1, I've seen for a large Facebook service, the text profile is reduced by 7%. For extbinary profile, the size of LBRProfileSection is reduced by 35%. For #4, I have seen general counts quality for SPEC2017 is improved by 10%. Reviewed By: wenlei, wlei, wmi Differential Revision: https://reviews.llvm.org/D104129	2021-06-16 11:45:29 -07:00
Fangrui Song	1de18ad8d7	[llvm-objcopy] Make ihex writer similar to binary writer There is no need to differentiate whether `UseSegments` is true or false. Unifying the cases makes the behavior closer to BinaryWriter. This improves compatibility with objcopy because SHF_ALLOC sections not in a PT_LOAD will not be skipped. Such cases are usually erroneous input, though. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D104186	2021-06-16 10:08:20 -07:00
James Henderson	b9ce8ea454	[obj2yaml] Address D104035 review comments Accidentally missed from commit `5c1639fe06`. Differential Revision: https://reviews.llvm.org/D104035	2021-06-16 15:01:54 +01:00
Andrea Di Biagio	70b37f4c03	[MCA][InstrBuilder] Always check for implicit uses of resource units (PR50725). When instructions are issued to the underlying pipeline resources, the mca::ResourceManager should also check for the presence of extra uses induced by the explicit consumption of multiple partially overlapping group resources. Fixes PR50725	2021-06-16 14:51:12 +01:00
Ben Dunbobbin	dbc07ef5ca	[llvm-symbolizer] improve test and fix doc example after recent --print-source-context-lines behaviour change I believe that after https://reviews.llvm.org/D102355 the behaviour of --print-source-context-lines has changed. Before: --print-source-context-lines=3 prints 4 lines. After: --print-source-context-lines=3 prints 3 lines. Adjust the example in the docs for this change and make the testing a little more robust. Differential Revision: https://reviews.llvm.org/D104114	2021-06-16 13:38:22 +01:00
James Henderson	5c1639fe06	[yaml2obj][obj2yaml] Support custom ELF section header string table name This patch adds support for a new field in the FileHeader, which states the name to use for the section header string table. This also allows combining the string table with another string table in the object, e.g. the symbol name string table. The field is optional. By default, .shstrtab will continue to be used. This partially fixes https://bugs.llvm.org/show_bug.cgi?id=50506. Reviewed by: Higuoxing Differential Revision: https://reviews.llvm.org/D104035	2021-06-16 10:02:23 +01:00
James Henderson	fef3bfb1b2	[yaml2obj] Fix bug when referencing items in SectionHeaderTable There was an off-by-one error caused by an index (which included an index for the null section header) being used to check against the size of a list of sections (which didn't include the null section header). This is a partial fix for https://bugs.llvm.org/show_bug.cgi?id=50506. Reviewed by: MaskRay Differential Revision: https://reviews.llvm.org/D104098	2021-06-16 10:02:22 +01:00
Andrea Di Biagio	beb5213a2e	[MCA][InstrBuilder] Check for the presence of flag VariadicOpsAreDefs. This patch fixes the logic that checks for variadic register definitions, Before llvm-svn 348114 (commit `4cf35b4ab0`), it was not possible to explicitly mark variadic operands as definitions. By default, variadic operands of an MCInst were always assumed to be uses. A number of had-hoc checks were introduced in the InstrBuilder to fix the processing of variadic register operands of ARM ldm/stm variants. This patch simply replaces those old (and buggy) checks with a much simpler (and correct) check for MCID::Flag::VariadicOpsAreDefs.	2021-06-15 09:52:38 +01:00
CarlosAlbertoEnciso	d0a5d86119	[Debug-Info][CodeView] Fix GUID string generation for MSVC generated objects. This patch is to address https://bugs.llvm.org/show_bug.cgi?id=50459. YAML:455:28: error: GUID strings are 38 characters long The valid format for a GUID is {XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX} where X is a hex digit (0,1,2,3,4,5,6,7,8,9,A,B,C,D,E,F). The length of the individual components must be: 8, 4, 4, 4, 12. For some cases, the converted string generated by obj2yaml, does not comply with those lengths. yaml2obj checks that the GUID string must be 38 characters including the dashes and braces. Reviewed By: amccarth Differential Revision: https://reviews.llvm.org/D103089	2021-06-15 06:53:21 +01:00
wlei	863184dd69	[CSSPGO] Aggregation by the last K context frames for cold profiles This change provides the option to merge and aggregate cold context by the last k frames instead of context-less name. By default K = 1 means the context-less one. This is for better perf tuning. The more selective merging and trimming will rely on llvm-profgen's preinliner. Reviewed By: wenlei, hoy Differential Revision: https://reviews.llvm.org/D104131	2021-06-14 10:33:43 -07:00
RamNalamothu	167e7afcd5	Implement DW_CFA_LLVM_* for Heterogeneous Debugging Add support in MC/MIR for writing/parsing, and DebugInfo. This is part of the Extensions for Heterogeneous Debugging defined at https://llvm.org/docs/AMDGPUDwarfExtensionsForHeterogeneousDebugging.html Specifically the CFI instructions implemented here are defined at https://llvm.org/docs/AMDGPUDwarfExtensionsForHeterogeneousDebugging.html#cfa-definition-instructions Reviewed By: clayborg Differential Revision: https://reviews.llvm.org/D76877	2021-06-14 08:51:50 +05:30

1 2 3 4 5 ...

5170 Commits