llvm-project

Commit Graph

Author	SHA1	Message	Date
Fangrui Song	3329cec2f7	[DebugInfo] Don't join DW_AT_comp_dir and directories[0] for DWARF v5 line tables DWARF v5 6.2.4 The Line Number Program Header says: > The first entry is the current directory of the compilation. Each additional > path entry is either a full path name or is relative to the current directory of > the compilation. When forming a path, relative DW_AT_comp_dir and directories[0] are not supposed to be joined together. Fix getFileNameByIndex to special case DWARF v5 DirIdx == 0. Reviewed By: #debug-info, dblaikie Differential Revision: https://reviews.llvm.org/D131804	2022-08-12 14:01:52 -07:00
Fangrui Song	f62e60fb23	[MCDwarf] Respect -fdebug-prefix-map= for generated assembly debug info (DWARF v5) For generated assembly debug info, MCDwarfLineTableHeader::CompilationDir is an unmapped path set in MCContext::setGenDwarfRootFile. Remap it. A relative destination path of -fdebug-prefix-map= exposes a llvm-dwarfdump bug which joins relative DW_AT_comp_dir and directories[0]. Fix https://github.com/llvm/llvm-project/issues/56609 Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D131749	2022-08-12 12:52:36 -07:00
Fangrui Song	d561907f27	[MCDwarf][test] Improve debug-prefix-map.s	2022-08-11 22:40:31 -07:00
Dmitry Preobrazhensky	4e68834add	[AMDGPU][MC][GFX11][NFC] Add tests for VOP1 and VOP2 16 bit opcodes Differential Revision: https://reviews.llvm.org/D131588	2022-08-11 17:12:13 +03:00
Dmitry Preobrazhensky	eda6e49aa8	[AMDGPU][MC][GFX11][NFC] Correct tests for 16-bit VOP2 opcodes which use v128 or higher VGPRs Differential Revision: https://reviews.llvm.org/D131564	2022-08-11 15:55:58 +03:00
Dmitry Preobrazhensky	2b79b2e392	[AMDGPU][MC][GFX11][NFC] Remove test duplicates Differential Revision: https://reviews.llvm.org/D131558	2022-08-11 15:10:01 +03:00
Dmitry Preobrazhensky	c0de0e0c64	[AMDGPU][MC][GFX11][NFC] Consolidate VOP tests by encoding Differential Revision: https://reviews.llvm.org/D131559	2022-08-11 15:02:22 +03:00
Dmitry Preobrazhensky	9d1eeefbfe	[AMDGPU][MC][GFX11][NFC] Rename tests Make test names more uniform. Differential Revision: https://reviews.llvm.org/D131398	2022-08-09 13:56:05 +03:00
Dmitry Preobrazhensky	fc4c1a86f3	[AMDGPU][MC][GFX11][NFC] Split large tests Differential Revision: https://reviews.llvm.org/D131397	2022-08-09 13:24:23 +03:00
Craig Topper	eec05bfeaf	[RISCV] Add {{$}} to some CHECK lines in MC tests. This adds {{$}} to most of the CHECK lines for errors about what extension an instruction requires. This ensures we check all the extensions when there is a list. Differential Revision: https://reviews.llvm.org/D131436	2022-08-08 14:38:11 -07:00
Martin Storsjö	0c52ab3968	[MC] [Win64EH] Fix the calculation of the end of epilogs Exclude the terminating end opcode from the epilog - it doesn't correspond to an actual instruction that is included in the epilog itself (within the .seh_startepilogue/.seh_endepilogue range). In most (all?) cases, an epilog is followed by a matching terminating instruction though (a ret or a branch to a tail call), but it's not strictly within the .seh_startepilogue/.seh_endepilogue range. This fixes a number of failed asserts in cases where the codegen has incorrectly reoredered SEH opcodes so they don't match up exactly with their instructions. However this still just avoids failing the assertion; the root cause of generating unexpected epilogs is still present (and fixing that is a less obvious issue). Differential Revision: https://reviews.llvm.org/D131393	2022-08-08 23:03:17 +03:00
Sheng	64d326c33c	[M68k] Add MC support for link/unlk Reviewers: myhsu Differential Revision: https://reviews.llvm.org/D125444	2022-08-08 11:00:11 +08:00
Leon Clark	6a275cd53c	Transform illegal intrinsics to V_ILLEGAL Related tasks: - SWDEV-240194 - SWDEV-309417 - SWDEV-334876 Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D123693	2022-08-06 08:59:00 +01:00
Zhaoshi Zheng	99e50e5838	[WinEH][ARM64] Split Unwind Info for Fucntions Larger than 1MB Create function segments and emit unwind info of them. A segment must be less than 1MB and no prolog or epilog is splitted between two segments. This patch should generate correct, though not optimal, unwind info for large functions. Currently it only generate pacted info (.pdata) only for functions that are less than 1MB (single-segment functions). This is NFC from before this patch. The next step is to enable (.pdata) only unwind info for the first segment or segments that have neither prolog or epilog in a multi-segment function. Another future work item is to further split segments that require more than 255 code words or have more than 65535 epilogs. Reference: https://docs.microsoft.com/en-us/cpp/build/arm64-exception-handling#function-fragments Differential Revision: https://reviews.llvm.org/D130049	2022-08-05 11:46:41 -07:00
Dmitry Preobrazhensky	05b3aadfff	[AMDGPU][MC][GFX11] Correct v_dot2_f16_f16 and v_dot2_bf16_bf16 Enable SGPRs for the following operands of these opcodes: - src operands of VOP3 variant. - src2 operand of DPP variants. Differential Revision: https://reviews.llvm.org/D130989	2022-08-03 15:08:23 +03:00
Dmitry Preobrazhensky	ae553f9e49	[AMDGPU][MC][GFX10] Correct encoding of VOP3 v_cmpx* opcodes Encode dst=EXEC but allow disassembler accept any dst value. Differential Revision: https://reviews.llvm.org/D130978	2022-08-03 15:03:44 +03:00
Dmitry Preobrazhensky	3aae8cd842	[AMDGPU][MC] Verify selection of LDS MUBUF opcodes Differential Revision: https://reviews.llvm.org/D130761	2022-08-01 16:44:39 +03:00
Dmitry Preobrazhensky	bb901dcc5a	[AMDGPU][MC][GFX940] Correct disassembly of MFMA opcodes Add a decoder table for GFX940 MFMA opcodes. Differential Revision: https://reviews.llvm.org/D130759	2022-08-01 16:00:47 +03:00
Lucas Prates	ba9caf9170	[Arm] Fix parsing and emission of Tag_also_compatible_with eabi attribute According to the ABI for the Arm Architecture, the value for the Tag_also_compatible_with eabi attribute is represented by an NTBS entry. This string value, in turn, is composed of a pair of tag+value encoded in one of two formats: - ULEB128: tag, ULEB128: value, 0. - ULEB128: tag, NBTS: data. (See [[ `60a8eb8c55/addenda32/addenda32.rst (3373secondary-compatibility-tag)` \| section 3.3.7.3 on the Addenda to, and Errata in, the ABI for the Arm Architecture ]].) Currently the Arm assembly parser and streamer ignore the encoding of the attribute's NTBS value, which can result in incorrect attributes being emitted in both assembly and object file outputs. This patch fixes these issues by properly handing the value's encoding. An update to llvm-readobj to properly handle the attribute's value will be covered by a separate patch. Patch by Victor Campos and Lucas Prates. Reviewed By: vhscampos Differential Revision: https://reviews.llvm.org/D129500	2022-08-01 13:28:01 +01:00
Petar Avramovic	e8d260753e	[AMDGPU] gfx11 allow dlc for MUBUF atomics Add MC support for dlc in gfx11 MUBUF atomic instructions. Differential Revision: https://reviews.llvm.org/D129075	2022-08-01 12:18:01 +02:00
Mirko Brkusanin	6a1aa627fa	[AMDGPU] Enable image_gather4h instruction for gfx10 and gfx11 Differential Revision: https://reviews.llvm.org/D130764	2022-07-29 15:42:06 +02:00
Fangrui Song	9f0d5330bd	[MC][test] Rename two --compress-debug-sections=zlib tests To be clearer when zstd support is added.	2022-07-28 10:57:56 -07:00
Dmitry Preobrazhensky	2b230d69ad	[AMDGPU][MC][GFX90A] Correct MIMG dst size validation Correct validator to enable MIMG dst size checks. Differential Revision: https://reviews.llvm.org/D130512	2022-07-28 14:30:08 +03:00
Dmitry Preobrazhensky	fa7fd8ec31	[AMDGPU][MC][GFX11] Disable SGPRs for src1 of v_fma_mix*_dpp opcodes Differential Revision: https://reviews.llvm.org/D130634	2022-07-28 14:20:05 +03:00
Phoebe Wang	726d9f8e8c	[X86][MC] Avoid emitting incorrect warning for complex FMUL We will insert a new operand which is identical to the Dest for complex FMUL with a mask. https://godbolt.org/z/eTEdnYv3q Complex FMA and FMUL with maskz don't have this problem. Reviewed By: LuoYuanke, skan Differential Revision: https://reviews.llvm.org/D130638	2022-07-28 13:58:34 +08:00
David Green	39f8384964	[ARM] Correct features on pacbti instructions. Given a patch like D129506, using instructions not valid for the current feature set becomes an error. This updates the Arm hint-space instructions for pac/bti to require thumbv7m as opposed to 8.1-m.main, to make them valid when compiling for thumbv7m with -mbranch-protection. Differential Revision: https://reviews.llvm.org/D129692	2022-07-27 09:15:14 +01:00
Craig Topper	3a2d7d8ad5	[RISCV] Add Predicate to c.lw/c.sw/c.lwsp/c.swsp InstAliases with no offset. These are aliases that allow the immediate offset to be ommitted. We had predicates for the RV64, RV32+F, and D versions, but not the base versions. I've also re-ordered them to share Predicate lines to improve readability.	2022-07-26 11:06:00 -07:00
Craig Topper	bc165de1c7	[RISCV] Minor fixes to rv64c-valid.s test. -Missing CHECK-NO-EXT and CHECK-NO-RV64 on subw. -Stray CHECK-NO-RV64 on c.slli. -c.slli used immediate 1 instead of RV64 only immediate like 63. -Missing CHECK-NO-EXT on c.srli and c.srai	2022-07-26 10:42:59 -07:00
Dmitry Preobrazhensky	e43621b09c	[AMDGPU][MC][GFX11] Correct src0 for VOP3_DPP variants of v_cmpclass opcodes Disable SGPRs for src0 of these opcodes. Differential Revision: https://reviews.llvm.org/D130486	2022-07-26 17:52:34 +03:00
Dmitry Preobrazhensky	0eb9f18520	[AMDGPU][MC][GFX11] Correct encoding of VOP3/VOP3_DPP v_cmpx* opcodes Encode dst=EXEC but allow disassembler accept any dst value. Differential Revision: https://reviews.llvm.org/D130345	2022-07-26 17:36:22 +03:00
Simon Tatham	ae16b2ed98	[llvm-objdump,ARM] Fix a lot more tests. When I changed the output format of llvm-objdump for Arm and AArch64 in D130358, I hadn't realised llvm-objdump was used so much in the plain MC tests as well as tests of itself and lld. Sorry!	2022-07-26 10:22:02 +01:00
Victor Campos	b43bec19b9	[ARM] Add Tag_CPU_arch missing value descriptions in attribute parser The ARM attribute parser for Tag_CPU_arch is missing value descriptions for Armv8-A and Armv8-R. This patch adds these descriptions. Reviewed By: pratlucas Differential Revision: https://reviews.llvm.org/D129631	2022-07-26 08:32:40 +01:00
Petar Avramovic	8de1f04c77	[AMDGPU] gfx11 Fix VOP3 dot instructions Fix src modifiers for operands with bf16 type. op_sel[0:1] are ignored. Differential Revision: https://reviews.llvm.org/D129084	2022-07-22 11:43:35 +02:00
ksyx	3198364e6e	[RISCV][Clang] Add support for Zmmul extension This patch implements recently ratified extension Zmmul, a subextension of M (Integer Multiplication and Division) consisting only multiplication part of it. Differential Revision: https://reviews.llvm.org/D103313 Reviewed By: craig.topper, jrtc27, asb	2022-07-18 20:26:08 -04:00
Stanislav Mekhanoshin	523a99c0eb	[AMDGPU] Support for gfx940 fp8 smfmac Differential Revision: https://reviews.llvm.org/D129908	2022-07-18 12:12:41 -07:00
Stanislav Mekhanoshin	2695f0a688	[AMDGPU] Support for gfx940 fp8 mfma Differential Revision: https://reviews.llvm.org/D129906	2022-07-18 11:49:56 -07:00
Stanislav Mekhanoshin	9fa5a6b7e8	[AMDGPU] Support for gfx940 fp8 conversions Differential Revision: https://reviews.llvm.org/D129902	2022-07-18 11:48:43 -07:00
Petar Avramovic	c287bc4841	[AMDGPU][MC][GFX11] AsmParser for op_sel for VOP3 dpp opcodes Parse op_sel for *_e64_dpp VOP3 opcodes. Depends on D129637 and setting of VOP3_OPSEL in dpp pseudos. Differential Revision: https://reviews.llvm.org/D129767	2022-07-18 15:08:52 +02:00
Abinav Puthan Purayil	d96361d714	[AMDGPU] Add the uses_dynamic_stack field to the kernel descriptor and the kernel metadata map This change introduces the dynamic stack boolean field to code-object-v3 and above under the code properties of the kernel descriptor and under the kernel metadata map of NT_AMDGPU_METADATA. This field corresponds to the is_dynamic_callstack field of amd_kernel_code_t. Differential Revision: https://reviews.llvm.org/D128344	2022-07-18 10:07:13 +05:30
Neumann Hon	e8f9a74fbf	[SystemZ][z/OS] Implement detection and handling for XPLink Leaf procedures. This PR adds support for creating leaf functions when there are no CSRs used, no function calls are made, no stack frame is acquired, and contain no try/catch/throw statements. Reviewed By: uweigand Differential Revision: https://reviews.llvm.org/D129687	2022-07-17 14:30:33 -04:00
Tim Besard	c71d77876f	[MC] Avoid UAF in WinCOFFObjectWriter with weak symbols. When using weak symbols, the WinCOFFObjectWriter keeps a list (`WeakDefaults`) that's used to make names unique. This list should be reset when the object writer is reset, because otherwise reuse of the object writer can result in freed symbols being accessed. With some added output, this becomes clear when using `llc` in `--run-twice` mode: ``` $ ./llc --compile-twice -mtriple=x86_64-pc-win32 trivial.ll -filetype=obj DefineSymbol::WeakDefaults - .weak.foo.default - .weak.bar.default DefineSymbol::WeakDefaults - .weak.foo.default - áÑJÄ³⌂ p§┼Ø┐☺ - .debug_macinfo.dw - .weak.bar.default ``` This does not seem to leak into the output object file though, so I couldn't come up with a test. I added one that just does `--run-twice` (and verified that it does access freed memory), which should result in detecting the invalid memory accesses when running under ASAN. Observed in a Julia PR where we started using weak symbols: https://github.com/JuliaLang/julia/pull/45649 Reviewed By: mstorsjo Differential Revision: https://reviews.llvm.org/D129840	2022-07-16 13:24:08 +03:00
Dmitry Preobrazhensky	2a6532d542	[AMDGPU][MC][GFX11] Correct disassembly of *_e64_dpp opcodes which support op_sel These opcodes cannot be disassembled because op_sel operand is missing - it must be added manually. See https://github.com/llvm/llvm-project/issues/56512 for detailed issue analysis. Differential Revision: https://reviews.llvm.org/D129637	2022-07-15 13:11:59 +03:00
David Blaikie	61891748f5	Remove testing for zlib-gnu llvm-mc support in the absence of zlib	2022-07-14 22:55:47 +00:00
David Blaikie	d63ec445ca	Remove zlibgnu support in llvm-mc The feature's been removed from most other tools in LLVM at this point.	2022-07-14 22:38:59 +00:00
spupyrev	eecd41aa09	Revert "Rebase: [Facebook] [MC] Introduce NeverAlign fragment type" This reverts commit `6d0528636a`.	2022-07-11 09:50:47 -07:00
Rafael Auler	6d0528636a	Rebase: [Facebook] [MC] Introduce NeverAlign fragment type Summary: Introduce NeverAlign fragment type. The intended usage of this fragment is to insert it before a pair of macro-op fusion eligible instructions. NeverAlign fragment ensures that the next fragment (first instruction in the pair) does not end at a given alignment boundary by emitting a minimal size nop if necessary. In effect, it ensures that a pair of macro-fusible instructions is not split by a given alignment boundary, which is a precondition for macro-op fusion in modern Intel Cores (64B = cache line size, see Intel Architecture Optimization Reference Manual, 2.3.2.1 Legacy Decode Pipeline: Macro-Fusion). This patch introduces functionality used by BOLT when emitting code with MacroFusion alignment already in place. The use case is different from BoundaryAlign and instruction bundling: - BoundaryAlign can be extended to perform the desired alignment for the first instruction in the macro-op fusion pair (D101817). However, this approach has higher overhead due to reliance on relaxation as BoundaryAlign requires in the general case - see https://reviews.llvm.org/D97982#2710638. - Instruction bundling: the intent of NeverAlign fragment is to prevent the first instruction in a pair ending at a given alignment boundary, by inserting at most one minimum size nop. It's OK if either instruction crosses the cache line. Padding both instructions using bundles to not cross the alignment boundary would result in excessive padding. There's no straightforward way to request instruction bundling to avoid a given end alignment for the first instruction in the bundle. LLVM: https://reviews.llvm.org/D97982 Manual rebase conflict history: https://phabricator.intern.facebook.com/D30142613 Test Plan: sandcastle Reviewers: #llvm-bolt Subscribers: phabricatorlinter Differential Revision: https://phabricator.intern.facebook.com/D31361547	2022-07-11 09:31:52 -07:00
Joseph Huber	1d2ce4da84	[Object] Add ELF section type for offloading objects Currently we use the `.llvm.offloading` section to store device-side objects inside the host, creating a fat binary. The contents of these sections is currently determined by the name of the section while it should ideally be determined by its type. This patch adds the new `SHT_LLVM_OFFLOADING` section type to the ELF section types. Which should make it easier to identify this specific data format. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D129052	2022-07-07 12:20:30 -04:00
Paul Robinson	08e4fe6c61	[X86] Add RDPRU instruction Add support for the RDPRU instruction on Zen2 processors. User-facing features: - Clang option -m[no-]rdpru to enable/disable the feature - Support is implicit for znver2/znver3 processors - Preprocessor symbol __RDPRU__ to indicate support - Header rdpruintrin.h to define intrinsics - "rdpru" mnemonic supported for assembler code Internal features: - Clang builtin __builtin_ia32_rdpru - IR intrinsic @llvm.x86.rdpru Differential Revision: https://reviews.llvm.org/D128934	2022-07-06 07:17:47 -07:00
David Spickett	20741c74c5	[llvm][AArch64] Fix "+all" feature for sysreg aliases For example the predres extension adds one instruction that is a sys alias. Previously this wasn't disassembled properly with "+all". This was because a check for "+all" was added to haveFeatures in AArch64SysReg but not in SysAlias. Reviewed By: MaskRay, lenary Differential Revision: https://reviews.llvm.org/D129147	2022-07-06 08:41:53 +00:00
Joe Nash	1d27faa5fe	[AMDGPU] NFC. Add a test of the error message for assembling global_atomic_cmpswap_x2	2022-07-05 14:25:48 -04:00

1 2 3 4 5 ...

8819 Commits