llvm-project

Commit Graph

Author	SHA1	Message	Date
Eric Astor	353a169cb8	[ms] [llvm-ml] Use default RIP-relative addressing for x64 MASM. Summary: When parsing 64-bit MASM, treat memory operands with unspecified base register as RIP-based. Documented in several places, including https://software.intel.com/en-us/articles/introduction-to-x64-assembly: "Unfortunately, MASM does not allow this form of opcode, but other assemblers like FASM and YASM do. Instead, MASM embeds RIP-relative addressing implicitly." Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D73227	2020-07-01 12:41:07 -04:00
Guillaume Chatelet	0f9d623b63	[Alignment][NFC] Use Align for BPFAbstractMemberAccess::RecordAlignment This patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Differential Revision: https://reviews.llvm.org/D82962	2020-07-01 16:23:52 +00:00
Hiroshi Yamauchi	6bd1db08e7	[InstCombine] Don't let an alignment assume prevent new/delete removals. Remove allocations with alignment assume. Differential Revision: https://reviews.llvm.org/D81854	2020-07-01 09:22:32 -07:00
David Green	ca4c1ad854	[Outliner] Set nounwind for outlined functions This prevents the outlined functions from pulling in a lot of unnecessary code in our downstream libraries/linker. Which stops outlining making codesize worse in c++ code with no-exceptions. Differential Revision: https://reviews.llvm.org/D57254	2020-07-01 17:18:34 +01:00
Xing GUO	38907b696c	[DWARFYAML][debug_abbrev] Emit 0 byte for terminating abbreviations. The abbreviations for a given compilation unit end with an entry consisting of a 0 byte for the abbreviation code. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D82933	2020-07-02 00:09:29 +08:00
Simon Pilgrim	c83ec0a633	Make dyn_cast results explicitly auto* instead of just auto. Noticed by clang-tidy llvm-qualified-auto warning.	2020-07-01 16:38:52 +01:00
Simon Pilgrim	0ae989a1fe	Pass DebugLoc::appendInlinedAt DebugLoc arg by const reference not value. Noticed by clang-tidy performance-unnecessary-value-param warning.	2020-07-01 16:38:51 +01:00
Pengxuan Zheng	d36f2c6a6c	[RISCV] Add mcountinhibit CSR Summary: The mcountinhibit CSR is defined in the ratified 1.11 version of the privileged spec. Reviewers: apazos, asb, lenary, luismarques Reviewed By: asb Subscribers: hiraditya, rbar, johnrusso, simoncook, sabuasal, niosHD, kito-cheng, shiva0217, jrtc27, MaskRay, zzheng, edward-jones, rogfer01, MartinMosbeck, brucehoult, the_o, rkruppe, PkmX, jocewei, psnobl, benna, Jim, s.egerton, sameer.abuasal, evandro, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82913	2020-07-01 08:27:00 -07:00
Luís Marques	b2aa546b07	[RISCV] Temporarily move riscv-expand-pseudo pass to PreEmitPass2 The pass to split atomic and non-atomic RISC-V pseudo-instructions was itself split into two passes in D79635 / commit rG2cb0644f90b7, with the splitting of non-atomic instructions being moved to the PreSched2 phase. A comment was added to D79635 detailing a case where this caused problems, so this commit moves the non-atomic split pass back to the PreEmitPass2 phase. This allows the bulk of the changes from D79635 to remain committed, while addressing the the reported problem (the pass split is now almost NFC). Once the root problem is fixed we can move the (non-atomic) instruction splitting pass back to earlier in the pipeline.	2020-07-01 16:26:02 +01:00
Luís Marques	a61fa1a4b9	Revert "[RISCV] Temporarily move riscv-expand-pseudo pass to PreEmitPass2" This reverts commit `05a20a9e9a`.	2020-07-01 16:01:40 +01:00
Kazushi (Jam) Marukawa	1952055892	[VE] Support symbol with offset value Summary: Support symbol with offset value as a VEMCExpr. Reviewers: simoll, k-ishizaka Reviewed By: simoll Subscribers: hiraditya, llvm-commits Tags: #llvm, #ve Differential Revision: https://reviews.llvm.org/D82734	2020-07-01 23:55:27 +09:00
Luís Marques	05a20a9e9a	[RISCV] Temporarily move riscv-expand-pseudo pass to PreEmitPass2 The pass to split atomic and non-atomic RISC-V pseudo-instructions was itself split into two passes in D79635 / commit rG2cb0644f90b7, with the splitting of non-atomic instructions being moved to the PreSched2 phase. A comment was added to D79635 detailing a case where this caused problems, so this commit moves the non-atomic split pass back to the PreEmitPass2 phase. This allows the bulk of the changes from D79635 to remain committed, while addressing the the reported problem (the pass split is now almost NFC). Once the root problem is fixed we can move the (non-atomic) instruction splitting pass back to earlier in the pipeline.	2020-07-01 15:42:18 +01:00
Guillaume Chatelet	ef36f5143d	[Alignment] TargetLowering::hasPairedLoad must use Align for RequiredAlignment As per documentation of `hasPairLoad`: "`RequiredAlignment` gives the minimal alignment constraints that must be met to be able to select this paired load." In this sense, `0` is strictly equivalent to `1`. We make this obvious by using `Align` instead of unsigned. There is only one implementor of this interface. Differential Revision: https://reviews.llvm.org/D82958	2020-07-01 14:32:30 +00:00
Guillaume Chatelet	d3085c2501	[Alignment][NFC] Transition and simplify calls to DL::getABITypeAlignment This patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Differential Revision: https://reviews.llvm.org/D82956	2020-07-01 14:31:56 +00:00
Guillaume Chatelet	27bbc8ede1	[Alignment][NFC] Migrate TargetTransformInfo::CreateVariableSizedObject to Align This patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Differential Revision: https://reviews.llvm.org/D82939	2020-07-01 14:31:21 +00:00
David Sherwood	97a7a9abb2	[CodeGen] Fix up warnings in visitEXTRACT_SUBVECTOR It's perfectly valid to do certain DAG combines where we extract subvectors from a concat vector when we have scalable vector types. However, we can do this in a way that avoids generating compiler warnings by replacing calls to getVectorNumElements() with getVectorMinNumElements(). Due to the way subvector extracts are designed to work with scalable vector types this is ok. This eliminates some warnings from existing tests in this file: llvm/test/CodeGen/AArch64/sve-intrinsics-loads.ll Differential Revision: https://reviews.llvm.org/D82655	2020-07-01 15:10:53 +01:00
Alexey Bataev	e35a5876e4	Revert "[StackSafety,NFC] Remove unneded constexpr" This reverts commit `38470baa54` because it breaks builds with lld and gold linkers.	2020-07-01 08:41:45 -04:00
Simon Pilgrim	65647ed1e5	Pass DIEnumerator APInt args by const reference not value. Noticed by clang-tidy performance-unnecessary-value-param warning.	2020-07-01 13:16:07 +01:00
Stefan Pintilie	b294e00fb0	[PowerPC] Fix for PC Relative call protocol The situation where the caller uses a TOC and the callee does not but is marked as clobbers the TOC (st_other=1) was not being compiled correctly if both functions where in the same object file. The call site where we had `callee` was missing a nop after the call. This is because it was assumed that since the two functions where in the same DSO they would be sharing a TOC. This is not the case if the callee uses PC Relative because in that case it may clobber the TOC. This patch makes sure that we add the cnop correctly so that the linker has a place to restore the TOC. Reviewers: sfertile, NeHuang, saghir Differential Revision: https://reviews.llvm.org/D81126	2020-07-01 07:08:41 -05:00
Simon Pilgrim	2c7af6dffc	Pass stripNonLineTableDebugInfo remapDebugLoc lambda DebugLoc arg by const reference not value. Noticed by clang-tidy performance-unnecessary-value-param warning.	2020-07-01 12:37:47 +01:00
Simon Pilgrim	b485586482	[X86][SSE] Fix targetShrinkDemandedConstant constant vector sign extensions D82257/rG3521ecf1f8a3 was incorrectly sign-extending a constant vector from the lsb, this is fine if all the constant elements are 'allsignbits' in the active bits, but if only some of the elements are, then we are corrupting the constant values for those elements. This fix ensures we sign extend from the msb of the active/demanded bits instead.	2020-07-01 12:12:53 +01:00
Simon Pilgrim	cfb5b144cf	Fix Wdocumentation warnings by only tagging a param id once per doxygen comment block. NFC.	2020-07-01 12:01:19 +01:00
Simon Pilgrim	0144f501a6	AttrBuilder::merge/remove - use const& for iterator values in for-range loops. Noticed by clang-tidy performance-for-range-copy warning.	2020-07-01 12:01:18 +01:00
Sam Elliott	7dc892661e	[RISCV] Implement Hooks to avoid chaining SELECT Summary: This implements two hooks that attempt to avoid control flow for RISC-V. RISC-V will lower SELECTs into control flow, which is not a great idea. The hook `hasMultipleConditionRegisters()` turns off the following DAGCombiner folds: select(C0\|C1, x, y) <=> select(C0, x, select(C1, x, y)) select(C0&C1, x, y) <=> select(C0, select(C1, x, y), y) The second hook `setJumpIsExpensive` controls a flag that has a similar purpose and is used in CodeGenPrepare and the SelectionDAGBuilder. Both of these have the effect of ensuring more logic is done before fewer jumps. Note: with the `B` extension, we may be able to lower select into a conditional move instruction, so at some point these hooks will need to be guarded based on enabled extensions. Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D79268	2020-07-01 11:56:31 +01:00
Vitaly Buka	8180a39965	[StackSafety,NFC] Remove expensive assert Differential Revision: https://reviews.llvm.org/D80908	2020-07-01 02:54:27 -07:00
Vitaly Buka	38470baa54	[StackSafety,NFC] Remove unneded constexpr Differential Revision: https://reviews.llvm.org/D80908	2020-07-01 02:54:27 -07:00
Petar Avramovic	4b9ae1b7e5	AMDGPU/GlobalISel: Select init_exec intrinsic Change imm with timm in pattern for SI_INIT_EXEC_LO and remove regbank mappings for non register operands. Differential Revision: https://reviews.llvm.org/D82885	2020-07-01 11:50:59 +02:00
Kerry McLaughlin	4c6683eafc	[AArch64][SVE] Add reg+imm addressing mode for unpredicated loads Reviewers: efriedma, sdesmalen, david-arm Reviewed By: efriedma Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, danielkiss, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82893	2020-07-01 10:33:56 +01:00
David Stenberg	85460c4ea2	[DebugInfo] Do not emit entry values for composite locations Summary: This is a fix for PR45009. When working on D67492 I made DwarfExpression emit a single DW_OP_entry_value operation covering the whole composite location description that is produced if a register does not have a valid DWARF number, and is instead composed of multiple register pieces. Looking closer at the standard, this appears to not be valid DWARF. A DW_OP_entry_value operation's block can only be a DWARF expression or a register location description, so it appears to not be valid for it to hold a composite location description like that. See DWARFv5 sec. 2.5.1.7: "The DW_OP_entry_value operation pushes the value that the described location held upon entering the current subprogram. It has two operands: an unsigned LEB128 length, followed by a block containing a DWARF expression or a register location description (see Section 2.6.1.1.3 on page 39)." Here is a dwarf-discuss mail thread regarding this: http://lists.dwarfstd.org/pipermail/dwarf-discuss-dwarfstd.org/2020-March/004610.html There was not a strong consensus reached there, but people seem to lean towards that operations specified under 2.6 (e.g. DW_OP_piece) may not be part of a DWARF expression, and thus the DW_OP_entry_value operation can't contain those. Perhaps we instead want to emit a entry value operation per each DW_OP_reg* operation, e.g.: - DW_OP_entry_value(DW_OP_regx sub_reg0), DW_OP_stack_value, DW_OP_piece 8, - DW_OP_entry_value(DW_OP_regx sub_reg1), DW_OP_stack_value, DW_OP_piece 8, [...] The question then becomes how the call site should look; should a composite location description be emitted there, and we then leave it up to the debugger to match those two composite location descriptions? Another alternative could be to emit a call site parameter entry for each sub-register, but firstly I'm unsure if that is even valid DWARF, and secondly it seems like that would complicate the collection of call site values quite a bit. As far as I can tell GCC does not emit any entry values / call sites in these cases, so we do not have something to compare with, but the former seems like the more reasonable approach. Currently when trying to emit a call site entry for a parameter composed of multiple DWARF registers a (DwarfRegs.size() == 1) assert is triggered in addMachineRegExpression(). Until the call site representation is figured out, and until there is use for these entry values in practice, this commit simply stops the invalid DWARF from being emitted. Reviewers: djtodoro, vsk, aprantl Reviewed By: djtodoro, vsk Subscribers: jyknight, hiraditya, fedor.sergeev, jrtc27, llvm-commits Tags: #debug-info, #llvm Differential Revision: https://reviews.llvm.org/D75270	2020-07-01 10:50:55 +02:00
Guillaume Chatelet	7f37d88306	[Alignment][NFC] Migrate MachineFrameInfo::CreateSpillStackObject to Align iThis patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Differential Revision: https://reviews.llvm.org/D82934	2020-07-01 08:49:28 +00:00
Paul Walker	a1aed80a35	[SVE] Relax merge requirement for IR based divides. We currently lower SDIV to SDIV_MERGE_OP1. This forces the value for inactive lanes in a way that can hamper register allocation, however, the lowering has no requirement for inactive lanes. Instead this patch replaces SDIV_MERGE_OP1 with SDIV_PRED thus freeing the register allocator. Once done the only user of SDIV_MERGE_OP1 is intrinsic lowering so I've removed the node and perform ISel on the intrinsic directly. This also allows us to implement MOVPRFX based zeroing in the same manner as SUB. This patch also renames UDIV_MERGE_OP1 and [F]ADD_MERGE_OP1 for the same reason but in the ADD cases the ISel code is already as required. Differential Revision: https://reviews.llvm.org/D82783	2020-07-01 08:18:42 +00:00
Hans Wennborg	a8e582c830	[ThinLTO] Always parse module level inline asm with At&t dialect (PR46503) clang-cl passes -x86-asm-syntax=intel to the cc1 invocation so that assembly listings produced by the /FA flag are printed in Intel dialect. That flag however should not affect the parsing of inline assembly in the program. (See r322652) When compiling normally, AsmPrinter::emitInlineAsm is used for assembling and defaults to At&t dialect. However, when compiling for ThinLTO, the code which parses module level inline asm to find symbols for the symbol table was failing to set the dialect. This patch fixes that. (See the bug for more details.) Differential revision: https://reviews.llvm.org/D82862	2020-07-01 09:43:45 +02:00
Saiyedul Islam	9182316395	[AMDGPU] Spill more than wavesize CSR SGPRs In case of more than wavesize CSR SGPR spills, lanes of reserved VGPR were getting overwritten due to wrap around. Reserve a VGPR (when NumVGPRSpillLanes = 0, WaveSize, 2*WaveSize, ..) and when one of the two conditions is true: 1. One reserved VGPR being tracked by VGPRReservedForSGPRSpill is not yet reserved. 2. All spill lanes of reserved VGPR(s) are full and another spill lane is required. Reviewed By: arsenm, kerbowa Differential Revision: https://reviews.llvm.org/D82463	2020-07-01 07:40:47 +00:00
Sam Parker	3ee580d017	[ARM][LowOverheadLoops] Handle reductions While validating live-out values, record instructions that look like a reduction. This will comprise of a vector op (for now only vadd), a vorr (vmov) which store the previous value of vadd and then a vpsel in the exit block which is predicated upon a vctp. This vctp will combine the last two iterations using the vmov and vadd into a vector which can then be consumed by a vaddv. Once we have determined that it's safe to perform tail-predication, we need to change this sequence of instructions so that the predication doesn't produce incorrect code. This involves changing the register allocation of the vadd so it updates itself and the predication on the final iteration will not update the falsely predicated lanes. This mimics what the vmov, vctp and vpsel do and so we then don't need any of those instructions. Differential Revision: https://reviews.llvm.org/D75533	2020-07-01 08:31:49 +01:00
serge-sans-paille	ffee804053	Correctly track GCOVProfiling IR update Differential Revision: https://reviews.llvm.org/D82742	2020-07-01 09:30:59 +02:00
Guillaume Chatelet	28de229bc6	[Alignment][NFC] Migrate MachineFrameInfo::CreateStackObject to Align This patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Differential Revision: https://reviews.llvm.org/D82894	2020-07-01 07:28:11 +00:00
Adam Balogh	ec5ba353fa	[Hexagon][NFC] Remove redundant condition Condition `secondReg` is checked both in an outer and in an inner `if` statement in static function `canCompareBeNewValueJump()` in file `HexagonNewValueJump.cpp`. This patch removes the redundant inner check. The issue was found using `clang-tidy` check under review `misc-redundant-condition`. See https://reviews.llvm.org/D81272. Differential Revision: https://reviews.llvm.org/D82556	2020-07-01 09:04:26 +02:00
Adam Balogh	71c6a36018	[AMDGPU][NFC] Remove redundant condition Condition `LiteralCount` is checked both in an outer and in an inner `if` statement in `SIInstrInfo::verifyInstruction()`. This patch removes the redundant inner check. The issue was found using `clang-tidy` check under review `misc-redundant-condition`. See https://reviews.llvm.org/D81272. Differential Revision: https://reviews.llvm.org/D82555	2020-07-01 09:04:25 +02:00
Yonghong Song	7f6bc84a97	[BPF] Fix a bug for __builtin_preserve_field_info() processing Andrii discovered a problem where a simple case similar to below will generate wrong relocation kind: enum { FIELD_EXISTENCE = 2, }; struct s1 { int a1; }; int test() { struct s1 *v = 0; return __builtin_preserve_field_info(v[0], FIELD_EXISTENCE); } The expected relocation kind should be FIELD_EXISTENCE, but recorded reloc kind in the final object file is FIELD_BYTE_OFFSET, which is incorrect. This exposed a bug in generating access strings from intrinsics. The current access string generation has two steps: step 1: find the base struct/union type, step 2: traverse members in the base type. The current implementation relies on at lease one member access in step 2 to get the correct relocation kind, which is true in typical cases. But if there is no member accesses, the current implementation falls to the default info kind FIELD_BYTE_OFFSET. This is incorrect, we should still record the reloc kind based on the user input. This patch fixed this issue by properly recording the reloc kind in such cases. Differential Revision: https://reviews.llvm.org/D82932	2020-06-30 23:45:37 -07:00
JF Bastien	ca134e4c52	[NFC] fix diagnostic It's pretty silly to diagnose on a scalar copy but the build does that: loop variable 'SibReg' of type 'const llvm::Register' creates a copy from type 'const llvm::Register' [-Wrange-loop-analysis]	2020-06-30 21:49:01 -07:00
Douglas Yung	56fc6b987a	Fixup BDVER1 and ZNVER1 definitions that were accidentally changed in recent refactor. - BDVER1 - Duplicate FeatureLZCNT removed - ZNVER1 - Duplicate FeatureLZCNT removed - Removed unsupported FeatureLWP - Swapped FeatureMMX and FeatureMOVBE to be in alphabetical order	2020-06-30 18:15:33 -07:00
Matt Arsenault	e9eab30339	GlobalISel: Disallow undef generic virtual register uses With an undef operand, it's possible for getVRegDef to fail and return null. This is an edge case very little code bothered to consider. Proper gMIR should use G_IMPLICIT_DEF instead. I initially tried to apply this restriction to all SSA MIR, so then getVRegDef would never fail anywhere. However, ProcessImplicitDefs does technically run while the function is in SSA. ProcessImplicitDefs and DetectDeadLanes would need to either move, or a new pseudo-SSA type of function property would need to be introduced.	2020-06-30 19:18:01 -04:00
Hendrik Greving	50ac7ce94f	[ModuloSchedule] Make PeelingModuloScheduleExpander inheritable. Basically a NFC, but allows subclasses access to the entire PeelingModuloScheduleExpander class. We are doing this to allow backends, particularly one that are not necessarily upstreamed, to inherit from PeelingModuloScheduleExpander and access its basic structures. Renames Info into LoopInfo for consistency in PeelingModuloScheduleExpander. Differential Revision: https://reviews.llvm.org/D82673	2020-06-30 15:56:13 -07:00
Kit Barton	4c2c6c7cc1	[PPC][NFC] Replace TM with Subtarget->getTargetMachine() in preparation for GlobalISel. There are two uses of TM (instance of TargetMachine) when checking options. These will not work once we enable GlobalISel. This patch replaces those uses of TM with Subtarget->getTargetMachine().	2020-06-30 17:19:24 -05:00
Craig Topper	1df1186ab1	[X86] Use some preprocessor macros to reduce the very similar repeated code in getVPTESTMOpc. NFCI This function picks X86 opcode name based on type, masking, and whether not a load or broadcast has been folded using multiple switch statements. The contents of the switches mostly just vary in a few characters in the instruction name. So use some macros to build the instruction names to reduce the repetiveness.	2020-06-30 14:38:22 -07:00
Amy Kwan	73377c4597	[PowerPC][Power10] Add Vector Splat Imm/Permute/Blend/Shift Double Bit Imm Definitions and MC Tests This patch adds the td definitions and asm/disasm tests for the following instructions: XXSPLTIW XXSPLTIDP XXSPLTI32DX XXPERMX XXBLENDVB XXBLENDVH XXBLENDVW XXBLENDVD VSLDBI VSRDBI Differential Revision: https://reviews.llvm.org/D82896	2020-06-30 16:07:21 -05:00
Eli Friedman	df1f371e28	[SVE] Reject vector struct indexes for scalable vectors. It's messy to pattern-match, and completely unnecessary: scalar indexes work equally well. See also discussion on D81620 and D82061. Differential Revision: https://reviews.llvm.org/D82430	2020-06-30 13:52:38 -07:00
Eli Friedman	0ec712afec	[BitcodeReader] Fix DelayedShuffle handling for ConstantExpr shuffles. The indexing was messed up, so the result was completely broken. Shuffle constant exprs are rare in practice; without vscale types, constant folding generally elminates them. So sort of hard to trip over. Fixes regression from D72467. (Recommitting after fix for memory leak.) Differential Revision: https://reviews.llvm.org/D80330	2020-06-30 13:23:07 -07:00
Matt Arsenault	f7b2997ad6	Sparc: Use Register	2020-06-30 16:14:23 -04:00
Matt Arsenault	08649f0a9d	RISCV: Don't store function in RISCVMachineFunctionInfo Targets should not depend on the MachineFunction state during the MachineFunctionInfo construction.	2020-06-30 16:08:51 -04:00

1 2 3 4 5 ...

136235 Commits