llvm-project

Commit Graph

Author	SHA1	Message	Date
Nikita Popov	5fae613a4f	[LVI] Don't require DominatorTree in LVI (NFC) After D76797 the dominator tree is no longer used in LVI, so we can remove it as a pass dependency, and also get rid of the dominator tree enabling/disabling logic in JumpThreading. Apart from cleaning up the code, this also clarifies LVI cache consistency, in that the LVI cache can no longer depend on whether the DT was or wasn't enabled due to pending DT updates at any given time. Differential Revision: https://reviews.llvm.org/D76985	2020-05-19 20:21:46 +02:00
Craig Topper	ccba60a784	[StackColoring] When remapping alloca's move the To alloca if the From alloca is before it. If To is after From its possible that there's a use of From between them. Fixes issue reported here http://lists.llvm.org/pipermail/llvm-dev/2020-May/141421.html Differential Revision: https://reviews.llvm.org/D80101	2020-05-19 10:37:27 -07:00
Andrea Di Biagio	0980c9c6f1	[X86] Split masked integer vector stores into vXi32/vXi64 variants (PR45975). NFC This effectively splits the scheduling WriteVecMaskedStore(Y) classes into four different classes (one per each variant). The new VecMaskedStore scheduling classes are now correctly marked as 'unsupported' by the bdver2 and btver2 models. No functional change intended. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D80201	2020-05-19 17:35:10 +01:00
Florian Hahn	7cefd1b4cd	[LV] Remove duplicated return stmt (NFC).	2020-05-19 17:20:50 +01:00
Jay Foad	9bc989a48d	[InstCombine] Remove hasNoInfs check for pow(C,y) -> exp2(log2(C)*y) We already check hasNoNaNs and that x is finite and strictly positive. That only leaves the following special cases (taken from the Linux man page for pow): If x is +1, the result is 1.0 (even if y is a NaN). If the absolute value of x is less than 1, and y is negative infinity, the result is positive infinity. If the absolute value of x is greater than 1, and y is negative infinity, the result is +0. If the absolute value of x is less than 1, and y is positive infinity, the result is +0. If the absolute value of x is greater than 1, and y is positive infinity, the result is positive infinity. The first case is handled elsewhere, and this transformation preserves all the others, so there is no need to limit it to hasNoInfs. Differential Revision: https://reviews.llvm.org/D79409	2020-05-19 17:06:05 +01:00
Florian Hahn	cff9399f6b	[VPlan] Fix comment for User in VPWidenSelectRecipe (NFC). The comment was referring the arguments of the call, but the recipe widens a select.	2020-05-19 15:31:39 +01:00
Simon Pilgrim	f3b20c2ae7	MCTargetOptionsCommandFlags.h - remove unnecessary includes. NFC. Replace with MCTargetOptions forward declaration and move includes down to MCTargetOptionsCommandFlags.cpp	2020-05-19 15:15:26 +01:00
Florian Hahn	f828d75b46	[VPlan] Add & use VPValue operands for VPReplicateRecipe (NFC). This patch adds VPValue version of the instruction operands to VPReplicateRecipe and uses them during code-generation. Reviewers: Ayal, gilr, rengolin Reviewed By: gilr Differential Revision: https://reviews.llvm.org/D80114	2020-05-19 15:12:17 +01:00
Florian Hahn	66ad107452	[VPlan] Remove unique_ptr from VPBranchOnRecipeMask (NFC). We can remove a dynamic memory allocation, by checking the number of operands: no operands = all true, 1 operand = mask. Reviewers: Ayal, gilr, rengolin Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D80110	2020-05-19 15:01:37 +01:00
Matt Arsenault	a7759d1785	GlobalISel: Fix IRTranslator for constantexpr selects This was assuming a select is always an instruction, which is not true.	2020-05-19 09:52:48 -04:00
Jay Foad	c1ae72d03f	[IR] Revert r119493 r119493 protected against PHINode::hasConstantValue returning the PHI node itself, but a later fix in r159687 means that can never happen, so the workarounds are no longer required.	2020-05-19 13:17:11 +01:00
Georgii Rymar	e2b134b01a	[yaml2obj] - Stop using square brackets for unique suffixes. For describing section/symbol names we can use unique suffixes, e.g: ``` - Name: '.foo [1]` - Name: '.foo [2]` ``` It can be a problem (see https://reviews.llvm.org/D79984#inline-734829), because `[]` are sometimes used to describe a macros: ``` - Name: "[[a0]]" ``` Seems the better approach is to use something else, like "()". This patch does it and refactors the code related. Differential revision: https://reviews.llvm.org/D80123	2020-05-19 12:59:13 +03:00
Simon Pilgrim	cdafe59f95	TargetLoweringObjectFile.h - remove unnecessary includes. NFCI. Replace with forward declarations and move includes down to source files where required. I also needed to move the TargetLoweringObjectFile::SectionForGlobal wrapper implementation down into TargetLoweringObjectFile.cpp	2020-05-19 09:28:13 +01:00
Jonas Paulsson	b3bd0c37ec	[SystemZ] Eliminate the need to create a zero vector by reusing the VPERM mask. Try to avoid creating VGBMs by reusing the permutation mask if it contains a zero. If the first byte was into (any byte of) a zero vector, then the first byte of the mask can become zero and reused by putting the mask also as the first operand. If there instead was a first-byte use of the other source operand, then that zero index can be reused if the mask is placed as the second operand. Review: Ulrich Weigand Differential Revision: https://reviews.llvm.org/D79925	2020-05-19 09:37:19 +02:00
Igor Kudrin	e94382ee37	[DebugInfo] Dump offsets in .debug_str_offsets according to the DWARF format (7/8). The patch changes dumping of offsets in .debug_str_offsets sections so that they are printed as 16-digit hex values if the contribution is in the DWARF64 format. Differential Revision: https://reviews.llvm.org/D79997	2020-05-19 13:35:58 +07:00
Igor Kudrin	7e9a740198	[DebugInfo] Dump values in .debug_pubnames and .debug_pubtypes according to the DWARF format (6/8). The patch changes dumping of unit_length, debug_info_offset, and debug_info_length fields in headers in .debug_pubname and .debug_pubtypes sections so that they are printed as 16-digit hex values if the contribution is in the DWARF64 format. Dumping of offsets in the tables is changed in the same way. Differential Revision: https://reviews.llvm.org/D79997	2020-05-19 13:35:48 +07:00
Igor Kudrin	2094c5d292	[DebugInfo] Dump values in .debug_loclists and .debug_rnglists according to the DWARF format (5/8). The patch changes dumping of a unit_length field and offsets in headers in .debug_loclists and .debug_rnglists sections so that they are printed as 16-digit hex values if the contribution is in the DWARF64 format. Differential Revision: https://reviews.llvm.org/D79997	2020-05-19 13:35:41 +07:00
Igor Kudrin	c9122b8f70	[DebugInfo] Dump length in .debug_line according to the DWARF format (4/8). The patch changes dumping of unit_length and header_length fields in headers in .debug_line sections so that they are printed as 16-digit hex values if the contribution is in the DWARF64 format. Differential Revision: https://reviews.llvm.org/D79997	2020-05-19 13:35:31 +07:00
Igor Kudrin	0db1684b74	[DebugInfo] Dump length of CUs and TUs according to the DWARF format (3/8). The patch changes dumping of the unit_length field in a unit header so that it is printed as a 16-digit hex value if the unit is in the DWARF64 format. Differential Revision: https://reviews.llvm.org/D79997	2020-05-19 13:35:20 +07:00
Igor Kudrin	f92a554516	[DebugInfo] Dump form values according to the DWARF format (2/8). The patch changes dumping of DWARF form values which sizes depend on the DWARF format so that they are printed as 16-digit hex values for DWARF64. Differential Revision: https://reviews.llvm.org/D79997	2020-05-19 13:35:07 +07:00
Igor Kudrin	69dfa07b4c	[DebugInfo] Dump fields in .debug_aranges according to the DWARF format (1/8). The patch changes dumping of unit_length and debug_info_offset fields in an address range header so that they are printed as 16-digit hex values if the contribution is in the DWARF64 format. Differential Revision: https://reviews.llvm.org/D79997	2020-05-19 13:34:54 +07:00
Yonghong Song	eec758825d	[BPF] fix an asan issue when disassemble an illegal instruction Commit `8e8f1bd75a` ("[BPF] Return fail if disassembled insn registers out of range") tried to fix a segfault when an illegal instruction is decoded. A test case is added to emulate such an illegal instruction. The llvm buildbot reported an asan issue with this test case. ERROR: AddressSanitizer: global-buffer-overflow on address ... decodeMemoryOpValue(llvm::MCInst&, unsigned int, ...) llvm::MCDisassembler::DecodeStatus llvm::decodeToMCInst<unsigned long>(...) llvm::MCDisassembler::DecodeStatus llvm::decodeInstruction<unsigned long>(...) in (anonymous namespace)::BPFDisassembler::getInstruction(...) ... Basically, the fix in Commit `8e8f1bd75a` is too later to prevent the asan. The fix in this patch moved the register number check earlier during decodeInstruction(). It will return fail for decodeInstruction() if the register number is out of range. Note that DecodeGPRRegisterClass() and DecodeGPR32RegisterClass() already have register number checking, so here we only check decodeMemoryOpValue().	2020-05-18 22:33:34 -07:00
Sameer Sahasrabuddhe	6c84884366	[LoopSimplify] don't separate nested loops with convergent calls Summary: When a loop has multiple backedges, loop simplification attempts to separate them out into nested loops. This results in incorrect control flow in the presence of some functions like a GPU barrier. This change skips the transformation when such "convergent" function calls are present in the loop body. Reviewed By: nhaehnle Differential Revision: https://reviews.llvm.org/D80078	2020-05-19 09:22:39 +05:30
Chen Zheng	a6be4d17e3	[PowerPC-QPX] adjust operands order of qpx fma instructions. convert %3 = QVFMADD %2, %0, %1, implicit $rm to %3 = QVFMADD %2, %1, %0, implicit $rm Reviewed By: hfinkel, steven.zhang Differential Revision: https://reviews.llvm.org/D78986	2020-05-18 22:59:51 -04:00
Eli Friedman	27b4e6931d	[NFC] Replace MaybeAlign with Align in TargetTransformInfo.	2020-05-18 19:25:49 -07:00
Yonghong Song	8e8f1bd75a	[BPF] Return fail if disassembled insn registers out of range Daniel reported a llvm-objdump segfault like below: $ llvm-objdump -D bpf_xdp.o ... 0000000000000000 <.strtab>: 0: 00 63 69 6c 69 75 6d 5f <unknown> 1: 6c 62 36 5f 61 66 66 69 w2 <<= w6 ... (llvm-objdump: lib/Target/BPF/BPFGenAsmWriter.inc:1087: static const char* llvm::BPFInstPrinter::getRegisterName(unsigned int): Assertion `RegNo && RegNo < 25 && "Invalid register number!"' failed. Stack dump: 0. Program arguments: llvm-objdump -D bpf_xdp.o ... abort ... llvm::BPFInstPrinter::getRegisterName(unsigned int) llvm::BPFInstPrinter::printMemOperand(llvm::MCInst const, int, llvm::raw_ostream&, char const) llvm::BPFInstPrinter::printInstruction(llvm::MCInst const, unsigned long, llvm::raw_ostream&) llvm::BPFInstPrinter::printInst(llvm::MCInst const, unsigned long, llvm::StringRef, llvm::MCSubtargetInfo const&, llvm::raw_ostream&) ... Basically, since -D enables disassembly for all sections, .strtab is also disassembled, but some strings are decoded as legal instructions but with illegal register numbers. When llvm-objdump tries to print register name for these illegal register numbers, assertion and segfault happens. The patch fixed the issue by returning fail for a disassembled insn if that insn contains a reg operand with illegal reg number. The insn will be printed as "<unknown>" instead of causing an assertion.	2020-05-18 18:53:23 -07:00
Chen Zheng	9971839942	fix build failure due to commit rGddcb3cf213e8	2020-05-18 21:47:40 -04:00
Chen Zheng	ddcb3cf213	[TargetInstrInfo] add override function setSpecialOperandAttr - NFC	2020-05-18 21:20:52 -04:00
Yonghong Song	ddff9799d2	[BPF] Prevent disassembly segfault for NOP insn For a simple program like below: -bash-4.4$ cat t.c int test() { asm volatile("r0 = r0" ::); return 0; } compiled with clang -target bpf -O2 -c t.c the following llvm-objdump command will segfault. llvm-objdump -d t.o 0: bf 00 00 00 00 00 00 00 nop llvm-objdump: ../include/llvm/ADT/SmallVector.h:180 ... Assertion `idx < size()' failed ... abort ... llvm::BPFInstPrinter::printOperand llvm::BPFInstPrinter::printInstruction ... The reason is both NOP and MOV_rr (r0 = r0) having the same encoding. The disassembly getInstruction() decodes to be a NOP instruciton but during printInstruction() the same encoding is interpreted as a MOV_rr instruction. Such a mismatcch caused the segfault. The fix is to make NOP instruction as CodeGen only so disassembler will skip NOP insn for disassembling. Note that instruction "r0 = r0" should not appear in non inline asm codes since BPF Machine Instruction Peephole optimization will remove it. Differential Revision: https://reviews.llvm.org/D80156	2020-05-18 17:40:18 -07:00
Reid Kleckner	47cc6db928	Re-land [Debug][CodeView] Emit fully qualified names for globals This reverts commit `525a591f0f`. Fixed an issue with pointers to members based on typedefs. In this case, LLVM would emit a second UDT. I fixed it by not passing the class type to getTypeIndex when the base type is not a function type. lowerType only uses the class type for direct function types. This suggests if we have a PMF with a function typedef, there may be an issue, but that can be solved separately.	2020-05-18 17:31:00 -07:00
Amara Emerson	665da59685	[AArch64][GlobalISel] Add legalizer & selector support for G_FREEZE. These should legalize like undefs and select into copies. The ll test is copied from the x86 test, minus the half fp case because we don't currently support that.	2020-05-18 16:25:33 -07:00
Ayal Zaks	682e739638	[LV] Fix FoldTail under user VF and UF LV considers an internally computed MaxVF to decide if a constant trip-count is a multiple of any subsequently chosen VF, and conclude that no scalar remainder iterations (tail) will be left for Fold Tail to handle. If an external VF is provided via -force-vector-width, it must be considered instead of the internal MaxVF. If an external UF is provided via -force-vector-interleave, it too must be considered in addition to MaxVF or user VF. Fixes PR45679. Differential Revision: https://reviews.llvm.org/D80085	2020-05-19 01:32:25 +03:00
Matt Arsenault	ae98939172	GlobalISel: Fold G_MUL x, 0, and G_*DIV 0, x	2020-05-18 18:08:26 -04:00
Francesco Petrogalli	b572d9b1a7	[llvm][sve] Intrinsics for SVE sudot and usdot instructions. Summary: This patch adds IR intrinsics for the mnemonics USDOT and SUDOT of the 8.6 extension of Armv8-a. Reviewers: sdesmalen, efriedma, david-arm Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79876	2020-05-18 22:02:19 +00:00
Francesco Petrogalli	01f9d8ce5c	[llvm][SVE] IR intrinscs for matrix multiplication instructions. Summary: Instructions: * SMMLA * UMMLA * USMMLA * FMMLA Reviewers: sdesmalen, efriedma, kmclaughlin Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79638	2020-05-18 22:02:19 +00:00
Amara Emerson	17842025ed	[GlobalISel] Add support for using vector values in memset inlining.	2020-05-18 14:56:16 -07:00
Stanislav Mekhanoshin	50f3bb1329	[AMDGPU] Fixed selection error for 64 bit extract_subvector Differential Revision: https://reviews.llvm.org/D80155	2020-05-18 14:17:59 -07:00
Matt Arsenault	3e315697ac	DAG: Use correct pointer size for llvm.ptrmask This was ignoring the address space, and would assert on address spaces with a different size from the default.	2020-05-18 16:46:11 -04:00
Craig Topper	c9f63297e2	Fix several places that were calling verifyFunction or verifyModule without checking the return value. verifyFunction/verifyModule don't assert or error internally. They also don't print anything if you don't pass a raw_ostream to them. So the caller needs to check the result and ideally pass a stream to get the messages. Otherwise they're just really expensive no-ops. I've filed PR45965 for another instance in SLPVectorizer that causes a lit test failure. Differential Revision: https://reviews.llvm.org/D80106	2020-05-18 13:28:46 -07:00
Nikita Popov	47a0e9f49b	[Sanitizers] Use getParamByValType() (NFC) Instead of fetching the pointer element type.	2020-05-18 22:06:18 +02:00
Jean-Michel Gorius	cd12e79e6d	[x86] Propagate memory operands during ISel DAG postprocessing Summary: Propagate memory operands when folding test instructions. This was split from D80062. Reviewers: craig.topper, rnk, lebedev.ri Reviewed By: craig.topper Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80140	2020-05-18 21:35:31 +02:00
Matt Arsenault	b27a538dda	AMDGPU: Fix illegally constant folding from V_MOV_B32_sdwa This was assumed to be a simple move, and interpreting the immediate modifier operand as a materialized immediate. Apparently the SDWA pass never produces these, but GlobalISel does emit these for some vector shuffles.	2020-05-18 15:34:33 -04:00
Matt Arsenault	bf527a1dc4	AMDGPU/GlobalISel: Fix f64 G_FDIV lowering This was using an integer multiply instead of FP.	2020-05-18 15:14:08 -04:00
Volkan Keles	63081dc6f6	LoadStoreVectorizer: Match nested adds to prove vectorization is safe If both OpA and OpB is an add with NSW/NUW and with the same LHS operand, we can guarantee that the transformation is safe if we can prove that OpA won't overflow when IdxDiff added to the RHS of OpA. Review: https://reviews.llvm.org/D79817	2020-05-18 12:13:01 -07:00
Nikita Popov	736db2f710	[Loads] Require Align in isSafeToLoadUnconditionally() (NFC) Now that load/store have required alignment, accept Align here. This also avoids uses of getPointerElementType(), which is incompatible with opaque pointers.	2020-05-18 20:50:35 +02:00
Arthur Eubanks	a7cc275e7e	Add verifier check that musttail and preallocated are not used together Summary: Currently they are not supported together. Supporting them will require a LangRef change. See discussion in https://reviews.llvm.org/D77689. Reviewers: rnk, efriedma Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80132	2020-05-18 11:24:59 -07:00
Jay Foad	bdd8c111fc	[IR] Revert r2694 in BasicBlock::removePredecessor r2694 fixed a bug where removePredecessor could create IR with a use not dominated by its def in a self loop. But this could only happen in an unreachable loop, and since that time the rules have been relaxed so that defs don't have to dominate uses in unreachable code, so the fix is unnecessary. The regression test added in r2691 still stands. Differential Revision: https://reviews.llvm.org/D80128	2020-05-18 19:13:06 +01:00
Jonas Paulsson	31ecef7627	[SystemZ] Don't create PERMUTE nodes with an undef operand. It's better to reuse the first source value than to use an undef second operand, because that will make more resulting VPERMs have identical operands and therefore MachineCSE more successful. Review: Ulrich Weigand	2020-05-18 19:42:14 +02:00
Mircea Trofin	691980ebb4	[llvm][NFC] Fixed non-compliant style in InlineAdvisor.h Changed OnPass{Entry\|Exit} -> onPass{Entry\|Exit} Also fixed a small typo in a comment.	2020-05-18 10:26:45 -07:00
Vedant Kumar	623b254244	[Local] Do not ignore zexts in salvageDebugInfo, PR45923 Summary: When salvaging a dead zext instruction, append a convert operation to the DIExpressions of the debug uses of the instruction, to prevent the salvaged value from being sign-extended. I confirmed that lldb prints out the correct unsigned result for "f" in the example from PR45923 with this changed applied. rdar://63246143 Reviewers: aprantl, jmorse, chrisjackson, davide Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80034	2020-05-18 09:52:02 -07:00
Matt Arsenault	4c70074e54	AMDGPU/GlobalISel: Fix splitting wide VALU, non-vector loads	2020-05-18 12:06:53 -04:00
Matt Arsenault	681a161ff5	AMDGPU: Remove outdated comment	2020-05-18 12:06:16 -04:00
David Sherwood	364c595403	[SVE] Ignore scalable vectors in InterleavedLoadCombinePass I have changed the pass so that we ignore shuffle vectors with scalable vector types, and replaced VectorType with FixedVectorType in the rest of the pass. I couldn't think of an easy way to test this change, since for scalable vectors we shouldn't be using shufflevectors for interleaving. This change fixes up some type size assert warnings I found in the following test: CodeGen/AArch64/sve-intrinsics-int-arith-imm.ll Differential Revision: https://reviews.llvm.org/D79700	2020-05-18 16:35:55 +01:00
Wouter van Oortmerssen	10e2e7de0c	[WebAssembly] iterate stack in DebugFixup from the top. Differential Revision: https://reviews.llvm.org/D80045	2020-05-18 08:33:36 -07:00
Max Kazantsev	e47c101e35	[InstCombine][NFC] Simplify check in sinking We just need to check that the only predecessor of user parent is BB, we don't need to iterate through BB's successors for it.	2020-05-18 18:10:40 +07:00
Dmitry Preobrazhensky	f997370d9c	[AMDGPU][MC] Corrected branch relocation handling to detect undefined labels Fixed ELF object writer to die gracefully when an undefined label is encountered in a branch instruction. See https://bugs.llvm.org/show_bug.cgi?id=41914. Reviewers: arsenm, rampitec Differential Revision: https://reviews.llvm.org/D79943	2020-05-18 14:04:58 +03:00
Hans Wennborg	525a591f0f	Revert `76c5f277f2` "Re-land [Debug][CodeView] Emit fully qualified names for globals" > Before this patch, S_[L\|G][THREAD32\|DATA32] records were emitted with a simple name, not the fully qualified name (namespace + class scope). > > Differential Revision: https://reviews.llvm.org/D79447 This causes asserts in Chromium builds: CodeViewDebug.cpp:2997: void llvm::CodeViewDebug::emitDebugInfoForUDTs(const std::vector<std::pair<std::string, const DIType *>> &): Assertion `OriginalSize == UDTs.size()' failed. I will follow up on the Phabricator issue.	2020-05-18 11:26:30 +02:00
OCHyams	709c52b955	[DebugInfo][DWARF] Emit a single location instead of a location list for variables in nested scopes (including inlined functions) if there is a single location which covers the entire scope and the scope is contained in a single block. Based on work by @jmorse. Reviewed By: vsk, aprantl Differential Revision: https://reviews.llvm.org/D79571	2020-05-18 09:43:32 +01:00
Mehdi Amini	8697d443ab	Fix warning "defined but not used" for debug function (NFC)	2020-05-17 23:50:18 +00:00
Mehdi Amini	ffc6e593d2	Replace dyn_cast with isa when the result isn't used (NFC) Fix build warning: unused variable 'BB'	2020-05-17 23:15:17 +00:00
Craig Topper	5f65faef2c	ValueMapper does not preserve inline assembly dialect when remapping the type Bug report: https://bugs.llvm.org/show_bug.cgi?id=45291 Patch by Tomasz Miąsko Differential Revision: https://reviews.llvm.org/D80066	2020-05-17 14:57:50 -07:00
Nikita Popov	52e98f620c	[Alignment] Remove unnecessary getValueOrABITypeAlignment calls (NFC) Now that load/store alignment is required, we no longer need most of them. Also switch the getLoadStoreAlignment() helper to return Align instead of MaybeAlign.	2020-05-17 22:19:15 +02:00
Roman Lebedev	fde8eb00e1	[InstCombine] visitMaskedMerge(): when unfolding, sanitize undef constants (PR45955) We can't leave undef vector element constants as-is, it is a miscompile, so we need to sanitize them. We have two vectors (C and ~C): * We can't replace undef with 0 in both of them * We can't replace undef with 0 in only one of them * We could replace undef with -1 in both of them * We could replace undef with -1 in only one(!) of them * We could replace undef with -1 in one and 0 in another one of them. Therefore, it seems best to go with the last option, since otherwise we'd loose knowledge that C and ~C have no common bits set, which seems more important than preserving partial undef knowledge. Fixes https://bugs.llvm.org/show_bug.cgi?id=45955	2020-05-17 22:53:03 +03:00
David Blaikie	a055e3856f	DebugInfo: Reduce long-distance dependence on what will/won't emit a debug_addr section This is a no-op/NFC at the moment & generally makes the code /somewhat/ cleaner/less reliant on assumptions about what will produce a debug_addr section. It's still a bit "spooky action at a distance" - the add ranges code pre-emptively inserts addresses into the address pool it knows will eventually be used by the range emission code (or low/high pc). The 'ideal' would be either to actually compute the addresses needed for range (& loc) emission earlier - which would mean decanonicalizing the range/loc representation earlier to account for whether it was going to use addrx encodings or not (which would be unfortunate, but could be refactored to be relatively unobtrusive). Alternatively, emitting the range/loc sections earlier would cause them to request the needed addresses sooner - but then you endup having to split finalizeModuleInfo because some things need to be handled there before the ranges/locs are emitted, I think...	2020-05-17 12:45:56 -07:00
Nikita Popov	39beeeff20	[LVI] Don't use dominator tree in isValidAssumeForContext() LVI and its consumers currently have quite a bit of complexity related to dominator tree management. However, it doesn't look like it is actually needed... The only use of the dominator tree is inside isValidAssumeForContext(). However, due to the way LVI queries work, it is not needed: If we query a value for some block, we will first get the edge values from all predecessor blocks, which also includes an intersection with assumptions that apply to the terminator of the predecessor. As such, we will already have processed all assumptions from predecessor blocks (this is actually stronger than what isValidAssumeForContext() does with a DT, because this is capable of combining non-dominating assumptions). The only additional assumptions we need to take into account are those in the block being queried. And we don't need a dominator tree for that. This patch only removes the use of DT, I will drop the machinery around it in a followup. Differential Revision: https://reviews.llvm.org/D76797	2020-05-17 21:39:35 +02:00
Simon Pilgrim	090cf4591f	Revert rGca18ce1a00cd8b7cb7ce0e130440f5ae1ffe86ee "GlobPattern.h - remove unnecessary BitVector.h/StringRef.h includes. NFC" Causes lld build errors	2020-05-17 18:51:21 +01:00
Simon Pilgrim	ca18ce1a00	GlobPattern.h - remove unnecessary BitVector.h/StringRef.h includes. NFC Use forward declarations (BitVector already had one) and an headers to source file that were implicitly using them.	2020-05-17 18:29:41 +01:00
Simon Pilgrim	897e926bb0	ImmutableGraph.h - remove unused raw_ostream.h include. NFC	2020-05-17 18:29:41 +01:00
Sanjay Patel	57c3fe76a3	[x86] favor vector constant load to avoid GPR to XMM transfer This build vector lowering pattern came up in D79886. I've tried to limit the improvement to cases where it looks clearly better to load, but we could remove the 'TODO' predicates already if we are willing to overlook some corner cases. Differential Revision: https://reviews.llvm.org/D80013	2020-05-17 11:56:26 -04:00
Xing GUO	42011fb1c8	[ObjectYAML][DWARF] Take into account other debug sections in DWARFYAML::Data::isEmpty().	2020-05-17 22:53:27 +08:00
Simon Pilgrim	6f02633a4f	[X86] Add getTargetConstantFromBasePtr helper. NFC. Allows us to share code from LoadSDNode and MemIntrinsicSDNode constant pool loads.	2020-05-17 14:58:31 +01:00
Simon Pilgrim	9aca5b68ee	[X86] getTargetConstantBitsFromNode - remove unnecessary X86ISD::VBROADCAST handling. We create X86ISD::VBROADCAST_LOAD for constant pool folds now.	2020-05-17 14:58:30 +01:00
Sanjay Patel	bfd512160f	[InstCombine] improve analysis of FP->int->FP to eliminate fpextend This was originally in D79116. Converting from a narrow-enough FP source value to integer and back to FP guarantees that the conversion to FP is exact because of UB/poison-on-overflow. This was suggested in PR36617: https://bugs.llvm.org/show_bug.cgi?id=36617#c19	2020-05-17 09:06:57 -04:00
Christudasan Devadasan	7c4e711ef8	[AMDGPU] Enable base pointer. When the callee requires a dynamic stack realignment, it is not possible to correcty access the incoming stack arguments using the stack pointer. We reserve a base pointer in such cases to access the function arguments inside the callee. The base pointer will hold the incoming stack pointer value before any kind of delta added to it. Reviewed By: arsenm, scott.linder Differential Revision: https://reviews.llvm.org/D78811	2020-05-17 16:13:55 +05:30
Dylan McKay	1335737ee1	[LLVM][AVR] Support for R_AVR_6 fixup Summary: Handle the emission of `R_AVR_6` ELF relocation type. Reviewers: dylanmckay Reviewed By: dylanmckay Subscribers: hiraditya, Jim, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78721 Patch by @LemonBoy https://reviews.llvm.org/p/LemonBoy/	2020-05-17 19:46:09 +12:00
Dylan McKay	1420f4efbe	[AVR] Fix I/O instructions on XMEGA Summary: On XMEGA, I/O address space is same as data address space - there is no 0x20 offset, because CPU General Purpose Registers are not mapped in data address space. From https://en.wikipedia.org/wiki/AVR_microcontrollers > In the XMEGA variant, the working register file is not mapped into the data address space; as such, it is not possible to treat any of the XMEGA's working registers as though they were SRAM. Instead, the I/O registers are mapped into the data address space starting at the very beginning of the address space. Reviewers: dylanmckay Reviewed By: dylanmckay Subscribers: hiraditya, Jim, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77207 Patch by Vlastimil Labsky.	2020-05-17 19:46:09 +12:00
Fangrui Song	3dbbbcc80e	[llvm-xray] consumeError when trying big-endian Follow-up of rL341226. Fixes "Expected<T> must be checked before access or destruction"	2020-05-16 22:44:48 -07:00
Craig Topper	796ae8cf82	[LegalizeDAG] Use MachinePointerInfo::getUnknownStack in place of MachinePointerInfo() in a couple places. NFC We know the pointer somewhere on the stack, we just don't know exactly where since the index may be variable. Differential Revision: https://reviews.llvm.org/D80060	2020-05-16 15:48:16 -07:00
Eli Friedman	4f04db4b54	AllocaInst should store Align instead of MaybeAlign. Along the lines of D77454 and D79968. Unlike loads and stores, the default alignment is getPrefTypeAlign, to match the existing handling in various places, including SelectionDAG and InstCombine. Differential Revision: https://reviews.llvm.org/D80044	2020-05-16 14:53:16 -07:00
Craig Topper	135b877874	[X86] Replace selectScalarSSELoad ComplexPattern with PatFrags to handle the 3 types of loads we currently match. This ensures we create mem operands for these instructions fixing PR45949. Unfortunately, it increases the size of X86GenDAGISel.inc, but some dag combine canonicalization could reduce the types of load we need to match.	2020-05-16 14:30:45 -07:00
Eli Friedman	0ec5f50196	Harden IR and bitcode parsers against infinite size types. If isSized is passed a SmallPtrSet, it uses that set to catch infinitely recursive types (for example, a struct that has itself as a member). Otherwise, it just crashes on such types.	2020-05-16 14:24:51 -07:00
Sanjay Patel	81e9ede3a2	[VectorCombine] forward walk through instructions to improve chaining of transforms This is split off from D79799 - where I was proposing to fully iterate over a function until there are no more transforms. I suspect we are still going to want to do something like that eventually. But we can achieve the same gains much more efficiently on the current set of regression tests just by reversing the order that we visit the instructions. This may also reduce the motivation for D79078, but we are still not getting the optimal pattern for a reduction.	2020-05-16 13:08:01 -04:00
Nikita Popov	604f44977b	[InstCombine] Clean up alignment handling (NFC) Now that load/store alignment is required, we can simplify code in some places.	2020-05-16 18:47:29 +02:00
David Green	2123bb843e	[ARM] Patterns for VQSHRN Given a VQMOVN(VSHR), we can fold that into a VQSHRN simply enough using a few tablegen patterns. Differential Revision: https://reviews.llvm.org/D77720	2020-05-16 17:46:43 +01:00
Sanjay Patel	5be37cb124	[x86][CGP] try to hoist funnel shift above select-of-splats This is basically the same patch as D63233, but converted to funnel shifts rather than regular shifts. I did not see a way to effectively share code for these 2 cases though. This follows D79718 and D79827 to re-fix PR37426 because that gets canonicalized to funnel shift intrinsics in IR. I did draft an alternative patch as an enhancement to "shouldSinkOperands()", but that was awkward because we have to key the transform from the select, but then look at both its users and its operands.	2020-05-16 10:44:47 -04:00
David Green	72f1fb2edf	[ARM] Combines for VMOVN This adds two combines for VMOVN, one to fold VMOVN[tb](c, VQMOVNb(a, b)) => VQMOVN[tb](c, b) The other to perform demand bits analysis on the lanes of a VMOVN. We know that only the bottom lanes of the second operand and the top or bottom lanes of the Qd operand are needed in the result, depending on if the VMOVN is bottom or top. Differential Revision: https://reviews.llvm.org/D77718	2020-05-16 15:13:16 +01:00
David Green	2e1fbf85b6	[ARM] MVE saturating truncates This adds some custom lowering for VQMOVN, an instruction that can be used to perform saturating truncates from a pair of min(max(X, -0x8000), 0x7fff), providing those constants are correct. This leaves a VQMOVNBs which saturates the value and inserts that into the bottom lanes of an existing vector. We then need to do something with the other lanes, extending the value using a vmovlb. Ideally, as will often be the case, only the bottom lane of what remains will be demanded, allowing the vmovlb to be removed. Which should mean the instruction is either equal or a win most of the time, and allows some extra follow-up folding to happen. Differential Revision: https://reviews.llvm.org/D77590	2020-05-16 15:10:20 +01:00
Simon Pilgrim	228913780b	DIEHash.cpp - remove headers explicitly included in DIEHash.h. NFC. Don't duplicate module header includes.	2020-05-16 15:00:57 +01:00
Simon Pilgrim	25656332f1	AggressiveAntiDepBreaker.cpp - remove headers explicitly included in AggressiveAntiDepBreaker.h. NFC. Don't duplicate module header includes.	2020-05-16 15:00:56 +01:00
Simon Pilgrim	43bf2be4d9	LLParser.cpp - remove headers explicitly included in LLParser.h. NFC. Don't duplicate module header includes.	2020-05-16 15:00:56 +01:00
Nikita Popov	d86fff6ae7	[ValueTracking] Fix computeKnownBits() with bitwidth-changing ptrtoint computeKnownBitsFromAssume() currently asserts if m_V matches a ptrtoint that changes the bitwidth. Because InstCombine canonicalizes ptrtoint instructions to use explicit zext/trunc, we never ran into the issue in practice. I'm adding unit tests, as I don't know if this can be triggered via IR anywhere. Fix this by calling anyextOrTrunc(BitWidth) on the computed KnownBits. Note that we are going from the KnownBits of the ptrtoint result to the KnownBits of the ptrtoint operand, so we need to truncate if the ptrtoint zexted and anyext if the ptrtoint truncated. Differential Revision: https://reviews.llvm.org/D79234	2020-05-16 14:17:11 +02:00
Craig Topper	13d44b2a0c	[LegalizeDAG] Use getMemBasePlusOffset to simplify some code. Use other signature of getMemBasePlusOffset in another location. NFCI The code was calculating an offset from a stack pointer SDValue. This is exactly what getMemBasePlusOffset does. I also replaced sizeof(int) with a hardcoded 4. We know the type we're operating on is 4 bytes. But the size of int that the source code is being compiled with isn't guaranteed to be 4 bytes. While here replace another use of getMemBasePlusOffset that was proceeded with a call to getConstant with the other signature that call getConstant internally.	2020-05-16 01:02:08 -07:00
Craig Topper	45c7b3fd91	[LegalizeVectorTypes] Remove non-constnat INSERT_SUBVECTOR handling. NFC Now that D79814 has landed, we can assume that subvector ops use constant, in-range indices.	2020-05-15 23:56:13 -07:00
Ten Tzen	e32f8e5d4a	[Windows EH] Fix the order of Nested try-catches in $tryMap$ table This bug is exposed by Test7 of ehthrow.cxx in MSVC EH suite where a rethrow occurs in a try-catch inside a catch (i.e., a nested Catch handlers). See the test code in https://github.com/microsoft/compiler-tests/blob/master/eh/ehthrow.cxx#L346 When an object is rethrown in a Catch handler, the copy-ctor of this object must be executed after the destructions of live objects, but BEFORE the dtors of live objects in parent handlers. Today Windows 64-bit runtime (__CxxFrameHandler3 & 4) expects nested Catch handers are stored in pre-order (outer first, inner next) in $tryMap$ table, so that given a State, its Catch's beginning State can be properly retrieved. The Catch beginning state (which is also the ending State) is the State where rethrown object's copy-ctor must take place. LLVM currently stores nested catch handlers in post-ordering because it's the natural way to compute the highest State in Catch. The fix is to simply store TryCatch handler in pre-order, but update Catch's highest State after child Catches are all processed. Differential Revision: https://reviews.llvm.org/D79474?id=263919	2020-05-15 22:03:43 -07:00
Carl Ritson	a065a01bf7	[AMDGPU] Allow use of StackPtrOffsetReg when building spills Summary: When spilling in the entry function we should be able to borrow StackPtrOffsetReg as a last resort. This restores behaviour removed in D75138, and fixes failures when shaders use all SGPRs, VGPRs and spill in the entry function. Reviewers: scott.linder, arsenm, tpr Reviewed By: scott.linder, arsenm Subscribers: qcolombet, foad, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79776	2020-05-16 11:54:43 +09:00
Diogo Sampaio	6c68f75ee4	Prevent register coalescing in functions whith setjmp Summary: In the the given example, a stack slot pointer is merged between a setjmp and longjmp. This pointer is spilled, so it does not get correctly restored, addinga undefined behaviour where it shouldn't. Change-Id: I60ec010844f2a24ce01ceccf12eb5eba5ab94abb Reviewers: eli.friedman, thanm, efriedma Reviewed By: efriedma Subscribers: MatzeB, qcolombet, tpr, rnk, efriedma, hiraditya, llvm-commits, chill Tags: #llvm Differential Revision: https://reviews.llvm.org/D77767	2020-05-16 00:36:34 +01:00
Vitaly Buka	6512cc7735	[NFC,StackSafety] Rename local function	2020-05-15 13:39:07 -07:00
Christopher Tetreault	245679b62e	[SVE] Remove usages of VectorType::getNumElements() from ARM Reviewers: efriedma, fpetrogalli, kmclaughlin, grosbach, dmgreen Reviewed By: dmgreen Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, dmgreen, danielkiss, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79816	2020-05-15 12:55:27 -07:00
Christopher Tetreault	0d5d5a75e2	[SVE] Remove usages of VectorType::getNumElements() from PowerPC Reviewers: efriedma, sdesmalen, c-rhodes, hfinkel Reviewed By: c-rhodes Subscribers: wuzish, nemanjai, tschuett, hiraditya, kbarton, rkruppe, psnobl, shchenz, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79821	2020-05-15 12:30:56 -07:00
Mircea Trofin	08e2386dee	Revert "Revert "[llvm][NFC] Cleanup uses of std::function in Inlining-related APIs"" This reverts commit `454de99a6f`. The problem was that one of the ctor arguments of CallAnalyzer was left to be const std::function<>&. A function_ref was passed for it, and then the ctor stored the value in a function_ref field. So a std::function<> would be created as a temporary, and not survive past the ctor invocation, while the field would. Tested locally by following https://github.com/google/sanitizers/wiki/SanitizerBotReproduceBuild Original Differential Revision: https://reviews.llvm.org/D79917	2020-05-15 12:29:16 -07:00
Eli Friedman	11aa3707e3	StoreInst should store Align, not MaybeAlign This is D77454, except for stores. All the infrastructure work was done for loads, so the remaining changes necessary are relatively small. Differential Revision: https://reviews.llvm.org/D79968	2020-05-15 12:26:58 -07:00
Scott Linder	03c44c7584	[NFC] Deduplicate comment in PromoteMemoryToRegister.cpp This has been duplicated since before `2372a193ba`, but that commit has it appearing twice in the space of 10 lines of the same function body. It could also be hoisted up to the point just after where the last special-case is considered, but I want to keep the intent of the original authors. Committed as obvious without a review.	2020-05-15 15:18:07 -04:00
Thomas Lively	40af48101b	[WebAssembly] Optimize splats of bitcasted vectors Summary: This new custom DAG combine fixes a codegen issue with the wasm_simd128.h intrinsics. Clang lowers the return (v128_t)(__f32x4){__a, __a, __a, __a}; body of f32x4_splat to a splat shuffle of a bitcasted vector, as seen in the new simd-shuffle-bitcast.ll test. The bitcast interfered with the target-independent DAG combine that combines splat shuffles into BUILD_VECTOR nodes, so this patch introduces a new custom DAG combine to hoist the bitcast out of the shuffle, allowing the target-independent combine to work as intended. Reviewers: aheejin, dschuff Subscribers: sbc100, jgravelle-google, hiraditya, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80021	2020-05-15 12:12:20 -07:00
Eli Friedman	a1ce88b4e3	[AArch64][SVE] Implement AArch64ISD::SETCC_PRED This unifies SETCC operations along the lines of other operations. Differential Revision: https://reviews.llvm.org/D79975	2020-05-15 11:53:21 -07:00
Christopher Tetreault	015e297a37	[SVE] Restore broken LLVM-C ABI compatability Reviewers: deadalnix, efriedma, rengolin, jyknight, joerg Reviewed By: joerg Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79915	2020-05-15 11:50:24 -07:00
Thomas Lively	c702d4bf41	[WebAssembly] Update latest implemented SIMD instructions Summary: Move instructions that have recently been implemented in V8 from the `unimplemented-simd128` target feature to the `simd128` target feature. The updated instructions match the update at https://github.com/WebAssembly/simd/pull/223. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D79973	2020-05-15 10:53:02 -07:00
Wouter van Oortmerssen	62efd1eca2	[WebAssembly] Fixed debugloc in DebugFixup pass BuildMI requires this debug loc to be from the same sub program as the variable metadata passed in. Differential Revision: https://reviews.llvm.org/D80019	2020-05-15 10:50:14 -07:00
Nikita Popov	f89f7da999	[IR] Convert null-pointer-is-valid into an enum attribute The "null-pointer-is-valid" attribute needs to be checked by many pointer-related combines. To make the check more efficient, convert it from a string into an enum attribute. In the future, this attribute may be replaced with data layout properties. Differential Revision: https://reviews.llvm.org/D78862	2020-05-15 19:41:07 +02:00
Simon Pilgrim	330b7491d5	[X86] Remove some duplicate ConstantSDNode casts. NFC. Avoid repeated isa<> and cast<> by just performing a dyn_cast<ConstantSDNode>	2020-05-15 18:23:35 +01:00
Jay Foad	91ef7cb508	[IR] Trivial cleanups in Use. NFC. Remove Use::setPrev. It provided no value because it had the same accessibility as the underlying field Prev, and there was no corresponding setNext anyway. Simplify Use::removeFromList.	2020-05-15 18:14:45 +01:00
Craig Topper	e288e24376	[X86] Move expansion of MASKPAIR16LOAD and MASKPAIR16STORE from X86MCInstLower to X86ExpandPseudo. It makes more sense to turn these into real instructions a little earlier in the pipeline. I've made sure to adjust the memoperand so the spill/reload comments are printed correctly.	2020-05-15 09:51:32 -07:00
Alexey Lapshin	da30c3796a	[x86][NFC] Apply clang-format to X86ISelLowering.h Summary: Apply clang-format to X86ISelLowering.h Reviewed by: aeubanks Differential Revision: https://reviews.llvm.org/D80005	2020-05-15 19:48:16 +03:00
Simon Pilgrim	9825d3daa8	[X86] Use getConstantOperandVal helper in a few places. NFC. Avoid raw cast<ConstantSDNode> calls.	2020-05-15 17:31:27 +01:00
Simon Pilgrim	4580b0f5b6	[X86] getFauxShuffle - remove (unused) ISD::TRUNCATE shuffle decoding.	2020-05-15 17:31:26 +01:00
Anna Welker	4ec340c3e9	[ARM][MVE] Add support for incrementing scatters Adds support to build pre-incrementing scatters. If the increment (i.e., add instruction) that is merged into the scatter is the loop increment, an incrementing write-back scatter can be built, which then assumes the role of the loop increment. Differential Revision: https://reviews.llvm.org/D79859	2020-05-15 17:02:00 +01:00
Anna Thomas	7cc3769adb	[VectorUtils] Expose vector-function-abi-variant mangling as a utility. Summary: This change exposes the vector name mangling with LLVM ISA (used as part of vector-function-abi-variant) as a utility. This can then be used by front-ends that add this attribute. Note that all parameters passed in to the function will be mangled with the "v" token to identify that they are of of vector type. So, it is the responsibility of the caller to confirm that all parameters in the vectorized variant is of vector type. Added unit test to show vector name mangling. Reviewed-By: fpetrogalli, simoll Differential Revision: https://reviews.llvm.org/D79867	2020-05-15 11:42:20 -04:00
Yvan Roux	3648dde3dd	[ARM][MachineOutliner] Fix memory leak #2 . Use smart pointer instead of new/delete.	2020-05-15 17:33:56 +02:00
Yonghong Song	6b01b46538	[BPF] preserve debuginfo types for builtin __builtin__btf_type_id() The builtin function u32 btf_type_id = __builtin_btf_type_id(param, 0) can help preserve type info for the following use case: extern void foo(..., void *data, int size); int test(...) { struct t { int a; int b; int c; } d; d.a = ...; d.b = ...; d.c = ...; foo(..., &d, sizeof(d)); } The function "foo" in the above only see raw data and does not know what type of the data is. In certain cases, e.g., logging, the additional type information will help pretty print. This patch handles the builtin in BPF backend. It includes an IR pass to translate the IR intrinsic to a load of a global variable which carries the metadata, and an MI pass to remove the intermediate load of the global variable. Finally, in AsmPrinter pass, proper instruction are generated. In the above example, the second argument for __builtin_btf_type_id() is 0, which means a relocation for local adjustment, i.e., w.r.t. bpf program BTF change, will be generated. The value 1 for the second argument means a relocation for remote adjustment, e.g., against vmlinux. Differential Revision: https://reviews.llvm.org/D74572	2020-05-15 08:00:44 -07:00
Jay Foad	10c10f2419	[AMDGPU] Fix assertion failure in SIInsertHardClauses This new pass failed an assertion whenever there were s_nops after the end of clause. Differential Revision: https://reviews.llvm.org/D80007	2020-05-15 15:49:52 +01:00
Alexandre Ganea	76c5f277f2	Re-land [Debug][CodeView] Emit fully qualified names for globals Before this patch, S_[L\|G][THREAD32\|DATA32] records were emitted with a simple name, not the fully qualified name (namespace + class scope). Differential Revision: https://reviews.llvm.org/D79447	2020-05-15 10:37:09 -04:00
Yvan Roux	96c4460a0b	[ARM][MachineOutliner] Fix memory leak. Fix sanitizer bots after `0e4827aa4e`	2020-05-15 16:27:14 +02:00
Dmitry Vyukov	151ed6aa38	[TSAN] Add option to allow instrumenting reads of reads-before-writes Add -tsan-instrument-read-before-write which allows instrumenting reads of reads-before-writes. This is required for KCSAN [1], where under certain configurations plain writes behave differently (e.g. aligned writes up to word size may be treated as atomic). In order to avoid missing potential data races due to plain RMW operations ("x++" etc.), we will require instrumenting reads of reads-before-writes. [1] https://github.com/google/ktsan/wiki/KCSAN Author: melver (Marco Elver) Reviewed-in: https://reviews.llvm.org/D79983	2020-05-15 16:08:44 +02:00
David Sherwood	fb1c55b57d	[CodeGen] Fix FoldConstantVectorArithmetic for scalable vectors For now I have changed FoldConstantVectorArithmetic to return early if we encounter a scalable vector, since the subsequent code assumes you can perform lane-wise constant folds. However, in future work we should be able to extend this to look at splats of a constant value and fold those if possible. I have also added the same code to FoldConstantArithmetic, since that deals with vectors too. The warnings I fixed in this patch were being generated by this existing test: CodeGen/AArch64/sve-int-arith.ll Differential Revision: https://reviews.llvm.org/D79421	2020-05-15 14:58:44 +01:00
Ties Stuij	8c24f33158	[IR][BFloat] Add BFloat IR type Summary: The BFloat IR type is introduced to provide support for, initially, the BFloat16 datatype introduced with the Armv8.6 architecture (optional from Armv8.2 onwards). It has an 8-bit exponent and a 7-bit mantissa and behaves like an IEEE 754 floating point IR type. This is part of a patch series upstreaming Armv8.6 features. Subsequent patches will upstream intrinsics support and C-lang support for BFloat. Reviewers: SjoerdMeijer, rjmccall, rsmith, liutianle, RKSimon, craig.topper, jfb, LukeGeeson, sdesmalen, deadalnix, ctetreau Subscribers: hiraditya, llvm-commits, danielkiss, arphaman, kristof.beyls, dexonsmith Tags: #llvm Differential Revision: https://reviews.llvm.org/D78190	2020-05-15 14:43:43 +01:00
Simon Pilgrim	9d4b4f344d	DAGCombiner.cpp - remove non-constant EXTRACT_SUBVECTOR/INSERT_SUBVECTOR handling. NFC. Now that D79814 has landed, we can assume that subvector ops use constant, in-range indices.	2020-05-15 12:41:35 +01:00
Konstantin Schwarz	5425cdc3ad	[GlobalISel][InlineAsm] Add early return for memory inputs that need to be indirectified Summary: D78319 introduced basic support for inline asm input operands in GlobalISel. However, that patch did not handle the case where a memory input operand still needs to be indirectified. Later code asserts that the memory operand is already indirect. This patch adds an early return false to trigger the SelectionDAG fallback for now. Reviewers: arsenm, paquette Reviewed By: arsenm Subscribers: thakis, wdng, rovka, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79955	2020-05-15 13:37:06 +02:00
Simon Pilgrim	1024e82469	X86ISelLowering.cpp - remove non-constant EXTRACT_SUBVECTOR/INSERT_SUBVECTOR handling. NFC. Now that D79814 has landed, we can assume that subvector ops use constant, in-range indices.	2020-05-15 11:50:00 +01:00
Georgii Rymar	710d9d66f8	[DebugInfo] - DWARFDebugFrame: do not call abort() on errors. Imagine we have a broken .eh_frame. Below is a possible sample output of llvm-readelf: ``` ... entry 2 { initial_location: 0x10f5 address: 0x2080 } } } .eh_frame section at offset 0x2028 address 0x2028: LLVM ERROR: Parsing entry instructions at 0 failed PLEASE submit a bug report to https://bugs.llvm.org/ and include the crash backtrace. Stack dump: 0. Program arguments: /home/umb/LLVM/LLVM/llvm-project/build/bin/llvm-readelf -a 1 #0 0x000055f4a2ff5a1a llvm::sys::PrintStackTrace(llvm::raw_ostream&) (/home/umb/LLVM/LLVM/llvm-project/build/bin/llvm-readelf+0x2b9a1a) ... #15 0x00007fdae5dc209b __libc_start_main /build/glibc-B9XfQf/glibc-2.28/csu/../csu/libc-start.c:342:3 #16 0x000055f4a2db746a _start (/home/umb/LLVM/LLVM/llvm-project/build/bin/llvm-readelf+0x7b46a) Aborted ``` I.e. it calls abort(), suggests to submit a bug report and exits with the code 134. This patch changes the logic to propagate errors to callers. This fixes the behavior for llvm-dwarfdump, llvm-readobj and other possible tools. Differential revision: https://reviews.llvm.org/D79165	2020-05-15 13:05:35 +03:00
Georgii Rymar	7ccae2cece	[yaml2obj] - Introduce the "Offset" property for sections. Currently there is no good way to set a physical offset for a section: * We have the `ShOffset` that allows to override the `sh_offset`, but it does not affect the real data written. * We can use a `Filler` to create an artificial gap, but it is more like a hack rather than a proper solution for this problem. This patch adds the `Offset` property which allows setting physical offsets for sections. It also generalizes the code, so that we set sh_offset field in one place Differential revision: https://reviews.llvm.org/D78927	2020-05-15 11:23:44 +03:00
Djordje Todorovic	170ac4be33	[CSInfo][ISEL] Call site info generation support for Mips Debug entry values functionality provides debug information about call sites and function parameters values at the call entry spot. Condition for generating this type of information is compiling with -g option and optimization level higher than zero(-O0). In ISEL phase, while lowering call instructions, collect info about registers that forward arguments into following function frame. We store such info into MachineFunction of the caller function. This is used very late, when dumping DWARF info about call site parameters. The call site info is visible at MIR level, as callSites attribute of MachineFunction. Also, when using unmodified parameter value inside callee it could be described as DW_OP_entry_value expression. To deal with callSites attribute, we should pass -emit-call-site-info option to llc. This patch enables functionality in clang frontend and adds call site info generation support for MIPS targets (mips, mipsel, mips64, mips64el). Patch by Nikola Tesic Differential Revision: https://reviews.llvm.org/D78105	2020-05-15 10:13:15 +02:00
David Sherwood	525b8e6dcb	[SVE] Fix wrong usage of getNumElements() in matchIntrinsicType I have changed the ScalableVecArgument case in matchIntrinsicType to create a new FixedVectorType. This means that the next case we hit (Vector) will not assert when calling getNumElements(), since we know that it's always a FixedVectorType. This is a temporary measure for now, and it will be fixed properly in another patch that refactors this code. The changes are covered by this existing test: CodeGen/AArch64/sve-intrinsics-fp-converts.ll In addition, I have added a new test to ensure that we correctly reject SVE intrinsics when called with fixed length vector types. Differential Revision: https://reviews.llvm.org/D79416	2020-05-15 08:44:59 +01:00
Li Rong Yi	80173566f4	[PowerPC] Add an intrinsic for Popcntb Summary: This patch adds the intrinsic llvm.ppc.popcntb for the HW instruction POPCNTB Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D79703	2020-05-15 15:19:12 +08:00
Yvan Roux	0e4827aa4e	[ARM][MachineOutliner] Add Machine Outliner support for ARM. Enables Machine Outlining for ARM and Thumb2 modes. This is the first patch of the series which adds all the basic logic for the support, and only handles tail-calls and thunks. The outliner can be turned on by using clang -moutline option or -mllvm -enable-machine-outliner one (like AArch64). Differential Revision: https://reviews.llvm.org/D76066	2020-05-15 08:44:23 +02:00
David Sherwood	8ce4a8f6df	[CodeGen] Refactor CreateStackTemporary I've created a new variant of CreateStackTemporary that takes TypeSize and Align arguments, and made the older instances of CreateStackTemporary call this new function. This refactoring is in preparation for more patches in this area related to scalable vectors and improving the alignment calculations. Differential Revision: https://reviews.llvm.org/D79933	2020-05-15 07:29:13 +01:00
Alok Kumar Sharma	4042ada1c1	[DebugInfo] support for DW_AT_data_location in llvm This patch adds support for DWARF attribute DW_AT_data_location. Summary: Dynamic arrays in fortran are described by array descriptor and data allocation address. Former is mapped to DW_AT_location and later is mapped to DW_AT_data_location. Testing: unit test cases added (hand-written) check llvm check debug-info Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D79592	2020-05-15 11:33:17 +05:30
Alok Kumar Sharma	ab699d78a2	[DebugInfo] llvm rejects DWARF operator DW_OP_push_object_address llvm rejects DWARF operator DW_OP_push_object_address.This DWARF operator is needed for Flang to support allocatable array. Summary: Currently llvm rejects DWARF operator DW_OP_push_object_address. below error is produced when llvm finds this operator. [..] invalid expression !DIExpression(151) warning: ignoring invalid debug info in pushobj.ll [..] There are some parts missing in support of this operator, need to be completed. Testing -added a unit testcase -check-debuginfo -check-llvm Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D79306	2020-05-15 11:10:35 +05:30
Mircea Trofin	454de99a6f	Revert "[llvm][NFC] Cleanup uses of std::function in Inlining-related APIs" This reverts commit `767db5be67`.	2020-05-14 22:32:44 -07:00
Mircea Trofin	767db5be67	[llvm][NFC] Cleanup uses of std::function in Inlining-related APIs Summary: Replacing uses of std::function pointers or refs, or Optional, to function_ref, since the usage pattern allows that. If the function is optional, using a default parameter value (nullptr). This led to a few parameter reshufles, to push all optionals to the end of the parameter list. Reviewers: davidxl, dblaikie Subscribers: arsenm, jvesely, nhaehnle, eraman, hiraditya, haicheng, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79917	2020-05-14 22:13:53 -07:00
Kang Zhang	aedb6615a8	[MachineVerifier] Use the for_range loop to instead llvm::any_of Summary: In the patch D78849, it uses llvm::any_of to instead of for loop to simplify the function addRequired(). It's obvious that above code is not a NFC conversion. Because any_of will return if any addRequired(Reg) is true immediately, but we want every element to call addRequired(Reg). This patch uses for_range loop to fix above any_of bug. Reviewed By: MaskRay, nickdesaulniers Differential Revision: https://reviews.llvm.org/D79872	2020-05-15 02:35:33 +00:00
Eric Christopher	dad2e92eaf	Temporarily Revert "[Support] Make UniqueStringSaver wrap a StringSet" as it's causing asan failures in clangd. Followed up offline with repro instructions. This reverts commit `29560a89dd`.	2020-05-14 19:18:20 -07:00
Joel E. Denny	5df55bc7a4	[FileCheck] Fix isalpha/isalnum calls D79276 caused the following builder to fail: http://lab.llvm.org:8011/builders/llvm-clang-x86_64-expensive-checks-win/builds/23489 Specifically, FileCheck dumped stack in the following tests: LLVM :: MC/Mips/micromips-jump-pc-region.s LLVM :: MC/Mips/mips-jump-pc-region.s Those tests contained characters encoded as 160 but that render (at least for me in vim) like a single space (32). Those characters appeared between the `#` and `RUN:` on several lines, and D79276 caused FileCheck to process those lines differently: `RUN:` is a comment directive. As a result, D79276 caused FileCheck to start calling is `isalnum` on those characters. The problem is that FileCheck calls `isalnum` on type `char` without casting to `unsigned char` first, so it sign-extends 160 beyond what `unsigned char` or `EOF` can represent. C says that has undefined behavior. This problem is general to FileCheck's prefix parsing and so exists independently of D79276. `524457edbc` fixed the above tests. This patch changes FileCheck to use LLVM's replacements for `ctype.h` functions, and it adds tests for cases that are representative with or without D79276. Reviewed By: jhenderson, thopre, efriedma Differential Revision: https://reviews.llvm.org/D79810	2020-05-14 20:24:09 -04:00
Davide Italiano	da52aa2c33	[LICM] When promoting loads to the preheader, drop the location. It's really almost going to be misleading, see the example in https://bugs.llvm.org/show_bug.cgi?id=45820 Maybe at some point we can do something fancier, but at least this will fix a bug where we step on dead code while debugging.	2020-05-14 17:05:23 -07:00
Nico Weber	e0c1554274	Revert "[GlobalISel][InlineAsm] Add early return for memory inputs that need to be indirectified" This reverts commit `887dfeec53`. It broke irtranslator-inline-asm.ll on many bots, e.g. http://lab.llvm.org:8011/builders/lld-x86_64-freebsd/builds/38606/steps/test-check-all/logs/FAIL%3A%20LLVM%3A%3Airtranslator-inline-asm.ll	2020-05-14 19:37:05 -04:00
Mircea Trofin	8a2e2a6a2b	[llvm] Fix refactoring bug introduced in D79042 Incorrectly copied over the GetAssumptionCache snippet. This patch also renames a variable for clarity.	2020-05-14 15:59:43 -07:00
Stanislav Mekhanoshin	7d16a22eb0	[AMDGPU] Peephole adjacent equivalent S_SET_GPR_IDX_ON Differential Revision: https://reviews.llvm.org/D79907	2020-05-14 15:44:33 -07:00
Stanislav Mekhanoshin	9d4cf5bd42	[AMDGPU] Make v16f64/v16i64 legal This allows indirect VGPR addressing to work. Differential Revision: https://reviews.llvm.org/D79960	2020-05-14 14:46:55 -07:00
Konstantin Schwarz	887dfeec53	[GlobalISel][InlineAsm] Add early return for memory inputs that need to be indirectified Summary: D78319 introduced basic support for inline asm input operands in GlobalISel. However, that patch did not handle the case where a memory input operand still needs to be indirectified. Later code asserts that the memory operand is already indirect. This patch adds an early return false to trigger the SelectionDAG fallback for now. Reviewers: arsenm, paquette Reviewed By: arsenm Subscribers: wdng, rovka, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79955	2020-05-14 23:42:31 +02:00
Cameron McInally	b085e51d81	[AArch64][SVE] Add some integer DestructiveBinaryComm* patterns Add DestructiveBinaryComm* patterns for ADD, SUB, and SUBR. Differential Revision: https://reviews.llvm.org/D76711	2020-05-14 16:35:49 -05:00
Stanislav Mekhanoshin	184b383457	Add v16f64 value type We need to use it to handle <16 x double> indirect indexes in the AMDGPU BE. The only visible change from adding it is in ARM cost model. To me it looks reasonable. With doubling a vector size it quadruples the cost up to the size 8 and then it did only double it. Now it also quadruples, which seems a logical progression to me. Actual AMDGPU code is to follow, this is a common part, plus load/store legalization in the AMDGPU BE not to break what works now. Differential Revision: https://reviews.llvm.org/D79952	2020-05-14 14:28:00 -07:00
Eli Friedman	accc6b5545	LoadInst should store Align, not MaybeAlign. The fact that loads and stores can have the alignment missing is a constant source of confusion: code that usually works can break down in rare cases. So fix the LoadInst API so the alignment is never missing. To reduce the number of changes required to make this work, IRBuilder and certain LoadInst constructors will grab the module's datalayout and compute the alignment automatically. This is the same alignment instcombine would eventually apply anyway; we're just doing it earlier. There's a minor risk that the way we're retrieving the datalayout could break out-of-tree code, but I don't think that's likely. This is the last in a series of patches, so most of the necessary changes have already been merged. Differential Revision: https://reviews.llvm.org/D77454	2020-05-14 13:19:21 -07:00
Wouter van Oortmerssen	2b7fe0863a	[WebAssembly] Added Debug Fixup pass This pass changes debug_value instructions referring to stackified registers into TI_OPERAND_STACK with correct stack depth.	2020-05-14 13:14:45 -07:00
Eli Friedman	4532a50899	Infer alignment of unmarked loads in IR/bitcode parsing. For IR generated by a compiler, this is really simple: you just take the datalayout from the beginning of the file, and apply it to all the IR later in the file. For optimization testcases that don't care about the datalayout, this is also really simple: we just use the default datalayout. The complexity here comes from the fact that some LLVM tools allow overriding the datalayout: some tools have an explicit flag for this, some tools will infer a datalayout based on the code generation target. Supporting this properly required plumbing through a bunch of new machinery: we want to allow overriding the datalayout after the datalayout is parsed from the file, but before we use any information from it. Therefore, IR/bitcode parsing now has a callback to allow tools to compute the datalayout at the appropriate time. Not sure if I covered all the LLVM tools that want to use the callback. (clang? lli? Misc IR manipulation tools like llvm-link?). But this is at least enough for all the LLVM regression tests, and IR without a datalayout is not something frontends should generate. This change had some sort of weird effects for certain CodeGen regression tests: if the datalayout is overridden with a datalayout with a different program or stack address space, we now parse IR based on the overridden datalayout, instead of the one written in the file (or the default one, if none is specified). This broke a few AVR tests, and one AMDGPU test. Outside the CodeGen tests I mentioned, the test changes are all just fixing CHECK lines and moving around datalayout lines in weird places. Differential Revision: https://reviews.llvm.org/D78403	2020-05-14 13:03:50 -07:00
Christopher Tetreault	920ff806d4	[SVE] Remove usages of VectorType::getNumElements() from SystemZ Reviewers: efriedma, david-arm, c-rhodes, jnspaulsson Reviewed By: david-arm Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79824	2020-05-14 12:46:51 -07:00
Eli Friedman	44ecaabc07	[BitcodeReader] datalayout must be specified before it is queried. This isn't really a new invariant; it effectively already existed due to existing DataLayout queries. But this makes it explicit. This is technically not backward-compatible with the existing bitcode reader, but it's backward-compatible with the output of the bitcode writer, which is what matters in practice. No testcase because I don't know a good way to write one: there are no existing tools that can generate a bitcode file that would trigger the error. Split off from D78403. Differential Revision: https://reviews.llvm.org/D79900	2020-05-14 12:45:17 -07:00
Craig Topper	2fdeee9c82	[X86] Add support for forming vXi16 PMULH instructions from shifts. We already form PMULH when the shift is truncated. But we can also do it from just a shift by extending the result. Unfortunately, I get regressions if I try to replace the truncate combine with this as we turn the truncate into a more complicated sequence first. Then we are unable to combine that sequence with the extend produced at the end of this combine. Differential Revision: https://reviews.llvm.org/D79682	2020-05-14 10:58:00 -07:00
Jay Foad	42a5560503	[AMDGPU] New SIInsertHardClauses pass Enable clausing of memory loads on gfx10 by adding a new pass to insert the s_clause instructions that mark the start of each hard clause. Differential Revision: https://reviews.llvm.org/D79792	2020-05-14 18:54:49 +01:00
Craig Topper	2b0b9b1148	[X86] Fix a regression caused by moving combineLoopMAddPattern to IR When I moved combineLoopMAddPattern to an IR pass. I didn't match the behavior of canReduceVMulWidth that was used in the SelectionDAG version. canReduceVMulWidth just calls computeSignBits and assumes a truncate is always profitable. The version I put in IR just looks for constants and zext/sext. Though I neglected to check the number of bits in input of the zext/sext. This patch adds a check for the number of input bits to the sext/zext. And it adds a special case for add/sub with zext/sext inputs which can be handled by combineTruncatedArithmetic. Match the original SelectionDAG behavior appears to be a regression in some cases if the truncate isn't removed and becomes pack and permq. So enabling only this specific case is the conservative approach. Differential Revision: https://reviews.llvm.org/D79909	2020-05-14 10:31:28 -07:00
Simon Pilgrim	acb6f1ae09	TargetLowering.cpp - remove non-constant EXTRACT_SUBVECTOR/INSERT_SUBVECTOR handling. NFC. Now that D79814 has landed, we can assume that subvector ops use constant, in-range indices.	2020-05-14 18:13:58 +01:00
Momchil Velikov	bc2e572f51	Re-commit: [ARM] CMSE code generation This patch implements the final bits of CMSE code generation: * emit special linker symbols * restrict parameter passing to no use memory * emit BXNS and BLXNS instructions for returns from non-secure entry functions, and non-secure function calls, respectively * emit code to save/restore secure floating-point state around calls to non-secure functions * emit code to save/restore non-secure floating-pointy state upon entry to non-secure entry function, and return to non-secure state * emit code to clobber registers not used for arguments and returns * when switching to no-secure state Patch by Momchil Velikov, Bradley Smith, Javed Absar, David Green, possibly others. Differential Revision: https://reviews.llvm.org/D76518	2020-05-14 16:46:16 +01:00
Jay Foad	17941437a2	[TargetLowering] Improve expansion of FSHL/FSHR Use an extra shift-by-1 instead of a compare and select to handle the shift-by-zero case. This sometimes saves one instruction (if the compare couldn't be combined with a previous instruction). It also works better on targets that don't have good select instructions. Note that currently this change doesn't affect most targets because expandFunnelShift is not used because funnel shift intrinsics are lowered early in SelectionDAGBuilder. But there is work afoot to change that; see D77152. Differential Revision: https://reviews.llvm.org/D77301	2020-05-14 16:36:22 +01:00
Anna Thomas	eb282be9f8	[RS4GC] Fix algorithm to avoid setting vector BDV for scalar derived pointer"" This is relanding of rGbb308b020522420413c7d3f2989a88f2fc423c56 after speculatively fixing buildbot lit test failure which was seen on two bots (I cannot reproduce the lit test failure locally either). [RS4GC] Fix algorithm to avoid setting vector BDV for scalar derived pointer Summary: This is a more general fix to `59029b9eef` (D75704). This patch does the following: updates isKnownBaseValue to account for base pointer and derived pointer having differing types. This inturn allows us to populate the lattice (States) for such derived pointers. It also updates all states where the base and derived pointers have differing types (vector versus scalar) and conservatively marks these states as conflictcs. Note that in `59029b9eef`, we were just fixing existing lattice values and that too, only for uses of extractelement. Reviewers: reames, skatkov, dantrushin Reviewed By: skatkov Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D76305	2020-05-14 11:17:45 -04:00
Benjamin Kramer	29560a89dd	[Support] Make UniqueStringSaver wrap a StringSet This is slightly more efficient while providing exactly the same semantics.	2020-05-14 17:11:44 +02:00
Ehud Katz	c6c265527d	Revert "[StructurizeCFG] Fix region nodes ordering" This reverts commit `897d8ee5cd`, due to causing an infinite loop when encountering a loop with a sub-region with an inner loop.	2020-05-14 17:56:39 +03:00
Sean Fertile	ce4ebc14a8	[PowerPC] Remove support for SplitCSR. SplitCSR was only suppored for functions with CXX_FAST_TLS calling convention. Clang only emits that calling convention for Darwin which is no longer supported by the PowerPC backend. Another IR producer could use the calling convention, but considering the calling convention is meant to be an optimization and the codegen for SplitCSR can be attrocious on Power (see the modifed lit test) it is best to remove it and codegen CXX_FAST_TLS same as the C calling convention. Differential Revision: https://reviews.llvm.org/D79018	2020-05-14 10:32:17 -04:00
Anna Thomas	f20c62741e	Revert "[RS4GC] Fix algorithm to avoid setting vector BDV for scalar derived pointer" This reverts commit `bb308b0205`. Failing a testcase.	2020-05-14 10:16:25 -04:00
Anna Thomas	bb308b0205	[RS4GC] Fix algorithm to avoid setting vector BDV for scalar derived pointer Summary: This is a more general fix to `59029b9eef` (D75704). This patch does the following: 1. updates isKnownBaseValue to account for base pointer and derived pointer having differing types. 2. This inturn allows us to populate the lattice (States) for such derived pointers. 3. It also updates all states where the base and derived pointers have differing types (vector versus scalar) and conservatively marks these states as conflictcs. Note that in `59029b9eef`, we were just fixing existing lattice values and that too, only for uses of extractelement. Reviewers: reames, skatkov, dantrushin Reviewed By: skatkov Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76305	2020-05-14 10:03:30 -04:00
Xinglong Liao	5f3f45dc53	[Hexagon] Check isInstr() before getInstr() with SUnit SUnit represent a MachineInstr in post-regalloc scheduling but SDNode in pre-regalloc scheduling. when pass -enable-hexagon-sdnode-sched to Hexagon backend with -O1 and above, this may cause an assertion failed. Fixes PR45194. Differential Revision: https://reviews.llvm.org/D76134	2020-05-14 08:47:54 -05:00
Gabor Marton	5fc05c376a	Fix Z3 function calls regarding arithmetic operations Summary: The order of Z3_mk_fpa_mul, Z3_mk_fpa_div, Z3_mk_fpa_add and Z3_mk_fpa_sub functions' arguments is: context, rounding_mode, ast1, ast2. See for example: `a14c2a3051/src/api/api_fpa.cpp (L433)` At function calls from LLVM the argument order was different: rounding_mode was passed as last argument. Unfortunately these Z3_ast and other function parameter types are technically like void* which are reinterpret_cast-ed to a specific class type. So there was no type error, but the assertions fail in runtime if something goes wrong. Such a crash happened during Z3 refutation while using StaticAnalyzer. Reviewers: Szelethus, xazax.hun, baloghadamsoftware, steakhal, martong, mikhail.ramalho Reviewed By: martong Subscribers: hiraditya, rnkovacs, mikhail.ramalho, martong, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79883 Patch by Tibor Brunner!	2020-05-14 15:46:13 +02:00
Sanjay Patel	26e742fd84	[x86][CGP] improve sinking of splatted vector shift amount operand Expands on the enablement of the shouldSinkOperands() TLI hook in: D79718 The last codegen/IR test diff shows what I suspected could happen - we were sinking all splat shift operands into a loop. But that's not what we want in general; we only want to sink the shift amount operand if it is a splat. Differential Revision: https://reviews.llvm.org/D79827	2020-05-14 08:36:03 -04:00
Simon Pilgrim	80715b7124	SelectionDAG.cpp - remove non-constant EXTRACT_SUBVECTOR/INSERT_SUBVECTOR handling. NFC. Now that D79814 has landed, we can assume that subvector ops use constant, in-range indices.	2020-05-14 13:23:00 +01:00
Florian Hahn	4c8285c750	[VPlan] Move emission of \\l\"+\n to dumpBasicBlock (NFC). The patch standardizes printing of VPRecipes a bit, by hoisting out the common emission of \\l\"+\n. It simplifies the code and is also a first step towards untangling printing from DOT format output, with the goal of making the DOT output optional and to provide a more concise debug output if DOT output is disabled. Reviewers: gilr, Ayal, rengolin Reviewed By: gilr Differential Revision: https://reviews.llvm.org/D78883	2020-05-14 13:07:59 +01:00
Konstantin Schwarz	91063cf85a	[GlobalISel][InlineAsm] Add support for basic input operand constraints Reviewers: arsenm, dsanders, aemerson, volkan, t.p.northover, paquette Reviewed By: arsenm Subscribers: gargaroff, wdng, rovka, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78319	2020-05-14 10:43:37 +02:00
Greg Clayton	6e73f12a64	Fix buildbots errors after comitting D78782. Rename "Ranges" variables to "DebugRanges" to avoid warnings/errors on machines that have extra settings enabled. https://reviews.llvm.org/D78782	2020-05-13 22:01:57 -07:00
Craig Topper	fa8c2ae76f	[X86] Return true from trySADReplacement in the partial reduction pass when a change is made. Otherwise we don't signal to the pass manager that we changed IR.	2020-05-13 17:52:29 -07:00
Christopher Tetreault	2a77d1d0ed	[SVE] Remove usages of VectorType::getNumElements() from Hexagon Reviewers: efriedma, kmclaughlin, sdesmalen, kparzysz Reviewed By: kparzysz Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79819	2020-05-13 17:13:12 -07:00
Wei Zhao	382d3a85e2	[AARch64] Add Marvell ThunderX3T110 support This is the first checkin to support Marvell ThunderX3T110. Initial definition of the micro-ops of the instructions in ThunderX3T110 is included. Differential Revision: https://reviews.llvm.org/D78129	2020-05-13 16:58:51 -07:00
Omar Ahmed	425333c23b	[Attributor] Improve the alignment of the loads This patch introduces an improvement in the Alignment of the loads generated in createReplacementValues() by querying AAAlign attribute for the best Alignment for the base. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D76550	2020-05-13 18:24:05 -05:00
Greg Clayton	6025fc2243	Add .debug_ranges support to the DWARF YAML. Summary: This allows DIEs with DW_AT_ranges to be encoded and decoded _and_ actually have their address ranges be included instead of having DW_AT_ranges with a section offset value for a section that doesn't exist. Reviewers: labath, aprantl, JDevlieghere, dblaikie, probinson Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78782	2020-05-13 16:21:45 -07:00
Christopher Tetreault	3254a001fc	[SVE] Remove usages of VectorType::getNumElements() from AMDGPU Reviewers: efriedma, arsenm, david-arm, fpetrogalli Reviewed By: efriedma Subscribers: dmgreen, arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, tschuett, hiraditya, rkruppe, psnobl, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79807	2020-05-13 15:57:55 -07:00
Florian Hahn	824a859332	[AArch64] Don't promote constants with float ConstantExpr. Currently the AsmPrinter cannot emit some floating point constant expressions in global initializers. Avoid generating them. Reviewers: dmgreen, t.p.northover, arsenm, efriedma, Gerolf Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D79865	2020-05-13 23:31:47 +01:00
Eric Christopher	bfa200ebcf	Remove an unused variable.	2020-05-13 15:13:02 -07:00
Eli Friedman	a52f10b5a3	[AArch64][SVE] Add patterns for VSELECT of immediate merged with a variable. This covers forms involving "CPY (immediate, merging)". Differential Revision: https://reviews.llvm.org/D79803	2020-05-13 15:02:08 -07:00
Stanislav Mekhanoshin	591b029f40	[AMDGPU] Optimized indirect multi-VGPR addressing SelectMOVRELOffset prevents peeling of a constant from an index if final base could be negative. isBaseWithConstantOffset() succeeds if a value is an "add" or "or" operator. In case of "or" it shall be an add-like "or" which never changes a sign of the sum given a non-negative offset. I.e. we can safely allow peeling if operator is an "or". Differential Revision: https://reviews.llvm.org/D79898	2020-05-13 14:53:16 -07:00
Kuter Dinel	e57807769b	[Attributor] Use AAValueConstantRange to infer dereferencability. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D76208	2020-05-13 16:44:15 -05:00
Eric Christopher	d6e3e55c40	Remove unused Debugging variable.	2020-05-13 14:37:26 -07:00
Reid Kleckner	4092742740	[PDB] Switch from LLVM_PACKED to LLVM_PACKED_START/END Reportedly using the pragma instead of the __attribute__ silences warnings with some GCC versions.	2020-05-13 14:24:11 -07:00
Mircea Trofin	d6695e1876	[llvm] Add interface to drive inlining decision using ML model Summary: This change introduces InliningAdvisor (and related APIs), the interface that abstracts decision making away from the inlining pass. We will use this interface to delegate decision making to a trained ML model, subsequently (see referenced RFC). RFC: http://lists.llvm.org/pipermail/llvm-dev/2020-April/140763.html Reviewers: davidxl, eraman, dblaikie Subscribers: mgorny, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79042	2020-05-13 13:27:29 -07:00
Craig Topper	028bfdd891	[X86] Only allow f32, f64, or f80 to be used with 'f' inline assembly constraint. Avoids crash when using i128. Gives better error than 'scalar-to-vector conversion failed' for other types.	2020-05-13 13:27:13 -07:00
Eli Friedman	ed428c429e	[SelectionDAG] Require constant index for INSERT/EXTRACT_SUBVECTOR. It sounds like an interesting idea in theory, but nothing is actually taking advantage of it, and specifying/implementing the edge cases is painful. So just forbid it. Differential Revision: https://reviews.llvm.org/D79814	2020-05-13 13:08:59 -07:00
Alina Sbirlea	bd541b217f	[NewPassManager] Add assertions when getting statefull cached analysis. Summary: Analyses that are statefull should not be retrieved through a proxy from an outer IR unit, as these analyses are only invalidated at the end of the inner IR unit manager. This patch disallows getting the outer manager and provides an API to get a cached analysis through the proxy. If the analysis is not stateless, the call to getCachedResult will assert. Reviewers: chandlerc Subscribers: mehdi_amini, eraman, hiraditya, zzheng, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72893	2020-05-13 12:38:38 -07:00
Alina Sbirlea	db04ff4b6b	[SimpleLoopUnswitch] Add non-empty unreachable block check to exit cases removed. Summary: Update check to include the check for unreachable. Basic blocks ending in unreachable are special cased, as these blocks may be already unswitched. Before this patch this check is only done for the default destination. The condition for the exit cases and the default case must be the same, because we should never leave edges from the switch instruction to a basic block that we are unswitching. In PR45355 we still have a remaining edge (that we're attempting to remove from the DT) because its the default edge to an unreachable-terminated block where we unswitch a case edge to that block. Resolves PR45355. Reviewers: chandlerc Subscribers: hiraditya, uabelho, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78279	2020-05-13 12:38:37 -07:00
Matt Arsenault	704b539f65	AMDGPU: Use Register	2020-05-13 15:31:54 -04:00
Craig Topper	38e0ab2f3a	[X86] Don't allow f80 to be used with the 'q', 'r', 'l', 'Q' or 'q' inline assembly constraints. It was previously trying to use the 64-bit class, but 80 isn't evenly divisible by 64 so it will trigger a crash.	2020-05-13 12:19:57 -07:00
Craig Topper	47985451ed	[X86] Make the if statement structure for inline assembly constraints 'l', 'r', 'q', 'Q', and 'R' the same. These did similar things but had slight differences. For example 'Q' didn't allow f64, but the others did.	2020-05-13 12:19:57 -07:00
Eli Friedman	fcfb3170a7	[SROA] Clean up some uses of MaybeAlign in SROA. Use Align instead of using MaybeAlign; all the operations in question have known alignment. For getSliceAlign() in particular, in the cases where we used to return None, it would be converted back to an Align by IRBuilder, so there's no functional change there. Split off from D77454. Differential Revision: https://reviews.llvm.org/D79205	2020-05-13 11:23:29 -07:00
Craig Topper	de92dc2850	[Statepoint] Mark FixupStatepointCallerSaved as preserving the CFG I'm hoping this will restore some compile time lost by D75936 and D75937. Differential Revision: https://reviews.llvm.org/D79813	2020-05-13 10:59:44 -07:00
Sylvain Audi	7a8edcb212	[Clang] Restore replace_path_prefix instead of startswith In D49466, sys::path::replace_path_prefix was used instead startswith for -f[macro/debug/file]-prefix-map options. However those were reverted later (commit rG3bb24bf25767ef5bbcef958b484e7a06d8689204) due to broken Windows tests. This patch restores those replace_path_prefix calls. It also modifies the prefix matching to be case-insensitive under Windows. Differential Revision : https://reviews.llvm.org/D76869	2020-05-13 13:49:14 -04:00
Huber, Joseph	4d4ea9ac59	OpenMPOpt Remarks Support Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D79359	2020-05-13 12:20:40 -05:00
Amy Huang	641ae73f2e	[NativeSession] Implement NativeSession::findSymbolByAddress. Summary: This implements searching for function symbols and public symbols by address. More specifically, -Implements NativeSession::findSymbolByAddress for function symbols and public symbols. I think data symbols are also searched for, but isn't implemented in this patch. -Adds classes for NativeFunctionSymbol and NativePublicSymbol -Adds a '-use-native-pdb-reader' option to llvm-symbolizer, for testing purposes. Reviewers: rnk, amccarth, labath Subscribers: mgorny, hiraditya, MaskRay, rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79269	2020-05-13 09:39:25 -07:00
Benjamin Kramer	a8bf2deae4	[CodeGenPrepare] Remove a superflouos variable. NFC. Fixes a -Wunused-variable warning in Release builds.	2020-05-13 18:25:20 +02:00

... 2 3 4 5 6 ...

134713 Commits