llvm-project

Commit Graph

Author	SHA1	Message	Date
David Sherwood	0905d9f31e	[SVE][CodeGen] Fix bug with store of unpacked FP scalable vectors Fixed an incorrect pattern in lib/Target/AArch64/AArch64SVEInstrInfo.td for storing out <vscale x 2 x f32> unpacked scalable vectors. Added a couple of tests to test/CodeGen/AArch64/sve-st1-addressing-mode-reg-imm.ll Differential Revision: https://reviews.llvm.org/D85441	2020-08-07 07:19:09 +01:00
Jonas Devlieghere	dbf44b8330	[LLDB] Mark test_launch_simple as a no-debug-info test No need to run this test with the multiple variants.	2020-08-06 23:18:37 -07:00
biplmish	cce1b0e891	[PowerPC] Implement Vector Extract Low/High Order Builtins in LLVM/Clang This patch implements the function prototypes vec_extractl and vec_extracth in altivec.h to utilize the vector extract double element instructions introduced in Power10. Differential Revision: https://reviews.llvm.org/D84622	2020-08-07 01:02:29 -05:00
QingShan Zhang	55de46f3b2	[PowerPC] Support constrained fp operation for setcc The constrained fp operation fcmp was added by https://reviews.llvm.org/D69281. This patch is trying to add the support for PowerPC backend. Reviewed By: uweigand Differential Revision: https://reviews.llvm.org/D81727	2020-08-07 05:16:36 +00:00
QingShan Zhang	3359ea62ed	[Scheduling] Create the missing dependency edges for store cluster If it is load cluster, we don't need to create the dependency edges(SUb->reg) from SUb to SUa as they both depend on the base register "reg" +-------+ +----> reg \| \| +---+---+ \| ^ \| \| \| \| \| \| \| +---+---+ \| \| SUa \| Load 0(reg) \| +---+---+ \| ^ \| \| \| \| \| +---+---+ +----+ SUb \| Load 4(reg) +-------+ But if it is store cluster, we need to create it as follow shows to avoid the instruction store depend on scheduled in-between SUb and SUa. +-------+ +----> reg \| \| +---+---+ \| ^ \| \| Missing +-------+ \| \| +-------------------->+ y \| \| \| \| +---+---+ \| +---+-+-+ ^ \| \| SUa \| Store x 0(reg) \| \| +---+---+ \| \| ^ \| \| \| +------------------------+ \| \| \| \| +---+--++ +----+ SUb \| Store y 4(reg) +-------+ Reviewed By: evandro, arsenm, rampitec, foad, fhahn Differential Revision: https://reviews.llvm.org/D72031	2020-08-07 04:58:03 +00:00
Michał Górny	96b02808af	[Polly] Support linking ScopPassManager against LLVM dylib Link ScopPassManager to LLVM dylib target if LLVM_LINK_LLVM_DYLIB is enabled. This fixes build failures on systems where static LLVM libraries are not installed. Differential Revision: https://reviews.llvm.org/D85281	2020-08-07 06:46:35 +02:00
Sameer Sahasrabuddhe	c530539bad	[AArch64][NFC] require aarch64 support for hwasan test This was breaking builds where the target is not enabled. Reviewed By: danielkiss, eugenis Differential Revision: https://reviews.llvm.org/D85412	2020-08-07 09:24:52 +05:30
Vitaly Buka	7fb9de2c6f	[StackSafety,NFC] Fix tests in debug	2020-08-06 20:46:39 -07:00
Tim Keith	d8713523a2	[flang] Improve message for assignment to subprogram In the example below we were producing the error message "Assignment to constant 'f' is not allowed": ``` function f() result(r) f = 1.0 end ``` This changes it to a more helpful message when the LHS is a subprogram name and also mentions the function result name when it's a function. Differential Revision: https://reviews.llvm.org/D85483	2020-08-06 20:34:00 -07:00
Shinji Okumura	f13f2e16f0	[Attributor] Check violation of returned position nonnull and noundef attribute in AAUndefinedBehavior This patch is a follow up of D84733. If a function has noundef attribute in returned position, instructions that return undef or poison value cause UB. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D85178	2020-08-07 12:02:42 +09:00
Vitaly Buka	39cbcbe1b1	[StackSafety,NFC] Add more tests	2020-08-06 19:50:05 -07:00
Vitaly Buka	d97636196a	[StackSafety,NFC] Sort llvm-lto2 resolutions in tests	2020-08-06 19:46:52 -07:00
Vitaly Buka	58b95c9b2b	[StackSafety,NFC] Add debug counters	2020-08-06 19:24:02 -07:00
Vitaly Buka	92dcf12b2f	[StackSafety,NFC] Use CHECK-EMPTY in tests	2020-08-06 19:19:51 -07:00
Vitaly Buka	faeeed6f52	[LLParser,NFC] Simplify forward GV refs update Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D85238	2020-08-06 19:18:51 -07:00
Michael Kruse	1139d899d5	[polly] Unbreak buildbot. The test failed since commit `bc10888dc` "DomTree: Make PostDomTree indifferent to block successors swap" which is a re-commit of `c35585e20` "DomTree: Make PostDomTree immune to block successors swap"	2020-08-06 21:17:27 -05:00
Vitaly Buka	0b2616a804	[StackSafety] Skip ambiguous lifetime analysis If we can't identify alloca used in lifetime marker we need to assume to worst case scenario. Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D84630	2020-08-06 19:10:33 -07:00
Richard Smith	b2847671b8	Reinstate check that we don't crash.	2020-08-06 19:07:50 -07:00
Richard Smith	2f1fffab73	Disable clang-tidy test that started failing after clang commit `ed5a18f`. This checker appears to be intentionally not diagnosing cases where an operator appearing in a duplicated expression might have side-effects; Clang is now modeling fold-expressions as having an unresolved operator name within them, so they now trip up this check.	2020-08-06 19:06:06 -07:00
Vitaly Buka	5c6d9b2bbf	[LTO,NFC] Skip generateParamAccessSummary when empty addGlobalValueSummary can check newly added FunctionSummary and set HasParamAccess to mark that generateParamAccessSummary is needed. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D85182	2020-08-06 19:01:19 -07:00
Arthur Eubanks	72c95b2213	[NewPM] Add callback for skipped passes Parallel to https://reviews.llvm.org/D84772. Will use this for printing when a pass is skipped. Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D85478	2020-08-06 18:58:59 -07:00
Nathan Ridge	f4ba7a100a	[clangd] Semantic highlighting for dependent template name in template argument Fixes https://github.com/clangd/clangd/issues/484 Differential Revision: https://reviews.llvm.org/D85272	2020-08-06 21:23:49 -04:00
Nico Weber	ecbf2b3496	fix doc typo to cycle bots	2020-08-06 21:02:41 -04:00
Kazushi (Jam) Marukawa	f92e0d9384	[VE] Optimize trunc related instructions Change to not generate truncate instructions if all use of a truncate operation don't care about higher bits. For example, an i32 add instruction doesn't care about higher 32 bits in 64 bit registers. Updates regression tests also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D85418	2020-08-07 09:21:05 +09:00
Richard Smith	ed5a18fc03	PR30738: Implement two-phase name lookup for fold-expressions.	2020-08-06 16:56:39 -07:00
Jessica Paquette	c8a282bcf7	[GlobalISel] Fix computing known bits for loads with range metadata In GlobalISel, if you have a load into a small type with a range, you'll hit an assert if you try to compute known bits on it starting at a larger type. e.g. ``` %x:_(s8) = G_LOAD %whatever(p0) :: (load 1 ... !range !n) ... %y:_(s32) = G_SOMETHING %x ``` When we walk through G_SOMETHING and hit the load, the width of our known bits is 32. However, the width of the range is going to be 8. This will cause us to hit an assert. To fix this, make computeKnownBitsFromRangeMetadata zero extend or truncate the range type to match the bitwidth of the known bits we're calculating. Add a testcase in CodeGen/GlobalISel/KnownBitsTest.cpp to reflect that this works now. https://reviews.llvm.org/D85375	2020-08-06 16:47:07 -07:00
Adrian Prantl	243903f326	Factor out common code from the iPhone/AppleTV/WatchOS simulator platform plugins. (NFC) The implementation of these classes was copied & pasted from the iPhone simulator plugin with only a handful of configuration parameters substituted. This patch moves the redundant implementations into the base class PlatformAppleSimulator. Differential Revision: https://reviews.llvm.org/D85243	2020-08-06 16:36:58 -07:00
Matt Arsenault	1ad051dd8c	GlobalISel: Implement lower for G_INSERT_VECTOR_ELT	2020-08-06 19:29:17 -04:00
Mark Mentovai	92d5839297	[gn build] mac: use frameworks instead of libs where appropriate As of GN 3028c6a426a4, the hack that transformed "libs" ending in ".framework" from -l arguments to -framework arguments has been removed. Instead, "frameworks" must be used, and the toolchain must provide support. Differential Revision: https://reviews.llvm.org/D84219	2020-08-06 18:58:57 -04:00
Arthur Eubanks	039fb7f68a	[NewPM][GuardWidening] Fix loop guard widening tests under NPM Reviewed By: ychen, asbirlea Differential Revision: https://reviews.llvm.org/D85394	2020-08-06 15:32:59 -07:00
Fangrui Song	004be4037e	[ELF] Change tombstone values to (.debug_ranges/.debug_loc) 1 and (other .debug_) 0 tl;dr See D81784 for the 'tombstone value' concept. This patch changes our behavior to be almost the same as GNU ld (except that we also use 1 for .debug_loc): .debug_ranges & .debug_loc: 1 (LLD<11: 0+addend; GNU ld uses 1 for .debug_ranges) * .debug_: 0 (LLD<11: 0+addend; GNU ld uses 0; future LLD: -1) We make the tweaks because: 1) The new tombstone is novel and needs more time to be adopted by consumers before it's the default. 2) The old (gold) strategy had problems with zero-length functions - so rather than going back that, we're going to the GNU ld strategy which doesn't have that problem. 3) One slight tweak to (2) is to apply the .debug_ranges workaround to .debug_loc for the same reasons it applies to debug_ranges - to avoid terminating lists early. ----- http://lists.llvm.org/pipermail/llvm-dev/2020-July/143482.html The tombstone value -1 in .debug_line caused problems to lldb (fixed by D83957; will be included in 11.0.0) and breakpad (fixed by https://crrev.com/c/2321300). It may potentially affects other DWARF consumers. For .debug_ranges & .debug_loc: 1, an argument preferring 1 (GNU ld for .debug_ranges) over -2 is that: ``` {-1, -2} <<< base address selection entry {0, length} <<< address range ``` may create a situation where low_pc is greater than high_pc. So we use 1, the GNU ld behavior for .debug_ranges For other .debug_ sections, there haven't been many reports. One issue is that bloaty (src/dwarf.cc) can incorrectly count address ranges in .debug_ranges . To reduce similar disruption, this patch changes the tombstone values to be similar to GNU ld. This does mean another behavior change to the default trunk behavior. Sorry about it. The default trunk behavior will be similar to release/11.x while we work on a transition plan for LLD users. Reviewed By: dblaikie, echristo Differential Revision: https://reviews.llvm.org/D84825	2020-08-06 15:30:08 -07:00
Yonghong Song	c50f5dece9	BPF: fix libLLVMBPFCodeGen.so build failure Buildbot reported a build failure when building shared library libLLVMBPFCodeGen.so with unknown reference to "createCFGSimplificationPass". Commit `87cba43402` ("BPF: add a SimplifyCFG IR pass during generic Scalar/IPO optimization") added an IR pass SimplifyCFG by BPF target. The commit called function createCFGSimplificationPass() defined in "Scalar" library. Add this library in Target/BPF/LLVMBuild.txt so shared library build can succeed.	2020-08-06 15:27:15 -07:00
Tony	ce74e97d9b	[AMDGPU] Correct missing sram-ecc target feature for gfx906 Differential Revision: https://reviews.llvm.org/D85476	2020-08-06 22:12:25 +00:00
Matt Arsenault	87b2af8140	AMDGPU/GlobalISel: Enable s_{and\|or}n2_{b32\|b64} patterns	2020-08-06 18:00:38 -04:00
Evgenii Stepanov	aa57cabae2	[msan] Support %ms in scanf. Differential Revision: https://reviews.llvm.org/D85350	2020-08-06 13:54:43 -07:00
Michael Kruse	f81bae9ff4	[flang][msvc] Do not use gcc/clang command line options for msvc. The command line options `-Wno-error` and `-Wno-unused-parameter` are specific to gcc/clang, do not use them when compiling with other compilers. This patch is part of the series to [[ http://lists.llvm.org/pipermail/flang-dev/2020-July/000448.html \| make flang compilable with MS Visual Studio ]]. Reviewed By: isuruf Differential Revision: https://reviews.llvm.org/D85355	2020-08-06 15:46:52 -05:00
Roman Lebedev	be02adfad7	[InstCombine] Fold (x + C1) * (-1<<C2) --> (-C1 - x) * (1<<C2) Negator knows how to do this, but the one-use reasoning is getting a bit muddy here, we don't really want to increase instruction count, so we need to both lie that "IsNegation" and have an one-use check on the outermost LHS value.	2020-08-06 23:40:16 +03:00
Roman Lebedev	0c1c756a31	[InstCombine] Generalize %x * (-1<<C) --> (-%x) * (1<<C) fold Multiplication is commutative, and either of operands can be negative, so if the RHS is a negated power-of-two, we should try to make it true power-of-two (which will allow us to turn it into a left-shift), by trying to sink the negation down into LHS op. But, we shouldn't re-invent the logic for sinking negation, let's just use Negator for that. Tests and original patch by: Simon Pilgrim @RKSimon! Differential Revision: https://reviews.llvm.org/D85446	2020-08-06 23:39:53 +03:00
Roman Lebedev	a404acb86a	[NFC][InstCombine] Add some more tests for negation sinking into mul	2020-08-06 23:37:17 +03:00
Roman Lebedev	7ce76b06ec	[InstCombine] Fold sdiv exact X, -1<<C --> -(ashr exact X, C) While that does increases instruction count, shift is obviously better than a division. Name: base Pre: (1<<C1) >= 0 %o0 = shl i8 1, C1 %r = sdiv exact i8 C0, %o0 => %r = ashr exact i8 C0, C1 Name: neg %o0 = shl i8 -1, C1 %r = sdiv exact i8 C0, %o0 => %t0 = ashr exact i8 C0, C1 %r = sub i8 0, %t0 Name: reverse Pre: C1 != 0 && C1 u< 8 %t0 = ashr exact i8 C0, C1 %r = sub i8 0, %t0 => %o0 = shl i8 -1, C1 %r = sdiv exact i8 C0, %o0 https://rise4fun.com/Alive/MRplf	2020-08-06 23:37:16 +03:00
Roman Lebedev	47aec80e4a	[NFC][InstCombine] Negator: add a comment about negating exact arithmentic shift	2020-08-06 23:37:16 +03:00
Roman Lebedev	442cb88f53	[InstCombine] Generalize sdiv exact X, 1<<C --> ashr exact X, C fold to handle non-splat vectors	2020-08-06 23:37:15 +03:00
Roman Lebedev	8633a0d985	[NFC][InstCombine] Better tests for x s/EXACT (1 << y) pattern	2020-08-06 23:37:15 +03:00
Roman Lebedev	1c21635c94	[NFC][InstCombine] Tests for x s/EXACT (-1 << y) pattern	2020-08-06 23:37:15 +03:00
Adrian Prantl	0fa520af67	Unify the code that updates the ArchSpec after finding a fat binary with how it is done for a lean binary In particular this affects how target create --arch is handled — it allowed us to override the deployment target (a useful feature for the expression evaluator), but the fat binary case didn't. rdar://problem/66024437 Differential Revision: https://reviews.llvm.org/D85049 (cherry picked from commit 470bdd3caaab0b6e0ffed4da304244be40b78668)	2020-08-06 13:30:17 -07:00
Richard Smith	d6492d8744	Add -Wtautological-value-range-compare warning. This warning diagnoses cases where an expression is compared to a constant, and the comparison is tautological due to the form of the expression (but not merely due to its type). This applies in cases such as comparisons of bit-fields and the result of bit-masks. The new warning is added to the Clang diagnostic group -Wtautological-constant-in-range-compare but not to the formerly-equivalent GCC-compatibility diagnostic group -Wtype-limits, which retains its old meaning of diagnosing only tautological comparisons to extremal values of a type (eg, int > INT_MAX). Reviewed By: rtrieu Differential Revision: https://reviews.llvm.org/D85256	2020-08-06 13:28:50 -07:00
Craig Topper	ffc248f3b8	[LegalTypes] Move VSELECT node creation out of WidenVSELECTAndMask and push to 2 of the 3 callers. One of the callers only wants the condition, but the vselect can be simplified by getNode making it hard or impossible to retrieve the condition. Instead, return the condition and make the other 2 callers responsible for creating the vselect node using the condition. Rename the function to WidenVSELECTMask accordingly. Differential Revision: https://reviews.llvm.org/D85468	2020-08-06 13:18:16 -07:00
Craig Topper	4df38a5589	[X86] Optimize out a few extra strlen calls in getX86TargetCPU. NFCI We had a conversion from const char * to StringRef and const char * to std::string conversion. These both do their own strlen call if the compiler doens't figure out how to share them. By adding the temporary StringRef we can convert it to std::string instead. The other case is to use a StringSwitch<StringRef> instead of StringSwitch<const char > since the output values of the switch are string literals. This allows the length to be computed at compile time. Otherwise we have to convert from const char to std::string after the StringSwitch.	2020-08-06 13:18:15 -07:00
Craig Topper	e1cad4234c	[X86] Make getX86TargetCPU return std::string instead of const char *. Remove call to MakeArgString. NFCI I believe this function used to be called directly from X86 specific code and was used to immediately create -target-cpu command line. A later refactoring changed it to to be called from a generic getCPU function that returns std::string. So on some paths we created a string using MakeArgString converted that to std::string then called MakeArgString again from that. Instead just return std::string directly like the other targets.	2020-08-06 13:18:15 -07:00
Yonghong Song	87cba43402	BPF: add a SimplifyCFG IR pass during generic Scalar/IPO optimization The following bpf linux kernel selftest failed with latest llvm: $ ./test_progs -n 7/10 ... The sequence of 8193 jumps is too complex. verification time 126272 usec stack depth 320 processed 114799 insns (limit 1000000) ... libbpf: failed to load object 'pyperf600_nounroll.o' test_bpf_verif_scale:FAIL:110 #7/10 pyperf600_nounroll.o:FAIL #7 bpf_verif_scale:FAIL After some investigation, I found the following llvm patch https://reviews.llvm.org/D84108 is responsible. The patch disabled hoisting common instructions in SimplifyCFG by default. Later on, the code changes and a SimplifyCFG phase with hoisting on cannot do the work any more. A test is provided to demonstrate the problem. The IR before simplifyCFG looks like: for.cond: %i.0 = phi i32 [ 0, %entry ], [ %inc, %for.inc ] %cmp = icmp ult i32 %i.0, 6 br i1 %cmp, label %for.body, label %for.cond.cleanup for.cond.cleanup: %2 = load i8, i8* %frame_ptr, align 8, !tbaa !2 %cmp2 = icmp eq i8* %2, null %conv = zext i1 %cmp2 to i32 call void @llvm.lifetime.end.p0i8(i64 8, i8* nonnull %1) #3 call void @llvm.lifetime.end.p0i8(i64 8, i8* nonnull %0) #3 ret i32 %conv for.body: %3 = load i8, i8* %frame_ptr, align 8, !tbaa !2 %tobool.not = icmp eq i8* %3, null br i1 %tobool.not, label %for.inc, label %land.lhs.true The first two insns of `for.cond.cleanup` and `for.body`, load and icmp, can be hoisted to `for.cond` block. With Patch D84108, the optimization is delayed. But unfortunately, later on loop rotation added addition phi nodes to `for.body` and hoisting cannot be done any more. Note such a hoisting is beneficial to bpf programs as bpf verifier does path sensitive analysis and verification. The hoisting preverts reloading from stack which will assume conservative value and increase exploited insns. In this case, it caused verifier failure. To fix this problem, I added an IR pass from bpf target to performance additional simplifycfg with hoisting common inst enabled. Differential Revision: https://reviews.llvm.org/D85434	2020-08-06 13:16:00 -07:00

1 2 3 4 5 ...

362766 Commits All Branches Search

362766 Commits

All Branches