llvm-project

Commit Graph

Author	SHA1	Message	Date
Jessica Paquette	c8a282bcf7	[GlobalISel] Fix computing known bits for loads with range metadata In GlobalISel, if you have a load into a small type with a range, you'll hit an assert if you try to compute known bits on it starting at a larger type. e.g. ``` %x:_(s8) = G_LOAD %whatever(p0) :: (load 1 ... !range !n) ... %y:_(s32) = G_SOMETHING %x ``` When we walk through G_SOMETHING and hit the load, the width of our known bits is 32. However, the width of the range is going to be 8. This will cause us to hit an assert. To fix this, make computeKnownBitsFromRangeMetadata zero extend or truncate the range type to match the bitwidth of the known bits we're calculating. Add a testcase in CodeGen/GlobalISel/KnownBitsTest.cpp to reflect that this works now. https://reviews.llvm.org/D85375	2020-08-06 16:47:07 -07:00
Adrian Prantl	243903f326	Factor out common code from the iPhone/AppleTV/WatchOS simulator platform plugins. (NFC) The implementation of these classes was copied & pasted from the iPhone simulator plugin with only a handful of configuration parameters substituted. This patch moves the redundant implementations into the base class PlatformAppleSimulator. Differential Revision: https://reviews.llvm.org/D85243	2020-08-06 16:36:58 -07:00
Matt Arsenault	1ad051dd8c	GlobalISel: Implement lower for G_INSERT_VECTOR_ELT	2020-08-06 19:29:17 -04:00
Mark Mentovai	92d5839297	[gn build] mac: use frameworks instead of libs where appropriate As of GN 3028c6a426a4, the hack that transformed "libs" ending in ".framework" from -l arguments to -framework arguments has been removed. Instead, "frameworks" must be used, and the toolchain must provide support. Differential Revision: https://reviews.llvm.org/D84219	2020-08-06 18:58:57 -04:00
Arthur Eubanks	039fb7f68a	[NewPM][GuardWidening] Fix loop guard widening tests under NPM Reviewed By: ychen, asbirlea Differential Revision: https://reviews.llvm.org/D85394	2020-08-06 15:32:59 -07:00
Fangrui Song	004be4037e	[ELF] Change tombstone values to (.debug_ranges/.debug_loc) 1 and (other .debug_) 0 tl;dr See D81784 for the 'tombstone value' concept. This patch changes our behavior to be almost the same as GNU ld (except that we also use 1 for .debug_loc): .debug_ranges & .debug_loc: 1 (LLD<11: 0+addend; GNU ld uses 1 for .debug_ranges) * .debug_: 0 (LLD<11: 0+addend; GNU ld uses 0; future LLD: -1) We make the tweaks because: 1) The new tombstone is novel and needs more time to be adopted by consumers before it's the default. 2) The old (gold) strategy had problems with zero-length functions - so rather than going back that, we're going to the GNU ld strategy which doesn't have that problem. 3) One slight tweak to (2) is to apply the .debug_ranges workaround to .debug_loc for the same reasons it applies to debug_ranges - to avoid terminating lists early. ----- http://lists.llvm.org/pipermail/llvm-dev/2020-July/143482.html The tombstone value -1 in .debug_line caused problems to lldb (fixed by D83957; will be included in 11.0.0) and breakpad (fixed by https://crrev.com/c/2321300). It may potentially affects other DWARF consumers. For .debug_ranges & .debug_loc: 1, an argument preferring 1 (GNU ld for .debug_ranges) over -2 is that: ``` {-1, -2} <<< base address selection entry {0, length} <<< address range ``` may create a situation where low_pc is greater than high_pc. So we use 1, the GNU ld behavior for .debug_ranges For other .debug_ sections, there haven't been many reports. One issue is that bloaty (src/dwarf.cc) can incorrectly count address ranges in .debug_ranges . To reduce similar disruption, this patch changes the tombstone values to be similar to GNU ld. This does mean another behavior change to the default trunk behavior. Sorry about it. The default trunk behavior will be similar to release/11.x while we work on a transition plan for LLD users. Reviewed By: dblaikie, echristo Differential Revision: https://reviews.llvm.org/D84825	2020-08-06 15:30:08 -07:00
Yonghong Song	c50f5dece9	BPF: fix libLLVMBPFCodeGen.so build failure Buildbot reported a build failure when building shared library libLLVMBPFCodeGen.so with unknown reference to "createCFGSimplificationPass". Commit `87cba43402` ("BPF: add a SimplifyCFG IR pass during generic Scalar/IPO optimization") added an IR pass SimplifyCFG by BPF target. The commit called function createCFGSimplificationPass() defined in "Scalar" library. Add this library in Target/BPF/LLVMBuild.txt so shared library build can succeed.	2020-08-06 15:27:15 -07:00
Tony	ce74e97d9b	[AMDGPU] Correct missing sram-ecc target feature for gfx906 Differential Revision: https://reviews.llvm.org/D85476	2020-08-06 22:12:25 +00:00
Matt Arsenault	87b2af8140	AMDGPU/GlobalISel: Enable s_{and\|or}n2_{b32\|b64} patterns	2020-08-06 18:00:38 -04:00
Evgenii Stepanov	aa57cabae2	[msan] Support %ms in scanf. Differential Revision: https://reviews.llvm.org/D85350	2020-08-06 13:54:43 -07:00
Michael Kruse	f81bae9ff4	[flang][msvc] Do not use gcc/clang command line options for msvc. The command line options `-Wno-error` and `-Wno-unused-parameter` are specific to gcc/clang, do not use them when compiling with other compilers. This patch is part of the series to [[ http://lists.llvm.org/pipermail/flang-dev/2020-July/000448.html \| make flang compilable with MS Visual Studio ]]. Reviewed By: isuruf Differential Revision: https://reviews.llvm.org/D85355	2020-08-06 15:46:52 -05:00
Roman Lebedev	be02adfad7	[InstCombine] Fold (x + C1) * (-1<<C2) --> (-C1 - x) * (1<<C2) Negator knows how to do this, but the one-use reasoning is getting a bit muddy here, we don't really want to increase instruction count, so we need to both lie that "IsNegation" and have an one-use check on the outermost LHS value.	2020-08-06 23:40:16 +03:00
Roman Lebedev	0c1c756a31	[InstCombine] Generalize %x * (-1<<C) --> (-%x) * (1<<C) fold Multiplication is commutative, and either of operands can be negative, so if the RHS is a negated power-of-two, we should try to make it true power-of-two (which will allow us to turn it into a left-shift), by trying to sink the negation down into LHS op. But, we shouldn't re-invent the logic for sinking negation, let's just use Negator for that. Tests and original patch by: Simon Pilgrim @RKSimon! Differential Revision: https://reviews.llvm.org/D85446	2020-08-06 23:39:53 +03:00
Roman Lebedev	a404acb86a	[NFC][InstCombine] Add some more tests for negation sinking into mul	2020-08-06 23:37:17 +03:00
Roman Lebedev	7ce76b06ec	[InstCombine] Fold sdiv exact X, -1<<C --> -(ashr exact X, C) While that does increases instruction count, shift is obviously better than a division. Name: base Pre: (1<<C1) >= 0 %o0 = shl i8 1, C1 %r = sdiv exact i8 C0, %o0 => %r = ashr exact i8 C0, C1 Name: neg %o0 = shl i8 -1, C1 %r = sdiv exact i8 C0, %o0 => %t0 = ashr exact i8 C0, C1 %r = sub i8 0, %t0 Name: reverse Pre: C1 != 0 && C1 u< 8 %t0 = ashr exact i8 C0, C1 %r = sub i8 0, %t0 => %o0 = shl i8 -1, C1 %r = sdiv exact i8 C0, %o0 https://rise4fun.com/Alive/MRplf	2020-08-06 23:37:16 +03:00
Roman Lebedev	47aec80e4a	[NFC][InstCombine] Negator: add a comment about negating exact arithmentic shift	2020-08-06 23:37:16 +03:00
Roman Lebedev	442cb88f53	[InstCombine] Generalize sdiv exact X, 1<<C --> ashr exact X, C fold to handle non-splat vectors	2020-08-06 23:37:15 +03:00
Roman Lebedev	8633a0d985	[NFC][InstCombine] Better tests for x s/EXACT (1 << y) pattern	2020-08-06 23:37:15 +03:00
Roman Lebedev	1c21635c94	[NFC][InstCombine] Tests for x s/EXACT (-1 << y) pattern	2020-08-06 23:37:15 +03:00
Adrian Prantl	0fa520af67	Unify the code that updates the ArchSpec after finding a fat binary with how it is done for a lean binary In particular this affects how target create --arch is handled — it allowed us to override the deployment target (a useful feature for the expression evaluator), but the fat binary case didn't. rdar://problem/66024437 Differential Revision: https://reviews.llvm.org/D85049 (cherry picked from commit 470bdd3caaab0b6e0ffed4da304244be40b78668)	2020-08-06 13:30:17 -07:00
Richard Smith	d6492d8744	Add -Wtautological-value-range-compare warning. This warning diagnoses cases where an expression is compared to a constant, and the comparison is tautological due to the form of the expression (but not merely due to its type). This applies in cases such as comparisons of bit-fields and the result of bit-masks. The new warning is added to the Clang diagnostic group -Wtautological-constant-in-range-compare but not to the formerly-equivalent GCC-compatibility diagnostic group -Wtype-limits, which retains its old meaning of diagnosing only tautological comparisons to extremal values of a type (eg, int > INT_MAX). Reviewed By: rtrieu Differential Revision: https://reviews.llvm.org/D85256	2020-08-06 13:28:50 -07:00
Craig Topper	ffc248f3b8	[LegalTypes] Move VSELECT node creation out of WidenVSELECTAndMask and push to 2 of the 3 callers. One of the callers only wants the condition, but the vselect can be simplified by getNode making it hard or impossible to retrieve the condition. Instead, return the condition and make the other 2 callers responsible for creating the vselect node using the condition. Rename the function to WidenVSELECTMask accordingly. Differential Revision: https://reviews.llvm.org/D85468	2020-08-06 13:18:16 -07:00
Craig Topper	4df38a5589	[X86] Optimize out a few extra strlen calls in getX86TargetCPU. NFCI We had a conversion from const char * to StringRef and const char * to std::string conversion. These both do their own strlen call if the compiler doens't figure out how to share them. By adding the temporary StringRef we can convert it to std::string instead. The other case is to use a StringSwitch<StringRef> instead of StringSwitch<const char > since the output values of the switch are string literals. This allows the length to be computed at compile time. Otherwise we have to convert from const char to std::string after the StringSwitch.	2020-08-06 13:18:15 -07:00
Craig Topper	e1cad4234c	[X86] Make getX86TargetCPU return std::string instead of const char *. Remove call to MakeArgString. NFCI I believe this function used to be called directly from X86 specific code and was used to immediately create -target-cpu command line. A later refactoring changed it to to be called from a generic getCPU function that returns std::string. So on some paths we created a string using MakeArgString converted that to std::string then called MakeArgString again from that. Instead just return std::string directly like the other targets.	2020-08-06 13:18:15 -07:00
Yonghong Song	87cba43402	BPF: add a SimplifyCFG IR pass during generic Scalar/IPO optimization The following bpf linux kernel selftest failed with latest llvm: $ ./test_progs -n 7/10 ... The sequence of 8193 jumps is too complex. verification time 126272 usec stack depth 320 processed 114799 insns (limit 1000000) ... libbpf: failed to load object 'pyperf600_nounroll.o' test_bpf_verif_scale:FAIL:110 #7/10 pyperf600_nounroll.o:FAIL #7 bpf_verif_scale:FAIL After some investigation, I found the following llvm patch https://reviews.llvm.org/D84108 is responsible. The patch disabled hoisting common instructions in SimplifyCFG by default. Later on, the code changes and a SimplifyCFG phase with hoisting on cannot do the work any more. A test is provided to demonstrate the problem. The IR before simplifyCFG looks like: for.cond: %i.0 = phi i32 [ 0, %entry ], [ %inc, %for.inc ] %cmp = icmp ult i32 %i.0, 6 br i1 %cmp, label %for.body, label %for.cond.cleanup for.cond.cleanup: %2 = load i8, i8* %frame_ptr, align 8, !tbaa !2 %cmp2 = icmp eq i8* %2, null %conv = zext i1 %cmp2 to i32 call void @llvm.lifetime.end.p0i8(i64 8, i8* nonnull %1) #3 call void @llvm.lifetime.end.p0i8(i64 8, i8* nonnull %0) #3 ret i32 %conv for.body: %3 = load i8, i8* %frame_ptr, align 8, !tbaa !2 %tobool.not = icmp eq i8* %3, null br i1 %tobool.not, label %for.inc, label %land.lhs.true The first two insns of `for.cond.cleanup` and `for.body`, load and icmp, can be hoisted to `for.cond` block. With Patch D84108, the optimization is delayed. But unfortunately, later on loop rotation added addition phi nodes to `for.body` and hoisting cannot be done any more. Note such a hoisting is beneficial to bpf programs as bpf verifier does path sensitive analysis and verification. The hoisting preverts reloading from stack which will assume conservative value and increase exploited insns. In this case, it caused verifier failure. To fix this problem, I added an IR pass from bpf target to performance additional simplifycfg with hoisting common inst enabled. Differential Revision: https://reviews.llvm.org/D85434	2020-08-06 13:16:00 -07:00
Snehasish Kumar	8d943a928d	[NFC] Rename BBSectionsPrepare -> BasicBlockSections. Rename the BBSectionsPrepare pass as suggested by the review comment in https://reviews.llvm.org/D85368. Differential Revision: https://reviews.llvm.org/D85380	2020-08-06 13:12:06 -07:00
Adrian Prantl	f406a90a08	Add missing override to Makefile	2020-08-06 13:07:16 -07:00
Sanjay Patel	250a167c41	[InstSimplify] avoid crashing by trying to rem-by-zero Bug was noted in the post-commit comments for: rGe8760bb9a8a3	2020-08-06 16:06:31 -04:00
Jonas Devlieghere	ba37b144e6	[LLDB] Skip test_launch_simple from TestTargetAPI.py when remote	2020-08-06 13:03:18 -07:00
Matt Arsenault	30eeb742f1	clang: Use byref for aggregate kernel arguments Add address space to indirect abi info and use it for kernels. Previously, indirect arguments assumed assumed a stack passed object in the alloca address space using byval. A stack pointer is unsuitable for kernel arguments, which are passed in a separate, constant buffer with a different address space. Start using the new byref for aggregate kernel arguments. Previously these were emitted as raw struct arguments, and turned into loads in the backend. These will lower identically, although with byref you now have the option of applying an explicit alignment. In the future, a reasonable implementation would use byref for all kernel arguments (this would be a practical problem at the moment due to losing things like noalias on pointer arguments). This is mostly to avoid fighting the optimizer's treatment of aggregate load/store. SROA and instcombine both turn aggregate loads and stores into a long sequence of element loads and stores, rather than the optimizable memcpy I would expect in this situation. Now an explicit memcpy will be introduced up-front which is better understood and helps eliminate the alloca in more situations. This skips using byref in the case where HIP kernel pointer arguments in structs are promoted to global pointers. At minimum an additional patch is needed to allow coercion with indirect arguments. This also skips using it for OpenCL due to the current workaround used to support kernels calling kernels. Distinct function bodies would need to be generated up front instead of emitting an illegal call.	2020-08-06 15:52:26 -04:00
Sanjay Patel	c9bcc237a2	[VectorCombine] add tests for load+insert; NFC	2020-08-06 15:45:02 -04:00
Adrian Prantl	05df9cc703	Correctly detect legacy iOS simulator Mach-O objectfiles The code in ObjectFileMachO didn't disambiguate between ios and ios-simulator object files for Mach-O objects using the legacy ambiguous LC_VERSION_MIN load commands. This used to not matter before taught ArchSpec that ios and ios-simulator are no longer compatible. rdar://problem/66545307 Differential Revision: https://reviews.llvm.org/D85358	2020-08-06 12:40:45 -07:00
cgyurgyik	128bf458ab	[libc] Add tolower, toupper implementation. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D85326	2020-08-06 15:21:38 -04:00
Anton Afanasyev	a7478fab6c	[SLP] Fix order of `insertelement`/`insertvalue` seed operands Summary: This patch takes the indices operands of `insertelement`/`insertvalue` into account while generation of seed elements for `findBuildAggregate()`. This function has kept the original order of `insert`s before. Also this patch optimizes `findBuildAggregate()` preventing it from redundant temporary vector allocations and its multiple reversing. Fixes llvm.org/pr44067 Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83779	2020-08-06 22:09:24 +03:00
Evgenii Stepanov	189ba3db86	Fix CFI issues in <future> This change fixes errors reported by Control Flow Integrity (CFI) checking when using `std::packaged_task`. The errors mostly stem from casting the underlying storage (`__buf_`) to `__base`, even if it is uninitialized. The solution is to wrap `__base` access to `__buf_` behind a getter marked with _LIBCPP_NO_CFI. Differential Revision: https://reviews.llvm.org/D82627	2020-08-06 12:05:22 -07:00
Matt Arsenault	87ce06e315	Add freeze keyword to IR emacs mode	2020-08-06 14:58:28 -04:00
MaheshRavishankar	25e8668e88	[mlir][SPIR-V] Fix wrongly placed Rationale section. Differential Revision: https://reviews.llvm.org/D85461	2020-08-06 11:51:42 -07:00
Jonas Devlieghere	86aa8e6363	[lldb] Use target.GetLaunchInfo() instead of creating an empty one. Update tests that were creating an empty LaunchInfo instead of using the one coming from the target. This ensures target properties are honored.	2020-08-06 11:51:26 -07:00
Aleksandr Platonov	9f24148b21	[clangd] Fix crash in bugprone-bad-signal-to-kill-thread clang-tidy check. Inside clangd, clang-tidy checks don't see preprocessor events in the preamble. This leads to `Token::PtrData == nullptr` for tokens that the macro is defined to. E.g. `#define SIGTERM 15`: - Token::Kind == tok::numeric_constant (Token::isLiteral() == true) - Token::UintData == 2 - Token::PtrData == nullptr As the result of this, bugprone-bad-signal-to-kill-thread check crashes at null-dereference inside clangd. Reviewed By: hokein Differential Revision: https://reviews.llvm.org/D85417	2020-08-06 21:45:21 +03:00
dfukalov	4ccc38813e	[AMDGPU][CostModel] Add f16, f64 and contract cases to fused costs estimation. Add cases of fused fmul+fadd/fsub with f16 and f64 operands to cost model. Also added operations with contract attribute. Fixed line endings in test. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D84995	2020-08-06 21:43:27 +03:00
Matt Arsenault	e00201539f	GlobalISel: Implement fewerElementsVector for G_EXTRACT_VECTOR_ELT Use the same basic strategy as LegalizeVectorTypes. Try to index into smaller pieces if there's a constant index, and otherwise fall back to a stack temporary.	2020-08-06 14:33:16 -04:00
Arthur Eubanks	d0acd97c68	[NewPM][LoopUnswitch] Pin loop-unswitch to legacy PM or use simple-loop-unswitch As mentioned in http://lists.llvm.org/pipermail/llvm-dev/2020-July/143395.html, loop-unswitch has not been ported to the NPM. Instead people are using simple-loop-unswitch. Pin all tests in Transforms/LoopUnswitch to legacy PM and replace all other uses of loop-unswitch with simple-loop-unswitch. One test that didn't fit into the above was 2014-06-21-congruent-constant.ll which seems to only pass with loop-unswitch. That is also pinned to legacy PM. Now all tests containing "-loop-unswitch" anywhere in the test succeed with NPM turned on by default. Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D85360	2020-08-06 10:56:00 -07:00
Aaron En Ye Shi	96c2d5e99e	[HIP] Ignore invalid ar linker options Instead of accepting the same arguments as regular linker, the static linker will only accept input files. Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D85442	2020-08-06 17:39:41 +00:00
Fred Riss	99298c7fc5	[lldb/testsuite] Change get_debugserver_exe to support Rosetta In order to be able to run the debugserver tests against the Rosetta debugserver, detect the Rosetta run configuration and return the system Rosetta debugserver.	2020-08-06 10:38:30 -07:00
Arthur Eubanks	5bb6b8250a	[NewPM] Pin -assumption-cache-tracker tests to legacy PM All tests have corresponding NPM RUN lines. Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D85395	2020-08-06 10:38:03 -07:00
Matt Arsenault	1a0c0944c6	AMDGPU: Define raw/struct variants of buffer atomic fadd Somehow the new FP atomic buffer intrinsics ended up using the legacy style for buffer intrinsics.	2020-08-06 13:36:19 -04:00
Sterling Augustine	9dbdaea9a0	Remove unused variable "saved_opts". wattr_get is a macro, and the documentation states: "The parameter opts is reserved for future use, applications must supply a null pointer." In practice, passing a variable there is harmless, except that it is unused inside the macro, which causes unused variable warnings. The various places where	2020-08-06 10:20:21 -07:00
Matt Arsenault	eae9c54148	AArch64/GlobalISel: Fix verifier error after selecting returnaddress This was caching the wrong register to re-use later.	2020-08-06 13:18:05 -04:00
Simon Pilgrim	3b93464dcf	[SLP][X86] Regenerate sdiv test noticed in D83779. NFC.	2020-08-06 18:00:21 +01:00
Mircea Trofin	ca7973cf18	[NFC]{MLInliner] Point out the tests' model dependencies	2020-08-06 09:57:26 -07:00

1 2 3 4 5 ...

362741 Commits All Branches Search

362741 Commits

All Branches