llvm-project

Commit Graph

Author	SHA1	Message	Date
Nikita Popov	13e19d2e7c	Revert "[InstCombine] Canonicalize SPF_ABS to abs intrinc" This reverts commit `05d4c4ebc2`. mstorsjo reports a miscompile after this change in https://reviews.llvm.org/D87188#2281093. Reverting until I can investigate this.	2020-09-18 09:38:26 +02:00
Vitaly Buka	f16c4a3704	[NFC][fuzzer] Simplify StrcmpTest.cpp The test started to consistently fail after unrelated `2ffaa9a173`. Even before the patch it was possible to fail the test, e.g. -seed=1660180256 on my workstation. Also this checks do not look related to strcmp.	2020-09-18 00:36:48 -07:00
Andrew Wei	8f09cec8c9	[AArch64] Add tests for zext pattern match with AssertZext/AssertSext operand, NFC	2020-09-18 15:02:43 +08:00
Serge Pavlov	8a86261c51	[FPEnv] Use typed accessors in FPOptions Previously methods `FPOptions::get*` returned unsigned value even if the corresponding property was represented by specific enumeration type. With this change such methods return actual type of the property. It also allows printing value of a property as text rather than integer code. Differential Revision: https://reviews.llvm.org/D87812	2020-09-18 14:16:43 +07:00
Artur Bialas	5a733468e0	Revert "This is a test commit" This reverts commit `9d54b166c2`.	2020-09-18 08:43:53 +02:00
Artur Bialas	9d54b166c2	This is a test commit	2020-09-18 08:43:18 +02:00
Craig Topper	fb92f863f6	[X86] Add some demanded bits test cases for PDEP with constant mask The number of ones in the mask for the PDEP determines how many bits of the other operand are used. If the mask is constant we can use this to build a mask for SimplifyDemandedBits. This can be used to replace the extends in the test with anyextend.	2020-09-17 22:48:19 -07:00
Andrew Wei	992698cfbc	[AArch64] Emit zext move when the source of the zext is AssertZext or AssertSext When the source of the zext is AssertZext or AssertSext, it is hard to know any information about the upper 32 bits, so we should insert a zext move before emitting SUBREG_TO_REG to define the lower 32 bits. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D87771	2020-09-18 12:48:41 +08:00
Teresa Johnson	6e475e1288	Revert "[sanitizer] Add facility to print the full StackDepot" This reverts commit `2ffaa9a173`. There were 2 reported bot failures that need more investigation: http://lab.llvm.org:8011/builders/sanitizer-windows/builds/69871/steps/stage%201%20check/logs/stdio This one is in my new test. http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-fuzzer/builds/39187/steps/check-fuzzer/logs/stdio This one seems completely unrelated.	2020-09-17 21:18:16 -07:00
Tue Ly	f55963d501	[libc] Add implementation for hypotf Truncating the sum of squares, and then use shift-and-add algorithm to compute its square root. Required MPFR testing infra is updated in https://reviews.llvm.org/D87514 Differential Revision: https://reviews.llvm.org/D87516	2020-09-17 23:28:36 -04:00
Teresa Johnson	2ffaa9a173	[sanitizer] Add facility to print the full StackDepot Split out of D87120 (memory profiler). Added unit testing of the new printing facility. Differential Revision: https://reviews.llvm.org/D87792	2020-09-17 18:12:22 -07:00
Vitaly Buka	55edf7039e	[NFC] clang-format one line	2020-09-17 18:03:55 -07:00
Vitaly Buka	03358becbf	[NFC][Lsan] Fix zero-sized array compilation error	2020-09-17 17:59:52 -07:00
Roland McGrath	27f34540ea	[scudo/standalone] Don't define test main function for Fuchsia Fuchsia's unit test library provides the main function by default. Reviewed By: cryptoad Differential Revision: https://reviews.llvm.org/D87809	2020-09-17 17:42:53 -07:00
Rahul Joshi	ea237e2c8e	[MLIR] Fix build failure due to https://reviews.llvm.org/D87059 . - Remove spurious ; - Make comparison object invokable as const. Differential Revision: https://reviews.llvm.org/D87872	2020-09-17 16:57:53 -07:00
Sean Silva	bae6374205	[mlir][shape] Add `shape.cstr_require %bool` This op is a catch-all for creating witnesses from various random kinds of constraints. In particular, I when dealing with extents directly, which are of `index` type, one can directly use std ops for calculating the predicates, and then use cstr_require for the final conversion to a witness. Differential Revision: https://reviews.llvm.org/D87871	2020-09-17 16:56:43 -07:00
Vedant Kumar	4926a5ee63	[lldb] Clarify docstring for SBBlock::IsInlined, NFC Previously, there was a little ambiguity about whether IsInlined should return true for an inlined lexical block, since technically the lexical block would not represent an inlined function (it'd just be contained within one). Edit suggested by Jim Ingham.	2020-09-17 16:54:58 -07:00
Amara Emerson	f5898f8c2d	[AArch64][GlobalISel] Make G_STORE <8 x s8> legal.	2020-09-17 16:42:18 -07:00
Amara Emerson	196e2f97b7	[AArch64][GlobalISel] clang-format AArch64LegalizerInfo.cpp. NFC.	2020-09-17 16:41:10 -07:00
Amy Kwan	6f3c0991bf	[PowerPC] Add Set Boolean Condition Instruction Definitions and MC Tests This patch adds the instruction definitions and assembly/disassembly tests for the set boolean condition instructions. This also includes the negative, and reverse variants of the instruction. Differential Revision: https://reviews.llvm.org/D86252	2020-09-17 18:20:54 -05:00
Amy Kwan	2c3bc918db	[PowerPC] Implement Vector Count Mask Bits builtins in LLVM/Clang This patch implements the vec_cntm function prototypes in altivec.h in order to utilize the vector count mask bits instructions introduced in Power10. Differential Revision: https://reviews.llvm.org/D82726	2020-09-17 18:20:53 -05:00
Philip Reames	b4013f9c7f	[MemorySSA] Fix an unused variable warning [NFC]	2020-09-17 16:07:59 -07:00
Rahul Joshi	8069844577	[MLIR][TableGen] Automatic detection and elimination of redundant methods - Change OpClass new method addition to find and eliminate any existing methods that are made redundant by the newly added method, as well as detect if the newly added method will be redundant and return nullptr in that case. - To facilitate that, add the notion of resolved and unresolved parameters, where resolved parameters have each parameter type known, so that redundancy checks on methods with same name but different parameter types can be done. - Eliminate existing code to avoid adding conflicting/redundant build methods and rely on this new mechanism to eliminate conflicting build methods. Fixes https://bugs.llvm.org/show_bug.cgi?id=47095 Differential Revision: https://reviews.llvm.org/D87059	2020-09-17 16:04:37 -07:00
Zhaoshi Zheng	1c466477ad	[RISCV] Support Shadow Call Stack Currenlty assume x18 is used as pointer to shadow call stack. User shall pass flags: "-fsanitize=shadow-call-stack -ffixed-x18" Runtime supported is needed to setup x18. If SCS is desired, all parts of the program should be built with -ffixed-x18 to maintain inter-operatability. There's no particuluar reason that we must use x18 as SCS pointer. Any register may be used, as long as it does not have designated purpose already, like RA or passing call arguments. Differential Revision: https://reviews.llvm.org/D84414	2020-09-17 16:02:35 -07:00
Philip Reames	b04c181ed7	[AArch64] Enable implicit null check transformation This change enables the generic implicit null transformation for the AArch64 target. As background for those unfamiliar with our implicit null check support: An implicit null check is the use of a signal handler to catch and redirect to a handler a null pointer. Specifically, it's replacing an explicit conditional branch with such a redirect. This is only done for very cold branches under frontend control w/appropriate metadata. FAULTING_OP is used to wrap the faulting instruction. It is modelled as being a conditional branch to reflect the fact it can transfer control in the CFG. FAULTING_OP does not need to be an analyzable branch to achieve it's purpose. (Or at least, that's the x86 model. I find this slightly questionable.) When lowering to MC, we convert the FAULTING_OP back into the actual instruction, record the labels, and lower the original instruction. As can be seen in the test changes, currently the AArch64 backend does not eliminate the unconditional branch to the fallthrough block. I've tried two approaches, neither of which worked. I plan to return to this in a separate change set once I've wrapped my head around the interactions a bit better. (X86 handles this via AllowModify on analyzeBranch, but adding the obvious code causing BranchFolding to crash. I haven't yet figured out if it's a latent bug in BranchFolding, or something I'm doing wrong.) Differential Revision: https://reviews.llvm.org/D87851	2020-09-17 16:00:19 -07:00
Arthur Eubanks	f2f0474c93	[test] Fix FullUnroll.ll I believe the intention of this test added in https://reviews.llvm.org/D71687 was to test LoopFullUnrollPass with clang's -fno-unroll-loops, not its interaction with optnone. Loop unrolling passes don't run under optnone/-O0. Also added back unintentionally removed -disable-loop-unrolling from https://reviews.llvm.org/D85578. Reviewed By: echristo Differential Revision: https://reviews.llvm.org/D86485	2020-09-17 15:56:13 -07:00
Quentin Colombet	99e865b618	[TargetRegisterInfo] Add a couple of target hooks for the greedy register allocator Before this patch, the last chance recoloring and deferred spilling techniques were solely controled by command line options. This patch adds target hooks for these two techniques so that it is easier for backend writers to override the default behavior. The default behavior of the hooks preserves the default values of the related command line options. NFC	2020-09-17 15:23:15 -07:00
Zhaoshi Zheng	cab780a5a0	[NFC] Test Commit	2020-09-17 15:14:14 -07:00
Derek Schuff	0ff28fa6a7	Support dwarf fission for wasm object files Initial support for dwarf fission sections (-gsplit-dwarf) on wasm. The most interesting change is support for writing 2 files (.o and .dwo) in the wasm object writer. My approach moves object-writing logic into its own function and calls it twice, swapping out the endian::Writer (W) in between calls. It also splits the import-preparation step into its own function (and skips it when writing a dwo). Differential Revision: https://reviews.llvm.org/D85685	2020-09-17 14:42:41 -07:00
Florian Hahn	a0017c2bc2	[MemorySSA] Be more conservative when traversing MemoryPhis. I think we need to be even more conservative when traversing memory phis, to make sure we catch any loop carried dependences. This approach updates fillInCurrentPair to use unknown sizes for locations when we walk over a phi, unless the location is guaranteed to be loop-invariant for any possible loop. Using an unknown size for locations should ensure we catch all memory accesses to locations after the given memory location, which includes loop-carried dependences. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D87778	2020-09-17 22:09:53 +01:00
Arthur Eubanks	179a22e807	[NewPM] Fix pr45927.ll under NPM	2020-09-17 13:57:55 -07:00
Alexander Shaposhnikov	53ba045f48	[llvm-install-name-tool] Update the command-line guide	2020-09-17 13:44:26 -07:00
Nikita Popov	05d4c4ebc2	[InstCombine] Canonicalize SPF_ABS to abs intrinc Enable canonicalization of SPF_ABS and SPF_NABS to the abs intrinsic. To be conservative, the one-use check on the comparison is retained, this may be relaxed if all goes well. It's pretty likely that this will uncover places that missing handling for the abs() intrinsic. Please report any seen performance regressions. Differential Revision: https://reviews.llvm.org/D87188	2020-09-17 22:28:34 +02:00
Whitney Tsang	1cee33e9db	[LoopUnrollAndJam] Allow unroll and jam loops forced by user. Summary: Allow unroll and jam loops forced by user. LoopUnrollAndJamPass is still disabled by default in the NPM pipeline, and can be controlled by -enable-npm-unroll-and-jam. Reviewed By: Meinersbur, dmgreen Differential Revision: https://reviews.llvm.org/D87786	2020-09-17 19:40:14 +00:00
Nikita Popov	91ce8e121b	[GVN] Use that assume(!X) implies X==false (PR47496) We already use that assume(X) implies X==true, do the same for assume(!X) implying X==false. This fixes PR47496.	2020-09-17 21:34:44 +02:00
Nikita Popov	59855b9d3b	[GVN] Add additional assume tests (NFC) The other assume tests seem to be dealing with equalities in particular. Test implication for the condition itself, especially the negated case from PR47496.	2020-09-17 21:34:43 +02:00
Florian Hahn	51973a607d	[SCEV] Add test cases for max BTC with loop guard info. This adds test cases for PR40961 and PR47247. They illustrate cases in which the max backedge-taken count can be improved by information from the loop guards.	2020-09-17 20:27:48 +01:00
Victor Huang	a4bb71b1c0	Disable hoisting MI to hotter basic blocks when using pgo This is a follow up patch for https://reviews.llvm.org/D63676 to enable the feature when using pgo. Differential Revision: https://reviews.llvm.org/D85240	2020-09-17 14:17:00 -05:00
Vitaly Buka	5813fca107	[Lsan] Use fp registers to search for pointers X86 can use xmm registers for pointers operations. e.g. for std::swap. I don't know yet if it's possible on other platforms. NT_X86_XSTATE includes all registers from NT_FPREGSET so the latter used only if the former is not available. I am not sure how reasonable to expect that but LLD has such fallback in NativeRegisterContextLinux_x86_64::ReadFPR. Reviewed By: morehouse Differential Revision: https://reviews.llvm.org/D87754	2020-09-17 12:16:28 -07:00
Jon Roelofs	c145a1ca25	AArch64::ArchKind's underlying type is uint64_t	2020-09-17 12:13:57 -07:00
LLVM GN Syncbot	667762c64e	[gn build] Port `7e4c6fb854`	2020-09-17 19:09:34 +00:00
Andrew Litteken	7e4c6fb854	[IRSim] Adding IR Instruction Mapper This introduces the IRInstructionMapper, and the associated wrapper for instructions, IRInstructionData, that maps IR level Instructions to unsigned integers. Mapping is done mainly by using the "isSameOperationAs" comparison between two instructions. If they return true, the opcode, result type, and operand types of the instruction are used to hash the instruction with an unsigned integer. The mapper accepts instruction ranges, and adds each resulting integer to a list, and each wrapped instruction to a separate list. At present, branches, phi nodes are not mapping and exception handling is illegal. Debug instructions are not considered. The different mapping schemes are tested in unittests/Analysis/IRSimilarityIdentifierTest.cpp Recommit of: `b04c1a9d31` Differential Revision: https://reviews.llvm.org/D86968	2020-09-17 14:06:16 -05:00
Cameron McInally	a35c7f3076	[SVE][WIP] Implement lowering for fixed length VSELECT to Scalable Map fixed length VSELECT to its Scalable equivalent. Differential Revision: https://reviews.llvm.org/D85364	2020-09-17 14:02:57 -05:00
Reid Kleckner	1e5b7e91aa	[PDB] Split TypeServerSource and extend type index map lifetime Extending the lifetime of these type index mappings does increase memory usage (+2% in my case), but it decouples type merging from symbol merging. This is a pre-requisite for two changes that I have in mind: - parallel type merging: speeds up slow type merging - defered symbol merging: avoid heap allocating (relocating) all symbols This eliminates CVIndexMap and moves its data into TpiSource. The maps are also split into a SmallVector and ArrayRef component, so that the ipiMap can alias the tpiMap for /Z7 object files, and so that both maps can simply alias the PDB type server maps for /Zi files. Splitting TypeServerSource establishes that all input types to be merged can be identified with two 32-bit indices: - The index of the TpiSource object - The type index of the record This is useful, because this information can be stored in a single 64-bit atomic word to enable concurrent hashtable insertion. One last change is that now all object files with debugChunks get a TpiSource, even if they have no type info. This avoids some null checks and special cases. Differential Revision: https://reviews.llvm.org/D87736	2020-09-17 11:53:10 -07:00
Amara Emerson	7d5b103483	[AArch64][GlobalISel] Widen G_EXTRACT_VECTOR_ELT element types if < 8b. In order to not unnecessarily promote the source vector to greater than our native vector size of 128b, I've added some cascading rules to widen based on the number of elements.	2020-09-17 11:50:33 -07:00
Amara Emerson	bea7749d03	[AArch64][GlobalISel] Make <8 x s16> and <16 x s8> legal for shifts.	2020-09-17 11:50:32 -07:00
Sanjay Patel	48a23bccf3	[VectorCombine] limit load+insert transform to one-use As discussed in: https://llvm.org/PR47558 ...there are several potential fixes/follow-ups visible in the test case, but this is the quickest and safest fix of the perf regression.	2020-09-17 14:29:15 -04:00
Craig Topper	3783d3bc7b	[X86] Don't match x87 register inline asm constraints unless the VT is floating point or its a clobber The register class picked will be the RFP80 register class which has a f80 VT. The code in SelectionDAGBuilder that generates copies around inline assembly doesn't know how to handle an integer and floating point type of different bit widths. The test case is derived from this https://godbolt.org/z/sEa659 which gcc accepts but clang crashes on. This patch just gives a more graceful error. I'm not sure if the single element struct case is special in gcc. Adding another field to the struct makes gcc reject it. If we want to support this correctly I think we need a change in the frontend to give us the true element type. Right now the frontend just realizes the constraint can take a memory argument so creates an integer type of the same size and bitcasts. Differential Revision: https://reviews.llvm.org/D87485	2020-09-17 11:26:50 -07:00
Navdeep Kumar	0602e8f77f	[MLIR][Affine] Add parametric tile size support for affine.for tiling Add support to tile affine.for ops with parametric sizes (i.e., SSA values). Currently supports hyper-rectangular loop nests with constant lower bounds only. Move methods - moveLoopBody() - getTileableBands() - checkTilingLegality() - tilePerfectlyNested() - constructTiledIndexSetHyperRect(*) to allow reuse with constant tile size API. Add a test pass -test-affine -parametric-tile to test parametric tiling. Differential Revision: https://reviews.llvm.org/D87353	2020-09-17 23:39:14 +05:30
Abhishek Varma	296e97ae8f	[MLIR] Support for return values in Affine.For yield Add support for return values in affine.for yield along the same lines as scf.for and affine.parallel. Signed-off-by: Abhishek Varma <abhishek.varma@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D87437	2020-09-17 23:34:59 +05:30

... 3 4 5 6 7 ...

366766 Commits All Branches Search

366766 Commits

All Branches