llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	76673c65e7	Regenerate PR19420 tests	2020-07-03 10:04:37 +01:00
Simon Pilgrim	50b25e0679	[InstCombine] Add some sext/trunc tests to show missing support for non-uniform vectors	2020-07-02 17:11:56 +01:00
Simon Pilgrim	769b979930	[InstCombine] Add (vXi1 trunc(lshr(x,c))) -> icmp_eq(and(x,c')) support for non-uniform vectors As noted on PR46531, we were only performing this transform on uniform vectors as we were using the m_APInt pattern matcher to extract the shift amount. Differential Revision: https://reviews.llvm.org/D83035	2020-07-02 16:56:33 +01:00
Simon Pilgrim	103d62e131	[InstCombine] Add some (vXi1 trunc(lshr(x,c))) -> icmp_eq(and(x,c')) tests for vectors with undef elements Suggested on D83035	2020-07-02 16:04:30 +01:00
Simon Pilgrim	23eeae5526	Regenerate sext/trunc tests and replace %tmp variable names to silence update_test_checks warnings	2020-07-02 14:37:21 +01:00
Simon Pilgrim	421c02e5c6	[InstCombine] Add some (vXi1 trunc(lshr(x,c))) -> icmp_eq(and(x,c')) tests for non-uniform vectors As noticed on PR46531	2020-07-02 11:56:51 +01:00
Simon Pilgrim	11c4bb0c7c	Regenerate apint-shift tests and replace %tmp variable names to silence update_test_checks warnings	2020-07-02 11:56:51 +01:00
Nikita Popov	a59dc55c2a	[InstSimplify] Move assume icmp test (NFC) Move this test from InstCombine into InstSimplify.	2020-07-01 23:35:52 +02:00
Hiroshi Yamauchi	6bd1db08e7	[InstCombine] Don't let an alignment assume prevent new/delete removals. Remove allocations with alignment assume. Differential Revision: https://reviews.llvm.org/D81854	2020-07-01 09:22:32 -07:00
David Green	9e49d1d9b8	[InstCombine] fma x, y, 0 -> fmul x, y If the addend of the fma is zero, common sense would suggest that we can convert fma x, y, 0.0 to fmul x, y. This comes up with some user code that was expecting the first fma in an unrolled loop to simplify to a fmul. Floating point often does not follow naive common sense though. Alive suggests that this should be guarded by nsz (as fadd -0.0, 0.0 = 0.0). fma x, y, -0.0 is always valid. Differential Revision: https://reviews.llvm.org/D82778	2020-06-30 19:56:37 +01:00
David Green	787b1a4746	[InstCombine] New FMA tests and regenerate tests. NFC	2020-06-30 18:05:13 +01:00
Nikita Popov	8758e14c6f	[InstCombine] Add tests for assume implication (NFC)	2020-06-28 16:18:44 +02:00
Fangrui Song	f31811f2dc	[BasicAA] Rename deprecated -basicaa to -basic-aa Follow-up to D82607 Revert an accidental change (empty.ll) of D82683	2020-06-26 20:41:37 -07:00
Vedant Kumar	9649c2095f	[InstCombine] Drop debug loc in TryToSinkInstruction (reland) Summary: The advice in HowToUpdateDebugInfo.rst is to "... preserve the debug location of an instruction if the instruction either remains in its basic block, or if its basic block is folded into a predecessor that branches unconditionally". TryToSinkInstruction doesn't seem to satisfy the criteria as it's sinking an instruction to some successor block. Preserving the debug loc can make single-stepping appear to go backwards, or make a breakpoint hit on that location happen "too late" (since single-stepping from that breakpoint can cause the function to return unexpectedly). So, drop the debug location. This was reverted in `ee3620643d` because it removed source locations from inlinable calls, breaking a verifier rule. I've added an exception for calls because the alternative (setting a line 0 location) is not better. I tested the updated patch by completing a stage2 RelWithDebInfo build. Reviewers: aprantl, davide Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82487	2020-06-26 17:18:15 -07:00
Vedant Kumar	ee3620643d	Revert "[InstCombine] Drop debug loc in TryToSinkInstruction" This reverts commit `903cf140d0`. This might be causing verifier failures on the bots, such as: "inlinable function call in a function with debug info must have a !dbg location" -- http://lab.llvm.org:8011/builders/sanitizer-ppc64be-linux/builds/16976/steps/bootstrap%20clang/logs/stdio	2020-06-26 14:59:40 -07:00
Vedant Kumar	903cf140d0	[InstCombine] Drop debug loc in TryToSinkInstruction Summary: The advice in HowToUpdateDebugInfo.rst is to "... preserve the debug location of an instruction if the instruction either remains in its basic block, or if its basic block is folded into a predecessor that branches unconditionally". TryToSinkInstruction doesn't seem to satisfy the criteria as it's sinking an instruction to some successor block. Preserving the debug loc can make single-stepping appear to go backwards, or make a breakpoint hit on that location happen "too late" (since single-stepping from that breakpoint can cause the function to return unexpectedly). So, drop the debug location. Reviewers: aprantl, davide Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82487	2020-06-26 13:23:24 -07:00
David Sherwood	7a834a0a4e	[SVE] Fix scalable vector bug in DataLayout::getIntPtrType Fixed an issue in DataLayout::getIntPtrType where we were assuming the input type was always a fixed vector type, which isn't true. Added a test that exposed the problem to: Transforms/InstCombine/vector_gep1.ll Differential Revision: https://reviews.llvm.org/D82294	2020-06-26 07:58:45 +01:00
Sanjay Patel	c9e8c9e3ea	[InstCombine] fold fmul/fdiv with fabs operands fabs(X) * fabs(Y) --> fabs(X * Y) fabs(X) / fabs(Y) --> fabs(X / Y) If both operands of fmul/fdiv are positive, then the result must be positive. There's a NAN corner-case that prevents removing the more specific fold just above this one: fabs(X) * fabs(X) -> X * X That fold works even with NAN because the sign-bit result of the multiply is not specified if X is NAN. We can't remove that and use the more general fold that is proposed here because once we convert to this: fabs (X * X) ...it is not legal to simplify the 'fabs' out of that expression when X is NAN. That's because fabs() guarantees that the sign-bit is always cleared - even for NAN values. So this patch has the potential to lose information, but it seems unlikely if we do the more specific fold ahead of this one. Differential Revision: https://reviews.llvm.org/D82277	2020-06-25 11:35:38 -04:00
Tyker	c95ffadb24	[AssumeBundles] Use operand bundles to encode alignment assumptions Summary: NOTE: There is a mailing list discussion on this: http://lists.llvm.org/pipermail/llvm-dev/2019-December/137632.html Complemantary to the assumption outliner prototype in D71692, this patch shows how we could simplify the code emitted for an alignemnt assumption. The generated code is smaller, less fragile, and it makes it easier to recognize the additional use as a "assumption use". As mentioned in D71692 and on the mailing list, we could adopt this scheme, and similar schemes for other patterns, without adopting the assumption outlining. Reviewers: hfinkel, xbolva00, lebedev.ri, nikic, rjmccall, spatel, jdoerfert, sstefan1 Reviewed By: jdoerfert Subscribers: yamauchi, kuter, fhahn, merge_guards_bot, hiraditya, bollu, rkruppe, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D71739	2020-06-25 12:59:44 +02:00
Max Kazantsev	4c6548222b	[Test] Add more tests for selects & phis	2020-06-25 10:54:07 +07:00
Max Kazantsev	1eeb714787	[InstCombine] Combine select & Phi by same condition This patch transforms ``` p = phi [x, y] s = select cond, z, p ``` with ``` s = phi[x, z] ``` if we can prove that the Phi node takes values basing on select's condition. Differential Revision: https://reviews.llvm.org/D82072 Reviewed By: nikic	2020-06-25 10:44:10 +07:00
Roman Lebedev	381054a989	[InstCombine] visitBitCast(): do not crash on weird `bitcast <1 x i8> to i8` Even if we know that RHS of a bitcast is a pointer, we can't assume LHS is, because it might be a single-element vector of pointer.	2020-06-25 00:58:53 +03:00
Max Kazantsev	9bff376e5c	[InstCombine] Replace selects with Phis We can sometimes replace a select with a Phi node if all of its values are available on respective incoming edges. Differential Revision: https://reviews.llvm.org/D82005 Reviewed By: nikic	2020-06-23 12:12:59 +07:00
Sanjay Patel	8953ecf22b	[InstCombine] reassociate diff of sums into sum of diffs This is the integer sibling to D81491. (a[0] + a[1] + a[2] + a[3]) - (b[0] + b[1] + b[2] +b[3]) --> (a[0] - b[0]) + (a[1] - b[1]) + (a[2] - b[2]) + (a[3] - b[3]) Removing the "experimental" from these intrinsics is likely not too far away.	2020-06-22 20:47:09 -04:00
Sanjay Patel	7e1f376f80	[InstCombine] add tests for integer reductions; NFC	2020-06-22 20:47:09 -04:00
Sanjay Patel	fc3cf48e12	[InstCombine] add tests for fmul/fdiv with fabs operands; NFC	2020-06-20 11:44:27 -04:00
Sanjay Patel	d84cdb81ed	[InstCombine] fabs(X) / fabs(X) -> X / X Also, consolidate related folds so we don't miss/repeat these.	2020-06-20 10:20:21 -04:00
Sanjay Patel	61b5773796	[InstCombine] add tests for fabs(x) / fabs (x); NFC	2020-06-20 10:17:09 -04:00
Max Kazantsev	7f0d7f3263	[Test] Add more tests on select->phi transform	2020-06-19 12:57:08 +07:00
Matt Arsenault	f0abefaf50	AMDGPU: Add IntrWillReturn to intrinsic definitions This should probably be implied for all the speculatable ones. I think the only ones where this plausibly doesn't apply is s_sendmsghalt and maybe kill.	2020-06-18 15:38:10 -04:00
Max Kazantsev	819948c443	[Test] Add more tests showing missing opportunities in Select instcombine	2020-06-18 12:32:55 +07:00
Roman Lebedev	e3d8cb1e1d	[InstCombine] Negator: cache negation results (PR46362) It is possible that we can try to negate the same value multiple times. For example, PHI nodes may happen to have multiple incoming values (all of which must be the same value) for the same incoming basic block. It may happen that we try to negate such a PHI node, and succeed, and that might result in having now-different incoming values.. To avoid that, and in general to reduce the amount of duplicated work we might be doing, let's introduce a cache where we'll track results of negating each value. The added test was previously failing -verify after -instcombine. Fixes https://bugs.llvm.org/show_bug.cgi?id=46362	2020-06-17 22:47:20 +03:00
Sam Parker	5bf0858c0b	Return "[InstCombine] Simplify compare of Phi with constant inputs against a constant" I originally reverted the patch because it was causing performance issues, but now I think it's just enabling simplify-cfg to do something that I don't want instead :) Sorry for the noise. This reverts commit `3e39760f8e`.	2020-06-17 11:38:59 +01:00
Max Kazantsev	9465dd5ddd	[Test] Add missing opportunity for replacement of select with Phi	2020-06-17 15:33:42 +07:00
Hiroshi Yamauchi	6bc2b042f4	[TLI] Add four C++17 delete variants. Summary: delete(void, unsigned int, align_val_t) delete(void, unsigned long, align_val_t) delete[](void, unsigned int, align_val_t) delete[](void, unsigned long, align_val_t) Differential Revision: https://reviews.llvm.org/D81853	2020-06-16 11:12:02 -07:00
Sam Parker	3e39760f8e	Revert "Return "[InstCombine] Simplify compare of Phi with constant inputs against a constant"" This reverts commit `23291b9863`. This caused performance regressions.	2020-06-15 07:46:28 +01:00
Max Kazantsev	344eaf7827	[Test] Update test with check script, add two more motivating cases	2020-06-15 12:41:46 +07:00
Sanjay Patel	b5fb26951a	[InstCombine] reassociate FP diff of sums into sum of diffs (a[0] + a[1] + a[2] + a[3]) - (b[0] + b[1] + b[2] +b[3]) --> (a[0] - b[0]) + (a[1] - b[1]) + (a[2] - b[2]) + (a[3] - b[3]) This should be the last step in solving PR43953: https://bugs.llvm.org/show_bug.cgi?id=43953 We started emitting reduction intrinsics with: D80867/ rGe50059f6b6b3 So it's a relatively easy pattern match now to re-order those ops. Also, I have not seen any complaints for the switch to intrinsics yet, so I'll propose to remove the "experimental" tag from the intrinsics soon. Differential Revision: https://reviews.llvm.org/D81491	2020-06-14 09:09:03 -04:00
Sanjay Patel	aeb5044801	[InstCombine] allow undef elements when comparing vector constants for min/max bailout This is a hacky, but low-risk fix to avoid the infinite loop in PR46271: https://bugs.llvm.org/show_bug.cgi?id=46271 As discussed there, the problem is that FoldOpIntoSelect() can get into a conflict with a transform that wants to pull a 'not' op through min/max via SimplifyDemandedVectorElts(). We need to relax our matching of min/max to include undefined elements in vector constants to avoid that. Alternatively, we could improve or cripple the demanded elements analysis, but that could create even more problems. The likely better, safer alternative will be to create min/max intrinsics, so we can remove all of the hacks related to min/max matching in instcombine. Differential Revision: https://reviews.llvm.org/D81698	2020-06-14 09:02:47 -04:00
EgorBo	012909dcaf	[InstCombine] "X - (X / C) * C == 0" to "X & C-1 == 0" Summary: "X % C == 0" is optimized to "X & C-1 == 0" (where C is a power-of-two) However, "X % Y" can also be represented as "X - (X / Y) * Y" so if I rewrite the initial expression: "X - (X / C) * C == 0" it's not currently optimized to "X & C-1 == 0", see godbolt: https://godbolt.org/z/KzuXUj This is my first contribution to LLVM so I hope I didn't mess things up Reviewers: lebedev.ri, spatel Reviewed By: lebedev.ri Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79369	2020-06-12 10:20:06 +03:00
EgorBo	6538b3adbe	[NFC][InstCombine] Tests for "X - (X / C) * C == 0" pattern See https://reviews.llvm.org/D79369	2020-06-12 10:20:06 +03:00
Chris Jackson	4707bc2177	[DebugInfo] Refactor SalvageDebugInfo and SalvageDebugInfoForDbgValues - Simplify the salvaging interface and the algorithm in InstCombine Reviewers: vsk, aprantl, Orlando, jmorse, TWeaver Reviewed by: Orlando Differential Revision: https://reviews.llvm.org/D79863	2020-06-11 11:13:46 +01:00
Sanjay Patel	f71a3b54f0	[InstCombine] add tests for diff-of-sums; NFC	2020-06-09 15:33:38 -04:00
Mehdi Amini	d31c9e5a46	Change filecheck default to dump input on failure Having the input dumped on failure seems like a better default: I debugged FileCheck tests for a while without knowing about this option, which really helps to understand failures. Remove `-dump-input-on-failure` and the environment variable FILECHECK_DUMP_INPUT_ON_FAILURE which are now obsolete. Differential Revision: https://reviews.llvm.org/D81422	2020-06-09 18:57:46 +00:00
Simon Pilgrim	8233439fdb	[InstCombine] Ensure allocation alignment mask is within range before applying as an attribute Fixes OSS-Fuzz #23214 https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=23214	2020-06-09 17:31:55 +01:00
Sanjay Patel	d50366d29f	[InstCombine] improve matching for sext-lshr-trunc patterns, part 2 Similar to rG42f488b63a04 This is intended to preserve the logic of the existing transform, but remove unnecessary restrictions on uses and types. https://rise4fun.com/Alive/oS0 Name: narrow input Pre: C1 <= width(C1) - 24 %B = sext i8 %A %C = lshr %B, C1 %r = trunc %C to i24 => %s = ashr i8 %A, trunc(umin(C1, 7)) %r = sext i8 %s to i24 Name: wide input Pre: C1 <= width(C1) - 24 %B = sext i24 %A %C = lshr %B, C1 %r = trunc %C to i8 => %s = ashr i24 %A, trunc(umin(C1, 23)) %r = trunc i24 %s to i8	2020-06-08 14:41:50 -04:00
Sanjay Patel	9b41821c1b	[InstCombine] add tests for sext+lshr+trunc; NFC Shows missing transforms with extra uses and vectors.	2020-06-08 14:41:50 -04:00
Sanjay Patel	42f488b63a	[InstCombine] improve matching for sext-lshr-trunc patterns This is intended to preserve the logic of the existing transform, but remove unnecessary restrictions on uses and types. https://rise4fun.com/Alive/pYfR Pre: C1 <= width(C1) - 8 %B = sext i8 %A %C = lshr %B, C1 %r = trunc %C to i8 => %r = ashr i8 %A, trunc(umin(C1, 7))	2020-06-08 11:55:30 -04:00
Sanjay Patel	2e5bba6787	[InstCombine] add tests for sext+lshr+trunc; NFC Shows missing transforms with extra uses and vectors.	2020-06-08 11:15:44 -04:00
Max Kazantsev	005db9c361	[Test] Add test showing InstCombine missing simplification opportunity	2020-06-08 13:19:09 +07:00
Sanjay Patel	2552f65183	[InstCombine] fold mask op into casted shift (PR46013) https://rise4fun.com/Alive/Qply8 Pre: C2 == (-1 u>> zext(C1)) %a = ashr %x, C1 %s = sext %a to i16 %r = and i16 %s, C2 => %s2 = sext %x to i16 %r = lshr i16 %s2, zext(C1) https://bugs.llvm.org/show_bug.cgi?id=46013	2020-06-07 09:33:18 -04:00
Sanjay Patel	c6719d0b47	[InstCombine] add tests for bitmask of casted shift; NFC (PR46013)	2020-06-07 09:33:18 -04:00
Richard Smith	f39e12a06b	PR34581: Don't remove an 'if (p)' guarding a call to 'operator delete(p)' under -Oz. Summary: This transformation is correct for a builtin call to 'free(p)', but not for 'operator delete(p)'. There is no guarantee that a user replacement 'operator delete' has no effect when called on a null pointer. However, the principle behind the transformation is correct, and can be applied more broadly: a 'delete p' expression is permitted to unconditionally call 'operator delete(p)'. So do that in Clang under -Oz where possible. We do this whether or not 'p' has trivial destruction, since the destruction might turn out to be trivial after inlining, and even for a class-specific (but non-virtual, non-destroying, non-array) 'operator delete'. Reviewers: davide, dnsampaio, rjmccall Reviewed By: dnsampaio Subscribers: hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D79378	2020-06-05 17:13:43 -07:00
Philip Reames	32c09d527c	[Tests] Migrate a number of tests to gc-live bundle representation	2020-06-05 16:44:04 -07:00
Max Kazantsev	23291b9863	Return "[InstCombine] Simplify compare of Phi with constant inputs against a constant" This reverts commit `c4b5a66e44`. Returning along with Clang test fix	2020-06-05 20:48:29 +07:00
Kadir Cetinkaya	c4b5a66e44	Revert "[InstCombine] Simplify compare of Phi with constant inputs against a constant" This reverts commit `16b7eb6dd1`. Breaks build bots, see http://lab.llvm.org:8011/builders/clang-x86_64-debian-fast/builds/29888 for an example.	2020-06-05 13:02:35 +02:00
Max Kazantsev	16b7eb6dd1	[InstCombine] Simplify compare of Phi with constant inputs against a constant We can simplify ``` icmp <pred> phi(C1, C2, ...), C ``` with ``` phi(icmp(C1, C), icmp(C2, C), ...) ``` provided that all comparison of constants are constants themselves. Differential Revision: https://reviews.llvm.org/D81151 Reviewed By: lebedev.ri	2020-06-05 17:02:47 +07:00
Sanjay Patel	192cb71836	[InstCombine] avoid crashing on select-shuffle detection As mentioned in the post-commit comments of D81013 - the mask check API has to assume the shuffle is not length-changing, but we have not ruled that out in this code. Use the ShuffleVectorInst call instead.	2020-06-04 17:27:14 -04:00
Sanjay Patel	8a96c1f627	[InstCombine] move vector select ahead of select-shuffle select Cond, (shuf_sel X, Y), X --> shuf_sel X, (select Cond, Y, X) A select of a select-shuffle ("blend" in x86 lingo) can be reversed so that the select is done first. This is a more limited version of what I was trying in D80658, but it enables existing demanded bits transforms to catch some of the motivating cases. The tricky bit in that seems to be that by moving the shuffle later, we can always guarantee that poison is correctly inhibited by the shuffle mask in the final value. Alive2 checks for the basic tests: http://volta.cs.utah.edu:8080/z/Qqd3RK http://volta.cs.utah.edu:8080/z/S4wchM http://volta.cs.utah.edu:8080/z/wf9zPL http://volta.cs.utah.edu:8080/z/wJeEGk Differential Revision: https://reviews.llvm.org/D81013	2020-06-04 14:29:13 -04:00
Max Kazantsev	18134511d9	[Test] Add test showing missing opportunity of folding ICmp(Phi(Consts...))	2020-06-04 18:26:24 +07:00
Yevgeny Rouban	417bcb8827	[Instruction] Remove setProfWeight() Remove the function Instruction::setProfWeight() and make use of Instruction::copyMetadata(.., {LLVMContext::MD_prof}). This is correct for all use cases of setProfWeight() as it is applied to CallBase instructions only. This change results in prof metadata copied intact even if the source has "VP". The old pair of calls extractProfTotalWeight() + setProfWeight() resulted in setting branch_weights if the source had "VP" data. Reviewers: yamauchi, davidxl Tags: #llvm Differential Revision: https://reviews.llvm.org/D80987	2020-06-04 15:10:55 +07:00
Sanjay Patel	a26cd73d33	[InstSimplify] add/move tests for or with not op (PR46083); NFC	2020-06-03 08:13:36 -04:00
Sanjay Patel	5a82dc62d2	[InstCombine] add tests for select-of-select-shuffle; NFC	2020-06-02 13:26:21 -04:00
Sanjay Patel	5b8c79ce76	[InstCombine] regenerate complete test checks; NFC	2020-06-02 13:26:21 -04:00
Sanjay Patel	b874dc4dda	[InstCombine] add test for select-of-shuffle; NFC This is based on an example in D80658	2020-06-01 11:52:07 -04:00
David Sherwood	f254f1d94e	[SVE] Remove getNumElements() warnings in InstCombiner::visitBitCast Whilst trying to compile this test to assembly: CodeGen/aarch64-sve-intrinsics/acle_sve_reinterpret.c I discovered some warnings were firing in InstCombiner::visitBitCast due to calls to getNumElements() for scalable vector types. These calls only really made sense for fixed width vectors so I have fixed up the code appropriately. Differential Revision: https://reviews.llvm.org/D80559	2020-05-29 08:00:08 +01:00
Philip Reames	27304b1737	[Tests] Switch a few statepoint tests to using operand bundles We've started (D80598) the process of migrating away from the inline operand lists in statepoints to using explicit operand bundles. Update a few tests to reflect the new preference. More to come, these were simply the ones outside any obvious grouping.	2020-05-28 14:36:05 -07:00
Sanjay Patel	48cb380abd	[InstCombine] add tests for vector demanded elements of select condition; NFC	2020-05-27 14:49:36 -04:00
Sanjay Patel	1a2bffaf8b	[InstCombine] reassociate sub+add to increase adds and throughput The -reassociate pass tends to transform this kind of pattern into something that is worse for vectorization and codegen. See PR43953: https://bugs.llvm.org/show_bug.cgi?id=43953 Follows-up the FP version of the same transform: rGa0ce2338a083	2020-05-26 14:49:17 -04:00
Sanjay Patel	0788392637	[InstCombine] add tests for reassociative sub/add expressions; NFC	2020-05-26 14:49:16 -04:00
Sanjay Patel	a0ce2338a0	[InstCombine] reassociate fsub+fadd with FMF to increase adds and throughput The -reassociate pass tends to transform this kind of pattern into something that is worse for vectorization and codegen. See PR43953: https://bugs.llvm.org/show_bug.cgi?id=43953	2020-05-26 13:17:15 -04:00
Serge Pavlov	4d20e31f73	[FPEnv] Intrinsic llvm.roundeven This intrinsic implements IEEE-754 operation roundToIntegralTiesToEven, and performs rounding to the nearest integer value, rounding halfway cases to even. The intrinsic represents the missed case of IEEE-754 rounding operations and now llvm provides full support of the rounding operations defined by the standard. Differential Revision: https://reviews.llvm.org/D75670	2020-05-26 19:24:58 +07:00
Sanjay Patel	c048a02b5b	[InstCombine] fold FP trunc into exact itofp Similar to D79116 and rGbfd512160fe0 - if the 1st cast is exact, then we can go directly to the destination type because there is no double-rounding.	2020-05-24 09:30:19 -04:00
Matt Arsenault	27fe841aa6	AMDGPU: Refine rcp/rsq intrinsic folding for modern FP rules We have to assume undef could be an snan, which would need quieting so returning qnan is safer than undef. Also consider strictfp, and don't care if the result rounded.	2020-05-23 13:28:36 -04:00
Sanjay Patel	2f7c24fe30	[InstCombine] (A + B) + B --> A + (B << 1) This eliminates a use of 'B', so it can enable follow-on transforms as well as improve analysis/codegen. The PhaseOrdering test was added for D61726, and that shows the limits of instcombine vs. real reassociation. We would need to run some form of CSE to collapse that further. The intermediate variable naming here is intentional because there's a test at llvm/test/Bitcode/value-with-long-name.ll that would break with the usual nameless value. I'm not sure how to improve that test to be more robust. The naming may also be helpful to debug regressions if this change exposes weaknesses in the reassociation pass for example.	2020-05-22 11:46:59 -04:00
Sanjay Patel	b603794061	[InstCombine] add tests for adds with common operand; NFC	2020-05-22 11:46:59 -04:00
Matt Arsenault	88c20fa3d2	InstCombine: Add constant folding/simplify for amdgcn.ldexp intrinsic This really belongs in InstructionSimplify since it doesn't introduce new instructions. Put it in instcombine to avoid increasing the number of passes considering target intrinsics. I also noticed that we seem to now be interpreting strictfp attributes on call sites, so try to handle that.	2020-05-22 08:21:38 -04:00
Jon Roelofs	5a8db275f8	Revert "[llvm][test] Add COM: directives before colon-less non-CHECKs in comments. NFC" This reverts commit `183d6af081`. Revert pending further consensus building: https://reviews.llvm.org/D79963#2050521	2020-05-22 05:36:15 -06:00
Max Kazantsev	403810557b	[InstCombine] Sink pure instructions down to return and unreachable blocks If the only user of `Instr` is in a return or unreachable block, we can sink `Instr` to the`User` safely (unless it reads/writes memory). Return or unreachable blocks are guaranteed to execute zero or one time, and `Instr` always dominates `User`, so they either will be executed together (execution of `User` always implies execution of `Instr`) or not executed at all. Differential Revision: https://reviews.llvm.org/D80120 Reviewed By: asbirlea, jdoerfert	2020-05-22 14:33:42 +07:00
Jon Roelofs	183d6af081	[llvm][test] Add COM: directives before colon-less non-CHECKs in comments. NFC Differential Revision: https://reviews.llvm.org/D79963	2020-05-21 09:29:27 -06:00
Eli Friedman	f26bdb539e	Make Value::getPointerAlignment() return an Align, not a MaybeAlign. If we don't know anything about the alignment of a pointer, Align(1) is still correct: all pointers are at least 1-byte aligned. Included in this patch is a bugfix for an issue discovered during this cleanup: pointers with "dereferenceable" attributes/metadata were assumed to be aligned according to the type of the pointer. This wasn't intentional, as far as I can tell, so Loads.cpp was fixed to stop making this assumption. Frontends may need to be updated. I updated clang's handling of C++ references, and added a release note for this. Differential Revision: https://reviews.llvm.org/D80072	2020-05-20 16:37:20 -07:00
Roman Lebedev	55430f53f3	[InstCombine] `insertelement` is negatible if both sources are negatible ---------------------------------------- define <2 x i4> @negate_insertelement(<2 x i4> %src, i4 %a, i32 %x, <2 x i4> %b) { %0: %t0 = sub <2 x i4> { 0, 0 }, %src %t1 = sub i4 0, %a %t2 = insertelement <2 x i4> %t0, i4 %t1, i32 %x %t3 = sub <2 x i4> %b, %t2 ret <2 x i4> %t3 } => define <2 x i4> @negate_insertelement(<2 x i4> %src, i4 %a, i32 %x, <2 x i4> %b) { %0: %t2.neg = insertelement <2 x i4> %src, i4 %a, i32 %x %t3 = add <2 x i4> %t2.neg, %b ret <2 x i4> %t3 } Transformation seems to be correct!	2020-05-20 21:44:31 +03:00
Roman Lebedev	a6097cebe9	[NFC][InstCombine] Negator: tests for insertelement negation	2020-05-20 21:44:31 +03:00
Roman Lebedev	ebed96fdbf	[InstCombine] Negator: `extractelement` is negatible if src is negatible ---------------------------------------- define i4 @negate_extractelement(<2 x i4> %x, i32 %y, i4 %z) { %0: %t0 = sub <2 x i4> { 0, 0 }, %x call void @use_v2i4(<2 x i4> %t0) %t1 = extractelement <2 x i4> %t0, i32 %y %t2 = sub i4 %z, %t1 ret i4 %t2 } => define i4 @negate_extractelement(<2 x i4> %x, i32 %y, i4 %z) { %0: %t0 = sub <2 x i4> { 0, 0 }, %x call void @use_v2i4(<2 x i4> %t0) %t1.neg = extractelement <2 x i4> %x, i32 %y %t2 = add i4 %t1.neg, %z ret i4 %t2 } Transformation seems to be correct!	2020-05-20 21:44:31 +03:00
Roman Lebedev	952e7106b3	[NFC][InstCombine] Negator: tests for extractelement negation	2020-05-20 21:44:30 +03:00
Arthur Eubanks	8a88755610	Reland [X86] Codegen for preallocated See https://reviews.llvm.org/D74651 for the preallocated IR constructs and LangRef changes. In X86TargetLowering::LowerCall(), if a call is preallocated, record each argument's offset from the stack pointer and the total stack adjustment. Associate the call Value with an integer index. Store the info in X86MachineFunctionInfo with the integer index as the key. This adds two new target independent ISDOpcodes and two new target dependent Opcodes corresponding to @llvm.call.preallocated.{setup,arg}. The setup ISelDAG node takes in a chain and outputs a chain and a SrcValue of the preallocated call Value. It is lowered to a target dependent node with the SrcValue replaced with the integer index key by looking in X86MachineFunctionInfo. In X86TargetLowering::EmitInstrWithCustomInserter() this is lowered to an %esp adjustment, the exact amount determined by looking in X86MachineFunctionInfo with the integer index key. The arg ISelDAG node takes in a chain, a SrcValue of the preallocated call Value, and the arg index int constant. It produces a chain and the pointer fo the arg. It is lowered to a target dependent node with the SrcValue replaced with the integer index key by looking in X86MachineFunctionInfo. In X86TargetLowering::EmitInstrWithCustomInserter() this is lowered to a lea of the stack pointer plus an offset determined by looking in X86MachineFunctionInfo with the integer index key. Force any function containing a preallocated call to use the frame pointer. Does not yet handle a setup without a call, or a conditional call. Does not yet handle musttail. That requires a LangRef change first. Tried to look at all references to inalloca and see if they apply to preallocated. I've made preallocated versions of tests testing inalloca whenever possible and when they make sense (e.g. not alloca related, inalloca edge cases). Aside from the tests added here, I checked that this codegen produces correct code for something like ``` struct A { A(); A(A&&); ~A(); }; void bar() { foo(foo(foo(foo(foo(A(), 4), 5), 6), 7), 8); } ``` by replacing the inalloca version of the .ll file with the appropriate preallocated code. Running the executable produces the same results as using the current inalloca implementation. Reverted due to unexpectedly passing tests, added REQUIRES: asserts for reland. Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77689	2020-05-20 11:25:44 -07:00
Arthur Eubanks	b8cbff51d3	Revert "[X86] Codegen for preallocated" This reverts commit `810567dc69`. Some tests are unexpectedly passing	2020-05-20 10:04:55 -07:00
Sanjay Patel	ad953a1ae1	[InstCombine] add tests for reassociative fsub/fadd expressions; NFC	2020-05-20 12:45:27 -04:00
Arthur Eubanks	810567dc69	[X86] Codegen for preallocated See https://reviews.llvm.org/D74651 for the preallocated IR constructs and LangRef changes. In X86TargetLowering::LowerCall(), if a call is preallocated, record each argument's offset from the stack pointer and the total stack adjustment. Associate the call Value with an integer index. Store the info in X86MachineFunctionInfo with the integer index as the key. This adds two new target independent ISDOpcodes and two new target dependent Opcodes corresponding to @llvm.call.preallocated.{setup,arg}. The setup ISelDAG node takes in a chain and outputs a chain and a SrcValue of the preallocated call Value. It is lowered to a target dependent node with the SrcValue replaced with the integer index key by looking in X86MachineFunctionInfo. In X86TargetLowering::EmitInstrWithCustomInserter() this is lowered to an %esp adjustment, the exact amount determined by looking in X86MachineFunctionInfo with the integer index key. The arg ISelDAG node takes in a chain, a SrcValue of the preallocated call Value, and the arg index int constant. It produces a chain and the pointer fo the arg. It is lowered to a target dependent node with the SrcValue replaced with the integer index key by looking in X86MachineFunctionInfo. In X86TargetLowering::EmitInstrWithCustomInserter() this is lowered to a lea of the stack pointer plus an offset determined by looking in X86MachineFunctionInfo with the integer index key. Force any function containing a preallocated call to use the frame pointer. Does not yet handle a setup without a call, or a conditional call. Does not yet handle musttail. That requires a LangRef change first. Tried to look at all references to inalloca and see if they apply to preallocated. I've made preallocated versions of tests testing inalloca whenever possible and when they make sense (e.g. not alloca related, inalloca edge cases). Aside from the tests added here, I checked that this codegen produces correct code for something like ``` struct A { A(); A(A&&); ~A(); }; void bar() { foo(foo(foo(foo(foo(A(), 4), 5), 6), 7), 8); } ``` by replacing the inalloca version of the .ll file with the appropriate preallocated code. Running the executable produces the same results as using the current inalloca implementation. Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77689	2020-05-20 09:20:38 -07:00
Jay Foad	9bc989a48d	[InstCombine] Remove hasNoInfs check for pow(C,y) -> exp2(log2(C)*y) We already check hasNoNaNs and that x is finite and strictly positive. That only leaves the following special cases (taken from the Linux man page for pow): If x is +1, the result is 1.0 (even if y is a NaN). If the absolute value of x is less than 1, and y is negative infinity, the result is positive infinity. If the absolute value of x is greater than 1, and y is negative infinity, the result is +0. If the absolute value of x is less than 1, and y is positive infinity, the result is +0. If the absolute value of x is greater than 1, and y is positive infinity, the result is positive infinity. The first case is handled elsewhere, and this transformation preserves all the others, so there is no need to limit it to hasNoInfs. Differential Revision: https://reviews.llvm.org/D79409	2020-05-19 17:06:05 +01:00
Vedant Kumar	623b254244	[Local] Do not ignore zexts in salvageDebugInfo, PR45923 Summary: When salvaging a dead zext instruction, append a convert operation to the DIExpressions of the debug uses of the instruction, to prevent the salvaged value from being sign-extended. I confirmed that lldb prints out the correct unsigned result for "f" in the example from PR45923 with this changed applied. rdar://63246143 Reviewers: aprantl, jmorse, chrisjackson, davide Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80034	2020-05-18 09:52:02 -07:00
Max Kazantsev	a2a4e5aae8	[Test] Opportunity for sinking to unreachable in InstCombine	2020-05-18 16:27:16 +07:00
Roman Lebedev	fde8eb00e1	[InstCombine] visitMaskedMerge(): when unfolding, sanitize undef constants (PR45955) We can't leave undef vector element constants as-is, it is a miscompile, so we need to sanitize them. We have two vectors (C and ~C): * We can't replace undef with 0 in both of them * We can't replace undef with 0 in only one of them * We could replace undef with -1 in both of them * We could replace undef with -1 in only one(!) of them * We could replace undef with -1 in one and 0 in another one of them. Therefore, it seems best to go with the last option, since otherwise we'd loose knowledge that C and ~C have no common bits set, which seems more important than preserving partial undef knowledge. Fixes https://bugs.llvm.org/show_bug.cgi?id=45955	2020-05-17 22:53:03 +03:00
Sanjay Patel	130a2356ae	[InstCombine] add tests for FP cast of cast; NFC A fold of casts is proposed as a backend transform in D79187, but we can also do that in IR (and that may obsolete the need for a backend transform).	2020-05-17 11:42:07 -04:00
Sanjay Patel	bfd512160f	[InstCombine] improve analysis of FP->int->FP to eliminate fpextend This was originally in D79116. Converting from a narrow-enough FP source value to integer and back to FP guarantees that the conversion to FP is exact because of UB/poison-on-overflow. This was suggested in PR36617: https://bugs.llvm.org/show_bug.cgi?id=36617#c19	2020-05-17 09:06:57 -04:00
Eli Friedman	11aa3707e3	StoreInst should store Align, not MaybeAlign This is D77454, except for stores. All the infrastructure work was done for loads, so the remaining changes necessary are relatively small. Differential Revision: https://reviews.llvm.org/D79968	2020-05-15 12:26:58 -07:00
Nikita Popov	f89f7da999	[IR] Convert null-pointer-is-valid into an enum attribute The "null-pointer-is-valid" attribute needs to be checked by many pointer-related combines. To make the check more efficient, convert it from a string into an enum attribute. In the future, this attribute may be replaced with data layout properties. Differential Revision: https://reviews.llvm.org/D78862	2020-05-15 19:41:07 +02:00
Simon Pilgrim	33d96bf7b9	[InstCombine] Add vector tests for the or(shl(zext(x),32)\|zext(y)) concat combines	2020-05-13 18:48:02 +01:00
Sanjay Patel	856cc60bc1	[InstCombine] canonicalize bitcast after insertelement into undef We have a transform in the opposite direction only for the x86 MMX type, Other types are not handled either way before this patch. The motivating case from PR45748: https://bugs.llvm.org/show_bug.cgi?id=45748 ...is the last test diff. In that example, we are triggering an existing bitcast transform, so we reduce the number of casts, and that should give us the ideal x86 codegen. Differential Revision: https://reviews.llvm.org/D79171	2020-05-10 11:37:47 -04:00
Simon Pilgrim	bab44a698e	[InstCombine] matchOrConcat - match BITREVERSE Fold or(zext(bitreverse(x)),shl(zext(bitreverse(y)),bw/2) -> bitreverse(or(zext(x),shl(zext(y),bw/2)) Practically this is the same as the BSWAP pattern so we might as well handle it.	2020-05-10 16:00:29 +01:00

1 2 3 4 5 ...

5080 Commits