llvm-project

Commit Graph

Author	SHA1	Message	Date
Sanjay Patel	6080387f13	[InstSimplify] fold splat of inserted constant to vector constant shuf (inselt ?, C, IndexC), undef, <IndexC, IndexC...> --> <C, C...> This is another missing shuffle fold pattern uncovered by the shuffle correctness fix from D70246. The problem was visible in the post-commit thread example, but we managed to overcome the limitation for that particular case with D71220. This is something like the inverse of the previous fix - there we didn't demand the inserted scalar, and here we are only demanding an inserted scalar. Differential Revision: https://reviews.llvm.org/D71488	2019-12-15 09:32:03 -05:00
Sanjay Patel	940600ae41	[InstSimplify] improve test coverage for insert+splat; NFC	2019-12-13 14:03:54 -05:00
Sanjay Patel	252d3b9805	[InstSimplify] add tests for insert constant + splat; NFC	2019-12-10 17:16:58 -05:00
Johannes Doerfert	a7d992c0f2	[ValueTracking] Allow context-sensitive nullness check for non-pointers Summary: Same as D60846 and D69571 but with a fix for the problem encountered after them. Both times it was a missing context adjustment in the handling of PHI nodes. The reproducers created from the bugs that caused the old commits to be reverted are included. Reviewers: nikic, nlopes, mkazantsev, spatel, dlrobertson, uabelho, hakzsam, hans Subscribers: hiraditya, bollu, asbirlea, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71181	2019-12-09 15:15:52 -06:00
Sanjay Patel	1c4dd3ae2f	[InstSimplify] fold copysign with negated operand, part 2 This is another transform suggested in PR44153: https://bugs.llvm.org/show_bug.cgi?id=44153 Unlike rG12f39e0fede9, it doesn't look like the backend matches this variant.	2019-12-08 10:16:29 -05:00
Sanjay Patel	12f39e0fed	[InstSimplify] fold copysign with negated operand This is another transform suggested in PR44153: https://bugs.llvm.org/show_bug.cgi?id=44153 The backend for some targets already manages to get this if it converts copysign to bitwise logic.	2019-12-08 10:08:02 -05:00
Sanjay Patel	d5abaaf140	[InstSimplify] add tests for copysign with fneg operand; NFC	2019-12-06 16:23:44 -05:00
Sanjay Patel	e177c5a00d	[InstSimplify] fold copysign with same args to the arg This is correct for any value including NaN/inf. We don't have this fold directly in the backend either, but x86 manages to get it after converting things to bitops.	2019-11-26 17:35:10 -05:00
Sanjay Patel	48a3a1e090	[InstSimplify] add tests for copysign; NFC	2019-11-26 17:23:30 -05:00
Benjamin Kramer	cd4811360e	[ValueTracking] Add a basic version of isKnownNonInfinity and use it to detect more NoNaNs	2019-11-19 22:24:46 +01:00
Hans Wennborg	6ea4775900	Revert `57dd4b0` "[ValueTracking] Allow context-sensitive nullness check for non-pointers" This caused miscompiles of Chromium (https://crbug.com/1023818). The reduced repro is small enough to fit here: $ cat /tmp/a.c unsigned char f(unsigned char p) { unsigned char result = 0; for (int shift = 0; shift < 1; ++shift) result \|= p[0] << (shift 8); return result; } $ bin/clang -O2 -S -o - /tmp/a.c \| grep -A4 f: f: # @f .cfi_startproc # %bb.0: # %entry xorl %eax, %eax retq That's nicely optimized, but I don't think it's the right result :-) > Same as D60846 but with a fix for the problem encountered there which > was a missing context adjustment in the handling of PHI nodes. > > The test that caused D60846 to be reverted was added in `e15ab8f277`. > > Reviewers: nikic, nlopes, mkazantsev,spatel, dlrobertson, uabelho, hakzsam > > Subscribers: hiraditya, bollu, llvm-commits > > Tags: #llvm > > Differential Revision: https://reviews.llvm.org/D69571 This reverts commit `57dd4b03e4`.	2019-11-13 12:19:02 +01:00
aqjune	4187cb138b	Add InstCombine/InstructionSimplify support for Freeze Instruction Summary: - Add llvm::SimplifyFreezeInst - Add InstCombiner::visitFreeze - Add llvm tests Reviewers: majnemer, sanjoy, reames, lebedev.ri, spatel Reviewed By: reames, lebedev.ri Subscribers: reames, lebedev.ri, filcab, regehr, trentxintong, llvm-commits Differential Revision: https://reviews.llvm.org/D29013	2019-11-12 12:13:26 +09:00
Sanjay Patel	659bd73d13	[InstSimplify] use FMF to improve fcmp+select fold This is part of a series of patches needed to solve PR39535: https://bugs.llvm.org/show_bug.cgi?id=39535	2019-11-04 08:29:56 -05:00
Sanjay Patel	ad87f244b4	[InstSimplify] add more tests for fcmp+select; NFC The easy code fix won't catch non-canonical mismatched constant patterns, so adding extra coverage for those in case we decide that's important (but seems unlikely).	2019-11-04 08:23:08 -05:00
Sanjay Patel	499c90afe9	[InstSimplify] add more tests for fcmp+select; NFC The addition of FMF for select allows more folding for these kinds of patterns.	2019-11-04 07:38:11 -05:00
Johannes Doerfert	57dd4b03e4	[ValueTracking] Allow context-sensitive nullness check for non-pointers Same as D60846 but with a fix for the problem encountered there which was a missing context adjustment in the handling of PHI nodes. The test that caused D60846 to be reverted was added in `e15ab8f277`. Reviewers: nikic, nlopes, mkazantsev,spatel, dlrobertson, uabelho, hakzsam Subscribers: hiraditya, bollu, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69571	2019-10-31 14:37:38 -05:00
Sanjay Patel	be21ceb565	[InstSimplify] fold fma/fmuladd with a NaN or undef operand This is intended to be similar to the constant folding results from D67446 and earlier, but not all operands are constant in these tests, so the responsibility for folding is left to InstSimplify. Differential Revision: https://reviews.llvm.org/D67721 llvm-svn: 373455	2019-10-02 12:12:02 +00:00
Sanjay Patel	1b40402aa2	[InstSimplify] add tests for fma/fmuladd with undef operand; NFC llvm-svn: 373109	2019-09-27 18:38:51 +00:00
Roman Lebedev	914a3d1cf2	[InstSimplify] Handle more 'A </>/>=/<= B &&/\|\| (A - B) !=/== 0' patterns (PR43251) https://rise4fun.com/Alive/sl9s https://rise4fun.com/Alive/2plN https://bugs.llvm.org/show_bug.cgi?id=43251 llvm-svn: 372928	2019-09-25 22:59:41 +00:00
Roman Lebedev	26606bec9a	[NFC][InstSimplify] More exaustive test coverage for 'A </>/>=/<= B &&/\|\| (A - B) !=/== 0' pattern (PR43251) llvm-svn: 372927	2019-09-25 22:59:24 +00:00
Roman Lebedev	baf809811b	[InstSimplify] simplifyUnsignedRangeCheck(): X >= Y && Y == 0 --> Y == 0 https://rise4fun.com/Alive/v9Y4 llvm-svn: 372491	2019-09-21 22:27:39 +00:00
Roman Lebedev	ac4dda8052	[NFC][InstSimplify] Add exhaustive test coverage for simplifyUnsignedRangeCheck(). One case is not handled. llvm-svn: 372489	2019-09-21 22:27:18 +00:00
Sanjay Patel	e406a3f2d6	[InstSimplify] add tests for fma/fmuladd; NFC llvm-svn: 372236	2019-09-18 17:27:02 +00:00
Roman Lebedev	9c5a4a4527	[InstSimplify] simplifyUnsignedRangeCheck(): handle few tautological cases (PR43251) Summary: This is split off from D67356, since these cases produce a constant, no real need to keep them in instcombine. Alive proofs: https://rise4fun.com/Alive/u7Fk https://rise4fun.com/Alive/4lV https://bugs.llvm.org/show_bug.cgi?id=43251 Reviewers: spatel, nikic, xbolva00 Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67498 llvm-svn: 371921	2019-09-14 13:47:27 +00:00
Roman Lebedev	4cb267f9f5	[NFC][InstSimplify] Add some more tests for D67498/D67502 llvm-svn: 371877	2019-09-13 17:58:24 +00:00
Roman Lebedev	80a8a85758	[InstCombine][InstSimplify] Move constant-folding tests in result-of-usub-is-non-zero-and-no-overflow.ll llvm-svn: 371737	2019-09-12 14:12:31 +00:00
Roman Lebedev	b3e0937f0a	[NFC][InstCombine][InstSimplify] Add test for "add-of-negative is non-zero and no overflow" (PR43259) https://rise4fun.com/Alive/ska https://rise4fun.com/Alive/9iX https://bugs.llvm.org/show_bug.cgi?id=43259 llvm-svn: 371736	2019-09-12 14:12:20 +00:00
Roman Lebedev	f1286621eb	[InstSimplify] simplifyUnsignedRangeCheck(): handle more cases (PR43251) Summary: I don't have a direct motivational case for this, but it would be good to have this for completeness/symmetry. This pattern is basically the motivational pattern from https://bugs.llvm.org/show_bug.cgi?id=43251 but with different predicate that requires that the offset is non-zero. The completeness bit comes from the fact that a similar pattern (offset != zero) will be needed for https://bugs.llvm.org/show_bug.cgi?id=43259, so it'd seem to be good to not overlook very similar patterns.. Proofs: https://rise4fun.com/Alive/21b Also, there is something odd with `isKnownNonZero()`, if the non-zero knowledge was specified as an assumption, it didn't pick it up (PR43267) Reviewers: spatel, nikic, xbolva00 Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67411 llvm-svn: 371718	2019-09-12 09:26:17 +00:00
Roman Lebedev	00c1ee48e4	[InstSimplify] Pass SimplifyQuery into simplifyUnsignedRangeCheck() and use it for isKnownNonZero() This was actually the original intention in D67332, but i messed up and forgot about it. This patch was originally part of D67411, but precommitting this. llvm-svn: 371630	2019-09-11 15:32:46 +00:00
Roman Lebedev	8aeb7bb013	[NFC][InstSimplify] Add extra test for D67411 with @llvm.assume llvm-svn: 371629	2019-09-11 15:28:03 +00:00
Sanjay Patel	9c4047f267	[ConstProp] move test file from InstSimplify; NFC These are constant folding tests; there is no code directly in InstSimplify for this. llvm-svn: 371619	2019-09-11 14:01:11 +00:00
Sanjay Patel	29ba5e0817	[InstSimplify] regenerate test CHECKs; NFC llvm-svn: 371617	2019-09-11 13:56:07 +00:00
Roman Lebedev	870ffe3cee	[NFC][InstSimplify] rewrite test added in r371537 to use non-null pointer instead I only want to ensure that %offset is non-zero there, it doesn't matter how that info is conveyed. As filed in PR43267, the assumption way does not work. llvm-svn: 371546	2019-09-10 18:40:00 +00:00
Roman Lebedev	880657c97c	[NFC][InstCombine][InstSimplify] PR43251 - and some patterns with offset != 0 https://rise4fun.com/Alive/21b llvm-svn: 371537	2019-09-10 17:13:59 +00:00
Roman Lebedev	6e2c5c8710	[InstSimplify] simplifyUnsignedRangeCheck(): if we know that X != 0, handle more cases (PR43246) Summary: This is motivated by D67122 sanitizer check enhancement. That patch seemingly worsens `-fsanitize=pointer-overflow` overhead from 25% to 50%, which strongly implies missing folds. In this particular case, given ``` char* test(char& base, unsigned long offset) { return &base + offset; } ``` it will end up producing something like https://godbolt.org/z/LK5-iH which after optimizations reduces down to roughly ``` define i1 @t0(i8* nonnull %base, i64 %offset) { %base_int = ptrtoint i8* %base to i64 %adjusted = add i64 %base_int, %offset %non_null_after_adjustment = icmp ne i64 %adjusted, 0 %no_overflow_during_adjustment = icmp uge i64 %adjusted, %base_int %res = and i1 %non_null_after_adjustment, %no_overflow_during_adjustment ret i1 %res } ``` Without D67122 there was no `%non_null_after_adjustment`, and in this particular case we can get rid of the overhead: Here we add some offset to a non-null pointer, and check that the result does not overflow and is not a null pointer. But since the base pointer is already non-null, and we check for overflow, that overflow check will already catch the null pointer, so the separate null check is redundant and can be dropped. Alive proofs: https://rise4fun.com/Alive/WRzq There are more patterns of "unsigned-add-with-overflow", they are not handled here, but this is the main pattern, that we currently consider canonical, so it makes sense to handle it. https://bugs.llvm.org/show_bug.cgi?id=43246 Reviewers: spatel, nikic, vsk Reviewed By: spatel Subscribers: hiraditya, llvm-commits, reames Tags: #llvm Differential Revision: https://reviews.llvm.org/D67332 llvm-svn: 371349	2019-09-08 20:14:15 +00:00
Roman Lebedev	64965430db	[NFC][InstSimplify] Some tests for dropping null check after uadd.with.overflow of non-null (PR43246) https://rise4fun.com/Alive/WRzq Name: C <= Y && Y != 0 --> C <= Y iff C != 0 Pre: C != 0 %y_is_nonnull = icmp ne i64 %y, 0 %no_overflow = icmp ule i64 C, %y %r = and i1 %y_is_nonnull, %no_overflow => %r = %no_overflow Name: C <= Y \|\| Y != 0 --> Y != 0 iff C != 0 Pre: C != 0 %y_is_nonnull = icmp ne i64 %y, 0 %no_overflow = icmp ule i64 C, %y %r = or i1 %y_is_nonnull, %no_overflow => %r = %y_is_nonnull Name: C > Y \|\| Y == 0 --> C > Y iff C != 0 Pre: C != 0 %y_is_null = icmp eq i64 %y, 0 %overflow = icmp ugt i64 C, %y %r = or i1 %y_is_null, %overflow => %r = %overflow Name: C > Y && Y == 0 --> Y == 0 iff C != 0 Pre: C != 0 %y_is_null = icmp eq i64 %y, 0 %overflow = icmp ugt i64 C, %y %r = and i1 %y_is_null, %overflow => %r = %y_is_null https://bugs.llvm.org/show_bug.cgi?id=43246 llvm-svn: 371339	2019-09-08 17:50:40 +00:00
Sanjay Patel	4a2cd7be5a	[InstSimplify] guard against unreachable code (PR43218) This would crash: https://bugs.llvm.org/show_bug.cgi?id=43218 llvm-svn: 370911	2019-09-04 15:12:55 +00:00
Roman Lebedev	c584786854	[InstSimplify] Drop leftover "division-by-zero guard" around `@llvm.umul.with.overflow` inverted overflow bit Summary: Now that with D65143/D65144 we've produce `@llvm.umul.with.overflow`, and with D65147 we've flattened the CFG, we now can see that the guard may have been there to prevent division by zero is redundant. We can simply drop it: ``` ---------------------------------------- Name: no overflow or zero %iszero = icmp eq i4 %y, 0 %umul = smul_overflow i4 %x, %y %umul.ov = extractvalue {i4, i1} %umul, 1 %umul.ov.not = xor %umul.ov, -1 %retval.0 = or i1 %iszero, %umul.ov.not ret i1 %retval.0 => %iszero = icmp eq i4 %y, 0 %umul = smul_overflow i4 %x, %y %umul.ov = extractvalue {i4, i1} %umul, 1 %umul.ov.not = xor %umul.ov, -1 %retval.0 = or i1 %iszero, %umul.ov.not ret i1 %umul.ov.not Done: 1 Optimization is correct! ``` Note that this is inverted from what we have in a previous patch, here we are looking for the inverted overflow bit. And that inversion is kinda problematic - given this particular pattern we neither hoist that `not` closer to `ret` (then the pattern would have been identical to the one without inversion, and would have been handled by the previous patch), neither do the opposite transform. But regardless, we should handle this too. I've filled [[ https://bugs.llvm.org/show_bug.cgi?id=42720 \| PR42720 ]]. Reviewers: nikic, spatel, xbolva00, RKSimon Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65151 llvm-svn: 370351	2019-08-29 12:48:04 +00:00
Roman Lebedev	aaf6ab4410	[InstSimplify] Drop leftover "division-by-zero guard" around `@llvm.umul.with.overflow` overflow bit Summary: Now that with D65143/D65144 we've produce `@llvm.umul.with.overflow`, and with D65147 we've flattened the CFG, we now can see that the guard may have been there to prevent division by zero is redundant. We can simply drop it: ``` ---------------------------------------- Name: no overflow and not zero %iszero = icmp ne i4 %y, 0 %umul = umul_overflow i4 %x, %y %umul.ov = extractvalue {i4, i1} %umul, 1 %retval.0 = and i1 %iszero, %umul.ov ret i1 %retval.0 => %iszero = icmp ne i4 %y, 0 %umul = umul_overflow i4 %x, %y %umul.ov = extractvalue {i4, i1} %umul, 1 %retval.0 = and i1 %iszero, %umul.ov ret %umul.ov Done: 1 Optimization is correct! ``` Reviewers: nikic, spatel, xbolva00 Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65150 llvm-svn: 370350	2019-08-29 12:47:50 +00:00
Bjorn Pettersson	d218a3326e	[InstSimplify] Report "Changed" also when only deleting dead instructions Summary: Make sure that we report that changes has been made by InstSimplify also in situations when only trivially dead instructions has been removed. If for example a call is removed the call graph must be updated. Bug seem to have been introduced by llvm-svn r367173 (commit `02b9e45a7e`), since the code in question was rewritten in that commit. Reviewers: spatel, chandlerc, foad Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65973 llvm-svn: 368401	2019-08-09 07:08:25 +00:00
Craig Topper	66c08430f6	[ValueTracking] When calculating known bits for integer abs, make sure we're looking at a negate and not just any instruction with the nsw flag set. The matchSelectPattern code can match patterns like (x >= 0) ? x : -x for absolute value. But it can also match ((x-y) >= 0) ? (x-y) : (y-x). If the latter form was matched we can only use the nsw flag if its set on both subtracts. This match makes sure we're looking at the former case only. Differential Revision: https://reviews.llvm.org/D65692 llvm-svn: 368195	2019-08-07 18:28:16 +00:00
Craig Topper	aa2810b6e7	[InstSimplify] Add test case to show bad sign bit handling for integer abs idiom in computeKnownBits. computeKnownBits will indicate the sign bit of abs is 0 if the the RHS operand returned by matchSelectPattern has the nsw flag set. For abs idioms like (X >= 0) ? X : -X, the RHS returns -X. But we can also match ((X-Y) >= 0 ? X-Y : Y-X as abs. In this case RHS will be the Y-X operand. According to Alive, the sign bit for this is only 0 if both the X-Y and Y-X operands have the nsw flag. But we're only checking the Y-X operand. llvm-svn: 367747	2019-08-03 02:54:54 +00:00
Sanjay Patel	02b9e45a7e	[InstSimplify] remove quadratic time looping (PR42771) The test case from: https://bugs.llvm.org/show_bug.cgi?id=42771 ...shows a ~30x slowdown caused by the awkward loop iteration (rL207302) that is seemingly done just to avoid invalidating the instruction iterator. We can instead delay instruction deletion until we reach the end of the block (or we could delay until we reach the end of all blocks). There's a test diff here for a degenerate case with llvm.assume that is not meaningful in itself, but serves to verify this change in logic. This change probably doesn't result in much overall compile-time improvement because we call '-instsimplify' as a standalone pass only once in the standard -O2 opt pipeline currently. Differential Revision: https://reviews.llvm.org/D65336 llvm-svn: 367173	2019-07-27 14:05:51 +00:00
Roman Lebedev	4153f17181	[InstSimplify][NFC] Tests for skipping 'div-by-0' checks before inverted @llvm.umul.with.overflow It would be already handled by the non-inverted case if we were hoisting the `not` in InstCombine, but we don't (granted, we don't sink it in this case either), so this is a separate case. llvm-svn: 366801	2019-07-23 12:42:49 +00:00
Roman Lebedev	0689427280	[InstSimplify][NFC] Tests for skipping 'div-by-0' checks before @llvm.umul.with.overflow These may remain after @llvm.umul.with.overflow was canonicalized from the code that was originally doing the check via division. llvm-svn: 366751	2019-07-22 22:09:11 +00:00
Michael Liao	543ba4e9e0	[InstructionSimplify] Apply sext/trunc after pointer stripping Summary: - As the pointer stripping could trace through `addrspacecast` now, need to sext/trunc the offset to ensure it has the same width as the pointer after stripping. Reviewers: jdoerfert Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64768 llvm-svn: 366162	2019-07-16 01:03:06 +00:00
David Bolvansky	5dca95bc4e	[NFC] Revisited tests for D64285 llvm-svn: 365815	2019-07-11 19:39:20 +00:00
David Bolvansky	e195a91d2d	[NFC] Updated tests for D64285 llvm-svn: 365765	2019-07-11 12:51:33 +00:00
David Bolvansky	901d91e5f0	[NFC] Fixed tests llvm-svn: 365506	2019-07-09 15:31:36 +00:00
David Bolvansky	e625eb9def	[NFC] Added tests for D64285 llvm-svn: 365501	2019-07-09 15:12:01 +00:00

1 2 3 4 5 ...

615 Commits