llvm-project

Commit Graph

Author	SHA1	Message	Date
Nikita Popov	566a66703f	[InstSimplify] Add test for expand binop undef issue (NFC) Add test case from https://reviews.llvm.org/D83360#2146539.	2020-08-10 22:39:59 +02:00
Sanjay Patel	250a167c41	[InstSimplify] avoid crashing by trying to rem-by-zero Bug was noted in the post-commit comments for: rGe8760bb9a8a3	2020-08-06 16:06:31 -04:00
Sanjay Patel	60f2c6a94c	[PatternMatch] allow intrinsic form of min/max with existing matchers I skimmed the existing users of these matchers and don't see any problems (eg, the caller assumes the matched value was a select instruction without checking). So I think we can generalize the matching to allow the new intrinsics or the cmp+select idioms. I did not find any unit tests for the matchers, so added some basics there. The instsimplify tests are adapted from existing tests for the cmp+select pattern and cover the folds in simplifyICmpWithMinMax(). Differential Revision: https://reviews.llvm.org/D85230	2020-08-06 10:50:24 -04:00
Sanjay Patel	e8760bb9a8	[InstSimplify] fold icmp with mul nsw and constant operands https://rise4fun.com/Alive/slvl Name: mul nsw with icmp eq Pre: (C2 % C1) != 0 %a = mul nsw i8 %x, C1 %r = icmp eq i8 %a, C2 => %r = false Name: mul nsw with icmp ne Pre: (C2 % C1) != 0 %a = mul nsw i8 %x, C1 %r = icmp ne i8 %a, C2 => %r = true Follow-up to the 'nuw' variation added with: rGf879c9b79621	2020-08-05 14:38:39 -04:00
Sanjay Patel	f879c9b796	[InstSimplify] fold icmp with mul nuw and constant operands https://rise4fun.com/Alive/pZEr Name: mul nuw with icmp eq Pre: (C2 %u C1) != 0 %a = mul nuw i8 %x, C1 %r = icmp eq i8 %a, C2 => %r = false Name: mul nuw with icmp ne Pre: (C2 %u C1) != 0 %a = mul nuw i8 %x, C1 %r = icmp ne i8 %a, C2 => %r = true There are potentially several other transforms we need to add based on: D51625 ...but it doesn't look like there was follow-up to that patch.	2020-08-05 14:32:17 -04:00
Sanjay Patel	a569a0af0d	[InstSimplify] add vector tests for icmp with mul nuw; NFC Also, the naming was off on a couple of tests.	2020-08-05 14:32:17 -04:00
Sanjay Patel	719954eacb	[InstSimplify] add tests for icmp with 'mul nuw' operand; NFC	2020-08-05 12:46:45 -04:00
Xavier Denis	29fe3fe615	[InstSimplify] Peephole optimization for icmp (urem X, Y), X This revision adds the following peephole optimization and it's negation: %a = urem i64 %x, %y %b = icmp ule i64 %a, %x ====> %b = true With John Regehr's help this optimization was checked with Alive2 which suggests it should be valid. This pattern occurs in the bound checks of Rust code, the program const N: usize = 3; const T = u8; pub fn split_mutiple(slice: &[T]) -> (&[T], &[T]) { let len = slice.len() / N; slice.split_at(len * N) } the method call slice.split_at will check that len * N is within the bounds of slice, this bounds check is after some transformations turned into the urem seen above and then LLVM fails to optimize it any further. Adding this optimization would cause this bounds check to be fully optimized away. ref: https://github.com/rust-lang/rust/issues/74938 Differential Revision: https://reviews.llvm.org/D85092	2020-08-04 20:48:37 +02:00
Xavier Denis	b778b04b69	[InstSimplify] Add tests for icmp with urem divisor (NFC)	2020-08-04 20:45:20 +02:00
Sanjay Patel	960cef75f4	[InstSimplify] add tests for compare of min/max; NFC The test are adapted from the existing tests for cmp/select idioms.	2020-08-04 13:55:30 -04:00
Sanjay Patel	04e45ae1c6	[InstSimplify] fold nested min/max intrinsics with constant operands This is based on the existing code for the non-intrinsic idioms in InstCombine. The vector constant constraint is non-obvious: undefs should be ok in the outer call, but they can't propagate safely from the inner call in all cases. Example: https://alive2.llvm.org/ce/z/-2bVbM define <2 x i8> @src(<2 x i8> %x) { %0: %m = umin <2 x i8> %x, { 7, undef } %m2 = umin <2 x i8> { 9, 9 }, %m ret <2 x i8> %m2 } => define <2 x i8> @tgt(<2 x i8> %x) { %0: %m = umin <2 x i8> %x, { 7, undef } ret <2 x i8> %m } Transformation doesn't verify! ERROR: Value mismatch Example: <2 x i8> %x = < undef, undef > Source: <2 x i8> %m = < #x00 (0) [based on undef value], #x00 (0) > <2 x i8> %m2 = < #x00 (0), #x00 (0) > Target: <2 x i8> %m = < #x07 (7), #x10 (16) > Source value: < #x00 (0), #x00 (0) > Target value: < #x07 (7), #x10 (16) >	2020-08-04 08:44:48 -04:00
Sanjay Patel	011e15bea3	[InstSimplify] add tests for min/max with constants; NFC	2020-08-04 08:02:33 -04:00
Sanjay Patel	9e5cf6bde5	[InstSimplify] fold variations of max-of-min with common operand https://alive2.llvm.org/ce/z/ZtxpZ3	2020-08-03 15:02:46 -04:00
Sanjay Patel	7efd9ceb58	[InstSimplify] add tests for min-of-max variants; NFC	2020-08-03 15:02:46 -04:00
Sanjay Patel	4abc69c6f5	[InstSimplify] fold max (max X, Y), X --> max X, Y https://alive2.llvm.org/ce/z/VGgG3M	2020-08-02 11:50:58 -04:00
Sanjay Patel	e37987563a	[InstSimplify] add tests for max(max x,y), x) and variants; NFC	2020-08-02 11:50:47 -04:00
Craig Topper	85b5315dbe	[InstSimplify] Fold abs(abs(x)) -> abs(x) It's always safe to pick the earlier abs regardless of the nsw flag. We'll just lose it if it is on the outer abs but not the inner abs. Differential Revision: https://reviews.llvm.org/D85053	2020-08-01 13:25:00 -07:00
Sanjay Patel	04b99a4d18	[InstSimplify] simplify abs if operand is known non-negative abs() should be rare enough that using value tracking is not going to be a compile-time cost burden, so use it to reduce a variety of potential patterns. We do this in DAGCombiner too. Differential Revision: https://reviews.llvm.org/D85043	2020-08-01 07:47:06 -04:00
Sanjay Patel	1aa52d67d1	[InstSimplify] add abs test with assume; NFC	2020-08-01 07:47:06 -04:00
Sanjay Patel	77a02527dc	[InstSimplify] add tests for abs intrinsic; NFC	2020-07-31 18:49:13 -04:00
Sanjay Patel	fef513f5cc	[InstSimplify] fold min/max intrinsic with undef operand	2020-07-29 17:03:50 -04:00
Sanjay Patel	5cd695dd7f	[InstSimplify] fold min/max with opposite of limit value	2020-07-29 17:03:50 -04:00
Sanjay Patel	3e8534fbc6	[InstSimplify] allow partial undef constants for vector min/max folds	2020-07-29 11:53:41 -04:00
Sanjay Patel	3c20ede18b	[InstSimplify] fold integer min/max intrinsic with same args	2020-07-29 11:53:41 -04:00
Juneyoung Lee	672df0fc67	[InstSimplify] add tests for expandCommutativeBinOp; NFC	2020-07-29 23:21:39 +09:00
Sanjay Patel	3fb13b8484	[InstSimplify] allow undefs in icmp with vector constant folds This is the main icmp simplification shortcoming seen in D84655. Alive2 agrees that the basic examples are correct at least: define <2 x i1> @src(<2 x i8> %x) { %0: %r = icmp sle <2 x i8> { undef, 128 }, %x ret <2 x i1> %r } => define <2 x i1> @tgt(<2 x i8> %x) { %0: ret <2 x i1> { 1, 1 } } Transformation seems to be correct! define <2 x i1> @src(<2 x i32> %X) { %0: %A = or <2 x i32> %X, { 63, 63 } %B = icmp ult <2 x i32> %A, { undef, 50 } ret <2 x i1> %B } => define <2 x i1> @tgt(<2 x i32> %X) { %0: ret <2 x i1> { 0, 0 } } Transformation seems to be correct! https://alive2.llvm.org/ce/z/omt2ee https://alive2.llvm.org/ce/z/GW4nP_ Differential Revision: https://reviews.llvm.org/D84762	2020-07-28 15:13:53 -04:00
Sanjay Patel	496fc3f196	[InstSimplify] add tests for icmp with partial undef constant; NFC	2020-07-28 15:00:33 -04:00
Sanjay Patel	1ebcf03551	[InstSimplify] add tests for min/max intrinsics; NFC	2020-07-27 08:26:27 -04:00
Sanjay Patel	0481e1ae3c	[InstSimplify] fold integer min/max intrinsics with limit constant	2020-07-26 09:41:54 -04:00
Sanjay Patel	c6cf71107a	[InstSimplify] add tests for min/max intrinsics; NFC	2020-07-26 09:04:37 -04:00
Sanjay Patel	b89ae102e6	[InstSimplify] fold fcmp using isKnownNeverInfinity + isKnownNeverNaN Follow-up to D84035 / rG7393d7574c09. This sidesteps a question of FMF/poison on fcmp raised in PR46077: http://bugs.llvm.org/PR46077 https://alive2.llvm.org/ce/z/TCsyzD define i1 @src(float %x) { %0: %x42 = fadd nnan ninf float %x, 42.000000 %r = fcmp ueq float %x42, inf ret i1 %r } => define i1 @tgt(float %x) { %0: ret i1 0 } Transformation seems to be correct! https://alive2.llvm.org/ce/z/FQaH7a define i1 @src(i8 %x) { %0: %cast = uitofp i8 %x to float %r = fcmp one float inf, %cast ret i1 %r } => define i1 @tgt(i8 %x) { %0: ret i1 1 } Transformation seems to be correct!	2020-07-26 09:04:37 -04:00
Sanjay Patel	912e9e5262	[InstSimplify] add tests for fcmp with infinity constant; NFC	2020-07-26 09:04:36 -04:00
Juneyoung Lee	9f074214b7	[ValueTracking] Instruction::isBinaryOp should be used for constexprs This is a simple patch that makes canCreateUndefOrPoison use Instruction::isBinaryOp because BinaryOperator inherits Instruction. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D84596	2020-07-26 21:48:51 +09:00
Juneyoung Lee	02dadab1b4	NFC; add an example that subtracts pointers to two global vars	2020-07-26 20:47:33 +09:00
Juneyoung Lee	ace0bf7490	[ValueTracking] Fix incorrect handling of canCreateUndefOrPoison .. in isGuaranteedNotToBeUndefOrPoison. This caused early exit of isGuaranteedNotToBeUndefOrPoison, making it return imprecise result. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D84251	2020-07-22 09:31:16 +09:00
Juneyoung Lee	30201d3b61	[ValueTracking] Let isGuaranteedNotToBeUndefOrPoison use canCreateUndefOrPoison This patch adds support more operations. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D83926	2020-07-20 09:21:39 +09:00
Sanjay Patel	7393d7574c	[InstSimplify] fold fcmp with infinity constant using isKnownNeverInfinity This is a step towards trying to remove unnecessary FP compares with infinity when compiling with -ffinite-math-only or similar. I'm intentionally not checking FMF on the fcmp itself because I'm assuming that will go away eventually. The analysis part of this was added with rGcd481136 for use with isKnownNeverNaN. Similarly, that could be an enhancement here to get predicates like 'one' and 'ueq'. Differential Revision: https://reviews.llvm.org/D84035	2020-07-19 09:24:52 -04:00
Sanjay Patel	acbc688263	[InstSimplify] add tests for fcmp with infinity; NFC	2020-07-17 11:51:41 -04:00
Juneyoung Lee	582901d0b5	[ValueTracking] Let isGuaranteedNotToBeUndefOrPoison consider noundef This patch adds support for noundef arguments. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D83752	2020-07-17 12:53:08 +09:00
Juneyoung Lee	cd4953246b	Add a test for D83752	2020-07-17 12:50:40 +09:00
Matt Arsenault	0347039a6e	ValueTracking: Fix isKnownNonZero for non-0 null pointers for byval The IR doesn't have a proper concept of invalid pointers, and "null" constants are just all zeros (though it really needs one). I think it's not possible to break this for AMDGPU due to the copy semantics of byval. If you have an original stack object at 0, the byval copy will be placed above it so I don't think it's really possible to hit a 0 address.	2020-07-16 13:50:49 -04:00
Craig Topper	00f3579aea	Revert "[InstSimplify] Remove select ?, undef, X -> X and select ?, X, undef -> X transforms" and subsequent patches This reverts most of the following patches due to reports of miscompiles. I've left the added test cases with comments updated to be FIXMEs. `1cf6f210a2` [IR] Disable select ? C : undef -> C fold in ConstantFoldSelectInstruction unless we know C isn't poison. `469da663f2` [InstSimplify] Re-enable select ?, undef, X -> X transform when X is provably not poison `122b0640fc` [InstSimplify] Don't fold vectors of partial undef in SimplifySelectInst if the non-undef element value might produce poison `ac0af12ed2` [InstSimplify] Add test cases for opportunities to fold select ?, X, undef -> X when we can prove X isn't poison `9b1e95329a` [InstSimplify] Remove select ?, undef, X -> X and select ?, X, undef -> X transforms	2020-07-15 22:02:33 -07:00
Sanjay Patel	e6c016420c	[ValueTracking] fix library to intrinsic mapping to respect 'nobuiltin' attribute This is another problem raised in: http://bugs.llvm.org/PR46627	2020-07-14 10:04:24 -04:00
Sanjay Patel	9300de4d1c	[InstSimplify] add test with nobuiltin attribute (PR46627); NFC	2020-07-14 10:04:24 -04:00
Sanjay Patel	34d35d4a42	[ValueTracking] fix miscompile in maxnum case of cannotBeOrderedLessThanZeroImpl (PR46627) A miscompile with -0.0 is shown in: http://bugs.llvm.org/PR46627 This is because maxnum(-0.0, +0.0) does not specify a fixed result: http://llvm.org/docs/LangRef.html#llvm-maxnum-intrinsic So we need to tighten the constraints for when it is ok to say the result of maxnum is positive (including +0.0). Differential Revision: https://reviews.llvm.org/D83601	2020-07-14 08:08:09 -04:00
Sanjay Patel	9cc669d22d	[InstCombine][InstSimplify] add tests for sign of maxnum; NFC More coverage for D83601.	2020-07-14 08:08:09 -04:00
Sanjay Patel	351f2b3c0a	[InstSimplify] add tests for maxnum (PR46627); NFC	2020-07-10 20:20:38 -04:00
Craig Topper	1cf6f210a2	[IR] Disable select ? C : undef -> C fold in ConstantFoldSelectInstruction unless we know C isn't poison. This matches the recent change to InstSimplify from D83440. Differential Revision: https://reviews.llvm.org/D83535	2020-07-10 10:42:25 -07:00
Craig Topper	469da663f2	[InstSimplify] Re-enable select ?, undef, X -> X transform when X is provably not poison Follow up from the transform being removed in D83360. If X is probably not poison, then the transform is safe. Still plan to remove or adjust the code from ConstantFolding after this. Differential Revision: https://reviews.llvm.org/D83440	2020-07-09 12:21:03 -07:00
Craig Topper	122b0640fc	[InstSimplify] Don't fold vectors of partial undef in SimplifySelectInst if the non-undef element value might produce poison We can't fold to the non-undef value unless we know it isn't poison. So check each element with isGuaranteedNotToBeUndefOrPoison. This currently rules out all constant expressions. Differential Revision: https://reviews.llvm.org/D83442	2020-07-09 11:01:12 -07:00
Craig Topper	ac0af12ed2	[InstSimplify] Add test cases for opportunities to fold select ?, X, undef -> X when we can prove X isn't poison Part of addressing post-commit feedback from D83360	2020-07-08 15:24:55 -07:00
Craig Topper	9b1e95329a	[InstSimplify] Remove select ?, undef, X -> X and select ?, X, undef -> X transforms As noted here https://lists.llvm.org/pipermail/llvm-dev/2016-October/106182.html and by alive2, this transform isn't valid. If X is poison this potentially propagates poison when it shouldn't. This same transform still exists in DAGCombiner. Differential Revision: https://reviews.llvm.org/D83360	2020-07-08 12:53:05 -07:00
Christopher Tetreault	021d56abb9	[SVE] Make Constant::getSplatValue work for scalable vector splats Summary: Make Constant::getSplatValue recognize scalable vector splats of the form created by ConstantVector::getSplat. Add unit test to verify that C == ConstantVector::getSplat(C)->getSplatValue() for fixed width and scalable vector splats Reviewers: efriedma, spatel, fpetrogalli, c-rhodes Reviewed By: efriedma Subscribers: sdesmalen, tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82416	2020-07-07 13:45:51 -07:00
Nikita Popov	3b671022e4	[InstSimplify] Simplify comparison between zext(x) and sext(x) This is picking up a loose thread from D69006: We can simplify (zext x) ule (sext x) and (zext x) sge (sext x) to true, with various permutations. Oddly, SCEV knows about this identity, but nothing on the IR level does. Differential Revision: https://reviews.llvm.org/D83081	2020-07-04 11:03:00 +02:00
Nikita Popov	93ccb8eb52	[InstSimplify] Add additional zext/sext comparison tests (NFC) Add vector variants, and negative tests where the operand does not match.	2020-07-04 11:03:00 +02:00
Nikita Popov	cf1d9f9f49	[InstSimplify] Fold icmp with dominating assume If we assume(x > y), then we should be able to fold the basic implications of that, like x >= y. This already happens if either one of the operands is constant (LVI) or if the conditions are exactly the same (GVN), but not if we have an implication with non-constant operands. Support this by querying AssumptionCache. Fixes https://bugs.llvm.org/show_bug.cgi?id=40149. Differential Revision: https://reviews.llvm.org/D82717	2020-07-03 18:53:58 +02:00
Nikita Popov	359345d609	[InstSimplify] Add test for sext/zext comparisons (NFC)	2020-07-02 22:21:59 +02:00
Nikita Popov	a59dc55c2a	[InstSimplify] Move assume icmp test (NFC) Move this test from InstCombine into InstSimplify.	2020-07-01 23:35:52 +02:00
David Sherwood	ee26a31e7b	[SVE] Make ConstantFoldGetElementPtr work for scalable vectors of indices This patch fixes a compiler crash that was hit when trying to simplify the following code: getelementptr [2 x i64], [2 x i64]* null, i64 0, <vscale x 2 x i64> zeroinitializer For the case where we have a null pointer value like above, we just need to ensure we don't assume the indices are always fixed width. Differential Revision: https://reviews.llvm.org/D82183	2020-06-25 07:28:19 +01:00
Roman Lebedev	0c22147027	[NFCI][InstSimplify] Add CHECK-LABEL to new icmp.ll test	2020-06-25 01:10:35 +03:00
Roman Lebedev	2b8d706b19	[IR] GetUnderlyingObject(), stripPointerCastsAndOffsets(): don't crash on `bitcast <1 x i8> to i8` I'm not sure how to write standalone tests for each of two changes here. If either one of these two fixes is missing, the test fill crash.	2020-06-25 00:58:53 +03:00
Eli Friedman	90ad786947	[IR] Prefer scalar type for struct indexes in GEP constant expressions. This has two advantages: one, it's simpler, and two, it doesn't require heroic pattern matching with scalable vectors. Also includes a small fix to DataLayout to allow the scalable vector testcase to work correctly. Differential Revision: https://reviews.llvm.org/D82061	2020-06-23 16:14:36 -07:00
Sanjay Patel	2ad42c2653	[ValueTracking] improve analysis for fdiv with same operands (The 'nnan' variant of this pattern is already tested to produce '1.0'.) https://alive2.llvm.org/ce/z/D4hPBy define i1 @src(float %x, i32 %y) { %0: %d = fdiv float %x, %x %uge = fcmp uge float %d, 0.000000 ret i1 %uge } => define i1 @tgt(float %x, i32 %y) { %0: ret i1 1 } Transformation seems to be correct!	2020-06-21 09:07:59 -04:00
Sanjay Patel	97c0232621	[InstSimplify] add test for fdiv signbit; NFC	2020-06-21 09:07:59 -04:00
Tyker	b7338fb1a6	[AssumeBundles] add cannonicalisation to the assume builder Summary: this reduces significantly the number of assumes generated without aftecting too much the information that is preserved. this improves the compile-time cost of enable-knowledge-retention significantly. Reviewers: jdoerfert, sstefan1 Reviewed By: jdoerfert Subscribers: hiraditya, asbirlea, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79650	2020-06-19 10:32:26 +02:00
Tyker	d7deef1206	Revert "[AssumeBundles] add cannonicalisation to the assume builder" This reverts commit `90c50cad19`.	2020-06-16 14:34:55 +02:00
Tyker	90c50cad19	[AssumeBundles] add cannonicalisation to the assume builder Summary: this reduces significantly the number of assumes generated without aftecting too much the information that is preserved. this improves the compile-time cost of enable-knowledge-retention significantly. Reviewers: jdoerfert, sstefan1 Reviewed By: jdoerfert Subscribers: hiraditya, asbirlea, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79650	2020-06-16 13:12:35 +02:00
Dorit Nuzman	a9fe69c359	[InstSimplify] fix bug in matching or-with-not op (PR46083)	2020-06-03 13:44:29 -04:00
Sanjay Patel	a26cd73d33	[InstSimplify] add/move tests for or with not op (PR46083); NFC	2020-06-03 08:13:36 -04:00
Serge Pavlov	4d20e31f73	[FPEnv] Intrinsic llvm.roundeven This intrinsic implements IEEE-754 operation roundToIntegralTiesToEven, and performs rounding to the nearest integer value, rounding halfway cases to even. The intrinsic represents the missed case of IEEE-754 rounding operations and now llvm provides full support of the rounding operations defined by the standard. Differential Revision: https://reviews.llvm.org/D75670	2020-05-26 19:24:58 +07:00
Nikita Popov	f89f7da999	[IR] Convert null-pointer-is-valid into an enum attribute The "null-pointer-is-valid" attribute needs to be checked by many pointer-related combines. To make the check more efficient, convert it from a string into an enum attribute. In the future, this attribute may be replaced with data layout properties. Differential Revision: https://reviews.llvm.org/D78862	2020-05-15 19:41:07 +02:00
Nikita Popov	5a2265647e	Reapply [InstSimplify] Remove known bits constant folding No changes relative to last time, but after a mitigation for an AMDGPU regression landed. --- If SimplifyInstruction() does not succeed in simplifying the instruction, it will compute the known bits of the instruction in the hope that all bits are known and the instruction can be folded to a constant. I have removed a similar optimization from InstCombine in D75801, and would like to drop this one as well. On average, we spend ~1% of total compile-time performing this known bits calculation. However, if we introduce some additional statistics for known bits computations and how many of them succeed in simplifying the instruction we get (on test-suite): instsimplify.NumKnownBits: 216 instsimplify.NumKnownBitsComputed: 13828375 valuetracking.NumKnownBitsComputed: 45860806 Out of ~14M known bits calculations (accounting for approximately one third of all known bits calculations), only 0.0015% succeed in producing a constant. Those cases where we do succeed to compute all known bits will get folded by other passes like InstCombine later. On test-suite, only lencod.test and GCC-C-execute-pr44858.test show a hash difference after this change. On lencod we see an improvement (a loop phi is optimized away), on the GCC torture test a regression (a function return value is determined only after IPSCCP, preventing propagation from a noinline function.) There are various regressions in InstSimplify tests. However, all of these cases are already handled by InstCombine, and corresponding tests have already been added there. Differential Revision: https://reviews.llvm.org/D79294	2020-05-08 10:24:53 +02:00
Sanjay Patel	a954b8a363	[ValueTracking] fix CannotBeNegativeZero() to disregard 'nsz' FMF The 'nsz' flag is different than 'nnan' or 'ninf' in that it does not create poison. Make that explicit in the LangRef and fix ValueTracking analysis that misinterpreted the definition. This manifests as bugs in InstSimplify shown in the test diffs and as discussed in PR45778: https://bugs.llvm.org/show_bug.cgi?id=45778 Differential Revision: https://reviews.llvm.org/D79422	2020-05-05 16:04:59 -04:00
Nikita Popov	46ee652c70	Revert "[InstSimplify] Remove known bits constant folding" This reverts commit `08556afc54`. This breaks some AMDGPU tests.	2020-05-03 20:45:10 +02:00
Nikita Popov	08556afc54	[InstSimplify] Remove known bits constant folding If SimplifyInstruction() does not succeed in simplifying the instruction, it will compute the known bits of the instruction in the hope that all bits are known and the instruction can be folded to a constant. I have removed a similar optimization from InstCombine in D75801, and would like to drop this one as well. On average, we spend ~1% of total compile-time performing this known bits calculation. However, if we introduce some additional statistics for known bits computations and how many of them succeed in simplifying the instruction we get (on test-suite): instsimplify.NumKnownBits: 216 instsimplify.NumKnownBitsComputed: 13828375 valuetracking.NumKnownBitsComputed: 45860806 Out of ~14M known bits calculations (accounting for approximately one third of all known bits calculations), only 0.0015% succeed in producing a constant. Those cases where we do succeed to compute all known bits will get folded by other passes like InstCombine later. On test-suite, only lencod.test and GCC-C-execute-pr44858.test show a hash difference after this change. On lencod we see an improvement (a loop phi is optimized away), on the GCC torture test a regression (a function return value is determined only after IPSCCP, preventing propagation from a noinline function.) There are various regressions in InstSimplify tests. However, all of these cases are already handled by InstCombine, and corresponding tests have already been added there. Differential Revision: https://reviews.llvm.org/D79294	2020-05-03 20:26:58 +02:00
Sanjay Patel	57f0eed98d	[InstSimplify] allow insertelement-with-undef fold if poison-safe The more general fold was not poison-safe, so it was removed: rG5486e00 ...but it is ok to have this transform if analysis can determine the vector contains no poison. The test shows a simple example of that: constant integer elements are not poison.	2020-05-01 10:34:29 -04:00
Sanjay Patel	c79a366ec0	[InstSimplify] update test; NFC Missed this test diff when committing: rG5486e00dc3	2020-05-01 10:06:56 -04:00
Sanjay Patel	5486e00dc3	[InstSimplify] remove poison-unsafe insertelement of undef value PR45481: https://bugs.llvm.org/show_bug.cgi?id=45481 SDAG has an identical transform to this, so there's little chance of any real-world impact. OTOH, that means we are effectively sweeping the bug out of sight because poison exists in codegen too.	2020-05-01 09:22:05 -04:00
Juneyoung Lee	f5677fe700	[ValueTracking] Let isGuaranteedNotToBeUndefOrPoison look into more constants/instructions Summary: This patch helps isGuaranteedNotToBeUndefOrPoison look into more constants and instructions (bitcast/alloca/gep/fcmp). To deal with bitcast, Depth is added to isGuaranteedNotToBeUndefOrPoison. This patch is splitted from https://reviews.llvm.org/D75808. Checked with Alive2 Reviewers: reames, jdoerfert Reviewed By: jdoerfert Subscribers: sanwou01, spatel, llvm-commits, hiraditya Tags: #llvm Differential Revision: https://reviews.llvm.org/D76010	2020-04-25 23:29:54 +09:00
Sanjay Patel	e86eff0e82	[InstSimplify] fold and/or of compares with equality to min/max constant I found 12 (6 if we compress the DeMorganized forms) patterns for logic-of-compares with a min/max constant while looking at PR45510: https://bugs.llvm.org/show_bug.cgi?id=45510 The variations on those forms multiply the test cases by 8 (unsigned/signed, swapped compare operands, commuted logic operands). We have partial logic to deal with these for the unsigned min (zero) case, but missed everything else. We are deferring the majority of these patterns to InstCombine to allow more general handling (see D78582). We could use ConstantRange instead of predicate+constant matching here. I don't expect there's any noticeable compile-time impact for either form. Here's an abuse of Alive2 to show the 12 basic signed variants of the patterns in one function: http://volta.cs.utah.edu:8080/z/5Vpiyg declare void @use(i1, i1, i1, i1, i1, i1, i1, i1, i1, i1, i1, i1) define void @src(i8 %x, i8 %y) { %m1 = icmp eq i8 %x, 127 %c1 = icmp slt i8 %x, %y %r1 = and i1 %m1, %c1 ; (X == MAX) && (X < Y) --> false %m2 = icmp ne i8 %x, 127 %c2 = icmp sge i8 %x, %y %r2 = or i1 %m2, %c2 ; (X != MAX) \|\| (X >= Y) --> true %m3 = icmp eq i8 %x, -128 %c3 = icmp sgt i8 %x, %y %r3 = and i1 %m3, %c3 ; (X == MIN) && (X > Y) --> false %m4 = icmp ne i8 %x, -128 %c4 = icmp sle i8 %x, %y %r4 = or i1 %m4, %c4 ; (X != MIN) \|\| (X <= Y) --> true %m5 = icmp eq i8 %x, 127 %c5 = icmp sge i8 %x, %y %r5 = and i1 %m5, %c5 ; (X == MAX) && (X >= Y) --> X == MAX %m6 = icmp ne i8 %x, 127 %c6 = icmp slt i8 %x, %y %r6 = or i1 %m6, %c6 ; (X != MAX) \|\| (X < Y) --> X != MAX %m7 = icmp eq i8 %x, -128 %c7 = icmp sle i8 %x, %y %r7 = and i1 %m7, %c7 ; (X == MIN) && (X <= Y) --> X == MIN %m8 = icmp ne i8 %x, -128 %c8 = icmp sgt i8 %x, %y %r8 = or i1 %m8, %c8 ; (X != MIN) \|\| (X > Y) --> X != MIN %m9 = icmp ne i8 %x, 127 %c9 = icmp slt i8 %x, %y %r9 = and i1 %m9, %c9 ; (X != MAX) && (X < Y) --> X < Y %m10 = icmp eq i8 %x, 127 %c10 = icmp sge i8 %x, %y %r10 = or i1 %m10, %c10 ; (X == MAX) \|\| (X >= Y) --> X >= Y %m11 = icmp ne i8 %x, -128 %c11 = icmp sgt i8 %x, %y %r11 = and i1 %m11, %c11 ; (X != MIN) && (X > Y) --> X > Y %m12 = icmp eq i8 %x, -128 %c12 = icmp sle i8 %x, %y %r12 = or i1 %m12, %c12 ; (X == MIN) \|\| (X <= Y) --> X <= Y call void @use(i1 %r1, i1 %r2, i1 %r3, i1 %r4, i1 %r5, i1 %r6, i1 %r7, i1 %r8, i1 %r9, i1 %r10, i1 %r11, i1 %r12) ret void } define void @tgt(i8 %x, i8 %y) { %m5 = icmp eq i8 %x, 127 %m6 = icmp ne i8 %x, 127 %m7 = icmp eq i8 %x, -128 %m8 = icmp ne i8 %x, -128 %c9 = icmp slt i8 %x, %y %c10 = icmp sge i8 %x, %y %c11 = icmp sgt i8 %x, %y %c12 = icmp sle i8 %x, %y call void @use(i1 0, i1 1, i1 0, i1 1, i1 %m5, i1 %m6, i1 %m7, i1 %m8, i1 %c9, i1 %c10, i1 %c11, i1 %c12) ret void } Differential Revision: https://reviews.llvm.org/D78430	2020-04-23 09:16:10 -04:00
Sanjay Patel	cf30aafa2d	[Analysis] recognize the 'null' pointer constant as not poison Differential Revision: https://reviews.llvm.org/D78575	2020-04-21 14:23:06 -04:00
Sanjay Patel	a2eb55de99	[InstSimplify] add tests for logic+icmp folds for nullptr; NFC See discussion in D78430.	2020-04-19 10:42:08 -04:00
Sanjay Patel	02b070ed49	[InstSimplify] add tests for logic-of-icmp with min/max constant; NFC See PR45510: https://bugs.llvm.org/show_bug.cgi?id=45510 We had partial coverage for some of these patterns, so removing duplicate tests with the complete set in the new test file.	2020-04-19 08:24:38 -04:00
Sanjay Patel	c72f49cc57	[InstSimplify] add test for select that should not be simplified; NFC See discussion in D77868	2020-04-14 13:57:30 -04:00
Sanjay Patel	73bebc9445	[InstSimplify] add tests for folding bool select to logic; NFC	2020-04-10 09:08:00 -04:00
Simon Pilgrim	6d24dd7ed1	[InstSimplify] Regenerate compares tests to fix issue reported on D77354	2020-04-03 17:34:56 +01:00
Serge Pavlov	f398739152	[FEnv] Constfold some unary constrained operations This change implements constant folding to constrained versions of intrinsics, implementing rounding: floor, ceil, trunc, round, rint and nearbyint. Differential Revision: https://reviews.llvm.org/D72930	2020-03-28 12:28:33 +07:00
Sanjay Patel	88b493a838	[ValueTracking] improve undef/poison analysis for constant vectors Differential Revision: https://reviews.llvm.org/D76702	2020-03-24 13:35:47 -04:00
Sanjay Patel	6c3c7a0dd6	[InstSimplify] add tests for freeze(constexpr); NFC	2020-03-24 11:39:19 -04:00
Sanjay Patel	58ec867a3b	[InstSimplify] add more tests for freeze(constant); NFC These should really be moved over to a ConstantFolding test file, but since this may overlap with the in-progress D76010 and similar tests already exist here, we can do that as a later cleanup.	2020-03-24 09:53:49 -04:00
Juneyoung Lee	5792c2236d	Add test cases that are addressed by D76010	2020-03-23 13:49:29 +09:00
Simon Pilgrim	7f764fa18f	[ValueTracking] Add some initial isKnownNonZero DemandedElts support (PR36319)	2020-03-20 13:29:00 +00:00
Simon Pilgrim	c1efdbcbe0	[ValueTracking] Add computeKnownBits DemandedElts support to shift instructions (PR36319)	2020-03-20 11:08:08 +00:00
Nikita Popov	a09ff56b5b	[Tests] Regenerate some test checks; NFC	2020-03-20 12:06:53 +01:00
Nikita Popov	0372768776	[InstCombine] Simplify calls with "returned" attribute If a call argument has the "returned" attribute, we can simplify the call to the value of that argument. This was already partially handled by InstSimplify/InstCombine for the case where the argument is an integer constant, and the result is thus known via known bits. The non-constant (or non-int) argument cases weren't handled though. This previously landed as an InstSimplify transform, but was reverted due to assertion failures when compiling the Linux kernel. The reason is that simplifying a call to another call breaks assumptions in call graph updating during inlining. As the code is not easy to fix, and there is no particularly strong motivation for having this in InstSimplify, the transform is only performed in InstCombine instead. Differential Revision: https://reviews.llvm.org/D75815	2020-03-20 10:23:39 +01:00
Simon Pilgrim	95b6f62efb	[InstSimplify] Add some vector shift tests to show lack of DemandedElts support	2020-03-19 22:09:51 +00:00
Simon Pilgrim	0b458d4dca	[ValueTracking] Add computeKnownBits DemandedElts support to ADD/SUB/MUL instructions (PR36319)	2020-03-19 12:41:29 +00:00
Simon Pilgrim	7ce7f78963	[InstSimplify] Add missing vector ADD+SUB tests to show lack of DemandedElts support	2020-03-19 11:27:27 +00:00
Simon Pilgrim	d259e31a17	[InstSimplify] Add missing vector MUL tests to show lack of DemandedElts support	2020-03-19 11:27:27 +00:00
Simon Pilgrim	99336bf95a	[ValueTracking] Add computeKnownBits DemandedElts support to masked add instructions (PR36319)	2020-03-18 21:50:56 +00:00
Simon Pilgrim	49bdfd888d	[InstSimplify] Add missing vector masked add tests to show lack of DemandedElts support	2020-03-18 21:04:54 +00:00
Simon Pilgrim	9d40292a64	[ValueTracking] Add computeKnownBits DemandedElts support to XOR instructions (PR36319)	2020-03-18 20:24:14 +00:00
Simon Pilgrim	47ce1406c8	[InstSimplify] Add missing vector OR test to show lack of DemandedElts support	2020-03-18 20:24:14 +00:00
Simon Pilgrim	6bdb0efa42	[InstSimplify] Regenerate OR tests	2020-03-18 20:24:13 +00:00
Simon Pilgrim	1010c44b4c	[ValueTracking] Add computeKnownBits DemandedElts support to EXTRACTELEMENT/OR/BSWAP/BITREVERSE instructions (PR36319) These are all covered by the bswap/bitreverse vector tests.	2020-03-18 18:49:58 +00:00
Simon Pilgrim	9c6458ecf8	[InstSimplify] Add bitreverse/bswap vector tests Shows missing DemandedElts support (PR36319)	2020-03-18 18:17:10 +00:00
Huihui Zhang	1bf0c99375	[ValueTracking][SVE] Fix isGEPKnownNonNull for scalable vector. Summary: DataLayout::getTypeAllocSize() return TypeSize. For cases where the scalable property doesn't matter, we should explicitly call getKnownMinSize() to avoid implicit type conversion to uint64_t, which is not valid for scalable vector type. Reviewers: sdesmalen, efriedma, apazos, reames Reviewed By: efriedma Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76260	2020-03-17 11:31:30 -07:00
Nico Weber	623cb95eb3	Revert "[InstSimplify] Simplify calls with "returned" attribute" This reverts commit `45555c3819`. Causes clang crashes in some causes, see comments on https://reviews.llvm.org/D75815 for details (including repro steps).	2020-03-16 15:21:30 -04:00
Huihui Zhang	0616e9964b	[InstSimplify][SVE] Fix SimplifyGEPInst for scalable vector. Summary: Skip folds that rely on DataLayout::getTypeAllocSize(). For scalable vector, only minimal type alloc size is known at compile-time. Reviewers: sdesmalen, efriedma, spatel, apazos Reviewed By: efriedma Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75892	2020-03-16 11:46:12 -07:00
Huihui Zhang	118abf2017	[SVE] Update API ConstantVector::getSplat() to use ElementCount. Summary: Support ConstantInt::get() and Constant::getAllOnesValue() for scalable vector type, this requires ConstantVector::getSplat() to take in 'ElementCount', instead of 'unsigned' number of element count. This change is needed for D73753. Reviewers: sdesmalen, efriedma, apazos, spatel, huntergr, willlovett Reviewed By: efriedma Subscribers: tschuett, hiraditya, rkruppe, psnobl, cfe-commits, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74386	2020-03-12 13:22:41 -07:00
Sanjay Patel	a66dc755db	[InstSimplify] simplify FP ops harder with FMF (part 2) This is part of the IR sibling for: D75576 Related transform committed with: rG8ec71585719d	2020-03-12 09:53:20 -04:00
Sanjay Patel	8ec7158571	[InstSimplify] simplify FP ops harder with FMF This is part of the IR sibling for: D75576 (I'm splitting part of the transform as a separate commit to reduce risk. I don't know of any bugs that might be exposed by this improved folding, but it's hard to see those in advance...)	2020-03-12 09:13:28 -04:00
Sanjay Patel	d748e759d5	[InstSimplify] add tests for FP poison; NFC Adapted from codegen tests seen in D75576.	2020-03-12 08:32:04 -04:00
Huihui Zhang	8f52573962	[InstSimplify][SVE] Fix SimplifyInsert/ExtractElementInst for scalable vector. Summary: For scalable vector, index out-of-bound can not be determined at compile-time. The same apply for VectorUtil findScalarElement(). Add test cases to check the functionality of SimplifyInsert/ExtractElementInst for scalable vector. Reviewers: sdesmalen, efriedma, spatel, apazos Reviewed By: efriedma Subscribers: cameron.mcinally, tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75782	2020-03-11 15:09:56 -07:00
Nikita Popov	45555c3819	[InstSimplify] Simplify calls with "returned" attribute If a call argument has the "returned" attribute, we can simplify the call to the value of that argument. The "-inst-simplify" pass already handled this for the constant integer argument case via known bits, which is invoked in SimplifyInstruction. However, non-constant (or non-int) arguments are not handled at all right now. This addresses one of the regressions from D75801. Differential Revision: https://reviews.llvm.org/D75815	2020-03-09 18:53:47 +01:00
Nikita Popov	829d377a98	[InstSimplify] Don't simplify musttail calls As pointed out by jdoerfert on D75815, we must be careful when simplifying musttail calls: We can only replace the return value if we can eliminate the call entirely. As we can't make this guarantee for all consumers of InstSimplify, this patch disables simplification of musttail calls. Without this patch, musttail simplification currently results in module verification errors. Differential Revision: https://reviews.llvm.org/D75824	2020-03-09 18:46:56 +01:00
Nikita Popov	d2dab92f01	[InstSimplify] Add tests for "returned" attribute; NFC	2020-03-07 17:17:21 +01:00
Juneyoung Lee	d7267ee194	[ValueTracking] Let isGuaranteedNotToBeUndefOrPoison look into branch conditions of dominating blocks' terminators Summary: ``` br i1 c, BB1, BB2: BB1: use1(c) BB2: use2(c) ``` In BB1 and BB2, c is never undef or poison because otherwise the branch would have triggered UB. This is a resubmission of `952ad47` with crash fix of llvm/test/Transforms/LoopRotate/freeze-crash.ll. Checked with Alive2 Reviewers: xbolva00, spatel, lebedev.ri, reames, jdoerfert, nlopes, sanjoy Reviewed By: reames Subscribers: jdoerfert, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75401	2020-03-06 01:08:35 +09:00
Daniil Suchkov	3db48f9324	Revert "[ValueTracking] Let isGuaranteedNotToBeUndefOrPoison look into branch conditions of dominating blocks' terminators" That commit causes SIGSEGV on some simple tests. This reverts commit `952ad4701c`.	2020-03-05 16:32:36 +07:00
Nikita Popov	c6ff3c9bad	[InstSimplify] Constant fold icmp of gep InstSimplify can fold icmps of gep where the base pointers are the same and the offsets are constant. It does so by constructing a constant expression icmp and assumes that it gets folded -- but this doesn't actually happen, because GEP expressions can usually only be folded by the target-dependent constant folding layer. As such, we need to explicitly invoke it here. Differential Revision: https://reviews.llvm.org/D75407	2020-03-04 23:16:52 +01:00
Nikita Popov	a99b97b818	[InstSimplify] Add additional icmp of gep folding test; NFC	2020-03-04 18:27:01 +01:00
Nikita Popov	0940c32385	[InstSimplify] Regenerate compare.ll checks; NFC	2020-03-04 18:26:42 +01:00
Juneyoung Lee	952ad4701c	[ValueTracking] Let isGuaranteedNotToBeUndefOrPoison look into branch conditions of dominating blocks' terminators Summary: ``` br i1 c, BB1, BB2: BB1: use1(c) BB2: use2(c) ``` In BB1 and BB2, c is never undef or poison because otherwise the branch would have triggered UB. Checked with Alive2 Reviewers: xbolva00, spatel, lebedev.ri, reames, jdoerfert, nlopes, sanjoy Reviewed By: reames Subscribers: jdoerfert, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75401	2020-03-04 11:43:31 +09:00
Sanjay Patel	7ddbf802cf	[ConstantFold] add/move tests for FP with undef operand; NFC	2020-02-20 15:07:11 -05:00
Ehud Katz	2d77e0b9f2	Fix tests of constant folding of fp operations on NaN values Fix failures introduced due to change rG0b336b6048ae.	2020-01-21 21:48:07 +02:00
Sanjay Patel	da9c93f330	[InstSimplify] fold select of vector constants that include undef elements As mentioned in D72643, we'd like to be able to assert that any select of equivalent constants has been removed before we're deep into InstCombine. But there's a loophole in that assertion for vectors with undef elements that don't match exactly. This patch should close that gap. If we have undefs, we can't safely propagate those unless both constants elements for that lane are undef. Differential Revision: https://reviews.llvm.org/D72958	2020-01-20 08:48:32 -05:00
Sanjay Patel	a8b9c93601	[InstSimplify] add test for select of vector constants; NFC	2020-01-17 16:39:55 -05:00
Sanjay Patel	3ae38d95e6	[InstSimplify] add test for select of FP constants; NFC	2020-01-17 16:39:55 -05:00
Sanjay Patel	cfe2fab708	[InstSimplify] add tests for vector select; NFC	2020-01-14 08:41:06 -05:00
Sanjay Patel	26d2ace9e2	[InstSimplify] move tests for select from InstCombine; NFC InstCombine has transforms that would enable these simplifications in an indirect way, but those transforms are unsafe and likely to be removed.	2020-01-13 09:13:21 -05:00
Sanjay Patel	f53b38d12a	[InstSimplify] select Cond, true, false --> Cond This is step 1 of damage control assuming that we need to remove several over-reaching folds for select-of-booleans because they can cause miscompiles as shown in D72396. The scalar case seems obviously safe: https://rise4fun.com/Alive/jSj And I don't think there's any danger for vectors either - if the condition is poisoned, then the select must be poisoned too, so undef elements don't make any difference. Differential Revision: https://reviews.llvm.org/D72412	2020-01-09 09:04:20 -05:00
Sanjay Patel	0b8ce37d64	[InstSimplify] add tests for select of true/false; NFC	2020-01-08 16:22:48 -05:00
Roman Lebedev	047186cc98	[ValueTracking] isKnownNonZero() should take non-null-ness assumptions into consideration (PR43267) Summary: It is pretty common to assume that something is not zero. Even optimizer itself sometimes emits such assumptions (e.g. `addAssumeNonNull()` in `PromoteMemoryToRegister.cpp`). But we currently don't deal with such assumptions :) The only way `isKnownNonZero()` handles assumptions is by calling `computeKnownBits()` which calls `computeKnownBitsFromAssume()`. But `x != 0` does not tell us anything about set bits, it only says that there are some set bits. So naturally, `KnownBits` does not get populated, and we fail to make use of this assumption. I propose to deal with this special case by special-casing it via adding a `isKnownNonZeroFromAssume()` that returns boolean when there is an applicable assumption. While there, we also deal with other predicates, mainly if the comparison is with constant. Fixes [[ https://bugs.llvm.org/show_bug.cgi?id=43267 \| PR43267 ]]. Differential Revision: https://reviews.llvm.org/D71660	2019-12-20 01:47:57 +03:00
Roman Lebedev	34dd49c86a	[NFC][InstCombine] Add some more non-zero assumption variants (D71660) https://rise4fun.com/Alive/6yR	2019-12-19 21:08:54 +03:00
Sanjay Patel	6080387f13	[InstSimplify] fold splat of inserted constant to vector constant shuf (inselt ?, C, IndexC), undef, <IndexC, IndexC...> --> <C, C...> This is another missing shuffle fold pattern uncovered by the shuffle correctness fix from D70246. The problem was visible in the post-commit thread example, but we managed to overcome the limitation for that particular case with D71220. This is something like the inverse of the previous fix - there we didn't demand the inserted scalar, and here we are only demanding an inserted scalar. Differential Revision: https://reviews.llvm.org/D71488	2019-12-15 09:32:03 -05:00
Sanjay Patel	940600ae41	[InstSimplify] improve test coverage for insert+splat; NFC	2019-12-13 14:03:54 -05:00
Sanjay Patel	252d3b9805	[InstSimplify] add tests for insert constant + splat; NFC	2019-12-10 17:16:58 -05:00
Johannes Doerfert	a7d992c0f2	[ValueTracking] Allow context-sensitive nullness check for non-pointers Summary: Same as D60846 and D69571 but with a fix for the problem encountered after them. Both times it was a missing context adjustment in the handling of PHI nodes. The reproducers created from the bugs that caused the old commits to be reverted are included. Reviewers: nikic, nlopes, mkazantsev, spatel, dlrobertson, uabelho, hakzsam, hans Subscribers: hiraditya, bollu, asbirlea, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71181	2019-12-09 15:15:52 -06:00
Sanjay Patel	1c4dd3ae2f	[InstSimplify] fold copysign with negated operand, part 2 This is another transform suggested in PR44153: https://bugs.llvm.org/show_bug.cgi?id=44153 Unlike rG12f39e0fede9, it doesn't look like the backend matches this variant.	2019-12-08 10:16:29 -05:00
Sanjay Patel	12f39e0fed	[InstSimplify] fold copysign with negated operand This is another transform suggested in PR44153: https://bugs.llvm.org/show_bug.cgi?id=44153 The backend for some targets already manages to get this if it converts copysign to bitwise logic.	2019-12-08 10:08:02 -05:00
Sanjay Patel	d5abaaf140	[InstSimplify] add tests for copysign with fneg operand; NFC	2019-12-06 16:23:44 -05:00
Sanjay Patel	e177c5a00d	[InstSimplify] fold copysign with same args to the arg This is correct for any value including NaN/inf. We don't have this fold directly in the backend either, but x86 manages to get it after converting things to bitops.	2019-11-26 17:35:10 -05:00
Sanjay Patel	48a3a1e090	[InstSimplify] add tests for copysign; NFC	2019-11-26 17:23:30 -05:00
Benjamin Kramer	cd4811360e	[ValueTracking] Add a basic version of isKnownNonInfinity and use it to detect more NoNaNs	2019-11-19 22:24:46 +01:00
Hans Wennborg	6ea4775900	Revert `57dd4b0` "[ValueTracking] Allow context-sensitive nullness check for non-pointers" This caused miscompiles of Chromium (https://crbug.com/1023818). The reduced repro is small enough to fit here: $ cat /tmp/a.c unsigned char f(unsigned char p) { unsigned char result = 0; for (int shift = 0; shift < 1; ++shift) result \|= p[0] << (shift 8); return result; } $ bin/clang -O2 -S -o - /tmp/a.c \| grep -A4 f: f: # @f .cfi_startproc # %bb.0: # %entry xorl %eax, %eax retq That's nicely optimized, but I don't think it's the right result :-) > Same as D60846 but with a fix for the problem encountered there which > was a missing context adjustment in the handling of PHI nodes. > > The test that caused D60846 to be reverted was added in `e15ab8f277`. > > Reviewers: nikic, nlopes, mkazantsev,spatel, dlrobertson, uabelho, hakzsam > > Subscribers: hiraditya, bollu, llvm-commits > > Tags: #llvm > > Differential Revision: https://reviews.llvm.org/D69571 This reverts commit `57dd4b03e4`.	2019-11-13 12:19:02 +01:00
aqjune	4187cb138b	Add InstCombine/InstructionSimplify support for Freeze Instruction Summary: - Add llvm::SimplifyFreezeInst - Add InstCombiner::visitFreeze - Add llvm tests Reviewers: majnemer, sanjoy, reames, lebedev.ri, spatel Reviewed By: reames, lebedev.ri Subscribers: reames, lebedev.ri, filcab, regehr, trentxintong, llvm-commits Differential Revision: https://reviews.llvm.org/D29013	2019-11-12 12:13:26 +09:00
Sanjay Patel	659bd73d13	[InstSimplify] use FMF to improve fcmp+select fold This is part of a series of patches needed to solve PR39535: https://bugs.llvm.org/show_bug.cgi?id=39535	2019-11-04 08:29:56 -05:00
Sanjay Patel	ad87f244b4	[InstSimplify] add more tests for fcmp+select; NFC The easy code fix won't catch non-canonical mismatched constant patterns, so adding extra coverage for those in case we decide that's important (but seems unlikely).	2019-11-04 08:23:08 -05:00
Sanjay Patel	499c90afe9	[InstSimplify] add more tests for fcmp+select; NFC The addition of FMF for select allows more folding for these kinds of patterns.	2019-11-04 07:38:11 -05:00
Johannes Doerfert	57dd4b03e4	[ValueTracking] Allow context-sensitive nullness check for non-pointers Same as D60846 but with a fix for the problem encountered there which was a missing context adjustment in the handling of PHI nodes. The test that caused D60846 to be reverted was added in `e15ab8f277`. Reviewers: nikic, nlopes, mkazantsev,spatel, dlrobertson, uabelho, hakzsam Subscribers: hiraditya, bollu, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69571	2019-10-31 14:37:38 -05:00
Sanjay Patel	be21ceb565	[InstSimplify] fold fma/fmuladd with a NaN or undef operand This is intended to be similar to the constant folding results from D67446 and earlier, but not all operands are constant in these tests, so the responsibility for folding is left to InstSimplify. Differential Revision: https://reviews.llvm.org/D67721 llvm-svn: 373455	2019-10-02 12:12:02 +00:00
Sanjay Patel	1b40402aa2	[InstSimplify] add tests for fma/fmuladd with undef operand; NFC llvm-svn: 373109	2019-09-27 18:38:51 +00:00
Roman Lebedev	914a3d1cf2	[InstSimplify] Handle more 'A </>/>=/<= B &&/\|\| (A - B) !=/== 0' patterns (PR43251) https://rise4fun.com/Alive/sl9s https://rise4fun.com/Alive/2plN https://bugs.llvm.org/show_bug.cgi?id=43251 llvm-svn: 372928	2019-09-25 22:59:41 +00:00
Roman Lebedev	26606bec9a	[NFC][InstSimplify] More exaustive test coverage for 'A </>/>=/<= B &&/\|\| (A - B) !=/== 0' pattern (PR43251) llvm-svn: 372927	2019-09-25 22:59:24 +00:00
Roman Lebedev	baf809811b	[InstSimplify] simplifyUnsignedRangeCheck(): X >= Y && Y == 0 --> Y == 0 https://rise4fun.com/Alive/v9Y4 llvm-svn: 372491	2019-09-21 22:27:39 +00:00
Roman Lebedev	ac4dda8052	[NFC][InstSimplify] Add exhaustive test coverage for simplifyUnsignedRangeCheck(). One case is not handled. llvm-svn: 372489	2019-09-21 22:27:18 +00:00
Sanjay Patel	e406a3f2d6	[InstSimplify] add tests for fma/fmuladd; NFC llvm-svn: 372236	2019-09-18 17:27:02 +00:00
Roman Lebedev	9c5a4a4527	[InstSimplify] simplifyUnsignedRangeCheck(): handle few tautological cases (PR43251) Summary: This is split off from D67356, since these cases produce a constant, no real need to keep them in instcombine. Alive proofs: https://rise4fun.com/Alive/u7Fk https://rise4fun.com/Alive/4lV https://bugs.llvm.org/show_bug.cgi?id=43251 Reviewers: spatel, nikic, xbolva00 Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67498 llvm-svn: 371921	2019-09-14 13:47:27 +00:00
Roman Lebedev	4cb267f9f5	[NFC][InstSimplify] Add some more tests for D67498/D67502 llvm-svn: 371877	2019-09-13 17:58:24 +00:00
Roman Lebedev	80a8a85758	[InstCombine][InstSimplify] Move constant-folding tests in result-of-usub-is-non-zero-and-no-overflow.ll llvm-svn: 371737	2019-09-12 14:12:31 +00:00
Roman Lebedev	b3e0937f0a	[NFC][InstCombine][InstSimplify] Add test for "add-of-negative is non-zero and no overflow" (PR43259) https://rise4fun.com/Alive/ska https://rise4fun.com/Alive/9iX https://bugs.llvm.org/show_bug.cgi?id=43259 llvm-svn: 371736	2019-09-12 14:12:20 +00:00
Roman Lebedev	f1286621eb	[InstSimplify] simplifyUnsignedRangeCheck(): handle more cases (PR43251) Summary: I don't have a direct motivational case for this, but it would be good to have this for completeness/symmetry. This pattern is basically the motivational pattern from https://bugs.llvm.org/show_bug.cgi?id=43251 but with different predicate that requires that the offset is non-zero. The completeness bit comes from the fact that a similar pattern (offset != zero) will be needed for https://bugs.llvm.org/show_bug.cgi?id=43259, so it'd seem to be good to not overlook very similar patterns.. Proofs: https://rise4fun.com/Alive/21b Also, there is something odd with `isKnownNonZero()`, if the non-zero knowledge was specified as an assumption, it didn't pick it up (PR43267) Reviewers: spatel, nikic, xbolva00 Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67411 llvm-svn: 371718	2019-09-12 09:26:17 +00:00
Roman Lebedev	00c1ee48e4	[InstSimplify] Pass SimplifyQuery into simplifyUnsignedRangeCheck() and use it for isKnownNonZero() This was actually the original intention in D67332, but i messed up and forgot about it. This patch was originally part of D67411, but precommitting this. llvm-svn: 371630	2019-09-11 15:32:46 +00:00
Roman Lebedev	8aeb7bb013	[NFC][InstSimplify] Add extra test for D67411 with @llvm.assume llvm-svn: 371629	2019-09-11 15:28:03 +00:00
Sanjay Patel	9c4047f267	[ConstProp] move test file from InstSimplify; NFC These are constant folding tests; there is no code directly in InstSimplify for this. llvm-svn: 371619	2019-09-11 14:01:11 +00:00
Sanjay Patel	29ba5e0817	[InstSimplify] regenerate test CHECKs; NFC llvm-svn: 371617	2019-09-11 13:56:07 +00:00
Roman Lebedev	870ffe3cee	[NFC][InstSimplify] rewrite test added in r371537 to use non-null pointer instead I only want to ensure that %offset is non-zero there, it doesn't matter how that info is conveyed. As filed in PR43267, the assumption way does not work. llvm-svn: 371546	2019-09-10 18:40:00 +00:00
Roman Lebedev	880657c97c	[NFC][InstCombine][InstSimplify] PR43251 - and some patterns with offset != 0 https://rise4fun.com/Alive/21b llvm-svn: 371537	2019-09-10 17:13:59 +00:00
Roman Lebedev	6e2c5c8710	[InstSimplify] simplifyUnsignedRangeCheck(): if we know that X != 0, handle more cases (PR43246) Summary: This is motivated by D67122 sanitizer check enhancement. That patch seemingly worsens `-fsanitize=pointer-overflow` overhead from 25% to 50%, which strongly implies missing folds. In this particular case, given ``` char* test(char& base, unsigned long offset) { return &base + offset; } ``` it will end up producing something like https://godbolt.org/z/LK5-iH which after optimizations reduces down to roughly ``` define i1 @t0(i8* nonnull %base, i64 %offset) { %base_int = ptrtoint i8* %base to i64 %adjusted = add i64 %base_int, %offset %non_null_after_adjustment = icmp ne i64 %adjusted, 0 %no_overflow_during_adjustment = icmp uge i64 %adjusted, %base_int %res = and i1 %non_null_after_adjustment, %no_overflow_during_adjustment ret i1 %res } ``` Without D67122 there was no `%non_null_after_adjustment`, and in this particular case we can get rid of the overhead: Here we add some offset to a non-null pointer, and check that the result does not overflow and is not a null pointer. But since the base pointer is already non-null, and we check for overflow, that overflow check will already catch the null pointer, so the separate null check is redundant and can be dropped. Alive proofs: https://rise4fun.com/Alive/WRzq There are more patterns of "unsigned-add-with-overflow", they are not handled here, but this is the main pattern, that we currently consider canonical, so it makes sense to handle it. https://bugs.llvm.org/show_bug.cgi?id=43246 Reviewers: spatel, nikic, vsk Reviewed By: spatel Subscribers: hiraditya, llvm-commits, reames Tags: #llvm Differential Revision: https://reviews.llvm.org/D67332 llvm-svn: 371349	2019-09-08 20:14:15 +00:00
Roman Lebedev	64965430db	[NFC][InstSimplify] Some tests for dropping null check after uadd.with.overflow of non-null (PR43246) https://rise4fun.com/Alive/WRzq Name: C <= Y && Y != 0 --> C <= Y iff C != 0 Pre: C != 0 %y_is_nonnull = icmp ne i64 %y, 0 %no_overflow = icmp ule i64 C, %y %r = and i1 %y_is_nonnull, %no_overflow => %r = %no_overflow Name: C <= Y \|\| Y != 0 --> Y != 0 iff C != 0 Pre: C != 0 %y_is_nonnull = icmp ne i64 %y, 0 %no_overflow = icmp ule i64 C, %y %r = or i1 %y_is_nonnull, %no_overflow => %r = %y_is_nonnull Name: C > Y \|\| Y == 0 --> C > Y iff C != 0 Pre: C != 0 %y_is_null = icmp eq i64 %y, 0 %overflow = icmp ugt i64 C, %y %r = or i1 %y_is_null, %overflow => %r = %overflow Name: C > Y && Y == 0 --> Y == 0 iff C != 0 Pre: C != 0 %y_is_null = icmp eq i64 %y, 0 %overflow = icmp ugt i64 C, %y %r = and i1 %y_is_null, %overflow => %r = %y_is_null https://bugs.llvm.org/show_bug.cgi?id=43246 llvm-svn: 371339	2019-09-08 17:50:40 +00:00
Sanjay Patel	4a2cd7be5a	[InstSimplify] guard against unreachable code (PR43218) This would crash: https://bugs.llvm.org/show_bug.cgi?id=43218 llvm-svn: 370911	2019-09-04 15:12:55 +00:00
Roman Lebedev	c584786854	[InstSimplify] Drop leftover "division-by-zero guard" around `@llvm.umul.with.overflow` inverted overflow bit Summary: Now that with D65143/D65144 we've produce `@llvm.umul.with.overflow`, and with D65147 we've flattened the CFG, we now can see that the guard may have been there to prevent division by zero is redundant. We can simply drop it: ``` ---------------------------------------- Name: no overflow or zero %iszero = icmp eq i4 %y, 0 %umul = smul_overflow i4 %x, %y %umul.ov = extractvalue {i4, i1} %umul, 1 %umul.ov.not = xor %umul.ov, -1 %retval.0 = or i1 %iszero, %umul.ov.not ret i1 %retval.0 => %iszero = icmp eq i4 %y, 0 %umul = smul_overflow i4 %x, %y %umul.ov = extractvalue {i4, i1} %umul, 1 %umul.ov.not = xor %umul.ov, -1 %retval.0 = or i1 %iszero, %umul.ov.not ret i1 %umul.ov.not Done: 1 Optimization is correct! ``` Note that this is inverted from what we have in a previous patch, here we are looking for the inverted overflow bit. And that inversion is kinda problematic - given this particular pattern we neither hoist that `not` closer to `ret` (then the pattern would have been identical to the one without inversion, and would have been handled by the previous patch), neither do the opposite transform. But regardless, we should handle this too. I've filled [[ https://bugs.llvm.org/show_bug.cgi?id=42720 \| PR42720 ]]. Reviewers: nikic, spatel, xbolva00, RKSimon Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65151 llvm-svn: 370351	2019-08-29 12:48:04 +00:00
Roman Lebedev	aaf6ab4410	[InstSimplify] Drop leftover "division-by-zero guard" around `@llvm.umul.with.overflow` overflow bit Summary: Now that with D65143/D65144 we've produce `@llvm.umul.with.overflow`, and with D65147 we've flattened the CFG, we now can see that the guard may have been there to prevent division by zero is redundant. We can simply drop it: ``` ---------------------------------------- Name: no overflow and not zero %iszero = icmp ne i4 %y, 0 %umul = umul_overflow i4 %x, %y %umul.ov = extractvalue {i4, i1} %umul, 1 %retval.0 = and i1 %iszero, %umul.ov ret i1 %retval.0 => %iszero = icmp ne i4 %y, 0 %umul = umul_overflow i4 %x, %y %umul.ov = extractvalue {i4, i1} %umul, 1 %retval.0 = and i1 %iszero, %umul.ov ret %umul.ov Done: 1 Optimization is correct! ``` Reviewers: nikic, spatel, xbolva00 Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65150 llvm-svn: 370350	2019-08-29 12:47:50 +00:00
Bjorn Pettersson	d218a3326e	[InstSimplify] Report "Changed" also when only deleting dead instructions Summary: Make sure that we report that changes has been made by InstSimplify also in situations when only trivially dead instructions has been removed. If for example a call is removed the call graph must be updated. Bug seem to have been introduced by llvm-svn r367173 (commit `02b9e45a7e`), since the code in question was rewritten in that commit. Reviewers: spatel, chandlerc, foad Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65973 llvm-svn: 368401	2019-08-09 07:08:25 +00:00
Craig Topper	66c08430f6	[ValueTracking] When calculating known bits for integer abs, make sure we're looking at a negate and not just any instruction with the nsw flag set. The matchSelectPattern code can match patterns like (x >= 0) ? x : -x for absolute value. But it can also match ((x-y) >= 0) ? (x-y) : (y-x). If the latter form was matched we can only use the nsw flag if its set on both subtracts. This match makes sure we're looking at the former case only. Differential Revision: https://reviews.llvm.org/D65692 llvm-svn: 368195	2019-08-07 18:28:16 +00:00
Craig Topper	aa2810b6e7	[InstSimplify] Add test case to show bad sign bit handling for integer abs idiom in computeKnownBits. computeKnownBits will indicate the sign bit of abs is 0 if the the RHS operand returned by matchSelectPattern has the nsw flag set. For abs idioms like (X >= 0) ? X : -X, the RHS returns -X. But we can also match ((X-Y) >= 0 ? X-Y : Y-X as abs. In this case RHS will be the Y-X operand. According to Alive, the sign bit for this is only 0 if both the X-Y and Y-X operands have the nsw flag. But we're only checking the Y-X operand. llvm-svn: 367747	2019-08-03 02:54:54 +00:00
Sanjay Patel	02b9e45a7e	[InstSimplify] remove quadratic time looping (PR42771) The test case from: https://bugs.llvm.org/show_bug.cgi?id=42771 ...shows a ~30x slowdown caused by the awkward loop iteration (rL207302) that is seemingly done just to avoid invalidating the instruction iterator. We can instead delay instruction deletion until we reach the end of the block (or we could delay until we reach the end of all blocks). There's a test diff here for a degenerate case with llvm.assume that is not meaningful in itself, but serves to verify this change in logic. This change probably doesn't result in much overall compile-time improvement because we call '-instsimplify' as a standalone pass only once in the standard -O2 opt pipeline currently. Differential Revision: https://reviews.llvm.org/D65336 llvm-svn: 367173	2019-07-27 14:05:51 +00:00
Roman Lebedev	4153f17181	[InstSimplify][NFC] Tests for skipping 'div-by-0' checks before inverted @llvm.umul.with.overflow It would be already handled by the non-inverted case if we were hoisting the `not` in InstCombine, but we don't (granted, we don't sink it in this case either), so this is a separate case. llvm-svn: 366801	2019-07-23 12:42:49 +00:00
Roman Lebedev	0689427280	[InstSimplify][NFC] Tests for skipping 'div-by-0' checks before @llvm.umul.with.overflow These may remain after @llvm.umul.with.overflow was canonicalized from the code that was originally doing the check via division. llvm-svn: 366751	2019-07-22 22:09:11 +00:00
Michael Liao	543ba4e9e0	[InstructionSimplify] Apply sext/trunc after pointer stripping Summary: - As the pointer stripping could trace through `addrspacecast` now, need to sext/trunc the offset to ensure it has the same width as the pointer after stripping. Reviewers: jdoerfert Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64768 llvm-svn: 366162	2019-07-16 01:03:06 +00:00
David Bolvansky	5dca95bc4e	[NFC] Revisited tests for D64285 llvm-svn: 365815	2019-07-11 19:39:20 +00:00
David Bolvansky	e195a91d2d	[NFC] Updated tests for D64285 llvm-svn: 365765	2019-07-11 12:51:33 +00:00
David Bolvansky	901d91e5f0	[NFC] Fixed tests llvm-svn: 365506	2019-07-09 15:31:36 +00:00
David Bolvansky	e625eb9def	[NFC] Added tests for D64285 llvm-svn: 365501	2019-07-09 15:12:01 +00:00
Sanjay Patel	b342f026a4	[InstSimplify] simplify power-of-2 (single bit set) sequences As discussed in PR42314: https://bugs.llvm.org/show_bug.cgi?id=42314 Improving the canonicalization for these patterns: rL363956 ...means we should adjust/enhance the related simplification. https://rise4fun.com/Alive/w1cp Name: isPow2 or zero %x = and i32 %xx, 2048 %a = add i32 %x, -1 %r = and i32 %a, %x => %r = i32 0 llvm-svn: 363997	2019-06-20 22:55:28 +00:00
Sanjay Patel	3207566dd6	[InstSimplify] add tests for known-not-a-power-of-2; NFC I added a canonicalization to create this general pattern in: rL363956 But as noted in PR42314: https://bugs.llvm.org/show_bug.cgi?id=42314#c11 ...we have a (potentially expensive) simplification for the version of the code that we just canonicalized away from, so we should add/adjust that code to match. llvm-svn: 363981	2019-06-20 21:04:14 +00:00
Sanjay Patel	3e03bf6921	[InstSimplify] add a phi test with 1 incoming value; NFC D63489 proposes to change this behavior, but there's no direct -instsimplify test to verify that the transform exists. llvm-svn: 363842	2019-06-19 17:23:29 +00:00
Jay Foad	45d19fb470	[ConstantFolding] Fix assertion failure on non-power-of-two vector load. Summary: The test case does an (out of bounds) load from a global constant with type <3 x float>. InstSimplify tried to turn this into an integer load of the whole alloc size of the vector, which is 128 bits due to alignment padding, and then bitcast this to <3 x vector> which failed an assertion due to the type size mismatch. The fix is to do an integer load of the normal size of the vector, with no alignment padding. Reviewers: tpr, arsenm, majnemer, dstuttard Reviewed By: arsenm Subscribers: hfinkel, wdng, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63375 llvm-svn: 363784	2019-06-19 10:28:48 +00:00
Roman Lebedev	5a663bd77a	[InstSimplify] Fix addo/subo undef folds (PR42209) Fix folds of addo and subo with an undef operand to be: `@llvm.{u,s}{add,sub}.with.overflow` all fold to `{ undef, false }`, as per LLVM undef rules. Same for commuted variants. Based on the original version of the patch by @nikic. Fixes [[ https://bugs.llvm.org/show_bug.cgi?id=42209 \| PR42209 ]] Differential Revision: https://reviews.llvm.org/D63065 llvm-svn: 363522	2019-06-16 20:39:45 +00:00
Sanjay Patel	73f5a855b3	[InstSimplify] enhance fcmp fold with never-nan operand This is another step towards correcting our usage of fast-math-flags when applied on an fcmp. In this case, we are checking for 'nnan' on the fcmp itself rather than the operand of the fcmp. But I'm leaving that clause in until we're more confident that we can stop relying on fcmp's FMF. By using the more general "isKnownNeverNaN()", we gain a simplification shown on the tests with 'uitofp' regardless of the FMF on the fcmp (uitofp never produces a NaN). On the tests with 'fabs', we are now relying on the FMF for the call fabs instruction in addition to the FMF on the fcmp. This is a continuation of D62979 / rL362879. llvm-svn: 362903	2019-06-09 13:48:59 +00:00
Sanjay Patel	de4d4d5049	[InstSimplify] add tests for fcmp with known-never-nan operands; NFC Opposite predicate for rL362742 / rL362879 / D62979 llvm-svn: 362902	2019-06-09 13:30:14 +00:00
Sanjay Patel	4329c15f11	[InstSimplify] enhance fcmp fold with never-nan operand This is 1 step towards correcting our usage of fast-math-flags when applied on an fcmp. In this case, we are checking for 'nnan' on the fcmp itself rather than the operand of the fcmp. But I'm leaving that clause in until we're more confident that we can stop relying on fcmp's FMF. By using the more general "isKnownNeverNaN()", we gain a simplification shown on the tests with 'uitofp' regardless of the FMF on the fcmp (uitofp never produces a NaN). On the tests with 'fabs', we are now relying on the FMF for the call fabs instruction in addition to the FMF on the fcmp. I'll update the 'ult' case below here as a follow-up assuming no problems here. Differential Revision: https://reviews.llvm.org/D62979 llvm-svn: 362879	2019-06-08 15:12:33 +00:00
Sanjay Patel	38c5ee1802	[InstSimplify] add tests for fcmp with known-never-nan operands; NFC llvm-svn: 362742	2019-06-06 20:14:06 +00:00
Sanjay Patel	8869a98e82	[InstSimplify] fold insertelement-of-extractelement This was partly handled in InstCombine (only the constant index case), so delete that and zap it more generally in InstSimplify. llvm-svn: 361576	2019-05-24 00:13:58 +00:00
Sanjay Patel	3e15f83381	[InstSimplify] add tests for insert-of-extract; NFC llvm-svn: 361575	2019-05-24 00:11:23 +00:00
Sanjay Patel	e60cb7d1be	[InstSimplify] insertelement V, undef, ? --> V This was part of InstCombine, but it's better placed in InstSimplify. InstCombine also had an unreachable but weaker fold for insertelement with undef index, so that is deleted. llvm-svn: 361559	2019-05-23 21:49:47 +00:00
Cameron McInally	2d2a46db8e	[InstSimplify] Teach fsub -0.0, (fneg X) ==> X about unary fneg Differential Revision: https://reviews.llvm.org/D62077 llvm-svn: 361151	2019-05-20 13:13:35 +00:00
Sanjay Patel	9ef99b4b11	[InstSimplify] fold fcmp (maxnum, X, C1), C2 This is the sibling transform for rL360899 (D61691): maxnum(X, GreaterC) == C --> false maxnum(X, GreaterC) <= C --> false maxnum(X, GreaterC) < C --> false maxnum(X, GreaterC) >= C --> true maxnum(X, GreaterC) > C --> true maxnum(X, GreaterC) != C --> true llvm-svn: 361118	2019-05-19 14:26:39 +00:00
Cameron McInally	12de5425c1	[NFC][InstSimplify] Add more unary fneg tests to floating-point-arithmetic.ll llvm-svn: 361076	2019-05-17 21:10:11 +00:00
Cameron McInally	bebc7d6a4e	[NFC][InstSimplify] Precommit new unary fneg test llvm-svn: 361060	2019-05-17 18:34:35 +00:00
Cameron McInally	19dc8c7280	[NFC][InstSImplify] Fix flip-flopped comments and test names In test/Transforms/InstSimplify/floating-point-arithmetic.ll Differential Revision: https://reviews.llvm.org/D62069 llvm-svn: 361057	2019-05-17 17:59:17 +00:00
Cameron McInally	067e946859	[InstSimplify] Add unary fneg to `fsub 0.0, (fneg X) ==> X` transform Differential Revision: https://reviews.llvm.org/D62013 llvm-svn: 361047	2019-05-17 16:47:00 +00:00
Cameron McInally	f637bb6ebd	[NFC][InstSimplify] Update fast-math.ll tests I botched in r360808. These were new tests I added in r360808. I made a mistake while converting the exisiting binary FNeg test into the new unary FNeg tests. Correct that. llvm-svn: 360928	2019-05-16 19:00:57 +00:00
Sanjay Patel	3413035477	[InstSimplify] add tests for fcmp of maxnum with constants; NFC Sibling tests for rL360899 (D61691). llvm-svn: 360905	2019-05-16 15:00:11 +00:00
Sanjay Patel	152f81fae8	[InstSimplify] fold fcmp (minnum, X, C1), C2 minnum(X, LesserC) == C --> false minnum(X, LesserC) >= C --> false minnum(X, LesserC) > C --> false minnum(X, LesserC) != C --> true minnum(X, LesserC) <= C --> true minnum(X, LesserC) < C --> true maxnum siblings will follow if there are no problems here. We should be able to perform some other combines when the constants are equal or greater-than too, but that would go in instcombine. We might also generalize this by creating an FP ConstantRange (similar to what we do for integers). Differential Revision: https://reviews.llvm.org/D61691 llvm-svn: 360899	2019-05-16 14:03:10 +00:00
Cameron McInally	14a90661f8	Revert llvm-svn: 360807 Somehow submitted this patch twice. llvm-svn: 360812	2019-05-15 20:48:50 +00:00
Cameron McInally	b8df789ff3	Pre-commit unary fneg tests to InstSimplify llvm-svn: 360808	2019-05-15 20:27:37 +00:00
Cameron McInally	94f16bfaba	Add unary fneg to InstSimplify/fp-nan.ll llvm-svn: 360807	2019-05-15 20:27:35 +00:00
Cameron McInally	a4d29b8e20	Add unary fneg to InstSimplify/fp-nan.ll llvm-svn: 360797	2019-05-15 19:37:03 +00:00
Cameron McInally	0c82d9b5a2	Teach InstSimplify -X + X --> 0.0 about unary FNeg Differential Revision: https://reviews.llvm.org/D61916 llvm-svn: 360777	2019-05-15 14:31:33 +00:00
Sanjay Patel	b64c48597f	[InstSimplify] add tests for fcmp+minnum; NFC llvm-svn: 360275	2019-05-08 17:53:18 +00:00
Nikita Popov	9fd02a71a3	Revert "[ValueTracking] Improve isKnowNonZero for Ints" This reverts commit `3b137a4956`. As reported in https://reviews.llvm.org/D60846, this is causing miscompiles. llvm-svn: 360260	2019-05-08 14:50:01 +00:00
Dan Robertson	3b137a4956	[ValueTracking] Improve isKnowNonZero for Ints Improve isKnownNonZero for integers in order to improve cttz optimizations. Differential Revision: https://reviews.llvm.org/D60846 llvm-svn: 360222	2019-05-08 02:25:08 +00:00
Sanjay Patel	e088d03b9c	[ValueTracking] add logic for known-never-nan with minnum/maxnum From the LangRef: "Returns NaN only if both operands are NaN." llvm-svn: 360206	2019-05-07 22:58:31 +00:00
Sanjay Patel	9a1c2b7776	[InstSimplify] add tests for minnum/maxnum and NaN; NFC llvm-svn: 360197	2019-05-07 21:50:09 +00:00
Cameron McInally	c3167696bc	Add FNeg support to InstructionSimplify Differential Revision: https://reviews.llvm.org/D61573 llvm-svn: 360053	2019-05-06 16:05:10 +00:00
Dan Robertson	9e441aee50	[NFC] Add baseline tests for int isKnownNonZero Add baseline tests for improvements of isKnownNonZero for integer types. Differential Revision: https://reviews.llvm.org/D60932 llvm-svn: 359267	2019-04-26 02:55:54 +00:00
Eric Christopher	cee313d288	Revert "Temporarily Revert "Add basic loop fusion pass."" The reversion apparently deleted the test/Transforms directory. Will be re-reverting again. llvm-svn: 358552	2019-04-17 04:52:47 +00:00
Eric Christopher	a863435128	Temporarily Revert "Add basic loop fusion pass." As it's causing some bot failures (and per request from kbarton). This reverts commit r358543/ab70da07286e618016e78247e4a24fcb84077fda. llvm-svn: 358546	2019-04-17 02:12:23 +00:00
Fangrui Song	d5c404246f	[ConstantFold] Don't evaluate FP or FP vector casts or truncations when simplifying icmp Fix PR41476 llvm-svn: 358262	2019-04-12 07:34:30 +00:00
Nikita Popov	3db93ac5d6	Reapply [ValueTracking] Support min/max selects in computeConstantRange() Add support for min/max flavor selects in computeConstantRange(), which allows us to fold comparisons of a min/max against a constant in InstSimplify. This fixes an infinite InstCombine loop, with the test case taken from D59378. Relative to the previous iteration, this contains some adjustments for AMDGPU med3 tests: The AMDGPU target runs InstSimplify prior to codegen, which ends up constant folding some existing med3 tests after this change. To preserve these tests a hidden -amdgpu-scalar-ir-passes option is added, which allows disabling scalar IR passes (that use InstSimplify) for testing purposes. Differential Revision: https://reviews.llvm.org/D59506 llvm-svn: 357870	2019-04-07 17:22:16 +00:00
Matt Arsenault	03e7492876	InstSimplify: Fold round intrinsics from sitofp/uitofp https://godbolt.org/z/gEMRZb llvm-svn: 357549	2019-04-03 00:25:06 +00:00
Matt Arsenault	fa0a2c529b	InstSimplify: Add missing case from r357386 llvm-svn: 357443	2019-04-02 00:46:19 +00:00
Matt Arsenault	0276b94356	InstSimplify: Add baseline test for upcoming change llvm-svn: 357386	2019-04-01 14:03:44 +00:00
Nikita Popov	b86576a5b9	[InstSimplify] Add tests for signed icmp of and/or; NFC Even if a signed predicate is used, the ranges computed for and/or are unsigned, resulting in missed simplifications. llvm-svn: 356720	2019-03-21 21:13:08 +00:00
Nikita Popov	00b5ecab5d	[ValueTracking] Compute range for abs without nsw This is a small followup to D59511. The code that was moved into computeConstantRange() there is a bit overly conversative: If the abs is not nsw, it does not compute any range. However, abs without nsw still has a well-defined contiguous unsigned range from 0 to SIGNED_MIN. This is a lot less useful than the usual 0 to SIGNED_MAX range, but if we're already here we might as well specify it... Differential Revision: https://reviews.llvm.org/D59563 llvm-svn: 356586	2019-03-20 18:16:02 +00:00
Nikita Popov	2dd1566e8b	[InstSimplify] Add additional cmp of abs without nsw tests; NFC llvm-svn: 356520	2019-03-19 21:12:21 +00:00
Nikita Popov	3e9770d2dc	Revert "[ValueTracking][InstSimplify] Support min/max selects in computeConstantRange()" This reverts commit `106f0cdefb`. This change impacts the AMDGPU smed3.ll and umed3.ll codegen tests. llvm-svn: 356424	2019-03-18 22:26:27 +00:00
Nikita Popov	106f0cdefb	[ValueTracking][InstSimplify] Support min/max selects in computeConstantRange() Add support for min/max flavor selects in computeConstantRange(), which allows us to fold comparisons of a min/max against a constant in InstSimplify. This was suggested by spatel as an alternative approach to D59378. I've also added the infinite looping test from that revision here. Differential Revision: https://reviews.llvm.org/D59506 llvm-svn: 356415	2019-03-18 21:35:19 +00:00
Nikita Popov	05baa9ee1a	[InstSimplify] Add additional icmp of min/max tests; NFC These are baseline tests for D59506. llvm-svn: 356408	2019-03-18 21:19:56 +00:00
Sanjay Patel	9dada83d6c	[InstSimplify] remove zero-shift-guard fold for general funnel shift As discussed on llvm-dev: http://lists.llvm.org/pipermail/llvm-dev/2019-February/130491.html We can't remove the compare+select in the general case because we are treating funnel shift like a standard instruction (as opposed to a special instruction like select/phi). That means that if one of the operands of the funnel shift is poison, the result is poison regardless of whether we know that the operand is actually unused based on the instruction's particular semantics. The motivating case for this transform is the more specific rotate op (rather than funnel shift), and we are preserving the fold for that case because there is no chance of introducing extra poison when there is no anonymous extra operand to the funnel shift. llvm-svn: 354905	2019-02-26 18:26:56 +00:00
Sanjay Patel	421c6e6864	[InstSimplify] add tests for rotate; NFC Rotate is a special-case of funnel shift that has different poison constraints than the general case. That's not visible yet in the existing tests, but it needs to be corrected. llvm-svn: 354894	2019-02-26 16:44:08 +00:00
Sanjay Patel	68171e3cd6	[InstSimplify] use any-zero matcher for fcmp folds The m_APFloat matcher does not work with anything but strict splat vector constants, so we could miss these folds and then trigger an assertion in instcombine: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=13201 The previous attempt at this in rL354406 had a logic bug that actually triggered a regression test failure, but I failed to notice it the first time. llvm-svn: 354467	2019-02-20 14:34:00 +00:00
Sanjay Patel	49f97395ab	Revert "[InstSimplify] use any-zero matcher for fcmp folds" This reverts commit `058bb83513`. Forgot to update another test affected by this change. llvm-svn: 354408	2019-02-20 00:20:38 +00:00
Sanjay Patel	058bb83513	[InstSimplify] use any-zero matcher for fcmp folds The m_APFloat matcher does not work with anything but strict splat vector constants, so we could miss these folds and then trigger an assertion in instcombine: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=13201 llvm-svn: 354406	2019-02-20 00:09:50 +00:00
Sanjay Patel	9cf04addf3	[InstSimplify] add vector tests for fcmp+fabs; NFC llvm-svn: 354404	2019-02-19 23:58:02 +00:00
Dmitry Venikov	aaa709f2ec	[InstSimplify] Missed optimization in math expression: log10(pow(10.0,x)) == x, log2(pow(2.0,x)) == x Summary: This patch enables folding following instructions under -ffast-math flag: log10(pow(10.0,x)) -> x, log2(pow(2.0,x)) -> x Reviewers: hfinkel, spatel, efriedma, craig.topper, zvi, majnemer, lebedev.ri Reviewed By: spatel, lebedev.ri Subscribers: lebedev.ri, llvm-commits Differential Revision: https://reviews.llvm.org/D41940 llvm-svn: 352981	2019-02-03 03:48:30 +00:00
Dmitry Venikov	4c81a2b2ec	Commit tests for changes in revision D41940 llvm-svn: 352734	2019-01-31 07:38:19 +00:00
Nikita Popov	f17421e595	[ConstantFolding] Consolidate and extend bitcount intrinsic tests; NFC Move constant folding tests into ConstantFolding/bitcount.ll and drop various tests in other places. Add coverage for undefs. llvm-svn: 349806	2018-12-20 19:46:52 +00:00
Nikita Popov	221f3fc750	[InstSimplify] Simplify saturating add/sub + icmp If a saturating add/sub has one constant operand, then we can determine the possible range of outputs it can produce, and simplify an icmp comparison based on that. The implementation is based on a similar existing mechanism for simplifying binary operator + icmps. Differential Revision: https://reviews.llvm.org/D55735 llvm-svn: 349369	2018-12-17 17:45:18 +00:00
Nikita Popov	32083bd072	[InstCombine] Add additional saturating add/sub + icmp tests; NFC These test comparisons with saturating add/sub in non-canonical form. llvm-svn: 349309	2018-12-16 17:45:25 +00:00
Nikita Popov	f61567f42a	[InstSimplify] Add tests for saturating add/sub + icmp; NFC If a saturating add/sub with a constant operand is compared to another constant, we should be able to determine that the condition is always true/false in some cases (but currently don't). llvm-svn: 349261	2018-12-15 10:37:01 +00:00
Sanjay Patel	998ececef0	[InstCombine] remove dead code from visitExtractElement Extracting from a splat constant is always handled by InstSimplify. Move the test for this from InstCombine to InstSimplify to make sure that stays true. llvm-svn: 348423	2018-12-05 23:09:33 +00:00
Sanjay Patel	398728732e	[InstSimplify] add tests for undef + partial undef constant folding; NFC These tests should probably go under a separate test file because they should fold with just -constprop, but they're similar to the scalar tests already in here. llvm-svn: 348045	2018-11-30 22:51:34 +00:00
Sanjay Patel	d802270808	[InstSimplify] fold select with implied condition This is an almost direct move of the functionality from InstCombine to InstSimplify. There's no reason not to do this in InstSimplify because we never create a new value with this transform. (There's a question of whether any dominance-based transform belongs in either of these passes, but that's a separate issue.) I've changed 1 of the conditions for the fold (1 of the blocks for the branch must be the block we started with) into an assert because I'm not sure how that could ever be false. We need 1 extra check to make sure that the instruction itself is in a basic block because passes other than InstCombine may be using InstSimplify as an analysis on values that are not wired up yet. The 3-way compare changes show that InstCombine has some kind of phase-ordering hole. Otherwise, we would have already gotten the intended final result that we now show here. llvm-svn: 347896	2018-11-29 18:44:39 +00:00
Sanjay Patel	14ab9170b8	[InstSimplify] fold funnel shifts with undef operands Splitting these off from the D54666. Patch by: nikic (Nikita Popov) llvm-svn: 347332	2018-11-20 17:34:59 +00:00
Sanjay Patel	2778f56a40	[InstSimplify] add tests for funnel shift with undef operands; NFC These are part of D54666, so adding them here before the patch to show the baseline (currently unoptimized) results. Patch by: @nikic (Nikita Popov) llvm-svn: 347331	2018-11-20 17:30:09 +00:00
Sanjay Patel	eea21da12a	[InstructionSimplify] Add support for saturating add/sub Add support for saturating add/sub in InstructionSimplify. In particular, the following simplifications are supported: sat(X + 0) -> X sat(X + undef) -> -1 sat(X uadd MAX) -> MAX (and commutative variants) sat(X - 0) -> X sat(X - X) -> 0 sat(X - undef) -> 0 sat(undef - X) -> 0 sat(0 usub X) -> 0 sat(X usub MAX) -> 0 Patch by: @nikic (Nikita Popov) Differential Revision: https://reviews.llvm.org/D54532 llvm-svn: 347330	2018-11-20 17:20:26 +00:00
Sanjay Patel	f5ead29b78	[PatternMatch] Handle undef vectors consistently This patch fixes the issue noticed in D54532. The problem is that cst_pred_ty-based matchers like m_Zero() currently do not match scalar undefs (as expected), but do match vector undefs. This may lead to optimization inconsistencies in rare cases. There is only one existing test for which output changes, reverting the change from D53205. The reason here is that vector fsub undef, %x is no longer matched as an m_FNeg(). While I think that the new output is technically worse than the previous one, it is consistent with scalar, and I don't think it's really important either way (generally that undef should have been folded away prior to reassociation.) I've also added another test case for this issue based on InstructionSimplify. It took some effort to find that one, as in most cases undef folds are either checked first -- and in the cases where they aren't it usually happens to not make a difference in the end. This is the only case I was able to come up with. Prior to this patch the test case simplified to undef in the scalar case, but zeroinitializer in the vector case. Patch by: @nikic (Nikita Popov) Differential Revision: https://reviews.llvm.org/D54631 llvm-svn: 347318	2018-11-20 16:08:19 +00:00
Sanjay Patel	f967328e24	[InstSimplify] add tests for saturating add/sub; NFC These are baseline tests for D54532. Patch based on the original tests by: @nikic (Nikita Popov) llvm-svn: 347060	2018-11-16 16:32:34 +00:00

... 3 4 5 6 7 ...

949 Commits