llvm-project

Commit Graph

Author	SHA1	Message	Date
Sanjay Patel	058bb83513	[InstSimplify] use any-zero matcher for fcmp folds The m_APFloat matcher does not work with anything but strict splat vector constants, so we could miss these folds and then trigger an assertion in instcombine: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=13201 llvm-svn: 354406	2019-02-20 00:09:50 +00:00
Sanjay Patel	9cf04addf3	[InstSimplify] add vector tests for fcmp+fabs; NFC llvm-svn: 354404	2019-02-19 23:58:02 +00:00
Dmitry Venikov	aaa709f2ec	[InstSimplify] Missed optimization in math expression: log10(pow(10.0,x)) == x, log2(pow(2.0,x)) == x Summary: This patch enables folding following instructions under -ffast-math flag: log10(pow(10.0,x)) -> x, log2(pow(2.0,x)) -> x Reviewers: hfinkel, spatel, efriedma, craig.topper, zvi, majnemer, lebedev.ri Reviewed By: spatel, lebedev.ri Subscribers: lebedev.ri, llvm-commits Differential Revision: https://reviews.llvm.org/D41940 llvm-svn: 352981	2019-02-03 03:48:30 +00:00
Dmitry Venikov	4c81a2b2ec	Commit tests for changes in revision D41940 llvm-svn: 352734	2019-01-31 07:38:19 +00:00
Nikita Popov	f17421e595	[ConstantFolding] Consolidate and extend bitcount intrinsic tests; NFC Move constant folding tests into ConstantFolding/bitcount.ll and drop various tests in other places. Add coverage for undefs. llvm-svn: 349806	2018-12-20 19:46:52 +00:00
Nikita Popov	221f3fc750	[InstSimplify] Simplify saturating add/sub + icmp If a saturating add/sub has one constant operand, then we can determine the possible range of outputs it can produce, and simplify an icmp comparison based on that. The implementation is based on a similar existing mechanism for simplifying binary operator + icmps. Differential Revision: https://reviews.llvm.org/D55735 llvm-svn: 349369	2018-12-17 17:45:18 +00:00
Nikita Popov	32083bd072	[InstCombine] Add additional saturating add/sub + icmp tests; NFC These test comparisons with saturating add/sub in non-canonical form. llvm-svn: 349309	2018-12-16 17:45:25 +00:00
Nikita Popov	f61567f42a	[InstSimplify] Add tests for saturating add/sub + icmp; NFC If a saturating add/sub with a constant operand is compared to another constant, we should be able to determine that the condition is always true/false in some cases (but currently don't). llvm-svn: 349261	2018-12-15 10:37:01 +00:00
Sanjay Patel	998ececef0	[InstCombine] remove dead code from visitExtractElement Extracting from a splat constant is always handled by InstSimplify. Move the test for this from InstCombine to InstSimplify to make sure that stays true. llvm-svn: 348423	2018-12-05 23:09:33 +00:00
Sanjay Patel	398728732e	[InstSimplify] add tests for undef + partial undef constant folding; NFC These tests should probably go under a separate test file because they should fold with just -constprop, but they're similar to the scalar tests already in here. llvm-svn: 348045	2018-11-30 22:51:34 +00:00
Sanjay Patel	d802270808	[InstSimplify] fold select with implied condition This is an almost direct move of the functionality from InstCombine to InstSimplify. There's no reason not to do this in InstSimplify because we never create a new value with this transform. (There's a question of whether any dominance-based transform belongs in either of these passes, but that's a separate issue.) I've changed 1 of the conditions for the fold (1 of the blocks for the branch must be the block we started with) into an assert because I'm not sure how that could ever be false. We need 1 extra check to make sure that the instruction itself is in a basic block because passes other than InstCombine may be using InstSimplify as an analysis on values that are not wired up yet. The 3-way compare changes show that InstCombine has some kind of phase-ordering hole. Otherwise, we would have already gotten the intended final result that we now show here. llvm-svn: 347896	2018-11-29 18:44:39 +00:00
Sanjay Patel	14ab9170b8	[InstSimplify] fold funnel shifts with undef operands Splitting these off from the D54666. Patch by: nikic (Nikita Popov) llvm-svn: 347332	2018-11-20 17:34:59 +00:00
Sanjay Patel	2778f56a40	[InstSimplify] add tests for funnel shift with undef operands; NFC These are part of D54666, so adding them here before the patch to show the baseline (currently unoptimized) results. Patch by: @nikic (Nikita Popov) llvm-svn: 347331	2018-11-20 17:30:09 +00:00
Sanjay Patel	eea21da12a	[InstructionSimplify] Add support for saturating add/sub Add support for saturating add/sub in InstructionSimplify. In particular, the following simplifications are supported: sat(X + 0) -> X sat(X + undef) -> -1 sat(X uadd MAX) -> MAX (and commutative variants) sat(X - 0) -> X sat(X - X) -> 0 sat(X - undef) -> 0 sat(undef - X) -> 0 sat(0 usub X) -> 0 sat(X usub MAX) -> 0 Patch by: @nikic (Nikita Popov) Differential Revision: https://reviews.llvm.org/D54532 llvm-svn: 347330	2018-11-20 17:20:26 +00:00
Sanjay Patel	f5ead29b78	[PatternMatch] Handle undef vectors consistently This patch fixes the issue noticed in D54532. The problem is that cst_pred_ty-based matchers like m_Zero() currently do not match scalar undefs (as expected), but do match vector undefs. This may lead to optimization inconsistencies in rare cases. There is only one existing test for which output changes, reverting the change from D53205. The reason here is that vector fsub undef, %x is no longer matched as an m_FNeg(). While I think that the new output is technically worse than the previous one, it is consistent with scalar, and I don't think it's really important either way (generally that undef should have been folded away prior to reassociation.) I've also added another test case for this issue based on InstructionSimplify. It took some effort to find that one, as in most cases undef folds are either checked first -- and in the cases where they aren't it usually happens to not make a difference in the end. This is the only case I was able to come up with. Prior to this patch the test case simplified to undef in the scalar case, but zeroinitializer in the vector case. Patch by: @nikic (Nikita Popov) Differential Revision: https://reviews.llvm.org/D54631 llvm-svn: 347318	2018-11-20 16:08:19 +00:00
Sanjay Patel	f967328e24	[InstSimplify] add tests for saturating add/sub; NFC These are baseline tests for D54532. Patch based on the original tests by: @nikic (Nikita Popov) llvm-svn: 347060	2018-11-16 16:32:34 +00:00
Sanjay Patel	5ebd2a785e	[InstSimplify] add test to demonstrate undef matching differences; NFC This is a baseline test for D54631. Patch by: @nikic (Nikita Popov) llvm-svn: 347055	2018-11-16 15:35:58 +00:00
Sanjay Patel	e98ec77a95	[InstSimplify] delete shift-of-zero guard ops around funnel shifts This is a problem seen in common rotate idioms as noted in: https://bugs.llvm.org/show_bug.cgi?id=34924 Note that we are not canonicalizing standard IR (shifts and logic) to the intrinsics yet. (Although I've written this before...) I think this is the last step before we enable that transform. Ie, we could regress code by doing that transform without this simplification in place. In PR34924, I questioned whether this is a valid transform for target-independent IR, but I convinced myself this is ok. If we're speculating a funnel shift by turning cmp+br into select, then SimplifyCFG has already determined that the transform is justified. It's possible that SimplifyCFG is not taking into account profile or other metadata, but if that's true, then it's a bug independent of funnel shifts. Also, we do have CGP code to restore a guard like this around an intrinsic if it can't be lowered cheaply. But that isn't necessary for funnel shift because the default expansion in SelectionDAGBuilder includes this same cmp+select. Differential Revision: https://reviews.llvm.org/D54552 llvm-svn: 346960	2018-11-15 14:53:37 +00:00
Sanjay Patel	4832ffee39	[InstSimplify] add more tests for funnel shift with select; NFC The cases are just different enough that we should have complete tests to avoid bugs from typos in the code. llvm-svn: 346902	2018-11-14 22:34:25 +00:00
Sanjay Patel	7d028670f6	[InstSimplify] add tests for funnel shift with select; NFC llvm-svn: 346881	2018-11-14 19:12:54 +00:00
Sanjay Patel	1440107821	[InstSimplify] fold select (fcmp X, Y), X, Y This is NFCI for InstCombine because it calls InstSimplify, so I left the tests for this transform there. As noted in the code comment, we can allow this fold more often by using FMF and/or value tracking. llvm-svn: 346169	2018-11-05 21:51:39 +00:00
Sanjay Patel	72c2d355b7	[InstSimplify] add tests for select+fcmp; NFC These are translated from InstCombine's test file with the same name. We should move the transform from InstCombine to InstSimplify. llvm-svn: 346168	2018-11-05 21:42:01 +00:00
Sanjay Patel	746ebb4ee8	[InstSimplify] fold icmp based on range of abs/nabs (2nd try) This is retrying the fold from rL345717 (reverted at rL347780) ...with a fix for the miscompile demonstrated by PR39510: https://bugs.llvm.org/show_bug.cgi?id=39510 Original commit message: This is a fix for PR39475: https://bugs.llvm.org/show_bug.cgi?id=39475 We managed to get some of these patterns using computeKnownBits in https://reviews.llvm.org/D47041, but that can't be used for nabs(). Instead, put in some range-based logic, so we can fold both abs/nabs with icmp with a constant value. Alive proofs: https://rise4fun.com/Alive/21r Name: abs_nsw_is_positive %cmp = icmp slt i32 %x, 0 %negx = sub nsw i32 0, %x %abs = select i1 %cmp, i32 %negx, i32 %x %r = icmp sgt i32 %abs, -1 => %r = i1 true Name: abs_nsw_is_not_negative %cmp = icmp slt i32 %x, 0 %negx = sub nsw i32 0, %x %abs = select i1 %cmp, i32 %negx, i32 %x %r = icmp slt i32 %abs, 0 => %r = i1 false Name: nabs_is_negative_or_0 %cmp = icmp slt i32 %x, 0 %negx = sub i32 0, %x %nabs = select i1 %cmp, i32 %x, i32 %negx %r = icmp slt i32 %nabs, 1 => %r = i1 true Name: nabs_is_not_over_0 %cmp = icmp slt i32 %x, 0 %negx = sub i32 0, %x %nabs = select i1 %cmp, i32 %x, i32 %negx %r = icmp sgt i32 %nabs, 0 => %r = i1 false Differential Revision: https://reviews.llvm.org/D53844 llvm-svn: 345832	2018-11-01 14:07:39 +00:00
Sanjay Patel	056807b01e	[InstSimplify] add tests for icmp fold bug (PR39510); NFC Verify that set intersection/subset are not confused. llvm-svn: 345831	2018-11-01 14:03:22 +00:00
Sanjay Patel	72fe03f93b	revert rL345717 : [InstSimplify] fold icmp based on range of abs/nabs This can miscompile as shown in PR39510: https://bugs.llvm.org/show_bug.cgi?id=39510 llvm-svn: 345780	2018-10-31 21:37:40 +00:00
Sanjay Patel	d4dc30c20d	[InstSimplify] fold 'fcmp nnan ult X, 0.0' when X is not negative This is the inverted case for the transform added with D53874 / rL345725. llvm-svn: 345728	2018-10-31 15:35:46 +00:00
Sanjay Patel	85cba3b6fb	[InstSimplify] fold 'fcmp nnan oge X, 0.0' when X is not negative This re-raises some of the open questions about how to apply and use fast-math-flags in IR from PR38086: https://bugs.llvm.org/show_bug.cgi?id=38086 ...but given the current implementation (no FMF on casts), this is likely the only way to predicate the transform. This is part of solving PR39475: https://bugs.llvm.org/show_bug.cgi?id=39475 Differential Revision: https://reviews.llvm.org/D53874 llvm-svn: 345725	2018-10-31 14:57:23 +00:00
Sanjay Patel	1cd9917edf	[InstSimplify] add tests for fcmp and known positive; NFC llvm-svn: 345722	2018-10-31 14:29:21 +00:00
Sanjay Patel	2efccd2cf2	[InstSimplify] fold icmp based on range of abs/nabs This is a fix for PR39475: https://bugs.llvm.org/show_bug.cgi?id=39475 We managed to get some of these patterns using computeKnownBits in D47041, but that can't be used for nabs(). Instead, put in some range-based logic, so we can fold both abs/nabs with icmp with a constant value. Alive proofs: https://rise4fun.com/Alive/21r Name: abs_nsw_is_positive %cmp = icmp slt i32 %x, 0 %negx = sub nsw i32 0, %x %abs = select i1 %cmp, i32 %negx, i32 %x %r = icmp sgt i32 %abs, -1 => %r = i1 true Name: abs_nsw_is_not_negative %cmp = icmp slt i32 %x, 0 %negx = sub nsw i32 0, %x %abs = select i1 %cmp, i32 %negx, i32 %x %r = icmp slt i32 %abs, 0 => %r = i1 false Name: nabs_is_negative_or_0 %cmp = icmp slt i32 %x, 0 %negx = sub i32 0, %x %nabs = select i1 %cmp, i32 %x, i32 %negx %r = icmp slt i32 %nabs, 1 => %r = i1 true Name: nabs_is_not_over_0 %cmp = icmp slt i32 %x, 0 %negx = sub i32 0, %x %nabs = select i1 %cmp, i32 %x, i32 %negx %r = icmp sgt i32 %nabs, 0 => %r = i1 false Differential Revision: https://reviews.llvm.org/D53844 llvm-svn: 345717	2018-10-31 13:25:10 +00:00
Sanjay Patel	bcde5afcac	[InstSimplify] add tests for fcmp folds; NFC This is part of a problem noted in PR39475: https://bugs.llvm.org/show_bug.cgi?id=39475 llvm-svn: 345615	2018-10-30 16:58:43 +00:00
Sanjay Patel	33603198f2	[InstSimplify] add tests for abs/nabs+icmp folding; NFC llvm-svn: 345541	2018-10-29 21:05:41 +00:00
Thomas Lively	c339250e12	[InstCombine] InstCombine and InstSimplify for minimum and maximum Summary: Depends on D52765 Reviewers: aheejin, dschuff Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D52766 llvm-svn: 344799	2018-10-19 19:01:26 +00:00
Sanjay Patel	ce3f1915f3	[InstCombine] move/add tests for sub/neg; NFC These should all be handled using "dyn_castNegVal", but that misses vectors with undef elements. llvm-svn: 344790	2018-10-19 17:26:22 +00:00
Cameron McInally	bea5967e8c	[FPEnv] PatternMatcher support for checking FNEG ignoring signed zeros https://reviews.llvm.org/D52934 llvm-svn: 344084	2018-10-09 21:48:00 +00:00
Sanjay Patel	01daf62a0d	[InstSimplify] add vector test for fneg+fdiv; NFC This should be fixed with D52934. llvm-svn: 343936	2018-10-07 14:46:33 +00:00
Sanjay Patel	f3ae9cc33e	[InstSimplify] use isKnownNeverNaN to fold more fcmp ord/uno Remove duplicate tests from InstCombine that were added with D50582. I left negative tests there to verify that nothing in InstCombine tries to go overboard. If isKnownNeverNaN is improved to handle the FP binops or other cases, we should have coverage under InstSimplify, so we could remove more duplicate tests from InstCombine at that time. llvm-svn: 340279	2018-08-21 14:45:13 +00:00
Sanjay Patel	6bb09a4291	[InstSimplify] add tests for FP uno/ord with nnan; NFC This is a slight modification of the tests from D50582; change half of the predicates to 'uno' so we have coverage for that side too. All of the positive tests can fold to a constant (true/false), so that should happen in instsimplify. llvm-svn: 340276	2018-08-21 13:33:13 +00:00
Sanjay Patel	c6944f795d	[InstSimplify] move minnum/maxnum with Inf folds from instcombine llvm-svn: 339396	2018-08-09 22:20:44 +00:00
Sanjay Patel	9b07347033	[InstSimplify] fold fsub+fadd with common operand llvm-svn: 339176	2018-08-07 20:32:55 +00:00
Sanjay Patel	4364d604c2	[InstSimplify] fold fadd+fsub with common operand llvm-svn: 339174	2018-08-07 20:23:49 +00:00
Sanjay Patel	f7a8fb2dee	[InstSimplify] fold fsub+fsub with common operand llvm-svn: 339171	2018-08-07 20:14:27 +00:00
Sanjay Patel	50976393ed	[InstSimplify] add tests for fadd/fsub; NFC Instcombine gets some, but not all, of these cases via it's internal reassociation transforms. It fails in all cases with vector types. llvm-svn: 339168	2018-08-07 19:49:13 +00:00
Sanjay Patel	948ff87d7d	[InstSimplify] move minnum/maxnum with common op fold from instcombine llvm-svn: 339144	2018-08-07 14:36:27 +00:00
Sanjay Patel	b06d283909	[InstSimplify] add tests for minnum/maxnum with shared op; NFC llvm-svn: 339142	2018-08-07 14:13:40 +00:00
Sanjay Patel	b802d18df7	[InstSimplify] move misplaced minnum/maxnum tests; NFC llvm-svn: 339141	2018-08-07 14:12:08 +00:00
Matt Arsenault	56b31d8d75	ValueTracking: Handle canonicalize in CannotBeNegativeZero Also fix apparently missing test coverage for any of the handling here. llvm-svn: 339023	2018-08-06 15:16:26 +00:00
Hiroshi Inoue	73f8b255b6	[InstSimplify] fold extracting from std::pair (2/2) This is the second patch of the series which intends to enable jump threading for an inlined method whose return type is std::pair<int, bool> or std::pair<bool, int>. The first patch is https://reviews.llvm.org/rL338485. This patch handles code sequences that merges two values using `shl` and `or`, then extracts one value using `and`. Differential Revision: https://reviews.llvm.org/D49981 llvm-svn: 338817	2018-08-03 05:39:48 +00:00
Sanjay Patel	3f6e9a71f7	[InstSimplify] move minnum/maxnum with undef fold from instcombine llvm-svn: 338719	2018-08-02 14:33:40 +00:00
Sanjay Patel	f9a0d593e9	[ValueTracking] fix maxnum miscompile for cannotBeOrderedLessThanZero (PR37776) This adds the NAN checks suggested in PR37776: https://bugs.llvm.org/show_bug.cgi?id=37776 If both operands to maxnum are NAN, that should get constant folded, so we don't have to handle that case. This is the same assumption as other FP ops in this function. Returning 'false' is always conservatively correct. Copying from the bug report: Currently, we have this for "when is cannotBeOrderedLessThanZero (mustBePositiveOrNaN) true for maxnum": L ------------------- \| Pos \| Neg \| NaN \| ------------------------ \|Pos \| x \| x \| x \| ------------------------ R \|Neg \| x \| \| x \| ------------------------ \|NaN \| x \| x \| x \| ------------------------ The cases with (Neg & NaN) are wrong. We should have: L ------------------- \| Pos \| Neg \| NaN \| ------------------------ \|Pos \| x \| x \| x \| ------------------------ R \|Neg \| x \| \| \| ------------------------ \|NaN \| x \| \| x \| ------------------------ Differential Revision: https://reviews.llvm.org/D50081 llvm-svn: 338716	2018-08-02 13:46:20 +00:00
Sanjay Patel	28c7e41c09	[InstSimplify] move minnum/maxnum with same arg fold from instcombine llvm-svn: 338652	2018-08-01 23:05:55 +00:00
Hiroshi Inoue	02f79eae06	[InstSimplify] fold extracting from std::pair (1/2) This patch intends to enable jump threading when a method whose return type is std::pair<int, bool> or std::pair<bool, int> is inlined. For example, jump threading does not happen for the if statement in func. std::pair<int, bool> callee(int v) { int a = dummy(v); if (a) return std::make_pair(dummy(v), true); else return std::make_pair(v, v < 0); } int func(int v) { std::pair<int, bool> rc = callee(v); if (rc.second) { // do something } SROA executed before the method inlining replaces std::pair by i64 without splitting in both callee and func since at this point no access to the individual fields is seen to SROA. After inlining, jump threading fails to identify that the incoming value is a constant due to additional instructions (like or, and, trunc). This series of patch add patterns in InstructionSimplify to fold extraction of members of std::pair. To help jump threading, actually we need to optimize the code sequence spanning multiple BBs. These patches does not handle phi by itself, but these additional patterns help NewGVN pass, which calls instsimplify to check opportunities for simplifying instructions over phi, apply phi-of-ops optimization to result in successful jump threading. SimplifyDemandedBits in InstCombine, can do more general optimization but this patch aims to provide opportunities for other optimizers by supporting a simple but common case in InstSimplify. This first patch in the series handles code sequences that merges two values using shl and or and then extracts one value using lshr. Differential Revision: https://reviews.llvm.org/D48828 llvm-svn: 338485	2018-08-01 04:40:32 +00:00
David Bolvansky	16d8a69b90	[InstSimplify] Fold another Select with And/Or pattern Summary: Proof: https://rise4fun.com/Alive/L5J Reviewers: lebedev.ri, spatel Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D49975 llvm-svn: 338383	2018-07-31 14:17:15 +00:00
Hiroshi Inoue	2f6769be60	[InstSimplify] tests for D48828, D49981: fold extraction from std::pair Minor touch up in the previous comment. llvm-svn: 338351	2018-07-31 05:29:20 +00:00
Hiroshi Inoue	5427d3efc2	[InstSimplify] tests for D48828, D49981: fold extraction from std::pair Updated unit tests for D48828 and D49981. llvm-svn: 338350	2018-07-31 05:10:36 +00:00
David Bolvansky	76e95865f3	[InstSimplify] [NFC] Tests for Select with AND/OR fold llvm-svn: 338285	2018-07-30 18:22:18 +00:00
Sanjay Patel	54421ce918	[InstSimplify] fold funnel shifts with 0-shift amount llvm-svn: 338218	2018-07-29 16:36:38 +00:00
Sanjay Patel	46af5835af	[InstSimplify] add tests for funnel shift intrinsics; NFC llvm-svn: 338217	2018-07-29 16:27:17 +00:00
David Bolvansky	9800b710c2	[InstSimplify] Moved Select + AND/OR tests from InstCombine Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D49957 llvm-svn: 338195	2018-07-28 13:52:45 +00:00
Hiroshi Inoue	eeab694cea	[InstSimplify] tests for D48828: fold extraction from std::pair This commit includes unit tests for D48828, which enhances InstSimplify to enable jump threading with a method whose return type is std::pair<int, bool> or std::pair<bool, int>. I am going to commit the actual transformation later. llvm-svn: 338107	2018-07-27 07:21:02 +00:00
Chen Zheng	69bb064539	[InstrSimplify] fold sdiv if two operands are negated and non-overflow Differential Revision: https://reviews.llvm.org/D49382 llvm-svn: 337642	2018-07-21 12:27:54 +00:00
Chen Zheng	0f609db61a	[NFC][testcases] fold sdiv if two operands are negated and non-overflow llvm-svn: 337549	2018-07-20 13:38:59 +00:00
Chen Zheng	f801d0fea9	[InstSimplify] fold srem instruction if its two operands are negated. Differential Revision: https://reviews.llvm.org/D49423 llvm-svn: 337545	2018-07-20 13:00:47 +00:00
Chen Zheng	35395a6773	[NFC][testcases] more testcases for folding srem if its two operands are negatived. llvm-svn: 337543	2018-07-20 12:53:45 +00:00
Chen Zheng	fcfcf07104	[NFC][testcases] add testcases for folding srem whose operands are negatived. Finish same optimization for add instruction in D49216 and sdiv instruction in D49382. This patch is for srem instruction. llvm-svn: 337270	2018-07-17 12:31:54 +00:00
Chen Zheng	c992cc4e97	[testcases] move testcases to right place - NFC Differential Revision: https://reviews.llvm.org/D49409 llvm-svn: 337230	2018-07-17 01:04:41 +00:00
Sanjay Patel	fae8ed0104	[InstSimplify] add fixme comment for PR37776; NFC llvm-svn: 337129	2018-07-15 16:13:58 +00:00
Sanjay Patel	92d0c1c129	[InstSimplify] fold minnum/maxnum with NaN arg This fold is repeated/misplaced in instcombine, but I'm not sure if it's safe to remove that yet because some other folds appear to be asserting that the transform has occurred within instcombine itself. This isn't the best fix for PR37776, but it probably hides the bug with the given code example: https://bugs.llvm.org/show_bug.cgi?id=37776 We have another test to demonstrate the more general bug. llvm-svn: 337127	2018-07-15 14:52:16 +00:00
Sanjay Patel	ef71b704c2	[InstSimplify] add tests for minnum/maxnum; NFC This isn't the best fix for PR37776, but it probably hides the bug with the given code example: https://bugs.llvm.org/show_bug.cgi?id=37776 We have another test to demonstrate the more general bug. llvm-svn: 337126	2018-07-15 14:46:48 +00:00
Chen Zheng	fdf13ef342	[InstSimplify] simplify add instruction if two operands are negative Differential Revision: https://reviews.llvm.org/D49216 llvm-svn: 336881	2018-07-12 03:06:04 +00:00
Sanjay Patel	14eeb5a5c0	[InstSimplify] add/move tests for add folds; NFC isKnownNegation() is currently proposed as part of D48754, but it could be used to make InstSimplify stronger independently of any abs() improvements. llvm-svn: 336822	2018-07-11 16:52:18 +00:00
Manoj Gupta	77eeac3d9e	llvm: Add support for "-fno-delete-null-pointer-checks" Summary: Support for this option is needed for building Linux kernel. This is a very frequently requested feature by kernel developers. More details : https://lkml.org/lkml/2018/4/4/601 GCC option description for -fdelete-null-pointer-checks: This Assume that programs cannot safely dereference null pointers, and that no code or data element resides at address zero. -fno-delete-null-pointer-checks is the inverse of this implying that null pointer dereferencing is not undefined. This feature is implemented in LLVM IR in this CL as the function attribute "null-pointer-is-valid"="true" in IR (Under review at D47894). The CL updates several passes that assumed null pointer dereferencing is undefined to not optimize when the "null-pointer-is-valid"="true" attribute is present. Reviewers: t.p.northover, efriedma, jyknight, chandlerc, rnk, srhines, void, george.burgess.iv Reviewed By: efriedma, george.burgess.iv Subscribers: eraman, haicheng, george.burgess.iv, drinkcat, theraven, reames, sanjoy, xbolva00, llvm-commits Differential Revision: https://reviews.llvm.org/D47895 llvm-svn: 336613	2018-07-09 22:27:23 +00:00
Sanjay Patel	ad0bfb844d	[InstSimplify] fold shifts by sext bool https://rise4fun.com/Alive/c3Y llvm-svn: 335633	2018-06-26 17:31:38 +00:00
Sanjay Patel	3d1e4d6fa6	[InstSimplify] add tests for shifts by sext bool; NFC llvm-svn: 335631	2018-06-26 17:15:07 +00:00
Sanjay Patel	2b7e31095d	[InstSimplify] fold srem with sext bool divisor llvm-svn: 335616	2018-06-26 15:32:54 +00:00
Sanjay Patel	0e0dbebeed	[InstSimplify] add tests for srem with sext bool divisor; NFC llvm-svn: 335609	2018-06-26 14:47:31 +00:00
Sanjay Patel	1e911fa746	[InstSimplify] fold div/rem of zexted bool I was looking at an unrelated fold and noticed that we don't have this simplification (because the other fold would break existing tests). Name: zext udiv %z = zext i1 %x to i32 %r = udiv i32 %y, %z => %r = %y Name: zext urem %z = zext i1 %x to i32 %r = urem i32 %y, %z => %r = 0 Name: zext sdiv %z = zext i1 %x to i32 %r = sdiv i32 %y, %z => %r = %y Name: zext srem %z = zext i1 %x to i32 %r = srem i32 %y, %z => %r = 0 https://rise4fun.com/Alive/LZ9 llvm-svn: 335512	2018-06-25 18:51:21 +00:00
Sanjay Patel	a46bcbec58	[InstSimplify] add tests for div/rem with bool divisor; NFC llvm-svn: 335509	2018-06-25 18:27:14 +00:00
Sanjay Patel	0c57de4c21	[InstSimplify] Fix missed optimization in simplifyUnsignedRangeCheck() For both operands are unsigned, the following optimizations are valid, and missing: 1. X > Y && X != 0 --> X > Y 2. X > Y \|\| X != 0 --> X != 0 3. X <= Y \|\| X != 0 --> true 4. X <= Y \|\| X == 0 --> X <= Y 5. X > Y && X == 0 --> false unsigned foo(unsigned x, unsigned y) { return x > y && x != 0; } should fold to x > y, but I found we haven't done it right now. besides, unsigned foo(unsigned x, unsigned y) { return x < y && y != 0; } Has been folded to x < y, so there may be a bug. Patch by: Li Jia He! Differential Revision: https://reviews.llvm.org/D47922 llvm-svn: 335129	2018-06-20 14:22:49 +00:00
Sanjay Patel	c1d2177f9d	[InstSimplify] Add tests for missed optimizations in simplifyUnsignedRangeCheck (NFC) These are the baseline tests for the functional change in D47922. Patch by Li Jia He! Differential Revision: https://reviews.llvm.org/D48000 llvm-svn: 335128	2018-06-20 14:03:13 +00:00
Roman Lebedev	b060ce45ca	[InstSimplify] add nuw %x, -1 -> -1 fold. Summary: `%ret = add nuw i8 %x, C` From [[ https://llvm.org/docs/LangRef.html#add-instruction \| langref ]]: nuw and nsw stand for “No Unsigned Wrap” and “No Signed Wrap”, respectively. If the nuw and/or nsw keywords are present, the result value of the add is a poison value if unsigned and/or signed overflow, respectively, occurs. So if `C` is `-1`, `%x` can only be `0`, and the result is always `-1`. I'm not sure we want to use `KnownBits`/`LVI` here, because there is exactly one possible value (all bits set, `-1`), so some other pass should take care of replacing the known-all-ones with constant `-1`. The `test/Transforms/InstCombine/set-lowbits-mask-canonicalize.ll` change is confusing. What happening is, before this: (omitting `nuw` for simplicity) 1. First, InstCombine D47428/rL334127 folds `shl i32 1, %NBits`) to `shl nuw i32 -1, %NBits` 2. Then, InstSimplify D47883/rL334222 folds `shl nuw i32 -1, %NBits` to `-1`, 3. `-1` is inverted to `0`. But now: 1. This InstSimplify fold `%ret = add nuw i32 %setbit, -1` -> `-1` happens first, before InstCombine D47428/rL334127 fold could happen. Thus we now end up with the opposite constant, and it is all good: https://rise4fun.com/Alive/OA9 https://rise4fun.com/Alive/sldC Was mentioned in D47428 review. Follow-up for D47883. Reviewers: spatel, craig.topper Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47908 llvm-svn: 334298	2018-06-08 15:44:47 +00:00
Roman Lebedev	188a619e56	[NFC][InstSimplify] Add tests for add nuw %x, -1 -> -1 fold. %ret = add nuw i8 %x, C From langref: nuw and nsw stand for “No Unsigned Wrap” and “No Signed Wrap”, respectively. If the nuw and/or nsw keywords are present, the result value of the add is a poison value if unsigned and/or signed overflow, respectively, occurs. So if C is -1, %x can only be 0, and the result is always -1. https://rise4fun.com/Alive/sldC Was mentioned in D47428 review. llvm-svn: 334236	2018-06-07 21:19:50 +00:00
Roman Lebedev	fdd90f2fc6	[NFC][InstSimplify] One more negative test for shl nuw C, %x -> C fold. Follow-up for rL334200, rL334206. llvm-svn: 334235	2018-06-07 21:19:45 +00:00
Roman Lebedev	2683802ba0	[InstSimplify] shl nuw C, %x -> C iff signbit is set on C. Summary: `%r = shl nuw i8 C, %x` As per langref: ``` If the nuw keyword is present, then the shift produces a poison value if it shifts out any non-zero bits. ``` Thus, if the sign bit is set on `C`, then `%x` can only be `0`, which means that `%r` can only be `C`. Or in other words, set sign bit means that the signed value is negative, so the constant is `<= 0`. https://rise4fun.com/Alive/WMk https://rise4fun.com/Alive/udv Was mentioned in D47428 review. We already handle the `0` constant, https://godbolt.org/g/UZq1sJ, so this only handles negative constants. Could use computeKnownBits() / LazyValueInfo, but the cost-benefit analysis (https://reviews.llvm.org/D47891) suggests it isn't worth it. Reviewers: spatel, craig.topper Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47883 llvm-svn: 334222	2018-06-07 20:03:45 +00:00
Roman Lebedev	86d376f516	[NFC][InstSimplify] Add more tests for shl nuw C, %x -> C fold. Follow-up for rL334200. For these, KnownBits will be needed. llvm-svn: 334206	2018-06-07 16:18:26 +00:00
Roman Lebedev	847938925b	[NFC][InstSimplify] Add tests for shl nuw C, %x -> C fold. %r = shl nuw i8 C, %x As per langref: If the nuw keyword is present, then the shift produces a poison value if it shifts out any non-zero bits. Thus, if the sign bit is set on C, then %x can only be 0, which means that %r can only be C. https://rise4fun.com/Alive/WMk Was mentioned in D47428 review. llvm-svn: 334200	2018-06-07 14:18:38 +00:00
Sanjay Patel	3ef8f858da	[InstCombine] move misplaced test file and regenerate checks; NFC llvm-svn: 333039	2018-05-22 23:15:56 +00:00
Sanjay Patel	a003c728a5	[InstCombine] choose 1 form of abs and nabs as canonical We already do this for min/max (see the blob above the diff), so we should do the same for abs/nabs. A sign-bit check (<s 0) is used as a predicate for other IR transforms and it's likely the best for codegen. This might solve the motivating cases for D47037 and D47041, but I think those patches still make sense. We can't guarantee this canonicalization if the icmp has more than one use. Differential Revision: https://reviews.llvm.org/D47076 llvm-svn: 332819	2018-05-20 14:23:23 +00:00
Shiva Chen	2c864551df	[DebugInfo] Add DILabel metadata and intrinsic llvm.dbg.label. In order to set breakpoints on labels and list source code around labels, we need collect debug information for labels, i.e., label name, the function label belong, line number in the file, and the address label located. In order to keep these information in LLVM IR and to allow backend to generate debug information correctly. We create a new kind of metadata for labels, DILabel. The format of DILabel is !DILabel(scope: !1, name: "foo", file: !2, line: 3) We hope to keep debug information as much as possible even the code is optimized. So, we create a new kind of intrinsic for label metadata to avoid the metadata is eliminated with basic block. The intrinsic will keep existing if we keep it from optimized out. The format of the intrinsic is llvm.dbg.label(metadata !1) It has only one argument, that is the DILabel metadata. The intrinsic will follow the label immediately. Backend could get the label metadata through the intrinsic's parameter. We also create DIBuilder API for labels to be used by Frontend. Frontend could use createLabel() to allocate DILabel objects, and use insertLabel() to insert llvm.dbg.label intrinsic in LLVM IR. Differential Revision: https://reviews.llvm.org/D45024 Patch by Hsiangkai Wang. llvm-svn: 331841	2018-05-09 02:40:45 +00:00
George Burgess IV	8e807bf3fa	Reland r301880(!): "[InstSimplify] Handle selects of GEPs with 0 offset" I was reminded today that this patch got reverted in r301885. I can no longer reproduce the failure that caused the revert locally (...almost one year later), and the patch applied pretty cleanly, so I guess we'll see if the bots still get angry about it. The original breakage was InstSimplify complaining (in "assertion failed" form) about getting passed some crazy IR when running `ninja check-sanitizer`. I'm unable to find traces of what, exactly, said crazy IR was. I suppose we'll find out pretty soon if that's still the case. :) Original commit: Author: gbiv Date: Mon May 1 18:12:08 2017 New Revision: 301880 URL: http://llvm.org/viewvc/llvm-project?rev=301880&view=rev Log: [InstSimplify] Handle selects of GEPs with 0 offset In particular (since it wouldn't fit nicely in the summary): (select (icmp eq V 0) P (getelementptr P V)) -> (getelementptr P V) Differential Revision: https://reviews.llvm.org/D31435 llvm-svn: 330667	2018-04-24 00:25:01 +00:00
Sanjay Patel	30be665e82	[PatternMatch] allow undef elements when matching a vector zero This is the last step in getting constant pattern matchers to allow undef elements in constant vectors. I'm adding a dedicated m_ZeroInt() function and building m_Zero() from that. In most cases, calling code can be updated to use m_ZeroInt() directly when there's no need to match pointers, but I'm leaving that efficiency optimization as a follow-up step because it's not always clear when that's ok. There are just enough icmp folds in InstSimplify that can be used for integer or pointer types, that we probably still want a generic m_Zero() for those cases. Otherwise, we could eliminate it (and possibly add a m_NullPtr() as an alias for isa<ConstantPointerNull>()). We're conservatively returning a full zero vector (zeroinitializer) in InstSimplify/InstCombine on some of these folds (see diffs in InstSimplify), but I'm not sure if that's actually necessary in all cases. We may be able to propagate an undef lane instead. One test where this happens is marked with 'TODO'. llvm-svn: 330550	2018-04-22 17:07:44 +00:00
Sanjay Patel	e187cd3273	[InstSimplify, InstCombine] add vector tests with undef elts; NFC llvm-svn: 330543	2018-04-22 14:19:37 +00:00
Sanjay Patel	5f845732ed	[InstSimplify] move tests for shifts; NFC llvm-svn: 330516	2018-04-21 16:58:00 +00:00
Sanjay Patel	d0b27a1156	[InstSimplify] move/add/regenerate checks for tests; NFC llvm-svn: 330515	2018-04-21 16:23:47 +00:00
Sanjay Patel	93e64dd9a1	[PatternMatch] allow undef elements when matching vector FP +0.0 This continues the FP constant pattern matching improvements from: https://reviews.llvm.org/rL327627 https://reviews.llvm.org/rL327339 https://reviews.llvm.org/rL327307 Several integer constant matchers also have this ability. I'm separating matching of integer/pointer null from FP positive zero and renaming/commenting to make the functionality clearer. llvm-svn: 328461	2018-03-25 21:16:33 +00:00
Sanjay Patel	c84b48ec29	[InstSimplify, InstCombine] add/update tests with FP +0.0 vector with undef; NFC llvm-svn: 328455	2018-03-25 17:48:20 +00:00
Sanjay Patel	3547dcb3ae	[InstSimplify] regenerate checks, move tests; NFC llvm-svn: 328327	2018-03-23 15:31:31 +00:00
Sanjay Patel	e235942a1e	[InstSimplify] fp_binop X, NaN --> NaN We propagate the existing NaN value when possible. Differential Revision: https://reviews.llvm.org/D44521 llvm-svn: 328140	2018-03-21 19:31:53 +00:00
Sanjay Patel	95ec4a4dfe	[InstSimplify] loosen FMF for sqrt(X) * sqrt(X) --> X As shown in the code comment, we don't need all of 'fast', but we do need reassoc + nsz + nnan. Differential Revision: https://reviews.llvm.org/D43765 llvm-svn: 327796	2018-03-18 14:12:25 +00:00
Sanjay Patel	5a5c33d8b5	[InstSimplify] add NaN constant diversity; NFC llvm-svn: 327743	2018-03-16 20:55:55 +00:00
Roman Lebedev	6aca33534b	[InstSimplify] peek through unsigned FP casts for sign-bit compares (PR36682) This pattern came up in PR36682 / D44390 https://bugs.llvm.org/show_bug.cgi?id=36682 https://reviews.llvm.org/D44390 https://godbolt.org/g/oKvT5H See also D44421, D44424 Reviewers: spatel, majnemer, efriedma, arsenm Reviewed By: spatel Subscribers: wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D44425 llvm-svn: 327642	2018-03-15 16:17:46 +00:00
Matthew Simpson	c1c4ad6e64	[ConstantFolding, InstSimplify] Handle more vector GEPs This patch addresses some additional cases where the compiler crashes upon encountering vector GEPs. This should fix PR36116. Differential Revision: https://reviews.llvm.org/D44219 Reference: https://bugs.llvm.org/show_bug.cgi?id=36116 llvm-svn: 327638	2018-03-15 16:00:29 +00:00
Sanjay Patel	43f71eade0	[InstSimplify] add tests with NaN operand for fp binops; NFC llvm-svn: 327631	2018-03-15 14:48:39 +00:00
Sanjay Patel	a4f42f2cfd	[PatternMatch, InstSimplify] allow undef elements when matching any vector FP zero This matcher implementation appears to be slightly more efficient than the generic constant check that it is replacing because every use was for matching FP patterns, but the previous code would check int and pointer type nulls too. llvm-svn: 327627	2018-03-15 14:29:27 +00:00
Sanjay Patel	8f063d0c70	[InstSimplify] remove 'nsz' requirement for frem 0, X From the LangRef definition for frem: "The value produced is the floating-point remainder of the two operands. This is the same output as a libm ‘fmod‘ function, but without any possibility of setting errno. The remainder has the same sign as the dividend. This instruction is assumed to execute in the default floating-point environment." llvm-svn: 327626	2018-03-15 14:04:31 +00:00
Sanjay Patel	e4e3f79b83	[InstSimplify] add tests for frem and vectors with undef; NFC These should all be folded. The vector tests need to have m_AnyZero updated to ignore undef elements, but we need to be careful not to return the existing value in that case and unintentionally propagate undef. llvm-svn: 327585	2018-03-14 22:45:58 +00:00
Sanjay Patel	11f7f9908b	[InstSimplify] fix folds for (0.0 - X) + X --> 0 (PR27151) As shown in: https://bugs.llvm.org/show_bug.cgi?id=27151 ...the existing fold could miscompile when X is NaN. The fold was also dependent on 'ninf' but that's not necessary. From IEEE-754 (with default rounding which we can assume for these opcodes): "When the sum of two operands with opposite signs (or the difference of two operands with like signs) is exactly zero, the sign of that sum (or difference) shall be +0...However, x + x = x − (−x) retains the same sign as x even when x is zero." llvm-svn: 327575	2018-03-14 21:23:27 +00:00
Sanjay Patel	fd82fd000c	[InstSimplify] add tests to show missing/broken fadd folds (PR27151, PR26958); NFC llvm-svn: 327554	2018-03-14 18:52:40 +00:00
Sanjay Patel	e011d7964d	[InstSimplify] regenerate checks; NFC llvm-svn: 327553	2018-03-14 18:49:57 +00:00
Roman Lebedev	60d24445dd	[InstSimplify] [NFC] cast-unsigned-icmp-cmp-0.ll - don't run instcombine As disscussed in post-commit review of D44421, there is simply no reason to run instcombine on this testcase. llvm-svn: 327541	2018-03-14 17:59:12 +00:00
Roman Lebedev	978aae7614	[InstSimplify] [NFC] Add tests for peeking through unsigned FP casts for sign compares (PR36682) Summary: This pattern came up in PR36682 / D44390 https://bugs.llvm.org/show_bug.cgi?id=36682 https://reviews.llvm.org/D44390 https://godbolt.org/g/oKvT5H Looking at the IR pattern in question, as per [[ https://github.com/rutgers-apl/alive-nj \| alive-nj ]], for all the type combinations i checked (input: `i16`, `i32`, `i64`; intermediate: `half`/`i16`, `float`/`i32`, `double`/`i64`) for the following `icmp` comparisons the `uitofp`+`bitcast`+`icmp` can be evaluated to a boolean: * `slt 0` * `sgt -1` I did not check vectors, but i'm guessing it's the same there. {F5889242} Thus all these cases are in the testcase (along with the vector variant with additional `undef` element in the middle). There are no negative patterns here (unless alive-nj lied/is broken), all of these should be optimized. Reviewers: spatel, majnemer, efriedma, arsenm Reviewed By: spatel Subscribers: wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D44421 llvm-svn: 327535	2018-03-14 17:31:08 +00:00
Sanjay Patel	1dbb6b7282	[PatternMatch] enhance m_NaN() to ignore undef elements in vectors llvm-svn: 327339	2018-03-12 22:18:47 +00:00
Sanjay Patel	5b034c83d6	[InstSimplify] add fcmp tests for constant NaN vector with undef elt; NFC llvm-svn: 327335	2018-03-12 21:44:17 +00:00
Sanjay Patel	a0d8d127c6	[PatternMatch, InstSimplify] allow undef elements when matching vector -0.0 This is the FP equivalent of D42818. Use it for the few cases in InstSimplify with -0.0 folds (that's the only current use of m_NegZero()). Differential Revision: https://reviews.llvm.org/D43792 llvm-svn: 327307	2018-03-12 18:17:01 +00:00
Sanjay Patel	49b7dc2968	[InstSimplify] add test for m_NegZero with undef elt; NFC llvm-svn: 327287	2018-03-12 15:47:32 +00:00
Sanjay Patel	4222716822	[InstSimplify] fp_binop X, undef --> NaN The variable operand could be NaN, so it's always safe to propagate NaN. llvm-svn: 327212	2018-03-10 16:51:28 +00:00
Sanjay Patel	e5606b4fa5	[ConstantFold] fp_binop AnyConstant, undef --> NaN With the updated LangRef ( D44216 / rL327138 ) in place, we can proceed with more constant folding. I'm intentionally taking the conservative path here: no matter what the constant or the FMF, we can always fold to NaN. This is because the undef operand can be chosen as NaN, and in our simplified default FP env, nothing else happens - NaN just propagates to the result. If we find some way/need to propagate undef instead, that can be added subsequently. The tests show that we always choose the same quiet NaN constant (0x7FF8000000000000 in IR text). There were suggestions to improve that with a 'NaN' string token or not always print a 64-bit hex value, but those are independent changes. We might also consider setting/propagating the payload of NaN constants as an enhancement. Differential Revision: https://reviews.llvm.org/D44308 llvm-svn: 327208	2018-03-10 15:56:25 +00:00
Sanjay Patel	3675b8cece	[InstSimplify] fix FP infinite hex constant values in tests; NFC Really should improve this... llvm-svn: 327144	2018-03-09 16:14:02 +00:00
Sanjay Patel	2ee7b9349d	[ConstantFold] fp_binop undef, undef --> undef These are uncontroversial and independent of a proposed LangRef edits (D44216). I tried to fix tests that would fold away: rL327004 rL327028 rL327030 rL327034 I'm not sure if the Reassociate tests are meaningless yet, but they probably will be as we add more folds, so if anyone has suggestions or wants to fix those, please do. Differential Revision: https://reviews.llvm.org/D44258 llvm-svn: 327058	2018-03-08 20:42:49 +00:00
Sanjay Patel	a87b74f72b	[InstSimplify] add more tests for FP undef; NFC llvm-svn: 327012	2018-03-08 15:39:39 +00:00
Sanjay Patel	bf28a8fc01	[InstSimplify] add tests for FP with undef operand; NFC Are any of these correct? llvm-svn: 326241	2018-02-27 20:17:18 +00:00
Craig Topper	301991080e	[ValueTracking] Teach cannotBeOrderedLessThanZeroImpl to look through ExtractElement. This is similar to what's done in computeKnownBits and computeSignBits. Don't do anything fancy just collect information valid for any element. Differential Revision: https://reviews.llvm.org/D43789 llvm-svn: 326237	2018-02-27 19:53:45 +00:00
Sanjay Patel	66911b16e6	[InstCombine, InstSimplify] add tests with undef elements in constant FP vectors; NFC llvm-svn: 326148	2018-02-26 23:23:02 +00:00
Craig Topper	69c8972fd1	[ValueTracking] Teach cannotBeOrderedLessThanZeroImpl to handle vector constants. Summary: This allows vector fabs to be removed in more cases. Reviewers: spatel, arsenm, RKSimon Reviewed By: spatel Subscribers: wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D43739 llvm-svn: 326138	2018-02-26 22:33:17 +00:00
Craig Topper	aee341ef28	[InstSimplify] Add test cases for removal of vector fabs on known positive. llvm-svn: 326050	2018-02-25 06:51:52 +00:00
Craig Topper	2b8f051aaa	[InstSimplify] Remove unused parameter from test cases. llvm-svn: 326049	2018-02-25 06:51:51 +00:00
Sanjay Patel	db53d1847b	[InstSimplify] sqrt(X) * sqrt(X) --> X This was misplaced in InstCombine. We can loosen the FMF as a follow-up step. llvm-svn: 325965	2018-02-23 22:20:13 +00:00
Sanjay Patel	e29caaa9c5	[PatternMatch] enhance m_SignMask() to ignore undef elements in vectors llvm-svn: 325623	2018-02-20 21:02:40 +00:00
Sanjay Patel	ff7b777bbe	[InstSimplify] add tests for m_SignMask with undef vector elements; NFC llvm-svn: 325622	2018-02-20 20:53:35 +00:00
Sanjay Patel	adf6e88c74	[PatternMatch, InstSimplify] enhance m_AllOnes() to ignore undef elements in vectors Loosening the matcher definition reveals a subtle bug in InstSimplify (we should not assume that because an operand constant matches that it's safe to return it as a result). So I'm making that change here too (that diff could be independent, but I'm not sure how to reveal it before the matcher change). This also seems like a good reason to not include matchers that capture the value. We don't want to encourage the potential misstep of propagating undef values when it's not allowed/intended. I didn't include the capture variant option here or in the related rL325437 (m_One), but it already exists for other constant matchers. llvm-svn: 325466	2018-02-18 18:05:08 +00:00
Sanjay Patel	7faceaed31	[InstSimplify] add tests with vector undef elts; NFC llvm-svn: 325465	2018-02-18 17:39:09 +00:00
Sanjay Patel	f569578373	[PatternMatch] enhance m_One() to ignore undef elements in vectors llvm-svn: 325437	2018-02-17 16:00:42 +00:00
Sanjay Patel	a6a1426cf1	[InstSimplify, InstCombine] add tests with vector undef elts; NFC These would fold if the m_One pattern matcher accounted for undef elts. llvm-svn: 325436	2018-02-17 15:55:40 +00:00
Sanjay Patel	841ca95219	[InstSimplify] add vector select tests with undef elts in condition; NFC llvm-svn: 325419	2018-02-17 01:18:53 +00:00
Sanjay Patel	b13fcd52ed	[InstCombine, InstSimplify] (re)move tests, regenerate checks; NFC The InstCombine integer mul test file had tests that belong in InstSimplify (including fmul tests). Move things to where they belong and auto-generate complete checks for everything. llvm-svn: 325037	2018-02-13 18:22:53 +00:00
Sanjay Patel	246d769232	[InstSimplify] allow exp/log simplifications with only 'reassoc' FMF These intrinsic folds were added with D41381, but only allowed with isFast(). That's more than necessary because FMF has 'reassoc' to apply to these kinds of folds after D39304, and that's all we need in these cases. Differential Revision: https://reviews.llvm.org/D43160 llvm-svn: 324967	2018-02-12 23:51:23 +00:00
Sanjay Patel	c5d5933bd5	[InstSimplify] change tests to 'fast' to reflect current folds The diff to use 'reassoc' is part of D43160; it should not have been made with rL324961. Reverting that part here, so we'll see the intended diff with the code change. llvm-svn: 324963	2018-02-12 23:39:10 +00:00
Sanjay Patel	de3e889a88	[InstSimplify] consolidate tests for log-exp inverse folds Some tests didn't add much value because we already show stronger constraints for the folds in other tests, so the weaker versions were deleted. Moved the remaining tests into 1 file because the folds are very similar and handled from 1 place in the code. llvm-svn: 324961	2018-02-12 23:18:11 +00:00
Sanjay Patel	a60aec1ab7	[ValueTracking] don't crash when assumptions conflict (PR36270) The last assume in the test says that %B12 is 0. The first assume says that %and1 is less than %B12. Therefore, %and1 is unsigned less than 0...does not compute. That means this line: Known.Zero.setHighBits(RHSKnown.countMinLeadingZeros() + 1); ...tries to set more bits than exist. Differential Revision: https://reviews.llvm.org/D43052 llvm-svn: 324610	2018-02-08 14:52:40 +00:00
Sanjay Patel	83f056604c	[InstSimplify] (X * Y) / Y --> X for relaxed floating-point ops This is the FP counterpart that was mentioned in PR35709: https://bugs.llvm.org/show_bug.cgi?id=35709 Differential Revision: https://reviews.llvm.org/D42385 llvm-svn: 323716	2018-01-30 00:18:37 +00:00
Zvi Rackover	51f0d64b9c	InstSimplify: If divisor element is undef simplify to undef Summary: If any vector divisor element is undef, we can arbitrarily choose it be zero which would make the div/rem an undef value by definition. Reviewers: spatel, reames Reviewed By: spatel Subscribers: magabari, llvm-commits Differential Revision: https://reviews.llvm.org/D42485 llvm-svn: 323343	2018-01-24 17:22:00 +00:00
Anton Bikineev	82f61151b3	[InstSimplify] (X << Y) % X -> 0 llvm-svn: 323182	2018-01-23 09:27:47 +00:00
Sanjay Patel	7a44e4d594	[InstSimplify] add baseline tests for (X << Y) % X -> 0; NFC This is the 'rem' counterpart to D42032 and would be folded by D42341. Patch by Anton Bikineev. Differential Revision: https://reviews.llvm.org/D42342 llvm-svn: 323067	2018-01-21 15:36:15 +00:00
Sanjay Patel	a19b748f6d	[InstSimplify] regenerate checks and add tests for commutes; NFC llvm-svn: 322907	2018-01-18 23:11:24 +00:00
Sanjay Patel	4158eff0f8	[InstSimplify] fold implied null ptr check (PR35790) This extends rL322327 to handle the pointer cast and should solve: https://bugs.llvm.org/show_bug.cgi?id=35790 Name: or_eq_zero %isnull = icmp eq i64* %p, null %x = ptrtoint i64* %p to i64 %somebits = and i64 %x, %y %somebits_are_zero = icmp eq i64 %somebits, 0 %or = or i1 %somebits_are_zero, %isnull => %or = %somebits_are_zero Name: and_ne_zero %isnotnull = icmp ne i64* %p, null %x = ptrtoint i64* %p to i64 %somebits = and i64 %x, %y %somebits_are_not_zero = icmp ne i64 %somebits, 0 %and = and i1 %somebits_are_not_zero, %isnotnull => %and = %somebits_are_not_zero https://rise4fun.com/Alive/CQ3 llvm-svn: 322439	2018-01-13 15:44:44 +00:00
Sanjay Patel	6691e40980	[InstSimplify] add tests for implied ptr cmp with null (PR35790); NFC llvm-svn: 322411	2018-01-12 22:16:26 +00:00
Sanjay Patel	6ef6aa987c	[InstSimplify] fold implied cmp with zero (PR35790) This doesn't handle the more complicated case in the bug report yet: https://bugs.llvm.org/show_bug.cgi?id=35790 For that, we have to match / look through a cast. llvm-svn: 322327	2018-01-11 23:27:37 +00:00
Sanjay Patel	ac0edcb3f3	[InstSimplify] add tests for implied cmp with zero (PR35790); NFC llvm-svn: 322323	2018-01-11 22:48:07 +00:00
Dmitry Venikov	3d8cd34a5d	[InstSimplify] Missed optimization in math expression: squashing exp(log), log(exp) Summary: This patch enables folding following expressions under -ffast-math flag: exp(log(x)) -> x, exp2(log2(x)) -> x, log(exp(x)) -> x, log2(exp2(x)) -> x Reviewers: spatel, hfinkel, davide Reviewed By: spatel, hfinkel, davide Subscribers: scanon, llvm-commits Differential Revision: https://reviews.llvm.org/D41381 llvm-svn: 321710	2018-01-03 14:37:42 +00:00
Philip Reames	e499bc3042	[instsimplify] consistently handle undef and out of bound indices for insertelement and extractelement In one case, we were handling out of bounds, but not undef indices. In the other, we were handling undef (with the comment making the analogy to out of bounds), but not out of bounds. Be consistent and treat both undef and constant out of bounds indices as producing undefined results. As a side effect, this also protects instcombine from having to handle large constant indices as we always simplify first. llvm-svn: 321575	2017-12-30 05:54:22 +00:00
Philip Reames	3e9c671923	Move tests associated with transforms moved in r321467 llvm-svn: 321572	2017-12-30 03:13:00 +00:00

1 2 3 4 5 ...

615 Commits