llvm-project

Commit Graph

Author	SHA1	Message	Date
Roman Lebedev	be612ea471	[InstCombine] Fold "x ?% y ==/!= 0" to "x & (y-1) ==/!= 0" iff y is power-of-two Summary: I have stumbled into this by accident while preparing to extend backend `x s% C ==/!= 0` handling. While we did happen to handle this fold in most of the cases, the folding is indirect - we fold `x u% y` to `x & (y-1)` (iff `y` is power-of-two), or first turn `x s% -y` to `x u% y`; that does handle most of the cases. But we can't turn `x s% INT_MIN` to `x u% -INT_MIN`, and thus we end up being stuck with `(x s% INT_MIN) == 0`. There is no such restriction for the more general fold: https://rise4fun.com/Alive/IIeS To be noted, the fold does not enforce that `y` is a constant, so it may indeed increase instruction count. This is consistent with what `x u% y`->`x & (y-1)` already does. I think it makes sense, it's at most one (simple) extra instruction, while `rem`ainder is really much more un-simple (and likely very costly). Reviewers: spatel, RKSimon, nikic, xbolva00, craig.topper Reviewed By: RKSimon Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65046 llvm-svn: 367322	2019-07-30 15:28:22 +00:00
Roman Lebedev	8a431874e9	[NFC][InstCombine] Add a few extra srem-by-power-of-two tests - extra uses llvm-svn: 366652	2019-07-21 09:05:49 +00:00
Roman Lebedev	056640f8b3	[NFC][InstCombine] Add srem-by-signbit tests - still can fold to bittest https://rise4fun.com/Alive/IIeS llvm-svn: 366642	2019-07-20 21:33:50 +00:00
Eric Christopher	cee313d288	Revert "Temporarily Revert "Add basic loop fusion pass."" The reversion apparently deleted the test/Transforms directory. Will be re-reverting again. llvm-svn: 358552	2019-04-17 04:52:47 +00:00
Eric Christopher	a863435128	Temporarily Revert "Add basic loop fusion pass." As it's causing some bot failures (and per request from kbarton). This reverts commit r358543/ab70da07286e618016e78247e4a24fcb84077fda. llvm-svn: 358546	2019-04-17 02:12:23 +00:00
Sanjay Patel	3575f0c0b3	[InstCombine] fold urem with sext bool divisor Similar to other patches in this series: https://reviews.llvm.org/rL335512 https://reviews.llvm.org/rL335527 https://reviews.llvm.org/rL335597 https://reviews.llvm.org/rL335616 ...this is filling a gap in analysis that is exposed by an unrelated select-of-constants transform. I didn't see a way to unify the sext cases because each div/rem opcode results in a different fold. Note that in this case, the backend might want to convert the select into math: Name: sext urem %e = sext i1 %x to i32 %r = urem i32 %y, %e => %c = icmp eq i32 %y, -1 %z = zext i1 %c to i32 %r = add i32 %z, %y llvm-svn: 335622	2018-06-26 16:30:00 +00:00
Sanjay Patel	0f44759b0d	[InstCombine] add tests for urem with sext bool divisor; NFC llvm-svn: 335619	2018-06-26 16:01:24 +00:00
Sanjay Patel	1e911fa746	[InstSimplify] fold div/rem of zexted bool I was looking at an unrelated fold and noticed that we don't have this simplification (because the other fold would break existing tests). Name: zext udiv %z = zext i1 %x to i32 %r = udiv i32 %y, %z => %r = %y Name: zext urem %z = zext i1 %x to i32 %r = urem i32 %y, %z => %r = 0 Name: zext sdiv %z = zext i1 %x to i32 %r = sdiv i32 %y, %z => %r = %y Name: zext srem %z = zext i1 %x to i32 %r = srem i32 %y, %z => %r = 0 https://rise4fun.com/Alive/LZ9 llvm-svn: 335512	2018-06-25 18:51:21 +00:00
Roman Lebedev	cbf8446359	[InstCombine] PR37603: low bit mask canonicalization Summary: This is [[ https://bugs.llvm.org/show_bug.cgi?id=37603 \| PR37603 ]]. https://godbolt.org/g/VCMNpS https://rise4fun.com/Alive/idM When doing bit manipulations, it is quite common to calculate some bit mask, and apply it to some value via `and`. The typical C code looks like: ``` int mask_signed_add(int nbits) { return (1 << nbits) - 1; } ``` which is translated into (with `-O3`) ``` define dso_local i32 @mask_signed_add(int)(i32) local_unnamed_addr #0 { %2 = shl i32 1, %0 %3 = add nsw i32 %2, -1 ret i32 %3 } ``` But there is a second, less readable variant: ``` int mask_signed_xor(int nbits) { return ~(-(1 << nbits)); } ``` which is translated into (with `-O3`) ``` define dso_local i32 @mask_signed_xor(int)(i32) local_unnamed_addr #0 { %2 = shl i32 -1, %0 %3 = xor i32 %2, -1 ret i32 %3 } ``` Since we created such a mask, it is quite likely that we will use it in `and` next. And then we may get rid of `not` op by folding into `andn`. But now that i have actually looked: https://godbolt.org/g/VTUDmU _some_ backend changes will be needed too. We clearly loose `bzhi` recognition. Reviewers: spatel, craig.topper, RKSimon Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47428 llvm-svn: 334127	2018-06-06 19:38:27 +00:00
Sanjay Patel	e7b6654711	[InstCombine] refine select-of-constants to bitwise ops Add logic for the special case when a cmp+select can clearly be reduced to just a bitwise logic instruction, and remove an over-reaching chunk of general purpose bit magic. The primary goal is to remove cases where we are not improving the IR instruction count when doing these select transforms, and in all cases here that is true. In the motivating 3-way compare tests, there are further improvements because we can combine/propagate select values (not sure if that belongs in instcombine, but it's there for now). DAGCombiner has folds to turn some of these selects into bit magic, so there should be no difference in the end result in those cases. Not all constant combinations are handled there yet, however, so it is possible that some targets will see more cmov/csel codegen with this change in IR canonicalization. Ideally, we'll go further to not turn selects into multiple logic/math ops in instcombine, and we'll canonicalize to selects. But we should make sure that this step does not result in regressions first (and if it does, we should fix those in the backend). The general direction for this change was discussed here: http://lists.llvm.org/pipermail/llvm-dev/2016-September/105373.html http://lists.llvm.org/pipermail/llvm-dev/2017-July/114885.html Alive proofs for the new bit magic: https://rise4fun.com/Alive/XG7 Differential Revision: https://reviews.llvm.org/D46086 llvm-svn: 331486	2018-05-03 21:58:44 +00:00
Roman Lebedev	3d7b22621c	[NFC][InstCombine] rem.ll: add a few commutative tests. This closes a gap in missing test coverage in isKnownToBeAPowerOfTwo() from ValueTracking.cpp llvm-svn: 330975	2018-04-26 18:44:37 +00:00
Roman Lebedev	e117e1a440	[NFC][InstCombine] Regenerate rem.ll test llvm-svn: 330974	2018-04-26 18:44:32 +00:00
Sanjay Patel	d0b27a1156	[InstSimplify] move/add/regenerate checks for tests; NFC llvm-svn: 330515	2018-04-21 16:23:47 +00:00
Sanjay Patel	14adbacd8a	[InstCombine] fix miscompile of frem with 0.0 operand (PR34870) We might want to select NAN here or do this transform with fast-math, but this should at least fix the miscompile. llvm-svn: 321461	2017-12-26 22:12:20 +00:00
Sanjay Patel	546c43fd1a	[InstCombine] add test for frem with 0.0 (PR34870); NFC llvm-svn: 321460	2017-12-26 22:06:57 +00:00
Matt Arsenault	90e4f719e1	Fix some misc. -enable-var-scope violations llvm-svn: 318006	2017-11-13 01:47:52 +00:00
Craig Topper	1a18a7c51e	[InstCombine] Add support for vector srem->urem. llvm-svn: 300437	2017-04-17 01:51:24 +00:00
Craig Topper	b60f300afb	[InstCombine] Add missing testcases for srem->urem conversion. The vector version isn't currently supported. NFC llvm-svn: 300436	2017-04-17 01:51:21 +00:00
Craig Topper	18f9e424e7	[InstCombine] Support weird size element types in dyn_castNegVal. llvm-svn: 299915	2017-04-11 05:42:47 +00:00
Sanjay Patel	1c8c6a457d	[InstCombine] consolidate rem tests and update checks; NFC llvm-svn: 297747	2017-03-14 16:27:46 +00:00
Sanjay Patel	9deec85c34	[InstCombine] regenerate checks; NFC llvm-svn: 297746	2017-03-14 16:16:40 +00:00
David Majnemer	cb892e9066	[InstCombine] Move casts around shift operations It is possible to perform a left shift before zero extending if the shift would only shift out zeros. llvm-svn: 290928	2017-01-04 02:21:34 +00:00
Sanjay Patel	2e9675ff52	[InstCombine] use m_APInt to allow icmp eq (srem X, C1), C2 folds for splat constant vectors llvm-svn: 277638	2016-08-03 19:48:40 +00:00
Sanjay Patel	296a776a5b	add tests for icmp vector folds llvm-svn: 276464	2016-07-22 20:11:08 +00:00
Sanjoy Das	b7e861a488	Add safety check to InstCombiner::commonIRemTransforms Since FoldOpIntoPhi speculates the binary operation to potentially each of the predecessors of the PHI node (pulling it out of arbitrary control dependence in the process), we can FoldOpIntoPhi only if we know the operation doesn't have UB. This also brings up an interesting profitability question -- the way it is written today, commonIRemTransforms will hoist out work from dynamically dead code into code that will execute at runtime. Perhaps that isn't the best canonicalization? Fixes PR27968. llvm-svn: 271857	2016-06-05 21:17:04 +00:00
Sanjoy Das	0dcd1d859c	Add test case for InstCombiner::commonIRemTransforms; NFC The PHI case in commonIRemTransforms was untested; add a trivial test case. llvm-svn: 271856	2016-06-05 21:17:00 +00:00
Sanjay Patel	e2e89ef936	[ValueTracking, InstCombine] extend isKnownToBeAPowerOfTwo() to handle vector splat constants We could try harder to handle non-splat vector constants too, but that seems much rarer to me. Note that the div test isn't resolved because there's a check for isIntegerTy() guarding that transform. Differential Revision: http://reviews.llvm.org/D20497 llvm-svn: 270369	2016-05-22 15:41:53 +00:00
Sanjay Patel	54acedf88f	add tests for vector urem llvm-svn: 270271	2016-05-20 20:55:17 +00:00
Rafael Espindola	d1a2c2d905	Add back commit r210029. The code was actually correct. Sorry for the confusion. I have expanded the comment saying why the analysis is valid to avoid me misunderstaning it again in the future. llvm-svn: 210052	2014-06-02 22:01:04 +00:00
Rafael Espindola	582c890fbe	Revert "Add the nsw flag when we detect that an add will not signed overflow." This reverts commit r210029. It was not correctly handling cases where LHS and RHS had multiple but different sign bits. llvm-svn: 210048	2014-06-02 21:12:19 +00:00
Rafael Espindola	82899febf0	Add the nsw flag when we detect that an add will not signed overflow. We already had a function for checking this, we were just using it only in specialized cases. llvm-svn: 210029	2014-06-02 14:32:58 +00:00
Benjamin Kramer	72196f3ae5	InstCombine: Teach most integer add/sub/mul/div combines how to deal with vectors. llvm-svn: 199602	2014-01-19 15:24:22 +00:00
David Majnemer	b7d5409ad2	isKnownToBeAPowerOfTwo: Strengthen isKnownToBeAPowerOfTwo's analysis on add instructions Call into ComputeMaskedBits to figure out which bits are set on both add operands and determine if the value is a power-of-two-or-zero or not. llvm-svn: 187445	2013-07-30 21:01:36 +00:00
Stephen Lin	c1c7a1309c	Update Transforms tests to use CHECK-LABEL for easier debugging. No functionality change. This update was done with the following bash script: find test/Transforms -name ".ll" \| \ while read NAME; do echo "$NAME" if ! grep -q "^; RUN: llc" $NAME; then TEMP=`mktemp -t temp` cp $NAME $TEMP sed -n "s/^define [^@]@$[A-Za-z0-9_]$(.$/\1/p" < $NAME \| \ while read FUNC; do sed -i '' "s/;$.$$[A-Za-z0-9_]$:$ $@$FUNC$[( ]$\$/;\1\2-LABEL:\3@$FUNC(/g" $TEMP done mv $TEMP $NAME fi done llvm-svn: 186268	2013-07-14 01:42:54 +00:00
Nick Lewycky	7459be6dc7	Add a microoptimization for urem. llvm-svn: 186235	2013-07-13 01:16:47 +00:00
David Majnemer	a92b3c914e	ValueTracking: Fix bugs in isKnownToBeAPowerOfTwo (add nsw x, (and x, y)) isn't a power of two if x is zero, it's zero (add nsw x, (xor x, y)) isn't a power of two if y has bits set that aren't set in x llvm-svn: 185954	2013-07-09 18:11:10 +00:00
David Majnemer	7a69d2c06a	ValueTracking: Teach isKnownToBeAPowerOfTwo about (ADD X, (XOR X, Y)) where X is a power of two This allows us to simplify urem instructions involving the add+xor to turn into simpler math. llvm-svn: 185272	2013-06-29 23:44:53 +00:00
David Majnemer	beab5678a3	isKnownToBeAPowerOfTwo: (X & Y) + Y is a power of 2 or zero if y is also. This is useful if something that looks like (x & (1 << y)) ? 64 : 32 is the divisor in a modulo operation. llvm-svn: 182200	2013-05-18 19:30:37 +00:00
David Majnemer	6c30f49af3	InstCombine: Flip the order of two urem transforms There are two transforms in visitUrem that conflict with each other. ) One, if a divisor is a power of two, subtracts one from the divisor and turns it into a bitwise-and. ) The other unwraps both operands if they are surrounded by zext instructions. Flipping the order allows the subtraction to go beneath the sign extension. llvm-svn: 181668	2013-05-12 00:07:05 +00:00
David Majnemer	470b077bca	InstCombine: Turn urem to bitwise-and more often Use isKnownToBeAPowerOfTwo in visitUrem so that we may more aggressively fold away urem instructions. llvm-svn: 181661	2013-05-11 09:01:28 +00:00
Benjamin Kramer	07726c7d52	InstCombine: Add a missing irem identity (X % X -> 0). llvm-svn: 119538	2010-11-17 19:11:46 +00:00
Dan Gohman	72a13d2476	Use opt -S instead of piping bitcode output through llvm-dis. llvm-svn: 81257	2009-09-08 22:34:10 +00:00
Dan Gohman	9737a63ed8	Change these tests to feed the assembly files to opt directly, instead of using llvm-as, now that opt supports this. llvm-svn: 81226	2009-09-08 16:50:01 +00:00
Nick Lewycky	d0b62a1552	Don't try to simplify urem and srem using arithmetic rules that don't work under modulo (overflow). Fixes PR1933. llvm-svn: 47987	2008-03-06 06:48:30 +00:00
Tanya Lattner	5640bd186a	Remove llvm-upgrade and update test cases. llvm-svn: 47793	2008-03-01 09:15:35 +00:00
Reid Spencer	ede8c3b92c	For PR1319: Make use of the END. facility on all files > 1K so that we aren't wasting CPU cycles searching for RUN: lines that we'll never find. llvm-svn: 36059	2007-04-15 07:38:21 +00:00
Reid Spencer	91948d4cad	For PR1319: Upgrade tests to work with new llvm.exp version of llvm_runtest. llvm-svn: 36013	2007-04-14 20:13:02 +00:00
Reid Spencer	af6a408117	For PR411: Update these tests to not use the same name even though the type of the value differs. After PR411 hits, type planes will be gone and it will be illegal for a name to be used twice, regardless of type. llvm-svn: 33660	2007-01-30 16:16:01 +00:00
Reid Spencer	83b3d82672	Regression is gone, don't try to find it on clean target. llvm-svn: 33296	2007-01-17 07:59:14 +00:00

49 Commits