llvm-project

Commit Graph

Author	SHA1	Message	Date
Philip Reames	cdb72f369f	Introduce a 'nonnull' metadata on Load instructions. The newly introduced 'nonnull' metadata is analogous to existing 'nonnull' attributes, but applies to load instructions rather than call arguments or returns. Long term, it would be nice to combine these into a single construct. The value of the load is allowed to vary between successive loads, but null is not a valid value to be loaded by any load marked nonnull. Reviewed by: Hal Finkel Differential Revision: http://reviews.llvm.org/D5220 llvm-svn: 220240	2014-10-20 22:40:55 +00:00
Hal Finkel	171c2ec008	Revert "r216914 - Revert: [APFloat] Fixed a bug in method 'fusedMultiplyAdd'" Reapply r216913, a fix for PR20832 by Andrea Di Biagio. The commit was reverted because of buildbot failures, and credit goes to Ulrich Weigand for isolating the underlying issue (which can be confirmed by Valgrind, which does helpfully light up like the fourth of July). Uli explained the problem with the original patch as: It seems the problem is calling multiplySignificand with an addend of category fcZero; that is not expected by this routine. Note that for fcZero, the significand parts are simply uninitialized, but the code in (or rather, called from) multiplySignificand will unconditionally access them -- in effect using uninitialized contents. This version avoids using a category == fcZero addend within multiplySignificand, which avoids this problem (the Valgrind output is also now clean). Original commit message: [APFloat] Fixed a bug in method 'fusedMultiplyAdd'. When folding a fused multiply-add builtin call, make sure that we propagate the correct result in the case where the addend is zero, and the two other operands are finite non-zero. Example: define double @test() { %1 = call double @llvm.fma.f64(double 7.0, double 8.0, double 0.0) ret double %1 } Before this patch, the instruction simplifier wrongly folded the builtin call in function @test to constant 'double 7.0'. With this patch, method 'fusedMultiplyAdd' correctly evaluates the multiply and propagates the expected result (i.e. 56.0). Added test fold-builtin-fma.ll with the reproducible from PR20832 plus extra test cases to verify the behavior of method 'fusedMultiplyAdd' in the presence of NaN/Inf operands. This fixes PR20832. llvm-svn: 219708	2014-10-14 19:23:07 +00:00
David Majnemer	cb9d596655	InstCombine, InstSimplify: (%X /s C1) /s C2 isn't always 0 when C1 * C2 overflow consider: C1 = INT_MIN C2 = -1 C1 * C2 overflows without a doubt but consider the following: %x = i32 INT_MIN This means that (%X /s C1) is 1 and (%X /s C1) /s C2 is -1. N. B. Move the unsigned version of this transform to InstSimplify, it doesn't create any new instructions. This fixes PR21243. llvm-svn: 219567	2014-10-11 10:20:01 +00:00
David Majnemer	b435a4214e	InstSimplify: Don't allow (x srem y) urem y -> x srem y Let's consider the case where: %x i16 = 32768 %y i16 = 384 %x srem %y = 65408 (%x srem %y) urem %y = 128 llvm-svn: 217939	2014-09-17 04:16:35 +00:00
David Majnemer	ac717f0972	InstSimplify: ((X % Y) % Y) -> (X % Y) Patch by Sonam Kumari! Differential Revision: http://reviews.llvm.org/D5350 llvm-svn: 217937	2014-09-17 03:34:34 +00:00
David Majnemer	a315bd80c2	InstSimplify: Simplify trivial and/or of icmps Some ICmpInsts when anded/ored with another ICmpInst trivially reduces to true or false depending on whether or not all integers or no integers satisfy the intersected/unioned range. This sort of trivial looking code can come about when InstCombine performs a range reduction-type operation on sdiv and the like. This fixes PR20916. llvm-svn: 217750	2014-09-15 08:15:28 +00:00
Andrea Di Biagio	b9de900788	Revert: [APFloat] Fixed a bug in method 'fusedMultiplyAdd'. This reverts revision 216913; the new test added at revision 216913 caused regression failures on a couple of buildbots. llvm-svn: 216914	2014-09-02 17:22:49 +00:00
Andrea Di Biagio	7676fe1878	[APFloat] Fixed a bug in method 'fusedMultiplyAdd'. When folding a fused multiply-add builtin call, make sure that we propagate the correct result in the case where the addend is zero, and the two other operands are finite non-zero. Example: define double @test() { %1 = call double @llvm.fma.f64(double 7.0, double 8.0, double 0.0) ret double %1 } Before this patch, the instruction simplifier wrongly folded the builtin call in function @test to constant 'double 7.0'. With this patch, method 'fusedMultiplyAdd' correctly evaluates the multiply and propagates the expected result (i.e. 56.0). Added test fold-builtin-fma.ll with the reproducible from PR20832 plus extra test cases to verify the behavior of method 'fusedMultiplyAdd' in the presence of NaN/Inf operands. This fixes PR20832. Differential Revision: http://reviews.llvm.org/D5152 llvm-svn: 216913	2014-09-02 16:44:56 +00:00
David Majnemer	76d06bc613	InstSimplify: Move a transform from InstCombine to InstSimplify Several combines involving icmp (shl C2, %X) C1 can be simplified without introducing any new instructions. Move them to InstSimplify; while we are at it, make them more powerful. llvm-svn: 216642	2014-08-28 03:34:28 +00:00
David Majnemer	11ca2971e8	InstSimplify: Don't simplify gep X, (Y-X) to Y if types differ It's incorrect to perform this simplification if the types differ. A bitcast would need to be inserted for this to work. This fixes PR20771. llvm-svn: 216597	2014-08-27 20:08:34 +00:00
Nico Weber	48c82400ed	Reland r216439 215441, majnemer has a real fix for PR20771. llvm-svn: 216586	2014-08-27 20:06:19 +00:00
Nico Weber	7b343e3cc6	Revert r216439 (and r216441, else the former doesn't revert cleanly). It caused PR 20771. I'll land a test on the clang side. llvm-svn: 216582	2014-08-27 20:00:13 +00:00
David Majnemer	d6d1671c1e	InstSimplify: Compute comparison ranges for left shift instructions 'shl nuw CI, x' produces [CI, CI << CLZ(CI)] 'shl nsw CI, x' produces [CI << CLO(CI)-1, CI] if CI is negative 'shl nsw CI, x' produces [CI, CI << CLZ(CI)-1] if CI is non-negative llvm-svn: 216570	2014-08-27 18:03:46 +00:00
David Majnemer	788d0ab8c8	InstSimplify: Fold gep X, (sub 0, ptrtoint(X)) to null Save InstCombine some work if we can perform this fold during InstSimplify. llvm-svn: 216441	2014-08-26 07:08:03 +00:00
David Majnemer	bc4981323f	InstSimplify: Simplify trivial pointer expressions like b + (e - b) consider: long long f(long long b, long long e) { return b + (e - b); } we would lower this to something like: define i64 @f(i64* %b, i64* %e) { %1 = ptrtoint i64* %e to i64 %2 = ptrtoint i64* %b to i64 %3 = sub i64 %1, %2 %4 = ashr exact i64 %3, 3 %5 = getelementptr inbounds i64* %b, i64 %4 ret i64* %5 } This should fold away to just 'e'. N.B. This adds m_SpecificInt as a convenient way to match against a particular 64-bit integer when using LLVM's match interface. llvm-svn: 216439	2014-08-26 05:55:16 +00:00
David Majnemer	97ddca3224	ValueTracking: Figure out more bits when looking at add/sub Given something like X01XX + X01XX, we know that the result must look like X1XXX. Adapted from a patch by Richard Smith, test-case written by me. llvm-svn: 216250	2014-08-22 00:40:43 +00:00
David Majnemer	cd4fbcd1bb	InstSimplify: Simplify (X - (0 - Y)) if the second sub is NUW If the NUW bit is set for 0 - Y, we know that all values for Y other than 0 would produce a poison value. This allows us to replace (0 - Y) with 0 in the expression (X - (0 - Y)) which will ultimately leave us with X. This partially fixes PR20189. llvm-svn: 214384	2014-07-31 04:49:18 +00:00
Hal Finkel	930469107d	Add @llvm.assume, lowering, and some basic properties This is the first commit in a series that add an @llvm.assume intrinsic which can be used to provide the optimizer with a condition it may assume to be true (when the control flow would hit the intrinsic call). Some basic properties are added here: - llvm.invariant(true) is dead. - llvm.invariant(false) is unreachable (this directly corresponds to the documented behavior of MSVC's __assume(0)), so is llvm.invariant(undef). The intrinsic is tagged as writing arbitrarily, in order to maintain control dependencies. BasicAA has been updated, however, to return NoModRef for any particular location-based query so that we don't unnecessarily block code motion. llvm-svn: 213973	2014-07-25 21:13:35 +00:00
Hal Finkel	b0407ba071	Add a dereferenceable attribute This attribute indicates that the parameter or return pointer is dereferenceable. Practically speaking, loads from such a pointer within the associated byte range are safe to speculatively execute. Such pointer parameters are common in source languages (C++ references, for example). llvm-svn: 213385	2014-07-18 15:51:28 +00:00
Suyog Sarda	68862414b5	Move ashr optimization from InstCombineShift to InstSimplify. Refactor code, no functionality change, test case moved from instcombine to instsimplify. Differential Revision: http://reviews.llvm.org/D4102 llvm-svn: 213231	2014-07-17 06:28:15 +00:00
David Majnemer	b8f435ca70	Fix a test broken in r212981 @icmp_sdiv_neg1 should have referred to %a instead of %call, it was renamed at the last second. llvm-svn: 212983	2014-07-14 20:46:04 +00:00
David Majnemer	af9180fd04	InstSimplify: Correct sdiv x / -1 Determining the bounds of x/ -1 would start off with us dividing it by INT_MIN. Suffice to say, this would not work very well. Instead, handle it upfront by checking for -1 and mapping it to the range: [INT_MIN + 1, INT_MAX. This means that the result of our division can be any value other than INT_MIN. llvm-svn: 212981	2014-07-14 20:38:45 +00:00
David Majnemer	5ea4fc0b33	InstSimplify: The upper bound of X / C was missing a rounding step Summary: When calculating the upper bound of X / -8589934592, we would perform the following calculation: Floor[INT_MAX / 8589934592] However, flooring the result would make us wrongly come to the conclusion that 1073741824 was not in the set of possible values. Instead, use the ceiling of the result. Reviewers: nicholas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D4502 llvm-svn: 212976	2014-07-14 19:49:57 +00:00
David Majnemer	651ed5e8fd	InstSimplify: Fix a bug when INT_MIN is in a sdiv When INT_MIN is the numerator in a sdiv, we would not properly handle overflow when calculating the bounds of possible values; abs(INT_MIN) is not a meaningful number. Instead, check and handle INT_MIN by reasoning that the largest value is INT_MIN/-2 and the smallest value is INT_MIN. This fixes PR20199. llvm-svn: 212307	2014-07-04 00:23:39 +00:00
Dinesh Dwivedi	99281a0615	This patch removed duplicate code for matching patterns which are now handled in SimplifyUsingDistributiveLaws() (after r211261) Differential Revision: http://reviews.llvm.org/D4253 llvm-svn: 211768	2014-06-26 08:57:33 +00:00
Nick Lewycky	8561a49c27	Move optimization of some cases of (A & C1)\|(B & C2) from instcombine to instsimplify. Patch by Rahul Jain, plus some last minute changes by me -- you can blame me for any bugs. llvm-svn: 211252	2014-06-19 03:51:46 +00:00
Nick Lewycky	c961030ac2	Make instsimplify's analysis of icmp eq/ne use computeKnownBits to determine whether the icmp is always true or false. Patch by Suyog Sarda! llvm-svn: 211251	2014-06-19 03:35:49 +00:00
Nick Lewycky	ec373545b8	Teach isKnownNonNull that a nonnull return is not null. Add a test for this case as well as the case of a nonnull attribute (already handled but not tested). llvm-svn: 209193	2014-05-20 05:13:21 +00:00
David Majnemer	78910fc4da	InstSimplify: Improve handling of ashr/lshr Summary: Analyze the range of values produced by ashr/lshr cst, %V when it is being used in an icmp. Reviewers: nicholas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3774 llvm-svn: 209000	2014-05-16 17:14:03 +00:00
David Majnemer	ea8d5dbf24	InstSimplify: Optimize using dividend in sdiv Summary: The dividend in an sdiv tells us the largest and smallest possible results. Use this fact to optimize comparisons against an sdiv with a constant dividend. Reviewers: nicholas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3795 llvm-svn: 208999	2014-05-16 16:57:04 +00:00
David Majnemer	2d6c023576	InstSimplify: Optimize signed icmp of -(zext V) Summary: We know that -(zext V) will always be <= zero, simplify signed icmps that have these. Uncovered using http://www.cs.utah.edu/~regehr/souper/ Reviewers: nicholas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3754 llvm-svn: 208809	2014-05-14 20:16:28 +00:00
Gerolf Hoflehner	af7a87d2e3	RecursivelyDeleteTriviallyDeadInstructions() could remove more than 1 instruction. The caller need to be aware of this and adjust instruction iterators accordingly. rdar://16679376 Repaired r207302. llvm-svn: 207309	2014-04-26 05:58:11 +00:00
Gerolf Hoflehner	c46e9b0423	Revert commit r207302 since build failures have been reported. llvm-svn: 207303	2014-04-26 02:03:17 +00:00
Gerolf Hoflehner	34210108b3	RecursivelyDeleteTriviallyDeadInstructions() could remove more than 1 instruction. The caller need to be aware of this and adjust instruction iterators accordingly. rdar://16679376 llvm-svn: 207302	2014-04-26 01:19:16 +00:00
Benjamin Kramer	5e1794eedb	InstSimplify: Make shift, select and GEP simplifications vector-aware. llvm-svn: 200016	2014-01-24 17:09:53 +00:00
Benjamin Kramer	d75c8ebdd1	Add a test that large offsets on GEPs on 32 bits targets are handled correctly. llvm-svn: 191628	2013-09-28 21:27:49 +00:00
Benjamin Kramer	d59bf255d5	Verify that we don't optimize null return checks to the nothrow_t version of operator new. llvm-svn: 191325	2013-09-24 18:37:49 +00:00
Benjamin Kramer	4d4df04353	MemoryBuiltins: Fix operator new bits. We really don't want to optimize malloc return value checks away. llvm-svn: 191313	2013-09-24 17:15:14 +00:00
Benjamin Kramer	fd4777c046	Teach MemoryBuiltins and InstructionSimplify that operator new never returns NULL. This is safe per C++11 18.6.1.1p3: [operator new returns] a non-null pointer to suitably aligned storage (3.7.4), or else throw a bad_alloc exception. This requirement is binding on a replacement version of this function. Brings us a tiny bit closer to eliminating more vector push_backs. llvm-svn: 191310	2013-09-24 16:37:51 +00:00
Benjamin Kramer	b517194f33	Expand test case a bit. llvm-svn: 191205	2013-09-23 14:41:35 +00:00
Benjamin Kramer	942dfe625b	InstSimplify: Fold equality comparisons between non-inbounds GEPs. Overflow doesn't affect the correctness of equalities. Computing this is cheap, we just reuse the computation for the inbounds case and try to peel of more non-inbounds GEPs. This pattern is unlikely to ever appear in code generated by Clang, but SCEV occasionally produces it. llvm-svn: 191200	2013-09-23 14:16:38 +00:00
Daniel Dunbar	9efbedfd35	[tests] Cleanup initialization of test suffixes. - Instead of setting the suffixes in a bunch of places, just set one master list in the top-level config. We now only modify the suffix list in a few suites that have one particular unique suffix (.ml, .mc, .yaml, .td, .py). - Aside from removing the need for a bunch of lit.local.cfg files, this enables 4 tests that were inadvertently being skipped (one in Transforms/BranchFolding, a .s file each in DebugInfo/AArch64 and CodeGen/PowerPC, and one in CodeGen/SI which is now failing and has been XFAILED). - This commit also fixes a bunch of config files to use config.root instead of older copy-pasted code. llvm-svn: 188513	2013-08-16 00:37:11 +00:00
Stephen Lin	c1c7a1309c	Update Transforms tests to use CHECK-LABEL for easier debugging. No functionality change. This update was done with the following bash script: find test/Transforms -name ".ll" \| \ while read NAME; do echo "$NAME" if ! grep -q "^; RUN: llc" $NAME; then TEMP=`mktemp -t temp` cp $NAME $TEMP sed -n "s/^define [^@]@$[A-Za-z0-9_]$(.$/\1/p" < $NAME \| \ while read FUNC; do sed -i '' "s/;$.$$[A-Za-z0-9_]$:$ $@$FUNC$[( ]$\$/;\1\2-LABEL:\3@$FUNC(/g" $TEMP done mv $TEMP $NAME fi done llvm-svn: 186268	2013-07-14 01:42:54 +00:00
Nick Lewycky	35aeea993b	Fix logic error optimizing "icmp pred (urem X, Y), Y" where pred is signed. Fixes PR16605. llvm-svn: 186229	2013-07-12 23:42:57 +00:00
David Majnemer	a80fed7e58	InstSimplify: X >> X -> 0 llvm-svn: 185973	2013-07-09 22:01:22 +00:00
Benjamin Kramer	ec1bb4fdaf	ConstantFolding: ComputeMaskedBits wants the scalar size for vectors. Fixes PR15791. llvm-svn: 179859	2013-04-19 16:56:24 +00:00
Benjamin Kramer	b50682e156	Add missing colons to check lines. llvm-svn: 179277	2013-04-11 12:41:41 +00:00
Manman Ren	1217112d11	Check whether a pointer is non-null (isKnownNonNull) in isKnownNonZero. This handles the case where we have an inbounds GEP with alloca as the pointer. This fixes the regression in PR12750 and rdar://13286434. Note that we can also fix this by handling some GEP cases in isKnownNonNull. llvm-svn: 177321	2013-03-18 21:23:25 +00:00
Chad Rosier	22d275f7b8	[SimplifyLibCalls] Library call simplification doen't work if the call site isn't using the default calling convention. However, if the transformation is from a call to inline IR, then the calling convention doesn't matter. rdar://13157990 llvm-svn: 174724	2013-02-08 18:00:14 +00:00
Michael Ilseman	5485729b9a	Identify and simplify idempotent intrinsics. Test case included. llvm-svn: 174650	2013-02-07 19:26:05 +00:00
Benjamin Kramer	a5a9ec5755	ConstantFolding: Fix a crash when encoutering a truncating inttoptr. This was introduced in r173293. llvm-svn: 174424	2013-02-05 19:04:36 +00:00
Dan Gohman	9ee4bc1abc	Add a testcase for some past-the-end address subtleties. llvm-svn: 174210	2013-02-01 19:37:52 +00:00
Benjamin Kramer	c05aa958b1	InstSimplify: stripAndComputeConstantOffsets can be called with vectors of pointers too. Prepare it for vectors of pointers and handle simple cases. We don't handle complicated cases because accumulateConstantOffset bails on pointer vectors. Fixes selfhost on i386. llvm-svn: 174179	2013-02-01 15:21:10 +00:00
Dan Gohman	b3e2d3a638	Rewrite instsimplify's handling if icmp on pointer values to remove the remaining use of AliasAnalysis concepts such as isIdentifiedObject to prove pointer inequality. @external_compare in test/Transforms/InstSimplify/compare.ll shows a simple case where a noalias argument can be equal to a global variable address, and while AliasAnalysis can get away with saying that these pointers don't alias, instsimplify cannot say that they are not equal. llvm-svn: 174122	2013-02-01 00:11:13 +00:00
Dan Gohman	995d40e1e2	An alloca can be equal to an argument. It can't alias an alloca, but it could be equal, since there's nothing preventing a caller from correctly predicting the stack location of an alloca. llvm-svn: 174119	2013-01-31 23:49:33 +00:00
Dmitri Gribenko	56bf2e1830	Tests: rewrite 'opt ... %s' to 'opt ... < %s' so that opt does not emit a ModuleID This is done to avoid odd test failures, like the one fixed in r171243. llvm-svn: 171250	2012-12-30 02:33:22 +00:00
Chandler Carruth	f6182155f6	Teach instsimplify to use the constant folder where appropriate for constant folding calls. Add the initial tests for this which show that now instsimplify can simplify blindingly obvious code patterns expressed with both intrinsics and library calls. llvm-svn: 171194	2012-12-28 14:23:29 +00:00
Michael Ilseman	bb6f691b01	Added a slew of SimplifyInstruction floating-point optimizations, many of which take advantage of fast-math flags. Test cases included. fsub X, +0 ==> X fsub X, -0 ==> X, when we know X is not -0 fsub +/-0.0, (fsub -0.0, X) ==> X fsub nsz +/-0.0, (fsub +/-0.0, X) ==> X fsub nnan ninf X, X ==> 0.0 fadd nsz X, 0 ==> X fadd [nnan ninf] X, (fsub [nnan ninf] 0, X) ==> 0 where nnan and ninf have to occur at least once somewhere in this expression fmul X, 1.0 ==> X llvm-svn: 169940	2012-12-12 00:27:46 +00:00
Chandler Carruth	80d3e56c73	Add support to ValueTracking for determining that a pointer is non-null by virtue of inbounds GEPs that preclude a null pointer. This is a very common pattern in the code generated by std::vector and other standard library routines which use allocators that test for null pervasively. This is one step closer to teaching Clang+LLVM to be able to produce an empty function for: void f() { std::vector<int> v; v.push_back(1); v.push_back(2); v.push_back(3); v.push_back(4); } Which is related to getting them to completely fold SmallVector push_back sequences into constants when inlining and other optimizations make that a possibility. llvm-svn: 169573	2012-12-07 02:08:58 +00:00
Michael Ilseman	6cdacff2d0	Fast-math test for SimplifyInstruction: fold multiply by 0 Applied the patch, rather than committing it. llvm-svn: 168656	2012-11-27 01:00:22 +00:00
Duncan Sands	c41076c07c	InstructionSimplify should be able to simplify A+B==B+A to 'true' but wasn't due to the same logic bug that caused PR14361. llvm-svn: 168186	2012-11-16 19:41:26 +00:00
Chad Rosier	e27081d348	Revert r153521 as it's causing large regressions on the nightly testers. Original commit message for r153521 (aka r153423): Use the new range metadata in computeMaskedBits and add a new optimization to instruction simplify that lets us remove an and when loding a boolean value. llvm-svn: 153587	2012-03-28 18:42:50 +00:00
Chad Rosier	8e6dbccd03	Reapply r153423; the original commit was fine. The failing test, distray, had undefined behavior, which Rafael was kind enough to fix. Original commit message for r153423: Use the new range metadata in computeMaskedBits and add a new optimization to instruction simplify that lets us remove an and when loding a boolean value. llvm-svn: 153521	2012-03-27 17:44:52 +00:00
Chad Rosier	08e57e5ccf	Revert r153423 as this is causing failures on our internal nightly testers. Original commit message: Use the new range metadata in computeMaskedBits and add a new optimization to instruction simplify that lets us remove an and when loading a boolean value. llvm-svn: 153452	2012-03-26 18:07:14 +00:00
Rafael Espindola	df9b4adb82	Use the new range metadata in computeMaskedBits and add a new optimization to instruction simplify that lets us remove an and when loding a boolean value. llvm-svn: 153423	2012-03-26 01:44:11 +00:00
Chandler Carruth	8059c84af1	Teach instsimplify how to simplify comparisons of pointers which are constant-offsets of a common base using the generic GEP-walking logic I added for computing pointer differences in the same situation. llvm-svn: 153419	2012-03-25 21:28:14 +00:00
Chandler Carruth	2741aae80b	Switch the pointer-difference simplification logic to only work with inbounds GEPs. This isn't really necessary for simplifying pointer differences, but I'm planning to re-use the same code to simplify pointer comparisons where it is necessary. Since real code almost exclusively uses inbounds GEPs, it doesn't seem worth it to support the extra complexity of turning it on and off. If anyone would like that back, feel free to shout. Note that instcombine will still catch any of these patterns. llvm-svn: 153418	2012-03-25 20:43:07 +00:00
Duncan Sands	395ac42dd2	Generalize the "trunc(ptrtoint(x)) - trunc(ptrtoint(y)) -> trunc(ptrtoint(x-y))" optimization introduced by Chandler. llvm-svn: 152626	2012-03-13 14:07:05 +00:00
Chandler Carruth	a0796555e2	Teach instsimplify how to constant fold pointer differences. Typically instcombine has handled this, but pointer differences show up in several contexts where we would like to get constant folding, and cannot afford to run instcombine. Specifically, I'm working on improving the constant folding of arguments used in inline cost analysis with instsimplify. Doing this in instsimplify implies some algorithm changes. We have to handle multiple layers of all-constant GEPs because instsimplify cannot fold them into a single GEP the way instcombine can. Also, we're only interested in all-constant GEPs. The result is that this doesn't really replace the instcombine logic, it's just complimentary and focused on constant folding. Reviewed on IRC by Benjamin Kramer. llvm-svn: 152555	2012-03-12 11:19:31 +00:00
Eli Friedman	a8b75ac798	Make sure we don't return bits outside the mask in ComputeMaskedBits. PR12189. llvm-svn: 152066	2012-03-05 23:09:40 +00:00
Nick Lewycky	3db143ea8c	Reinstate the optimization from r151449 with a fix to not turn 'gep %x' into 'gep null' when the icmp predicate is unsigned (or is signed without inbounds). llvm-svn: 151467	2012-02-26 02:09:49 +00:00
Nick Lewycky	7bbd72da46	Roll these back to r151448 until I figure out how they're breaking MultiSource/Applications/lua. llvm-svn: 151463	2012-02-25 23:01:19 +00:00
Nick Lewycky	eeeffbb497	An argument and a local identified object (eg. a noalias call) could turn out equal if both are null. In the test, scope type %t and global @y by adding a 'gep' prefix to them. llvm-svn: 151452	2012-02-25 20:19:07 +00:00
Nick Lewycky	51f2be8bff	Teach instsimplify to be more aggressive when analyzing comparisons of pointers by using llvm::isIdentifiedObject. Also teach it to handle GEPs that have the same base pointer and constant operands. Fixes PR11238! llvm-svn: 151449	2012-02-25 19:07:42 +00:00
Chris Lattner	01990f0e1c	fix PR12075, a regression in a recent transform I added. In unreachable code, gep chains can be infinite. Just like "stripPointerCasts", use a set to keep track of visited instructions so we don't recurse infinitely. llvm-svn: 151383	2012-02-24 19:01:58 +00:00
Chris Lattner	445d8c6b50	fold comparisons of gep'd alloca points with null to false, implementing PR12013. We now compile the testcase to: __Z4testv: ## @_Z4testv ## BB#0: ## %_ZN4llvm15SmallVectorImplIiE9push_backERKi.exit pushq %rbx subq $64, %rsp leaq 32(%rsp), %rbx movq %rbx, (%rsp) leaq 64(%rsp), %rax movq %rax, 16(%rsp) movl $1, 32(%rsp) leaq 36(%rsp), %rax movq %rax, 8(%rsp) leaq (%rsp), %rdi callq __Z1gRN4llvm11SmallVectorIiLj8EEE movq (%rsp), %rdi cmpq %rbx, %rdi je LBB0_2 ## BB#1: callq _free LBB0_2: ## %_ZN4llvm11SmallVectorIiLj8EED1Ev.exit addq $64, %rsp popq %rbx ret instead of: __Z4testv: ## @_Z4testv ## BB#0: pushq %rbx subq $64, %rsp xorl %eax, %eax leaq (%rsp), %rbx addq $32, %rbx movq %rbx, (%rsp) movq %rbx, 8(%rsp) leaq 64(%rsp), %rcx movq %rcx, 16(%rsp) je LBB0_2 ## BB#1: movl $1, 32(%rsp) movq %rbx, %rax LBB0_2: ## %_ZN4llvm15SmallVectorImplIiE9push_backERKi.exit addq $4, %rax movq %rax, 8(%rsp) leaq (%rsp), %rdi callq __Z1gRN4llvm11SmallVectorIiLj8EEE movq (%rsp), %rdi cmpq %rbx, %rdi je LBB0_4 ## BB#3: callq _free LBB0_4: ## %_ZN4llvm11SmallVectorIiLj8EED1Ev.exit addq $64, %rsp popq %rbx ret This doesn't shrink clang noticably though. llvm-svn: 150944	2012-02-20 00:42:49 +00:00
Eli Friedman	952d1f9f40	Fix a rather nasty regression from r150690: LHS != RHS does not imply LHS->stripPointerCasts() != RHS->stripPointerCasts(). llvm-svn: 150863	2012-02-18 03:29:25 +00:00
Benjamin Kramer	ea51f62e4b	InstSimplify: Ignore pointer casts when constant folding compares between pointers. llvm-svn: 150690	2012-02-16 13:49:39 +00:00
Eli Bendersky	924f9a671d	Replace all instances of dg.exp file with lit.local.cfg, since all tests are run with LIT now and now Dejagnu. dg.exp is no longer needed. Patch reviewed by Daniel Dunbar. It will be followed by additional cleanup patches. llvm-svn: 150664	2012-02-16 06:28:33 +00:00
Duncan Sands	26641d7c02	Fix PR11948: the result type of an icmp may be a vector of boolean - don't assume it is a boolean. llvm-svn: 150247	2012-02-10 14:31:24 +00:00
Duncan Sands	bf48ac622a	Revert commit 149912 (lattner) and add a testcase that shows the problem (which is that patterns no longer match for vectors of booleans, because you only get ConstantDataVector when the vector element type is i8, i16, etc, not when it is i1). Original commit message: Remove some dead code and tidy things up now that vectors use ConstantDataVector instead of always using ConstantVector. llvm-svn: 150246	2012-02-10 14:26:42 +00:00
Nadav Rotem	3924cb0267	Add support for vectors of pointers. llvm-svn: 145801	2011-12-05 06:29:09 +00:00
Duncan Sands	81a2af12d6	Fix a crash in which a multiplication was being reported as being both negative and positive: positive, because it could be directly computed to be positive; negative, because the nsw flags means it is either negative or undefined (the multiplication always overflowed). llvm-svn: 145104	2011-11-23 16:26:47 +00:00
Eli Friedman	0bae8b2cfb	Fix code to match comment. Fixes PR11340, a regression from r143209. llvm-svn: 144121	2011-11-08 21:08:02 +00:00
Dan Gohman	ce3d6248b2	Add tests for existing InstSimplify features. llvm-svn: 143721	2011-11-04 18:39:16 +00:00
Dan Gohman	85977e6ab4	Teach instsimplify to simplify calls to undef. llvm-svn: 143719	2011-11-04 18:32:42 +00:00
Duncan Sands	3d5692a475	Reapply commit 143214 with a fix: m_ICmp doesn't match conditions with the given predicate, it matches any condition and returns the predicate - d'oh! Original commit message: The expression icmp eq (select (icmp eq x, 0), 1, x), 0 folds to false. Spotted by my super-optimizer in 186.crafty and 450.soplex. We really need a proper infrastructure for handling generalizations of this kind of thing (which occur a lot), however this case is so simple that I decided to go ahead and implement it directly. llvm-svn: 143318	2011-10-30 19:56:36 +00:00
Eli Friedman	3af3c046a9	Revert r143214; it's breaking a bunch of stuff. llvm-svn: 143265	2011-10-29 00:56:07 +00:00
Duncan Sands	280bc553b3	The expression icmp eq (select (icmp eq x, 0), 1, x), 0 folds to false. Spotted by my super-optimizer in 186.crafty and 450.soplex. We really need a proper infrastructure for handling generalizations of this kind of thing (which occur a lot), however this case is so simple that I decided to go ahead and implement it directly. llvm-svn: 143214	2011-10-28 19:01:20 +00:00
Duncan Sands	985ba6386d	A shift of a power of two is a power of two or zero. For completeness - not spotted in the wild. llvm-svn: 143211	2011-10-28 18:30:05 +00:00
Duncan Sands	92af0a8a7f	Fold icmp ugt (udiv X, Y), X to false. Spotted by my super-optimizer in 186.crafty. llvm-svn: 143209	2011-10-28 18:17:44 +00:00
Duncan Sands	7cb61e5a0e	Reapply commit 143028 with a fix: the problem was casting a ConstantExpr Mul using BinaryOperator (which only works for instructions) when it should have been a cast to OverflowingBinaryOperator (which also works for constants). While there, correct a few other dubious looking uses of BinaryOperator. Thanks to Chad Rosier for the testcase. Original commit message: My super-optimizer noticed that we weren't folding this expression to true: (x *nsw x) sgt 0, where x = (y \| 1). This occurs in 464.h264ref. llvm-svn: 143125	2011-10-27 19:16:21 +00:00
Bob Wilson	1455ce27e4	Revert Duncan's r143028 expression folding which appears to be the culprit behind a compile failure on 483.xalancbmk. llvm-svn: 143102	2011-10-27 15:47:25 +00:00
Duncan Sands	ba286d7c73	The maximum power of 2 dividing a power of 2 is itself. This occurs in 403.gcc and was spotted by my super-optimizer. llvm-svn: 143054	2011-10-26 20:55:21 +00:00
Duncan Sands	1d2bb9882d	My super-optimizer noticed that we weren't folding this expression to true: (x *nsw x) sgt 0, where x = (y \| 1). This occurs in 464.h264ref. llvm-svn: 143028	2011-10-26 15:31:51 +00:00
Benjamin Kramer	4b79c21ef2	InstSimplify: Don't try to replace an extractvalue/insertvalue pair with the original value if types don't match. Fixes clang selfhost. llvm-svn: 139120	2011-09-05 18:16:19 +00:00
Duncan Sands	fd26a954a8	Add some simple insertvalue simplifications, for the purpose of cleaning up do-nothing exception handling code produced by dragonegg. llvm-svn: 139113	2011-09-05 06:52:48 +00:00
Nick Lewycky	56e99c7933	Remove bogus test: for all possible inputs of %X, the 'sub nsw' is guaranteed to perform a signed wrap. Don't rely on any particular handling of that case. llvm-svn: 135471	2011-07-19 08:22:57 +00:00
Dan Gohman	54664ed714	Improve constant folding of undef for cmp and select operators. llvm-svn: 134223	2011-07-01 01:03:43 +00:00
Dan Gohman	ca8d9e1341	Improve constant folding of undef for binary operators. llvm-svn: 134221	2011-07-01 00:42:17 +00:00
Duncan Sands	af32728a57	The comparision "max(x,y)==x" is equivalent to "x>=y". Since the max is often expressed as "x >= y ? x : y", there is a good chance we can extract the existing "x >= y" from it and use that as a replacement for "max(x,y)==x". llvm-svn: 131049	2011-05-07 16:56:49 +00:00
Duncan Sands	a228785526	Add variations on: max(x,y) >= min(x,z) folds to true. This isn't that common, but according to my super-optimizer there are only two missed simplifications of -instsimplify kind when compiling bzip2, and this is one of them. It amuses me to have bzip2 be perfectly optimized as far as instsimplify goes! llvm-svn: 130840	2011-05-04 16:05:05 +00:00
Duncan Sands	0a9c1246d7	Implement some basic simplifications involving min/max, for example max(a,b) >= a -> true. According to my super-optimizer, these are by far the most common simplifications (of the -instsimplify kind) that occur in the testsuite and aren't caught by -std-compile-opts. llvm-svn: 130780	2011-05-03 19:53:10 +00:00
Duncan Sands	a3e3699c88	Move some rem transforms out of instcombine and into instsimplify. This automagically provides a transform noticed by my super-optimizer as occurring quite often: "rem x, (select cond, x, 1)" -> 0. llvm-svn: 130694	2011-05-02 16:27:02 +00:00
Benjamin Kramer	5acc751b6f	Teach ComputeMaskedBits about sub nsw. llvm-svn: 127548	2011-03-12 17:18:11 +00:00
Nick Lewycky	cc79973856	Teach ComputeMaskedBits about nsw on add. I don't think there's anything we can do with nuw here, but sub and mul should be given similar treatment. Fixes PR9343 #15! llvm-svn: 127463	2011-03-11 09:00:19 +00:00
Benjamin Kramer	1885d21700	Fix mistyped CHECK lines. llvm-svn: 127366	2011-03-09 22:07:31 +00:00
Nick Lewycky	980104d1d6	Add another micro-optimization. Apologies for the lack of refactoring, but I gave up when I realized I couldn't come up with a good name for what the refactored function would be, to describe what it does. This is PR9343 test12, which is test3 with arguments reordered. Whoops! llvm-svn: 127318	2011-03-09 06:26:03 +00:00
Nick Lewycky	9719a719c7	Thread comparisons over udiv/sdiv/ashr/lshr exact and lshr nuw/nsw whenever possible. This goes into instcombine and instsimplify because instsimplify doesn't need to check hasOneUse since it returns (almost exclusively) constants. This fixes PR9343 #4 #5 and #8! llvm-svn: 127064	2011-03-05 05:19:11 +00:00
Nick Lewycky	41c529bd09	Revert broken srem logic from r126991. llvm-svn: 127021	2011-03-04 19:26:08 +00:00
Nick Lewycky	8e3a79da9f	Fold "icmp pred (srem X, Y), Y" like we do for urem. Handle signed comparisons in the urem case, though not the other way around. This is enough to get #3 from PR9343! llvm-svn: 126991	2011-03-04 10:06:52 +00:00
Nick Lewycky	3cec6f5563	Teach instruction simplify to use constant ranges to solve problems of the form "icmp pred %X, CI" and a number of examples where "%X = binop %Y, CI2". Some of these cases (div and rem) used to make it through opt -O2, but the others are probably now making code elsewhere redundant (probably instcombine). llvm-svn: 126988	2011-03-04 07:00:57 +00:00
Nick Lewycky	c9d20067cd	Optimize "icmp pred (urem X, Y), Y" --> true/false depending on pred. There's more work to do here, "icmp ult (urem X, 10), 11" doesn't optimize away yet. Fixes example 3 from PR9343! llvm-svn: 126741	2011-03-01 08:15:50 +00:00
Duncan Sands	d114ab331c	Teach instsimplify that X+Y>=X+Z is the same as Y>=Z if neither side overflows, plus some variations of this. According to my auto-simplifier this occurs a lot but usually in combination with max/min idioms. Because max/min aren't handled yet this unfortunately doesn't have much effect in the testsuite. llvm-svn: 125462	2011-02-13 17:15:40 +00:00
Chris Lattner	9e4aa0259f	Teach instsimplify some tricks about exact/nuw/nsw shifts. improve interfaces to instsimplify to take this info. llvm-svn: 125196	2011-02-09 17:15:04 +00:00
Chris Lattner	6e57b15228	teach instsimplify to transform (X / Y) * Y to X when the div is an exact udiv. llvm-svn: 124994	2011-02-06 22:05:31 +00:00
Chris Lattner	9c70414551	rename test. llvm-svn: 124993	2011-02-06 21:59:10 +00:00
Duncan Sands	06504025d2	Improve threading of comparisons over select instructions (spotted by my auto-simplifier). This has a big impact on Ada code, but not much else. Unfortunately the impact is mostly negative! This is due to PR9004 (aka SCCP failing to resolve conditional branch conditions in the destination blocks of the branch), in which simple correlated expressions are not resolved but complicated ones are, so simplifying has a bad effect! llvm-svn: 124788	2011-02-03 09:37:39 +00:00
Duncan Sands	5747abab10	Reenable the transform "(X*Y)/Y->X" when the multiplication is known not to overflow (nsw flag), which was disabled because it breaks 254.gap. I have informed the GAP authors of the mistake in their code, and arranged for the testsuite to use -fwrapv when compiling this benchmark. llvm-svn: 124746	2011-02-02 20:52:00 +00:00
Duncan Sands	cf0ff030a8	Have m_One also match constant vectors for which every element is 1. llvm-svn: 124655	2011-02-01 08:39:12 +00:00
Duncan Sands	2e5a58da8f	Commit 124487 broke 254.gap. See if disabling the part that might be triggered by PR9088 fixes things. llvm-svn: 124561	2011-01-30 18:24:20 +00:00
Duncan Sands	b67edc6a29	Transform (X/Y)*Y into X if the division is exact. Instcombine already knows how to do this and more, but would only do it if X/Y had only one use. Spotted as the most common missed simplification in SPEC by my auto-simplifier, now that it knows about nuw/nsw/exact flags. This removes a bunch of multiplications from 447.dealII and 483.xalancbmk. It also removes a lot from tramp3d-v4, which results in much more inlining. llvm-svn: 124560	2011-01-30 18:03:50 +00:00
Frits van Bommel	c2549661af	Move InstCombine's knowledge of fdiv to SimplifyInstruction(). llvm-svn: 124534	2011-01-29 15:26:31 +00:00
Duncan Sands	2e9e4f1be3	Fix typo: should have been testing that X was odd, not V. llvm-svn: 124533	2011-01-29 13:27:00 +00:00
Duncan Sands	771e82a863	My auto-simplifier noticed that ((X/Y)Y)/Y occurs several times in SPEC benchmarks, and that it can be simplified to X/Y. (In general you can only simplify (ZY)/Y to Z if the multiplication did not overflow; if Z has the form "X/Y" then this is the case). This patch implements that transform and moves some Div logic out of instcombine and into InstructionSimplify. Unfortunately instcombine gets in the way somewhat, since it likes to change (X/Y)Y into X-(X rem Y), so I had to teach instcombine about this too. Finally, thanks to the NSW/NUW flags, sometimes we know directly that "ZY" does not overflow, because the flag says so, so I added that logic too. This eliminates a bunch of divisions and subtractions in 447.dealII, and has good effects on some other benchmarks too. It seems to have quite an effect on tramp3d-v4 but it's hard to say if it's good or bad because inlining decisions changed, resulting in massive changes all over. llvm-svn: 124487	2011-01-28 16:51:11 +00:00
Duncan Sands	9e9d5b25e2	In which I discover that zero+zero is zero, d'oh! llvm-svn: 124188	2011-01-25 15:14:15 +00:00
Duncan Sands	c78548d791	Turn off this test - the corresponding instsimplify logic has been disabled. llvm-svn: 124185	2011-01-25 12:31:43 +00:00
Duncan Sands	d395108394	According to my auto-simplifier the most common missed simplifications in optimized code are: (non-negative number)+(power-of-two) != 0 -> true and (x \| 1) != 0 -> true Instcombine knows about the second one of course, but only does it if X\|1 has only one use. These fire thousands of times in the testsuite. llvm-svn: 124183	2011-01-25 09:38:29 +00:00
Duncan Sands	8fb2c3827c	At -O123 the early-cse pass is run before instcombine has run. According to my auto-simplier the transform most missed by early-cse is (zext X) != 0 -> X != 0. This patch adds this transform and some related logic to InstructionSimplify and removes some of the logic from instcombine (unfortunately not all because there are several situations in which instcombine can improve things by making new instructions, whereas instsimplify is not allowed to do this). At -O2 this often results in more than 15% more simplifications by early-cse, and results in hundreds of lines of bitcode being eliminated from the testsuite. I did see some small negative effects in the testsuite, for example a few additional instructions in three programs. One program, 483.xalancbmk, got an additional 35 instructions, which seems to be due to a function getting an additional instruction and then being inlined all over the place. llvm-svn: 123911	2011-01-20 13:21:55 +00:00
Duncan Sands	99589d07e9	For completeness, generalize the (X + Y) - Y -> X transform and add X - (X + 1) -> -1. These were not recommended by my auto-simplifier since they don't fire often enough. However they do fire from time to time, for example they remove one subtraction from the final bitcode for 483.xalancbmk. llvm-svn: 123755	2011-01-18 11:50:19 +00:00
Duncan Sands	9b8e2bd8ef	Simplify (X<<1)-X into X. According to my auto-simplier this is the most common missed simplification in fully optimized code. It occurs sporadically in the testsuite, and many times in 403.gcc: the final bitcode has 131 fewer subtractions after this change. The reason that the multiplies are not eliminated is the same reason that instcombine did not catch this: they are used by other instructions (instcombine catches this with a more general transform which in general is only profitable if the operands have only one use). llvm-svn: 123754	2011-01-18 09:24:58 +00:00
Duncan Sands	d6f1a9584d	Turn X-(X-Y) into Y. According to my auto-simplifier this is the most common simplification present in fully optimized code (I think instcombine fails to transform some of these when "X-Y" has more than one use). Fires here and there all over the test-suite, for example it eliminates 8 subtractions in the final IR for 445.gobmk, 2 subs in 447.dealII, 2 in paq8p etc. llvm-svn: 123442	2011-01-14 15:26:10 +00:00
Duncan Sands	571fd9a606	Factorize common code out of the InstructionSimplify shift logic. Add in threading of shifts over selects and phis while there. This fires here and there in the testsuite, to not much effect. For example when compiling spirit it fires 5 times, during early-cse, resulting in 6 more cse simplifications, and 3 more terminators being folded by jump threading, but the final bitcode doesn't change in any interesting way: other optimizations would have caught the opportunity anyway, only later. llvm-svn: 123441	2011-01-14 14:44:12 +00:00
Duncan Sands	c3eb0f4b2e	Rename this test. llvm-svn: 123440	2011-01-14 14:16:33 +00:00
Duncan Sands	8d25a7c3a0	The most common simplification missed by instsimplify in unoptimized bitcode is "X != 0 -> X" when X is a boolean. This occurs a lot because of the way llvm-gcc converts gcc's conditional expressions. Add this, and a few other similar transforms for completeness. llvm-svn: 123372	2011-01-13 08:56:29 +00:00
Duncan Sands	772749aea1	Revert commit 122654 at the request of Chris, who reckons that instsimplify is the wrong hammer for this nail, and is probably right. llvm-svn: 122661	2011-01-01 20:08:02 +00:00
Duncan Sands	e3c539581c	Fix a README item by having InstructionSimplify do a mild form of value numbering, in which it considers (for example) "%a = add i32 %x, %y" and "%b = add i32 %x, %y" to be equal because the operands are equal and the result of the instructions only depends on the values of the operands. This has almost no effect (it removes 4 instructions from gcc-as-one-file), and perhaps slows down compilation: I measured a 0.4% slowdown on the large gcc-as-one-file testcase, but it wasn't statistically significant. llvm-svn: 122654	2011-01-01 16:12:09 +00:00
Duncan Sands	a45cfbd405	When determining whether the new instruction was already present in the original instruction, half the cases were missed (making it not wrong but suboptimal). Also correct a typo (A <-> B) in the second chunk. llvm-svn: 122414	2010-12-22 17:15:25 +00:00
Duncan Sands	76befde93a	Add an additional InstructionSimplify factorization test. llvm-svn: 122333	2010-12-21 15:12:22 +00:00
Duncan Sands	fecc642224	While I don't think any later transforms can fire, it seems cleaner to not assume this (for example in case more transforms get added below it). Suggested by Frits van Bommel. llvm-svn: 122332	2010-12-21 15:03:43 +00:00
Duncan Sands	07c17132d7	Fix typo in comment, spotted by Deewiant. llvm-svn: 122329	2010-12-21 13:39:20 +00:00
Duncan Sands	ee3ec6eb94	Teach InstructionSimplify about distributive laws. These transforms fire quite often, but don't make much difference in practice presumably because instcombine also knows them and more. llvm-svn: 122328	2010-12-21 13:32:22 +00:00
Duncan Sands	6c7a52cf80	Add generic simplification of associative operations, generalizing a couple of existing transforms. This fires surprisingly often, for example when compiling gcc "(X+(-1))+1->X" fires quite a lot as well as various "and" simplifications (usually with a phi node operand). Most of the time this doesn't make a real difference since the same thing would have been done elsewhere anyway, eg: by instcombine, but there are a few places where this results in simplifications that we were not doing before. llvm-svn: 122326	2010-12-21 08:49:00 +00:00

... 2 3 4 5 6

293 Commits