llvm-project

Commit Graph

Author	SHA1	Message	Date
Sanjoy Das	dcc84db264	Bugfix: SCEVExpander incorrectly marks increment operations as no-wrap (The change was landed in r230280 and caused the regression PR22674. This version contains a fix and a test-case for PR22674). When emitting the increment operation, SCEVExpander marks the operation as nuw or nsw based on the flags on the preincrement SCEV. This is incorrect because, for instance, it is possible that {-6,+,1} is <nuw> while {-6,+,1}+1 = {-5,+,1} is not. This change teaches SCEV to mark the increment as nuw/nsw only if it can explicitly prove that the increment operation won't overflow. Apart from the attached test case, another (more realistic) manifestation of the bug can be seen in Transforms/IndVarSimplify/pr20680.ll. Differential Revision: http://reviews.llvm.org/D7778 llvm-svn: 230533	2015-02-25 20:02:59 +00:00
Hans Wennborg	953d6fb84e	Revert r230280: "Bugfix: SCEVExpander incorrectly marks increment operations as no-wrap" This caused PR22674, failing this assert: Instructions.h:2281: llvm::Value* llvm::PHINode::getOperand(unsigned int) const: Assertion `i_nocapture < OperandTraits<PHINode>::operands(this) && "getOperand() out of range!"' failed. llvm-svn: 230341	2015-02-24 16:19:29 +00:00
Sanjoy Das	b14010d28b	Fix bug 22641 The bug was a result of getPreStartForExtend interpreting nsw/nuw flags on an add recurrence more strongly than is legal. {S,+,X}<nsw> implies S+X is nsw only if the backedge of the loop is taken at least once. NOTE: I had accidentally committed an unrelated change with the commit message of this change in r230275 (r230275 was reverted in r230279). This is the correct change for this commit message. Differential Revision: http://reviews.llvm.org/D7808 llvm-svn: 230291	2015-02-24 01:02:42 +00:00
Sanjoy Das	18c243b933	Bugfix: SCEVExpander incorrectly marks increment operations as no-wrap When emitting the increment operation, SCEVExpander marks the operation as nuw or nsw based on the flags on the preincrement SCEV. This is incorrect because, for instance, it is possible that {-6,+,1} is <nuw> while {-6,+,1}+1 = {-5,+,1} is not. This change teaches SCEV to mark the increment as nuw/nsw only if it can explicitly prove that the increment operation won't overflow. Apart from the attached test case, another (more realistic) manifestation of the bug can be seen in Transforms/IndVarSimplify/pr20680.ll. NOTE: this change was landed with an incorrect commit message in rL230275 and was reverted for that reason in rL230279. This commit message is the correct one. Differential Revision: http://reviews.llvm.org/D7778 llvm-svn: 230280	2015-02-23 23:22:58 +00:00
Sanjoy Das	c9cf0151cf	Revert 230275. 230275 got committed with an incorrect commit message due to a mixup on my side. Will re-land in a few moments with the correct commit message. llvm-svn: 230279	2015-02-23 23:13:22 +00:00
Sanjoy Das	913dfd8f7f	Fix bug 22641 The bug was a result of getPreStartForExtend interpreting nsw/nuw flags on an add recurrence more strongly than is legal. {S,+,X}<nsw> implies S+X is nsw only if the backedge of the loop is taken at least once. Differential Revision: http://reviews.llvm.org/D7808 llvm-svn: 230275	2015-02-23 22:55:13 +00:00
Sanjoy Das	4153f47026	Generalize getExtendAddRecStart to work with both sign and zero extensions. This change also removes `DEBUG(dbgs() << "SCEV: untested prestart overflow check\n");` because that case has a unit test now. Differential Revision: http://reviews.llvm.org/D7645 llvm-svn: 229600	2015-02-18 01:47:07 +00:00
Sanjoy Das	bf5d870dfa	Bugfix: SCEV incorrectly marks certain add recurrences as nsw When creating a scev for sext({X,+,Y}), scev checks if the expression is equivalent to {sext X,+,zext Y}. If it can prove that, it also tags the original {X,+,Y} as <nsw>, which is not correct. In the test case I run `-scalar-evolution` twice because the bug manifests only once SCEV has run through and seen the `sext` expressions (and then does a in-place mutation on {X,+,Y}). Differential Revision: http://reviews.llvm.org/D7495 llvm-svn: 228586	2015-02-09 18:34:55 +00:00
Johannes Doerfert	2683e5676c	Allow ScalarEvolution to catch more min/max cases For the attached test case different types are used in the ICmpInst and SelectInst that represent the min/max expressions. However, if the ICmpInst type is smaller a comparison with the sign/zero extended operands would have yielded the same result. This situation might arise after the instruction combination pass was applied. Differential Revision: http://reviews.llvm.org/D7338 llvm-svn: 228572	2015-02-09 12:34:23 +00:00
Sanjoy Das	f2e931cae9	Bugfix: ScalarEvolution incorrectly assumes that the start of certain add recurrences don't overflow. This change makes the optimization more restrictive. It still assumes that an overflowing `add nsw` is undefined behavior; and this change will need revisiting once we have a consistent semantics for poison values. Differential Revision: http://reviews.llvm.org/D7331 llvm-svn: 228552	2015-02-08 22:52:17 +00:00
Sanjoy Das	cb47366366	Make ScalarEvolution less aggressive with respect to no-wrap flags. ScalarEvolution currently lowers a subtraction recurrence to an add recurrence with the same no-wrap flags as the subtraction. This is incorrect because `sub nsw X, Y` is not the same as `add nsw X, -Y` and `sub nuw X, Y` is not the same as `add nuw X, -Y`. This patch fixes the issue, and adds two test cases demonstrating the bug. Differential Revision: http://reviews.llvm.org/D7081 llvm-svn: 226755	2015-01-22 00:48:47 +00:00
Sanjoy Das	81401d4b19	Fix PR22179. We were incorrectly inferring nsw for certain SCEVs. We can be more aggressive here (see Richard Smith's comment on http://llvm.org/bugs/show_bug.cgi?id=22179) but this change just focuses on correctness. Differential Revision: http://reviews.llvm.org/D6914 llvm-svn: 225591	2015-01-10 23:41:24 +00:00
Duncan P. N. Exon Smith	be7ea19b58	IR: Make metadata typeless in assembly Now that `Metadata` is typeless, reflect that in the assembly. These are the matching assembly changes for the metadata/value split in r223802. - Only use the `metadata` type when referencing metadata from a call intrinsic -- i.e., only when it's used as a `Value`. - Stop pretending that `ValueAsMetadata` is wrapped in an `MDNode` when referencing it from call intrinsics. So, assembly like this: define @foo(i32 %v) { call void @llvm.foo(metadata !{i32 %v}, metadata !0) call void @llvm.foo(metadata !{i32 7}, metadata !0) call void @llvm.foo(metadata !1, metadata !0) call void @llvm.foo(metadata !3, metadata !0) call void @llvm.foo(metadata !{metadata !3}, metadata !0) ret void, !bar !2 } !0 = metadata !{metadata !2} !1 = metadata !{i32* @global} !2 = metadata !{metadata !3} !3 = metadata !{} turns into this: define @foo(i32 %v) { call void @llvm.foo(metadata i32 %v, metadata !0) call void @llvm.foo(metadata i32 7, metadata !0) call void @llvm.foo(metadata i32* @global, metadata !0) call void @llvm.foo(metadata !3, metadata !0) call void @llvm.foo(metadata !{!3}, metadata !0) ret void, !bar !2 } !0 = !{!2} !1 = !{i32* @global} !2 = !{!3} !3 = !{} I wrote an upgrade script that handled almost all of the tests in llvm and many of the tests in cfe (even handling many `CHECK` lines). I've attached it (or will attach it in a moment if you're speedy) to PR21532 to help everyone update their out-of-tree testcases. This is part of PR21532. llvm-svn: 224257	2014-12-15 19:07:53 +00:00
David Majnemer	0df1d12476	ScalarEvolution: HowFarToZero was wrongly using signed division HowFarToZero was supposed to use unsigned division in order to calculate the backedge taken count. However, SCEVDivision::divide performs signed division. Unless I am mistaken, no users of SCEVDivision actually want signed arithmetic: switch to udiv and urem. This fixes PR21578. llvm-svn: 222093	2014-11-16 07:30:35 +00:00
Rafael Espindola	1c99344156	Use FileCheck in a few tests. llvm-svn: 221459	2014-11-06 15:05:51 +00:00
Bradley Smith	9992b167ae	[SCEV] Improve Scalar Evolution's use of no {un,}signed wrap flags In a case where we have a no {un,}signed wrap flag on the increment, if RHS - Start is constant then we can avoid inserting a max operation bewteen the two, since we can statically determine which is greater. This allows us to unroll loops such as: void testcase3(int v) { for (int i=v; i<=v+1; ++i) f(i); } llvm-svn: 220960	2014-10-31 11:40:32 +00:00
Sanjoy Das	1f05c51e5e	This patch teaches ScalarEvolution to pick and use !range metadata. It also makes it more aggressive in querying range information by adding a call to isKnownPredicateWithRanges to isLoopBackedgeGuardedByCond and isLoopEntryGuardedByCond. phabricator: http://reviews.llvm.org/D5638 Reviewed by: atrick, hfinkel llvm-svn: 219532	2014-10-10 21:22:34 +00:00
Mark Heffernan	2beab5f0b4	This patch de-pessimizes the calculation of loop trip counts in ScalarEvolution in the presence of multiple exits. Previously all loops exits had to have identical counts for a loop trip count to be considered computable. This pessimization was implemented by calling getBackedgeTakenCount(L) rather than getExitCount(L, ExitingBlock) inside of ScalarEvolution::getSmallConstantTripCount() (see the FIXME in the comments of that function). The pessimization was added to fix a corner case involving undefined behavior (pr/16130). This patch more precisely handles the undefined behavior case allowing the pessimization to be removed. ControlsExit replaces IsSubExpr to more precisely track the case where undefined behavior is expected to occur. Because undefined behavior is tracked more precisely we can remove MustExit from ExitLimit. MustExit was used to track the case where the limit was computed potentially assuming undefined behavior even if undefined behavior didn't necessarily occur. llvm-svn: 219517	2014-10-10 17:39:11 +00:00
Hal Finkel	cebf0cc210	Make use @llvm.assume for loop guards in ScalarEvolution This adds a basic (but important) use of @llvm.assume calls in ScalarEvolution. When SE is attempting to validate a condition guarding a loop (such as whether or not the loop count can be zero), this check should also include dominating assumptions. llvm-svn: 217348	2014-09-07 21:37:59 +00:00
Dinesh Dwivedi	c0e6703360	Adding testcase for PR18886. Differential Revision: http://reviews.llvm.org/D3837 llvm-svn: 209645	2014-05-27 06:44:25 +00:00
Andrew Trick	b429083aff	Test case comments. Fix sloppiness. llvm-svn: 209551	2014-05-23 20:46:21 +00:00
Andrew Trick	839e30b2c0	Fix and improve SCEV ComputeBackedgeTankCount. This is a follow-up to r209358: PR19799: Indvars miscompile due to an incorrect max backedge taken count from SCEV. That fix was incomplete as pointed out by Arnold and Michael Z. The code was also too confusing. It needed a careful rewrite with more unit tests. This version will also happen to optimize more cases. <rdar://17005101> PR19799: Indvars miscompile... llvm-svn: 209545	2014-05-23 19:47:13 +00:00
Andrew Trick	e255359b57	Fix a bug in SCEV's backedge taken count computation from my prior fix in Jan. This has to do with the trip count computation for loops with multiple exits, which is quite subtle. Most passes just ask for a single trip count number, so we must be conservative assuming any exit could be taken. Normally, we rely on the "exact" trip count, which was correctly given as "unknown". However, SCEV also gives a "max" back-edge taken count. The loops max BE taken count is conservatively a maximum over the max of each exit's non-exiting iterations count. Note that some exit tests can be skipped so the max loop back-edge taken count can actually exceed the max non-exiting iterations for some exits. However, when we know the loop latch cannot be skipped, we can directly use its max taken count disregarding other exits. I previously took the minimum here without checking whether the other exit could be skipped. The correct, and simpler thing to do here is just to directly use the loop latch's max non-exiting iterations as the loops max back-edge count. In the problematic test case, the first loop exit had a max of zero non-exiting iterations, but could be skipped. The loop latch was known not to be skipped but had max of one non-exiting iteration. We incorrectly claimed the loop back-edge could be taken zero times, when it is actually taken one time. Fixes Loop %for.body.i: <multiple exits> Unpredictable backedge-taken count. Loop %for.body.i: max backedge-taken count is 1. llvm-svn: 209358	2014-05-22 00:37:03 +00:00
Benjamin Kramer	e75eaca32f	ScalarEvolution: Compute exit counts for loops with a power-of-2 step. If we have a loop of the form for (unsigned n = 0; n != (k & -32); n += 32) {} then we know that n is always divisible by 32 and the loop must terminate. Even if we have a condition where the loop counter will overflow it'll always hold this invariant. PR19183. Our loop vectorizer creates this pattern and it's also occasionally formed by loop counters derived from pointers. llvm-svn: 204728	2014-03-25 16:25:12 +00:00
Nico Rieck	35a237d4ed	Actually call FileCheck in tests llvm-svn: 201491	2014-02-16 13:27:39 +00:00
Benjamin Kramer	5a188549ad	ScalarEvolution: Analyze trip count of loops with a switch guarding the exit. llvm-svn: 201159	2014-02-11 15:44:32 +00:00
Nick Lewycky	629199ccb3	Fix crasher introduced in r200203 and caught by a libc++ buildbot. Don't assume that getMulExpr returns a SCEVMulExpr, it may have simplified it to something else! llvm-svn: 200210	2014-01-27 10:47:44 +00:00
Nick Lewycky	31eaca5513	Teach SCEV to handle more cases of 'and X, CST', specifically where CST is any number of contiguous 1 bits in a row, with any number of leading and trailing 0 bits. Unfortunately, this in turn led to some lower quality SCEVs due to some different paths through expression simplification, so add getUDivExactExpr and use it. This fixes all instances of the problems that I found, but we can make that function smarter as necessary. Merge test "xor-and.ll" into "and-xor.ll" since I needed to update it anyways. Test 'nsw-offset.ll' analyzes a little deeper, %n now gets a scev in terms of %no instead of a SCEVUnknown. llvm-svn: 200203	2014-01-27 10:04:03 +00:00
Alp Toker	cb40291100	Fix known typos Sweep the codebase for common typos. Includes some changes to visible function names that were misspelt. llvm-svn: 200018	2014-01-24 17:20:08 +00:00
Stepan Dyatkovskiy	431993b57b	Fixed old typo in ScalarEvolution, that caused wrong SCEVs zext operation. Detailed description is here: http://llvm.org/bugs/show_bug.cgi?id=18000#c16 For participation in bugfix process special thanks to David Wiberg. llvm-svn: 198863	2014-01-09 12:26:12 +00:00
Andrew Trick	34e2f0c4ea	Rewrite SCEV's backedge taken count computation. Patch by Michele Scandale! Rewrite of the functions used to compute the backedge taken count of a loop on LT and GT comparisons. I decided to split the handling of LT and GT cases becasue the trick "a > b == -a < -b" in some cases prevents the trip count computation due to the multiplication by -1 on the two operands of the comparison. This issue comes from the conservative computation of value range of SCEVs: taking the negative SCEV of an expression that have a small positive range (e.g. [0,31]), we would have a SCEV with a fullset as value range. Indeed, in the new rewritten function I tried to better handle the maximum backedge taken count computation when MAX/MIN expression are used to handle the cases where no entry guard is found. Some test have been modified in order to check the new value correctly (I manually check them and reasoning on possible overflow the new values seem correct). I finally added a new test case related to the multiplication by -1 issue on GT comparisons. llvm-svn: 194116	2013-11-06 02:08:26 +00:00
Benjamin Kramer	6094f30da2	SCEV: Make the final add of an inbounds GEP nuw if we know that the index is positive. We can't do this for the general case as saying a GEP with a negative index doesn't have unsigned wrap isn't valid for negative indices. %gep = getelementptr inbounds i32* %p, i64 -1 But an inbounds GEP cannot run past the end of address space. So we check for the very common case of a positive index and make GEPs derived from that NUW. Together with Andy's recent non-unit stride work this lets us analyze loops like void foo3(int a, int b) { for (; a < b; a++) {} } PR12375, PR12376. Differential Revision: http://llvm-reviews.chandlerc.com/D2033 llvm-svn: 193514	2013-10-28 07:30:06 +00:00
Matt Arsenault	be18b8a3ca	Fix creating bitcasts between address spaces in SCEV. The test before wasn't successfully testing this since it was missing the datalayout piece to change the size of the second address space. llvm-svn: 193102	2013-10-21 18:41:10 +00:00
Andrew Trick	768b917dc8	SCEV should use NSW to get trip count for positive nonunit stride loops. SCEV currently fails to compute loop counts for nonunit stride loops. This comes up frequently. It prevents loop optimization and forces vectorization to insert extra loop checks. For example: void foo(int n, int *x) { for (int i = 0; i < n; i += 3) { x[i] = i; x[i+1] = i+1; x[i+2] = i+2; } } We need to properly handle the case in which limit > INT_MAX-stride. In the above case: n > INT_MAX-3. In this case the loop counter will step beyond the limit and overflow at the same time. However, knowing that signed integer overlow in undefined, we can assume the loop test behavior is arbitrary after overflow. This obeys both C undefined behavior rules, and the more strict LLVM poison value rules. I'm finally fixing this in response to Hal Finkel's persistence. The most probable reason that we never optimized this before is that we were being careful to handle case where the developer expected a side-effect free infinite loop relying on overflow: for (int i = 0; i < n; i += s) { ++j; } return j; If INT_MAX+1 is a multiple of s and n > INT_MAX-s, then we might expect an infinite loop. However there are plenty of ways to achieve this effect without relying on undefined behavior of signed overflow. llvm-svn: 193015	2013-10-18 23:43:53 +00:00
Matt Arsenault	a90a18e0ea	Teach ScalarEvolution about pointer address spaces llvm-svn: 190425	2013-09-10 19:55:24 +00:00
Bill Wendling	a08bb49c61	FileCheck-ize tests. llvm-svn: 188971	2013-08-22 00:51:19 +00:00
Daniel Dunbar	9efbedfd35	[tests] Cleanup initialization of test suffixes. - Instead of setting the suffixes in a bunch of places, just set one master list in the top-level config. We now only modify the suffix list in a few suites that have one particular unique suffix (.ml, .mc, .yaml, .td, .py). - Aside from removing the need for a bunch of lit.local.cfg files, this enables 4 tests that were inadvertently being skipped (one in Transforms/BranchFolding, a .s file each in DebugInfo/AArch64 and CodeGen/PowerPC, and one in CodeGen/SI which is now failing and has been XFAILED). - This commit also fixes a bunch of config files to use config.root instead of older copy-pasted code. llvm-svn: 188513	2013-08-16 00:37:11 +00:00
Bill Wendling	dc17270968	FileCheckize some of the testcases. llvm-svn: 187756	2013-08-05 23:43:18 +00:00
Stephen Lin	6dd347b39f	Add newlines at end of test files, no functionality change llvm-svn: 186263	2013-07-13 22:00:58 +00:00
Andrew Trick	3c944ba2f0	Unit test for SCEV fix r182989, PR16130. llvm-svn: 183017	2013-05-31 16:42:41 +00:00
Manman Ren	662ece49de	TBAA: remove !tbaa from testing cases if not used. This will make it easier to turn on struct-path aware TBAA since the metadata format will change. llvm-svn: 180743	2013-04-29 22:42:01 +00:00
Andrew Trick	9093e15066	Fix SCEV forgetMemoizedResults should search and destroy backedge exprs. Fixes PR15570: SEGV: SCEV back-edge info invalid after dead code removal. Indvars creates a SCEV expression for the loop's back edge taken count, then determines that the comparison is always true and removes it. When loop-unroll asks for the expression, it contains a NULL SCEVUnknkown (as a CallbackVH). forgetMemoizedResults should invalidate the loop back edges expression. llvm-svn: 177986	2013-03-26 03:14:53 +00:00
Dmitri Gribenko	56bf2e1830	Tests: rewrite 'opt ... %s' to 'opt ... < %s' so that opt does not emit a ModuleID This is done to avoid odd test failures, like the one fixed in r171243. llvm-svn: 171250	2012-12-30 02:33:22 +00:00
Dmitri Gribenko	10c4b4d249	Add a check to the test Analysis/ScalarEvolution/2010-09-03-RequiredTransitive.ll This test did not test anything at all (except for opt crashing, but that was not the reason why it was added). llvm-svn: 171248	2012-12-30 01:42:34 +00:00
Dmitri Gribenko	b137c9e551	Tests: rewrite 'opt ... %s' to 'opt ... < %s' so that opt does not emit a ModuleID This is done to avoid odd test failures, like the one fixed in r171243. llvm-svn: 171246	2012-12-30 01:28:40 +00:00
Benjamin Kramer	2f47a3fb07	Fix broken check lines. I really need to find a way to automate this, but I can't come up with a regex that has no false positives while handling tricky cases like custom check prefixes. llvm-svn: 162097	2012-08-17 12:28:26 +00:00
Nick Lewycky	fb78083b1c	Stay rational; don't assert trying to take the square root of a negative value. If it's negative, the loop is already proven to be infinite. Fixes PR13489! llvm-svn: 161107	2012-08-01 09:14:36 +00:00
Chandler Carruth	ff123d5c63	Fix the remaining TCL-style quotes found in the testsuite. This is another mechanical change accomplished though the power of terrible Perl scripts. I have manually switched some "s to 's to make escaping simpler. While I started this to fix tests that aren't run in all configurations, the massive number of tests is due to a really frustrating fragility of our testing infrastructure: things like 'grep -v', 'not grep', and 'expected failures' can mask broken tests all too easily. Essentially, I'm deeply disturbed that I can change the testsuite so radically without causing any change in results for most platforms. =/ llvm-svn: 159547	2012-07-02 19:09:46 +00:00
Chandler Carruth	5da53436d5	Convert the uses of '\|&' to use '2>&1 \|' instead, which works on old versions of Bash. In addition, I can back out the change to the lit built-in shell test runner to support this. This should fix the majority of fallout on Darwin, but I suspect there will be a few straggling issues. llvm-svn: 159544	2012-07-02 18:37:59 +00:00
Chandler Carruth	a5a29f970e	Convert all tests using TCL-style quoting to use shell-style quoting. This was done through the aid of a terrible Perl creation. I will not paste any of the horrors here. Suffice to say, it require multiple staged rounds of replacements, state carried between, and a few nested-construct-parsing hacks that I'm not proud of. It happens, by luck, to be able to deal with all the TCL-quoting patterns in evidence in the LLVM test suite. If anyone is maintaining large out-of-tree test trees, feel free to poke me and I'll send you the steps I used to convert things, as well as answer any painful questions etc. IRC works best for this type of thing I find. Once converted, switch the LLVM lit config to use ShTests the same as Clang. In addition to being able to delete large amounts of Python code from 'lit', this will also simplify the entire test suite and some of lit's architecture. Finally, the test suite runs 33% faster on Linux now. ;] For my 16-hardware-thread (2x 4-core xeon e5520): 36s -> 24s llvm-svn: 159525	2012-07-02 12:47:22 +00:00
Nick Lewycky	474112d82c	If the step value is a constant zero, the loop isn't going to terminate. Fixes the assert reported in PR13228! llvm-svn: 159393	2012-06-28 23:44:57 +00:00
Andrew Trick	a3f9043196	SCEV: Handle a corner case reducing AddRecExpr * AddRecExpr If integer overflow causes one of the terms to reach zero, that can force the entire expression to zero. Fixes PR12929: cast<Ty>() argument of incompatible type llvm-svn: 157673	2012-05-30 03:35:20 +00:00
Andrew Trick	7fa4e0fea6	SCEV: Add MarkPendingLoopPredicates to avoid recursive isImpliedCond. getUDivExpr attempts to simplify by checking for overflow. isLoopEntryGuardedByCond then evaluates the loop predicate which may lead to the same getUDivExpr causing endless recursion. Fixes PR12868: clang 3.2 segmentation fault. llvm-svn: 157092	2012-05-19 00:48:25 +00:00
Benjamin Kramer	e364d195e9	Revert "SCEV: When expanding a GEP the final addition to the base pointer has NUW but not NSW." This isn't right either, reverting for now. llvm-svn: 154910	2012-04-17 06:33:57 +00:00
Benjamin Kramer	e1f4ca1b0f	SCEV: When expanding a GEP the final addition to the base pointer has NUW but not NSW. Found by inspection. llvm-svn: 154262	2012-04-07 17:19:26 +00:00
Andrew Trick	7004e4b95e	SCEV fix: Handle loop invariant loads. Fixes PR11882: NULL dereference in ComputeLoadConstantCompareExitLimit. llvm-svn: 153480	2012-03-26 22:33:59 +00:00
Andrew Trick	bd11257df7	Test scalar evolution directly instead of testing the result of canonical indvars. llvm-svn: 153256	2012-03-22 17:09:31 +00:00
Eli Bendersky	924f9a671d	Replace all instances of dg.exp file with lit.local.cfg, since all tests are run with LIT now and now Dejagnu. dg.exp is no longer needed. Patch reviewed by Daniel Dunbar. It will be followed by additional cleanup patches. llvm-svn: 150664	2012-02-16 06:28:33 +00:00
Andrew Trick	d25089f8e0	SCEV fix. In general, Add/Mul expressions should not inherit NSW/NUW. This reverts r139450, fixes r139453, and adds much needed comments and a unit test. llvm-svn: 145367	2011-11-29 02:16:38 +00:00
Andrew Trick	5ec136c57e	Filecheckize. llvm-svn: 145363	2011-11-29 02:05:23 +00:00
Nick Lewycky	0485d51a76	Don't forget to check FlagNW when determining whether an AddRecExpr will wrap or not. Patch by Brendon Cahoon! llvm-svn: 144173	2011-11-09 07:11:37 +00:00
Benjamin Kramer	652f576a70	2>&1 doesn't work here, it just creates an empty file called "&1" llvm-svn: 143117	2011-10-27 18:27:45 +00:00
Duncan Sands	a370f3e34e	Restore commits 142790 and 142843 - they weren't breaking the build bots. Original commit messages: - Reapply r142781 with fix. Original message: Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } - Now that we look at all the header PHIs, we need to consider all the header PHIs when deciding that the loop has stopped evolving. Fixes miscompile in the gcc torture testsuite! llvm-svn: 142919	2011-10-25 12:28:52 +00:00
Duncan Sands	805c5b92c8	Speculatively revert commits 142790 and 142843 to see if it fixes the dragonegg and llvm-gcc self-host buildbots. Original commit messages: - Reapply r142781 with fix. Original message: Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } - Now that we look at all the header PHIs, we need to consider all the header PHIs when deciding that the loop has stopped evolving. Fixes miscompile in the gcc torture testsuite! llvm-svn: 142916	2011-10-25 09:26:43 +00:00
Nick Lewycky	a58fb48a55	Now that we look at all the header PHIs, we need to consider all the header PHIs when deciding that the loop has stopped evolving. Fixes miscompile in the gcc torture testsuite! llvm-svn: 142843	2011-10-24 21:02:38 +00:00
Nick Lewycky	9be7f277e4	Reapply r142781 with fix. Original message: Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } llvm-svn: 142790	2011-10-24 06:57:05 +00:00
Nick Lewycky	9d28c26d77	Speculatively revert r142781. Bots are showing Assertion `i_nocapture < OperandTraits<PHINode>::operands(this) && "getOperand() out of range!"' failed. coming out of indvars. llvm-svn: 142786	2011-10-24 04:00:25 +00:00
Nick Lewycky	1700007ecc	Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } llvm-svn: 142781	2011-10-23 23:43:14 +00:00
Nick Lewycky	a6674c7fc9	Make SCEV's brute force analysis stronger in two ways. Firstly, we should be able to constant fold load instructions where the argument is a constant. Second, we should be able to watch multiple PHI nodes through the loop; this patch only supports PHIs in loop headers, more can be done here. With this patch, we now constant evaluate: static const int arr[] = {1, 2, 3, 4, 5}; int test() { int sum = 0; for (int i = 0; i < 5; ++i) sum += arr[i]; return sum; } llvm-svn: 142731	2011-10-22 19:58:20 +00:00
Andrew Trick	887a111e31	Missing test case for r141164. llvm-svn: 141166	2011-10-05 06:23:32 +00:00
Nick Lewycky	3155552461	Reapply r140979 with fix! We never did get a testcase, but careful review of the logic by David Meyer revealed this bug. llvm-svn: 140992	2011-10-03 07:10:45 +00:00
Nick Lewycky	b1dbce1406	Revert r140979 due to reports of bootstrap failure. llvm-svn: 140980	2011-10-03 05:14:59 +00:00
Nick Lewycky	3c624b8d0d	Add one more case we compute a max trip count. llvm-svn: 140979	2011-10-03 01:03:57 +00:00
Andrew Trick	57d8afde93	This test only makes sense with -enable-iv-rewrite. llvm-svn: 139576	2011-09-13 02:45:26 +00:00
Nick Lewycky	e0aa54bb98	This transform only handles two-operand AddRec's. Prevent it from trying to handle anything more complex. Fixes PR10383 again! llvm-svn: 139186	2011-09-06 21:42:18 +00:00
Nick Lewycky	658bdb5133	The logic inside getMulExpr to simplify {a,+,b}*{c,+,d} was wrong, which was visible given a=b=c=d=1, on iteration #1 (the second iteration). Replace it with correct math. Fixes PR10383! llvm-svn: 139133	2011-09-06 05:05:14 +00:00
Nick Lewycky	b1438c763a	Revert r139126 due to selfhost failures reported by buildbots. llvm-svn: 139130	2011-09-06 02:43:13 +00:00
Nick Lewycky	c4c43fbb07	Teach SCEV to report a max backedge count in one interesting case in HowFarToZero; the case for a canonical loop. llvm-svn: 139126	2011-09-05 23:25:16 +00:00
Chris Lattner	8936d2bfbc	Remove support for parsing the "type i32" syntax for defining a numbered top level type without a specified number. This syntax isn't documented and blocks forward progress. llvm-svn: 133371	2011-06-19 00:03:46 +00:00
Chris Lattner	80ed9dc9e5	rip out a ton of intrinsic modernization logic from AutoUpgrade.cpp, which is for pre-2.9 bitcode files. We keep x86 unaligned loads, movnt, crc32, and the target indep prefetch change. As usual, updating the testsuite is a PITA. llvm-svn: 133337	2011-06-18 06:05:24 +00:00
Chris Lattner	def1949c00	Remove support for using "foo" as symbols instead of %"foo". This is ancient syntax and has been long obsolete. As usual, updating the tests is the nasty part of this. llvm-svn: 133242	2011-06-17 06:36:20 +00:00
Andrew Trick	01eff820ae	Test case and comment for PR9633. llvm-svn: 130294	2011-04-27 05:42:17 +00:00
Andrew Trick	f6b01ff422	Propagate SCEV no-wrap flags whenever possible. This needs review. llvm-svn: 127638	2011-03-15 00:37:00 +00:00
Andrew Trick	2afa325811	When SCEV can determine the loop test is X < X, set ExactBECount=0. When ExactBECount is a constant, use it for MaxBECount. When MaxBECount cannot be computed, replace it with ExactBECount. Fixes PR9424. llvm-svn: 127342	2011-03-09 17:29:58 +00:00
Chris Lattner	4f23f2be15	teach SCEV that the scale and addition of an inbounds gep don't NSW. This fixes a FIXME in scev-aa.ll (allowing a new no-alias result) and generally makes things more precise. llvm-svn: 125449	2011-02-13 03:14:49 +00:00
Chris Lattner	7936a8a488	Per discussion with Dan G, inbounds geps certainly can have unsigned overflow (e.g. "gep P, -1"), and while they can have signed wrap in theoretical situations, modelling an AddRec as not having signed wrap is going enough for any case we can think of today. In the future if this isn't enough, we can revisit this. Modeling them as having NUW isn't causing any known problems either FWIW. llvm-svn: 125410	2011-02-11 21:43:33 +00:00
Nick Lewycky	d4192f71b5	Simplify some code with no functionality change. Make the test a lot more robust against smarter optimizations, using the power of FileCheck. llvm-svn: 124081	2011-01-23 20:06:05 +00:00
Nick Lewycky	bc98f5b78e	Use value ranges to fold ext(trunc) in SCEV when possible. llvm-svn: 124062	2011-01-23 06:20:19 +00:00
Tobias Grosser	f07426b40d	Implement requiredTransitive The PassManager did not implement the transitivity of requiredTransitive. This was unnoticed since 2006. llvm-svn: 123942	2011-01-20 21:03:22 +00:00
Nick Lewycky	5c901f3489	Similarly, analyze truncate through multiply. llvm-svn: 123842	2011-01-19 18:56:00 +00:00
Nick Lewycky	5143f0f09b	Add a missed SCEV fold that is required to continue analyzing the IR produced by indvars through the scev expander. trunc(add x, y) --> add(trunc x, y). Currently SCEV largely folds the other way which is probably wrong, but preserved to minimize churn. Instcombine doesn't do this fold either, demonstrating a missed optz'n opportunity on code doing add+trunc+add. llvm-svn: 123838	2011-01-19 16:59:46 +00:00
Nick Lewycky	e9ea75e3fc	Add a missing SCEV simplification sext(zext x) --> zext x. llvm-svn: 123832	2011-01-19 15:56:12 +00:00
Eric Christopher	31bb4c5811	Revert the testcase from the previous reverted commit. llvm-svn: 123227	2011-01-11 09:20:44 +00:00
Chris Lattner	1032965cbe	add a testcase I missed in previous commit. llvm-svn: 123143	2011-01-09 23:52:31 +00:00
Chris Lattner	10223a3fbf	teach SCEV analysis of PHI nodes that PHI recurences formed with GEP instructions are always NUW, because PHIs cannot wrap the end of the address space. llvm-svn: 123105	2011-01-09 02:28:48 +00:00
Chris Lattner	a337f5ec5c	reduce indentation. Print <nuw> and <nsw> when dumping SCEV AddRec's that have the bit set. llvm-svn: 123104	2011-01-09 02:16:18 +00:00
Chris Lattner	bb451461ec	remove some noise from tests. llvm-svn: 112889	2010-09-02 22:35:33 +00:00
Dan Gohman	f7495f286a	When analyzing loop exit conditions combined with and and or, don't make any assumptions about when the two conditions will agree on when to permit the loop to exit. This fixes PR7845. llvm-svn: 110758	2010-08-11 00:12:36 +00:00
Dan Gohman	c0cca7fdda	Revert the part of r107257 which introduced new logic for using nsw and nuw flags from IR Instructions. On further consideration, this isn't valid. llvm-svn: 107298	2010-06-30 17:27:11 +00:00
Dan Gohman	725ed0364b	Add a testcase for scev-aa's new capability. llvm-svn: 107258	2010-06-30 07:17:47 +00:00
Dan Gohman	f820bd327d	Allow "exhaustive" trip count evaluation on phi nodes with all constant operands. llvm-svn: 106537	2010-06-22 13:15:46 +00:00
Dan Gohman	866971ed3d	Fix ScalarEvolution's "exhaustive" trip count evaluation code to avoid assuming that loops are in canonical form, as ScalarEvolution doesn't depend on LoopSimplify itself. Also, with indirectbr not all loops can be simplified. This fixes PR7416. llvm-svn: 106389	2010-06-19 14:17:24 +00:00
Dan Gohman	24ceda8eb0	Revert r106304 (105548 and friends), which are the SCEVComplexityCompare optimizations. There is still some nondeterminism remaining. llvm-svn: 106306	2010-06-18 19:54:20 +00:00
Dan Gohman	4c807fca97	Reapply 105540, 105542, and 105548, and revert r105732. llvm-svn: 106304	2010-06-18 19:26:04 +00:00
Daniel Dunbar	e16d569932	Workaround SCEV non-determinism on this test, for now, to get buildbots back to green. Dan, please revert this once the real problem is fixed. llvm-svn: 105732	2010-06-09 17:54:40 +00:00
Dan Gohman	70910a6ab6	Optimize ScalarEvolution's SCEVComplexityCompare predicate: don't go scrounging through SCEVUnknown contents and SCEVNAryExpr operands; instead just do a simple deterministic comparison of the precomputed hash data. Also, since this is more precise, it eliminates the need for the slow N^2 duplicate detection code. llvm-svn: 105540	2010-06-07 19:06:13 +00:00
Dan Gohman	d07d2f9774	Add a comment to this test. llvm-svn: 102387	2010-04-26 21:37:43 +00:00
Dan Gohman	f33bac3afe	ScalarEvolution support for <= and >= loops. Also, generalize ScalarEvolutions's min and max recognition to handle some new forms of min and max that this change makes more common. llvm-svn: 102234	2010-04-24 03:09:42 +00:00
Dan Gohman	acd700a24b	Don't attempt to analyze values which are obviously undef. This fixes some assertion failures in extreme cases. llvm-svn: 102042	2010-04-22 01:35:11 +00:00
Dan Gohman	6635bb26a6	Generalize ScalarEvolution's PHI analysis to handle loops that don't have preheaders or dedicated exit blocks, as clients may not otherwise need to run LoopSimplify. llvm-svn: 101030	2010-04-12 07:49:36 +00:00
Dan Gohman	69451a0950	Avoid analyzing instructions in blocks not reachable from the entry block. They are lots of trouble, and they don't matter. This fixes PR6559. llvm-svn: 98103	2010-03-09 23:46:50 +00:00
Dan Gohman	6b1e2a829d	Teach ScalarEvolution how to compute a tripcount for a loop with true or false as its exit condition. These are usually eliminated by SimplifyCFG, but the may be left around during a pass which wishes to preserve the CFG. llvm-svn: 96683	2010-02-19 18:12:07 +00:00
Dan Gohman	80386c10d4	-disable-output is no longer needed with -analyze. llvm-svn: 94574	2010-01-26 19:25:59 +00:00
Dan Gohman	51aaf02821	Fix the the ceiling-division used in computing the MaxBECount so that it doesn't have trouble with an intermediate add overflowing. Also, be more conservative about the case where the induction variable in an SLT loop exit can step past the RHS of the SLT and overflow in a single step. Make getSignedRange more aggressive, to recover for some common cases which the above fixes pessimized. This addresses rdar://7561161. llvm-svn: 94512	2010-01-26 04:40:18 +00:00
Dan Gohman	bc694918cc	Use WriteAsOperand instead of getName() to print loop header names, so that unnamed blocks are handled. llvm-svn: 93059	2010-01-09 18:17:45 +00:00
Dan Gohman	03f90ab0a9	Add a comment about A[i+(j+1)]. llvm-svn: 90185	2009-12-01 01:38:10 +00:00
Chris Lattner	ba0014a44c	update status of this. basicaa is much improved now, only missing the one form (in this testcase). Dan, do you consider this example to be important? llvm-svn: 89953	2009-11-26 16:42:00 +00:00
Kenneth Uildriks	90fedc6ef9	Make opt default to not adding a target data string and update tests that depend on target data to supply it within the test llvm-svn: 85900	2009-11-03 15:29:06 +00:00
Edward O'Callaghan	f96ce30236	Convert Analysis tests to FileCheck in regards to PR5307. llvm-svn: 85241	2009-10-27 14:54:46 +00:00
Dan Gohman	36bad00bef	Teach ScalarEvolution how to reason about no-wrap flags on loops where the induction variable has a non-unit stride, such as {0,+,2}, and there are expressions such as {1,+,2} inside the loop formed with or or add nsw operators. llvm-svn: 82151	2009-09-17 18:05:20 +00:00
Dan Gohman	1880092722	Change tests from "opt %s" to "opt < %s" so that opt doesn't see the input filename so that opt doesn't print the input filename in the output so that grep lines in the tests don't unintentionally match strings in the input filename. llvm-svn: 81537	2009-09-11 18:01:28 +00:00
Dan Gohman	72a13d2476	Use opt -S instead of piping bitcode output through llvm-dis. llvm-svn: 81257	2009-09-08 22:34:10 +00:00
Dan Gohman	9737a63ed8	Change these tests to feed the assembly files to opt directly, instead of using llvm-as, now that opt supports this. llvm-svn: 81226	2009-09-08 16:50:01 +00:00
Dan Gohman	d926b985df	Create a ScalarEvolution-based AliasAnalysis implementation. This is a simple AliasAnalysis implementation which works by making ScalarEvolution queries. ScalarEvolution has a more complete understanding of arithmetic than BasicAA's collection of ad-hoc checks, so it handles some cases that BasicAA misses, for example p[i] and p[i+1] within the same iteration of a loop. This is currently experimental. It may be that the main use for this pass will be to help find cases where BasicAA can be profitably extended, or to help in the development of the overall AliasAnalysis infrastructure, however it's also possible that it could grow up to become a directly useful pass. llvm-svn: 80098	2009-08-26 14:53:06 +00:00
Dan Gohman	463d3407e2	Loosen up the regex for this test so that it doesn't implicitly depend on TargetData information. llvm-svn: 79491	2009-08-19 23:19:36 +00:00
Dan Gohman	e274526d78	Make LLVM Assembly dramatically easier to read by aligning the comments, using formatted_raw_ostream's PadToColumn. Before: bb1: ; preds = %bb %2 = sext i32 %i.01 to i64 ; <i64> [#uses=1] %3 = getelementptr double* %p, i64 %2 ; <double> [#uses=1] %4 = load double %3, align 8 ; <double> [#uses=1] %5 = fmul double %4, 1.100000e+00 ; <double> [#uses=1] %6 = sext i32 %i.01 to i64 ; <i64> [#uses=1] %7 = getelementptr double* %p, i64 %6 ; <double> [#uses=1] After: bb1: ; preds = %bb %2 = sext i32 %i.01 to i64 ; <i64> [#uses=1] %3 = getelementptr double %p, i64 %2 ; <double> [#uses=1] %4 = load double %3, align 8 ; <double> [#uses=1] %5 = fmul double %4, 1.100000e+00 ; <double> [#uses=1] %6 = sext i32 %i.01 to i64 ; <i64> [#uses=1] %7 = getelementptr double* %p, i64 %6 ; <double*> [#uses=1] Several tests required whitespace adjustments. llvm-svn: 78816	2009-08-12 17:23:50 +00:00
Dan Gohman	9c7f808201	Change the assembly syntax for nsw, nuw, and exact, putting them after their associated opcodes rather than before. This makes them a little easier to read. llvm-svn: 77194	2009-07-27 16:11:46 +00:00
Dan Gohman	534d66a426	When attempting to sign-extend an addrec by interpreting the step value as unsigned, the start value and the addrec itself still need to be treated as signed. llvm-svn: 77078	2009-07-25 16:03:30 +00:00
Dan Gohman	62ef6a7f1c	Teach ScalarEvolution to make use of no-overflow flags when analyzing add recurrences. llvm-svn: 77034	2009-07-25 01:22:26 +00:00
Dan Gohman	430f0cc544	Replace the original ad-hoc code for determining whether (v pred w) implies (x pred y) with more thorough code that does more complete canonicalization before resorting to range checks. This helps it find more cases where the canonicalized expressions match. llvm-svn: 76671	2009-07-21 23:03:19 +00:00
Dan Gohman	52e14d2272	Add a testcase for PR4569, which is now fixed. llvm-svn: 76526	2009-07-21 00:50:52 +00:00
Dan Gohman	054d2a7837	Add testcases for PR4538, PR4537, and PR4534. llvm-svn: 75533	2009-07-13 22:30:31 +00:00
Nick Lewycky	3292908132	When comparing constants, consider a less wide constant to be "less complex" than a wider one, before trying to compare their contents which will crash if their sizes are different. llvm-svn: 74792	2009-07-04 17:24:52 +00:00
Dan Gohman	5f71a2886a	Add a testcase demoing some of ScalarEvolution's new trip count logic. llvm-svn: 74049	2009-06-24 01:22:30 +00:00
Dan Gohman	53efeb0e45	Fix a bug in the trip-count computation with And/Or. If either of the sides is CouldNotCompute, the resulting exact count must be CouldNotCompute. llvm-svn: 73920	2009-06-22 23:28:56 +00:00
Dan Gohman	2636693a3c	Fix llvm::ComputeNumSignBits to handle pointer types conservatively correctly, instead of aborting. llvm-svn: 73908	2009-06-22 22:02:32 +00:00
Dan Gohman	96212b661c	Teach ScalarEvolution how to analyze loops with multiple exit blocks, and also exit blocks with multiple conditions (combined with (bitwise) ands and ors). It's often infeasible to compute an exact trip count in such cases, but a useful upper bound can often be found. llvm-svn: 73866	2009-06-22 00:31:57 +00:00
Dan Gohman	0104842ee3	Fix ScalarEvolution's backedge-taken count computations to check for overflow when computing a integer division to round up. Thanks to Nick Lewycky for noticing this! llvm-svn: 73862	2009-06-21 23:46:38 +00:00
Dan Gohman	eddf77123a	Teach ScalarEvolution how to recognize another xor(and(x, C), C) case. If C is a single bit and the and gets analyzed as a truncate and zero-extend, the xor can be represnted as an add. llvm-svn: 73664	2009-06-18 00:00:20 +00:00
Dan Gohman	432af7ace0	Add -disable-output to a bunch of tests that don't care about the output. llvm-svn: 73633	2009-06-17 20:56:26 +00:00
Dan Gohman	b50f5a46e0	Fix ScalarEvolution's Xor handling to not assume that an And that gets recognized with a SCEVZeroExtendExpr must be an And with a low-bits mask. With r73540, this is no longer the case. llvm-svn: 73594	2009-06-17 01:22:39 +00:00
Dan Gohman	a5b9645c4b	Split the Add, Sub, and Mul instruction opcodes into separate integer and floating-point opcodes, introducing FAdd, FSub, and FMul. For now, the AsmParser, BitcodeReader, and IRBuilder all preserve backwards compatability, and the Core LLVM APIs preserve backwards compatibility for IR producers. Most front-ends won't need to change immediately. This implements the first step of the plan outlined here: http://nondot.org/sabre/LLVMNotes/IntegerOverflow.txt llvm-svn: 72897	2009-06-04 22:49:04 +00:00
Dan Gohman	6350296efc	Teach ScalarEvolution to recognize x^-1 in the case where non-demanded bits have been stripped out by instcombine. llvm-svn: 72010	2009-05-18 16:29:04 +00:00
Dan Gohman	8c77f1a275	Make ScalarEvolution::isLoopGuardedByCond work even when the edge entering a loop is a non-split critical edge. llvm-svn: 72004	2009-05-18 15:36:09 +00:00
Dan Gohman	b81dd48fd2	Add nounwind to a few tests. llvm-svn: 72002	2009-05-18 15:16:49 +00:00
Eli Friedman	ebf98b0212	Allow scalar evolution to compute iteration counts for loops with a pointer-based condition. This fixes PR3171. llvm-svn: 71354	2009-05-09 12:32:42 +00:00
Dan Gohman	35dc9b65ed	Fix bogus overflow checks by replacing them with actual overflow checks. llvm-svn: 71284	2009-05-08 23:11:16 +00:00
Dan Gohman	2e55cc5a4a	Fold trunc casts into add-recurrence expressions, allowing the add-recurrence to be exposed. Add a new SCEV folding rule to help simplify expressions in the presence of these extra truncs. llvm-svn: 71264	2009-05-08 21:03:19 +00:00
Dan Gohman	7227bc88f0	When printing a SCEVUnknown with pointer type, don't print an artificial "ptrtoint", as it tends to clutter up complicated expressions. The cast operators now print both source and destination types, which is usually sufficient. llvm-svn: 70554	2009-05-01 17:02:22 +00:00
Dan Gohman	2b8da35f9d	Extend ScalarEvolution's getBackedgeTakenCount to be able to compute an upper-bound value for the trip count, in addition to the actual trip count. Use this to allow getZeroExtendExpr and getSignExtendExpr to fold casts in more cases. This may eventually morph into a more general value-range analysis capability; there are certainly plenty of places where more complete value-range information would allow more folding. llvm-svn: 70509	2009-04-30 20:47:05 +00:00
Dan Gohman	494dac3f84	Generalize the cast-of-addrec folding to handle folding of SCEVs like (sext i8 {-128,+,1} to i64) to i64 {-128,+,1}, where the iteration crosses from negative to positive, but is still safe if the trip count is within range. llvm-svn: 70421	2009-04-29 22:28:28 +00:00
Dan Gohman	d9775a3be1	Fix this test to match the new output from scalar-evolution. llvm-svn: 70410	2009-04-29 21:06:20 +00:00
Dan Gohman	d9b11b2ef4	Include the source type in SCEV cast expression debug output, and print sext, zext, and trunc, instead of signextend, zeroextend, and truncate, respectively, for consistency with the main IR. llvm-svn: 70405	2009-04-29 20:27:52 +00:00
Dan Gohman	807dff7486	Fix a grammaro in a comment. llvm-svn: 70331	2009-04-28 21:54:23 +00:00
Nick Lewycky	b4d9f7a9b3	Simplify trunc(extend(x)) in SCEVs, just for completeness. Also fix some odd whitespace in the same file. llvm-svn: 69870	2009-04-23 05:15:08 +00:00
Owen Anderson	7d82244be7	Testcase for PR3909. llvm-svn: 69868	2009-04-23 04:33:42 +00:00
Dan Gohman	e14efcc9f4	When turning (ashr(shl(x, n), n)) into sext(trunc(x)), the width of the type to truncate to should be the number of bits of the value that are preserved, not the number that are clobbered with sign-extension. This fixes regressions in ldecod. llvm-svn: 69704	2009-04-21 20:18:36 +00:00
Dan Gohman	0bddac16a8	Rename ScalarEvolution's getIterationCount to getBackedgeTakenCount, to more accurately describe what it does. Expand its doxygen comment to describe what the backedge-taken count is and how it differs from the actual iteration count of the loop. Adjust names and comments in associated code accordingly. llvm-svn: 65382	2009-02-24 18:55:53 +00:00
Nick Lewycky	52348300a4	Wind SCEV back in time, to Nov 18th. This 'fixes' PR3275, PR3294, PR3295, PR3296 and PR3302. llvm-svn: 62160	2009-01-13 09:18:58 +00:00
Nick Lewycky	380292a51a	Don't try to analyze this "backward" case. This is overly conservative pending a correct solution. llvm-svn: 61589	2009-01-02 18:54:17 +00:00
Nick Lewycky	69c9aa4ce5	Generalize support for analyzing loops to include SLE/SGE loop exit conditions and support for non-unit strides with signed exit conditions. llvm-svn: 61082	2008-12-16 08:30:01 +00:00
Nick Lewycky	729bf137a8	Revert my re-instated reverted commit, fixes the bootstrap build on x86-64 linux. llvm-svn: 60951	2008-12-12 17:09:07 +00:00
Nick Lewycky	6a344e097c	Sneaky, sneaky: move the -1 to the outside of the SMax. Reinstate the optimization of SGE/SLE with unit stride, now that it works properly. llvm-svn: 60881	2008-12-11 17:40:14 +00:00
Evan Cheng	058522f1da	xfail this for now. llvm-svn: 60777	2008-12-09 18:43:00 +00:00
Nick Lewycky	f545749f2b	It's easy to handle SLE/SGE when the loop has a unit stride. llvm-svn: 60748	2008-12-09 07:25:04 +00:00
Nick Lewycky	1c451ae43e	Add a utility function that detects whether a loop is guaranteed to be finite. Use it to safely handle less-than-or-equals-to exit conditions in loops. These also occur when the loop exit branch is exit on true because SCEV inverses the icmp predicate. Use it again to handle non-zero strides, but only with an unsigned comparison in the exit condition. llvm-svn: 59528	2008-11-18 15:10:54 +00:00
Nick Lewycky	625c6f79b2	Don't brute-force analyze cubic or higher polynomials. If this patch causes a performance regression for anyone, please let me know, and it can be fixed in a different way with much more effort. llvm-svn: 59384	2008-11-16 04:14:25 +00:00
Nick Lewycky	7b14e20a5e	Don't crash analyzing certain quadratics (addrec of {X,+,Y,+,1}). We're still waiting on code that actually analyzes them properly. llvm-svn: 58592	2008-11-03 02:43:49 +00:00
Dan Gohman	dc5f5cbe59	Finally re-apply r46959. This is made feasible by the combination of r56230, r56232, and r56246. llvm-svn: 56247	2008-09-16 18:52:57 +00:00
Dan Gohman	162568842e	Fix spacing in the grep line for this test, following the recent SCEV-whitespace changes. llvm-svn: 56234	2008-09-16 01:37:08 +00:00
Dan Gohman	f9081a2cd5	Teach ScalarEvolution to consider loop preheaders in the search for an if statement that guards a loop, to allow indvars to avoid smax operations in more situations. llvm-svn: 56232	2008-09-15 22:18:04 +00:00
Dan Gohman	81313fd8d1	Fix WriteAsOperand to not emit a leading space character. Adjust its callers to emit a space character before calling it when a space is needed. This fixes several spurious whitespace issues in ScalarEvolution's debug dumps. See the test changes for examples. This also fixes odd space-after-tab indentation in the output for switch statements, and changes calls from being printed like this: call void @foo( i32 %x ) to this: call void @foo(i32 %x) llvm-svn: 56196	2008-09-14 17:21:12 +00:00
Dan Gohman	2a62fd96a6	Extend ScalarEvolution's executesAtLeastOnce logic to be able to continue past the first conditional branch when looking for a relevant test. This helps it avoid using MAX expressions in loop trip counts in more cases. llvm-svn: 54697	2008-08-12 20:17:31 +00:00
Eli Friedman	61f67624c3	PR2621: Improvements to the SCEV AddRec binomial expansion. This version uses a new algorithm for evaluating the binomial coefficients which is significantly more efficient for AddRecs of more than 2 terms (see the comments in the code for details on how the algorithm works). It also fixes some bugs: it removes the arbitrary length restriction for AddRecs, it fixes the silent generation of incorrect code for AddRecs which require a wide calculation width, and it fixes an issue where we were incorrectly truncating the iteration count too far when evaluating an AddRec expression narrower than the induction variable. There are still a few related issues I know of: I think there's still an issue with the SCEVExpander expansion of AddRec in terms of the width of the induction variable used. The hack to avoid generating too-wide integers shouldn't be necessary; instead, the callers should be considering the cost of the expansion before expanding it (in addition to not expanding too-wide integers, we might not want to expand expressions that are really expensive, especially when optimizing for size; calculating an length-17 32-bit AddRec currently generates about 250 instructions of straight-line code on X86). Also, for long 32-bit AddRecs on X86, CodeGen really sucks at scheduling the code. I'm planning on filing follow-up PRs for these issues. llvm-svn: 54332	2008-08-04 23:49:06 +00:00
Eli Friedman	4736916aa6	Another SCEV issue from PR2607; essentially the same issue, but this time applying to the implicit comparison in smin expressions. The correct way to transform an inequality into the opposite inequality, either signed or unsigned, is with a not expression. I looked through the SCEV code, and I don't think there are any more occurrences of this issue. llvm-svn: 54194	2008-07-30 04:36:32 +00:00
Eli Friedman	5ae90441c4	Fix for PR2607: SCEV miscomputing the loop count for loops with an SGT exit condition. Essentially, the correct way to flip an inequality in 2's complement is the not operator, not the negation operator. That said, the difference only affects cases involving INT_MIN. Also, enhance the pre-test search logic to be a bit smarter about inequalities flipped with a not operator, so it can eliminate the smax from the iteration count for simple loops. llvm-svn: 54184	2008-07-30 00:04:08 +00:00
Wojciech Matyjewicz	f0d21cdd19	Fix PR2088. Use modulo linear equation solver to compute loop iteration count. llvm-svn: 53810	2008-07-20 15:55:14 +00:00
Nick Lewycky	82510bf0e3	XFAIL this test. llvm-svn: 53793	2008-07-19 15:52:06 +00:00
Wojciech Matyjewicz	a78669c2cf	While testing particular algorithms to compute loop iteration count the brute force evaluation (ComputeIterationCountExhaustively) should be turned off. It doesn't apply to trip-count2.ll because this file tests the brute force evaluation. The test for PR2364 (2008-05-25-NegativeStepToZero.ll) currently fails showing that the patch for this bug doesn't work. I'll fix it in a few hours with a patch for PR2088. llvm-svn: 53792	2008-07-19 13:26:15 +00:00
Nick Lewycky	b5688ccf57	Stop creating extraneous smax/umax in SCEV. This removes a regression where we started complicating many loops ('for' loops, in fact). llvm-svn: 53508	2008-07-12 07:41:32 +00:00
Nick Lewycky	ed169d531d	Crash less. The i64 restriction in BinomialCoefficient caused some problems with code that was expecting different bit widths for different values. Make getTruncateOrZeroExtend a method on ScalarEvolution, and use it. llvm-svn: 52248	2008-06-13 04:38:55 +00:00
Nick Lewycky	a61cc6ece0	Whoops -- forgot PR reference on this test. llvm-svn: 51569	2008-05-26 20:23:33 +00:00
Nick Lewycky	be993358a7	Use {} instead of "" in RUN lines. llvm-svn: 51561	2008-05-26 01:27:08 +00:00
Nick Lewycky	3195b393d6	Don't treat values as signed when looking at loop steppings in HowForToNonZero. llvm-svn: 51560	2008-05-25 23:43:32 +00:00
Evan Cheng	01d6257e81	Temporarily reverting 46959. llvm-svn: 47542	2008-02-25 03:57:32 +00:00
Nick Lewycky	1c44ebcf86	Add 'umax' similar to 'smax' SCEV. Closes PR2003. Parse reversed smax and umax as smin and umin and express them with negative or binary-not SCEVs (which are really just subtract under the hood). Parse 'xor %x, -1' as (-1 - %x). Remove dead code (ConstantInt::get always returns a ConstantInt). Don't use getIntegerSCEV(-1, Ty). The first value is an int, then it gets passed into a uint64_t. Instead, create the -1 directly from ConstantInt::getAllOnesValue(). llvm-svn: 47360	2008-02-20 06:48:22 +00:00
Wojciech Matyjewicz	ddb265b905	Now that ScalarEvolution::print writes to the correct stream, there is no need to redirect stderr into stdout. llvm-svn: 47009	2008-02-12 15:12:40 +00:00
Wojciech Matyjewicz	995624f44d	Change negative grep into positive one in my yesterday's testcase. llvm-svn: 47008	2008-02-12 15:10:35 +00:00
Wojciech Matyjewicz	1d2c27b23e	Fix PR2002. Suppose n is the initial value for the induction variable (with step 1) and m is its final value. Then, the correct trip count is SMAX(m,n)-n. Previously, we used SMAX(0,m-n), but m-n may overflow and can't in general be interpreted as signed. Patch by Nick Lewycky. llvm-svn: 47007	2008-02-12 15:09:36 +00:00
Wojciech Matyjewicz	adae053b53	If the LHS of the comparison is a loop-invariant we also want to move it to the RHS. This simple change allows to compute loop iteration count for loops with condition similar to the one in the testcase (which seems to be quite common). llvm-svn: 46959	2008-02-11 18:37:34 +00:00
Wojciech Matyjewicz	d2d9764cc8	Fix PR1798 - an error in the evaluation of SCEVAddRecExpr at an arbitrary iteration. The patch: 1) changes SCEVSDivExpr into SCEVUDivExpr, 2) replaces PartialFact() function with BinomialCoefficient(); the computations (essentially, the division) in BinomialCoefficient() are performed with the apprioprate bitwidth necessary to avoid overflow; unsigned division is used instead of the signed one. Computations in BinomialCoefficient() require support from the code generator for APInts. Currently, we use a hack rounding up the neccessary bitwidth to the nearest power of 2. The hack is easy to turn off in future. One remaining issue: we assume the divisor of the binomial coefficient formula can be computed accurately using 16 bits. It means we can handle AddRecs of length up to 9. In future, we should use APInts to evaluate the divisor. Thanks to Nicholas for cooperation! llvm-svn: 46955	2008-02-11 11:03:14 +00:00
Tanya Lattner	8f342f8ef3	Fix bug in regression tests that ignored stderr output in RUN lines. Updated tests and fixed broken run lines. XFAILed 3 arm regressions (will file bugs) llvm-svn: 44389	2007-11-28 04:57:00 +00:00
Dan Gohman	2dba0788a5	Change grep '' to grep {}. Change 2>&1 \| to \|&. llvm-svn: 44344	2007-11-27 00:10:35 +00:00
Nick Lewycky	cdb7e54ca7	Add new SCEV, SCEVSMax. This allows LLVM to analyze do-while loops. llvm-svn: 44319	2007-11-25 22:41:31 +00:00
Nick Lewycky	5b18bd3368	Be more careful when transforming \| to +. Patch from Wojciech Matyjewicz. llvm-svn: 44248	2007-11-20 08:24:44 +00:00
Anton Korobeynikov	6a7ddfdb8f	Reverted r44163 per request llvm-svn: 44177	2007-11-15 18:33:16 +00:00
Nick Lewycky	fbb24817cc	Fix handling of overflow in loop calculation by adding new UDiv SCEV. This SCEV is disabled in the sense that it will refuse to create one from a UDiv instruction, until the code is better tested. llvm-svn: 44163	2007-11-15 06:30:50 +00:00
Nick Lewycky	3934961878	Build the correct range for loops with unusual bounds. Fix from Jay Foad. llvm-svn: 42394	2007-09-27 14:12:54 +00:00
Nick Lewycky	76fb226fd4	Add reference to problem report. llvm-svn: 40889	2007-08-07 12:27:03 +00:00
Nick Lewycky	f921f1bc4a	Fix the dates on these tests. It's not September yet. Thanks Reid! llvm-svn: 40869	2007-08-06 20:00:11 +00:00
Nick Lewycky	96606cec20	Let scalar-evolution analyze loops with an unsigned comparison for the exit condition. Fixes 1597. llvm-svn: 40867	2007-08-06 19:21:00 +00:00
Nick Lewycky	b9819f3a8b	Don't assume it's safe to transform a loop just because it's dominated by any comparison. Fixes bug 1598. llvm-svn: 40866	2007-08-06 18:33:46 +00:00
Nick Lewycky	5246026c8c	Handle decrementing loops properly. Fixes PR1533. Always pass the constant as the second parameter to HowManyLessThans. Remove obsolete "isSigned" parameter. llvm-svn: 39893	2007-07-16 02:08:00 +00:00
John Criswell	2660cef6d7	Convert .cvsignore files llvm-svn: 37801	2007-06-29 16:35:07 +00:00
Zhou Sheng	6d207a761f	Add two test cases to cover apintification change. llvm-svn: 36476	2007-04-26 16:44:48 +00:00
Reid Spencer	5a77babd81	For PR1319: Upgrade to use new Tcl exec based test harness. llvm-svn: 36066	2007-04-15 09:31:07 +00:00
Reid Spencer	d029c7e666	Make the llvm-runtest function much more amenable by eliminating all the global variables that needed to be passed in. This makes it possible to add new global variables with only a couple changes (Makefile and llvm-dg.exp) instead of touching every single dg.exp file. llvm-svn: 35918	2007-04-11 19:56:59 +00:00
Reid Spencer	44259a29c0	Remove use of implementation keyword. llvm-svn: 35412	2007-03-28 02:38:26 +00:00
Reid Spencer	af6a408117	For PR411: Update these tests to not use the same name even though the type of the value differs. After PR411 hits, type planes will be gone and it will be illegal for a name to be used twice, regardless of type. llvm-svn: 33660	2007-01-30 16:16:01 +00:00
Reid Spencer	ce380568b5	For PR761: Remove "target endian/pointersize" or add "target datalayout" to make the test parse properly or set the datalayout because defaults changes. For PR645: Make global names use the @ prefix. For llvm-upgrade changes: Fix test cases or completely remove use of llvm-upgrade for test cases that cannot survive the new renaming or upgrade capabilities. llvm-svn: 33533	2007-01-26 08:25:06 +00:00
Reid Spencer	83b3d82672	Regression is gone, don't try to find it on clean target. llvm-svn: 33296	2007-01-17 07:59:14 +00:00

... 3 4 5 6 7 ...

411 Commits