llvm-project

Commit Graph

Author	SHA1	Message	Date
Andrew Trick	7fa4e0fea6	SCEV: Add MarkPendingLoopPredicates to avoid recursive isImpliedCond. getUDivExpr attempts to simplify by checking for overflow. isLoopEntryGuardedByCond then evaluates the loop predicate which may lead to the same getUDivExpr causing endless recursion. Fixes PR12868: clang 3.2 segmentation fault. llvm-svn: 157092	2012-05-19 00:48:25 +00:00
Bill Wendling	1981c0e533	FileCheck-ize tests. llvm-svn: 155434	2012-04-24 10:45:44 +00:00
Bill Wendling	4cf911c0cd	FileCheck-ize these tests. llvm-svn: 155433	2012-04-24 10:36:42 +00:00
Bill Wendling	cd6df16cb4	FileCheck-ize these tests. Harden some of them. llvm-svn: 155432	2012-04-24 09:15:38 +00:00
Benjamin Kramer	e364d195e9	Revert "SCEV: When expanding a GEP the final addition to the base pointer has NUW but not NSW." This isn't right either, reverting for now. llvm-svn: 154910	2012-04-17 06:33:57 +00:00
Benjamin Kramer	e1f4ca1b0f	SCEV: When expanding a GEP the final addition to the base pointer has NUW but not NSW. Found by inspection. llvm-svn: 154262	2012-04-07 17:19:26 +00:00
Rafael Espindola	5054ee82cc	Handle intrinsics in GlobalsModRef. Fixes pr12351. llvm-svn: 153604	2012-03-28 21:31:24 +00:00
Andrew Trick	7004e4b95e	SCEV fix: Handle loop invariant loads. Fixes PR11882: NULL dereference in ComputeLoadConstantCompareExitLimit. llvm-svn: 153480	2012-03-26 22:33:59 +00:00
Andrew Trick	bd11257df7	Test scalar evolution directly instead of testing the result of canonical indvars. llvm-svn: 153256	2012-03-22 17:09:31 +00:00
Eli Friedman	0774902a00	Duncan pointed out that if the alignment isn't explicitly specified, it defaults to the ABI alignment. Given that, make this code a bit more aggressive in such cases. llvm-svn: 151584	2012-02-27 23:16:46 +00:00
Eli Friedman	8bc169c3c5	Teach BasicAA about the LLVM IR rules that allow reading past the end of an object given sufficient alignment. Fixes PR12098. llvm-svn: 151553	2012-02-27 20:46:07 +00:00
Rafael Espindola	94df267db3	Change the implementation of dominates(inst, inst) to one based on what the verifier does. This correctly handles invoke. Thanks to Duncan, Andrew and Chris for the comments. Thanks to Joerg for the early testing. llvm-svn: 151469	2012-02-26 02:19:19 +00:00
Eli Bendersky	924f9a671d	Replace all instances of dg.exp file with lit.local.cfg, since all tests are run with LIT now and now Dejagnu. dg.exp is no longer needed. Patch reviewed by Daniel Dunbar. It will be followed by additional cleanup patches. llvm-svn: 150664	2012-02-16 06:28:33 +00:00
Nick Lewycky	4c378a4453	Change CaptureTracking to pass a Use* instead of a Value* when a value is captured. This allows the tracker to look at the specific use, which may be especially interesting for function calls. Use this to fix 'nocapture' deduction in FunctionAttrs. The existing one does not iterate until a fixpoint and does not guarantee that it produces the same result regardless of iteration order. The new implementation builds up a graph of how arguments are passed from function to function, and uses a bottom-up walk on the argument-SCCs to assign nocapture. This gets us nocapture more often, and does so rather efficiently and independent of iteration order. llvm-svn: 147327	2011-12-28 23:24:21 +00:00
Chandler Carruth	b024aa021d	Make the unreachable probability much much heavier. The previous probability wouldn't be considered "hot" in some weird loop structures or other compounding probability patterns. This makes it much harder to confuse, but isn't really a principled fix. I'd actually like it if we could model a zero probability, as it would make this much easier to reason about. Suggestions for how to do this better are welcome. llvm-svn: 147142	2011-12-22 09:26:37 +00:00
Chandler Carruth	6b0e34c445	Manually upgrade the test suite to specify the flag to cttz and ctlz. I followed three heuristics for deciding whether to set 'true' or 'false': - Everything target independent got 'true' as that is the expected common output of the GCC builtins. - If the target arch only has one way of implementing this operation, set the flag in the way that exercises the most of codegen. For most architectures this is also the likely path from a GCC builtin, with 'true' being set. It will (eventually) require lowering away that difference, and then lowering to the architecture's operation. - Otherwise, set the flag differently dependending on which target operation should be tested. Let me know if anyone has any issue with this pattern or would like specific tests of another form. This should allow the x86 codegen to just iteratively improve as I teach the backend how to differentiate between the two forms, and everything else should remain exactly the same. llvm-svn: 146370	2011-12-12 11:59:10 +00:00
Andrew Trick	d25089f8e0	SCEV fix. In general, Add/Mul expressions should not inherit NSW/NUW. This reverts r139450, fixes r139453, and adds much needed comments and a unit test. llvm-svn: 145367	2011-11-29 02:16:38 +00:00
Andrew Trick	5ec136c57e	Filecheckize. llvm-svn: 145363	2011-11-29 02:05:23 +00:00
Chris Lattner	6a144a2227	Upgrade syntax of tests using volatile instructions to use 'load volatile' instead of 'volatile load', which is archaic. llvm-svn: 145171	2011-11-27 06:54:59 +00:00
Nick Lewycky	0485d51a76	Don't forget to check FlagNW when determining whether an AddRecExpr will wrap or not. Patch by Brendon Cahoon! llvm-svn: 144173	2011-11-09 07:11:37 +00:00
Benjamin Kramer	652f576a70	2>&1 doesn't work here, it just creates an empty file called "&1" llvm-svn: 143117	2011-10-27 18:27:45 +00:00
Duncan Sands	a370f3e34e	Restore commits 142790 and 142843 - they weren't breaking the build bots. Original commit messages: - Reapply r142781 with fix. Original message: Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } - Now that we look at all the header PHIs, we need to consider all the header PHIs when deciding that the loop has stopped evolving. Fixes miscompile in the gcc torture testsuite! llvm-svn: 142919	2011-10-25 12:28:52 +00:00
Chandler Carruth	32f46e7c07	Fix the API usage in loop probability heuristics. It was incorrectly classifying many edges as exiting which were in fact not. These mainly formed edges into sub-loops. It was also not correctly classifying all returning edges out of loops as leaving the loop. With this match most of the loop heuristics are more rational. Several serious regressions on loop-intesive benchmarks like perlbench's loop tests when built with -enable-block-placement are fixed by these updated heuristics. Unfortunately they in turn uncover some other regressions. There are still several improvemenst that should be made to loop heuristics including trip-count, and early back-edge management. llvm-svn: 142917	2011-10-25 09:47:41 +00:00
Duncan Sands	805c5b92c8	Speculatively revert commits 142790 and 142843 to see if it fixes the dragonegg and llvm-gcc self-host buildbots. Original commit messages: - Reapply r142781 with fix. Original message: Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } - Now that we look at all the header PHIs, we need to consider all the header PHIs when deciding that the loop has stopped evolving. Fixes miscompile in the gcc torture testsuite! llvm-svn: 142916	2011-10-25 09:26:43 +00:00
Nick Lewycky	a58fb48a55	Now that we look at all the header PHIs, we need to consider all the header PHIs when deciding that the loop has stopped evolving. Fixes miscompile in the gcc torture testsuite! llvm-svn: 142843	2011-10-24 21:02:38 +00:00
Chandler Carruth	7111f4564c	Remove return heuristics from the static branch probabilities, and introduce no-return or unreachable heuristics. The return heuristics from the Ball and Larus paper don't work well in practice as they pessimize early return paths. The only good hitrate return heuristics are those for: - NULL return - Constant return - negative integer return Only the last of these three can possibly require significant code for the returning block, and even the last is fairly rare and usually also a constant. As a consequence, even for the cold return paths, there is little code on that return path, and so little code density to be gained by sinking it. The places where sinking these blocks is valuable (inner loops) will already be weighted appropriately as the edge is a loop-exit branch. All of this aside, early returns are nearly as common as all three of these return categories, and should actually be predicted as taken! Rather than muddy the waters of the static predictions, just remain silent on returns and let the CFG itself dictate any layout or other issues. However, the return heuristic was flagging one very important case: unreachable. Unfortunately it still gave a 1/4 chance of the branch-to-unreachable occuring. It also didn't do a rigorous job of finding those blocks which post-dominate an unreachable block. This patch builds a more powerful analysis that should flag all branches to blocks known to then reach unreachable. It also has better worst-case runtime complexity by not looping through successors for each block. The previous code would perform an N^2 walk in the event of a single entry block branching to N successors with a switch where each successor falls through to the next and they finally fall through to a return. Test case added for noreturn heuristics. Also doxygen comments improved along the way. llvm-svn: 142793	2011-10-24 12:01:08 +00:00
Nick Lewycky	9be7f277e4	Reapply r142781 with fix. Original message: Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } llvm-svn: 142790	2011-10-24 06:57:05 +00:00
Nick Lewycky	9d28c26d77	Speculatively revert r142781. Bots are showing Assertion `i_nocapture < OperandTraits<PHINode>::operands(this) && "getOperand() out of range!"' failed. coming out of indvars. llvm-svn: 142786	2011-10-24 04:00:25 +00:00
Nick Lewycky	1700007ecc	Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } llvm-svn: 142781	2011-10-23 23:43:14 +00:00
Chandler Carruth	1c8ace0e89	Teach the BranchProbabilityInfo pass to print its results, and use that to bring it under direct test instead of merely indirectly testing it in the BlockFrequencyInfo pass. The next step is to start adding tests for the various heuristics employed, and to start fixing those heuristics once they're under test. llvm-svn: 142778	2011-10-23 21:21:50 +00:00
Nick Lewycky	a6674c7fc9	Make SCEV's brute force analysis stronger in two ways. Firstly, we should be able to constant fold load instructions where the argument is a constant. Second, we should be able to watch multiple PHI nodes through the loop; this patch only supports PHIs in loop headers, more can be done here. With this patch, we now constant evaluate: static const int arr[] = {1, 2, 3, 4, 5}; int test() { int sum = 0; for (int i = 0; i < 5; ++i) sum += arr[i]; return sum; } llvm-svn: 142731	2011-10-22 19:58:20 +00:00
Chandler Carruth	deac50cba9	Generalize the reading of probability metadata to work for both branches and switches, with arbitrary numbers of successors. Still optimized for the common case of 2 successors for a conditional branch. Add a test case for switch metadata showing up in the BlockFrequencyInfo pass. llvm-svn: 142493	2011-10-19 10:32:19 +00:00
Chandler Carruth	d27a7a947b	Teach the BranchProbabilityInfo analysis pass to read any metadata encoding of probabilities. In the absense of metadata, it continues to fall back on static heuristics. This allows __builtin_expect, after lowering through llvm.expect a branch instruction's metadata, to actually enter the branch probability model. This is one component of resolving PR2577. llvm-svn: 142492	2011-10-19 10:30:30 +00:00
Chandler Carruth	343fad44ea	Add pass printing support to BlockFrequencyInfo pass. The implementation layer already had support for printing the results of this analysis, but the wiring was missing. Now that printing the analysis works, actually bring some of this analysis, and the BranchProbabilityInfo analysis that it wraps, under test! I'm planning on fixing some bugs and doing other work here, so having a nice place to add regression tests and a way to observe the results is really useful. llvm-svn: 142491	2011-10-19 10:12:41 +00:00
Andrew Trick	887a111e31	Missing test case for r141164. llvm-svn: 141166	2011-10-05 06:23:32 +00:00
Nick Lewycky	3155552461	Reapply r140979 with fix! We never did get a testcase, but careful review of the logic by David Meyer revealed this bug. llvm-svn: 140992	2011-10-03 07:10:45 +00:00
Nick Lewycky	b1dbce1406	Revert r140979 due to reports of bootstrap failure. llvm-svn: 140980	2011-10-03 05:14:59 +00:00
Nick Lewycky	3c624b8d0d	Add one more case we compute a max trip count. llvm-svn: 140979	2011-10-03 01:03:57 +00:00
Eli Friedman	5f476dc3ef	PR10628: Fix getModRefInfo so it queries the underlying alias() implementation correctly while checking nocapture calls. llvm-svn: 140666	2011-09-28 00:34:27 +00:00
Eli Friedman	5c91891cf3	Enhance alias analysis for atomic instructions a bit. Upgrade a couple alias-analysis tests to the new atomic instructions. llvm-svn: 140557	2011-09-26 20:15:28 +00:00
Andrew Trick	57d8afde93	This test only makes sense with -enable-iv-rewrite. llvm-svn: 139576	2011-09-13 02:45:26 +00:00
Eli Friedman	3d1b307672	Fix the logic in BasicAliasAnalysis::aliasGEP for comparing GEP's with variable differences so that it actually does something sane. Fixes PR10881. llvm-svn: 139276	2011-09-08 02:23:31 +00:00
Owen Anderson	653cb03191	Teach BasicAA about the aliasing properties of memset_pattern16. Fixes PR10872 and <rdar://problem/10065079>. llvm-svn: 139204	2011-09-06 23:33:25 +00:00
Nick Lewycky	e0aa54bb98	This transform only handles two-operand AddRec's. Prevent it from trying to handle anything more complex. Fixes PR10383 again! llvm-svn: 139186	2011-09-06 21:42:18 +00:00
Nick Lewycky	658bdb5133	The logic inside getMulExpr to simplify {a,+,b}*{c,+,d} was wrong, which was visible given a=b=c=d=1, on iteration #1 (the second iteration). Replace it with correct math. Fixes PR10383! llvm-svn: 139133	2011-09-06 05:05:14 +00:00
Nick Lewycky	b1438c763a	Revert r139126 due to selfhost failures reported by buildbots. llvm-svn: 139130	2011-09-06 02:43:13 +00:00
Nick Lewycky	c4c43fbb07	Teach SCEV to report a max backedge count in one interesting case in HowFarToZero; the case for a canonical loop. llvm-svn: 139126	2011-09-05 23:25:16 +00:00
Rafael Espindola	7161661863	Move the loads after the calls so that the fix for PR10292 doesn't show that the loads don't alias the allocas. llvm-svn: 134852	2011-07-09 23:53:58 +00:00
Rafael Espindola	7fbab4dcc7	Use CHECK-NEXT. llvm-svn: 134850	2011-07-09 22:56:50 +00:00
Chris Lattner	8936d2bfbc	Remove support for parsing the "type i32" syntax for defining a numbered top level type without a specified number. This syntax isn't documented and blocks forward progress. llvm-svn: 133371	2011-06-19 00:03:46 +00:00

1 2 3 4 5 ...

395 Commits