llvm-project

Commit Graph

Author	SHA1	Message	Date
Richard Smith	ad9c8e839e	Don't bind a reference to a dereferenced null pointer (for return value of WeakVH::operator*). llvm-svn: 162309	2012-08-21 20:35:14 +00:00
Andrew Trick	c803706c18	Reapply r160340. LSR: Limit CollectSubexprs. Speculatively fix crashes by code inspection. Can't reproduce them yet. llvm-svn: 160344	2012-07-17 05:30:37 +00:00
Andrew Trick	e834cb465a	Revert "LSR: try not to blow up solving combinatorial problems brute force." Some units tests crashed on a different platform. llvm-svn: 160341	2012-07-17 05:05:21 +00:00
Andrew Trick	7cd6d426b3	LSR: try not to blow up solving combinatorial problems brute force. This places limits on CollectSubexprs to constrains the number of reassociation possibilities. It limits the recursion depth and skips over chains of nested recurrences outside the current loop. Fixes PR13361. Although underlying SCEV behavior is still potentially bad. llvm-svn: 160340	2012-07-17 05:00:56 +00:00
Andrew Trick	653513b8dd	LSR Fix: check SCEV expression safety before expansion. All SCEV expressions used by LSR formulae must be safe to expand. i.e. they may not contain UDiv unless we can prove nonzero denominator. Fixes PR11356: LSR hoists UDiv. llvm-svn: 160205	2012-07-13 23:33:10 +00:00
Andrew Trick	8370c7c38f	LSR: fix expansion of scaled reg in non-address type formulae. For non-address users, Base and Scaled registers are not specially associated to fit an address mode, so SCEVExpander should apply normal expansion rules. Otherwise we may sink computation into inner loops that have already been optimized. llvm-svn: 158537	2012-06-15 20:07:29 +00:00
Andrew Trick	aca8fb3c45	LSR fix: "Special" users are just like "Basic" users but allow -1 scale. llvm-svn: 158536	2012-06-15 20:07:26 +00:00
Benjamin Kramer	bde9176663	Fix typos found by http://github.com/lyda/misspell-check llvm-svn: 157885	2012-06-02 10:20:22 +00:00
Rafael Espindola	dd48931461	Make sure HoistInsertPosition finds a position that is dominated by all inputs. llvm-svn: 155809	2012-04-30 03:53:06 +00:00
Jakob Stoklund Olesen	c90abc8956	Break up getProfitableChainIncrement(). The required checks are moved to ChainInstruction() itself and the policy decisions are moved to IVChain::isProfitableInc(). Also cache the ExprBase in IVChain to avoid frequent recomputations. No functional change intended. llvm-svn: 155676	2012-04-26 23:33:11 +00:00
Jakob Stoklund Olesen	a0337d7bd9	Turn IVChain into a struct. No functional change intended. llvm-svn: 155675	2012-04-26 23:33:09 +00:00
Jakob Stoklund Olesen	293673d788	Print IV chain numbers while collecting them. llvm-svn: 155567	2012-04-25 18:01:32 +00:00
Andrew Trick	19f80c1e7e	loop-reduce: Add an early bailout to catch extremely large loops. This introduces a threshold of 200 IV Users, which is very conservative but should be sufficient to avoid serious compile time sink or stack overflow. The llvm test-suite with LTO never exceeds 190 users per loop. The bug doesn't relate to a specific type of loop. Checking in an arbitrary giant loop as a unit test would be silly. Fixes rdar://11262507. llvm-svn: 154983	2012-04-18 04:00:10 +00:00
Jakob Stoklund Olesen	f2390e8303	Pass the right sign to TLI->isLegalICmpImmediate. LSR can fold three addressing modes into its ICmpZero node: ICmpZero BaseReg + Offset => ICmp BaseReg, -Offset ICmpZero -1ScaleReg + Offset => ICmp ScaleReg, Offset ICmpZero BaseReg + -1ScaleReg => ICmp BaseReg, ScaleReg The first two cases are only used if TLI->isLegalICmpImmediate() likes the offset. Make sure the right Offset sign is passed to this method in the second case. The ARM version is not symmetric. <rdar://problem/11184260> llvm-svn: 154079	2012-04-05 03:10:56 +00:00
Andrew Trick	14779cc49e	LSR ivchain bug fix: corner case with ConstantExpr. Fixes PR11950. llvm-svn: 153463	2012-03-26 20:28:37 +00:00
Andrew Trick	356a896394	comment typo llvm-svn: 153462	2012-03-26 20:28:35 +00:00
Andrew Trick	e51feea79c	LSR cleanup: potential bug caught by PVS-Studio. Thanks Andrey. llvm-svn: 153451	2012-03-26 18:03:16 +00:00
Andrew Trick	e3502cb204	Remove -enable-lsr-retry in time for 3.1. llvm-svn: 153287	2012-03-22 22:42:51 +00:00
Andrew Trick	d97b83e320	Remove -enable-lsr-nested in time for 3.1. Tests cases have been removed but attached to open PR12330. llvm-svn: 153286	2012-03-22 22:42:45 +00:00
Andrew Trick	070e540a3e	LSR fix: Add isSimplifiedLoopNest to IVUsers analysis. Only record IVUsers that are dominated by simplified loop headers. Otherwise SCEVExpander will crash while looking for a preheader. I previously tried to work around this in LSR itself, but that was insufficient. This way, LSR can continue to run if some uses are not in simple loops, as long as we don't attempt to analyze those users. Fixes <rdar://problem/11049788> Segmentation fault: 11 in LoopStrengthReduce llvm-svn: 152892	2012-03-16 03:16:56 +00:00
David Blaikie	46a9f016c5	More dead code removal (using -Wunreachable-code) llvm-svn: 148578	2012-01-20 21:51:11 +00:00
Andrew Trick	b9c822ab0b	Handle a corner case with IV chain collection with bailout instead of assert. Fixes PR11783: bad cast to AddRecExpr. llvm-svn: 148572	2012-01-20 21:23:40 +00:00
Andrew Trick	c908b43d9f	SCEVExpander fixes. Affects LSR and indvars. LSR has gradually been improved to more aggressively reuse existing code, particularly existing phi cycles. This exposed problems with the SCEVExpander's sloppy treatment of its insertion point. I applied some rigor to the insertion point problem that will hopefully avoid an endless bug cycle in this area. Changes: - Always used properlyDominates to check safe code hoisting. - The insertion point provided to SCEV is now considered a lower bound. This is usually a block terminator or the use itself. Under no cirumstance may SCEVExpander insert below this point. - LSR is reponsible for finding a "canonical" insertion point across expansion of different expressions. - Robust logic to determine whether IV increments are in "expanded" form and/or can be safely hoisted above some insertion point. Fixes PR11783: SCEVExpander assert. llvm-svn: 148535	2012-01-20 07:41:13 +00:00
Andrew Trick	12728f04ca	LSR fix: broaden the check for loop preheaders. It's becoming clear that LoopSimplify needs to unconditionally create loop preheaders. But that is a bigger fix. For now, continuing to hack LSR. Fixes rdar://10701050 "Cannot split an edge from an IndirectBrInst" assert. llvm-svn: 148288	2012-01-17 06:45:52 +00:00
Andrew Trick	d5d2db9af9	Enable LSR IV Chains with sufficient heuristics. These heuristics are sufficient for enabling IV chains by default. Performance analysis has been done for i386, x86_64, and thumbv7. The optimization is rarely important, but can significantly speed up certain cases by eliminating spill code within the loop. Unrolled loops are prime candidates for IV chains. In many cases, the final code could still be improved with more target specific optimization following LSR. The goal of this feature is for LSR to make the best choice of induction variables. Instruction selection may not completely take advantage of this feature yet. As a result, there could be cases of slight code size increase. Code size can be worse on x86 because it doesn't support postincrement addressing. In fact, when chains are formed, you may see redundant address plus stride addition in the addressing mode. GenerateIVChains tries to compensate for the common cases. On ARM, code size increase can be mitigated by using postincrement addressing, but downstream codegen currently misses some opportunities. llvm-svn: 147826	2012-01-10 01:45:08 +00:00
Andrew Trick	248d410e3e	Adding IV chain generation to LSR. After collecting chains, check if any should be materialized. If so, hide the chained IV users from the LSR solver. LSR will only solve for the head of the chain. GenerateIVChains will then materialize the chained IV users by computing the IV relative to its previous value in the chain. In theory, chained IV users could be exposed to LSR's solver. This would be considerably complicated to implement and I'm not aware of a case where we need it. In practice it's more important to intelligently prune the search space of nontrivial loops before running the solver, otherwise the solver is often forced to prune the most optimal solutions. Hiding the chained users does this well, so that LSR is more likely to find the best IV for the chain as a whole. llvm-svn: 147801	2012-01-09 21:18:52 +00:00
Andrew Trick	29fe5f03d7	Adding collection of IV chains to LSR. This collects a set of IV uses within the loop whose values can be computed relative to each other in a sequence. Following checkins will make use of this information. llvm-svn: 147797	2012-01-09 19:50:34 +00:00
Andrew Trick	4dc3eff5ae	"Minor LSR debugging stuff" llvm-svn: 147785	2012-01-09 18:58:16 +00:00
Andrew Trick	06f6c05d08	Enable redundant phi elimination after LSR. This will be more important as we extend the LSR pass in ways that don't rely on the formula solver. In particular, we need it for constructing IV chains. llvm-svn: 147724	2012-01-07 07:08:17 +00:00
Andrew Trick	732ad80dbb	LSR: Don't optimize loops if an outer loop has no preheader. LoopSimplify may not run on some outer loops, e.g. because of indirect branches. SCEVExpander simply cannot handle outer loops with no preheaders. Fixes rdar://10655343 SCEVExpander segfault. llvm-svn: 147718	2012-01-07 03:16:50 +00:00
Andrew Trick	2ec61a896b	LSR: run DeleteDeadPhis before replaceCongruentPhis. llvm-svn: 147711	2012-01-07 01:36:44 +00:00
Andrew Trick	5adedf5d47	Extended replaceCongruentPhis to handle mixed phi types. llvm-svn: 147707	2012-01-07 01:12:09 +00:00
Andrew Trick	dc18e383b7	Cleanup. Clarify LSRInstance public methods. llvm-svn: 146459	2011-12-13 00:55:33 +00:00
Andrew Trick	e8b4f409b2	LSR: ignore strides in outer loops. Since we're not rewriting IVs in other loops, there's not much reason to consider their stride when generating formulae. This should reduce the number of useless formulas considered by LSR. llvm-svn: 146302	2011-12-10 00:25:00 +00:00
Andrew Trick	5df9096584	LSR: prune undesirable formulae early. It's always good to prune early, but formulae that are unsatisfactory in their own right need to be removed before running any other pruning heuristics. We easily avoid generating such formulae, but we need them as an intermediate basis for forming other good formulae. llvm-svn: 145906	2011-12-06 03:13:31 +00:00
Benjamin Kramer	7ba71be392	Move code into anonymous namespaces. llvm-svn: 145154	2011-11-26 23:01:57 +00:00
Nadav Rotem	51f71054b6	Fix MSVC warnings by adding a cast. llvm-svn: 144721	2011-11-15 22:54:21 +00:00
Eli Friedman	b46345d7c1	Avoid undefined behavior in negation in LSR. Patch by Ahmed Charles. Someone more familiar with LSR should double-check that the extra cast is actually doing the right thing in the overflow cases; I'm not completely confident that's that case. llvm-svn: 141916	2011-10-13 23:48:33 +00:00
Andrew Trick	ecbe22bb8d	Add experimental -enable-lsr-phielim option. I'm not sure we will need it in the long run, but the option is currently useful for checking if the output of LSR is "clean". llvm-svn: 141634	2011-10-11 02:30:45 +00:00
Andrew Trick	7fb669ab48	LSR should only reuse phis that match its formula. Fixes rdar://problem/5064068 llvm-svn: 141442	2011-10-07 23:46:21 +00:00
Andrew Trick	8de329a9fc	LSR should avoid redundant edge splitting. This handles the case in which LSR rewrites an IV user that is a phi and splits critical edges originating from a switch. Fixes <rdar://problem/6453893> LSR is not splitting edges "nicely" llvm-svn: 141059	2011-10-04 03:50:44 +00:00
Andrew Trick	168dfffdb8	typo + pasto llvm-svn: 140769	2011-09-29 01:53:08 +00:00
Andrew Trick	bc6de90a5f	LSR: rewrite inner loops only. Rewriting the entire loop nest now requires -enable-lsr-nested. See PR11035 for some performance data. A few unit tests specifically test nested LSR, and are now under a flag. llvm-svn: 140762	2011-09-29 01:33:38 +00:00
Andrew Trick	581243919d	Disable LSR retry by default. Disabling aggressive LSR saves compilation time, and with the new indvars behavior usually improves performance. llvm-svn: 140590	2011-09-27 00:44:14 +00:00
Andrew Trick	8868faec63	LSR, one of the new Cost::isLoser() checks did not get merged in the previous checkin. llvm-svn: 140583	2011-09-26 23:35:25 +00:00
Andrew Trick	784729d408	LSR cost metric minor fix and verification. The minor bug heuristic was noticed by inspection. I added the isLoser/isValid helpers because they will become more important with subsequent checkins. llvm-svn: 140580	2011-09-26 23:11:04 +00:00
Andrew Trick	8b2fe2f744	LSR minor bug fix in RateRegister. No test case. Noticed by inspection and I doubt it ever affects the outcome of the overall heuristic, let alone final codegen. llvm-svn: 140431	2011-09-23 23:05:19 +00:00
Bill Wendling	3fb137f7ef	LSR wants to split the landing pad's critical edge. Let it do it, but use the proper function to do it. llvm-svn: 138550	2011-08-25 05:55:40 +00:00
Bill Wendling	07efd6f1e0	When inserting new instructions, use getFirstInsertionPt instead of getFirstNonPHI so that it will skip over the landingpad instructions as well. llvm-svn: 138537	2011-08-25 01:08:34 +00:00
Bill Wendling	86c5cbe613	Skip the landingpad instruction when determining the insertion point. llvm-svn: 138481	2011-08-24 21:06:46 +00:00

1 2 3 4 5 ...

562 Commits