llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	95057f6ad1	Okay, so there is no reasonable way for tail duplication to update SSA form, as it is making effectively arbitrary modifications to the CFG and we don't have a domset/domfrontier implementations that can handle the dynamic updates. Instead of having a bunch of code that doesn't actually work in practice, just demote any potentially tricky values to the stack (causing the problem to go away entirely). Later invocations of mem2reg will rebuild SSA for us. This fixes all of the major performance regressions with tail duplication from LLVM 1.1. For example, this loop: --- int popcount(int x) { int result = 0; while (x != 0) { result = result + (x & 0x1); x = x >> 1; } return result; } --- Used to be compiled into: int %popcount(int %X) { entry: br label %loopentry loopentry: ; preds = %entry, %no_exit %x.0 = phi int [ %X, %entry ], [ %tmp.9, %no_exit ] ; <int> [#uses=3] %result.1.0 = phi int [ 0, %entry ], [ %tmp.6, %no_exit ] ; <int> [#uses=2] %tmp.1 = seteq int %x.0, 0 ; <bool> [#uses=1] br bool %tmp.1, label %loopexit, label %no_exit no_exit: ; preds = %loopentry %tmp.4 = and int %x.0, 1 ; <int> [#uses=1] %tmp.6 = add int %tmp.4, %result.1.0 ; <int> [#uses=1] %tmp.9 = shr int %x.0, ubyte 1 ; <int> [#uses=1] br label %loopentry loopexit: ; preds = %loopentry ret int %result.1.0 } And is now compiled into: int %popcount(int %X) { entry: br label %no_exit no_exit: ; preds = %entry, %no_exit %x.0.0 = phi int [ %X, %entry ], [ %tmp.9, %no_exit ] ; <int> [#uses=2] %result.1.0.0 = phi int [ 0, %entry ], [ %tmp.6, %no_exit ] ; <int> [#uses=1] %tmp.4 = and int %x.0.0, 1 ; <int> [#uses=1] %tmp.6 = add int %tmp.4, %result.1.0.0 ; <int> [#uses=2] %tmp.9 = shr int %x.0.0, ubyte 1 ; <int> [#uses=2] %tmp.1 = seteq int %tmp.9, 0 ; <bool> [#uses=1] br bool %tmp.1, label %loopexit, label %no_exit loopexit: ; preds = %no_exit ret int %tmp.6 } llvm-svn: 12457	2004-03-16 23:29:09 +00:00
Chris Lattner	7a7b114871	Do not try to optimize PHI nodes with incredibly high degree. This reduces SCCP time from 615s to 1.49s on a large testcase that has a gigantic switch statement that all of the blocks in the function go to (an intepreter). llvm-svn: 12442	2004-03-16 19:49:59 +00:00
Chris Lattner	a64923ad26	Do not copy gigantic switch instructions llvm-svn: 12441	2004-03-16 19:45:22 +00:00
Chris Lattner	db5b8f4d6b	Fix a regression from this patch: http://mail.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20040308/013095.html Basically, this patch only updated the immediate dominatees of the header node to tell them that the preheader also dominated them. In practice, ALL dominatees of the header node are also dominated by the preheader. This fixes: LoopSimplify/2004-03-15-IncorrectDomUpdate. and PR293 llvm-svn: 12434	2004-03-16 06:00:15 +00:00
Chris Lattner	cd83282df1	Add counters for the number of calls elimianted llvm-svn: 12420	2004-03-15 05:46:59 +00:00
Chris Lattner	20cda2645e	Implement LICM of calls in simple cases. This is sufficient to move around sin/cos/strlen calls and stuff. This implements: LICM/call_sink_pure_function.ll LICM/call_sink_const_function.ll llvm-svn: 12415	2004-03-15 04:11:30 +00:00
Chris Lattner	b68659552a	Do not create empty basic blocks when the lowerswitch pass expects blocks to be non-empty! This fixes LowerSwitch/2004-03-13-SwitchIsDefaultCrash.ll llvm-svn: 12384	2004-03-14 04:14:31 +00:00
Chris Lattner	d078812f96	If a block is dead, dominators will not be calculated for it. Because of this loop information won't see it, and we could have unreachable blocks pointing to the non-header node of blocks in a natural loop. This isn't tidy, so have the loopsimplify pass clean it up. llvm-svn: 12380	2004-03-14 03:59:22 +00:00
Chris Lattner	7d2a539735	Add some debugging output Fix InstCombine/2004-03-13-InstCombineInfLoop.ll which caused an infinite loop compiling (I think) povray. llvm-svn: 12365	2004-03-13 23:54:27 +00:00
Chris Lattner	797cb2f6c1	This little patch speeds up the loop used to update the dominator set analysis. On the testcase from GCC PR12440, which has a LOT of loops (1392 of which require preheaders to be inserted), this speeds up the loopsimplify pass from 1.931s to 0.1875s. The loop in question goes from 1.65s -> 0.0097s, which isn't bad. All of these times are a debug build. This adds a dependency on DominatorTree analysis that was not there before, but we always had dominatortree available anyway, because LICM requires both loop simplify and DT, so this doesn't add any extra analysis in practice. llvm-svn: 12362	2004-03-13 22:01:26 +00:00
Chris Lattner	022167f13b	Implement sub.ll:test14 llvm-svn: 12355	2004-03-13 00:11:49 +00:00
Chris Lattner	92295c5031	Implement InstCombine/sub.ll:test12 & test13 llvm-svn: 12353	2004-03-12 23:53:13 +00:00
Chris Lattner	59db22dcd4	Add sccp support for select instructions llvm-svn: 12318	2004-03-12 05:52:44 +00:00
Chris Lattner	b909e8b0d4	Add trivial optimizations for select instructions llvm-svn: 12317	2004-03-12 05:52:32 +00:00
Chris Lattner	538fee7aa2	Since 'load null' is undefined, we can make it do whatever we want. Returning a zero value is the most likely way to cause further simplification, so we do it. llvm-svn: 12197	2004-03-07 22:16:24 +00:00
Chris Lattner	7abcc387de	Don't emit things like malloc(16*1). Allocation instructions are fixed arity now. llvm-svn: 12086	2004-03-03 01:40:53 +00:00
Chris Lattner	5cf39339d1	Disable tail duplication in a case that breaks on Olden/tsp llvm-svn: 12021	2004-03-01 01:12:13 +00:00
Chris Lattner	bf2963ef91	Fix PR255: [tailduplication] Single basic block loops are very rare Note that this is a band-aid put over a band-aid. This just undisables tail duplication in on very specific case that it seems to work in. llvm-svn: 11989	2004-02-29 06:41:20 +00:00
Chris Lattner	772eafa332	if there is already a prototype for malloc/free, use it, even if it's incorrect. Do not just inject a new prototype. llvm-svn: 11951	2004-02-28 18:51:45 +00:00
Chris Lattner	51ea127bf3	Rename AddUsesToWorkList -> AddUsersToWorkList, as that is what it does. Create a new AddUsesToWorkList method optimize memmove/set/cpy of zero bytes to a noop. llvm-svn: 11941	2004-02-28 05:22:00 +00:00
Chris Lattner	f3a366062c	Turn 'free null' into nothing llvm-svn: 11940	2004-02-28 04:57:37 +00:00
Chris Lattner	4f7accab96	Implement test/Regression/Transforms/InstCombine/canonicalize_branch.ll This is a really minor thing, but might help out the 'switch statement induction' code in simplifycfg. llvm-svn: 11900	2004-02-27 06:27:46 +00:00
Chris Lattner	9c6833c5ca	Fix incorrect debug code llvm-svn: 11821	2004-02-25 15:15:04 +00:00
Chris Lattner	8ee0593f0d	Fix a faulty optimization on FP values llvm-svn: 11801	2004-02-24 18:10:14 +00:00
Chris Lattner	ae739aefd7	Generate much more efficient code in programs like pifft llvm-svn: 11775	2004-02-23 21:46:58 +00:00
Chris Lattner	c40b9d7d51	Fix a small typeo in my checkin last night that broke vortex and other programs :( llvm-svn: 11774	2004-02-23 21:46:42 +00:00
Chris Lattner	f5ce254692	Fix InstCombine/2004-02-23-ShiftShiftOverflow.ll Also, turn 'shr int %X, 1234' into 'shr int %X, 31' llvm-svn: 11768	2004-02-23 20:30:06 +00:00
Chris Lattner	2b55ea38bc	Implement cast.ll::test14/15 llvm-svn: 11742	2004-02-23 07:16:20 +00:00
Chris Lattner	e79e854c5c	Refactor some code. In the mul - setcc folding case, we really care about whether this is the sign bit or not, so check unsigned comparisons as well. llvm-svn: 11740	2004-02-23 06:38:22 +00:00
Chris Lattner	c8a10c4b6a	Implement mul.ll:test11 llvm-svn: 11737	2004-02-23 06:00:11 +00:00
Chris Lattner	59611149ee	Implement "strength reduction" of X <= C and X >= C llvm-svn: 11735	2004-02-23 05:47:48 +00:00
Chris Lattner	2635b52d4e	Implement InstCombine/mul.ll:test10, which is a case that occurs when dealing with "predication" llvm-svn: 11734	2004-02-23 05:39:21 +00:00
Chris Lattner	8d0bacbb9e	Implement Transforms/InstCombine/cast.ll:test13, a case which occurs in a hot 164.gzip loop. llvm-svn: 11702	2004-02-22 05:25:17 +00:00
Chris Lattner	4db2d22bea	Fold PHI nodes of constants which are only used by a single cast. This implements phi.ll:test4 llvm-svn: 11494	2004-02-16 05:07:08 +00:00
Chris Lattner	b36d908f7b	Teach LLVM to unravel the "swap idiom". This implements: Regression/Transforms/InstCombine/xor.ll:test20 llvm-svn: 11492	2004-02-16 03:54:20 +00:00
Chris Lattner	c207635fd5	Implement Transforms/InstCombine/xor.ll:test19 llvm-svn: 11490	2004-02-16 01:20:27 +00:00
Chris Lattner	d85e061575	Instead of producing calls to setjmp/longjmp, produce uses of the llvm.setjmp/llvm.longjmp intrinsics. llvm-svn: 11482	2004-02-15 22:24:27 +00:00
Chris Lattner	76b2ff4ded	Adjustments to support the new ConstantAggregateZero class llvm-svn: 11474	2004-02-15 05:55:15 +00:00
Chris Lattner	7cbb22abe6	Expose a pass ID that can be 'required' llvm-svn: 11376	2004-02-13 16:16:16 +00:00
Chris Lattner	d4b36cf9bc	Remove obsolete comment. Unreachable blocks will automatically be left at the end of the function. llvm-svn: 11313	2004-02-11 05:20:50 +00:00
Chris Lattner	5add05129e	Add an _embarassingly simple_ implementation of basic block layout. This is more of a testcase for profiling information than anything that should reasonably be used, but it's a starting point. When I have more time I will whip this into better shape. llvm-svn: 11311	2004-02-11 04:53:20 +00:00
Chris Lattner	37d46f4815	Only add the global variable with the abort message if an unwind actually occurs in the program. llvm-svn: 11249	2004-02-09 22:48:47 +00:00
Misha Brukman	3480e935d0	Fix grammar-o. llvm-svn: 11210	2004-02-08 22:27:33 +00:00
Chris Lattner	3b7f6b2217	Improve compatibility with programs that already have a prototype for 'write', even if it is wierd in some way. llvm-svn: 11207	2004-02-08 22:14:44 +00:00
Chris Lattner	fae8ab3088	rename the "exceptional" destination of an invoke instruction to the 'unwind' dest llvm-svn: 11202	2004-02-08 21:44:31 +00:00
Chris Lattner	108cadc274	Implement proper invoke/unwind lowering. This fixed PR16 "[lowerinvoke] The -lowerinvoke pass does not insert calls to setjmp/longjmp" llvm-svn: 11195	2004-02-08 19:53:56 +00:00
Chris Lattner	476488e669	Add a call to 'write' right before the call to abort() in the unwind path. This causes the JIT, or LLC'd program to print out a nice message, explaining WHY the program aborted. llvm-svn: 11184	2004-02-08 07:30:29 +00:00
Chris Lattner	2dd1c8d8ce	Fix another dominator update bug. These bugs keep getting exposed because GCSE keeps finding more code motion opportunities now that the dominators are correct! llvm-svn: 11142	2004-02-05 23:20:59 +00:00
Chris Lattner	c0c953f0bc	Fix bug updating dominators llvm-svn: 11140	2004-02-05 22:33:26 +00:00
Chris Lattner	f978c421e5	Add debug output llvm-svn: 11139	2004-02-05 22:33:19 +00:00

1 2 3 4 5 ...

565 Commits