llvm-project

Commit Graph

Author	SHA1	Message	Date
Evan Cheng	89553cc42e	Do not speculatively execute an instruction by hoisting it to its predecessor BB if any of its operands are defined but not used in BB. The transformation will prevent the operand from being sunk into the use block. llvm-svn: 52244	2008-06-12 21:15:59 +00:00
Evan Cheng	933c743042	For now, avoid generating FP select instructions in order to speculatively execute integer arithmetic instructions. FP selects are more likely to be expensive (even compared to branch on fcmp). This is not a wonderful solution but I rather err on the side of conservative. This fixes the heapsort performance regressions. llvm-svn: 52224	2008-06-11 19:18:20 +00:00
Gabor Greif	945f2f7fed	op_iterator-ify loops llvm-svn: 52191	2008-06-10 22:03:26 +00:00
Evan Cheng	89200c9177	Speculatively execute a block when the the block is the then part of a triangle shape and it contains a single, side effect free, cheap instruction. The branch is eliminated by adding a select instruction. i.e. Turn BB: %t1 = icmp br i1 %t1, label %BB1, label %BB2 BB1: %t3 = add %t2, c br label BB2 BB2: => BB: %t1 = icmp %t4 = add %t2, c %t3 = select i1 %t1, %t2, %t3 llvm-svn: 52073	2008-06-07 08:52:29 +00:00
Devang Patel	8549e4ca07	LoopSimplify preserves AA. llvm-svn: 52053	2008-06-06 17:50:58 +00:00
Owen Anderson	2df82e7cec	LoopIndexSplit can sometimes result in cases where a block in its own domfrontier. Don't crash when we encounter one of these. llvm-svn: 51915	2008-06-03 18:29:48 +00:00
Dan Gohman	2ad7e7341c	Fix whitespace in whitespace-significant pseudocode in a comment. llvm-svn: 51890	2008-06-03 00:57:21 +00:00
Gabor Greif	5df4326d78	rewrite operand loops to use iterators llvm-svn: 51789	2008-05-30 21:24:22 +00:00
Owen Anderson	1f59d9937f	Since LCSSA switched over to DenseMap, we have to be more careful to avoid iterator invalidation. Fixes PR2385. llvm-svn: 51777	2008-05-30 17:31:01 +00:00
Duncan Sands	dd7daee850	Factor code to copy global value attributes like the section or the visibility from one global value to another: copyAttributesFrom. This is particularly useful for duplicating functions: previously this was done by explicitly copying each attribute in turn at each place where a new function was created out of an old one, with the result that obscure attributes were regularly forgotten (like the collector or the section). Hopefully now everything is uniform and nothing is forgotten. llvm-svn: 51567	2008-05-26 19:58:59 +00:00
Owen Anderson	d3f21d165f	Use a DenseMap instead of an std::map, speeding up the testcase in PR2368 by about a third. llvm-svn: 51565	2008-05-26 10:07:43 +00:00
Dan Gohman	f96e1371e8	Tidy up BasicBlock::getFirstNonPHI, and change a bunch of places to use it instead of duplicating its functionality. llvm-svn: 51499	2008-05-23 21:05:58 +00:00
Matthijs Kooijman	aef2b8198b	Restucture a part of the SimplifyCFG pass and include a testcase. The SimplifyCFG pass looks at basic blocks that contain only phi nodes, followed by an unconditional branch. In a lot of cases, such a block (BB) can be merged into their successor (Succ). This merging is performed by TryToSimplifyUncondBranchFromEmptyBlock. It does this by taking all phi nodes in the succesor block Succ and expanding them to include the predecessors of BB. Furthermore, any phi nodes in BB are moved to Succ and expanded to include the predecessors of Succ as well. Before attempting this merge, CanPropagatePredecessorsForPHIs checks to see if all phi nodes can be properly merged. All functional changes are made to this function, only comments were updated in TryToSimplifyUncondBranchFromEmptyBlock. In the original code, CanPropagatePredecessorsForPHIs looks quite convoluted and more like stack of checks added to handle different kinds of situations than a comprehensive check. In particular the first check in the function did some value checking for the case that BB and Succ have a common predecessor, while the last check in the function simply rejected all cases where BB and Succ have a common predecessor. The first check was still useful in the case that BB did not contain any phi nodes at all, though, so it was not completely useless. Now, CanPropagatePredecessorsForPHIs is restructured to to look a lot more similar to the code that actually performs the merge. Both functions now look at the same phi nodes in about the same order. Any conflicts (phi nodes with different values for the same source) that could arise from merging or moving phi nodes are detected. If no conflicts are found, the merge can happen. Apart from only restructuring the checks, two main changes in functionality happened. Firstly, the old code rejected blocks with common predecessors in most cases. The new code performs some extra checks so common predecessors can be handled in a lot of cases. Wherever common predecessors still pose problems, the blocks are left untouched. Secondly, the old code rejected the merge when values (phi nodes) from BB were used in any other place than Succ. However, it does not seem that there is any situation that would require this check. Even more, this can be proven. Consider that BB is a block containing of a single phi node "%a" and a branch to Succ. Now, since the definition of %a will dominate all of its uses, BB will dominate all blocks that use %a. Furthermore, since the branch from BB to Succ is unconditional, Succ will also dominate all uses of %a. Now, assume that one predecessor of Succ is not dominated by BB (and thus not dominated by Succ). Since at least one use of %a (but in reality all of them) is reachable from Succ, you could end up at a use of %a without passing through it's definition in BB (by coming from X through Succ). This is a contradiction, meaning that our original assumption is wrong. Thus, all predecessors of Succ must also be dominated by BB (and thus also by Succ). This means that moving the phi node %a from BB to Succ does not pose any problems when the two blocks are merged, and any use checks are not needed. llvm-svn: 51478	2008-05-23 09:09:41 +00:00
Gabor Greif	e1f6e4b21d	API change for {BinaryOperator\|CmpInst\|CastInst}::create*() --> Create. Legacy interfaces will be in place for some time. (Merge from use-diet branch.) llvm-svn: 51200	2008-05-16 19:29:10 +00:00
Gabor Greif	697e94cc22	Fix a bunch of 80col violations that arose from the Create API change. Tweak makefile targets to find these better. llvm-svn: 51143	2008-05-15 10:04:30 +00:00
Dan Gohman	3dc2d92ebd	Split the loop unroll mechanism logic out into a utility function. Patch by Matthijs Kooijman! llvm-svn: 51083	2008-05-14 00:24:14 +00:00
Dan Gohman	0479aa5c0b	Change class' public PassInfo variables to by initialized with the address of the PassInfo directly instead of calling getPassInfo. This eliminates a bunch of dynamic initializations of static data. Also, fold RegisterPassBase into PassInfo, make a bunch of its data members const, and rearrange some code to initialize data members in constructors instead of using setter member functions. llvm-svn: 51022	2008-05-13 02:05:11 +00:00
Dan Gohman	d78c400b5b	Clean up the use of static and anonymous namespaces. This turned up several things that were neither in an anonymous namespace nor static but not intended to be global. llvm-svn: 51017	2008-05-13 00:00:25 +00:00
Dan Gohman	6a2da37c0e	Make several variable declarations static. llvm-svn: 50696	2008-05-06 01:53:16 +00:00
Dan Gohman	a8b7e78f54	Remove uses of llvm/System/IncludeFile.h that are no longer needed. llvm-svn: 50695	2008-05-06 01:32:53 +00:00
Devang Patel	fa0e3c4a92	Handle multiple return values. llvm-svn: 50604	2008-05-03 01:12:15 +00:00
Chris Lattner	8be72700b8	Fix PR2256, yet another miscompilation in simplifycfg of i multiple return values. Bill, please pull this into Tak. llvm-svn: 50332	2008-04-28 00:19:07 +00:00
Nate Begeman	ca270ad96f	Feedback from chris llvm-svn: 50271	2008-04-25 17:45:52 +00:00
Nick Lewycky	4d43d3c72c	Remove 'unwinds to' support from mainline. This patch undoes r47802 r47989 r48047 r48084 r48085 r48086 r48088 r48096 r48099 r48109 and r48123. llvm-svn: 50265	2008-04-25 16:53:59 +00:00
Nate Begeman	6fed3b2038	Teach the PruningFunctionCloner how to look through loads with ConstantExpression GEPs pointing into constant globals. llvm-svn: 50256	2008-04-25 06:37:06 +00:00
Evan Cheng	608eeef5ce	Adjust inline cost computation to be less aggressive. llvm-svn: 50222	2008-04-24 18:42:47 +00:00
Chris Lattner	86bbf338e5	Split some code out of the main SimplifyCFG loop into its own function. Fix said code to handle merging return instructions together correctly when handling multiple return values. llvm-svn: 50199	2008-04-24 00:01:19 +00:00
Devang Patel	8f83081fea	Check type instead of no. of operands. llvm-svn: 50179	2008-04-23 20:18:29 +00:00
Chris Lattner	a5b11705b6	Move SplitBlockPredecessors out of loopsimplify into BasicBlockUtils.h as a global helper function. At the same type, switch it from taking a vector of predecessors to an arbitrary sequential input. This allows us to switch LoopSimplify to use a SmallVector for various temporary vectors that it passed into SplitBlockPredecessors. llvm-svn: 50020	2008-04-21 01:28:02 +00:00
Chris Lattner	d418b06abf	Move domtree/frontier updating earlier, allowing us to use it to update phi nodes, removing a hack. llvm-svn: 50019	2008-04-21 01:05:08 +00:00
Chris Lattner	96e9e22269	Factor dominator tree and frontier updating into SplitBlockPredecessors instead of doing it after every call. llvm-svn: 50018	2008-04-21 00:54:38 +00:00
Chris Lattner	aca912d793	simplify code, fit in 80 cols. llvm-svn: 50015	2008-04-21 00:23:14 +00:00
Chris Lattner	38806c3e9c	fit in 80 cols llvm-svn: 50014	2008-04-21 00:19:16 +00:00
Scott Michel	376acf4aaa	Remove unused variable llvm-svn: 49838	2008-04-17 01:30:44 +00:00
Scott Michel	f66cb3696a	Workaround for PR2207, in which pred_iterator assert gets triggered due to a wee problem in Xcode 2.[45]/gcc 4.0.1. llvm-svn: 49831	2008-04-16 23:46:39 +00:00
Chuck Rose III	c6a47e8a79	VisualStudio project files updated. #include <algorithm> added to make VisualStudio happy. Also had to undefine setjmp because of #include <csetjmp> turning setjmp into _setjmp in VisualStudio. llvm-svn: 49743	2008-04-15 21:27:11 +00:00
Owen Anderson	7629b71dd4	Revert r49614. As Dan pointed out, some of these aren't correct. llvm-svn: 49657	2008-04-14 17:38:21 +00:00
Owen Anderson	1f6fbc4bc3	Replace calls of the form V1->setName(V2->getName()) with V1->takeName(V2), which is significantly more efficient. llvm-svn: 49614	2008-04-13 19:15:17 +00:00
Devang Patel	8cd2a3ae2a	Fix insert point handling for multiple return values. llvm-svn: 49367	2008-04-08 02:24:08 +00:00
Duncan Sands	1416ebf1fe	The "stacksave is not nounwind problem" no longer needs to be fixed here - a previous commit made sure that intrinsics always get the right attributes. So remove no-longer needed code, and while there use Intrinsic::getDeclaration rather than getOrInsertFunction. llvm-svn: 49337	2008-04-07 13:43:58 +00:00
Duncan Sands	fbc6adcc59	Use Intrinsic::getDeclaration to get hold of intrinsics. Fix up the argument type (should be i8, was an array). llvm-svn: 49336	2008-04-07 13:41:19 +00:00
Dale Johannesen	87e484f08b	Mark calls to llvm.stacksave, llvm.stackrestore as nounwind. When such calls are inlined into something else that is invoked, they were getting changed to invokes, which is badness. llvm-svn: 49299	2008-04-07 00:08:48 +00:00
Gabor Greif	e9ecc68d8f	API changes for class Use size reduction, wave 1. Specifically, introduction of XXX::Create methods for Users that have a potentially variable number of Uses. llvm-svn: 49277	2008-04-06 20:25:17 +00:00
Evan Cheng	ac38d444e2	1. Drop default inline threshold back down to 200. 2. Do not use # of basic blocks as part of the cost computation since it doesn't really figure into function size. 3. More aggressively inline function with vector code. llvm-svn: 49061	2008-04-01 23:59:29 +00:00
Dale Johannesen	5e4e051c2a	Revert 49006 for the moment. llvm-svn: 49046	2008-04-01 20:00:57 +00:00
Dale Johannesen	7d02cf3c9c	Emit exception handling info for functions which are not marked nounwind, or for all functions when -enable-eh is set, provided the target supports Dwarf EH. llvm-gcc generates nounwind in the right places; other FEs will need to do so also. Given such a FE, -enable-eh should no longer be needed. llvm-svn: 49006	2008-03-31 23:40:23 +00:00
Evan Cheng	3471ae8c5d	Increasing the inline limit from (overly conservative) 200 to 300. Given each BB costs 20 and each instruction costs 5, 200 means a 4 BB function + 24 instructions (actually less because caller's size also contributes to it). Furthermore, double the limit when more than 10% of the callee instructions are vector instructions. Multimedia kernels tend to love inlining. llvm-svn: 48725	2008-03-24 06:37:48 +00:00
Anton Korobeynikov	d38b3fb127	Preserve calling convention during function cloning llvm-svn: 48708	2008-03-23 16:03:00 +00:00
Evan Cheng	5daf090a1a	80 col violation. llvm-svn: 48573	2008-03-20 00:20:23 +00:00
Nick Lewycky	7698bfbe16	Update -mem2reg to use succ_iterator instead of iterating across TerminatorInst successors. This makes it support nounwind. llvm-svn: 48320	2008-03-13 02:42:41 +00:00

1 2 3 4 5 ...

816 Commits