llvm-project

Commit Graph

Author	SHA1	Message	Date
Reid Spencer	2452c94df4	Fix a doxygen problem and break lines at 80 columns llvm-svn: 28395	2006-05-19 19:09:46 +00:00
Chris Lattner	2e266807c3	Add a CloneModule call that exposes the mapping of values from the old module to the new module. Patch provided by Nick Lewycky! llvm-svn: 28349	2006-05-17 18:05:35 +00:00
Chris Lattner	35515557c7	remove some dead code identified by coverity llvm-svn: 28289	2006-05-14 18:45:44 +00:00
Chris Lattner	3237da073e	remove dead variables llvm-svn: 28286	2006-05-14 18:33:57 +00:00
Chris Lattner	4fe87d67c4	Patch to make some xforms preserve each other. Patch contributed by Domagoj Babic! llvm-svn: 28181	2006-05-09 04:13:41 +00:00
Chris Lattner	f98b4aa2e7	Fix some nondeterminstic behavior in the mem2reg pass that (in addition to nondeterminism being bad) could cause some trivial missed optimizations (dead phi nodes being left around for later passes to clean up). With this, llvm-gcc4 now bootstraps and correctly compares. I don't know why I never tried to do it before... :) llvm-svn: 27984	2006-04-27 01:14:43 +00:00
Chris Lattner	17bd60588c	Add supprot for shufflevector llvm-svn: 27513	2006-04-08 01:19:12 +00:00
Chris Lattner	8ec0205de4	Fix inlining of insert/extract element constantexprs llvm-svn: 27478	2006-04-07 04:41:03 +00:00
Chris Lattner	70ec96fa32	Adjust to change in Intrinsics.gen interface. llvm-svn: 27344	2006-04-02 03:35:01 +00:00
Chris Lattner	1b2436a624	add valuemapper support for inline asm llvm-svn: 27332	2006-04-01 23:17:11 +00:00
Chris Lattner	42e0ba09aa	teach the inliner to work with packed constants llvm-svn: 27161	2006-03-27 05:50:18 +00:00
Chris Lattner	60f6833376	use autogenerated side-effect information llvm-svn: 26673	2006-03-09 22:38:10 +00:00
Chris Lattner	d95665188b	Fix Transforms/SimplifyCFG/2006-02-17-InfiniteUnroll.ll llvm-svn: 26275	2006-02-18 00:33:17 +00:00
Chris Lattner	9c5693fb2a	Canonicalize inner loops before outer loops. Inner loop canonicalization can provide work for the outer loop to canonicalize. This fixes a case that breaks unswitching. llvm-svn: 26189	2006-02-14 23:06:02 +00:00
Chris Lattner	cffbbee8d1	When splitting exit edges to canonicalize loops, make sure to put the new block in the appropriate loop nest. Third time is the charm, right? llvm-svn: 26187	2006-02-14 22:34:08 +00:00
Chris Lattner	02f53ad3a2	Revert my last patch. It too breaks stuff llvm-svn: 26128	2006-02-12 01:59:10 +00:00
Chris Lattner	35248e06bc	Fix for my previously reverted patch llvm-svn: 26126	2006-02-11 21:24:54 +00:00
Chris Lattner	b24ce3a2a8	revert my previous change, it exposed other problems. llvm-svn: 26121	2006-02-11 08:47:47 +00:00
Chris Lattner	05bf90dddf	Make this check stricter. Disallow loop exit blocks from being shared by loops and their subloops. llvm-svn: 26118	2006-02-11 02:13:17 +00:00
Chris Lattner	a6ae101afa	remove dead expr llvm-svn: 26116	2006-02-11 01:43:37 +00:00
Chris Lattner	120f31b1fd	teach the cloner to handle inline asms llvm-svn: 25633	2006-01-26 01:55:22 +00:00
Chris Lattner	00fcdfef0d	rename method llvm-svn: 25572	2006-01-24 04:16:34 +00:00
Chris Lattner	37992b34c2	When cloning a module, clone the inline asm. llvm-svn: 25559	2006-01-23 23:06:28 +00:00
Chris Lattner	469640e506	Add explicit #includes of <iostream> llvm-svn: 25509	2006-01-22 22:53:01 +00:00
Robert Bocchino	027c18da98	ConstantFoldLoadThroughGEPConstantExpr wasn't handling pointers to packed types correctly. llvm-svn: 25470	2006-01-19 23:53:23 +00:00
Chris Lattner	b98282d2d6	Make sure that cloning a module clones its target triple and dependent library list as well. This should help bugpoint. llvm-svn: 25424	2006-01-18 21:32:45 +00:00
Robert Bocchino	e6336a9b69	Constant folding support for the insertelement operation. llvm-svn: 25407	2006-01-17 20:07:07 +00:00
Reid Spencer	b4f9a6f110	For PR411: This patch is an incremental step towards supporting a flat symbol table. It de-overloads the intrinsic functions by providing type-specific intrinsics and arranging for automatically upgrading from the old overloaded name to the new non-overloaded name. Specifically: llvm.isunordered -> llvm.isunordered.f32, llvm.isunordered.f64 llvm.sqrt -> llvm.sqrt.f32, llvm.sqrt.f64 llvm.ctpop -> llvm.ctpop.i8, llvm.ctpop.i16, llvm.ctpop.i32, llvm.ctpop.i64 llvm.ctlz -> llvm.ctlz.i8, llvm.ctlz.i16, llvm.ctlz.i32, llvm.ctlz.i64 llvm.cttz -> llvm.cttz.i8, llvm.cttz.i16, llvm.cttz.i32, llvm.cttz.i64 New code should not use the overloaded intrinsic names. Warnings will be emitted if they are used. llvm-svn: 25366	2006-01-16 21:12:35 +00:00
Chris Lattner	0841fb1d4c	Teach the inliner to update the CallGraph itself, and have it add edges to llvm.stacksave/restore when it inserts calls to them. llvm-svn: 25320	2006-01-14 20:07:50 +00:00
Nate Begeman	82049eba2c	Add bswap intrinsics as documented in the Language Reference llvm-svn: 25309	2006-01-14 01:25:24 +00:00
Chris Lattner	5fba6e6696	it is ok to dce stacksave. llvm-svn: 25295	2006-01-13 21:31:54 +00:00
Chris Lattner	2be0607a8d	If inlining a call to a function that contains dynamic allocas, wrap the resultant code with llvm.stacksave/llvm.stackrestore intrinsics. llvm-svn: 25286	2006-01-13 19:34:14 +00:00
Chris Lattner	e24f79a032	Use ClonedCodeInfo to avoid another walk over the inlined code, this this time in common C cases. llvm-svn: 25285	2006-01-13 19:18:11 +00:00
Chris Lattner	19e6a08d78	Use the ClonedCodeInfo object to avoid scans of the inlined code when it doesn't contain any calls. This is a fairly common case for C++ code, so it will probably speed up the inliner marginally in these cases. llvm-svn: 25284	2006-01-13 19:15:15 +00:00
Chris Lattner	908d79556d	Refactor a bunch of invoke handling stuff out into a new function "HandleInlinedInvoke". No functionality change. llvm-svn: 25283	2006-01-13 19:05:59 +00:00
Chris Lattner	edad1288fd	Allow the code cloning interfaces to capture some important info about the code being cloned if the client wants. llvm-svn: 25281	2006-01-13 18:39:17 +00:00
Chris Lattner	257492c0ab	Fix a bug I noticed by inspection: if the first instruction in the inlined function was not an alloca, we wouldn't check the entry block for any allocas, leading to increased stack space in some cases. In practice, allocas are almost always at the top of the block, so this was never noticed. llvm-svn: 25280	2006-01-13 18:16:48 +00:00
Chris Lattner	0770d8e326	Preserve and update ETForest. Patch by Daniel Berlin llvm-svn: 25203	2006-01-11 05:11:13 +00:00
Robert Bocchino	230044839d	Added support for the extractelement operation. llvm-svn: 25181	2006-01-10 19:05:34 +00:00
Chris Lattner	cda4aa6eb4	Teach loopsimplify to update et-forest. Patch contributed by Daniel Berlin! llvm-svn: 25153	2006-01-09 08:03:08 +00:00
Chris Lattner	2820b8c855	Fix SimplifyCFG/2005-12-03-IncorrectPHIFold.ll llvm-svn: 24581	2005-12-03 18:25:58 +00:00
Chris Lattner	3e9e8bd25c	Implement a refinement to the mem2reg algorithm for cases where an alloca has a single def. In this case, look for uses that are dominated by the def and attempt to rewrite them to directly use the stored value. This speeds up mem2reg on these values and reduces the number of phi nodes inserted. This should address PR665. llvm-svn: 24411	2005-11-18 07:31:42 +00:00
Chris Lattner	31dc3827d3	This needs proper dominance llvm-svn: 24410	2005-11-18 07:29:44 +00:00
Chris Lattner	479911f971	Fix #include order llvm-svn: 24044	2005-10-27 16:34:00 +00:00
John Criswell	fe5f33b120	Move some constant folding code shared by Analysis and Transform passes into the LLVMAnalysis library. This allows LLVMTranform and LLVMTransformUtils to be archives and linked with LLVMAnalysis.a, which provides any missing definitions. llvm-svn: 24036	2005-10-27 15:54:34 +00:00
John Criswell	94b7bea733	1. Remove libraries no longer created from the list of libraries linked into the SparcV9 JIT. 2. Make LLVMTransformUtils a relinked object file and always link it before LLVMAnalysis.a. These two libraries have circular dependencies on each other which creates problem when building the SparcV9 JIT. This change fixes the dependency on all platforms problems with a minimum of fuss. llvm-svn: 24023	2005-10-26 20:35:13 +00:00
Jeff Cohen	2b8cbf319c	Update Visual Studio projects to reflect moved file. llvm-svn: 23998	2005-10-26 05:36:51 +00:00
Chris Lattner	bde3845548	DONT_BUILD_RELINKED is gone and implied by BUILD_ARCHIVE now llvm-svn: 23940	2005-10-24 02:26:13 +00:00
Chris Lattner	8c087e962c	Only build .a file versions of these libraries, instead of .a and .o versions. This should speed up build times. llvm-svn: 23933	2005-10-24 01:59:48 +00:00
Chris Lattner	20b0754c41	Fix DemoteRegToStack on an invoke. This fixes PR634. llvm-svn: 23618	2005-10-04 00:44:01 +00:00
Chris Lattner	4c3b2b536c	Clean up the code a bit. Use isInstructionTriviallyDead to be more aggressive and more correct than use_empty(). This fixes PR635 and SimplifyCFG/2005-10-02-InvokeSimplify.ll llvm-svn: 23616	2005-10-03 23:43:43 +00:00
Chris Lattner	ea7214b23d	Constant fold llvm.sqrt llvm-svn: 23487	2005-09-28 01:34:32 +00:00
Chris Lattner	16cd356fb2	allow demotion to volatile values, add support for invoke llvm-svn: 23473	2005-09-27 19:39:00 +00:00
Chris Lattner	c13c7b9376	Move the ConstantFoldLoadThroughGEPConstantExpr function out of the InstCombine pass. llvm-svn: 23444	2005-09-26 05:27:10 +00:00
Chris Lattner	499e33646e	remove some debugging code llvm-svn: 23411	2005-09-23 18:49:09 +00:00
Chris Lattner	c59a371d45	Fold two consequtive branches that share a common destination between them. This implements SimplifyCFG/branch-fold.ll, and is useful on ?:/min/max heavy code llvm-svn: 23410	2005-09-23 18:47:20 +00:00
Chris Lattner	3a978bf66d	simplify some logic further llvm-svn: 23408	2005-09-23 07:23:18 +00:00
Chris Lattner	cc14ebc17b	pull a bunch of logic out of SimplifyCFG into a helper fn llvm-svn: 23407	2005-09-23 06:39:30 +00:00
Chris Lattner	6c70106053	Start threading across blocks with code in them, so long as the code does not define a value that is used outside of it's block. This catches many more simplifications, e.g. 854 in 176.gcc, 137 in vpr, etc. This implements branch-phi-thread.ll:test3.ll llvm-svn: 23397	2005-09-20 01:48:40 +00:00
Chris Lattner	f0bd8d0107	Implement merging of blocks with the same condition if the block has multiple predecessors. This implements branch-phi-thread.ll::test1 llvm-svn: 23395	2005-09-20 00:43:16 +00:00
Chris Lattner	049cb4482f	Reject a case we don't handle yet llvm-svn: 23393	2005-09-19 23:57:04 +00:00
Chris Lattner	a160924d57	remove debugging code :-/ llvm-svn: 23392	2005-09-19 23:50:15 +00:00
Chris Lattner	748f903046	Implement SimplifyCFG/branch-phi-thread.ll, the most trivial case of threading control across branches with determined outcomes. More generality to follow. This triggers a couple thousand times in specint. llvm-svn: 23391	2005-09-19 23:49:37 +00:00
Chris Lattner	89c1dfc733	Teach SplitCriticalEdge to update LoopInfo if it is alive. This fixes a problem in LoopStrengthReduction, where it would split critical edges then confused itself with outdated loop information. llvm-svn: 22776	2005-08-13 01:38:43 +00:00
Chris Lattner	b7ebe65c56	Change break critical edges to not remove, then insert, PHI node entries. Instead, just update the BB in-place. This is both faster, and it prevents split-critical-edges from shuffling the PHI argument list unneccesarily. llvm-svn: 22765	2005-08-12 21:58:07 +00:00
Chris Lattner	257efb2ad3	This code can handle non-dominating instructions llvm-svn: 22667	2005-08-05 00:57:45 +00:00
Nate Begeman	b392321cae	Fix a fixme in CondPropagate.cpp by moving a PhiNode optimization into BasicBlock's removePredecessor routine. This requires shuffling around the definition and implementation of hasContantValue from Utils.h,cpp into Instructions.h,cpp llvm-svn: 22664	2005-08-04 23:24:19 +00:00
Chris Lattner	d683bdd0f8	Fix Transforms/SimplifyCFG/2005-08-03-PHIFactorCrash.ll, a problem that occurred while bugpointing another testcase llvm-svn: 22621	2005-08-03 17:59:45 +00:00
Chris Lattner	2dbf1960ff	Finally, add the required constraint checks to fix Transforms/SimplifyCFG/2005-08-01-PHIUpdateFail.ll the right way llvm-svn: 22615	2005-08-03 00:59:12 +00:00
Chris Lattner	908036942c	Simplify some code, add the correct pred checks llvm-svn: 22613	2005-08-03 00:38:27 +00:00
Chris Lattner	982b75c061	Refactor code out of PropagatePredecessorsForPHIs, turning it into a pure function with no side-effects llvm-svn: 22612	2005-08-03 00:29:26 +00:00
Chris Lattner	1f047fd513	use splice instead of remove/insert to avoid some symtab operations llvm-svn: 22611	2005-08-03 00:23:42 +00:00
Chris Lattner	76dc204488	move two functions up in the file, use SafeToMergeTerminators to eliminate some duplicated code llvm-svn: 22610	2005-08-03 00:19:45 +00:00
Chris Lattner	733d6704ce	Rip some code out of the main SimplifyCFG function into a subfunction and call it from the only place it is live. No functionality changes. llvm-svn: 22609	2005-08-03 00:11:16 +00:00
Chris Lattner	ac594de8dc	Disable this patch: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20050801/027345.html This breaks real programs and only fixes an obscure regression testcase. A real fix is in development. llvm-svn: 22606	2005-08-02 23:31:38 +00:00
Chris Lattner	eee90f7eb4	Change a place to use an arbitrary value instead of null, when possible llvm-svn: 22605	2005-08-02 23:29:23 +00:00
Chris Lattner	4fd3e16cbd	This code was very close, but not quite right. It did not take into consideration the case where a reference in an unreachable block could occur. This fixes Transforms/SimplifyCFG/2005-08-01-PHIUpdateFail.ll, something I ran into while bugpoint'ing another pass. llvm-svn: 22584	2005-08-02 03:24:05 +00:00
Jeff Cohen	5f4ef3c5a8	Eliminate all remaining tabs and trailing spaces. llvm-svn: 22523	2005-07-27 06:12:32 +00:00
Chris Lattner	937c71f2b3	Fix PR590 and Transforms/Mem2Reg/2005-06-30-ReadBeforeWrite.ll. The optimization for locally used allocas was not safe for allocas that were read before they were written. This change disables that optimization in that case. llvm-svn: 22318	2005-06-30 07:29:44 +00:00
Andrew Lenharth	d4b103107e	prevent DCE of vaarg intrinsics. This should take care of most regressions llvm-svn: 22263	2005-06-19 14:41:20 +00:00
Chris Lattner	5e735294bf	Don't crash on: X = phi (X, X). This fixes PR584 and Transforms/SimplifyCFG/2005-06-16-PHICrash.ll llvm-svn: 22232	2005-06-17 01:45:53 +00:00
Chris Lattner	19f9f32a5c	Fix Transforms/SimplifyCFG/switch-simplify-crash.ll llvm-svn: 22158	2005-05-20 22:19:54 +00:00
Chris Lattner	bcefcf8552	Make sure to preserve the calling convention when changing an invoke into a call. This fixes Prolangs-C++/deriv2, kimwitu++, and Misc-C++/bigfib on X86 with -enable-x86-fastcc. llvm-svn: 22023	2005-05-14 12:21:56 +00:00
Chris Lattner	21d1dde72a	wrap long lines, preserve calling conventions when cloning functions and turning calls into invokes llvm-svn: 21797	2005-05-09 01:04:34 +00:00
Chris Lattner	7effa0ed06	BAD typeo which caused many testsuite failures last night. Note to self, do not change code after testing it without retesting! llvm-svn: 21741	2005-05-06 17:13:16 +00:00
Chris Lattner	9f3dced2c7	Implement Transforms/Inline/inline-tail.ll llvm-svn: 21736	2005-05-06 06:47:52 +00:00
Chris Lattner	a36d525741	DCE intrinsic instructions without side effects. llvm-svn: 21719	2005-05-06 05:27:34 +00:00
Reid Spencer	c968ea0495	Remove optimizations that don't require both operands to be constant. These are moved to simplify-libcalls pass. llvm-svn: 21614	2005-04-29 05:55:35 +00:00
Reid Spencer	ed55a6b5e0	* Add constant folding for additional floating point library calls such as sinh, cosh, etc. * Make the name comparisons for the fp libcalls a little more efficient by switching on the first character of the name before doing comparisons. llvm-svn: 21611	2005-04-28 23:01:59 +00:00
Jeff Cohen	82639853c0	Eliminate tabs and trailing spaces llvm-svn: 21480	2005-04-23 21:38:35 +00:00
Misha Brukman	b1c9317bb4	Remove trailing whitespace llvm-svn: 21427	2005-04-21 23:48:37 +00:00
Chris Lattner	7ceb081f3f	Improve doxygen documentation, patch contributed by Evan Jones! llvm-svn: 21393	2005-04-21 16:04:49 +00:00
Chris Lattner	b38b443b15	Teach simplifycfg that setcc is cheap and non-trapping, so that it can convert this: %tmp.1 = seteq int %i, 0 ; <bool> [#uses=1] br bool %tmp.1, label %shortcirc_done, label %shortcirc_next shortcirc_next: ; preds = %entry %tmp.4 = seteq int %j, 0 ; <bool> [#uses=1] br label %shortcirc_done shortcirc_done: ; preds = %shortcirc_next, %entry %shortcirc_val = phi bool [ %tmp.4, %shortcirc_next ], [ true, %entry ] ; <bool> [#uses=1] to this: %tmp.1 = seteq int %i, 0 ; <bool> [#uses=1] %tmp.4 = seteq int %j, 0 ; <bool> [#uses=1] %shortcirc_val = select bool %tmp.1, bool true, bool %tmp.4 ; <bool> [#uses=1] ... which is later simplified by instcombine into an or. llvm-svn: 21388	2005-04-21 05:31:13 +00:00
Chris Lattner	95f16a3ac4	Get rid of this for_each loop llvm-svn: 21253	2005-04-12 18:51:33 +00:00
Chris Lattner	7b9020a059	Fix the missing symbols problem Bill was hitting. Patch contributed by Bill Wendling!! llvm-svn: 20649	2005-03-17 15:38:16 +00:00
Chris Lattner	531f9e92d4	This mega patch converts us from using Function::a{iterator\|begin\|end} to using Function::arg_{iterator\|begin\|end}. Likewise Module::g* -> Module::global_*. This patch is contributed by Gabor Greif, thanks! llvm-svn: 20597	2005-03-15 04:54:21 +00:00
Chris Lattner	6d0a24c608	second argument to Value::setName is now gone. llvm-svn: 20463	2005-03-05 19:05:20 +00:00
Chris Lattner	0ce80cd542	Fix spelling, patch contributed by Gabor Greif! llvm-svn: 20343	2005-02-27 06:18:25 +00:00
Chris Lattner	cc6d75fddf	remove extraneous cast llvm-svn: 20334	2005-02-26 18:33:28 +00:00
Chris Lattner	1cca959e5d	Implement Transforms/SimplifyCFG/switch_thread.ll This does a simple form of "jump threading", which eliminates CFG edges that are provably dead. This triggers 90 times in the external tests, and eliminating CFG edges is always always a good thing! :) llvm-svn: 20300	2005-02-24 06:17:52 +00:00
Chris Lattner	a35dfcedd3	switchinst ctor now takes a hint for the number of cases that it will have. llvm-svn: 19898	2005-01-29 00:38:26 +00:00
Chris Lattner	3215bb6049	Implement SimplifyCFG/DeadSetCC.ll SimplifyCFG is one of those passes that we use for final cleanup: it should not rely on other passes to clean up its garbage. This fixes the "why are trivially dead setcc's in the output of gccas" problem. llvm-svn: 19212	2005-01-01 16:02:12 +00:00
Chris Lattner	ffefea0772	The split bb is really the exit of the old function llvm-svn: 18799	2004-12-11 16:59:54 +00:00
Chris Lattner	4fc998da2e	Fix Regression/Transforms/SimplifyCFG/2004-12-10-SimplifyCFGCrash.ll, and the failure on make_dparser last night. llvm-svn: 18766	2004-12-10 17:42:31 +00:00
Chris Lattner	019445715e	Squelch warning llvm-svn: 18381	2004-11-30 07:47:34 +00:00
Chris Lattner	868ae13dc0	Fix test/Regression/Transforms/LICM/2004-09-14-AliasAnalysisInvalidate.llx This only fails on darwin or on X86 under valgrind. llvm-svn: 18377	2004-11-30 07:01:15 +00:00
Chris Lattner	fd8cbc257e	Alkis noticed that this variable is dead. Thanks! llvm-svn: 18369	2004-11-30 04:01:44 +00:00
Chris Lattner	389cfac0d1	If we have something like this: if (x) { code ... } else { code ... } Turn it into: code if (x) { ... } else { ... } This reduces code size and in some common cases allows us to completely eliminate the conditional. This turns several if/then/else blocks in loops into straightline code in 179.art, turning the loops into single basic blocks (good for modsched even!). Maybe now brg will leave me alone ;-) llvm-svn: 18366	2004-11-30 00:29:14 +00:00
Misha Brukman	72a57c3259	Allow constructor parameter to override aggregating args; fix spacing llvm-svn: 18028	2004-11-20 02:20:27 +00:00
Chris Lattner	9b0291b18d	Fix: CodeExtractor/2004-11-12-InvokeExtract.ll llvm-svn: 17699	2004-11-13 00:06:45 +00:00
Chris Lattner	5bcca6058a	Fix a bug where the code extractor would get a bit confused handling invoke instructions, setting DefBlock to a block it did not have dom info for. llvm-svn: 17697	2004-11-12 23:50:44 +00:00
Chris Lattner	93d1e39f3e	Do not compute the predecessor list for a block unless we need it. This speeds up simplifycfg on this program, from 44.87s to 0.29s (with a profiled build): #define CL0(a) case a: goto c; #define CL1(a) CL0(a##0) CL0(a##1) CL0(a##2) CL0(a##3) CL0(a##4) CL0(a##5) \ CL0(a##6) CL0(a##7) CL0(a##8) CL0(a##9) #define CL2(a) CL1(a##0) CL1(a##1) CL1(a##2) CL1(a##3) CL1(a##4) CL1(a##5) \ CL1(a##6) CL1(a##7) CL1(a##8) CL1(a##9) #define CL3(a) CL2(a##0) CL2(a##1) CL2(a##2) CL2(a##3) CL2(a##4) CL2(a##5) \ CL2(a##6) CL2(a##7) CL2(a##8) CL2(a##9) #define CL4(a) CL3(a##0) CL3(a##1) CL3(a##2) CL3(a##3) CL3(a##4) CL3(a##5) \ CL3(a##6) CL3(a##7) CL3(a##8) CL3(a##9) void f(); void a() { int b; c: switch (b) { CL4(1) } } This testcase is contrived to expose N^2 behavior, but this patch should speedup simplifycfg on any programs that use large switch statements. This testcase comes from GCC PR17895. llvm-svn: 17389	2004-11-01 06:53:58 +00:00
Reid Spencer	57cbe39d1e	Change Library Names Not To Conflict With Others When Installed llvm-svn: 17286	2004-10-27 23:18:45 +00:00
Reid Spencer	fad217c847	Eliminate compilation warning on uninitialized variable. llvm-svn: 17163	2004-10-22 16:10:39 +00:00
Reid Spencer	c1c320c335	We won't use automake llvm-svn: 17155	2004-10-22 03:35:04 +00:00
Reid Spencer	6a11a75f31	Initial automake generated Makefile template llvm-svn: 17136	2004-10-18 23:55:41 +00:00
Reid Spencer	ce0783318b	Correction to allow compilation with Visual C++. Patch contributed by Morten Ofstad. Thanks Morten! llvm-svn: 17123	2004-10-18 14:38:48 +00:00
Chris Lattner	5edb2f32d0	Simplify code by deleting instructions that preceed unreachable instructions. Simplify code by simplifying terminators that branch to blocks that start with an unreachable instruction. llvm-svn: 17116	2004-10-18 04:07:22 +00:00
Chris Lattner	34ae670706	Fix a bug that occurs when the constant value is the result of an invoke. In particular, invoke ret values are only live in the normal dest of the invoke not in the unwind dest. llvm-svn: 17108	2004-10-18 01:21:17 +00:00
Chris Lattner	6e79e55aea	Fix Regression/Transforms/Inline/2004-10-17-InlineFunctionWithoutReturn.ll If a function had no return instruction in it, and the result of the inlined call instruction was used, we would crash. llvm-svn: 17104	2004-10-17 23:21:07 +00:00
Chris Lattner	215c7ebaa6	When inserting PHI nodes, don't insert any phi nodes that are obviously unneccesary. This allows us to delete several hundred phi nodes of the form PHI(x,x,x,undef) from 253.perlbmk and probably other programs as well. This implements Mem2Reg/UndefValuesMerge.ll llvm-svn: 17098	2004-10-17 21:25:56 +00:00
Chris Lattner	96db59e48a	Enhance hasConstantValue to ignore undef values in phi nodes. This allows it to think that PHI[4, undef] == 4. llvm-svn: 17096	2004-10-17 21:23:26 +00:00
Chris Lattner	98e541457b	Add support for unreachable llvm-svn: 17056	2004-10-16 18:21:33 +00:00
Chris Lattner	7e6d4a12b5	Add support for UndefValue llvm-svn: 17046	2004-10-16 18:10:31 +00:00
Chris Lattner	c0e2e82477	When promoting mem2reg, make uninitialized values become undef isntead of 0. llvm-svn: 17045	2004-10-16 18:10:06 +00:00
Chris Lattner	45c35b1d1f	When converting phi nodes into select instructions, we shouldn't promote PHI nodes unless we KNOW that we are able to promote all of them. This fixes: test/Regression/Transforms/SimplifyCFG/PhiNoEliminate.ll llvm-svn: 16973	2004-10-14 05:13:36 +00:00
Reid Spencer	ace94df71f	Update to reflect changes in Makefile rules. llvm-svn: 16950	2004-10-13 11:46:52 +00:00
Reid Spencer	97327f05fc	Initial version of automake Makefile.am file. llvm-svn: 16893	2004-10-10 22:20:40 +00:00
Chris Lattner	879ce7894c	Do not insert trivially dead select instructions, which allows us to potentially fold more in one pass. llvm-svn: 16583	2004-09-29 05:43:32 +00:00
Chris Lattner	855a4ff4dd	Remove a whole bunch of horrible hacky code that was used to promote allocas whose addresses where used by trivial phi nodes and select instructions. This is now performed by the instcombine pass, which is more powerful, is much simpler, and is faster. This allows the deletion of a bunch of code, two FIXME's and two gotos. llvm-svn: 16406	2004-09-19 18:51:51 +00:00
Chris Lattner	475dc2c93d	Make sure to remove the Select instruction as well llvm-svn: 16389	2004-09-18 00:32:40 +00:00
Reid Spencer	6614946443	Convert code to compile with vc7.1. Patch contributed by Paolo Invernizzi. Thanks Paolo! llvm-svn: 16368	2004-09-15 17:06:42 +00:00
Chris Lattner	9b9932bd94	If given an AliasSetTracker object to update, update it. llvm-svn: 16347	2004-09-15 01:02:54 +00:00
Alkis Evlogimenos	a5c04ee50f	Fixes to make LLVM compile with vc7.1. Patch contributed by Paolo Invernizzi! llvm-svn: 16152	2004-09-03 18:19:51 +00:00
Reid Spencer	7c16caa336	Changes For Bug 352 Move include/Config and include/Support into include/llvm/Config, include/llvm/ADT and include/llvm/Support. From here on out, all LLVM public header files must be under include/llvm/. llvm-svn: 16137	2004-09-01 22:55:40 +00:00
Chris Lattner	56273827b1	If we are extracting a block that has multiple successors that are the same block (common in a switch), make sure to remove extra edges in successor blocks. This fixes CodeExtractor/2004-08-12-BlockExtractPHI.ll and should be pulled into LLVM 1.3 (though the regression test need not be, as that would require pulling in the LoopExtract.cpp changes). llvm-svn: 15717	2004-08-13 03:27:07 +00:00
Chris Lattner	f06b043204	When we code extract some stuff, leave the codeRepl block in the place where the extracted code was, instead of putting it at the end of the function llvm-svn: 15716	2004-08-13 03:17:39 +00:00
Chris Lattner	889d346e6e	Add value mapper support for select constant exprs. This should fix a bug Nate ran into when bugpointing siod. This fix should go into LLVM 1.3 llvm-svn: 15712	2004-08-13 02:43:19 +00:00
Chris Lattner	7f1c7ede5b	Fix code extraction of unwind blocks. This fixed bugs that bugpoint can run into. This should go into 1.3 llvm-svn: 15679	2004-08-12 03:17:02 +00:00
Alkis Evlogimenos	832437255d	Stop using getValues(). llvm-svn: 15487	2004-08-04 08:44:43 +00:00
Chris Lattner	7471b96a05	Expose this as a functionpass llvm-svn: 15369	2004-07-31 10:01:58 +00:00
Misha Brukman	f4a410f907	Fix #includes of i*.h => Instructions.h as per PR403. llvm-svn: 15337	2004-07-29 17:30:57 +00:00
Misha Brukman	63b38bd2ed	Fix #includes of i*.h => Instructions.h as per PR403. llvm-svn: 15334	2004-07-29 17:30:56 +00:00
Alkis Evlogimenos	fd7a2d4477	Merge i*.h headers into Instructions.h as part of bug403. llvm-svn: 15325	2004-07-29 12:17:34 +00:00
Brian Gaeke	902dcf0729	These files don't need to include <iostream> since they include "Support/Debug.h". llvm-svn: 15089	2004-07-21 20:50:33 +00:00
Chris Lattner	45b50d14c9	Fix a serious code pessimization problem. If an inlined function has a single return, clone the 'ret' BB code into the block AFTER the inlined call, not the other way around. llvm-svn: 15030	2004-07-20 05:45:24 +00:00
Chris Lattner	5823ac1c21	Implement SimplifyCFG/BrUnwind.ll llvm-svn: 15022	2004-07-20 01:17:38 +00:00
Reid Spencer	3b4e83ec83	Remove an if statement that would never be reached. llvm-svn: 14968	2004-07-18 08:41:47 +00:00
Reid Spencer	cb3fb5d4f5	bug 122: - Replace ConstantPointerRef usage with GlobalValue usage llvm-svn: 14953	2004-07-18 00:44:37 +00:00
Reid Spencer	ef784f01dd	bug 122: - Minimize redundant isa<GlobalValue> usage llvm-svn: 14948	2004-07-18 00:32:14 +00:00
Chris Lattner	32c518e526	Progress on PR341 llvm-svn: 14840	2004-07-15 02:06:12 +00:00
Reid Spencer	eb04d9bcb4	Add #include <iostream> since Value.h does not #include it any more. llvm-svn: 14622	2004-07-04 12:19:56 +00:00
Misha Brukman	49bb82a4b8	Moved to lib/VMCore llvm-svn: 14348	2004-06-23 17:21:17 +00:00
Brian Gaeke	1ea8447089	Use new IsNAN() wrapper. llvm-svn: 14340	2004-06-23 00:25:35 +00:00
Chris Lattner	f12c4a3d37	FINALLY Fix a really nasty nondeterministic bug that has been haunting us since May 1st. In this code, the pred iterator was being invalidated sometimes causing the wrong entries to be added to PHI nodes. The fix for this is to defererence and safe the *PI value before we hack on branch instructions, which changes use/def chains, which SOMETIMES invalidates the iterator. llvm-svn: 14278	2004-06-21 07:19:01 +00:00
Chris Lattner	46f60890a3	Comment out the isnan stuff until we get a proper autoconf test for it breaking the build on sparc is not acceptable. llvm-svn: 14277	2004-06-21 06:17:21 +00:00
Chris Lattner	9734fd0980	Add some DEBUG output to the simplifycfg routines Fix another non-deterministic behavior, this one should actually speed up the code though as it was doing silly things. llvm-svn: 14258	2004-06-20 01:13:18 +00:00
Chris Lattner	4db0f8260a	Change to use the StableBasicBlockNumbering class llvm-svn: 14247	2004-06-19 08:42:40 +00:00
Chris Lattner	a52ab6f57f	Do not let the numbering of PHI nodes placed in the function depend on non-deterministic things like the ordering of blocks in the dominance frontier of a BB. Unfortunately, I don't know of a better way to solve this problem than to explicitly sort the BB's in function-order before processing them. This is guaranteed to slow the pass down a bit, but is absolutely necessary to get usable diffs between two different tools executing the mem2reg or scalarrepl pass. Before this, bazillions of spurious diff failures occurred all over the place due to the different order of processing PHIs: - %tmp.111 = getelementptr %struct.Connector_struct* %upcon.0.0, uint 0, uint 0 + %tmp.111 = getelementptr %struct.Connector_struct* %upcon.0.1, uint 0, uint 0 Now, the diffs match. llvm-svn: 14244	2004-06-19 07:40:14 +00:00
Chris Lattner	b2b151d297	Do not sort by the address of LLVM ConstantInt* objects. This produces nondeterministic results that depend on where these objects land in memory. Instead, sort by the value of the constant, which is stable. Before this patch, the -simplifycfg pass run from two different compilers could cause different code to be generated, though it was semantically the same: @@ -12258,8 +12258,8 @@ %s_addr.1 = phi sbyte* [ %s, %entry ], [ %inc.0, %no_exit ] ; <sbyte> [#uses=5] %tmp.1 = load sbyte %s_addr.1 ; <sbyte> [#uses=1] switch sbyte %tmp.1, label %no_exit [ - sbyte 0, label %loopexit sbyte 46, label %loopexit + sbyte 0, label %loopexit ] We need to stomp all of this stuff out. llvm-svn: 14243	2004-06-19 07:02:14 +00:00
Brian Gaeke	27b13253d9	I love the smell of a freshly broken PowerPC build in the morning. llvm-svn: 14206	2004-06-17 22:27:04 +00:00
Chris Lattner	f03f320b79	Fix compilation problem on freebsd. Problem noted by Vladimir Merzliakov in PR371 llvm-svn: 14203	2004-06-17 21:20:52 +00:00
Chris Lattner	6b7275996c	Rename Type::PrimitiveID to TypeId and ::getPrimitiveID() to ::getTypeID() llvm-svn: 14201	2004-06-17 18:19:28 +00:00
Brian Gaeke	20e09e5c7b	Um, did someone make a typo or something? llvm-svn: 14192	2004-06-15 23:09:50 +00:00
Chris Lattner	5a542aadc8	Remove support for the isnan intrinsic llvm-svn: 14186	2004-06-15 21:37:54 +00:00
Brian Gaeke	21370771ba	Quick hack to get this file compiling again on Mac OS X. The right thing to do is write an autoconf macro that checks whether __isnan or isnan actually works using the C++ compiler after #include <cmath>, instead of doing it the easy way with AC_CHECK_FUNCS(). llvm-svn: 14171	2004-06-14 06:33:19 +00:00
Alkis Evlogimenos	e395468ae5	Add constant folding capabilities to the isunordered intrinsic. llvm-svn: 14168	2004-06-13 01:23:56 +00:00
Chris Lattner	ec941f7abb	Constant fold the isnan intrinsic llvm-svn: 14150	2004-06-11 06:16:23 +00:00
Chris Lattner	c6e21fbd5c	Implement constant folding of fmod, which is used a lot in povray llvm-svn: 13823	2004-05-27 07:25:00 +00:00
Chris Lattner	06158d140c	Restructure call constant folding code a bit to make it simpler Add support for acos/asin/atan. 188.ammp contains three calls to acos with constant arguments. Constant folding it allows elimination of those 3 calls and three FP divisions of the results. llvm-svn: 13821	2004-05-27 06:26:28 +00:00
Alkis Evlogimenos	0eefdcd73f	Do not pass a null pointer if this instruction is not prepended or appended anywhere. llvm-svn: 13798	2004-05-26 22:50:28 +00:00
Alkis Evlogimenos	9e84b503f0	Use one destination constructor for the unconditional branch. llvm-svn: 13792	2004-05-26 21:38:14 +00:00
Reid Spencer	e7e9671cad	Convert to SymbolTable's new iteration interface. llvm-svn: 13754	2004-05-25 08:53:40 +00:00
Reid Spencer	abb6f008ca	Convert to SymbolTable's new lookup and iteration interfaces. llvm-svn: 13751	2004-05-25 08:52:20 +00:00
Brian Gaeke	72185765bc	Add CloneTraceInto(), which is based on (and has mostly the same effects as) CloneFunctionInto(). llvm-svn: 13601	2004-05-19 09:08:14 +00:00
Brian Gaeke	6182acf92a	Move RemapInstruction() to ValueMapper, so that it can be shared with CloneTrace, and because it is primarily an operation on ValueMaps. It is now a global (non-static) function which can be pulled in using ValueMapper.h. llvm-svn: 13600	2004-05-19 09:08:12 +00:00
Chris Lattner	66219abac7	Do not pass in the same argument to the extracted function more than once, and give the extracted function a more useful name than just foo_code. llvm-svn: 13493	2004-05-12 16:26:18 +00:00
Chris Lattner	13d2ddfe9c	Implement support for code extracting basic blocks that have a return instruction in them. llvm-svn: 13490	2004-05-12 16:07:41 +00:00
Chris Lattner	795c9933e2	Implement splitting of PHI nodes, allowing block extraction of BB's that have PHI node entries from multiple outside-the-region blocks. This also fixes extraction of the entry block in a function. Yaay. This has successfully block extracted all (but one) block from the score_move function in obsequi (out of 33). Hrm, I wonder which block the bug is in. :) llvm-svn: 13489	2004-05-12 15:29:13 +00:00
Chris Lattner	3b2917bfcf	* Pull some code out into the definedInRegion/definedInCaller methods * Add a stub for the severSplitPHINodes which will allow us to bbextract bb's with PHI nodes in them soon. * Remove unused arguments from findInputsOutputs * Dramatically simplify the code in findInputsOutputs. In particular, nothing really cares whether or not a PHI node is using something. * Move moveCodeToFunction to after emitCallAndSwitchStatement as that's the order they get called. * Fix a bug where we would code extract a region that included a call to vastart. Like 'alloca', calls to vastart must stay in the function that they are defined in. * Add some comments. llvm-svn: 13482	2004-05-12 06:01:40 +00:00
Chris Lattner	ffc4926263	Generate substantially better code when there are a limited number of exits from the extracted region. If the return has 0 or 1 exit blocks, the new function returns void. If it has 2 exits, it returns bool, otherwise it returns a ushort as before. This allows us to use a conditional branch instruction when there are two exit blocks, as often happens during block extraction. llvm-svn: 13481	2004-05-12 04:14:24 +00:00
Chris Lattner	3d1ca67fdd	Two minor improvements: 1. Get rid of the silly abort block. When doing bb extraction, we get one abort block for every block extracted, which is kinda annoying. 2. If the switch ends up having a single destination, turn it into an unconditional branch. I would like to add support for conditional branches, but to do this we will want to have the function return a bool instead of a ushort. llvm-svn: 13478	2004-05-12 03:22:33 +00:00
Chris Lattner	be6f06818c	Do not clone arbitrary condition instructions. llvm-svn: 13316	2004-05-02 05:19:36 +00:00
Chris Lattner	51a6dbcb65	Do not infinitely "unroll" single BB loops. llvm-svn: 13315	2004-05-02 05:02:03 +00:00
Chris Lattner	1e94ed606e	Dont' merge terminators that are needed to select PHI node values. llvm-svn: 13312	2004-05-02 01:00:44 +00:00
Chris Lattner	2e93c4275e	Implement SimplifyCFG/branch-cond-merge.ll Turning "if (A < B && B < C)" into "if (A < B & B < C)" llvm-svn: 13311	2004-05-01 23:35:43 +00:00
Chris Lattner	4cbd160b45	Fix my missing parens llvm-svn: 13307	2004-05-01 22:41:51 +00:00
Chris Lattner	88da6f7b52	Implement SimplifyCFG/branch-cond-prop.ll llvm-svn: 13306	2004-05-01 22:36:37 +00:00
Misha Brukman	3596f0a180	* Allow aggregating extracted function arguments (controlled by flag) * Commandline option (for now) controls that flag that is passed in llvm-svn: 13141	2004-04-23 23:54:17 +00:00
Alkis Evlogimenos	f68f40ea42	Include cerrno (gcc-3.4 fix) llvm-svn: 13091	2004-04-21 16:11:40 +00:00
Chris Lattner	a43312d30b	Add support for evaluation of exp/log/log10/pow llvm-svn: 13011	2004-04-16 22:35:33 +00:00
Brian Gaeke	174633b078	Include <cmath> for compatibility with gcc 3.0.x (the system compiler on Debian.) llvm-svn: 12986	2004-04-16 15:57:32 +00:00
Chris Lattner	0328d75c83	Fix Inline/2004-04-15-InlineDeletesCall.ll Basically we were using SimplifyCFG as a huge sledgehammer for a simple optimization. Because simplifycfg does so many things, we can't use it for this purpose. llvm-svn: 12977	2004-04-16 05:17:59 +00:00
Chris Lattner	ca52d0468e	Add a simple call constant propagation interface. llvm-svn: 12919	2004-04-13 19:28:52 +00:00
Chris Lattner	0aa565647c	Fold code like: if (C) V1 \|= V2; into: Vx = V1 \| V2; V1 = select C, V1, Vx when the expression can be evaluated unconditionally and is cheap to execute. This limited form of if conversion is quite handy in lots of cases. For example, it turns this testcase into straight-line code: int in0 ; int in1 ; int in2 ; int in3 ; int in4 ; int in5 ; int in6 ; int in7 ; int in8 ; int in9 ; int in10; int in11; int in12; int in13; int in14; int in15; long output; void mux(void) { output = (in0 ? 0x00000001 : 0) \| (in1 ? 0x00000002 : 0) \| (in2 ? 0x00000004 : 0) \| (in3 ? 0x00000008 : 0) \| (in4 ? 0x00000010 : 0) \| (in5 ? 0x00000020 : 0) \| (in6 ? 0x00000040 : 0) \| (in7 ? 0x00000080 : 0) \| (in8 ? 0x00000100 : 0) \| (in9 ? 0x00000200 : 0) \| (in10 ? 0x00000400 : 0) \| (in11 ? 0x00000800 : 0) \| (in12 ? 0x00001000 : 0) \| (in13 ? 0x00002000 : 0) \| (in14 ? 0x00004000 : 0) \| (in15 ? 0x00008000 : 0) ; } llvm-svn: 12798	2004-04-09 22:50:22 +00:00
Chris Lattner	3b3861d305	Implement ScalarRepl/select_promote.ll llvm-svn: 12779	2004-04-08 19:59:34 +00:00
Chris Lattner	eed034bcd3	Fix the obvious bug in my previous checkin llvm-svn: 12618	2004-04-02 18:15:10 +00:00
Chris Lattner	9f0db32625	Implement Transforms/SimplifyCFG/return-merge.ll This actually causes us to turn code like: return C ? A : B; into a select instruction. llvm-svn: 12617	2004-04-02 18:13:43 +00:00
Chris Lattner	c24019c825	Fix PR310 and TailDup/2004-04-01-DemoteRegToStack.llx llvm-svn: 12597	2004-04-01 20:28:45 +00:00
Chris Lattner	709f03e2dd	Fix linking of constant expr casts due to type resolution changes. With this and the other patches 253.perlbmk links again. llvm-svn: 12565	2004-03-31 02:58:28 +00:00
Chris Lattner	81bdcb90ce	Now that all the code generators support the select instruction, and the instcombine pass can eliminate many nasty cases of them, start generating them in the optimizers llvm-svn: 12545	2004-03-30 19:44:05 +00:00
Chris Lattner	232155dc1b	Fix bug: CodeExtractor/2004-03-17-MissedLiveIns.ll With this fix we now successfully extract all 149 loops from 256.bzip2 without crashing or miscompiling the program! llvm-svn: 12493	2004-03-18 05:56:32 +00:00
Chris Lattner	acd75986ee	Fix CodeExtractor/2004-03-17-UpdatePHIsOutsideRegion.ll llvm-svn: 12489	2004-03-18 05:38:31 +00:00
Chris Lattner	320d59f4cd	Seriously simplify and correct the PHI node handling code. llvm-svn: 12487	2004-03-18 05:28:49 +00:00
Chris Lattner	d8017a340d	Fix CodeExtractor/2004-03-17-OutputMismatch.ll llvm-svn: 12486	2004-03-18 04:12:05 +00:00
Chris Lattner	37de257ef0	Fix several bugs in the extractor: 1. Names were not put on the new arguments created (ok, this just helps sanity :) 2. Fix outgoing pointer values 3. Do not insert stores for values that had not been computed 4. Fix some wierd problems with the outset calculation This fixes CodeExtractor/2004-03-14-DominanceProblem.ll, making the extractor work on at least one simple case! llvm-svn: 12484	2004-03-18 03:49:40 +00:00
Chris Lattner	cee3404d0a	Prune #includes, moving the module interface to the front. Note that this exposed the fact that the header was not self-contained. There is a reason we do things :) llvm-svn: 12481	2004-03-18 03:15:29 +00:00
Chris Lattner	a078f47b39	Fix compilation of mesa, which I broke earlier today llvm-svn: 12465	2004-03-17 02:02:47 +00:00
Chris Lattner	bb1a2cc7ab	This code was both incredibly complex and incredibly broken. Fix it. llvm-svn: 12456	2004-03-16 23:23:11 +00:00
Chris Lattner	a64923ad26	Do not copy gigantic switch instructions llvm-svn: 12441	2004-03-16 19:45:22 +00:00
Chris Lattner	fb87cdecd8	Mostly cosmetic improvements. Do fix the bug where a global value was considered an input. llvm-svn: 12406	2004-03-15 01:26:44 +00:00
Chris Lattner	73ab1fa7c8	Assert that input blocks meet the invariants we expect Simplify the input/output finder. All elements of a basic block are instructions. Any used arguments are also inputs. An instruction can only be used by another instruction. llvm-svn: 12405	2004-03-15 01:18:23 +00:00
Chris Lattner	5b2072ecd3	No correctness fixes here, just minor qoi fixes: * Don't insert a branch to the switch instruction after the call, just make it a single block. * Insert the new alloca instructions in the entry block of the original function instead of having them execute dynamically * Don't make the default edge of the switch instruction go back to the switch. The loop extractor shouldn't create new loops! * Give meaningful names to the alloca slots and the reload instructions * Some minor code simplifications llvm-svn: 12402	2004-03-14 23:43:24 +00:00
Chris Lattner	b4d8bf365c	Simplify code a bit, and fix bug CodeExtractor/2004-03-14-NoSwitchSupport.ll This also implements a two minor improvements: * Don't insert live-out stores IN the region, insert them on the code path that exits the region * If the region is exited to the same block from multiple paths, share the switch statement entry, live-out store code, and the basic block. llvm-svn: 12401	2004-03-14 23:05:49 +00:00
Chris Lattner	9c431f6c44	Simplify the code a bit by making the collection of basic blocks to extract a member of the class. While we're at it, turn the collection into a set instead of a vector to improve efficiency and make queries simpler. llvm-svn: 12400	2004-03-14 22:34:55 +00:00
Chris Lattner	4fca71eb44	Minor random cleanups llvm-svn: 12382	2004-03-14 04:01:47 +00:00
Chris Lattner	3684469326	Verify functions as they are produced if -debug is specified. Reduce curly braceage llvm-svn: 12378	2004-03-14 03:17:22 +00:00
Chris Lattner	41ec709e00	Move to the IPO library. Utils shouldn't contain passes. llvm-svn: 12372	2004-03-14 02:32:27 +00:00
Chris Lattner	8eebc49884	DemoteRegToStack got moved from DemoteRegToStack.h to Local.h llvm-svn: 12368	2004-03-14 02:13:38 +00:00
Chris Lattner	cb015ee6c0	Add constant folding wrapper support for select instructions. llvm-svn: 12319	2004-03-12 05:53:03 +00:00
Misha Brukman	f44acae31e	Implement ExtractCodeRegion() llvm-svn: 12070	2004-03-02 00:20:57 +00:00
Misha Brukman	f272f9b3d5	Make a note that this is usually used via bugpoint. llvm-svn: 12068	2004-03-02 00:19:09 +00:00
Misha Brukman	5af2be7d09	* Add implementation of ExtractBasicBlock() * Add comments to ExtractLoop() llvm-svn: 12053	2004-03-01 18:28:34 +00:00
Chris Lattner	2de229f31b	Fix bug: test/Regression/Transforms/LowerInvoke/2004-02-29-PHICrash.llx ... which tickled the lowerinvoke pass because it used the BCE routines. llvm-svn: 12012	2004-02-29 22:24:41 +00:00
Chris Lattner	d3e6ae263c	Implement switch->br and br->switch folding by ripping out the switch->switch and br->br code and generalizing it. This allows us to compile code like this: int test(Instruction I) { if (isa<CastInst>(I)) return foo(7); else if (isa<BranchInst>(I)) return foo(123); else if (isa<UnwindInst>(I)) return foo(1241); else if (isa<SetCondInst>(I)) return foo(1); else if (isa<VAArgInst>(I)) return foo(42); return foo(-1); } into: int %_Z4testPN4llvm11InstructionE("struct.llvm::Instruction" %I) { entry: %tmp.1.i.i.i.i.i.i.i = getelementptr "struct.llvm::Instruction"* %I, long 0, ubyte 4 ; <uint> [#uses=1] %tmp.2.i.i.i.i.i.i.i = load uint %tmp.1.i.i.i.i.i.i.i ; <uint> [#uses=2] %tmp.2.i.i.i.i.i.i = seteq uint %tmp.2.i.i.i.i.i.i.i, 27 ; <bool> [#uses=0] switch uint %tmp.2.i.i.i.i.i.i.i, label %endif.0 [ uint 27, label %then.0 uint 2, label %then.1 uint 5, label %then.2 uint 14, label %then.3 uint 15, label %then.3 uint 16, label %then.3 uint 17, label %then.3 uint 18, label %then.3 uint 19, label %then.3 uint 32, label %then.4 ] ... As well as handling the cases in 176.gcc and many other programs more effectively. llvm-svn: 11964	2004-02-28 21:28:10 +00:00
Misha Brukman	8a2c28fdda	Right, it's really Extractor, not Extraction. llvm-svn: 11939	2004-02-28 03:37:58 +00:00
Misha Brukman	03a11340ff	A pass that uses the generic CodeExtractor to rip out every loop in every function, as long as the loop isn't the only one in that function. This should help debugging passes easier with BugPoint. llvm-svn: 11936	2004-02-28 03:33:01 +00:00
Misha Brukman	caa1a5abeb	A generic code extractor: given a list of BasicBlocks, it will rip them out into a new function, taking care of inputs and outputs. llvm-svn: 11935	2004-02-28 03:26:20 +00:00
Chris Lattner	21e941fbfd	turn things like: if (X == 0 \|\| X == 2) ...where the comparisons and branches are in different blocks... into a switch instruction. This comes up a lot in various programs, and works well with the switch/switch merging code I checked earlier. For example, this testcase: int switchtest(int C) { return C == 0 ? f(123) : C == 1 ? f(3123) : C == 4 ? f(312) : C == 5 ? f(1234): f(444); } is converted into this: switch int %C, label %cond_false.3 [ int 0, label %cond_true.0 int 1, label %cond_true.1 int 4, label %cond_true.2 int 5, label %cond_true.3 ] instead of a whole bunch of conditional branches. Admittedly the code is ugly, and incomplete. To be complete, we need to add br -> switch merging and switch -> br merging. For example, this testcase: struct foo { int Q, R, Z; }; #define A (X->Q+X->R * 123) int test(struct foo X) { return A == 123 ? X1() : A == 12321 ? X2(): (A == 111 \|\| A == 222) ? X3() : A == 875 ? X4() : X5(); } Gets compiled to this: switch int %tmp.7, label %cond_false.2 [ int 123, label %cond_true.0 int 12321, label %cond_true.1 int 111, label %cond_true.2 int 222, label %cond_true.2 ] ... cond_false.2: ; preds = %entry %tmp.52 = seteq int %tmp.7, 875 ; <bool> [#uses=1] br bool %tmp.52, label %cond_true.3, label %cond_false.3 where the branch could be folded into the switch. This kind of thing occurs ALL OF THE TIME, especially in programs like 176.gcc, which is a horrible mess of code. It contains stuff like shudder*: #define SWITCH_TAKES_ARG(CHAR) \ ( (CHAR) == 'D' \ \|\| (CHAR) == 'U' \ \|\| (CHAR) == 'o' \ \|\| (CHAR) == 'e' \ \|\| (CHAR) == 'u' \ \|\| (CHAR) == 'I' \ \|\| (CHAR) == 'm' \ \|\| (CHAR) == 'L' \ \|\| (CHAR) == 'A' \ \|\| (CHAR) == 'h' \ \|\| (CHAR) == 'z') and #define CONST_OK_FOR_LETTER_P(VALUE, C) \ ((C) == 'I' ? SMALL_INTVAL (VALUE) \ : (C) == 'J' ? SMALL_INTVAL (-(VALUE)) \ : (C) == 'K' ? (unsigned)(VALUE) < 32 \ : (C) == 'L' ? ((VALUE) & 0xffff) == 0 \ : (C) == 'M' ? integer_ok_for_set (VALUE) \ : (C) == 'N' ? (VALUE) < 0 \ : (C) == 'O' ? (VALUE) == 0 \ : (C) == 'P' ? (VALUE) >= 0 \ : 0) and #define LEGITIMIZE_ADDRESS(X,OLDX,MODE,WIN) \ { \ if (GET_CODE (X) == PLUS && CONSTANT_ADDRESS_P (XEXP (X, 1))) \ (X) = gen_rtx (PLUS, SImode, XEXP (X, 0), \ copy_to_mode_reg (SImode, XEXP (X, 1))); \ if (GET_CODE (X) == PLUS && CONSTANT_ADDRESS_P (XEXP (X, 0))) \ (X) = gen_rtx (PLUS, SImode, XEXP (X, 1), \ copy_to_mode_reg (SImode, XEXP (X, 0))); \ if (GET_CODE (X) == PLUS && GET_CODE (XEXP (X, 0)) == MULT) \ (X) = gen_rtx (PLUS, SImode, XEXP (X, 1), \ force_operand (XEXP (X, 0), 0)); \ if (GET_CODE (X) == PLUS && GET_CODE (XEXP (X, 1)) == MULT) \ (X) = gen_rtx (PLUS, SImode, XEXP (X, 0), \ force_operand (XEXP (X, 1), 0)); \ if (GET_CODE (X) == PLUS && GET_CODE (XEXP (X, 0)) == PLUS) \ (X) = gen_rtx (PLUS, Pmode, force_operand (XEXP (X, 0), NULL_RTX),\ XEXP (X, 1)); \ if (GET_CODE (X) == PLUS && GET_CODE (XEXP (X, 1)) == PLUS) \ (X) = gen_rtx (PLUS, Pmode, XEXP (X, 0), \ force_operand (XEXP (X, 1), NULL_RTX)); \ if (GET_CODE (X) == SYMBOL_REF \|\| GET_CODE (X) == CONST \ \|\| GET_CODE (X) == LABEL_REF) \ (X) = legitimize_address (flag_pic, X, 0, 0); \ if (memory_address_p (MODE, X)) \ goto WIN; } and others. These macros get used multiple times of course. These are such lovely candidates for macros, aren't they? :) This code also nicely handles LLVM constructs that look like this: if (isa<CastInst>(I)) ... else if (isa<BranchInst>(I)) ... else if (isa<SetCondInst>(I)) ... else if (isa<UnwindInst>(I)) ... else if (isa<VAArgInst>(I)) ... where the isa can obviously be a dyn_cast as well. Switch instructions are a good thing. llvm-svn: 11870	2004-02-26 07:13:46 +00:00
Chris Lattner	90ea78edba	If a block is made dead, make sure to promptly remove it. llvm-svn: 11799	2004-02-24 16:09:21 +00:00
Chris Lattner	a2ab489135	Implement SimplifyCFG/switch_switch_fold.ll This case occurs many times in various benchmarks, especially when combined with the previous patch. This allows it to get stuff like: if (X == 4 \|\| X == 3) if (X == 5 \|\| X == 8) and switch (X) { case 4: case 5: case 6: if (X == 4 \|\| X == 5) llvm-svn: 11797	2004-02-24 07:23:58 +00:00
Chris Lattner	3cd98f054a	Rearrange code a bit llvm-svn: 11793	2004-02-24 05:54:22 +00:00
Chris Lattner	6f4b45acf5	Implement: test/Regression/Transforms/SimplifyCFG/switch_create.ll This turns code like this: if (X == 4 \| X == 7) and if (X != 4 & X != 7) into switch instructions. llvm-svn: 11792	2004-02-24 05:38:11 +00:00
Chris Lattner	693e393fee	Fix PR245: Linking weak and strong global variables is dependent on link order llvm-svn: 11565	2004-02-17 21:56:04 +00:00
Chris Lattner	e42732e75f	Implement test/Regression/Transforms/SimplifyCFG/UncondBranchToReturn.ll, see the testcase for the reasoning. llvm-svn: 11496	2004-02-16 06:35:48 +00:00
Chris Lattner	76b2ff4ded	Adjustments to support the new ConstantAggregateZero class llvm-svn: 11474	2004-02-15 05:55:15 +00:00
Chris Lattner	283ffdfac5	Fix compilation of 126.gcc: intrinsic functions cannot throw, so they are not allowed in invoke instructions. Thus, if we are inlining a call to an intrinsic function into an invoke site, we don't need to turn the call into an invoke! llvm-svn: 11384	2004-02-13 16:47:35 +00:00
Chris Lattner	18d1f19fba	Implement SimplifyCFG/PhiEliminate.ll Having a proper 'select' instruction would allow the elimination of a lot of the special case cruft in this patch, but we don't have one yet. llvm-svn: 11307	2004-02-11 03:36:04 +00:00
Chris Lattner	838b845781	The hasConstantReferences predicate always returns false. llvm-svn: 11301	2004-02-11 01:17:07 +00:00
Chris Lattner	fae8ab3088	rename the "exceptional" destination of an invoke instruction to the 'unwind' dest llvm-svn: 11202	2004-02-08 21:44:31 +00:00
Chris Lattner	39ad6f2772	Minor speedup, don't query ValueMap each time through the loop llvm-svn: 11123	2004-02-04 21:44:26 +00:00
Chris Lattner	6f8865bf9f	Two changes: 1. Don't scan to the end of alloca instructions in the caller function to insert inlined allocas, just insert at the top. This saves a lot of time inlining into functions with a lot of allocas. 2. Use splice to move the alloca instructions over, instead of remove/insert. This allows us to transfer a block at a time, and eliminates a bunch of silly symbol table manipulations. This speeds up the inliner on the testcase in PR209 from 1.73s -> 1.04s (67%) llvm-svn: 11118	2004-02-04 21:33:42 +00:00
Chris Lattner	0fa8c7c321	Optimize the case where we are inlining a function that contains only one basic block, and that basic block ends with a return instruction. In this case, we can just splice the cloned "body" of the function directly into the source basic block, avoiding a lot of rearrangement and splitBasicBlock's linear scan over the split block. This speeds up the inliner on the testcase in PR209 from 2.3s to 1.7s, a 35% reduction. llvm-svn: 11116	2004-02-04 04:17:06 +00:00
Chris Lattner	18ef3fda57	More refactoring. Move alloca instructions and handle invoke instructions before we delete the original call site, allowing slight simplifications of code, but nothing exciting. llvm-svn: 11109	2004-02-04 02:51:48 +00:00
Chris Lattner	9fc977eac4	Move the cloning of the function body much earlier in the inlinefunction process. The only optimization we did so far is to avoid creating a PHI node, then immediately destroying it in the common case where the callee has one return statement. Instead, we just don't create the return value. This has no noticable performance impact, but paves the way for future improvements. llvm-svn: 11108	2004-02-04 01:41:09 +00:00
Chris Lattner	a6578ef318	Give CloneBasicBlock an optional function argument to specify which function to add the cloned block to. This allows the block to be added to the function immediately, and all of the instructions to be immediately added to the function symbol table, which speeds up the inliner from 3.7 -> 3.38s on the PR209. llvm-svn: 11107	2004-02-04 01:19:43 +00:00
Chris Lattner	ae51cae111	Bunch up all locally used allocas by the block they are allocated in, and process them all as a group. This speeds up SRoA/mem2reg from 28.46s to 0.62s on the testcase from PR209. llvm-svn: 11100	2004-02-03 22:34:12 +00:00
Chris Lattner	3784188620	Handle extremely trivial cases extremely efficiently. This speeds up SRoA/mem2reg from 41.2s to 27.5s on the testcase in PR209. llvm-svn: 11099	2004-02-03 22:00:33 +00:00
Chris Lattner	6b052f2154	Clean up #includes llvm-svn: 10799	2004-01-12 19:56:36 +00:00
Chris Lattner	429963742e	Remove use of ConstantExpr::getShift llvm-svn: 10792	2004-01-12 19:10:58 +00:00
Chris Lattner	2853a7ed22	Remove use of ConstantHandling llvm-svn: 10789	2004-01-12 18:35:03 +00:00
Chris Lattner	fc6c859a0c	Move llvm::ConstantFoldInstruction from VMCore to here, next to ConstantFoldTerminator llvm-svn: 10785	2004-01-12 18:25:22 +00:00
Chris Lattner	fafa2ff2d6	Implement Transforms/ScalarRepl/phinodepromote.ll, which is an important case that the C/C++ front-end generates. llvm-svn: 10761	2004-01-12 01:18:32 +00:00
Chris Lattner	df3c342a4c	Finegrainify namespacification llvm-svn: 10727	2004-01-09 06:12:26 +00:00
Chris Lattner	04efa4b155	Add new function llvm-svn: 10529	2003-12-19 05:56:28 +00:00
Chris Lattner	a29600046d	Minor cleanups and simplifications llvm-svn: 10127	2003-11-21 16:52:05 +00:00
Chris Lattner	2af517281d	Start using the nicer terminator auto-insertion API llvm-svn: 10111	2003-11-20 18:25:24 +00:00
Chris Lattner	63a0ccff44	Spew symbolic types! llvm-svn: 10110	2003-11-20 18:23:14 +00:00
Brian Gaeke	960707c335	Put all LLVM code into the llvm namespace, as per bug 109. llvm-svn: 9903	2003-11-11 22:41:34 +00:00
Chris Lattner	1e6d3053f2	Reorganize code for locality, improve comments llvm-svn: 9857	2003-11-10 04:42:42 +00:00
Chris Lattner	4474336166	Adjust to new critical edge interface llvm-svn: 9853	2003-11-10 04:10:50 +00:00
Chris Lattner	38cd27e450	Various cleanups and efficiency improvements llvm-svn: 9753	2003-11-06 19:46:29 +00:00
Chris Lattner	8055fb3afa	Yet more fixes for constant expr shifts llvm-svn: 9739	2003-11-05 20:43:58 +00:00
Chris Lattner	ba55bd37fe	Further fixes for PR93 llvm-svn: 9738	2003-11-05 20:37:01 +00:00
John Criswell	81587e798a	Checking in Chris's suggestions: Added assert() to ensure symbol table is well formed. Added code to remember the value that was found; resolving types can change the symbol table and invalidate the value of the iterator. Added comments to the ResolveTypes() function (mainly for my own benefit). Please feel free to correct the comments if they are not accurate. llvm-svn: 9693	2003-11-04 15:22:26 +00:00
Chris Lattner	b727fb2663	Fix test: Linker/2003-10-27-LinkOncePromote.ll Fix PR58 llvm-svn: 9530	2003-10-27 16:39:39 +00:00
Chris Lattner	d9f4ffdf5e	Get the list of PHI node values before the basic block is split. Also, add PHI node entries for unwind instructions just like for call instructions which became invokes! This fixes PR57, tested by Inline/2003-10-26-InlineInvokeExceptionDestPhi.ll llvm-svn: 9526	2003-10-27 05:33:09 +00:00
Chris Lattner	4f2581f828	Fix bug: Linker/2003-10-21-ConflictingTypesTolerance.ll llvm-svn: 9357	2003-10-21 22:46:38 +00:00
Chris Lattner	9bc22b7439	Fix message to make more sense and confuse Chris less llvm-svn: 9354	2003-10-21 21:52:20 +00:00
John Criswell	29265fe981	Added LLVM copyright header. llvm-svn: 9321	2003-10-21 15:17:13 +00:00
John Criswell	4436c49787	Added LLVM copyright notice to Makefiles. llvm-svn: 9312	2003-10-20 22:26:57 +00:00
John Criswell	482202a601	Added LLVM project notice to the top of every C++ source file. Header files will be on the way. llvm-svn: 9298	2003-10-20 19:43:21 +00:00
Chris Lattner	b32f5748b7	Fix PR#50 llvm-svn: 9227	2003-10-18 06:14:59 +00:00
Chris Lattner	068ad84038	Add support for 'weak' linkage. llvm-svn: 9171	2003-10-16 18:29:00 +00:00
Chris Lattner	f77a856f3b	Cleanup llvm-svn: 9133	2003-10-15 16:42:21 +00:00
Chris Lattner	b4778c73c9	Do not move variable sized allocations to the top of the caller, which might break dominance relationships, and is otherwise bad. This fixes bug: Inline/2003-10-13-AllocaDominanceProblem.ll. This also fixes miscompilation of 3 176.gcc source files (reload1.c, global.c, flow.c) llvm-svn: 9109	2003-10-14 01:11:07 +00:00
Chris Lattner	72272a70b8	Rename loop preheaders pass to loop simplify llvm-svn: 9061	2003-10-12 21:52:28 +00:00
Misha Brukman	8b2bd4ed47	Fix spelling. llvm-svn: 9027	2003-10-10 17:57:28 +00:00
Chris Lattner	6aa34b0d0b	Avoid doing pointless work. Amazingly, this makes us go faster. Running the inliner on 252.eon used to take 48.4763s, now it takes 14.4148s. In release mode, it went from taking 25.8741s to taking 11.5712s. This also fixes a FIXME. llvm-svn: 8890	2003-10-06 15:23:43 +00:00
Chris Lattner	c30f22f57c	This changes the PromoteMemToReg function to create "pruned" SSA form, not "minimal" SSA form (in other words, it doesn't insert dead PHIs). This speeds up the mem2reg pass very significantly because it doesn't have to do a lot of frivolous work in many common cases. In the 252.eon function I have been playing with, this doesn't even insert the 120 PHI nodes that it used to which were trivially dead (in the process of promoting 356 alloca instructions overall). This speeds up the mem2reg pass from 1.2459s to 0.1284s. More significantly, the DCE pass used to take 2.4138s to remove the 120 dead PHI nodes that mem2reg constructed, now it takes 0.0134s (which is the time to scan the function and decide that there is nothing dead). So overall, on this one function, we speed things up a total of 3.5179s, which is a 24.8x speedup! :) This change is tested by the Mem2Reg/2003-10-05-DeadPHIInsertion.ll test, which now passes. llvm-svn: 8884	2003-10-05 22:19:20 +00:00
Chris Lattner	a906bacfdd	Change the interface to PromoteMemToReg to also take a DominatorTree llvm-svn: 8883	2003-10-05 21:20:13 +00:00
Chris Lattner	8047152977	Speed up the mem2reg transform for allocas which are only read/written in a single basic block. This is amazingly common in code generated by the C/C++ front-ends. This change makes it not have to insert ANY phi nodes, whereas before it would insert a ton of dead ones which DCE would have to clean up. Thus, this fix improves compile-time performance of these trivial allocas in two ways: 1. It doesn't have to do the walking and book-keeping for renaming 2. It does not insert dead phi nodes for them which would have to subsequently be cleaned up. On my favorite testcase from 252.eon, this special case handles 305 out of 356 promoted allocas in the function. It speeds up the mem2reg pass from 7.5256s to 1.2505s. It inserts 677 fewer dead PHI nodes, which speeds up a subsequent -dce pass from 18.7524s to 2.4806s. There are still 120 trivially dead PHI nodes being inserted for variables used in multiple basic blocks, but they are not handled by this patch. llvm-svn: 8881	2003-10-05 20:54:03 +00:00
Chris Lattner	a5721d3d03	The first PHI node may be null, scan for the first non-null one llvm-svn: 8865	2003-10-05 05:34:39 +00:00
Chris Lattner	203bc011e5	The VersionNumbers vector is only used during PHI placement. Turn it into an argument, allowing us to get rid of the vector. llvm-svn: 8864	2003-10-05 04:33:22 +00:00
Chris Lattner	7d9692df22	* Update file header comment *** Revamp the code which handled unreachable code in the function. Now the code is much more efficient for high-degree basic blocks, such as those that occur in the 252.eon SPEC benchmark. For the interested, the time to promote a SINGLE alloca in _ZN7mrScene4ReadERSi function used to be > 3.5s. Now it is < .075s. The function has a LOT of allocas in it, so it appeared to be infinite looping, this should make it much nicer. :) llvm-svn: 8863	2003-10-05 04:26:39 +00:00
Chris Lattner	db1f81bcb5	Simplify the loop a bit llvm-svn: 8862	2003-10-05 03:45:44 +00:00
Chris Lattner	2093012a03	There is no need for separate WriteSets and PhiNodeBlocks lists. It is just a work-list of value definitions. This allows elimination of the explicit 'iterative' step of the algorithm, and also reuses temporary memory better. llvm-svn: 8861	2003-10-05 03:39:10 +00:00
Chris Lattner	9408b1e8b9	The PhiNodes 2D vector is only used during PHI node placement. It doesn't need to be an instance variable! llvm-svn: 8860	2003-10-05 03:26:25 +00:00
Chris Lattner	417bc173ee	* Document instance vars better * Fuse two parallel loops * Use a more specific type for AllocaLookup llvm-svn: 8859	2003-10-05 03:16:07 +00:00
Chris Lattner	c360f135be	Two small cleanups/speedups: * Do not insert a new entry into NewPhiNodes during the rename pass if there are no PHIs in a block. * Do not compute WriteSets in parallel llvm-svn: 8858	2003-10-05 02:37:36 +00:00
Chris Lattner	c3d57b5923	* Minor cleanups * Eliminate the KillList instance variable, instead, just delete loads and stores as they are "renamed", and delete allocas when they are done * Make the 'visited' set an instance variable to avoid passing it on the stack. llvm-svn: 8857	2003-10-05 01:52:53 +00:00
Chris Lattner	50ac360a34	Fix bugs in the last change llvm-svn: 8667	2003-09-22 23:30:59 +00:00
Chris Lattner	d6002b8e15	Fix bug: Inline/2003-09-22-PHINodesInNormalInvokeDest.ll llvm-svn: 8666	2003-09-22 22:35:39 +00:00
Chris Lattner	0178d262e6	Fix bug: Inline/2003-09-22-PHINodesInExceptionDest.ll ... by making sure to update PHI nodes to take into consideration the extra edges we get if we inline a call instruction through an invoke. llvm-svn: 8664	2003-09-22 21:59:27 +00:00
Chris Lattner	4413e4362c	Minor cleanups, no functional changes Rename Function::getEntryNode -> getEntryBlock llvm-svn: 8623	2003-09-20 14:36:23 +00:00
Chris Lattner	7f89ea75e1	Fix bug: Inline/2003-09-14-InlineValue.ll llvm-svn: 8514	2003-09-15 02:10:16 +00:00
Chris Lattner	f941396cce	Expand the pass to unify all of the unwind blocks as well llvm-svn: 8456	2003-09-10 20:34:51 +00:00
Chris Lattner	04ecefe232	Eliminate support for the llvm.unwind intrinisic, using the Unwind instruction instead llvm-svn: 8411	2003-09-08 19:44:26 +00:00
Chris Lattner	caa12cfa75	Should invokify is always true llvm-svn: 8409	2003-09-08 19:00:30 +00:00
Chris Lattner	66cc0141b0	Fix bug where we considered function types equivalent even if they had differing numbers of arguments llvm-svn: 8178	2003-08-28 16:42:50 +00:00

... 4 5 6 7 8 ...

724 Commits