llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	2d9c117e21	Reorder for minor efficiency gain llvm-svn: 9285	2003-10-20 05:54:26 +00:00
Chris Lattner	b94550e537	Change the Opcode enum for PHI nodes from "Instruction::PHINode" to "Instruction::PHI" to be more consistent with the other instructions. llvm-svn: 9269	2003-10-19 21:34:28 +00:00
Chris Lattner	b32f5748b7	Fix PR#50 llvm-svn: 9227	2003-10-18 06:14:59 +00:00
Chris Lattner	f0fc9be634	ADd support for the new varargs instructions llvm-svn: 9225	2003-10-18 05:56:52 +00:00
Chris Lattner	1facf89eaf	Do not crash on empty structures llvm-svn: 9195	2003-10-17 18:03:54 +00:00
Chris Lattner	068ad84038	Add support for 'weak' linkage. llvm-svn: 9171	2003-10-16 18:29:00 +00:00
Chris Lattner	50b6858e2e	This code does not require random access use_lists llvm-svn: 9156	2003-10-16 16:49:12 +00:00
Chris Lattner	5dbb244edb	Eliminate using declaration Rewrite code to work with use_lists what are either random access or bidirectional llvm-svn: 9155	2003-10-16 16:48:53 +00:00
Chris Lattner	f95d9b99b3	Decrease usage of use_size() llvm-svn: 9135	2003-10-15 16:48:29 +00:00
Chris Lattner	f77a856f3b	Cleanup llvm-svn: 9133	2003-10-15 16:42:21 +00:00
Chris Lattner	b4778c73c9	Do not move variable sized allocations to the top of the caller, which might break dominance relationships, and is otherwise bad. This fixes bug: Inline/2003-10-13-AllocaDominanceProblem.ll. This also fixes miscompilation of 3 176.gcc source files (reload1.c, global.c, flow.c) llvm-svn: 9109	2003-10-14 01:11:07 +00:00
Chris Lattner	612cafef0a	Whoops, we inserted into the wrong set. What's up with the dead set anyway? llvm-svn: 9094	2003-10-13 16:49:21 +00:00
Chris Lattner	951e7329e8	Use external df iterators to avoid revisiting blocks in functions with multiple setjmp calls. llvm-svn: 9093	2003-10-13 16:44:50 +00:00
Chris Lattner	178957028b	Wrap code at 80 columns llvm-svn: 9073	2003-10-13 05:04:27 +00:00
Chris Lattner	44d2c3514a	Regularize header file comments llvm-svn: 9071	2003-10-13 03:32:08 +00:00
Chris Lattner	7cce14bfbf	Regularize header file comment, eliminate using's llvm-svn: 9069	2003-10-13 03:30:47 +00:00
Chris Lattner	f7a60cd9fa	Minor cleanups llvm-svn: 9067	2003-10-13 01:02:33 +00:00
Chris Lattner	56b8526083	Checkin an improvement contributed by Bill: Only transform call sites in a setjmp'ing function which are reachable from the setjmp. If the call dominates the setjmp (for example), the called function cannot longjmp to the setjmp. This dramatically reduces the number of invoke instructions created in some large testcases. llvm-svn: 9066	2003-10-13 00:57:16 +00:00
Chris Lattner	c4622a6955	Add support to the loop canonicalization pass to make it transform loops to have a SINGLE backedge. This is useful to, for example, the -indvars pass. This implements testcase LoopSimplify/single-backedge.ll and closes PR#34 llvm-svn: 9065	2003-10-13 00:37:13 +00:00
Chris Lattner	72272a70b8	Rename loop preheaders pass to loop simplify llvm-svn: 9061	2003-10-12 21:52:28 +00:00
Chris Lattner	55d4788397	File is renamed to LoopSimplify.cpp llvm-svn: 9059	2003-10-12 21:44:18 +00:00
Chris Lattner	154e4d5dea	First step in renaming the preheaders pass to loopsimplify llvm-svn: 9058	2003-10-12 21:43:28 +00:00
Chris Lattner	9703c02ce4	The preheader insertion pass only depends on the CFG. Mark it as such, which allows GCCAS to only run it once. llvm-svn: 9056	2003-10-12 19:33:10 +00:00
Brian Gaeke	b8a4ed6543	Include <cstdio> instead of <stdio.h>. llvm-svn: 9032	2003-10-10 18:46:52 +00:00
Brian Gaeke	cc31fddf13	Don't include Config/stdio.h or <stdio.h>. llvm-svn: 9031	2003-10-10 18:46:29 +00:00
Misha Brukman	8b2bd4ed47	Fix spelling. llvm-svn: 9027	2003-10-10 17:57:28 +00:00
Misha Brukman	b3acb4027e	Fixing the spelling of this filename. llvm-svn: 9009	2003-10-10 16:57:31 +00:00
Chris Lattner	35e56e7372	Update comment llvm-svn: 8965	2003-10-08 16:56:11 +00:00
Chris Lattner	0bbbe5d4c8	Use a set to keep track of which edges have been noticed as executable already to avoid reprocessing PHI nodes needlessly. This speeds up the big bad PHI testcase 43%: from 104.9826 to 73.5157s llvm-svn: 8964	2003-10-08 16:55:34 +00:00
Chris Lattner	7324f7cd03	Minor fixes here and there llvm-svn: 8963	2003-10-08 16:21:03 +00:00
Chris Lattner	71ac22ffb5	Avoid building data structures we don't really need. This improves the runtime of a test that Bill Wendling sent me from 228.5s to 105s. Obviously there is more improvement to be had, but this is a nice speedup which should be "felt" by many programs. llvm-svn: 8962	2003-10-08 15:47:41 +00:00
Chris Lattner	950fc785ae	whoops, don't accidentally lose variable names llvm-svn: 8955	2003-10-07 22:58:41 +00:00
Chris Lattner	75b4d1deec	Fix bug: InstCombine/cast.ll:test11 / PR#7 llvm-svn: 8954	2003-10-07 22:54:13 +00:00
Chris Lattner	aec3d948cf	Refactor code a bit llvm-svn: 8952	2003-10-07 22:32:43 +00:00
Chris Lattner	f8492537eb	Fix bugzilla bug #5 llvm-svn: 8930	2003-10-07 19:33:31 +00:00
Chris Lattner	ed922162e1	Bill contributed this major rewrite of the -lowerswitch pass to make it generate logarithmic conditional branch sequences instead of linear sequences. Thanks Bill! llvm-svn: 8928	2003-10-07 18:46:23 +00:00
Chris Lattner	800aaaf207	Fix bug in previous checkin llvm-svn: 8922	2003-10-07 15:17:02 +00:00
Chris Lattner	e8ed4ef039	Minor speedups for the instcombine pass llvm-svn: 8894	2003-10-06 17:11:01 +00:00
Chris Lattner	6dc0ae2d18	Speed up the predicate used to decide when to inline by caching the size of callees between executions. On eon, in release mode, this changes the inliner from taking 11.5712s to taking 2.2066s. In debug mode, it went from taking 14.4148s to taking 7.0745s. In release mode, this is a 24.7% speedup of gccas, in debug mode, it's a total speedup of 11.7%. This also makes it slightly more aggressive. This could be because we are not judging the size of the functions quite as accurately as before. When we start looking at the performance of the generated code, this can be investigated further. llvm-svn: 8893	2003-10-06 15:52:43 +00:00
Chris Lattner	6aa34b0d0b	Avoid doing pointless work. Amazingly, this makes us go faster. Running the inliner on 252.eon used to take 48.4763s, now it takes 14.4148s. In release mode, it went from taking 25.8741s to taking 11.5712s. This also fixes a FIXME. llvm-svn: 8890	2003-10-06 15:23:43 +00:00
Chris Lattner	c30f22f57c	This changes the PromoteMemToReg function to create "pruned" SSA form, not "minimal" SSA form (in other words, it doesn't insert dead PHIs). This speeds up the mem2reg pass very significantly because it doesn't have to do a lot of frivolous work in many common cases. In the 252.eon function I have been playing with, this doesn't even insert the 120 PHI nodes that it used to which were trivially dead (in the process of promoting 356 alloca instructions overall). This speeds up the mem2reg pass from 1.2459s to 0.1284s. More significantly, the DCE pass used to take 2.4138s to remove the 120 dead PHI nodes that mem2reg constructed, now it takes 0.0134s (which is the time to scan the function and decide that there is nothing dead). So overall, on this one function, we speed things up a total of 3.5179s, which is a 24.8x speedup! :) This change is tested by the Mem2Reg/2003-10-05-DeadPHIInsertion.ll test, which now passes. llvm-svn: 8884	2003-10-05 22:19:20 +00:00
Chris Lattner	a906bacfdd	Change the interface to PromoteMemToReg to also take a DominatorTree llvm-svn: 8883	2003-10-05 21:20:13 +00:00
Chris Lattner	8047152977	Speed up the mem2reg transform for allocas which are only read/written in a single basic block. This is amazingly common in code generated by the C/C++ front-ends. This change makes it not have to insert ANY phi nodes, whereas before it would insert a ton of dead ones which DCE would have to clean up. Thus, this fix improves compile-time performance of these trivial allocas in two ways: 1. It doesn't have to do the walking and book-keeping for renaming 2. It does not insert dead phi nodes for them which would have to subsequently be cleaned up. On my favorite testcase from 252.eon, this special case handles 305 out of 356 promoted allocas in the function. It speeds up the mem2reg pass from 7.5256s to 1.2505s. It inserts 677 fewer dead PHI nodes, which speeds up a subsequent -dce pass from 18.7524s to 2.4806s. There are still 120 trivially dead PHI nodes being inserted for variables used in multiple basic blocks, but they are not handled by this patch. llvm-svn: 8881	2003-10-05 20:54:03 +00:00
Chris Lattner	a43b8f4b2f	Initial checkin of the LLVM->LLVM transform to support code generators which do not support stack unwinding yet llvm-svn: 8869	2003-10-05 19:14:42 +00:00
Chris Lattner	5ed281d7d7	simplify-cfg is really a function pass llvm-svn: 8868	2003-10-05 19:14:16 +00:00
Chris Lattner	a5721d3d03	The first PHI node may be null, scan for the first non-null one llvm-svn: 8865	2003-10-05 05:34:39 +00:00
Chris Lattner	203bc011e5	The VersionNumbers vector is only used during PHI placement. Turn it into an argument, allowing us to get rid of the vector. llvm-svn: 8864	2003-10-05 04:33:22 +00:00
Chris Lattner	7d9692df22	* Update file header comment *** Revamp the code which handled unreachable code in the function. Now the code is much more efficient for high-degree basic blocks, such as those that occur in the 252.eon SPEC benchmark. For the interested, the time to promote a SINGLE alloca in _ZN7mrScene4ReadERSi function used to be > 3.5s. Now it is < .075s. The function has a LOT of allocas in it, so it appeared to be infinite looping, this should make it much nicer. :) llvm-svn: 8863	2003-10-05 04:26:39 +00:00
Chris Lattner	db1f81bcb5	Simplify the loop a bit llvm-svn: 8862	2003-10-05 03:45:44 +00:00
Chris Lattner	2093012a03	There is no need for separate WriteSets and PhiNodeBlocks lists. It is just a work-list of value definitions. This allows elimination of the explicit 'iterative' step of the algorithm, and also reuses temporary memory better. llvm-svn: 8861	2003-10-05 03:39:10 +00:00

1 2 3 4 5 ...

1017 Commits