llvm-project

Commit Graph

Author	SHA1	Message	Date
Chandler Carruth	dcf5dacb2c	Don't leave pointers uninitialized in the default constructor. GCC complains about the potential use of these uninitialized members under certain conditions. llvm-svn: 91239	2009-12-13 07:04:45 +00:00
Bob Wilson	895f364ae6	Revise scalar replacement to be more flexible about handle bitcasts and GEPs. While scanning through the uses of an alloca, keep track of the current offset relative to the start of the alloca, and check memory references to see if the offset & size correspond to a component within the alloca. This has the nice benefit of unifying much of the code from isSafeUseOfAllocation, isSafeElementUse, and isSafeUseOfBitCastedAllocation. The code to rewrite the uses of a promoted alloca, after it is determined to be safe, is reorganized in the same way. Also, when rewriting GEP instructions, mark them as "in-bounds" since all the indices are known to be safe. llvm-svn: 91184	2009-12-11 23:47:40 +00:00
Eric Christopher	22889c049d	Make sure the immediate dominator isn't NULL through iterations of the loop. We could get to this condition via indirect branches. llvm-svn: 91009	2009-12-10 00:25:41 +00:00
Chris Lattner	9ccc879006	Fix PR5744, a case where we were getting the pointer size instead of the value size. This only manifested when memdep inprecisely returns clobber, which is do to a caching issue in the PR5744 testcase. We can 'efficiently emulate' this by using '-no-aa' llvm-svn: 91004	2009-12-10 00:11:45 +00:00
Chris Lattner	3ddf804f78	allow this to build when the #if 0's are enabled. No functionality change. llvm-svn: 90999	2009-12-10 00:04:46 +00:00
Dan Gohman	72c367fb52	Dereference loopHeader after checking for null rather than before. llvm-svn: 90990	2009-12-09 22:55:01 +00:00
Chris Lattner	ca5f9cb18b	fix hte last remaining known (by me) phi translation bug. When we reanalyze clobbers to forward pieces of large stores to small loads, we need to consider the properly phi translated pointer in the store block. llvm-svn: 90978	2009-12-09 18:21:46 +00:00
Chris Lattner	f8ba1253f1	change GetStoreValueForLoad to use IRBuilder, which is cleaner and implicitly constant folds. llvm-svn: 90977	2009-12-09 18:13:28 +00:00
Bob Wilson	1c5a6fb299	Fix a comment. llvm-svn: 90975	2009-12-09 18:05:27 +00:00
Chris Lattner	07df9efb35	change AnalyzeLoadFromClobberingMemInst/AnalyzeLoadFromClobberingStore to require the load ty/ptr to be passed in, no functionality change. llvm-svn: 90960	2009-12-09 07:37:07 +00:00
Chris Lattner	0def861ee9	change AnalyzeLoadFromClobberingWrite and clients to pass in type and pointer instead of the load. No functionality change. llvm-svn: 90959	2009-12-09 07:34:10 +00:00
Chris Lattner	0c31547168	change NonLocalDepEntry from being a typedef for an std::pair to be its own small class. No functionality change. llvm-svn: 90956	2009-12-09 07:08:01 +00:00
Chris Lattner	946b58dd90	add some aborts to #if 0's. llvm-svn: 90929	2009-12-09 02:41:54 +00:00
Chris Lattner	972e6d8d00	Switch GVN and memdep to use PHITransAddr, which correctly handles phi translation of complex expressions like &A[i+1]. This has the following benefits: 1. The phi translation logic is all contained in its own class with a strong interface and verification that it is self consistent. 2. The logic is more correct than before. Previously, if intermediate expressions got PHI translated, we'd miss the update and scan for the wrong pointers in predecessor blocks. @phi_trans2 is a testcase for this. 3. We have a lot less code in memdep. We can handle phi translation across blocks of things like @phi_trans3, which is pretty insane :). This patch should fix the miscompiles of 255.vortex, and I tested it with a bootstrap of llvm-gcc, llvm-test and dejagnu of course. llvm-svn: 90926	2009-12-09 01:59:31 +00:00
Bob Wilson	c5d082fd5d	Some superficial cleanups. llvm-svn: 90866	2009-12-08 18:27:03 +00:00
Bob Wilson	2029ea04f9	Clean up dead operands left around after SROA replaces a mem intrinsic. I'm not aware that this does anything significant on its own, but it's needed for another patch that I'm working on. llvm-svn: 90864	2009-12-08 18:22:03 +00:00
Duncan Sands	6a3df7b0c7	Teach GlobalOpt to delete aliases with internal linkage (after forwarding any uses). GlobalDCE can also do this, but is only run at -O3. llvm-svn: 90850	2009-12-08 10:10:20 +00:00
Nick Lewycky	8bca014d7f	Remove unnecessary #include "llvm/LLVMContext.h". llvm-svn: 90836	2009-12-08 05:45:41 +00:00
Chris Lattner	6d6f10fe91	fix PR5698 llvm-svn: 90708	2009-12-06 17:17:23 +00:00
Chris Lattner	778cb92235	constant fold loads from memcpy's from global constants. This is important because clang lowers nontrivial automatic struct/array inits to memcpy from a global array. llvm-svn: 90698	2009-12-06 05:29:56 +00:00
Chris Lattner	93236ba327	add support for forwarding mem intrinsic values to non-local loads. llvm-svn: 90697	2009-12-06 04:54:31 +00:00
Chris Lattner	42376066eb	Handle forwarding local memsets to loads. For example, we optimize this: short x(short A) { memset(A, 1, sizeof(A)*100); return A[42]; } to 'return 257' instead of doing the load. llvm-svn: 90695	2009-12-06 01:57:02 +00:00
Nick Lewycky	a0e9d700dc	Generalize this optimization to work on equality comparisons between any two integers that are constant except for a single bit (the same n-th bit in each). llvm-svn: 90646	2009-12-05 05:00:00 +00:00
Bob Wilson	050b812fe7	Fix up some comments. llvm-svn: 90603	2009-12-04 21:57:37 +00:00
Bob Wilson	5ca37b274c	Fix 80-column violations. llvm-svn: 90601	2009-12-04 21:51:35 +00:00
Chris Lattner	2bd9609992	add an assert to make it really clear what this is doing. Return singularval as a compile time perf optimization to avoid a load. llvm-svn: 90507	2009-12-04 01:03:32 +00:00
Bob Wilson	53bdae3802	Fix a comment typo. llvm-svn: 90487	2009-12-03 21:47:07 +00:00
Owen Anderson	0b6e260066	Fix this crasher, and add a FIXME for a missed optimization. llvm-svn: 90408	2009-12-03 03:43:29 +00:00
Chris Lattner	a48f44d9ee	improve portability to avoid conflicting with std::next in c++'0x. Patch by Howard Hinnant! llvm-svn: 90365	2009-12-03 00:50:42 +00:00
Jim Grosbach	d831ef4945	Move EliminateDuplicatePHINodes() from SimplifyCFG.cpp to Local.cpp llvm-svn: 90324	2009-12-02 17:06:45 +00:00
Andreas Neustifter	3d207290fe	Cheap, mostly strict, stable sorting. This is necessary for tests so the results are comparable. llvm-svn: 90320	2009-12-02 15:57:15 +00:00
Owen Anderson	b9878ee6b6	Cleanup/remove some parts of the lifetime region handling code in memdep and GVN, per Chris' comments. Adjust testcases to match. llvm-svn: 90304	2009-12-02 07:35:19 +00:00
Chris Lattner	c468025ac9	factor some code better. llvm-svn: 90299	2009-12-02 06:44:58 +00:00
Chris Lattner	2764b4dc55	formatting cleanups. llvm-svn: 90298	2009-12-02 06:35:55 +00:00
Chris Lattner	eea42c7b51	tidy up, remove dependence on order of evaluation of function args from EmitMemCpy. llvm-svn: 90297	2009-12-02 06:05:42 +00:00
Chris Lattner	3c9aca9079	fix PR5640 by tracking whether a block is the header of a loop more precisely, which prevents us from infinitely peeling the loop. llvm-svn: 90211	2009-12-01 06:04:43 +00:00
Benjamin Kramer	3efc050ac4	Revert r90089 for now, it's breaking selfhost. llvm-svn: 90097	2009-11-29 21:17:48 +00:00
Benjamin Kramer	bfa993ab20	Fix two FIXMEs. llvm-svn: 90089	2009-11-29 20:29:30 +00:00
Chris Lattner	1cc4cca193	add testcases for the foo_with_overflow op xforms added recently and fix bugs exposed by the tests. Testcases from Alastair Lynn! llvm-svn: 90056	2009-11-29 02:57:29 +00:00
Chris Lattner	cd261c9c26	Implement PR5634. llvm-svn: 90046	2009-11-29 00:51:17 +00:00
Chris Lattner	32140312ca	reenable load address insertion in load pre. This allows us to handle cases like this: void test(int N, double* G) { long j; for (j = 1; j < N - 1; j++) G[j+1] = G[j] + G[j+1]; } where G[1] isn't live into the loop. llvm-svn: 90041	2009-11-28 16:08:18 +00:00
Chris Lattner	44da5bd837	Enhance InsertPHITranslatedPointer to be able to return a list of newly inserted instructions. No functionality change until someone starts using it. llvm-svn: 90039	2009-11-28 15:39:14 +00:00
Chris Lattner	cf0b198827	disable value insertion for now, I need to figure out how to inform GVN about the newly inserted values. This fixes PR5631. llvm-svn: 90022	2009-11-27 22:50:07 +00:00
Chris Lattner	2be52e72ae	Rework InsertPHITranslatedPointer to handle the recursive case, this fixes PR5630 and sets the stage for the next phase of goodness (testcase pending). llvm-svn: 90019	2009-11-27 22:05:15 +00:00
Chris Lattner	3d9823b9cf	factor some logic out of instcombine into a new SimplifyAddInst method. llvm-svn: 90011	2009-11-27 17:42:22 +00:00
Chris Lattner	2226db66ab	fix PR5436 by making the 'simple' case of SRoA not promote out of range array indexes. The "complex" case of SRoA still handles them, and correctly. This fixes a weirdness where we'd correctly avoid transforming A[0][42] if the 42 was too large, but we'd only do it if it was one gep, not two separate ones. llvm-svn: 90007	2009-11-27 16:37:41 +00:00
Chris Lattner	25be93dfed	teach GVN's load PRE to insert computations of the address in predecessors where it is not available. It's unclear how to get this inserted computation into GVN's scalar availability sets, Owen, help? :) llvm-svn: 89997	2009-11-27 08:25:10 +00:00
Chris Lattner	a9a76ccf56	Fix phi translation in load PRE to agree with the phi translation done by memdep, and reenable gep translation again. llvm-svn: 89992	2009-11-27 06:31:14 +00:00
Chris Lattner	8574aba4ea	factor some instcombine simplifications for getelementptr out to a new SimplifyGEPInst method in InstructionSimplify.h. No functionality change. llvm-svn: 89980	2009-11-27 00:29:05 +00:00
Chris Lattner	a5bc618a91	fix crash on Transforms/InstCombine/intrinsics.ll introduced by r89970 llvm-svn: 89972	2009-11-26 22:08:06 +00:00
Chris Lattner	a73ecf0b00	Fix PR5471 by removing an instcombine xform. Some pieces of the code generates store to undef and some generates store to null as the idiom for undefined behavior. Since simplifycfg zaps both, don't remove the undefined behavior in instcombine. llvm-svn: 89971	2009-11-26 22:04:42 +00:00
Chris Lattner	5b83ba215d	implement a bunch of xforms for overflow intrinsics, based on a patch by Alastair Lynn. llvm-svn: 89970	2009-11-26 21:42:47 +00:00
Edward O'Callaghan	2b8fed15e0	Reverting patch in revision 89758, initial attempt at fixing PR5373 has proven to be bogus. llvm-svn: 89844	2009-11-25 05:38:41 +00:00
Edward O'Callaghan	5fd452d596	Fix for PR5373, Credit to Jakub Staszak. llvm-svn: 89758	2009-11-24 11:51:52 +00:00
Dan Gohman	580b80d6d9	Make ConstantFoldConstantExpression recursively visit the entire ConstantExpr, not just the top-level operator. This allows it to fold many more constants. Also, make GlobalOpt call ConstantFoldConstantExpression on GlobalVariable initializers. llvm-svn: 89659	2009-11-23 16:22:21 +00:00
Dan Gohman	1f522d98f8	Fix a use of an invalidated iterator in the case where there are multiple adjacent uses of a dead basic block from the same user. This fixes PR5596. llvm-svn: 89658	2009-11-23 16:13:39 +00:00
Nick Lewycky	15a1287c1f	Pull LLVMContext out of PromoteMemToReg. llvm-svn: 89645	2009-11-23 03:50:44 +00:00
Nick Lewycky	621fe5614e	Remove LLVMContext and its include. llvm-svn: 89644	2009-11-23 03:34:29 +00:00
Nick Lewycky	39dbfd3c58	Remove unused LLVMContext. llvm-svn: 89642	2009-11-23 03:29:18 +00:00
Nick Lewycky	922d4ab574	Reapply r88830 with a bugfix: this transform only applies to icmp eq/ne. This fixes part of PR5438. llvm-svn: 89639	2009-11-23 03:17:33 +00:00
Eric Christopher	0c7bd96de2	Add more optimizations for object size checking, enable handling of object size intrinsic and verify return type is correct. Collect various code in one place. llvm-svn: 89523	2009-11-21 01:01:30 +00:00
Dan Gohman	fbffe63528	Make Loop::getLoopLatch() work on loops which don't have preheaders, as it may be used in contexts where preheader insertion may have failed due to an indirectbr. Make LoopSimplify's LoopSimplify::SeparateNestedLoop properly fail in the case that it would require splitting an indirectbr edge. These fix PR5502. llvm-svn: 89484	2009-11-20 20:51:18 +00:00
Dan Gohman	d15302afa0	Fix IPSCCP's code for deleting dead blocks to tolerate outstanding blockaddress users. This fixes PR5569. llvm-svn: 89483	2009-11-20 20:19:14 +00:00
Daniel Dunbar	f87c75706f	Revert "Add some rough optimizations for checking routines.", it buildeth not. llvm-svn: 89482	2009-11-20 20:17:30 +00:00
Eric Christopher	cf97d01dff	Add some rough optimizations for checking routines. llvm-svn: 89479	2009-11-20 19:57:37 +00:00
Duncan Sands	9e26aac773	Fix PR5563, an expensive checks failure when running on tests/Transforms/InstCombine/shufflemask-undef.ll. If anyone cares, the use of 2*e here (and the equivalent all over the place in instcombine) seems wrong, though harmless: it should really be twice the length of the input vector. I think shufflevector used to require that the mask have the same length as the input, but I don't think that's true any more. I don't care enough about vectors to do anything about this... llvm-svn: 89456	2009-11-20 13:19:51 +00:00
Dan Gohman	94e617627d	Extend CaptureTracking to indicate when a value is never stored, even if it is not ultimately captured. Teach BasicAliasAnalysis that a local object address which does not escape and is never stored does not alias with a value resulting from a load. llvm-svn: 89398	2009-11-19 21:57:48 +00:00
Dan Gohman	cbc6ebb6fd	Enable hoisting of loads from constant memory by default. In cases where they are lowered to instruction sequences more complex than a simple load, such that CodeGen cannot rematerialize them, a reload from a spill slot is likely to be cheaper than the complex sequence. llvm-svn: 89374	2009-11-19 19:00:10 +00:00
Jim Grosbach	dcef55b2ef	Eliminate duplicate phi nodes in loops. Loop rotation, for example, can introduce these, and it's beneficial to later passes to clean them up. llvm-svn: 89298	2009-11-19 02:03:18 +00:00
Jim Grosbach	cc69a1ba9a	Make EliminateDuplicatePHINodes() available as a utility function llvm-svn: 89297	2009-11-19 02:02:10 +00:00
Jim Grosbach	6bf5305f5d	grammar llvm-svn: 89145	2009-11-17 21:37:04 +00:00
Jim Grosbach	e4e018ae67	80-column violations llvm-svn: 89123	2009-11-17 19:05:35 +00:00
Evan Cheng	ba4e5da727	Generalize OptimizeLoopTermCond to optimize more loop terminating icmp to use postinc iv. llvm-svn: 89116	2009-11-17 18:10:11 +00:00
Jim Grosbach	60f4854c76	Remove trailing whitespace llvm-svn: 89110	2009-11-17 17:53:56 +00:00
Devang Patel	12144a2348	Remove debug info attached with an instruction. llvm-svn: 89016	2009-11-17 00:47:06 +00:00
David Greene	a3ce7828b2	Fix an expensive-checks error. The Mask and LHSMask may not be of the same size, so don't do the transformation if they're different. llvm-svn: 88972	2009-11-16 21:52:23 +00:00
Duncan Sands	e5de4a9ad6	CreateIntCast takes an "isSigned" parameter. Pass "true" for it, rather than a name. llvm-svn: 88908	2009-11-16 12:32:28 +00:00
Chris Lattner	9d9812a636	make PRE of loads preserve the alignment of the moved load instruction. llvm-svn: 88865	2009-11-15 19:58:31 +00:00
Chris Lattner	5f037b6439	fix a bug handling 'not x' when x is undef. llvm-svn: 88864	2009-11-15 19:57:43 +00:00
Nick Lewycky	95148689c9	Revert r88830 and r88831 which appear to have caused a selfhost buildbot some grief. I suspect this patch merely exposed a bug else. llvm-svn: 88841	2009-11-15 07:47:32 +00:00
Nick Lewycky	e29fa4c7a1	Teach instcombine to look for booleans in wider integers when it encounters a zext(icmp). It may be able to optimize that away. This fixes one of the cases in PR5438. llvm-svn: 88830	2009-11-15 05:55:17 +00:00
Nick Lewycky	7935bcb0fe	Remove LLVMContext from reassociate. It was threaded through every function but ultimately never used. llvm-svn: 88763	2009-11-14 07:25:54 +00:00
Dan Gohman	81132465d3	Add an option for running GVN with redundant load processing disabled. llvm-svn: 88742	2009-11-14 02:27:51 +00:00
Owen Anderson	e96b2111b1	Re-enable this code, since redundant PHIs are now being better nuked. llvm-svn: 87042	2009-11-12 23:22:41 +00:00
Chris Lattner	5c89f4b4ef	use isInstructionTriviallyDead, as pointed out by Duncan llvm-svn: 87035	2009-11-12 21:58:18 +00:00
Chris Lattner	eb9acbfb05	implement a nice little efficiency hack in the inliner. Since we're now running IPSCCP early, and we run functionattrs interlaced with the inliner, we often (particularly for small or noop functions) completely propagate all of the information about a call to its call site in IPSSCP (making a call dead) and functionattrs is smart enough to realize that the function is readonly (because it is interlaced with inliner). To improve compile time and make the inliner threshold more accurate, realize that we don't have to inline dead readonly function calls. Instead, just delete the call. This happens all the time for C++ codes, here are some counters from opt/llvm-ld counting the number of times calls were deleted vs inlined on various apps: Tramp3d opt: 5033 inline - Number of call sites deleted, not inlined 24596 inline - Number of functions inlined llvm-ld: 667 inline - Number of functions deleted because all callers found 699 inline - Number of functions inlined 483.xalancbmk opt: 8096 inline - Number of call sites deleted, not inlined 62528 inline - Number of functions inlined llvm-ld: 217 inline - Number of allocas merged together 2158 inline - Number of functions inlined 471.omnetpp: 331 inline - Number of call sites deleted, not inlined 8981 inline - Number of functions inlined llvm-ld: 171 inline - Number of functions deleted because all callers found 629 inline - Number of functions inlined Deleting a call is much faster than inlining it, and is insensitive to the size of the callee. :) llvm-svn: 86975	2009-11-12 07:56:08 +00:00
Evan Cheng	85a9f430e9	- Teach LSR to avoid changing cmp iv stride if it will create an immediate that cannot be folded into target cmp instruction. - Avoid a phase ordering issue where early cmp optimization would prevent the later count-to-zero optimization. - Add missing checks which could cause LSR to reuse stride that does not have users. - Fix a bug in count-to-zero optimization code which failed to find the pre-inc iv's phi node. - Remove, tighten, loosen some incorrect checks disable valid transformations. - Quite a bit of code clean up. llvm-svn: 86969	2009-11-12 07:35:05 +00:00
Chris Lattner	5f6b8b2bcb	use getPredicateOnEdge to fold comparisons through PHI nodes, which implements GCC PR18046. This also gets us 360 more jump threads on 176.gcc. llvm-svn: 86953	2009-11-12 05:24:05 +00:00
Chris Lattner	22db4b5e0c	various fixes to the lattice transfer functions. llvm-svn: 86952	2009-11-12 04:57:13 +00:00
Chris Lattner	c893c4ed10	switch jump threading to use getPredicateOnEdge in one place making the new LVI stuff smart enough to subsume some special cases in the old code. Disable them when LVI is around, the testcase still passes. llvm-svn: 86951	2009-11-12 04:37:50 +00:00
Daniel Dunbar	11881e2283	Add the braces gcc suggested. llvm-svn: 86933	2009-11-12 02:52:56 +00:00
Chris Lattner	ba45616958	with the new code we can thread non-instruction values. This allows us to handle the test10 testcase. llvm-svn: 86924	2009-11-12 01:41:34 +00:00
Chris Lattner	3f80d85191	this argument can be an arbitrary value, it doesn't need to be an instruction. llvm-svn: 86923	2009-11-12 01:37:43 +00:00
Chris Lattner	d5e25436a1	expose edge information and switch j-t to use it. llvm-svn: 86920	2009-11-12 01:29:10 +00:00
Chris Lattner	67146695b6	pass TD into a SimplifyCmpInst call. Add another case that uses LVI info when -enable-jump-threading-lvi is passed. llvm-svn: 86886	2009-11-11 22:31:38 +00:00
Duncan Sands	ba61fed5d3	Don't trivially delete unused calls to llvm.invariant.start. This allows llvm.invariant.start to be used without necessarily being paired with a call to llvm.invariant.end. If you run the entire optimization pipeline then such calls are in fact deleted (adce does it), but that's actually a good thing since we probably do want them to be zapped late in the game. There should really be an integration test that checks that the llvm.invariant.start call lasts long enough that all passes that do interesting things with it get to do their stuff before it is deleted. But since no passes do anything interesting with it yet this will have to wait for later. llvm-svn: 86840	2009-11-11 15:34:13 +00:00
Chris Lattner	852f2653c4	remove the now dead condprop pass, PR3906. llvm-svn: 86810	2009-11-11 05:56:35 +00:00
Chris Lattner	fde1f8d0d8	stub out some LazyValueInfo interfaces, and have JumpThreading start using them in a trivial way when -enable-jump-threading-lvi is passed. enable-jump-threading-lvi will be my playground for awhile. llvm-svn: 86789	2009-11-11 02:08:33 +00:00
Chris Lattner	3a2ae908fe	add a fixme llvm-svn: 86766	2009-11-11 00:21:58 +00:00
Evan Cheng	12f146d8f7	Block terminator may be a switch. llvm-svn: 86761	2009-11-11 00:00:21 +00:00
Devang Patel	f6eeaebd76	Implement support to debug inlined functions. llvm-svn: 86748	2009-11-10 23:06:00 +00:00
Chris Lattner	9518fbb54e	implement a TODO by teaching jump threading about "xor x, 1". llvm-svn: 86739	2009-11-10 22:39:16 +00:00
Chris Lattner	852d6d64ff	move some generally useful functions out of jump threading into libanalysis and transformutils. llvm-svn: 86735	2009-11-10 22:26:15 +00:00
Chris Lattner	02e2cee7dc	fix a crash in SCCP handling extractvalue of an array, pointed out and tracked down by Stephan Reiter! llvm-svn: 86726	2009-11-10 22:02:09 +00:00
Chris Lattner	40b15f220d	improve comment. llvm-svn: 86723	2009-11-10 21:45:09 +00:00
Chris Lattner	80e7e5a429	Make jump threading eliminate blocks that just contain phi nodes, debug intrinsics, and an unconditional branch when possible. This reuses the TryToSimplifyUncondBranchFromEmptyBlock function split out of simplifycfg. llvm-svn: 86722	2009-11-10 21:40:01 +00:00
Evan Cheng	87fe40b32d	Generalize lsr code that optimize loop to count down towards zero. llvm-svn: 86715	2009-11-10 21:14:05 +00:00
Duncan Sands	23344095de	Add defensive break. llvm-svn: 86705	2009-11-10 19:36:40 +00:00
Duncan Sands	8d4cde2b55	Fix obvious typo. llvm-svn: 86694	2009-11-10 18:21:37 +00:00
Chris Lattner	b8f79ba10e	clarify logic. llvm-svn: 86689	2009-11-10 17:00:47 +00:00
Duncan Sands	1925d3a1d1	Teach DSE to eliminate useless trampolines. llvm-svn: 86683	2009-11-10 13:49:50 +00:00
Duncan Sands	04e0c95248	Add brackets to make gcc-4.4 happy. llvm-svn: 86681	2009-11-10 09:32:10 +00:00
Victor Hernandez	fcc77b1c02	Update computeArraySize() to use ComputeMultiple() to determine the array size associated with a malloc; also extend PerformHeapAllocSRoA() to check if the optimized malloc's arg had its highest bit set, so that it is safe for ComputeMultiple() to look through sext instructions while determining the optimized malloc's array size llvm-svn: 86676	2009-11-10 08:32:25 +00:00
Chris Lattner	1559bedcc7	unify the code that determines whether it is a good idea to change the type of a computation. This fixes some infinite loops when dealing with TD that has no native types. llvm-svn: 86670	2009-11-10 07:23:37 +00:00
Nick Lewycky	5b3def9b86	Simplify. llvm-svn: 86668	2009-11-10 07:00:43 +00:00
Nick Lewycky	9027147fb1	Reapply r86359, "Teach dead store elimination that certain intrinsics write to memory just like a store" with bug fixed (partial-overwrite.ll is the regression test). llvm-svn: 86667	2009-11-10 06:46:40 +00:00
Chris Lattner	cbd18fc93d	refactor TryToSimplifyUncondBranchFromEmptyBlock out of SimplifyCFG. llvm-svn: 86666	2009-11-10 05:59:26 +00:00
Oscar Fuentes	bbc1067001	CMake: Support for building llvm loadable modules. llvm-svn: 86656	2009-11-10 02:45:37 +00:00
Chris Lattner	38c44ea6b0	make jump threading recursively simplify expressions instead of doing it just one level deep. On the testcase we go from getting this: F1: ; preds = %T2 %F = and i1 true, %cond ; <i1> [#uses=1] br i1 %F, label %X, label %Y to a fully threaded: F1: ; preds = %T2 br label %Y This changes gets us to the point where we're forming (too many) switch instructions on doug's strswitch testcase. llvm-svn: 86646	2009-11-10 01:57:31 +00:00
Chris Lattner	be11db6894	don't invalidate PN, rewrite of this code is in progress anyway. llvm-svn: 86639	2009-11-10 01:19:06 +00:00
Chris Lattner	fb7f87d5a3	add a new SimplifyInstruction API, which is like ConstantFoldInstruction, except that the result may not be a constant. Switch jump threading to use it so that it gets things like (X & 0) -> 0, which occur when phi preds are deleted and the remaining phi pred was a zero. llvm-svn: 86637	2009-11-10 01:08:51 +00:00
Jeffrey Yasskin	b40d3f76a0	Fix DenseMap iterator constness. This patch forbids implicit conversion of DenseMap::const_iterator to DenseMap::iterator which was possible because DenseMapIterator inherited (publicly) from DenseMapConstIterator. Conversion the other way around is now allowed as one may expect. The template DenseMapConstIterator is removed and the template parameter IsConst which specifies whether the iterator is constant is added to DenseMapIterator. Actually IsConst parameter is not necessary since the constness can be determined from KeyT but this is not relevant to the fix and can be addressed later. Patch by Victor Zverovich! llvm-svn: 86636	2009-11-10 01:02:17 +00:00
Chris Lattner	a71e9d61be	factor simplification logic for AND and OR out to InstSimplify from instcombine. llvm-svn: 86635	2009-11-10 00:55:12 +00:00
Chris Lattner	ccfdceb22c	pull a bunch of logic out of instcombine into instsimplify for compare simplification, this handles the foldable fcmp x,x cases among many others. llvm-svn: 86627	2009-11-09 23:55:12 +00:00
Chris Lattner	beadc6e8c7	inline a simple function. llvm-svn: 86625	2009-11-09 23:31:49 +00:00
Chris Lattner	c1f19071f8	rename SimplifyCompare -> SimplifyCmpInst and split it into Simplify[IF]Cmp pieces. Add some predicates to CmpInst to determine whether a predicate is fp or int. llvm-svn: 86624	2009-11-09 23:28:39 +00:00
Chris Lattner	cdfb80de16	fix ConstantFoldCompareInstOperands to take the LHS/RHS as individual operands instead of taking a temporary array llvm-svn: 86619	2009-11-09 23:06:58 +00:00
Chris Lattner	800aad3dda	use instructionsimplify instead of a weak clone of ad-hoc folding stuff. llvm-svn: 86616	2009-11-09 23:00:14 +00:00
Chris Lattner	2978ca7b79	stub out a new form of BasicBlock::RemovePredecessorAndSimplify which simplifies instruction users of PHIs when the phi is eliminated. This will be moved to transforms/utils after some other refactoring. llvm-svn: 86603	2009-11-09 22:32:36 +00:00
Dan Gohman	f324dd65f8	Fix a comment in a typo that Duncan noticed. llvm-svn: 86575	2009-11-09 18:59:22 +00:00
Dan Gohman	c146c78060	Generalize LCSSA to handle loops with exits with predecessors outside the loop. This is needed because with indirectbr it may not be possible for LoopSimplify to guarantee that all loop exit predecessors are inside the loop. This fixes PR5437. LCCSA no longer actually requires LoopSimplify form, but for now it must still have the dependency because the PassManager doesn't know how to schedule LoopSimplify otherwise. llvm-svn: 86569	2009-11-09 18:28:24 +00:00
Chris Lattner	39c07b2eef	if a 'with overflow' intrinsic just has the normal result used, simplify it to a normal binop. Patch by Alastair Lynn, testcase by me. llvm-svn: 86524	2009-11-09 07:07:56 +00:00
Chris Lattner	feeabde753	fix PR5104: when printing a single character, return the result of putchar in case there is an error. llvm-svn: 86515	2009-11-09 04:57:04 +00:00
Chris Lattner	0685be3441	enhance PHI slicing to handle the case when a slicable PHI is begin used by a chain of other PHIs. llvm-svn: 86503	2009-11-09 01:38:00 +00:00
Owen Anderson	939ea35244	Small cleanups. llvm-svn: 86499	2009-11-09 00:48:15 +00:00
Owen Anderson	73fc616838	Revert my previous patch to ABCD and fix things the right way. There are two problems addressed here: 1) We need to avoid processing sigma nodes as phi nodes for constraint generation. 2) We need to generate constraints for comparisons against constants properly. This includes our first working ABCD test! llvm-svn: 86498	2009-11-09 00:44:44 +00:00
Chris Lattner	ea465e221e	comment typos pointed out by Duncan llvm-svn: 86497	2009-11-09 00:41:49 +00:00
Owen Anderson	058088f219	Fix an issue where the ordering of blocks within a function could lead to different constraint graphs being produced. The cause was that we were incorrectly marking sigma instructions as processed after handling the sigma-specific constraints for them, potentially neglecting to process them as normal instructions as well. Unfortunately, the testcase that inspired this still doesn't work because of a bug in the solver, which is next on the list to debug. llvm-svn: 86486	2009-11-08 22:36:55 +00:00
Chris Lattner	2299d4b6d8	Teach an instcombine to not pull trunc instructions through PHI nodes when both the source and dest are illegal types, since it would cause the phi to grow (for example, we shouldn't transform test14b's phi to a phi on i320). This fixes an infinite loop on i686 bootstrap with phi slicing turned on, so turn it back on. llvm-svn: 86483	2009-11-08 21:20:06 +00:00
Chris Lattner	a837e4db6b	reapply r8644[3-5] with only the scary part (SliceUpIllegalIntegerPHI) disabled. llvm-svn: 86480	2009-11-08 19:23:30 +00:00
Daniel Dunbar	4c41373c56	Speculatively revert r8644[3-5], they seem to be leading to infinite loops in llvm-gcc bootstrap. llvm-svn: 86478	2009-11-08 17:52:47 +00:00
Chris Lattner	c7a450b5b2	teach a couple of instcombine transformations involving PHIs to not turn a PHI in a legal type into a PHI of an illegal type, and add a new optimization that breaks up insane integer PHI nodes into small pieces (PR3451). llvm-svn: 86443	2009-11-08 08:21:13 +00:00
Nick Lewycky	b9397262b7	Improve tail call elimination to handle the switch statement. llvm-svn: 86403	2009-11-07 21:10:15 +00:00
Chris Lattner	c77d24b792	make instcombine only rewrite a chain of computation (eliminating some extends) if the new type of the computation is legal or if both the source and dest are illegal. This prevents instcombine from changing big chains of computation into i64 on 32-bit targets for example. llvm-svn: 86398	2009-11-07 19:11:46 +00:00
Chris Lattner	431000da21	Revert r86359, it is breaking the self host on the llvm-gcc-i386-darwin9 build bot. llvm-svn: 86391	2009-11-07 17:59:32 +00:00
Nick Lewycky	b6a3dd48f4	Teach dead store elimination that certain intrinsics write to memory just like a store. llvm-svn: 86359	2009-11-07 08:34:40 +00:00
Chris Lattner	5ff7f5672e	reapply 86289, 86278, 86270, 86267, 86266 & 86264 plus a fix (making pred factoring only happen if threading is guaranteed to be successful). This now survives an X86-64 bootstrap of llvm-gcc. llvm-svn: 86355	2009-11-07 08:05:03 +00:00
Nick Lewycky	9b669b3c4f	Oops, FunctionContainsEscapingAllocas is really used to mean two different things. Back out part of r86349 for a moment. llvm-svn: 86353	2009-11-07 07:42:38 +00:00
Nick Lewycky	5091272fdf	Dust off tail recursion elimination. Fix a fixme by applying CaptureTracking and add a .ll to demo the new capability. llvm-svn: 86349	2009-11-07 07:10:01 +00:00
Devang Patel	3a42e7ac65	Revert following patches to fix llvmgcc bootstrap. 86289, 86278, 86270, 86267, 86266 & 86264 Chris, please take a look. llvm-svn: 86321	2009-11-07 01:32:59 +00:00
Victor Hernandez	bde558c536	- new SROA mallocs should have the mallocs running-or'ed, not the malloc's bitcast - fix ProcessInternalGlobal() debug output llvm-svn: 86317	2009-11-07 00:41:19 +00:00
Jeffrey Yasskin	8f77e948e5	Avoid "ambiguous 'else'" warning from gcc. llvm-svn: 86314	2009-11-07 00:26:47 +00:00
Victor Hernandez	f3db915294	Re-commit r86077 now that r86290 fixes the 179.art and 175.vpr ARM regressions. Here is the original commit message: This commit updates malloc optimizations to operate on malloc calls that have constant int size arguments. Update CreateMalloc so that its callers specify the size to allocate: MallocInst-autoupgrade users use non-TargetData-computed allocation sizes. Optimization uses use TargetData to compute the allocation size. Now that malloc calls can have constant sizes, update isArrayMallocHelper() to use TargetData to determine the size of the malloced type and the size of malloced arrays. Extend getMallocType() to support malloc calls that have non-bitcast uses. Update OptimizeGlobalAddressOfMalloc() to optimize malloc calls that have non-bitcast uses. The bitcast use of a malloc call has to be treated specially here because the uses of the bitcast need to be replaced and the bitcast needs to be erased (just like the malloc call) for OptimizeGlobalAddressOfMalloc() to work correctly. Update PerformHeapAllocSRoA() to optimize malloc calls that have non-bitcast uses. The bitcast use of the malloc is not handled specially here because ReplaceUsesOfMallocWithGlobal replaces through the bitcast use. Update OptimizeOnceStoredGlobal() to not care about the malloc calls' bitcast use. Update all globalopt malloc tests to not rely on autoupgraded-MallocInsts, but instead use explicit malloc calls with correct allocation sizes. llvm-svn: 86311	2009-11-07 00:16:28 +00:00
Chris Lattner	eb690feaef	Fix a bug where we'd call SplitBlockPredecessors with a pred in the set only once even if it has multiple edges to BB. llvm-svn: 86299	2009-11-06 23:19:58 +00:00
Eli Friedman	a70917b2f4	Remove function left over from other jump threading cleanup. llvm-svn: 86289	2009-11-06 21:24:57 +00:00
Chris Lattner	a8b9ce3f07	Fix a problem discovered on self host. llvm-svn: 86278	2009-11-06 19:21:48 +00:00
Chris Lattner	d91a7960bf	remove more code subsumed by r86264 llvm-svn: 86270	2009-11-06 18:24:32 +00:00
Chris Lattner	899ef22acb	eliminate some more code subsumed by r86264 llvm-svn: 86267	2009-11-06 18:22:54 +00:00
Chris Lattner	2f6184f6aa	remove now redundant code, r86264 handles this case. llvm-svn: 86266	2009-11-06 18:20:58 +00:00
Chris Lattner	68d2417e05	Extend jump threading to support much more general threading predicates. This allows us to jump thread things like: _ZN12StringSwitchI5ColorE4CaseILj7EEERS1_RAT__KcRKS0_.exit119: %tmp1.i24166 = phi i8 [ 1, %bb5.i117 ], [ %tmp1.i24165, %_Z....exit ], [ %tmp1.i24165, %bb4.i114 ] %toBoolnot.i87 = icmp eq i8 %tmp1.i24166, 0 ; <i1> [#uses=1] %tmp4.i90 = icmp eq i32 %tmp2.i, 6 ; <i1> [#uses=1] %or.cond173 = and i1 %toBoolnot.i87, %tmp4.i90 ; <i1> [#uses=1] br i1 %or.cond173, label %bb4.i96, label %_ZN12... Where it is "obvious" that when coming from %bb5.i117 that the 'and' is always false. This triggers a surprisingly high number of times in the testsuite, and gets us closer to generating good code for doug's strswitch testcase. This also make a bunch of other code in jump threading redundant, I'll rip out in the next patch. This survived an enable-checking llvm-gcc bootstrap. llvm-svn: 86264	2009-11-06 18:15:14 +00:00
Chris Lattner	8c12bb8cd7	remove some more Context arguments. llvm-svn: 86235	2009-11-06 05:59:53 +00:00
Chris Lattner	46b5c642b9	remove a bunch of extraneous LLVMContext arguments from various APIs, addressing PR5325. llvm-svn: 86231	2009-11-06 04:27:31 +00:00
Victor Hernandez	b9f5899779	Revert r86077 because it caused crashes in 179.art and 175.vpr on ARM llvm-svn: 86213	2009-11-06 01:33:24 +00:00
Dan Gohman	a1bf0c0acc	Teach LSR to avoid calling SplitCriticalEdge on edges with indirectbr. llvm-svn: 86193	2009-11-05 23:34:59 +00:00
Dan Gohman	928068a886	Avoid calling getUniqueExitBlocks from within LoopSimplify, as it depends on loops having dedicated exits, which LoopSimplify can no longer always guarantee. llvm-svn: 86181	2009-11-05 21:48:32 +00:00
Dan Gohman	dca7ac335b	LoopDeletion depends on loops having dedicated exits. llvm-svn: 86180	2009-11-05 21:47:04 +00:00
Dan Gohman	1ef784db67	The introduction of indirectbr meant the introduction of unsplittable critical edges, which means the introduction of loops which cannot be transformed to LoopSimplify form. Fix LoopSimplify to avoid transforming such loops into invalid code. llvm-svn: 86176	2009-11-05 21:14:46 +00:00
Dan Gohman	a83ac2d9e7	Update various Loop optimization passes to cope with the possibility that LoopSimplify form may not be available. llvm-svn: 86175	2009-11-05 21:11:53 +00:00
Dan Gohman	415c64ea3f	Teach LoopUnroll how to bail if LoopSimplify can't give it what it needs. llvm-svn: 86164	2009-11-05 19:44:06 +00:00
Dan Gohman	d9fa1c9c1e	Call getAnalysis<LoopInfo> the normal way, instead of asking passed-in LoopPassManager for it. llvm-svn: 86163	2009-11-05 19:43:25 +00:00
Dan Gohman	885c46e387	Delete an unused member variable. llvm-svn: 86160	2009-11-05 19:33:15 +00:00
Dan Gohman	00c793822e	Add an assertion to catch indirectbr in SplitBlockPredecessors. This makes several optimization passes abort in cases where they're currently silently miscompiling code. Remove the indirectbr assertion from SplitEdge. Indirectbr is only a problem for critical edges, and SplitEdge defers to SplitCriticalEdge to handle those, and SplitCriticalEdge has its own assertion for indirectbr. llvm-svn: 86147	2009-11-05 18:25:44 +00:00
Benjamin Kramer	b971445ab7	Teach SimplifyLibCalls to fold memcmp calls with constant arguments. llvm-svn: 86141	2009-11-05 17:44:22 +00:00
Benjamin Kramer	3fcbb82151	Do map insert+find in one step. TODO -= 2. llvm-svn: 86133	2009-11-05 14:33:27 +00:00
Victor Hernandez	492ed30a32	Update CreateMalloc so that its callers specify the size to allocate: MallocInst-autoupgrade users use non-TargetData-computed allocation sizes. Optimization uses use TargetData to compute the allocation size. Now that malloc calls can have constant sizes, update isArrayMallocHelper() to use TargetData to determine the size of the malloced type and the size of malloced arrays. Extend getMallocType() to support malloc calls that have non-bitcast uses. Update OptimizeGlobalAddressOfMalloc() to optimize malloc calls that have non-bitcast uses. The bitcast use of a malloc call has to be treated specially here because the uses of the bitcast need to be replaced and the bitcast needs to be erased (just like the malloc call) for OptimizeGlobalAddressOfMalloc() to work correctly. Update PerformHeapAllocSRoA() to optimize malloc calls that have non-bitcast uses. The bitcast use of the malloc is not handled specially here because ReplaceUsesOfMallocWithGlobal replaces through the bitcast use. Update OptimizeOnceStoredGlobal() to not care about the malloc calls' bitcast use. Update all globalopt malloc tests to not rely on autoupgraded-MallocInsts, but instead use explicit malloc calls with correct allocation sizes. llvm-svn: 86077	2009-11-05 00:03:03 +00:00
Chris Lattner	a09062758b	improve DSE when TargetData is not around, based on work by Hans Wennborg! llvm-svn: 86067	2009-11-04 23:20:12 +00:00
Chris Lattner	762b56fa8c	Fix an iterator invalidation bug that happens when a hashtable resizes in IPSCCP. This fixes PR5394. llvm-svn: 86036	2009-11-04 18:57:42 +00:00
Chris Lattner	cb3c64ee3c	move two functions up higher in the file. Delete a useless argument to EmitGEPOffset. Implement some new transforms for optimizing subtracts of two pointer to ints into the same vector. This happens for C++ iterator idioms for example, stringmap takes a const char* that points to the start and end of a string. Once inlined, we want the pointer difference to turn back into a length. This is rdar://7362831. llvm-svn: 86021	2009-11-04 08:05:20 +00:00
Chris Lattner	156b8c7109	reimplement multiple return value handling in IPSCCP, making it more aggressive an correct. This survives building llvm in 64-bit mode with optimizations and the built llvm passes make check. llvm-svn: 85973	2009-11-03 23:40:48 +00:00
Chris Lattner	2c427233d4	finish half thunk thought llvm-svn: 85937	2009-11-03 20:52:57 +00:00
Chris Lattner	cde8de519d	fix an IPSCCP bug I introduced when I changed IPSCCP to start working on functions that don't have local linkage. Basically, we need to be more careful about propagating argument information to functions whose results we aren't tracking. This fixes a miscompilation of LLVMCConfigurationEmitter.cpp when built with an llvm-gcc that has ipsccp enabled. llvm-svn: 85923	2009-11-03 19:24:51 +00:00
Chris Lattner	e1d5cd9f48	fix a subtle bug I introduced when refactoring SCCP. Testcase to follow. llvm-svn: 85903	2009-11-03 16:50:11 +00:00
Benjamin Kramer	5573971453	Eliminate some temporaries. llvm-svn: 85896	2009-11-03 12:52:50 +00:00
Chris Lattner	5a3832496a	remove a isFreeCall check: it is a callinst that can write to memory already. llvm-svn: 85863	2009-11-03 05:33:46 +00:00
Ted Kremenek	2124f0d43f	Alphabetize. llvm-svn: 85859	2009-11-03 04:01:53 +00:00
Chris Lattner	fb14181b18	turn IPSCCP back on now that the iterator invalidation bug is fixed. llvm-svn: 85858	2009-11-03 03:42:51 +00:00
Chris Lattner	b70ef3c8c7	fix a nasty iterator invalidation bug from my conversion from std::map to DenseMap, exposed on release llvm-gcc bootstrap. llvm-svn: 85840	2009-11-02 23:25:39 +00:00
Chris Lattner	a15cc59dcb	revert r8579[56], which are causing unhappiness in buildbot land. llvm-svn: 85818	2009-11-02 19:31:10 +00:00
Chris Lattner	a3d794ebbb	disable IPSCCP support for multiple return values, it is buggy, so just disable it until I can fix it. llvm-svn: 85810	2009-11-02 18:22:51 +00:00
Chris Lattner	9d49f0c858	improve IPSCCP to be able to propagate the result of "!mayBeOverridden" function to calls of that function, regardless of whether it has local linkage or has its address taken. Not escaping should only affect whether we make an aggressive assumption about the arguments to a function, not whether we can track the result of it. llvm-svn: 85795	2009-11-02 07:33:59 +00:00
Chris Lattner	47837c5182	don't mark the arguments of prototype overdefined, they will never be queried. llvm-svn: 85793	2009-11-02 06:34:04 +00:00
Chris Lattner	5503328332	restore some code I removed in r85788, refactor it into a shared place instead of duplicating it 4 times. llvm-svn: 85792	2009-11-02 06:28:16 +00:00
Chris Lattner	4910b656b2	remove some confused code that dates from when we had "multiple return values" but not "first class aggregates" llvm-svn: 85791	2009-11-02 06:17:06 +00:00
Chris Lattner	809aee2f40	avoid redundant lookups in BBExecutable, and make it a SmallPtrSet. llvm-svn: 85790	2009-11-02 06:11:23 +00:00
Chris Lattner	e77c9aa04a	Use the libanalysis 'ConstantFoldLoadFromConstPtr' function instead of reinventing SCCP-specific logic. This gives us new powers. llvm-svn: 85789	2009-11-02 06:06:14 +00:00
Chris Lattner	f548403989	switch the main 'ValueState' map from being an std::map to being a DenseMap. Doing this required being aware of subtle iterator invalidation issues, but it provides a big speedup. In a release-asserts build, this sped up optimizing 403.gcc from 1.34s -> 0.79s (IPSCCP) and 1.11s -> 0.44s (SCCP). This commit also conflates in a bunch of general cleanups, sorry. llvm-svn: 85788	2009-11-02 05:55:40 +00:00
Chris Lattner	4e849162ef	fix a bug exposed by moving SRoA earlier which caused a crash building kc++ llvm-svn: 85786	2009-11-02 04:37:17 +00:00
Chris Lattner	e82b087ae6	only IPSCCP incoming arguments if the function is executable, this fixes an assertion on the buildbot. llvm-svn: 85784	2009-11-02 03:25:55 +00:00
Chris Lattner	9e97fbe114	add a new ValueState::getConstantInt() helper, use it to simplify some code. llvm-svn: 85783	2009-11-02 03:21:36 +00:00
Chris Lattner	7ccf1a6df6	tidy up some more: remove some extraneous inline specifiers, return harder. llvm-svn: 85780	2009-11-02 03:03:42 +00:00

... 2 3 4 5 6 ...

6197 Commits