llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	1cc4cca193	add testcases for the foo_with_overflow op xforms added recently and fix bugs exposed by the tests. Testcases from Alastair Lynn! llvm-svn: 90056	2009-11-29 02:57:29 +00:00
Chris Lattner	0d39613f65	add PR# llvm-svn: 90049	2009-11-29 01:28:58 +00:00
Chris Lattner	73d45454be	Add a testcase for: void test(int N, double* G) { long j; for (j = 1; j < N - 1; j++) G[j] = G[j] + G[j+1] + G[j-1]; } which we now compile to one load in the loop: LBB1_2: ## %bb movsd 16(%rsi,%rax,8), %xmm2 incq %rdx addsd %xmm2, %xmm1 addsd %xmm1, %xmm0 movapd %xmm2, %xmm1 movsd %xmm0, 8(%rsi,%rax,8) incq %rax cmpq %rcx, %rax jne LBB1_2 instead of: LBB1_2: ## %bb movsd 8(%rsi,%rax,8), %xmm0 addsd 16(%rsi,%rax,8), %xmm0 addsd (%rsi,%rax,8), %xmm0 movsd %xmm0, 8(%rsi,%rax,8) incq %rax cmpq %rcx, %rax jne LBB1_2 llvm-svn: 90048	2009-11-29 01:15:43 +00:00
Chris Lattner	a73adac52e	add a testcase for void test9(int N, double* G) { long j; for (j = 1; j < N - 1; j++) G[j+1] = G[j] + G[j+1]; } llvm-svn: 90047	2009-11-29 01:04:40 +00:00
Chris Lattner	cd261c9c26	Implement PR5634. llvm-svn: 90046	2009-11-29 00:51:17 +00:00
Nick Lewycky	218a3393f4	Teach memdep to look for memory use intrinsics during dependency queries. Fixes PR5574. llvm-svn: 90045	2009-11-28 21:27:49 +00:00
Chris Lattner	32140312ca	reenable load address insertion in load pre. This allows us to handle cases like this: void test(int N, double* G) { long j; for (j = 1; j < N - 1; j++) G[j+1] = G[j] + G[j+1]; } where G[1] isn't live into the loop. llvm-svn: 90041	2009-11-28 16:08:18 +00:00
Chris Lattner	c7bc66dfc6	implement a FIXME: limit the depth that DecomposeGEPExpression goes the same way that getUnderlyingObject does it. This fixes the 'DecomposeGEPExpression and getUnderlyingObject disagree!' assertion on sqlite3. llvm-svn: 90038	2009-11-28 15:12:41 +00:00
Chris Lattner	cf0b198827	disable value insertion for now, I need to figure out how to inform GVN about the newly inserted values. This fixes PR5631. llvm-svn: 90022	2009-11-27 22:50:07 +00:00
Chris Lattner	d141f885a1	I accidentally implemented this :) llvm-svn: 90014	2009-11-27 19:56:00 +00:00
Chris Lattner	2f0354ecf0	add support for recursive phi translation and phi translation of add with immediate. This allows us to optimize this function: void test(int N, double* G) { long j; G[1] = 1; for (j = 1; j < N - 1; j++) G[j+1] = G[j] + G[j+1]; } to only do one load every iteration of the loop. llvm-svn: 90013	2009-11-27 19:11:31 +00:00
Chris Lattner	e66f84e012	add two simple test cases we now optimize (to one load in the loop each) and one we don't (corresponding to the fixme I added yesterday). llvm-svn: 90012	2009-11-27 18:08:30 +00:00
Chris Lattner	2226db66ab	fix PR5436 by making the 'simple' case of SRoA not promote out of range array indexes. The "complex" case of SRoA still handles them, and correctly. This fixes a weirdness where we'd correctly avoid transforming A[0][42] if the 42 was too large, but we'd only do it if it was one gep, not two separate ones. llvm-svn: 90007	2009-11-27 16:37:41 +00:00
Chris Lattner	92ba18e9e4	filecheckize llvm-svn: 90006	2009-11-27 16:31:59 +00:00
Chris Lattner	25be93dfed	teach GVN's load PRE to insert computations of the address in predecessors where it is not available. It's unclear how to get this inserted computation into GVN's scalar availability sets, Owen, help? :) llvm-svn: 89997	2009-11-27 08:25:10 +00:00
Chris Lattner	41a5bba4e0	add some tests for memdep phi translation + PRE. llvm-svn: 89996	2009-11-27 06:42:42 +00:00
Chris Lattner	fa76d23c1d	this test is failing, and is expected to. llvm-svn: 89995	2009-11-27 06:36:28 +00:00
Chris Lattner	4f1552bde7	filecheckize llvm-svn: 89994	2009-11-27 06:33:09 +00:00
Chris Lattner	66426c70e6	rename test. llvm-svn: 89993	2009-11-27 06:31:55 +00:00
Chris Lattner	a9a76ccf56	Fix phi translation in load PRE to agree with the phi translation done by memdep, and reenable gep translation again. llvm-svn: 89992	2009-11-27 06:31:14 +00:00
Chris Lattner	b018bda665	redisable this, my bootstrap worked because it wasn't an optimized build, whoops. llvm-svn: 89991	2009-11-27 05:53:01 +00:00
Chris Lattner	fb8a718fc3	try again. llvm-svn: 89990	2009-11-27 05:19:56 +00:00
Chris Lattner	14444f5c1a	this is causing buildbot failures, disable for now. llvm-svn: 89985	2009-11-27 01:52:22 +00:00
Chris Lattner	5030c6ab21	teach phi translation of GEPs to simplify geps like 'gep x, 0'. This allows us to compile the example from PR5313 into: LBB1_2: ## %bb incl %ecx movb %al, (%rsi) movslq %ecx, %rax movb (%rdi,%rax), %al testb %al, %al jne LBB1_2 instead of: LBB1_2: ## %bb movslq %eax, %rcx incl %eax movb (%rdi,%rcx), %cl movb %cl, (%rsi) movslq %eax, %rcx cmpb $0, (%rdi,%rcx) jne LBB1_2 llvm-svn: 89981	2009-11-27 00:34:38 +00:00
Chris Lattner	4c88e814b8	teach memdep to do trivial PHI translation of GEPs. More to come. llvm-svn: 89979	2009-11-27 00:07:37 +00:00
Chris Lattner	9bd2136ca3	Teach memdep to phi translate bitcasts. This allows us to compile the example in GCC PR16799 to: LBB1_2: ## %bb1 movl %eax, %eax subq %rax, %rdi movq %rdi, (%rcx) movl (%rdi), %eax testl %eax, %eax je LBB1_2 instead of: LBB1_2: ## %bb1 movl (%rdi), %ecx subq %rcx, %rdi movq %rdi, (%rax) cmpl $0, (%rdi) je LBB1_2 llvm-svn: 89978	2009-11-26 23:41:07 +00:00
Chris Lattner	dfaa592de1	convert to filecheck llvm-svn: 89977	2009-11-26 23:32:59 +00:00
Chris Lattner	a73ecf0b00	Fix PR5471 by removing an instcombine xform. Some pieces of the code generates store to undef and some generates store to null as the idiom for undefined behavior. Since simplifycfg zaps both, don't remove the undefined behavior in instcombine. llvm-svn: 89971	2009-11-26 22:04:42 +00:00
Edward O'Callaghan	2b8fed15e0	Reverting patch in revision 89758, initial attempt at fixing PR5373 has proven to be bogus. llvm-svn: 89844	2009-11-25 05:38:41 +00:00
Edward O'Callaghan	5fd452d596	Fix for PR5373, Credit to Jakub Staszak. llvm-svn: 89758	2009-11-24 11:51:52 +00:00
Dan Gohman	580b80d6d9	Make ConstantFoldConstantExpression recursively visit the entire ConstantExpr, not just the top-level operator. This allows it to fold many more constants. Also, make GlobalOpt call ConstantFoldConstantExpression on GlobalVariable initializers. llvm-svn: 89659	2009-11-23 16:22:21 +00:00
Dan Gohman	1f522d98f8	Fix a use of an invalidated iterator in the case where there are multiple adjacent uses of a dead basic block from the same user. This fixes PR5596. llvm-svn: 89658	2009-11-23 16:13:39 +00:00
Nick Lewycky	922d4ab574	Reapply r88830 with a bugfix: this transform only applies to icmp eq/ne. This fixes part of PR5438. llvm-svn: 89639	2009-11-23 03:17:33 +00:00
Dan Gohman	fbffe63528	Make Loop::getLoopLatch() work on loops which don't have preheaders, as it may be used in contexts where preheader insertion may have failed due to an indirectbr. Make LoopSimplify's LoopSimplify::SeparateNestedLoop properly fail in the case that it would require splitting an indirectbr edge. These fix PR5502. llvm-svn: 89484	2009-11-20 20:51:18 +00:00
Dan Gohman	d15302afa0	Fix IPSCCP's code for deleting dead blocks to tolerate outstanding blockaddress users. This fixes PR5569. llvm-svn: 89483	2009-11-20 20:19:14 +00:00
Benjamin Kramer	e986c44a9b	Try to work around grep's "Binary file (standard input) matches" complaints seen on ppc buildbot. llvm-svn: 89452	2009-11-20 09:53:25 +00:00
Dan Gohman	62167b9516	Teach getSmallConstantTripMultiple about Shl operators. llvm-svn: 89426	2009-11-20 01:09:34 +00:00
Dan Gohman	94e617627d	Extend CaptureTracking to indicate when a value is never stored, even if it is not ultimately captured. Teach BasicAliasAnalysis that a local object address which does not escape and is never stored does not alias with a value resulting from a load. llvm-svn: 89398	2009-11-19 21:57:48 +00:00
Dan Gohman	cbc6ebb6fd	Enable hoisting of loads from constant memory by default. In cases where they are lowered to instruction sequences more complex than a simple load, such that CodeGen cannot rematerialize them, a reload from a spill slot is likely to be cheaper than the complex sequence. llvm-svn: 89374	2009-11-19 19:00:10 +00:00
Evan Cheng	ba4e5da727	Generalize OptimizeLoopTermCond to optimize more loop terminating icmp to use postinc iv. llvm-svn: 89116	2009-11-17 18:10:11 +00:00
Nick Lewycky	95148689c9	Revert r88830 and r88831 which appear to have caused a selfhost buildbot some grief. I suspect this patch merely exposed a bug else. llvm-svn: 88841	2009-11-15 07:47:32 +00:00
Nick Lewycky	6a6ac7e105	Correct typo. llvm-svn: 88831	2009-11-15 06:16:57 +00:00
Nick Lewycky	e29fa4c7a1	Teach instcombine to look for booleans in wider integers when it encounters a zext(icmp). It may be able to optimize that away. This fixes one of the cases in PR5438. llvm-svn: 88830	2009-11-15 05:55:17 +00:00
Nick Lewycky	c53e2ecf02	Teach BasicAA that a constant expression can't alias memory provably not allocated until runtime (such as an alloca). Patch by Hans Wennborg! llvm-svn: 88760	2009-11-14 06:15:14 +00:00
Gabor Greif	13431c6cdf	typo llvm-svn: 86980	2009-11-12 09:44:17 +00:00
Chris Lattner	eb9acbfb05	implement a nice little efficiency hack in the inliner. Since we're now running IPSCCP early, and we run functionattrs interlaced with the inliner, we often (particularly for small or noop functions) completely propagate all of the information about a call to its call site in IPSSCP (making a call dead) and functionattrs is smart enough to realize that the function is readonly (because it is interlaced with inliner). To improve compile time and make the inliner threshold more accurate, realize that we don't have to inline dead readonly function calls. Instead, just delete the call. This happens all the time for C++ codes, here are some counters from opt/llvm-ld counting the number of times calls were deleted vs inlined on various apps: Tramp3d opt: 5033 inline - Number of call sites deleted, not inlined 24596 inline - Number of functions inlined llvm-ld: 667 inline - Number of functions deleted because all callers found 699 inline - Number of functions inlined 483.xalancbmk opt: 8096 inline - Number of call sites deleted, not inlined 62528 inline - Number of functions inlined llvm-ld: 217 inline - Number of allocas merged together 2158 inline - Number of functions inlined 471.omnetpp: 331 inline - Number of call sites deleted, not inlined 8981 inline - Number of functions inlined llvm-ld: 171 inline - Number of functions deleted because all callers found 629 inline - Number of functions inlined Deleting a call is much faster than inlining it, and is insensitive to the size of the callee. :) llvm-svn: 86975	2009-11-12 07:56:08 +00:00
Chris Lattner	5f6b8b2bcb	use getPredicateOnEdge to fold comparisons through PHI nodes, which implements GCC PR18046. This also gets us 360 more jump threads on 176.gcc. llvm-svn: 86953	2009-11-12 05:24:05 +00:00
Chris Lattner	380ccbaeaa	should not commit when distracted. llvm-svn: 86929	2009-11-12 02:04:17 +00:00
Chris Lattner	e2a63f2798	We now thread some impossible condition information with LVI. llvm-svn: 86927	2009-11-12 01:55:20 +00:00
Chris Lattner	ba45616958	with the new code we can thread non-instruction values. This allows us to handle the test10 testcase. llvm-svn: 86924	2009-11-12 01:41:34 +00:00
Chris Lattner	b584d1e456	move some stuff into DEBUG's and turn on lazy-value-info for the basic.ll testcase. llvm-svn: 86918	2009-11-12 01:22:16 +00:00
Duncan Sands	ba61fed5d3	Don't trivially delete unused calls to llvm.invariant.start. This allows llvm.invariant.start to be used without necessarily being paired with a call to llvm.invariant.end. If you run the entire optimization pipeline then such calls are in fact deleted (adce does it), but that's actually a good thing since we probably do want them to be zapped late in the game. There should really be an integration test that checks that the llvm.invariant.start call lasts long enough that all passes that do interesting things with it get to do their stuff before it is deleted. But since no passes do anything interesting with it yet this will have to wait for later. llvm-svn: 86840	2009-11-11 15:34:13 +00:00
Chris Lattner	3e308fb0ee	remove condprop testcases. llvm-svn: 86804	2009-11-11 05:25:16 +00:00
Chris Lattner	6e960c8657	oops, didn't mean to commit this, no harm, but add a todoops, didn't mean to commit this, no harm, but add a todoo llvm-svn: 86768	2009-11-11 00:27:54 +00:00
Chris Lattner	741c94c719	Stub out a new lazy value info pass, which will eventually vend value constraint information to the optimizer. llvm-svn: 86767	2009-11-11 00:22:30 +00:00
Evan Cheng	12f146d8f7	Block terminator may be a switch. llvm-svn: 86761	2009-11-11 00:00:21 +00:00
Chris Lattner	9518fbb54e	implement a TODO by teaching jump threading about "xor x, 1". llvm-svn: 86739	2009-11-10 22:39:16 +00:00
Chris Lattner	02e2cee7dc	fix a crash in SCCP handling extractvalue of an array, pointed out and tracked down by Stephan Reiter! llvm-svn: 86726	2009-11-10 22:02:09 +00:00
Chris Lattner	80e7e5a429	Make jump threading eliminate blocks that just contain phi nodes, debug intrinsics, and an unconditional branch when possible. This reuses the TryToSimplifyUncondBranchFromEmptyBlock function split out of simplifycfg. llvm-svn: 86722	2009-11-10 21:40:01 +00:00
Evan Cheng	87fe40b32d	Generalize lsr code that optimize loop to count down towards zero. llvm-svn: 86715	2009-11-10 21:14:05 +00:00
Dan Gohman	1f31f6e265	Optimize test more. llvm-svn: 86714	2009-11-10 21:02:18 +00:00
Duncan Sands	1925d3a1d1	Teach DSE to eliminate useless trampolines. llvm-svn: 86683	2009-11-10 13:49:50 +00:00
Chris Lattner	17529ac0c5	optimize test llvm-svn: 86672	2009-11-10 07:44:36 +00:00
Chris Lattner	1559bedcc7	unify the code that determines whether it is a good idea to change the type of a computation. This fixes some infinite loops when dealing with TD that has no native types. llvm-svn: 86670	2009-11-10 07:23:37 +00:00
Nick Lewycky	9027147fb1	Reapply r86359, "Teach dead store elimination that certain intrinsics write to memory just like a store" with bug fixed (partial-overwrite.ll is the regression test). llvm-svn: 86667	2009-11-10 06:46:40 +00:00
Chris Lattner	38c44ea6b0	make jump threading recursively simplify expressions instead of doing it just one level deep. On the testcase we go from getting this: F1: ; preds = %T2 %F = and i1 true, %cond ; <i1> [#uses=1] br i1 %F, label %X, label %Y to a fully threaded: F1: ; preds = %T2 br label %Y This changes gets us to the point where we're forming (too many) switch instructions on doug's strswitch testcase. llvm-svn: 86646	2009-11-10 01:57:31 +00:00
Dan Gohman	0d401124d1	Trim a bunch of unneeded code from this testcase. llvm-svn: 86640	2009-11-10 01:33:08 +00:00
Dan Gohman	ccb4584edd	Default-addressspace null pointers don't alias anything. This allows GVN to be more aggressive. Patch by Hans Wennborg! (with a comment added by me) llvm-svn: 86582	2009-11-09 19:29:11 +00:00
Dan Gohman	c146c78060	Generalize LCSSA to handle loops with exits with predecessors outside the loop. This is needed because with indirectbr it may not be possible for LoopSimplify to guarantee that all loop exit predecessors are inside the loop. This fixes PR5437. LCCSA no longer actually requires LoopSimplify form, but for now it must still have the dependency because the PassManager doesn't know how to schedule LoopSimplify otherwise. llvm-svn: 86569	2009-11-09 18:28:24 +00:00
Chris Lattner	39c07b2eef	if a 'with overflow' intrinsic just has the normal result used, simplify it to a normal binop. Patch by Alastair Lynn, testcase by me. llvm-svn: 86524	2009-11-09 07:07:56 +00:00
Chris Lattner	0685be3441	enhance PHI slicing to handle the case when a slicable PHI is begin used by a chain of other PHIs. llvm-svn: 86503	2009-11-09 01:38:00 +00:00
Owen Anderson	73fc616838	Revert my previous patch to ABCD and fix things the right way. There are two problems addressed here: 1) We need to avoid processing sigma nodes as phi nodes for constraint generation. 2) We need to generate constraints for comparisons against constants properly. This includes our first working ABCD test! llvm-svn: 86498	2009-11-09 00:44:44 +00:00
Chris Lattner	2299d4b6d8	Teach an instcombine to not pull trunc instructions through PHI nodes when both the source and dest are illegal types, since it would cause the phi to grow (for example, we shouldn't transform test14b's phi to a phi on i320). This fixes an infinite loop on i686 bootstrap with phi slicing turned on, so turn it back on. llvm-svn: 86483	2009-11-08 21:20:06 +00:00
Chris Lattner	a837e4db6b	reapply r8644[3-5] with only the scary part (SliceUpIllegalIntegerPHI) disabled. llvm-svn: 86480	2009-11-08 19:23:30 +00:00
Daniel Dunbar	4c41373c56	Speculatively revert r8644[3-5], they seem to be leading to infinite loops in llvm-gcc bootstrap. llvm-svn: 86478	2009-11-08 17:52:47 +00:00
Chris Lattner	99db7963b4	another more interesting test. llvm-svn: 86445	2009-11-08 08:36:40 +00:00
Chris Lattner	7c8b29ef61	feature test for the new transformation in r86443 llvm-svn: 86444	2009-11-08 08:30:58 +00:00
Chris Lattner	c7a450b5b2	teach a couple of instcombine transformations involving PHIs to not turn a PHI in a legal type into a PHI of an illegal type, and add a new optimization that breaks up insane integer PHI nodes into small pieces (PR3451). llvm-svn: 86443	2009-11-08 08:21:13 +00:00
Nick Lewycky	b9397262b7	Improve tail call elimination to handle the switch statement. llvm-svn: 86403	2009-11-07 21:10:15 +00:00
Chris Lattner	c77d24b792	make instcombine only rewrite a chain of computation (eliminating some extends) if the new type of the computation is legal or if both the source and dest are illegal. This prevents instcombine from changing big chains of computation into i64 on 32-bit targets for example. llvm-svn: 86398	2009-11-07 19:11:46 +00:00
Chris Lattner	acc83d10bd	remove empty files. llvm-svn: 86392	2009-11-07 18:03:32 +00:00
Chris Lattner	431000da21	Revert r86359, it is breaking the self host on the llvm-gcc-i386-darwin9 build bot. llvm-svn: 86391	2009-11-07 17:59:32 +00:00
Nick Lewycky	b6a3dd48f4	Teach dead store elimination that certain intrinsics write to memory just like a store. llvm-svn: 86359	2009-11-07 08:34:40 +00:00
Chris Lattner	5ff7f5672e	reapply 86289, 86278, 86270, 86267, 86266 & 86264 plus a fix (making pred factoring only happen if threading is guaranteed to be successful). This now survives an X86-64 bootstrap of llvm-gcc. llvm-svn: 86355	2009-11-07 08:05:03 +00:00
Nick Lewycky	9b669b3c4f	Oops, FunctionContainsEscapingAllocas is really used to mean two different things. Back out part of r86349 for a moment. llvm-svn: 86353	2009-11-07 07:42:38 +00:00
Nick Lewycky	5091272fdf	Dust off tail recursion elimination. Fix a fixme by applying CaptureTracking and add a .ll to demo the new capability. llvm-svn: 86349	2009-11-07 07:10:01 +00:00
Devang Patel	3a42e7ac65	Revert following patches to fix llvmgcc bootstrap. 86289, 86278, 86270, 86267, 86266 & 86264 Chris, please take a look. llvm-svn: 86321	2009-11-07 01:32:59 +00:00
Victor Hernandez	f3db915294	Re-commit r86077 now that r86290 fixes the 179.art and 175.vpr ARM regressions. Here is the original commit message: This commit updates malloc optimizations to operate on malloc calls that have constant int size arguments. Update CreateMalloc so that its callers specify the size to allocate: MallocInst-autoupgrade users use non-TargetData-computed allocation sizes. Optimization uses use TargetData to compute the allocation size. Now that malloc calls can have constant sizes, update isArrayMallocHelper() to use TargetData to determine the size of the malloced type and the size of malloced arrays. Extend getMallocType() to support malloc calls that have non-bitcast uses. Update OptimizeGlobalAddressOfMalloc() to optimize malloc calls that have non-bitcast uses. The bitcast use of a malloc call has to be treated specially here because the uses of the bitcast need to be replaced and the bitcast needs to be erased (just like the malloc call) for OptimizeGlobalAddressOfMalloc() to work correctly. Update PerformHeapAllocSRoA() to optimize malloc calls that have non-bitcast uses. The bitcast use of the malloc is not handled specially here because ReplaceUsesOfMallocWithGlobal replaces through the bitcast use. Update OptimizeOnceStoredGlobal() to not care about the malloc calls' bitcast use. Update all globalopt malloc tests to not rely on autoupgraded-MallocInsts, but instead use explicit malloc calls with correct allocation sizes. llvm-svn: 86311	2009-11-07 00:16:28 +00:00
Chris Lattner	a8b9ce3f07	Fix a problem discovered on self host. llvm-svn: 86278	2009-11-06 19:21:48 +00:00
Chris Lattner	68d2417e05	Extend jump threading to support much more general threading predicates. This allows us to jump thread things like: _ZN12StringSwitchI5ColorE4CaseILj7EEERS1_RAT__KcRKS0_.exit119: %tmp1.i24166 = phi i8 [ 1, %bb5.i117 ], [ %tmp1.i24165, %_Z....exit ], [ %tmp1.i24165, %bb4.i114 ] %toBoolnot.i87 = icmp eq i8 %tmp1.i24166, 0 ; <i1> [#uses=1] %tmp4.i90 = icmp eq i32 %tmp2.i, 6 ; <i1> [#uses=1] %or.cond173 = and i1 %toBoolnot.i87, %tmp4.i90 ; <i1> [#uses=1] br i1 %or.cond173, label %bb4.i96, label %_ZN12... Where it is "obvious" that when coming from %bb5.i117 that the 'and' is always false. This triggers a surprisingly high number of times in the testsuite, and gets us closer to generating good code for doug's strswitch testcase. This also make a bunch of other code in jump threading redundant, I'll rip out in the next patch. This survived an enable-checking llvm-gcc bootstrap. llvm-svn: 86264	2009-11-06 18:15:14 +00:00
Victor Hernandez	b9f5899779	Revert r86077 because it caused crashes in 179.art and 175.vpr on ARM llvm-svn: 86213	2009-11-06 01:33:24 +00:00
Dan Gohman	1ef784db67	The introduction of indirectbr meant the introduction of unsplittable critical edges, which means the introduction of loops which cannot be transformed to LoopSimplify form. Fix LoopSimplify to avoid transforming such loops into invalid code. llvm-svn: 86176	2009-11-05 21:14:46 +00:00
Benjamin Kramer	b971445ab7	Teach SimplifyLibCalls to fold memcmp calls with constant arguments. llvm-svn: 86141	2009-11-05 17:44:22 +00:00
Chris Lattner	046dff7acf	merge a few crash tests into crash.ll llvm-svn: 86119	2009-11-05 05:57:34 +00:00
Victor Hernandez	492ed30a32	Update CreateMalloc so that its callers specify the size to allocate: MallocInst-autoupgrade users use non-TargetData-computed allocation sizes. Optimization uses use TargetData to compute the allocation size. Now that malloc calls can have constant sizes, update isArrayMallocHelper() to use TargetData to determine the size of the malloced type and the size of malloced arrays. Extend getMallocType() to support malloc calls that have non-bitcast uses. Update OptimizeGlobalAddressOfMalloc() to optimize malloc calls that have non-bitcast uses. The bitcast use of a malloc call has to be treated specially here because the uses of the bitcast need to be replaced and the bitcast needs to be erased (just like the malloc call) for OptimizeGlobalAddressOfMalloc() to work correctly. Update PerformHeapAllocSRoA() to optimize malloc calls that have non-bitcast uses. The bitcast use of the malloc is not handled specially here because ReplaceUsesOfMallocWithGlobal replaces through the bitcast use. Update OptimizeOnceStoredGlobal() to not care about the malloc calls' bitcast use. Update all globalopt malloc tests to not rely on autoupgraded-MallocInsts, but instead use explicit malloc calls with correct allocation sizes. llvm-svn: 86077	2009-11-05 00:03:03 +00:00
Chris Lattner	a09062758b	improve DSE when TargetData is not around, based on work by Hans Wennborg! llvm-svn: 86067	2009-11-04 23:20:12 +00:00
Chris Lattner	cb3c64ee3c	move two functions up higher in the file. Delete a useless argument to EmitGEPOffset. Implement some new transforms for optimizing subtracts of two pointer to ints into the same vector. This happens for C++ iterator idioms for example, stringmap takes a const char* that points to the start and end of a string. Once inlined, we want the pointer difference to turn back into a length. This is rdar://7362831. llvm-svn: 86021	2009-11-04 08:05:20 +00:00
Chris Lattner	e3cdf2ed3b	filecheckize this test. llvm-svn: 86020	2009-11-04 07:57:05 +00:00
Chris Lattner	156b8c7109	reimplement multiple return value handling in IPSCCP, making it more aggressive an correct. This survives building llvm in 64-bit mode with optimizations and the built llvm passes make check. llvm-svn: 85973	2009-11-03 23:40:48 +00:00
Chris Lattner	9122fa2d1e	fix test llvm-svn: 85946	2009-11-03 21:26:26 +00:00
Chris Lattner	69c523c813	merge a test into ipsccp-basic. running llvm-ld to get one pass is... bad. llvm-svn: 85945	2009-11-03 21:25:50 +00:00
Chris Lattner	cde8de519d	fix an IPSCCP bug I introduced when I changed IPSCCP to start working on functions that don't have local linkage. Basically, we need to be more careful about propagating argument information to functions whose results we aren't tracking. This fixes a miscompilation of LLVMCConfigurationEmitter.cpp when built with an llvm-gcc that has ipsccp enabled. llvm-svn: 85923	2009-11-03 19:24:51 +00:00
Chris Lattner	6ec614e15e	testcase for r85903 llvm-svn: 85906	2009-11-03 17:03:02 +00:00
Kenneth Uildriks	90fedc6ef9	Make opt default to not adding a target data string and update tests that depend on target data to supply it within the test llvm-svn: 85900	2009-11-03 15:29:06 +00:00
Chris Lattner	e364a32a65	merge 2008-03-10-sret.ll into ipsccp-basic.ll, and upgrade its syntax. llvm-svn: 85811	2009-11-02 18:27:22 +00:00
Chris Lattner	a3d794ebbb	disable IPSCCP support for multiple return values, it is buggy, so just disable it until I can fix it. llvm-svn: 85810	2009-11-02 18:22:51 +00:00
Chris Lattner	9d49f0c858	improve IPSCCP to be able to propagate the result of "!mayBeOverridden" function to calls of that function, regardless of whether it has local linkage or has its address taken. Not escaping should only affect whether we make an aggressive assumption about the arguments to a function, not whether we can track the result of it. llvm-svn: 85795	2009-11-02 07:33:59 +00:00
Chris Lattner	e77c9aa04a	Use the libanalysis 'ConstantFoldLoadFromConstPtr' function instead of reinventing SCCP-specific logic. This gives us new powers. llvm-svn: 85789	2009-11-02 06:06:14 +00:00
Chris Lattner	4e849162ef	fix a bug exposed by moving SRoA earlier which caused a crash building kc++ llvm-svn: 85786	2009-11-02 04:37:17 +00:00
Chris Lattner	3cd6a61b27	fix instcombine to only do store sinking when the alignments of the two loads agree. Propagate that onto the new store. llvm-svn: 85772	2009-11-02 02:06:37 +00:00
Chris Lattner	db3311edc7	merge a test into store.ll llvm-svn: 85771	2009-11-02 02:00:18 +00:00
Chris Lattner	d263dbec7a	convert to filecheck llvm-svn: 85770	2009-11-02 01:58:03 +00:00
Chris Lattner	3e6398baa5	merge phi-merge.ll into phi.ll I don't know what Dan wants to do with phi-merge-gep.ll, I'll let him deal with it because instcombine may end up sinking these. llvm-svn: 85739	2009-11-01 20:10:11 +00:00
Chris Lattner	328ef89bd1	when merging two loads, make sure to take the min of their alignment, not the max. This didn't matter until the previous patch because instcombine would refuse to sink loads with differenting alignments. llvm-svn: 85738	2009-11-01 20:07:07 +00:00
Chris Lattner	0b40a8bc0e	fix a bug noticed by inspection: when instcombine sinks loads through phis, it didn't preserve the alignment of the load. This is a missed optimization of the alignment is high and a miscompilation when the alignment is low. llvm-svn: 85736	2009-11-01 19:50:13 +00:00
Chris Lattner	d162b5c955	convert to filecheck. llvm-svn: 85734	2009-11-01 19:22:20 +00:00
Dan Gohman	2d02ff8cbb	Revert r85667. LoopUnroll currently can't call utility functions which auto-update the DominatorTree because it doesn't keep the DominatorTree current while it works. llvm-svn: 85670	2009-10-31 17:33:01 +00:00
Dan Gohman	041e2dbad1	Merge the enhancements from LoopUnroll's FoldBlockIntoPredecessor into MergeBlockIntoPredecessor. This makes SimplifyCFG slightly more aggressive, and makes it unnecessary for LoopUnroll to have its own copy of this code. llvm-svn: 85667	2009-10-31 16:08:00 +00:00
Dan Gohman	56998cdc5b	Add a testcase for the recent duplicate PHI elimination changes. llvm-svn: 85636	2009-10-30 23:16:10 +00:00
Chris Lattner	dd5d035302	if basic blocks are destroyed while there are just BlockAddress' hanging around, then zap them. This is analogous to dangling constantexprs hanging off functions. llvm-svn: 85627	2009-10-30 22:39:36 +00:00
Victor Hernandez	0d025421cd	Extend getMallocArraySize() to determine the array size if the malloc argument is: ArraySize * ElementSize ElementSize * ArraySize ArraySize << log2(ElementSize) ElementSize << log2(ArraySize) Refactor isArrayMallocHelper and delete isSafeToGetMallocArraySize, so that there is only 1 copy of the malloc array determining logic. Update users of getMallocArraySize() to not bother calling isArrayMalloc() as well. llvm-svn: 85421	2009-10-28 20:18:55 +00:00
Owen Anderson	2b2bd28973	Treat lifetime begin/end markers as allocations/frees respectively for the purposes for GVN/DSE. llvm-svn: 85383	2009-10-28 07:05:35 +00:00
Owen Anderson	fc16e5a98f	Be more careful about invariance reasoning on "store" queries. Stores still need to depend on Ref and ModRef calls within the invariant region. llvm-svn: 85380	2009-10-28 06:30:52 +00:00
Owen Anderson	d0e86d57c1	Add trivial support for the invariance intrinsics to memdep. This logic is purely local for now. llvm-svn: 85378	2009-10-28 06:18:42 +00:00
Chris Lattner	c6b3b25f94	Fix a pretty serious misfeature of the inliner: if it inlines a function with multiple return values it inserts a PHI to merge them all together. However, if the return values are all the same, it ends up with a pointless PHI and this pointless PHI happens to really block SRoA from happening in at least a silly C++ example written by Doug, but probably others. This fixes rdar://7339069. llvm-svn: 85206	2009-10-27 05:39:41 +00:00
Chris Lattner	58ee24c8bf	convert to filecheck. llvm-svn: 85205	2009-10-27 05:35:35 +00:00
Edward O'Callaghan	e45ac76ee4	Convert a few tests to FileCheck for PR5307. llvm-svn: 85171	2009-10-26 22:52:03 +00:00
Dan Gohman	672927f393	Code that checks WillNotOverflowSignedAdd before creating an Add can safely use the NSW bit on the Add. llvm-svn: 85164	2009-10-26 22:14:22 +00:00
Chris Lattner	683eed3286	reapply r85085 with a bugfix to avoid infinite looping. All of the 'demorgan' related xforms need to use dyn_castNotVal, not m_Not. llvm-svn: 85119	2009-10-26 15:40:07 +00:00
Evan Cheng	8014a728b9	Revert 85085. It causes infinite looping during llvm-gcc build. llvm-svn: 85090	2009-10-26 03:51:32 +00:00
Chris Lattner	2e6564d6ff	Implement PR3266 & PR5276, folding: not (or (icmp, icmp)) -> and(icmp, icmp) llvm-svn: 85085	2009-10-26 01:06:31 +00:00
Chris Lattner	52880b29d2	convert or.ll to filecheck and merge or2 into it. llvm-svn: 85083	2009-10-25 23:47:55 +00:00
Dan Gohman	a484d17ec5	Make these tests more interesting by using -verify-dom-info and -verify-loop-info, which enable additional (expensive) consistency checks. llvm-svn: 85017	2009-10-24 23:23:04 +00:00
Chris Lattner	9e2d5b3b8e	fix PR5287, a serious regression from my previous patches. Thanks to Duncan for the nice tiny testcase. llvm-svn: 84992	2009-10-24 05:22:15 +00:00
Victor Hernandez	e297149e26	Auto-upgrade free instructions to calls to the builtin free function. Update all analysis passes and transforms to treat free calls just like FreeInst. Remove RaiseAllocations and all its tests since FreeInst no longer needs to be raised. llvm-svn: 84987	2009-10-24 04:23:03 +00:00
Dan Gohman	41d00ac45b	Make LoopDeletion check the maximum backedge taken count, rather than the exact backedge taken count, when checking for infinite loops. This allows it to delete loops with multiple exit conditions. llvm-svn: 84952	2009-10-23 17:10:01 +00:00
Chris Lattner	ccf1e84779	teach libanalysis to simplify vector loads with bitcast sources. This implements something out of Target/README.txt producing: _foo: ## @foo movl 4(%esp), %eax movapd LCPI1_0, %xmm0 movapd %xmm0, (%eax) ret $4 instead of: _foo: ## @foo movl 4(%esp), %eax movapd _b, %xmm0 mulpd LCPI1_0, %xmm0 addpd _a, %xmm0 movapd %xmm0, (%eax) ret $4 llvm-svn: 84942	2009-10-23 06:57:37 +00:00
Chris Lattner	59f94c01dd	enhance FoldReinterpretLoadFromConstPtr to handle loads of up to 32 bytes (i256). llvm-svn: 84941	2009-10-23 06:50:36 +00:00
Chris Lattner	ed00b80bf8	teach libanalysis to fold int and fp loads from almost arbitrary non-type-safe constant initializers. This sort of thing happens quite a bit for 4-byte loads out of string constants, unions, bitfields, and an interesting endianness check from sqlite, which is something like this: const int sqlite3one = 1; # define SQLITE_BIGENDIAN ((char )(&sqlite3one)==0) # define SQLITE_LITTLEENDIAN ((char )(&sqlite3one)==1) # define SQLITE_UTF16NATIVE (SQLITE_BIGENDIAN?SQLITE_UTF16BE:SQLITE_UTF16LE) all of these macros now constant fold away. This implements PR3152 and is based on a patch started by Eli, but heavily modified and extended. llvm-svn: 84936	2009-10-23 06:23:49 +00:00
Chris Lattner	c7a962d3b3	fix PR5262. llvm-svn: 84810	2009-10-22 00:17:26 +00:00
Chris Lattner	966526cbfb	revert r84754, it isn't the right approach. Edwin, please propose patches for fixes like this instead of committing them directly. llvm-svn: 84799	2009-10-21 23:41:58 +00:00
Victor Hernandez	be9e179104	Make changes to rev 84292 as requested by Chris Lattner. Most changes are cleanup, but there is 1 correctness fix: I fixed InstCombine so that the icmp is removed only if the malloc call is removed (which requires explicit removal because the Worklist won't DCE any calls since they can have side-effects). llvm-svn: 84772	2009-10-21 19:11:40 +00:00
Torok Edwin	1539a352a6	Fix PR5262: when folding select into PHI, make sure all operands are available in the PHI's Basic Block. This uses a conservative approach, because we don't have dominator info in instcombine. llvm-svn: 84754	2009-10-21 10:49:00 +00:00
Chris Lattner	0f15e03c5a	add a real testcase for PR4313 llvm-svn: 84676	2009-10-20 21:04:26 +00:00
Chris Lattner	582d056b14	add a test similar to that needed for PR4313, but that doesn't fail without the patch. llvm-svn: 84675	2009-10-20 21:00:47 +00:00
Chris Lattner	8468c8e857	the date on this testcase is wrong, it is unreduced, and it passes without the fix for PR4313. llvm-svn: 84674	2009-10-20 20:57:58 +00:00
Chris Lattner	c702b6ab37	merge and filecheckize llvm-svn: 84672	2009-10-20 20:39:43 +00:00
Chris Lattner	591d4da790	merge two tests and convert to filecheck. llvm-svn: 84671	2009-10-20 20:33:46 +00:00
Chris Lattner	7f903681ac	alternate fix for PR5258 which avoids worklist problems, with reduced testcase. llvm-svn: 84667	2009-10-20 20:27:49 +00:00
Torok Edwin	cf10ec951d	Fix PR5258, jump-threading creating invalid PHIs. When an incoming value for a PHI is updated, we must also updated all other incoming values for the same BB to match, otherwise we create invalid PHIs. llvm-svn: 84638	2009-10-20 15:42:00 +00:00
Torok Edwin	729d92bd74	Fix PR4313: IPSCCP was not setting the lattice value for the invoke instruction when the invoke had multiple return values: it set the lattice value only on the extractvalue. This caused the invoke's lattice value to remain the default (undefined), and later propagated to extractvalue's operand, which incorrectly introduces undefined behavior. llvm-svn: 84637	2009-10-20 15:15:09 +00:00
Dan Gohman	8f986672a1	Fix SplitBlockPredecessors' LoopInfo updating code to handle the case where a loop's header is being split and it has predecessors which are not contained by the most-nested loop which contains the loop. This fixes PR5235. llvm-svn: 84505	2009-10-19 16:04:50 +00:00
Chris Lattner	8054401989	remove a now-pointless regtest llvm-svn: 84409	2009-10-18 05:20:17 +00:00
Chris Lattner	00c6ac7bc2	remove testcase for dead pass llvm-svn: 84406	2009-10-18 05:03:41 +00:00
Chris Lattner	f67d297eda	Teach vm core to more aggressively fold 'trunc' constantexprs, allowing it to simplify the crazy constantexprs in the testcases down to something sensible. This allows -std-compile-opts to completely "devirtualize" the pointers to member functions in the testcase from PR5176. llvm-svn: 84368	2009-10-17 21:53:27 +00:00
Chris Lattner	6f463f9ad4	remove # uses from FileCheck lines. llvm-svn: 84367	2009-10-17 21:51:19 +00:00
Chris Lattner	965fe98af6	rename test llvm-svn: 84364	2009-10-17 21:31:19 +00:00
Chris Lattner	88b36f1140	Simplify some code (first hunk) and fix PR5208 (second hunk) by updating the callgraph when introducing a call. llvm-svn: 84310	2009-10-17 05:39:39 +00:00
Victor Hernandez	c7d6a8327c	Autoupgrade malloc insts to malloc calls. Update testcases that rely on malloc insts being present. Also prematurely remove MallocInst handling from IndMemRemoval and RaiseAllocations to help pass tests in this incremental step. llvm-svn: 84292	2009-10-17 00:00:19 +00:00
Victor Hernandez	264da3274e	HeapAllocSRoA also needs to check if malloc array size can be computed. llvm-svn: 84288	2009-10-16 23:12:25 +00:00
Victor Hernandez	c81923e07c	Invert isSafeToGetMallocArraySize check because we return NULL when we don't know the size. Thanks to Duncan Sands for noticing this bug. llvm-svn: 84260	2009-10-16 18:07:17 +00:00
Duncan Sands	de3f2c26c6	Check that GVN performs this transform even if the calls themselves are not marked readonly, but only the called functions. llvm-svn: 84253	2009-10-16 12:18:23 +00:00
Chris Lattner	6b9044db01	make instcombine's instruction sinking more aggressive in the presence of PHI nodes. llvm-svn: 84103	2009-10-14 15:21:58 +00:00
Chris Lattner	19788ca686	change simplifycfg to not duplicate 'unwind' instructions. Hopefully this will increase the likelihood of common code getting sunk towards the unwind. llvm-svn: 83996	2009-10-13 18:13:05 +00:00
Chris Lattner	8d6d09379d	convert to filecheck llvm-svn: 83995	2009-10-13 18:10:05 +00:00
Chris Lattner	6f55a81bb9	rename test llvm-svn: 83994	2009-10-13 18:08:21 +00:00
Victor Hernandez	70e8505eb1	Memory dependence analysis was incorrectly stopping to scan for stores to a pointer at bitcast uses of a malloc call. It should continue scanning until the malloc call, and this patch fixes that. llvm-svn: 83931	2009-10-13 01:42:53 +00:00
Edward O'Callaghan	1c591f74c7	Missing CHECK: lines makes test exit abnormally. llvm-svn: 83835	2009-10-12 09:01:26 +00:00
Edward O'Callaghan	8720e8c8f3	FileCheck not CheckFile, oops. llvm-svn: 83834	2009-10-12 08:51:28 +00:00
Edward O'Callaghan	6d01608662	Convert InstCombine/call.ll to CheckFile. llvm-svn: 83833	2009-10-12 08:46:47 +00:00
Edward O'Callaghan	cbf75a5dc3	Convert the rest of the InstCombine tests from notcast to FileCheck. llvm-svn: 83828	2009-10-12 07:18:14 +00:00
Nick Lewycky	31a57ea0dd	Remove this part of the test, it never actually tested anything anyways. This unbreaks make check after evocallaghan's changes. llvm-svn: 83827	2009-10-12 06:32:42 +00:00
Edward O'Callaghan	940da903e2	Fix syntax error missed in converting zext.ll test. Convert 2003-11-13-ConstExprCastCall.ll to FileCheck from notcast. llvm-svn: 83826	2009-10-12 06:23:56 +00:00
Edward O'Callaghan	484b6c2cfc	Convert InstCombine tests from notcast to FileCheck. llvm-svn: 83825	2009-10-12 06:14:06 +00:00
Chris Lattner	06462efb47	reduce vec_shuffle2 and merge into vec_shuffle. llvm-svn: 83807	2009-10-11 22:54:48 +00:00
Chris Lattner	6373045e7d	filecheckize vec_shuffle.ll and merge shuffle.ll into it. llvm-svn: 83806	2009-10-11 22:52:15 +00:00
Chris Lattner	79a2f91f65	filecheckize llvm-svn: 83805	2009-10-11 22:45:17 +00:00
Chris Lattner	8308fd9aab	rename test llvm-svn: 83804	2009-10-11 22:44:16 +00:00
Chris Lattner	e660ee0a3b	remove old testcase llvm-svn: 83803	2009-10-11 22:42:06 +00:00
Chris Lattner	1fe15dbbbb	merge test into shift.ll, this also eliminates awful grepping on -stats output llvm-svn: 83802	2009-10-11 22:39:58 +00:00
Chris Lattner	d7969a2796	convert to filecheck. llvm-svn: 83801	2009-10-11 22:36:59 +00:00
Chris Lattner	c6cdbfbfdd	teach instcombine to simplify xor's harder, catching the new testcase. llvm-svn: 83799	2009-10-11 22:22:13 +00:00
Chris Lattner	7db5b7893d	convert xor2 to filecheck, merge in a random regtest llvm-svn: 83796	2009-10-11 21:42:08 +00:00
Chris Lattner	fd27f8a5b3	generalize a transformation even more: we don't care whether the input the the mul is a zext from bool, just that it is all zeros other than the low bit. This fixes some phase ordering issues that would cause us to miss some xforms in mul.ll when the worklist is visited differently. llvm-svn: 83794	2009-10-11 21:29:45 +00:00
Chris Lattner	406cb75c6b	simplify a transformation by making it more general. llvm-svn: 83792	2009-10-11 21:22:21 +00:00
Torok Edwin	907ec36943	LICM shouldn't sink/delete debug information. Fix this and add a testcase. For now the metadata of sinked/hoisted instructions is still wrong, but that'll be fixed when instructions will have debug metadata directly attached. llvm-svn: 83786	2009-10-11 19:15:54 +00:00
Chris Lattner	85c85c5e04	when folding duplicate conditions, delete the now-probably-dead instruction tree feeding it. llvm-svn: 83778	2009-10-11 18:39:58 +00:00
Chris Lattner	e374382b8f	implement rdar://7293527, a trivial instcombine that llvm-gcc gets but clang doesn't, because it is implemented in GCC's fold routine. llvm-svn: 83761	2009-10-11 07:53:15 +00:00
Chris Lattner	97b1405207	implement a transformation in jump threading that is currently done by condprop, but do it in a much more general form. The basic idea is that we can do a limited form of tail duplication in the case when we have a branch on a phi. Moving the branch up in to the predecessor block makes instruction selection much easier and encourages chained jump threadings. llvm-svn: 83759	2009-10-11 07:24:57 +00:00
Chris Lattner	4140d8bd5c	another testcase jump threading shouldn't crash on. llvm-svn: 83758	2009-10-11 07:11:11 +00:00
Chris Lattner	ece16f2335	rename a file, remove a poorly reduced testcase. llvm-svn: 83757	2009-10-11 07:10:28 +00:00
Chris Lattner	f99a74e24b	make jump threading on a phi with undef inputs happen. llvm-svn: 83754	2009-10-11 04:18:15 +00:00
Chris Lattner	8d186bfafb	merge two tests. llvm-svn: 83751	2009-10-11 03:55:30 +00:00
Chris Lattner	041c1dca8b	simplify some run lines, convert a test to filecheck. llvm-svn: 83750	2009-10-11 03:54:21 +00:00
Chris Lattner	b6c65faa64	switch GVN to use SSAUpdater. Besides removing a lot of complexity from GVN, this also speeds it up, inserts fewer PHI nodes (see the testcase) and allows it to remove more loads (due to fewer PHI nodes standing in the way). llvm-svn: 83746	2009-10-10 23:50:30 +00:00
Dale Johannesen	3059924bdd	When considering whether to inline Callee into Caller, and that will make Caller too big to inline, see if it might be better to inline Caller into its callers instead. This situation is described in PR 2973, although I haven't tried the specific case in SPASS. llvm-svn: 83602	2009-10-09 00:11:32 +00:00
Chris Lattner	a893f5bdf5	remove predicate simplifier, it never got the last bugs beaten out of it, and jump threading, condprop and gvn are now getting most of the benefit. This was approved by Nicholas and Nicolas. llvm-svn: 83390	2009-10-06 16:59:46 +00:00
Evan Phoenix	44e5dbcaf0	Extend ConstantFolding to understand signed overflow variants llvm-svn: 83338	2009-10-05 22:53:52 +00:00
Chris Lattner	59d939894b	teach the optimizer how to constant fold uadd/usub intrinsics. llvm-svn: 83295	2009-10-05 05:26:04 +00:00
Chris Lattner	463716d559	instcombine shouldn't delete all null checks for mallocs. This fixes PR5130. llvm-svn: 83290	2009-10-05 02:47:47 +00:00
Chris Lattner	5f3cc06cd2	remove the GVNPRE pass. It has been subsumed by the GVN pass. Ok'd by Owen. llvm-svn: 83193	2009-10-01 02:18:36 +00:00
Dan Gohman	82ef61857e	Add a testcase for r83011. llvm-svn: 83012	2009-09-28 21:03:02 +00:00
Dan Gohman	21c0774ba9	Add a testcase to help test analysis preservation. llvm-svn: 83002	2009-09-28 18:40:27 +00:00
Chris Lattner	0261b5d2d2	The select instruction is not neccesarily in the same block as the phi nodes. Make sure to phi translate from the right block. This fixes a llvm-building-llvm failure on GVN-PRE.cpp llvm-svn: 82970	2009-09-28 06:49:44 +00:00
Dan Gohman	4dbb301f17	Move the dominator verification code out of special code embedded within the PassManager code into a regular verifyAnalysis method. Also, reorganize loop verification. Make the LoopPass infrastructure call verifyLoop as needed instead of having LoopInfo::verifyAnalysis check every loop in the function after each looop pass. Add a new command-line argument, -verify-loop-info, to enable the expensive full checking. llvm-svn: 82952	2009-09-28 00:27:48 +00:00
Chris Lattner	ae289632ef	Enhance the previous fix for PR4895 to allow more values than just simple constants for the true/false value of the select. We now do phi translation etc. This really fixes PR4895 :) llvm-svn: 82917	2009-09-27 20:18:49 +00:00
Chris Lattner	facb867af3	implement PR4895, by making FoldOpIntoPhi handle select conditions that are phi nodes. Also tighten up FoldOpIntoPhi to treat constantexpr operands to phis just like other variables, avoiding moving constantexpr computations around. Patch by Daniel Dunbar. llvm-svn: 82913	2009-09-27 19:57:57 +00:00
Nick Lewycky	b56e1ab033	Filecheckify this one test. llvm-svn: 82888	2009-09-27 06:25:05 +00:00
Dan Gohman	62995c71a2	Fix SimplifyLibCalls to transfer attributes from callees rather than calls, since direct calls don't always reflect the attributes of their callees. llvm-svn: 82867	2009-09-26 18:10:13 +00:00
Dan Gohman	5bafe38916	Fix a case where ScalarEvolution was expanding pointer arithmetic to inttoptr/ptrtoint unnecessarily. llvm-svn: 82864	2009-09-26 16:11:57 +00:00
Dan Gohman	48f7da742a	I put the wrong rdar number in this test. llvm-svn: 82829	2009-09-26 01:11:57 +00:00
Dan Gohman	5ffd53892d	Transform pow(x, 0.5) to (x == -inf ? inf : fabs(sqrt(x))), which is typically faster then doing a general pow. llvm-svn: 82819	2009-09-25 23:10:17 +00:00
Dale Johannesen	f6a987b784	Handle sqrt in CannotBeNegativeZero. absf and absl appear to be misspellings, removed in favor of fabs*. llvm-svn: 82796	2009-09-25 20:54:50 +00:00
Victor Hernandez	e6ff7662b6	Revert 82694 "Auto-upgrade malloc instructions to malloc calls." because it causes regressions in the nightly tests. llvm-svn: 82784	2009-09-25 18:11:52 +00:00
Torok Edwin	21bd8c9fc5	Constant propagating byval pointer is safe if function is readonly. llvm-svn: 82700	2009-09-24 18:33:42 +00:00
Victor Hernandez	46cd467310	Auto-upgrade malloc instructions to malloc calls. Reviewed by Devang Patel. llvm-svn: 82694	2009-09-24 17:47:49 +00:00
Torok Edwin	f95a450ef9	Don't constant propagate byval pointers, since they are not really pointers, but rather structs passed by value. This fixes PR5038. llvm-svn: 82689	2009-09-24 09:47:18 +00:00
Chris Lattner	cf295039e4	Fix PR5023: The instruction form of DominatorTree::dominates did not take into consideration that the result of an invoke is only valid in the normal dest, not the unwind dest. This caused 'PHINode::hasConstantValue' to return true in an invalid situation, causing mem2reg to delete a phi that was actually needed. This caused a crash building 483.xalancbmk. llvm-svn: 82491	2009-09-21 22:39:35 +00:00
Chris Lattner	9045f235d2	fix PR5016, a crash I introduced in GVN handing first class arrays and structs, which cannot be bitcast to integers. llvm-svn: 82460	2009-09-21 17:24:04 +00:00
Chris Lattner	4d8af2f1ae	enable non-local analysis and PRE of large store -> little load. This doesn't kick in too much because of phi translation issues, but this can be resolved in the future. llvm-svn: 82447	2009-09-21 06:48:08 +00:00
Chris Lattner	e2b8a80487	add pr# llvm-svn: 82440	2009-09-21 05:57:47 +00:00
Chris Lattner	0a9616d906	Improve GVN to be able to forward substitute a small load from a piece of a large store when both are in the same block. This allows clang to compile the testcase in PR4216 to this code: _test_bitfield: movl 4(%esp), %eax movl %eax, %ecx andl $-65536, %ecx orl $32962, %eax andl $40186, %eax orl %ecx, %eax ret This is not ideal, but is a whole lot better than the code produced by llvm-gcc: _test_bitfield: movw $-32574, %ax orw 4(%esp), %ax andw $-25350, %ax movw %ax, 4(%esp) movw 7(%esp), %cx shlw $8, %cx movzbl 6(%esp), %edx orw %cx, %dx movzwl %dx, %ecx shll $16, %ecx movzwl %ax, %eax orl %ecx, %eax ret and dramatically better than that produced by gcc 4.2: _test_bitfield: pushl %ebx call L3 "L00000000001$pb": L3: popl %ebx movl 8(%esp), %eax leal 0(,%eax,4), %edx sarb $7, %dl movl %eax, %ecx andl $7168, %ecx andl $-7201, %ebx movzbl %dl, %edx andl $1, %edx sall $5, %edx orl %ecx, %ebx orl %edx, %ebx andl $24, %eax andl $-58336, %ebx orl %eax, %ebx orl $32962, %ebx movl %ebx, %eax popl %ebx ret llvm-svn: 82439	2009-09-21 05:57:11 +00:00
Chris Lattner	b9f2bf46f7	fix a FileCheck bug where: ; CHECK: foo ; CHECK-NOT: foo ; CHECK: bar would always fail. llvm-svn: 82424	2009-09-21 02:30:42 +00:00
Daniel Dunbar	ffb60d566f	Work around a FileCheck bug, for now. llvm-svn: 82416	2009-09-20 23:30:31 +00:00
Chris Lattner	7e6d56ebc5	Revert r82404, it is causing a bootstrap miscompile. This is very very scary, as it indicates a lurking bug. yay. llvm-svn: 82411	2009-09-20 22:44:26 +00:00
Chris Lattner	973f14c8fa	this was not supposed to be committed llvm-svn: 82409	2009-09-20 22:36:11 +00:00
Chris Lattner	236d2d5e7b	implement and document support for CHECK-NOT llvm-svn: 82408	2009-09-20 22:35:26 +00:00
Chris Lattner	eea16a168a	improve memdep to eliminate bitcasts (and aliases, and noop geps) early for the stated reasons: this allows it to find more equivalences and depend less on code layout. llvm-svn: 82404	2009-09-20 21:00:18 +00:00
Chris Lattner	a0aa8fb6a6	Move CoerceAvailableValueToLoadType earlier in GVN.cpp. Hook it up so that nonlocal and partially redundant loads can use it as well. The testcase shows examples of craziness this can handle. This triggers many times in 176.gcc. llvm-svn: 82403	2009-09-20 20:09:34 +00:00
Chris Lattner	1dd48c34e5	enhance GVN to forward substitute a stored value to a load (and load -> load) when the base pointers must alias but when they are different types. This occurs very very frequently in 176.gcc and other code that uses bitfields a lot. llvm-svn: 82399	2009-09-20 19:03:47 +00:00
Nick Lewycky	9b3ed87506	Peer through zext and sext to eliminate them when it is safe to do so. llvm-svn: 82389	2009-09-20 07:31:25 +00:00
Nick Lewycky	b0225ba289	Fold 'icmp eq (icmp), true' into an xor(icmp). llvm-svn: 82386	2009-09-20 07:21:39 +00:00
Nick Lewycky	22fc051bd7	Rewrite this check so that it checks what it's supposed to and doesn't use CHECK-NOT. llvm-svn: 82383	2009-09-20 07:00:24 +00:00
Nick Lewycky	28260409f2	Teach the constant folder how to not a cmpinst. llvm-svn: 82378	2009-09-20 06:24:51 +00:00
Nick Lewycky	4a03452077	Try turning icmp(bitcast(x), bitcast(y)) into icmp(bitcast(bitcast(x)), y) in the hopes that the two bitcasts will merge. llvm-svn: 82371	2009-09-20 05:48:50 +00:00
Nick Lewycky	605109d151	Teach the constant folder how to handle a few simple i1 cases. llvm-svn: 82340	2009-09-20 00:04:02 +00:00
Dan Gohman	e5acc61f03	Fix the comment in this test. llvm-svn: 82051	2009-09-16 16:33:59 +00:00
Dan Gohman	3b7ce109ec	Don't sink gep operators through phi nodes if the result would require more than one phi, since that leads to higher register pressure on entry to the phi. This is especially problematic when the phi is in a loop header, as it increases register pressure throughout the loop. llvm-svn: 81993	2009-09-16 02:01:52 +00:00
Chris Lattner	d7490a4763	convert to filecheck llvm-svn: 81848	2009-09-15 06:34:29 +00:00
Dan Gohman	f9eafce3af	When extending a memset range past the front, set the alignment of the memset region to the alignment of the new start address. llvm-svn: 81810	2009-09-14 23:39:10 +00:00
Dan Gohman	a080159a7c	Convert more tests to avoid llvm-as. llvm-svn: 81545	2009-09-11 18:36:27 +00:00
Dan Gohman	0f3ef7be50	Eliminate more redundant llvm-as calls. llvm-svn: 81540	2009-09-11 18:17:12 +00:00
Dan Gohman	1880092722	Change tests from "opt %s" to "opt < %s" so that opt doesn't see the input filename so that opt doesn't print the input filename in the output so that grep lines in the tests don't unintentionally match strings in the input filename. llvm-svn: 81537	2009-09-11 18:01:28 +00:00
Chris Lattner	7158513fe0	another random update llvm-svn: 81531	2009-09-11 17:07:01 +00:00
Chris Lattner	e54242dc02	fix a bunch of spurious failures for people whose home directory is sabre. llvm-svn: 81528	2009-09-11 17:02:12 +00:00
Dan Gohman	21c6216c87	Teach lib/VMCore/ConstantFold.cpp how to set the inbounds keyword and how to fold notionally-out-of-bounds array getelementptr indices instead of just doing these in lib/Analysis/ConstantFolding.cpp, because it can be done in a fairly general way without TargetData, and because not all constants are visited by lib/Analysis/ConstantFolding.cpp. This enables more constant folding. Also, set the "inbounds" flag when the getelementptr indices are one-past-the-end. llvm-svn: 81483	2009-09-11 00:04:14 +00:00
Dan Gohman	7190d48075	Factor out the code for checking that all indices in a getelementptr are within the notional bounds of the static type of the getelementptr (which is not the same as "inbounds") from GlobalOpt into a utility routine, and use it in ConstantFold.cpp to check whether there are any mis-behaved indices. llvm-svn: 81478	2009-09-10 23:37:55 +00:00
Dan Gohman	ec4557f324	Fix SplitCriticalEdge to properly update LCSSA form when splitting a loop exit edge -- new PHIs may be needed not only for the additional splits that are made to preserve LoopSimplify form, but also for the original split. Factor out the code that inserts new PHIs so that it can be used for both. Remove LoopRotation.cpp's code for manually updating LCSSA form, as it is now redundant. This fixes PR4934. llvm-svn: 81363	2009-09-09 18:18:18 +00:00
Daniel Dunbar	d556bc48d7	Update test. llvm-svn: 81314	2009-09-09 02:41:50 +00:00
Dan Gohman	c466e31309	Use "opt < %s" instead of "opt %s" to keep the testname away from the grep. llvm-svn: 81299	2009-09-09 00:22:49 +00:00

... 3 4 5 6 7 ...

1603 Commits