llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	f5f944aeaa	no reason for simplifylibcalls to simplify intrinsics, instcombine does a fine job. llvm-svn: 50470	2008-04-30 06:12:15 +00:00
Chris Lattner	4b20032b08	remove redundant check. llvm-svn: 50469	2008-04-30 06:06:37 +00:00
Owen Anderson	ff7d7b18e5	Fix a bug in memcpyopt where the memcpy-memcpy transform was never being applied because we were checking for it in the wrong order. This caused a miscompilation because the return slot optimization assumes that the call it is dealing with is NOT a memcpy. llvm-svn: 50444	2008-04-29 21:26:06 +00:00
Chris Lattner	d9e3b5c5bd	don't eliminate load from volatile value on paths where the load is dead. This fixes the second half of PR2262 llvm-svn: 50430	2008-04-29 17:28:22 +00:00
Chris Lattner	53bcf3609a	make this test reduced and valid llvm-svn: 50429	2008-04-29 17:25:32 +00:00
Chris Lattner	9233c124c9	fix a subtle volatile handling bug. llvm-svn: 50428	2008-04-29 17:13:43 +00:00
Chris Lattner	e331a65c79	don't delete the last store to an alloca if the store is volatile. llvm-svn: 50390	2008-04-29 04:58:38 +00:00
Dan Gohman	8cb19d967f	Fix DSE to not eliminate volatile loads with no uses. llvm-svn: 50370	2008-04-28 19:51:27 +00:00
Dan Gohman	72ec3f4562	Teach InstCombine's ComputeMaskedBits what SelectionDAG's ComputeMaskedBits knows about cttz, ctlz, and ctpop. Teach SelectionDAG's ComputeMaskedBits what InstCombine's knows about SRem. And teach them both some things about high bits in Mul, UDiv, URem, and Sub. This allows instcombine and dagcombine to eliminate sign-extension operations in several new cases. llvm-svn: 50358	2008-04-28 17:02:21 +00:00
Chris Lattner	8be72700b8	Fix PR2256, yet another miscompilation in simplifycfg of i multiple return values. Bill, please pull this into Tak. llvm-svn: 50332	2008-04-28 00:19:07 +00:00
Chris Lattner	67ca6f6347	When SRoA'ing a global variable, make sure the new globals get the appropriate alignment. This fixes a miscompilation of 252.eon on x86-64 (rdar://5891920). Bill, please pull this into Tak. llvm-svn: 50308	2008-04-26 07:40:11 +00:00
Nick Lewycky	4d43d3c72c	Remove 'unwinds to' support from mainline. This patch undoes r47802 r47989 r48047 r48084 r48085 r48086 r48088 r48096 r48099 r48109 and r48123. llvm-svn: 50265	2008-04-25 16:53:59 +00:00
Chris Lattner	f7de528463	Don't infininitely thread branches when a threaded edge goes back to the block, e.g.: Threading edge through bool from 'bb37.us.thread3829' to 'bb37.us' with cost: 1, across block: bb37.us: ; preds = %bb37.us.thread3829, %bb37.us, %bb33 %D1361.1.us = phi i32 [ %tmp36, %bb33 ], [ %D1361.1.us, %bb37.us ], [ 0, %bb37.us.thread3829 ] ; <i32> [#uses=2] %tmp39.us = icmp eq i32 %D1361.1.us, 0 ; <i1> [#uses=1] br i1 %tmp39.us, label %bb37.us, label %bb42.us llvm-svn: 50251	2008-04-25 04:12:29 +00:00
Chris Lattner	86bbf338e5	Split some code out of the main SimplifyCFG loop into its own function. Fix said code to handle merging return instructions together correctly when handling multiple return values. llvm-svn: 50199	2008-04-24 00:01:19 +00:00
Chris Lattner	5a58a4dc6d	Rewrite multiple return value handling in SCCP. Before, the -sccp pass would turn every getresult instruction into undef. This helps with rdar://5778210 llvm-svn: 50140	2008-04-23 05:38:20 +00:00
Chris Lattner	14f41bfc49	remove this testcase. It isn't testing loop rotate, it is testing all of -std-compile-opts and is now failing because other passes are generating IR that looks different to input of loop rotate. Devang, please introduce a testcase that only runs loop rotate. llvm-svn: 50136	2008-04-23 05:36:04 +00:00
Chris Lattner	3376d6d824	make this test more interesting. llvm-svn: 50128	2008-04-23 03:49:32 +00:00
Chris Lattner	2161d6c075	distill down the essense of this test. llvm-svn: 50125	2008-04-23 03:03:42 +00:00
Dale Johannesen	c4d3c1cbe0	new test llvm-svn: 50123	2008-04-23 01:22:22 +00:00
Evan Cheng	1c89ca7295	Don't do: "(X & 4) >> 1 == 2 --> (X & 4) == 4" if there are more than one uses of the shift result. llvm-svn: 50118	2008-04-23 00:38:06 +00:00
Chris Lattner	37e9c187b0	Start doing the significantly useful part of jump threading: handle cases where a comparison has a phi input and that phi is a constant. For example, stuff like: Threading edge through bool from 'bb2149' to 'bb2231' with cost: 1, across block: bb2237: ; preds = %bb2231, %bb2149 %tmp2328.rle = phi i32 [ %tmp2232, %bb2231 ], [ %tmp2232439, %bb2149 ] ; <i32> [#uses=2] %done.0 = phi i32 [ %done.2, %bb2231 ], [ 0, %bb2149 ] ; <i32> [#uses=1] %tmp2239 = icmp eq i32 %done.0, 0 ; <i1> [#uses=1] br i1 %tmp2239, label %bb2231, label %bb2327 or bb38.i298: ; preds = %bb33.i295, %bb1693 %tmp39.i296.rle = phi %struct.ibox* [ null, %bb1693 ], [ %tmp39.i296.rle1109, %bb33.i295 ] ; <%struct.ibox> [#uses=2] %minspan.1.i291.reg2mem.1 = phi i32 [ 32000, %bb1693 ], [ %minspan.0.i288, %bb33.i295 ] ; <i32> [#uses=1] %tmp40.i297 = icmp eq %struct.ibox %tmp39.i296.rle, null ; <i1> [#uses=1] br i1 %tmp40.i297, label %implfeeds.exit311, label %bb43.i301 This triggers thousands of times in spec. llvm-svn: 50110	2008-04-22 21:40:39 +00:00
Chris Lattner	d5425e8f8d	Dig through multiple levels of AND to thread jumps if needed. llvm-svn: 50106	2008-04-22 20:46:09 +00:00
Chris Lattner	3df4c15dc7	Teach jump threading to thread through blocks like: br (and X, phi(Y, Z, false)), label L1, label L2 This triggers once on 252.eon and 6 times on 176.gcc. Blocks in question often look like this: bb262: ; preds = %bb261, %bb248 %iftmp.251.0 = phi i1 [ true, %bb261 ], [ false, %bb248 ] ; <i1> [#uses=4] %tmp270 = icmp eq %struct.rtx_def* %tmp.0.i, null ; <i1> [#uses=1] %bothcond = or i1 %iftmp.251.0, %tmp270 ; <i1> [#uses=1] br i1 %bothcond, label %bb288, label %bb273 In this case, it is clear that it doesn't matter if tmp.0.i is null when coming from bb261. When coming from bb248, it is all that matters. Another random example: check_asm_operands.exit: ; preds = %check_asm_operands.exit.thr_comm, %bb30.i, %bb12.i, %bb6.i413 %tmp.0.i420 = phi i1 [ true, %bb6.i413 ], [ true, %bb12.i ], [ true, %bb30.i ], [ false, %check_asm_operands.exit.thr_comm ; <i1> [#uses=1] call void @llvm.stackrestore( i8* %savedstack ) nounwind %tmp4389 = icmp eq i32 %added_sets_1.0, 0 ; <i1> [#uses=1] %tmp4394 = icmp eq i32 %added_sets_2.0, 0 ; <i1> [#uses=1] %bothcond80 = and i1 %tmp4389, %tmp4394 ; <i1> [#uses=1] %bothcond81 = and i1 %bothcond80, %tmp.0.i420 ; <i1> [#uses=1] br i1 %bothcond81, label %bb4398, label %bb4397 Here is the case from 252.eon: bb290.i.i: ; preds = %bb23.i57.i.i, %bb8.i39.i.i, %bb100.i.i, %bb100.i.i, %bb85.i.i110 %myEOF.1.i.i = phi i1 [ true, %bb100.i.i ], [ true, %bb100.i.i ], [ true, %bb85.i.i110 ], [ true, %bb8.i39.i.i ], [ false, %bb23.i57.i.i ] ; <i1> [#uses=2] %i.4.i.i = phi i32 [ %i.1.i.i, %bb85.i.i110 ], [ %i.0.i.i, %bb100.i.i ], [ %i.0.i.i, %bb100.i.i ], [ %i.3.i.i, %bb8.i39.i.i ], [ %i.3.i.i, %bb23.i57.i.i ] ; <i32> [#uses=3] %tmp292.i.i = load i8* %tmp16.i.i100, align 1 ; <i8> [#uses=1] %tmp293.not.i.i = icmp ne i8 %tmp292.i.i, 0 ; <i1> [#uses=1] %bothcond.i.i = and i1 %tmp293.not.i.i, %myEOF.1.i.i ; <i1> [#uses=1] br i1 %bothcond.i.i, label %bb202.i.i, label %bb301.i.i Factoring out 3 common predecessors. On the path from any blocks other than bb23.i57.i.i, the load and compare are dead. llvm-svn: 50096	2008-04-22 07:05:46 +00:00
Chris Lattner	3cc28ce1ed	add a basic testcase. llvm-svn: 50093	2008-04-22 06:35:14 +00:00
Chris Lattner	c3a439351c	optimize "p != gep p, ..." better. This allows us to compile getelementptr-seteq.ll into: define i1 @test(i64 %X, %S* %P) { %C = icmp eq i64 %X, -1 ; <i1> [#uses=1] ret i1 %C } instead of: define i1 @test(i64 %X, %S* %P) { %A.idx.mask = and i64 %X, 4611686018427387903 ; <i64> [#uses=1] %C = icmp eq i64 %A.idx.mask, 4611686018427387903 ; <i1> [#uses=1] ret i1 %C } And fixes the second half of PR2235. This speeds up the insertion sort case by 45%, from 1.12s to 0.77s. In practice, this will significantly speed up for loops structured like: for (double *P = Base + N; P != Base; --P) ... Which happens frequently for C++ iterators. llvm-svn: 50079	2008-04-22 02:53:33 +00:00
Owen Anderson	6a7355caa2	Refactor memcpyopt based on Chris' suggestions. Consolidate several functions and simplify code that was fallout from the separation of memcpyopt and gvn. llvm-svn: 50034	2008-04-21 07:45:10 +00:00
Chris Lattner	b839c05a05	rename .llx -> .ll, last batch. llvm-svn: 49971	2008-04-19 22:32:52 +00:00
Owen Anderson	81f7584c4e	XFAIL this test for the moment. The real solution is to prevent ADCE from transforming loops and adding a separate loop pass for removing loops with know trip counts. Until that happens, ADCE is miscompiling this code. llvm-svn: 49769	2008-04-16 04:25:42 +00:00
Owen Anderson	90bde997b3	Add testcase for PR2213. llvm-svn: 49517	2008-04-11 05:13:32 +00:00
Dan Gohman	99b7b3f03b	Teach InstCombine's ComputeMaskedBits to handle pointer expressions in addition to integer expressions. Rewrite GetOrEnforceKnownAlignment as a ComputeMaskedBits problem, moving all of its special alignment knowledge to ComputeMaskedBits as low-zero-bits knowledge. Also, teach ComputeMaskedBits a few basic things about Mul and PHI instructions. This improves ComputeMaskedBits-based simplifications in a few cases, but more noticeably it significantly improves instcombine's alignment detection for loads, stores, and memory intrinsics. llvm-svn: 49492	2008-04-10 18:43:06 +00:00
Chris Lattner	802134fc02	Generalize getUnaryFloatFunction to handle any FP unary function, automatically figuring out the suffix to use. implement pow(2,x) -> exp2(x). llvm-svn: 49437	2008-04-09 17:48:11 +00:00
Chris Lattner	091afc7714	remove capital letter from test name. llvm-svn: 49436	2008-04-09 17:46:36 +00:00
Owen Anderson	ef9a6fd5c2	Factor a bunch of functionality related to memcpy and memset transforms out of GVN and into its own pass. llvm-svn: 49419	2008-04-09 08:23:16 +00:00
Chris Lattner	b859fb49ed	many cleanups to the pow optimizer. Allow it to handle powf, add support for pow(x, 2.0) -> x*x. llvm-svn: 49411	2008-04-09 00:07:45 +00:00
Gabor Greif	00fcdeddd3	merge r48768 from branches/ggreif/parallelized-test llvm-svn: 49382	2008-04-08 15:22:41 +00:00
Chris Lattner	28e7b57605	add a testcase for forming memset from noncontiguous stores. llvm-svn: 48938	2008-03-29 04:51:35 +00:00
Evan Cheng	2b72c05992	Handle a special case xor undef, undef -> 0. Technically this should be transformed to undef. But this is such a common idiom (misuse) we are going to handle it. llvm-svn: 48791	2008-03-25 20:07:13 +00:00
Tanya Lattner	8bf97c2324	Byebye llvm-upgrade! llvm-svn: 48762	2008-03-25 04:26:08 +00:00
Devang Patel	a38f58aa5c	Add incoming value from header only if phi node has any use inside the loop. llvm-svn: 48738	2008-03-24 20:16:14 +00:00
Chris Lattner	c2c0c8303c	apparently tclsh doesn't lex like bash. Weird. llvm-svn: 48732	2008-03-24 17:41:57 +00:00
Chris Lattner	9ca6bb4f16	pass the option so this test tests the right thing. llvm-svn: 48731	2008-03-24 17:36:38 +00:00
Evan Cheng	c3cf9f872a	Transform (zext (or (icmp), (icmp))) to (or (zext (cimp), (zext icmp))) if at least one of the (zext icmp) can be transformed to eliminate an icmp. llvm-svn: 48715	2008-03-24 00:21:34 +00:00
Owen Anderson	e3605ac108	Use normal naming convention for test. llvm-svn: 48693	2008-03-22 21:08:33 +00:00
Chris Lattner	53ccb62712	implement an initial hack at a straight-line store -> memset optimization. This fires dozens of times across spec and multisource, but I don't know if it actually speeds stuff up. Hopefully the testers will show something nice :) llvm-svn: 48680	2008-03-22 05:37:16 +00:00
Chris Lattner	c44160ce6e	Teach masked value is zero about add and sub, and use MVIZ to simplify things like (X & 4) >> 1 == 2 --> (X & 4) == 4. since it is obvious that the shift doesn't remove any bits. llvm-svn: 48631	2008-03-21 05:19:58 +00:00
Tanya Lattner	ab7872c06c	Upgrade tests. llvm-svn: 48538	2008-03-19 07:28:33 +00:00
Tanya Lattner	f9d25185d5	Upgrade tests. llvm-svn: 48536	2008-03-19 05:39:35 +00:00
Tanya Lattner	0ea4c8d706	Upgrade tests to not use llvm-upgrade. llvm-svn: 48530	2008-03-19 04:36:04 +00:00
Tanya Lattner	1d526b90aa	Upgrade tests to not use llvm-upgrade. llvm-svn: 48529	2008-03-19 04:14:49 +00:00
Tanya Lattner	f73582b17c	Remove llvm-upgrade and update tests. llvm-svn: 48527	2008-03-19 03:47:13 +00:00
Tanya Lattner	4e59897d3d	Upgrade tests to not use llvm-upgrade. llvm-svn: 48484	2008-03-18 04:14:37 +00:00
Tanya Lattner	baa370b37a	Upgrade tests to not use llvm-upgrade. llvm-svn: 48483	2008-03-18 03:45:45 +00:00
Bill Wendling	68a930b33e	The inst combining of inttoptr into GEP with one index was using the bit size of the type instead of the byte size. This was causing troublesome mis-compilations. True to form, this took 2 days to find and is a one-line fix. :-P llvm-svn: 48354	2008-03-14 05:12:19 +00:00
Owen Anderson	7a69e3aef3	Fix a bug in GVN that Duncan noticed, where we potentially need to insert a pointer bitcast when performing return slot optimization. llvm-svn: 48343	2008-03-13 22:07:10 +00:00
Owen Anderson	6ff0b822b4	Improve the return slot optimization to be both more aggressive (not limited to sret parameters), and safer (when the passed pointer might be invalid). Thanks to Duncan and Chris for the idea behind this, and extra thanks to Duncan for helping me work out the trap-safety. llvm-svn: 48280	2008-03-12 07:37:44 +00:00
Devang Patel	fa8667a2dd	Fix attribute handling. llvm-svn: 48262	2008-03-12 00:07:03 +00:00
Devang Patel	7358165c99	Handle multiple ret values. llvm-svn: 48254	2008-03-11 22:24:29 +00:00
Dan Gohman	20af5a0fe7	Check to see if a two-entry PHI block can be simplified before trying to merge the block into its predecessors. This allows two-entry-phi-return.ll to be simplified into a single basic block. llvm-svn: 48252	2008-03-11 21:53:06 +00:00
Dan Gohman	8e9ae96a4a	Make this test more challenging to help it avoid being optimized away before it tests what it is intended to test. llvm-svn: 48251	2008-03-11 21:47:57 +00:00
Devang Patel	a7a2075ab8	Initial multiple return values support. llvm-svn: 48210	2008-03-11 05:46:42 +00:00
Dan Gohman	319234d67c	Upgrade this test. llvm-svn: 48207	2008-03-11 02:19:59 +00:00
Devang Patel	741f491d90	Simplify llvm-svn: 48163	2008-03-10 18:38:30 +00:00
Tanya Lattner	5f4b355f20	Remove llvm-upgrade and update tests. llvm-svn: 48137	2008-03-10 07:21:50 +00:00
Nick Lewycky	fb2c1a999a	Turn unwind_to into "unwinds to". llvm-svn: 48123	2008-03-10 02:20:00 +00:00
Tanya Lattner	aa6f5c9ddd	Remove llvm-upgrade and update tests. llvm-svn: 48103	2008-03-09 08:16:40 +00:00
Nick Lewycky	42445be0df	Firstly, having a BranchInst isn't exclusive with having an unwind_to. Secondly, we have to check whether the branch is actually pointing to the block with the unwind in it. We could have gotten here because of the unwind_to alone. llvm-svn: 48099	2008-03-09 07:50:37 +00:00
Nick Lewycky	f3d637fa14	A BB that unwind_to an "unwind" inst is that same as one that doesn't unwind_to at all. llvm-svn: 48096	2008-03-09 07:36:38 +00:00
Nick Lewycky	5ce9b521d7	Update the inliner and simplifycfg to handle unwind_to. llvm-svn: 48086	2008-03-09 05:10:13 +00:00
Nick Lewycky	4d0ed842b1	Prune the unwind_to labels on BBs that don't need them. Another step in the removal of invoke, PR1269. llvm-svn: 48084	2008-03-09 04:55:16 +00:00
Devang Patel	780b3ca64b	Update inliner to handle functions that return multiple values. llvm-svn: 48020	2008-03-07 20:06:16 +00:00
Devang Patel	47d774b2c8	Place for sret promotion tests. llvm-svn: 48016	2008-03-07 20:00:15 +00:00
Nick Lewycky	3e2d7c9f85	Commit the testcase too. llvm-svn: 47988	2008-03-06 06:50:03 +00:00
Nick Lewycky	d0b62a1552	Don't try to simplify urem and srem using arithmetic rules that don't work under modulo (overflow). Fixes PR1933. llvm-svn: 47987	2008-03-06 06:48:30 +00:00
Devang Patel	941ab37ea8	Use cast instead of dyn_cast. Update test to use multiple return value directly, instead of relying on -sretpromotion. llvm-svn: 47907	2008-03-04 21:45:28 +00:00
Devang Patel	841322b32a	Handle multiple return values. llvm-svn: 47904	2008-03-04 21:15:15 +00:00
Tanya Lattner	5640bd186a	Remove llvm-upgrade and update test cases. llvm-svn: 47793	2008-03-01 09:15:35 +00:00
Chris Lattner	c966cebe93	fix a bug Anders ran into where scalarrepl would crash when promoting a union containing a vector and an array whose elements were smaller than the vector elements. this means we need to compile the load of the array elements into an extract element plus a truncate. llvm-svn: 47752	2008-02-29 07:12:06 +00:00
Chris Lattner	c612571555	Folding or(fcmp,fcmp) only works if the operands of the fcmps are the same fp type. llvm-svn: 47750	2008-02-29 06:09:11 +00:00
Owen Anderson	e41c19c987	Add PR number to testcase. llvm-svn: 47640	2008-02-26 23:16:11 +00:00
Owen Anderson	d29ed0b122	Fix an issue where GVN had the sizes of the two memcpy's reverse, resulting in an invalid transformation. llvm-svn: 47639	2008-02-26 23:06:17 +00:00
Chris Lattner	a39cff3aaa	fix this test so that the fn name doesn't match the regex llvm-svn: 47608	2008-02-26 18:13:51 +00:00
Gabor Greif	3d9755f6ca	Really feed llvm-as with the testcase, do not let it read from stdin. This fixes the hangs seen on solaris10. llvm-svn: 47604	2008-02-26 13:37:13 +00:00
Owen Anderson	df1d2b02f9	Fix an issue where GVN was performing the return slot optimization when it was not safe. This is fixed by more aggressively checking that the return slot is not used elsewhere in the function. llvm-svn: 47544	2008-02-25 04:08:09 +00:00
Owen Anderson	40dca46ddb	Fix an issue where GVN would try to use an instruction before its definition when performing return slot optimization. llvm-svn: 47541	2008-02-25 00:40:41 +00:00
Zhou Sheng	aae582ba99	Testcase for Revision 47478. llvm-svn: 47531	2008-02-23 10:59:51 +00:00
Nick Lewycky	fefd0202c9	Correctly fold divide-by-constant, even when faced with overflow. llvm-svn: 47287	2008-02-18 22:48:05 +00:00
Chris Lattner	23fe6630e3	make this just a bit more strict. llvm-svn: 47274	2008-02-18 17:33:10 +00:00
Owen Anderson	3549553262	Add support to GVN for performing sret return slot optimization. This means that, if an sret function tail calls another sret function, it should pass its own sret parameter to the tail callee, allowing it to fill in the correct return value. llvm-gcc does not emit this by default. Instead, it allocates space in the caller for the sret of the tail call and then uses memcpy to copy the result into the caller's sret parameter. This optimization detects and optimizes that case. llvm-svn: 47265	2008-02-18 09:24:53 +00:00
Chris Lattner	024f8c8f09	optimize away stackrestore calls that have no intervening alloca or call. llvm-svn: 47258	2008-02-18 06:12:38 +00:00
Chris Lattner	c8ec470b52	upgrade this test. llvm-svn: 47257	2008-02-18 06:11:00 +00:00
Chris Lattner	cc22601bc3	Fold (-x + -y) -> -(x+y) which promotes better association, fixing the second half of PR2047 llvm-svn: 47244	2008-02-17 21:03:36 +00:00
Chris Lattner	a70d138457	Split up subtracts into add+negate if they have a reassociable use or operand that is also a subtract. This implements PR2047 and Transforms/Reassociate/subtest2.ll llvm-svn: 47241	2008-02-17 20:51:26 +00:00
Chris Lattner	2de8c2d41f	upgrade and simplify this test. llvm-svn: 47240	2008-02-17 20:48:43 +00:00
Duncan Sands	573b3f89e4	Remove any 'nest' parameter attributes if the function is not passed as an argument to a trampoline intrinsic. llvm-svn: 47220	2008-02-16 20:56:04 +00:00
Devang Patel	2e622e4c2b	If loop header is also loop exiting block then OrigPN is incoming value for B loop header. Fixes PR 2030. llvm-svn: 47141	2008-02-14 23:18:47 +00:00
Chris Lattner	70e294660a	Fix PR2029 llvm-svn: 47129	2008-02-14 19:18:13 +00:00
Nick Lewycky	9592bb0390	Testcase for PR2032. llvm-svn: 47113	2008-02-14 07:15:11 +00:00
Devang Patel	0ecb76d820	A loop latch phi node may have uses inside loop, not just in loop header. llvm-svn: 47093	2008-02-13 22:23:07 +00:00
Devang Patel	22c3caab6e	While moving exit condition, do not drop loop latch on the floor. llvm-svn: 47089	2008-02-13 22:06:36 +00:00
Devang Patel	c281d8031b	Keep track of exit value operand number when operands are swapped. llvm-svn: 47082	2008-02-13 19:48:48 +00:00
Eli Friedman	460648abde	Add a note pointing to PR1996. llvm-svn: 47055	2008-02-13 07:56:04 +00:00
Eli Friedman	03ec63f29d	Add test for PR1996. (This is my first time adding a test for a transform, so please review.) llvm-svn: 47050	2008-02-13 06:55:57 +00:00
Owen Anderson	00dba4f734	Re-apply the patch to improve the optimizations of memcpy's, with several bugs fixed. This now passes PPC bootstrap. llvm-svn: 47026	2008-02-12 21:15:18 +00:00
Devang Patel	26f75e2576	Fix PR 1995. llvm-svn: 46898	2008-02-08 22:49:13 +00:00
Bill Wendling	c676a0329c	Temporarily reverting: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20080128/057882.html This is causing a miscompilation on PPC G5 and just now seeing it on iMac x86-64. llvm-svn: 46822	2008-02-06 20:03:07 +00:00
Chris Lattner	682a7dc653	Fix a bug compiling PR1978 (perhaps not the only one though) which was incorrectly simplifying "x == (gep x, 1, i)" into false, even though i could be negative. As it turns out, all the code to handle this already existed, we just need to disable the incorrect optimization case and let the general case handle it. llvm-svn: 46739	2008-02-05 04:45:32 +00:00
Owen Anderson	1a78ae76e4	Make this test more aggressive, to cover recent improvements. llvm-svn: 46695	2008-02-04 04:55:24 +00:00
Owen Anderson	c4a7c41869	Allow GVN to hack on memcpy's, making them open to further optimization. llvm-svn: 46693	2008-02-04 02:59:58 +00:00
Nick Lewycky	56178bc6ad	Tag this test with the PR reference. llvm-svn: 46688	2008-02-03 16:35:19 +00:00
Nick Lewycky	3b59214320	There are some cases where icmp(add) can be folded into a new icmp. Handle them. llvm-svn: 46687	2008-02-03 16:33:09 +00:00
Duncan Sands	9aa789fda3	Don't drop function/call return attributes like 'nounwind'. llvm-svn: 46645	2008-02-01 20:37:16 +00:00
Owen Anderson	4e4b116750	Make DSE much more aggressive by performing DCE earlier. Update a testcase to reflect this increased aggressiveness. llvm-svn: 46542	2008-01-30 01:24:47 +00:00
Chris Lattner	b9e5b8fb9e	Fix a bug where scalarrepl would discard offset if type would match. In practice this can only happen on code with already undefined behavior, but this is still a good thing to handle correctly. llvm-svn: 46539	2008-01-30 00:39:15 +00:00
Chris Lattner	ade0abb498	Don't let globalopt hack on volatile loads or stores. llvm-svn: 46523	2008-01-29 19:01:37 +00:00
Chris Lattner	17819d971e	eliminate additions of 0.0 when they are obviously dead. This has to be careful to avoid turning -0.0 + 0.0 -> -0.0 which is incorrect. llvm-svn: 46499	2008-01-29 06:52:45 +00:00
Owen Anderson	95bf1d4d7b	Add a testcase for eliminating memcpy's at the end of functions. Forgot to commit this with my last commit. llvm-svn: 46497	2008-01-29 06:40:32 +00:00
Devang Patel	67fa0521b6	Filter loops that subtract induction variables. These loops are not yet handled. Fix PR 1912. llvm-svn: 46484	2008-01-29 02:20:41 +00:00
Chris Lattner	a116071547	this test is now compiled into the right thing. llvm-svn: 46454	2008-01-28 17:38:46 +00:00
Nick Lewycky	8ea81e8ba4	Handle some more combinations of extend and icmp. Fixes PR1940. llvm-svn: 46431	2008-01-28 03:48:02 +00:00
Chris Lattner	710b441174	Fix PR1932 by disabling an xform invalid for fdiv. llvm-svn: 46429	2008-01-28 00:58:18 +00:00
Chris Lattner	1b706dd680	Fix PR1938 by forcing the code that uses an undefined value to branch one way or the other. Rewriting the code itself prevents subsequent analysis passes from making contradictory conclusions about the code that could cause an infeasible path to be made feasible. llvm-svn: 46427	2008-01-28 00:32:30 +00:00
Nick Lewycky	efb16f7057	Be more careful modifying the use_list while also iterating through it. llvm-svn: 46417	2008-01-27 18:35:00 +00:00
Duncan Sands	053c9871cd	Revert r46393: readonly/readnone functions are no longer allowed to write through byval arguments. llvm-svn: 46416	2008-01-27 18:12:58 +00:00
Bill Wendling	8c491162d2	The CorrelatedExpressions pass is now no more. llvm-svn: 46409	2008-01-27 06:13:32 +00:00
Chris Lattner	fa1e7eef30	Fold fptrunc(add (fpextend x), (fpextend y)) -> add(x,y), as GCC does. llvm-svn: 46406	2008-01-27 05:29:54 +00:00
Duncan Sands	dc157a4f0a	Invert this test, because it is wrong if we allow readonly functions to use byval parameters as local storage (how much do we want this?). llvm-svn: 46399	2008-01-26 12:33:01 +00:00
Owen Anderson	6af19fd1e2	DeadStoreElimination can treat byval parameters as if there were alloca's for the purpose of removing end-of-function stores. llvm-svn: 46351	2008-01-25 10:10:33 +00:00
Nick Lewycky	78712e5b59	Multiply can be evaluated in a different type, so long as the target type has a smaller bitwidth. llvm-svn: 46244	2008-01-22 05:08:48 +00:00
Evan Cheng	9a93dc9565	Test case for varargs parameter attribute issue I just fixed. llvm-svn: 46127	2008-01-17 07:26:31 +00:00
Chris Lattner	5630c4f217	Fix arg promotion to propagate the correct attrs on the calls to promoted functions. This is important for varargs calls in particular. Thanks to duncan for providing a great testcase. llvm-svn: 46108	2008-01-17 01:17:03 +00:00
Devang Patel	b3696e4f14	Do not strip llvm.used values. llvm-svn: 46045	2008-01-16 03:33:05 +00:00
Chris Lattner	f3e1155c41	add a test to ensure that argpromote of one argument doesn't break the byval attr on some other argument. llvm-svn: 46025	2008-01-15 22:38:12 +00:00
Duncan Sands	b5ca2e9fcb	I noticed that the trampoline straightening transformation could drop attributes on varargs call arguments. Also, it could generate invalid IR if the transformed call already had the 'nest' attribute somewhere (this can never happen for code coming from llvm-gcc, but it's a theoretical possibility). Fix both problems. llvm-svn: 45973	2008-01-14 19:52:09 +00:00
Chris Lattner	26fe7ebc03	Fix the miscompilation of MiBench/consumer-lame that was exposed by Evan's byval work. This miscompilation is due to the program indexing an array out of range and us doing a transformation that broke this. llvm-svn: 45949	2008-01-14 02:09:12 +00:00
Chris Lattner	92bd785323	Turn a memcpy from a double* into a load/store of double instead of a load/store of i64. The later prevents promotion/scalarrepl of the source and dest in many cases. This fixes the 300% performance regression of the byval stuff on stepanov_v1p2. llvm-svn: 45945	2008-01-14 00:28:35 +00:00
Chris Lattner	5bc253c8f2	Fix PR1907, a nasty miscompilation because instcombine didn't realize that ne & sgt was a signed comparison (it was only looking at whether the left compare was signed). llvm-svn: 45937	2008-01-13 20:59:02 +00:00
Duncan Sands	781f6549db	When turning a call to a bitcast function into a direct call, if this becomes a varargs call then deal correctly with any parameter attributes on the newly vararg call arguments. llvm-svn: 45931	2008-01-13 08:02:44 +00:00
Chris Lattner	4f6c81ac68	we don't have to make an explicit copy of a byval argument when inlining a function if we know that the function does not write to any memory. This implements test/Transforms/Inline/byval2.ll llvm-svn: 45912	2008-01-12 18:54:29 +00:00
Duncan Sands	5b721fc21d	When DAE drops the varargs part of a function, ensure any attributes on the vararg call arguments are also dropped. llvm-svn: 45892	2008-01-11 23:13:45 +00:00
Chris Lattner	b5bd924e83	Teach argpromote to ruthlessly hack small byval structs when it can get away with it, which exposes opportunities to eliminate the memory objects entirely. For example, we now compile byval.ll to: define internal void @f1(i32 %b.0, i64 %b.1) { entry: %tmp2 = add i32 %b.0, 1 ; <i32> [#uses=0] ret void } define i32 @main() nounwind { entry: call void @f1( i32 1, i64 2 ) ret i32 0 } This seems like it would trigger a lot for code that passes around small structs (e.g. SDOperand's or _Complex)... llvm-svn: 45886	2008-01-11 22:31:41 +00:00
Chris Lattner	908117bf69	When inlining a functino with a byval argument, make an explicit copy of it in case the callee modifies the struct. llvm-svn: 45853	2008-01-11 06:09:30 +00:00
Chris Lattner	2940c5c56d	Implement PR1795, an instcombine hack for forming GEPs with integer pointer arithmetic. llvm-svn: 45745	2008-01-08 07:23:51 +00:00
Duncan Sands	404eb05247	The transform that tries to turn calls to bitcast functions into direct calls bails out unless caller and callee have essentially equivalent parameter attributes. This is illogical - the callee's attributes should be of no relevance here. Rework the logic, which incidentally fixes a crash when removed arguments have attributes. llvm-svn: 45658	2008-01-06 18:27:01 +00:00
Duncan Sands	55e5090fe8	When transforming a call to a bitcast function into a direct call with cast parameters and cast return value (if any), instcombine was prepared to cast any non-void return value into any other, whether castable or not. Add a new predicate for testing whether casting is valid, and check it both for the return value and (as a cleanup) for the parameters. llvm-svn: 45657	2008-01-06 10:12:28 +00:00
Chris Lattner	e666bc272d	remove a couple more unsafe xforms in the face of overflow. llvm-svn: 45613	2008-01-05 01:22:42 +00:00
Chris Lattner	bdd6acfb59	Fix PR1896 llvm-svn: 45568	2008-01-04 05:04:53 +00:00
Chris Lattner	f391883670	don't hoist FP additions into unconditional adds + selects. This could theoretically introduce a trap, but is also a performance issue. This speeds up ptrdist/ks by 8%. llvm-svn: 45533	2008-01-03 07:25:26 +00:00
Bill Wendling	6f8c9a8372	Update this testcase. The output needs to be disabled to pass. llvm-svn: 45478	2008-01-01 01:34:36 +00:00
Chris Lattner	e96658392d	dead calls to llvm.stacksave can be deleted, even though they have potential side-effects. llvm-svn: 45392	2007-12-29 00:59:12 +00:00
Chris Lattner	bc03f70a07	upgrade this test llvm-svn: 45391	2007-12-29 00:57:06 +00:00
Devang Patel	b57ff068cd	Test -simplifycfg only. llvm-svn: 45389	2007-12-28 22:59:48 +00:00
Owen Anderson	3de3f9981e	Add a testcase for my recent InstCombine fix, written by Nicholas. llvm-svn: 45386	2007-12-28 21:08:43 +00:00
Chris Lattner	74b2ab59fd	implement InstCombine/shift-trunc-shift.ll. This allows us to compile: #include <math.h> int t1(double d) { return signbit(d); } into: _t1: movd %xmm0, %rax shrq $63, %rax ret instead of: _t1: movd %xmm0, %rax shrq $32, %rax shrl $31, %eax ret on x86-64. llvm-svn: 45311	2007-12-22 09:07:47 +00:00
Devang Patel	7a2c66b11e	If succ has succ itself as one of the predecessors then do not merge current bb and succ even if bb's terminator is unconditional branch to succ. llvm-svn: 45305	2007-12-22 01:32:53 +00:00
Duncan Sands	6a7703ed63	Make DAE not wipe out attributes on calls, and not drop return attributes on the floor. In the case of a call to a varargs function where the varargs arguments are being removed, any call attributes on those arguments need to be dropped. I didn't do this because I plan to make it illegal to have such attributes (see next patch). With this change, compiling the gcc filter2 eh test at -O0 and then running opt -std-compile-opts on it results in a correctly working program (compiling at -O1 or higher results in the test failing due to a problem with how we output eh info into the IR). llvm-svn: 45285	2007-12-21 19:16:16 +00:00
Christopher Lamb	7d82bc46b8	Implement review feedback, including additional transforms (icmp slt (sub A B) 1) -> (icmp sle A B) icmp sgt (sub A B) -1) -> (icmp sge A B) and add testcase. llvm-svn: 45256	2007-12-20 07:21:11 +00:00
Duncan Sands	aa31b92508	When inlining through an 'nounwind' call, mark inlined calls 'nounwind'. It is important for correct C++ exception handling that nounwind markings do not get lost, so this transformation is actually needed for correctness. llvm-svn: 45218	2007-12-19 21:13:37 +00:00
Christopher Lamb	74dbad9216	Remove an orthogonal transformation of the selection condition from my most recent submission. llvm-svn: 45169	2007-12-18 20:30:28 +00:00
Christopher Lamb	30291f4a30	Fix typos. llvm-svn: 45159	2007-12-18 09:45:40 +00:00
Christopher Lamb	8b09a464b4	Fold certain additions through selects (and their compares) so as to eliminate subtractions. This code is often produced by the SMAX expansion in SCEV. This implements test/Transforms/InstCombine/2007-12-18-AddSelCmpSub.ll llvm-svn: 45158	2007-12-18 09:34:41 +00:00
Duncan Sands	b5a79d0eaa	Make invokes of inline asm legal. Teach codegen how to lower them (with no attempt made to be efficient, since they should only occur for unoptimized code). llvm-svn: 45108	2007-12-17 18:08:19 +00:00
Duncan Sands	8e4847ee95	Make instcombine promote inline asm calls to 'nounwind' calls. Remove special casing of inline asm from the inliner. There is a potential problem: the verifier rejects invokes of inline asm (not sure why). If an asm call is not marked "nounwind" in some .ll, and instcombine is not run, but the inliner is run, then an illegal module will be created. This is bad but I'm not sure what the best approach is. I'm tempted to remove the check in the verifier... llvm-svn: 45073	2007-12-16 15:51:49 +00:00
Wojciech Matyjewicz	309e5a723b	1. "Upgrage" comments. 2. Using zero-extended value of Scale and unsigned division is safe provided that Scale doesn't have the sign bit set. Previously these 2 instructions: %p = bitcast [100 x {i8,i8,i8}]* %x to i8* %q = getelementptr i8* %p, i32 -4 were combined into: %q = getelementptr [100 x { i8, i8, i8 }]* %x, i32 0, i32 1431655764, i32 0 what was incorrect. llvm-svn: 44936	2007-12-12 15:21:32 +00:00
Chris Lattner	6a6b3fb62b	Implement constant folding if vector<->vector bitcasts where the number of source/dest elements changes. This implements test/Transforms/InstCombine/bitcast-vector-fold.ll llvm-svn: 44855	2007-12-11 07:29:44 +00:00
Chris Lattner	d2265b45ae	Fix PR1850 by removing an unsafe transformation from VMCore/ConstantFold.cpp. Reimplement the xform in Analysis/ConstantFolding.cpp where we can use targetdata to validate that it is safe. While I'm in there, fix some const correctness issues and generalize the interface to the "operand folder". llvm-svn: 44817	2007-12-10 22:53:04 +00:00
Duncan Sands	9f76be61d1	Make PruneEH update the nounwind/noreturn attributes on functions as it calculates them. llvm-svn: 44802	2007-12-10 19:09:40 +00:00
Devang Patel	bd75910fa7	If ExitValue operand is also defined in Loop header then insert new ExitValue after this operand definition. This fixes PR1828. llvm-svn: 44539	2007-12-03 19:17:21 +00:00
Duncan Sands	5208d1ab4a	Add some convenience methods for querying attributes, and use them. llvm-svn: 44403	2007-11-28 17:07:01 +00:00
Duncan Sands	ad0ea2d430	Fix PR1146: parameter attributes are longer part of the function type, instead they belong to functions and function calls. This is an updated and slightly corrected version of Reid Spencer's original patch. The only known problem is that auto-upgrading of bitcode files doesn't seem to work properly (see test/Bitcode/AutoUpgradeIntrinsics.ll). Hopefully a bitcode guru (who might that be? :) ) will fix it. llvm-svn: 44359	2007-11-27 13:23:08 +00:00
Nick Lewycky	cdb7e54ca7	Add new SCEV, SCEVSMax. This allows LLVM to analyze do-while loops. llvm-svn: 44319	2007-11-25 22:41:31 +00:00
Chris Lattner	c00e8adfe0	Implement PR1822 llvm-svn: 44318	2007-11-25 21:27:53 +00:00
Duncan Sands	185eeac0f8	Fix PR1816. If a bitcast of a function only exists because of a trivial difference in function attributes, allow calls to it to be converted to direct calls. Based on a patch by Török Edwin. While there, move the various lists of mutually incompatible parameters etc out of the verifier and into ParameterAttributes.h. llvm-svn: 44315	2007-11-25 14:10:56 +00:00
Chris Lattner	893fe3bbd1	Fix PR1816, by correcting the broken definition of APInt::countTrailingZeros. llvm-svn: 44296	2007-11-23 22:42:31 +00:00
Duncan Sands	8a3e9d2bee	Ding dong, the DoesntAccessMemoryFns and OnlyReadsMemoryFns tables are dead! We get more, and more accurate, information from gcc via the readnone and readonly function attributes. llvm-svn: 44288	2007-11-23 19:30:27 +00:00
Chris Lattner	a8fbde3f78	Fix a bug where we'd try to find a scev value for a bitcast operand, even though the bitcast operand did not have integer type. This fixes PR1814. llvm-svn: 44286	2007-11-23 08:46:22 +00:00
Chris Lattner	1985d96dc9	Fix PR1817. llvm-svn: 44284	2007-11-22 23:47:13 +00:00
Duncan Sands	a915b538d3	Turn invokes of nounwind functions into ordinary calls. llvm-svn: 44280	2007-11-22 22:24:59 +00:00
Duncan Sands	1c97d752df	Readonly/readnone functions are allowed to throw exceptions, so don't turn invokes of them into calls. llvm-svn: 44278	2007-11-22 21:40:06 +00:00
Chris Lattner	c53b18362a	Fix PR1800 by correcting mistaken logic. llvm-svn: 44188	2007-11-16 06:04:17 +00:00
Chris Lattner	a77e74edba	Implement PR1796 and Transforms/SimplifyCFG/noreturn-call.ll by inserting unreachable after no-return calls. llvm-svn: 44099	2007-11-14 06:19:25 +00:00
Chris Lattner	f150ace6cb	upgrade test llvm-svn: 44067	2007-11-13 21:42:48 +00:00
Chris Lattner	61ce4dff7a	Implement PR1786 by iterating between dead cycle elimination and simplifycfg in the rare cases when it is needed. llvm-svn: 44044	2007-11-13 07:32:38 +00:00
Chris Lattner	f9c0fd7488	Tighten up a check for folding away loads from (newly constant) globals. This fixes a crash on Transforms/GlobalOpt/2007-11-09-GEP-GEP-Crash.ll and rdar://5585488. llvm-svn: 43949	2007-11-09 17:33:02 +00:00
Andrew Lenharth	19ca5c7021	Better check llvm-svn: 43897	2007-11-08 18:45:15 +00:00
Andrew Lenharth	8cf11aa330	Fix PR1780 llvm-svn: 43893	2007-11-08 17:39:28 +00:00
Chris Lattner	d8515f8e80	Implement PR1777 by detecting dependent phis that all compute the same value. llvm-svn: 43777	2007-11-06 21:52:06 +00:00
Dan Gohman	4decbc5002	Fix an abort in instcombine when folding creates a vector rem instruction. llvm-svn: 43743	2007-11-05 23:16:33 +00:00
Devang Patel	b98d2050a2	If a value is incoming from outside the loop then the value does not need remapping and the value is never tracked through LastValueMap. llvm-svn: 43728	2007-11-05 19:32:30 +00:00
Duncan Sands	399d97987b	Change uses of getTypeSize to getABITypeSize, getTypeStoreSize or getTypeSizeInBits as appropriate in ScalarReplAggregates. The right change to make was not always obvious, so it would be good to have an sroa guru review this. While there I noticed some bugs, and fixed them: (1) arrays of x86 long double have holes due to alignment padding, but this wasn't being spotted by HasStructPadding (renamed to HasPadding). The same goes for arrays of oddly sized ints. Vectors also suffer from this, in fact the problem for vectors is much worse because basic vector assumptions seem to be broken by vectors of type with alignment padding. I didn't try to fix any of these vector problems. (2) The code for extracting smaller integers from larger ones (in the "int union" case) was wrong on big-endian machines for integers with size not a multiple of 8, like i1. Probably this is impossible to hit via llvm-gcc, but I fixed it anyway while there and added a testcase. I also got rid of some trailing whitespace and changed a function name which had an obvious typo in it. llvm-svn: 43672	2007-11-04 14:43:57 +00:00
Owen Anderson	2ed651ace7	Fix test/Transforms/DeadStoreElimination/PartialStore.ll, which had been silently failing because of an incorrect run line for some time. llvm-svn: 43605	2007-11-01 05:29:16 +00:00
Chris Lattner	6ab19ed78d	Fix InstCombine/2007-10-31-StringCrash.ll by removing an obvious (in hindsight) infinite recursion. Simplify the code. llvm-svn: 43597	2007-11-01 02:30:35 +00:00
Chris Lattner	74709473ed	Fix InstCombine/2007-10-31-RangeCrash.ll llvm-svn: 43596	2007-11-01 02:18:41 +00:00
Dan Gohman	9f39660c20	Add support for folding binary operators with vector zero operands. llvm-svn: 43510	2007-10-30 19:00:49 +00:00
Chris Lattner	00860d7574	update testcase llvm-svn: 43452	2007-10-29 17:06:35 +00:00
Chris Lattner	c541c3ee15	Model stacksave and stackrestore as both writing memory, since we don't model their dependences on allocas correctly. This fixes PR1745. llvm-svn: 43442	2007-10-29 05:47:52 +00:00
Chris Lattner	9a641510bd	Fix PR1749 and InstCombine/2007-10-28-EmptyField.ll by handling zero-length fields better. llvm-svn: 43427	2007-10-29 02:40:02 +00:00
Chris Lattner	4a15e04aee	Fix PR1752 and LoopSimplify/2007-10-28-InvokeCrash.ll: terminators can have uses too. Wouldn't it be nice if invoke didn't exist? :) llvm-svn: 43426	2007-10-29 02:30:37 +00:00
Chris Lattner	c62877e9da	Implement a couple of foldings for ordered and unordered comparisons, implementing cases related to PR1738. llvm-svn: 43289	2007-10-24 05:38:08 +00:00
Bill Wendling	ac5c93040f	Don't branch fold inline asm statements. llvm-svn: 43191	2007-10-19 21:09:55 +00:00
Devang Patel	c0ced49a14	This test now passes. llvm-svn: 43183	2007-10-19 17:11:01 +00:00

... 2 3 4 5 6 ...

609 Commits