llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	4efe20a103	Fold: (X + (X << C2)) --> X * ((1 << C2) + 1) ((X << C2) + X) --> X * ((1 << C2) + 1) This means that we now canonicalize "Y+Y+Y" into: %tmp.2 = mul long %Y, 3 ; <long> [#uses=1] instead of: %tmp.10 = shl long %Y, ubyte 1 ; <long> [#uses=1] %tmp.6 = add long %Y, %tmp.10 ; <long> [#uses=1] llvm-svn: 17701	2004-11-13 19:31:40 +00:00
Chris Lattner	33eb909939	Fix some warnings on VC++ llvm-svn: 17481	2004-11-05 04:45:43 +00:00
Chris Lattner	96f6616479	* Rearrange code slightly * Disable broken transforms for simplifying (setcc (cast X to larger), CI) where CC is not != or == llvm-svn: 17422	2004-11-02 03:50:32 +00:00
Chris Lattner	70c2039b39	Hrm, this code was severely botched. As it turns out, this patch: http://mail.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20041018/019708.html exposed ANOTHER latent bug in this xform, which caused Prolangs-C/bison to fill the zion nightly tester disk up and make the tester barf. This is obviously not a good thing, so lets fix this bug shall we? :) llvm-svn: 17276	2004-10-27 05:57:15 +00:00
Chris Lattner	5c3c21e10a	Fix a bug Nate noticed, where we miscompiled a simple testcase llvm-svn: 17157	2004-10-22 04:53:16 +00:00
Chris Lattner	8ba9ec9bbb	Turn things with obviously undefined semantics into 'store -> null' llvm-svn: 17110	2004-10-18 02:59:09 +00:00
Chris Lattner	3b92f17165	My friend the invoke instruction does not dominate all basic blocks if it occurs in the entry node of a function llvm-svn: 17109	2004-10-18 01:48:31 +00:00
Chris Lattner	107c15c33d	Remove printout, realize that instructions in the entry block dominate all other blocks. llvm-svn: 17099	2004-10-17 21:31:34 +00:00
Chris Lattner	e29d634a94	hasConstantValue will soon return instructions that don't dominate the PHI node, so prepare for this. llvm-svn: 17095	2004-10-17 21:22:38 +00:00
Chris Lattner	67f0545daf	Fix a type violation llvm-svn: 17069	2004-10-16 23:28:04 +00:00
Chris Lattner	684c5c6587	Kill the bogon that slipped into my buffer before I committed. llvm-svn: 17067	2004-10-16 19:46:33 +00:00
Chris Lattner	6580e09fef	Implement InstCombine/getelementptr.ll:test9, which is the source of many ugly and giant constnat exprs in some programs. llvm-svn: 17066	2004-10-16 19:44:59 +00:00
Chris Lattner	81a7a23494	Optimize instructions involving undef values. For example X+undef == undef. llvm-svn: 17047	2004-10-16 18:11:37 +00:00
Chris Lattner	00648e1f86	Transform memmove -> memcpy when the source is obviously constant memory. llvm-svn: 16932	2004-10-12 04:52:52 +00:00
Chris Lattner	a92af96c56	Reenable the transform, turning X/-10 < 1 into X > -10 llvm-svn: 16918	2004-10-11 19:40:04 +00:00
Chris Lattner	4ad08352b4	Implement sub.ll:test17, -X/C -> X/-C llvm-svn: 16863	2004-10-09 02:50:40 +00:00
Chris Lattner	0b41e861b6	Temporarily disable a buggy transformation until it can be fixed. This fixes 254.gap. llvm-svn: 16853	2004-10-08 19:15:44 +00:00
Chris Lattner	bff91d9a2e	Instcombine (X & FF00) + xx00 -> (X+xx00) & FF00, implementing and.ll:test27 This comes up when doing adds to bitfield elements. llvm-svn: 16836	2004-10-08 05:07:56 +00:00
Chris Lattner	44bd392cbf	Little patch to turn (shl (add X, 123), 4) -> (add (shl X, 4), 123 << 4) This triggers in cases of bitfield additions, opening opportunities for future improvements. llvm-svn: 16834	2004-10-08 03:46:20 +00:00
Chris Lattner	0aee4b7947	Instcombine: -(X sdiv C) -> (X sdiv -C), tested by sub.ll:test16 llvm-svn: 16769	2004-10-06 15:08:25 +00:00
Chris Lattner	abae776b18	Hrm, debugging printouts do not need to be in here llvm-svn: 16598	2004-09-29 21:21:14 +00:00
Chris Lattner	6862fbd2cf	* Pull range optimization code out into new InsertRangeTest function. * SubOne/AddOne functions always return ConstantInt, declare them as such * Pull code for handling setcc X, cst, where cst is at the end of the range, or cc is LE or GE up earlier in visitSetCondInst. This reduces #iterations in some cases. * Fold: (div X, C1) op C2 -> range check, implementing div.ll:test6 - test9. llvm-svn: 16588	2004-09-29 17:40:11 +00:00
Chris Lattner	6a4adcda4c	Fold binary expressions and casts into PHI nodes that have all constant inputs. This takes something like this: %A = phi int [ 3, %cond_false.0 ], [ 2, %endif.0.i ], [ 2, %endif.1.i ] %B = div int %tmp.243, 4 and turns it into: %A = phi int [ 3/4, %cond_false.0 ], [ 2/4, %endif.0.i ], [ 2/4, %endif.1.i ] which is later simplified (in this case) into %A = 0. This triggers thousands of times in spec, for example, 269 times in 176.gcc. This is tested by InstCombine/add.ll:test23 and set.ll:test18. llvm-svn: 16582	2004-09-29 05:07:12 +00:00
Chris Lattner	c949128b2f	Hrm, really, all tests passed without this, but it is scary to think how... llvm-svn: 16568	2004-09-29 03:16:24 +00:00
Chris Lattner	be7a69ebd8	Remove debugging printout Instcombine (setcc (truncate X), C1). This occurs THOUSANDS of times in many benchmarks. Particularlly common seem to be things like (seteq (cast bool X to int), int 0) This turns it into (seteq bool %X, false), which then becomes (not %X). llvm-svn: 16567	2004-09-29 03:09:18 +00:00
Chris Lattner	dcf756ec22	Fold (X setcc C1) \| (X setcc C2) This implements or.ll:test1[89] llvm-svn: 16561	2004-09-28 22:33:08 +00:00
Chris Lattner	623826c888	Fold (and (setcc X, C1), (setcc X, C2)) This is important for several reasons: 1. Benchmarks have lots of code that looks like this (perlbmk in particular): %tmp.2.i = setne int %tmp.0.i, 128 ; <bool> [#uses=1] %tmp.6343 = seteq int %tmp.0.i, 1 ; <bool> [#uses=1] %tmp.63 = and bool %tmp.2.i, %tmp.6343 ; <bool> [#uses=1] we now fold away the setne, a clear improvement. 2. In the more important cases, such as (X >= 10) & (X < 20), we now produce smaller code: (X-10) < 10. 3. Perhaps the nicest effect of this patch is that it really helps out the code generators. In particular, for a 'range test' like the above, instead of generating this on X86 (the difference on PPC is even more pronounced): cmp %EAX, 50 setge %CL cmp %EAX, 100 setl %AL and %CL, %AL cmp %CL, 0 we now generate this: add %EAX, -50 cmp %EAX, 50 Furthermore, this causes setcc's to be folded into branches more often. These combinations trigger dozens of times in the spec benchmarks, particularly in 176.gcc, 186.crafty, 253.perlbmk, 254.gap, & 099.go. llvm-svn: 16559	2004-09-28 21:48:02 +00:00
Chris Lattner	272d5ca9e0	Implement X / C1 / C2 folding Implement (setcc (shl X, C1), C2) folding. The second one occurs several dozen times in spec. The first was added just in case. :) These are tested by shift.ll:test2[12], and div.ll:test5 llvm-svn: 16549	2004-09-28 18:22:15 +00:00
Chris Lattner	6afc02f816	shl is always zero extending, so always use a zero extending shift right. This latent bug was exposed by recent changes, and is tested as: llvm/test/Regression/Transforms/InstCombine/2004-09-28-BadShiftAndSetCC.llx llvm-svn: 16546	2004-09-28 17:54:07 +00:00
Chris Lattner	bfff18a869	Fix two bugs: one where a condition was mistakenly swapped, and another where we folded (X & 254) -> X < 1 instead of X < 2. These problems were latent problems exposed by the latest patch. llvm-svn: 16528	2004-09-27 19:29:18 +00:00
Chris Lattner	1023b8726e	Fold: (setcc (shr X, ShAmt), CI), where 'cc' is eq or ne. This xform triggers often, for example: 6x in povray, 1x in gzip, 279x in gcc, 1x in crafty, 8x in eon, 11x in perlbmk, 362x in gap, 4x in vortex, 14 in m88ksim, 211x in 126.gcc, 1x in compress, 11x in ijpeg, and 4x in 147.vortex. llvm-svn: 16521	2004-09-27 16:18:50 +00:00
Chris Lattner	7e794273f5	Implement shift-and combinations, implementing InstCombine/and.ll:test19-21 These combinations trigger 4 times in povray, 7x in gcc, 4x in gap, and 2x in bzip2. llvm-svn: 16508	2004-09-24 15:21:34 +00:00
Chris Lattner	e1b4d2a470	Move LHSI->hasOneUse() into the arms of the conditional, reindenting code. No functionality changes here. llvm-svn: 16505	2004-09-23 21:52:49 +00:00
Chris Lattner	8fc5af4da9	Implement Transforms/InstCombine/and.ll:test18, a case that occurs 20 times in perlbmk llvm-svn: 16504	2004-09-23 21:46:38 +00:00
Chris Lattner	bdcf41a8a2	Implement select.ll:test16: fold load (select C, X, null) -> load X llvm-svn: 16499	2004-09-23 15:46:00 +00:00
Chris Lattner	b121ae1cec	Do not fold (X + C1 != C2) if there are other users of the add. Doing this transformation used to take a loop like this: int Array[1000]; void test(int X) { int i; for (i = 0; i < 1000; ++i) Array[i] += X; } Compiled to LLVM is: no_exit: ; preds = %entry, %no_exit %indvar = phi uint [ 0, %entry ], [ %indvar.next, %no_exit ] ; <uint> [#uses=2] %tmp.4 = getelementptr [1000 x int]* %Array, int 0, uint %indvar ; <int> [#uses=2] %tmp.7 = load int %tmp.4 ; <int> [#uses=1] %tmp.9 = add int %tmp.7, %X ; <int> [#uses=1] store int %tmp.9, int* %tmp.4 * %indvar.next = add uint %indvar, 1 ; <uint> [#uses=2] * %exitcond = seteq uint %indvar.next, 1000 ; <bool> [#uses=1] br bool %exitcond, label %return, label %no_exit and turn it into a loop like this: no_exit: ; preds = %entry, %no_exit %indvar = phi uint [ 0, %entry ], [ %indvar.next, %no_exit ] ; <uint> [#uses=3] %tmp.4 = getelementptr [1000 x int]* %Array, int 0, uint %indvar ; <int> [#uses=2] %tmp.7 = load int %tmp.4 ; <int> [#uses=1] %tmp.9 = add int %tmp.7, %X ; <int> [#uses=1] store int %tmp.9, int* %tmp.4 * %indvar.next = add uint %indvar, 1 ; <uint> [#uses=1] * %exitcond = seteq uint %indvar, 999 ; <bool> [#uses=1] br bool %exitcond, label %return, label %no_exit Note that indvar.next and indvar can no longer be coallesced. In machine code terms, this patch changes this code: .LBBtest_1: # no_exit mov %EDX, OFFSET Array mov %ESI, %EAX add %ESI, DWORD PTR [%EDX + 4%ECX] mov %EDX, OFFSET Array mov DWORD PTR [%EDX + 4%ECX], %ESI mov %EDX, %ECX inc %EDX cmp %ECX, 999 mov %ECX, %EDX jne .LBBtest_1 # no_exit into this: .LBBtest_1: # no_exit mov %EDX, OFFSET Array mov %ESI, %EAX add %ESI, DWORD PTR [%EDX + 4%ECX] mov %EDX, OFFSET Array mov DWORD PTR [%EDX + 4%ECX], %ESI inc %ECX cmp %ECX, 1000 jne .LBBtest_1 # no_exit We need better instruction selection to get this: .LBBtest_1: # no_exit add DWORD PTR [Array + 4*%ECX], EAX inc %ECX cmp %ECX, 1000 jne .LBBtest_1 # no_exit ... but at least there is less register juggling llvm-svn: 16473	2004-09-21 21:35:23 +00:00
Chris Lattner	42618551d5	Fix potential miscompilations: InstCombine/2004-09-20-BadLoadCombine*.llx llvm-svn: 16447	2004-09-20 10:15:10 +00:00
Alkis Evlogimenos	d59cebf87a	Fix loop condition so that we don't decrement off the beginning of the list. llvm-svn: 16440	2004-09-20 06:42:58 +00:00
Chris Lattner	e6f13093e6	Make isSafeToLoadUnconditionally a bit smarter, implementing PR362 and Regression/Transforms/InstCombine/CPP_min_max.llx llvm-svn: 16409	2004-09-19 19:18:10 +00:00
Chris Lattner	f62ea8ef4b	Make instruction combining a bit more aggressive in the face of volatile loads, and implement two new transforms: InstCombine/load.ll:test[56]. llvm-svn: 16404	2004-09-19 18:43:46 +00:00
Reid Spencer	7c16caa336	Changes For Bug 352 Move include/Config and include/Support into include/llvm/Config, include/llvm/ADT and include/llvm/Support. From here on out, all LLVM public header files must be under include/llvm/. llvm-svn: 16137	2004-09-01 22:55:40 +00:00
Chris Lattner	4456da6a4c	Fix InstCombine/2004-08-10-BoolSetCC.ll, a bug that is miscompiling 176.gcc. Note that this is apparently not the only bug miscompiling gcc though. :( llvm-svn: 15639	2004-08-11 00:50:51 +00:00
Chris Lattner	8e7260652b	Fix InstCombine/2004-08-09-RemInfLoop.llx This should go into the 1.3 branch llvm-svn: 15593	2004-08-09 21:05:48 +00:00
Alkis Evlogimenos	832437255d	Stop using getValues(). llvm-svn: 15487	2004-08-04 08:44:43 +00:00
Chris Lattner	7aa2d4747a	Fix a regression in InstCombine/xor.ll llvm-svn: 15410	2004-08-01 19:42:59 +00:00
Misha Brukman	9c003d8f65	Fix De Morgan's name. llvm-svn: 15343	2004-07-30 12:50:08 +00:00
Chris Lattner	d4252a7c64	Start using the PatternMatcher a bit. llvm-svn: 15342	2004-07-30 07:50:03 +00:00
Robert Bocchino	7b5b86cd0f	This change fixed a bug in the function visitMul. The prior version assumed that a constant on the RHS of a multiplication was either an IntConstant or an FPConstant. It checked for an IntConstant and then, if it did not find one, did a hard cast to an FPConstant. That code would crash if the RHS were a ConstantExpr that was neither an IntConstant nor an FPConstant. This version replaces the hard cast with a dyn_cast. It performs the same way for IntConstants and FPConstants but does nothing, instead of crashing, for constant expressions. The regression test for this change is 2004-07-27-ConstantExprMul.ll. llvm-svn: 15291	2004-07-27 21:02:21 +00:00
Brian Gaeke	38b79e8fbc	Make the create...() functions for some of these passes return a FunctionPass *. llvm-svn: 15276	2004-07-27 17:43:21 +00:00
Chris Lattner	d8f5e2ccac	* Further cleanup. * Test for whether bits are shifted out during the optzn. If so, the fold is illegal, though it can be handled explicitly for setne/seteq This fixes the miscompilation of 254.gap last night, which was a latent bug exposed by other optimizer improvements. llvm-svn: 15085	2004-07-21 20:14:10 +00:00
Chris Lattner	1638de4499	Make cast-cast code a bit more defensive "simplify" a bit of code for comparison/and folding llvm-svn: 15082	2004-07-21 19:50:44 +00:00
Chris Lattner	4fbad968f8	Remove special casing of pointers and treat them generically as integers of the appopriate size. This gives us the ability to eliminate int -> ptr -> int llvm-svn: 15063	2004-07-21 04:27:24 +00:00
Chris Lattner	11ffd59e37	Implement Transforms/InstCombine/IntPtrCast.ll llvm-svn: 15029	2004-07-20 05:21:00 +00:00
Chris Lattner	44d0b9502a	Implement InstCombine/GEPIdxCanon.ll llvm-svn: 15024	2004-07-20 01:48:15 +00:00
Chris Lattner	4e2dbc6b4a	Rewrite cast->cast elimination code completely based on the information we actually care about. Someday when the cast instruction is gone, we can do better here, but this will do for now. This implements instcombine/cast.ll:test17/18 as well. llvm-svn: 15018	2004-07-20 00:59:32 +00:00
Chris Lattner	f3edc49ae2	Minor cleanup, no functionality change llvm-svn: 14972	2004-07-18 18:59:44 +00:00
Reid Spencer	539429d9b5	Delete a no-op loop. llvm-svn: 14965	2004-07-18 08:32:43 +00:00
Reid Spencer	874368790f	bug 122: - Replace ConstantPointerRef usage with GlobalValue usage - Minimize redundant isa<GlobalValue> usage - Correct isa<Constant> for GlobalValue subclass llvm-svn: 14950	2004-07-18 00:38:32 +00:00
Chris Lattner	35e24774eb	Factor some code to handle "load (constantexpr cast foo)" just like "load (cast foo)". This allows us to compile C++ code like this: class Bclass { public: virtual int operator()() { return 666; } }; class Dclass: public Bclass { public: virtual int operator()() { return 667; } } ; int main(int argc, char argv) { Dclass x; return x(); } Into this: int %main(int %argc, sbyte %argv) { entry: call void %__main( ) ret int 667 } Instead of this: int %main(int %argc, sbyte** %argv) { entry: %x = alloca "struct.std::bad_typeid" ; <"struct.std::bad_typeid"> [#uses=3] call void %__main( ) %tmp.1.i.i = getelementptr "struct.std::bad_typeid" %x, uint 0, uint 0, uint 0 ; <int (...)*> [#uses=1] store int (...) getelementptr ([3 x int (...)] %vtable for Bclass, int 0, long 2), int (...)*** %tmp.1.i.i %tmp.3.i = getelementptr "struct.std::bad_typeid"* %x, int 0, uint 0, uint 0 ; <int (...)*> [#uses=1] store int (...) getelementptr ([3 x int (...)] %vtable for Dclass, int 0, long 2), int (...)*** %tmp.3.i %tmp.5 = load int ("struct.std::bad_typeid")* cast (int (...)** getelementptr ([3 x int (...)] %vtable for Dclass, int 0, long 2) to int ("struct.std::bad_typeid")) ; <int ("struct.std::bad_typeid")> [#uses=1] %tmp.6 = call int %tmp.5( "struct.std::bad_typeid" %x ) ; <int> [#uses=1] ret int %tmp.6 ret int 0 } In order words, we now resolve the virtual function call. llvm-svn: 14783	2004-07-13 01:49:43 +00:00
Chris Lattner	9eb9ccd9f6	Check to make sure types are sized before calling getTypeSize on them. llvm-svn: 14649	2004-07-06 19:28:42 +00:00
Chris Lattner	23b47b6af9	Implement rem.ll:test3 llvm-svn: 14640	2004-07-06 07:38:18 +00:00
Chris Lattner	98c6bdf251	Fix a minor bug where we would go into infinite loops on some constants llvm-svn: 14638	2004-07-06 07:11:42 +00:00
Chris Lattner	7fd5f0745a	Implement InstCombine/sub.ll:test15: X % -Y === X % Y Also, remove X % -1 = 0, because it's not true for unsigneds, and the signed case is superceeded by this new handling. llvm-svn: 14637	2004-07-06 07:01:22 +00:00
Chris Lattner	4c9c20af28	Implement add.ll:test22, a common case in MSIL files llvm-svn: 14587	2004-07-03 00:26:11 +00:00
Chris Lattner	49df6cefa5	Do not call getTypeSize on a type that has no size llvm-svn: 14584	2004-07-02 22:55:47 +00:00
Chris Lattner	6e07936ed2	Implement InstCombine/add.ll:test21 llvm-svn: 14443	2004-06-27 22:51:36 +00:00
Chris Lattner	023a483c76	Implement Transforms/InstCombine/and.ll:test17, a common case that occurs due to unordered comparison macros in math.h llvm-svn: 14221	2004-06-18 06:07:51 +00:00
Chris Lattner	97bfcea262	Rename Type::PrimitiveID to TypeId and ::getPrimitiveID() to ::getTypeID() Delete two functions that are now methods on the Type class llvm-svn: 14200	2004-06-17 18:16:02 +00:00
Chris Lattner	ee59d4bf04	Fix a bug in my checkin from last night that caused miscompilations of 186.crafty, fhourstones and 132.ijpeg. Bugpoint makes really nasty miscompilations embarassingly easy to find. It narrowed it down to the instcombiner and this testcase (from fhourstones): bool %l7153_l4706_htstat_loopentry_2E_4_no_exit_2E_4(int* %i, [32 x int]* %works, int* %tmp.98.out) { newFuncRoot: %tmp.96 = load int* %i ; <int> [#uses=1] %tmp.97 = getelementptr [32 x int]* %works, long 0, int %tmp.96 ; <int> [#uses=1] %tmp.98 = load int %tmp.97 ; <int> [#uses=2] %tmp.99 = load int* %i ; <int> [#uses=1] %tmp.100 = and int %tmp.99, 7 ; <int> [#uses=1] %tmp.101 = seteq int %tmp.100, 7 ; <bool> [#uses=2] %tmp.102 = cast bool %tmp.101 to int ; <int> [#uses=0] br bool %tmp.101, label %codeRepl4.exitStub, label %codeRepl3.exitStub codeRepl4.exitStub: ; preds = %newFuncRoot store int %tmp.98, int* %tmp.98.out ret bool true codeRepl3.exitStub: ; preds = %newFuncRoot store int %tmp.98, int* %tmp.98.out ret bool false } ... which only has one combination performed on it: $ llvm-as < t.ll \| opt -instcombine -debug \| llvm-dis IC: Old = %tmp.101 = seteq int %tmp.100, 7 ; <bool> [#uses=1] New = setne int %tmp.100, 0 ; <bool>:<badref> [#uses=0] IC: MOD = br bool %tmp.101, label %codeRepl3.exitStub, label %codeRepl4.exitStub IC: MOD = %tmp.97 = getelementptr [32 x int]* %works, uint 0, int %tmp.96 ; <int*> [#uses=1] It doesn't get much better than this. :) llvm-svn: 14109	2004-06-10 02:33:20 +00:00
Chris Lattner	c8e7e298c1	More minor cleanups llvm-svn: 14108	2004-06-10 02:12:35 +00:00
Chris Lattner	df20a4d589	Eliminate many occurrances of Instruction:: llvm-svn: 14107	2004-06-10 02:07:29 +00:00
Chris Lattner	35167c3087	Implement InstCombine/select.ll:test15* llvm-svn: 14095	2004-06-09 07:59:58 +00:00
Chris Lattner	396dbfe327	Be more careful about the order we put stuff onto the worklist. This allow us to collapse this: bool %le(int %A, int %B) { %c1 = setgt int %A, %B %tmp = select bool %c1, int 1, int 0 %c2 = setlt int %A, %B %result = select bool %c2, int -1, int %tmp %c3 = setle int %result, 0 ret bool %c3 } into: bool %le(int %A, int %B) { %c3 = setle int %A, %B ; <bool> [#uses=1] ret bool %c3 } which is handy, because the Java FE makes these sequences all over the place. This is tested as: test/Regression/Transforms/InstCombine/JavaCompare.ll llvm-svn: 14086	2004-06-09 05:08:07 +00:00
Chris Lattner	2dd017402b	Implement select.ll:test14* llvm-svn: 14083	2004-06-09 04:24:29 +00:00
Chris Lattner	ed79d8af53	Fix InstCombine/load.ll & PR347. This code hadn't been updated after the "structs with more than 256 elements" related changes to the GEP instruction. Also it was not handling the ConstantAggregateZero class. Now it does! llvm-svn: 13834	2004-05-27 17:30:27 +00:00
Chris Lattner	e1e10e1883	Implement InstCombine:shift.ll:test16, which turns (X >> C1) & C2 != C3 into (X & (C2 << C1)) != (C3 << C1), where the shift may be either left or right and the compare may be any one. This triggers 1546 times in 176.gcc alone, as it is a common pattern that occurs for bitfield accesses. llvm-svn: 13740	2004-05-25 06:32:08 +00:00
Chris Lattner	03841659a4	Implement instcombine/cast.ll:test16: Canonicalize cast X to bool into a setne instruction llvm-svn: 13736	2004-05-25 04:29:21 +00:00
Chris Lattner	0026512bac	This was not meant to be committed llvm-svn: 13565	2004-05-13 20:56:34 +00:00
Chris Lattner	c12c945cc4	Fix a nasty bug that caused us to unroll EXTREMELY large loops due to overflow in the size calculation. This is not something you want to see: Loop Unroll: F[main] Loop %no_exit Loop Size = 2 Trip Count = 2147483648 - UNROLLING! The problem was that 2*2147483648 == 0. Now we get: Loop Unroll: F[main] Loop %no_exit Loop Size = 2 Trip Count = 2147483648 - TOO LARGE: 4294967296>100 Thanks to some anonymous person playing with the demo page that repeatedly caused zion to go into swapping land. That's one way to ensure you'll get a quick bugfix. :) Testcase here: Transforms/LoopUnroll/2004-05-13-DontUnrollTooMuch.ll llvm-svn: 13564	2004-05-13 20:43:31 +00:00
Chris Lattner	8ec5f88c79	Fix stupid bug in my checkin yesterday llvm-svn: 13429	2004-05-08 22:41:42 +00:00
Chris Lattner	5f667a6f58	Implement folding of GEP's like: %tmp.0 = getelementptr [50 x sbyte]* %ar, uint 0, int 5 ; <sbyte> [#uses=2] %tmp.7 = getelementptr sbyte %tmp.0, int 8 ; <sbyte*> [#uses=1] together. This patch actually allows us to simplify and generalize the code. llvm-svn: 13415	2004-05-07 22:09:22 +00:00
Chris Lattner	d9e5813821	Fix PR336: The instcombine pass asserts when visiting load instruction llvm-svn: 13400	2004-05-07 15:35:56 +00:00
Chris Lattner	dd1a86d858	Minor efficiency tweak, suggested by Patrick Meredith llvm-svn: 13341	2004-05-04 15:19:33 +00:00
Chris Lattner	63d75af920	Make sure to reprocess instructions used by deleted instructions to avoid missing opportunities for combination. llvm-svn: 13309	2004-05-01 23:27:23 +00:00
Chris Lattner	b643a9e675	Make sure the instruction combiner doesn't lose track of instructions when replacing them, missing the opportunity to do simplifications llvm-svn: 13308	2004-05-01 23:19:52 +00:00
Chris Lattner	652064e3b8	Fix a major pessimization in the instcombiner. If an allocation instruction is only used by a cast, and the casted type is the same size as the original allocation, it would eliminate the cast by folding it into the allocation. Unfortunately, it was placing the new allocation instruction right before the cast, which could pull (for example) alloca instructions into the body of a function. This turns statically allocatable allocas into expensive dynamically allocated allocas, which is bad bad bad. This fixes the problem by placing the new allocation instruction at the same place the old one was, duh. :) llvm-svn: 13289	2004-04-30 04:37:52 +00:00
Chris Lattner	2d3a7a6ff0	Changes to fix up the inst_iterator to pass to boost iterator checks. This patch was graciously contributed by Vladimir Prus. llvm-svn: 13185	2004-04-27 15:13:33 +00:00
Chris Lattner	e20c334e65	Instcombine X/-1 --> 0-X llvm-svn: 13172	2004-04-26 14:01:59 +00:00
Chris Lattner	1e9ac1a45e	Fix a HUGE pessimization on X86. The indvars pass was taking this (familiar) function: int _strlen(const char str) { int len = 0; while (str++) len++; return len; } And transforming it to use a ulong induction variable, because the type of the pointer index was left as a constant long. This is obviously very bad. The fix is to shrink long constants in getelementptr instructions to intptr_t, making the indvars pass insert a uint induction variable, which is much more efficient. Here's the before code for this function: int %_strlen(sbyte* %str) { entry: %tmp.13 = load sbyte* %str ; <sbyte> [#uses=1] %tmp.24 = seteq sbyte %tmp.13, 0 ; <bool> [#uses=1] br bool %tmp.24, label %loopexit, label %no_exit no_exit: ; preds = %entry, %no_exit * %indvar = phi uint [ %indvar.next, %no_exit ], [ 0, %entry ] ; <uint> [#uses=2] * %indvar = phi ulong [ %indvar.next, %no_exit ], [ 0, %entry ] ; <ulong> [#uses=2] %indvar1 = cast ulong %indvar to uint ; <uint> [#uses=1] %inc.02.sum = add uint %indvar1, 1 ; <uint> [#uses=1] %inc.0.0 = getelementptr sbyte* %str, uint %inc.02.sum ; <sbyte> [#uses=1] %tmp.1 = load sbyte %inc.0.0 ; <sbyte> [#uses=1] %tmp.2 = seteq sbyte %tmp.1, 0 ; <bool> [#uses=1] %indvar.next = add ulong %indvar, 1 ; <ulong> [#uses=1] %indvar.next = add uint %indvar, 1 ; <uint> [#uses=1] br bool %tmp.2, label %loopexit.loopexit, label %no_exit loopexit.loopexit: ; preds = %no_exit %indvar = cast uint %indvar to int ; <int> [#uses=1] %inc.1 = add int %indvar, 1 ; <int> [#uses=1] ret int %inc.1 loopexit: ; preds = %entry ret int 0 } Here's the after code: int %_strlen(sbyte* %str) { entry: %inc.02 = getelementptr sbyte* %str, uint 1 ; <sbyte> [#uses=1] %tmp.13 = load sbyte %str ; <sbyte> [#uses=1] %tmp.24 = seteq sbyte %tmp.13, 0 ; <bool> [#uses=1] br bool %tmp.24, label %loopexit, label %no_exit no_exit: ; preds = %entry, %no_exit *** %indvar = phi uint [ %indvar.next, %no_exit ], [ 0, %entry ] ; <uint> [#uses=3] %indvar = cast uint %indvar to int ; <int> [#uses=1] %inc.0.0 = getelementptr sbyte* %inc.02, uint %indvar ; <sbyte> [#uses=1] %inc.1 = add int %indvar, 1 ; <int> [#uses=1] %tmp.1 = load sbyte %inc.0.0 ; <sbyte> [#uses=1] %tmp.2 = seteq sbyte %tmp.1, 0 ; <bool> [#uses=1] %indvar.next = add uint %indvar, 1 ; <uint> [#uses=1] br bool %tmp.2, label %loopexit, label %no_exit loopexit: ; preds = %entry, %no_exit %len.0.1 = phi int [ 0, %entry ], [ %inc.1, %no_exit ] ; <int> [#uses=1] ret int %len.0.1 } llvm-svn: 13016	2004-04-17 18:16:10 +00:00
Chris Lattner	284d3b0311	Fix some really nasty dominance bugs that were exposed by my patch to make the verifier more strict. This fixes building zlib llvm-svn: 13002	2004-04-16 18:08:07 +00:00
Chris Lattner	6679e46b59	ADd a trivial instcombine: load null -> null llvm-svn: 12940	2004-04-14 03:28:36 +00:00
Chris Lattner	24cf0200c7	Fix a bug in my select transformation llvm-svn: 12826	2004-04-11 01:39:19 +00:00
Chris Lattner	623fba1107	Implement InstCombine/select.ll:test13* llvm-svn: 12821	2004-04-10 22:21:27 +00:00
Chris Lattner	cf4a996cba	Implement InstCombine/add.ll:test20 Canonicalize add of sign bit constant into a xor llvm-svn: 12819	2004-04-10 22:01:55 +00:00
Chris Lattner	f9d9665138	Fix spurious warning in release mode llvm-svn: 12816	2004-04-10 19:15:56 +00:00
Chris Lattner	56e4d3d8ad	Implement select.ll:test12* This transforms code like this: %C = or %A, %B %D = select %cond, %C, %A into: %C = select %cond, %B, 0 %D = or %A, %C Since B is often a constant, the select can often be eliminated. In any case, this reduces the usage count of A, allowing subsequent optimizations to happen. This xform applies when the operator is any of: add, sub, mul, or, xor, and, shl, shr llvm-svn: 12800	2004-04-09 23:46:01 +00:00
Chris Lattner	183b336a54	Fold binary operators with a constant operand into select instructions that have a constant operand. This implements add.ll:test19, shift.ll:test15*, and others that are not tested llvm-svn: 12794	2004-04-09 19:05:30 +00:00
Chris Lattner	cf7baf3519	Implement select.ll:test11 llvm-svn: 12793	2004-04-09 18:19:44 +00:00
Chris Lattner	e228ee5870	Implement InstCombine/cast-propagate.ll llvm-svn: 12784	2004-04-08 20:39:49 +00:00
Chris Lattner	1c631e813d	Implement InstCombine/select.ll:test[7-10] llvm-svn: 12769	2004-04-08 04:43:23 +00:00

1 2 3 4 5 ...

330 Commits