llvm-project

Commit Graph

Author	SHA1	Message	Date
Eric Christopher	f9553572b7	Reformat my last patch slightly. llvm-svn: 95147	2010-02-02 22:29:26 +00:00
Eric Christopher	4264e7e46f	Re-add strcmp and known size object size checking optimization. Passed bootstrap and nightly test run here. llvm-svn: 95145	2010-02-02 22:10:43 +00:00
Chris Lattner	8e2c471614	don't turn (A & (C0?-1:0)) \| (B & ~(C0?-1:0)) -> C0 ? A : B for vectors. Codegen is generating awful code or segfaulting in various cases (e.g. PR6204). llvm-svn: 95058	2010-02-02 02:43:51 +00:00
Chris Lattner	302240d73e	fix a crash in loop unswitch on a loop invariant vector condition. llvm-svn: 95055	2010-02-02 02:26:54 +00:00
Dan Gohman	949458d014	LangRef.html says that inttoptr and ptrtoint always use zero-extension when the cast is extending. llvm-svn: 95046	2010-02-02 01:44:02 +00:00
Eric Christopher	14dfc3f6df	Don't need to check the last argument since it'll always be bool. We also don't use TargetData here. llvm-svn: 95040	2010-02-02 00:51:45 +00:00
Eric Christopher	9afa973203	More indentation/tabification fixes. llvm-svn: 95036	2010-02-02 00:13:06 +00:00
Eric Christopher	1408234753	Untabify previous commit. llvm-svn: 95035	2010-02-02 00:06:55 +00:00
Eric Christopher	56e4182c49	Formatting. llvm-svn: 95027	2010-02-01 23:25:03 +00:00
Bob Wilson	d517b52012	Add an option to GVN to remove all partially redundant loads. This is currently disabled by default. This divides the existing load PRE code into 2 phases: first it checks that it is safe to move the load to each of the predecessors where it is unavailable, and then if it is safe, the code is changed to move the load. Radar 7571861. llvm-svn: 95007	2010-02-01 21:17:14 +00:00
Chris Lattner	9306ffa05a	cleanups. llvm-svn: 94995	2010-02-01 19:54:45 +00:00
Chris Lattner	846a52e228	fix rdar://7590304, a miscompilation of objc apps on arm. The caller of objc message send was getting marked arm_apcscc, but the prototype isn't. This is fine at runtime because objcmsgsend is implemented in assembly. Only turn a mismatched caller and callee into 'unreachable' if the callee is a definition. llvm-svn: 94986	2010-02-01 18:11:34 +00:00
Chris Lattner	2cecedf081	fix rdar://7590304, an infinite loop in instcombine. In the invoke case, instcombine can't zap the invoke for fear of changing the CFG. However, we have to do something to prevent the next iteration of instcombine from inserting another store -> undef before the invoke thereby getting into infinite iteration between dead store elim and store insertion. Just zap the callee to null, which will prevent the next iteration from doing anything. llvm-svn: 94985	2010-02-01 18:04:58 +00:00
Bob Wilson	f65ba356e1	Fix pr6198 by moving the isSized() check to an outer conditional. The testcase from pr6198 does not crash for me -- I don't know what's up with that -- so I'm not adding it to the tests. llvm-svn: 94984	2010-02-01 17:41:44 +00:00
Eli Friedman	a2cc2875fc	Simplify/generalize the xor+add->sign-extend instcombine. llvm-svn: 94943	2010-01-31 04:29:12 +00:00
Eli Friedman	37a8197b61	Add a small transform: transform -(X<<Y) to (-X<<Y) when the shift has a single use and X is free to negate. llvm-svn: 94941	2010-01-31 02:30:23 +00:00
Evan Cheng	d86d3fe0c3	Do not mark no-return calls tail calls. It'll screw up special calls like longjmp and it doesn't make much sense for performance reason. If my logic is faulty, please let me know. llvm-svn: 94937	2010-01-31 00:59:31 +00:00
Bob Wilson	56600a15ad	Check alignment of loads when deciding whether it is safe to execute them unconditionally. Besides checking the offset, also check that the underlying object is aligned as much as the load itself. llvm-svn: 94875	2010-01-30 04:42:39 +00:00
Bob Wilson	4b71b6c179	Use more specific types to avoid casts. No functionality change. llvm-svn: 94863	2010-01-30 00:41:10 +00:00
Jakob Stoklund Olesen	e27dc727e2	Keep iterating over all uses when meeting a phi node in AllUsesOfValueWillTrapIfNull(). This bug was exposed by my inliner cost changes in r94615, and caused failures of lencod on most architectures when building with LTO. This patch fixes lencod and 464.h264ref on x86-64 (and likely others). llvm-svn: 94858	2010-01-29 23:54:14 +00:00
Bob Wilson	1b8453067b	Preserve load alignment in instcombine transformations. I've been unable to create a testcase where this matters. The select+load transformation only occurs when isSafeToLoadUnconditionally is true, and in those situations, instcombine also changes the underlying objects to be aligned. This seems like a good idea regardless, and I've verified that it doesn't pessimize the subsequent realignment. llvm-svn: 94850	2010-01-29 22:39:21 +00:00
Eric Christopher	5a0e174863	Revert my last couple of patches. They appear to have broken bison. llvm-svn: 94841	2010-01-29 21:16:24 +00:00
Bob Wilson	34e10c2218	Use uint64_t instead of unsigned for offsets and sizes. llvm-svn: 94835	2010-01-29 20:34:28 +00:00
Bob Wilson	7c42b9d51e	Improve isSafeToLoadUnconditionally to recognize that GEPs with constant indices are safe if the result is known to be within the bounds of the underlying object. llvm-svn: 94829	2010-01-29 19:19:08 +00:00
Duncan Sands	c8a3e56870	Having RHSKnownZero and RHSKnownOne be alternative names for KnownZero and KnownOne (via APInt &RHSKnownZero = KnownZero, etc) seems dangerous and confusing to me: it is easy not to notice this, and then wonder why KnownZero/RHSKnownZero changed underneath you when you modified RHSKnownZero/KnownZero etc. So get rid of this. No intended functionality change (tested with "make check" + llvm-gcc bootstrap). llvm-svn: 94802	2010-01-29 06:18:46 +00:00
Eric Christopher	9b3c02b7da	Make strcpy_chk lower to strcpy if we have a safe size. llvm-svn: 94783	2010-01-29 01:37:11 +00:00
Eric Christopher	997f7ca8c5	Add constant support to object size handling and remove default lowering. We'll either figure it out, or not and be lowered by SelectionDAGBuild. Add test. llvm-svn: 94775	2010-01-29 01:09:57 +00:00
Bill Wendling	48816a0b3f	Generic reformatting and comment fixing. No functionality change. llvm-svn: 94771	2010-01-29 00:52:43 +00:00
Bill Wendling	8277838cf8	Add newline to debugging output, and fix some grammar-os in comment. llvm-svn: 94765	2010-01-29 00:27:39 +00:00
Victor Hernandez	006b53f199	mem2reg erases the dbg.declare intrinsics that it converts to dbg.val intrinsics llvm-svn: 94763	2010-01-29 00:01:35 +00:00
Duncan Sands	3a48b87c54	Fix PR6165. The bug was that LHSKnownZero was being and'd with DemandedMask when it should have been and'd with LowBits. Fix that and while there beef up the logic in the case of a negative LHS. llvm-svn: 94745	2010-01-28 17:22:42 +00:00
Bob Wilson	7577e948e4	Avoid creating redundant PHIs in SSAUpdater::GetValueInMiddleOfBlock. This was already being done in SSAUpdater::GetValueAtEndOfBlock so I've just changed SSAUpdater to check for existing PHIs in both places. llvm-svn: 94690	2010-01-27 22:01:02 +00:00
Jeffrey Yasskin	091217be6f	Kill ModuleProvider and ghost linkage by inverting the relationship between Modules and ModuleProviders. Because the "ModuleProvider" simply materializes GlobalValues now, and doesn't provide modules, it's renamed to "GVMaterializer". Code that used to need a ModuleProvider to materialize Functions can now materialize the Functions directly. Functions no longer use a magic linkage to record that they're materializable; they simply ask the GVMaterializer. Because the C ABI must never change, we can't remove LLVMModuleProviderRef or the functions that refer to it. Instead, because Module now exposes the same functionality ModuleProvider used to, we store a Module* in any LLVMModuleProviderRef and translate in the wrapper methods. The bindings to other languages still use the ModuleProvider concept. It would probably be worth some time to update them to follow the C++ more closely, but I don't intend to do it. Fixes http://llvm.org/PR5737 and http://llvm.org/PR5735. llvm-svn: 94686	2010-01-27 20:34:15 +00:00
Benjamin Kramer	1266d46d32	Don't bother with sprintf, just pass the Twine through. llvm-svn: 94684	2010-01-27 19:58:47 +00:00
Benjamin Kramer	40582a891c	Use the less expensive getName function instead of getNameStr. llvm-svn: 94683	2010-01-27 19:46:52 +00:00
Chris Lattner	65f4733b77	some cleanups. llvm-svn: 94649	2010-01-27 02:12:20 +00:00
Chris Lattner	711e701f1c	no need to check for null llvm-svn: 94648	2010-01-27 02:04:20 +00:00
Victor Hernandez	477d9274bb	When converting dbg.declare to dbg.value, attach promoted store's debug metadata to dbg.value llvm-svn: 94634	2010-01-27 00:44:36 +00:00
Victor Hernandez	2b17e2a452	Avoid extra calls to MD->getNumOperands() llvm-svn: 94618	2010-01-26 23:29:09 +00:00
Victor Hernandez	9ecd2f039f	Switch AllocaDbgDeclares to SmallVector and don't leak DIFactory llvm-svn: 94567	2010-01-26 18:57:53 +00:00
Victor Hernandez	cd94410152	In mem2reg, for all alloca/stores that get promoted where the alloca has an associated llvm.dbg.declare instrinsic, insert an llvm.dbg.var intrinsic before each store. llvm-svn: 94493	2010-01-26 02:42:15 +00:00
Bob Wilson	70c8fe5e4e	Remove check for an impossible condition: the condition of the while loop has already checked that TmpBB->getSinglePredecessor() is non-null. llvm-svn: 94451	2010-01-25 21:28:05 +00:00
Bob Wilson	fc060e4337	Change Value::getUnderlyingObject to have the MaxLookup value specified as a parameter with a default value, instead of just hardcoding it in the implementation. The limit of MaxLookup = 6 was introduced in r69151 to fix a performance problem with O(n^2) behavior in instcombine, but the scalarrepl pass is relying on getUnderlyingObject to go all the way back to an AllocaInst. Making the limit part of the method signature makes it clear that by default the result is limited and should help avoid similar problems in the future. This fixes pr6126. llvm-svn: 94433	2010-01-25 18:26:54 +00:00
Victor Hernandez	8a588e1444	Revert r94260 until findDbgDeclare() is made more efficient llvm-svn: 94432	2010-01-25 17:52:13 +00:00
Chris Lattner	823aed16f9	make -fno-rtti the default unless a directory builds with REQUIRES_RTTI. llvm-svn: 94378	2010-01-24 20:43:08 +00:00
Chris Lattner	1b35bbe813	change the canonical form of "cond ? -1 : 0" to be "sext cond" instead of a select. This simplifies some instcombine code, matches the policy for zext (cond ? 1 : 0 -> zext), and allows us to generate better code for a testcase on ppc. llvm-svn: 94339	2010-01-24 00:09:49 +00:00
Chris Lattner	e112ff64c5	fix a potential overflow issue Eli pointed out. llvm-svn: 94336	2010-01-23 23:31:46 +00:00
Nick Lewycky	7e7ed8b9e5	Speculatively revert r94322 to see if it fixes darwin selfhost buildbot. llvm-svn: 94331	2010-01-23 20:32:12 +00:00
Chris Lattner	29b15c5cfd	third bug from PR6119: the xor dupe extension allows for arbitrary terminators in predecessors, don't assume it is a conditional or uncond branch. The testcase shows an example where they can happen with switches. llvm-svn: 94323	2010-01-23 19:21:31 +00:00
Nick Lewycky	32966aed9d	Teach DAE that even though it can't modify the function signature of an externally visible function, it can still find all callers of it and replace the parameters to a dead argument with undef. llvm-svn: 94322	2010-01-23 19:19:34 +00:00
Chris Lattner	ba2d0b89ff	add an early out to ProcessBranchOnXOR to speed it up, handle the case when we can infer an input to the xor from all inputs that agree, instead of going into an infinite loop. Another part of PR6199 llvm-svn: 94321	2010-01-23 19:16:25 +00:00
Chris Lattner	de5ab4860f	fix a crash in jump threading, PR6119 llvm-svn: 94319	2010-01-23 18:56:07 +00:00
Chris Lattner	249da5cb73	implement a simple instcombine xform that has been in the readme forever. llvm-svn: 94318	2010-01-23 18:49:30 +00:00
Eric Christopher	ba7cd4c393	Reapply 94059 while fixing the calling convention setup for strcpy. llvm-svn: 94287	2010-01-23 05:29:06 +00:00
Victor Hernandez	5006e43faf	In mem2reg, for all alloca/stores that get promoted where the alloca has an associated llvm.dbg.declare instrinsic, insert an llvm.dbg.var intrinsic before each store llvm-svn: 94260	2010-01-23 00:17:34 +00:00
Benjamin Kramer	3838dfbaea	Another strncmp -> StringRef.startswith simplification. llvm-svn: 94203	2010-01-22 20:00:21 +00:00
Bob Wilson	6c0c8d41b4	Revert 94059. It is breaking the MultiSource/Benchmarks/Prolangs-C/bison test on ARM. llvm-svn: 94198	2010-01-22 19:16:40 +00:00
Victor Hernandez	5f8c8c034a	Keep ignoring pointer-to-pointer bitcasts llvm-svn: 94194	2010-01-22 19:05:05 +00:00
Chris Lattner	7ba0661f27	Stop building RTTI information for most llvm libraries. Notable missing ones are libsupport, libsystem and libvmcore. libvmcore is currently blocked on bugpoint, which uses EH. Once it stops using EH, we can switch it off. This #if 0's out 3 unit tests, because gtest requires RTTI information. Suggestions welcome on how to fix this. llvm-svn: 94164	2010-01-22 06:49:46 +00:00
Dan Gohman	045f81981a	Revert LoopStrengthReduce.cpp to pre-r94061 for now. llvm-svn: 94123	2010-01-22 00:46:49 +00:00
Victor Hernandez	7b151e9f06	No need to look through bitcasts for DbgInfoIntrinsic llvm-svn: 94114	2010-01-21 23:09:12 +00:00
Victor Hernandez	ae4d949721	DbgInfoIntrinsic no longer appear in an instruction's use list llvm-svn: 94113	2010-01-21 23:08:36 +00:00
Victor Hernandez	5f5abd598c	No need to look through bitcasts for DbgInfoIntrinsic llvm-svn: 94112	2010-01-21 23:07:15 +00:00
Victor Hernandez	1df65186d1	DbgInfoIntrinsics no longer appear in an instruction's use list; so clean up looking for them in use iterations and remove OnlyUsedByDbgInfoIntrinsics() llvm-svn: 94111	2010-01-21 23:05:53 +00:00
Dan Gohman	b1ee154b6b	When inserting expressions for post-increment users which contain loop-variant components, adds must be inserted after the increment. Keep track of the increment position for this case, and insert these adds in the correct location. llvm-svn: 94110	2010-01-21 23:01:22 +00:00
Dan Gohman	cb8d577eb2	Include IVUsers information in LSR's debug output. llvm-svn: 94108	2010-01-21 22:46:32 +00:00
Dan Gohman	29916e023d	Prune the search for candidate formulae if the number of register operands exceeds the number of registers used in the initial solution, as that wouldn't lead to a profitable solution anyway. llvm-svn: 94107	2010-01-21 22:42:49 +00:00
Dan Gohman	c903499ff8	Add a comment. llvm-svn: 94104	2010-01-21 21:31:09 +00:00
Chris Lattner	24716b6c63	It turns out that this #include is needed because otherwise ValueMapper.cpp ends up calling an out of line __ZNK4llvm12PATypeHolder3getEv, which is a template and llvm-config determines arbitrarily to use the one in libipo. This sucks, but keeping the #include is a reasonable workaround. llvm-svn: 94103	2010-01-21 21:29:25 +00:00
Chris Lattner	9889b4be04	unbreak the build, apparently without this transformutils starts depending on libipa? llvm-svn: 94102	2010-01-21 21:20:51 +00:00
Chris Lattner	e39837d5ee	tidy up llvm-svn: 94101	2010-01-21 21:05:54 +00:00
Victor Hernandez	a9ad174b49	Don't need to include IntrinsicInst.h any more llvm-svn: 94092	2010-01-21 19:33:59 +00:00
Victor Hernandez	d089f4e10b	No need to map NULL operands of metadata llvm-svn: 94091	2010-01-21 19:26:20 +00:00
Dan Gohman	51ad99d2c5	Re-implement the main strength-reduction portion of LoopStrengthReduction. This new version is much more aggressive about doing "full" reduction in cases where it reduces register pressure, and also more aggressive about rewriting induction variables to count down (or up) to zero when doing so reduces register pressure. It currently uses fairly simplistic algorithms for finding reuse opportunities, but it introduces a new framework allows it to combine multiple strategies at once to form hybrid solutions, instead of doing all full-reduction or all base+index. llvm-svn: 94061	2010-01-21 02:09:26 +00:00
Eric Christopher	fa863258d0	Add strcpy_chk -> strcpy support for "don't know" object size answers. This will update as object size checking gets better information. llvm-svn: 94059	2010-01-21 01:04:38 +00:00
Chris Lattner	3c5bf71353	simplify this code. llvm-svn: 94048	2010-01-20 23:30:28 +00:00
Jakob Stoklund Olesen	8a19d3c96c	Move per-function inline threshold calculation to a method. No functional change except the forgotten test for InlineLimit.getNumOccurrences() == 0 in the CurrentThreshold2 calculation. llvm-svn: 94007	2010-01-20 17:51:28 +00:00
Victor Hernandez	f2462407ee	Switch Elts from vector to SmallVector llvm-svn: 93989	2010-01-20 06:56:16 +00:00
Victor Hernandez	5fa88d4e30	Map operands of all function-local metadata, not just metadata passed to llvm.dbg.declare intrinsics llvm-svn: 93979	2010-01-20 05:49:59 +00:00
Dan Gohman	ca19445d08	When doing address-mode sinking, expand the base register first, rather than the scaled register. This makes it more likely that subsequent AddrModeMatcher queries will match the new address the same way as the old, instead of accidentally matching what had been the base register as the new scaled register, and then failing to match the scaled register. This fixes some problems with address-mode sinking multiple muls into a block, which will be a lot more common with some upcoming LoopStrengthReduction changes. llvm-svn: 93935	2010-01-19 22:45:06 +00:00
Chris Lattner	18f49ce2d3	optimize ~(~X >>s Y) --> (X >>s Y), patch by Edmund Grimley Evans! llvm-svn: 93884	2010-01-19 18:16:19 +00:00
Bob Wilson	58d59fe394	Fix a crash in scalarrepl for memcpy/memmove where the source and destination are the same. I had already fixed a similar problem where the source and destination were different bitcasts derived from the same alloca, but the previous fix still did not handle the case where both operands are exactly the same value. Radar 7552893. llvm-svn: 93848	2010-01-19 04:32:48 +00:00
Eric Christopher	84bd316bd6	Fix comment. llvm-svn: 93831	2010-01-19 01:20:15 +00:00
Chris Lattner	43f2fa6201	my instcombine transformations to make extension elimination more aggressive changed the canonical form from sext(trunc(x)) to ashr(lshr(x)), make sure to transform a couple more things into that canonical form, and catch a case where we missed turning zext/shl/ashr into a single sext. llvm-svn: 93787	2010-01-18 22:19:16 +00:00
Devang Patel	696cb8d410	While mapping llvm.dbg.declare intrinsic manually map its operand, if possible, because it points to an alloca instruction through metadata. llvm-svn: 93757	2010-01-18 19:52:14 +00:00
Owen Anderson	cdea3572fa	Convert some of the dynamic opcode lookups into static ones. llvm-svn: 93693	2010-01-17 19:33:27 +00:00
Owen Anderson	fa1edea9ce	Fix comment. llvm-svn: 93679	2010-01-17 06:49:03 +00:00
Bob Wilson	e0da4b6cff	Fix a comment typo. llvm-svn: 93560	2010-01-15 21:55:02 +00:00
Bill Wendling	ad7a5b07a7	When the visitSub method was split into visitSub and visitFSub, this xform was added to the FSub version. However, the original version of this xform guarded against doing this for floating point (!Op0->getType()->isFPOrFPVector()). This is causing LLVM to perform incorrect xforms for code like: void func(double rhi, double rlo, double xh, double xl, double yh, double yl){ double mh, ml; double c = 134217729.0; double up, u1, u2, vp, v1, v2; up = xhc; u1 = (xh - up) + up; u2 = xh - u1; vp = yhc; v1 = (yh - vp) + vp; v2 = yh - v1; mh = xhyh; ml = (((u1v1 - mh) + (u1v2)) + (u2v1)) + (u2v2); ml += xhyl + xlyh; rhi = mh + ml; rlo = (mh - (rhi)) + ml; } The last line was optimized away, but rl is intended to be the difference between the infinitely precise result of mh + ml and after it has been rounded to double precision. llvm-svn: 93369	2010-01-13 23:23:17 +00:00
Chris Lattner	573da8ac90	1) Use the new SimplifyInstructionsInBlock routine instead of the copy in JT. 2) When cloning blocks for PHI or xor conditions, use instsimplify to simplify the code as we go. This allows us to squish common cases early in JT which opens up opportunities for subsequent iterations, and allows it to completely simplify the testcase. llvm-svn: 93253	2010-01-12 20:41:47 +00:00
Chris Lattner	7c743f2c74	add a helper function. llvm-svn: 93251	2010-01-12 19:40:54 +00:00
Chris Lattner	af7855d571	tidy up llvm-svn: 93222	2010-01-12 02:07:50 +00:00
Chris Lattner	eb73bdb2e1	Teach jump threading to duplicate small blocks when the branch condition is a xor with a phi node. This eliminates nonsense like this from 176.gcc in several places: LBB166_84: testl %eax, %eax - setne %al - xorb %cl, %al - notb %al - testb $1, %al - je LBB166_85 + je LBB166_69 + jmp LBB166_85 This is rdar://7391699 llvm-svn: 93221	2010-01-12 02:07:17 +00:00
Chris Lattner	6a19ed0b86	some cleanup, and make it obvious that ProcessJumpOnPHI only works on branches by renaming it and checking for a branch at the call site. llvm-svn: 93208	2010-01-11 23:41:09 +00:00
Chris Lattner	d1a3efedd8	reenable the piece that turns trunc(zext(x)) -> x even if zext has multiple uses, codegen has no apparent problem with the trunc version of this, because it turns into a simple subreg idiom llvm-svn: 93202	2010-01-11 22:49:40 +00:00
Chris Lattner	a6b1356cf9	Disable folding sext(trunc(x)) -> x (and other similar cast/cast cases) when the trunc has multiple uses. Codegen is not able to coalesce the subreg case correctly and so this leads to higher register pressure and spilling (see PR5997). This speeds up 256.bzip2 from 8.60 -> 8.04s on my machine, ~7%. llvm-svn: 93200	2010-01-11 22:45:25 +00:00
Chris Lattner	9518869423	add one more bitfield optimization, allowing clang to generate good code on PR4216: _test_bitfield: ## @test_bitfield orl $32962, %edi movl $4294941946, %eax andq %rdi, %rax ret instead of: _test_bitfield: movl $4294941696, %ecx movl %edi, %eax orl $194, %edi orl $32768, %eax andq $250, %rdi andq %rax, %rcx movq %rdi, %rax orq %rcx, %rax ret Evan is looking into the remaining andq+imm -> andl optimization. llvm-svn: 93147	2010-01-11 06:55:24 +00:00
Chris Lattner	0a85420409	Extend CanEvaluateZExtd to handle and/or/xor more aggressively in the BitsToClear case. This allows it to promote expressions which have an and/or/xor after the lshr, promoting cases like test2 (from PR4216) and test3 (random extample extracted from a spec benchmark). clang now compiles the code in PR4216 into: _test_bitfield: ## @test_bitfield movl %edi, %eax orl $194, %eax movl $4294902010, %ecx andq %rax, %rcx orl $32768, %edi andq $39936, %rdi movq %rdi, %rax orq %rcx, %rax ret instead of: _test_bitfield: ## @test_bitfield movl %edi, %eax orl $194, %eax movl $4294902010, %ecx andq %rax, %rcx shrl $8, %edi orl $128, %edi shlq $8, %rdi andq $39936, %rdi movq %rdi, %rax orq %rcx, %rax ret which is still not great, but is progress. llvm-svn: 93145	2010-01-11 04:05:13 +00:00
Chris Lattner	12bd8992b3	Remove the dead TD argument to CanEvaluateZExtd, and add a new BitsToClear result which allows us to start promoting expressions that end with a lshr-by-constant. This is conservatively correct and better than what we had before (see testcases) but still needs to be extended further. llvm-svn: 93144	2010-01-11 03:32:00 +00:00
Chris Lattner	172630abd2	improve comments, remove dead TD argument to CanEvaluateSExtd. llvm-svn: 93143	2010-01-11 02:43:35 +00:00
Chris Lattner	7dd540ee24	teach sext optimization to handle truncs from types that are not the dest of the sext. llvm-svn: 93128	2010-01-10 20:30:41 +00:00
Chris Lattner	39d2daa94c	teach zext optimization how to deal with truncs that don't come from the zext dest type. This allows us to handle test52/53 in cast.ll, and allows llvm-gcc to generate much better code for PR4216 in -m64 mode: _test_bitfield: ## @test_bitfield orl $32962, %edi movl %edi, %eax andl $-25350, %eax ret This also fixes a bug handling vector extends, ensuring that the mask produced is a vector constant, not an integer constant. llvm-svn: 93127	2010-01-10 20:25:54 +00:00
Chris Lattner	1a05fddcdc	simplify CanEvaluateSExtd to return a bool now that we have a simpler profitability predicate. llvm-svn: 93111	2010-01-10 07:57:20 +00:00
Chris Lattner	d7816780e2	the NumCastsRemoved argument to CanEvaluateSExtd is dead, remove it. llvm-svn: 93110	2010-01-10 07:42:21 +00:00
Chris Lattner	2fff10c424	now that the cost model has changed, we can always consider elimination of a sign extend to be a win, which simplifies the client of CanEvaluateSExtd, and allows us to eliminate more casts (examples taken from real code). llvm-svn: 93109	2010-01-10 07:40:50 +00:00
Chris Lattner	d8509424a4	change the preferred canonical form for a sign extension to be lshr+ashr instead of trunc+sext. We want to avoid type conversions whenever possible, it is easier to codegen expressions without truncates and extensions. llvm-svn: 93107	2010-01-10 07:08:30 +00:00
Chris Lattner	2b459fe7e1	fix indentation of switch statements, no functionality change. llvm-svn: 93106	2010-01-10 06:59:55 +00:00
Chris Lattner	127bbc715e	fix pasto that broke bootstrap. llvm-svn: 93105	2010-01-10 06:50:04 +00:00
Chris Lattner	b7be7cc486	simplify CanEvaluateZExtd now that we don't care about the number of bits known clear in the result and don't care about the # casts eliminated. TD is also dead but keeping it for now. llvm-svn: 93098	2010-01-10 02:50:04 +00:00
Chris Lattner	49d2c9764d	two changes: 1) don't try to optimize a sext or zext that is only used by a trunc, let the trunc get optimized first. This avoids some pointless effort in some common cases since instcombine scans down a block in the first pass. 2) Change the cost model for zext elimination to consider an 'and' cheaper than a zext. This allows us to do it more aggressively, and for the next patch to simplify the code quite a bit. llvm-svn: 93097	2010-01-10 02:39:31 +00:00
Chris Lattner	f0af17dab3	enhance CanEvaluateZExtd to handle shift left and sext, allowing more expressions to be promoted and casts eliminated. llvm-svn: 93096	2010-01-10 02:22:12 +00:00
Chris Lattner	7723e2b10f	remove an xform subsumed by EvaluateInDifferentType. llvm-svn: 93095	2010-01-10 01:35:55 +00:00
Julien Lerouge	321098ebec	Fix nondeterministic behavior. llvm-svn: 93093	2010-01-10 01:07:22 +00:00
Chris Lattner	c95a7a21b7	clean up this xform by using m_Trunc. llvm-svn: 93092	2010-01-10 01:04:31 +00:00
Chris Lattner	883550afe8	inline and remove the rest of commonIntCastTransforms. llvm-svn: 93091	2010-01-10 01:00:46 +00:00
Chris Lattner	c3aca38468	Inline the expression type promotion/demotion stuff out of commonIntCastTransforms into the callers, eliminating a switch, and allowing the static predicate methods to be moved down to live next to the corresponding function. No functionality change. llvm-svn: 93089	2010-01-10 00:58:42 +00:00
Chris Lattner	ab7087ad66	only factor from expressions whose uses are empty and whose base is the right expression type. This fixes PR5981. llvm-svn: 93045	2010-01-09 06:01:36 +00:00
Julien Lerouge	f50a3f19da	Fix nondeterministic behavior. llvm-svn: 93038	2010-01-09 01:06:49 +00:00
Eric Christopher	4a1d7e1506	Remove unnecessary dyn_cast and add a comment. Part of a WIP. llvm-svn: 93026	2010-01-08 21:37:11 +00:00
Chris Lattner	9242ae047c	mplement a theoretical fixme. llvm-svn: 93024	2010-01-08 19:28:47 +00:00
Chris Lattner	10840e9e13	rename CanEvaluateInDifferentType -> CanEvaluateTruncated and simplify it now that it is only used for truncates. llvm-svn: 93021	2010-01-08 19:19:23 +00:00
Chris Lattner	a1e223ea10	teach instcombine to delete sign extending shift pairs (sra(shl X, C), C) when the input is already sign extended. llvm-svn: 93019	2010-01-08 19:04:21 +00:00
Duncan Sands	4a8b15dc74	Suppress an unused variable warning when assertions are off; remove some trailing whitespace while there. llvm-svn: 93008	2010-01-08 17:51:48 +00:00
Chris Lattner	8c92b57df9	tidy up some stuff duncan pointed out. llvm-svn: 93007	2010-01-08 17:48:19 +00:00
Chris Lattner	35d3b9dcd0	teach ComputeNumSignBits to look through PHI nodes. llvm-svn: 92964	2010-01-07 23:44:37 +00:00
Chris Lattner	3057c37959	Enhance instcombine to reason more strongly about promoting computation that feeds into a zext, similar to the patch I did yesterday for sext. There is a lot of room for extension beyond this patch. llvm-svn: 92962	2010-01-07 23:41:00 +00:00
Benjamin Kramer	76e2766442	Use a do-while loop instead of while + boolean. llvm-svn: 92912	2010-01-07 13:50:07 +00:00
Duncan Sands	f117880ab0	Be less stingy as to how many selects and phi nodes we are prepared to look through. llvm-svn: 92898	2010-01-07 05:48:42 +00:00
Chris Lattner	9855a6bb7c	handle ConstantVector while I'm in here. llvm-svn: 92892	2010-01-07 01:20:20 +00:00
Chris Lattner	64ecc468bd	fix a globalopt crash on 'bullet' (handling evaluation of a store to an element of a vector in a static ctor) which occurs with an unrelated patch I'm testing. Annoyingly, EvaluateStoreInto basically does exactly the same stuff as InsertElement constant folding, but it now handles vectors, and you can't insertelement into a vector. It would be 'really nice' if GEP into a vector were not legal. llvm-svn: 92889	2010-01-07 01:16:21 +00:00
Eric Christopher	2cdb806fd8	Move the object size intrinsic optimization to inst-combine and make it work for any integer size return type. llvm-svn: 92853	2010-01-06 20:04:44 +00:00
Duncan Sands	c8493da5b1	Fix a README item: have functionattrs look through selects and phi nodes when deciding which pointers point to local memory. I actually checked long ago how useful this is, and it isn't very: it hardly ever fires in the testsuite, but since Chris wants it here it is! llvm-svn: 92836	2010-01-06 15:37:47 +00:00
Mikhail Glushenkov	40d2429b28	Formatting. llvm-svn: 92831	2010-01-06 09:20:39 +00:00
Duncan Sands	78376ad7e1	Partially address a README by having functionattrs consider calls to memcpy, memset and other intrinsics that only access their arguments to be readnone if the intrinsic's arguments all point to local memory. This improves the testcase in the README to readonly, but it could in theory be made readnone, however this would involve more sophisticated analysis that looks through the memcpy. llvm-svn: 92829	2010-01-06 08:45:52 +00:00
Chris Lattner	4339f2abdb	tweaks suggested by Duncan llvm-svn: 92824	2010-01-06 05:32:15 +00:00
Chris Lattner	98748c0964	Teach instcombine's sext elimination logic to be more aggressive. Previously, instcombine would only promote an expression tree to the larger type if doing so eliminated two casts. This is because a need to manually do the sign extend after the promoted expression tree with two shifts. Now, we keep track of whether the result of the computation is going to be properly sign extended already. If so, we can unconditionally promote the expression, which allows us to zap more sext's. This implements rdar://6598839 (aka gcc pr38751) llvm-svn: 92815	2010-01-06 01:56:21 +00:00
Chris Lattner	8600dd3d7c	simplify this code. llvm-svn: 92800	2010-01-05 23:00:30 +00:00
Chris Lattner	554d0564ff	make this a static function instead of a method. llvm-svn: 92795	2010-01-05 22:30:42 +00:00
Chris Lattner	a93c63c22d	more rearrangement and cleanup, fix my test failure. llvm-svn: 92792	2010-01-05 22:21:18 +00:00
Chris Lattner	f476ef502c	cleanup llvm-svn: 92790	2010-01-05 22:07:33 +00:00
Chris Lattner	f88dd5ed64	remove two trunc xforms that are subsumed by EvaluateInDifferentType. The only difference is that EvaluateInDifferentType checks to ensure they are profitable before doing them :) llvm-svn: 92788	2010-01-05 22:01:41 +00:00
Chris Lattner	44a63815b9	just remove this xform which is subsumed by others. llvm-svn: 92775	2010-01-05 21:16:30 +00:00
Chris Lattner	b82a840eb2	move a trunc-specific transform out of commonIntCastTransforms into visitTrunc. llvm-svn: 92773	2010-01-05 21:11:17 +00:00
Benjamin Kramer	d2564e3afb	Move remaining stuff to the isInteger predicate. llvm-svn: 92771	2010-01-05 21:05:54 +00:00
Chris Lattner	fd7e42b65d	move a zext specific xform out of commonIntCastTransforms into visitZExt and modernize it. llvm-svn: 92770	2010-01-05 21:04:47 +00:00
Chris Lattner	aaccc8de62	move a trunc-specific xform out of commonIntCastTransforms into visitTrunc llvm-svn: 92768	2010-01-05 20:57:30 +00:00
Chris Lattner	dec6847bf6	reduce indentation llvm-svn: 92766	2010-01-05 20:56:24 +00:00
Benjamin Kramer	a81a6dff0d	Convert a ton of simple integer type equality tests to the new predicate. llvm-svn: 92760	2010-01-05 20:07:06 +00:00
Chris Lattner	54f4e39956	optimize comparisons against cttz/ctlz/ctpop, patch by Alastair Lynn! llvm-svn: 92745	2010-01-05 18:09:56 +00:00
Dan Gohman	c3c031bb37	Nick Lewycky pointed out that this code makes changes unconditionally. llvm-svn: 92739	2010-01-05 17:50:58 +00:00
Dan Gohman	b5358003fb	Set Changed properly after calling DeleteDeadPHIs. llvm-svn: 92735	2010-01-05 16:31:45 +00:00
Dan Gohman	28943873e6	Use do+while instead of while for loops which obviously have a non-zero trip count. Use SmallVector's pop_back_val(). llvm-svn: 92734	2010-01-05 16:27:25 +00:00
Dan Gohman	92fdb96474	Fix indentation. llvm-svn: 92733	2010-01-05 16:20:55 +00:00
Dan Gohman	cb99fe9839	Make RecursivelyDeleteTriviallyDeadInstructions, RecursivelyDeleteDeadPHINode, and DeleteDeadPHIs return a flag indicating whether they made any changes. llvm-svn: 92732	2010-01-05 15:45:31 +00:00
Benjamin Kramer	f7cc698b69	Add newline at EOF. llvm-svn: 92727	2010-01-05 13:32:48 +00:00
Benjamin Kramer	ccce8bae14	Avoid going through the LLVMContext for type equality where it's safe to dereference the type pointer. llvm-svn: 92726	2010-01-05 13:12:22 +00:00
Chris Lattner	223812d547	prune some #includes. llvm-svn: 92712	2010-01-05 07:54:43 +00:00
Chris Lattner	0a8191ee88	split and/or/xor out into one overly-large (2000LOC) file. However, I think it does make sense to keep them together, at least for now. llvm-svn: 92711	2010-01-05 07:50:36 +00:00
Chris Lattner	ed41b14f54	missed file with previous commit. llvm-svn: 92710	2010-01-05 07:45:02 +00:00
Chris Lattner	dc67e13442	split instcombine of shifts out to its own file. llvm-svn: 92709	2010-01-05 07:44:46 +00:00
Chris Lattner	e903f38b4d	eliminate getBitCastOperand and simplify some over-complex inbounds stuff. llvm-svn: 92708	2010-01-05 07:42:10 +00:00
Chris Lattner	7a9e47ac4b	split call handling out to InstCombineCalls.cpp llvm-svn: 92707	2010-01-05 07:32:13 +00:00
Chris Lattner	9da1cb243b	optimize cttz and ctlz when we can prove something about the leading/trailing bits. Patch by Alastair Lynn! llvm-svn: 92706	2010-01-05 07:23:56 +00:00
Chris Lattner	85e65e58ac	this inline function moved to addsub llvm-svn: 92705	2010-01-05 07:20:54 +00:00
Chris Lattner	82aa888e8c	split add/sub out to its own file. Eliminate use of dyn_castNotVal in the X+~X transform. dyn_castNotVal is dramatic overkill for what the xform needed. llvm-svn: 92704	2010-01-05 07:18:46 +00:00
Chris Lattner	c7de92ae15	all the places we use hasOneUse() we know are instructions, so inline and simplify. llvm-svn: 92700	2010-01-05 07:04:23 +00:00
Chris Lattner	c6493f070e	eliminate AssociativeOpt and its last uses. llvm-svn: 92697	2010-01-05 07:01:16 +00:00
Chris Lattner	94694c7f0b	inline the FoldICmpLogical functor. llvm-svn: 92695	2010-01-05 06:59:49 +00:00
Chris Lattner	98d48a0b76	inline the 'AddRHS' transformation, simplifying things significantly. Eliminate the 'AddMaskingAnd' transformation, it is redundant with this more general code right below it: // A+B --> A\|B iff A and B have no bits set in common. llvm-svn: 92693	2010-01-05 06:29:13 +00:00
Chris Lattner	39b063bf37	remove massive over-genality manifested as a big template that got instantiated. There is no reason for instcombine to try this hard for simple associative optimizations. Next up, eliminate the template completely. llvm-svn: 92692	2010-01-05 06:24:06 +00:00
Chris Lattner	dc054bf39a	split mul/div/rem instructions out to their own file. llvm-svn: 92689	2010-01-05 06:09:35 +00:00
Chris Lattner	1e7b7b50b1	clean up header. llvm-svn: 92688	2010-01-05 06:05:07 +00:00
Chris Lattner	8f771cb78f	split select out to its own file. llvm-svn: 92687	2010-01-05 06:03:12 +00:00
Chris Lattner	a65e2f7304	split out load/store/alloca. llvm-svn: 92685	2010-01-05 05:57:49 +00:00
Chris Lattner	841af4f03d	reduce indentation llvm-svn: 92684	2010-01-05 05:42:08 +00:00
Chris Lattner	ec97a90221	split vector stuff out to InstCombineVectorOps.cpp llvm-svn: 92683	2010-01-05 05:36:20 +00:00
Chris Lattner	de1feded32	split PHI node stuff out to InstCombinePHI.cpp llvm-svn: 92682	2010-01-05 05:31:55 +00:00
Chris Lattner	27acfcd1c4	convert various IntrinsicInst's to use class instead of struct. llvm-svn: 92681	2010-01-05 05:21:26 +00:00
Chris Lattner	f741d72b84	fix an infinite loop in reassociate building emacs. llvm-svn: 92679	2010-01-05 04:55:35 +00:00
David Greene	cf0addf927	Change errs() to dbgs(). llvm-svn: 92639	2010-01-05 01:28:37 +00:00
David Greene	6ef94ad615	Change errs() to dbgs(). llvm-svn: 92636	2010-01-05 01:28:29 +00:00
David Greene	74e8bd05cc	Change errs() to dbgs(). llvm-svn: 92633	2010-01-05 01:28:12 +00:00
David Greene	9fcfd96da9	Change errs() to dbgs(). llvm-svn: 92631	2010-01-05 01:28:07 +00:00
David Greene	44cb8ade45	Change errs() to dbgs(). llvm-svn: 92629	2010-01-05 01:28:05 +00:00
David Greene	8306b60d56	Change errs() to dbgs(). llvm-svn: 92627	2010-01-05 01:27:54 +00:00
David Greene	0122fc495d	Change errs() to dbgs(). llvm-svn: 92625	2010-01-05 01:27:51 +00:00
David Greene	241992382e	Change errs() to dbgs(). llvm-svn: 92624	2010-01-05 01:27:47 +00:00
David Greene	e0b9789593	Change errs() to dbgs(). llvm-svn: 92623	2010-01-05 01:27:44 +00:00
David Greene	6bc0776343	Change errs() to dbgs(). llvm-svn: 92622	2010-01-05 01:27:39 +00:00
David Greene	3a79df0993	Change errs() to dbgs(). llvm-svn: 92620	2010-01-05 01:27:33 +00:00
David Greene	0fd862254e	Change errs() to dbgs(). llvm-svn: 92619	2010-01-05 01:27:30 +00:00
David Greene	d17c3916d0	Change errs() to dbgs(). llvm-svn: 92617	2010-01-05 01:27:24 +00:00
David Greene	9ddc6e2e12	Change errs() to dbgs(). llvm-svn: 92615	2010-01-05 01:27:21 +00:00
David Greene	1efdb45562	Change errs() to dbgs(). llvm-svn: 92614	2010-01-05 01:27:19 +00:00
David Greene	2e6efc441f	Change errs() to dbgs(). llvm-svn: 92613	2010-01-05 01:27:17 +00:00
David Greene	389fc3b9f6	Change errs() to dbgs(). llvm-svn: 92612	2010-01-05 01:27:15 +00:00
David Greene	74e2d4917d	Change errs() to dbgs(). llvm-svn: 92611	2010-01-05 01:27:11 +00:00
David Greene	48c86bedbd	Change errs() to dbgs(). llvm-svn: 92610	2010-01-05 01:27:09 +00:00
David Greene	0dd384cfd0	Change errs() to dbgs(). llvm-svn: 92609	2010-01-05 01:27:06 +00:00
David Greene	d9c355d590	Change errs() to dbgs(). llvm-svn: 92608	2010-01-05 01:27:04 +00:00
David Greene	b72ad95ecf	Change errs() to dbgs(). llvm-svn: 92607	2010-01-05 01:27:01 +00:00
David Greene	084b0dde9d	Change errs() to dbgs(). llvm-svn: 92606	2010-01-05 01:26:57 +00:00
David Greene	76a4e852f8	Change errs() to dbgs(). llvm-svn: 92605	2010-01-05 01:26:54 +00:00
David Greene	725c7c3f2e	Change errs() to dbgs(). llvm-svn: 92604	2010-01-05 01:26:52 +00:00
David Greene	3774a38fdf	Change errs() to dbgs(). llvm-svn: 92603	2010-01-05 01:26:49 +00:00
David Greene	50c54238e4	Change errs() to dbgs(). llvm-svn: 92602	2010-01-05 01:26:45 +00:00
David Greene	0ad6dce031	Change errs() to dbgs(). llvm-svn: 92601	2010-01-05 01:26:44 +00:00
David Greene	627f40a9f2	Change errs() to dbgs(). llvm-svn: 92600	2010-01-05 01:26:41 +00:00
David Greene	a8a32dd987	Change errs() to dbgs(). llvm-svn: 92599	2010-01-05 01:26:39 +00:00
Devang Patel	be94f23992	Remove dead debug info intrinsics. Intrinsic::dbg_stoppoint Intrinsic::dbg_region_start Intrinsic::dbg_region_end Intrinsic::dbg_func_start AutoUpgrade simply ignores these intrinsics now. llvm-svn: 92557	2010-01-05 01:10:40 +00:00
Daniel Dunbar	72a87448c1	Fix some struct/class specifier mismatches. llvm-svn: 92550	2010-01-05 00:15:58 +00:00
Chris Lattner	a751d09c08	Truncate GEP indexes larger than the pointer size down to pointer size when doing this transform if the GEP is not inbounds. No testcase because it is very difficult to trigger this: instcombine already canonicalizes GEP indices to pointer size, so it relies specific permutations of the instcombine worklist. Thanks to Duncan for pointing this possible problem out. llvm-svn: 92495	2010-01-04 18:57:15 +00:00
Chris Lattner	2cb08e69b1	silence a bogus 'might be used uninit' warning from GCC. llvm-svn: 92494	2010-01-04 18:48:26 +00:00
Chris Lattner	59d95743c8	move some more cast-related stuff llvm-svn: 92471	2010-01-04 07:59:07 +00:00
Mikhail Glushenkov	6a8ac8ce8f	80-col violations, trailing whitespace. llvm-svn: 92470	2010-01-04 07:55:25 +00:00
Chris Lattner	92be2adba6	move the [Can]EvaluateInDifferentType functions out to InstCombineCasts.cpp llvm-svn: 92469	2010-01-04 07:54:59 +00:00
Chris Lattner	2b295a0eba	split 943 lines of instcombine out to a new InstCombineCasts.cpp file. InstructionCombining.cpp is now down to a svelte 9300 lines :) llvm-svn: 92468	2010-01-04 07:53:58 +00:00
Chris Lattner	2188e40e4c	split instcombine of compares (visit[FI]Cmp) out to a new InstCombineCompares.cpp file. llvm-svn: 92467	2010-01-04 07:37:31 +00:00
Chris Lattner	6ea40f1542	update cmakefile llvm-svn: 92466	2010-01-04 07:19:55 +00:00
Chris Lattner	7e0449172c	move the 'SimplifyDemandedFoo' methods out to their own file, cutting 1K lines out of instcombine.cpp llvm-svn: 92465	2010-01-04 07:17:19 +00:00
Chris Lattner	35522b7465	split the instcombine class definition out to a header shared among the instcombine library. llvm-svn: 92463	2010-01-04 07:12:23 +00:00
Chris Lattner	b8906bda13	remove a ton of unneeded LLVMContext stuff. llvm-svn: 92462	2010-01-04 07:02:48 +00:00
Chris Lattner	66c2e54bcd	move InstCombineWorklist out to its own header. llvm-svn: 92461	2010-01-04 06:30:00 +00:00
Chris Lattner	e2b9da98b0	forgot to svn add these. llvm-svn: 92460	2010-01-04 06:28:20 +00:00
Chris Lattner	c0e6640d3a	move instcombine to its own library, it's past time. llvm-svn: 92459	2010-01-04 06:23:24 +00:00
Chris Lattner	2d91231d82	implement an instcombine xform needed by clang's codegen on the example in PR4216. This doesn't trigger in the testsuite, so I'd really appreciate someone scrutinizing the logic for correctness. llvm-svn: 92458	2010-01-04 06:03:59 +00:00
Chris Lattner	48218e42cd	pull my debug hooks out, I'm done with this xform for now. llvm-svn: 92446	2010-01-03 06:58:48 +00:00
Nick Lewycky	475d3d1215	Small cleanups, refactor some duplicated code into a single method. No functionality change. llvm-svn: 92445	2010-01-03 04:39:07 +00:00
Chris Lattner	fca0c8f93a	generalize the previous transformation to handle indexing into arrays of structs and other arrays, so long as all the subsequent indexes are constants. This triggers frequently for stuff like: @divisions = internal constant [29 x [2 x i32]] [[2 x i32] zeroinitializer, [2 x i32] [i32 0, i32 1], [2 x i32] [i32 0, i32 2], [2 x i32] [i32 0, i32 1], [2 x i32] zeroinitializer, [2 x i32] [i32 0, i32 1], [2 x i32] [i32 0, i32 1], [2 x i32] [i32 0, i32 2], [2 x i32] [i32 0, i32 2], [2 x i32] zeroinitializer, [2 x i32] zeroinitializer, [2 x i32] zeroinitializer, [2 x i32] [i32 0, i32 2], [2 x i32] [i32 0, i32 1], [2 x i32] zeroinitializer, [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 1], [2 x i32] [i32 1, i32 1], [2 x i32] [i32 1, i32 2], [2 x i32] [i32 1, i32 1], [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 2], [2 x i32] [i32 1, i32 2], [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 1], [2 x i32] [i32 1, i32 2], [2 x i32] [i32 1, i32 2]], align 32 ; <[29 x [2 x i32]]> [#uses=50] %623 = getelementptr inbounds [29 x [2 x i32]] @divisions, i64 0, i64 %619, i64 0 ; <i32*> [#uses=1] %684 = icmp eq i32 %683, 999 also for the "my_defs" table in 'gs', etc. llvm-svn: 92444	2010-01-03 03:03:27 +00:00
Nick Lewycky	ff9cd7ace7	Cleanup. llvm-svn: 92436	2010-01-03 00:55:31 +00:00
Chris Lattner	98ad2b56cc	teach instcombine to optimize idioms like A[i]&42 == 0. This occurs in 403.gcc in mode_mask_array, in safe-ctype.c (which is copied in multiple apps) in _sch_istable, etc. llvm-svn: 92427	2010-01-02 22:08:28 +00:00
Chris Lattner	b56bef45f8	Teach the table lookup optimization to generate range compares when a consequtive sequence of elements all satisfies the predicate. Like the double compare case, this generates better code than the magic constant case and generalizes to more than 32/64 element array lookups. Here are some examples where it triggers. From 403.gcc, most accesses to the rtx_class array are handled, e.g.: @rtx_class = constant [153 x i8] c"xxxxxmmmmmmmmxxxxxxxxxxxxmxxxxxxiiixxxxxxxxxxxxxxxxxxxooxooooooxxoooooox3x2c21c2222ccc122222ccccaaaaaa<<<<<<<<<<<<<<<<<<111111111111bbooxxxxxxxxxxcc2211x", align 32 ; <[153 x i8]> [#uses=547] %142 = icmp eq i8 %141, 105 @rtx_class = constant [153 x i8] c"xxxxxmmmmmmmmxxxxxxxxxxxxmxxxxxxiiixxxxxxxxxxxxxxxxxxxooxooooooxxoooooox3x2c21c2222ccc122222ccccaaaaaa<<<<<<<<<<<<<<<<<<111111111111bbooxxxxxxxxxxcc2211x", align 32 ; <[153 x i8]> [#uses=543] %165 = icmp eq i8 %164, 60 Also, most of the 59-element arrays (mode_class/rid_to_yy, etc) optimized before are actually range compares. This lets 32-bit machines optimize them. 400.perlbmk has stuff like this: 400.perlbmk: PL_regkind, even for 32-bit: @PL_regkind = constant [62 x i8] c"\00\00\02\02\02\06\06\06\06\09\09\0B\0B\0D\0E\0E\0E\11\12\12\14\14\16\16\18\18\1A\1A\1C\1C\1E\1F !!!$$&'((((,-.///88886789:;8$", align 32 ; <[62 x i8]> [#uses=4] %811 = icmp ne i8 %810, 33 @PL_utf8skip = constant [256 x i8] c"\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\03\03\03\03\03\03\03\03\03\03\03\03\03\03\03\03\04\04\04\04\04\04\04\04\05\05\05\05\06\06\07\0D", align 32 ; <[256 x i8]> [#uses=94] %12 = icmp ult i8 %10, 2 etc. llvm-svn: 92426	2010-01-02 21:50:18 +00:00
Chris Lattner	e199d2df80	theoretically the negate we find could be in a different function, check for this case. llvm-svn: 92425	2010-01-02 21:46:33 +00:00
Chris Lattner	2fa4ec70fc	use enums for the over/underdefined markers for clarity. Switch to using -2/-3 instead of -1/-2 for a future xform. llvm-svn: 92423	2010-01-02 20:20:33 +00:00
Chris Lattner	351e22aa36	remove the random sampling framework, which is not maintained anymore. If there is interest, it can be resurrected from SVN. PR4912. llvm-svn: 92422	2010-01-02 20:07:03 +00:00
Nick Lewycky	a67519be12	Fix logic error in previous commit. The != case needs to become an or, not an and. llvm-svn: 92419	2010-01-02 16:14:56 +00:00
Nick Lewycky	357d41b3c1	Optimize pointer comparison into the typesafe form, now that the backends will handle them efficiently. This is the opposite direction of the transformation we used to have here. llvm-svn: 92418	2010-01-02 15:25:44 +00:00
Chris Lattner	cfda435c73	Generalize the previous xform to handle cases where exactly two elements match or don't match with two comparisons. For example, the testcase compiles into: define i1 @test5(i32 %X) { %1 = icmp eq i32 %X, 2 ; <i1> [#uses=1] %2 = icmp eq i32 %X, 7 ; <i1> [#uses=1] %R = or i1 %1, %2 ; <i1> [#uses=1] ret i1 %R } This generalizes the previous xforms when the array is larger than 64 elements (and this case matches) and generates better code for cases where it overlaps with the magic bitshift case. This generalizes more cases than you might expect. For example, 400.perlbmk has: @PL_utf8skip = constant [256 x i8] c"\01\01\01\... %15 = icmp ult i8 %7, 7 403.gcc has: @rid_to_yy = internal constant [114 x i16] [i16 259, i16 260, ... %18 = icmp eq i16 %16, 295 and xalancbmk has a bunch of examples, such as _ZN11xercesc_2_5L15gCombiningCharsE and _ZN11xercesc_2_5L10gBaseCharsE. llvm-svn: 92417	2010-01-02 09:35:17 +00:00
Chris Lattner	c6ac078423	fix a miscompilation I introduced of cdecl with a late change. llvm-svn: 92416	2010-01-02 09:22:13 +00:00
Chris Lattner	935a4a606a	enhance the compare/load/index optimization to work on any load from a global with 32/64 elements or less (depending on whether i64 is native on the target), generating a bitshift idiom to determine the result. For example, on test4 we produce: define i1 @test4(i32 %X) { %1 = lshr i32 933, %X ; <i32> [#uses=1] %2 = and i32 %1, 1 ; <i32> [#uses=1] %R = icmp ne i32 %2, 0 ; <i1> [#uses=1] ret i1 %R } This triggers in a number of interesting cases, for example, here's an fp case: @A.3255 = internal constant [4 x double] [double 4.100000e+00, double -3.900000e+00, double -1.000000e+00, double 1.000000e+00], align 32 ; <[4 x double]> [#uses=7] ... %7 = fcmp olt double %3, 0.000000e+00 In this case we make the slen2_tab global dead, which is nice: @slen2_tab = internal constant [16 x i32] [i32 0, i32 1, i32 2, i32 3, i32 0, i32 1, i32 2, i32 3, i32 1, i32 2, i32 3, i32 1, i32 2, i32 3, i32 2, i32 3], align 32 ; <[16 x i32]> [#uses=1] ... %204 = icmp eq i32 %46, 0 Perl has a bunch of these, also on the 'Perl_regkind' array: @Perl_yygindex = internal constant [51 x i16] [i16 0, i16 0, i16 0, i16 0, i16 374, i16 351, i16 0, i16 -12, i16 0, i16 946, i16 413, i16 -83, i16 0, i16 0, i16 0, i16 -311, i16 -13, i16 4007, i16 2893, i16 0, i16 0, i16 0, i16 0, i16 0, i16 372, i16 -8, i16 0, i16 0, i16 246, i16 -131, i16 43, i16 86, i16 208, i16 -45, i16 -169, i16 987, i16 0, i16 0, i16 0, i16 0, i16 308, i16 0, i16 -271, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0], align 32 ; <[51 x i16]> [#uses=1] ... %1364 = icmp eq i16 %1361, 0 186.crafty really likes this on 64-bit machines, because it triggers on a bunch of globals like this: @white_outpost = internal constant [64 x i8] c"\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\02\02\00\00\00\00\00\04\05\05\04\00\00\00\00\03\06\06\03\00\00\00\00\00\01\01\00\00\00\00\00\00\00\00\00\00\00", align 32 ; <[64 x i8]> [#uses=2] However the big winner is 403.gcc, which triggers hundreds of times, eliminating all the accesses to the 57-element arrays 'mode_class', mode_unit_size, mode_bitsize, regclass_map, etc. go 64-bit machines :) llvm-svn: 92415	2010-01-02 08:56:52 +00:00
Chris Lattner	b1567bd584	enhance the previous optimization to work with fcmp in addition to icmp. llvm-svn: 92412	2010-01-02 08:20:51 +00:00
Chris Lattner	a061859ccc	Teach instcombine to fold compares of loads from constant arrays with variable indices into a comparison of the index with a constant. The most common occurrence of this that I see by far is stuff like: if ("foobar"[i] == '\0') ... which we compile into: if (i == 6), saving a load and materialization of the global address. This also exposes loop trip count information to later passes in many cases. This triggers hundreds of times in xalancbmk, which is where I first noticed it, but it also triggers in many other apps. Here are a few interesting ones from various apps: @must_be_connected_without = internal constant [8 x i8] [i8 getelementptr inbounds ([3 x i8]* @.str64320, i64 0, i64 0), i8* getelementptr inbounds ([3 x i8]* @.str27283, i64 0, i64 0), i8* getelementptr inbounds ([4 x i8]* @.str71327, i64 0, i64 0), i8* getelementptr inbounds ([4 x i8]* @.str72328, i64 0, i64 0), i8* getelementptr inbounds ([3 x i8]* @.str18274, i64 0, i64 0), i8* getelementptr inbounds ([6 x i8]* @.str11267, i64 0, i64 0), i8* getelementptr inbounds ([3 x i8]* @.str32288, i64 0, i64 0), i8* null], align 32 ; <[8 x i8]> [#uses=2] %scevgep.i = getelementptr [8 x i8] @must_be_connected_without, i64 0, i64 %indvar.i ; <i8*> [#uses=1] %17 = load ... %18 = icmp eq i8 %17, null ; <i1> [#uses=1] -> icmp eq i64 %indvar.i, 7 @yytable1095 = internal constant [84 x i8] c"\12\01(\05\06\07\08\09\0A\0B\0C\0D\0E1\0F\10\11266\1D: \10\11,-,0\03'\10\11B6\04\17&\18\1945\05\06\07\08\09\0A\0B\0C\0D\0E\1E\0F\10\11\1A\1B\1C$3+>#%;<IJ=ADFEGH9KL\00\00\00C", align 32 ; <[84 x i8]> [#uses=2] %57 = getelementptr inbounds [84 x i8]* @yytable1095, i64 0, i64 %56 ; <i8> [#uses=1] %mode.0.in = getelementptr inbounds [9 x i32] @mb_mode_table, i64 0, i64 %.pn ; <i32> [#uses=1] load ... %64 = icmp eq i8 %58, 4 ; <i1> [#uses=1] -> icmp eq i64 %.pn, 35 ; <i1> [#uses=0] @gsm_DLB = internal constant [4 x i16] [i16 6554, i16 16384, i16 26214, i16 32767] %scevgep.i = getelementptr [4 x i16] @gsm_DLB, i64 0, i64 %indvar.i ; <i16*> [#uses=1] %425 = load %scevgep.i %426 = icmp eq i16 %425, -32768 ; <i1> [#uses=0] -> false llvm-svn: 92411	2010-01-02 08:12:04 +00:00
Chris Lattner	2e4be2c340	remove the instcombine transformations that are inserting nasty pointer to int casts that confuse later optimizations. See PR3351 for details. This improves but doesn't complete fix 483.xalancbmk because llvm-gcc does this xform in GCC's "fold" routine as well. Clang++ will do better I guess. llvm-svn: 92408	2010-01-02 00:31:05 +00:00
Chris Lattner	faf1337acb	add a simple instcombine xform, simplify another one to use hasAllZeroIndices() instead of hand rolling a loop. llvm-svn: 92403	2010-01-01 23:09:08 +00:00
Chris Lattner	30c0a2833d	generalize the pointer difference optimization to handle a constantexpr gep on the 'base' side of the expression. This completes comment #4 in PR3351, which comes from 483.xalancbmk. llvm-svn: 92402	2010-01-01 22:42:29 +00:00
Chris Lattner	4394f71752	teach instcombine to optimize pointer difference idioms involving constant expressions. This is a step towards comment #4 in PR3351. llvm-svn: 92401	2010-01-01 22:29:12 +00:00
Chris Lattner	9d4c5414bb	use 'match' to simplify some code. llvm-svn: 92400	2010-01-01 22:12:03 +00:00
Chris Lattner	25c87e9cf9	implement the transform requested in PR5284 llvm-svn: 92398	2010-01-01 18:34:40 +00:00
Chris Lattner	ee1f861d81	add missing line. llvm-svn: 92384	2010-01-01 01:54:08 +00:00
Chris Lattner	8330daf733	add a few trivial instcombines for llvm.powi. llvm-svn: 92383	2010-01-01 01:52:15 +00:00
Chris Lattner	0c59ac3f41	When factoring multiply expressions across adds, factor both positive and negative forms of constants together. This allows us to compile: int foo(int x, int y) { return (x-y) + (x-y) + (x-y); } into: _foo: ## @foo subl %esi, %edi leal (%rdi,%rdi,2), %eax ret instead of (where the 3 and -3 were not factored): _foo: imull $-3, 8(%esp), %ecx imull $3, 4(%esp), %eax addl %ecx, %eax ret this started out as: movl 12(%ebp), %ecx imull $3, 8(%ebp), %eax subl %ecx, %eax subl %ecx, %eax subl %ecx, %eax ret This comes from PR5359. llvm-svn: 92381	2010-01-01 01:13:15 +00:00
Chris Lattner	a552683fd4	clean up some comments. llvm-svn: 92377	2010-01-01 00:04:26 +00:00
Chris Lattner	17229a7cb8	switch from std::map to DenseMap for rank data structures. llvm-svn: 92375	2010-01-01 00:01:34 +00:00
Chris Lattner	fed3397654	reuse negates where possible instead of always creating them from scratch. This allows us to optimize test12 into: define i32 @test12(i32 %X) { %factor = mul i32 %X, -3 ; <i32> [#uses=1] %Z = add i32 %factor, 6 ; <i32> [#uses=1] ret i32 %Z } instead of: define i32 @test12(i32 %X) { %Y = sub i32 6, %X ; <i32> [#uses=1] %C = sub i32 %Y, %X ; <i32> [#uses=1] %Z = sub i32 %C, %X ; <i32> [#uses=1] ret i32 %Z } llvm-svn: 92373	2009-12-31 20:34:32 +00:00
Chris Lattner	60c2ca743d	we don't need a smallptrset to detect duplicates, the values are sorted, so we can just do a linear scan. llvm-svn: 92372	2009-12-31 19:49:01 +00:00
Chris Lattner	1d8979422a	make reassociate more careful about not leaving around dead mul's llvm-svn: 92370	2009-12-31 19:34:45 +00:00
Chris Lattner	ed18917665	remove debug llvm-svn: 92369	2009-12-31 19:25:19 +00:00
Chris Lattner	60b71b5c4d	teach reassociate to factor x+x+x -> x*3. While I'm at it, fix RemoveDeadBinaryOp to actually do something. llvm-svn: 92368	2009-12-31 19:24:52 +00:00
Chris Lattner	38abecbad0	change reassociate to use SmallVector for its key datastructures instead of std::vector. llvm-svn: 92366	2009-12-31 18:40:32 +00:00
Chris Lattner	ac61550504	change an if to an assert, fix comment. llvm-svn: 92364	2009-12-31 18:18:46 +00:00
Chris Lattner	177140ad12	move the rest of the add optimization code out to OptimizeAdd, improve some comments, simplify a bit of code. llvm-svn: 92363	2009-12-31 18:17:13 +00:00
Chris Lattner	ba1f36aa99	factor statistic updating better. llvm-svn: 92362	2009-12-31 17:51:05 +00:00
Chris Lattner	4e3a5678af	simple fix for an incorrect factoring which causes a miscompilation, PR5458. llvm-svn: 92354	2009-12-31 08:33:49 +00:00
Chris Lattner	5f8a005d38	factor code out into helper functions. llvm-svn: 92347	2009-12-31 07:59:34 +00:00
Chris Lattner	f5c2b8b8d7	switch some std::vector's to smallvector. Reduce nesting. llvm-svn: 92346	2009-12-31 07:48:51 +00:00
Chris Lattner	9039ff8912	use more modern datastructures. llvm-svn: 92344	2009-12-31 07:33:14 +00:00
Chris Lattner	bc1512c8d1	clean up -debug output. llvm-svn: 92343	2009-12-31 07:17:37 +00:00
Chris Lattner	6a0ca6aa90	fix Analysis/DebugInfo.h to not include Metadata.h. Do this by moving one method out of line and eliminating redundant checks from other methods. llvm-svn: 92337	2009-12-31 03:02:08 +00:00
Chris Lattner	9b493028df	rename "elements" of metadata to "operands". "Elements" are things that occur in types. "operands" are things that occur in values. llvm-svn: 92322	2009-12-31 01:22:29 +00:00
Benjamin Kramer	756d7086c1	Use an array instead of a SmallVector. llvm-svn: 92264	2009-12-29 11:04:52 +00:00
Chris Lattner	22e13ba4e5	prune #includes. llvm-svn: 92260	2009-12-29 09:12:29 +00:00
Chris Lattner	a0566979b7	Final step in the metadata API restructuring: move the getMDKindID/getMDKindNames methods to LLVMContext (and add convenience methods to Module), eliminating MetadataContext. Move the state that it maintains out to LLVMContext. llvm-svn: 92259	2009-12-29 09:01:33 +00:00
Chris Lattner	6311212bf9	remove useless argument. llvm-svn: 92256	2009-12-29 08:03:58 +00:00
Chris Lattner	2f2aa2b067	This is a major cleanup of the instruction metadata interfaces that I asked Devang to do back on Sep 27. Instead of going through the MetadataContext class with methods like getMD() and getMDs(), just ask the instruction directly for its metadata with getMetadata() and getAllMetadata(). This includes a variety of other fixes and improvements: previously all Value*'s were bloated because the HasMetadata bit was thrown into value, adding a 9th bit to a byte. Now this is properly sunk down to the Instruction class (the only place where it makes sense) and it will be folded away somewhere soon. This also fixes some confusion in getMDs and its clients about whether the returned list is indexed by the MDID or densely packed. This is now returned sorted and densely packed and the comments make this clear. This introduces a number of fixme's which I'll follow up on. llvm-svn: 92235	2009-12-28 23:41:32 +00:00
Chris Lattner	17079fc0fa	split code that doesn't need to be templated out of IRBuilder into a new non-templated IRBuilderBase class. Move that large CreateGlobalString out of line, eliminating the need to #include GlobalVariable.h in IRBuilder.h llvm-svn: 92227	2009-12-28 21:28:46 +00:00
Chris Lattner	7093946ab1	rename getMDKind -> getMDKindID, make it autoinsert if an MD Kind doesn't exist already, eliminate registerMDKind. Tidy up a bunch of random stuff. llvm-svn: 92225	2009-12-28 20:45:51 +00:00
Chris Lattner	f8d22fc77d	Metadata.h doesn't need to include ValueHandle.h anymore. llvm-svn: 92211	2009-12-28 08:20:46 +00:00
Chris Lattner	1a32ede6fd	move an optimization for memcmp out of simplifylibcalls and into SDISel. This optimization was causing simplifylibcalls to introduce type-unsafe nastiness. This is the first step, I'll be expanding the memcmp optimizations shortly, covering things that we really really wouldn't want simplifylibcalls to do. llvm-svn: 92098	2009-12-24 00:37:38 +00:00
Chris Lattner	efebb234b7	reorder to follow a normal fall-through style, no functionality change. llvm-svn: 92084	2009-12-23 23:24:51 +00:00
David Greene	2330f78075	Remove dump routine and the associated Debug.h from a header. Patch up other files to compensate. llvm-svn: 92075	2009-12-23 22:58:38 +00:00
Eric Christopher	fdb33458fc	Update objectsize intrinsic and associated dependencies. Fix lowering code and update testcases. llvm-svn: 91979	2009-12-23 02:51:48 +00:00
Chris Lattner	c0f6402a94	Fix the Convert to scalar to not insert dead loads in the store case. The load is needed when we have a small store into a large alloca (at which point we get a load/insert/store sequence), but when you do a full-sized store, this load ends up being dead. This dead load is bad in really large nasty testcases where the load ends up causing mem2reg to insert large chains of dependent phi nodes which only ADCE can delete. Instead of doing this, just don't insert the dead load. This fixes rdar://6864035 llvm-svn: 91917	2009-12-22 19:33:28 +00:00
Chris Lattner	fda3b559e6	fix some fixme's by using twines llvm-svn: 91916	2009-12-22 19:23:33 +00:00
Bob Wilson	62a84ea8e3	Generalize SROA to allow the first index of a GEP to be non-zero. Add a missing check that an array reference doesn't go past the end of the array, and remove some redundant checks for in-bound array and vector references that are no longer needed. llvm-svn: 91897	2009-12-22 06:57:14 +00:00
Chris Lattner	f21a220bcd	Implement PR5795 by merging duplicated return blocks. This could go further by merging all returns in a function into a single one, but simplifycfg currently likes to duplicate the return (an unfortunate choice!) llvm-svn: 91890	2009-12-22 06:07:30 +00:00
Chris Lattner	9b7d99eb76	The phi translated pointer can be computed when returning a partially cached result instead of stored. This reduces memdep memory usage, and also eliminates a bunch of weakvh's. This speeds up gvn on gcc.c-torture/20001226-1.c from 23.9s to 8.45s (2.8x) on a different machine than earlier. llvm-svn: 91885	2009-12-22 04:25:02 +00:00
Eric Christopher	ab6a0d60d5	Whitespace fixes. llvm-svn: 91875	2009-12-22 01:23:51 +00:00
Daniel Dunbar	c661a2d4d8	Add suggested parentheses. llvm-svn: 91853	2009-12-21 23:27:57 +00:00
Chris Lattner	bf20018423	Add a fastpath to Load GVN to special case when we have exactly one dominating load to avoid even messing around with SSAUpdate at all. In this case (which is very common, we can just use the input value directly). This speeds up GVN time on gcc.c-torture/20001226-1.c from 36.4s to 16.3s, which still isn't great, but substantially better and this is a simple speedup that applies to lots of different cases. llvm-svn: 91851	2009-12-21 23:15:48 +00:00
Chris Lattner	927b0ac4b2	refactor some code out to a new helper method. llvm-svn: 91849	2009-12-21 23:04:33 +00:00
Chris Lattner	eaa25da8bb	improve indentation avoid a pointless conversion from weakvh to trackingvh, no functionality change. llvm-svn: 91848	2009-12-21 22:43:03 +00:00
Bob Wilson	88a0598fe8	Remove special-case SROA optimization of variable indexes to one-element and two-element arrays. After restructuring the SROA code, it was not safe to do this without adding more checking. It is not clear that this special-case has really been useful, and removing this simplifies the code quite a bit. llvm-svn: 91828	2009-12-21 18:39:47 +00:00
Chris Lattner	4edfcb88e1	revert r89298, which was committed without a testcase. I think the underlying PHI node insertion issue in SSAUpdate is fixed. llvm-svn: 91821	2009-12-21 07:45:57 +00:00
Chris Lattner	8fb07c5a21	fix PR5837 by having SSAUpdate reuse phi nodes for the 'GetValueInMiddleOfBlock' case, instead of inserting duplicates. A similar fix is almost certainly needed by the machine-level SSAUpdate implementation. llvm-svn: 91820	2009-12-21 07:16:11 +00:00
Chris Lattner	d4fb4296df	give instcombine some helper functions for matching MIN and MAX, and implement some optimizations for MIN(MIN()) and MAX(MAX()) and MIN(MAX()) etc. This substantially improves the code in PR5822 but doesn't kick in much elsewhere. 2 max's were optimized in pairlocalalign and one in smg2000. llvm-svn: 91814	2009-12-21 06:03:05 +00:00
Chris Lattner	ffbd02829c	enhance x-(-A) -> x+A to preserve NUW/NSW. Use the presence of NSW/NUW to fold "icmp (x+cst), x" to a constant in cases where it would otherwise be undefined behavior. Surprisingly (to me at least), this triggers hundreds of the times in a few benchmarks: lencode, ldecode, and 466.h264ref seem to really like this. llvm-svn: 91812	2009-12-21 04:04:05 +00:00
Chris Lattner	900ce231f9	Optimize all cases of "icmp (X+Cst), X" to something simpler. This triggers a bunch in lencode, ldecod, spass, 176.gcc, 252.eon, among others. It is also the first part of PR5822 llvm-svn: 91811	2009-12-21 03:19:28 +00:00
Douglas Gregor	740ab38bb7	Fix a bunch of little errors that Clang complains about when its being pedantic llvm-svn: 91764	2009-12-19 07:05:23 +00:00
Chris Lattner	4ad5eba568	fix PR5827 by disabling the phi slicing transformation in a case where instcombine would have to split a critical edge due to a phi node of an invoke. Since instcombine can't change the CFG, it has to bail out from doing the transformation. llvm-svn: 91763	2009-12-19 07:01:15 +00:00
Bob Wilson	c16811b575	Update my SROA changes in response to review. * change FindElementAndOffset to return a uint64_t instead of unsigned, and to identify the type to be used for that result in a GEP instruction. * move "isa<ConstantInt>" to be first in conditional. * replace some dyn_casts with casts. * add a comment about handling mem intrinsics. llvm-svn: 91762	2009-12-19 06:53:17 +00:00
Bob Wilson	532cd232fb	Reapply 91459 with a simple fix for the problem that broke the x86_64-darwin bootstrap. This also replaces the WeakVH references that Chris objected to with normal Value references. llvm-svn: 91711	2009-12-18 20:14:40 +00:00
Eli Friedman	86b9d75dc8	Optimize icmp of null and select of two constants even if the select has multiple uses. (The construct in question was found in gcc.) llvm-svn: 91675	2009-12-18 08:22:35 +00:00
Dan Gohman	57e808628c	Eliminte unnecessary uses of <cstdio>. llvm-svn: 91666	2009-12-18 03:25:51 +00:00
Dan Gohman	18fa5686f6	Add Loop contains utility methods for testing whether a loop contains another loop, or an instruction. The loop form is substantially more efficient on large loops than the typical code it replaces. llvm-svn: 91654	2009-12-18 01:24:09 +00:00
Dan Gohman	fd7231f1fe	Minor code simplification. llvm-svn: 91653	2009-12-18 01:20:44 +00:00
Dan Gohman	b1924e8a0f	Don't pass const pointers by reference. llvm-svn: 91647	2009-12-18 00:38:08 +00:00
Dan Gohman	1af1954852	Update a comment. llvm-svn: 91645	2009-12-18 00:28:43 +00:00
Dan Gohman	92c3696524	Reapply LoopStrengthReduce and IVUsers cleanups, excluding the part of 91296 that caused trouble -- the Processed list needs to be preserved for the livetime of the pass, as AddUsersIfInteresting is called from other passes. llvm-svn: 91641	2009-12-18 00:06:20 +00:00
Eli Friedman	250b119d98	Allow instcombine to combine "sext(a) >u const" to "a >u trunc(const)". llvm-svn: 91631	2009-12-17 22:42:29 +00:00
Eli Friedman	7cc86b4cc6	Make the ptrtoint comparison simplification work if one side is a global. llvm-svn: 91624	2009-12-17 21:27:47 +00:00
Eli Friedman	5842c9968a	Slightly generalize transformation of memmove(a,a,n) so that it also applies to memcpy. (Such a memcpy is technically illegal, but in practice is safe and is generated by struct self-assignment in C code.) llvm-svn: 91621	2009-12-17 21:07:31 +00:00
Bob Wilson	f3927b7994	Re-revert 91459. It's breaking the x86_64 darwin bootstrap. llvm-svn: 91607	2009-12-17 18:34:24 +00:00
Evan Cheng	090ac0865a	Revert 91280-91283, 91286-91289, 91291, 91293, 91295-91296. It apparently introduced a non-deterministic behavior in the optimizer somewhere. llvm-svn: 91598	2009-12-17 09:39:49 +00:00
Daniel Dunbar	ab42d42390	Reapply r91459, it was only unmasking the bug, and since TOT is still broken having it reverted does no good. llvm-svn: 91559	2009-12-16 20:09:53 +00:00
Daniel Dunbar	133efc317e	Revert "Reapply 91184 with fixes and an addition to the testcase to cover the problem", this broke llvm-gcc bootstrap for release builds on x86_64-apple-darwin10. This reverts commit db22309800b224a9f5f51baf76071d7a93ce59c9. llvm-svn: 91534	2009-12-16 10:56:17 +00:00
Chris Lattner	f278addbdc	reapply my strstr optimization. I have reproduced the x86-64 bootstrap miscompile (i386.o miscompares) but it happens both with and without this patch. llvm-svn: 91532	2009-12-16 09:32:05 +00:00
Chris Lattner	177be32334	revert my strstr optimization, I'm told it breaks x86-64 bootstrap. Will reapply with a fix when I get a chance. llvm-svn: 91486	2009-12-16 00:46:02 +00:00
Bob Wilson	e44756d7c2	Reapply 91184 with fixes and an addition to the testcase to cover the problem found last time. Instead of trying to modify the IR while iterating over it, I've change it to keep a list of WeakVH references to dead instructions, and then delete those instructions later. I also added some special case code to detect and handle the situation when both operands of a memcpy intrinsic are referencing the same alloca. llvm-svn: 91459	2009-12-15 22:00:51 +00:00
Chris Lattner	26ab363361	optimize strstr, PR5783 llvm-svn: 91438	2009-12-15 19:14:40 +00:00
Dan Gohman	265ce318b8	Delete an unused function. llvm-svn: 91432	2009-12-15 16:30:09 +00:00
Chris Lattner	24aba42d04	add some other xforms that should be done as part of PR5783 llvm-svn: 91428	2009-12-15 09:05:13 +00:00
Chris Lattner	45d040bd85	Remove isPod() from DenseMapInfo, splitting it out to its own isPodLike type trait. This is a generally useful type trait for more than just DenseMap, and we really care about whether something acts like a pod, not whether it really is a pod. llvm-svn: 91421	2009-12-15 07:26:43 +00:00
Dan Gohman	fbeec7270c	Fix a thinko; isNotAlreadyContainedIn had a built-in negative, so the condition was inverted when the code was converted to contains(). llvm-svn: 91295	2009-12-14 17:31:01 +00:00
Dan Gohman	416d5b7361	Remove unnecessary #includes. llvm-svn: 91293	2009-12-14 17:19:06 +00:00
Dan Gohman	163fb26927	Instead of having a ScalarEvolution pointer member in BasedUser, just pass the ScalarEvolution pointer into the functions which need it. llvm-svn: 91289	2009-12-14 17:12:51 +00:00
Dan Gohman	8dbd4e3d16	Don't bother cleaning up if there's nothing to clean up. llvm-svn: 91288	2009-12-14 17:10:44 +00:00
Dan Gohman	88c7e61c5b	Delete an unused variable. llvm-svn: 91287	2009-12-14 17:08:09 +00:00
Dan Gohman	838f604543	LSR itself doesn't need LoopInfo. llvm-svn: 91283	2009-12-14 17:02:34 +00:00
Dan Gohman	273e692952	LSR itself doesn't need DominatorTree. llvm-svn: 91282	2009-12-14 16:57:08 +00:00
Dan Gohman	c3513095cf	Remove the code in LSR that manually hoists expansions out of loops; SCEVExpander does this automatically. llvm-svn: 91281	2009-12-14 16:52:55 +00:00
Dan Gohman	ec2a7c58e8	Minor code cleanups. llvm-svn: 91280	2009-12-14 16:37:29 +00:00
Chris Lattner	aaa6ac10a6	revert r91184, because it causes a crash on a .bc file I just sent to Bob. llvm-svn: 91268	2009-12-14 05:11:02 +00:00
Chandler Carruth	dcf5dacb2c	Don't leave pointers uninitialized in the default constructor. GCC complains about the potential use of these uninitialized members under certain conditions. llvm-svn: 91239	2009-12-13 07:04:45 +00:00
Bob Wilson	895f364ae6	Revise scalar replacement to be more flexible about handle bitcasts and GEPs. While scanning through the uses of an alloca, keep track of the current offset relative to the start of the alloca, and check memory references to see if the offset & size correspond to a component within the alloca. This has the nice benefit of unifying much of the code from isSafeUseOfAllocation, isSafeElementUse, and isSafeUseOfBitCastedAllocation. The code to rewrite the uses of a promoted alloca, after it is determined to be safe, is reorganized in the same way. Also, when rewriting GEP instructions, mark them as "in-bounds" since all the indices are known to be safe. llvm-svn: 91184	2009-12-11 23:47:40 +00:00
Eric Christopher	22889c049d	Make sure the immediate dominator isn't NULL through iterations of the loop. We could get to this condition via indirect branches. llvm-svn: 91009	2009-12-10 00:25:41 +00:00
Chris Lattner	9ccc879006	Fix PR5744, a case where we were getting the pointer size instead of the value size. This only manifested when memdep inprecisely returns clobber, which is do to a caching issue in the PR5744 testcase. We can 'efficiently emulate' this by using '-no-aa' llvm-svn: 91004	2009-12-10 00:11:45 +00:00
Chris Lattner	3ddf804f78	allow this to build when the #if 0's are enabled. No functionality change. llvm-svn: 90999	2009-12-10 00:04:46 +00:00
Dan Gohman	72c367fb52	Dereference loopHeader after checking for null rather than before. llvm-svn: 90990	2009-12-09 22:55:01 +00:00
Chris Lattner	ca5f9cb18b	fix hte last remaining known (by me) phi translation bug. When we reanalyze clobbers to forward pieces of large stores to small loads, we need to consider the properly phi translated pointer in the store block. llvm-svn: 90978	2009-12-09 18:21:46 +00:00
Chris Lattner	f8ba1253f1	change GetStoreValueForLoad to use IRBuilder, which is cleaner and implicitly constant folds. llvm-svn: 90977	2009-12-09 18:13:28 +00:00
Bob Wilson	1c5a6fb299	Fix a comment. llvm-svn: 90975	2009-12-09 18:05:27 +00:00
Chris Lattner	07df9efb35	change AnalyzeLoadFromClobberingMemInst/AnalyzeLoadFromClobberingStore to require the load ty/ptr to be passed in, no functionality change. llvm-svn: 90960	2009-12-09 07:37:07 +00:00
Chris Lattner	0def861ee9	change AnalyzeLoadFromClobberingWrite and clients to pass in type and pointer instead of the load. No functionality change. llvm-svn: 90959	2009-12-09 07:34:10 +00:00
Chris Lattner	0c31547168	change NonLocalDepEntry from being a typedef for an std::pair to be its own small class. No functionality change. llvm-svn: 90956	2009-12-09 07:08:01 +00:00
Chris Lattner	946b58dd90	add some aborts to #if 0's. llvm-svn: 90929	2009-12-09 02:41:54 +00:00
Chris Lattner	972e6d8d00	Switch GVN and memdep to use PHITransAddr, which correctly handles phi translation of complex expressions like &A[i+1]. This has the following benefits: 1. The phi translation logic is all contained in its own class with a strong interface and verification that it is self consistent. 2. The logic is more correct than before. Previously, if intermediate expressions got PHI translated, we'd miss the update and scan for the wrong pointers in predecessor blocks. @phi_trans2 is a testcase for this. 3. We have a lot less code in memdep. We can handle phi translation across blocks of things like @phi_trans3, which is pretty insane :). This patch should fix the miscompiles of 255.vortex, and I tested it with a bootstrap of llvm-gcc, llvm-test and dejagnu of course. llvm-svn: 90926	2009-12-09 01:59:31 +00:00
Bob Wilson	c5d082fd5d	Some superficial cleanups. llvm-svn: 90866	2009-12-08 18:27:03 +00:00
Bob Wilson	2029ea04f9	Clean up dead operands left around after SROA replaces a mem intrinsic. I'm not aware that this does anything significant on its own, but it's needed for another patch that I'm working on. llvm-svn: 90864	2009-12-08 18:22:03 +00:00
Duncan Sands	6a3df7b0c7	Teach GlobalOpt to delete aliases with internal linkage (after forwarding any uses). GlobalDCE can also do this, but is only run at -O3. llvm-svn: 90850	2009-12-08 10:10:20 +00:00
Nick Lewycky	8bca014d7f	Remove unnecessary #include "llvm/LLVMContext.h". llvm-svn: 90836	2009-12-08 05:45:41 +00:00

... 5 6 7 8 9 ...

6679 Commits