llvm-project

Commit Graph

Author	SHA1	Message	Date
Jakob Stoklund Olesen	8a19d3c96c	Move per-function inline threshold calculation to a method. No functional change except the forgotten test for InlineLimit.getNumOccurrences() == 0 in the CurrentThreshold2 calculation. llvm-svn: 94007	2010-01-20 17:51:28 +00:00
Victor Hernandez	f2462407ee	Switch Elts from vector to SmallVector llvm-svn: 93989	2010-01-20 06:56:16 +00:00
Victor Hernandez	5fa88d4e30	Map operands of all function-local metadata, not just metadata passed to llvm.dbg.declare intrinsics llvm-svn: 93979	2010-01-20 05:49:59 +00:00
Dan Gohman	ca19445d08	When doing address-mode sinking, expand the base register first, rather than the scaled register. This makes it more likely that subsequent AddrModeMatcher queries will match the new address the same way as the old, instead of accidentally matching what had been the base register as the new scaled register, and then failing to match the scaled register. This fixes some problems with address-mode sinking multiple muls into a block, which will be a lot more common with some upcoming LoopStrengthReduction changes. llvm-svn: 93935	2010-01-19 22:45:06 +00:00
Chris Lattner	18f49ce2d3	optimize ~(~X >>s Y) --> (X >>s Y), patch by Edmund Grimley Evans! llvm-svn: 93884	2010-01-19 18:16:19 +00:00
Bob Wilson	58d59fe394	Fix a crash in scalarrepl for memcpy/memmove where the source and destination are the same. I had already fixed a similar problem where the source and destination were different bitcasts derived from the same alloca, but the previous fix still did not handle the case where both operands are exactly the same value. Radar 7552893. llvm-svn: 93848	2010-01-19 04:32:48 +00:00
Eric Christopher	84bd316bd6	Fix comment. llvm-svn: 93831	2010-01-19 01:20:15 +00:00
Chris Lattner	43f2fa6201	my instcombine transformations to make extension elimination more aggressive changed the canonical form from sext(trunc(x)) to ashr(lshr(x)), make sure to transform a couple more things into that canonical form, and catch a case where we missed turning zext/shl/ashr into a single sext. llvm-svn: 93787	2010-01-18 22:19:16 +00:00
Devang Patel	696cb8d410	While mapping llvm.dbg.declare intrinsic manually map its operand, if possible, because it points to an alloca instruction through metadata. llvm-svn: 93757	2010-01-18 19:52:14 +00:00
Owen Anderson	cdea3572fa	Convert some of the dynamic opcode lookups into static ones. llvm-svn: 93693	2010-01-17 19:33:27 +00:00
Owen Anderson	fa1edea9ce	Fix comment. llvm-svn: 93679	2010-01-17 06:49:03 +00:00
Bob Wilson	e0da4b6cff	Fix a comment typo. llvm-svn: 93560	2010-01-15 21:55:02 +00:00
Bill Wendling	ad7a5b07a7	When the visitSub method was split into visitSub and visitFSub, this xform was added to the FSub version. However, the original version of this xform guarded against doing this for floating point (!Op0->getType()->isFPOrFPVector()). This is causing LLVM to perform incorrect xforms for code like: void func(double rhi, double rlo, double xh, double xl, double yh, double yl){ double mh, ml; double c = 134217729.0; double up, u1, u2, vp, v1, v2; up = xhc; u1 = (xh - up) + up; u2 = xh - u1; vp = yhc; v1 = (yh - vp) + vp; v2 = yh - v1; mh = xhyh; ml = (((u1v1 - mh) + (u1v2)) + (u2v1)) + (u2v2); ml += xhyl + xlyh; rhi = mh + ml; rlo = (mh - (rhi)) + ml; } The last line was optimized away, but rl is intended to be the difference between the infinitely precise result of mh + ml and after it has been rounded to double precision. llvm-svn: 93369	2010-01-13 23:23:17 +00:00
Chris Lattner	573da8ac90	1) Use the new SimplifyInstructionsInBlock routine instead of the copy in JT. 2) When cloning blocks for PHI or xor conditions, use instsimplify to simplify the code as we go. This allows us to squish common cases early in JT which opens up opportunities for subsequent iterations, and allows it to completely simplify the testcase. llvm-svn: 93253	2010-01-12 20:41:47 +00:00
Chris Lattner	7c743f2c74	add a helper function. llvm-svn: 93251	2010-01-12 19:40:54 +00:00
Chris Lattner	af7855d571	tidy up llvm-svn: 93222	2010-01-12 02:07:50 +00:00
Chris Lattner	eb73bdb2e1	Teach jump threading to duplicate small blocks when the branch condition is a xor with a phi node. This eliminates nonsense like this from 176.gcc in several places: LBB166_84: testl %eax, %eax - setne %al - xorb %cl, %al - notb %al - testb $1, %al - je LBB166_85 + je LBB166_69 + jmp LBB166_85 This is rdar://7391699 llvm-svn: 93221	2010-01-12 02:07:17 +00:00
Chris Lattner	6a19ed0b86	some cleanup, and make it obvious that ProcessJumpOnPHI only works on branches by renaming it and checking for a branch at the call site. llvm-svn: 93208	2010-01-11 23:41:09 +00:00
Chris Lattner	d1a3efedd8	reenable the piece that turns trunc(zext(x)) -> x even if zext has multiple uses, codegen has no apparent problem with the trunc version of this, because it turns into a simple subreg idiom llvm-svn: 93202	2010-01-11 22:49:40 +00:00
Chris Lattner	a6b1356cf9	Disable folding sext(trunc(x)) -> x (and other similar cast/cast cases) when the trunc has multiple uses. Codegen is not able to coalesce the subreg case correctly and so this leads to higher register pressure and spilling (see PR5997). This speeds up 256.bzip2 from 8.60 -> 8.04s on my machine, ~7%. llvm-svn: 93200	2010-01-11 22:45:25 +00:00
Chris Lattner	9518869423	add one more bitfield optimization, allowing clang to generate good code on PR4216: _test_bitfield: ## @test_bitfield orl $32962, %edi movl $4294941946, %eax andq %rdi, %rax ret instead of: _test_bitfield: movl $4294941696, %ecx movl %edi, %eax orl $194, %edi orl $32768, %eax andq $250, %rdi andq %rax, %rcx movq %rdi, %rax orq %rcx, %rax ret Evan is looking into the remaining andq+imm -> andl optimization. llvm-svn: 93147	2010-01-11 06:55:24 +00:00
Chris Lattner	0a85420409	Extend CanEvaluateZExtd to handle and/or/xor more aggressively in the BitsToClear case. This allows it to promote expressions which have an and/or/xor after the lshr, promoting cases like test2 (from PR4216) and test3 (random extample extracted from a spec benchmark). clang now compiles the code in PR4216 into: _test_bitfield: ## @test_bitfield movl %edi, %eax orl $194, %eax movl $4294902010, %ecx andq %rax, %rcx orl $32768, %edi andq $39936, %rdi movq %rdi, %rax orq %rcx, %rax ret instead of: _test_bitfield: ## @test_bitfield movl %edi, %eax orl $194, %eax movl $4294902010, %ecx andq %rax, %rcx shrl $8, %edi orl $128, %edi shlq $8, %rdi andq $39936, %rdi movq %rdi, %rax orq %rcx, %rax ret which is still not great, but is progress. llvm-svn: 93145	2010-01-11 04:05:13 +00:00
Chris Lattner	12bd8992b3	Remove the dead TD argument to CanEvaluateZExtd, and add a new BitsToClear result which allows us to start promoting expressions that end with a lshr-by-constant. This is conservatively correct and better than what we had before (see testcases) but still needs to be extended further. llvm-svn: 93144	2010-01-11 03:32:00 +00:00
Chris Lattner	172630abd2	improve comments, remove dead TD argument to CanEvaluateSExtd. llvm-svn: 93143	2010-01-11 02:43:35 +00:00
Chris Lattner	7dd540ee24	teach sext optimization to handle truncs from types that are not the dest of the sext. llvm-svn: 93128	2010-01-10 20:30:41 +00:00
Chris Lattner	39d2daa94c	teach zext optimization how to deal with truncs that don't come from the zext dest type. This allows us to handle test52/53 in cast.ll, and allows llvm-gcc to generate much better code for PR4216 in -m64 mode: _test_bitfield: ## @test_bitfield orl $32962, %edi movl %edi, %eax andl $-25350, %eax ret This also fixes a bug handling vector extends, ensuring that the mask produced is a vector constant, not an integer constant. llvm-svn: 93127	2010-01-10 20:25:54 +00:00
Chris Lattner	1a05fddcdc	simplify CanEvaluateSExtd to return a bool now that we have a simpler profitability predicate. llvm-svn: 93111	2010-01-10 07:57:20 +00:00
Chris Lattner	d7816780e2	the NumCastsRemoved argument to CanEvaluateSExtd is dead, remove it. llvm-svn: 93110	2010-01-10 07:42:21 +00:00
Chris Lattner	2fff10c424	now that the cost model has changed, we can always consider elimination of a sign extend to be a win, which simplifies the client of CanEvaluateSExtd, and allows us to eliminate more casts (examples taken from real code). llvm-svn: 93109	2010-01-10 07:40:50 +00:00
Chris Lattner	d8509424a4	change the preferred canonical form for a sign extension to be lshr+ashr instead of trunc+sext. We want to avoid type conversions whenever possible, it is easier to codegen expressions without truncates and extensions. llvm-svn: 93107	2010-01-10 07:08:30 +00:00
Chris Lattner	2b459fe7e1	fix indentation of switch statements, no functionality change. llvm-svn: 93106	2010-01-10 06:59:55 +00:00
Chris Lattner	127bbc715e	fix pasto that broke bootstrap. llvm-svn: 93105	2010-01-10 06:50:04 +00:00
Chris Lattner	b7be7cc486	simplify CanEvaluateZExtd now that we don't care about the number of bits known clear in the result and don't care about the # casts eliminated. TD is also dead but keeping it for now. llvm-svn: 93098	2010-01-10 02:50:04 +00:00
Chris Lattner	49d2c9764d	two changes: 1) don't try to optimize a sext or zext that is only used by a trunc, let the trunc get optimized first. This avoids some pointless effort in some common cases since instcombine scans down a block in the first pass. 2) Change the cost model for zext elimination to consider an 'and' cheaper than a zext. This allows us to do it more aggressively, and for the next patch to simplify the code quite a bit. llvm-svn: 93097	2010-01-10 02:39:31 +00:00
Chris Lattner	f0af17dab3	enhance CanEvaluateZExtd to handle shift left and sext, allowing more expressions to be promoted and casts eliminated. llvm-svn: 93096	2010-01-10 02:22:12 +00:00
Chris Lattner	7723e2b10f	remove an xform subsumed by EvaluateInDifferentType. llvm-svn: 93095	2010-01-10 01:35:55 +00:00
Julien Lerouge	321098ebec	Fix nondeterministic behavior. llvm-svn: 93093	2010-01-10 01:07:22 +00:00
Chris Lattner	c95a7a21b7	clean up this xform by using m_Trunc. llvm-svn: 93092	2010-01-10 01:04:31 +00:00
Chris Lattner	883550afe8	inline and remove the rest of commonIntCastTransforms. llvm-svn: 93091	2010-01-10 01:00:46 +00:00
Chris Lattner	c3aca38468	Inline the expression type promotion/demotion stuff out of commonIntCastTransforms into the callers, eliminating a switch, and allowing the static predicate methods to be moved down to live next to the corresponding function. No functionality change. llvm-svn: 93089	2010-01-10 00:58:42 +00:00
Chris Lattner	ab7087ad66	only factor from expressions whose uses are empty and whose base is the right expression type. This fixes PR5981. llvm-svn: 93045	2010-01-09 06:01:36 +00:00
Julien Lerouge	f50a3f19da	Fix nondeterministic behavior. llvm-svn: 93038	2010-01-09 01:06:49 +00:00
Eric Christopher	4a1d7e1506	Remove unnecessary dyn_cast and add a comment. Part of a WIP. llvm-svn: 93026	2010-01-08 21:37:11 +00:00
Chris Lattner	9242ae047c	mplement a theoretical fixme. llvm-svn: 93024	2010-01-08 19:28:47 +00:00
Chris Lattner	10840e9e13	rename CanEvaluateInDifferentType -> CanEvaluateTruncated and simplify it now that it is only used for truncates. llvm-svn: 93021	2010-01-08 19:19:23 +00:00
Chris Lattner	a1e223ea10	teach instcombine to delete sign extending shift pairs (sra(shl X, C), C) when the input is already sign extended. llvm-svn: 93019	2010-01-08 19:04:21 +00:00
Duncan Sands	4a8b15dc74	Suppress an unused variable warning when assertions are off; remove some trailing whitespace while there. llvm-svn: 93008	2010-01-08 17:51:48 +00:00
Chris Lattner	8c92b57df9	tidy up some stuff duncan pointed out. llvm-svn: 93007	2010-01-08 17:48:19 +00:00
Chris Lattner	35d3b9dcd0	teach ComputeNumSignBits to look through PHI nodes. llvm-svn: 92964	2010-01-07 23:44:37 +00:00
Chris Lattner	3057c37959	Enhance instcombine to reason more strongly about promoting computation that feeds into a zext, similar to the patch I did yesterday for sext. There is a lot of room for extension beyond this patch. llvm-svn: 92962	2010-01-07 23:41:00 +00:00

1 2 3 4 5 ...

6303 Commits