llvm-project

Commit Graph

Author	SHA1	Message	Date
Dan Gohman	c380cca7ae	Don't attempt to simplify an non-affine IV expression if it can't be simplified to a loop-invariant value. This fixes PR4315. llvm-svn: 72798	2009-06-03 19:11:31 +00:00
Dan Gohman	760377effc	Fix CodeGenPrepare's address-mode sinking to handle unusual addresses, involving Base values which do not have Pointer type. This fixes PR4297. llvm-svn: 72739	2009-06-02 21:29:13 +00:00
Eli Friedman	ee94e3cc9e	PR4286: Make RewriteLoadUserOfWholeAlloca and RewriteStoreUserOfWholeAlloca deal with tail padding because isSafeUseOfBitCastedAllocation expects them to. Otherwise, we crash trying to erase the bitcast. llvm-svn: 72688	2009-06-01 09:14:32 +00:00
Owen Anderson	cc0c75c74d	Be more aggressive in doing LoadPRE by tracing backwards when a block only has a single predecessor. Patch by Jakub Staszak. llvm-svn: 72661	2009-05-31 09:03:40 +00:00
Chris Lattner	221895303c	fix PR4284, a bug in simplifylibcalls handling memcmp. Patch by Benjamin Kramer! llvm-svn: 72625	2009-05-30 18:43:04 +00:00
Bill Wendling	006459ecd4	Enable GVN Load PRE. llvm-svn: 72589	2009-05-29 20:38:16 +00:00
Torok Edwin	0b0ddb21fe	just show the instruction, its not that slow. llvm-svn: 72577	2009-05-29 16:58:36 +00:00
Torok Edwin	6a94624a1b	for instructions with void type we have no choice but print the instruction as is, otherwise we get a <badref>. llvm-svn: 72567	2009-05-29 10:28:44 +00:00
Torok Edwin	72070282eb	Add a DEBUG() output to GVN that prints the instruction clobbering a load. This is useful when trying to figure out why GVN didn't eliminate redundant loads. llvm-svn: 72565	2009-05-29 09:46:03 +00:00
Owen Anderson	04cfdd38a2	Fix an issue where phiMap was not being updated properly when doing load PRE. Diagnosis and patch thanks to Jakub Staszak. llvm-svn: 72562	2009-05-29 05:37:54 +00:00
Nick Lewycky	206876e2da	Use Operands.data() instead of &Operands[0] where Operands is a potentially empty SmallVector. llvm-svn: 72512	2009-05-28 04:08:10 +00:00
Dan Gohman	4d1823680d	Revert 72493 and replace it with a more conservative fix, for now: don't rewrite the comparison if there is any implicit extension or truncation on the induction variable. I'm planning for IVUsers to eventually take over some of the work of this code, and for it to be generalized. llvm-svn: 72496	2009-05-27 21:10:47 +00:00
Dan Gohman	f4d85325c0	In ChangeCompareStride, when the stride to be reused is truncated to a smaller type, promoted its offset back up to the type of the new comparison. This fixes PR4222. llvm-svn: 72493	2009-05-27 20:00:18 +00:00
Dan Gohman	8ca0885d69	Change ScalarEvolution::getSCEVAtScope to always return the original value in the case where a loop exit value cannot be computed, instead of only in some cases while using SCEVCouldNotCompute in others. This simplifies getSCEVAtScope's callers. llvm-svn: 72375	2009-05-24 23:25:42 +00:00
Torok Edwin	26895b518b	Move Rewriter.clear() earlier, to avoid triggerring the AssertingVH by one of the RecursivelyDeleteTriviallyDeadInstructions. Add a comment explaining why the cache needs to be cleared. llvm-svn: 72372	2009-05-24 20:08:21 +00:00
Torok Edwin	5349cf5f4b	Instead of clearing the rewriter, don't attempt to rewrite dead phi nodes. Also fix 80 column violation. llvm-svn: 72371	2009-05-24 19:36:09 +00:00
Dan Gohman	4486da5b78	When rewriting the loop exit test with the canonical induction variable, leave the original comparison in place if it has other uses, since the other uses won't be dominated by the new comparison instruction. llvm-svn: 72369	2009-05-24 19:11:38 +00:00
Dan Gohman	fb56cf1b1d	When replacing a floating-point comparison with an integer comparison, use takeName to give the integer comparison a name. llvm-svn: 72367	2009-05-24 18:09:01 +00:00
Torok Edwin	d184bc209c	The rewriter may hold references to instructions that are deleted because they are trivially dead. Fix by clearing the rewriter cache before deleting the trivially dead instructions. Also make InsertedExpressions use an AssertingVH to catch these bugs easier. llvm-svn: 72364	2009-05-24 14:23:16 +00:00
Evan Cheng	a838a40bc4	Fix bug in FoldFCmp_IntToFP_Cst. If inttofp is a uintofp, use unsigned instead of signed integer constant. llvm-svn: 72300	2009-05-22 23:10:53 +00:00
Dan Gohman	781b75a7df	Teach IndVarSimplify's FixUsesBeforeDefs to handle InvokeInsts by assuming that the use of the value is in a block dominated by the "normal" destination. LangRef.html and other documentation sources don't explicitly guarantee this, but it seems to be assumed in other places in LLVM at least. This fixes an assertion failure on the included testcase, which is derived from the Ada testsuite. FixUsesBeforeDefs is a temporary measure which I'm looking to replace with a more capable solution. llvm-svn: 72266	2009-05-22 16:47:11 +00:00
Eli Friedman	0cf811df82	Fix loop-index-split to correctly preserve dominance frontiers. Part of PR4238. llvm-svn: 72244	2009-05-22 03:22:46 +00:00
Dan Gohman	bf0002e7c1	Teach ValueTracking a new way to analyze PHI nodes, and and teach Instcombine to be more aggressive about using SimplifyDemandedBits on shift nodes. This allows a shift to be simplified to zero in the included test case. llvm-svn: 72204	2009-05-21 02:28:33 +00:00
Dan Gohman	7248923a5d	Suppress the IV reversal transformation in the case that the RHS of the comparison is defined inside the loop. This fixes a use-before-def problem, because the transformation puts a use of the RHS outside the loop. llvm-svn: 72149	2009-05-20 00:34:08 +00:00
Dan Gohman	67587ce2e9	Remove an irrelevant comment. llvm-svn: 72132	2009-05-19 20:38:47 +00:00
Dan Gohman	97f70add3c	Add some more comments to the top of this file. llvm-svn: 72131	2009-05-19 20:37:36 +00:00
Dan Gohman	adc70d6806	Trim unneeded #includes. llvm-svn: 72130	2009-05-19 20:35:26 +00:00
Dan Gohman	2649491f9c	Teach SCEVExpander to expand arithmetic involving pointers into GEP instructions. It attempts to create high-level multi-operand GEPs, though in cases where this isn't possible it falls back to casting the pointer to i8* and emitting a GEP with that. Using GEP instructions instead of ptrtoint+arithmetic+inttoptr helps pointer analyses that don't use ScalarEvolution, such as BasicAliasAnalysis. Also, make the AddrModeMatcher more aggressive in handling GEPs. Previously it assumed that operand 0 of a GEP would require a register in almost all cases. It now does extra checking and can do more matching if operand 0 of the GEP is foldable. This fixes a problem that was exposed by SCEVExpander using GEPs. llvm-svn: 72093	2009-05-19 02:15:55 +00:00
Dan Gohman	14d1339579	Rename UseTy to AccessTy, for consistency with getAccessType, and to avoid ambiguity with the word "use" in IVStrideUse. llvm-svn: 72012	2009-05-18 16:45:28 +00:00
Dale Johannesen	f241df9abe	Use abs64 in one more place. llvm-svn: 71775	2009-05-14 16:47:34 +00:00
Chris Lattner	149546a6a0	calls in nothrow functions can be marked nothrow even if the callee is not known to be nothrow. This allows readnone/readonly functions to be deleted even if we don't know whether the callee can throw. llvm-svn: 71676	2009-05-13 17:39:14 +00:00
Chris Lattner	7e335a763a	Fix PR4206 - crash in simplify lib calls llvm-svn: 71644	2009-05-13 06:26:11 +00:00
Dale Johannesen	536de01bcf	Add an int64_t variant of abs, for host environments without one. Use it where we were using abs on int64_t objects. (I strongly suspect the casts to unsigned in the fragments in LoopStrengthReduce are not doing whatever the original intent was, but the obvious change to uint64_t doesn't work. Maybe later.) llvm-svn: 71612	2009-05-13 00:24:22 +00:00
Dan Gohman	d76d71a291	Factor the code for collecting IV users out of LSR into an IVUsers class, and generalize it so that it can be used by IndVarSimplify. Implement the base IndVarSimplify transformation code using IVUsers. This removes TestOrigIVForWrap and associated code, as ScalarEvolution now has enough builtin overflow detection and folding logic to handle all the same cases, and more. Run "opt -iv-users -analyze -disable-output" on your favorite loop for an example of what IVUsers does. This lets IndVarSimplify eliminate IV casts and compute trip counts in more cases. Also, this happens to finally fix the remaining testcases in PR1301. Now that IndVarSimplify is being more aggressive, it occasionally runs into the problem where ScalarEvolutionExpander's code for avoiding duplicate expansions makes it difficult to ensure that all expanded instructions dominate all the instructions that will use them. As a temporary measure, IndVarSimplify now uses a FixUsesBeforeDefs function to fix up instructions inserted by SCEVExpander. Fortunately, this code is contained, and can be easily removed once a more comprehensive solution is available. llvm-svn: 71535	2009-05-12 02:17:14 +00:00
Evan Cheng	78a4eb844b	Teach LSR to optimize more loop exit compares, i.e. change them to use postinc iv value. Previously LSR would only optimize those which are in the loop latch block. However, if LSR can prove it is safe (and profitable), it's now possible to change those not in the latch blocks to use postinc values. Also, if the compare is the only use, LSR would place the iv increment instruction before the compare instead in the latch. llvm-svn: 71485	2009-05-11 22:33:01 +00:00
Dale Johannesen	02cb2bf2e3	Reverse a loop that is counting up to a maximum to count down to 0 instead, under very restricted circumstances. Adjust 4 testcases in which this optimization fires. llvm-svn: 71439	2009-05-11 17:15:42 +00:00
Duncan Sands	af9eaa830a	Rename PaddedSize to AllocSize, in the hope that this will make it more obvious what it represents, and stop it being confused with the StoreSize. llvm-svn: 71349	2009-05-09 07:06:46 +00:00
Evan Cheng	b9dcc2c0c9	Factor out code that optimize loop terminating condition. llvm-svn: 71305	2009-05-09 01:08:24 +00:00
Chris Lattner	c48091f141	fix RewriteStoreUserOfWholeAlloca to use the correct type size method, fixing a crash on PR4146. While the store will ultimately overwrite the "padded size" number of bits in memory, the stored value may be a subset of this size. This function only wants to handle the case where all bits are stored. llvm-svn: 71224	2009-05-08 15:54:41 +00:00
Nick Lewycky	702fbf94a0	This transform requires valid TargetData info. Wrap it in 'if (TD)' in preparation for the day we use null TargetData when no target is specified. llvm-svn: 71210	2009-05-08 06:47:37 +00:00
Dan Gohman	140a6f24f0	Perform constant folding on operands of instructions with non-void types, such as loads and calls. llvm-svn: 71175	2009-05-07 19:43:39 +00:00
Evan Cheng	342053cd27	Unbreak the build. llvm-svn: 71091	2009-05-06 18:00:56 +00:00
David Greene	0dec5b9a75	Make sure to use signed arithmetic in APInt to fix a regression. llvm-svn: 71090	2009-05-06 17:39:26 +00:00
Duncan Sands	1efabaaa2a	Allow readonly functions to unwind exceptions. Teach the optimizers about this. For example, a readonly function with no uses cannot be removed unless it is also marked nounwind. llvm-svn: 71071	2009-05-06 06:49:50 +00:00
Dan Gohman	e58fc20f8d	Fix a copy+pasto in a comment. llvm-svn: 71035	2009-05-05 23:02:38 +00:00
Dan Gohman	96b18ccdd3	Delete a FIXME which is no longer relevant, and add a FIXME that is. llvm-svn: 71033	2009-05-05 22:59:55 +00:00
Bill Wendling	5e2ac0cd9c	Temporarily reverting r71008. It was causing this failure: Running /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/test/ CodeGen/X86/dg.exp ... FAIL: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/test/ CodeGen/X86/change-compare-stride-1.ll Failed with exit(1) at line 2 while running: grep {cmpq $-478,} change-compare-stride-1.ll.tmp child process exited abnormally llvm-svn: 71013	2009-05-05 20:49:46 +00:00
David Greene	246a3dfb10	Handle overflow of 64-bit loop conditions. llvm-svn: 71008	2009-05-05 20:22:36 +00:00
Dan Gohman	48f8222293	Re-apply 70645, converting ScalarEvolution to use CallbackVH, with fixes. allUsesReplacedWith need to walk the def-use chains and invalidate all users of a value that is replaced. SCEVs of users need to be recalcualted even if the new value is equivalent. Also, make forgetLoopPHIs walk def-use chains, since any SCEV that depends on a PHI should be recalculated when more information about that PHI becomes available. llvm-svn: 70927	2009-05-04 22:30:44 +00:00
Dan Gohman	a30370bc33	Constify a bunch of SCEV-using code. llvm-svn: 70919	2009-05-04 22:02:23 +00:00
Chris Lattner	fa552d728d	fix some problems spotted by Duncan and Nicolas Geoffray llvm-svn: 70872	2009-05-04 16:29:24 +00:00
Chris Lattner	d579cb1167	* Sink 4 duplicates of edge threading validity checks and DOUT prints into ThreadEdge directly. This shares the code, but is just a refactoring. * Make JumpThreading compute the set of loop headers and avoid threading across them. This prevents jump threading from forming irreducible loops (goodness) but also prevents it from threading in other cases that are beneficial (see the comment above FindFunctionBackedges). llvm-svn: 70820	2009-05-04 02:28:08 +00:00
Chris Lattner	351134ba93	Factor loop backedge finding out of CodeGenPrepare into a new FindFunctionBackedges function. llvm-svn: 70819	2009-05-04 02:25:58 +00:00
Dan Gohman	5036695c32	Revert r70645 for now; it's causing a variety of regressions. llvm-svn: 70661	2009-05-03 05:46:20 +00:00
Dan Gohman	e9a38d16fe	Convert ScalarEvolution to use CallbackVH for its internal map. This makes ScalarEvolution::deleteValueFromRecords, and it's code that subtly needed to be called before ReplaceAllUsesWith, unnecessary. It also makes ValueDeletionListener unnecessary. llvm-svn: 70645	2009-05-02 21:19:20 +00:00
Dan Gohman	ff08995589	Previously, RecursivelyDeleteDeadInstructions provided an option of returning a list of pointers to Values that are deleted. This was unsafe, because the pointers in the list are, by nature of what RecursivelyDeleteDeadInstructions does, always dangling. Replace this with a simple callback mechanism. This may eventually be removed if all clients can reasonably be expected to use CallbackVH. Use this to factor out the dead-phi-cycle-elimination code from LSR utility function, and generalize it to use the RecursivelyDeleteTriviallyDeadInstructions utility function. This makes LSR more aggressive about eliminating dead PHI cycles; adjust tests to either be less trivial or to simply expect fewer instructions. llvm-svn: 70636	2009-05-02 18:29:22 +00:00
Dan Gohman	c27345f0b4	Tell ScalarEvolution that the loop is being deleted before actually deleting it. This will let ScalarEvolution be more complete about updating its records. llvm-svn: 70632	2009-05-02 17:29:26 +00:00
Dan Gohman	6409e7d4e9	Don't split critical edges during the AddUsersIfInteresting phase of LSR. This makes the AddUsersIfInteresting phase of LSR a pure analysis instead of a phase that potentially does CFG modifications. The conditions where this code would actually perform a split are rare, and in the cases where it actually would do a split the split is usually undone by CodeGenPrepare, and in cases where splits actually survive into codegen, they appear to hurt more often than they help. llvm-svn: 70625	2009-05-02 05:36:01 +00:00
Dan Gohman	65dbe7874f	Make RequiresTypeConversion canonicalize the types before calling the target hooks canLosslesslyBitCastTo and isTruncateFree. This allows targets to avoid worrying about handling all combinations of integer and pointer types. llvm-svn: 70555	2009-05-01 17:07:43 +00:00
Dan Gohman	d3aa4215ef	Minor whitespace fix. llvm-svn: 70551	2009-05-01 16:56:32 +00:00
Dan Gohman	6be8530158	Fix some code to work if TargetLowering is not available. llvm-svn: 70546	2009-05-01 16:29:14 +00:00
Dale Johannesen	f4031bd01e	Print correct instruction in dump. llvm-svn: 70427	2009-04-29 22:57:20 +00:00
Dan Gohman	8ddd0b3599	Reword and tidy up some comments. llvm-svn: 70416	2009-04-29 22:01:05 +00:00
Dan Gohman	3e6e188ee3	Remove an obsolete comment. llvm-svn: 70262	2009-04-27 22:12:34 +00:00
Dale Johannesen	27b4f222cf	Fix PR 4086, a bug in FP IV elimination. llvm-svn: 70247	2009-04-27 21:03:15 +00:00
Dan Gohman	e99f98262c	Permit ChangeCompareStride to rewrite a comparison when the factor between the comparison's iv stride and the candidate stride is exactly -1. llvm-svn: 70244	2009-04-27 20:35:32 +00:00
Dan Gohman	1b5055ab7f	Return null instead of false, as appropriate. llvm-svn: 70054	2009-04-25 17:28:45 +00:00
Dan Gohman	5638e0d642	Add several more icmp simplifications. Transform signed comparisons into unsigned ones when the operands are known to have the same sign bit value. llvm-svn: 70053	2009-04-25 17:12:48 +00:00
Sanjiv Gupta	46c97e626f	Allow i16 type indices to gep. llvm-svn: 69946	2009-04-24 02:37:54 +00:00
Dan Gohman	86bcd97014	Change SCEVExpander's expandCodeFor to provide more flexibility with the persistent insertion point, and change IndVars to make use of it. This fixes a bug where IndVars was holding on to a stale insertion point and forcing the SCEVExpander to continue to use it. This fixes PR4038. llvm-svn: 69892	2009-04-23 15:16:49 +00:00
Evan Cheng	d8174d3d09	Make sure both operands have binary instructions have the same type. llvm-svn: 69844	2009-04-22 23:39:28 +00:00
Evan Cheng	59ca33053b	A few more places where the check of use_empty is needed. llvm-svn: 69842	2009-04-22 23:09:16 +00:00
Evan Cheng	cbfe9df096	Avoid deferencing use_begin() if value does not have a use. llvm-svn: 69836	2009-04-22 22:45:37 +00:00
Chris Lattner	69223bb7f5	fix a crash on a pointless but valid zero-length memset, rdar://6808691 llvm-svn: 69680	2009-04-21 16:52:12 +00:00
Dan Gohman	4860db61be	Factor out a common base class from SCEVTruncateExpr, SCEVZeroExtendExpr, and SCEVSignExtendExpr. llvm-svn: 69649	2009-04-21 01:25:57 +00:00
Dan Gohman	b397e1a7a2	Introduce encapsulation for ScalarEvolution's TargetData object, and refactor the code to minimize dependencies on TargetData. llvm-svn: 69644	2009-04-21 01:07:12 +00:00
Dale Johannesen	1238220473	Adjust loop size estimate for full unrolling; GEP's don't usually become instructions. llvm-svn: 69631	2009-04-20 22:19:33 +00:00
Sanjiv Gupta	428d490332	Before trying to introduce/eliminate cast/ext/trunc to make indices type as pointer type, make sure that the pointer size is a valid sequential index type. llvm-svn: 69574	2009-04-20 06:05:54 +00:00
Dan Gohman	056857aa21	Use more const qualifiers with SCEV interfaces. llvm-svn: 69450	2009-04-18 17:56:28 +00:00
Dan Gohman	d2d6fd806c	Don't create ConstantInts with pointer type. This fixes a regression in 403.gcc in PIC_CODEGEN=1 and DISABLE_LTO=1 mode. llvm-svn: 69344	2009-04-17 02:02:52 +00:00
Dan Gohman	fec1d086e0	Use TargetData::getTypeSizeInBits instead of getPrimitiveSizeInBits() to get the correct answer for pointer types. llvm-svn: 69321	2009-04-16 22:35:57 +00:00
Dan Gohman	8b6ebb1112	Minor code simplifications. Don't attempt LSR on theoretical targets with pointers larger than 64 bits, due to the code not yet being APInt clean. llvm-svn: 69296	2009-04-16 16:49:48 +00:00
Dan Gohman	e2ead2c328	LSR is no longer a GEP optimizer. It is now an IV expression optimizer, which just happen to frequently involve optimizing GEPs. llvm-svn: 69295	2009-04-16 16:46:01 +00:00
Dan Gohman	a8be04b2db	Use ConstantExpr::getIntToPtr instead of SCEVExpander::InsertCastOfTo, since the operand is always a constant. llvm-svn: 69291	2009-04-16 15:48:38 +00:00
Dan Gohman	71bccd3e0e	Use a SCEV expression cast instead of immediately inserting a new instruction with SCEVExpander::InsertCastOfTo. llvm-svn: 69290	2009-04-16 15:47:35 +00:00
Dan Gohman	0a40ad93a9	Expand GEPs in ScalarEvolution expressions. SCEV expressions can now have pointer types, though in contrast to C pointer types, SCEV addition is never implicitly scaled. This not only eliminates the need for special code like IndVars' EliminatePointerRecurrence and LSR's own GEP expansion code, it also does a better job because it lets the normal optimizations handle pointer expressions just like integer expressions. Also, since LLVM IR GEPs can't directly index into multi-dimensional VLAs, moving the GEP analysis out of client code and into the SCEV framework makes it easier for clients to handle multi-dimensional VLAs the same way as other arrays. Some existing regression tests show improved optimization. test/CodeGen/ARM/2007-03-13-InstrSched.ll in particular improved to the point where if-conversion started kicking in; I turned it off for this test to preserve the intent of the test. llvm-svn: 69258	2009-04-16 03:18:22 +00:00
Dale Johannesen	a71daa83c6	Eliminate zext over (iv \| const) or (signed iv), and sext over (iv \| const), if a longer iv is available. Allow expressions to have more than one zext/sext parent. All from OpenSSL. llvm-svn: 69241	2009-04-15 23:31:51 +00:00
Dale Johannesen	82230b5b17	Eliminate zext over (iv & const) or ((iv+const)&const) if a longer iv is available. These subscript forms are not common; they're a bottleneck in OpenSSL. llvm-svn: 69215	2009-04-15 20:41:02 +00:00
Dale Johannesen	7ffb7d5728	Enhance induction variable code to remove the sext around sext(shorter IV + constant), using a longer IV instead, when it can figure out the add can't overflow. This comes up a lot in subscripting; mainly affects 64 bit. llvm-svn: 69123	2009-04-15 01:10:12 +00:00
Evan Cheng	ffb83a155e	Avoid making the transformation enabled by my last patch if the new destinations have phi nodes. llvm-svn: 69121	2009-04-15 00:43:54 +00:00
Evan Cheng	5ebf2acd84	Optimize conditional branch on i1 phis with non-constant inputs. This turns: eq: %3 = icmp eq i32 %1, %2 br label %join ne: %4 = icmp ne i32 %1, %2 br label %join join: %5 = phi i1 [%3, %eq], [%4, %ne] br i1 %5, label %yes, label %no => eq: %3 = icmp eq i32 %1, %2 br i1 %3, label %yes, label %no ne: %4 = icmp ne i32 %1, %2 br i1 %4, label %yes, label %no llvm-svn: 69102	2009-04-14 23:40:03 +00:00
Owen Anderson	a1902318e3	LoopIndexSplit needs to inform the loop pass manager of the instructions it is deleting, not just the basic block. llvm-svn: 69011	2009-04-14 01:04:19 +00:00
Chris Lattner	6cd82fb430	"There was a typo in my previous patch which leads to miscompilation of strncat :( strncat(foo, "bar", 99) would be optimized to memcpy(foo+strlen(foo), "bar", 100, 1) instead of memcpy(foo+strlen(foo), "bar", 4, 1)" Patch by Benjamin Kramer! llvm-svn: 68905	2009-04-12 18:22:33 +00:00
Chris Lattner	91b6af24ac	add some optimizations for strncpy/strncat and factor some code. Patch by Benjamin Kramer! llvm-svn: 68885	2009-04-12 05:06:39 +00:00
Chris Lattner	eb510d6b3d	Instcombine should not promote whole computation trees to "strange" integer types, unless they are already strange. This prevents it from turning the code produced by SROA into crazy libcalls and stuff that the code generator can't handle. In the attached example, the result was an i96 multiply that caused the x86 backend to assert. Note that if TargetData had an idea of what the legal types are for a target that this could be used to stop instcombine from introducing i64 muls, as Scott wanted. llvm-svn: 68598	2009-04-08 05:41:03 +00:00
Chris Lattner	321741af5f	fix rdar://6762290, a crash compiling cxx filt with clang. llvm-svn: 68500	2009-04-07 05:03:34 +00:00
Chris Lattner	47d6e7b93e	remove empty section llvm-svn: 68485	2009-04-07 02:55:53 +00:00
Ed Schouten	01aa6ec97a	Let the strcat optimizer return the pointer to the start of the buffer, instead of the place where it started to perform the string copy. - PR3661 - Patch by Benjamin Kramer! llvm-svn: 68443	2009-04-06 13:06:48 +00:00
Owen Anderson	98f912bf13	Reapply r68211, with the miscompilations it caused fixed. llvm-svn: 68262	2009-04-01 23:53:49 +00:00
Dan Gohman	c4971721ea	Revert r68172. It caused regressions in Applications/Burg/burg Applications/ClamAV/clamscan and many other tests. llvm-svn: 68211	2009-04-01 16:37:47 +00:00
Owen Anderson	ff5961b46c	Enhance GVN to propagate simple conditionals. This fixes PR3921. llvm-svn: 68172	2009-04-01 01:20:45 +00:00
Chris Lattner	f72ce6ea8b	Make the key of ValueRankMap an AssertingVH, so that we die violently if it dangles. llvm-svn: 68150	2009-03-31 22:13:29 +00:00
Evan Cheng	826b6f0f7c	Throttle back "fold select into operand" transformation. InstCombine should not generate selects of two constants unless they are selects of 0 and 1. e.g. define i32 @t1(i32 %c, i32 %x) nounwind { %t1 = icmp eq i32 %c, 0 %t2 = lshr i32 %x, 18 %t3 = select i1 %t1, i32 %t2, i32 %x ret i32 %t3 } was turned into define i32 @t2(i32 %c, i32 %x) nounwind { %t1 = icmp eq i32 %c, 0 %t2 = select i1 %t1, i32 18, i32 0 %t3 = lshr i32 %x, %t2 ret i32 %t3 } For most targets, that means materializing two constants and then a select. e.g. On x86-64 movl %esi, %eax shrl $18, %eax testl %edi, %edi cmovne %esi, %eax ret => xorl %eax, %eax testl %edi, %edi movl $18, %ecx cmovne %eax, %ecx movl %esi, %eax shrl %cl, %eax ret Also, the optimizer and codegen can reason about shl / and / add, etc. by a constant. This optimization will hinder optimizations using ComputeMaskedBits. llvm-svn: 68142	2009-03-31 20:42:45 +00:00
Devang Patel	6e68bd007a	Loop Index Split can eliminate a loop if it can determin if loop body is executed only once. There was a bug in determining IV based value of the iteration for which the loop body is executed. Fix it. llvm-svn: 68071	2009-03-30 22:24:10 +00:00
Duncan Sands	3241b74f69	Revert r67798: it breaks llvm-gcc bootstrap on x86-64-linux, presumably due to a miscompilation. make[4]: Entering directory `gcc-4.2.llvm-objects/x86_64-unknown-linux-gnu/libstdc++-v3/include' if [ ! -d "./x86_64-unknown-linux-gnu/bits/stdtr1c++.h.gch" ]; then \ mkdir -p ./x86_64-unknown-linux-gnu/bits/stdtr1c++.h.gch; \ fi; \ gcc-4.2.llvm-objects/./gcc/xgcc -shared-libgcc -Bgcc-4.2.llvm-objects/./gcc -nostdinc++ -Lgcc-4.2.llvm-objects/x86_64-unknown-linux-gnu/libstdc++-v3/src -Lgcc-4.2.llvm-objects/x86_64-unknown-linux-gnu/libstdc++-v3/src/.libs -B/usr/local/gnat-llvm/x86_64-unknown-linux-gnu/bin/ -B/usr/local/gnat-llvm/x86_64-unknown-linux-gnu/lib/ -isystem /usr/local/gnat-llvm/x86_64-unknown-linux-gnu/include -isystem /usr/local/gnat-llvm/x86_64-unknown-linux-gnu/sys-include -Winvalid-pch -Wno-deprecated -x c++-header -g -O2 -D_GNU_SOURCE -Igcc-4.2.llvm-objects/x86_64-unknown-linux-gnu/libstdc++-v3/include/x86_64-unknown-linux-gnu -Igcc-4.2.llvm-objects/x86_64-unknown-linux-gnu/libstdc++-v3/include -Igcc-4.2.llvm/libstdc++-v3/libsupc++ -O2 -g gcc-4.2.llvm/libstdc++-v3/include/precompiled/stdtr1c++.h -o x86_64-unknown-linux-gnu/bits/stdtr1c++.h.gch/O2g.gch In file included from gcc-4.2.llvm-objects/x86_64-unknown-linux-gnu/libstdc++-v3/include/tr1/repeat.h:247, from gcc-4.2.llvm-objects/x86_64-unknown-linux-gnu/libstdc++-v3/include/tr1/functional:1098, from gcc-4.2.llvm/libstdc++-v3/include/precompiled/stdtr1c++.h:53: gcc-4.2.llvm-objects/x86_64-unknown-linux-gnu/libstdc++-v3/include/tr1/functional_iterate.h:417: internal compiler error: in ggc_recalculate_in_use_p, at ggc-page.c:1602 Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://llvm.org/bugs/> for instructions. make[4]: *** [x86_64-unknown-linux-gnu/bits/stdtr1c++.h.gch/O2g.gch] Error 1 llvm-svn: 67839	2009-03-27 14:56:47 +00:00
Dale Johannesen	4026b041ce	One more place to skip debug info. llvm-svn: 67811	2009-03-27 01:13:37 +00:00
Devang Patel	fe7c0492a0	While hoisting an instruction, update alias info set tracker. llvm-svn: 67798	2009-03-26 23:48:52 +00:00
Dale Johannesen	db90560c1c	Skip debug info one more place. (This one gets called from llc, not opt, but it's an IR level optimization nevertheless.) llvm-svn: 67724	2009-03-26 01:15:07 +00:00
Devang Patel	4555618854	Before deleting a basic block, give other loop passes a chance cleanup analysis values, related to the instructions in the basic block. llvm-svn: 67719	2009-03-25 23:57:48 +00:00
Chris Lattner	c3b2111d97	Fix PR3874 by restoring a condition I removed, but making it more precise than it used to be. llvm-svn: 67662	2009-03-25 00:28:58 +00:00
Chris Lattner	9e94538005	oops, I intended to remove this, not comment it out. Thanks Duncan! llvm-svn: 67657	2009-03-24 23:48:25 +00:00
Chris Lattner	306813cbbb	canonicalize inttoptr and ptrtoint instructions which cast pointers to/from integer types that are not intptr_t to convert to intptr_t then do an integer conversion to the dest type. This exposes the cast to the optimizer. llvm-svn: 67638	2009-03-24 18:35:40 +00:00
Chris Lattner	d9eb41177a	two changes: 1. Make instcombine always canonicalize trunc x to i1 into an icmp(x&1). This exposes the AND to other instcombine xforms and is more of what the code generator expects. 2. Rewrite the remaining trunc pattern match to use 'match', which simplifies it a lot. llvm-svn: 67635	2009-03-24 18:15:30 +00:00
Duncan Sands	1f15ca7c7a	Factorize out a concept - no functionality change. llvm-svn: 67454	2009-03-21 21:27:31 +00:00
Chris Lattner	0a981d1d36	Fix instcombine to not introduce undefined shifts when merging two shifts together. This fixes PR3851. llvm-svn: 67411	2009-03-20 22:41:15 +00:00
Duncan Sands	a09e0afe74	Don't load values out of global constants with weak linkage: the value may be replaced with something different at link time. (Frontends that want to allow values to be loaded out of weak constants can give their constants weak_odr linkage). llvm-svn: 67407	2009-03-20 21:53:29 +00:00
Dale Johannesen	52bc2aac8a	This pass keeps a map of Instructions to Rank numbers, and was deleting Instructions without clearing the corresponding map entry. This led to nondeterministic behavior if the same address got allocated to another Instruction within a short time. llvm-svn: 67306	2009-03-19 17:22:53 +00:00
Nick Lewycky	bfd4ad67c7	Remove strange extra semicolons. llvm-svn: 67287	2009-03-19 05:51:39 +00:00
Chris Lattner	595923ff75	Fix PR3826 - InstComb assert with vector shift, by not calling ComputeNumSignBits on a vector. llvm-svn: 67211	2009-03-18 16:32:19 +00:00
Zhou Sheng	4e2af3cb55	Explicitly check for StoreInst, do not lose the chance to delete unused loads or bitcasts. llvm-svn: 67202	2009-03-18 12:48:48 +00:00
Zhou Sheng	05bea906c1	Revert my previous change on Local.cpp, instead, fix the bug on scalarrepl. If the instruction has no users, it is also not only used by debug info and should not be deleted. llvm-svn: 67194	2009-03-18 10:13:08 +00:00
Chris Lattner	42e9ca42ce	LSR shouldn't ever try to hack on integer IV's larger than 64-bits. Right now it is not APInt clean, but even when it is it needs to be evaluated carefully to determine whether it is actually profitable. This fixes a crash on PR3806 llvm-svn: 67134	2009-03-17 23:58:30 +00:00
Chris Lattner	e549493a55	Remove a condition which is always true. llvm-svn: 67089	2009-03-17 17:55:15 +00:00
Dale Johannesen	87077356be	Fix a debug info dependency in jump threading. llvm-svn: 67064	2009-03-17 00:38:24 +00:00
Evan Cheng	94419d6fdd	Fix PR3784: If the source of a phi comes from a bb ended with an invoke, make sure the copy is inserted before the try range (unless it's used as an input to the invoke, then insert it after the last use), not at the end of the bb. Also re-apply r66140 which was disabled as a workaround. llvm-svn: 66976	2009-03-13 22:59:14 +00:00
Bill Wendling	4bb96e9a50	Revert r66920. It was causing failures in the self-hosting buildbot (in release mode). Running /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/test/BugPoint/dg.exp ... FAIL: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/test/BugPoint/crash-narrowfunctiontest.ll Failed with signal(SIGBUS) at line 1 while running: bugpoint /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/test/BugPoint/crash-narrowfunctiontest.ll -bugpoint-crashcalls -silence-passes > /dev/null 0 bugpoint 0x0035dd25 llvm::sys::SetInterruptFunction(void ()()) + 85 1 bugpoint 0x0035e382 llvm::sys::RemoveFileOnSignal(llvm::sys::Path const&, std::string) + 706 2 libSystem.B.dylib 0x92f112bb _sigtramp + 43 3 libSystem.B.dylib 0xffffffff _sigtramp + 1829694831 4 bugpoint 0x00021d1c main + 92 5 bugpoint 0x00002106 start + 54 6 bugpoint 0x00000004 start + 18446744073709543220 Stack dump: 0. Program arguments: bugpoint /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/test/BugPoint/crash-narrowfunctiontest.ll -bugpoint-crashcalls -silence-passes FAIL: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/test/BugPoint/misopt-basictest.ll Failed with signal(SIGBUS) at line 1 while running: bugpoint /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/test/BugPoint/misopt-basictest.ll -dce -bugpoint-deletecalls -simplifycfg -silence-passes 0 bugpoint 0x0035dd25 llvm::sys::SetInterruptFunction(void ()()) + 85 1 bugpoint 0x0035e382 llvm::sys::RemoveFileOnSignal(llvm::sys::Path const&, std::string) + 706 2 libSystem.B.dylib 0x92f112bb _sigtramp + 43 3 libSystem.B.dylib 0xffffffff _sigtramp + 1829694831 4 bugpoint 0x00021d1c main + 92 5 bugpoint 0x00002106 start + 54 6 bugpoint 0x00000006 start + 18446744073709543222 Stack dump: 0. Program arguments: bugpoint /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/test/BugPoint/misopt-basictest.ll -dce -bugpoint-deletecalls -simplifycfg -silence-passes FAIL: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/test/BugPoint/remove_arguments_test.ll Failed with signal(SIGBUS) at line 1 while running: bugpoint /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/test/BugPoint/remove_arguments_test.ll -bugpoint-crashcalls -silence-passes 0 bugpoint 0x0035dd25 llvm::sys::SetInterruptFunction(void ()()) + 85 1 bugpoint 0x0035e382 llvm::sys::RemoveFileOnSignal(llvm::sys::Path const&, std::string) + 706 2 libSystem.B.dylib 0x92f112bb _sigtramp + 43 3 libSystem.B.dylib 0xffffffff _sigtramp + 1829694831 4 bugpoint 0x00021d1c main + 92 5 bugpoint 0x00002106 start + 54 Stack dump: 0. Program arguments: bugpoint /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/test/BugPoint/remove_arguments_test.ll -bugpoint-crashcalls -silence-passes --- Reverse-merging (from foreign repository) r66920 into '.': U include/llvm/Support/CallSite.h U include/llvm/Instructions.h U lib/Analysis/IPA/GlobalsModRef.cpp U lib/Analysis/IPA/Andersens.cpp U lib/Bitcode/Writer/BitcodeWriter.cpp U lib/VMCore/Instructions.cpp U lib/VMCore/Verifier.cpp U lib/VMCore/AsmWriter.cpp U lib/Transforms/Utils/LowerInvoke.cpp U lib/Transforms/Scalar/SimplifyCFGPass.cpp U lib/Transforms/IPO/PruneEH.cpp U lib/Transforms/IPO/DeadArgumentElimination.cpp llvm-svn: 66953	2009-03-13 21:15:59 +00:00
Dale Johannesen	c65830519e	One more place where debug info affects codegen. llvm-svn: 66930	2009-03-13 19:23:20 +00:00
Gabor Greif	258232fb80	Second installment of "BasicBlock operands to the back" changes. For InvokeInst now all arguments begin at op_begin(). The Callee, Cont and Fail are now faster to get by access relative to op_end(). This patch introduces some temporary uglyness in CallSite. Next I'll bring CallInst up to a similar scheme and then the uglyness will magically vanish. This patch also exposes all the reliance of the libraries on InvokeInst's operand ordering. I am thinking of taking care of that too. llvm-svn: 66920	2009-03-13 18:27:29 +00:00
Bill Wendling	fa54bc2052	Oops...I committed too much. llvm-svn: 66867	2009-03-13 04:39:26 +00:00
Bill Wendling	b02eadf660	Temporarily XFAIL this test. llvm-svn: 66866	2009-03-13 04:37:11 +00:00
Duncan Sands	1f853d6a2a	Revert commit 66140 since it caused several failures in the Ada testcase. Reverting this only covers up the real problem, which is a nasty conceptual difficulty in the phi elimination pass: when eliminating phi nodes in landing pads, the register copies need to come before the invoke, not at the end of the basic block which is too late... See PR3784. llvm-svn: 66826	2009-03-12 21:13:42 +00:00
Dale Johannesen	08ccba73a7	Skip interleaved debug info when fast-forwarding through allocations. Apparently the assumption is there is an instruction (terminator?) following the allocation so I am allowing the same assumption. llvm-svn: 66716	2009-03-11 22:19:43 +00:00
Dale Johannesen	703703aacb	Removing a dead debug intrinsic shouldn't trigger another instcombine pass if we weren't going to make one without debug info. llvm-svn: 66576	2009-03-10 21:19:49 +00:00
John Criswell	073e4d16c5	Do not attempt to do parial redundancy elimination on void values. Also fixed a punctuation error in the header comment. This fixes PR3775. llvm-svn: 66542	2009-03-10 15:04:53 +00:00
Dan Gohman	f12436891e	Don't record the increment instruction; just recompute it from the Phi if needed. This simplifies the code a little, and is needed for an upcoming refactoring. llvm-svn: 66479	2009-03-09 22:04:01 +00:00
Dan Gohman	b855164751	Fix a few more places where induction variable types were used where memory access types are needed. llvm-svn: 66470	2009-03-09 21:22:12 +00:00
Dan Gohman	5a4e31666d	Use ReplacedTy instead of recomputing the same value. llvm-svn: 66469	2009-03-09 21:19:58 +00:00
Dan Gohman	34e52ddb7d	Use LoopInfo's getLoopLatch() instead of doing what it does manualy. llvm-svn: 66467	2009-03-09 21:14:16 +00:00
Dan Gohman	70cc9875d8	Don't use an induction variable type as a memory access type. Use VoidTy instead, to be properly conservative. llvm-svn: 66463	2009-03-09 21:04:19 +00:00
Dan Gohman	917ffe4592	Factor out the code that determines the memory access type of an instruction into a helper function. llvm-svn: 66460	2009-03-09 21:01:17 +00:00
Dan Gohman	e201f8ff1d	Move the sorting of the StrideOrder array earlier so that it doesn't have to be done twice. llvm-svn: 66449	2009-03-09 20:46:50 +00:00
Dan Gohman	b5001909b0	Delete the isOnlyStride argument, which is unused. llvm-svn: 66446	2009-03-09 20:41:15 +00:00
Dan Gohman	85875f7120	Tidy some LSR debug output: announce the loop it's about to process before it does any processing. llvm-svn: 66443	2009-03-09 20:34:59 +00:00
Chris Lattner	0eab5ecb71	reimplement AliasSetTracker in terms of DenseMap instead of hash_map, hopefully no functionality change. llvm-svn: 66398	2009-03-09 05:11:09 +00:00
Chris Lattner	21a84f3054	teach SROA to handle promoting vector allocas with a memset into them into a vector type instead of into an integer type. llvm-svn: 66368	2009-03-08 04:17:04 +00:00
Chris Lattner	c009757761	Enhance SROA to "promote to scalar" allocas which are memcpy/memmove'd into or out of. This fixes a serious perf issue that Nate ran into. llvm-svn: 66366	2009-03-08 04:04:21 +00:00
Chris Lattner	dc35e5b43a	change the MemIntrinsic get/setAlignment method to take an unsigned instead of a Constant*, which is what the clients of it really want. llvm-svn: 66364	2009-03-08 03:59:00 +00:00
Chris Lattner	334268a211	Introduce a new MemTransferInst pseudo class, which is a common parent between MemCpyInst and MemMoveInst, simplify some code to use it. llvm-svn: 66361	2009-03-08 03:37:16 +00:00
Chris Lattner	e48f897ca7	add a bunch more passes to the C bindings (PR3734), patch by Lennart Augustsson! llvm-svn: 66272	2009-03-06 16:52:18 +00:00
Devang Patel	25b625165f	While converting an aggregate to scalare, ignore and remove aggregate's debug info. llvm-svn: 66262	2009-03-06 07:03:54 +00:00
Chris Lattner	e6d1e8d0cc	this wasn't intended to go in. llvm-svn: 66252	2009-03-06 05:42:30 +00:00
Chris Lattner	e3fc2d13be	Change various llvm utilities to use PrettyStackTraceProgram in their main routines. This makes the tools print their argc/argv commands if they crash. llvm-svn: 66248	2009-03-06 05:34:10 +00:00
Devang Patel	bab43b4c91	Do not count DbgInfoIntrinsic while estimating loop header size. llvm-svn: 66245	2009-03-06 03:51:30 +00:00
Devang Patel	e8c6d3102d	Skip DbgInfoIntrinsic. llvm-svn: 66244	2009-03-06 02:59:27 +00:00
Dale Johannesen	fb1caf3e1f	Don't assign rank numbers to debug intrinsic "calls". This is needed so debug info doesn't change codegen. llvm-svn: 66235	2009-03-06 01:41:59 +00:00
Evan Cheng	5fd4fc76bf	SRThreshold is meant to be inclusive. llvm-svn: 66227	2009-03-06 00:56:43 +00:00
Evan Cheng	b7922dee15	Do not split edges to EH landing pads. It will cause code size explosion. llvm-svn: 66140	2009-03-05 06:31:26 +00:00
Dale Johannesen	78ab338024	Fix another case where debug info was affecting codegen. I convinced myself it was OK to skip all pointer bitcasts here too. llvm-svn: 66122	2009-03-05 02:06:48 +00:00
Bill Wendling	0bf1ded7bd	Add comment to emphasize that the while body is empty. llvm-svn: 66115	2009-03-05 01:08:35 +00:00
Dale Johannesen	ad6b47377f	Fix another case where a dbg.declare meant something had 2 uses instead of 1. llvm-svn: 66112	2009-03-05 00:39:02 +00:00
Dale Johannesen	df4226c0e2	Re-commit 65975 and a fix for the problem that was causing llvm-gcc to fail to build. I've verified it bootstraps now; good enough for me. llvm-svn: 66073	2009-03-04 21:24:04 +00:00
Dan Gohman	66476b582d	Fix this comment. llvm-svn: 66065	2009-03-04 20:50:23 +00:00
Dan Gohman	ae0035ee15	Add an assertion for a condition that's always true, and not immediately obvious. llvm-svn: 66062	2009-03-04 20:49:01 +00:00
Chris Lattner	a41bb40458	complete comment. llvm-svn: 66055	2009-03-04 19:23:25 +00:00
Chris Lattner	b5b0c87be6	this wasn't intended to be committed. llvm-svn: 66054	2009-03-04 19:22:30 +00:00
Chris Lattner	5c204c92a4	Fix PR3720 by properly propagating alignment information from memcpy/memmove onto element accesses. llvm-svn: 66053	2009-03-04 19:20:50 +00:00
Dale Johannesen	c8b5a6ef7d	Always skip ptr-to-ptr bitcasts when counting, per Chris' suggestion. Slightly faster. llvm-svn: 65999	2009-03-04 01:53:05 +00:00
Dale Johannesen	0365d3b8b5	Make my earlier patch to skip debug intrinsics when counting work; it was only off by 1. llvm-svn: 65993	2009-03-04 01:20:34 +00:00
Dale Johannesen	09c3e8ec00	Instruction counters must skip the bitcasts that feed into llvm.dbg.declare nodes, as well as the debug directives themselves. llvm-svn: 65976	2009-03-03 22:36:47 +00:00
Dale Johannesen	77456b7ab4	When removing a store to an alloca that has only one use, check also for the case where it has two uses, the other being a llvm.dbg.declare. This is needed so debug info doesn't affect codegen. llvm-svn: 65970	2009-03-03 21:26:39 +00:00
Bill Wendling	a68fc7af63	Use > instead of >=. We want to promote aggregates of 128-bytes. llvm-svn: 65960	2009-03-03 19:18:49 +00:00
Bill Wendling	3e44bf3c4b	Reapply r65755, but reversing "<" to ">=". llvm-svn: 65945	2009-03-03 12:12:58 +00:00
Dan Gohman	92b551bc2b	Fix a bunch of Doxygen syntax issues. Escape special characters, and put @file directives on their own comment line. llvm-svn: 65920	2009-03-03 02:55:14 +00:00
Dale Johannesen	0192552340	Don't count DebugInfo instructions in another limit (lest they affect codegen). llvm-svn: 65915	2009-03-03 01:43:03 +00:00
Dale Johannesen	e1bb2f86f9	When sinking an insn in InstCombine bring its debug info with it. Don't count debug info insns against the scan maximum in FindAvailableLoadedValue (lest they affect codegen). llvm-svn: 65910	2009-03-03 01:09:07 +00:00
Devang Patel	d50ebbdf3f	If branch conditions' one successor is dominating another non-latch successor then this loop's iteration space can not be restricted. In this example block bb5 is always executed. llvm-svn: 65902	2009-03-02 23:39:14 +00:00
Duncan Sands	5795a6091d	Fix PR3694: add an instcombine micro-optimization that helps clean up when using variable length arrays in llvm-gcc. llvm-svn: 65832	2009-03-02 09:18:21 +00:00
Bill Wendling	38eae046cf	Temporarily revert r65755. It was causing failures in the self-hosting testsuite: Running /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/test/CodeGen/X86/dg.exp ... FAIL: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/test/CodeGen/X86/nancvt.ll Failed with exit(1) at line 2 while running: grep 2147027116 nancvt.ll.tmp \| count 3 count: expected 3 lines and got 0. child process exited abnormally FAIL: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/test/CodeGen/X86/vec_ins_extract.ll Failed with exit(1) at line 1 while running: llvm-as < /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/test/CodeGen/X86/vec_ins_extract.ll \| opt -scalarrepl -instcombine \| llc -march=x86 -mcpu=yonah \| not /usr/bin/grep sub.*esp subl $28, %esp subl $28, %esp child process exited abnormally And more. llvm-svn: 65758	2009-03-01 03:55:12 +00:00
Chris Lattner	e2bb5e31c8	hoist the check for alloca size up so that it controls CanConvertToScalar as well as isSafeAllocaToScalarRepl. llvm-svn: 65755	2009-03-01 02:26:47 +00:00
Nick Lewycky	34709f84d8	Silence compiler warning about use of uninitialized variables (in reality these are always set by reference on the path that uses them.) No functional change. llvm-svn: 65621	2009-02-27 06:37:39 +00:00
Chris Lattner	af618171f4	Fix PR3667 llvm-svn: 65464	2009-02-25 18:20:01 +00:00
Dan Gohman	0bddac16a8	Rename ScalarEvolution's getIterationCount to getBackedgeTakenCount, to more accurately describe what it does. Expand its doxygen comment to describe what the backedge-taken count is and how it differs from the actual iteration count of the loop. Adjust names and comments in associated code accordingly. llvm-svn: 65382	2009-02-24 18:55:53 +00:00
Dan Gohman	4f356bb9b0	Fix a ValueTracking rule: RHS means operand 1, not 0. Add a simple ashr instcombine to help expose this code. And apply the fix to SelectionDAG's copy of this code too. llvm-svn: 65364	2009-02-24 02:00:40 +00:00
Dan Gohman	5d1f458f0f	Generalize the ChangeCompareStride code, in preparation for handling non-constant strides. No functionality change. llvm-svn: 65363	2009-02-24 01:58:00 +00:00
Dan Gohman	e669884749	Preserve the DominanceFrontier analysis in the LoopDeletion pass. llvm-svn: 65359	2009-02-24 01:21:53 +00:00
Dan Gohman	f6e8c77e1c	Back out the change in 64918 that used sign-extensions when promoting trip counts that use signed comparisons. It's not obviously the best approach for preserving trip count information, and at any rate there isn't anything in the tree right now that makes use of that, so for now always using zero-extensions is preferable. llvm-svn: 65347	2009-02-23 23:20:35 +00:00
Dan Gohman	e591411fd6	LoopDeletion needs to inform ScalarEvolution when a loop is deleted, so that ScalarEvolution doesn't hang onto a dangling Loop*, which could be a problem if another Loop happens to get allocated at the same address. llvm-svn: 65323	2009-02-23 17:10:29 +00:00
Dan Gohman	42987f528a	IndVarSimplify preserves ScalarEvolution. In the -std-compile-opts sequence, this avoids the need for ScalarEvolution to be rerun before LoopDeletion. llvm-svn: 65318	2009-02-23 16:29:41 +00:00
Zhou Sheng	3a86bcf134	Should reset DBI_Prev if DBI_Next == 0. llvm-svn: 65314	2009-02-23 10:14:11 +00:00
Chris Lattner	d5420f0957	fix some typos that Duncan noticed llvm-svn: 65306	2009-02-23 05:56:17 +00:00
Dan Gohman	648c5e9c99	Revert the part of 64623 that attempted to align the source in a memcpy to match the alignment of the destination. It isn't necessary for making loads and stores handled like the SSE loadu/storeu intrinsics, and it was causing a performance regression in MultiSource/Applications/JM/lencod. The problem appears to have been a memcpy that copies from some highly aligned array into an alloca; the alloca was then being assigned a large alignment, which required codegen to perform dynamic stack-pointer re-alignment, which forced the enclosing function to have a frame pointer, which led to increased spilling. llvm-svn: 65289	2009-02-22 18:06:32 +00:00
Dan Gohman	f394e58af5	Properly parenthesize this expression, fixing a real bug in the new -full-lsr code, as well as a GCC warning. llvm-svn: 65288	2009-02-22 16:40:52 +00:00
Evan Cheng	69decbf0b2	Only try to sink immediate when TLI is not null. It needs to check if immediate would fit in target addressing field. llvm-svn: 65268	2009-02-22 07:31:19 +00:00
Nick Lewycky	d44e80d7fc	Don't sign extend the char when expanding char -> int during load(bitcast(char[4] to i32*)) evaluation. llvm-svn: 65246	2009-02-21 20:50:42 +00:00
Evan Cheng	107b06c4b9	Teach LSR sink to sink the immediate portion of the common expression back into uses if they fit in address modes of all the uses. llvm-svn: 65215	2009-02-21 02:06:47 +00:00
Chris Lattner	bef6b2098e	rename a function to indicate that it checks for profitability as well as legality. Make load sinking and gep sinking more careful: we only do it when it won't pessimize loads from the stack. This has the added benefit of not producing code that is unanalyzable to SROA. llvm-svn: 65209	2009-02-21 00:46:50 +00:00
Evan Cheng	8a9481d50d	Fix strange logic in CollectIVUsers used to determine whether all uses are addresses, part 1. This fixes an obvious logic bug. Previously if the only in-loop use is a PHI, it would return AllUsesAreAddresses as true. llvm-svn: 65178	2009-02-20 22:16:49 +00:00
Dan Gohman	5e309a5bbb	Simplify code and reduce indentation. No functionality change. llvm-svn: 65167	2009-02-20 21:27:23 +00:00
Dan Gohman	2c8cb5b4ec	Fix 80-column violations. llvm-svn: 65159	2009-02-20 21:06:57 +00:00
Dan Gohman	addc50b4ee	It's not necessary to check if Base is null here. llvm-svn: 65157	2009-02-20 21:05:23 +00:00
Dan Gohman	1608df5319	Add a comment about how Imm can be used for loop-variant values. llvm-svn: 65147	2009-02-20 20:29:04 +00:00
Evan Cheng	c380864d2c	Factor address mode matcher out of codegen prepare to make it available to other passes, e.g. loop strength reduction. llvm-svn: 65134	2009-02-20 18:24:38 +00:00
Dan Gohman	2a12ae7d1f	Implement "superhero" strength reduction, or full strength reduction of address calculations down to basic pointer arithmetic. This is currently off by default, as it needs a few other features before it becomes generally useful. And even when enabled, full strength reduction is only performed when it doesn't increase register pressure, and when several other conditions are true. This also factors out a bunch of exisiting LSR code out of StrengthReduceStridedIVUsers into separate functions, and tidies up IV insertion. This actually decreases register pressure even in non-superhero mode. The change in iv-users-in-other-loops.ll is an example of this; there are two more adds because there are two fewer leas, and there is less spilling. llvm-svn: 65108	2009-02-20 04:17:46 +00:00
Dan Gohman	a34d7adefb	Use DEBUG() instead of passing *DOUT to WriteAsOperand, since the latter just passes a null reference when debugging is not enabled. llvm-svn: 65060	2009-02-19 19:32:06 +00:00
Dan Gohman	30a2959367	Make the debug output of LSR less cryptic and more informative. llvm-svn: 65057	2009-02-19 19:23:27 +00:00
Dan Gohman	8078b8bddc	Use a sign-extend instead of a zero-extend when promoting a trip count value when the original loop iteration condition is signed and the canonical induction variable won't undergo signed overflow. This isn't required for correctness; it just preserves more information about original loop iteration values. Add a getTruncateOrSignExtend method to ScalarEvolution, following getTruncateOrZeroExtend. llvm-svn: 64918	2009-02-18 17:22:41 +00:00
Dan Gohman	aa0f01929b	Simplify by using dyn_cast instead of isa and cast. llvm-svn: 64917	2009-02-18 16:54:33 +00:00
Dan Gohman	38a9631d5f	Eliminate several more unnecessary intptr_t casts. llvm-svn: 64888	2009-02-18 05:09:16 +00:00
Dan Gohman	8212ebb5cf	Fix a corner case in the new indvars promotion logic: if there are multiple IV's in a loop, some of them may under go signed or unsigned wrapping even if the IV that's used in the loop exit condition doesn't. Restrict sign-extension-elimination and zero-extension-elimination to only those that operate on the original loop-controlling IV. llvm-svn: 64866	2009-02-18 00:52:00 +00:00
Dan Gohman	d0b1fbd983	Fix a typo in a comment. llvm-svn: 64859	2009-02-18 00:08:39 +00:00
Dan Gohman	d90415555e	LoopIndexSplit doesn't actually use ScalarEvolution. llvm-svn: 64811	2009-02-17 20:50:11 +00:00
Dan Gohman	4330034160	Add a method to ScalarEvolution for telling it when a loop has been modified in a way that may effect the trip count calculation. Change IndVars to use this method when it rewrites pointer or floating-point induction variables instead of using a doInitialization method to sneak these changes in before ScalarEvolution has a chance to see the loop. This eliminates the need for LoopPass to depend on ScalarEvolution. llvm-svn: 64810	2009-02-17 20:49:49 +00:00
Chris Lattner	24f31a0e59	commit a tweaked version of Daniel's patch for PR3599. We now eliminate all the extensions and all but the one required truncate from the testcase, but the or/and/shift stuff still isn't zapped. llvm-svn: 64809	2009-02-17 20:47:23 +00:00
Dan Gohman	f84d42f282	Delete trailing whitespace. llvm-svn: 64784	2009-02-17 19:13:57 +00:00
Dan Gohman	efe65e547b	Fix 80-column violation. llvm-svn: 64766	2009-02-17 15:57:39 +00:00
Evan Cheng	161861deb0	Strengthen the "non-constant stride must dominate loop preheader" check. llvm-svn: 64703	2009-02-17 00:13:06 +00:00
Dan Gohman	2cd8982002	Simplify; fix some 80-column violations. llvm-svn: 64702	2009-02-17 00:10:53 +00:00
Dan Gohman	f68d29edd5	Fix EnforceKnownAlignment so that it doesn't ever reduce the alignment of an alloca or global variable. llvm-svn: 64693	2009-02-16 23:02:21 +00:00
Dan Gohman	136aa1fb96	Delete this long-commented-out code. The situation it seems to have been written for is no longer relevant with the elimination of signed and unsigned types. llvm-svn: 64625	2009-02-16 02:57:42 +00:00
Dan Gohman	9cdfd44521	Change these tests to use regular loads instead of llvm.x86.sse2.loadu.dq. Enhance instcombine to use the preferred field of GetOrEnforceKnownAlignment in more cases, so that regular IR operations are optimized in the same way that the intrinsics currently are. llvm-svn: 64623	2009-02-16 00:44:23 +00:00
Nick Lewycky	8f4a097f15	Update the list of function annotations for nocapture. All of these came up when I was looking at functions used by python. Highlights include, better largefile support (64-bit file sizes on 32-bit systems), fputs string is nocapture, popen/pclose added (popen being noalias return), modf and frexp and friends. Also added some missing 'break' statements and combined identical sections. llvm-svn: 64615	2009-02-15 22:47:25 +00:00
Evan Cheng	e79841adbb	Fix pr3571: If stride is a value defined by an instruction, make sure it dominates the loop preheader. When IV users are strength reduced, the stride is inserted into the preheader. It could create a use before def situation. llvm-svn: 64579	2009-02-15 06:06:15 +00:00
Evan Cheng	fe151ba135	ifdef out unneeded if statement. llvm-svn: 64575	2009-02-15 03:20:37 +00:00
Dan Gohman	671f2c085f	Extend the IndVarSimplify support for promoting induction variables: - Test for signed and unsigned wrapping conditions, instead of just testing for non-negative induction ranges. - Handle loops with GT comparisons, in addition to LT comparisons. - Support more cases of induction variables that don't start at 0. llvm-svn: 64532	2009-02-14 02:31:09 +00:00
Dan Gohman	47ff6aad23	Clarify debug output. llvm-svn: 64531	2009-02-14 02:26:50 +00:00
Dan Gohman	4bfa1d4c63	Simplify some code. hasComputableLoopEvolution is overkill in this case. No functionality change. llvm-svn: 64530	2009-02-14 02:25:19 +00:00
Dan Gohman	55ea72179c	In CodeGenPrepare's debug output, use WriteAsOperand instead of printing getName(), so that unnamed values are printed correctly. llvm-svn: 64468	2009-02-13 17:45:12 +00:00
Dan Gohman	a2730abaaa	Complete the sentance in this comment. I have reservations about the code it describes, but at least now the comment is right. llvm-svn: 64465	2009-02-13 17:36:42 +00:00
Nick Lewycky	d234a845f9	Mark strto* as readonly when the endptr is null. llvm-svn: 64460	2009-02-13 17:08:33 +00:00
Nick Lewycky	a0e83a0952	On strtod and friends, mark 'endptr' nocapture in the function prototype, and mark the first argument nocapture if endptr=NULL for each particular call. llvm-svn: 64453	2009-02-13 15:31:46 +00:00
Dan Gohman	f71a473720	Fix the code that checked if a SCEVAddRecExpr Start contains an addrec in a different loop to check the value being added to the accumulated Start value, not the Start value before it has the new value added to it. This prevents LSR from going crazy on the included testcase. Dale, please review. llvm-svn: 64440	2009-02-13 03:58:31 +00:00
Dan Gohman	ba83228cdb	Fix LSR's IV sorting function to explicitly sort by bitwidth after sorting by stride value. This prevents it from missing IV reuse opportunities in a host-sensitive manner. llvm-svn: 64415	2009-02-13 00:26:43 +00:00
Dan Gohman	eb6be650ce	Teach IndVarSimplify to optimize code using the C "int" type for loop induction on LP64 targets. When the induction variable is used in addressing, IndVars now is usually able to inserst a 64-bit induction variable and eliminates the sign-extending cast. This is also useful for code using C "short" types for induction variables on targets with 32-bit addressing. Inserting a wider induction variable is easy; the tricky part is determining when trunc(sext(i)) expressions are no-ops. This requires range analysis of the loop trip count. A common case is when the original loop iteration starts at 0 and exits when the induction variable is signed-less-than a fixed value; this case is now handled. This replaces IndVarSimplify's OptimizeCanonicalIVType. It was doing the same optimization, but it was limited to loops with constant trip counts, because it was running after the loop rewrite, and the information about the original induction variable is lost by that point. Rename ScalarEvolution's executesAtLeastOnce to isLoopGuardedByCond, generalize it to be able to test for ICMP_NE conditions, and move it to be a public function so that IndVars can use it. llvm-svn: 64407	2009-02-12 22:19:27 +00:00
Dan Gohman	656b097b8a	Add a utility function to LoopInfo to return the exit block when the loop has exactly one exit, and make use of it in LoopIndexSplit. llvm-svn: 64388	2009-02-12 18:08:24 +00:00
Dan Gohman	e0d32c490a	This code doesn't actually use the ExitingBlocks list. llvm-svn: 64376	2009-02-12 16:36:26 +00:00
Chris Lattner	096f44de61	improve naming of values in GVN, patch by Jay Foad! llvm-svn: 64363	2009-02-12 07:00:35 +00:00
Chris Lattner	5297c63565	fix PR3537: if resetting bbi back to the start of a block, we need to forget about already inserted expressions. llvm-svn: 64362	2009-02-12 06:56:08 +00:00
Nick Lewycky	b92c4d72a7	Don't mark all args to strtod and friends as nocapture. llvm-svn: 64352	2009-02-12 03:18:34 +00:00
Nate Begeman	318aea93bf	the two non-mask arguments to a shufflevector must be the same width, but they do not have to be the same width as the result value. llvm-svn: 64335	2009-02-11 22:36:25 +00:00
Devang Patel	da1a632a87	Use early exits. Reduce indentation. llvm-svn: 64226	2009-02-10 19:28:07 +00:00
Devang Patel	caf4485781	Enable scalar replacement of AllocaInst whose one of the user is dbg info. llvm-svn: 64207	2009-02-10 07:00:59 +00:00
Dale Johannesen	cd19967754	Fix PR 3471, and some cleanups. llvm-svn: 64177	2009-02-09 22:14:15 +00:00
Bill Wendling	415515077b	Mistakenly turned this on. llvm-svn: 64065	2009-02-08 01:32:00 +00:00
Bill Wendling	5469ec1072	Revert r63999. It was breaking self-hosting builds. llvm-svn: 64062	2009-02-08 00:58:05 +00:00
Mon P Wang	21eb52a74f	Instrcombine should not change load(cast p) to cast(load p) if the cast changes the address space of the pointer. llvm-svn: 64035	2009-02-07 22:19:29 +00:00
Mike Stump	f009a51794	Insert space to avoid warning and make code more readable. llvm-svn: 64003	2009-02-07 03:36:02 +00:00
Devang Patel	7cb8df4ce7	Ignore DbgInfoIntrinsics. llvm-svn: 63923	2009-02-06 06:19:06 +00:00
Chris Lattner	bbbb74372b	fix PR3489, use bits instead of bytes. llvm-svn: 63916	2009-02-06 04:34:07 +00:00
Devang Patel	409b794cfe	Ignore dbg intrinsics while propagating conditional expression info. Take 2. llvm-svn: 63898	2009-02-05 23:32:52 +00:00
Devang Patel	02f58e1e8d	Revert rev. 63876. It is causing llvm-gcc bootstrap failure. llvm-svn: 63888	2009-02-05 21:46:41 +00:00
Devang Patel	58cb603d2a	Remove dead blocks in the end. llvm-svn: 63880	2009-02-05 19:59:42 +00:00
Devang Patel	5922e26d1a	Ignore dbg intrinsics while propagating conditional expression info. llvm-svn: 63876	2009-02-05 19:15:39 +00:00
Devang Patel	43a1161379	If "optimize for size" attribute is set then block non-trivial loop unswitches but allow trivial loop unswitches. llvm-svn: 63670	2009-02-03 22:04:27 +00:00
Chris Lattner	ef37dc8511	teach "convert from scalar" to handle loads of fca's. llvm-svn: 63659	2009-02-03 21:08:45 +00:00
Chris Lattner	f5df53cb46	refactor the interface to ConvertUsesOfLoadToScalar, renaming it to ConvertScalar_ExtractValue llvm-svn: 63658	2009-02-03 21:01:03 +00:00
Chris Lattner	576baa4adf	convert ConvertUsesOfLoadToScalar to use IRBuilder, no functionality change. llvm-svn: 63652	2009-02-03 19:45:44 +00:00
Chris Lattner	c1fb96d347	switch ConvertScalar_InsertValue to use an IRBuilder, no functionality change. llvm-svn: 63651	2009-02-03 19:41:50 +00:00
Chris Lattner	18f56c295c	make scalar conversion handle stores of first class aggregate values. loads are not yet handled (coming soon to an sroa near you). llvm-svn: 63649	2009-02-03 19:30:11 +00:00
Chris Lattner	73eff2e6e8	Make SROA produce a vector only when the alloca is actually accessed at least once as a vector. This prevents it from compiling the example in not-a-vector into: define double @test(double %A, double %B) { %tmp4 = insertelement <7 x double> undef, double %A, i32 0 %tmp = insertelement <7 x double> %tmp4, double %B, i32 4 %tmp2 = extractelement <7 x double> %tmp, i32 4 ret double %tmp2 } instead, producing the integer code. Producing vectors when they aren't otherwise in the program is dangerous because a lot of other code treats them carefully and doesn't want to break them down. OTOH, many things want to break down tasty i448's. llvm-svn: 63638	2009-02-03 18:15:05 +00:00
Evan Cheng	8542caa3f7	APInt'fy SimplifyDemandedVectorElts so it can analyze vectors with more than 64 elements. llvm-svn: 63631	2009-02-03 10:05:09 +00:00
Chris Lattner	80810b4c2d	add another case of undefined behavior without crashing, PR3466. llvm-svn: 63620	2009-02-03 07:08:57 +00:00
Chris Lattner	6aa6b1f263	Teach ConvertUsesToScalar to handle memset, allowing it to handle crazy cases like: struct f { int A, B, C, D, E, F; }; short test4() { struct f A; A.A = 1; memset(&A.B, 2, 12); return A.C; } llvm-svn: 63596	2009-02-03 02:01:43 +00:00
Chris Lattner	09b65ab288	rearrange how SRoA handles promotion of allocas to vectors. With the new world order, it can handle cases where the first store into the alloca is an element of the vector, instead of requiring the first analyzed store to have the vector type itself. This allows us to un-xfail test/CodeGen/X86/vec_ins_extract.ll. llvm-svn: 63590	2009-02-03 01:30:09 +00:00
Chris Lattner	43cecd7c26	inline SROA::ConvertToScalar, no functionality change. llvm-svn: 63544	2009-02-02 20:44:45 +00:00
Chris Lattner	18eba4f211	Fix a bug which caused us to miscompile a couple of Ada tests. Thanks for the beautiful reduced testcase Duncan! llvm-svn: 63529	2009-02-02 18:02:59 +00:00
Duncan Sands	6f361ff345	Fix a comment (bytes -> bits), reformat a comment and remove trailing whitespace. No functionality change. llvm-svn: 63511	2009-02-02 10:06:20 +00:00
Duncan Sands	33d6e97e33	Fix an obvious thinko. llvm-svn: 63510	2009-02-02 09:53:14 +00:00
Chris Lattner	1aafe4cece	reduce indentation, (~XorCST->getValue()).isSignBit() -> isMaxSignedValue() llvm-svn: 63500	2009-02-02 07:15:30 +00:00
Nick Lewycky	f23908151a	Reinstate this optimization to fold icmp of xor when possible. Don't try to turn icmp eq a+x, b+x into icmp eq a, b if a+x or b+x has other uses. This may have been increasing register pressure leading to the bzip2 slowdown. llvm-svn: 63487	2009-01-31 21:30:05 +00:00
Chris Lattner	9e2b9f3234	Fix PR3452 (an infinite loop bootstrapping) by disabling the recent improvements to the EvaluateInDifferentType code. This code works by just inserted a bunch of new code and then seeing if it is useful. Instcombine is not allowed to do this: it can only insert new code if it is useful, and only when it is converging to a more canonical fixed point. Now that we iterate when DCE makes progress, this causes an infinite loop when the code ends up not being used. llvm-svn: 63483	2009-01-31 19:05:27 +00:00
Chris Lattner	76a63ed099	now that all the pieces are in place, teach instcombine's simplifydemandedbits to simplify instructions with multiple uses in contexts where it can get away with it. This allows it to simplify the code in multi-use-or.ll into a single 'add double'. This change is particularly interesting because it will cover up for some common codegen bugs with large integers created due to the recent SROA patch. When working on fixing those bugs, this should be disabled. llvm-svn: 63481	2009-01-31 08:40:03 +00:00
Chris Lattner	3e2cb66c56	simplify/clarify control flow and improve comments, no functionality change. llvm-svn: 63480	2009-01-31 08:24:16 +00:00
Chris Lattner	83c6a141b8	make some fairly meaty internal changes to how SimplifyDemandedBits works. Now, if it detects that "V" is the same as some other value, SimplifyDemandedBits returns the new value instead of RAUW'ing it immediately. This has two benefits: 1) simpler code in the recursive SimplifyDemandedBits routine. 2) it allows future fun stuff in instcombine where an operation has multiple uses and can be simplified in one context, but not all. #2 isn't implemented yet, this patch should have no functionality change. llvm-svn: 63479	2009-01-31 08:15:18 +00:00
Chris Lattner	585cfb2ce7	minor cleanups llvm-svn: 63477	2009-01-31 07:26:06 +00:00
Chris Lattner	94cfb281c3	make sure to set Changed=true when instcombine hacks on the code, not doing so prevents it from properly iterating and prevents it from deleting the entire body of dce-iterate.ll llvm-svn: 63476	2009-01-31 07:04:22 +00:00
Chris Lattner	ec99c46d44	Simplify and generalize the SROA "convert to scalar" transformation to be able to handle ANY alloca that is poked by loads and stores of bitcasts and GEPs with constant offsets. Before the code had a number of annoying limitations and caused it to miss cases such as storing into holes in structs and complex casts (as in bitfield-sroa) where we had unions of bitfields etc. This also handles a number of important cases that are exposed due to the ABI lowering stuff we do to pass stuff by value. One case that is pretty great is that we compile 2006-11-07-InvalidArrayPromote.ll into: define i32 @func(<4 x float> %v0, <4 x float> %v1) nounwind { %tmp10 = call <4 x i32> @llvm.x86.sse2.cvttps2dq(<4 x float> %v1) %tmp105 = bitcast <4 x i32> %tmp10 to i128 %tmp1056 = zext i128 %tmp105 to i256 %tmp.upgrd.43 = lshr i256 %tmp1056, 96 %tmp.upgrd.44 = trunc i256 %tmp.upgrd.43 to i32 ret i32 %tmp.upgrd.44 } which turns into: _func: subl $28, %esp cvttps2dq %xmm1, %xmm0 movaps %xmm0, (%esp) movl 12(%esp), %eax addl $28, %esp ret Which is pretty good code all things considering :). One effect of this is that SROA will start generating arbitrary bitwidth integers that are a multiple of 8 bits. In the case above, we got a 256 bit integer, but the codegen guys assure me that it can handle the simple and/or/shift/zext stuff that we're doing on these operations. This addresses rdar://6532315 llvm-svn: 63469	2009-01-31 02:28:54 +00:00
Chris Lattner	df17987c19	Fix some issues with volatility, move "CanConvertToScalar" check after the others. llvm-svn: 63227	2009-01-28 20:16:43 +00:00
Duncan Sands	5a913d61e3	Rename getAnalysisToUpdate to getAnalysisIfAvailable. llvm-svn: 63198	2009-01-28 13:14:17 +00:00
Mon P Wang	3537a62704	Fixed optimization of combining two shuffles where the first shuffle inputs has a different number of elements than the output. llvm-svn: 62998	2009-01-26 04:39:00 +00:00
Chris Lattner	9449991c4f	Handle single-entry phi nodes gracefully in condprop. llvm-svn: 62985	2009-01-26 02:18:20 +00:00
Chris Lattner	7b6647c178	Fix PR3408 by making a non-obvious assumption very obvious, and handling the flaw inherent in that assumption. :) llvm-svn: 62984	2009-01-26 02:11:30 +00:00
Chris Lattner	57cb472b56	More cleanups and simplifications, no functionality change. llvm-svn: 62983	2009-01-26 01:57:01 +00:00
Chris Lattner	d67aaa6560	tidy asserts llvm-svn: 62982	2009-01-26 01:38:24 +00:00
Torok Edwin	f4395ea97a	testcase for PR3381. Also it was an empty struct, not a void after all. llvm-svn: 62920	2009-01-24 17:16:04 +00:00
Torok Edwin	73ff92272f	void* is represented as pointer to empty struct {}. Thus we need to check whether the struct is empty before trying to index into it. This fixes PR3381. llvm-svn: 62918	2009-01-24 11:30:49 +00:00
Chris Lattner	72cd68fe64	Make InstCombineStoreToCast handle aggregates more aggressively, handling the case in Transforms/InstCombine/cast-store-gep.ll, which is a heavily reduced testcase from Clang on x86-64. llvm-svn: 62904	2009-01-24 01:00:13 +00:00
Gabor Greif	eb61fcf2a1	Simplify the logic of getting hold of a PHI predecessor block. There is now a direct way from value-use-iterator to incoming block in PHINode's API. This way we avoid the iterator->index->iterator trip, and especially the costly getOperandNo() invocation. Additionally there is now an assertion that the iterator really refers to one of the PHI's Uses. llvm-svn: 62869	2009-01-23 19:40:15 +00:00
Chris Lattner	77527f5812	Remove uses of uint32_t in favor of 'unsigned' for better compatibility with cygwin. Patch by Jay Foad! llvm-svn: 62695	2009-01-21 18:09:24 +00:00
Dale Johannesen	b5721632ee	Make special cases (0 inf nan) work for frem. Besides APFloat, this involved removing code from two places that thought they knew the result of frem(0., x) but were wrong. llvm-svn: 62645	2009-01-21 00:35:19 +00:00
Chris Lattner	73d7fe5a34	improve compatibility with cygwin, patch by Jay Foad! llvm-svn: 62535	2009-01-19 22:00:18 +00:00
Chris Lattner	6f34e317e9	Fix PR3353, infinitely jump threading an infinite loop make from switches. llvm-svn: 62529	2009-01-19 21:20:34 +00:00
Chris Lattner	64b7bd7f9e	Fix rdar://6505632, an llc crash on 483.xalancbmk llvm-svn: 62470	2009-01-18 20:35:00 +00:00
Nick Lewycky	3ced0dfa69	Fix copy and pasted typos that prevented strtok_r, realloc, getenv, ungetc, putc, puts, perror, vscanf and vsscanf from getting annotations. Add annotations for eight printf functions, memalign, pread and pwrite. On Linux, llvm-gcc sometimes renames strdup, getc, putc, strtok_r, scanf and sscanf. Match the alternate function names. Fix a crash annotating opendir. Don't mark fsetpos's second parameter as nocapture. It's supposed to be captured. Do mark fopen's path and mode strings as nocapture. Mark ferror as readonly, but not fileno which may set errno. llvm-svn: 62456	2009-01-18 04:34:36 +00:00
Chris Lattner	db2d9613d2	Fix PR3335 by not turning a store to one address space into a store to another. llvm-svn: 62351	2009-01-16 20:12:52 +00:00
Chris Lattner	733256fe31	reduce indentation by using early exits, no functionality change. llvm-svn: 62350	2009-01-16 20:08:59 +00:00
Evan Cheng	beac6f8b0c	Clean up previous cast optimization a bit. Also make zext elimination a bit more aggressive: if it's not necessary to emit an AND (i.e. high bits are already zero), it's profitable to evaluate the operand at a different type. llvm-svn: 62297	2009-01-16 02:11:43 +00:00
Rafael Espindola	6de96a1b5d	Add the private linkage. llvm-svn: 62279	2009-01-15 20:18:42 +00:00
Evan Cheng	ff716cb342	Eliminate a redundant check. llvm-svn: 62264	2009-01-15 17:09:07 +00:00
Evan Cheng	60e19a46f2	- Teach CanEvaluateInDifferentType of this xform: sext (zext ty1), ty2 -> zext ty2 - Looking at the number of sign bits of the a sext instruction to determine whether new trunc + sext pair should be added when its source is being evaluated in a different type. llvm-svn: 62263	2009-01-15 17:01:23 +00:00
Chris Lattner	8fb9480ed2	Fix PR3325, a miscompilation of invokes by IPSCCP. Patch by Jay Foad! llvm-svn: 62244	2009-01-14 21:01:16 +00:00

... 4 5 6 7 8 ...

3545 Commits