llvm-project

Commit Graph

Author	SHA1	Message	Date
Duncan Sands	61c5708b51	Fix the other problem reported in PR8582. Testcase and patch by Nadav Rotem. llvm-svn: 122983	2011-01-06 23:45:22 +00:00
Eric Christopher	e516af753b	Add some fairly duplicated code to let type legalization split illegal typed atomics. This will lower exclusively to libcalls at the moment. llvm-svn: 122979	2011-01-06 22:28:56 +00:00
Chris Lattner	84184b7207	With Benjamin's recent amazing patches, we should be able to do even better things :) llvm-svn: 122978	2011-01-06 22:25:00 +00:00
Chris Lattner	171608e738	use isNullValue() to simplify code, add an assert. llvm-svn: 122977	2011-01-06 22:24:29 +00:00
Devang Patel	70eb982843	Emit 128 bit constant. This fixes PR 8913 crash. llvm-svn: 122971	2011-01-06 21:39:25 +00:00
Bob Wilson	914df82a2e	PR8921: LDM/POP do not support interworking prior to v5t. llvm-svn: 122970	2011-01-06 19:24:41 +00:00
Bob Wilson	e0bafd93b0	Remove extra whitespace. llvm-svn: 122969	2011-01-06 19:24:36 +00:00
Bob Wilson	7c2c626805	Fix comment typo. llvm-svn: 122968	2011-01-06 19:24:32 +00:00
Benjamin Kramer	1e01ade2e8	Add a note from llvmdev, this time with more info. llvm-svn: 122966	2011-01-06 17:35:50 +00:00
Abramo Bagnara	a41d7aebee	Fixed parsing of hex floats. llvm-svn: 122963	2011-01-06 16:55:14 +00:00
Rafael Espindola	9f9a10691a	Correctly disassemble truncated asm. Patch by Richard Simth. llvm-svn: 122962	2011-01-06 16:48:42 +00:00
Benjamin Kramer	ae67cc13a9	InstCombine: Turn _chk functions into the "unsafe" variant if length and max langth are equal. This happens when we take the (non-constant) length from a malloc. llvm-svn: 122961	2011-01-06 14:22:52 +00:00
Benjamin Kramer	605f21a6c8	EarlyCSE does this now (and GVN always did it). llvm-svn: 122960	2011-01-06 13:19:46 +00:00
Benjamin Kramer	799b011276	InstCombine: If we call llvm.objectsize on a malloc call we can replace it with the size passed to malloc. llvm-svn: 122959	2011-01-06 13:11:05 +00:00
Benjamin Kramer	a76cc117e0	InstCombine: Teach llvm.objectsize folding to look through GEPs. llvm-svn: 122958	2011-01-06 13:07:49 +00:00
Benjamin Kramer	3aa955e906	Remove dead code and silence warnings. llvm-svn: 122957	2011-01-06 13:01:02 +00:00
Evan Cheng	7998b1d6fe	Use movups to lower memcpy and memset even if it's not fast (like corei7). The theory is it's still faster than a pair of movq / a quad of movl. This will probably hurt older chips like P4 but should run faster on current and future Intel processors. rdar://8817010 llvm-svn: 122955	2011-01-06 07:58:36 +00:00
Chris Lattner	245de78e06	add a note about object size from drystone, add a poorly optimized loop from 179.art. llvm-svn: 122954	2011-01-06 07:41:22 +00:00
Chris Lattner	73552c2cce	add a trivial instcombine missed in Dhrystone llvm-svn: 122953	2011-01-06 07:09:23 +00:00
Evan Cheng	3ae2b79aa3	Re-implement r122936 with proper target hooks. Now getMaxStoresPerMemcpy etc. takes an option OptSize. If OptSize is true, it would return the inline limit for functions with attribute OptSize. llvm-svn: 122952	2011-01-06 06:52:41 +00:00
Chris Lattner	5858e091a6	implement constant folding support for an exotic constant expr: ret i64 ptrtoint (i8* getelementptr ([1000 x i8]* @X, i64 1, i64 sub (i64 0, i64 ptrtoint ([1000 x i8]* @X to i64))) to i64) to "ret i64 1000". This allows us to correctly compute the trip count on a loop in PR8883, which occurs with std::fill on a char array. This allows us to transform it into a memset with a constant size. llvm-svn: 122950	2011-01-06 06:19:46 +00:00
Evan Cheng	c052ba7ff3	Revert r122936. I'll re-implement the change. llvm-svn: 122949	2011-01-06 06:17:53 +00:00
Cameron Zwarich	9ec19ea06a	Add the CallInst optimizations that don't involve expanding inline assembly to OptimizeInst() so that they can be used on a worklist instruction. llvm-svn: 122945	2011-01-06 02:56:42 +00:00
Cameron Zwarich	d28c78eb4f	Move the GEP handling in CodeGenPrepare to OptimizeInst(). llvm-svn: 122944	2011-01-06 02:44:52 +00:00
Cameron Zwarich	14ac865ca9	Split the optimizations in CodeGenPrepare that don't manipulate the iterators into a separate function, so that it can be called from a loop using a worklist rather than a loop traversing a whole basic block. llvm-svn: 122943	2011-01-06 02:37:26 +00:00
Jakob Stoklund Olesen	70be93a200	Zap the last two -Wself-assign warnings in llvm. Simplify RALinScan::DowngradeRegister with TRI::getOverlaps while we are there. llvm-svn: 122940	2011-01-06 01:33:22 +00:00
Jakob Stoklund Olesen	8e236eac74	Add the SpillPlacement analysis pass. This pass precomputes CFG block frequency information that can be used by the register allocator to find optimal spill code placement. Given an interference pattern, placeSpills() will compute which basic blocks should have the current variable enter or exit in a register, and which blocks prefer the stack. The algorithm is ready to consume block frequencies from profiling data, but for now it gets by with the static estimates used for spill weights. This is a work in progress and still not hooked up to RegAllocGreedy. llvm-svn: 122938	2011-01-06 01:21:53 +00:00
Evan Cheng	06536e7158	r105228 reduced the memcpy / memset inline limit to 4 with -Os to avoid blowing up freebsd bootloader. However, this doesn't make much sense for Darwin, whose -Os is meant to optimize for size only if it doesn't hurt performance. rdar://8821501 llvm-svn: 122936	2011-01-06 01:04:47 +00:00
Evan Cheng	ac730dd2d1	Avoid zero extend bit test operands to pointer type if all the masks fit in the original type of the switch statement key. rdar://8781238 llvm-svn: 122935	2011-01-06 01:02:44 +00:00
Bill Wendling	3b949bcaf3	PR8919 - LLVM incorrectly generates "_alloca" as the stack probing call. That works only on MinGW32. On 64-bit, the function to call is "__chkstk". Patch by KS Sreeram! llvm-svn: 122934	2011-01-06 00:50:34 +00:00
Bill Wendling	81d40711f3	PR8918 - When used with MinGW64, LLVM generates a "calll __main" at the beginning of the "main" function. The assembler complains about the invalid suffix for the 'call' instruction. The right instruction is "callq __main". Patch by KS Sreeram! llvm-svn: 122933	2011-01-06 00:47:10 +00:00
Cameron Zwarich	ce3b930a98	Stop reallocating SunkAddrs for each basic block. When we move to an instruction worklist, the key will need to become std::pair<BasicBlock, Value>. llvm-svn: 122932	2011-01-06 00:42:50 +00:00
Owen Anderson	6f060afbbd	Reorder, rename, and document some members to make this easier to follow. llvm-svn: 122929	2011-01-05 23:26:22 +00:00
Evan Cheng	260acf32ee	Optimize: r1025 = s/zext r1024, 4 r1026 = extract_subreg r1025, 4 to: r1026 = copy r1024 llvm-svn: 122925	2011-01-05 23:06:49 +00:00
Chris Lattner	872908fdeb	fix PR8900, a shuffle miscompilation. Patch by Nadav Rotem! llvm-svn: 122921	2011-01-05 22:28:46 +00:00
Chris Lattner	2d7df02670	silence more self assignment warnings. llvm-svn: 122920	2011-01-05 22:26:52 +00:00
Jakob Stoklund Olesen	f3ac733684	Add a hidden command line option to display edge bundle graphs as they are calculated. llvm-svn: 122912	2011-01-05 21:50:24 +00:00
Jakob Stoklund Olesen	88b4b2794c	Silence a warning from non-standard warning avoidance code. llvm-svn: 122911	2011-01-05 21:50:21 +00:00
Eric Christopher	c673b21a87	80-cols. llvm-svn: 122909	2011-01-05 21:45:56 +00:00
Owen Anderson	e86dacf449	When computing the value on an edge, in certain cases LVI would fail to compute the value range in the predecessor block, leading to an incorrect conclusion for the edge value. Found by inspection. llvm-svn: 122908	2011-01-05 21:37:18 +00:00
Owen Anderson	118ac80c81	Re-convert several of LazyValueInfo's internal maps to Dense{Map\|Set}, and fix the issue in hasBlockValue() that was causing iterator invalidations. Many thanks to Dimitry Andric for tracking down those invalidations! llvm-svn: 122906	2011-01-05 21:15:29 +00:00
Chris Lattner	1b3f5b9f74	fix some -Wself-assign warnings. llvm-svn: 122893	2011-01-05 18:41:05 +00:00
Cameron Zwarich	b62ccb241b	Add some more statistics to CodeGenPrepare. llvm-svn: 122891	2011-01-05 17:47:38 +00:00
Wesley Peck	07fd1efcfa	Commit 122778 broke DWARF debug output when using the MBlaze backend. Fixed by overriding TargetFrameInfo::getFrameIndexOffset to take into account the new frame index information. llvm-svn: 122889	2011-01-05 17:34:20 +00:00
Cameron Zwarich	ced753fadf	Add some stats to CodeGenPrepare to make it easier to speed it up without regressing code quality. llvm-svn: 122887	2011-01-05 17:27:27 +00:00
Michael J. Spencer	61d6f739a8	Support/PathV2: Implement remove_all. llvm-svn: 122884	2011-01-05 16:39:38 +00:00
Michael J. Spencer	98879d7641	Support/Windows/PathV2: Make directory iteration ignore . and .. llvm-svn: 122883	2011-01-05 16:39:30 +00:00
Michael J. Spencer	153749b30b	Support/Windows/PathV2: Fix remove to handle both files and directories. llvm-svn: 122882	2011-01-05 16:39:22 +00:00
Michael J. Spencer	f8dc1868e2	Support/PathV2: Implement directory_entry::status. llvm-svn: 122881	2011-01-05 16:39:13 +00:00
Michael J. Spencer	52714863db	Support/PathV2: Implement directory iteration on POSIX. llvm-svn: 122879	2011-01-05 16:38:57 +00:00
Cameron Zwarich	6a78995369	Use pop_back_val instead of back followed by pop_back. llvm-svn: 122876	2011-01-05 16:08:47 +00:00
Cameron Zwarich	5a2bb998ac	Use a worklist for later iterations just like ordinary instsimplify. The next step is to only process instructions in subloops if they have been modified by an earlier simplification. llvm-svn: 122869	2011-01-05 05:47:47 +00:00
Cameron Zwarich	4c51d122d5	Change LoopInstSimplify back to a LoopPass. It revisits subloops rather than skipping them, but it should probably use a worklist and only revisit those instructions in subloops that have actually changed. It should probably also use a worklist after the first iteration like instsimplify now does. Regardless, it's only 0.3% of opt -O2 time on 403.gcc if it replaces the instcombine placed in the middle of the loop passes. llvm-svn: 122868	2011-01-05 05:15:53 +00:00
Eric Christopher	988518109d	Remove TODO, these appear to be implemented. llvm-svn: 122849	2011-01-04 22:31:50 +00:00
Owen Anderson	7b25ff04bd	Don't bother value numbering instructions with void types in GVN. In theory this should allow us to insert fewer things into the value numbering maps, but any speedup is beneath the noise threshold on my machine on 403.gcc. llvm-svn: 122844	2011-01-04 22:15:21 +00:00
Jakob Stoklund Olesen	01d4d86585	Use the EdgeBundles analysis in X86FloatingPoint instead of recomputing CFG bundles in the pass. llvm-svn: 122833	2011-01-04 21:10:11 +00:00
Jakob Stoklund Olesen	f96ae684c4	Turn the EdgeBundles class into a stand-alone machine CFG analysis pass. The analysis will be needed by both the greedy register allocator and the X86FloatingPoint pass. It only needs to be computed once when the CFG doesn't change. This pass is very fast, usually showing up as 0.0% wall time. llvm-svn: 122832	2011-01-04 21:10:05 +00:00
Dale Johannesen	e45a2389ce	Eliminate a warning compiling with llvm-gcc. (IMO the warning is overzealous but gcc is what it is.) llvm-svn: 122829	2011-01-04 19:31:24 +00:00
Owen Anderson	e39cb57b09	Complete the NumberTable --> LeaderTable rename. llvm-svn: 122828	2011-01-04 19:29:46 +00:00
Owen Anderson	d7d06d3aaf	Fix typo in a comment. llvm-svn: 122827	2011-01-04 19:25:18 +00:00
Owen Anderson	51489b3b28	Prune #include's. llvm-svn: 122826	2011-01-04 19:24:57 +00:00
Owen Anderson	c7c3bc63f7	Clarify terminology, settling on referring to what was the "number table" as the "leader table", and rename methods to make it much more clear what they're doing. llvm-svn: 122823	2011-01-04 19:13:25 +00:00
Owen Anderson	83546f2fe0	When removing a value from GVN's leaders list, don't drop the Next pointer in a corner case. llvm-svn: 122822	2011-01-04 19:10:54 +00:00
Dale Johannesen	a71d2cc88d	Improve the accuracy of the inlining heuristic looking for the case where a static caller is itself inlined everywhere else, and thus may go away if it doesn't get too big due to inlining other things into it. If there are references to the caller other than calls, it will not be removed; account for this. This results in same-day completion of the case in PR8853. llvm-svn: 122821	2011-01-04 19:01:54 +00:00
Owen Anderson	41a1550ef5	Branch instructions don't produce values, so there's no need to generate a value number for them. This avoids adding them to the various value numbering tables, resulting in a minor (~3%) speedup for GVN on 40.gcc. llvm-svn: 122819	2011-01-04 18:54:18 +00:00
Owen Anderson	22c53e277a	Remove commented out code. llvm-svn: 122817	2011-01-04 18:22:08 +00:00
Cameron Zwarich	b2a41e9388	Switch to the new style of asterisk placement. llvm-svn: 122815	2011-01-04 18:19:19 +00:00
Chris Lattner	c86e67e110	fix an off-by-one bug that caused a crash analyzing ashr's with huge shift amounts, PR8896 llvm-svn: 122814	2011-01-04 18:19:15 +00:00
Cameron Zwarich	5cd3d718f6	Switch to path halving from path compression for a small speedup. This also makes getLeader() nonrecursive. llvm-svn: 122811	2011-01-04 16:24:51 +00:00
Duncan Sands	95c4eccbe9	These methods should be "const"; make them so. llvm-svn: 122809	2011-01-04 12:52:29 +00:00
Chris Lattner	8643810ede	Teach loop-idiom to turn a loop containing a memset into a larger memset when safe. The testcase is basically this nested loop: void foo(char X) { for (int i = 0; i != 100; ++i) for (int j = 0; j != 100; ++j) X[j+i100] = 0; } which gets turned into a single memset now. clang -O3 doesn't optimize this yet though due to a phase ordering issue I haven't analyzed yet. llvm-svn: 122806	2011-01-04 07:46:33 +00:00
Chris Lattner	a62b01dc37	restructure this a bit. Initialize the WeakVH with "I", the instruction after the store. The store will always be deleted if the transformation kicks in, so we'd do an N^2 scan of every loop block. Whoops. llvm-svn: 122805	2011-01-04 07:27:30 +00:00
Cameron Zwarich	82e8332a22	Eliminate repeated allocation of a per-BB DenseMap for a 4.6% reduction of time spent in StrongPHIElimination on 403.gcc. llvm-svn: 122803	2011-01-04 06:42:27 +00:00
Cameron Zwarich	f4e13699e7	Avoid finding loop back edges when we are not splitting critical edges in CodeGenPrepare (which is the default behavior). llvm-svn: 122801	2011-01-04 04:43:31 +00:00
Owen Anderson	2e28697c60	Clean up a funky pass registration that got passed over when I got rid of static constructors. llvm-svn: 122795	2011-01-04 00:55:21 +00:00
Andrew Trick	163a24420a	Fix the ARM IIC_iCMPsi itinerary and add an important assert. llvm-svn: 122794	2011-01-04 00:32:57 +00:00
Cameron Zwarich	e924969380	Address most of Duncan's review comments. Also, make LoopInstSimplify a simple FunctionPass. It probably doesn't have a reason to be a LoopPass, as it will probably drop the simple fixed point and either use RPO iteration or Duncan's approach in instsimplify of only revisiting instructions that have changed. The next step is to preserve LoopSimplify. This looks like it won't be too hard, although the pass manager doesn't actually seem to respect when non-loop passes claim to preserve LCSSA or LoopSimplify. This will have to be fixed. llvm-svn: 122791	2011-01-04 00:12:46 +00:00
Chris Lattner	0ba473c218	use the very-handy getTruncateOrZeroExtend helper function, and stop setting NSW: signed overflow is possible. Thanks to Dan for pointing these out. llvm-svn: 122790	2011-01-04 00:06:55 +00:00
Bill Wendling	4466e981f9	Formatting changes. No functionality change. llvm-svn: 122789	2011-01-03 23:59:05 +00:00
Owen Anderson	0839d3930a	Fix comment. llvm-svn: 122788	2011-01-03 23:51:56 +00:00
Owen Anderson	d62d37225a	Use the new addEscapingValue callback to update GlobalsModRef when GVN adds PHIs of GEPs. For the moment, have GlobalsModRef handle this conservatively by simply removing the value from its maps. llvm-svn: 122787	2011-01-03 23:51:43 +00:00
Chris Lattner	bde6ec1db6	Duncan deftly points out that readnone functions aren't invalidated by stores, so they can be handled as 'simple' operations. llvm-svn: 122785	2011-01-03 23:38:13 +00:00
Evan Cheng	65089fc6c7	Use pushq / popq instead of subq $8, %rsp / addq $8, %rsp to adjust stack in prologue and epilogue if the adjustment is 8. Similarly, use pushl / popl if the adjustment is 4 in 32-bit mode. In the epilogue, takes care to pop to a caller-saved register that's not live at the exit (either return or tailcall instruction). rdar://8771137 llvm-svn: 122783	2011-01-03 22:53:22 +00:00
Wesley Peck	6941044c72	Fix more stack layout issues in the MBlaze backend. llvm-svn: 122778	2011-01-03 21:40:26 +00:00
Owen Anderson	b6e4ff0d85	Stub out a new updating interface to AliasAnalysis, allowing stateful analyses to be informed when a pointer value has potentially become escaping. Implementations can choose to either fall back to conservative responses for that value, or may recompute their analysis to accomodate the change. llvm-svn: 122777	2011-01-03 21:38:41 +00:00
Chris Lattner	16e42128c2	fix rdar://8813415 - a miscompilation of 164.gzip that loop-idiom exposed. It turns out to be a latent bug in basicaa, scary. llvm-svn: 122772	2011-01-03 21:03:33 +00:00
Owen Anderson	3a33d0cc4a	Simplify GVN's value expression structure, allowing the elimination of a lot of almost-but-not-quite-identical code. No intended functionality change. llvm-svn: 122760	2011-01-03 19:00:11 +00:00
Chris Lattner	16ca19ffc5	stength reduce my previous patch a bit. The only instructions that are allowed to have metadata operands are intrinsic calls, and the only ones that take metadata currently return void. Just reject all void instructions, which should not be value numbered anyway. To future proof things, add an assert to the getHashValue impl for calls to check that metadata operands aren't present. llvm-svn: 122759	2011-01-03 18:43:03 +00:00
Chris Lattner	142f1cd251	fix PR8895: metadata operands don't have a strong use of their nested values, so they can change and drop to null, which can change the hash and cause havok. It turns out that it isn't a good idea to value number stuff with metadata operands anyway, so... don't. llvm-svn: 122758	2011-01-03 18:28:15 +00:00
Duncan Sands	697de77339	Speed up instsimplify by about 10-15% by not bothering to retry InstructionSimplify on instructions that didn't change since the last time round the loop. llvm-svn: 122745	2011-01-03 10:50:04 +00:00
Cameron Zwarich	43cecb1200	Switch a worklist in CodeGenPrepare to SmallVector and increase the inline capacity on the Visited SmallPtrSet. On 403.gcc, this is about a 4.5% speedup of CodeGenPrepare time (which itself is 10% of time spent in the backend). This is progress towards PR8889. llvm-svn: 122741	2011-01-03 06:33:01 +00:00
Chris Lattner	9e5e9ed79a	earlycse can do trivial with-a-block dead store elimination as well. This deletes 60 stores in 176.gcc that largely come from bitfield code. llvm-svn: 122736	2011-01-03 04:17:24 +00:00
Cameron Zwarich	18f164f7c9	Use a RecyclingAllocator to allocate values for MachineCSE's ScopedHashTable for a 28% speedup of MachineCSE time on 403.gcc. llvm-svn: 122735	2011-01-03 04:07:46 +00:00
Chris Lattner	4b9a525742	switch the load table to use a recycling bump pointer allocator, speeding earlycse up by 6%. llvm-svn: 122733	2011-01-03 03:53:50 +00:00
Chris Lattner	e0e32a9ef0	now that loads are in their own table, we can implement store->load forwarding. This allows EarlyCSE to zap 600 more loads from 176.gcc. llvm-svn: 122732	2011-01-03 03:46:34 +00:00
Chris Lattner	92bb0f9f9d	split loads and calls into separate tables. Loads are now just indexed by their pointer instead of using MemoryValue to wrap it. llvm-svn: 122731	2011-01-03 03:41:27 +00:00
Chris Lattner	4cb365414f	various cleanups, no functionality change. llvm-svn: 122729	2011-01-03 03:28:23 +00:00
Nick Lewycky	0f87ca7733	Add spliceFunction to the CallGraph interface. This allows users to efficiently update a callGraph when performing the common operation of splicing the body to a new function and updating all callers (such as via RAUW). No users yet, though this is intended for DeadArgumentElimination as part of PR8887. llvm-svn: 122728	2011-01-03 03:19:35 +00:00
Chris Lattner	b9a8efc960	Teach EarlyCSE to do trivial CSE of loads and read-only calls. On 176.gcc, this catches 13090 loads and calls, and increases the number of simple instructions CSE'd from 29658 to 36208. llvm-svn: 122727	2011-01-03 03:18:43 +00:00
Chris Lattner	79d83067ee	rename InstValue to SimpleValue, add some comments. llvm-svn: 122725	2011-01-03 02:20:48 +00:00
Michael J. Spencer	edb5bcdde5	CMake: Add missing source file. llvm-svn: 122724	2011-01-03 02:13:05 +00:00
Chris Lattner	d815f69b30	Allocate nodes for the scoped hash table from a recyling bump pointer allocator. This speeds up early cse by about 20% llvm-svn: 122723	2011-01-03 01:42:46 +00:00
Chris Lattner	02a9776b64	reduce redundancy in the hashing code and other misc cleanups. llvm-svn: 122720	2011-01-03 01:10:08 +00:00
Cameron Zwarich	cab9a0abab	Add a new loop-instsimplify pass, with the intention of replacing the instance of instcombine that is currently in the middle of the loop pass pipeline. This commit only checks in the pass; it will hopefully be enabled by default later. llvm-svn: 122719	2011-01-03 00:25:16 +00:00
Chris Lattner	0844c76f9a	fix some pastos llvm-svn: 122718	2011-01-02 23:29:58 +00:00
Chris Lattner	8fac5db251	add DEBUG and -stats output to earlycse. Teach it to CSE the rest of the non-side-effecting instructions. llvm-svn: 122716	2011-01-02 23:19:45 +00:00
Chris Lattner	18ae5436b1	Enhance earlycse to do CSE of casts, instsimplify and die. Add a testcase. llvm-svn: 122715	2011-01-02 23:04:14 +00:00
Chris Lattner	bf0aa927cc	split dom frontier handling stuff out to its own DominanceFrontier header, so that Dominators.h is just domtree. Also prune #includes a bit. llvm-svn: 122714	2011-01-02 22:09:33 +00:00
Chris Lattner	704541bb23	sketch out a new early cse pass. No functionality yet. llvm-svn: 122713	2011-01-02 21:47:05 +00:00
Chris Lattner	9c69406f2b	fix a miscompilation of tramp3d-v4: when forming a memcpy, we have to make sure that the loop we're promoting into a memcpy doesn't mutate the input of the memcpy. Before we were just checking that the dest of the memcpy wasn't mod/ref'd by the loop. llvm-svn: 122712	2011-01-02 21:14:18 +00:00
Chris Lattner	5702a43c09	If a loop iterates exactly once (has backedge count = 0) then don't mess with it. We'd rather peel/unroll it than convert all of its stores into memsets. llvm-svn: 122711	2011-01-02 20:24:21 +00:00
Benjamin Kramer	25e6e06e42	Try to reuse the value when lowering memset. This allows us to compile: void test(char *s, int a) { __builtin_memset(s, a, 15); } into 1 mul + 3 stores instead of 3 muls + 3 stores. llvm-svn: 122710	2011-01-02 19:57:05 +00:00
Benjamin Kramer	2fdea4c8f1	Lower the i8 extension in memset to a multiply instead of a potentially long series of shifts and ors. We could implement a DAGCombine to turn x * 0x0101 back into logic operations on targets that doesn't support the multiply or it is slow (p4) if someone cares enough. Example code: void test(char *s, int a) { __builtin_memset(s, a, 4); } before: _test: ## @test movzbl 8(%esp), %eax movl %eax, %ecx shll $8, %ecx orl %eax, %ecx movl %ecx, %eax shll $16, %eax orl %ecx, %eax movl 4(%esp), %ecx movl %eax, 4(%ecx) movl %eax, (%ecx) ret after: _test: ## @test movzbl 8(%esp), %eax imull $16843009, %eax, %eax ## imm = 0x1010101 movl 4(%esp), %ecx movl %eax, 4(%ecx) movl %eax, (%ecx) ret llvm-svn: 122707	2011-01-02 19:44:58 +00:00
Oscar Fuentes	68b7bb95d4	A workaround for a bug in cmake 2.8.3 diagnosed on PR 8885. llvm-svn: 122706	2011-01-02 19:32:31 +00:00
Nick Lewycky	5361b84184	Also remove functions that use complex constant expressions in terms of another function. llvm-svn: 122705	2011-01-02 19:16:44 +00:00
Chris Lattner	8455b6e45e	enhance loop idiom recognition to scan all unconditionally executed blocks in a loop, instead of just the header block. This makes it more aggressive, able to handle Duncan's Ada examples. llvm-svn: 122704	2011-01-02 19:01:03 +00:00
Chris Lattner	0cdc6f62a5	make inSubLoop much more efficient. llvm-svn: 122703	2011-01-02 18:53:08 +00:00
Chris Lattner	27497ece96	rip out isExitBlockDominatedByBlockInLoop, calling DomTree::dominates instead. isExitBlockDominatedByBlockInLoop is a relic of the days when domtree was just a tree and didn't have DFS numbers. Checking DFS numbers is faster and easier than "limiting the search of the tree". llvm-svn: 122702	2011-01-02 18:45:39 +00:00
Chris Lattner	0469e01c02	add a list of opportunities for future improvement. llvm-svn: 122701	2011-01-02 18:32:09 +00:00
Chris Lattner	51415d26f1	update a bunch of entries. llvm-svn: 122700	2011-01-02 18:31:38 +00:00
Duncan Sands	64f1c0dcda	Fix PR8702 by not having LoopSimplify claim to preserve LCSSA form. As described in the PR, the pass could break LCSSA form when inserting preheaders. It probably would be easy enough to fix this, but since currently we always go into LCSSA form after running this pass, doing so is not urgent. llvm-svn: 122695	2011-01-02 13:38:21 +00:00
Chris Lattner	ddf58010bd	Allow loop-idiom to run on multiple BB loops, but still only scan the loop header for now for memset/memcpy opportunities. It turns out that loop-rotate is successfully rotating loops, but DOESN'T MERGE THE BLOCKS, turning "for loops" into 2 basic block loops that loop-idiom was ignoring. With this fix, we form many many more memcpy and memsets than before, including on the "history" loops in the viterbi benchmark, which look like this: for (j=0; j<MAX_history; ++j) { history_new[i][j+1] = history[2*i][j]; } Transforming these loops into memcpy's speeds up the viterbi benchmark from 11.98s to 3.55s on my machine. Woo. llvm-svn: 122685	2011-01-02 07:58:36 +00:00
Chris Lattner	5b5a043d82	remove debugging code. llvm-svn: 122683	2011-01-02 07:37:13 +00:00
Chris Lattner	12f91befce	add some -stats output. llvm-svn: 122682	2011-01-02 07:36:44 +00:00
Chris Lattner	679572e584	improve loop rotation to use CodeMetrics to analyze the size of a loop header instead of its own code size estimator. This allows it to handle bitcasts etc more precisely. llvm-svn: 122681	2011-01-02 07:35:53 +00:00
Chris Lattner	85b6d81d41	teach loop idiom recognition to form memcpy's from simple loops. llvm-svn: 122678	2011-01-02 03:37:56 +00:00
Nick Lewycky	4e250c8245	Remove functions from the FnSet when one of their callee's is being merged. This maintains the guarantee that the DenseSet expects two elements it contains to not go from inequal to equal under its nose. As a side-effect, this also lets us switch from iterating to a fixed-point to actually maintaining a work queue of functions to look at again, and we don't add thunks to our work queue so we don't need to detect and ignore them. llvm-svn: 122677	2011-01-02 02:46:33 +00:00
Chris Lattner	6c3fc0a52d	a missed __builtin_object_size case. llvm-svn: 122676	2011-01-01 22:57:31 +00:00
Chris Lattner	e5d5a41a58	various updates. llvm-svn: 122675	2011-01-01 22:52:11 +00:00
Chris Lattner	1903c42b97	fix a globalopt crash on two Adobe-C++ testcases that the recent loop idiom pass exposed. llvm-svn: 122674	2011-01-01 22:31:46 +00:00
Rafael Espindola	d606e54757	Add support for the 'H' modifier. llvm-svn: 122667	2011-01-01 20:58:46 +00:00
Anton Korobeynikov	62acecd7e1	Model operand restrictions of mul-like instructions on ARMv5 via earlyclobber stuff. This should fix PRs 2313 and 8157. Unfortunately, no testcase, since it'd be dependent on register assignments. llvm-svn: 122663	2011-01-01 20:38:38 +00:00
Chris Lattner	a3514441e0	add a validity check that was missed, fixing a crash on the new testcase. llvm-svn: 122662	2011-01-01 20:12:04 +00:00
Duncan Sands	772749aea1	Revert commit 122654 at the request of Chris, who reckons that instsimplify is the wrong hammer for this nail, and is probably right. llvm-svn: 122661	2011-01-01 20:08:02 +00:00
Chris Lattner	91a4435875	improve validity check to handle constant-trip-count loops more aggressively. In practice, this doesn't help anything though, see the todo. llvm-svn: 122660	2011-01-01 19:54:22 +00:00
Chris Lattner	8b3baf6d75	implement the "no aliasing accesses in loop" safety check. This pass should be correct now. llvm-svn: 122659	2011-01-01 19:39:01 +00:00
Rafael Espindola	3686473578	Fix PR8878. llvm-svn: 122658	2011-01-01 19:05:35 +00:00
Duncan Sands	e3c539581c	Fix a README item by having InstructionSimplify do a mild form of value numbering, in which it considers (for example) "%a = add i32 %x, %y" and "%b = add i32 %x, %y" to be equal because the operands are equal and the result of the instructions only depends on the values of the operands. This has almost no effect (it removes 4 instructions from gcc-as-one-file), and perhaps slows down compilation: I measured a 0.4% slowdown on the large gcc-as-one-file testcase, but it wasn't statistically significant. llvm-svn: 122654	2011-01-01 16:12:09 +00:00
Che-Liang Chiou	5451fc9195	ptx: remove reg-reg addressing mode and st.const llvm-svn: 122653	2011-01-01 11:58:58 +00:00
Che-Liang Chiou	15e8d2c5e7	ptx: add store instruction llvm-svn: 122652	2011-01-01 10:50:37 +00:00
Oscar Fuentes	a8eb60436b	Add to the list of cmake files the object file, not the asm file. This is necessary for executing the custom command that runs the assember. Fixes PR8877. llvm-svn: 122649	2010-12-31 20:15:37 +00:00
Duncan Sands	2c440fa403	Simplify this pass by using a depth-first iterator to ensure that all operands are visited before the instructions themselves. llvm-svn: 122647	2010-12-31 17:49:05 +00:00
Duncan Sands	6cc7126ed9	Zap dead instructions harder. llvm-svn: 122645	2010-12-31 16:17:54 +00:00
Benjamin Kramer	570dd787a6	Make a bunch of symbols internal. llvm-svn: 122642	2010-12-30 22:34:44 +00:00
Nick Lewycky	ee0432ce08	Add another non-commutable instruction that gas accepts commuted forms for. Fixes PR8861. llvm-svn: 122641	2010-12-30 22:10:49 +00:00
Che-Liang Chiou	3ee0501338	ptx: add state spaces llvm-svn: 122638	2010-12-30 10:41:27 +00:00
Chris Lattner	e240995e41	include the module identifier when emitting this warning, PR8865. llvm-svn: 122637	2010-12-30 02:49:45 +00:00
Chris Lattner	1556522a34	print the right string, thanks for Frits for noticing. llvm-svn: 122636	2010-12-30 01:07:20 +00:00
Cameron Zwarich	2f6dc10ccc	Use getVRegDef() instead of def_iterator. This leads to fewer defs being added with 2-address instructions, for about a 3.5% speedup of StrongPHIElimination on 403.gcc. llvm-svn: 122635	2010-12-30 00:42:23 +00:00
Chris Lattner	bd530a8e83	improve warning message to at least say what the triples are. llvm-svn: 122632	2010-12-29 22:41:18 +00:00
Wesley Peck	1f289afc25	Fix stack layout error in MBlaze backend. llvm-svn: 122631	2010-12-29 19:46:28 +00:00
Daniel Dunbar	ab14a6f174	MC/Mach-O/Thumb: Set the thumb bit in the symbol table. llvm-svn: 122630	2010-12-29 14:14:06 +00:00
Cameron Zwarich	329cd49ce6	None of the other pass names in CodeGen have terminating periods. llvm-svn: 122628	2010-12-29 11:49:10 +00:00
Cameron Zwarich	0507f44669	Instead of processing every instruction when splitting interferences, only process those instructions that define phi sources. This is a 47% speedup of StrongPHIElimination compile time on 403.gcc. llvm-svn: 122627	2010-12-29 11:00:09 +00:00
Venkatraman Govindaraju	0a09160e51	SPARC backend fix: correctly passing arguments through stack llvm-svn: 122626	2010-12-29 05:37:15 +00:00
Cameron Zwarich	bfef075140	Add a missing word to a comment. llvm-svn: 122625	2010-12-29 04:42:39 +00:00
Rafael Espindola	46a5b05207	Correctly encode pcrel\|indirect. llvm-svn: 122624	2010-12-29 04:31:26 +00:00
NAKAMURA Takumi	de8fda8908	CMake: Add disabling optimization on MSVC8 and MSVC10 as workaround for some files in Target/ARM and Target/X86. llvm-svn: 122623	2010-12-29 03:59:27 +00:00
Cameron Zwarich	458fd305d4	Add text explaining an assertion. llvm-svn: 122617	2010-12-29 03:52:51 +00:00
Rafael Espindola	2ae93b2c9c	Remove second return. llvm-svn: 122616	2010-12-29 02:42:33 +00:00
Rafael Espindola	563301dfdb	Fix bug when trying to output uint16_t or uint32_t. llvm-svn: 122615	2010-12-29 02:30:49 +00:00
Rafael Espindola	290d71671e	Implement cfi_def_cfa. Also don't convert to dwarf reg numbers twice. Looks like 6 is a fixed point of that and so the previous tests were OK :-) llvm-svn: 122614	2010-12-29 01:42:56 +00:00
Chris Lattner	1be1fe033c	fix PR8867: a crash handling fp128. Thanks to Nick for the testcase. llvm-svn: 122613	2010-12-29 01:33:36 +00:00
Rafael Espindola	426e68f793	Implement cfi_def_cfa_register. llvm-svn: 122612	2010-12-29 00:26:06 +00:00
Rafael Espindola	86d347dd31	Initial .cfi_offset implementation. llvm-svn: 122611	2010-12-29 00:09:59 +00:00
Cameron Zwarich	6fe33fdd63	Simplify some code in MachineVerifier that was doing the correct thing, but not in the most obvious way. llvm-svn: 122610	2010-12-28 23:45:38 +00:00
Rafael Espindola	6bbfb6c06c	Don't produce a "DW_CFA_advance_loc 0". llvm-svn: 122609	2010-12-28 23:38:03 +00:00
Cameron Zwarich	146666eabb	Revert the optimization in r122596. It is correct for all current targets, but it relies on assumptions that may not be true in the future. llvm-svn: 122608	2010-12-28 23:02:56 +00:00
Venkatraman Govindaraju	2f15503d5a	Multiple SPARC backend fixes: added Y register; updated select_cc, subx, subxcc defs/uses; and fixed CustomInserter. llvm-svn: 122607	2010-12-28 20:39:17 +00:00
Chris Lattner	65a699d4d0	simplify this, isBytewiseValue handles the extra check. We still check for "multiple of a byte" in size to make it clear that the >> 3 below is safe. llvm-svn: 122604	2010-12-28 18:53:48 +00:00
Chris Lattner	102bc01900	add a note from llvmdev llvm-svn: 122603	2010-12-28 18:45:02 +00:00
Rafael Espindola	85d91982ca	Implement .cfi_remember_state and .cfi_restore_state. llvm-svn: 122602	2010-12-28 18:36:23 +00:00
Benjamin Kramer	b6d52b8b64	Cast away "comparison between signed and unsigned integer" warnings. llvm-svn: 122598	2010-12-28 13:52:52 +00:00
Benjamin Kramer	110442d8a0	Fix a signed/unsigned comparison warning. llvm-svn: 122597	2010-12-28 13:05:13 +00:00
Cameron Zwarich	92f6e4290c	Avoid iterating every operand of an instruction in StrongPHIElimination, since we are only interested in the defs when discovering interferences. This is a 28% speedup running StrongPHIElimination on 403.gcc. llvm-svn: 122596	2010-12-28 10:49:33 +00:00
Duncan Sands	496770debc	Pacify the compiler. BestWeight cannot in fact be used uninitialized in this function, but the compiler was warning that it might be when doing a release build. llvm-svn: 122595	2010-12-28 10:07:15 +00:00
Duncan Sands	3784c5ea3c	Fix warning about size potentially being used uninitialized when doing a release build. llvm-svn: 122594	2010-12-28 10:02:42 +00:00
Duncan Sands	5cf10e691b	Silence gcc warning about an unused variable when doing a release build. llvm-svn: 122593	2010-12-28 09:41:15 +00:00
Rafael Espindola	736a35d9ab	Relax address updates in the eh_frame section. llvm-svn: 122591	2010-12-28 05:39:27 +00:00
Rafael Espindola	a75b87b55a	Start adding basic support for emitting the call frame instructions. llvm-svn: 122590	2010-12-28 04:15:37 +00:00
Michael J. Spencer	c581fe7e81	Support/Path: Deprecate Path::hasMagicNumber and replace all uses with fs::has_magic. llvm-svn: 122589	2010-12-28 01:49:21 +00:00
Michael J. Spencer	848f46bb84	Support/PathV2: Implement has_magic. llvm-svn: 122587	2010-12-28 01:49:01 +00:00
Cameron Zwarich	5e5cfbe871	Change an assertion to assert what the code actually relies upon. llvm-svn: 122586	2010-12-27 22:08:42 +00:00
Chris Lattner	cb18bfa3d2	fix some issues Frits noticed, add AliasAnalysis as a dependency llvm-svn: 122585	2010-12-27 18:39:08 +00:00
Rafael Espindola	1de2dd0e5e	Add support for .cfi_lsda. llvm-svn: 122584	2010-12-27 15:56:22 +00:00
Daniel Dunbar	a895c69431	MC/Mach-O/Thumb: Select appropriate relocation types for Thumb. llvm-svn: 122583	2010-12-27 14:49:49 +00:00
Cameron Zwarich	25d046ce68	Land a first cut at StrongPHIElimination. There are only 5 new test failures when running without the verifier, and I have not yet checked them to see if the new results are still correct. There are more verifier failures, but they all seem to be additional occurrences of verifier failures that occur with the existing PHIElimination pass. There are a few obvious issues with the code: 1) It doesn't properly update the register equivalence classes during copy insertion, and instead recomputes them before merging live intervals and renaming registers. I wanted to keep this first patch simple for debugging purposes, but it shouldn't be very hard to do this. 2) It doesn't mix the renaming and live interval merging with the copy insertion process, which leads to a lot of virtual register churn. Virtual registers and live intervals are created, only to later be merged into others. The code should be smarter and only create a new virtual register if there is no existing register in the same congruence class. 3) In one place the code uses a DenseMap per basic block, which is unnecessary heap allocation. There should be an inline storage version of DenseMap. I did a quick compile-time test of running llc on 403.gcc with and without StrongPHIElimination. It is slightly slower with StrongPHIElimination, because the small decrease in the coalescer runtime can't beat the increase in phi elimination runtime. Perhaps fixing the above performance issues will narrow the gap. I also haven't yet run any tests of the quality of the generated code. llvm-svn: 122582	2010-12-27 10:08:19 +00:00
Cameron Zwarich	b95bfe1667	Add knowledge of phi-def and phi-kill valnos to MachineVerifier's predecessor valno verification. The "Different value live out of predecessor" check is incorrect in the case of phi-def valnos, so just skip that check for phi-def valnos and instead check that all of the valnos for predecessors have phi-kill. Fixes PR8863. llvm-svn: 122581	2010-12-27 05:17:23 +00:00
Michael J. Spencer	9e590024f6	Support/PathV1: Deprecate GetRootDirectory. llvm-svn: 122580	2010-12-27 03:21:41 +00:00
Rafael Espindola	8fc59a682f	Handle reloc_riprel_4byte_movq_load. Should make the bots happy. llvm-svn: 122579	2010-12-27 02:03:24 +00:00
Rafael Espindola	2ac8355ecd	Add support for the same encodings of the personality function that gnu as supports. llvm-svn: 122577	2010-12-27 00:36:05 +00:00
Benjamin Kramer	84bd73c527	BuildLibCalls: Nuke EmitMemCpy, EmitMemMove and EmitMemSet. They are dead and superseded by IRBuilder. llvm-svn: 122576	2010-12-27 00:25:32 +00:00
Benjamin Kramer	7cba269dfb	SimplifyLibCalls: Use IRBuilder to simplify code. llvm-svn: 122575	2010-12-27 00:16:46 +00:00
Chris Lattner	b9fe685b9a	have loop-idiom nuke instructions that feed stores that get removed. llvm-svn: 122574	2010-12-27 00:03:23 +00:00
Chris Lattner	29e14edc8d	implement enough of the memset inference algorithm to recognize and insert memsets. This is still missing one important validity check, but this is enough to compile stuff like this: void test0(std::vector<char> &X) { for (std::vector<char>::iterator I = X.begin(), E = X.end(); I != E; ++I) *I = 0; } void test1(std::vector<int> &X) { for (long i = 0, e = X.size(); i != e; ++i) X[i] = 0x01010101; } With: $ clang t.cpp -S -o - -O2 -emit-llvm \| opt -loop-idiom \| opt -O3 \| llc to: __Z5test0RSt6vectorIcSaIcEE: ## @_Z5test0RSt6vectorIcSaIcEE ## BB#0: ## %entry subq $8, %rsp movq (%rdi), %rax movq 8(%rdi), %rsi cmpq %rsi, %rax je LBB0_2 ## BB#1: ## %bb.nph subq %rax, %rsi movq %rax, %rdi callq ___bzero LBB0_2: ## %for.end addq $8, %rsp ret ... __Z5test1RSt6vectorIiSaIiEE: ## @_Z5test1RSt6vectorIiSaIiEE ## BB#0: ## %entry subq $8, %rsp movq (%rdi), %rax movq 8(%rdi), %rdx subq %rax, %rdx cmpq $4, %rdx jb LBB1_2 ## BB#1: ## %for.body.preheader andq $-4, %rdx movl $1, %esi movq %rax, %rdi callq _memset LBB1_2: ## %for.end addq $8, %rsp ret llvm-svn: 122573	2010-12-26 23:42:51 +00:00
Chris Lattner	6cf8d6cc6e	start using irbuilder to make mem intrinsics in a few passes. llvm-svn: 122572	2010-12-26 22:57:41 +00:00
Chris Lattner	143a07cfee	add methods to IRBuilder to create memcpy/memset/memmove. llvm-svn: 122571	2010-12-26 22:49:25 +00:00
Rafael Espindola	54462cd730	Fix .cfi_personality on 32 bit systems. llvm-svn: 122570	2010-12-26 22:47:37 +00:00
Rafael Espindola	9ae2d05d45	Add support for @note. Patch by Jörg Sonnenberger. llvm-svn: 122568	2010-12-26 21:30:59 +00:00
Chris Lattner	7c5f9c35d1	sketch more of this out. llvm-svn: 122567	2010-12-26 20:45:45 +00:00
Rafael Espindola	9141b611ad	Add basic support for .cfi_personality. llvm-svn: 122566	2010-12-26 20:20:31 +00:00
Chris Lattner	9cb1035f94	move isBytewiseValue out to ValueTracking.h/cpp llvm-svn: 122565	2010-12-26 20:15:01 +00:00
Chris Lattner	81ae3f299a	actually add the file... llvm-svn: 122563	2010-12-26 19:39:38 +00:00
Chris Lattner	2ef535a4e4	Start of a pass for recognizing memset and memcpy idioms. No functionality yet. llvm-svn: 122562	2010-12-26 19:32:44 +00:00
Benjamin Kramer	30342fb1fd	Simplify code. llvm-svn: 122561	2010-12-26 15:23:45 +00:00
Chris Lattner	f9e0a56b94	fix some sort of weird pasto llvm-svn: 122560	2010-12-26 12:05:11 +00:00
Chris Lattner	424de3498b	add a note llvm-svn: 122559	2010-12-26 03:53:31 +00:00
Chris Lattner	03a102bff3	Generalize a previous change, fixing PR8855 - an valid large immediate rejected by the mc assembler. llvm-svn: 122557	2010-12-25 21:36:35 +00:00
Chris Lattner	d729d0dcdb	don't lose TD info llvm-svn: 122556	2010-12-25 20:52:04 +00:00
Chris Lattner	20fca48341	switch the inliner alignment enforcement stuff to use the getOrEnforceKnownAlignment function, which simplifies the code and makes it stronger. llvm-svn: 122555	2010-12-25 20:42:38 +00:00
Chris Lattner	6fcd32e7d7	Move getOrEnforceKnownAlignment out of instcombine into Transforms/Utils. llvm-svn: 122554	2010-12-25 20:37:57 +00:00
Michael J. Spencer	e13f1ead9b	Support/PathV1: Deprecate makeAbsolute and remove Unix impl because it annoys people. llvm-svn: 122553	2010-12-25 20:10:11 +00:00
Benjamin Kramer	b90b2f0635	Fix a thinko pointed out by Frits van Bommel: looking through global variables in isBytewiseValue is not safe. llvm-svn: 122550	2010-12-24 22:23:59 +00:00
Rafael Espindola	0f8abeba1d	Merge IsFixupFullyResolved and IsSymbolRefDifferenceFullyResolved. We now have a single point where targets test if a relocation is needed. llvm-svn: 122549	2010-12-24 21:22:02 +00:00
Benjamin Kramer	ea9152e551	MemCpyOpt: Turn memcpys from a constant into a memset if possible. This allows us to compile "int cst[] = {-1, -1, -1};" into movl $-1, 16(%rsp) movq $-1, 8(%rsp) instead of movl _cst+8(%rip), %eax movl %eax, 16(%rsp) movq _cst(%rip), %rax movq %rax, 8(%rsp) llvm-svn: 122548	2010-12-24 21:17:12 +00:00
Daniel Dunbar	7f2bb4dcae	MC/Mach-O/ARM: Start handling some Thumb branches. llvm-svn: 122547	2010-12-24 16:41:46 +00:00
Andrew Trick	5ce945ca3a	Minor cleanup related to my latest scheduler changes. llvm-svn: 122545	2010-12-24 07:10:19 +00:00
Andrew Trick	c94056692a	Fix a few cases where the scheduler is not checking for phys reg copies. The scheduling node may have a NULL DAG node, yuck. llvm-svn: 122544	2010-12-24 06:46:50 +00:00
Andrew Trick	10ffc2b6c2	Various bits of framework needed for precise machine-level selection DAG scheduling during isel. Most new functionality is currently guarded by -enable-sched-cycles and -enable-sched-hazard. Added InstrItineraryData::IssueWidth field, currently derived from ARM itineraries, but could be initialized differently on other targets. Added ScheduleHazardRecognizer::MaxLookAhead to indicate whether it is active, and if so how many cycles of state it holds. Added SchedulingPriorityQueue::HasReadyFilter to allowing gating entry into the scheduler's available queue. ScoreboardHazardRecognizer now accesses the ScheduleDAG in order to get information about it's SUnits, provides RecedeCycle for bottom-up scheduling, correctly computes scoreboard depth, tracks IssueCount, and considers potential stall cycles when checking for hazards. ScheduleDAGRRList now models machine cycles and hazards (under flags). It tracks MinAvailableCycle, drives the hazard recognizer and priority queue's ready filter, manages a new PendingQueue, properly accounts for stall cycles, etc. llvm-svn: 122541	2010-12-24 05:03:26 +00:00
Andrew Trick	c416ba612b	whitespace llvm-svn: 122539	2010-12-24 04:28:06 +00:00
Cameron Zwarich	ab434079d3	Simplify a check for implicit defs and remove a FIXME. llvm-svn: 122537	2010-12-24 03:09:36 +00:00
Kevin Enderby	fa3c6f148c	In llvm-mc parse a Hash token as a full line comment. Allows handling of preprocessed .s files and matches darwin gas. rdar://8798690 Also fix a comment on the next line of AsmParser.cpp after this new code. llvm-svn: 122531	2010-12-24 00:12:02 +00:00
Jim Grosbach	ffaea0f017	Use a StringSwitch<> instead of a manually constructed string matcher. llvm-svn: 122530	2010-12-24 00:03:39 +00:00
Owen Anderson	226ac14afb	When determining if we can fold (x >> C1) << C2, the bits that we need to verify are zero are not the low bits of x, but the bits that WILL be the low bits after the operation completes. llvm-svn: 122529	2010-12-23 23:56:24 +00:00
Evan Cheng	62de0fa671	Code clean up. No functionality change. llvm-svn: 122528	2010-12-23 23:54:17 +00:00
Jim Grosbach	545858d209	Remove dead patterns. llvm-svn: 122524	2010-12-23 23:20:13 +00:00
Jim Grosbach	bcfa4a945a	Recognize a few more documented register name aliases for ARM in the asm lexer. llvm-svn: 122523	2010-12-23 23:19:54 +00:00
Bob Wilson	36be00ceb3	Radar 8803471: Fix expansion of ARM BCCi64 pseudo instructions. If the basic block containing the BCCi64 (or BCCZi64) instruction ends with an unconditional branch, that branch needs to be deleted before appending the expansion of the BCCi64 to the end of the block. llvm-svn: 122521	2010-12-23 22:45:49 +00:00
Owen Anderson	5d690d4168	It is possible for SimplifyCFG to cause PHI nodes to become redundant too late in the optimization pipeline to be caught by instcombine, and it's not feasible to catch them in SimplifyCFG because the use-lists are in an inconsistent state at the point where it could know that it need to simplify them. Instead, have CodeGenPrepare look for trivially redundant PHIs as part of its general cleanup effort. llvm-svn: 122516	2010-12-23 20:57:35 +00:00
Chris Lattner	2a0a3b43d7	Flag -> Glue, the ongoing saga llvm-svn: 122513	2010-12-23 18:28:41 +00:00
Chris Lattner	11a33811b6	flags -> glue for selectiondag llvm-svn: 122509	2010-12-23 17:24:32 +00:00
Chris Lattner	f647e95b9a	sdisel flag -> glue. llvm-svn: 122507	2010-12-23 17:13:18 +00:00
Benjamin Kramer	dfa40f8f19	Remove/fix invalid README entries. The well thought out strcpy function doesn't return a pointer to the end of the string. llvm-svn: 122496	2010-12-23 15:32:07 +00:00
Benjamin Kramer	b37ae33125	Remove some obsolete README items, add a new one off the top of my head. llvm-svn: 122495	2010-12-23 15:07:02 +00:00
Andrew Trick	528fad91d2	Reorganize ListScheduleBottomUp in preparation for modeling machine cycles and instruction issue. llvm-svn: 122491	2010-12-23 05:42:20 +00:00
Andrew Trick	a52f325c35	Converted LiveRegCycles to LiveRegGens. It's easier to work with and allows multiple nodes per cycle. llvm-svn: 122474	2010-12-23 04:16:14 +00:00
Andrew Trick	12acde11cb	In CheckForLiveRegDef use TRI->getOverlaps. llvm-svn: 122473	2010-12-23 03:43:21 +00:00
Andrew Trick	033efdf4d7	Fixes PR8823: add-with-overflow-128.ll In the bottom-up selection DAG scheduling, handle two-address instructions that read/write unspillable registers. Treat the entire chain of two-address nodes as a single live range. llvm-svn: 122472	2010-12-23 03:15:51 +00:00
Mon P Wang	18b762a946	Preserve the address space when generating bitcasts for MemTransferInst in ConvertToScalarInfo llvm-svn: 122462	2010-12-23 01:41:32 +00:00
Jeffrey Yasskin	9b43f33620	Change all self assignments X=X to (void)X, so that we can turn on a new gcc warning that complains on self-assignments and self-initializations. llvm-svn: 122458	2010-12-23 00:58:24 +00:00
Jim Grosbach	8c48503baa	Trailing whitespace. llvm-svn: 122456	2010-12-22 23:26:02 +00:00
Benjamin Kramer	1f4dfbbcb0	DAGCombine add (sext i1), X into sub X, (zext i1) if sext from i1 is illegal. The latter usually compiles into smaller code. example code: unsigned foo(unsigned x, unsigned y) { if (x != 0) y--; return y; } before: _foo: ## @foo cmpl $1, 4(%esp) ## encoding: [0x83,0x7c,0x24,0x04,0x01] sbbl %eax, %eax ## encoding: [0x19,0xc0] notl %eax ## encoding: [0xf7,0xd0] addl 8(%esp), %eax ## encoding: [0x03,0x44,0x24,0x08] ret ## encoding: [0xc3] after: _foo: ## @foo cmpl $1, 4(%esp) ## encoding: [0x83,0x7c,0x24,0x04,0x01] movl 8(%esp), %eax ## encoding: [0x8b,0x44,0x24,0x08] adcl $-1, %eax ## encoding: [0x83,0xd0,0xff] ret ## encoding: [0xc3] llvm-svn: 122455	2010-12-22 23:17:45 +00:00
Benjamin Kramer	8ef5001b27	InstCombine: creating selects from -1 and 0 is fine, they combine into a sext from i1. llvm-svn: 122453	2010-12-22 23:12:15 +00:00
Benjamin Kramer	6020ed9d99	X86: Lower a select directly to a setcc_carry if possible. int test(unsigned long a, unsigned long b) { return -(a < b); } compiles to _test: ## @test cmpq %rsi, %rdi ## encoding: [0x48,0x39,0xf7] sbbl %eax, %eax ## encoding: [0x19,0xc0] ret ## encoding: [0xc3] instead of _test: ## @test xorl %ecx, %ecx ## encoding: [0x31,0xc9] cmpq %rsi, %rdi ## encoding: [0x48,0x39,0xf7] movl $-1, %eax ## encoding: [0xb8,0xff,0xff,0xff,0xff] cmovael %ecx, %eax ## encoding: [0x0f,0x43,0xc1] ret ## encoding: [0xc3] llvm-svn: 122451	2010-12-22 23:09:28 +00:00
Rafael Espindola	4124ab1255	Add r122359 back now that the bug in MCDwarfLineAddrFragment fragment has been fixed. llvm-svn: 122448	2010-12-22 22:16:24 +00:00
Rafael Espindola	d91e11932c	Assert that the AddrDelta expression is really constant and wrap it in a set if we have a lame assembler. llvm-svn: 122446	2010-12-22 22:04:28 +00:00
Jakob Stoklund Olesen	0acb69d53c	When RegAllocGreedy decides to spill the interferences of the current register, pick the victim with the lowest total spill weight. llvm-svn: 122445	2010-12-22 22:01:30 +00:00
Jakob Stoklund Olesen	29836e6572	Include a shadow of the original CFG edges in the edge bundle graph. llvm-svn: 122444	2010-12-22 22:01:28 +00:00
Rafael Espindola	a6cd2d821a	Rename NeedsSetToChangeDiffSize to HasAggressiveSymbolFolding which is a much better name and matches what is used in the MachO writer. llvm-svn: 122443	2010-12-22 21:51:29 +00:00
Daniel Dunbar	ac8173cf71	MC/Mach-O/ARM: Don't try to use scattered relocs for BR24 fixups. llvm-svn: 122441	2010-12-22 21:26:43 +00:00
Rafael Espindola	0e14b61c0b	Revert r122359 while I debug PR8845. llvm-svn: 122427	2010-12-22 19:05:49 +00:00
Matt Beaumont-Gay	6b5d2519b3	Fix another conditional expression mismatched enum type warning. llvm-svn: 122419	2010-12-22 18:25:55 +00:00
Duncan Sands	a45cfbd405	When determining whether the new instruction was already present in the original instruction, half the cases were missed (making it not wrong but suboptimal). Also correct a typo (A <-> B) in the second chunk. llvm-svn: 122414	2010-12-22 17:15:25 +00:00
Daniel Dunbar	45140d2efc	MC/Mach-O/ARM: We always use the SECTDIFF reloc type on ARM, which is esp. important given that the LOCAL_SECTDIFF enumeration got redefined. llvm-svn: 122412	2010-12-22 16:52:19 +00:00
Daniel Dunbar	11617c8666	MC/Mach-O/ARM: Clone off an ARM version of RecordScatteredRelocation until I figure out how it is supposed to work. llvm-svn: 122410	2010-12-22 16:45:29 +00:00
Daniel Dunbar	dde72ef0c1	MC/Mach-O: Return to reporting errors if we see unexpected fixup kinds. llvm-svn: 122409	2010-12-22 16:32:41 +00:00
Daniel Dunbar	a191cfd97e	MC/Mach-O/ARM: Recognize generic _Data_N fixup kinds. llvm-svn: 122408	2010-12-22 16:32:37 +00:00
Daniel Dunbar	9ccf843a61	MC/Mach-O/ARM: Add enough relocation logic to get BR24 relocations. llvm-svn: 122407	2010-12-22 16:19:24 +00:00
Daniel Dunbar	56f13732e2	MC/Mach-O/ARM: Fix thinko. llvm-svn: 122406	2010-12-22 16:19:20 +00:00
Rafael Espindola	50ce2f06de	Use references and simplify. llvm-svn: 122405	2010-12-22 16:11:57 +00:00
Rafael Espindola	73c0ae77ca	Simplify the handling of .size expressions. llvm-svn: 122404	2010-12-22 16:03:00 +00:00
Daniel Dunbar	083132e131	MC/Mach-O/ARM: Stub out RecordARMRelocation, which is mostly a copy of RecordRelocation with lots of FIXMEs. llvm-svn: 122402	2010-12-22 13:50:05 +00:00
Daniel Dunbar	a63db77f2e	Simplify. llvm-svn: 122401	2010-12-22 13:49:56 +00:00
Daniel Dunbar	1e5be3653c	MC/Mach-O: Split out RecordARMRelocation for now, it is weird enough it isn't clear how to keep in the generic path (yet). - Will revisit when it actually works. llvm-svn: 122400	2010-12-22 13:49:43 +00:00
Duncan Sands	fbb9ac3cca	Add a generic expansion transform: A op (B op' C) -> (A op B) op' (A op C) if both A op B and A op C simplify. This fires fairly often but doesn't make that much difference. On gcc-as-one-file it removes two "and"s and turns one branch into a select. llvm-svn: 122399	2010-12-22 13:36:08 +00:00
Che-Liang Chiou	aaedf8be1c	ptx: add ld instruction and test llvm-svn: 122398	2010-12-22 10:38:51 +00:00
Duncan Sands	3547d2ebd8	Add some statistics, good for understanding how much more powerful instcombine is compared to instsimplify. llvm-svn: 122397	2010-12-22 09:40:51 +00:00
Chris Lattner	cafc1e60bb	Fix a bug in ReduceLoadWidth that wasn't handling extending loads properly. We miscompiled the testcase into: _test: ## @test movl $128, (%rdi) movzbl 1(%rdi), %eax ret Now we get a proper: _test: ## @test movl $128, (%rdi) movsbl (%rdi), %eax movzbl %ah, %eax ret This fixes PR8757. llvm-svn: 122392	2010-12-22 08:02:57 +00:00
Chris Lattner	9a499e96eb	more cleanups, move a check for "roundedness" earlier to reject unhanded cases faster and simplify code. llvm-svn: 122391	2010-12-22 08:01:44 +00:00
Chris Lattner	222374d886	reduce indentation and improve comments, no functionality change. llvm-svn: 122389	2010-12-22 07:36:50 +00:00
Wesley Peck	ff2cd2ea6c	Don't generate carry bit when loading immediate values on the Microblaze. llvm-svn: 122385	2010-12-22 01:29:32 +00:00
Wesley Peck	3a3a5795c5	Add support for some of the LLVM atomic operations to the MBlaze backend. llvm-svn: 122384	2010-12-22 01:15:01 +00:00
Wesley Peck	1cd1554ca0	Modeling the carry bit in the MSR register of the MicroBlaze. llvm-svn: 122381	2010-12-22 00:53:07 +00:00
Wesley Peck	8143c61b91	Fix a regression introduced into the MBlaze delay slot filler. llvm-svn: 122379	2010-12-22 00:22:59 +00:00
Owen Anderson	5ab8d4b5e5	Give GVN back the ability to perform simple conditional propagation on conditional branch values. I still think that LVI should be handling this, but that capability is some ways off in the future, and this matters for some significant benchmarks. llvm-svn: 122378	2010-12-21 23:54:34 +00:00
Matt Beaumont-Gay	890cb2d506	GCC objects to the two sides of a conditional expression having different enum types, but they're just getting converted to unsigned anyway, so cast first (and ask questions later). llvm-svn: 122377	2010-12-21 23:43:23 +00:00
Owen Anderson	12470778d7	Remove dead code. llvm-svn: 122371	2010-12-21 22:31:24 +00:00
Andrew Trick	fbb3ed8774	In DelayForLiveRegsBottomUp, handle instructions that read and write the same physical register. Simplifies the fix from the previous checkin r122211. llvm-svn: 122370	2010-12-21 22:27:44 +00:00
Andrew Trick	2085a96513	whitespace llvm-svn: 122368	2010-12-21 22:25:04 +00:00
Dale Johannesen	a94e36bbee	Reapply 122353-122355 with fixes. 122354 was wrong; the shift type was needed one place, the shift count type another. The transform in 123555 had the same problem. llvm-svn: 122366	2010-12-21 21:55:50 +00:00
Benjamin Kramer	f6ddc4a1de	Add some x86 specific dagcombines for conditional increments. (add Y, (sete X, 0)) -> cmp X, 1; adc 0, Y (add Y, (setne X, 0)) -> cmp X, 1; sbb -1, Y (sub (sete X, 0), Y) -> cmp X, 1; sbb 0, Y (sub (setne X, 0), Y) -> cmp X, 1; adc -1, Y for unsigned foo(unsigned a, unsigned b) { if (a == 0) b++; return b; } we now get: foo: cmpl $1, %edi movl %esi, %eax adcl $0, %eax ret instead of: foo: testl %edi, %edi sete %al movzbl %al, %eax addl %esi, %eax ret llvm-svn: 122364	2010-12-21 21:41:44 +00:00
Benjamin Kramer	43493c089f	GVN's Expression is not POD-like (it contains a SmallVector). Simplify code while at it. llvm-svn: 122362	2010-12-21 21:30:19 +00:00
Dale Johannesen	87c47499c6	Revert 122353-122355 for the moment, they broke stuff. llvm-svn: 122360	2010-12-21 21:22:27 +00:00
Rafael Espindola	a468ae02cb	Simplify EvaluateAsAbsolute now that EvaluateAsRelocatableImpl does all the folding it can. llvm-svn: 122359	2010-12-21 20:51:42 +00:00
Rafael Espindola	6bdb49ded8	Don't relax org or align. They change size as the relaxation happens, but they are not actually relaxed. For example, a section with only alignments will never needs relaxation. llvm-svn: 122356	2010-12-21 20:35:18 +00:00
Dale Johannesen	caf42aa6a4	Add a new transform to DAGCombiner. llvm-svn: 122355	2010-12-21 20:10:51 +00:00
Dale Johannesen	fa5dc82fda	Get the type of a shift from the shift, not from its shift count operand. These should be the same but apparently are not always, and this is cleaner anyway. This improves the code in an existing test. llvm-svn: 122354	2010-12-21 20:06:19 +00:00
Dale Johannesen	d64931df77	Shift by the word size is invalid IR; don't create it. llvm-svn: 122353	2010-12-21 20:00:06 +00:00
Chris Lattner	2a7ff99979	fix some typos llvm-svn: 122349	2010-12-21 18:05:22 +00:00
Stuart Hastings	83cce8e7ab	Fix indentation, add comment. llvm-svn: 122345	2010-12-21 17:16:58 +00:00
Duncan Sands	3b8af41a3e	Visit instructions deterministically. Use a FIFO so as to approximately visit instructions before their uses, since InstructionSimplify does a better job in that case. All this prompted by Frits van Bommel. llvm-svn: 122343	2010-12-21 17:08:55 +00:00
Stuart Hastings	8c5bfcaa29	Missing logic for nested CALLSEQ_START/END. llvm-svn: 122342	2010-12-21 17:07:24 +00:00
Duncan Sands	e7cbb64ec0	If an instruction simplifies, try again to simplify any uses of it. This is not very important since the pass is only used for testing, but it does make it more realistic. Suggested by Frits van Bommel. llvm-svn: 122336	2010-12-21 16:12:03 +00:00
Daniel Dunbar	ff78eda678	MC/Mach-O: Shuffle enums a bit to make it harder to inadvertently use the wrong type. llvm-svn: 122334	2010-12-21 15:26:45 +00:00
Duncan Sands	fecc642224	While I don't think any later transforms can fire, it seems cleaner to not assume this (for example in case more transforms get added below it). Suggested by Frits van Bommel. llvm-svn: 122332	2010-12-21 15:03:43 +00:00
Duncan Sands	5def0d6791	Fix inverted condition noticed by Frits van Bommel. llvm-svn: 122331	2010-12-21 14:48:48 +00:00
Duncan Sands	d0eb6d39f8	Pull a few more simplifications out of instcombine (there are still plenty left though!), in particular for multiplication. llvm-svn: 122330	2010-12-21 14:00:22 +00:00
Duncan Sands	ee3ec6eb94	Teach InstructionSimplify about distributive laws. These transforms fire quite often, but don't make much difference in practice presumably because instcombine also knows them and more. llvm-svn: 122328	2010-12-21 13:32:22 +00:00
Duncan Sands	f64e690c4f	Move checking of the recursion limit into the various Thread methods. No functionality change. llvm-svn: 122327	2010-12-21 09:09:15 +00:00
Duncan Sands	6c7a52cf80	Add generic simplification of associative operations, generalizing a couple of existing transforms. This fires surprisingly often, for example when compiling gcc "(X+(-1))+1->X" fires quite a lot as well as various "and" simplifications (usually with a phi node operand). Most of the time this doesn't make a real difference since the same thing would have been done elsewhere anyway, eg: by instcombine, but there are a few places where this results in simplifications that we were not doing before. llvm-svn: 122326	2010-12-21 08:49:00 +00:00
Cameron Zwarich	79ebc7186e	Incremental progress towards a new implementation of StrongPHIElimination. Most of the problems with my last attempt were in the updating of LiveIntervals rather than the coalescing itself. Therefore, I decided to get that right first by essentially reimplementing the existing PHIElimination using LiveIntervals. It works correctly, with only a few tests failing (which may not be legitimate failures) and no new verifier failures (at least as far as I can tell, I didn't count the number per file). llvm-svn: 122321	2010-12-21 06:54:43 +00:00
Bob Wilson	1a20c2aedd	Add ARM-specific DAG combining to cast i64 vector element load/stores to f64. Type legalization splits up i64 values into pairs of i32 values, which leads to poor quality code when inserting or extracting i64 vector elements. If the vector element is loaded or stored, it can be treated as an f64 value and loaded or stored directly from a VPR register. Use the pre-legalization DAG combiner to cast those vector elements to f64 types so that the type legalizer won't mess them up. Radar 8755338. llvm-svn: 122319	2010-12-21 06:43:19 +00:00
Rafael Espindola	98d93c5294	Layout one section until no relaxations are done and then move to the next section. This helps because in practice sections form a dag with debug sections pointing to text sections. Finishing up the text sections first makes the debug section relaxation trivial. llvm-svn: 122314	2010-12-21 04:22:09 +00:00
Eric Christopher	c874f6c9ff	Arm and thumb call instructions are also in different orders. Fixes rdar://8782223 llvm-svn: 122313	2010-12-21 03:50:43 +00:00
Chris Lattner	3e5fbd74ed	rename MVT::Flag to MVT::Glue. "Flag" is a terrible name for something that just glues two nodes together, even if it is sometimes used for flags. llvm-svn: 122310	2010-12-21 02:38:05 +00:00
Eric Christopher	6df5ff8ce6	If we're not using reg+reg offset we're using reg+imm, set the opcode to be the one we want to use. bugpoint reduced testcase is a little large, I'll see if I can simplify it down more. Fixes part of rdar://8782207 llvm-svn: 122307	2010-12-21 02:12:07 +00:00
Chris Lattner	17f906be96	improve "cannot yet select" errors a trivial amount: now they are just as useless, but at least a bit more gramatical llvm-svn: 122305	2010-12-21 02:07:03 +00:00
Bill Wendling	cdcc4fc048	Fix a copy-pasto. When the tBR_JTr instruction was converted to using the tPseudoInst class, its size was changed from "special" to "2 bytes". This is incorrect because the jump table will no longer be taken into account when calculating branch offsets. <rdar://problem/8782216> llvm-svn: 122303	2010-12-21 01:57:15 +00:00
Bill Wendling	18581a4ac0	Comment cleanups. llvm-svn: 122302	2010-12-21 01:54:40 +00:00
Jakob Stoklund Olesen	2530cd2a4c	Add EdgeBundles to SplitKit. Edge bundles is an annotation on the CFG that turns it into a bipartite directed graph where each basic block is connected to an outgoing and an ingoing bundle. These bundles are useful for identifying regions of the CFG for live range splitting. llvm-svn: 122301	2010-12-21 01:50:21 +00:00
Jakob Stoklund Olesen	4c278f82c8	Use IntEqClasses to compute connected components of live intervals. llvm-svn: 122296	2010-12-21 00:48:17 +00:00
Jakob Stoklund Olesen	baee655c5e	Add ADT/IntEqClasses.h as a light-weight implementation of EquivalenceClasses.h. This implementation already exists as ConnectedVNInfoEqClasses in LiveInterval.cpp, and it seems to be generally useful to have a light-weight way of forming equivalence classes of small integers. IntEqClasses doesn't allow enumeration of the elements in a class. llvm-svn: 122293	2010-12-21 00:04:46 +00:00
Owen Anderson	c6beda80ff	Speculatively revert the use of DenseMap in LazyValueInfo, which may be causing Linux self-host failures. llvm-svn: 122291	2010-12-20 23:53:19 +00:00
Owen Anderson	9be3ec6264	Attempt to appease the DragonEgg buildbots. llvm-svn: 122288	2010-12-20 23:23:18 +00:00
Owen Anderson	aa81496c76	Revert r122114 (CallbackVH observing use-list changes) because it caused severe slowdowns on the Linux self-host configuration. llvm-svn: 122279	2010-12-20 22:28:03 +00:00
Nate Begeman	4b9db07b02	Implement feedback from Bruno on making pblendvb an x86-specific ISD node in addition to being an intrinsic, and convert lowering to use it. Hopefully the pattern fragment is doing the right thing with XMM0, looks correct in testing. llvm-svn: 122277	2010-12-20 22:04:24 +00:00
Owen Anderson	813a2c45a8	Convert one of LVI's primary maps to a DenseMap, now that we know are more assured of iterator stability. llvm-svn: 122273	2010-12-20 21:30:54 +00:00
Wesley Peck	ae58e7b179	Teach the MBlaze disassembler to disassemble special purpose registers. llvm-svn: 122269	2010-12-20 21:18:04 +00:00
Roman Divacky	55184ddd35	Set the value of absolute symbols. llvm-svn: 122268	2010-12-20 21:14:39 +00:00
Duncan Sands	eaff500c7b	Oops, forgot to add the pass itself! llvm-svn: 122265	2010-12-20 21:07:42 +00:00
Duncan Sands	a436cbe4bf	Add a new convenience pass for testing InstructionSimplify. Previously it could only be tested indirectly, via instcombine, gvn or some other pass that makes use of InstructionSimplify, which means that testcases had to be carefully contrived to dance around any other transformations that that pass did. llvm-svn: 122264	2010-12-20 20:54:37 +00:00
Wesley Peck	3307d7cbad	Teach the MBlaze asm parser how to parse special purpose register names. llvm-svn: 122261	2010-12-20 20:43:24 +00:00
Dale Johannesen	0a291a36f2	Cosmetic changes. llvm-svn: 122259	2010-12-20 20:10:50 +00:00
Benjamin Kramer	f7957d0463	Add a check missing from my last commit and avoid a potential overflow situation. llvm-svn: 122258	2010-12-20 20:00:31 +00:00
Owen Anderson	d83f98a51e	More LVI cleanups, including trying to simplify the process of maintaining the OverDefinedCache. llvm-svn: 122256	2010-12-20 19:33:41 +00:00
Owen Anderson	64c2c5798a	Reuse the reference into the LVI cache throughout the solver subsystem. This is much easier to verify as being safe thanks its recent de-recursivization. llvm-svn: 122254	2010-12-20 18:18:16 +00:00
Benjamin Kramer	2bca3a67b3	Reduce indentation. llvm-svn: 122249	2010-12-20 16:21:59 +00:00
Benjamin Kramer	68531baea9	Teach InstCombine to merge (icmp ult (X + CA), C1) \| (icmp eq X, C2) into (icmp ult (X + CA), C1 + 1) if C2 + CA == C1. InstCombine creates these so now we compile x == 23 \|\| x == 24 \|\| x == 25 to %x.off = add i32 %x, -23 %1 = icmp ult i32 %x.off, 3 instead of %x.off = add i32 %x, -23 %1 = icmp ult i32 %x.off, 2 %cmp3 = icmp eq i32 %x, 25 %ret2 = or i1 %1, %cmp3 llvm-svn: 122248	2010-12-20 16:18:51 +00:00
Daniel Dunbar	ca2511d849	Add header... llvm-svn: 122247	2010-12-20 15:45:51 +00:00
Daniel Dunbar	7da045e59f	X86/MC/Mach-O: Split out createX86MachObjectWriter(). llvm-svn: 122246	2010-12-20 15:07:39 +00:00
Duncan Sands	ed6d6c33dd	Have SimplifyBinOp dispatch Xor, Add and Sub to the corresponding methods (they had just been forgotten before). Adding Xor causes "main" in the existing testcase 2010-11-01-lshr-mask.ll to be hugely more simplified. llvm-svn: 122245	2010-12-20 14:47:04 +00:00
Michael J. Spencer	6d4b7e7e83	Support/PathV2: Add missing has_relative_path impl. llvm-svn: 122243	2010-12-20 13:30:28 +00:00
Duncan Sands	70db5e7cb2	There is no need for isAssociative to take the type as an argument anymore. llvm-svn: 122242	2010-12-20 13:10:23 +00:00
Chris Lattner	27ca8ebd4b	fix PR8807 by making transformConstExprCastCall aware of byval arguments. llvm-svn: 122238	2010-12-20 08:36:38 +00:00
Chris Lattner	7398965b67	various cleanups for transformConstExprCastCall llvm-svn: 122237	2010-12-20 08:25:06 +00:00
Chris Lattner	0f11495289	when eliding a byval copy due to inlining a readonly function, we have to make sure that the reused alloca has sufficient alignment. llvm-svn: 122236	2010-12-20 08:10:40 +00:00
Chris Lattner	0099744506	pull byval processing out to its own helper function. llvm-svn: 122235	2010-12-20 07:57:41 +00:00
Chris Lattner	7394680a00	fix PR8769, a miscompilation by inliner when inlining a function with a byval argument. The generated alloca has to have at least the alignment of the byval, if not, the client may be making assumptions that the new alloca won't satisfy. llvm-svn: 122234	2010-12-20 07:45:28 +00:00
Cameron Zwarich	4ffda706d0	MachineVerifier should count landing pad successors as basic blocks rather than out-edges. Fixes PR8824. llvm-svn: 122228	2010-12-20 04:19:48 +00:00
Cameron Zwarich	660bce67f3	Teach MachineVerifier that early clobber defs begin at USE slots and other defs begin at DEF slots. Fixes the second half of PR8813. llvm-svn: 122225	2010-12-20 03:15:20 +00:00
Cameron Zwarich	bc2461c5f9	Add a missing check from r122218. llvm-svn: 122224	2010-12-20 02:59:51 +00:00
Chris Lattner	0b3ca50ebb	implement type legalization promotion support for SMULO and UMULO, giving ARM (and other 32-bit-only) targets support for i8 and i16 overflow multiplies. The generated code isn't great, but this at least fixes CodeGen/Generic/overflow.ll when running on ARM hosts. llvm-svn: 122221	2010-12-20 02:05:39 +00:00
Chris Lattner	5c00d41688	now that addc/adde are gone, "ADDC" in the X86 backend uses EFLAGS results, the same as setcc. Optimize ADDC(0,0,FLAGS) -> SET_CARRY(FLAGS). This is a step towards finishing off PR5443. In the testcase in that bug we now get: movq %rdi, %rax addq %rsi, %rax sbbq %rcx, %rcx testb $1, %cl setne %dl ret instead of: movq %rdi, %rax addq %rsi, %rax movl $0, %ecx adcq $0, %rcx testq %rcx, %rcx setne %dl ret llvm-svn: 122219	2010-12-20 01:37:09 +00:00
Cameron Zwarich	fc0c6b1ea9	Don't assume that an instruction ending a register's live range always reads the register; it may be a dead def instead. Fixes PR8820. llvm-svn: 122218	2010-12-20 01:22:37 +00:00
Chris Lattner	46b9efcad7	We lower setb to sbb with the hope that the and will go away, when it doesn't, match it back to setb. On a 64-bit version of the testcase before we'd get: movq %rdi, %rax addq %rsi, %rax sbbb %dl, %dl andb $1, %dl ret now we get: movq %rdi, %rax addq %rsi, %rax setb %dl ret llvm-svn: 122217	2010-12-20 01:16:03 +00:00
Mon P Wang	1991c47ec1	Avoid dropping the address space when InstCombine optimizes memset llvm-svn: 122215	2010-12-20 01:05:30 +00:00
Chris Lattner	9c26d2711b	use for loop over types. llvm-svn: 122214	2010-12-20 01:03:27 +00:00
Chris Lattner	846c20d4e6	Change the X86 backend to stop using the evil ADDC/ADDE/SUBC/SUBE nodes (which their carry depenedencies with MVT::Flag operands) and use clean and beautiful EFLAGS dependences instead. We do this by changing the modelling of SBB/ADC to have EFLAGS input and outputs (which is what requires the previous scheduler change) and change X86 ISelLowering to custom lower ADDC and friends down to X86ISD::ADD/ADC/SUB/SBB nodes. With the previous series of changes, this causes no changes in the testsuite, woo. llvm-svn: 122213	2010-12-20 00:59:46 +00:00
Chris Lattner	981afd206b	Fix a bug in the scheduler's handling of "unspillable" vregs. Imagine we see: EFLAGS = inst1 EFLAGS = inst2 FLAGS gpr = inst3 EFLAGS Previously, we would refuse to schedule inst2 because it clobbers the EFLAGS of the predecessor. However, it also uses the EFLAGS of the predecessor, so it is safe to emit. SDep edges ensure that the right order happens already anyway. This fixes 2 testsuite crashes with the X86 patch I'm going to commit next. llvm-svn: 122211	2010-12-20 00:55:43 +00:00
Chris Lattner	0cfe884874	the result of CheckForLiveRegDef is dead, remove it. llvm-svn: 122209	2010-12-20 00:51:56 +00:00

... 5 6 7 8 9 ...

44946 Commits