llvm-project

Commit Graph

Author	SHA1	Message	Date
Benjamin Kramer	8bcc971174	Make MemoryBuiltins aware of TargetLibraryInfo. This disables malloc-specific optimization when -fno-builtin (or -ffreestanding) is specified. This has been a problem for a long time but became more severe with the recent memory builtin improvements. Since the memory builtin functions are used everywhere, this required passing TLI in many places. This means that functions that now have an optional TLI argument, like RecursivelyDeleteTriviallyDeadFunctions, won't remove dead mallocs anymore if the TLI argument is missing. I've updated most passes to do the right thing. Fixes PR13694 and probably others. llvm-svn: 162841	2012-08-29 15:32:21 +00:00
Benjamin Kramer	dd62d6b6c8	GVN: Fix quadratic runtime on the number of switch cases. No intended behavior change. This was introduced in r162023. With the fixed algorithm a Release build of ARMInstPrinter.cpp goes from 16s to 10s on a 2011 MBP. llvm-svn: 162559	2012-08-24 15:06:28 +00:00
Rafael Espindola	cc80cdebb9	Teach GVN to reason about edges dominating uses. This allows it to handle cases where some fact lake a=b dominates a use in a phi, but doesn't dominate the basic block itself. This feature could also be implemented by splitting critical edges, but at least with the current algorithm reasoning about the dominance directly is faster. The time for running "opt -O2" in the testcase in pr10584 is 1.003 times slower and on gcc as a single file it is 1.0007 times faster. llvm-svn: 162023	2012-08-16 15:09:43 +00:00
Rafael Espindola	64e7b5703e	Constify some basic blocks, no functionality change. llvm-svn: 161668	2012-08-10 15:55:25 +00:00
Nadav Rotem	465834c85f	Clean whitespaces. llvm-svn: 160668	2012-07-24 10:51:42 +00:00
Chandler Carruth	aafe0918bc	Move llvm/Support/IRBuilder.h -> llvm/IRBuilder.h This was always part of the VMCore library out of necessity -- it deals entirely in the IR. The .cpp file in fact was already part of the VMCore library. This is just a mechanical move. I've tried to go through and re-apply the coding standard's preferred header sort, but at 40-ish files, I may have gotten some wrong. Please let me know if so. I'll be committing the corresponding updates to Clang and Polly, and Duncan has DragonEgg. Thanks to Bill and Eric for giving the green light for this bit of cleanup. llvm-svn: 159421	2012-06-29 12:38:19 +00:00
Nuno Lopes	55fff83422	refactor the MemoryBuiltin analysis: - provide more extensive set of functions to detect library allocation functions (e.g., malloc, calloc, strdup, etc) - provide an API to compute the size and offset of an object pointed by Move a few clients (GVN, AA, instcombine, ...) to the new API. This implementation is a lot more aggressive than each of the custom implementations being replaced. Patch reviewed by Nick Lewycky and Chandler Carruth, thanks. llvm-svn: 158919	2012-06-21 15:45:28 +00:00
Hal Finkel	16ddd4b66b	Move the Metadata merging methods from GVN and make them public in MDNode. There are other passes, BBVectorize specifically, that also need some of this functionality. llvm-svn: 158605	2012-06-16 20:33:37 +00:00
Rafael Espindola	47d988c54c	When gvn decides to replace an instruction with another, we have to patch the replacement to make it at least as generic as the instruction being replaced. This includes: * dropping nsw/nuw flags * getting the least restrictive tbaa and fpmath metadata * merging ranges Fixes PR12979. llvm-svn: 157958	2012-06-04 22:44:21 +00:00
Duncan Sands	4df5e96d3a	Fix PR12858, a crash due to GVN's PRE not fully removing an instruction from the leader table. That's because it wasn't expecting instructions to turn up as leader for a value number that is not its own, but equality propagation could create this situation. One solution is to have the leader table use a WeakVH but this slows down GVN by about 5%. Instead just have equality propagation not add instructions to the leader table, only constants and arguments. In theory this might cause GVN to run more (each time it changes something it runs again) but it doesn't seem to occur enough to cause a slow down. llvm-svn: 157251	2012-05-22 14:17:53 +00:00
David Blaikie	84e4b39995	Change recurse depth limit to uint32 to fix warning. llvm-svn: 155727	2012-04-27 19:30:32 +00:00
Mon P Wang	6120cfb8cd	Add an early bailout to IsValueFullyAvailableInBlock from deeply nested blocks. The limit is set to an arbitrary 1000 recursion depth to avoid stack overflow issues. <rdar://problem/11286839>. llvm-svn: 155722	2012-04-27 18:09:28 +00:00
Duncan Sands	d12b18f820	Make GVN's propagateEquality non-recursive. No intended functionality change. The modifications are a lot more trivial than they appear to be in the diff! llvm-svn: 154174	2012-04-06 15:31:09 +00:00
Jakob Stoklund Olesen	4e55044ff5	Don't PRE compares. CodeGenPrepare sinks compare instructions down to their uses to prevent live flags and predicate registers across basic blocks. PRE of a compare instruction prevents that, forcing the i1 compare result into a general purpose register. That is usually more expensive than the redundant compare PRE was trying to eliminate in the first place. llvm-svn: 153657	2012-03-29 17:22:39 +00:00
Duncan Sands	a11ef6e4ea	When propagating equalities, eg replacing A with B in every basic block dominated by Root, check that B is available throughout the scope. This is obviously true (famous last words?) given the current logic, but the check may be helpful if more complicated reasoning is added one day. llvm-svn: 153323	2012-03-23 08:45:52 +00:00
Stepan Dyatkovskiy	97b02fc1b3	llvm::SwitchInst Renamed methods caseBegin, caseEnd and caseDefault with case_begin, case_end, and case_default. Added some notes relative to case iterators. llvm-svn: 152532	2012-03-11 06:09:17 +00:00
Stepan Dyatkovskiy	5b648afb4d	Taken into account Duncan's comments for r149481 dated by 2nd Feb 2012: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20120130/136146.html Implemented CaseIterator and it solves almost all described issues: we don't need to mix operand/case/successor indexing anymore. Base iterator class is implemented as a template since it may be initialized either from "const SwitchInst" or from "SwitchInst". ConstCaseIt is just a read-only iterator. CaseIt is read-write iterator; it allows to change case successor and case value. Usage of iterator allows totally remove resolveXXXX methods. All indexing convertions done automatically inside the iterator's getters. Main way of iterator usage looks like this: SwitchInst SI = ... // intialize it somehow for (SwitchInst::CaseIt i = SI->caseBegin(), e = SI->caseEnd(); i != e; ++i) { BasicBlock BB = i.getCaseSuccessor(); ConstantInt *V = i.getCaseValue(); // Do something. } If you want to convert case number to TerminatorInst successor index, just use getSuccessorIndex iterator's method. If you want initialize iterator from TerminatorInst successor index, use CaseIt::fromSuccessorIndex(...) method. There are also related changes in llvm-clients: klee and clang. llvm-svn: 152297	2012-03-08 07:06:20 +00:00
Duncan Sands	3eb328574e	This is not a common case, in fact it never happens! llvm-svn: 152027	2012-03-05 12:23:00 +00:00
Chandler Carruth	e134d1a336	Replace the ad-hoc hashing in GVN with the new hashing infrastructure. This implicitly fixes a nasty bug in the GVN hashing (that thankfully could only manifest as a performance bug): actually include the opcode in the hash. The old code started the hash off with the opcode, but then overwrote it with the type pointer. Since this is likely to be pretty hot (GVN being already pretty expensive) I've included a micro-optimization to just not bother with the varargs hashing if they aren't present. I can't measure any change in GVN performance due to this, even with a big test case like Duncan's sqlite one. Everything I see is in the noise floor. That said, this closes a loop hole for a potential scaling problem due to collisions if the opcode were the differentiating aspect of the expression. llvm-svn: 152025	2012-03-05 11:29:54 +00:00
Duncan Sands	4d928e7dff	Nick pointed out on IRC that GVN's propagateEquality wasn't propagating equalities into phi node operands for which the equality is known to hold in the incoming basic block. That's because replaceAllDominatedUsesWith wasn't handling phi nodes correctly in general (that this didn't give wrong results was just luck: the specific way GVN uses replaceAllDominatedUsesWith precluded wrong changes to phi nodes). llvm-svn: 152006	2012-03-04 13:25:19 +00:00
Duncan Sands	bb2fe65542	Have GVN also do condition propagation when the right-hand side is not a constant. This fixes PR1768. llvm-svn: 151713	2012-02-29 11:12:03 +00:00
Duncan Sands	9edea84420	Micro-optimization, no functionality change. llvm-svn: 151524	2012-02-27 12:11:41 +00:00
Duncan Sands	1be25a78f7	The value numbering function is recursive, so it is possible for multiple new value numbers to be assigned when calculating any particular value number. Enhance the logic that detects new value numbers to take this into account, for a tiny compile time speedup. Fix a comment typo while there. llvm-svn: 151522	2012-02-27 09:54:35 +00:00
Duncan Sands	27f459519d	When performing a conditional branch depending on the value of a comparison %cmp (eg: A==B) we already replace %cmp with "true" under the true edge, and with "false" under the false edge. This change enhances this to replace the negated compare (A!=B) with "false" under the true edge and "true" under the false edge. Reported to improve perlbench results by 1%. llvm-svn: 151517	2012-02-27 08:14:30 +00:00
Duncan Sands	926d101640	Teach GVN that x+y is the same as y+x and that x<y is the same as y>x. llvm-svn: 151365	2012-02-24 15:16:31 +00:00
Duncan Sands	0920308a7e	Use Use::set rather than finding the operand number of the use and setting that. llvm-svn: 150074	2012-02-08 14:10:53 +00:00
Duncan Sands	9066fb5c43	Neaten up this method. Check that if there is only one predecessor then it's Src. llvm-svn: 149843	2012-02-05 19:43:37 +00:00
Duncan Sands	12efb16b01	Fix a thinko pointed out by Eli and the buildbots. llvm-svn: 149839	2012-02-05 18:56:50 +00:00
Duncan Sands	4b613497f0	Reduce the number of dom queries made by GVN's conditional propagation logic by half: isOnlyReachableViaThisEdge was trying to be clever and handle the case of a branch to a basic block which is contained in a loop. This costs a domtree lookup and is completely useless due to GVN's position in the pass pipeline: all loops have preheaders at this point, which means it is enough for isOnlyReachableViaThisEdge to check that Dst has only one predecessor. (I checked this theoretical argument by running over the entire nightly testsuite, and indeed it is so!). llvm-svn: 149838	2012-02-05 18:25:50 +00:00
Duncan Sands	268903955c	Reduce the number of non-trivial domtree queries by about 1% when compiling sqlite3, by only doing dom queries after the cheap check rather than interleaved with it. llvm-svn: 149836	2012-02-05 15:50:43 +00:00
Stepan Dyatkovskiy	513aaa5691	SwitchInst refactoring. The purpose of refactoring is to hide operand roles from SwitchInst user (programmer). If you want to play with operands directly, probably you will need lower level methods than SwitchInst ones (TerminatorInst or may be User). After this patch we can reorganize SwitchInst operands and successors as we want. What was done: 1. Changed semantics of index inside the getCaseValue method: getCaseValue(0) means "get first case", not a condition. Use getCondition() if you want to resolve the condition. I propose don't mix SwitchInst case indexing with low level indexing (TI successors indexing, User's operands indexing), since it may be dangerous. 2. By the same reason findCaseValue(ConstantInt*) returns actual number of case value. 0 means first case, not default. If there is no case with given value, ErrorIndex will returned. 3. Added getCaseSuccessor method. I propose to avoid usage of TerminatorInst::getSuccessor if you want to resolve case successor BB. Use getCaseSuccessor instead, since internal SwitchInst organization of operands/successors is hidden and may be changed in any moment. 4. Added resolveSuccessorIndex and resolveCaseIndex. The main purpose of these methods is to see how case successors are really mapped in TerminatorInst. 4.1 "resolveSuccessorIndex" was created if you need to level down from SwitchInst to TerminatorInst. It returns TerminatorInst's successor index for given case successor. 4.2 "resolveCaseIndex" converts low level successors index to case index that curresponds to the given successor. Note: There are also related compatability fix patches for dragonegg, klee, llvm-gcc-4.0, llvm-gcc-4.2, safecode, clang. llvm-svn: 149481	2012-02-01 07:49:51 +00:00
Bill Wendling	e5f4a6d904	Increase the initial vector size to be equivalent to the size of the Deps vector. This potentially saves a resizing. llvm-svn: 149369	2012-01-31 07:04:52 +00:00
Bill Wendling	8a33312948	Cache the size of the vector instead of calling .size() all over the place. llvm-svn: 149368	2012-01-31 06:57:53 +00:00
Chad Rosier	6a0baa8f09	Typo. llvm-svn: 149289	2012-01-30 22:44:13 +00:00
Chad Rosier	41003f819c	Typo. llvm-svn: 149275	2012-01-30 21:13:22 +00:00
Chad Rosier	c24b86ffbe	Propagate TargetLibraryInfo throughout ConstantFolding.cpp and InstructionSimplify.cpp. Other fixups as needed. Part of rdar://10500969 llvm-svn: 145559	2011-12-01 03:08:23 +00:00
Duncan Sands	f537a6edd4	Don't replace all dominated uses if there is only one use, since that use can't be dominated, saving one domtree lookup. llvm-svn: 142066	2011-10-15 11:13:42 +00:00
Eli Friedman	c1702c8f22	Enhance the memdep interface so that users can tell the difference between a dependency which cannot be calculated and a path reaching the entry point of the function. This patch introduces isNonFuncLocal, which replaces isUnknown in some cases. Patch by Xiaoyi Guo. llvm-svn: 141896	2011-10-13 22:14:57 +00:00
Duncan Sands	c52af46484	Teach GVN to also propagate switch cases. For example, in this code switch (n) { case 27: do_something(x); ... } the call do_something(x) will be replaced with do_something(27). In gcc-as-one-big-file this results in the removal of about 500 lines of bitcode (about 0.02%), so has about 1/10 of the effect of propagating branch conditions. llvm-svn: 141360	2011-10-07 08:29:06 +00:00
Duncan Sands	f4f47ccd12	GVN does simple propagation of conditions: when it sees a conditional branch "br i1 %x, label %if_true, label %if_false" then it replaces "%x" with "true" in places only reachable via the %if_true arm, and with "false" in places only reachable via the %if_false arm. Except that actually it doesn't: if value numbering shows that %y is equal to %x then, yes, %y will be turned into true/false in this way, but any occurrences of %x itself are not transformed. Fix this. What's more, it's often the case that %x is an equality comparison such as "%x = icmp eq %A, 0", in which case every occurrence of %A that is only reachable via the %if_true arm can be replaced with 0. Implement this and a few other variations on this theme. This reduces the number of lines of LLVM IR in "GCC as one big file" by 0.2%. It has a bigger impact on Ada code, typically reducing the number of lines of bitcode by around 0.4% by removing repeated compiler generated checks. Passes the LLVM nightly testsuite and the Ada ACATS testsuite. llvm-svn: 141177	2011-10-05 14:28:49 +00:00
Duncan Sands	e90dd0587e	Generalize GVN's conditional propagation logic slightly: it's OK for the false/true destination to have multiple predecessors as long as the extra ones are dominated by the branch destination. llvm-svn: 141176	2011-10-05 14:17:01 +00:00
Benjamin Kramer	547b6c5ecd	Stop emitting instructions with the name "tmp" they eat up memory and have to be uniqued, without any benefit. If someone prefers %tmp42 to %42, run instnamer. llvm-svn: 140634	2011-09-27 20:39:19 +00:00
Jakub Staszak	7470fb01d0	Compare type size instead of type _store_ size to make sure that BitCastInst will be valid. This fixes PR10820. llvm-svn: 139005	2011-09-02 14:57:37 +00:00
Eli Friedman	9a468153e1	Atomic load/store handling for the passes using memdep (GVN, DSE, memcpyopt). llvm-svn: 137888	2011-08-17 22:22:24 +00:00
Bill Wendling	8bbcbedeaf	Disable PRE for landing pads. PRE needs the landing pads to have their critical edges split. Doing this for a landing pad is non-trivial. Abandon the attempt to perform PRE when we come across a landing pad. (Reviewed by Owen!) llvm-svn: 137876	2011-08-17 21:32:02 +00:00
Jay Foad	ed8db7d9df	Convert ConstantExpr::getGetElementPtr and ConstantExpr::getInBoundsGetElementPtr to use ArrayRef. llvm-svn: 135673	2011-07-21 14:31:17 +00:00
Chris Lattner	229907cd11	land David Blaikie's patch to de-constify Type, with a few tweaks. llvm-svn: 135375	2011-07-18 04:54:35 +00:00
Lang Hames	266dab7bab	Added recognition for signed add/sub/mul with overflow intrinsics to GVN as per Chris and Frits suggestion. llvm-svn: 134777	2011-07-09 00:25:11 +00:00
Lang Hames	29cd98fd52	Make GVN look through extractvalues for recognised intrinsics. GVN can then CSE ops that match values produced by the intrinsics. llvm-svn: 134677	2011-07-08 01:50:54 +00:00
Jay Foad	372ad64b4d	Make better use of the PHINode API. Change various bits of code to make better use of the existing PHINode API, to insulate them from forthcoming changes in how PHINodes store their operands. llvm-svn: 133434	2011-06-20 14:18:48 +00:00
Eli Friedman	7d58bc7bc0	Add "unknown" results for memdep, which mean "I don't know whether a dependence for the given instruction exists in the given block". This cleans up all the existing hacks in memdep which represent this concept by returning clobber with various unrelated instructions. llvm-svn: 133031	2011-06-15 00:47:34 +00:00
Chris Lattner	f0d59072de	fix PR9841 by having GVN not process dead loads. This was causing it to get into infinite loops when it would widen a load (which can necessarily leave around dead loads). llvm-svn: 131847	2011-05-22 07:03:34 +00:00
Devang Patel	341b38c22a	Preserve line number information. llvm-svn: 131482	2011-05-17 20:00:02 +00:00
Devang Patel	c5933f2418	Set debug loc for new load instruction. llvm-svn: 131481	2011-05-17 19:43:38 +00:00
Devang Patel	ffb798c1c6	Set debug loc for new instructions. llvm-svn: 130895	2011-05-04 23:58:50 +00:00
Chris Lattner	a5452c0d67	improve comment. llvm-svn: 130426	2011-04-28 20:02:57 +00:00
Chris Lattner	1777601a74	final step needed to resolve PR6627, which allows us to flatten the code down to a nice and tidy: %x1 = load i32* %0, align 4 %1 = icmp eq i32 %x1, 1179403647 br i1 %1, label %if.then, label %if.end instead of doing lots of loads and branches. May the FreeBSD bootloader long fit in its allocated space. llvm-svn: 130416	2011-04-28 18:15:47 +00:00
Chris Lattner	45e393fc9c	code cleanups only. llvm-svn: 130414	2011-04-28 18:08:21 +00:00
Chris Lattner	f81f789b6c	centralize "marking for deletion" into a helper function. Pass GVN around to static functions instead of passing around tons of random ivars. llvm-svn: 130403	2011-04-28 16:36:48 +00:00
Chris Lattner	6cec6ab275	Promote toErase to be an ivar of the GVN class. llvm-svn: 130401	2011-04-28 16:18:52 +00:00
Chris Lattner	827a270a2a	teach GVN to widen integer loads when they are overaligned, when doing an wider load would allow elimination of subsequent loads, and when the wider load is still a native integer type. This eliminates a ton of loads on various benchmarks involving struct fields, though it is somewhat hobbled by clang not being very aggressive about field alignment. This is yet another step along the way towards resolving PR6627. llvm-svn: 130390	2011-04-28 07:29:08 +00:00
Chris Lattner	eb045f9c02	Improve the bail-out predicate to really only kick in when phi translation fails. We were bailing out in some cases that would cause us to miss GVN'ing some non-local cases away. llvm-svn: 130206	2011-04-26 17:41:02 +00:00
Chris Lattner	6f83d06ffa	Enhance MemDep: When alias analysis returns a partial alias result, return it as a clobber. This allows GVN to do smart things. Enhance GVN to be smart about the case when a small load is clobbered by a larger overlapping load. In this case, forward the value. This allows us to compile stuff like this: int test(void P) { int tmp = (unsigned int)P; return tmp+((unsigned char*)P+1); } into: _test: ## @test movl (%rdi), %ecx movzbl %ch, %eax addl %ecx, %eax ret which has one load. We already handled the case where the smaller load was from a must-aliased base pointer. llvm-svn: 130180	2011-04-26 01:21:15 +00:00
Jay Foad	52131344a2	Remove PHINode::reserveOperandSpace(). Instead, add a parameter to PHINode::Create() giving the (known or expected) number of operands. llvm-svn: 128537	2011-03-30 11:28:46 +00:00
Jay Foad	e0938d8a87	(Almost) always call reserveOperandSpace() on newly created PHINodes. llvm-svn: 128535	2011-03-30 11:19:20 +00:00
Dan Gohman	0f124e1987	Give GetUnderlyingObject a TargetData, to keep it in sync with BasicAA's DecomposeGEPExpression, which recently began using a TargetData. This fixes PR8968, though the testcase is awkward to reduce. Also, update several off GetUnderlyingObject's users which happen to have a TargetData handy to pass it in. llvm-svn: 124134	2011-01-24 18:53:32 +00:00
Jakob Stoklund Olesen	12cc296bd4	Remove the PR8954 workaround. llvm-svn: 123288	2011-01-11 22:56:41 +00:00
Cameron Zwarich	cb9c4f85ec	Dial back the speculative fix for PR8954 a bit, so that we only recompute dominators once at the beginning of GVN instead of once per iteration. llvm-svn: 123278	2011-01-11 22:14:42 +00:00
Cameron Zwarich	51eb403907	Attempt to fix the bootstrap buildbot. Rafael says this works for him on x86-64 Linux. llvm-svn: 123270	2011-01-11 20:23:34 +00:00
Chris Lattner	193ce7c4d1	update memdep when an instruction is deleted. This code isn't actually reached in the testcase in PR8954, but it's safe and good practice. llvm-svn: 123224	2011-01-11 08:19:16 +00:00
Chris Lattner	f6ae904e34	Fix FoldSingleEntryPHINodes to update memdep and AA when it deletes phi nodes. It is called from MergeBlockIntoPredecessor which is called from GVN, which claims to preserve these. I'm skeptical that this is the actual problem behind PR8954, but this is a stab in the right direction. llvm-svn: 123222	2011-01-11 08:13:40 +00:00
Owen Anderson	7b25ff04bd	Don't bother value numbering instructions with void types in GVN. In theory this should allow us to insert fewer things into the value numbering maps, but any speedup is beneath the noise threshold on my machine on 403.gcc. llvm-svn: 122844	2011-01-04 22:15:21 +00:00
Owen Anderson	e39cb57b09	Complete the NumberTable --> LeaderTable rename. llvm-svn: 122828	2011-01-04 19:29:46 +00:00
Owen Anderson	d7d06d3aaf	Fix typo in a comment. llvm-svn: 122827	2011-01-04 19:25:18 +00:00
Owen Anderson	51489b3b28	Prune #include's. llvm-svn: 122826	2011-01-04 19:24:57 +00:00
Owen Anderson	c7c3bc63f7	Clarify terminology, settling on referring to what was the "number table" as the "leader table", and rename methods to make it much more clear what they're doing. llvm-svn: 122823	2011-01-04 19:13:25 +00:00
Owen Anderson	83546f2fe0	When removing a value from GVN's leaders list, don't drop the Next pointer in a corner case. llvm-svn: 122822	2011-01-04 19:10:54 +00:00
Owen Anderson	41a1550ef5	Branch instructions don't produce values, so there's no need to generate a value number for them. This avoids adding them to the various value numbering tables, resulting in a minor (~3%) speedup for GVN on 40.gcc. llvm-svn: 122819	2011-01-04 18:54:18 +00:00
Owen Anderson	22c53e277a	Remove commented out code. llvm-svn: 122817	2011-01-04 18:22:08 +00:00
Owen Anderson	d62d37225a	Use the new addEscapingValue callback to update GlobalsModRef when GVN adds PHIs of GEPs. For the moment, have GlobalsModRef handle this conservatively by simply removing the value from its maps. llvm-svn: 122787	2011-01-03 23:51:43 +00:00
Owen Anderson	3a33d0cc4a	Simplify GVN's value expression structure, allowing the elimination of a lot of almost-but-not-quite-identical code. No intended functionality change. llvm-svn: 122760	2011-01-03 19:00:11 +00:00
Chris Lattner	bf0aa927cc	split dom frontier handling stuff out to its own DominanceFrontier header, so that Dominators.h is just domtree. Also prune #includes a bit. llvm-svn: 122714	2011-01-02 22:09:33 +00:00
Owen Anderson	5ab8d4b5e5	Give GVN back the ability to perform simple conditional propagation on conditional branch values. I still think that LVI should be handling this, but that capability is some ways off in the future, and this matters for some significant benchmarks. llvm-svn: 122378	2010-12-21 23:54:34 +00:00
Owen Anderson	12470778d7	Remove dead code. llvm-svn: 122371	2010-12-21 22:31:24 +00:00
Benjamin Kramer	43493c089f	GVN's Expression is not POD-like (it contains a SmallVector). Simplify code while at it. llvm-svn: 122362	2010-12-21 21:30:19 +00:00
Chris Lattner	b6252a376a	tidy up llvm-svn: 122190	2010-12-19 20:24:28 +00:00
Dan Gohman	4467aa5294	Preserve TBAA tags when doing load PRE. llvm-svn: 121921	2010-12-15 23:53:55 +00:00
Dan Gohman	a4fcd2418d	Move Value::getUnderlyingObject to be a standalone function so that it can live in Analysis instead of VMCore. llvm-svn: 121885	2010-12-15 20:02:24 +00:00
Chris Lattner	e28618de59	move GetPointerBaseWithConstantOffset out of GVN into ValueTracking.h llvm-svn: 120476	2010-11-30 22:25:26 +00:00
Chris Lattner	50162e3c2a	remove a fixed fixme llvm-svn: 120474	2010-11-30 22:18:11 +00:00
Owen Anderson	ea326db47b	Document the new GVN number table structure. llvm-svn: 119865	2010-11-19 22:48:40 +00:00
Owen Anderson	c21c100f3d	Completely rework the datastructure GVN uses to represent the value number to leader mapping. Previously, this was a tree of hashtables, and a query recursed into the table for the immediate dominator ad infinitum if the initial lookup failed. This led to really bad performance on tall, narrow CFGs. We can instead replace it with what is conceptually a multimap of value numbers to leaders (actually represented by a hashtable with a list of Value*'s as the value type), and then determine which leader from that set to use very cheaply thanks to the DFS numberings maintained by DominatorTree. Because there are typically few duplicates of a given value, this scan tends to be quite fast. Additionally, we use a custom linked list and BumpPtr allocation to avoid any unnecessary allocation in representing the value-side of the multimap. This change brings with it a 15% (!) improvement in the total running time of GVN on 403.gcc, which I think is pretty good considering that includes all the "real work" being done by MemDep as well. The one downside to this approach is that we can no longer use GVN to perform simple conditional progation, but that seems like an acceptable loss since we now have LVI and CorrelatedValuePropagation to pick up the slack. If you see conditional propagation that's not happening, please file bugs against LVI or CVP. llvm-svn: 119714	2010-11-18 18:32:40 +00:00
Duncan Sands	72313843d5	Remove dead code in GVN: now that SimplifyInstruction is called systematically, CollapsePhi will always return null here. Note that CollapsePhi did an extra check, isSafeReplacement, which the SimplifyInstruction logic does not do. I think that check was bogus - I guess we will soon find out! (It was originally added in commit 41998 without a testcase). llvm-svn: 119456	2010-11-17 04:05:21 +00:00
Duncan Sands	b99f39b9f6	If dom tree information is available, make it possible to pass it to get better phi node simplification. llvm-svn: 119055	2010-11-14 18:36:10 +00:00
Duncan Sands	246b71c596	Have GVN simplify instructions as it goes. For example, consider "%z = %x and %y". If GVN can prove that %y equals %x, then it turns this into "%z = %x and %x". With the new code, %z will be replaced with %x everywhere (and then deleted). Previously %z would be value numbered too, which is a waste of time. Also, while a clever value numbering algorithm would give %z the same value number as %x, our current one doesn't do so (at least I don't think it does). The new logic has an essentially equivalent effect to what you would get if %z was given the same value number as %x, i.e. it should make value numbering smarter. While there, get hold of target data once at the start rather than a gazillion times all over the place. llvm-svn: 118923	2010-11-12 21:10:24 +00:00
Dan Gohman	65316d6749	Add helper functions for computing the Location of load, store, and vaarg instructions. llvm-svn: 118845	2010-11-11 21:50:19 +00:00
Dan Gohman	0a6021a54d	Enhance GVN to do more precise alias queries for non-local memory references. For example, this allows gvn to eliminate the load in this example: void foo(int n, int* p, int q) { p[0] = 0; p[1] = 1; if (n) { q = p[0]; } } llvm-svn: 118714	2010-11-10 20:37:15 +00:00
Dan Gohman	d209911642	Use getValueOperand() and getPointerOperand() on load and store instructions instead of hard-coding operand numbers. llvm-svn: 118698	2010-11-10 19:03:33 +00:00
Owen Anderson	6c18d1aac0	Get rid of static constructors for pass registration. Instead, every pass exposes an initializeMyPassFunction(), which must be called in the pass's constructor. This function uses static dependency declarations to recursively initialize the pass's dependencies. Clients that only create passes through the createFooPass() APIs will require no changes. Clients that want to use the CommandLine options for passes will need to manually call the appropriate initialization functions in PassInitialization.h before parsing commandline arguments. I have tested this with all standard configurations of clang and llvm-gcc on Darwin. It is possible that there are problems with the static dependencies that will only be visible with non-standard options. If you encounter any crash in pass registration/creation, please send the testcase to me directly. llvm-svn: 116820	2010-10-19 17:21:58 +00:00
Owen Anderson	8ac477ffb5	Begin adding static dependence information to passes, which will allow us to perform initialization without static constructors AND without explicit initialization by the client. For the moment, passes are required to initialize both their (potential) dependencies and any passes they preserve. I hope to be able to relax the latter requirement in the future. llvm-svn: 116334	2010-10-12 19:48:12 +00:00
Owen Anderson	df7a4f2515	Now with fewer extraneous semicolons! llvm-svn: 115996	2010-10-07 22:25:06 +00:00
Owen Anderson	13a642da0b	Now that the profitable bits of EnableFullLoadPRE have been enabled by default, rip out the remainder. Anyone interested in more general PRE would be better served by implementing it separately, to get real anticipation calculation, etc. llvm-svn: 115337	2010-10-01 20:02:55 +00:00
Owen Anderson	3170a25a84	We do want to allow LoadPRE to perform LICM-like transformations: we already consider PHI nodes to be negligible for code size (making this transform code size neutral), and it allows us to hoist values out of loops, which is always a good thing. llvm-svn: 115205	2010-09-30 20:53:04 +00:00
Owen Anderson	b590a927cd	LoadPRE was not properly checking that the load it was PRE'ing post-dominated the block it was being hoisted to. Splitting critical edges at the merge point only addressed part of the issue; it is also possible for non-post-domination to occur when the path from the load to the merge has branches in it. Unfortunately, full anticipation analysis is time-consuming, so for now approximate it. This is strictly more conservative than real anticipation, so we will miss some cases that real PRE would allow, but we also no longer insert loads into paths where they didn't exist before. :-) This is a very slight net positive on SPEC for me (0.5% on average). Most of the benchmarks are largely unaffected, but when it pays off it pays off decently: 181.mcf improves by 4.5% on my machine. llvm-svn: 114785	2010-09-25 05:26:18 +00:00
Chris Lattner	65b48b5dfc	zap dead code. llvm-svn: 113073	2010-09-04 18:12:00 +00:00
Duncan Sands	6778149f7e	Reapply commit 112699, speculatively reverted by echristo, since I'm sure it is harmless. Original commit message: If PrototypeValue is erased in the middle of using the SSAUpdator then the SSAUpdator may access freed memory. Instead, simply pass in the type and name explicitly, which is all that was used anyway. llvm-svn: 112810	2010-09-02 08:14:03 +00:00
Eric Christopher	a5d315c665	Speculatively revert 112699 and 112702, they seem to be causing self host errors on clang-x86-64. llvm-svn: 112719	2010-09-01 17:29:10 +00:00
Duncan Sands	f7b18437b5	If PrototypeValue is erased in the middle of using the SSAUpdator then the SSAUpdator may access freed memory. Instead, simply pass in the type and name explicitly, which is all that was used anyway. llvm-svn: 112699	2010-09-01 10:29:33 +00:00
Chris Lattner	b50407f104	remove dead proto llvm-svn: 112408	2010-08-29 04:53:24 +00:00
Owen Anderson	0398607714	Don't attempt the PRE inline asm calls, since we don't value number them yet. Fixes PR7835. llvm-svn: 110489	2010-08-07 00:20:35 +00:00
Owen Anderson	a7aed18624	Reapply r110396, with fixes to appease the Linux buildbot gods. llvm-svn: 110460	2010-08-06 18:33:48 +00:00
Owen Anderson	bda59bd247	Revert r110396 to fix buildbots. llvm-svn: 110410	2010-08-06 00:23:35 +00:00
Owen Anderson	755aceb5d0	Don't use PassInfo* as a type identifier for passes. Instead, use the address of the static ID member as the sole unique type identifier. Clean up APIs related to this change. llvm-svn: 110396	2010-08-05 23:42:04 +00:00
Gabor Greif	dde79d8f1a	mass elimination of reliance on automatic iterator dereferencing llvm-svn: 109103	2010-07-22 13:36:47 +00:00
Owen Anderson	a57b97e7e7	Fix batch of converting RegisterPass<> to INTIALIZE_PASS(). llvm-svn: 109045	2010-07-21 22:09:45 +00:00
Gabor Greif	d323f5e161	cache result of operator* (found by inspection) llvm-svn: 107971	2010-07-09 14:48:08 +00:00
Gabor Greif	b0d56ffc85	cache result of operator* llvm-svn: 107969	2010-07-09 14:36:49 +00:00
Gabor Greif	f628ecd15f	use getNumArgOperands instead of getNumOperands llvm-svn: 107272	2010-06-30 09:17:53 +00:00
Gabor Greif	2d958d4db5	use ArgOperand API llvm-svn: 106730	2010-06-24 10:17:17 +00:00
Gabor Greif	5bcaa55761	use callsite to obtain all arguments llvm-svn: 106729	2010-06-24 10:04:07 +00:00
Dan Gohman	d2d1ae105d	Use pre-increment instead of post-increment when the result is not used. llvm-svn: 106542	2010-06-22 15:08:57 +00:00
Dan Gohman	826bdf8c10	Move FindAvailableLoadedValue isSafeToLoadUnconditionally out of lib/Transforms/Utils and into lib/Analysis so that Analysis passes can use them. llvm-svn: 104949	2010-05-28 16:19:17 +00:00
Chris Lattner	5a62d6e578	Fix PR7052, patch by Jakub Staszak! llvm-svn: 103347	2010-05-08 20:01:44 +00:00
Bob Wilson	0c8b29bcdb	Use the right version of "append" to combine two SmallVectors. This fixes the compile-time regressions seen in last night's tests. llvm-svn: 103118	2010-05-05 20:44:15 +00:00
Bob Wilson	a2fda8b648	Defer adding critical edges to the "toSplit" list until after checking for indirect branches in all the predecessors. This avoids unnecessarily splitting edges in cases where load PRE is not possible anyway. Thanks to Jakub Staszak for pointing this out. llvm-svn: 103034	2010-05-04 20:03:21 +00:00
Eric Christopher	7258dcd77f	Revert 101465, it broke internal OpenGL testing. Probably the best way to know that all getOperand() calls have been handled is to replace that API instead of updating. llvm-svn: 101579	2010-04-16 23:37:20 +00:00
Gabor Greif	f375520f7b	reapply r101434 with a fix for self-hosting rotate CallInst operands, i.e. move callee to the back of the operand array the motivation for this patch are laid out in my mail to llvm-commits: more efficient access to operands and callee, faster callgraph-construction, smaller compiler binary llvm-svn: 101465	2010-04-16 15:33:14 +00:00
Gabor Greif	403e9694f9	back out r101423 and r101397, they break llvm-gcc self-host on darwin10 llvm-svn: 101434	2010-04-16 01:16:20 +00:00
Gabor Greif	33ae80bff7	reapply r101364, which has been backed out in r101368 with a fix rotate CallInst operands, i.e. move callee to the back of the operand array the motivation for this patch are laid out in my mail to llvm-commits: more efficient access to operands and callee, faster callgraph-construction, smaller compiler binary llvm-svn: 101397	2010-04-15 20:51:13 +00:00
Dan Gohman	b29cda9b3c	Fix a bunch of namespace polution. llvm-svn: 101376	2010-04-15 17:08:50 +00:00
Gabor Greif	9fd00c7d25	back out r101364, as it trips the linux nightlybot on some clang C++ tests llvm-svn: 101368	2010-04-15 12:46:56 +00:00
Gabor Greif	aafd209632	rotate CallInst operands, i.e. move callee to the back of the operand array the motivation for this patch are laid out in my mail to llvm-commits: more efficient access to operands and callee, faster callgraph-construction, smaller compiler binary llvm-svn: 101364	2010-04-15 10:49:53 +00:00
Gabor Greif	ce6dd889ec	const-ize a predicate llvm-svn: 100856	2010-04-09 10:57:00 +00:00
Chris Lattner	0563804982	fix PR6642, GVN forwarding from memset to load of the base of the memset. llvm-svn: 99488	2010-03-25 05:58:19 +00:00
Bob Wilson	0fd415820b	Don't attempt load PRE when there is no real redundancy (i.e., the load is in a loop and is itself the only dependency). llvm-svn: 97526	2010-03-02 00:09:29 +00:00
Bob Wilson	892432b7ef	When GVN needs to split critical edges for load PRE, check all of the predecessors before returning. Otherwise, if multiple predecessor edges need splitting, we only get one of them per iteration. This makes a small but measurable compile time improvement with -enable-full-load-pre. llvm-svn: 97521	2010-03-01 23:37:32 +00:00
Evan Cheng	7263cf8431	MemoryDepAnalysis is not used if redundant load processing is disabled. llvm-svn: 97512	2010-03-01 22:23:12 +00:00
Bob Wilson	1136166ee9	Revert r97245 which seems to be causing performance problems. llvm-svn: 97366	2010-02-28 05:34:05 +00:00
Bob Wilson	ed1b0c31a7	Move the EnableFullLoadPRE flag from a separate command-line option to an argument of createGVNPass and set it automatically for -O3. llvm-svn: 97245	2010-02-26 19:09:47 +00:00
Bob Wilson	d4655991c3	Remove unused "NoPRE" parameter in GVN and createGVNPass(). llvm-svn: 97235	2010-02-26 18:35:19 +00:00
Daniel Dunbar	693ea89214	Reapply r97010, the speculative revert failed. llvm-svn: 97036	2010-02-24 08:48:04 +00:00
Daniel Dunbar	0a2031e5b6	Speculatively revert r97010, "Add an argument to PHITranslateValue to specify the DominatorTree. ...", in hopes of restoring poor old PPC bootstrap. llvm-svn: 97027	2010-02-24 06:55:22 +00:00
Bob Wilson	66e58ac742	Add an argument to PHITranslateValue to specify the DominatorTree. If this argument is non-null, pass it along to PHITranslateSubExpr so that it can prefer using existing values that dominate the PredBB, instead of just blindly picking the first equivalent value that it finds on a uselist. Also when the DominatorTree is specified, have PHITranslateValue filter out any result that does not dominate the PredBB. This is basically just refactoring the check that used to be in GetAvailablePHITranslatedSubExpr and also in GVN. Despite my initial expectations, this change does not affect the results of GVN for any testcases that I could find, but it should help compile time. Before this change, if PHITranslateSubExpr picked a value that does not dominate, PHITranslateWithInsertion would then insert a new value, which GVN would later determine to be redundant and would replace. By picking a good value to begin with, we save GVN the extra work of inserting and then replacing a new value. llvm-svn: 97010	2010-02-24 01:39:00 +00:00
Bob Wilson	923261bbe9	Update memdep when load PRE inserts a new load, and add some debug output. I don't have a small testcase for this. llvm-svn: 96890	2010-02-23 05:55:00 +00:00
Bob Wilson	1da9041913	Erase deleted instructions from GVN's ValueTable. This fixes assertion failures from ValueTable::verifyRemoved() when using -debug. llvm-svn: 96805	2010-02-22 21:39:41 +00:00
Bob Wilson	aff96b2132	Rename SuccessorNumber to GetSuccessorNumber. llvm-svn: 96387	2010-02-16 21:06:42 +00:00
Bob Wilson	92cdb6eec5	Split critical edges as needed for load PRE. llvm-svn: 96378	2010-02-16 19:51:59 +00:00
Bob Wilson	3de492ec35	Refactor to share code to find the position of a basic block successor in the terminator's list of successors. llvm-svn: 96377	2010-02-16 19:49:17 +00:00
Duncan Sands	19d0b47b1f	There are two ways of checking for a given type, for example isa<PointerType>(T) and T->isPointerTy(). Convert most instances of the first form to the second form. Requested by Chris. llvm-svn: 96344	2010-02-16 11:11:14 +00:00
Devang Patel	03936a1880	Ignore dbg info intrinsics. llvm-svn: 95828	2010-02-11 00:20:49 +00:00
Bob Wilson	76e8c59509	Fix some comment typos. llvm-svn: 95170	2010-02-03 00:33:21 +00:00
Bob Wilson	d517b52012	Add an option to GVN to remove all partially redundant loads. This is currently disabled by default. This divides the existing load PRE code into 2 phases: first it checks that it is safe to move the load to each of the predecessors where it is unavailable, and then if it is safe, the code is changed to move the load. Radar 7571861. llvm-svn: 95007	2010-02-01 21:17:14 +00:00
Bob Wilson	56600a15ad	Check alignment of loads when deciding whether it is safe to execute them unconditionally. Besides checking the offset, also check that the underlying object is aligned as much as the load itself. llvm-svn: 94875	2010-01-30 04:42:39 +00:00
Bob Wilson	7c42b9d51e	Improve isSafeToLoadUnconditionally to recognize that GEPs with constant indices are safe if the result is known to be within the bounds of the underlying object. llvm-svn: 94829	2010-01-29 19:19:08 +00:00
Bob Wilson	70c8fe5e4e	Remove check for an impossible condition: the condition of the while loop has already checked that TmpBB->getSinglePredecessor() is non-null. llvm-svn: 94451	2010-01-25 21:28:05 +00:00
Owen Anderson	cdea3572fa	Convert some of the dynamic opcode lookups into static ones. llvm-svn: 93693	2010-01-17 19:33:27 +00:00
Dan Gohman	28943873e6	Use do+while instead of while for loops which obviously have a non-zero trip count. Use SmallVector's pop_back_val(). llvm-svn: 92734	2010-01-05 16:27:25 +00:00
David Greene	2e6efc441f	Change errs() to dbgs(). llvm-svn: 92613	2010-01-05 01:27:17 +00:00
Chris Lattner	17079fc0fa	split code that doesn't need to be templated out of IRBuilder into a new non-templated IRBuilderBase class. Move that large CreateGlobalString out of line, eliminating the need to #include GlobalVariable.h in IRBuilder.h llvm-svn: 92227	2009-12-28 21:28:46 +00:00
Chris Lattner	9b7d99eb76	The phi translated pointer can be computed when returning a partially cached result instead of stored. This reduces memdep memory usage, and also eliminates a bunch of weakvh's. This speeds up gvn on gcc.c-torture/20001226-1.c from 23.9s to 8.45s (2.8x) on a different machine than earlier. llvm-svn: 91885	2009-12-22 04:25:02 +00:00
Chris Lattner	bf20018423	Add a fastpath to Load GVN to special case when we have exactly one dominating load to avoid even messing around with SSAUpdate at all. In this case (which is very common, we can just use the input value directly). This speeds up GVN time on gcc.c-torture/20001226-1.c from 36.4s to 16.3s, which still isn't great, but substantially better and this is a simple speedup that applies to lots of different cases. llvm-svn: 91851	2009-12-21 23:15:48 +00:00
Chris Lattner	927b0ac4b2	refactor some code out to a new helper method. llvm-svn: 91849	2009-12-21 23:04:33 +00:00
Dan Gohman	57e808628c	Eliminte unnecessary uses of <cstdio>. llvm-svn: 91666	2009-12-18 03:25:51 +00:00
Chris Lattner	45d040bd85	Remove isPod() from DenseMapInfo, splitting it out to its own isPodLike type trait. This is a generally useful type trait for more than just DenseMap, and we really care about whether something acts like a pod, not whether it really is a pod. llvm-svn: 91421	2009-12-15 07:26:43 +00:00
Chris Lattner	9ccc879006	Fix PR5744, a case where we were getting the pointer size instead of the value size. This only manifested when memdep inprecisely returns clobber, which is do to a caching issue in the PR5744 testcase. We can 'efficiently emulate' this by using '-no-aa' llvm-svn: 91004	2009-12-10 00:11:45 +00:00
Chris Lattner	3ddf804f78	allow this to build when the #if 0's are enabled. No functionality change. llvm-svn: 90999	2009-12-10 00:04:46 +00:00
Chris Lattner	ca5f9cb18b	fix hte last remaining known (by me) phi translation bug. When we reanalyze clobbers to forward pieces of large stores to small loads, we need to consider the properly phi translated pointer in the store block. llvm-svn: 90978	2009-12-09 18:21:46 +00:00
Chris Lattner	f8ba1253f1	change GetStoreValueForLoad to use IRBuilder, which is cleaner and implicitly constant folds. llvm-svn: 90977	2009-12-09 18:13:28 +00:00
Chris Lattner	07df9efb35	change AnalyzeLoadFromClobberingMemInst/AnalyzeLoadFromClobberingStore to require the load ty/ptr to be passed in, no functionality change. llvm-svn: 90960	2009-12-09 07:37:07 +00:00
Chris Lattner	0def861ee9	change AnalyzeLoadFromClobberingWrite and clients to pass in type and pointer instead of the load. No functionality change. llvm-svn: 90959	2009-12-09 07:34:10 +00:00
Chris Lattner	0c31547168	change NonLocalDepEntry from being a typedef for an std::pair to be its own small class. No functionality change. llvm-svn: 90956	2009-12-09 07:08:01 +00:00
Chris Lattner	946b58dd90	add some aborts to #if 0's. llvm-svn: 90929	2009-12-09 02:41:54 +00:00
Chris Lattner	972e6d8d00	Switch GVN and memdep to use PHITransAddr, which correctly handles phi translation of complex expressions like &A[i+1]. This has the following benefits: 1. The phi translation logic is all contained in its own class with a strong interface and verification that it is self consistent. 2. The logic is more correct than before. Previously, if intermediate expressions got PHI translated, we'd miss the update and scan for the wrong pointers in predecessor blocks. @phi_trans2 is a testcase for this. 3. We have a lot less code in memdep. We can handle phi translation across blocks of things like @phi_trans3, which is pretty insane :). This patch should fix the miscompiles of 255.vortex, and I tested it with a bootstrap of llvm-gcc, llvm-test and dejagnu of course. llvm-svn: 90926	2009-12-09 01:59:31 +00:00
Chris Lattner	778cb92235	constant fold loads from memcpy's from global constants. This is important because clang lowers nontrivial automatic struct/array inits to memcpy from a global array. llvm-svn: 90698	2009-12-06 05:29:56 +00:00
Chris Lattner	93236ba327	add support for forwarding mem intrinsic values to non-local loads. llvm-svn: 90697	2009-12-06 04:54:31 +00:00
Chris Lattner	42376066eb	Handle forwarding local memsets to loads. For example, we optimize this: short x(short A) { memset(A, 1, sizeof(A)*100); return A[42]; } to 'return 257' instead of doing the load. llvm-svn: 90695	2009-12-06 01:57:02 +00:00
Owen Anderson	0b6e260066	Fix this crasher, and add a FIXME for a missed optimization. llvm-svn: 90408	2009-12-03 03:43:29 +00:00
Owen Anderson	b9878ee6b6	Cleanup/remove some parts of the lifetime region handling code in memdep and GVN, per Chris' comments. Adjust testcases to match. llvm-svn: 90304	2009-12-02 07:35:19 +00:00
Chris Lattner	c468025ac9	factor some code better. llvm-svn: 90299	2009-12-02 06:44:58 +00:00
Chris Lattner	32140312ca	reenable load address insertion in load pre. This allows us to handle cases like this: void test(int N, double* G) { long j; for (j = 1; j < N - 1; j++) G[j+1] = G[j] + G[j+1]; } where G[1] isn't live into the loop. llvm-svn: 90041	2009-11-28 16:08:18 +00:00
Chris Lattner	44da5bd837	Enhance InsertPHITranslatedPointer to be able to return a list of newly inserted instructions. No functionality change until someone starts using it. llvm-svn: 90039	2009-11-28 15:39:14 +00:00
Chris Lattner	cf0b198827	disable value insertion for now, I need to figure out how to inform GVN about the newly inserted values. This fixes PR5631. llvm-svn: 90022	2009-11-27 22:50:07 +00:00
Chris Lattner	2be52e72ae	Rework InsertPHITranslatedPointer to handle the recursive case, this fixes PR5630 and sets the stage for the next phase of goodness (testcase pending). llvm-svn: 90019	2009-11-27 22:05:15 +00:00
Chris Lattner	25be93dfed	teach GVN's load PRE to insert computations of the address in predecessors where it is not available. It's unclear how to get this inserted computation into GVN's scalar availability sets, Owen, help? :) llvm-svn: 89997	2009-11-27 08:25:10 +00:00
Chris Lattner	a9a76ccf56	Fix phi translation in load PRE to agree with the phi translation done by memdep, and reenable gep translation again. llvm-svn: 89992	2009-11-27 06:31:14 +00:00
Dan Gohman	81132465d3	Add an option for running GVN with redundant load processing disabled. llvm-svn: 88742	2009-11-14 02:27:51 +00:00
Jeffrey Yasskin	b40d3f76a0	Fix DenseMap iterator constness. This patch forbids implicit conversion of DenseMap::const_iterator to DenseMap::iterator which was possible because DenseMapIterator inherited (publicly) from DenseMapConstIterator. Conversion the other way around is now allowed as one may expect. The template DenseMapConstIterator is removed and the template parameter IsConst which specifies whether the iterator is constant is added to DenseMapIterator. Actually IsConst parameter is not necessary since the constness can be determined from KeyT but this is not relevant to the fix and can be addressed later. Patch by Victor Zverovich! llvm-svn: 86636	2009-11-10 01:02:17 +00:00
Chris Lattner	a546dcf418	Make sure PRE doesn't split crit edges from indirectbr. llvm-svn: 85692	2009-10-31 22:11:15 +00:00
Evan Cheng	5a6b9c40d6	Add option to createGVNPass to disable PRE. llvm-svn: 85609	2009-10-30 20:12:24 +00:00
Owen Anderson	2b2bd28973	Treat lifetime begin/end markers as allocations/frees respectively for the purposes for GVN/DSE. llvm-svn: 85383	2009-10-28 07:05:35 +00:00
Victor Hernandez	f390e04a47	Rename MallocFreeHelper as MemoryBuiltins llvm-svn: 85286	2009-10-27 20:05:49 +00:00
Victor Hernandez	762195bd01	Rename MallocHelper as MallocFreeHelper, since it now also identifies calls to free() llvm-svn: 85181	2009-10-26 23:58:56 +00:00
Victor Hernandez	8acf2956b8	Remove AllocationInst. Since MallocInst went away, AllocaInst is the only subclass of AllocationInst, so it no longer is necessary. llvm-svn: 84969	2009-10-23 21:09:37 +00:00
Chris Lattner	8ed7bef409	make GVN work better when TD is not around: "In the existing code, if the load and the value to replace it with are of different types and target data is available, it tries to use the target data to coerce the replacement value to the type of the load. Otherwise, it skips all effort to handle the type mismatch and just feeds the wrongly-typed replacement value to replaceAllUsesWith, which triggers an assertion. The patch replaces it with an outer if checking for type mismatch, and an inner if-else that checks whether target data is available and, if not, returns false rather than trying to replace the load." Patch by Kenneth Uildriks! llvm-svn: 84739	2009-10-21 04:11:19 +00:00
Owen Anderson	168ad6985e	Refactor lookup_or_add to contain _MUCH_ less duplicated code. Add support for numbering first class aggregate instructions while we're at it. llvm-svn: 84547	2009-10-19 22:14:22 +00:00
Owen Anderson	1059b5b32d	Simplify some code. llvm-svn: 84533	2009-10-19 21:14:57 +00:00
Devang Patel	92f8619923	Use isVoidTy() llvm-svn: 84118	2009-10-14 17:29:00 +00:00
Chris Lattner	b6c65faa64	switch GVN to use SSAUpdater. Besides removing a lot of complexity from GVN, this also speeds it up, inserts fewer PHI nodes (see the testcase) and allows it to remove more loads (due to fewer PHI nodes standing in the way). llvm-svn: 83746	2009-10-10 23:50:30 +00:00
Chris Lattner	89d2a5c4f3	remove dead code llvm-svn: 83742	2009-10-10 23:04:12 +00:00
Chris Lattner	f30a2b0c86	random tidying llvm-svn: 83701	2009-10-10 06:22:45 +00:00
Chris Lattner	4425660b1f	simplify some code. llvm-svn: 82936	2009-09-27 21:46:50 +00:00
Nick Lewycky	42fb7452df	Instruction::clone does not need to take an LLVMContext&. Remove that and update all the callers. llvm-svn: 82889	2009-09-27 07:38:41 +00:00
Chris Lattner	247053867e	big endian systems shift by bits too, hopefully this will fix the ppc bootstrap problems. llvm-svn: 82464	2009-09-21 17:55:47 +00:00
Chris Lattner	9045f235d2	fix PR5016, a crash I introduced in GVN handing first class arrays and structs, which cannot be bitcast to integers. llvm-svn: 82460	2009-09-21 17:24:04 +00:00
Chris Lattner	4d8af2f1ae	enable non-local analysis and PRE of large store -> little load. This doesn't kick in too much because of phi translation issues, but this can be resolved in the future. llvm-svn: 82447	2009-09-21 06:48:08 +00:00
Chris Lattner	0cdc17eb50	convert an std::pair to an explicit struct. llvm-svn: 82446	2009-09-21 06:30:24 +00:00
Chris Lattner	d28f90897a	move some functions, add a comment. llvm-svn: 82444	2009-09-21 06:24:16 +00:00
Chris Lattner	9d7fb29522	split HandleLoadFromClobberingStore in two pieces: one that does the analysis, one that does the xform. llvm-svn: 82443	2009-09-21 06:22:46 +00:00
Chris Lattner	0a9616d906	Improve GVN to be able to forward substitute a small load from a piece of a large store when both are in the same block. This allows clang to compile the testcase in PR4216 to this code: _test_bitfield: movl 4(%esp), %eax movl %eax, %ecx andl $-65536, %ecx orl $32962, %eax andl $40186, %eax orl %ecx, %eax ret This is not ideal, but is a whole lot better than the code produced by llvm-gcc: _test_bitfield: movw $-32574, %ax orw 4(%esp), %ax andw $-25350, %ax movw %ax, 4(%esp) movw 7(%esp), %cx shlw $8, %cx movzbl 6(%esp), %edx orw %cx, %dx movzwl %dx, %ecx shll $16, %ecx movzwl %ax, %eax orl %ecx, %eax ret and dramatically better than that produced by gcc 4.2: _test_bitfield: pushl %ebx call L3 "L00000000001$pb": L3: popl %ebx movl 8(%esp), %eax leal 0(,%eax,4), %edx sarb $7, %dl movl %eax, %ecx andl $7168, %ecx andl $-7201, %ebx movzbl %dl, %edx andl $1, %edx sall $5, %edx orl %ecx, %ebx orl %edx, %ebx andl $24, %eax andl $-58336, %ebx orl %eax, %ebx orl $32962, %ebx movl %ebx, %eax popl %ebx ret llvm-svn: 82439	2009-09-21 05:57:11 +00:00
Chris Lattner	1eefa9c427	formatting cleanups, no functionality change. llvm-svn: 82426	2009-09-21 02:42:51 +00:00
Chris Lattner	a0aa8fb6a6	Move CoerceAvailableValueToLoadType earlier in GVN.cpp. Hook it up so that nonlocal and partially redundant loads can use it as well. The testcase shows examples of craziness this can handle. This triggers many times in 176.gcc. llvm-svn: 82403	2009-09-20 20:09:34 +00:00
Chris Lattner	7c62d8a1a8	change the interface to CoerceAvailableValueToLoadType to be more generic. llvm-svn: 82402	2009-09-20 19:31:14 +00:00
Chris Lattner	1dd48c34e5	enhance GVN to forward substitute a stored value to a load (and load -> load) when the base pointers must alias but when they are different types. This occurs very very frequently in 176.gcc and other code that uses bitfields a lot. llvm-svn: 82399	2009-09-20 19:03:47 +00:00
Daniel Dunbar	7d6781b0fe	Tabs -> spaces, and remove trailing whitespace. llvm-svn: 82355	2009-09-20 02:20:51 +00:00
Victor Hernandez	5d034499ad	Enhance transform passes so that they apply the same tranforms to malloc calls as to MallocInst. Reviewed by Dan Gohman. llvm-svn: 82300	2009-09-18 22:35:49 +00:00
Dan Gohman	22571485b3	Change PHINode::hasConstantValue to have a DominatorTree argument instead of a bool argument, and to do the dominator check itself. This makes it eaiser to use when DominatorTree information is available. llvm-svn: 80920	2009-09-03 15:34:35 +00:00
Chris Lattner	2dd09dbdf7	eliminate VISIBILITY_HIDDEN from Transforms/Scalar. PR4861 llvm-svn: 80766	2009-09-02 06:11:42 +00:00
Owen Anderson	109ca5a14a	Make this into a static method. llvm-svn: 80170	2009-08-26 22:55:11 +00:00
Owen Anderson	55f1c09e31	Push LLVMContexts through the IntegerType APIs. llvm-svn: 78948	2009-08-13 21:58:54 +00:00
Dan Gohman	ef3ef7f645	Fix GVN's debug output, now that operator<< on Value* doesn't print a trailing newline. llvm-svn: 77719	2009-07-31 20:24:18 +00:00
Owen Anderson	23a204d91b	Move getTrue() and getFalse() to 2.5-like APIs. llvm-svn: 77685	2009-07-31 17:39:07 +00:00
Owen Anderson	b292b8ce70	Move more code back to 2.5 APIs. llvm-svn: 77635	2009-07-30 23:03:37 +00:00
Dan Gohman	1ddf98ad8e	Convert a few more things to use raw_ostream. llvm-svn: 77039	2009-07-25 01:43:01 +00:00
Daniel Dunbar	0dd5e1ed39	More migration to raw_ostream, the water has dried up around the iostream hole. - Some clients which used DOUT have moved to DEBUG. We are deprecating the "magic" DOUT behavior which avoided calling printing functions when the statement was disabled. In addition to being unnecessary magic, it had the downside of leaving code in -Asserts builds, and of hiding potentially unnecessary computations. llvm-svn: 77019	2009-07-25 00:23:56 +00:00
Owen Anderson	47db941fd3	Get rid of the Pass+Context magic. llvm-svn: 76702	2009-07-22 00:24:57 +00:00
Owen Anderson	c37bc69e91	Rename getConstantInt{True\|False} to get{True\|False} at Chris' behest. llvm-svn: 76598	2009-07-21 18:03:38 +00:00
Torok Edwin	fbcc663cbf	llvm_unreachable->llvm_unreachable(0), LLVM_UNREACHABLE->llvm_unreachable. This adds location info for all llvm_unreachable calls (which is a macro now) in !NDEBUG builds. In NDEBUG builds location info and the message is off (it only prints "UREACHABLE executed"). llvm-svn: 75640	2009-07-14 16:55:14 +00:00
Torok Edwin	56d0659726	assert(0) -> LLVM_UNREACHABLE. Make llvm_unreachable take an optional string, thus moving the cerr<< out of line. LLVM_UNREACHABLE is now a simple wrapper that makes the message go away for NDEBUG builds. llvm-svn: 75379	2009-07-11 20:10:48 +00:00
Owen Anderson	1e5f00e7a7	This started as a small change, I swear. Unfortunately, lots of things call the [I\|F]CmpInst constructors. Who knew!? llvm-svn: 75200	2009-07-09 23:48:35 +00:00
Nick Lewycky	a21d3daadc	Remove the vicmp and vfcmp instructions. Because we never had a release with these instructions, no autoupgrade or backwards compatibility support is provided. llvm-svn: 74991	2009-07-08 03:04:38 +00:00
Owen Anderson	b5618da226	Convert the first batch of passes to use LLVMContext. llvm-svn: 74748	2009-07-03 00:17:18 +00:00
Dale Johannesen	81b6463ed0	This fixes a bug introduced in 72661, which can move loads back past a check that the load address is valid, see new testcase. The test that went in with 72661 has exactly this case, except that the conditional it's moving past is checking something else; I've settled for changing that test to reference a global, not a pointer. It may be possible to scan all the tests you pass and make sure none of them are checking any component of the address, but it's not trivial and I'm not trying to do that here. llvm-svn: 73632	2009-06-17 20:48:23 +00:00
Torok Edwin	ba93ea7632	Add debug message about non-local loads being clobbered. llvm-svn: 73625	2009-06-17 18:48:18 +00:00
Dan Gohman	a8f8a85388	Make the EnableLoadPRE variable static. llvm-svn: 73398	2009-06-15 18:30:15 +00:00
Dan Gohman	a5b9645c4b	Split the Add, Sub, and Mul instruction opcodes into separate integer and floating-point opcodes, introducing FAdd, FSub, and FMul. For now, the AsmParser, BitcodeReader, and IRBuilder all preserve backwards compatability, and the Core LLVM APIs preserve backwards compatibility for IR producers. Most front-ends won't need to change immediately. This implements the first step of the plan outlined here: http://nondot.org/sabre/LLVMNotes/IntegerOverflow.txt llvm-svn: 72897	2009-06-04 22:49:04 +00:00
Owen Anderson	cc0c75c74d	Be more aggressive in doing LoadPRE by tracing backwards when a block only has a single predecessor. Patch by Jakub Staszak. llvm-svn: 72661	2009-05-31 09:03:40 +00:00
Bill Wendling	006459ecd4	Enable GVN Load PRE. llvm-svn: 72589	2009-05-29 20:38:16 +00:00
Torok Edwin	0b0ddb21fe	just show the instruction, its not that slow. llvm-svn: 72577	2009-05-29 16:58:36 +00:00
Torok Edwin	6a94624a1b	for instructions with void type we have no choice but print the instruction as is, otherwise we get a <badref>. llvm-svn: 72567	2009-05-29 10:28:44 +00:00
Torok Edwin	72070282eb	Add a DEBUG() output to GVN that prints the instruction clobbering a load. This is useful when trying to figure out why GVN didn't eliminate redundant loads. llvm-svn: 72565	2009-05-29 09:46:03 +00:00
Owen Anderson	04cfdd38a2	Fix an issue where phiMap was not being updated properly when doing load PRE. Diagnosis and patch thanks to Jakub Staszak. llvm-svn: 72562	2009-05-29 05:37:54 +00:00
Duncan Sands	1efabaaa2a	Allow readonly functions to unwind exceptions. Teach the optimizers about this. For example, a readonly function with no uses cannot be removed unless it is also marked nounwind. llvm-svn: 71071	2009-05-06 06:49:50 +00:00
Owen Anderson	98f912bf13	Reapply r68211, with the miscompilations it caused fixed. llvm-svn: 68262	2009-04-01 23:53:49 +00:00
Dan Gohman	c4971721ea	Revert r68172. It caused regressions in Applications/Burg/burg Applications/ClamAV/clamscan and many other tests. llvm-svn: 68211	2009-04-01 16:37:47 +00:00
Owen Anderson	ff5961b46c	Enhance GVN to propagate simple conditionals. This fixes PR3921. llvm-svn: 68172	2009-04-01 01:20:45 +00:00
John Criswell	073e4d16c5	Do not attempt to do parial redundancy elimination on void values. Also fixed a punctuation error in the header comment. This fixes PR3775. llvm-svn: 66542	2009-03-10 15:04:53 +00:00
Devang Patel	e8c6d3102d	Skip DbgInfoIntrinsic. llvm-svn: 66244	2009-03-06 02:59:27 +00:00
Chris Lattner	096f44de61	improve naming of values in GVN, patch by Jay Foad! llvm-svn: 64363	2009-02-12 07:00:35 +00:00
Bill Wendling	415515077b	Mistakenly turned this on. llvm-svn: 64065	2009-02-08 01:32:00 +00:00
Bill Wendling	5469ec1072	Revert r63999. It was breaking self-hosting builds. llvm-svn: 64062	2009-02-08 00:58:05 +00:00
Chris Lattner	73d7fe5a34	improve compatibility with cygwin, patch by Jay Foad! llvm-svn: 62535	2009-01-19 22:00:18 +00:00
Owen Anderson	164274eeb1	Don't forget to remove phi nodes from the value numbering table after we collapse them. llvm-svn: 61358	2008-12-23 00:49:51 +00:00
Bill Wendling	456e885382	Comment clean-ups. No functionality change. llvm-svn: 61354	2008-12-22 22:32:22 +00:00
Bill Wendling	e7f08e7250	Check that the instruction isn't in the value numbering scope. llvm-svn: 61353	2008-12-22 22:28:56 +00:00
Bill Wendling	86f01cb9f6	Simplification: Negate the operator== method instead of implementing a full operator!= method. llvm-svn: 61352	2008-12-22 22:16:31 +00:00
Bill Wendling	3c793441cb	Add verification that deleted instruction isn't hiding in the PHI map. llvm-svn: 61350	2008-12-22 22:14:07 +00:00
Bill Wendling	ebb6a543fa	Verify removed in a few more places. llvm-svn: 61349	2008-12-22 21:57:30 +00:00
Bill Wendling	6b18a3994b	Add verification functions to GVN which check to see that an instruction was truely deleted. These will be expanded with further checks of all of the data structures. llvm-svn: 61347	2008-12-22 21:36:08 +00:00
Bill Wendling	070de29fcf	Didn't mean to commit this. llvm-svn: 61222	2008-12-18 22:19:50 +00:00
Bill Wendling	4c13e77d49	Re-XFAIL this test until debug stuff settles down. llvm-svn: 61219	2008-12-18 22:13:31 +00:00
Chris Lattner	b6372933b5	reapply this hunk from Bill's reversion in r61169, it is conservative and safe and orthogonal from turning off load pre. llvm-svn: 61177	2008-12-18 00:51:32 +00:00
Bill Wendling	be4fb8a25f	Temporarily revert r61027. It was causing a bootstrap failure in "release" mode with everyone's favorite error messages: Comparing stages 2 and 3 warning: ./cc1-checksum.o differs warning: ./cc1plus-checksum.o differs Bootstrap comparison failure! ./c-decl.o differs ./cp/decl.o differs ./df-core.o differs ./gcc.o differs ./i386.o differs ./stor-layout.o differs ./tree-pretty-print.o differs ./tree.o differs make[2]: * [compare] Error 1 make[1]: * [stage3-bubble] Error 2 See PR3227. llvm-svn: 61169	2008-12-17 23:31:20 +00:00
Chris Lattner	0c68ae0603	Enable Load PRE. This teaches GVN to push partially redundant loads up the CFG when there is exactly one predecessor where the load is not available. This is designed to not increase code size but still eliminate partially redundant loads. This fires 1765 times on 403.gcc even though it doesn't do critical edge splitting yet (the most common reason for it to fail). llvm-svn: 61027	2008-12-15 05:28:29 +00:00
Owen Anderson	03aacbae90	Ifdef out some code that I didn't mean to enable by default yet. llvm-svn: 61024	2008-12-15 03:52:17 +00:00
Chris Lattner	69131fd872	make GVN try to rename inputs to the resultant replaced values, which cleans up the generated code a bit. This should have the added benefit of not randomly renaming functions/globals like my previous patch did. :) llvm-svn: 61023	2008-12-15 03:46:38 +00:00
Owen Anderson	bfe133e4ac	Add support for slow-path GVN with full phi construction for scalars. This is disabled for now, as it actually pessimizes code in the abscence of phi translation for load elimination. This slow down GVN a bit, by about 2% on 403.gcc. llvm-svn: 61021	2008-12-15 02:03:00 +00:00
Owen Anderson	e34c2399de	Generalize GVN's phi construciton routine to work for things other than loads. llvm-svn: 61009	2008-12-14 19:10:35 +00:00
Bill Wendling	293b9181e5	Temporarily revert r60973. It's inexplicably causing a failure when self-hosting LLVM: llvm[2]: Linking Release executable opt (without symbols) ... Undefined symbols: "llvm::APFloat::IEEEsingle", referenced from: __ZN4llvm7APFloat10IEEEsingleE$non_lazy_ptr in libLLVMCore.a(Constants.o) __ZN4llvm7APFloat10IEEEsingleE$non_lazy_ptr in libLLVMCore.a(AsmWriter.o) __ZN4llvm7APFloat10IEEEsingleE$non_lazy_ptr in libLLVMCore.a(ConstantFold.o) "llvm::APFloat::IEEEdouble", referenced from: __ZN4llvm7APFloat10IEEEdoubleE$non_lazy_ptr in libLLVMCore.a(Constants.o) __ZN4llvm7APFloat10IEEEdoubleE$non_lazy_ptr in libLLVMCore.a(AsmWriter.o) __ZN4llvm7APFloat10IEEEdoubleE$non_lazy_ptr in libLLVMCore.a(ConstantFold.o) ld: symbol(s) not found This is in release mode. To replicate, compile llvm and llvm-gcc in optimized mode. Then build llvm, in optimized mode, with the newly created compiler. llvm-svn: 60977	2008-12-13 09:28:44 +00:00
Chris Lattner	1e29f7c97d	make RLE preserve the name of the load that it replaces. This is just a pretification of the IR. llvm-svn: 60973	2008-12-13 07:22:47 +00:00
Chris Lattner	fa9f99aa12	Teach GVN to invalidate some memdep information when it does an RAUW of a pointer. This allows is to catch more equivalencies. For example, the type_lists_compatible_p function used to require two iterations of the gvn pass (!) to delete its 18 redundant loads because the first pass would CSE all the addressing computation cruft, which would unblock the second memdep/gvn passes from recognizing them. This change allows memdep/gvn to catch all 18 when run just once on the function (as is typical :) instead of just 3. On all of 403.gcc, this bumps up the # reundandancies found from: 63 gvn - Number of instructions PRE'd 153991 gvn - Number of instructions deleted 50069 gvn - Number of loads deleted to: 63 gvn - Number of instructions PRE'd 154137 gvn - Number of instructions deleted 50185 gvn - Number of loads deleted +120 loads deleted isn't bad. llvm-svn: 60799	2008-12-09 22:06:23 +00:00
Chris Lattner	254314e6bc	rename getNonLocalDependency -> getNonLocalCallDependency, and remove pointer stuff from it, simplifying the code a bit. llvm-svn: 60783	2008-12-09 19:38:05 +00:00
Chris Lattner	b6fc4b8d92	Switch GVN::processNonLocalLoad to using the new MemDep::getNonLocalPointerDependency method. There are some open issues with this (missed optimizations) and plenty of future work, but this does allow GVN to eliminate slightly more loads (49246 vs 49033). Switching over now allows simplification of the other code path in memdep. llvm-svn: 60780	2008-12-09 19:25:07 +00:00
Chris Lattner	0a5a8d54a9	random cleanups, no functionality change. llvm-svn: 60779	2008-12-09 19:21:47 +00:00
Chris Lattner	0e3d6337c6	Make a few major changes to memdep and its clients: 1. Merge the 'None' result into 'Normal', making loads and stores return their dependencies on allocations as Normal. 2. Split the 'Normal' result into 'Clobber' and 'Def' to distinguish between the cases when memdep knows the value is produced from when we just know if may be changed. 3. Move some of the logic for determining whether readonly calls are CSEs into memdep instead of it being in GVN. This still leaves verification that the arguments are hte same to GVN to let it know about value equivalences in different contexts. 4. Change memdep's call/call dependency analysis to use getModRefInfo(CallSite,CallSite) instead of doing something very weak. This only really matters for things like DSA, but someday maybe we'll have some other decent context sensitive analyses :) 5. This reimplements the guts of memdep to handle the new results. 6. This simplifies GVN significantly: a) readonly call CSE is slightly simpler b) I eliminated the "getDependencyFrom" chaining for load elimination and load CSE doesn't have to worry about volatile (they are always clobbers) anymore. c) GVN no longer does any 'lastLoad' caching, leaving it to memdep. 7. The logic in DSE is simplified a bit and sped up. A potentially unsafe case was eliminated. llvm-svn: 60607	2008-12-05 21:04:20 +00:00
Anton Korobeynikov	24600bf05a	Revert invalid r60393. It causes llvm-gcc bootstrap fails in release builds. See PR3160 for details llvm-svn: 60604	2008-12-05 19:38:49 +00:00
Chris Lattner	c100828026	Fix test/Transforms/GVN/pre-load.ll llvm-svn: 60594	2008-12-05 17:04:12 +00:00
Chris Lattner	d2a653af0c	Make IsValueFullyAvailableInBlock safe. llvm-svn: 60588	2008-12-05 07:49:08 +00:00
Chris Lattner	1db9bbe802	Implement PRE of loads in the GVN pass with a pretty cheap and straight-forward implementation. This does not require any extra alias analysis queries beyond what we already do for non-local loads. Some programs really really like load PRE. For example, SPASS triggers this ~1000 times, ~300 times in 255.vortex, and ~1500 times on 403.gcc. The biggest limitation to the implementation is that it does not split critical edges. This is a huge killer on many programs and should be addressed after the initial patch is enabled by default. The implementation of this should incidentally speed up rejection of non-local loads because it avoids creating the repl densemap in cases when it won't be used for fully redundant loads. This is currently disabled by default. Before I turn this on, I need to fix a couple of miscompilations in the testsuite, look at compile time performance numbers, and look at perf impact. This is pretty close to ready though. llvm-svn: 60408	2008-12-02 08:16:11 +00:00
Owen Anderson	d930420ccf	Fix an issue that Chris noticed, where local PRE was not properly instantiating a new value numbering set after splitting a critical edge. This increases the number of instances of PRE on 403.gcc from ~60 to ~570. llvm-svn: 60393	2008-12-02 04:09:22 +00:00
Chris Lattner	6f5bf6a718	Rename some variables, only increment BI once at the start of the loop instead of throughout it. llvm-svn: 60339	2008-12-01 07:35:54 +00:00
Chris Lattner	f00aae4968	pull the predMap densemap out of the inner loop of performPRE, so that it isn't reallocated all the time. This is a tiny speedup for GVN: 3.90->3.88s llvm-svn: 60338	2008-12-01 07:29:03 +00:00
Chris Lattner	9ce8995d24	Make GVN be more intelligent about redundant load elimination: when finding dependent load/stores, realize that they are the same if aliasing claims must alias instead of relying on the pointers to be exactly equal. This makes load elimination more aggressive. For example, on 403.gcc, we had: < 68 gvn - Number of instructions PRE'd < 152718 gvn - Number of instructions deleted < 49699 gvn - Number of loads deleted < 6153 memdep - Number of dirty cached non-local responses < 169336 memdep - Number of fully cached non-local responses < 162428 memdep - Number of uncached non-local responses now we have: > 64 gvn - Number of instructions PRE'd > 153623 gvn - Number of instructions deleted > 49856 gvn - Number of loads deleted > 5022 memdep - Number of dirty cached non-local responses > 159030 memdep - Number of fully cached non-local responses > 162443 memdep - Number of uncached non-local responses That's an extra 157 loads deleted and extra 905 other instructions nuked. This slows down GVN very slightly, from 3.91 to 3.96s. llvm-svn: 60314	2008-12-01 01:31:36 +00:00
Chris Lattner	7e61dafc95	Reimplement the non-local dependency data structure in terms of a sorted vector instead of a densemap. This shrinks the memory usage of this thing substantially (the high water mark) as well as making operations like scanning it faster. This speeds up memdep slightly, gvn goes from 3.9376 to 3.9118s on 403.gcc This also splits out the statistics for the cached non-local case to differentiate between the dirty and clean cached case. Here's the stats for 403.gcc: 6153 memdep - Number of dirty cached non-local responses 169336 memdep - Number of fully cached non-local responses 162428 memdep - Number of uncached non-local responses yay for caching :) llvm-svn: 60313	2008-12-01 01:15:42 +00:00
Chris Lattner	8541edec44	Cache analyses in ivars and add some useful DEBUG output. This speeds up GVN from 4.0386s to 3.9376s. llvm-svn: 60310	2008-12-01 00:40:32 +00:00
Chris Lattner	80c7d81e81	improve indentation, do cheap checks before expensive ones, remove some fixme's. This speeds up GVN very slightly on 403.gcc (4.06->4.03s) llvm-svn: 60309	2008-11-30 23:39:23 +00:00
Chris Lattner	3ff6d01586	Fix a fixme by making memdep's handling of allocations more logical. If we see that a load depends on the allocation of its memory with no intervening stores, we now return a 'None' depedency instead of "Normal". This tweaks GVN to do its optimization with the new result. llvm-svn: 60267	2008-11-30 01:39:32 +00:00
Chris Lattner	1c6b62eb4d	Change MemDep::getNonLocalDependency to return its results as a smallvector instead of a DenseMap. This speeds up GVN by 5% on 403.gcc. llvm-svn: 60255	2008-11-29 21:33:22 +00:00
Chris Lattner	f280b0c729	reimplement getNonLocalDependency with a simpler worklist formulation that is faster and doesn't require nonLazyHelper. Much less code. llvm-svn: 60253	2008-11-29 21:22:42 +00:00
Chris Lattner	51ba8d0630	Split getDependency into getDependency and getDependencyFrom, the former does caching, the later doesn't. This dramatically simplifies the logic in getDependency and getDependencyFrom. llvm-svn: 60234	2008-11-29 03:47:00 +00:00
Chris Lattner	7f9c8a0f05	Introduce and use a new MemDepResult class to hold the results of a memdep query. This makes it crystal clear what cases can escape from MemDep that the clients have to handle. This also gives the clients a nice simplified interface to it that is easy to poke at. This patch also makes DepResultTy and MemoryDependenceAnalysis::DepType private, yay. llvm-svn: 60231	2008-11-29 02:29:27 +00:00
Chris Lattner	de04e1173a	Reimplement the internal abstraction used by MemDep in terms of a pointer/int pair instead of a manually bitmangled pointer. This forces clients to think a little more about checking the appropriate pieces and will be useful for internal implementation improvements later. I'm not particularly happy with this. After going through this I don't think that the clients of memdep should be exposed to the internal type at all. I'll fix this in a subsequent commit. This has no functionality change. llvm-svn: 60230	2008-11-29 01:43:36 +00:00
Nuno Lopes	e3127f3f80	fix memleak by cleaning the global sets on pass exit llvm-svn: 57353	2008-10-10 16:25:50 +00:00
Duncan Sands	26ff6f9c54	Add <cstdio> include where needed by gcc-4.4. Patch by Samuel Tardieu. llvm-svn: 57291	2008-10-08 07:23:46 +00:00
Duncan Sands	d65a4daeea	Factorize code: remove variants of "strip off pointer bitcasts and GEP's", and centralize the logic in Value::getUnderlyingObject. The difference with stripPointerCasts is that stripPointerCasts only strips GEPs if all indices are zero, while getUnderlyingObject strips GEPs no matter what the indices are. llvm-svn: 56922	2008-10-01 15:25:41 +00:00
Dan Gohman	a79db30d28	Tidy up several unbeseeming casts from pointer to intptr_t. llvm-svn: 55779	2008-09-04 17:05:41 +00:00
Owen Anderson	2fbfb70530	Fix a bug that prevented PRE from applying in some cases. llvm-svn: 55744	2008-09-03 23:06:07 +00:00
Owen Anderson	b39e0decf8	Put a heuristic in place to prevent GVN from falling into bad cases with massively complicated CFGs. This speeds up a particular testcase from 12+ hours to 5 seconds with little perceptible loss of quality. llvm-svn: 55391	2008-08-26 22:07:42 +00:00
Chris Lattner	5fc8ab6d18	consolidate DenseMapInfo implementations, and add one for std::pair. Patch contributed by m-s. llvm-svn: 55167	2008-08-22 05:08:25 +00:00
Duncan Sands	2c741145a7	Supress a gcc-4.3 warning. llvm-svn: 53771	2008-07-18 21:06:02 +00:00
Owen Anderson	04a6e0ba8c	Make PRE actually handle critical edges (by splitting them). Confirmed that bootstrap passes with this change. llvm-svn: 53762	2008-07-18 18:03:38 +00:00

... 4 5 6 7 8 ...

661 Commits