llvm-project

Commit Graph

Author	SHA1	Message	Date
Nick Lewycky	0a1f25b927	Detabify. llvm-svn: 90085	2009-11-29 18:10:39 +00:00
Nick Lewycky	218a3393f4	Teach memdep to look for memory use intrinsics during dependency queries. Fixes PR5574. llvm-svn: 90045	2009-11-28 21:27:49 +00:00
Chris Lattner	44da5bd837	Enhance InsertPHITranslatedPointer to be able to return a list of newly inserted instructions. No functionality change until someone starts using it. llvm-svn: 90039	2009-11-28 15:39:14 +00:00
Chris Lattner	d5bd369a0f	enable code to handle un-phi-translatable cases more aggressively: if we don't have an address expression available in a predecessor, then model this as the value being clobbered at the end of the pred block instead of being modeled as a complete phi translation failure. This is important for PRE of loads because we want to see that the load is available in all but this predecessor, and complete phi translation failure results in not getting any information about predecessors. This doesn't do anything until I renable code insertion since PRE now sees that it is available in all but one predecessors, but can't insert the addressing in the predecessor that is missing it to eliminate the redundancy. llvm-svn: 90037	2009-11-28 14:54:10 +00:00
Chris Lattner	2be52e72ae	Rework InsertPHITranslatedPointer to handle the recursive case, this fixes PR5630 and sets the stage for the next phase of goodness (testcase pending). llvm-svn: 90019	2009-11-27 22:05:15 +00:00
Chris Lattner	4ee17e1482	recursively phi translate bitcast operands too, for consistency. llvm-svn: 90016	2009-11-27 20:25:30 +00:00
Chris Lattner	2f0354ecf0	add support for recursive phi translation and phi translation of add with immediate. This allows us to optimize this function: void test(int N, double* G) { long j; G[1] = 1; for (j = 1; j < N - 1; j++) G[j+1] = G[j] + G[j+1]; } to only do one load every iteration of the loop. llvm-svn: 90013	2009-11-27 19:11:31 +00:00
Chris Lattner	6d294de548	add comment. llvm-svn: 90002	2009-11-27 08:40:14 +00:00
Chris Lattner	ac323297e0	reduce nesting, no functionality change. llvm-svn: 90001	2009-11-27 08:37:22 +00:00
Chris Lattner	25be93dfed	teach GVN's load PRE to insert computations of the address in predecessors where it is not available. It's unclear how to get this inserted computation into GVN's scalar availability sets, Owen, help? :) llvm-svn: 89997	2009-11-27 08:25:10 +00:00
Chris Lattner	a9a76ccf56	Fix phi translation in load PRE to agree with the phi translation done by memdep, and reenable gep translation again. llvm-svn: 89992	2009-11-27 06:31:14 +00:00
Chris Lattner	b018bda665	redisable this, my bootstrap worked because it wasn't an optimized build, whoops. llvm-svn: 89991	2009-11-27 05:53:01 +00:00
Chris Lattner	fb8a718fc3	try again. llvm-svn: 89990	2009-11-27 05:19:56 +00:00
Chris Lattner	14444f5c1a	this is causing buildbot failures, disable for now. llvm-svn: 89985	2009-11-27 01:52:22 +00:00
Chris Lattner	5030c6ab21	teach phi translation of GEPs to simplify geps like 'gep x, 0'. This allows us to compile the example from PR5313 into: LBB1_2: ## %bb incl %ecx movb %al, (%rsi) movslq %ecx, %rax movb (%rdi,%rax), %al testb %al, %al jne LBB1_2 instead of: LBB1_2: ## %bb movslq %eax, %rcx incl %eax movb (%rdi,%rcx), %cl movb %cl, (%rsi) movslq %eax, %rcx cmpb $0, (%rdi,%rcx) jne LBB1_2 llvm-svn: 89981	2009-11-27 00:34:38 +00:00
Chris Lattner	4c88e814b8	teach memdep to do trivial PHI translation of GEPs. More to come. llvm-svn: 89979	2009-11-27 00:07:37 +00:00
Chris Lattner	9bd2136ca3	Teach memdep to phi translate bitcasts. This allows us to compile the example in GCC PR16799 to: LBB1_2: ## %bb1 movl %eax, %eax subq %rax, %rdi movq %rdi, (%rcx) movl (%rdi), %eax testl %eax, %eax je LBB1_2 instead of: LBB1_2: ## %bb1 movl (%rdi), %ecx subq %rcx, %rdi movq %rdi, (%rax) cmpl $0, (%rdi) je LBB1_2 llvm-svn: 89978	2009-11-26 23:41:07 +00:00
Chris Lattner	c49f5ac7d8	factor some code out into some helper functions. llvm-svn: 89975	2009-11-26 23:18:49 +00:00
Nick Lewycky	663e0a06b0	Remove dead code. While there, also turn a few 'T* ' into 'T *' to match the rest of the file. llvm-svn: 89577	2009-11-22 02:38:11 +00:00
Owen Anderson	2b2bd28973	Treat lifetime begin/end markers as allocations/frees respectively for the purposes for GVN/DSE. llvm-svn: 85383	2009-10-28 07:05:35 +00:00
Owen Anderson	fc16e5a98f	Be more careful about invariance reasoning on "store" queries. Stores still need to depend on Ref and ModRef calls within the invariant region. llvm-svn: 85380	2009-10-28 06:30:52 +00:00
Owen Anderson	d0e86d57c1	Add trivial support for the invariance intrinsics to memdep. This logic is purely local for now. llvm-svn: 85378	2009-10-28 06:18:42 +00:00
Victor Hernandez	f390e04a47	Rename MallocFreeHelper as MemoryBuiltins llvm-svn: 85286	2009-10-27 20:05:49 +00:00
Victor Hernandez	762195bd01	Rename MallocHelper as MallocFreeHelper, since it now also identifies calls to free() llvm-svn: 85181	2009-10-26 23:58:56 +00:00
Victor Hernandez	de5ad42aa1	Remove FreeInst. Remove LowerAllocations pass. Update some more passes to treate free calls just like they were treating FreeInst. llvm-svn: 85176	2009-10-26 23:43:48 +00:00
Victor Hernandez	e297149e26	Auto-upgrade free instructions to calls to the builtin free function. Update all analysis passes and transforms to treat free calls just like FreeInst. Remove RaiseAllocations and all its tests since FreeInst no longer needs to be raised. llvm-svn: 84987	2009-10-24 04:23:03 +00:00
Victor Hernandez	8acf2956b8	Remove AllocationInst. Since MallocInst went away, AllocaInst is the only subclass of AllocationInst, so it no longer is necessary. llvm-svn: 84969	2009-10-23 21:09:37 +00:00
Victor Hernandez	70e8505eb1	Memory dependence analysis was incorrectly stopping to scan for stores to a pointer at bitcast uses of a malloc call. It should continue scanning until the malloc call, and this patch fixes that. llvm-svn: 83931	2009-10-13 01:42:53 +00:00
Chris Lattner	7e6d56ebc5	Revert r82404, it is causing a bootstrap miscompile. This is very very scary, as it indicates a lurking bug. yay. llvm-svn: 82411	2009-09-20 22:44:26 +00:00
Chris Lattner	eea16a168a	improve memdep to eliminate bitcasts (and aliases, and noop geps) early for the stated reasons: this allows it to find more equivalences and depend less on code layout. llvm-svn: 82404	2009-09-20 21:00:18 +00:00
Victor Hernandez	537d8d99be	Enhance analysis passes so that they apply the same analysis to malloc calls as to MallocInst. Reviewed by Eli Friedman. llvm-svn: 82281	2009-09-18 21:34:51 +00:00
Dan Gohman	1ee6057b21	Make TargetData optional in MemoryDependenceAnalysis. llvm-svn: 77727	2009-07-31 20:53:12 +00:00
Dan Gohman	f3ee7eaac3	Remove an unnecessary header. llvm-svn: 77725	2009-07-31 20:47:45 +00:00
Chris Lattner	370aadabfc	factor the 'optimized sort' code out into a static helper function and use it from one more place. Patch by Jakub Staszak! llvm-svn: 75478	2009-07-13 17:20:05 +00:00
Chris Lattner	2f0c1c44d5	Move the re-sort of invalidated NonLocalPointerDeps cache earlier so that all code paths get it. PR4256 was about a case where the phi translation loop would find all preds in the Visited cache, so it could get by without re-sorting the NonLocalPointerDeps cache. Fix this by resorting it earlier, there is no reason not to do this. This patch inspired by Jakub Staszak's patch. llvm-svn: 75476	2009-07-13 17:14:23 +00:00
Chris Lattner	02274a7171	make memdep use the getModRefInfo method for stores instead of the low-level alias() method, allowing it to reason more aggressively about pointers into constant memory. PR4189 llvm-svn: 72403	2009-05-25 21:28:56 +00:00
Chris Lattner	8eda11bd9d	now that you can put a PointerIntPair in a SmallPtrSet, remove some hackish workarounds from memdep llvm-svn: 67971	2009-03-29 00:24:04 +00:00
Dale Johannesen	f61c8e81bd	Debug intriniscs should be skipped when looking for a dependency, not terminate the search. llvm-svn: 66709	2009-03-11 21:13:01 +00:00
Owen Anderson	f9a9cf96a1	Ignore debug intrinsics when computing dependences. llvm-svn: 66399	2009-03-09 05:12:38 +00:00
Zhou Sheng	c8e5085cd3	Remove this as dbginfo intrinsics has been defined as IntrNoMem. llvm-svn: 66256	2009-03-06 06:05:01 +00:00
Zhou Sheng	abe4192442	Ignore the debug info intrinsics when looking for dependency through basic block. llvm-svn: 66119	2009-03-05 01:45:43 +00:00
Chris Lattner	3f4591c89f	fix two more cases where we could let the NLPDI cache get unsorted. With this, sqlite3 now passes. llvm-svn: 62839	2009-01-23 07:12:16 +00:00
Chris Lattner	e3ea48c71e	Unconditionally reset 'cache' to zero, even if we don't need to resort it. This avoids using a dangling pointer. Reset NumSortedEntries after restoring Cache to avoid extraneous sorts. This fixes the reduced sqlite3 testcase, but apparently not the whole app. llvm-svn: 62838	2009-01-23 06:48:41 +00:00
Chris Lattner	706d40e662	a minor tweak to my previous patch, handle the invalidation case when there are multiple iterations of the loop. This fixes PR3375. llvm-svn: 62822	2009-01-23 00:27:03 +00:00
Chris Lattner	f09619d533	Fix PR3358, a really nasty bug where recursive phi translated analyses could be run without the caches properly sorted. This can fix all sorts of weirdness. Many thanks to Bill for coming up with the 'issorted' verification idea. llvm-svn: 62757	2009-01-22 07:04:01 +00:00
Chris Lattner	8b4be37275	fix PR3217: fully cached queries need to be verified against the visited set before they are used. If used, their blocks need to be added to the visited set so that subsequent queries don't use conflicting pointer values in the cache result blocks. llvm-svn: 61080	2008-12-16 07:10:09 +00:00
Chris Lattner	7ed5ccc517	if we have a phi translation failure of the start block, return just a clobber of the start block, not other random stuff as well. llvm-svn: 61026	2008-12-15 04:58:29 +00:00
Chris Lattner	ff9f3dba12	Implement initial support for PHI translation in memdep. This means that memdep keeps track of how PHIs affect the pointer in dep queries, which allows it to eliminate the load in cases like rle-phi-translate.ll, which basically end up being: BB1: X = load P br BB3 BB2: Y = load Q br BB3 BB3: R = phi [P] [Q] load R turning "load R" into a phi of X/Y. In addition to additional exposed opportunities, this makes memdep safe in many cases that it wasn't before (which is required for load PRE) and also makes it substantially more efficient. For example, consider: bb1: // has many predecessors. P = some_operator() load P In this example, previously memdep would scan all the predecessors of BB1 to see if they had something that would mustalias P. In some cases (e.g. test/Transforms/GVN/rle-must-alias.ll) it would actually find them and end up eliminating something. In many other cases though, it would scan and not find anything useful. MemDep now stops at a block if the pointer is defined in that block and cannot be phi translated to predecessors. This causes it to miss the (rare) cases like rle-must-alias.ll, but makes it faster by not scanning tons of stuff that is unlikely to be useful. For example, this speeds up GVN as a whole from 3.928s to 2.448s (60%)!. IMO, scalar GVN should be enhanced to simplify the rle-must-alias pointer base anyway, which would allow the loads to be eliminated. In the future, this should be enhanced to phi translate through geps and bitcasts as well (as indicated by FIXMEs) making memdep even more powerful. llvm-svn: 61022	2008-12-15 03:35:32 +00:00
Duncan Sands	c52b616ccf	Don't dereference the end() iterator. This was causing a bunch of failures when running "make ENABLE_EXPENSIVE_CHECKS=1 check". llvm-svn: 60832	2008-12-10 09:38:36 +00:00
Chris Lattner	0318b56f0e	loosen up an assertion that isn't valid when called from invalidateCachedPointerInfo. Thanks to Bill for sending me a testcase. llvm-svn: 60805	2008-12-09 22:45:32 +00:00
Chris Lattner	fa9f99aa12	Teach GVN to invalidate some memdep information when it does an RAUW of a pointer. This allows is to catch more equivalencies. For example, the type_lists_compatible_p function used to require two iterations of the gvn pass (!) to delete its 18 redundant loads because the first pass would CSE all the addressing computation cruft, which would unblock the second memdep/gvn passes from recognizing them. This change allows memdep/gvn to catch all 18 when run just once on the function (as is typical :) instead of just 3. On all of 403.gcc, this bumps up the # reundandancies found from: 63 gvn - Number of instructions PRE'd 153991 gvn - Number of instructions deleted 50069 gvn - Number of loads deleted to: 63 gvn - Number of instructions PRE'd 154137 gvn - Number of instructions deleted 50185 gvn - Number of loads deleted +120 loads deleted isn't bad. llvm-svn: 60799	2008-12-09 22:06:23 +00:00
Chris Lattner	702e46ed54	Teach BasicAA::getModRefInfo(CallSite, CallSite) some tricks based on readnone/readonly functions. Teach memdep to look past readonly calls when analyzing deps for a readonly call. This allows elimination of a few more calls from 403.gcc: before: 63 gvn - Number of instructions PRE'd 153986 gvn - Number of instructions deleted 50069 gvn - Number of loads deleted after: 63 gvn - Number of instructions PRE'd 153991 gvn - Number of instructions deleted 50069 gvn - Number of loads deleted 5 calls isn't much, but this adds plumbing for the next change. llvm-svn: 60794	2008-12-09 21:19:42 +00:00
Chris Lattner	41efb68c44	Fix a fixme: allow memdep to see past read-only calls when doing load dependence queries. This allows GVN to eliminate a few more instructions on 403.gcc: 152598 gvn - Number of instructions deleted 49240 gvn - Number of loads deleted after: 153986 gvn - Number of instructions deleted 50069 gvn - Number of loads deleted llvm-svn: 60786	2008-12-09 19:47:40 +00:00
Chris Lattner	254314e6bc	rename getNonLocalDependency -> getNonLocalCallDependency, and remove pointer stuff from it, simplifying the code a bit. llvm-svn: 60783	2008-12-09 19:38:05 +00:00
Chris Lattner	4f10733cf3	fix typos gabor noticed llvm-svn: 60754	2008-12-09 08:38:36 +00:00
Chris Lattner	75510d8d5c	restructure the top level non-local ptr dep query to handle the first block of a query specially. This makes the "complete query caching" subsystem more effective, avoiding predecessor queries. This speeds up GVN another 4%. llvm-svn: 60752	2008-12-09 07:52:59 +00:00
Chris Lattner	f903fe1df0	rename getNonLocalPointerDepInternal -> getNonLocalPointerDepFromBB and split its inner loop out into a new GetNonLocalInfoForBlock function. No functionality change. llvm-svn: 60751	2008-12-09 07:47:11 +00:00
Chris Lattner	aeaec0838b	if we have two elements, insert both, don't use std::sort. This speeds up the new GVN by another 3% llvm-svn: 60747	2008-12-09 07:05:45 +00:00
Chris Lattner	4d1281cdf2	If we're only adding one new element to 'Cache', insert it into its known position instead of using a full sort. This speeds up GVN by ~4% with the new memdep stuff. llvm-svn: 60746	2008-12-09 06:58:04 +00:00
Chris Lattner	e8113a70fa	convert a couple other places that use pred_iterator to use the caching pred iterator. llvm-svn: 60745	2008-12-09 06:44:17 +00:00
Chris Lattner	768e5bcafc	use hte new pred cache to speed up the new non-local memdep queries. This speeds up GVN using the new queries (not yet checked in) by just over 10%. llvm-svn: 60743	2008-12-09 06:28:49 +00:00
Chris Lattner	5ed409edfa	add another level of caching for non-local pointer queries, keeping track of whether the CachedNonLocalPointerInfo for a block is specific to a block. If so, just return it without any pred scanning. This is good for a 6% speedup on GVN (when it uses this lookup method, which it doesn't right now). llvm-svn: 60695	2008-12-08 07:31:50 +00:00
Chris Lattner	fdb8843133	add an assert. the cast<> below would catch this but a message is more useful. llvm-svn: 60674	2008-12-07 18:45:15 +00:00
Chris Lattner	82b7034753	factor some code better. llvm-svn: 60673	2008-12-07 18:42:51 +00:00
Chris Lattner	de4440c24b	factor some code, fixing some fixme's. llvm-svn: 60672	2008-12-07 18:39:13 +00:00
Chris Lattner	a28355de14	add support for caching pointer dependence queries. Nothing uses this yet so it "can't" break anything. That said, it does appear to work. llvm-svn: 60654	2008-12-07 08:50:20 +00:00
Chris Lattner	7564a3b81b	Some internal refactoring to make it easier to cache results. llvm-svn: 60650	2008-12-07 02:56:57 +00:00
Chris Lattner	2faa2c724a	Introduce a new MemDep::getNonLocalPointerDependency method. This will eventually take over load/store dep queries from getNonLocalDependency. For now it works fine, but is incredibly slow because it does no caching. Lets not switch GVN to use it until that is fixed :) llvm-svn: 60649	2008-12-07 02:15:47 +00:00
Chris Lattner	5a78604e39	push the "pointer case" up the analysis stack a bit. This causes duplication of logic (in 2 places) to determine what pointer a load/store touches. This will be addressed in a future commit. llvm-svn: 60648	2008-12-07 01:50:16 +00:00
Chris Lattner	ed494f791e	make clients have to know how to call getCallSiteDependencyFrom instead of making getDependencyFrom do it. llvm-svn: 60647	2008-12-07 01:21:14 +00:00
Chris Lattner	ccb9c3370a	rename some variables for consistency llvm-svn: 60644	2008-12-07 00:39:19 +00:00
Chris Lattner	e2069a6949	I love how using out of scope variables is not an error with GCC, no really I do. llvm-svn: 60643	2008-12-07 00:38:27 +00:00
Chris Lattner	056c090c67	Rename getCallSiteDependency -> getCallSiteDependencyFrom to emphasize the scanning and make it more similar to getDependencyFrom llvm-svn: 60642	2008-12-07 00:35:51 +00:00
Chris Lattner	d4d9588abc	a memdep query on a volatile load/store will always return clobber with the current implementation. Instead of returning a "precise clobber" just return a fuzzy one. This doesn't matter to any clients anyway and should speed up analysis time very very slightly. llvm-svn: 60641	2008-12-07 00:28:02 +00:00
Chris Lattner	f5891941b4	remove the ability to get memdep info for vaarg. I don't think the original impl was correct and noone actually makes the query anyway. llvm-svn: 60639	2008-12-07 00:21:18 +00:00
Chris Lattner	0e3d6337c6	Make a few major changes to memdep and its clients: 1. Merge the 'None' result into 'Normal', making loads and stores return their dependencies on allocations as Normal. 2. Split the 'Normal' result into 'Clobber' and 'Def' to distinguish between the cases when memdep knows the value is produced from when we just know if may be changed. 3. Move some of the logic for determining whether readonly calls are CSEs into memdep instead of it being in GVN. This still leaves verification that the arguments are hte same to GVN to let it know about value equivalences in different contexts. 4. Change memdep's call/call dependency analysis to use getModRefInfo(CallSite,CallSite) instead of doing something very weak. This only really matters for things like DSA, but someday maybe we'll have some other decent context sensitive analyses :) 5. This reimplements the guts of memdep to handle the new results. 6. This simplifies GVN significantly: a) readonly call CSE is slightly simpler b) I eliminated the "getDependencyFrom" chaining for load elimination and load CSE doesn't have to worry about volatile (they are always clobbers) anymore. c) GVN no longer does any 'lastLoad' caching, leaving it to memdep. 7. The logic in DSE is simplified a bit and sped up. A potentially unsafe case was eliminated. llvm-svn: 60607	2008-12-05 21:04:20 +00:00
Chris Lattner	eda6432beb	Make it illegal to call getDependency* on non-memory instructions like binary operators. llvm-svn: 60600	2008-12-05 18:46:19 +00:00
Chris Lattner	7e61dafc95	Reimplement the non-local dependency data structure in terms of a sorted vector instead of a densemap. This shrinks the memory usage of this thing substantially (the high water mark) as well as making operations like scanning it faster. This speeds up memdep slightly, gvn goes from 3.9376 to 3.9118s on 403.gcc This also splits out the statistics for the cached non-local case to differentiate between the dirty and clean cached case. Here's the stats for 403.gcc: 6153 memdep - Number of dirty cached non-local responses 169336 memdep - Number of fully cached non-local responses 162428 memdep - Number of uncached non-local responses yay for caching :) llvm-svn: 60313	2008-12-01 01:15:42 +00:00
Chris Lattner	47e81d0e90	Eliminate the DepResultTy abstraction. It is now completely redundant with MemDepResult, and MemDepResult has a nicer interface. llvm-svn: 60308	2008-11-30 23:17:19 +00:00
Chris Lattner	13cae612b9	Cache TargetData/AliasAnalysis in the pass instead of calling getAnalysis<>. getAnalysis<> is apparently extremely expensive. Doing this speeds up GVN on 403.gcc by 16%! llvm-svn: 60304	2008-11-30 19:24:31 +00:00
Chris Lattner	441042796d	Two changes: Make getDependency remove QueryInst for a dirty record's ReverseLocalDeps when we update it. This fixes a regression test failure from my last commit. Second, for each non-local cached information structure, keep a bit that indicates whether it is dirty or not. This saves us a scan over the whole thing in the common case when it isn't dirty. llvm-svn: 60274	2008-11-30 02:52:26 +00:00
Chris Lattner	fc678e2af5	introduce a typedef, no functionality change. llvm-svn: 60272	2008-11-30 02:30:50 +00:00
Chris Lattner	1b810bd5e6	Change NonLocalDeps to be a densemap of pointers to densemap instead of containing them by value. This increases the density (!) of NonLocalDeps as well as making the reallocation case faster. This speeds up gvn on 403.gcc by 2% and makes room for future improvements. I'm not super thrilled with having to explicitly manage the new/delete of the map, but it is necesary for the next change. llvm-svn: 60271	2008-11-30 02:28:25 +00:00
Chris Lattner	ff862c4e88	calls never depend on allocations. llvm-svn: 60268	2008-11-30 01:44:00 +00:00
Chris Lattner	3ff6d01586	Fix a fixme by making memdep's handling of allocations more logical. If we see that a load depends on the allocation of its memory with no intervening stores, we now return a 'None' depedency instead of "Normal". This tweaks GVN to do its optimization with the new result. llvm-svn: 60267	2008-11-30 01:39:32 +00:00
Chris Lattner	60444f8aa5	implement a fixme by introducing a new getDependencyFromInternal method that returns its result as a DepResultTy instead of as a MemDepResult. This reduces conversion back and forth. llvm-svn: 60266	2008-11-30 01:26:32 +00:00
Chris Lattner	2059753e66	Move the getNonLocalDependency method to a more logical place in the file, no functionality change. llvm-svn: 60265	2008-11-30 01:18:27 +00:00
Chris Lattner	3d5d5f2c6d	REmove an old fixme, resolve another fixme by adding liberal comments about what this class does. llvm-svn: 60264	2008-11-30 01:17:08 +00:00
Chris Lattner	ada1f87988	remove a bit of incorrect code that tried to be tricky about speeding up dependencies. The basic situation was this: consider if we had: store1 ... store2 ... store3 Where memdep thinks that store3 depends on store2 and store2 depends on store1. The problem happens when we delete store2: The code in question was updating dep info for store3 to be store1. This is a spiffy optimization, but is not safe at all, because aliasing isn't transitive. This bug isn't exposed today with DSE because DSE will only zap store2 if it is identifical to store 3, and in this case, it is safe to update it to depend on store1. However, memcpyopt is not so fortunate, which is presumably why the "dropInstruction" code used to exist. Since this doesn't actually provide a speedup in practice, just rip the code out. llvm-svn: 60263	2008-11-30 01:09:30 +00:00
Chris Lattner	63bd586d35	Eliminate the dropInstruction method, which is not needed any more. Fix a subtle iterator invalidation bug I introduced in the last commit. llvm-svn: 60258	2008-11-29 23:30:39 +00:00
Chris Lattner	e7d7e13bf7	implement some fixme's: when deleting an instruction with an entry in the nonlocal deps map, don't reset entries referencing that instruction to [dirty, null], instead, set them to [dirty,next] where next is the instruction after the deleted one. Use this information in the non-local deps code to avoid rescanning entire blocks. This speeds up GVN slightly by avoiding pointless work. On 403.gcc this makes GVN 1.5% faster. llvm-svn: 60256	2008-11-29 22:02:15 +00:00
Chris Lattner	1c6b62eb4d	Change MemDep::getNonLocalDependency to return its results as a smallvector instead of a DenseMap. This speeds up GVN by 5% on 403.gcc. llvm-svn: 60255	2008-11-29 21:33:22 +00:00
Chris Lattner	b8ec75bc35	move MemoryDependenceAnalysis::verifyRemoved to the end of the file, no functionality/code change. llvm-svn: 60254	2008-11-29 21:25:10 +00:00
Chris Lattner	f280b0c729	reimplement getNonLocalDependency with a simpler worklist formulation that is faster and doesn't require nonLazyHelper. Much less code. llvm-svn: 60253	2008-11-29 21:22:42 +00:00
Chris Lattner	9f1988ab6c	rename some maps. llvm-svn: 60242	2008-11-29 09:20:15 +00:00
Chris Lattner	5cd1cfad11	rename some variables. llvm-svn: 60241	2008-11-29 09:15:21 +00:00
Chris Lattner	80c081828f	eliminate a bunch of code in favor of using AliasAnalysis::getModRefInfo. Put a some code back to handle buggy behavior that GVN expects: it wants loads to depend on each other, and accesses to depend on their allocations. llvm-svn: 60240	2008-11-29 09:09:48 +00:00
Chris Lattner	81f19e9aa4	simplify some code and rename some variables. Reduce nesting. Use getTypeStoreSize instead of ABITypeSize for in-memory size in a couple places. llvm-svn: 60238	2008-11-29 08:51:16 +00:00
Chris Lattner	51ba8d0630	Split getDependency into getDependency and getDependencyFrom, the former does caching, the later doesn't. This dramatically simplifies the logic in getDependency and getDependencyFrom. llvm-svn: 60234	2008-11-29 03:47:00 +00:00
Chris Lattner	e4d32791ef	Now that DepType is private, we can start cleaning up some of its uses: Document the Dirty value more precisely, use it for the uninitialized DepResultTy value. Change reverse mappings to be from an instruction* instead of DepResultTy, and stop tracking other forms. This makes it more clear that we only care about the instruction cases. Eliminate a DepResultTy,bool pair by using Dirty in the local case as well, shrinking the map and simplifying the code. This speeds up GVN by ~3% on 403.gcc. llvm-svn: 60232	2008-11-29 03:22:12 +00:00
Chris Lattner	7f9c8a0f05	Introduce and use a new MemDepResult class to hold the results of a memdep query. This makes it crystal clear what cases can escape from MemDep that the clients have to handle. This also gives the clients a nice simplified interface to it that is easy to poke at. This patch also makes DepResultTy and MemoryDependenceAnalysis::DepType private, yay. llvm-svn: 60231	2008-11-29 02:29:27 +00:00
Chris Lattner	de04e1173a	Reimplement the internal abstraction used by MemDep in terms of a pointer/int pair instead of a manually bitmangled pointer. This forces clients to think a little more about checking the appropriate pieces and will be useful for internal implementation improvements later. I'm not particularly happy with this. After going through this I don't think that the clients of memdep should be exposed to the internal type at all. I'll fix this in a subsequent commit. This has no functionality change. llvm-svn: 60230	2008-11-29 01:43:36 +00:00
Chris Lattner	d3d9111ede	Fix PR3141 by ensuring that MemoryDependenceAnalysis::removeInstruction properly updates the reverse dependency map when it installs updated dependencies for instructions that depend on the removed instruction. llvm-svn: 60222	2008-11-28 22:51:08 +00:00
Chris Lattner	73c254593e	more cleanups for MemoryDependenceAnalysis::removeInstruction, no functionality change. llvm-svn: 60219	2008-11-28 22:28:27 +00:00
Chris Lattner	a25d3952c6	random cleanups, no functionality change. llvm-svn: 60218	2008-11-28 22:04:47 +00:00
Chris Lattner	554d1221aa	Run verifyRemoved from removeInstruction when -debug is specified. This shows the root problem behind PR3141. llvm-svn: 60216	2008-11-28 21:45:17 +00:00
Chris Lattner	e5fd5c29de	rename "ping" to "verifyRemoved". I don't know why 'ping' what chosen, but it doesn't make any sense at all. Also make the method const, private, and fit in 80 cols while we're at it. llvm-svn: 60215	2008-11-28 21:42:09 +00:00
Chris Lattner	dca2cd3562	remove mysterious escaped newlines. llvm-svn: 60211	2008-11-28 21:16:44 +00:00
Duncan Sands	0a6d01770f	Fix comment typo. llvm-svn: 56116	2008-09-11 19:41:10 +00:00
Owen Anderson	d70cf1d5ae	Fix a subtle bug when removing instructions from memdep. In very specific circumstances we could end up remapping a dependee to the same instruction that we're trying to remove. Handle this properly by just falling back to a conservative solution. llvm-svn: 54132	2008-07-28 16:00:58 +00:00
Owen Anderson	b22a640fe4	A better fix for PR2503 that doesn't pessimize GVN in the presence of unreachable blocks. llvm-svn: 53032	2008-07-02 17:20:16 +00:00
Owen Anderson	2a3a1127e2	Properly handle cases where a predecessor of the block being queried on is unreachable. This fixes PR2503, though we should also fix other passes not to emit this kind of code. llvm-svn: 52946	2008-07-01 00:40:58 +00:00
Owen Anderson	54ea37b9e9	Remember to update the reverse non-local cache when cleaning up dirty entries. This fixes PR2397. llvm-svn: 51846	2008-06-01 21:03:52 +00:00
Owen Anderson	b77103b7e4	Make ping more aggressive in finding nonlocal caching errors. llvm-svn: 51845	2008-06-01 20:51:41 +00:00
Owen Anderson	3ab976a21f	Fix memdep's handling of invokes when finding the dependency of another call instruction. This fixes some Ada miscompiles reported in PR2324. llvm-svn: 51069	2008-05-13 21:25:37 +00:00
Dan Gohman	d78c400b5b	Clean up the use of static and anonymous namespaces. This turned up several things that were neither in an anonymous namespace nor static but not intended to be global. llvm-svn: 51017	2008-05-13 00:00:25 +00:00
Dan Gohman	6a2da37c0e	Make several variable declarations static. llvm-svn: 50696	2008-05-06 01:53:16 +00:00
Owen Anderson	f9ae76d89c	Make GVN able to remove unnecessary calls to read-only functions again. llvm-svn: 49842	2008-04-17 05:36:50 +00:00
Dan Gohman	9b5ffc8408	Fix a typo in a comment. llvm-svn: 49504	2008-04-10 23:02:38 +00:00
Owen Anderson	53336d8055	Fix for PR2190. Memdep's non-local caching was checking dirtied blocks in the wrong order. llvm-svn: 49499	2008-04-10 22:13:32 +00:00
Dan Gohman	3717cdaf22	Set blockBegin to point to the beginning of the block, not the end. llvm-svn: 48999	2008-03-31 22:08:00 +00:00
Devang Patel	80e43fa744	Restore isCFGOnly property of various analysis passes. llvm-svn: 48579	2008-03-20 02:25:21 +00:00
Devang Patel	718da668ab	PassInfo keep tracks whether a pass is an analysis pass or not. llvm-svn: 48554	2008-03-19 21:56:59 +00:00
Owen Anderson	00dba4f734	Re-apply the patch to improve the optimizations of memcpy's, with several bugs fixed. This now passes PPC bootstrap. llvm-svn: 47026	2008-02-12 21:15:18 +00:00
Tanya Lattner	182a9fd39f	Throttle the non-local dependence analysis for basic blocks with more than 50 predecessors. Added command line option to play with this threshold. llvm-svn: 46790	2008-02-06 00:54:55 +00:00
Owen Anderson	1de5997119	Fix an obscure read-after-free bug that Duncan found. llvm-svn: 46738	2008-02-05 04:34:03 +00:00
Owen Anderson	b255ada55b	Fix an issue where, under very specific circumstances, memdep could end up dereferencing the end of one of its internal maps. llvm-svn: 46541	2008-01-30 01:24:05 +00:00
Chris Lattner	f3ebc3f3d2	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Owen Anderson	086b2c4537	Fix several cache coherence bugs in MemDep/GVN that were found. Also add some (disabled) debugging code to make such problems easier to diagnose in the future, written by Duncan Sands. llvm-svn: 44695	2007-12-08 01:37:09 +00:00
Duncan Sands	68b6f50938	Integrate the readonly/readnone logic more deeply into alias analysis. This meant updating the API which now has versions of the getModRefBehavior, doesNotAccessMemory and onlyReadsMemory methods which take a callsite parameter. These should be used unless the callsite is not known, since in general they can do a better job than the versions that take a function. Also, users should no longer call the version of getModRefBehavior that takes both a function and a callsite. To reduce the chance of misuse it is now protected. llvm-svn: 44487	2007-12-01 07:51:45 +00:00
Owen Anderson	7cad745d49	Fix a silly bug that Nicholas noticed. llvm-svn: 44324	2007-11-26 03:27:38 +00:00
Owen Anderson	4f833c7610	Allow GVN to eliminate read-only function calls when it can detect that they are redundant. llvm-svn: 44323	2007-11-26 02:26:36 +00:00
Duncan Sands	44b8721de8	Executive summary: getTypeSize -> getTypeStoreSize / getABITypeSize. The meaning of getTypeSize was not clear - clarifying it is important now that we have x86 long double and arbitrary precision integers. The issue with long double is that it requires 80 bits, and this is not a multiple of its alignment. This gives a primitive type for which getTypeSize differed from getABITypeSize. For arbitrary precision integers it is even worse: there is the minimum number of bits needed to hold the type (eg: 36 for an i36), the maximum number of bits that will be overwriten when storing the type (40 bits for i36) and the ABI size (i.e. the storage size rounded up to a multiple of the alignment; 64 bits for i36). This patch removes getTypeSize (not really - it is still there but deprecated to allow for a gradual transition). Instead there is: (1) getTypeSizeInBits - a number of bits that suffices to hold all values of the type. For a primitive type, this is the minimum number of bits. For an i36 this is 36 bits. For x86 long double it is 80. This corresponds to gcc's TYPE_PRECISION. (2) getTypeStoreSizeInBits - the maximum number of bits that is written when storing the type (or read when reading it). For an i36 this is 40 bits, for an x86 long double it is 80 bits. This is the size alias analysis is interested in (getTypeStoreSize returns the number of bytes). There doesn't seem to be anything corresponding to this in gcc. (3) getABITypeSizeInBits - this is getTypeStoreSizeInBits rounded up to a multiple of the alignment. For an i36 this is 64, for an x86 long double this is 96 or 128 depending on the OS. This is the spacing between consecutive elements when you form an array out of this type (getABITypeSize returns the number of bytes). This is TYPE_SIZE in gcc. Since successive elements in a SequentialType (arrays, pointers and vectors) need to be aligned, the spacing between them will be given by getABITypeSize. This means that the size of an array is the length times the getABITypeSize. It also means that GEP computations need to use getABITypeSize when computing offsets. Furthermore, if an alloca allocates several elements at once then these too need to be aligned, so the size of the alloca has to be the number of elements multiplied by getABITypeSize. Logically speaking this doesn't have to be the case when allocating just one element, but it is simpler to also use getABITypeSize in this case. So alloca's and mallocs should use getABITypeSize. Finally, since gcc's only notion of size is that given by getABITypeSize, if you want to output assembler etc the same as gcc then getABITypeSize is the size you want. Since a store will overwrite no more than getTypeStoreSize bytes, and a read will read no more than that many bytes, this is the notion of size appropriate for alias analysis calculations. In this patch I have corrected all type size uses except some of those in ScalarReplAggregates, lib/Codegen, lib/Target (the hard cases). I will get around to auditing these too at some point, but I could do with some help. Finally, I made one change which I think wise but others might consider pointless and suboptimal: in an unpacked struct the amount of space allocated for a field is now given by the ABI size rather than getTypeStoreSize. I did this because every other place that reserves memory for a type (eg: alloca) now uses getABITypeSize, and I didn't want to make an exception for unpacked structs, i.e. I did it to make things more uniform. This only effects structs containing long doubles and arbitrary precision integers. If someone wants to pack these types more tightly they can always use a packed struct. llvm-svn: 43620	2007-11-01 20:53:16 +00:00
Owen Anderson	46da2a6262	Add partial caching of non-local memory dependence queries. This provides a modest speedup for GVN. llvm-svn: 42185	2007-09-21 03:53:52 +00:00
Owen Anderson	c201cbc802	Add a flag to mark a dirty cache entry. This is not yet used, but will eventually help non-local memdep caching. llvm-svn: 42137	2007-09-19 16:13:57 +00:00
Owen Anderson	f9203ab36a	Fix a typo in memdep, which was causing PR1648. llvm-svn: 41833	2007-09-11 04:31:00 +00:00
Owen Anderson	82e4fa1020	Remove an un-needed dependence query. This improves compile time marginally on 401.bzip2. llvm-svn: 41792	2007-09-09 21:43:49 +00:00
Owen Anderson	5f208bea91	Cache non-local memory dependence analysis. This is a significant compile time performance win in most cases. llvm-svn: 41126	2007-08-16 21:27:05 +00:00
Owen Anderson	9b1cc8cac0	Make NonLocal and None const in the right way. :-) llvm-svn: 40961	2007-08-09 04:42:44 +00:00
Owen Anderson	2b21c3c7a8	Add more comments to memdep. llvm-svn: 40953	2007-08-08 22:26:03 +00:00
Owen Anderson	fa788358d5	Make memdep fit in 80 cols. llvm-svn: 40950	2007-08-08 22:01:54 +00:00
Owen Anderson	b84d3b1c92	Change the None and NonLocal markers in memdep to be const. llvm-svn: 40946	2007-08-08 21:39:39 +00:00
Owen Anderson	68c6732d2c	Clean up a bunch of caching stuff in memdep. This reduces the time to run GVN on 403.gcc from ~15s to ~10s. llvm-svn: 40884	2007-08-07 00:33:45 +00:00
Owen Anderson	4898513d96	Improve the accuracy of memdep for determining the dependencies of loads. This brings GVN to parity with GCSE+LoadVN. llvm-svn: 40882	2007-08-06 23:26:03 +00:00
Owen Anderson	0ac1fc8ac1	Fix a bug that was causing several miscompilations on SPEC. llvm-svn: 40746	2007-08-02 17:56:05 +00:00
Owen Anderson	c321e5e272	Make non-local memdep not be recursive, and fix a bug on 403.gcc that this exposed. llvm-svn: 40692	2007-08-01 22:01:54 +00:00
David Greene	87801e8773	Fix GLIBCXX_DEBUG error owing to dereference of end iterator. There's no guarantee that an instruction returned by getDependency exists in the maps. llvm-svn: 40647	2007-07-31 20:01:27 +00:00
Owen Anderson	212d5c27f6	Use more caching when computing non-local dependence. This makes bzip2 not use up the entire 32-bit address space. llvm-svn: 40596	2007-07-30 17:29:24 +00:00
Owen Anderson	0f692f27a3	Fix a bug introduced in my last commit. llvm-svn: 40542	2007-07-26 18:57:04 +00:00
Owen Anderson	dbf23ccaa0	Fix a couple more bugs in the phi construction by pulling in code that does almost the same things from LCSSA. llvm-svn: 40540	2007-07-26 18:26:51 +00:00
Owen Anderson	9b796348bd	Fix a bug in non-local memdep that was causing an infinite loop on 175.vpr. llvm-svn: 40495	2007-07-25 21:26:36 +00:00
Owen Anderson	5e5599b7ce	Add basic support for performing whole-function RLE. Note: This has not yet been thoroughly tested. Use at your own risk. llvm-svn: 40489	2007-07-25 19:57:03 +00:00
Owen Anderson	d998be79cc	Add initial support for non-local memory dependence analysis. NOTE: This has only been cursorily tested. Expected improvements soon. llvm-svn: 40476	2007-07-24 21:52:37 +00:00
Owen Anderson	edb926bfe3	When removing instructions from the analysis, be sure to check the confirmed flag when determining what to do with dependencies. llvm-svn: 40079	2007-07-20 06:16:07 +00:00
Owen Anderson	7fcaaadf1c	Add support for walking up memory def chains, which enables finding many more dead stores on 400.perlbench. llvm-svn: 39929	2007-07-16 21:52:50 +00:00
Owen Anderson	1e1bace52b	Let MemoryDependenceAnalysis take care of updating AliasAnalysis. llvm-svn: 39769	2007-07-12 00:06:21 +00:00
Owen Anderson	c432490b4c	Calculate the size of a array allocation correctly. llvm-svn: 38511	2007-07-10 20:48:38 +00:00
Owen Anderson	faf9e42479	Fix a crasher when finding the dependency of a call. llvm-svn: 38510	2007-07-10 20:39:07 +00:00
Owen Anderson	1279eaf776	Make this pass registration static as well. llvm-svn: 38509	2007-07-10 20:21:08 +00:00
Owen Anderson	1fa6132e85	Handle vaarg instructions correctly. llvm-svn: 38504	2007-07-10 18:43:15 +00:00
Owen Anderson	94a21dd1e0	Volatile loads and stores depend on each other. llvm-svn: 38502	2007-07-10 18:11:42 +00:00
Owen Anderson	9c88457abe	Add support for finding the dependencies of call and invoke instructions. llvm-svn: 38497	2007-07-10 17:59:22 +00:00
Owen Anderson	2552a12e65	Fix the build, and fix the handling of pointer sizes. llvm-svn: 38494	2007-07-10 17:25:03 +00:00
Owen Anderson	47352db672	Fix a bunch of things from Chris' feedback llvm-svn: 38493	2007-07-10 17:08:11 +00:00
Owen Anderson	c0daf5fe53	A first stab at memory dependence analysis. This is an interface on top of alias analysis, adding caching and lazy computation of queries. This will be used in planned improvements to memory access optimizations. llvm-svn: 37958	2007-07-06 23:14:35 +00:00

... 2 3 4 5 6 ...

315 Commits