llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	0e3d6337c6	Make a few major changes to memdep and its clients: 1. Merge the 'None' result into 'Normal', making loads and stores return their dependencies on allocations as Normal. 2. Split the 'Normal' result into 'Clobber' and 'Def' to distinguish between the cases when memdep knows the value is produced from when we just know if may be changed. 3. Move some of the logic for determining whether readonly calls are CSEs into memdep instead of it being in GVN. This still leaves verification that the arguments are hte same to GVN to let it know about value equivalences in different contexts. 4. Change memdep's call/call dependency analysis to use getModRefInfo(CallSite,CallSite) instead of doing something very weak. This only really matters for things like DSA, but someday maybe we'll have some other decent context sensitive analyses :) 5. This reimplements the guts of memdep to handle the new results. 6. This simplifies GVN significantly: a) readonly call CSE is slightly simpler b) I eliminated the "getDependencyFrom" chaining for load elimination and load CSE doesn't have to worry about volatile (they are always clobbers) anymore. c) GVN no longer does any 'lastLoad' caching, leaving it to memdep. 7. The logic in DSE is simplified a bit and sped up. A potentially unsafe case was eliminated. llvm-svn: 60607	2008-12-05 21:04:20 +00:00
Chris Lattner	eda6432beb	Make it illegal to call getDependency* on non-memory instructions like binary operators. llvm-svn: 60600	2008-12-05 18:46:19 +00:00
Chris Lattner	7e61dafc95	Reimplement the non-local dependency data structure in terms of a sorted vector instead of a densemap. This shrinks the memory usage of this thing substantially (the high water mark) as well as making operations like scanning it faster. This speeds up memdep slightly, gvn goes from 3.9376 to 3.9118s on 403.gcc This also splits out the statistics for the cached non-local case to differentiate between the dirty and clean cached case. Here's the stats for 403.gcc: 6153 memdep - Number of dirty cached non-local responses 169336 memdep - Number of fully cached non-local responses 162428 memdep - Number of uncached non-local responses yay for caching :) llvm-svn: 60313	2008-12-01 01:15:42 +00:00
Chris Lattner	47e81d0e90	Eliminate the DepResultTy abstraction. It is now completely redundant with MemDepResult, and MemDepResult has a nicer interface. llvm-svn: 60308	2008-11-30 23:17:19 +00:00
Chris Lattner	13cae612b9	Cache TargetData/AliasAnalysis in the pass instead of calling getAnalysis<>. getAnalysis<> is apparently extremely expensive. Doing this speeds up GVN on 403.gcc by 16%! llvm-svn: 60304	2008-11-30 19:24:31 +00:00
Chris Lattner	441042796d	Two changes: Make getDependency remove QueryInst for a dirty record's ReverseLocalDeps when we update it. This fixes a regression test failure from my last commit. Second, for each non-local cached information structure, keep a bit that indicates whether it is dirty or not. This saves us a scan over the whole thing in the common case when it isn't dirty. llvm-svn: 60274	2008-11-30 02:52:26 +00:00
Chris Lattner	fc678e2af5	introduce a typedef, no functionality change. llvm-svn: 60272	2008-11-30 02:30:50 +00:00
Chris Lattner	1b810bd5e6	Change NonLocalDeps to be a densemap of pointers to densemap instead of containing them by value. This increases the density (!) of NonLocalDeps as well as making the reallocation case faster. This speeds up gvn on 403.gcc by 2% and makes room for future improvements. I'm not super thrilled with having to explicitly manage the new/delete of the map, but it is necesary for the next change. llvm-svn: 60271	2008-11-30 02:28:25 +00:00
Chris Lattner	ff862c4e88	calls never depend on allocations. llvm-svn: 60268	2008-11-30 01:44:00 +00:00
Chris Lattner	3ff6d01586	Fix a fixme by making memdep's handling of allocations more logical. If we see that a load depends on the allocation of its memory with no intervening stores, we now return a 'None' depedency instead of "Normal". This tweaks GVN to do its optimization with the new result. llvm-svn: 60267	2008-11-30 01:39:32 +00:00
Chris Lattner	60444f8aa5	implement a fixme by introducing a new getDependencyFromInternal method that returns its result as a DepResultTy instead of as a MemDepResult. This reduces conversion back and forth. llvm-svn: 60266	2008-11-30 01:26:32 +00:00
Chris Lattner	2059753e66	Move the getNonLocalDependency method to a more logical place in the file, no functionality change. llvm-svn: 60265	2008-11-30 01:18:27 +00:00
Chris Lattner	3d5d5f2c6d	REmove an old fixme, resolve another fixme by adding liberal comments about what this class does. llvm-svn: 60264	2008-11-30 01:17:08 +00:00
Chris Lattner	ada1f87988	remove a bit of incorrect code that tried to be tricky about speeding up dependencies. The basic situation was this: consider if we had: store1 ... store2 ... store3 Where memdep thinks that store3 depends on store2 and store2 depends on store1. The problem happens when we delete store2: The code in question was updating dep info for store3 to be store1. This is a spiffy optimization, but is not safe at all, because aliasing isn't transitive. This bug isn't exposed today with DSE because DSE will only zap store2 if it is identifical to store 3, and in this case, it is safe to update it to depend on store1. However, memcpyopt is not so fortunate, which is presumably why the "dropInstruction" code used to exist. Since this doesn't actually provide a speedup in practice, just rip the code out. llvm-svn: 60263	2008-11-30 01:09:30 +00:00
Chris Lattner	63bd586d35	Eliminate the dropInstruction method, which is not needed any more. Fix a subtle iterator invalidation bug I introduced in the last commit. llvm-svn: 60258	2008-11-29 23:30:39 +00:00
Chris Lattner	e7d7e13bf7	implement some fixme's: when deleting an instruction with an entry in the nonlocal deps map, don't reset entries referencing that instruction to [dirty, null], instead, set them to [dirty,next] where next is the instruction after the deleted one. Use this information in the non-local deps code to avoid rescanning entire blocks. This speeds up GVN slightly by avoiding pointless work. On 403.gcc this makes GVN 1.5% faster. llvm-svn: 60256	2008-11-29 22:02:15 +00:00
Chris Lattner	1c6b62eb4d	Change MemDep::getNonLocalDependency to return its results as a smallvector instead of a DenseMap. This speeds up GVN by 5% on 403.gcc. llvm-svn: 60255	2008-11-29 21:33:22 +00:00
Chris Lattner	b8ec75bc35	move MemoryDependenceAnalysis::verifyRemoved to the end of the file, no functionality/code change. llvm-svn: 60254	2008-11-29 21:25:10 +00:00
Chris Lattner	f280b0c729	reimplement getNonLocalDependency with a simpler worklist formulation that is faster and doesn't require nonLazyHelper. Much less code. llvm-svn: 60253	2008-11-29 21:22:42 +00:00
Chris Lattner	9f1988ab6c	rename some maps. llvm-svn: 60242	2008-11-29 09:20:15 +00:00
Chris Lattner	5cd1cfad11	rename some variables. llvm-svn: 60241	2008-11-29 09:15:21 +00:00
Chris Lattner	80c081828f	eliminate a bunch of code in favor of using AliasAnalysis::getModRefInfo. Put a some code back to handle buggy behavior that GVN expects: it wants loads to depend on each other, and accesses to depend on their allocations. llvm-svn: 60240	2008-11-29 09:09:48 +00:00
Chris Lattner	81f19e9aa4	simplify some code and rename some variables. Reduce nesting. Use getTypeStoreSize instead of ABITypeSize for in-memory size in a couple places. llvm-svn: 60238	2008-11-29 08:51:16 +00:00
Chris Lattner	51ba8d0630	Split getDependency into getDependency and getDependencyFrom, the former does caching, the later doesn't. This dramatically simplifies the logic in getDependency and getDependencyFrom. llvm-svn: 60234	2008-11-29 03:47:00 +00:00
Chris Lattner	e4d32791ef	Now that DepType is private, we can start cleaning up some of its uses: Document the Dirty value more precisely, use it for the uninitialized DepResultTy value. Change reverse mappings to be from an instruction* instead of DepResultTy, and stop tracking other forms. This makes it more clear that we only care about the instruction cases. Eliminate a DepResultTy,bool pair by using Dirty in the local case as well, shrinking the map and simplifying the code. This speeds up GVN by ~3% on 403.gcc. llvm-svn: 60232	2008-11-29 03:22:12 +00:00
Chris Lattner	7f9c8a0f05	Introduce and use a new MemDepResult class to hold the results of a memdep query. This makes it crystal clear what cases can escape from MemDep that the clients have to handle. This also gives the clients a nice simplified interface to it that is easy to poke at. This patch also makes DepResultTy and MemoryDependenceAnalysis::DepType private, yay. llvm-svn: 60231	2008-11-29 02:29:27 +00:00
Chris Lattner	de04e1173a	Reimplement the internal abstraction used by MemDep in terms of a pointer/int pair instead of a manually bitmangled pointer. This forces clients to think a little more about checking the appropriate pieces and will be useful for internal implementation improvements later. I'm not particularly happy with this. After going through this I don't think that the clients of memdep should be exposed to the internal type at all. I'll fix this in a subsequent commit. This has no functionality change. llvm-svn: 60230	2008-11-29 01:43:36 +00:00
Chris Lattner	d3d9111ede	Fix PR3141 by ensuring that MemoryDependenceAnalysis::removeInstruction properly updates the reverse dependency map when it installs updated dependencies for instructions that depend on the removed instruction. llvm-svn: 60222	2008-11-28 22:51:08 +00:00
Chris Lattner	73c254593e	more cleanups for MemoryDependenceAnalysis::removeInstruction, no functionality change. llvm-svn: 60219	2008-11-28 22:28:27 +00:00
Chris Lattner	a25d3952c6	random cleanups, no functionality change. llvm-svn: 60218	2008-11-28 22:04:47 +00:00
Chris Lattner	554d1221aa	Run verifyRemoved from removeInstruction when -debug is specified. This shows the root problem behind PR3141. llvm-svn: 60216	2008-11-28 21:45:17 +00:00
Chris Lattner	e5fd5c29de	rename "ping" to "verifyRemoved". I don't know why 'ping' what chosen, but it doesn't make any sense at all. Also make the method const, private, and fit in 80 cols while we're at it. llvm-svn: 60215	2008-11-28 21:42:09 +00:00
Chris Lattner	dca2cd3562	remove mysterious escaped newlines. llvm-svn: 60211	2008-11-28 21:16:44 +00:00
Duncan Sands	0a6d01770f	Fix comment typo. llvm-svn: 56116	2008-09-11 19:41:10 +00:00
Owen Anderson	d70cf1d5ae	Fix a subtle bug when removing instructions from memdep. In very specific circumstances we could end up remapping a dependee to the same instruction that we're trying to remove. Handle this properly by just falling back to a conservative solution. llvm-svn: 54132	2008-07-28 16:00:58 +00:00
Owen Anderson	b22a640fe4	A better fix for PR2503 that doesn't pessimize GVN in the presence of unreachable blocks. llvm-svn: 53032	2008-07-02 17:20:16 +00:00
Owen Anderson	2a3a1127e2	Properly handle cases where a predecessor of the block being queried on is unreachable. This fixes PR2503, though we should also fix other passes not to emit this kind of code. llvm-svn: 52946	2008-07-01 00:40:58 +00:00
Owen Anderson	54ea37b9e9	Remember to update the reverse non-local cache when cleaning up dirty entries. This fixes PR2397. llvm-svn: 51846	2008-06-01 21:03:52 +00:00
Owen Anderson	b77103b7e4	Make ping more aggressive in finding nonlocal caching errors. llvm-svn: 51845	2008-06-01 20:51:41 +00:00
Owen Anderson	3ab976a21f	Fix memdep's handling of invokes when finding the dependency of another call instruction. This fixes some Ada miscompiles reported in PR2324. llvm-svn: 51069	2008-05-13 21:25:37 +00:00
Dan Gohman	d78c400b5b	Clean up the use of static and anonymous namespaces. This turned up several things that were neither in an anonymous namespace nor static but not intended to be global. llvm-svn: 51017	2008-05-13 00:00:25 +00:00
Dan Gohman	6a2da37c0e	Make several variable declarations static. llvm-svn: 50696	2008-05-06 01:53:16 +00:00
Owen Anderson	f9ae76d89c	Make GVN able to remove unnecessary calls to read-only functions again. llvm-svn: 49842	2008-04-17 05:36:50 +00:00
Dan Gohman	9b5ffc8408	Fix a typo in a comment. llvm-svn: 49504	2008-04-10 23:02:38 +00:00
Owen Anderson	53336d8055	Fix for PR2190. Memdep's non-local caching was checking dirtied blocks in the wrong order. llvm-svn: 49499	2008-04-10 22:13:32 +00:00
Dan Gohman	3717cdaf22	Set blockBegin to point to the beginning of the block, not the end. llvm-svn: 48999	2008-03-31 22:08:00 +00:00
Devang Patel	80e43fa744	Restore isCFGOnly property of various analysis passes. llvm-svn: 48579	2008-03-20 02:25:21 +00:00
Devang Patel	718da668ab	PassInfo keep tracks whether a pass is an analysis pass or not. llvm-svn: 48554	2008-03-19 21:56:59 +00:00
Owen Anderson	00dba4f734	Re-apply the patch to improve the optimizations of memcpy's, with several bugs fixed. This now passes PPC bootstrap. llvm-svn: 47026	2008-02-12 21:15:18 +00:00
Tanya Lattner	182a9fd39f	Throttle the non-local dependence analysis for basic blocks with more than 50 predecessors. Added command line option to play with this threshold. llvm-svn: 46790	2008-02-06 00:54:55 +00:00
Owen Anderson	1de5997119	Fix an obscure read-after-free bug that Duncan found. llvm-svn: 46738	2008-02-05 04:34:03 +00:00
Owen Anderson	b255ada55b	Fix an issue where, under very specific circumstances, memdep could end up dereferencing the end of one of its internal maps. llvm-svn: 46541	2008-01-30 01:24:05 +00:00
Chris Lattner	f3ebc3f3d2	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Owen Anderson	086b2c4537	Fix several cache coherence bugs in MemDep/GVN that were found. Also add some (disabled) debugging code to make such problems easier to diagnose in the future, written by Duncan Sands. llvm-svn: 44695	2007-12-08 01:37:09 +00:00
Duncan Sands	68b6f50938	Integrate the readonly/readnone logic more deeply into alias analysis. This meant updating the API which now has versions of the getModRefBehavior, doesNotAccessMemory and onlyReadsMemory methods which take a callsite parameter. These should be used unless the callsite is not known, since in general they can do a better job than the versions that take a function. Also, users should no longer call the version of getModRefBehavior that takes both a function and a callsite. To reduce the chance of misuse it is now protected. llvm-svn: 44487	2007-12-01 07:51:45 +00:00
Owen Anderson	7cad745d49	Fix a silly bug that Nicholas noticed. llvm-svn: 44324	2007-11-26 03:27:38 +00:00
Owen Anderson	4f833c7610	Allow GVN to eliminate read-only function calls when it can detect that they are redundant. llvm-svn: 44323	2007-11-26 02:26:36 +00:00
Duncan Sands	44b8721de8	Executive summary: getTypeSize -> getTypeStoreSize / getABITypeSize. The meaning of getTypeSize was not clear - clarifying it is important now that we have x86 long double and arbitrary precision integers. The issue with long double is that it requires 80 bits, and this is not a multiple of its alignment. This gives a primitive type for which getTypeSize differed from getABITypeSize. For arbitrary precision integers it is even worse: there is the minimum number of bits needed to hold the type (eg: 36 for an i36), the maximum number of bits that will be overwriten when storing the type (40 bits for i36) and the ABI size (i.e. the storage size rounded up to a multiple of the alignment; 64 bits for i36). This patch removes getTypeSize (not really - it is still there but deprecated to allow for a gradual transition). Instead there is: (1) getTypeSizeInBits - a number of bits that suffices to hold all values of the type. For a primitive type, this is the minimum number of bits. For an i36 this is 36 bits. For x86 long double it is 80. This corresponds to gcc's TYPE_PRECISION. (2) getTypeStoreSizeInBits - the maximum number of bits that is written when storing the type (or read when reading it). For an i36 this is 40 bits, for an x86 long double it is 80 bits. This is the size alias analysis is interested in (getTypeStoreSize returns the number of bytes). There doesn't seem to be anything corresponding to this in gcc. (3) getABITypeSizeInBits - this is getTypeStoreSizeInBits rounded up to a multiple of the alignment. For an i36 this is 64, for an x86 long double this is 96 or 128 depending on the OS. This is the spacing between consecutive elements when you form an array out of this type (getABITypeSize returns the number of bytes). This is TYPE_SIZE in gcc. Since successive elements in a SequentialType (arrays, pointers and vectors) need to be aligned, the spacing between them will be given by getABITypeSize. This means that the size of an array is the length times the getABITypeSize. It also means that GEP computations need to use getABITypeSize when computing offsets. Furthermore, if an alloca allocates several elements at once then these too need to be aligned, so the size of the alloca has to be the number of elements multiplied by getABITypeSize. Logically speaking this doesn't have to be the case when allocating just one element, but it is simpler to also use getABITypeSize in this case. So alloca's and mallocs should use getABITypeSize. Finally, since gcc's only notion of size is that given by getABITypeSize, if you want to output assembler etc the same as gcc then getABITypeSize is the size you want. Since a store will overwrite no more than getTypeStoreSize bytes, and a read will read no more than that many bytes, this is the notion of size appropriate for alias analysis calculations. In this patch I have corrected all type size uses except some of those in ScalarReplAggregates, lib/Codegen, lib/Target (the hard cases). I will get around to auditing these too at some point, but I could do with some help. Finally, I made one change which I think wise but others might consider pointless and suboptimal: in an unpacked struct the amount of space allocated for a field is now given by the ABI size rather than getTypeStoreSize. I did this because every other place that reserves memory for a type (eg: alloca) now uses getABITypeSize, and I didn't want to make an exception for unpacked structs, i.e. I did it to make things more uniform. This only effects structs containing long doubles and arbitrary precision integers. If someone wants to pack these types more tightly they can always use a packed struct. llvm-svn: 43620	2007-11-01 20:53:16 +00:00
Owen Anderson	46da2a6262	Add partial caching of non-local memory dependence queries. This provides a modest speedup for GVN. llvm-svn: 42185	2007-09-21 03:53:52 +00:00
Owen Anderson	c201cbc802	Add a flag to mark a dirty cache entry. This is not yet used, but will eventually help non-local memdep caching. llvm-svn: 42137	2007-09-19 16:13:57 +00:00
Owen Anderson	f9203ab36a	Fix a typo in memdep, which was causing PR1648. llvm-svn: 41833	2007-09-11 04:31:00 +00:00
Owen Anderson	82e4fa1020	Remove an un-needed dependence query. This improves compile time marginally on 401.bzip2. llvm-svn: 41792	2007-09-09 21:43:49 +00:00
Owen Anderson	5f208bea91	Cache non-local memory dependence analysis. This is a significant compile time performance win in most cases. llvm-svn: 41126	2007-08-16 21:27:05 +00:00
Owen Anderson	9b1cc8cac0	Make NonLocal and None const in the right way. :-) llvm-svn: 40961	2007-08-09 04:42:44 +00:00
Owen Anderson	2b21c3c7a8	Add more comments to memdep. llvm-svn: 40953	2007-08-08 22:26:03 +00:00
Owen Anderson	fa788358d5	Make memdep fit in 80 cols. llvm-svn: 40950	2007-08-08 22:01:54 +00:00
Owen Anderson	b84d3b1c92	Change the None and NonLocal markers in memdep to be const. llvm-svn: 40946	2007-08-08 21:39:39 +00:00
Owen Anderson	68c6732d2c	Clean up a bunch of caching stuff in memdep. This reduces the time to run GVN on 403.gcc from ~15s to ~10s. llvm-svn: 40884	2007-08-07 00:33:45 +00:00
Owen Anderson	4898513d96	Improve the accuracy of memdep for determining the dependencies of loads. This brings GVN to parity with GCSE+LoadVN. llvm-svn: 40882	2007-08-06 23:26:03 +00:00
Owen Anderson	0ac1fc8ac1	Fix a bug that was causing several miscompilations on SPEC. llvm-svn: 40746	2007-08-02 17:56:05 +00:00
Owen Anderson	c321e5e272	Make non-local memdep not be recursive, and fix a bug on 403.gcc that this exposed. llvm-svn: 40692	2007-08-01 22:01:54 +00:00
David Greene	87801e8773	Fix GLIBCXX_DEBUG error owing to dereference of end iterator. There's no guarantee that an instruction returned by getDependency exists in the maps. llvm-svn: 40647	2007-07-31 20:01:27 +00:00
Owen Anderson	212d5c27f6	Use more caching when computing non-local dependence. This makes bzip2 not use up the entire 32-bit address space. llvm-svn: 40596	2007-07-30 17:29:24 +00:00
Owen Anderson	0f692f27a3	Fix a bug introduced in my last commit. llvm-svn: 40542	2007-07-26 18:57:04 +00:00
Owen Anderson	dbf23ccaa0	Fix a couple more bugs in the phi construction by pulling in code that does almost the same things from LCSSA. llvm-svn: 40540	2007-07-26 18:26:51 +00:00
Owen Anderson	9b796348bd	Fix a bug in non-local memdep that was causing an infinite loop on 175.vpr. llvm-svn: 40495	2007-07-25 21:26:36 +00:00
Owen Anderson	5e5599b7ce	Add basic support for performing whole-function RLE. Note: This has not yet been thoroughly tested. Use at your own risk. llvm-svn: 40489	2007-07-25 19:57:03 +00:00
Owen Anderson	d998be79cc	Add initial support for non-local memory dependence analysis. NOTE: This has only been cursorily tested. Expected improvements soon. llvm-svn: 40476	2007-07-24 21:52:37 +00:00
Owen Anderson	edb926bfe3	When removing instructions from the analysis, be sure to check the confirmed flag when determining what to do with dependencies. llvm-svn: 40079	2007-07-20 06:16:07 +00:00
Owen Anderson	7fcaaadf1c	Add support for walking up memory def chains, which enables finding many more dead stores on 400.perlbench. llvm-svn: 39929	2007-07-16 21:52:50 +00:00
Owen Anderson	1e1bace52b	Let MemoryDependenceAnalysis take care of updating AliasAnalysis. llvm-svn: 39769	2007-07-12 00:06:21 +00:00
Owen Anderson	c432490b4c	Calculate the size of a array allocation correctly. llvm-svn: 38511	2007-07-10 20:48:38 +00:00
Owen Anderson	faf9e42479	Fix a crasher when finding the dependency of a call. llvm-svn: 38510	2007-07-10 20:39:07 +00:00
Owen Anderson	1279eaf776	Make this pass registration static as well. llvm-svn: 38509	2007-07-10 20:21:08 +00:00
Owen Anderson	1fa6132e85	Handle vaarg instructions correctly. llvm-svn: 38504	2007-07-10 18:43:15 +00:00
Owen Anderson	94a21dd1e0	Volatile loads and stores depend on each other. llvm-svn: 38502	2007-07-10 18:11:42 +00:00
Owen Anderson	9c88457abe	Add support for finding the dependencies of call and invoke instructions. llvm-svn: 38497	2007-07-10 17:59:22 +00:00
Owen Anderson	2552a12e65	Fix the build, and fix the handling of pointer sizes. llvm-svn: 38494	2007-07-10 17:25:03 +00:00
Owen Anderson	47352db672	Fix a bunch of things from Chris' feedback llvm-svn: 38493	2007-07-10 17:08:11 +00:00
Owen Anderson	c0daf5fe53	A first stab at memory dependence analysis. This is an interface on top of alias analysis, adding caching and lazy computation of queries. This will be used in planned improvements to memory access optimizations. llvm-svn: 37958	2007-07-06 23:14:35 +00:00

1 2 3 4 5

240 Commits