llvm-project

Commit Graph

Author	SHA1	Message	Date
Nadav Rotem	a8f3562e8f	153465 was incorrect. In this code we wanted to check that the pointer operand is of pointer type (and not vector type). llvm-svn: 153468	2012-03-26 21:00:53 +00:00
Nadav Rotem	e63e59cc44	PR12357: The pointer was used before it was checked. llvm-svn: 153465	2012-03-26 20:39:18 +00:00
Andrew Trick	14779cc49e	LSR ivchain bug fix: corner case with ConstantExpr. Fixes PR11950. llvm-svn: 153463	2012-03-26 20:28:37 +00:00
Andrew Trick	356a896394	comment typo llvm-svn: 153462	2012-03-26 20:28:35 +00:00
Chris Lattner	b1e2e1e091	eliminate an unneeded branch, part of PR12357 llvm-svn: 153458	2012-03-26 19:13:57 +00:00
Eric Christopher	2b40fdf3ae	Tidy. llvm-svn: 153456	2012-03-26 19:09:40 +00:00
Eric Christopher	f16bee8682	Tidy. llvm-svn: 153455	2012-03-26 19:09:38 +00:00
Andrew Trick	e51feea79c	LSR cleanup: potential bug caught by PVS-Studio. Thanks Andrey. llvm-svn: 153451	2012-03-26 18:03:16 +00:00
Kostya Serebryany	6f8a776041	[tsan] treat vtable pointer updates in a special way (requires tbaa); fix a bug (forgot to return true after instrumenting); make sure the tsan tests are run llvm-svn: 153448	2012-03-26 17:35:03 +00:00
Craig Topper	6e80c28017	Prune some includes and forward declarations. llvm-svn: 153429	2012-03-26 06:58:25 +00:00
Chandler Carruth	ef82cf5b1e	Teach the function cloner (and thus the inliner) to simplify PHINodes aggressively. There are lots of dire warnings about this being expensive that seem to predate switching to the TrackingVH-based value remapper that is automatically updated on RAUW. This makes it easy to not just prune single-entry PHIs, but to fully simplify PHIs, and to recursively simplify the newly inlined code to propagate PHINode simplifications. This introduces a bit of a thorny problem though. We may end up simplifying a branch condition to a constant when we fold PHINodes, and we would like to nuke any dead blocks resulting from this so that time isn't wasted continually analyzing them, but this isn't easy. Deleting basic blocks after they are fully cloned and mapped into the new function currently requires manually updating the value map. The last piece of the simplification-during-inlining puzzle will require either switching to WeakVH mappings or some other piece of refactoring. I've left a FIXME in the testcase about this. llvm-svn: 153410	2012-03-25 10:34:54 +00:00
Chandler Carruth	2121199241	Move the instruction simplification of callsite arguments in the inliner to instead rely on much more generic and powerful instruction simplification in the function cloner (and thus inliner). This teaches the pruning function cloner to use instsimplify rather than just the constant folder to fold values during cloning. This can simplify a large number of things that constant folding alone cannot begin to touch. For example, it will realize that 'or' and 'and' instructions with certain constant operands actually become constants regardless of what their other operand is. It also can thread back through the caller to perform simplifications that are only possible by looking up a few levels. In particular, GEPs and pointer testing tend to fold much more heavily with this change. This should (in some cases) have a positive impact on compile times with optimizations on because the inliner itself will simply avoid cloning a great deal of code. It already attempted to prune proven-dead code, but now it will be use the stronger simplifications to prove more code dead. llvm-svn: 153403	2012-03-25 04:03:40 +00:00
Chandler Carruth	0c72e3f469	Add an asserting ValueHandle to the block simplification code which will fire if anything ever invalidates the assumption of a terminator instruction being unchanged throughout the routine. I've convinced myself that the current definition of simplification precludes such a transformation, so I think getting some asserts coverage that we don't violate this agreement is sufficient to make this code safe for the foreseeable future. Comments to the contrary or other suggestions are of course welcome. =] The bots are now happy with this code though, so it appears the bug here has indeed been fixed. llvm-svn: 153401	2012-03-25 03:29:25 +00:00
Chandler Carruth	17fc6ef234	Don't form a WeakVH around the sentinel node in the instructions BB list. This is a bad idea. ;] I'm hopeful this is the bug that's showing up with the MSVC bots, but we'll see. It is definitely unnecessary. InstSimplify won't do anything to a terminator instruction, we don't need to even include it in the iteration range. We can also skip the now dead terminator check, although I've made it an assert to help document that this is an important invariant. I'm still a bit queasy about this because there is an implicit assumption that the terminator instruction cannot be RAUW'ed by the simplification code. While that appears to be true at the moment, I see no guarantee that would ensure it remains true in the future. I'm looking at the cleanest way to solve that... llvm-svn: 153399	2012-03-24 23:03:27 +00:00
Chandler Carruth	cf1b585f60	Refactor the interface to recursively simplifying instructions to be tad bit simpler by handling a common case explicitly. Also, refactor the implementation to use a worklist based walk of the recursive users, rather than trying to use value handles to detect and recover from RAUWs during the recursive descent. This fixes a very subtle bug in the previous implementation where degenerate control flow structures could cause mutually recursive instructions (PHI nodes) to collapse in just such a way that From became equal to To after some amount of recursion. At that point, we hit the inf-loop that the assert at the top attempted to guard against. This problem is defined away when not using value handles in this manner. There are lots of comments claiming that the WeakVH will protect against just this sort of error, but they're not accurate about the actual implementation of WeakVHs, which do still track RAUWs. I don't have any test case for the bug this fixes because it requires running the recursive simplification on unreachable phi nodes. I've no way to either run this or easily write an input that triggers it. It was found when using instruction simplification inside the inliner when running over the nightly test-suite. llvm-svn: 153393	2012-03-24 21:11:24 +00:00
Francois Pichet	4b9ab74690	Fix the MSVC build. llvm-svn: 153366	2012-03-24 01:36:37 +00:00
Andrew Trick	25553ab5fe	More IndVarSimplify cleanup. llvm-svn: 153362	2012-03-24 00:51:17 +00:00
Kostya Serebryany	e505a5abe9	add EP_OptimizerLast extension point llvm-svn: 153353	2012-03-23 23:22:59 +00:00
Dan Gohman	e3ed2b0699	Don't convert objc_retainAutoreleasedReturnValue to objc_retain if it is retaining the return value of an invoke that it immediately follows. llvm-svn: 153344	2012-03-23 18:09:00 +00:00
Dan Gohman	5c70fadc17	It's not possible to insert code immediately after an invoke in the same basic block, and it's not safe to insert code in the successor blocks if the edges are critical edges. Splitting those edges is possible, but undesirable, especially on the unwind side. Instead, make the bottom-up code motion to consider invokes to be part of their successor blocks, rather than part of their parent blocks, so that it doesn't push code past them and onto the edges. This fixes PR12307. llvm-svn: 153343	2012-03-23 17:47:54 +00:00
Duncan Sands	a11ef6e4ea	When propagating equalities, eg replacing A with B in every basic block dominated by Root, check that B is available throughout the scope. This is obviously true (famous last words?) given the current logic, but the check may be helpful if more complicated reasoning is added one day. llvm-svn: 153323	2012-03-23 08:45:52 +00:00
Duncan Sands	8f897dc88b	Indentation. llvm-svn: 153322	2012-03-23 08:29:04 +00:00
Andrew Trick	e3502cb204	Remove -enable-lsr-retry in time for 3.1. llvm-svn: 153287	2012-03-22 22:42:51 +00:00
Andrew Trick	d97b83e320	Remove -enable-lsr-nested in time for 3.1. Tests cases have been removed but attached to open PR12330. llvm-svn: 153286	2012-03-22 22:42:45 +00:00
Dan Gohman	817a7c6fdf	Refactor the code for visiting instructions out into helper functions. llvm-svn: 153267	2012-03-22 18:24:56 +00:00
Andrew Trick	0654989062	Remove unused simplifyIVUsers llvm-svn: 153262	2012-03-22 17:47:30 +00:00
Andrew Trick	f47d0af551	Remove -enable-iv-rewrite, which has been unsupported since 3.0. llvm-svn: 153260	2012-03-22 17:10:11 +00:00
Chris Lattner	7d7dba3c92	don't use "signed", just something I noticed in patches flying by. llvm-svn: 153237	2012-03-22 03:46:58 +00:00
Kostya Serebryany	84a7f2e8e9	[asan] fix one more bug related to long double llvm-svn: 153189	2012-03-21 15:28:50 +00:00
Eric Christopher	7d522f161d	Zap some dead code pointed out by Chandler. llvm-svn: 153150	2012-03-20 23:28:58 +00:00
Andrew Trick	f7711010e1	LoopSimplify bug fix. Handle indirect loop back edges. Do not call SplitBlockPredecessors on a loop preheader when one of the predecessors is an indirectbr. Otherwise, you will hit this assert: !isa<IndirectBrInst>(Preds[i]->getTerminator()) && "Cannot split an edge from an IndirectBrInst" llvm-svn: 153134	2012-03-20 21:24:52 +00:00
Andrew Trick	bb01cbb312	whitespace llvm-svn: 153133	2012-03-20 21:24:47 +00:00
Kostya Serebryany	c58dc9fcd2	[asan] don't emit __asan_mapping_offset/__asan_mapping_scale by default -- they are currently used only for experiments llvm-svn: 153040	2012-03-19 16:40:35 +00:00
Bill Wendling	55b6b2b6a9	Revert r152907. llvm-svn: 152935	2012-03-16 18:20:54 +00:00
Bill Wendling	a2a26b546c	The alignment of the pointer part of the store instruction may have an alignment. If that's the case, then we want to make sure that we don't increase the alignment of the store instruction. Because if we increase it to be "more aligned" than the pointer, code-gen may use instructions which require a greater alignment than the pointer guarantees. <rdar://problem/11043589> llvm-svn: 152907	2012-03-16 07:40:08 +00:00
Chandler Carruth	b37fc13a36	Rip out support for 'llvm.noinline'. This thing has a strange history... It was added in 2007 as the first cut at supporting no-inline attributes, but we didn't have function attributes of any form at the time. However, it was added without any mention in the LangRef or other documentation. Later on, in 2008, Devang added function notes for 'inline=never' and then turned them into proper function attributes. From that point onward, as far as I can tell, the world moved on, and no one has touched 'llvm.noinline' in any meaningful way since. It's time has now come. We have had better mechanisms for doing this for a long time, all the frontends I'm aware of use them, and this is just holding back progress. Given that it was never a documented feature of the IR, I've provided no auto-upgrade support. If people know of real, in-the-wild bitcode that relies on this, yell at me and I'll add it, but I seriously doubt anyone cares. llvm-svn: 152904	2012-03-16 06:10:15 +00:00
Chandler Carruth	d7a5f2adb0	Start removing the use of an ad-hoc 'never inline' set and instead directly query the function information which this set was representing. This simplifies the interface of the inline cost analysis, and makes the always-inline pass significantly more efficient. Previously, always-inline would first make a single set of every function in the module except those marked with the always-inline attribute. It would then query this set at every call site to see if the function was a member of the set, and if so, refuse to inline it. This is quite wasteful. Instead, simply check the function attribute directly when looking at the callsite. The normal inliner also had similar redundancy. It added every function in the module with the noinline attribute to its set to ignore, even though inside the cost analysis function we already tested the noinline attribute and produced the same result. The only tricky part of removing this is that we have to be able to correctly remove only the functions inlined by the always-inline pass when finalizing, which requires a bit of a hack. Still, much less of a hack than the set of all non-always-inline functions was. While I was touching this function, I switched a heavy-weight set to a vector with sort+unique. The algorithm already had a two-phase insert and removal pattern, we were just needlessly paying the uniquing cost on every insert. This probably speeds up some compiles by a small amount (-O0 compiles with lots of always-inline, so potentially heavy libc++ users), but I've not tried to measure it. I believe there is no functional change here, but yell if you spot one. None are intended. Finally, the direction this is going in is to greatly simplify the inline cost query interface so that we can replace its implementation with a much more clever one. Along the way, all the APIs get simplified, so it seems incrementally good. llvm-svn: 152903	2012-03-16 06:10:13 +00:00
Andrew Trick	070e540a3e	LSR fix: Add isSimplifiedLoopNest to IVUsers analysis. Only record IVUsers that are dominated by simplified loop headers. Otherwise SCEVExpander will crash while looking for a preheader. I previously tried to work around this in LSR itself, but that was insufficient. This way, LSR can continue to run if some uses are not in simple loops, as long as we don't attempt to analyze those users. Fixes <rdar://problem/11049788> Segmentation fault: 11 in LoopStrengthReduce llvm-svn: 152892	2012-03-16 03:16:56 +00:00
Eli Friedman	e06535b2f6	In InstCombiner::visitOr, make sure we reverse the operand swap used for checking for or-of-xor operations after those checks; a later check expects that any constant will be in Op1. PR12234. llvm-svn: 152884	2012-03-16 00:52:42 +00:00
Rafael Espindola	f58927855b	Short term fix for pr12270 before we change dominates to handle unreachable code. While here, reduce indentation. llvm-svn: 152803	2012-03-15 15:52:59 +00:00
Bill Wendling	7fa1be77cc	Use an iterator instead of calling .size() on the worklist every time, which is wasteful. llvm-svn: 152794	2012-03-15 11:19:41 +00:00
Chandler Carruth	be2ccf01b7	Remove the basic inliner. This was added in 2007, and hasn't really changed since. No one was using it. It is yet another consumer of the InlineCost interface that I'd like to change. llvm-svn: 152769	2012-03-15 01:37:56 +00:00
Chandler Carruth	3904590ba8	This pass didn't want the inline cost per-se, it just wants generic code metrics. llvm-svn: 152760	2012-03-15 00:29:10 +00:00
Aaron Ballman	a733297fa6	Fixed a transform crash when setting a negative size value for memset. Fixes PR12202. llvm-svn: 152756	2012-03-15 00:05:31 +00:00
Kostya Serebryany	abad002d55	[tsan] use FunctionBlackList llvm-svn: 152755	2012-03-14 23:33:24 +00:00
Kostya Serebryany	01401cec00	[asan] rename class BlackList to FunctionBlackList and move it into a separate file -- we will need the same functionality in ThreadSanitizer llvm-svn: 152753	2012-03-14 23:22:10 +00:00
Dan Gohman	532fb8131b	When an invoke is marked with metadata indicating its unwind edge should be ignored by ARC optimization, don't insert new ARC runtime calls in the unwind destination. llvm-svn: 152748	2012-03-14 23:05:06 +00:00
Chandler Carruth	30b8416d2c	Change where we enable the heuristic that delays inlining into functions which are small enough to themselves be inlined. Delaying in this manner can be harmful if the function is inelligible for inlining in some (or many) contexts as it pessimizes the code of the function itself in the event that inlining does not eventually happen. Previously the check was written to only do this delaying of inlining for static functions in the hope that they could be entirely deleted and in the knowledge that all callers of static functions will have the opportunity to inline if it is in fact profitable. However, with C++ we get two other important sources of functions where the definition is always available for inlining: inline functions and templated functions. This patch generalizes the inliner to allow linkonce-ODR (the linkage such C++ routines receive) to also qualify for this delay-based inlining. Benchmarking across a range of large real-world applications shows roughly 2% size increase across the board, but an average speedup of about 0.5%. Some benhcmarks improved over 2%, and the 'clang' binary itself (when bootstrapped with this feature) shows a 1% -O0 performance improvement when run over all Sema, Lex, and Parse source code smashed into a single file. A clean re-build of Clang+LLVM with a bootstrapped Clang shows approximately 2% improvement, but that measurement is often noisy. llvm-svn: 152737	2012-03-14 20:16:41 +00:00
Pete Cooper	615fd897e0	Target override to allow CodeGenPrepare to sink address operands to intrinsics in the same way it current does for loads and stores llvm-svn: 152666	2012-03-13 20:59:56 +00:00
Chris Lattner	87fa77bd8a	enhance jump threading to preserve TBAA information when PRE'ing loads, fixing rdar://11039258, an issue that came up when inspecting clang's bootstrapped codegen. llvm-svn: 152635	2012-03-13 18:07:41 +00:00
Dan Gohman	eab06fa3c9	Teach globalopt how to evaluate an invoke with a non-void return type. llvm-svn: 152634	2012-03-13 18:01:37 +00:00
Chandler Carruth	595fda8466	When inlining a function and adding its inner call sites to the candidate set for subsequent inlining, try to simplify the arguments to the inner call site now that inlining has been performed. The goal here is to propagate and fold constants through deeply nested call chains. Without doing this, we loose the inliner bonus that should be applied because the arguments don't match the exact pattern the cost estimator uses. Reviewed on IRC by Benjamin Kramer. llvm-svn: 152556	2012-03-12 11:19:33 +00:00
Stepan Dyatkovskiy	97b02fc1b3	llvm::SwitchInst Renamed methods caseBegin, caseEnd and caseDefault with case_begin, case_end, and case_default. Added some notes relative to case iterators. llvm-svn: 152532	2012-03-11 06:09:17 +00:00
Duncan Sands	14eb175836	Add statistics on removed switch cases, and fix the phi statistic to count the number of phis changed, not the number visited. llvm-svn: 152425	2012-03-09 19:21:15 +00:00
Dan Gohman	500b598c5c	When identifying exit nodes for the reverse-CFG reverse-post-order traversal, consider nodes for which the only successors are backedges which the traversal is ignoring to be exit nodes. This fixes a problem where the bottom-up traversal was failing to visit split blocks along split loop backedges. This fixes rdar://10989035. llvm-svn: 152421	2012-03-09 18:50:52 +00:00
Duncan Sands	cca89124a2	Eliminate switch cases that can never match, for example removes all negative switch cases if the branch condition is known to be positive. Inspired by a recent improvement to GCC's VRP. llvm-svn: 152405	2012-03-09 13:45:18 +00:00
Stepan Dyatkovskiy	5b648afb4d	Taken into account Duncan's comments for r149481 dated by 2nd Feb 2012: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20120130/136146.html Implemented CaseIterator and it solves almost all described issues: we don't need to mix operand/case/successor indexing anymore. Base iterator class is implemented as a template since it may be initialized either from "const SwitchInst" or from "SwitchInst". ConstCaseIt is just a read-only iterator. CaseIt is read-write iterator; it allows to change case successor and case value. Usage of iterator allows totally remove resolveXXXX methods. All indexing convertions done automatically inside the iterator's getters. Main way of iterator usage looks like this: SwitchInst SI = ... // intialize it somehow for (SwitchInst::CaseIt i = SI->caseBegin(), e = SI->caseEnd(); i != e; ++i) { BasicBlock BB = i.getCaseSuccessor(); ConstantInt *V = i.getCaseValue(); // Do something. } If you want to convert case number to TerminatorInst successor index, just use getSuccessorIndex iterator's method. If you want initialize iterator from TerminatorInst successor index, use CaseIt::fromSuccessorIndex(...) method. There are also related changes in llvm-clients: klee and clang. llvm-svn: 152297	2012-03-08 07:06:20 +00:00
Sebastian Pop	5ce71b18cb	fix typos llvm-svn: 152035	2012-03-05 17:39:47 +00:00
Sebastian Pop	8844e224b8	remove spaces on empty lines llvm-svn: 152034	2012-03-05 17:39:45 +00:00
Duncan Sands	3eb328574e	This is not a common case, in fact it never happens! llvm-svn: 152027	2012-03-05 12:23:00 +00:00
Chandler Carruth	d95357a18e	Switch mem2reg to use the new hashing infrastructure. llvm-svn: 152026	2012-03-05 11:29:56 +00:00
Chandler Carruth	e134d1a336	Replace the ad-hoc hashing in GVN with the new hashing infrastructure. This implicitly fixes a nasty bug in the GVN hashing (that thankfully could only manifest as a performance bug): actually include the opcode in the hash. The old code started the hash off with the opcode, but then overwrote it with the type pointer. Since this is likely to be pretty hot (GVN being already pretty expensive) I've included a micro-optimization to just not bother with the varargs hashing if they aren't present. I can't measure any change in GVN performance due to this, even with a big test case like Duncan's sqlite one. Everything I see is in the noise floor. That said, this closes a loop hole for a potential scaling problem due to collisions if the opcode were the differentiating aspect of the expression. llvm-svn: 152025	2012-03-05 11:29:54 +00:00
Duncan Sands	4d928e7dff	Nick pointed out on IRC that GVN's propagateEquality wasn't propagating equalities into phi node operands for which the equality is known to hold in the incoming basic block. That's because replaceAllDominatedUsesWith wasn't handling phi nodes correctly in general (that this didn't give wrong results was just luck: the specific way GVN uses replaceAllDominatedUsesWith precluded wrong changes to phi nodes). llvm-svn: 152006	2012-03-04 13:25:19 +00:00
Bill Wendling	97b9359623	Do trivial CSE of dead BBs during codegen preparation. Some BBs can become dead after codegen preparation. If we delete them here, it could help enable tail-call optimizations later on. <rdar://problem/10256573> llvm-svn: 152002	2012-03-04 10:46:01 +00:00
Evgeniy Stepanov	d33e3d8c6e	ASan: use getTypeAllocSize instead of getTypeStoreSize. This change replaces getTypeStoreSize with getTypeAllocSize in AddressSanitizer instrumentation for stack allocations. One case where old behaviour produced undesired results is an optimization in InstCombine pass (PromoteCastOfAllocation), which can replace alloca(T) with alloca(S), where S has the same AllocSize, but a smaller StoreSize. Another case is memcpy(long double => long double), where ASan will poison bytes 10-15 of a stack-allocated long double (StoreSize 10, AllocSize 16, sizeof(long double) = 16). See http://llvm.org/bugs/show_bug.cgi?id=12047 for more context. llvm-svn: 151887	2012-03-02 10:41:08 +00:00
Dan Gohman	362eb69f24	Fix an iterator invalidation problem. operator[] on a DenseMap can insert a new element, invalidating iterators. Use find instead, and handle the case where the key is not found explicitly. llvm-svn: 151871	2012-03-02 01:26:46 +00:00
Dan Gohman	55b067427b	Misc micro-optimizations. llvm-svn: 151869	2012-03-02 01:13:53 +00:00
Duncan Sands	bb2fe65542	Have GVN also do condition propagation when the right-hand side is not a constant. This fixes PR1768. llvm-svn: 151713	2012-02-29 11:12:03 +00:00
Bill Wendling	f2c78f344e	Restrict this transformation to equality conditions. This transformation is not correct for not-equal conditions: (trunc x) != C1 & (and x, CA) != C2 -> (and x, CA\|CMAX) != C1\|C2 Let C1 == 0 C2 == 0 CA == 0xFF0000 CMAX == 0xFF and truncating to i8. The original truth table: x \| A: trunc x != 0 \| B: x & 0xFF0000 != 0 \| A & B != 0 -------------------------------------------------------------- 0x00000 \| 0 \| 0 \| 0 0x00001 \| 1 \| 0 \| 0 0x10000 \| 0 \| 1 \| 0 0x10001 \| 1 \| 1 \| 1 The truth table of the replacement: x \| x & 0xFF00FF != 0 ---------------------------- 0x00000 \| 0 0x00001 \| 1 0x10000 \| 1 0x10001 \| 1 So they are different. llvm-svn: 151691	2012-02-29 01:46:50 +00:00
Pete Cooper	39b5255df4	Reverted r152620 - DSE: Shorten memset when a later store overwrites the start of it. There were all sorts of buildbot issues llvm-svn: 151621	2012-02-28 05:06:24 +00:00
Pete Cooper	f3862f91de	DSE: Shorten memset when a later store overwrites the start of it llvm-svn: 151620	2012-02-28 04:27:10 +00:00
Benjamin Kramer	93887631d9	Plog a memleak in GlobalOpt. Found by valgrind. llvm-svn: 151525	2012-02-27 12:48:24 +00:00
Duncan Sands	9edea84420	Micro-optimization, no functionality change. llvm-svn: 151524	2012-02-27 12:11:41 +00:00
Duncan Sands	1be25a78f7	The value numbering function is recursive, so it is possible for multiple new value numbers to be assigned when calculating any particular value number. Enhance the logic that detects new value numbers to take this into account, for a tiny compile time speedup. Fix a comment typo while there. llvm-svn: 151522	2012-02-27 09:54:35 +00:00
Duncan Sands	27f459519d	When performing a conditional branch depending on the value of a comparison %cmp (eg: A==B) we already replace %cmp with "true" under the true edge, and with "false" under the false edge. This change enhances this to replace the negated compare (A!=B) with "false" under the true edge and "true" under the false edge. Reported to improve perlbench results by 1%. llvm-svn: 151517	2012-02-27 08:14:30 +00:00
Chad Rosier	50e0b81ea9	Add comment. llvm-svn: 151431	2012-02-25 03:07:57 +00:00
Chad Rosier	07d37bc1ed	Add support for disabling llvm.lifetime intrinsics in the AlwaysInliner. These are optimization hints, but at -O0 we're not optimizing. This becomes a problem when the alwaysinline attribute is abused. rdar://10921594 llvm-svn: 151429	2012-02-25 02:56:01 +00:00
Chad Rosier	e48e5d2945	Fix indentation. llvm-svn: 151420	2012-02-25 01:10:59 +00:00
Duncan Sands	926d101640	Teach GVN that x+y is the same as y+x and that x<y is the same as y>x. llvm-svn: 151365	2012-02-24 15:16:31 +00:00
Benjamin Kramer	077e55252a	Reflow code, no functionality change. llvm-svn: 151262	2012-02-23 17:42:19 +00:00
Duncan Sands	4730cb9c7c	GCC fails to understand that NextBB is always initialized if EvaluateBlock returns 'true' and emits a warning. Help it out. llvm-svn: 151242	2012-02-23 08:23:06 +00:00
Nick Lewycky	9d0da18597	Use the target-aware constant folder on expressions to improve the chance they'll be simple enough to simulate, and to reduce the chance we'll encounter equal but different simple pointer constants. This removes the symptoms from PR11352 but is not a full fix. A proper fix would either require a guarantee that two constant objects we simulate are folded when equal, or a different way of handling equal pointers (ie., trying a constantexpr icmp on them to see whether we know they're equal or non-equal or unsure). llvm-svn: 151093	2012-02-21 22:08:06 +00:00
Benjamin Kramer	c7a22fe76b	Fix unsigned off-by-one in comment. llvm-svn: 151056	2012-02-21 13:40:06 +00:00
Benjamin Kramer	6ee8690aa5	InstCombine: Don't transform a signed icmp of two GEPs into a signed compare of the indices. This transformation is not safe in some pathological cases (signed icmp of pointers should be an extremely rare thing, but it's valid IR!). Add an explanatory comment. Kudos to Duncan for pointing out this edge case (and not giving up explaining it until I finally got it). llvm-svn: 151055	2012-02-21 13:31:09 +00:00
Nick Lewycky	519561f418	Check for the correct size in the invariant marker. llvm-svn: 151003	2012-02-20 23:32:26 +00:00
Chad Rosier	47eeddde24	Fix 80-column violation. llvm-svn: 150998	2012-02-20 23:13:17 +00:00
Benjamin Kramer	ac8ecc4e7e	InstCombine: Removing the base from the address calculation is only safe when the GEPs are inbounds. llvm-svn: 150978	2012-02-20 18:45:10 +00:00
Benjamin Kramer	7adb189538	InstCombine: When comparing two GEPs that were derived from the same base pointer but use different types, expand the offset calculation and to the compare on the offset if profitable. This came up in SmallVector code. llvm-svn: 150962	2012-02-20 15:07:47 +00:00
Benjamin Kramer	7746eb62fb	InstCombine: Make OptimizePointerDifference more aggressive. - Ignore pointer casts. - Also expand GEPs that aren't constantexprs when they have one use or only constant indices. - We now compile "&foo[i] - &foo[j]" into "i - j". llvm-svn: 150961	2012-02-20 14:34:57 +00:00
Nick Lewycky	60829a587a	Rename class Evaluate to Evaluator and put it in an anonymous namespace. llvm-svn: 150947	2012-02-20 03:25:59 +00:00
Nick Lewycky	73be5e31a6	Move EvaluateFunction and EvaluateBlock into a class, and make the class store the information that they pass around between them. No functionality change! llvm-svn: 150939	2012-02-19 23:26:27 +00:00
Ahmed Charles	636a3d618c	Remove dead code. Improve llvm_unreachable text. Simplify some control flow. llvm-svn: 150918	2012-02-19 11:37:01 +00:00
Dan Gohman	0155f30a9c	Calls and invokes with the new clang.arc.no_objc_arc_exceptions metadata may still unwind, but only in ways that the ARC optimizer doesn't need to consider. This permits more aggressive optimization. llvm-svn: 150829	2012-02-17 18:59:53 +00:00
Nick Lewycky	68f9f9d9c8	Add support for invariant.start inside the static constructor evaluator. This is useful to represent a variable that is const in the source but can't be constant in the IR because of a non-trivial constructor. If globalopt evaluates the constructor, and there was an invariant.start with no matching invariant.end possible, it will mark the global constant afterwards. llvm-svn: 150794	2012-02-17 06:59:21 +00:00
Bill Wendling	aa9a3eae79	Remove redundant comment. Use a more efficient datatype. llvm-svn: 150780	2012-02-17 02:12:54 +00:00
Bill Wendling	0a8fec2762	Fix some grammar-os and formatting. llvm-svn: 150779	2012-02-17 02:09:28 +00:00
Eli Friedman	c458885c58	loop-rotate shouldn't hoist alloca instructions out of a loop. Patch by Patrik Hägglund, with slightly modified test. Issue reported by Patrik Hägglund on llvmdev. llvm-svn: 150642	2012-02-16 00:41:10 +00:00
Kostya Serebryany	a8531eeb64	[tsan] fix compiler warnings llvm-svn: 150449	2012-02-14 00:52:07 +00:00
Andrew Trick	10cc45336d	Add simplifyLoopLatch to LoopRotate pass. This folds a simple loop tail into a loop latch. It covers the common (in fortran) case of postincrement loops. It's a "free" way to expose this type of loop to downstream loop optimizations that bail out on non-canonical loops (getLoopLatch is a heavily used check). llvm-svn: 150439	2012-02-14 00:00:23 +00:00
Andrew Trick	a20f198747	whitespace llvm-svn: 150438	2012-02-14 00:00:19 +00:00
Devang Patel	698452bc7e	Check against umin while converting fcmp into an icmp. llvm-svn: 150425	2012-02-13 23:05:18 +00:00
Dan Gohman	eb6e01533a	Just like in regular escape analysis, loads and stores through (but not of) a block pointer do not cause the block pointer to escape. This fixes rdar://10803830. llvm-svn: 150424	2012-02-13 22:57:02 +00:00
Kostya Serebryany	e2a0e4163a	ThreadSanitizer, a race detector. First LLVM commit. Clang patch (flags) will follow shortly. The run-time library will also follow, but not immediately. llvm-svn: 150423	2012-02-13 22:50:51 +00:00
Ahmed Charles	32e983e4fc	Fix various issues (or do cleanups) found by enabling certain MSVC warnings. - Use unsigned literals when the desired result is unsigned. This mostly allows unsigned/signed mismatch warnings to be less noisy even if they aren't on by default. - Remove misplaced llvm_unreachable. - Add static to a declaration of a function on MSVC x86 only. - Change some instances of calling a static function through a variable to simply calling that function while removing the unused variable. llvm-svn: 150364	2012-02-13 06:30:56 +00:00
Nick Lewycky	c1572e4c90	Handle InvokeInst in EvaluateBlock. Don't try to support exceptions, it's just that no optz'ns have run yet to convert invokes to calls. llvm-svn: 150326	2012-02-12 05:09:35 +00:00
Nick Lewycky	f285256f72	false is totally null! llvm-svn: 150324	2012-02-12 02:17:18 +00:00
Nick Lewycky	4b273cb7ea	Remove redundant getAnalysis<> calls in GlobalOpt. Add a few Itanium ABI calls to TargetLibraryInfo and use one of them in GlobalOpt. llvm-svn: 150323	2012-02-12 02:15:20 +00:00
Nick Lewycky	cf6aae686d	Pass TargetData and TargetLibraryInfo through to the constant folder. Fixes a few fixme's when TLI was added. llvm-svn: 150322	2012-02-12 01:13:18 +00:00
Nick Lewycky	1480f1d3f9	Fix function name in comment to match actual name. Fix comments that are using doxy-style on local variables to not do so. Fix one 80-col violation. llvm-svn: 150320	2012-02-12 00:52:26 +00:00
Nick Lewycky	4231c41c64	Don't traverse the PHI nodes twice. No functionality change! llvm-svn: 150319	2012-02-12 00:47:24 +00:00
Hal Finkel	1bde3f86d1	Update BBVectorize to use aliasesUnknownInst. This allows BBVectorize to check the "unknown instruction" list in the alias sets. This is important to prevent instruction fusing from reordering function calls. Resolves PR11920. llvm-svn: 150250	2012-02-10 15:52:40 +00:00
Benjamin Kramer	1a4695a091	Tweak comment readability and grammar. llvm-svn: 150183	2012-02-09 16:28:15 +00:00
Benjamin Kramer	487a3962c7	GlobalOpt: Be more aggressive about elminating side-effect free static dtors. GlobalOpt runs early in the pipeline (before inlining) and complex class hierarchies often introduce bitcasts or GEPs which weren't optimized away. Teach it to ignore side-effect free instructions instead of depending on other passes to remove them. llvm-svn: 150174	2012-02-09 14:26:06 +00:00
Kostya Serebryany	154a54d972	[asan] unpoison the stack before every noreturn call. Fixes asan issue 37. llvm part llvm-svn: 150102	2012-02-08 21:36:17 +00:00
Duncan Sands	0920308a7e	Use Use::set rather than finding the operand number of the use and setting that. llvm-svn: 150074	2012-02-08 14:10:53 +00:00
Craig Topper	a2886c21d9	Convert assert(0) to llvm_unreachable llvm-svn: 149967	2012-02-07 05:05:23 +00:00
Chris Lattner	8213c8af29	Remove some dead code and tidy things up now that vectors use ConstantDataVector instead of always using ConstantVector. llvm-svn: 149912	2012-02-06 21:56:39 +00:00
Bill Wendling	0aef16afd5	[unwind removal] Remove all of the code for the dead 'unwind' instruction. There were no 'unwind' instructions being generated before this, so this is in effect a no-op. llvm-svn: 149906	2012-02-06 21:44:22 +00:00
Bill Wendling	d5d95b0b51	[unwind removal] We no longer have 'unwind' instructions being generated, so remove the code that handles them. llvm-svn: 149901	2012-02-06 21:16:41 +00:00
Benjamin Kramer	baba1aa001	Make helper static. llvm-svn: 149865	2012-02-06 11:28:19 +00:00
Nick Lewycky	239fdf0f61	Split part of EvaluateFunction into a new EvaluateBlock method. No functionality change. llvm-svn: 149861	2012-02-06 08:24:44 +00:00
Sebastian Pop	662beed828	fix indentation llvm-svn: 149857	2012-02-06 05:29:32 +00:00
Nick Lewycky	52da72b12a	Teach GlobalOpt to handle atomic accesses to globals. * Most of the transforms come through intact by having each transformed load or store copy the ordering and synchronization scope of the original. * The transform that turns a global only accessed in main() into an alloca (since main is non-recursive) with a store of the initial value uses an unordered store, since it's guaranteed to be the first thing to happen in main. (Threads may have started before main (!) but they can't have the address of a function local before the point in the entry block we insert our code.) * The heap-SRoA transforms are disabled in the face of atomic operations. This can probably be improved; it seems odd to have atomic accesses to an alloca that doesn't have its address taken. AnalyzeGlobal keeps track of the strongest ordering found in any use of the global. This is more information than we need right now, but it's cheap to compute and likely to be useful. llvm-svn: 149847	2012-02-05 19:56:38 +00:00
Nick Lewycky	bbd1156b95	Clean up some whitespace and comments. No functionality change. llvm-svn: 149845	2012-02-05 19:48:37 +00:00
Duncan Sands	9066fb5c43	Neaten up this method. Check that if there is only one predecessor then it's Src. llvm-svn: 149843	2012-02-05 19:43:37 +00:00
Duncan Sands	12efb16b01	Fix a thinko pointed out by Eli and the buildbots. llvm-svn: 149839	2012-02-05 18:56:50 +00:00
Duncan Sands	4b613497f0	Reduce the number of dom queries made by GVN's conditional propagation logic by half: isOnlyReachableViaThisEdge was trying to be clever and handle the case of a branch to a basic block which is contained in a loop. This costs a domtree lookup and is completely useless due to GVN's position in the pass pipeline: all loops have preheaders at this point, which means it is enough for isOnlyReachableViaThisEdge to check that Dst has only one predecessor. (I checked this theoretical argument by running over the entire nightly testsuite, and indeed it is so!). llvm-svn: 149838	2012-02-05 18:25:50 +00:00
Duncan Sands	268903955c	Reduce the number of non-trivial domtree queries by about 1% when compiling sqlite3, by only doing dom queries after the cheap check rather than interleaved with it. llvm-svn: 149836	2012-02-05 15:50:43 +00:00
David Blaikie	f9c1291fde	Simplify contains tests using 'count'. llvm-svn: 149813	2012-02-05 06:35:36 +00:00
NAKAMURA Takumi	32c48634db	BBVectorize.cpp: Get rid of comparision to bool to fix a warning. llvm-svn: 149810	2012-02-05 05:47:51 +00:00
Chris Lattner	cf9e8f6968	reapply the patches reverted in r149470 that reenable ConstantDataArray, but with a critical fix to the SelectionDAG code that optimizes copies from strings into immediate stores: the previous code was stopping reading string data at the first nul. Address this by adding a new argument to llvm::getConstantStringInfo, preserving the behavior before the patch. llvm-svn: 149800	2012-02-05 02:29:43 +00:00
Hal Finkel	135cac922c	Boost the effective chain depth of loads and stores. By default, boost the chain depth contribution of loads and stores. This will allow a load/store pair to vectorize even when it would not otherwise be long enough to satisfy the chain depth requirement. llvm-svn: 149761	2012-02-04 04:14:04 +00:00
Jim Grosbach	1df8cdc588	Narrow test further. Make bot and test happy. llvm-svn: 149650	2012-02-03 00:26:07 +00:00
Jim Grosbach	7815f56b22	Tidy up. Trailing whitespace. llvm-svn: 149649	2012-02-03 00:07:04 +00:00
Jim Grosbach	e84ae7bfa0	Restrict InstCombine from converting varargs to or from fixed args. More targetted fix replacing d0e277d272d517ca1cda368267d199f0da7cad95. llvm-svn: 149648	2012-02-03 00:00:55 +00:00
Jim Grosbach	0ab54184d7	Revert "Disable InstCombine unsafe folding bitcasts of calls w/ varargs." This reverts commit d0e277d272d517ca1cda368267d199f0da7cad95. llvm-svn: 149647	2012-02-03 00:00:50 +00:00
Benjamin Kramer	f61f60d97a	BBVectorize: Simplify code, no functionality change. Also silences warnings about bodyless for loops. llvm-svn: 149612	2012-02-02 18:52:15 +00:00
Hal Finkel	8cf51b871c	Minor changes from review. As suggested by Nick Lewycky, the tree traversal queues have been changed to SmallVectors and the associated loops have been rotated. Also, an 80-col violation was fixed. llvm-svn: 149607	2012-02-02 17:29:39 +00:00
Hal Finkel	0f3298e8d4	Vectorize long blocks in groups. Long basic blocks with many candidate pairs (such as in the SHA implementation in Perl 5.14; thanks to Roman Divacky for the example) used to take an unacceptably-long time to compile. Instead, break long blocks into groups so that no group has too many candidate pairs. llvm-svn: 149595	2012-02-02 06:14:56 +00:00
Stepan Dyatkovskiy	513aaa5691	SwitchInst refactoring. The purpose of refactoring is to hide operand roles from SwitchInst user (programmer). If you want to play with operands directly, probably you will need lower level methods than SwitchInst ones (TerminatorInst or may be User). After this patch we can reorganize SwitchInst operands and successors as we want. What was done: 1. Changed semantics of index inside the getCaseValue method: getCaseValue(0) means "get first case", not a condition. Use getCondition() if you want to resolve the condition. I propose don't mix SwitchInst case indexing with low level indexing (TI successors indexing, User's operands indexing), since it may be dangerous. 2. By the same reason findCaseValue(ConstantInt*) returns actual number of case value. 0 means first case, not default. If there is no case with given value, ErrorIndex will returned. 3. Added getCaseSuccessor method. I propose to avoid usage of TerminatorInst::getSuccessor if you want to resolve case successor BB. Use getCaseSuccessor instead, since internal SwitchInst organization of operands/successors is hidden and may be changed in any moment. 4. Added resolveSuccessorIndex and resolveCaseIndex. The main purpose of these methods is to see how case successors are really mapped in TerminatorInst. 4.1 "resolveSuccessorIndex" was created if you need to level down from SwitchInst to TerminatorInst. It returns TerminatorInst's successor index for given case successor. 4.2 "resolveCaseIndex" converts low level successors index to case index that curresponds to the given successor. Note: There are also related compatability fix patches for dragonegg, klee, llvm-gcc-4.0, llvm-gcc-4.2, safecode, clang. llvm-svn: 149481	2012-02-01 07:49:51 +00:00
NAKAMURA Takumi	e1d61f666b	BBVectorize.cpp: Try to fix MSVC build. map::iterator and multimap::iterator are incompatible. llvm-svn: 149475	2012-02-01 06:11:58 +00:00
Hal Finkel	8a3aebe5e0	A few of the changes suggested in code review (by Nick Lewycky) llvm-svn: 149472	2012-02-01 05:51:45 +00:00
Argyrios Kyrtzidis	17c981a45b	Revert Chris' commits up to r149348 that started causing VMCoreTests unit test to fail. These are: r149348 r149351 r149352 r149354 r149356 r149357 r149361 r149362 r149364 r149365 llvm-svn: 149470	2012-02-01 04:51:17 +00:00
Hal Finkel	c34e51132c	Add a basic-block autovectorization pass. This is the initial checkin of the basic-block autovectorization pass along with some supporting vectorization infrastructure. Special thanks to everyone who helped review this code over the last several months (especially Tobias Grosser). llvm-svn: 149468	2012-02-01 03:51:43 +00:00
Jim Grosbach	9fa0481569	Disable InstCombine unsafe folding bitcasts of calls w/ varargs. Changing arguments from being passed as fixed to varargs is unsafe, as the ABI may require they be handled differently (stack vs. register, for example). Remove two tests which rely on the bitcast being folded into the direct call, which is exactly the transformation that's unsafe. llvm-svn: 149457	2012-02-01 00:08:17 +00:00
Lenny Maiorani	8d670b8f93	bz11794 : EarlyCSE stack overflow on long functions. Make the EarlyCSE optimizer not use recursion to do a depth first iteration. llvm-svn: 149445	2012-01-31 23:14:41 +00:00
Bill Wendling	e5f4a6d904	Increase the initial vector size to be equivalent to the size of the Deps vector. This potentially saves a resizing. llvm-svn: 149369	2012-01-31 07:04:52 +00:00
Bill Wendling	8a33312948	Cache the size of the vector instead of calling .size() all over the place. llvm-svn: 149368	2012-01-31 06:57:53 +00:00
Chris Lattner	f1179025ae	eliminate the "string" form of ConstantArray::get, using ConstantDataArray::getString instead. llvm-svn: 149365	2012-01-31 06:18:43 +00:00
Chris Lattner	9e4b8726f8	eliminate the last uses of GetConstantStringInfo from this file, I didn't realize I was that close... llvm-svn: 149354	2012-01-31 04:54:27 +00:00
Chris Lattner	8193b06e44	start moving SimplifyLibcalls over to getConstantStringInfo, which is dramatically more efficient than GetConstantStringInfo. llvm-svn: 149352	2012-01-31 04:43:11 +00:00
Chris Lattner	fe741769dd	enhance logic to support ConstantDataArray. llvm-svn: 149340	2012-01-31 02:55:06 +00:00
Bill Wendling	3fd879dde2	s/getInnerUnwindDest/getInnerResumeDest/g llvm-svn: 149328	2012-01-31 01:48:40 +00:00
Bill Wendling	ea6e935e95	Remove ivar which is identical to another ivar. llvm-svn: 149323	2012-01-31 01:25:54 +00:00
Bill Wendling	0c2d82b942	Remove unused ivars and s/getOuterUnwindDest/getOuterResumeDest/g. llvm-svn: 149322	2012-01-31 01:22:03 +00:00
Bill Wendling	7778e6d818	Remove more dead functions. llvm-svn: 149318	2012-01-31 01:18:21 +00:00
Bill Wendling	803d6b1b0c	s/getInnerUnwindDestNewEH/getInnerUnwindDest/g llvm-svn: 149317	2012-01-31 01:15:59 +00:00
Bill Wendling	621699de22	Remove some unused, old-EH methods. llvm-svn: 149316	2012-01-31 01:14:49 +00:00
Bill Wendling	518a205d0a	Get rid of references to dead intrinsics. The eh.selector and eh.resume intrinsics aren't used anymore. Get rid of some calls to them. llvm-svn: 149314	2012-01-31 01:05:20 +00:00
Bill Wendling	ce0c229234	Formatting cleanups. No functionality change. llvm-svn: 149312	2012-01-31 01:01:16 +00:00
Bill Wendling	f3cae51490	Remove no-longer-useful dyn_casts and pals. llvm-svn: 149307	2012-01-31 00:56:53 +00:00
Kostya Serebryany	22ddcfd2df	[asan] fix the ObjC support (asan Issue #33 ) llvm-svn: 149300	2012-01-30 23:50:10 +00:00
Chad Rosier	6a0baa8f09	Typo. llvm-svn: 149289	2012-01-30 22:44:13 +00:00
Chad Rosier	41003f819c	Typo. llvm-svn: 149275	2012-01-30 21:13:22 +00:00
Alexander Potapenko	7a36f9d399	Fix compilation of ASan tests on OS X Lion (see http://code.google.com/p/address-sanitizer/issues/detail?id=32 ) The redzones emitted by AddressSanitizer for CFString instances confuse the linker and are of little use, so we shouldn't add them. llvm-svn: 149243	2012-01-30 10:40:22 +00:00
Nick Lewycky	1b3167edec	Fix typo. llvm-svn: 149185	2012-01-28 23:33:44 +00:00
Kostya Serebryany	7471d1303d	[asan] correctly use ConstantExpr::getGetElementPtr. Catch by NAKAMURA Takumi llvm-svn: 149172	2012-01-28 04:27:16 +00:00
Chris Lattner	0256be96f2	continue making the world safe for ConstantDataVector. At this point, we should (theoretically optimize and codegen ConstantDataVector as well as ConstantVector. llvm-svn: 149116	2012-01-27 03:08:05 +00:00
Chris Lattner	fa77500d96	Continue improving support for ConstantDataAggregate, and use the new methods recently added to (sometimes greatly!) simplify code. llvm-svn: 149024	2012-01-26 02:32:04 +00:00
Chris Lattner	8326bd8e10	some general cleanup, using new methods and tidying up old code. llvm-svn: 149006	2012-01-26 00:42:34 +00:00
Nick Lewycky	3c3feaf40c	Gracefully degrade precision in branch probability numbers. llvm-svn: 148946	2012-01-25 09:43:14 +00:00
Chris Lattner	6705883ad8	use Constant::getAggregateElement to simplify a bunch of code. llvm-svn: 148934	2012-01-25 06:48:06 +00:00
Chris Lattner	47a86bdbe2	use ConstantVector::getSplat in a few places. llvm-svn: 148929	2012-01-25 06:02:56 +00:00
Kostya Serebryany	c11d1dd133	[asan] enable asan only for the functions that have Attribute::AddressSafety llvm-svn: 148846	2012-01-24 19:34:43 +00:00
Chris Lattner	a0d01ff567	basic instcombine support for CDS. llvm-svn: 148806	2012-01-24 14:31:22 +00:00
Alexander Potapenko	c94cf8faf6	Implemented AddressSanitizer::getPassName() llvm-svn: 148697	2012-01-23 11:22:43 +00:00
David Blaikie	46a9f016c5	More dead code removal (using -Wunreachable-code) llvm-svn: 148578	2012-01-20 21:51:11 +00:00
Andrew Trick	b9c822ab0b	Handle a corner case with IV chain collection with bailout instead of assert. Fixes PR11783: bad cast to AddRecExpr. llvm-svn: 148572	2012-01-20 21:23:40 +00:00
Kostya Serebryany	a5054ad2f3	Extend Attributes to 64 bits Problem: LLVM needs more function attributes than currently available (32 bits). One such proposed attribute is "address_safety", which shows that a function is being checked for address safety (by AddressSanitizer, SAFECode, etc). Solution: - extend the Attributes from 32 bits to 64-bits - wrap the object into a class so that unsigned is never erroneously used instead - change "unsigned" to "Attributes" throughout the code, including one place in clang. - the class has no "operator uint64 ()", but it has "uint64_t Raw() " to support packing/unpacking. - the class has "safe operator bool()" to support the common idiom: if (Attributes attr = getAttrs()) useAttrs(attr); - The CTOR from uint64_t is marked explicit, so I had to add a few explicit CTOR calls - Add the new attribute "address_safety". Doing it in the same commit to check that attributes beyond first 32 bits actually work. - Some of the functions from the Attribute namespace are worth moving inside the class, but I'd prefer to have it as a separate commit. Tested: "make check" on Linux (32-bit and 64-bit) and Mac (10.6) built/run spec CPU 2006 on Linux with clang -O2. This change will break clang build in lib/CodeGen/CGCall.cpp. The following patch will fix it. llvm-svn: 148553	2012-01-20 17:56:17 +00:00
Andrew Trick	c908b43d9f	SCEVExpander fixes. Affects LSR and indvars. LSR has gradually been improved to more aggressively reuse existing code, particularly existing phi cycles. This exposed problems with the SCEVExpander's sloppy treatment of its insertion point. I applied some rigor to the insertion point problem that will hopefully avoid an endless bug cycle in this area. Changes: - Always used properlyDominates to check safe code hoisting. - The insertion point provided to SCEV is now considered a lower bound. This is usually a block terminator or the use itself. Under no cirumstance may SCEVExpander insert below this point. - LSR is reponsible for finding a "canonical" insertion point across expansion of different expressions. - Robust logic to determine whether IV increments are in "expanded" form and/or can be safely hoisted above some insertion point. Fixes PR11783: SCEVExpander assert. llvm-svn: 148535	2012-01-20 07:41:13 +00:00
Dan Gohman	8ee108bf98	Set the "tail" flag on pattern-matched objc_storeStrong calls. rdar://10531041. llvm-svn: 148490	2012-01-19 19:14:36 +00:00
Nick Lewycky	219e6bcb71	Actually, this code handles wrapped sets just fine. Noticed by inspection. llvm-svn: 148487	2012-01-19 18:19:42 +00:00
Dan Gohman	8f12faeb14	Add a depth limit to avoid runaway recursion. llvm-svn: 148419	2012-01-18 21:24:45 +00:00
Dan Gohman	82041c2e60	Use llvm.global_ctors to locate global constructors instead of recognizing them by name. llvm-svn: 148416	2012-01-18 21:19:38 +00:00
Jakub Staszak	632a355a01	Remove trailing spaces and unneeded includes. llvm-svn: 148415	2012-01-18 21:16:33 +00:00
Dan Gohman	e7a243fea5	Add a new ObjC ARC optimization pass to eliminate unneeded autorelease push+pop pairs. llvm-svn: 148330	2012-01-17 20:52:24 +00:00
Dan Gohman	b9936296d3	Add a new PassManagerBuilder customization point, EP_ModuleOptimizerEarly, to allow passes to be added before the main ModulePass optimizers. llvm-svn: 148329	2012-01-17 20:51:32 +00:00
Andrew Trick	12728f04ca	LSR fix: broaden the check for loop preheaders. It's becoming clear that LoopSimplify needs to unconditionally create loop preheaders. But that is a bigger fix. For now, continuing to hack LSR. Fixes rdar://10701050 "Cannot split an edge from an IndirectBrInst" assert. llvm-svn: 148288	2012-01-17 06:45:52 +00:00
David Blaikie	b48ed1a4cb	Remove unreachable code. (replace with llvm_unreachable to help GCC where necessary) llvm-svn: 148284	2012-01-17 04:43:56 +00:00
Stepan Dyatkovskiy	2931a59ec5	Fixed comment in loop-unswitch. llvm-svn: 148252	2012-01-16 20:48:04 +00:00
Stepan Dyatkovskiy	7ec12e431a	Cosmetic patch for r148215. llvm-svn: 148216	2012-01-15 09:45:11 +00:00
Stepan Dyatkovskiy	cb2adbacf8	Fixup for r148132. Type replacement for LoopsProperties: from DenseMap to std::map, since we need to keep a valid pointer to properties of current loop. Message for r148132: LoopUnswitch: All helper data that is collected during loop-unswitch iterations was moved to separated class (LUAnalysisCache). llvm-svn: 148215	2012-01-15 09:44:07 +00:00
Dan Gohman	4cf362acc1	Fix an unused variable warning that Chad noticed. llvm-svn: 148164	2012-01-14 00:47:44 +00:00
Eli Friedman	d476fdc392	Speculatively revert r148132+r148133 to try and fix a buildbot failure. llvm-svn: 148149	2012-01-13 22:34:39 +00:00
Stepan Dyatkovskiy	0a920fa210	Cosmetic patch for r148132. llvm-svn: 148133	2012-01-13 19:27:22 +00:00
Stepan Dyatkovskiy	cbcbdb237f	LoopUnswitch: All helper data that is collected during loop-unswitch iterations was moved to separated class (LUAnalysisCache). llvm-svn: 148132	2012-01-13 19:13:54 +00:00
Dan Gohman	728db4997a	Implement proper ObjC ARC objc_retainBlock "escape" analysis, so that the optimizer doesn't eliminate objc_retainBlock calls which are needed for their side effect of copying blocks onto the heap. This implements rdar://10361249. llvm-svn: 148076	2012-01-13 00:39:07 +00:00
Eli Friedman	b31c627be1	Re-fix the issue Bill fixed in r147899 in a slightly different way, which doesn't abuse the semantics of linker_private. We don't really want to merge any string constant with a weak_odr global. llvm-svn: 147971	2012-01-11 22:06:46 +00:00
Kostya Serebryany	687d078192	[asan] extend the workaround for http://llvm.org/bugs/show_bug.cgi?id=11395 : don't instrument the function at all on x86_32 if it has a large asm blob llvm-svn: 147953	2012-01-11 18:15:23 +00:00
Stepan Dyatkovskiy	8216569812	Improved compile time: 1. Size heuristics changed. Now we calculate number of unswitching branches only once per loop. 2. Some checks was moved from UnswitchIfProfitable to processCurrentLoop, since it is not changed during processCurrentLoop iteration. It allows decide to skip some loops at an early stage. Extended statistics: - Added total number of instructions analyzed. llvm-svn: 147935	2012-01-11 08:40:51 +00:00
Bill Wendling	c79155192d	If the global variable is removed by the linker, then don't constant merge it with other symbols. An object in the __cfstring section is suppoed to be filled with CFString objects, which have a pointer to ___CFConstantStringClassReference followed by a pointer to a __cstring. If we allow the object in the __cstring section to be merged with another global, then it could end up in any section. Because the linker is going to remove these symbols in the final executable, we shouldn't bother to merge them. <rdar://problem/10564621> llvm-svn: 147899	2012-01-11 00:13:08 +00:00
Andrew Trick	d5d2db9af9	Enable LSR IV Chains with sufficient heuristics. These heuristics are sufficient for enabling IV chains by default. Performance analysis has been done for i386, x86_64, and thumbv7. The optimization is rarely important, but can significantly speed up certain cases by eliminating spill code within the loop. Unrolled loops are prime candidates for IV chains. In many cases, the final code could still be improved with more target specific optimization following LSR. The goal of this feature is for LSR to make the best choice of induction variables. Instruction selection may not completely take advantage of this feature yet. As a result, there could be cases of slight code size increase. Code size can be worse on x86 because it doesn't support postincrement addressing. In fact, when chains are formed, you may see redundant address plus stride addition in the addressing mode. GenerateIVChains tries to compensate for the common cases. On ARM, code size increase can be mitigated by using postincrement addressing, but downstream codegen currently misses some opportunities. llvm-svn: 147826	2012-01-10 01:45:08 +00:00
Andrew Trick	248d410e3e	Adding IV chain generation to LSR. After collecting chains, check if any should be materialized. If so, hide the chained IV users from the LSR solver. LSR will only solve for the head of the chain. GenerateIVChains will then materialize the chained IV users by computing the IV relative to its previous value in the chain. In theory, chained IV users could be exposed to LSR's solver. This would be considerably complicated to implement and I'm not aware of a case where we need it. In practice it's more important to intelligently prune the search space of nontrivial loops before running the solver, otherwise the solver is often forced to prune the most optimal solutions. Hiding the chained users does this well, so that LSR is more likely to find the best IV for the chain as a whole. llvm-svn: 147801	2012-01-09 21:18:52 +00:00
Andrew Trick	29fe5f03d7	Adding collection of IV chains to LSR. This collects a set of IV uses within the loop whose values can be computed relative to each other in a sequence. Following checkins will make use of this information. llvm-svn: 147797	2012-01-09 19:50:34 +00:00
Andrew Trick	4dc3eff5ae	"Minor LSR debugging stuff" llvm-svn: 147785	2012-01-09 18:58:16 +00:00
Benjamin Kramer	f7fe24f40a	Move assert to the right place. llvm-svn: 147779	2012-01-09 17:36:29 +00:00
Benjamin Kramer	f9d0cc0160	InstCombine: Teach foldLogOpOfMaskedICmpsHelper that sign bit tests are bit tests. This subsumes several other transforms while enabling us to catch more cases. llvm-svn: 147777	2012-01-09 17:23:27 +00:00
Benjamin Kramer	6609f741b9	Tweak my last commit to be less conservative about uses. We still save an instruction when just the "and" part is replaced. Also change the code to match comments more closely. llvm-svn: 147753	2012-01-08 21:12:51 +00:00
Benjamin Kramer	da37e15345	InstCombine: If we have a bit test and a sign test anded/ored together, merge the sign bit into the bit test. This is common in bit field code, e.g. checking if the first or the last bit of a bit field is set. llvm-svn: 147749	2012-01-08 18:32:24 +00:00
Andrew Trick	06f6c05d08	Enable redundant phi elimination after LSR. This will be more important as we extend the LSR pass in ways that don't rely on the formula solver. In particular, we need it for constructing IV chains. llvm-svn: 147724	2012-01-07 07:08:17 +00:00
Andrew Trick	732ad80dbb	LSR: Don't optimize loops if an outer loop has no preheader. LoopSimplify may not run on some outer loops, e.g. because of indirect branches. SCEVExpander simply cannot handle outer loops with no preheaders. Fixes rdar://10655343 SCEVExpander segfault. llvm-svn: 147718	2012-01-07 03:16:50 +00:00
Andrew Trick	2ec61a896b	LSR: run DeleteDeadPhis before replaceCongruentPhis. llvm-svn: 147711	2012-01-07 01:36:44 +00:00
Andrew Trick	5adedf5d47	Extended replaceCongruentPhis to handle mixed phi types. llvm-svn: 147707	2012-01-07 01:12:09 +00:00
Kostya Serebryany	3411f2ea68	[asan] cleanup: remove the SIGILL-related code (compiler part) llvm-svn: 147667	2012-01-06 18:09:21 +00:00
Dan Gohman	5ab9c0a927	Fix SpeculativelyExecuteBB to either speculate all or none of the phis present in the bottom of the CFG triangle, as the transformation isn't ever valuable if the branch can't be eliminated. Also, unify some heuristics between SimplifyCFG's multiple if-converters, for consistency. This fixes rdar://10627242. llvm-svn: 147630	2012-01-05 23:58:56 +00:00
Eli Friedman	55fa49f32d	PR11705, part 2: globalopt shouldn't put inttoptr/ptrtoint operations into global initializers if there's an implied extension or truncation. llvm-svn: 147625	2012-01-05 23:03:32 +00:00
Dan Gohman	5267211899	Revert r56315. When the instruction to speculate is a load, this code can incorrectly move the load across a store. This never happens in practice today, but only because the current heuristics accidentally preclude it. llvm-svn: 147623	2012-01-05 22:54:35 +00:00
Nick Lewycky	f740db31e2	SCCCaptured is trivially false on entry to this loop and not modified inside it. Eliminate the dead test for it on each loop iteration. No functionality change. llvm-svn: 147616	2012-01-05 22:21:45 +00:00
Nick Lewycky	6d1d4bb6a1	Remove pointless asserts. llvm-svn: 147529	2012-01-04 09:42:30 +00:00
Nick Lewycky	0c48afa0ed	Teach instcombine all sorts of great stuff about shifts that have exact, nuw or nsw bits on them. llvm-svn: 147528	2012-01-04 09:28:29 +00:00
Nick Lewycky	b59008c694	Make use of the exact bit when optimizing '(X >>exact 3) << 1' to eliminate the 'and' that would zero out the trailing bits, and to produce an exact shift ourselves. llvm-svn: 147391	2011-12-31 21:30:22 +00:00
Nick Lewycky	4c378a4453	Change CaptureTracking to pass a Use* instead of a Value* when a value is captured. This allows the tracker to look at the specific use, which may be especially interesting for function calls. Use this to fix 'nocapture' deduction in FunctionAttrs. The existing one does not iterate until a fixpoint and does not guarantee that it produces the same result regardless of iteration order. The new implementation builds up a graph of how arguments are passed from function to function, and uses a bottom-up walk on the argument-SCCs to assign nocapture. This gets us nocapture more often, and does so rather efficiently and independent of iteration order. llvm-svn: 147327	2011-12-28 23:24:21 +00:00
Nick Lewycky	8640fdf0b7	Demystify this comment. llvm-svn: 147307	2011-12-28 06:57:32 +00:00
Nick Lewycky	398255e70c	Use false not zero, as a bool. llvm-svn: 147292	2011-12-27 18:27:22 +00:00
Nick Lewycky	a8e84fb56b	Turn cos(-x) into cos(x). Patch by Alexander Malyshev! llvm-svn: 147291	2011-12-27 18:25:50 +00:00
Nick Lewycky	c554a9b58e	Teach simplifycfg to recompute branch weights when merging some branches, and to discard weights when appropriate. Still more to do (and a new TODO), but it's a start! llvm-svn: 147286	2011-12-27 04:31:52 +00:00
Rafael Espindola	2b14b80b60	Fix warning. llvm-svn: 147284	2011-12-26 23:12:42 +00:00
Nick Lewycky	8d302df4a4	Update the branch weight metadata when reversing the order of a branch. llvm-svn: 147280	2011-12-26 20:54:14 +00:00
Nick Lewycky	e87d54c817	Sort includes, canonicalize whitespace, fix typos. No functionality change. llvm-svn: 147279	2011-12-26 20:37:40 +00:00
Benjamin Kramer	b16bd77bd2	InstCombine: Add a combine that turns (2^n)-1 ^ x back into (2^n)-1 - x iff x is smaller than 2^n and it fuses with a following add. This was intended to undo the sub canonicalization in cases where it's not profitable, but it also finds some cases on it's own. llvm-svn: 147256	2011-12-24 17:31:53 +00:00
Benjamin Kramer	010337c838	InstCombine: Canonicalize (2^n)-1 - x into (2^n)-1 ^ x iff x is known to be smaller than 2^n. This has the obvious advantage of being commutable and is always a win on x86 because const - x wastes a register there. On less weird architectures this may lead to a regression because other arithmetic doesn't fuse with it anymore. I'll address that problem in a followup. llvm-svn: 147254	2011-12-24 17:31:38 +00:00
Nick Lewycky	d9d1de4f69	Fix typo "infinte". llvm-svn: 147226	2011-12-23 23:49:25 +00:00
Mon P Wang	5d44a4332a	When not destroying the source, the linker is not remapping the types. Added support to CloneFunctionInto to allow remapping for this case. llvm-svn: 147217	2011-12-23 02:18:32 +00:00
Chad Rosier	3ba90a1655	Add the actual code for r147175. llvm-svn: 147176	2011-12-22 21:10:46 +00:00
Chad Rosier	1b7e2baf47	Speculatively revert r146578 to determine if it is the cause of a number of performance regressions (both execution-time and compile-time) on our nightly testers. Original commit message: Fix for bug #11429: Wrong behaviour for switches. Small improvement for code size heuristics. llvm-svn: 147131	2011-12-22 02:40:57 +00:00
Dan Gohman	51c81685a8	Fix a copy+pasto. No testcase, because the symptoms of dereferencing an invalid iterator aren't reproducible. rdar://10614085. llvm-svn: 147098	2011-12-21 21:43:50 +00:00
Nick Lewycky	b4039f633c	Make some intrinsics safe to speculatively execute. llvm-svn: 147036	2011-12-21 05:52:02 +00:00
David Blaikie	a379b18173	Unweaken vtables as per http://llvm.org/docs/CodingStandards.html#ll_virtual_anch llvm-svn: 146960	2011-12-20 02:50:00 +00:00
Jakub Staszak	1b1d523d9e	- Use getExitingBlock instead of getExitingBlocks. - Remove trailing spaces. llvm-svn: 146854	2011-12-18 21:52:30 +00:00
Kevin Enderby	8b3deabd2d	Revert r146822 at Pete Cooper's request as it broke clang self hosting. Hope I did this correctly :) llvm-svn: 146834	2011-12-17 19:48:52 +00:00
Pete Cooper	eadf124d2b	SimplifyCFG now predicts some conditional branches to true or false depending on previous branch on same comparison operands. For example, if (a == b) { if (a > b) // this is false Fixes some of the issues on <rdar://problem/10554090> llvm-svn: 146822	2011-12-17 06:32:38 +00:00
Pete Cooper	ebf98c1304	Refactor code used in InstCombine::FoldAndOfICmps to new file. This will be used by SimplifyCfg in a later commit. llvm-svn: 146803	2011-12-17 01:20:32 +00:00
Dan Gohman	518cda42b9	The powers that be have decided that LLVM IR should now support 16-bit "half precision" floating-point with a first-class type. This patch adds basic IR support (but not codegen support). llvm-svn: 146786	2011-12-17 00:04:22 +00:00
Andrew Trick	ca3417e932	Avoid a confusing assert for silly options: -unroll-runtime -unroll-count=1. No need for an explicit test case for an unsupported combination of options. llvm-svn: 146721	2011-12-16 02:03:48 +00:00
Kostya Serebryany	7a9eb49a47	[asan] add the name of the module to the description of a global variable. This improves the readability of global-buffer-overflow reports. llvm-svn: 146698	2011-12-15 22:55:55 +00:00
Kostya Serebryany	cd1aba8b4d	[asan] fix a bug (issue 19) where dlclose and the following mmap caused a false positive. compiler part. llvm-svn: 146688	2011-12-15 21:59:03 +00:00
Pete Cooper	b33c297f14	Added InstCombine for "select cond, ~cond, x" type patterns These can be reduced to "~cond & x" or "~cond \| x" llvm-svn: 146624	2011-12-15 00:56:45 +00:00
Eli Friedman	16ad2905a3	Make loop preheader insertion in LoopSimplify handle the case where the loop header is a landing pad correctly (by splitting the landingpad out of the loop header). Make some adjustments to the rest of LoopSimplify to make it clear that the rest of LoopSimplify isn't making bad assumptions about the presence of landing pads. PR11575. llvm-svn: 146621	2011-12-15 00:50:34 +00:00
Dan Gohman	75d7d5e988	Move Instruction::isSafeToSpeculativelyExecute out of VMCore and into Analysis as a standalone function, since there's no need for it to be in VMCore. Also, update it to use isKnownNonZero and other goodies available in Analysis, making it more precise, enabling more aggressive optimization. llvm-svn: 146610	2011-12-14 23:49:11 +00:00
Stepan Dyatkovskiy	d7b2bb3bdd	Fix for bug #11429 : Wrong behaviour for switches. Small improvement for code size heuristics. llvm-svn: 146578	2011-12-14 19:19:17 +00:00

... 3 4 5 6 7 ...

8950 Commits