llvm-project

Commit Graph

Author	SHA1	Message	Date
Devang Patel	eb1bb4e419	Until now all debug info MDNodes referred to a root MDNode, a compile unit. This simplified handling of these needs in dwarf writer. However, one side effect of this is that during link time optimization all these MDNodes are _not_ uniqued. In other words there will be N number of MDNodes describing "int", "char" and all other types, which would suddenly grow when each object file starts using libraries like STL. MDNodes graph structure such that compiler unit keeps track of important MDNodes and update dwarf writer to process mdnodes top-down instead of bottom up. llvm-svn: 137778	2011-08-16 22:09:43 +00:00
Bill Wendling	8ddfc09e7a	Use the getFirstInsertionPt() method instead of getFirstNonPHI + an 'isa<>' check for a LandingPadInst. llvm-svn: 137745	2011-08-16 20:45:24 +00:00
Bill Wendling	be33e8d58d	A few places where we want to skip the landingpad instruction for insertion. llvm-svn: 137712	2011-08-16 04:52:55 +00:00
Devang Patel	2b8acaf4f3	Add a finalize() hook, that'll let DIBuilder construct compile unit lazily. llvm-svn: 137673	2011-08-15 23:00:00 +00:00
Eli Friedman	4419cd2464	Add some comments here because the lack of a check for volatile/atomic here is a bit unusual. llvm-svn: 137662	2011-08-15 21:56:39 +00:00
Bill Wendling	e86965ee19	Duncan pointed out that the LandingPadInst might read memory. (It might also write to memory.) Marking it as such makes some checks for immobility go away. llvm-svn: 137655	2011-08-15 21:14:31 +00:00
Eli Friedman	5494adac67	Misc analysis passes that need to be aware of atomic load/store. llvm-svn: 137650	2011-08-15 20:54:19 +00:00
Eli Friedman	91386c7be4	Atomic load/store support in LICM. llvm-svn: 137648	2011-08-15 20:52:09 +00:00
Bill Wendling	9af5b22b76	The landingpad instruction isn't loop-invariant. llvm-svn: 137628	2011-08-15 18:22:49 +00:00
Devang Patel	dfd6ec3ce1	Refactor. Global variables are part of compile unit so let CompileUnit create new global variable. llvm-svn: 137621	2011-08-15 17:57:41 +00:00
Duncan Sands	a41634e307	Silence a bunch (but not all) "variable written but not read" warnings when building with assertions disabled. llvm-svn: 137460	2011-08-12 14:54:45 +00:00
Andrew Trick	2b6860f0a1	Allow loop unrolling to get known trip counts from ScalarEvolution. SCEV unrolling can unroll loops with arbitrary induction variables. It is a prerequisite for -disable-iv-rewrite performance. It is also easily handles loops of arbitrary structure including multiple exits and is generally more robust. This is under a temporary option to avoid affecting default behavior for the next couple of weeks. It is needed so that I can checkin unit tests for updateUnloop. llvm-svn: 137384	2011-08-11 23:36:16 +00:00
Andrew Trick	c12c30a670	Fix for LoopInfo::updateUnloop. Remove subloop blocks from former ancestor loops. I have a unit test that depends on scev-unroll, which unfortunately isn't checked in. But I will check it in when I can. llvm-svn: 137341	2011-08-11 20:27:32 +00:00
Andrew Trick	266ab10012	Cleanup. Another thorough review by Nick! llvm-svn: 137317	2011-08-11 17:54:58 +00:00
Andrew Trick	d3530b9117	Reapplying r136844. An algorithm for incrementally updating LoopInfo within a LoopPassManager. The incremental update should be extremely cheap in most cases and can be used in places where it's not feasible to regenerate the entire loop forest. - "Unloop" is a node in the loop tree whose last backedge has been removed. - Perform reverse dataflow on the block inside Unloop to propagate the nearest loop from the block's successors. - For reducible CFG, each block in unloop is visited exactly once. This is because unloop no longer has a backedge and blocks within subloops don't change parents. - Immediate subloops are summarized by the nearest loop reachable from their exits or exits within nested subloops. - At completion the unloop blocks each have a new parent loop, and each immediate subloop has a new parent. llvm-svn: 137276	2011-08-10 23:22:57 +00:00
Devang Patel	bb23a4a9a5	Distinguish between two copies of one inlined variable. Take 2. llvm-svn: 137253	2011-08-10 21:50:54 +00:00
Andrew Trick	78b40c3f3a	Cleanup. Added LoopBlocksDFS::perform for simple clients. llvm-svn: 137195	2011-08-10 01:59:05 +00:00
Devang Patel	3d6e38942d	Provide method to print variable's extended name which includes inline location. llvm-svn: 137095	2011-08-09 01:03:14 +00:00
Andrew Trick	6d45a01b67	Made SCEV's UDiv expressions more canonical. When dividing a recurrence, the initial values low bits can sometimes be ignored. To take advantage of this, added FoldIVUser to IndVarSimplify to fold an IV operand into a udiv/lshr if the operator doesn't affect the result. -indvars -disable-iv-rewrite now transforms i = phi i4 i1 = i0 + 1 idx = i1 >> (2 or more) i4 = i + 4 into i = phi i4 idx = i0 >> ... i4 = i + 4 llvm-svn: 137013	2011-08-06 07:00:37 +00:00
Chandler Carruth	81b7e11c89	Temporarily revert r135528 which distinguishes between two copies of one inlined variable, based on the discussion in PR10542. This explodes the runtime of several passes down the pipeline due to a large number of "copies" remaining live across a large function. This only shows up with both debug and opt, but when it does it creates a many-minute compile when self-hosting LLVM+Clang. There are several other cases that show these types of regressions. All of this is tracked in PR10542, and progress is being made on fixing the issue. Once its addressed, the re-instated, but until then this restores the performance for self-hosting and other opt+debug builds. Devang, let me know if this causes any trouble, or impedes fixing it in any way, and thanks for working on this! llvm-svn: 136953	2011-08-05 00:51:31 +00:00
Duncan Sands	020c1947b7	Fix what seems an obvious typo. Patch by Ivan Krasin. Problem reported at http://habrahabr.ru/blogs/compilers/125626/. llvm-svn: 136865	2011-08-04 10:02:21 +00:00
Andrew Trick	bc673fb5f2	Reverting r136884 updateUnloop, which crashed a linux builder. llvm-svn: 136857	2011-08-04 01:04:37 +00:00
Andrew Trick	468eadbbb2	An algorithm for incrementally updating LoopInfo within a LoopPassManager. The incremental update should be extremely cheap in most cases and can be used in places where it's not feasible to regenerate the entire loop forest. - "Unloop" is a node in the loop tree whose last backedge has been removed. - Perform reverse dataflow on the block inside Unloop to propagate the nearest loop from the block's successors. - For reducible CFG, each block in unloop is visited exactly once. This is because unloop no longer has a backedge and blocks within subloops don't change parents. - Immediate subloops are summarized by the nearest loop reachable from their exits or exits within nested subloops. - At completion the unloop blocks each have a new parent loop, and each immediate subloop has a new parent. llvm-svn: 136844	2011-08-03 23:50:25 +00:00
Andrew Trick	f898cbde5e	whitespace llvm-svn: 136843	2011-08-03 23:45:50 +00:00
Jakub Staszak	a60d130f26	Add more constantness in BlockFrequencyInfo. llvm-svn: 136816	2011-08-03 21:30:57 +00:00
Bill Wendling	035ea32870	Add this back in for now. There are still a few passes which create unwind instructions at the moment. llvm-svn: 136756	2011-08-03 01:07:57 +00:00
Bill Wendling	ae3380faff	Replace the 'UnwindInst' check with a check for 'ResumeInst', which also exits the function, because the UnwindInst is going away. llvm-svn: 136751	2011-08-03 00:30:19 +00:00
Andrew Trick	77c55428fa	Use consistent terminology for loop exit/exiting blocks. Name change only. llvm-svn: 136677	2011-08-02 04:23:35 +00:00
Jakub Staszak	8b13b59f60	Change SmallVector to SmallPtrSet in BranchProbabilityInfo. Handle cases where one than one successor goes to the same block. llvm-svn: 136638	2011-08-01 19:16:26 +00:00
Jakub Staszak	6651b33671	Do not handle cases with >= and <= predicates. llvm-svn: 136588	2011-07-31 05:54:04 +00:00
Jakub Staszak	e348afb612	Remove untrue comment. llvm-svn: 136587	2011-07-31 04:51:14 +00:00
Jakub Staszak	bfb1ae223b	Do not handle case where LHS is equal to zero, because InstCombiner always moves it to RHS anyway. llvm-svn: 136586	2011-07-31 04:47:20 +00:00
Jakub Staszak	17af66a62f	Add Zero Heurestics to BranchProbabilityInfo. If we compare value to zero we decide whether condition is likely to be true this way: x == 0 -> false x < 0 -> false x <= 0 -> false x != 0 -> true x > 0 -> true x >= 0 -> true llvm-svn: 136583	2011-07-31 03:27:24 +00:00
Jakub Staszak	efd94c8fea	Add more constantness in BranchProbabilityInfo. llvm-svn: 136502	2011-07-29 19:30:00 +00:00
Jakub Staszak	0978426843	Remove incEdgeWeight and decEdgeWeight. Set edge weight directly to avoid rounding errors. llvm-svn: 136456	2011-07-29 02:36:53 +00:00
Chandler Carruth	9d7feab3e0	Rewrite the CMake build to use explicit dependencies between libraries, specified in the same file that the library itself is created. This is more idiomatic for CMake builds, and also allows us to correctly specify dependencies that are missed due to bugs in the GenLibDeps perl script, or change from compiler to compiler. On Linux, this returns CMake to a place where it can relably rebuild several targets of LLVM. I have tried not to change the dependencies from the ones in the current auto-generated file. The only places I've really diverged are in places where I was seeing link failures, and added a dependency. The goal of this patch is not to start changing the dependencies, merely to move them into the correct location, and an explicit form that we can control and change when necessary. This also removes a serialization point in the build because we don't have to scan all the libraries before we begin building various tools. We no longer have a step of the build that regenerates a file inside the source tree. A few other associated cleanups fall out of this. This isn't really finished yet though. After talking to dgregor he urged switching to a single CMake macro to construct libraries with both sources and dependencies in the arguments. Migrating from the two macros to that style will be a follow-up patch. Also, llvm-config is still generated with GenLibDeps.pl, which means it still has slightly buggy dependencies. The internal CMake 'llvm-config-like' macro uses the correct explicitly specified dependencies however. A future patch will switch llvm-config generation (when using CMake) to be based on these deps as well. This may well break Windows. I'm getting a machine set up now to dig into any failures there. If anyone can chime in with problems they see or ideas of how to solve them for Windows, much appreciated. llvm-svn: 136433	2011-07-29 00:14:25 +00:00
Jakub Staszak	eec01ccbf9	Change LBH_TAKEN_WEIGHT to 124 (from 128). Right now, sum of LBH_TAKEN_WEIGHT + LBH_NONTAKEN_WEIGHT = 128 which in _most_ cases reduce number of rounding errors. llvm-svn: 136428	2011-07-28 23:42:08 +00:00
Jakub Staszak	d07b2e159a	Heuristics are in descending priority now. If we use one of them, skip the rest. llvm-svn: 136402	2011-07-28 21:45:07 +00:00
Jakub Staszak	bcb3c65bb4	Add InEdges (edges from header to the loop) in Loop Branch Heuristics, so there is no frequency difference whether condition is in the header or in the latch. llvm-svn: 136398	2011-07-28 21:33:46 +00:00
Jakub Staszak	da3df4302a	Use BlockFrequency instead of uint32_t in BlockFrequencyInfo. llvm-svn: 136278	2011-07-27 22:05:51 +00:00
Jeffrey Yasskin	6381c0100b	Explicitly cast narrowing conversions inside {}s that will become errors in C++0x. llvm-svn: 136211	2011-07-27 06:22:51 +00:00
Eli Friedman	8b5277c6cf	Minor simplification. llvm-svn: 136202	2011-07-27 01:02:25 +00:00
Eli Friedman	ae8161e774	Fix AliasSetTracker so that it doesn't make any assumptions about instructions it doesn't know about (like the atomic instructions I'm adding). llvm-svn: 136198	2011-07-27 00:46:46 +00:00
Andrew Trick	3ca3f98c2c	SCEV: Added a data structure for storing not-taken info per loop exit. Added an interfaces for querying either the loop's exact/max backedge taken count or a specific loop exit's not-taken count. llvm-svn: 136100	2011-07-26 17:19:55 +00:00
Duncan Sands	c1c92719a4	Add helper function for getting true/false constants in a uniform way for i1 and vector of i1 types. Use these to make some code more self-documenting. llvm-svn: 136079	2011-07-26 15:03:53 +00:00
Jakub Staszak	875ebd5f5d	Rename BlockFrequency to BlockFrequencyInfo and MachineBlockFrequency to MachineBlockFrequencyInfo. llvm-svn: 135937	2011-07-25 19:25:40 +00:00
Frits van Bommel	ede0dc6dda	Shorten some expressions by using ArrayRef::slice(). llvm-svn: 135910	2011-07-25 15:13:01 +00:00
Jay Foad	d1b7849d49	Convert GetElementPtrInst to use ArrayRef. llvm-svn: 135904	2011-07-25 09:48:08 +00:00
Jay Foad	040dd82f44	Convert IRBuilder::CreateGEP and IRBuilder::CreateInBoundsGEP to use ArrayRef. llvm-svn: 135761	2011-07-22 08:16:57 +00:00
Jakub Staszak	b82bbf40bb	Allow getBlockFreq to return 0. llvm-svn: 135742	2011-07-22 02:24:57 +00:00
Jay Foad	ed8db7d9df	Convert ConstantExpr::getGetElementPtr and ConstantExpr::getInBoundsGetElementPtr to use ArrayRef. llvm-svn: 135673	2011-07-21 14:31:17 +00:00
Devang Patel	8fb9fd6769	There are two ways to map a variable to its lexical scope. Lexical scope information is embedded in MDNode describing the variable. It is also available as a part of DebugLoc attached with DBG_VALUE instruction. DebugLoc attached with an instruction is less reliable in optimized code so use information embedded in the MDNode. llvm-svn: 135629	2011-07-20 22:18:50 +00:00
Devang Patel	a59b24b090	Distinguish between two copies of one inlined variable. llvm-svn: 135528	2011-07-19 22:31:15 +00:00
Devang Patel	cfa82a378d	Reapply r135457. This needs llvm-gcc change, that I forgot to check-in yesterday. llvm-svn: 135504	2011-07-19 19:41:54 +00:00
Bob Wilson	da30cf84c3	Revert "Make a provision to encode inline location in a variable. This will enable dwarf writer to easily distinguish between two instances of a inlined variable in one basic block." This reverts commit 9fec5e346efdf744b151ae6604f912908315fa7a. llvm-svn: 135486	2011-07-19 16:32:50 +00:00
Jay Foad	b992a635fb	Convert SimplifyGEPInst to use ArrayRef. llvm-svn: 135482	2011-07-19 15:07:52 +00:00
Jay Foad	bf904773bb	Convert TargetData::getIndexedOffset to use ArrayRef. llvm-svn: 135478	2011-07-19 14:01:37 +00:00
Jay Foad	f4b14a2b0d	Use ArrayRef in ConstantFoldInstOperands and ConstantFoldCall. llvm-svn: 135477	2011-07-19 13:32:40 +00:00
Devang Patel	ac532dedf1	Make a provision to encode inline location in a variable. This will enable dwarf writer to easily distinguish between two instances of a inlined variable in one basic block. llvm-svn: 135457	2011-07-19 01:03:32 +00:00
Frits van Bommel	717d7edd3e	Migrate LLVM and Clang to use the new makeArrayRef(...) functions where previously explicit non-default constructors were used. Mostly mechanical with some manual reformatting. llvm-svn: 135390	2011-07-18 12:00:32 +00:00
Chris Lattner	229907cd11	land David Blaikie's patch to de-constify Type, with a few tweaks. llvm-svn: 135375	2011-07-18 04:54:35 +00:00
Benjamin Kramer	a7606b993c	Silence compiler warnings. llvm-svn: 135358	2011-07-16 22:26:27 +00:00
Jakub Staszak	623e1971ce	Remove "LoopInfo.h" include from BranchProbabilityInfo.h. llvm-svn: 135353	2011-07-16 20:31:15 +00:00
Andrew Trick	244e2c3e82	Fix SCEVEXpander to handle arbitrary phi expansion. Includes two related bug fixes and corresponding assertions for uninitialized data and missing NULL check. Test cases will be included with the new LFTR. llvm-svn: 135333	2011-07-16 00:59:39 +00:00
Jakub Staszak	abb236fe9b	Fix pointer heuristic. Check whether predicator is ICMP_NE instead of if it is not isEquality(). llvm-svn: 135296	2011-07-15 20:51:06 +00:00
Jay Foad	5bd375a6cc	Convert CallInst and InvokeInst APIs to use ArrayRef. llvm-svn: 135265	2011-07-15 08:37:34 +00:00
Jay Foad	57aa636794	Convert InsertValueInst and ExtractValueInst APIs to use ArrayRef. llvm-svn: 135040	2011-07-13 10:26:04 +00:00
Chris Lattner	13879a7091	stop using WriteTypeSymbolic. llvm-svn: 134833	2011-07-09 18:02:13 +00:00
Devang Patel	c3239d3965	Preserve debug loc. llvm-svn: 134441	2011-07-05 21:48:22 +00:00
Dan Gohman	a293f24a0d	Teach IVUsers to stop at non-affine expressions unless they are both outside the loop and reducible. This more completely hides them from LSR, which isn't usually able to do anything meaningful with non-affine expressions anyway, and this consequently hides them from SCEVExpander, which is acutely unprepared for non-affine expressions. Replace test/CodeGen/X86/lsr-nonaffine.ll with a new test that tests the new behavior. This works around the bug in PR10117 / rdar://problem/9633149, and is generally an improvement besides. llvm-svn: 134268	2011-07-01 22:05:19 +00:00
Dan Gohman	54664ed714	Improve constant folding of undef for cmp and select operators. llvm-svn: 134223	2011-07-01 01:03:43 +00:00
Andrew Trick	154d78a661	Cleanup. Fix a stupid variable name. llvm-svn: 133995	2011-06-28 05:41:52 +00:00
Andrew Trick	411daa5e81	SCEVExpander: give new insts a name that identifies the reponsible pass. llvm-svn: 133992	2011-06-28 05:07:32 +00:00
Andrew Trick	56b315a9cf	indvars --disable-iv-rewrite: sever ties with IVUsers. llvm-svn: 133988	2011-06-28 03:01:46 +00:00
Nick Lewycky	3e334a42d7	Move onlyUsedByLifetimeMarkers to ValueTracking so that it can be used by other passes as well. llvm-svn: 133904	2011-06-27 04:20:45 +00:00
Devang Patel	503c3998f3	Fix struct member's scope. Patch by Xi Wang. llvm-svn: 133828	2011-06-24 22:00:39 +00:00
Jakub Staszak	1aae619933	Calculate backedge probability correctly. llvm-svn: 133776	2011-06-23 23:52:11 +00:00
Jakub Staszak	668c6fae76	Missing files for the BlockFrequency analysis added. llvm-svn: 133767	2011-06-23 21:56:59 +00:00
Jakub Staszak	be52acc98a	Introduce BlockFrequency analysis for BasicBlocks. llvm-svn: 133766	2011-06-23 21:45:20 +00:00
Rafael Espindola	e2456536b5	Revert "revert 133714" This reverts commit e8e00f5efb4a22238f2407bf813de4606f30c5aa. The cmake build on OS X is still broken. llvm-svn: 133718	2011-06-23 14:19:39 +00:00
Dylan Noblesmith	8a4f22d017	revert 133714 It broke the build worse. llvm-svn: 133716	2011-06-23 13:56:01 +00:00
Rafael Espindola	250360d4bd	133713 broke the build, revert it. llvm-svn: 133714	2011-06-23 13:37:38 +00:00
Dylan Noblesmith	3595357772	Support: make floating-exception header private It has only one user. This eliminates the last include of config.h from the public headers -- ideally, config.h shouldn't even be installed by `make install` anymore. llvm-svn: 133713	2011-06-23 12:45:54 +00:00
Devang Patel	ccf8dbf885	New binops need debug loc. llvm-svn: 133642	2011-06-22 20:56:56 +00:00
Andrew Trick	fc4ccb20c6	IVUsers no longer needs to record the phis. llvm-svn: 133518	2011-06-21 15:43:52 +00:00
Chris Lattner	cc19efaa97	Revamp the "ConstantStruct::get" methods. Previously, these were scattered all over the place in different styles and variants. Standardize on two preferred entrypoints: one that takes a StructType and ArrayRef, and one that takes StructType and varargs. In cases where there isn't a struct type convenient, we now add a ConstantStruct::getAnon method (whose name will make more sense after a few more patches land). It would be "really really nice" if the ConstantStruct::get and ConstantVector::get methods didn't make temporary std::vectors. llvm-svn: 133412	2011-06-20 04:01:31 +00:00
Chris Lattner	67733f6557	simplify some code. llvm-svn: 133362	2011-06-18 21:46:23 +00:00
Benjamin Kramer	9319e9c5d8	Simplify code. No functionality change. llvm-svn: 133351	2011-06-18 14:42:42 +00:00
Jakub Staszak	12a43bdde5	Introduce MachineBranchProbabilityInfo class, which has similar API to BranchProbabilityInfo (expect setEdgeWeight which is not available here). Branch Weights are kept in MachineBasicBlocks. To turn off this analysis set -use-mbpi=false. llvm-svn: 133184	2011-06-16 20:22:37 +00:00
Eli Friedman	8b098b0d57	Add a limit to the number of instructions memdep will scan in a single block. This prevents (at least in some cases) O(N^2) runtime in passes like DSE. The limit in this patch is probably too high, but it is enough to stop DSE from going completely insane on a testcase I have (which has a single block with around 50,000 non-aliasing stores in it). rdar://9471075 llvm-svn: 133111	2011-06-15 23:59:25 +00:00
Eli Friedman	7d58bc7bc0	Add "unknown" results for memdep, which mean "I don't know whether a dependence for the given instruction exists in the given block". This cleans up all the existing hacks in memdep which represent this concept by returning clobber with various unrelated instructions. llvm-svn: 133031	2011-06-15 00:47:34 +00:00
Benjamin Kramer	558d09d87e	Move class into an anonymous namespace. llvm-svn: 132925	2011-06-13 18:38:56 +00:00
Andrew Trick	3d4e64b082	Branch profiling: floating-point avoidance. Patch by: Jakub Staszak! Introduces BranchProbability. Changes unsigned to uint32_t all over and uint64_t only when overflow is expected. llvm-svn: 132867	2011-06-11 01:05:22 +00:00
Dan Gohman	cc59548793	Initialize BasicAA's AliasCache to set it to use fewer buckets by default, since it usually has very few elements. This speeds up alias queries in many cases, because AliasCache.clear() doesn't have to visit as many buckets. llvm-svn: 132862	2011-06-10 22:30:30 +00:00
John McCall	729c35b680	Teach the CallGraph to ignore calls to intrinsics. llvm-svn: 132797	2011-06-09 19:46:27 +00:00
Dan Gohman	adf80ae9e4	Reapply r131781, now that the GVN bug with partially-aliasing loads is disabled. llvm-svn: 132632	2011-06-04 06:50:18 +00:00
Dan Gohman	a471751c24	Disable the main feature of 130180, the elimination of loads that are redundant with partially-aliasing loads. When computing what portion of a clobbering load value is needed, it doesn't consider phi-translation which may have occurred between the clobbing load and the redundant load. llvm-svn: 132631	2011-06-04 06:48:50 +00:00
Dan Gohman	87fdceaf73	Revert r131781 again. Apparently there is more going on here. llvm-svn: 132625	2011-06-04 05:11:22 +00:00
Nick Lewycky	75b2053863	Fold assert-only-used variable into the assert. llvm-svn: 132620	2011-06-04 02:07:10 +00:00
Andrew Trick	c73aa1ee81	Missing include of climits in the new BranchProbability pass. llvm-svn: 132616	2011-06-04 01:30:52 +00:00
Andrew Trick	49371f3f33	New BranchProbabilityInfo analysis. Patch by Jakub Staszak! BranchProbabilityInfo provides an interface for IR passes to query the likelihood that control follows a CFG edge. This patch provides an initial implementation of static branch predication that will populate BranchProbabilityInfo for branches with no external profile information using very simple heuristics. It currently isn't hooked up to any external profile data, so static prediction does all the work. llvm-svn: 132613	2011-06-04 01:16:30 +00:00
Dan Gohman	27b82f2f91	Reapply r131781 (revert r131809), now that some BasicAA shortcomings it exposed are fixed. llvm-svn: 132611	2011-06-04 00:46:31 +00:00
Dan Gohman	fb02cec44e	Fix BasicAA's recursion detection so that it doesn't pessimize queries in the case of a DAG, where a query reaches a node visited earlier, but it's not on a cycle. This avoids MayAlias results in cases where BasicAA is expected to return MustAlias or PartialAlias in order to protect TBAA. llvm-svn: 132609	2011-06-04 00:31:50 +00:00
Dan Gohman	4e7e7958d7	When merging MustAlias and PartialAlias, chose PartialAlias instead of conservatively choosing MayAlias. llvm-svn: 132579	2011-06-03 20:17:36 +00:00
Hans Wennborg	060b994a29	Test commit. llvm-svn: 132558	2011-06-03 17:15:37 +00:00
Devang Patel	1d40024322	A typedef's context is not the same as type's context. It is the context of typedef decl itself. Use extra parameter to communicate this to DIBuilder. llvm-svn: 132556	2011-06-03 17:04:51 +00:00
Eli Friedman	b576b1675c	When marking a block as being unanalyzable, use "Clobber" on the terminator instead of the first instruction in the block. This is a bit of a hack; "Clobber" isn't really the right marking in the first place. memdep doesn't really have any way of properly expressing "unanalyzable" at the moment. Using it on the terminator is much less ambiguous than using it on an arbitrary instruction, though. In the given testcase, the "Clobber" was pointing to a load, and GVN was incorrectly assuming that meant that the "Clobber" load overlapped the load being analyzed (when they are actually unrelated). The included testcase tests both this commit and r132434. Part two of rdar://9429882. (r132434 was mislabeled.) llvm-svn: 132442	2011-06-02 00:08:52 +00:00
Eli Friedman	4b6eeb9ca2	In MemoryDependenceAnalysis::getNonLocalPointerDepFromBB, if a given block is is deemed unanalyzable (and we execute one of the "goto PredTranslationFailure" statements), make sure we don't put information about the predecessors of that block into the returned data structures; this can lead to, among other things, extraneous results (which will confuse passes using memdep). Fixes an assert in GVN compiling ruby. Part of rdar://problem/9521954 . Testcase coming up soon. llvm-svn: 132434	2011-06-01 23:16:53 +00:00
Andrew Trick	8ef3ad049d	SCEV: missing null check fix for r132360, dragonegg crash. llvm-svn: 132416	2011-06-01 19:14:56 +00:00
Andrew Trick	812276eed4	scev: Better sign-extend removal. Normalize postincrement recurrences so that their sign extended forms are congruent when no overflow occurs. llvm-svn: 132360	2011-05-31 21:17:47 +00:00
Eli Friedman	7a5fc693f9	llvm.memcpy.* has two distinct associated address spaces; the source address space, and the destination address space. Fix up the interface on MemIntrinsic and MemTransferInst to make this clear, and fix InstructionDereferencesPointer in LazyValueInfo.cpp to use the interface properly. llvm-svn: 132356	2011-05-31 20:40:16 +00:00
Dan Gohman	c6f2ddfc04	Update this comment. llvm-svn: 132202	2011-05-27 18:42:33 +00:00
Chad Rosier	b362884ca9	Renamed llvm.x86.sse42.crc32 intrinsics; crc64 doesn't exist. crc32.[8\|16\|32] have been renamed to .crc32.32.[8\|16\|32] and crc64.[8\|16\|32] have been renamed to .crc32.64.[8\|64]. llvm-svn: 132163	2011-05-26 23:13:19 +00:00
Eli Friedman	bacb17906a	Change condition for determining whether a function is small for inlining metrics so that very long functions with few basic blocks are not re-analyzed. llvm-svn: 131994	2011-05-24 20:22:24 +00:00
Dan Gohman	0573b55c2b	Make DecomposeGEPExpression check SimplifyInstruction only after checking for a GEP, so that it matches what GetUnderlyingObject does. This fixes an obscure bug turned up by bugpoint in the testcase for PR9931. llvm-svn: 131971	2011-05-24 18:24:08 +00:00
Chris Lattner	026f5e61f0	fix a really nasty basicaa mod/ref calculation bug that was causing miscompilation of UnitTests/ObjC/messages-2.m with the recent optimizer improvements. llvm-svn: 131897	2011-05-23 05:15:43 +00:00
Chris Lattner	83791ced7b	Teach valuetracking that byval arguments with a specified alignment are aligned. Teach memcpyopt to not give up all hope when confonted with an underaligned memcpy feeding an overaligned byval. If the source of the memcpy can be determined to be adequeately aligned, or if it can be forced to be, we can eliminate the memcpy. This addresses PR9794. We now compile the example into: define i32 @f(%struct.p* nocapture byval align 8 %q) nounwind ssp { entry: %call = call i32 @g(%struct.p* byval align 8 %q) nounwind ret i32 %call } in both x86-64 and x86-32 mode. We still don't get a tailcall though, because tailcalls apparently can't handle byval. llvm-svn: 131884	2011-05-23 00:03:39 +00:00
Chris Lattner	713d52364f	implement PR9315, constant folding exp2 in terms of pow (since hosts without C99 runtimes don't have exp2). llvm-svn: 131872	2011-05-22 22:22:35 +00:00
Evan Cheng	2a746bfe36	Teach ValueTracking about x86 crc32 intrinsics. llvm-svn: 131861	2011-05-22 18:25:30 +00:00
Duncan Sands	5ec65765e6	Revert commit 131781, to see if it fixes the x86-64 dragonegg buildbot. Original log message: When BasicAA can determine that two pointers have the same base but differ by a dynamic offset, return PartialAlias instead of MayAlias. See the comment in the code for details. This fixes PR9971. llvm-svn: 131809	2011-05-21 20:54:46 +00:00
Dan Gohman	8b20187c82	When BasicAA can determine that two pointers have the same base but differ by a dynamic offset, return PartialAlias instead of MayAlias. See the comment in the code for details. This fixes PR9971. llvm-svn: 131781	2011-05-21 01:05:08 +00:00
Andrew Trick	f44aadf0fd	indvars: Prototyping Sign/ZeroExtend elimination without canonical IVs. No functionality enabled by default. Use -disable-iv-rewrite. Extended IVUsers to keep track of the phi that represents the users' IV. Added the WidenIV transform to replace a narrow IV with a wide IV by doing a one-for-one replacement of IV users instead of expanding the SCEV expressions. [sz]exts are removed and truncs are inserted. llvm-svn: 131744	2011-05-20 18:25:42 +00:00
Owen Anderson	97f0cf32ea	@llvm.lifetime.begin acts as a load, not @llvm.lifetime.end. llvm-svn: 131437	2011-05-17 00:05:49 +00:00
Rafael Espindola	71f8b08a80	Extra refactoring noticed by Eli Friedman. llvm-svn: 131405	2011-05-16 15:48:45 +00:00
Julien Lerouge	7e11f9e26d	Fix a source of non determinism in FindUsedTypes, use a SetVector instead of a set. rdar://9423996 llvm-svn: 131283	2011-05-13 05:20:42 +00:00
Dan Gohman	0daf687e1d	Change a few std::maps to DenseMaps. llvm-svn: 131088	2011-05-09 18:44:09 +00:00
Duncan Sands	af32728a57	The comparision "max(x,y)==x" is equivalent to "x>=y". Since the max is often expressed as "x >= y ? x : y", there is a good chance we can extract the existing "x >= y" from it and use that as a replacement for "max(x,y)==x". llvm-svn: 131049	2011-05-07 16:56:49 +00:00
Eli Friedman	8a20e66926	PR9838: Fix transform introduced in r127064 to not trigger when only one side of the icmp is an exact shift. llvm-svn: 130954	2011-05-05 21:59:18 +00:00
Hongbin Zheng	cd5afc5feb	Minor change: Fix the typo in RegionPass.h and RegionPass.cpp. llvm-svn: 130920	2011-05-05 13:59:38 +00:00
Duncan Sands	a228785526	Add variations on: max(x,y) >= min(x,z) folds to true. This isn't that common, but according to my super-optimizer there are only two missed simplifications of -instsimplify kind when compiling bzip2, and this is one of them. It amuses me to have bzip2 be perfectly optimized as far as instsimplify goes! llvm-svn: 130840	2011-05-04 16:05:05 +00:00
Andrew Trick	1abe296cfd	indvars: Added DisableIVRewrite and WidenIVs. This adds functionality to remove size/zero extension during indvars without generating a canonical IV and rewriting all IV users. It's disabled by default so should have no effect on codegen. Work in progress. llvm-svn: 130829	2011-05-04 02:10:13 +00:00
Duncan Sands	0a9c1246d7	Implement some basic simplifications involving min/max, for example max(a,b) >= a -> true. According to my super-optimizer, these are by far the most common simplifications (of the -instsimplify kind) that occur in the testsuite and aren't caught by -std-compile-opts. llvm-svn: 130780	2011-05-03 19:53:10 +00:00
Devang Patel	09fa69e151	Use llvm.dbg.cu named metadata to collect compile units. llvm-svn: 130756	2011-05-03 16:18:28 +00:00
Duncan Sands	f91c5ab341	Fix PR9579: when simplifying a compare to "true" or "false", and it was a vector compare, generate a vector result rather than i1 (and crashing). llvm-svn: 130706	2011-05-02 18:51:41 +00:00
Duncan Sands	a3e3699c88	Move some rem transforms out of instcombine and into instsimplify. This automagically provides a transform noticed by my super-optimizer as occurring quite often: "rem x, (select cond, x, 1)" -> 0. llvm-svn: 130694	2011-05-02 16:27:02 +00:00
Chris Lattner	827a270a2a	teach GVN to widen integer loads when they are overaligned, when doing an wider load would allow elimination of subsequent loads, and when the wider load is still a native integer type. This eliminates a ton of loads on various benchmarks involving struct fields, though it is somewhat hobbled by clang not being very aggressive about field alignment. This is yet another step along the way towards resolving PR6627. llvm-svn: 130390	2011-04-28 07:29:08 +00:00
Dan Gohman	5394c70d1e	Teach BasicAA about arm.neon.vld1 and vst1. llvm-svn: 130327	2011-04-27 20:44:28 +00:00
Dan Gohman	39b3a1ef7f	When analyzing functions known to only access argument pointees, only check arguments with pointer types. Update the documentation of IntrReadArgMem reflect this. While here, add support for TBAA tags on intrinsic calls. llvm-svn: 130317	2011-04-27 18:39:03 +00:00
Andrew Trick	7d1eea86d9	Corrects an old, old typo in a case that doesn't seem to be reached in practice. llvm-svn: 130316	2011-04-27 18:17:36 +00:00
Andrew Trick	01eff820ae	Test case and comment for PR9633. llvm-svn: 130294	2011-04-27 05:42:17 +00:00
Andrew Trick	759ba0802d	Fix for PR9633 [indvars] Assertion `isa<X>(Val) && "cast<Ty>() argument of incompatible type!"' failed. Added a type check in ScalarEvolution::computeSCEVAtScope to handle the case in which operands of an AddRecExpr in the current scope are folded. llvm-svn: 130271	2011-04-27 01:21:25 +00:00
Chris Lattner	7aab2799ae	Enhance memdep to return clobber relation between noalias loads when an earlier load could be widened to encompass a later load. For example, if we see: X = load i8* P, align 4 Y = load i8* (P+3), align 1 and we have a 32-bit native integer type, we can widen the former load to i32 which then makes the second load redundant. GVN can't actually do anything with this load/load relation yet, so this isn't testable, but it is the next step to resolving PR6627, and a fairly general class of "merge neighboring loads" missed optimizations. llvm-svn: 130250	2011-04-26 22:42:01 +00:00
Chris Lattner	32dc9bd1bb	use AA::isMustAlias to simplify some calls. llvm-svn: 130248	2011-04-26 21:53:34 +00:00
Chris Lattner	6b96621a8a	remove support for llvm.invariant.end from memdep. It is a work-in-progress that is not progressing, and it has issues. llvm-svn: 130247	2011-04-26 21:50:51 +00:00
Devang Patel	b5ea255fb4	Fix an off by one error while accessing complex address element of a DIVariable. This worked untill now because stars are aligned (i.e. num of complex address elments are always 0 or 2+ and when it is 2+ at least two elements are access together) llvm-svn: 130225	2011-04-26 18:24:39 +00:00
Chris Lattner	6f83d06ffa	Enhance MemDep: When alias analysis returns a partial alias result, return it as a clobber. This allows GVN to do smart things. Enhance GVN to be smart about the case when a small load is clobbered by a larger overlapping load. In this case, forward the value. This allows us to compile stuff like this: int test(void P) { int tmp = (unsigned int)P; return tmp+((unsigned char*)P+1); } into: _test: ## @test movl (%rdi), %ecx movzbl %ch, %eax addl %ecx, %eax ret which has one load. We already handled the case where the smaller load was from a must-aliased base pointer. llvm-svn: 130180	2011-04-26 01:21:15 +00:00
Dan Gohman	6acd95b3c1	Fix an iterator invalidation bug. llvm-svn: 130166	2011-04-25 22:48:29 +00:00
Jay Foad	dbf81d8ddf	PR9214: Convert the DIBuilder API to use ArrayRef. llvm-svn: 130086	2011-04-24 10:11:03 +00:00
Jay Foad	1a180156b6	Remove unused STL header includes. llvm-svn: 130068	2011-04-23 19:53:52 +00:00
Devang Patel	1d6bbd41aa	Let front-end tie subprogram declaration with subprogram definition directly. llvm-svn: 130028	2011-04-22 23:10:17 +00:00
Jay Foad	5514afe6b2	PR9214: Convert Metadata API to use ArrayRef. llvm-svn: 129932	2011-04-21 19:59:31 +00:00
Devang Patel	0c7732499b	Use ArrayRef variants. llvm-svn: 129735	2011-04-18 23:51:03 +00:00
Chandler Carruth	2b1ba48f8d	Mark some functions as used which are used within debug-only code. This silences Clang's -Wunused-function when building in release mode. llvm-svn: 129709	2011-04-18 18:49:44 +00:00
Devang Patel	514b4006c2	Introduce support to encode Objective-C property information in debugging information generated for an interface. llvm-svn: 129624	2011-04-16 00:11:51 +00:00
Chris Lattner	0ab5e2cded	Fix a ton of comment typos found by codespell. Patch by Luis Felipe Strano Moraes! llvm-svn: 129558	2011-04-15 05:18:47 +00:00
Jay Foad	0091fe8ca1	PR9214: Convert ConstantExpr::getIndices() to return an ArrayRef, plus related tweaks to ExprMapKeyType. llvm-svn: 129443	2011-04-13 15:22:40 +00:00
Jay Foad	7c14a558fe	Don't include Operator.h from InstrTypes.h. llvm-svn: 129271	2011-04-11 09:35:34 +00:00
Eli Friedman	17822fcde9	PR9604; try to deal with RAUW updates correctly in the AST. I'm not convinced it's completely safe to cache the AST across LICM runs even with this fix, but this fix can't hurt. llvm-svn: 129198	2011-04-09 06:55:46 +00:00
Devang Patel	9f738849ab	Add support to encode function's template parameters. llvm-svn: 128947	2011-04-05 22:52:06 +00:00
Chris Lattner	57ee5a5db7	remove postdom frontiers, because it is dead. Forward dom frontiers are still used by RegionInfo :( llvm-svn: 128943	2011-04-05 21:57:17 +00:00
Tobias Grosser	8b304ff9ac	Region: Allow user control the printing style of the print function. Contributed by: etherzhhb@gmail.com llvm-svn: 128808	2011-04-04 07:19:18 +00:00
Eli Friedman	8baa2c7ad9	Don't assume something which might be a constant expression is an instruction. Based on PR9429, but no testcase because I can't figure out how to trigger it anymore given other changes to the relevant code. llvm-svn: 128781	2011-04-02 22:11:56 +00:00
Jay Foad	52131344a2	Remove PHINode::reserveOperandSpace(). Instead, add a parameter to PHINode::Create() giving the (known or expected) number of operands. llvm-svn: 128537	2011-03-30 11:28:46 +00:00
Jay Foad	e0938d8a87	(Almost) always call reserveOperandSpace() on newly created PHINodes. llvm-svn: 128535	2011-03-30 11:19:20 +00:00
Frits van Bommel	0bb2ad2cf7	Constant folding support for calls to umul.with.overflow(), basically identical to the smul.with.overflow() code. llvm-svn: 128379	2011-03-27 14:26:13 +00:00
Anders Carlsson	c4f0ab397c	Revert r128140 for now. llvm-svn: 128149	2011-03-23 15:51:12 +00:00
Anders Carlsson	9ed8d93f55	A global variable with internal linkage where all uses are in one function and whose address is never taken is a non-escaping local object and can't alias anything else. llvm-svn: 128140	2011-03-23 02:19:48 +00:00
Nick Lewycky	f0469af63e	Fix INT_MIN gotcha pointed out by Eli Friedman. llvm-svn: 128028	2011-03-21 21:40:32 +00:00
Andrew Trick	1c4b42d00f	Avoid creating canonical induction variables for non-native types. For example, on 32-bit architecture, don't promote all uses of the IV to 64-bits just because one use is a 64-bit cast. Alternate implementation of the patch by Arnaud de Grandmaison. llvm-svn: 127884	2011-03-18 16:50:32 +00:00
Andrew Trick	87716c93c2	Added isValidRewrite() to check the result of ScalarEvolutionExpander. SCEV may generate expressions composed of multiple pointers, which can lead to invalid GEP expansion. Until we can teach SCEV to follow strict pointer rules, make sure no bad GEPs creep into IR. Fixes rdar://problem/9038671. llvm-svn: 127839	2011-03-17 23:51:11 +00:00
Nick Lewycky	b4d763b37d	Add comments for the demanglings. Correct mangled form of operator delete! llvm-svn: 127801	2011-03-17 05:20:12 +00:00
Nick Lewycky	c1f8658368	Add C++ global operator {new,new[],delete,delete[]}(unsigned {int,long}) to the memory builtins as equivalent to malloc/free. This is different from any attribute we have. For example, you can delete the allocators when their result is unused, but you can't collapse two calls to the same function, even if no global/memory state has changed in between. The noalias return states that the result does not alias any other pointer, but instcombine optimizes malloc() as though the result is non-null for the purpose of eliminating unused pointers. llvm-svn: 127673	2011-03-15 07:31:32 +00:00
Andrew Trick	a34f1b1f10	Remove getMinusSCEVForExitTest(). This function performed acrobatics to prove no-self-wrap, which we now have for free. llvm-svn: 127643	2011-03-15 01:16:14 +00:00
Andrew Trick	f6b01ff422	Propagate SCEV no-wrap flags whenever possible. This needs review. llvm-svn: 127638	2011-03-15 00:37:00 +00:00
Andrew Trick	e92dcceab7	Negating a recurrence preserves no-self-wrap. llvm-svn: 127593	2011-03-14 17:38:54 +00:00
Andrew Trick	f1781db622	HowFarToZero can compute a trip count as long as the recurrence has no-self-wrap. llvm-svn: 127591	2011-03-14 17:28:02 +00:00
Andrew Trick	8b55b736b1	Added SCEV::NoWrapFlags to manage unsigned, signed, and self wrap properties. Added the self-wrap flag for SCEV::AddRecExpr. A slew of temporary FIXMEs indicate the intention of the no-self-wrap flag without changing behavior in this revision. llvm-svn: 127590	2011-03-14 16:50:06 +00:00
Benjamin Kramer	5acc751b6f	Teach ComputeMaskedBits about sub nsw. llvm-svn: 127548	2011-03-12 17:18:11 +00:00
Benjamin Kramer	391a946fa9	ComputeMaskedBits: sub falls through to add, and sub doesn't have the same overflow semantics as add. Should fix the selfhost failures that started with r127463. llvm-svn: 127465	2011-03-11 14:46:49 +00:00
Nick Lewycky	cc79973856	Teach ComputeMaskedBits about nsw on add. I don't think there's anything we can do with nuw here, but sub and mul should be given similar treatment. Fixes PR9343 #15! llvm-svn: 127463	2011-03-11 09:00:19 +00:00
Devang Patel	fa31d38aad	Introduce DebugInfoProbe. This is used to monitor how llvm optimizer is treating debugging information. It generates output that lools like 8 times line number info lost by Scalar Replacement of Aggregates (SSAUp) 1 times line number info lost by Simplify well-known library calls 12 times variable info lost by Jump Threading llvm-svn: 127381	2011-03-10 00:21:25 +00:00
Andrew Trick	2afa325811	When SCEV can determine the loop test is X < X, set ExactBECount=0. When ExactBECount is a constant, use it for MaxBECount. When MaxBECount cannot be computed, replace it with ExactBECount. Fixes PR9424. llvm-svn: 127342	2011-03-09 17:29:58 +00:00
Andrew Trick	2a3b71684a	whitespace llvm-svn: 127340	2011-03-09 17:23:39 +00:00
Nick Lewycky	774647d974	Fix two cases I forgot to update when doing a mental "getSwappedPredicate". Thanks Duncan Sands! llvm-svn: 127323	2011-03-09 08:20:06 +00:00
Nick Lewycky	980104d1d6	Add another micro-optimization. Apologies for the lack of refactoring, but I gave up when I realized I couldn't come up with a good name for what the refactored function would be, to describe what it does. This is PR9343 test12, which is test3 with arguments reordered. Whoops! llvm-svn: 127318	2011-03-09 06:26:03 +00:00
Duncan Sands	7dc3d47c34	Fix PR9331. Simplified version of a patch by Jakub Staszak. llvm-svn: 127243	2011-03-08 12:39:03 +00:00
Nick Lewycky	e467979d0a	Add more analysis of the sign bit of an srem instruction. If the LHS is negative then the result could go either way. If it's provably positive then so is the srem. Fixes PR9343 #7! llvm-svn: 127146	2011-03-07 01:50:10 +00:00
Nick Lewycky	9719a719c7	Thread comparisons over udiv/sdiv/ashr/lshr exact and lshr nuw/nsw whenever possible. This goes into instcombine and instsimplify because instsimplify doesn't need to check hasOneUse since it returns (almost exclusively) constants. This fixes PR9343 #4 #5 and #8! llvm-svn: 127064	2011-03-05 05:19:11 +00:00
Dan Gohman	aa036eedb8	When decling to reuse existing expressions that involve casts, ignore bitcasts, which are really no-ops here. This fixes slowdowns on MultiSource/Applications/aha and others. llvm-svn: 127031	2011-03-04 20:46:46 +00:00
Nick Lewycky	41c529bd09	Revert broken srem logic from r126991. llvm-svn: 127021	2011-03-04 19:26:08 +00:00
Nick Lewycky	8e3a79da9f	Fold "icmp pred (srem X, Y), Y" like we do for urem. Handle signed comparisons in the urem case, though not the other way around. This is enough to get #3 from PR9343! llvm-svn: 126991	2011-03-04 10:06:52 +00:00
Nick Lewycky	3cec6f5563	Teach instruction simplify to use constant ranges to solve problems of the form "icmp pred %X, CI" and a number of examples where "%X = binop %Y, CI2". Some of these cases (div and rem) used to make it through opt -O2, but the others are probably now making code elsewhere redundant (probably instcombine). llvm-svn: 126988	2011-03-04 07:00:57 +00:00
Duncan Sands	bf577d6a86	Remove DIFactory. Patch by Devang. llvm-svn: 126871	2011-03-02 20:30:37 +00:00
Dan Gohman	7290868a1b	Don't re-use existing addrec expansions if they contain casts. This fixes PR9259. llvm-svn: 126812	2011-03-02 01:34:10 +00:00
Devang Patel	40eee1e970	Today, the language front ends produces llvm.dbg.* intrinsics, used to encode arguments' debug info, in order any way, most of the times. However, if a front end mix-n-matches llvm.dbg.declare and llvm.dbg.value intrinsics to encode debug info for arguments then code generator needs a way to find argument order. Use 8 bits from line number field to keep track of argument ordering while encoding debug info for an argument. That leaves 24 bit for line no, DebugLoc also allocates 24 bit for line numbers. If a function has more than 255 arguments then rest of the arguments will be ordered by llvm.dbg.* intrinsics' ordering in IR. llvm-svn: 126793	2011-03-01 22:58:13 +00:00
Nick Lewycky	c9d20067cd	Optimize "icmp pred (urem X, Y), Y" --> true/false depending on pred. There's more work to do here, "icmp ult (urem X, 10), 11" doesn't optimize away yet. Fixes example 3 from PR9343! llvm-svn: 126741	2011-03-01 08:15:50 +00:00
Ted Kremenek	49d15b959e	Unbreak CMake build. llvm-svn: 126717	2011-03-01 00:02:51 +00:00
Dan Gohman	161058838c	Delete the LiveValues pass. I won't get get back to the project it was started for in the foreseeable future. llvm-svn: 126668	2011-02-28 19:37:59 +00:00
Nick Lewycky	afe4a3062d	Fix comment. llvm-svn: 126645	2011-02-28 09:18:11 +00:00
Nick Lewycky	66f4f22f7b	srem doesn't actually have the same resulting sign as its numerator, you could also have a zero when numerator = denominator. Reverts parts of r126635 and r126637. llvm-svn: 126644	2011-02-28 09:17:39 +00:00
Nick Lewycky	c9aab8567b	Teach value tracking to make use of flags in more situations. llvm-svn: 126642	2011-02-28 08:02:21 +00:00
Nick Lewycky	29dbbd12c1	Teach ValueTracking to look at the dividend when determining the sign bit of an srem instruction. llvm-svn: 126637	2011-02-28 06:52:12 +00:00
Tobias Grosser	98eecaf0a9	RegionPrinter: Ignore back edges when layouting the graph llvm-svn: 126564	2011-02-27 04:11:07 +00:00
Devang Patel	9b4127349c	Follow LLVM coding style. clang uses DBuilder, so it requries corresponding change. llvm-svn: 126231	2011-02-22 18:56:12 +00:00
Benjamin Kramer	5b7a4e0195	Move "A \| ~(A & ?) -> -1" from InstCombine to InstructionSimplify. llvm-svn: 126082	2011-02-20 15:20:01 +00:00
Chris Lattner	acf6b0776a	Stores of null pointers should turn into memset, we weren't recognizing them as splat values. llvm-svn: 126041	2011-02-19 19:35:49 +00:00
Oscar Fuentes	5ed962656c	Move library stuff out of the toplevel CMakeLists.txt file. llvm-svn: 125968	2011-02-18 22:06:14 +00:00
Devang Patel	4ab0852080	Move DbgInfoPrinter specific utlities inside DbgInfoPrinter.cpp llvm-svn: 125571	2011-02-15 17:36:11 +00:00
Devang Patel	27924da676	Print function info. Patch by Minjang Kim. llvm-svn: 125567	2011-02-15 17:24:56 +00:00
Chris Lattner	69229316aa	convert ConstantVector::get to use ArrayRef. llvm-svn: 125537	2011-02-15 00:14:00 +00:00
Chris Lattner	34442e6ebf	revert my ConstantVector patch, it seems to have made the llvm-gcc builders unhappy. llvm-svn: 125504	2011-02-14 18:15:46 +00:00
Chris Lattner	d9f5b88548	Switch ConstantVector::get to use ArrayRef instead of a pointer+size idiom. Change various clients to simplify their code. llvm-svn: 125487	2011-02-14 07:55:32 +00:00
Duncan Sands	b86070933f	Remove pointless blank line. llvm-svn: 125463	2011-02-13 18:11:05 +00:00
Duncan Sands	d114ab331c	Teach instsimplify that X+Y>=X+Z is the same as Y>=Z if neither side overflows, plus some variations of this. According to my auto-simplifier this occurs a lot but usually in combination with max/min idioms. Because max/min aren't handled yet this unfortunately doesn't have much effect in the testsuite. llvm-svn: 125462	2011-02-13 17:15:40 +00:00
Chris Lattner	4f23f2be15	teach SCEV that the scale and addition of an inbounds gep don't NSW. This fixes a FIXME in scev-aa.ll (allowing a new no-alias result) and generally makes things more precise. llvm-svn: 125449	2011-02-13 03:14:49 +00:00
Chris Lattner	7936a8a488	Per discussion with Dan G, inbounds geps certainly can have unsigned overflow (e.g. "gep P, -1"), and while they can have signed wrap in theoretical situations, modelling an AddRec as not having signed wrap is going enough for any case we can think of today. In the future if this isn't enough, we can revisit this. Modeling them as having NUW isn't causing any known problems either FWIW. llvm-svn: 125410	2011-02-11 21:43:33 +00:00
Nick Lewycky	ac0b62c277	Tolerate degenerate phi nodes that can occur in the middle of optimization passes. Fixes PR9112. Patch by Jakub Staszak! llvm-svn: 125319	2011-02-10 23:54:10 +00:00
Duncan Sands	8b4e283bfb	Formatting and comment tweaks. llvm-svn: 125200	2011-02-09 17:45:03 +00:00
Chris Lattner	9e4aa0259f	Teach instsimplify some tricks about exact/nuw/nsw shifts. improve interfaces to instsimplify to take this info. llvm-svn: 125196	2011-02-09 17:15:04 +00:00
Chris Lattner	b940091388	Rework InstrTypes.h so to reduce the repetition around the NSW/NUW/Exact versions of creation functions. Eventually, the "insertion point" versions of these should just be removed, we do have IRBuilder afterall. Do a massive rewrite of much of pattern match. It is now shorter and less redundant and has several other widgets I will be using in other patches. Among other changes, m_Div is renamed to m_IDiv (since it only matches integer divides) and m_Shift is gone (it used to match all binops!!) and we now have m_LogicalShift for the one client to use. Enhance IRBuilder to have "isExact" arguments to things like CreateUDiv and reduce redundancy within IRbuilder by having these methods chain to each other more instead of duplicating code. llvm-svn: 125194	2011-02-09 17:00:45 +00:00
Duncan Sands	867cb633b4	Add an m_Div pattern for matching either a udiv or an sdiv and use it to simplify the "(X/Y)*Y->X when the division is exact" transform. llvm-svn: 125004	2011-02-07 09:36:32 +00:00
Chris Lattner	6e57b15228	teach instsimplify to transform (X / Y) * Y to X when the div is an exact udiv. llvm-svn: 124994	2011-02-06 22:05:31 +00:00
Eric Christopher	b54605b8e2	Remove premature optimization that avoided calculating argument weights if we weren't going to inline the function. The rest of the code using this was removed. Fixes PR9154. llvm-svn: 124991	2011-02-06 21:27:46 +00:00
Anders Carlsson	ecf8e159e3	Simplify test, as suggested by Chris. llvm-svn: 124990	2011-02-06 20:22:49 +00:00
Anders Carlsson	d21b06a0db	When loading from a constant, fold inttoptr if the integer type and the resulting pointer type both have the same size. llvm-svn: 124987	2011-02-06 20:11:56 +00:00
Anders Carlsson	36c6d23074	Fix another warning. llvm-svn: 124961	2011-02-05 18:33:43 +00:00
Eric Christopher	ceb4671ddd	Fix cut and paste error spotted by Jakob. llvm-svn: 124930	2011-02-05 02:48:47 +00:00
Eric Christopher	2dfbd7e0c1	Rewrite how the indirect call bonus is handled. This now works by: a) Making it a per call site bonus for functions that we can move from indirect to direct calls. b) Reduces the bonus from 500 to 100 per call site. c) Subtracts the size of the possible newly inlineable call from the bonus to only add a bonus if we can inline a small function to devirtualize it. Also changes the bonus from a positive that's subtracted to a negative that's added. Fixes the remainder of rdar://8546196 by reducing the object file size after inlining by 84%. llvm-svn: 124916	2011-02-05 00:49:15 +00:00
Duncan Sands	06504025d2	Improve threading of comparisons over select instructions (spotted by my auto-simplifier). This has a big impact on Ada code, but not much else. Unfortunately the impact is mostly negative! This is due to PR9004 (aka SCCP failing to resolve conditional branch conditions in the destination blocks of the branch), in which simple correlated expressions are not resolved but complicated ones are, so simplifying has a bad effect! llvm-svn: 124788	2011-02-03 09:37:39 +00:00
Devang Patel	df0dd7dc69	Fix typo in comment. llvm-svn: 124759	2011-02-03 00:13:47 +00:00
Devang Patel	be933b470a	Add support to describe template value parameter in debug info. llvm-svn: 124755	2011-02-02 22:35:53 +00:00
Devang Patel	3a9e65efb6	Add support to describe template parameter type in debug info. llvm-svn: 124752	2011-02-02 21:38:25 +00:00
Duncan Sands	5747abab10	Reenable the transform "(X*Y)/Y->X" when the multiplication is known not to overflow (nsw flag), which was disabled because it breaks 254.gap. I have informed the GAP authors of the mistake in their code, and arranged for the testsuite to use -fwrapv when compiling this benchmark. llvm-svn: 124746	2011-02-02 20:52:00 +00:00
Duncan Sands	a29ea9aa4c	Add a m_Undef pattern for convenience. This is so that code that uses pattern matching can also pattern match undef, creating a more uniform style. llvm-svn: 124657	2011-02-01 09:06:20 +00:00
Duncan Sands	4b397fcdc2	Add a m_SignBit pattern for convenience. llvm-svn: 124656	2011-02-01 08:50:33 +00:00
Duncan Sands	cf0ff030a8	Have m_One also match constant vectors for which every element is 1. llvm-svn: 124655	2011-02-01 08:39:12 +00:00
Eric Christopher	46308e666a	Reapply 124275 since the Dragonegg failure was unreproducible. llvm-svn: 124641	2011-02-01 01:16:32 +00:00
Duncan Sands	2e5a58da8f	Commit 124487 broke 254.gap. See if disabling the part that might be triggered by PR9088 fixes things. llvm-svn: 124561	2011-01-30 18:24:20 +00:00
Duncan Sands	b67edc6a29	Transform (X/Y)*Y into X if the division is exact. Instcombine already knows how to do this and more, but would only do it if X/Y had only one use. Spotted as the most common missed simplification in SPEC by my auto-simplifier, now that it knows about nuw/nsw/exact flags. This removes a bunch of multiplications from 447.dealII and 483.xalancbmk. It also removes a lot from tramp3d-v4, which results in much more inlining. llvm-svn: 124560	2011-01-30 18:03:50 +00:00
Nick Lewycky	b89d9a4412	Fix comment. llvm-svn: 124544	2011-01-29 19:55:23 +00:00
Frits van Bommel	c2549661af	Move InstCombine's knowledge of fdiv to SimplifyInstruction(). llvm-svn: 124534	2011-01-29 15:26:31 +00:00
Duncan Sands	2e9e4f1be3	Fix typo: should have been testing that X was odd, not V. llvm-svn: 124533	2011-01-29 13:27:00 +00:00
Andrew Trick	24f5ff0f23	Implementation of path profiling. Modified patch by Adam Preuss. This builds on the existing framework for block tracing, edge profiling and optimal edge profiling. See -help-hidden for new flags. For documentation, see the technical report "Implementation of Path Profiling..." in llvm.org/pubs. llvm-svn: 124515	2011-01-29 01:09:53 +00:00
Duncan Sands	e4b4d0c16d	This dyn_cast should be a cast. Pointed out by Frits van Bommel. llvm-svn: 124497	2011-01-28 18:53:08 +00:00
Duncan Sands	65995fa2a0	Thread divisions over selects and phis. This doesn't fire much and has basically zero effect on the testsuite (it improves two Ada testcases). llvm-svn: 124496	2011-01-28 18:50:50 +00:00
Duncan Sands	771e82a863	My auto-simplifier noticed that ((X/Y)Y)/Y occurs several times in SPEC benchmarks, and that it can be simplified to X/Y. (In general you can only simplify (ZY)/Y to Z if the multiplication did not overflow; if Z has the form "X/Y" then this is the case). This patch implements that transform and moves some Div logic out of instcombine and into InstructionSimplify. Unfortunately instcombine gets in the way somewhat, since it likes to change (X/Y)Y into X-(X rem Y), so I had to teach instcombine about this too. Finally, thanks to the NSW/NUW flags, sometimes we know directly that "ZY" does not overflow, because the flag says so, so I added that logic too. This eliminates a bunch of divisions and subtractions in 447.dealII, and has good effects on some other benchmarks too. It seems to have quite an effect on tramp3d-v4 but it's hard to say if it's good or bad because inlining decisions changed, resulting in massive changes all over. llvm-svn: 124487	2011-01-28 16:51:11 +00:00
Eric Christopher	cd55a46c31	Temporarily revert 124275 to see if it brings the dragonegg buildbot back. llvm-svn: 124312	2011-01-26 19:40:31 +00:00
Duncan Sands	8a33733228	APInt has a method for determining whether a number is a power of 2 which is more efficient than countPopulation - use it. llvm-svn: 124283	2011-01-26 08:44:16 +00:00
Nick Lewycky	d9e6b4a8ff	Fix memory corruption. If one of the SCEV creation functions calls another but doesn't return immediately after then the insert position in UniqueSCEVs will be out of date. No test because this is a memory corruption issue. Fixes PR9051! llvm-svn: 124282	2011-01-26 08:40:22 +00:00
Eric Christopher	078159e310	Separate out the constant bonus from the size reduction metrics. Rework a few loops accordingly. Should be no functional change. This is a step for more accurate cost/benefit analysis of devirt/inlining bonuses. llvm-svn: 124275	2011-01-26 02:58:39 +00:00
Eric Christopher	58f157a677	Coding style formatting changes. llvm-svn: 124260	2011-01-26 01:09:59 +00:00
Duncan Sands	9e9d5b25e2	In which I discover that zero+zero is zero, d'oh! llvm-svn: 124188	2011-01-25 15:14:15 +00:00
Duncan Sands	fced7620f5	See if this fixes llvm-gcc bootstrap. llvm-svn: 124184	2011-01-25 12:15:09 +00:00
Duncan Sands	d395108394	According to my auto-simplifier the most common missed simplifications in optimized code are: (non-negative number)+(power-of-two) != 0 -> true and (x \| 1) != 0 -> true Instcombine knows about the second one of course, but only does it if X\|1 has only one use. These fire thousands of times in the testsuite. llvm-svn: 124183	2011-01-25 09:38:29 +00:00
Eric Christopher	cd087f2512	Reorganize this so that the early exit and special cases come early rather than interspersed. No functional change. llvm-svn: 124168	2011-01-25 01:34:31 +00:00
Dan Gohman	0f124e1987	Give GetUnderlyingObject a TargetData, to keep it in sync with BasicAA's DecomposeGEPExpression, which recently began using a TargetData. This fixes PR8968, though the testcase is awkward to reduce. Also, update several off GetUnderlyingObject's users which happen to have a TargetData handy to pass it in. llvm-svn: 124134	2011-01-24 18:53:32 +00:00
Chris Lattner	f277b5d434	fix PR8928 by clearing a stale map, patch by Jakub Staszak! llvm-svn: 124132	2011-01-24 18:36:51 +00:00
Dan Gohman	3ac8cd614f	Add a comment. llvm-svn: 124126	2011-01-24 17:54:18 +00:00
Nick Lewycky	d4192f71b5	Simplify some code with no functionality change. Make the test a lot more robust against smarter optimizations, using the power of FileCheck. llvm-svn: 124081	2011-01-23 20:06:05 +00:00
Ted Kremenek	3c4408ceb6	Null initialize a few variables flagged by clang's -Wuninitialized-experimental warning. While these don't look like real bugs, clang's -Wuninitialized-experimental analysis is stricter than GCC's, and these fixes have the benefit of being general nice cleanups. llvm-svn: 124073	2011-01-23 17:05:06 +00:00
Nick Lewycky	bc98f5b78e	Use value ranges to fold ext(trunc) in SCEV when possible. llvm-svn: 124062	2011-01-23 06:20:19 +00:00
Nick Lewycky	b32c8943e6	Have SCEV turn sext(x) into zext(x) when x is s>= 0. This applies many times in "make check" alone. llvm-svn: 124046	2011-01-22 22:06:21 +00:00
Eric Christopher	c70e037b73	Add a FIXME explaining the move to a single indirect call bonus per function that we can change from indirect to direct. llvm-svn: 124045	2011-01-22 21:56:53 +00:00
Eric Christopher	08e8b3b629	Only apply the devirtualization bonus once instead of per-call site in the target function. Fixes part of rdar://8546196 llvm-svn: 124044	2011-01-22 21:17:33 +00:00
Duncan Sands	8fb2c3827c	At -O123 the early-cse pass is run before instcombine has run. According to my auto-simplier the transform most missed by early-cse is (zext X) != 0 -> X != 0. This patch adds this transform and some related logic to InstructionSimplify and removes some of the logic from instcombine (unfortunately not all because there are several situations in which instcombine can improve things by making new instructions, whereas instsimplify is not allowed to do this). At -O2 this often results in more than 15% more simplifications by early-cse, and results in hundreds of lines of bitcode being eliminated from the testsuite. I did see some small negative effects in the testsuite, for example a few additional instructions in three programs. One program, 483.xalancbmk, got an additional 35 instructions, which seems to be due to a function getting an additional instruction and then being inlined all over the place. llvm-svn: 123911	2011-01-20 13:21:55 +00:00
Nick Lewycky	5c901f3489	Similarly, analyze truncate through multiply. llvm-svn: 123842	2011-01-19 18:56:00 +00:00
Nick Lewycky	5143f0f09b	Add a missed SCEV fold that is required to continue analyzing the IR produced by indvars through the scev expander. trunc(add x, y) --> add(trunc x, y). Currently SCEV largely folds the other way which is probably wrong, but preserved to minimize churn. Instcombine doesn't do this fold either, demonstrating a missed optz'n opportunity on code doing add+trunc+add. llvm-svn: 123838	2011-01-19 16:59:46 +00:00
Nick Lewycky	e9ea75e3fc	Add a missing SCEV simplification sext(zext x) --> zext x. llvm-svn: 123832	2011-01-19 15:56:12 +00:00
Dan Gohman	44da55b7be	Teach BasicAA to return PartialAlias in cases where both pointers are pointing to the same object, one pointer is accessing the entire object, and the other is access has a non-zero size. This prevents TBAA from kicking in and saying NoAlias in such cases. llvm-svn: 123775	2011-01-18 21:16:06 +00:00
Duncan Sands	99589d07e9	For completeness, generalize the (X + Y) - Y -> X transform and add X - (X + 1) -> -1. These were not recommended by my auto-simplifier since they don't fire often enough. However they do fire from time to time, for example they remove one subtraction from the final bitcode for 483.xalancbmk. llvm-svn: 123755	2011-01-18 11:50:19 +00:00
Duncan Sands	9b8e2bd8ef	Simplify (X<<1)-X into X. According to my auto-simplier this is the most common missed simplification in fully optimized code. It occurs sporadically in the testsuite, and many times in 403.gcc: the final bitcode has 131 fewer subtractions after this change. The reason that the multiplies are not eliminated is the same reason that instcombine did not catch this: they are used by other instructions (instcombine catches this with a more general transform which in general is only profitable if the operands have only one use). llvm-svn: 123754	2011-01-18 09:24:58 +00:00
Cameron Zwarich	6b0c4c9b6c	Move DominanceFrontier from VMCore to Analysis. llvm-svn: 123747	2011-01-18 06:06:27 +00:00
Chris Lattner	08f43456c9	fix PR8983, a broken assertion. llvm-svn: 123562	2011-01-16 03:43:53 +00:00
Nick Lewycky	367f98f000	Teach LazyValueInfo that allocas aren't NULL. Over all of llvm-test, this saves half a million non-local queries, each of which would otherwise have triggered a linear scan over a basic block. Also fix a fixme for memory intrinsics which dereference pointers. With this, we prove that a pointer is non-null because it was dereferenced by an intrinsic 112 times in llvm-test. llvm-svn: 123533	2011-01-15 09:16:12 +00:00
Duncan Sands	d6f1a9584d	Turn X-(X-Y) into Y. According to my auto-simplifier this is the most common simplification present in fully optimized code (I think instcombine fails to transform some of these when "X-Y" has more than one use). Fires here and there all over the test-suite, for example it eliminates 8 subtractions in the final IR for 445.gobmk, 2 subs in 447.dealII, 2 in paq8p etc. llvm-svn: 123442	2011-01-14 15:26:10 +00:00
Duncan Sands	571fd9a606	Factorize common code out of the InstructionSimplify shift logic. Add in threading of shifts over selects and phis while there. This fires here and there in the testsuite, to not much effect. For example when compiling spirit it fires 5 times, during early-cse, resulting in 6 more cse simplifications, and 3 more terminators being folded by jump threading, but the final bitcode doesn't change in any interesting way: other optimizations would have caught the opportunity anyway, only later. llvm-svn: 123441	2011-01-14 14:44:12 +00:00
Duncan Sands	7f60dc1eb0	Move some shift transforms out of instcombine and into InstructionSimplify. While there, I noticed that the transform "undef >>a X -> undef" was wrong. For example if X is 2 then the top two bits must be equal, so the result can not be anything. I fixed this in the constant folder as well. Also, I made the transform for "X << undef" stronger: it now folds to undef always, even though X might be zero. This is in accordance with the LangRef, but I must admit that it is fairly aggressive. Also, I added "i32 X << 32 -> undef" following the LangRef and the constant folder, likewise fairly aggressive. llvm-svn: 123417	2011-01-14 00:37:45 +00:00
Tobias Grosser	b1d11c19da	Add single entry / single exit accessors. Add methods for accessing the (single) entry / exit edge of a region. If no such edge exists, null is returned. Both accessors return the start block of the corresponding edge. The edge can finally be formed by utilizing Region::getEntry() or Region::getExit(); Contributed by: Andreas Simbuerger <simbuerg@fim.uni-passau.de> llvm-svn: 123410	2011-01-13 23:18:04 +00:00
Duncan Sands	ad000d8f16	Remove some wrong code which fortunately was never executed (as explained in the comment I added): an extern weak global may have a null address. llvm-svn: 123373	2011-01-13 10:43:08 +00:00
Duncan Sands	8d25a7c3a0	The most common simplification missed by instsimplify in unoptimized bitcode is "X != 0 -> X" when X is a boolean. This occurs a lot because of the way llvm-gcc converts gcc's conditional expressions. Add this, and a few other similar transforms for completeness. llvm-svn: 123372	2011-01-13 08:56:29 +00:00
Chris Lattner	d30de95520	some comment improvements. llvm-svn: 123243	2011-01-11 17:11:59 +00:00
Eric Christopher	23bf3bafb7	Temporarily revert 123133, it's causing some regressions and I'm trying to get a testcase. llvm-svn: 123225	2011-01-11 09:02:09 +00:00
Chris Lattner	23109cb319	the GEP faq says that only inbounds geps are guaranteed to not overflow. llvm-svn: 123218	2011-01-11 06:44:41 +00:00
Jakob Stoklund Olesen	087f207009	Revert r123207: "Turn on memdep's verifyRemoved() in an attempt to smoke out the cause of our gcc bootstrap miscompare." It didn't. llvm-svn: 123215	2011-01-11 04:05:39 +00:00
Jakob Stoklund Olesen	9b6853efd6	Turn on memdep's verifyRemoved() in an attempt to smoke out the cause of our gcc bootstrap miscompare. llvm-svn: 123207	2011-01-11 01:18:03 +00:00
Chandler Carruth	b1e7f557b7	Teach constant folding to perform conversions from constant floating point values to their integer representation through the SSE intrinsic calls. This is the last part of a README.txt entry for which I have real world examples. llvm-svn: 123206	2011-01-11 01:07:24 +00:00
Chandler Carruth	352d9b14b3	Cleanup some of the constant folding code to consistently test intrinsic IDs when available rather than using a mixture of IDs and textual name comparisons. llvm-svn: 123165	2011-01-10 09:02:58 +00:00
Chris Lattner	67f82314af	add a fixme: ir isn't expressive enough. llvm-svn: 123139	2011-01-09 23:02:10 +00:00
Chris Lattner	28f140a33e	Step #4 in improving trip count analysis: HowFarToZero can analyze NUW AddRec's much more aggressively. We now get a trip count for @test2 in nsw.ll llvm-svn: 123138	2011-01-09 22:58:47 +00:00
Chris Lattner	dff679f4b6	rearrange some code, no functionality change. llvm-svn: 123136	2011-01-09 22:39:48 +00:00
Chris Lattner	a44274cb4f	Step #3 to improving trip count analysis: If we fold a + {b,+,stride} into {a+b,+,stride} (because a is LIV), then the resultant AddRec is NUW/NSW if the client says it is. llvm-svn: 123133	2011-01-09 22:31:26 +00:00
Chris Lattner	fc87752d55	Step #2 to improve trip count analysis for loops like this: void f(int* begin, int* end) { std::fill(begin, end, 0); } which turns into a != exit expression where one pointer is strided and (thanks to step #1) known to not overflow, and the other is loop invariant. The observation here is that, though the IV is strided by 4 in this case, that the IV has to become equal to the end value. It cannot "miss" the end value by stepping over it, because if it did, the strided IV expression would eventually wrap around. Handle this by turning A != B into "A-B != 0" where the A-B part is known to be NUW. llvm-svn: 123131	2011-01-09 22:26:35 +00:00
Chris Lattner	10223a3fbf	teach SCEV analysis of PHI nodes that PHI recurences formed with GEP instructions are always NUW, because PHIs cannot wrap the end of the address space. llvm-svn: 123105	2011-01-09 02:28:48 +00:00
Chris Lattner	a337f5ec5c	reduce indentation. Print <nuw> and <nsw> when dumping SCEV AddRec's that have the bit set. llvm-svn: 123104	2011-01-09 02:16:18 +00:00
Chris Lattner	171608e738	use isNullValue() to simplify code, add an assert. llvm-svn: 122977	2011-01-06 22:24:29 +00:00
Chris Lattner	5858e091a6	implement constant folding support for an exotic constant expr: ret i64 ptrtoint (i8* getelementptr ([1000 x i8]* @X, i64 1, i64 sub (i64 0, i64 ptrtoint ([1000 x i8]* @X to i64))) to i64) to "ret i64 1000". This allows us to correctly compute the trip count on a loop in PR8883, which occurs with std::fill on a char array. This allows us to transform it into a memset with a constant size. llvm-svn: 122950	2011-01-06 06:19:46 +00:00
Owen Anderson	6f060afbbd	Reorder, rename, and document some members to make this easier to follow. llvm-svn: 122929	2011-01-05 23:26:22 +00:00
Owen Anderson	e86dacf449	When computing the value on an edge, in certain cases LVI would fail to compute the value range in the predecessor block, leading to an incorrect conclusion for the edge value. Found by inspection. llvm-svn: 122908	2011-01-05 21:37:18 +00:00
Owen Anderson	118ac80c81	Re-convert several of LazyValueInfo's internal maps to Dense{Map\|Set}, and fix the issue in hasBlockValue() that was causing iterator invalidations. Many thanks to Dimitry Andric for tracking down those invalidations! llvm-svn: 122906	2011-01-05 21:15:29 +00:00
Chris Lattner	c86e67e110	fix an off-by-one bug that caused a crash analyzing ashr's with huge shift amounts, PR8896 llvm-svn: 122814	2011-01-04 18:19:15 +00:00
Owen Anderson	d62d37225a	Use the new addEscapingValue callback to update GlobalsModRef when GVN adds PHIs of GEPs. For the moment, have GlobalsModRef handle this conservatively by simply removing the value from its maps. llvm-svn: 122787	2011-01-03 23:51:43 +00:00
Owen Anderson	b6e4ff0d85	Stub out a new updating interface to AliasAnalysis, allowing stateful analyses to be informed when a pointer value has potentially become escaping. Implementations can choose to either fall back to conservative responses for that value, or may recompute their analysis to accomodate the change. llvm-svn: 122777	2011-01-03 21:38:41 +00:00
Chris Lattner	16e42128c2	fix rdar://8813415 - a miscompilation of 164.gzip that loop-idiom exposed. It turns out to be a latent bug in basicaa, scary. llvm-svn: 122772	2011-01-03 21:03:33 +00:00
Nick Lewycky	0f87ca7733	Add spliceFunction to the CallGraph interface. This allows users to efficiently update a callGraph when performing the common operation of splicing the body to a new function and updating all callers (such as via RAUW). No users yet, though this is intended for DeadArgumentElimination as part of PR8887. llvm-svn: 122728	2011-01-03 03:19:35 +00:00
Chris Lattner	bf0aa927cc	split dom frontier handling stuff out to its own DominanceFrontier header, so that Dominators.h is just domtree. Also prune #includes a bit. llvm-svn: 122714	2011-01-02 22:09:33 +00:00
Duncan Sands	772749aea1	Revert commit 122654 at the request of Chris, who reckons that instsimplify is the wrong hammer for this nail, and is probably right. llvm-svn: 122661	2011-01-01 20:08:02 +00:00
Duncan Sands	e3c539581c	Fix a README item by having InstructionSimplify do a mild form of value numbering, in which it considers (for example) "%a = add i32 %x, %y" and "%b = add i32 %x, %y" to be equal because the operands are equal and the result of the instructions only depends on the values of the operands. This has almost no effect (it removes 4 instructions from gcc-as-one-file), and perhaps slows down compilation: I measured a 0.4% slowdown on the large gcc-as-one-file testcase, but it wasn't statistically significant. llvm-svn: 122654	2011-01-01 16:12:09 +00:00
Benjamin Kramer	b6d52b8b64	Cast away "comparison between signed and unsigned integer" warnings. llvm-svn: 122598	2010-12-28 13:52:52 +00:00
Chris Lattner	9cb1035f94	move isBytewiseValue out to ValueTracking.h/cpp llvm-svn: 122565	2010-12-26 20:15:01 +00:00
Jeffrey Yasskin	9b43f33620	Change all self assignments X=X to (void)X, so that we can turn on a new gcc warning that complains on self-assignments and self-initializations. llvm-svn: 122458	2010-12-23 00:58:24 +00:00
Duncan Sands	a45cfbd405	When determining whether the new instruction was already present in the original instruction, half the cases were missed (making it not wrong but suboptimal). Also correct a typo (A <-> B) in the second chunk. llvm-svn: 122414	2010-12-22 17:15:25 +00:00
Duncan Sands	3547d2ebd8	Add some statistics, good for understanding how much more powerful instcombine is compared to instsimplify. llvm-svn: 122397	2010-12-22 09:40:51 +00:00
Duncan Sands	fecc642224	While I don't think any later transforms can fire, it seems cleaner to not assume this (for example in case more transforms get added below it). Suggested by Frits van Bommel. llvm-svn: 122332	2010-12-21 15:03:43 +00:00
Duncan Sands	5def0d6791	Fix inverted condition noticed by Frits van Bommel. llvm-svn: 122331	2010-12-21 14:48:48 +00:00
Duncan Sands	d0eb6d39f8	Pull a few more simplifications out of instcombine (there are still plenty left though!), in particular for multiplication. llvm-svn: 122330	2010-12-21 14:00:22 +00:00
Duncan Sands	ee3ec6eb94	Teach InstructionSimplify about distributive laws. These transforms fire quite often, but don't make much difference in practice presumably because instcombine also knows them and more. llvm-svn: 122328	2010-12-21 13:32:22 +00:00
Duncan Sands	f64e690c4f	Move checking of the recursion limit into the various Thread methods. No functionality change. llvm-svn: 122327	2010-12-21 09:09:15 +00:00
Duncan Sands	6c7a52cf80	Add generic simplification of associative operations, generalizing a couple of existing transforms. This fires surprisingly often, for example when compiling gcc "(X+(-1))+1->X" fires quite a lot as well as various "and" simplifications (usually with a phi node operand). Most of the time this doesn't make a real difference since the same thing would have been done elsewhere anyway, eg: by instcombine, but there are a few places where this results in simplifications that we were not doing before. llvm-svn: 122326	2010-12-21 08:49:00 +00:00
Owen Anderson	c6beda80ff	Speculatively revert the use of DenseMap in LazyValueInfo, which may be causing Linux self-host failures. llvm-svn: 122291	2010-12-20 23:53:19 +00:00
Owen Anderson	9be3ec6264	Attempt to appease the DragonEgg buildbots. llvm-svn: 122288	2010-12-20 23:23:18 +00:00
Owen Anderson	813a2c45a8	Convert one of LVI's primary maps to a DenseMap, now that we know are more assured of iterator stability. llvm-svn: 122273	2010-12-20 21:30:54 +00:00
Owen Anderson	d83f98a51e	More LVI cleanups, including trying to simplify the process of maintaining the OverDefinedCache. llvm-svn: 122256	2010-12-20 19:33:41 +00:00
Owen Anderson	64c2c5798a	Reuse the reference into the LVI cache throughout the solver subsystem. This is much easier to verify as being safe thanks its recent de-recursivization. llvm-svn: 122254	2010-12-20 18:18:16 +00:00
Duncan Sands	ed6d6c33dd	Have SimplifyBinOp dispatch Xor, Add and Sub to the corresponding methods (they had just been forgotten before). Adding Xor causes "main" in the existing testcase 2010-11-01-lshr-mask.ll to be hugely more simplified. llvm-svn: 122245	2010-12-20 14:47:04 +00:00
Nick Lewycky	55a700b0cf	Make LazyValueInfo non-recursive. llvm-svn: 122120	2010-12-18 01:00:40 +00:00
Nate Begeman	7aa18bf46a	Add vector versions of some existing scalar transforms to aid codegen in matching psign & pblend operations to the IR produced by clang/gcc for their C idioms. llvm-svn: 122105	2010-12-17 23:12:19 +00:00
Dan Gohman	91ab4ffd96	Update a comment. llvm-svn: 121946	2010-12-16 02:55:10 +00:00
Dan Gohman	e1a17a3473	Make memcpyopt TBAA-aware. llvm-svn: 121944	2010-12-16 02:51:19 +00:00
Dan Gohman	2c9d342f04	Enable TBAA by default. llvm-svn: 121923	2010-12-15 23:58:44 +00:00
Dan Gohman	05b18f143f	Reapply r121886, and also update DecomposeGEPExpression to keep it in sync. llvm-svn: 121895	2010-12-15 20:49:55 +00:00
Dan Gohman	d02b65982e	Revert r121886. DecomposeGEPExpression needs to be kept in sync. llvm-svn: 121892	2010-12-15 20:39:25 +00:00
Dan Gohman	949ab7889c	Strengthen GetUnderlyingObject using InstructionSimplify. While LLVM's main design is that analysis code shouldn't go out of its way to understand code which hasn't been InstCombined, analysis utility routines like this can find themselves being called in the middle of transform passes when instcombine hasn't had a chance to run. llvm-svn: 121886	2010-12-15 20:10:26 +00:00
Dan Gohman	a4fcd2418d	Move Value::getUnderlyingObject to be a standalone function so that it can live in Analysis instead of VMCore. llvm-svn: 121885	2010-12-15 20:02:24 +00:00
Nick Lewycky	11678bd299	Clean up some of LVI: * mergeIn now uses constant folding for constants that are provably not-equal. * sink some sanity checks from the get() methods into the mark() methods, to ensure that we never have a constant/notconstant ConstantInt * some textual cleanups, whitespace changes, removing "else" after return, that sort of thing. llvm-svn: 121877	2010-12-15 18:57:18 +00:00
Duncan Sands	0a2c416894	Move Sub simplifications and additional Add simplifications out of instcombine and into InstructionSimplify. llvm-svn: 121861	2010-12-15 14:07:39 +00:00
Duncan Sands	019a418808	If we detect that the instruction we are simplifying is unreachable, arrange for it to be replaced by undef rather than not replaced at all, the idea being that this may reduce the amount of work done by whoever called InstructionSimplify. llvm-svn: 121860	2010-12-15 11:02:22 +00:00
Dan Gohman	3cb55a1d23	Update a comment. llvm-svn: 121727	2010-12-13 22:53:18 +00:00
Dan Gohman	c4bf5cac9f	Reapply r121520, PartialAlias implementation for BasicAA, now that memdep is updated to handle it. llvm-svn: 121725	2010-12-13 22:50:24 +00:00
Dan Gohman	ba5d0abe39	Update memdep to handle PartialAlias as MayAlias. llvm-svn: 121723	2010-12-13 22:47:57 +00:00
Tobias Grosser	f3e1ada522	Remove useless dynamic_cast<>(). Thanks Peter for pointing me to something that should have never been committed to the llvm code base. llvm-svn: 121648	2010-12-12 21:58:28 +00:00
Dan Gohman	39de62348f	Revert r121520, which may have introduced miscompilations. llvm-svn: 121573	2010-12-10 21:48:28 +00:00
Dan Gohman	041f74e762	Implement PartialAlias checking in BasicAA. llvm-svn: 121520	2010-12-10 20:47:03 +00:00
Dan Gohman	704e7c2332	Minimally update this code to handle PartialAlias. llvm-svn: 121518	2010-12-10 20:14:49 +00:00
Dan Gohman	201acdb6db	Use PartialAlias to do better noalias lint checking. llvm-svn: 121514	2010-12-10 20:04:06 +00:00
Dan Gohman	4431e31df0	Teach AliasAnalysisCounter about PartialAlias. llvm-svn: 121513	2010-12-10 19:53:05 +00:00
Dan Gohman	105d60a5ef	Teach AliasAnalysisEvaluator about PartialAlias. llvm-svn: 121512	2010-12-10 19:52:40 +00:00
Dan Gohman	fb0a3754f5	Update this code to handle PartialAlias as MayAlias. llvm-svn: 121508	2010-12-10 19:40:47 +00:00
Owen Anderson	c7ed4dc932	Take the first step towards making LVI non-recursive: get rid of the LVIQuery abstraction. llvm-svn: 121357	2010-12-09 06:14:58 +00:00
Devang Patel	8817135cb9	Use type's file info while describing inheritance relationship. llvm-svn: 121289	2010-12-08 21:46:37 +00:00
Devang Patel	b68c6231e9	Add support to create debug info for functions and methods. llvm-svn: 121281	2010-12-08 20:42:44 +00:00
Devang Patel	81c3c87717	Add support to create class type. llvm-svn: 121279	2010-12-08 20:18:20 +00:00
Devang Patel	89ea4f27a8	Add support to create vector, array, enums etc... llvm-svn: 121224	2010-12-08 01:50:15 +00:00
Devang Patel	dd261afdd9	Global variable does not need linkage name. llvm-svn: 121212	2010-12-08 00:06:22 +00:00
Devang Patel	63f83cd861	Add support to create local variable's debug info. llvm-svn: 121211	2010-12-07 23:58:00 +00:00
Devang Patel	746660fc7b	Add support to create variables, structs etc.. using DIBuilder. This is still work in progress. llvm-svn: 121205	2010-12-07 23:25:47 +00:00
Jay Foad	583abbc4df	PR5207: Change APInt methods trunc(), sext(), zext(), sextOrTrunc() and zextOrTrunc(), and APSInt methods extend(), extOrTrunc() and new method trunc(), to be const and to return a new value instead of modifying the object in place. llvm-svn: 121120	2010-12-07 08:25:19 +00:00
Jakob Stoklund Olesen	8bdfb0c166	Also inore '()' while creating mdnode name from ObjC symbol name. llvm-svn: 120856	2010-12-03 23:40:45 +00:00
Devang Patel	f0227ccf3f	Ignore '+' while creating mdnode name from ObjC symbol name. llvm-svn: 120853	2010-12-03 23:29:30 +00:00
Jay Foad	25a5e4ca1f	PR5207: Rename overloaded APInt methods set(), clear(), flip() to setAllBits(), setBit(unsigned), etc. llvm-svn: 120564	2010-12-01 08:53:58 +00:00
Chris Lattner	e28618de59	move GetPointerBaseWithConstantOffset out of GVN into ValueTracking.h llvm-svn: 120476	2010-11-30 22:25:26 +00:00
Jay Foad	15084f085d	PR5207: Make APInt::set(), APInt::clear() and APInt::flip() return void. llvm-svn: 120413	2010-11-30 09:02:01 +00:00
Chris Lattner	d540a5d842	strength reduce this. llvm-svn: 120381	2010-11-30 01:56:13 +00:00
Chris Lattner	afbc0c2b8c	getLocationForDest should work for memset as well. llvm-svn: 120380	2010-11-30 01:48:20 +00:00
Chris Lattner	90c4947df7	enhance basicaa to return "Mod" for a memcpy call when the queried location doesn't overlap the source, and add a testcase. llvm-svn: 120370	2010-11-30 00:43:16 +00:00
Chris Lattner	9a146372b5	Teach basicaa that memset's modref set is at worst "mod" and never contains "ref". Enhance DSE to use a modref query instead of a store-specific hack to generalize the "ignore may-alias stores" optimization to handle memset and memcpy. llvm-svn: 120368	2010-11-30 00:28:45 +00:00
Frits van Bommel	a98214de10	Teach ConstantFoldInstruction() how to fold insertvalue and extractvalue. llvm-svn: 120316	2010-11-29 20:36:52 +00:00
Michael J. Spencer	447762da85	Merge System into Support. llvm-svn: 120298	2010-11-29 18:16:10 +00:00
Chandler Carruth	abcab28f9b	Add some dead stores to pacify my least favorite GCC warning: may be uninitialized. The warning is terrible, has incorrect source locations, and has a huge false positive rate such as all of these. If anyone has a better solution, please let me know. Alternatively, I'll happily add -Wno-uninitialized to the -Werror build mode. Maybe I can even do it only when building with GCC instead of Clang. llvm-svn: 120281	2010-11-29 01:41:13 +00:00
Duncan Sands	a021988d64	Expand a little on the description of what InstructionSimplify does. llvm-svn: 120016	2010-11-23 10:50:08 +00:00
Duncan Sands	763dec0ab8	Clarify that constant folding of instructions applies when all operands are constant. There was in fact one exception to this (phi nodes) - so remove that exception (InstructionSimplify handles this so there should be no loss). llvm-svn: 120015	2010-11-23 10:16:18 +00:00
Duncan Sands	c133c54426	If a GEP index simply advances by multiples of a type of zero size, then replace the index with zero. llvm-svn: 119974	2010-11-22 16:32:50 +00:00
Duncan Sands	8a0f486e36	Move the "gep undef" -> "undef" transform from instcombine to InstructionSimplify. llvm-svn: 119970	2010-11-22 13:42:49 +00:00
Benjamin Kramer	585dfa2b3d	Initialize MemDep's TD member so buildbots don't trip over an uninitialized pointer (TD is passed to PHITransAddr). I wonder why this didn't explode earlier. llvm-svn: 119944	2010-11-21 15:21:46 +00:00
Duncan Sands	cf4bceba49	Add a rather pointless InstructionSimplify transform, inspired by recent constant folding improvements: if P points to a type of size zero, turn "gep P, N" into "P". More generally, if a gep index type has size zero, instcombine could replace the index with zero, but that is not done here. llvm-svn: 119942	2010-11-21 13:53:09 +00:00
Duncan Sands	1f86be9164	Fix spelling. llvm-svn: 119941	2010-11-21 12:43:13 +00:00
Chris Lattner	6ce038082b	apply Dan's fix for PR8268 which allows constant folding to handle indexes over zero sized elements. This allows us to compile: #include <string> void foo() { std::string s; } into an empty function. llvm-svn: 119933	2010-11-21 08:39:01 +00:00
Chris Lattner	663ba91cc6	add "getLocation" method to AliasAnalysis for getting the source and destination location of a memcpy/memmove. I'm not clear about whether TBAA works on these, so I'm leaving it out for now. Dan, please revisit this when convenient. llvm-svn: 119928	2010-11-21 07:51:27 +00:00
Chris Lattner	e48c31ce33	implement PR8576, deleting dead stores with intervening may-alias stores. llvm-svn: 119927	2010-11-21 07:34:32 +00:00
Benjamin Kramer	ddd1b7b801	Simplify code. No change in functionality. llvm-svn: 119908	2010-11-20 18:43:35 +00:00
Benjamin Kramer	c77ebcc9a5	Silence warning about an uninitialized variable. llvm-svn: 119800	2010-11-19 11:37:26 +00:00
Duncan Sands	b238de0415	Remove threading of Xor over selects and phis, with an explanation of why such threading is pointless. llvm-svn: 119798	2010-11-19 09:20:39 +00:00
Duncan Sands	aef146b890	Factor code for testing whether replacing one value with another preserves LCSSA form out of ScalarEvolution and into the LoopInfo class. Use it to check that SimplifyInstruction simplifications are not breaking LCSSA form. Fixes PR8622. llvm-svn: 119727	2010-11-18 19:59:41 +00:00
Dan Gohman	f1ebfc1544	Strip trailing whitespace. llvm-svn: 119706	2010-11-18 17:06:31 +00:00
Dan Gohman	0ab28b62b1	Use llvm_unreachable for "impossible" situations. llvm-svn: 119705	2010-11-18 17:05:57 +00:00
Dan Gohman	2e1fc849b2	Add support for PHI-translating sext, zext, and trunc instructions, enabling more PRE. PR8586. llvm-svn: 119704	2010-11-18 17:05:13 +00:00
Dan Gohman	8ea83d81e0	Introduce memoization for ScalarEvolution dominates and properlyDominates queries, and SCEVExpander getRelevantLoop queries. llvm-svn: 119595	2010-11-18 00:34:22 +00:00
Dan Gohman	7e6b393e66	Factor out the code for purging a SCEV from all the various memoization maps. Some of these maps may merge in the future, but for now it's convenient to have a utility function for them. llvm-svn: 119587	2010-11-17 23:28:48 +00:00
Dan Gohman	7ee1bbb76c	Merge the implementations of isLoopInvariant and hasComputableLoopEvolution, and memoize the results. This improves compile time in code which highly complex expressions which get queried many times. llvm-svn: 119584	2010-11-17 23:21:44 +00:00
Dan Gohman	534749bf70	Make SCEV::getType() and SCEV::print non-virtual. Move SCEV::hasOperand to ScalarEvolution. Delete SCEV::~SCEV. SCEV is no longer virtual. llvm-svn: 119578	2010-11-17 22:27:42 +00:00
Dan Gohman	20d9ce21ef	Move SCEV::dominates and properlyDominates to ScalarEvolution. llvm-svn: 119570	2010-11-17 21:41:58 +00:00
Dan Gohman	afd6db9932	Move SCEV::isLoopInvariant and hasComputableLoopEvolution to be member functions of ScalarEvolution, in preparation for memoization and other optimizations. llvm-svn: 119562	2010-11-17 21:23:15 +00:00
Duncan Sands	39d77131a1	Before replacing a phi node with a different value, it needs to be checked that this won't break LCSSA form. Change the existing checking method to a more direct one: rather than seeing if all predecessors belong to the loop, check that the replacing value is either not in any loop or is in a loop that contains the phi node. llvm-svn: 119556	2010-11-17 20:49:12 +00:00
Dan Gohman	d3a32ae4c8	Verify SCEVAddRecExpr's invariant in ScalarEvolution::getAddRecExpr instead of in SCEVAddRecExpr's constructor, in preparation for an upcoming change. llvm-svn: 119554	2010-11-17 20:48:38 +00:00
Dan Gohman	ed75631743	Fix ScalarEvolution's range memoization to avoid using a default ctor with ConstantRange. llvm-svn: 119550	2010-11-17 20:23:08 +00:00
Duncan Sands	c89ac07e7a	Move some those Xor simplifications which don't require creating new instructions out of InstCombine and into InstructionSimplify. While there, introduce an m_AllOnes pattern to simplify matching with integers and vectors with all bits equal to one. llvm-svn: 119536	2010-11-17 18:52:15 +00:00
Duncan Sands	ec7a6ecb92	Now that hasConstantValue has been made simpler, it may return the phi node itself if it occurs in an unreachable basic block. Protect against this. Hopefully this will fix some more buildbots. llvm-svn: 119493	2010-11-17 10:23:23 +00:00
Duncan Sands	64e41cf865	Previously SimplifyInstruction could report that an instruction simplified to itself (this can only happen in unreachable blocks). Change it to return null instead. Hopefully this will fix some buildbot failures. llvm-svn: 119490	2010-11-17 08:35:29 +00:00
Duncan Sands	7412f6e53d	Fix a layering violation: hasConstantValue, which is part of the PHINode class, uses DominatorTree which is an analysis. This change moves all of the tricky hasConstantValue logic to SimplifyInstruction, and replaces it with a very simple literal implementation. I already taught users of hasConstantValue that need tricky stuff to use SimplifyInstruction instead. I didn't update InlineFunction because the IR looks like it might be in a funky state at the point it calls hasConstantValue, which makes calling SimplifyInstruction dangerous since it can in theory do a lot of tricky reasoning. This may be a pessimization, for example in the case where all phi node operands are either undef or a fixed constant. llvm-svn: 119459	2010-11-17 04:30:22 +00:00
Duncan Sands	d06f50e2db	Have ScalarEvolution use SimplifyInstruction rather than hasConstantValue. While there, add a note about an inefficiency I noticed. llvm-svn: 119458	2010-11-17 04:18:45 +00:00
Dan Gohman	761065e3b7	Memoize results from ScalarEvolution's getUnsignedRange and getSignedRange. This fixes some extreme compile times on unrolled sha512 code. llvm-svn: 119455	2010-11-17 02:44:44 +00:00

... 6 7 8 9 10 ...

4274 Commits