llvm-project

Commit Graph

Author	SHA1	Message	Date
Nuno Lopes	4b47f82ac2	revert r171306, since we cannot compare APInts with different bitwidths llvm-svn: 171308	2012-12-31 18:01:36 +00:00
Nuno Lopes	69dcc7deec	use ValueTracking's GetPointerBaseWithConstantOffset() function instead of a local implementation llvm-svn: 171307	2012-12-31 17:42:11 +00:00
Nuno Lopes	556b7de2c0	minor code simplification llvm-svn: 171306	2012-12-31 17:25:24 +00:00
Nuno Lopes	e9d6dbf7a2	add support for GlobalAlias to ObjectSizeOffsetVisitor llvm-svn: 171303	2012-12-31 16:23:48 +00:00
Nuno Lopes	7ab7c02d23	add support for PHI nodes to ObjectSizeOffsetVisitor llvm-svn: 171298	2012-12-31 13:52:36 +00:00
Nuno Lopes	b6ad98224a	convert a bunch of callers from DataLayout::getIndexedOffset() to GEP::accumulateConstantOffset(). The later API is nicer than the former, and is correct regarding wrap-around offsets (if anyone cares). There are a few more places left with duplicated code, which I'll remove soon. llvm-svn: 171259	2012-12-30 16:25:48 +00:00
Bill Wendling	698e84fc4f	Remove the Function::getFnAttributes method in favor of using the AttributeSet directly. This is in preparation for removing the use of the 'Attribute' class as a collection of attributes. That will shift to the AttributeSet class instead. llvm-svn: 171253	2012-12-30 10:32:01 +00:00
Chandler Carruth	405d681340	Nuke some dead code that snuck in some how. I thought I had already deleted this, but apparantly not. Charmingly, Clang didn't warn on it but GCC did. llvm-svn: 171197	2012-12-28 14:50:51 +00:00
Chandler Carruth	86ed53089f	Fix a stunning oversight in the inline cost analysis. It was never propagating one of the values it simplified to a constant across a myriad of instructions. Notably, ptrtoint instructions when we had a constant pointer (say, 0) didn't propagate that, blocking a massive number of down-stream optimizations. This was uncovered when investigating why we fail to inline and delete the boilerplate in: void f() { std::vector<int> v; v.push_back(1); } It turns out most of the efforts I've made thus far to improve the analysis weren't making it far purely because of this. After this is fixed, the store-to-load forwarding patch enables LLVM to optimize the above to an empty function. We still can't nuke a second push_back, but for different reasons. There is a very real chance this will cause somewhat noticable changes in inlining behavior, so please let me know if you see regressions (or improvements!) because of this patch. llvm-svn: 171196	2012-12-28 14:43:42 +00:00
Chandler Carruth	753e21d057	Teach the inline cost analysis about calls that can be simplified and how to propagate constants through insert and extract value instructions. With the recent improvements to instsimplify, this allows inline cost analysis to constant fold through intrinsic functions, including notably the with.overflow intrinsic math routines which often show up inside of STL abstractions. This is yet another piece in the puzzle of breaking down the code for: void f() { std::vector<int> v; v.push_back(1); } But it still isn't enough. There are a pile of bugs in inline cost still blocking this. llvm-svn: 171195	2012-12-28 14:23:32 +00:00
Chandler Carruth	f6182155f6	Teach instsimplify to use the constant folder where appropriate for constant folding calls. Add the initial tests for this which show that now instsimplify can simplify blindingly obvious code patterns expressed with both intrinsics and library calls. llvm-svn: 171194	2012-12-28 14:23:29 +00:00
Chandler Carruth	9dc3558920	Add entry points to instsimplify for simplifying calls. The entry points are nice and decomposed so that we can simplify synthesized calls as easily as actually call instructions. The internal utility still has the same behavior, it just now operates on a more generic interface so that I can extend the set of call simplifications that instsimplify knows about. llvm-svn: 171189	2012-12-28 11:30:55 +00:00
Bob Wilson	4ed23578da	Add LLVMContext::emitWarning methods and use them. <rdar://problem/12867368> When the backend is used from clang, it should produce proper diagnostics instead of just printing messages to errs(). Other clients may also want to register their own error handlers with the LLVMContext, and the same handler should work for warnings in the same way as the existing emitError methods. llvm-svn: 171041	2012-12-24 18:15:21 +00:00
Nadav Rotem	99868e4f9d	Update the docs of the cost model. llvm-svn: 171016	2012-12-24 05:51:12 +00:00
Craig Topper	1bef2c859f	Remove trailing whitespace. llvm-svn: 170991	2012-12-22 19:15:35 +00:00
James Molloy	4f6fb953a7	Add a new attribute, 'noduplicate'. If a function contains a noduplicate call, the call cannot be duplicated - Jump threading, loop unrolling, loop unswitching, and loop rotation are inhibited if they would duplicate the call. Similarly inlining of the function is inhibited, if that would duplicate the call (in particular inlining is still allowed when there is only one callsite and the function has internal linkage). llvm-svn: 170704	2012-12-20 16:04:27 +00:00
Nadav Rotem	11350aafb4	Fix a bug that was found by building clang with -fsanitize. I introduced it in r166785. PR14291. If TD is unavailable use getScalarSizeInBits, but don't optimize pointers or vectors of pointers. llvm-svn: 170586	2012-12-19 20:47:04 +00:00
Bill Wendling	3d7b0b8ac7	Rename the 'Attributes' class to 'Attribute'. It's going to represent a single attribute in the future. llvm-svn: 170502	2012-12-19 07:18:57 +00:00
Nadav Rotem	aa3e2a907e	Fix a crash in ValueTracking on vectors of pointers. llvm-svn: 170240	2012-12-14 20:43:49 +00:00
Rafael Espindola	319f74cd11	Rename isPowerOfTwo to isKnownToBeAPowerOfTwo. In a previous thread it was pointed out that isPowerOfTwo is not a very precise name since it can return false for powers of two if it is unable to show that they are powers of two. llvm-svn: 170093	2012-12-13 03:37:24 +00:00
Rafael Espindola	e40238069e	The TargetData is not used for the isPowerOfTwo determination. It has never been used in the first place. It simply was passed to the function and to the recursive invocations. Simply drop the parameter and update the callers for the new signature. Patch by Saleem Abdulrasool! llvm-svn: 169988	2012-12-12 16:52:40 +00:00
Michael Ilseman	d2b05e59b5	Have SimplifyBinOp call the new FAdd/FSub/FMul helpers, with fast-math flags off llvm-svn: 169943	2012-12-12 00:29:16 +00:00
Michael Ilseman	bb6f691b01	Added a slew of SimplifyInstruction floating-point optimizations, many of which take advantage of fast-math flags. Test cases included. fsub X, +0 ==> X fsub X, -0 ==> X, when we know X is not -0 fsub +/-0.0, (fsub -0.0, X) ==> X fsub nsz +/-0.0, (fsub +/-0.0, X) ==> X fsub nnan ninf X, X ==> 0.0 fadd nsz X, 0 ==> X fadd [nnan ninf] X, (fsub [nnan ninf] 0, X) ==> 0 where nnan and ninf have to occur at least once somewhere in this expression fmul X, 1.0 ==> X llvm-svn: 169940	2012-12-12 00:27:46 +00:00
Chandler Carruth	7ec41c7827	Holding my nose and moving the accumulation routine to GEPOperator instead of the instruction. I've left a forwarding wrapper for the instruction so users with the instruction don't need to create a GEPOperator themselves. This lets us remove the copy of this code in instsimplify. I've looked at most of the other copies of similar code, and this is the only one I've found that is actually exactly the same. The one in InlineCost is very close, but it requires re-mapping non-constant indices through the cost analysis value simplification map. I could add direct support for this to the generic routine, but it seems overly specific. llvm-svn: 169853	2012-12-11 11:05:15 +00:00
Chandler Carruth	1e14053d84	Hoist the GEP constant address offset computation to a common home on the GEP instruction class. This is part of the continued refactoring and cleaning of the infrastructure used by SROA. This particular operation is also done in a few other places which I'll try to refactor to share this implementation. llvm-svn: 169852	2012-12-11 10:29:10 +00:00
Arnold Schwaighofer	edd62b14e5	Optimistically analyse Phi cycles Analyse Phis under the starting assumption that they are NoAlias. Recursively look at their inputs. If they MayAlias/MustAlias there must be an input that makes them so. Addresses bug 14351. llvm-svn: 169788	2012-12-10 23:02:41 +00:00
Chandler Carruth	e41e7b7901	Add a new visitor for walking the uses of a pointer value. This visitor provides infrastructure for recursively traversing the use-graph of a pointer-producing instruction like an alloca or a malloc. It maintains a worklist of uses to visit, so it can handle very deep recursions. It automatically looks through instructions which simply translate one pointer to another (bitcasts and GEPs). It tracks the offset relative to the original pointer as long as that offset remains constant and exposes it during the visit as an APInt offset. Finally, it performs conservative escape analysis. However, currently it has some limitations that should be addressed going forward: 1) It doesn't handle vectors of pointers. 2) It doesn't provide a cheaper visitor when the constant offset tracking isn't needed. 3) It doesn't support non-instruction pointer values. The current functionality is exactly what is required to implement the SROA pointer-use visitors in terms of this one, rather than in terms of their own ad-hoc base visitor, which was always very poorly specified. SROA has been converted to use this, and the code there deleted which this utility now provides. Technically speaking, using this new visitor allows SROA to handle a few more cases than it previously did. It is now more aggressive in ignoring chains of instructions which look like they would defeat SROA, but in fact do not because they never result in a read or write of memory. While this is "neat", it shouldn't be interesting for real programs as any such chains should have been removed by others passes long before we get to SROA. As a consequence, I've not added any tests for these features -- it shouldn't be part of SROA's contract to perform such heroics. The goal is to extend the functionality of this visitor going forward, and re-use it from passes like ASan that can benefit from doing a detailed walk of the uses of a pointer. Thanks to Ben Kramer for the code review rounds and lots of help reviewing and debugging this patch. llvm-svn: 169728	2012-12-10 08:28:39 +00:00
Michael Ilseman	65f1435a6f	Reorganize FastMathFlags to be a wrapper around unsigned, and streamline some interfaces. llvm-svn: 169712	2012-12-09 21:12:04 +00:00
Chandler Carruth	80d3e56c73	Add support to ValueTracking for determining that a pointer is non-null by virtue of inbounds GEPs that preclude a null pointer. This is a very common pattern in the code generated by std::vector and other standard library routines which use allocators that test for null pervasively. This is one step closer to teaching Clang+LLVM to be able to produce an empty function for: void f() { std::vector<int> v; v.push_back(1); v.push_back(2); v.push_back(3); v.push_back(4); } Which is related to getting them to completely fold SmallVector push_back sequences into constants when inlining and other optimizations make that a possibility. llvm-svn: 169573	2012-12-07 02:08:58 +00:00
Michael Ilseman	0f12837be0	Have CannotBeNegativeZero() be aware of the nsz fast-math flag llvm-svn: 169452	2012-12-06 00:07:09 +00:00
Nadav Rotem	ce5db0fa3f	constify the cost API llvm-svn: 169172	2012-12-03 22:47:12 +00:00
Chandler Carruth	ed0881b2a6	Use the new script to sort the includes of every file under lib. Sooooo many of these had incorrect or strange main module includes. I have manually inspected all of these, and fixed the main module include to be the nearest plausible thing I could find. If you own or care about any of these source files, I encourage you to take some time and check that these edits were sensible. I can't have broken anything (I strictly added headers, and reordered them, never removed), but they may not be the headers you'd really like to identify as containing the API being implemented. Many forward declarations and missing includes were added to a header files to allow them to parse cleanly when included first. The main module rule does in fact have its merits. =] llvm-svn: 169131	2012-12-03 16:50:05 +00:00
Chandler Carruth	dbd6958183	Move the InstVisitor utility into VMCore where it belongs. It heavily depends on the IR infrastructure, there is no sense in it being off in Support land. This is in preparation to start working to expand InstVisitor into more special-purpose visitors that are still generic and can be re-used across different passes. The expansion will go into the Analylis tree though as nothing in VMCore needs it. llvm-svn: 168972	2012-11-30 03:08:41 +00:00
Preston Briggs	fd0b5c898a	Modified dump() to provide a little more information for dependences between instructions that don't share a common loop. Updated the test results appropriately. llvm-svn: 168965	2012-11-30 00:44:47 +00:00
Benjamin Kramer	ba11a9892c	Follow up to 168711: It's safe to base this analysis on the found compare, just return the value for the right predicate. Thanks to Andy for catching this. llvm-svn: 168921	2012-11-29 19:07:57 +00:00
Andrew Trick	fa59403bfd	Improve isImpliedCond comment a bit. llvm-svn: 168914	2012-11-29 18:35:13 +00:00
Preston Briggs	4eb7ee566a	Cleaned up a couple of comments. llvm-svn: 168854	2012-11-29 04:30:52 +00:00
Preston Briggs	5cb8cfae1e	Modified depends() to recognize that when all levels are "=" and there's no possible loo-independent dependence, then there's no dependence. Updated all test result appropriately. llvm-svn: 168719	2012-11-27 19:12:26 +00:00
Benjamin Kramer	e20e124280	SCEV: Even if the latch terminator is foldable we can't deduce the result of an unrelated condition with it. Fixes PR14432. llvm-svn: 168711	2012-11-27 18:16:32 +00:00
Preston Briggs	1084fa2ef2	Modify depends(Src, Dst, PossiblyLoopIndependent). If the Src and Dst are the same instruction, no loop-independent dependence is possible, so we force the PossiblyLoopIndependent flag to false. The test case results are updated appropriately. llvm-svn: 168678	2012-11-27 06:41:46 +00:00
Michael Ilseman	be9137a5c5	Fast-math optimization: fold multiply by zero Added in first optimization using fast-math flags to serve as an example for following optimizations. SimplifyInstruction will now try to optimize an fmul observing its FastMathFlags to see if it can fold multiply by zero when 'nnan' and 'nsz' flags are set. llvm-svn: 168648	2012-11-27 00:46:26 +00:00
Preston Briggs	3ad394931d	Corrects a problem where we reply exclusively of GEPs to drive analysis. Better is to look for cases with useful GEPs and use them when possible. When a pair of useful GEPs is not available, use the raw SCEVs directly. This approach supports better analysis of pointer dereferencing. In parallel, all the test cases are updated appropriately. Cases where we have a store to *B++ can now be analyzed! llvm-svn: 168474	2012-11-21 23:50:04 +00:00
Sebastian Pop	87ce43c5b5	removes a few "const" qualifiers so that I can (someday) call SE->getSCEV without complaint. No semantic change intended. Patch from Preston Briggs <preston.briggs@gmail.com>. llvm-svn: 168391	2012-11-20 22:28:04 +00:00
Bob Wilson	a5b0dc8884	Clean up handling of always-inline functions in the inliner. This patch moves the isInlineViable function from the InlineAlways pass into the InlineCostAnalyzer and then changes the InlineCost computation to use that simple check for always-inline functions. All the special-case checks for AlwaysInline in the CallAnalyzer can then go away. llvm-svn: 168300	2012-11-19 07:04:35 +00:00
Bob Wilson	266802d256	Some comment fixes. llvm-svn: 168299	2012-11-19 07:04:30 +00:00
Hal Finkel	a6f86fc6fa	Phi speculation improvement for BasicAA This is a partial solution to PR14351. It removes some of the special significance of the first incoming phi value in the phi aliasing checking logic in BasicAA. In the context of a loop, the old logic assumes that the first incoming value is the interesting one (meaning that it is the one that comes from outside the loop), but this is often not the case. With this change, we now test first the incoming value that comes from a block other than the parent of the phi being tested. llvm-svn: 168245	2012-11-17 02:33:15 +00:00
Duncan Sands	d7d8c09b93	Make this easier to understand, as suggested by Chandler. llvm-svn: 168196	2012-11-16 20:53:08 +00:00
Duncan Sands	c41076c07c	InstructionSimplify should be able to simplify A+B==B+A to 'true' but wasn't due to the same logic bug that caused PR14361. llvm-svn: 168186	2012-11-16 19:41:26 +00:00
Owen Anderson	1aa2751260	Add doInitialization and doFinalization methods to ModulePass's, to allow them to be re-initialized and reused on multiple Module's. Patch by Pedro Artigas. llvm-svn: 168008	2012-11-15 00:14:15 +00:00
Benjamin Kramer	3eb156306a	DependenceAnalysis: Print all dependency pairs when dumping. Update all testcases. Part of a patch by Preston Briggs. llvm-svn: 167827	2012-11-13 12:12:02 +00:00
NAKAMURA Takumi	43ab4ef9ba	llvm/ConstantFolding.cpp: Make ReadDataFromGlobal() and FoldReinterpretLoadFromConstPtr() Big-endian-aware. llvm-svn: 167595	2012-11-08 20:34:25 +00:00
Richard Osborne	a1fffcf73a	Don't infer whether a value is captured in the current function from the 'nocapture' attribute. The nocapture attribute only specifies that no copies are made that outlive the function. This isn't the same as there being no copies at all. This fixes PR14045. llvm-svn: 167381	2012-11-05 10:48:24 +00:00
NAKAMURA Takumi	dce899962b	ConstantFolding.cpp: Whitespace. llvm-svn: 167377	2012-11-05 00:11:11 +00:00
Duncan Sands	71c2070e2d	Apply the patch from PR14160. I failed to construct a testcase for this, but I'm applying it anyway since it seems to be obviously correct. llvm-svn: 167370	2012-11-04 09:02:45 +00:00
Nadav Rotem	13da94734c	CostModel: add support for Vector Insert and Extract. llvm-svn: 167329	2012-11-02 22:31:56 +00:00
Nadav Rotem	a6b91ac307	Add a cost model analysis that allows us to estimate the cost of IR-level instructions. llvm-svn: 167324	2012-11-02 21:48:17 +00:00
Chandler Carruth	5da3f0512e	Revert the majority of the next patch in the address space series: r165941: Resubmit the changes to llvm core to update the functions to support different pointer sizes on a per address space basis. Despite this commit log, this change primarily changed stuff outside of VMCore, and those changes do not carry any tests for correctness (or even plausibility), and we have consistently found questionable or flat out incorrect cases in these changes. Most of them are probably correct, but we need to devise a system that makes it more clear when we have handled the address space concerns correctly, and ideally each pass that gets updated would receive an accompanying test case that exercises that pass specificaly w.r.t. alternate address spaces. However, from this commit, I have retained the new C API entry points. Those were an orthogonal change that probably should have been split apart, but they seem entirely good. In several places the changes were very obvious cleanups with no actual multiple address space code added; these I have not reverted when I spotted them. In a few other places there were merge conflicts due to a cleaner solution being implemented later, often not using address spaces at all. In those cases, I've preserved the new code which isn't address space dependent. This is part of my ongoing effort to clean out the partial address space code which carries high risk and low test coverage, and not likely to be finished before the 3.2 release looms closer. Duncan and I would both like to see the above issues addressed before we return to these changes. llvm-svn: 167222	2012-11-01 09:14:31 +00:00
Chandler Carruth	7ec5085e01	Revert the series of commits starting with r166578 which introduced the getIntPtrType support for multiple address spaces via a pointer type, and also introduced a crasher bug in the constant folder reported in PR14233. These commits also contained several problems that should really be addressed before they are re-committed. I have avoided reverting various cleanups to the DataLayout APIs that are reasonable to have moving forward in order to reduce the amount of churn, and minimize the number of commits that were reverted. I've also manually updated merge conflicts and manually arranged for the getIntPtrType function to stay in DataLayout and to be defined in a plausible way after this revert. Thanks to Duncan for working through this exact strategy with me, and Nick Lewycky for tracking down the really annoying crasher this triggered. (Test case to follow in its own commit.) After discussing with Duncan extensively, and based on a note from Micah, I'm going to continue to back out some more of the more problematic patches in this series in order to ensure we go into the LLVM 3.2 branch with a reasonable story here. I'll send a note to llvmdev explaining what's going on and why. Summary of reverted revisions: r166634: Fix a compiler warning with an unused variable. r166607: Add some cleanup to the DataLayout changes requested by Chandler. r166596: Revert "Back out r166591, not sure why this made it through since I cancelled the command. Bleh, sorry about this! r166591: Delete a directory that wasn't supposed to be checked in yet. r166578: Add in support for getIntPtrType to get the pointer type based on the address space. llvm-svn: 167221	2012-11-01 08:07:29 +00:00
Benjamin Kramer	c914ab6e3c	Fix a couple of comment typos. llvm-svn: 167113	2012-10-31 11:25:32 +00:00
Benjamin Kramer	24c643b6de	DependenceAnalysis: Don't crash if there is no constant operand. This makes the code match the comments. Resolves a crash in loop idiom (PR14219). llvm-svn: 167110	2012-10-31 09:20:38 +00:00
Bob Wilson	09d16aa87e	Remove code to saturate profile counts. We may need to change the way profile counter values are stored, but saturation is the wrong thing to do. Just remove it for now. Patch by Alastair Murray! llvm-svn: 166938	2012-10-29 17:27:39 +00:00
Benjamin Kramer	5bc077aa88	SCEV validator: Ignore CouldNotCompute/undef on both sides. This is mostly noise and blocks finding more severe bugs. llvm-svn: 166873	2012-10-27 11:36:07 +00:00
Benjamin Kramer	24d270db57	SCEV validator: Add workarounds for some common false positives due to the way it handles strings. llvm-svn: 166872	2012-10-27 10:45:01 +00:00
Benjamin Kramer	6dc1e2f287	Remove LoopDependenceAnalysis. It was unmaintained and not much more than a stub. The new DependenceAnalysis pass is both more general and complete. llvm-svn: 166810	2012-10-26 20:25:01 +00:00
Benjamin Kramer	214935ee70	Add a basic verifier for SCEV's backedge taken counts. Enabled with -verify-scev. This could be extended significantly but hopefully catches the common cases now. Note that it's not enabled by default in any configuration because the way it tries to distinguish SCEVs is still fragile and may produce false positives. Also the test-suite isn't clean yet, one example is that it fails if a pass drops an NSW bit but it's still present in SCEV's cached. Cleaning up all those cases will take some time. llvm-svn: 166786	2012-10-26 17:31:32 +00:00
Nadav Rotem	15198e94d2	Fix a crash in SimpliftDemandedBits of vectors of pointers. PR14183. llvm-svn: 166785	2012-10-26 17:17:05 +00:00
Nick Lewycky	c86037ff01	Hoist out some work done inside a loop doing a linear scan over all instructions in a block. GetUnderlyingObject is more expensive than it looks as it can, for instance, call SimplifyInstruction. This might have some behavioural changes in odd corner cases, but only because of some strange artefacts of the original implementation. If you were relying on those, we can fix that by replacing this with a smarter algorithm. Change passes the existing tests. llvm-svn: 166754	2012-10-26 04:43:47 +00:00
Nadav Rotem	8255ceb2cf	Revert 166726 because it may have broken a number of SPEC tests. PR14183. llvm-svn: 166739	2012-10-25 23:51:48 +00:00
Nadav Rotem	bb4cfb5ee1	Fix a crash in ValueTracking. Add support for vectors of pointers. llvm-svn: 166726	2012-10-25 21:52:52 +00:00
Benjamin Kramer	71a3512d60	DependenceAnalysis: Push #includes down into the implementation. llvm-svn: 166688	2012-10-25 16:15:22 +00:00
Hal Finkel	30bd9346a0	getSmallConstantTripMultiple should never return zero. When the trip count is -1, getSmallConstantTripMultiple could return zero, and this would cause runtime loop unrolling to assert. Instead of returning zero, one is now returned (consistent with the existing overflow cases). Fixes PR14167. llvm-svn: 166612	2012-10-24 19:46:44 +00:00
Micah Villmow	bf3eeb2dfc	Add some cleanup to the DataLayout changes requested by Chandler. llvm-svn: 166607	2012-10-24 18:36:13 +00:00
Micah Villmow	12d9127833	Add in support for getIntPtrType to get the pointer type based on the address space. This checkin also adds in some tests that utilize these paths and updates some of the clients. llvm-svn: 166578	2012-10-24 15:52:52 +00:00
Bill Wendling	5858b56ce3	Ignore unreachable blocks when doing memory dependence analysis on non-local loads. It's not really profitable and may result in GVN going into an infinite loop when it hits constructs like this: %x = gep %some.type %x, ... Found via an LTO build of LLVM. llvm-svn: 166490	2012-10-23 18:37:11 +00:00
Nadav Rotem	4dc976fbcb	revert r166264 because the LTO build is still failing llvm-svn: 166340	2012-10-19 21:28:43 +00:00
Benjamin Kramer	a225ed8d2b	SCEVExpander: Don't crash when trying to merge two constant phis. Just constant fold them so they can't cause any trouble. Fixes PR12627. llvm-svn: 166286	2012-10-19 16:37:30 +00:00
Nadav Rotem	4985ddc5e0	recommit the patch that makes LSR and LowerInvoke use the TargetTransform interface. llvm-svn: 166264	2012-10-19 04:27:49 +00:00
Bob Wilson	d6d9ccca38	Temporarily revert the TargetTransform changes. The TargetTransform changes are breaking LTO bootstraps of clang. I am working with Nadav to figure out the problem, but I am reverting it for now to get our buildbots working. This reverts svn commits: 165665 165669 165670 165786 165787 165997 and I have also reverted clang svn 165741 llvm-svn: 166168	2012-10-18 05:43:52 +00:00
Micah Villmow	4bb926d91d	Resubmit the changes to llvm core to update the functions to support different pointer sizes on a per address space basis. llvm-svn: 165941	2012-10-15 16:24:29 +00:00
Sebastian Pop	e9623261ad	fix warning DependenceAnalysis.cpp:1164:32: warning: implicit truncation from 'int' to bitfield changes value from -5 to 3 [-Wconstant-conversion] Result.DV[Level].Direction &= ~Dependence::DVEntry::GT; ^ ~~~~~~~~~~~~~~~~~~~~~~~~ Patch from Preston Briggs <preston.briggs@gmail.com>. llvm-svn: 165784	2012-10-12 02:04:32 +00:00
Micah Villmow	0c61134d8d	Revert 165732 for further review. llvm-svn: 165747	2012-10-11 21:27:41 +00:00
Micah Villmow	083189730e	Add in the first iteration of support for llvm/clang/lldb to allow variable per address space pointer sizes to be optimized correctly. llvm-svn: 165726	2012-10-11 17:21:41 +00:00
Sebastian Pop	59b61b9e2c	dependence analysis Patch from Preston Briggs <preston.briggs@gmail.com>. This is an updated version of the dependence-analysis patch, including an MIV test based on Banerjee's inequalities. It's a fairly complete implementation of the paper Practical Dependence Testing Gina Goff, Ken Kennedy, and Chau-Wen Tseng PLDI 1991 It cannot yet propagate constraints between coupled RDIV subscripts (discussed in Section 5.3.2 of the paper). It's organized as a FunctionPass with a single entry point that supports testing for dependence between two instructions in a function. If there's no dependence, it returns null. If there's a dependence, it returns a pointer to a Dependence which can be queried about details (what kind of dependence, is it loop independent, direction and distance vector entries, etc). I haven't included every imaginable feature, but there's a good selection that should be adequate for supporting many loop transformations. Of course, it can be extended as necessary. Included in the patch file are many test cases, commented with C code showing the loops and array references. llvm-svn: 165708	2012-10-11 07:32:34 +00:00
Nadav Rotem	e10328737d	Add a new interface to allow IR-level passes to access codegen-specific information. llvm-svn: 165665	2012-10-10 22:04:55 +00:00
Bill Wendling	ff758fbd45	Use the attribute enums to query if a function has an attribute. llvm-svn: 165551	2012-10-09 21:49:51 +00:00
Bill Wendling	8ccd6ca199	Use the attribute enums to query if a parameter has an attribute. llvm-svn: 165550	2012-10-09 21:38:14 +00:00
Bill Wendling	c9b22d735a	Create enums for the different attributes. We use the enums to query whether an Attributes object has that attribute. The opaque layer is responsible for knowing where that specific attribute is stored. llvm-svn: 165488	2012-10-09 07:45:08 +00:00
Bill Wendling	375eb1f980	Remove more uses of the attribute enums by supplying appropriate query methods for them. No functionality change intended. llvm-svn: 165466	2012-10-09 00:28:54 +00:00
Nick Lewycky	7c3b5d9444	Give CaptureTracker::shouldExplore a base implementation. Most users want to do the same thing. No functionality change. llvm-svn: 165435	2012-10-08 22:12:48 +00:00
Micah Villmow	cdfe20b97f	Move TargetData to DataLayout. llvm-svn: 165402	2012-10-08 16:38:25 +00:00
Bob Wilson	e0b1dea267	Make sure always-inline functions get inlined. <rdar://problem/12423986> Without this change, when the estimated cost for inlining a function with an "alwaysinline" attribute was lower than the inlining threshold, the getInlineCost function was returning that estimated cost rather than the special InlineCost::AlwaysInlineCost value. That is fine in the normal inlining case, but it can fail when the inliner considers the opportunity cost of inlining into an internal or linkonce-odr function. It may decide not to inline the always-inline function in that case. The fix here is just to make getInlineCost always return the special value for always-inline functions. I ran into this building clang with libc++. Tablegen failed to link because of an always-inline function that was not inlined. I have been unable to reduce the testcase down to a reasonable size. llvm-svn: 165367	2012-10-07 01:11:19 +00:00
Duncan Sands	271ea6cdc5	The alignment of an sret parameter is known: it must be at least the alignment of the return type. Teach the optimizers this. llvm-svn: 165226	2012-10-04 13:36:31 +00:00
Bill Wendling	5d637b7d5b	Use method to query for NoAlias attribute. llvm-svn: 165211	2012-10-04 07:17:46 +00:00
Duncan Sands	5e561bbd5d	Ignore apparent buffer overruns on external or weak globals. This is a major source of false positives due to globals being declared in a header with some kind of incomplete (small) type, but the actual definition being bigger. llvm-svn: 164912	2012-09-30 07:30:10 +00:00
Sylvestre Ledru	91ce36c986	Revert 'Fix a typo 'iff' => 'if''. iff is an abreviation of if and only if. See: http://en.wikipedia.org/wiki/If_and_only_if Commit 164767 llvm-svn: 164768	2012-09-27 10:14:43 +00:00
Sylvestre Ledru	721cffd53a	Fix a typo 'iff' => 'if' llvm-svn: 164767	2012-09-27 09:59:43 +00:00
Bill Wendling	863bab689a	Remove the `hasFnAttr' method from Function. The hasFnAttr method has been replaced by querying the Attributes explicitly. No intended functionality change. llvm-svn: 164725	2012-09-26 21:48:26 +00:00
Duncan Sands	8598a0ec80	Now that invoke of an intrinsic is possible (for the llvm.do.nothing intrinsic) teach the callgraph logic to not create callgraph edges to intrinsics for invoke instructions; it already skips this for call instructions. Fixes PR13903. llvm-svn: 164707	2012-09-26 17:16:01 +00:00
Duncan Sands	a221eea7db	Teach the 'lint' sanity checking pass to detect simple buffer overflows. llvm-svn: 164671	2012-09-26 07:45:36 +00:00
Duncan Sands	3f4d0b1724	Change the way the lint sanity checking pass detects misaligned memory accesses. Previously it was only be able to detect problems if the pointer was a numerical value (eg inttoptr i32 1 to i32*), but not if it was an alloca or globa. The reason was the use of ComputeMaskedBits: imagine you have "alloca i8, align 2", and ask ComputeMaskedBits what it knows about the bits of the alloca pointer. It can tell you that the bottom bit is known zero (due to align 2) but it can't tell you that bit 1 is known one. That's because the address could be an even multiple of 2 rather than an odd multiple, eg it might be a multiple of 4. Thus trying to use KnownOne is ineffective in the case of an alloca as it will never have any bits set. Instead look explicitly for constant offsets from allocas and globals. llvm-svn: 164595	2012-09-25 10:00:49 +00:00
Duncan Sands	aef83e5f03	GCC doesn't understand that OrigAliasResult having a value is correlated with ArePhisAssumedNoAlias, and warns that OrigAliasResult may be used uninitialized. Pacify GCC. llvm-svn: 164229	2012-09-19 15:43:44 +00:00
Nadav Rotem	4eb3d4b2cf	Prevent inlining of callees which allocate lots of memory into a recursive caller. Example: void foo() { ... foo(); // I'm recursive! bar(); } bar() { int a[1000]; // large stack size } rdar://10853263 llvm-svn: 164207	2012-09-19 08:08:04 +00:00
Manman Ren	49d684e1e2	Release build: guard dump functions with "#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)" No functional change. Update r163344. llvm-svn: 163679	2012-09-12 05:06:18 +00:00
Manman Ren	c3366ccecb	Release build: guard dump functions with "ifndef NDEBUG" No functional change. llvm-svn: 163344	2012-09-06 19:55:56 +00:00
Roman Divacky	4717a8d654	Dont cast away const needlessly. Found by gcc48 -Wcast-qual. llvm-svn: 163324	2012-09-06 15:42:13 +00:00
Arnold Schwaighofer	8dc34cfb99	BasicAA: Recognize cyclic NoAlias phis Enhances basic alias analysis to recognize phis whose first incoming values are NoAlias and whose other incoming values are just the phi node itself through some amount of recursion. Example: With this change basicaa reports that ptr_phi and ptr_phi2 do not alias each other. bb: ptr = ptr2 + 1 loop: ptr_phi = phi [bb, ptr], [loop, ptr_plus_one] ptr2_phi = phi [bb, ptr2], [loop, ptr2_plus_one] ... ptr_plus_one = gep ptr_phi, 1 ptr2_plus_one = gep ptr2_phi, 1 This enables the elimination of one load in code like the following: extern int foo; int test_noalias(int ptr, int num, int coeff) { int ptr2 = ptr; int result = (ptr++) * (coeff--); while (num--) { ptr2++ = ptr; result += (coeff--) * (ptr++); } ptr = foo; return result; } Part 2/2 of fix for PR13564. llvm-svn: 163319	2012-09-06 14:41:53 +00:00
Arnold Schwaighofer	76dca58c66	BasicAA: GEPs of NoAlias'ing base ptr with equivalent indices are NoAlias If we can show that the base pointers of two GEPs don't alias each other using precise analysis and the indices and base offset are equal then the two GEPs also don't alias each other. This is primarily needed for the follow up patch that analyses NoAlias'ing PHI nodes. Part 1/2 of fix for PR13564. llvm-svn: 163317	2012-09-06 14:31:51 +00:00
Manman Ren	f3fedb6935	JumpThreading: when default destination is the destination of some cases in a switch, make sure we include the value for the cases when calculating edge value from switch to the default destination. rdar://12241132 llvm-svn: 163270	2012-09-05 23:45:58 +00:00
Roman Divacky	ad06cee239	Stop casting away const qualifier needlessly. llvm-svn: 163258	2012-09-05 22:26:57 +00:00
Benjamin Kramer	6c2649ca4e	Switch BasicAliasAnalysis' cache to SmallDenseMap. It relies on clear() being fast and the cache rarely has more than 1 or 2 elements, so give it an inline capacity and always shrink it back down in case it grows. DenseMap will grow to 64 buckets which makes clear() a lot slower. llvm-svn: 163215	2012-09-05 16:49:37 +00:00
Bob Wilson	01cfbfe9d0	Be conservative about allocations that may alias the accessed pointer. If an allocation has a must-alias relation to the access pointer, we treat it as a Def. Otherwise, without this check, the code here was just skipping over the allocation call and ignoring it. I noticed this by inspection and don't have a specific testcase that it breaks, but it seems like we need to treat a may-alias allocation as a Clobber. llvm-svn: 163127	2012-09-04 03:30:13 +00:00
Bob Wilson	dcc54decd5	Fix more fallout from r158919, similar to PR13547. This code used to only handle malloc-like calls, which do not read memory. r158919 changed it to check isNoAliasFn(), which includes strdup-like and realloc-like calls, but it was not checking for dependencies on the memory read by those calls. llvm-svn: 163106	2012-09-03 05:15:15 +00:00
Benjamin Kramer	e7e5235726	Clean up ProfileDataLoader a bit. - Overloading operator<< for raw_ostream and pointers is dangerous, it alters the behavior of code that includes the header. - Remove unused ID. - Use LLVM's byte swapping helpers instead of a hand-coded. - Make ReadProfilingData work directly on a pointer. No functionality change. llvm-svn: 162992	2012-08-31 12:43:07 +00:00
Bill Wendling	5aed004cf1	Cleanups due to feedback. No functionality change. Patch by Alistair. llvm-svn: 162979	2012-08-31 05:18:31 +00:00
Benjamin Kramer	8bcc971174	Make MemoryBuiltins aware of TargetLibraryInfo. This disables malloc-specific optimization when -fno-builtin (or -ffreestanding) is specified. This has been a problem for a long time but became more severe with the recent memory builtin improvements. Since the memory builtin functions are used everywhere, this required passing TLI in many places. This means that functions that now have an optional TLI argument, like RecursivelyDeleteTriviallyDeadFunctions, won't remove dead mallocs anymore if the TLI argument is missing. I've updated most passes to do the right thing. Fixes PR13694 and probably others. llvm-svn: 162841	2012-08-29 15:32:21 +00:00
Manman Ren	abbb01abea	Profile: set branch weight metadata with data generated from profiling. This patch implements ProfileDataLoader which loads profile data generated by -insert-edge-profiling and updates branch weight metadata accordingly. Patch by Alastair Murray. llvm-svn: 162799	2012-08-28 22:21:25 +00:00
Hongbin Zheng	14c05c409a	Remove the the block_node_iterator of Region, replace it by the block_iterator. llvm-svn: 162672	2012-08-27 13:49:24 +00:00
Richard Smith	228e6d4cf3	Fix integer undefined behavior due to signed left shift overflow in LLVM. Reviewed offline by chandlerc. llvm-svn: 162623	2012-08-24 23:29:28 +00:00
Manman Ren	cf10446ffa	BranchProb: modify the definition of an edge in BranchProbabilityInfo to handle the case of multiple edges from one block to another. A simple example is a switch statement with multiple values to the same destination. The definition of an edge is modified from a pair of blocks to a pair of PredBlock and an index into the successors. Also set the weight correctly when building SelectionDAG from LLVM IR, especially when converting a Switch. IntegersSubsetMapping is updated to calculate the weight for each cluster. llvm-svn: 162572	2012-08-24 18:14:27 +00:00
Richard Smith	c621af1f60	Fix floating-point divide by zero, in a case where the value was not going to be used anyway. llvm-svn: 162518	2012-08-24 00:31:45 +00:00
Benjamin Kramer	f29db275b2	Reduce duplicated hash map lookups. llvm-svn: 162362	2012-08-22 15:37:57 +00:00
Benjamin Kramer	34764fe2e4	MemoryBuiltins: Properly guard ObjectSizeOffsetVisitor against cycles in the IR. The previous fix only checked for simple cycles, use a set to catch longer cycles too. Drop the broken check from the ObjectSizeOffsetEvaluator. The BoundsChecking pass doesn't have to deal with invalid IR like InstCombine does. llvm-svn: 162120	2012-08-17 19:26:41 +00:00
Benjamin Kramer	4901f0d2a2	Guard MemoryBuiltins against self-looping GEPs, which can occur in unreachable code due to constant propagation. Fixes PR13621. llvm-svn: 162098	2012-08-17 14:16:37 +00:00
Bill Wendling	e1c54262f4	Set the branch probability of branching to the 'normal' destination of an invoke instruction to something absurdly high, while setting the probability of branching to the 'unwind' destination to the bare minimum. This should set cause the normal destination's invoke blocks to be moved closer to the invoke. PR13612 llvm-svn: 161944	2012-08-15 12:22:35 +00:00
Nadav Rotem	5d4e205874	MemoryDependenceAnalysis attempts to find the first memory dependency for function calls. Currently, if GetLocation reports that it did not find a valid pointer (this is the case for volatile load/stores), we ignore the result. This patch adds code to handle the cases where we did not obtain a valid pointer. rdar://11872864 PR12899 llvm-svn: 161802	2012-08-13 23:03:43 +00:00
Benjamin Kramer	c99d0e9186	PR13095: Give an inline cost bonus to functions using byval arguments. We give a bonus for every argument because the argument setup is not needed anymore when the function is inlined. With this patch we interpret byval arguments as a compact representation of many arguments. The byval argument setup is implemented in the backend as an inline memcpy, so to model the cost as accurately as possible we take the number of pointer-sized elements in the byval argument and give a bonus of 2 instructions for every one of those. The bonus is capped at 8 elements, which is the number of stores at which the x86 backend switches from an expanded inline memcpy to a real memcpy. It would be better to use the real memcpy threshold from the backend, but it's not available via TargetData. This change brings the performance of c-ray in line with gcc 4.7. The included test case tries to reproduce the c-ray problem to catch regressions for this benchmark early, its performance is dominated by the inline decision of a specific call. This only has a small impact on most code, more on x86 and arm than on x86_64 due to the way the ABI works. When building LLVM for x86 it gives a small inline cost boost to virtually any function using StringRef or STL allocators, but only a 0.01% increase in overall binary size. The size of gcc compiled by clang actually shrunk by a couple bytes with this patch applied, but not significantly. llvm-svn: 161413	2012-08-07 11:13:19 +00:00
Chandler Carruth	2f6cf4884c	Fix PR13412, a nasty miscompile due to the interleaved instsimplify+inline strategy. The crux of the problem is that instsimplify was reasonably relying on an invariant that is true within any single function, but is no longer true mid-inline the way we use it. This invariant is that an argument pointer != a local (alloca) pointer. The fix is really light weight though, and allows instsimplify to be resiliant to these situations: when checking the relation ships to function arguments, ensure that the argumets come from the same function. If they come from different functions, then none of these assumptions hold. All credit to Benjamin Kramer for coming up with this clever solution to the problem. llvm-svn: 161410	2012-08-07 10:59:59 +00:00
Hongbin Zheng	bb1d209210	Implement the block_iterator of Region based on df_iterator. llvm-svn: 161177	2012-08-02 14:20:02 +00:00
Nick Lewycky	fb78083b1c	Stay rational; don't assert trying to take the square root of a negative value. If it's negative, the loop is already proven to be infinite. Fixes PR13489! llvm-svn: 161107	2012-08-01 09:14:36 +00:00
Nadav Rotem	77f1b9c477	When constant folding GEP expressions, keep the address space information of pointers. Together with Ran Chachick <ran.chachick@intel.com> llvm-svn: 160954	2012-07-30 07:25:20 +00:00
Nuno Lopes	85591f899d	fix PR13390: do not loop forever with self-referencing self instructions llvm-svn: 160876	2012-07-27 18:21:15 +00:00
Nuno Lopes	f0626f2205	revert r160742: it's breaking CMake build original commit msg: MemoryBuiltins: add support to determine the size of strdup'ed non-constant strings llvm-svn: 160751	2012-07-25 18:49:28 +00:00
Nuno Lopes	f0441e04bd	MemoryBuiltins: add support to determine the size of strdup'ed non-constant strings llvm-svn: 160742	2012-07-25 17:29:22 +00:00
Duncan Sands	0b875a0c29	When folding a load from a global constant, if the load started in the middle of an array element (rather than at the beginning of the element) and extended into the next element, then the load from the second element was being handled wrong due to incorrect updating of the notion of which byte to load next. This fixes PR13442. Thanks to Chris Smowton for reporting the problem, analyzing it and providing a fix. llvm-svn: 160711	2012-07-25 09:14:54 +00:00
Nuno Lopes	2a4b09c9de	teach objectsize about strdup() and strndup() llvm-svn: 160676	2012-07-24 16:28:13 +00:00
Sylvestre Ledru	35521e2310	Fix a typo (the the => the) llvm-svn: 160621	2012-07-23 08:51:15 +00:00
Nuno Lopes	705141d4df	baby steps toward fixing some problems with inbound GEPs that overflow, as discussed 2 months ago or so. Make sure we do not emit index computations with NSW flags so that we dont get an undef value if the GEP overflows llvm-svn: 160589	2012-07-20 23:07:40 +00:00
Benjamin Kramer	5be8f60126	Remove unused private member variables uncovered by the recent changes to clang's -Wunused-private-field. llvm-svn: 160583	2012-07-20 22:05:57 +00:00
Chandler Carruth	36e2ecf528	Move llvm/Support/TypeBuilder.h -> llvm/TypeBuilder.h. This completes the move of *Builder classes into the Core library. No uses of this builder in Clang or DragonEgg I could find. If there is a desire to have an IR-building-support library that contains all of these builders, that can be easily added, but currently it seems likely that these add no real overhead to VMCore. llvm-svn: 160243	2012-07-15 23:45:24 +00:00
Andrew Trick	653513b8dd	LSR Fix: check SCEV expression safety before expansion. All SCEV expressions used by LSR formulae must be safe to expand. i.e. they may not contain UDiv unless we can prove nonzero denominator. Fixes PR11356: LSR hoists UDiv. llvm-svn: 160205	2012-07-13 23:33:10 +00:00
Andrew Trick	ee76065b7a	IVUsers should only generate SCEV's for values that are safe to speculate. This allows SCEVExpander to run on the IV expressions. This codifies an assumption made by LSR to complete the fix for PR11356, but I haven't been able to generate a separate unit test for this part. I'm adding it as an extra safety check. llvm-svn: 160204	2012-07-13 23:33:05 +00:00
Andrew Trick	365e31c36c	Factor SCEV traversal code so I can use it elsewhere. No functionality. llvm-svn: 160203	2012-07-13 23:33:03 +00:00
Dan Gohman	3d1512384f	Delete code for folding undefs in ScalarEvolution. It's invalid in obscure ways, and it isn't actually important in the real world. llvm-svn: 159969	2012-07-09 23:51:20 +00:00
Nuno Lopes	0d44a50426	PHINode::hasConstantValue(): return undef if the PHI is fully recursive. Thanks Duncan for the idea llvm-svn: 159687	2012-07-03 21:15:40 +00:00
Nuno Lopes	9291ff4078	fold PHI nodes in SizeOffsetEvaluator whenever possible. Unfortunately this change requires the cache map to hold WeakVHs instead llvm-svn: 159667	2012-07-03 17:13:25 +00:00
Benjamin Kramer	e2ef47c145	Reduce use list thrashing by using DenseMap's find_as for maps with ValueHandle keys. No functionality change. llvm-svn: 159497	2012-06-30 22:37:15 +00:00
Nuno Lopes	674acc12d0	RefreshCallGraph: ignore 'invoke intrinsic'. IntrinsicInst doesnt not recognize invoke, and shouldnt at this point, since the rest of LLVM codebase doesnt expect invoke of intrinsics llvm-svn: 159441	2012-06-29 17:49:32 +00:00
Bill Wendling	098d906dbb	Update the CMake files. llvm-svn: 159417	2012-06-29 09:01:47 +00:00
Bill Wendling	f799efdedc	The DIBuilder class is just a wrapper around debug info creation (a.k.a. MDNodes). The module doesn't belong in Analysis. Move it to the VMCore instead. llvm-svn: 159414	2012-06-29 08:32:07 +00:00
Nick Lewycky	474112d82c	If the step value is a constant zero, the loop isn't going to terminate. Fixes the assert reported in PR13228! llvm-svn: 159393	2012-06-28 23:44:57 +00:00

1 2 3 4 5 ...

4543 Commits