llvm-project

Commit Graph

Author	SHA1	Message	Date
Philip Reames	70efccd7dd	Fix an unused variable warning which broke the clang-cmake-mips builder llvm-svn: 251614	2015-10-29 04:21:49 +00:00
Philip Reames	eb3e9dad7f	[LVI/CVP] Teach LVI about range metadata Somewhat shockingly for an analysis pass which is computing constant ranges, LVI did not understand the ranges provided by range metadata. As part of this change, I included a change to CVP primarily because doing so made it much easier to write small self contained test cases. CVP was previously only handling the non-local operand case, but given that LVI can sometimes figure out information about instructions standalone, I don't see any reason to restrict this. There could possibly be a compile time impact from this, but I suspect it should be minimal. If anyone has an example which substaintially regresses, please let me know. I could restrict the block local handling to ICmps feeding Terminator instructions if needed. Note that this patch continues a somewhat bad practice in LVI. In many cases, we know facts about values, and separate context sensitive facts about values. LVI makes no effort to distinguish and will frequently cache the same value fact repeatedly for different contexts. I would like to change this, but that's a large enough change that I want it to go in separately with clear documentation of what's changing. Other examples of this include the non-null handling, and arguments. As a meta comment: the entire motivation of this change was being able to write smaller (aka reasonable sized) test cases for a future patch teaching LVI about select instructions. Differential Revision: http://reviews.llvm.org/D13543 llvm-svn: 251606	2015-10-29 03:57:17 +00:00
Philip Reames	dbbd77921d	[InstSimplify] sgt on i1s also encodes implication Follow on to http://reviews.llvm.org/D13074, implementing something pointed out by Sanjoy. His truth table from his comment on that bug summarizes things well: LHS \| RHS \| LHS >=s RHS \| LHS implies RHS 0 \| 0 \| 1 (0 >= 0) \| 1 0 \| 1 \| 1 (0 >= -1) \| 1 1 \| 0 \| 0 (-1 >= 0) \| 0 1 \| 1 \| 1 (-1 >= -1) \| 1 The key point is that an "i1 1" is the value "-1", not "1". Differential Revision: http://reviews.llvm.org/D13756 llvm-svn: 251597	2015-10-29 03:19:10 +00:00
Tim Northover	8b40366b54	ARM: teach backend about WatchOS and TvOS libcalls. The most substantial changes are again for watchOS: libcalls are hard-float if needed and sincos has a different calling convention. llvm-svn: 251571	2015-10-28 22:51:16 +00:00
Hal Finkel	1140e1704b	Revert "r251451 - [AliasSetTracker] Use mod/ref information for UnknownInstr" It looks like this broke the stage 2 builder: http://lab.llvm.org:8080/green/job/clang-stage2-configure-Rlto/6989/ Original commit message: AliasSetTracker does not need to convert the access mode to ModRefAccess if the new visited UnknownInst has only 'REF' modrefinfo to existing pointers in the sets. Patch by Andrew Zhogin! llvm-svn: 251562	2015-10-28 22:13:41 +00:00
Sanjoy Das	c88f5d3c2c	[SCEV] Compute max backedge count for loops with "shift ivs" This teaches SCEV to compute //max// backedge taken counts for loops like for (int i = k; i != 0; i >>>= 1) whatever(); SCEV yet cannot represent the exact backedge count for these loops, and this patch does not change that. This is really geared towards teaching SCEV that loops like the above are not infinite. llvm-svn: 251558	2015-10-28 21:27:14 +00:00
Igor Laevsky	559d170021	[AliasAnalysis] Take into account readnone attribute for the function arguments Differential Revision: http://reviews.llvm.org/D13992 llvm-svn: 251535	2015-10-28 17:54:48 +00:00
JF Bastien	ddaa1c7eb1	WebAssembly: disable some loop-idiom recognition memset/memcpy aren't fully supported yet. We should invert this test once they are supported. llvm-svn: 251534	2015-10-28 17:50:23 +00:00
Igor Laevsky	36e84c0fc7	[AliasAnalysis] Take into account readonly attribute for the function arguments In getArgModRefInfo we consider all arguments as having MRI_ModRef. However for arguments marked with readonly attribute we can return more precise answer - MRI_Ref. Differential Revision: http://reviews.llvm.org/D13992 llvm-svn: 251525	2015-10-28 16:42:00 +00:00
Benjamin Kramer	039b10423a	Put global classes into the appropriate namespace. Most of the cases belong into an anonymous namespace. No functionality change intended. llvm-svn: 251515	2015-10-28 13:54:36 +00:00
James Molloy	c67dec690f	[GlobalsAA] An indirect global that is initialized is not fair game When checking if an indirect global (a global with pointer type) is only assigned by allocation functions, first check if the global is itself initialized. If it is, it's not only assigned by allocation functions. This fixes PR25309. Thanks to David Majnemer for reducing the test case! llvm-svn: 251508	2015-10-28 10:41:29 +00:00
Sanjoy Das	3ef1e689c9	[ValueTracking] Expose `implies` via ValueTracking, NFC Summary: This will allow a later patch to `JumpThreading` use this functionality. Reviewers: reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13971 llvm-svn: 251488	2015-10-28 03:20:19 +00:00
Sanjoy Das	1d1929aace	[ValueTracking] Use !range metadata more aggressively in KnownBits Summary: Teach `computeKnownBitsFromRangeMetadata` to use `!range` metadata more aggressively. Reviewers: majnemer, nlewycky, jingyue Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14100 llvm-svn: 251487	2015-10-28 03:20:15 +00:00
Hal Finkel	b1bb739166	[AliasSetTracker] Use mod/ref information for UnknownInstr AliasSetTracker does not need to convert the access mode to ModRefAccess if the new visited UnknownInst has only 'REF' modrefinfo to existing pointers in the sets. Patch by Andrew Zhogin! llvm-svn: 251451	2015-10-27 20:37:04 +00:00
David Majnemer	235acde953	[ScalarEvolutionExpander] PHI on a catchpad can be used on both edges A PHI on a catchpad might be used by both edges out of the catchpad, feeding back into a loop. In this case, just use the insertion point. Anything more clever would require new basic blocks or PHI placement. llvm-svn: 251442	2015-10-27 19:48:28 +00:00
David Majnemer	dd9a815746	[ScalarEvolutionExpander] Properly insert no-op casts + EH Pads We want to insert no-op casts as close as possible to the def. This is tricky when the cast is of a PHI node and the BasicBlocks between the def and the use cannot hold any instructions. Iteratively walk EH pads until we hit a non-EH pad. This fixes PR25326. llvm-svn: 251393	2015-10-27 07:36:42 +00:00
Sanjoy Das	63d2b77961	[ValueTracking] Don't special case wrapped ConstantRanges; NFCI Use `getUnsignedMax` directly instead of special casing a wrapped ConstantRange. The previous code would have been "buggy" (and this would have been a semantic change) if LLVM allowed !range metadata to denote full ranges. E.g. in %val = load i1, i1* %ptr, !range !{i1 1, i1 1} ;; == full set ValueTracking would conclude that the high bit (IOW the only bit) in %val was zero. Since !range metadata does not allow empty or full ranges, this change is just a minor stylistic improvement. llvm-svn: 251380	2015-10-27 01:36:06 +00:00
Sanjoy Das	49edd3b3a8	[SCEV] Refactor out ScalarEvolution::getDataLayout; NFC llvm-svn: 251375	2015-10-27 00:52:09 +00:00
Keno Fischer	277bfaefaf	Initialize BasicAAWrapperPass in it's constructor Summary: This idiom is used elsewhere in LLVM, but was overlooked here. Reviewers: chandlerc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13628 llvm-svn: 251348	2015-10-26 21:22:58 +00:00
Cong Hou	fff8ccf579	Check the case that the numerator and denominator are both zeros when getting edge probabilities in BPI and return 100% in this case. This issue is triggered in PGO mode when bootstrapping LLVM. It seems that it is not guaranteed that edge weights are always greater than zero which are read from profile data. llvm-svn: 251317	2015-10-26 18:00:17 +00:00
James Molloy	493e57de01	[ValueTracking] Extend r251146 to catch a fairly common case Even though we may not know the value of the shifter operand, it's possible we know the shifter operand is non-zero. This can allow us to infer more known bits - for example: %1 = load %p !range {1, 5} %2 = shl %q, %1 We don't know %1, but we do know that it is nonzero so %2[0] is known zero, and importantly %2 is known non-zero. Calling isKnownNonZero is nontrivially expensive so use an Optional to run it lazily and cache its result. llvm-svn: 251294	2015-10-26 14:10:46 +00:00
Davide Italiano	f04d89bdb4	[ScalarEvolution] Throw away dead code. llvm-svn: 251256	2015-10-25 20:00:49 +00:00
Davide Italiano	2071f4cc5a	[ScalarEvolution] Get rid of NDEBUG in header (correctly this time). llvm-svn: 251255	2015-10-25 19:55:24 +00:00
Davide Italiano	0c34243ac1	[ScalarEvolution] Get rid of NDEBUG in header. llvm-svn: 251249	2015-10-25 19:13:36 +00:00
Elena Demikhovsky	092858588a	Scalarizer for masked.gather and masked.scatter intrinsics. When the target does not support these intrinsics they should be converted to a chain of scalar load or store operations. If the mask is not constant, the scalarizer will build a chain of conditional basic blocks. I added isLegalMaskedGather() isLegalMaskedScatter() APIs. Differential Revision: http://reviews.llvm.org/D13722 llvm-svn: 251237	2015-10-25 15:37:55 +00:00
Benjamin Kramer	5611561e99	Use all_of to simplify control flow. NFC. llvm-svn: 251202	2015-10-24 19:30:37 +00:00
Benjamin Kramer	74b6d3b967	Use find_if to simplify control flow. NFC. llvm-svn: 251200	2015-10-24 19:03:15 +00:00
Benjamin Kramer	557b601b08	[BasicAliasAnalysis] Simplify expression, no functional change. (-1) - x + 1 is the same as -x. llvm-svn: 251185	2015-10-24 11:38:01 +00:00
Sanjoy Das	a7e13782f1	Extract out getConstantRangeFromMetadata; NFC The loop idiom creating a ConstantRange is repeated twice in the codebase, time to give it a name and a home. The loop is also repeated in `rangeMetadataExcludesValue`, but using `getConstantRangeFromMetadata` there would not be an NFC -- the range returned by `getConstantRangeFromMetadata` may contain a value that none of the subranges did. llvm-svn: 251180	2015-10-24 05:37:35 +00:00
Sanjoy Das	bb5ffc50b7	Fix whitespace issues in two places; NFC llvm-svn: 251179	2015-10-24 05:37:28 +00:00
Hal Finkel	f2199b2178	Handle non-constant shifts in computeKnownBits, and use computeKnownBits for constant folding in InstCombine/Simplify First, the motivation: LLVM currently does not realize that: ((2072 >> (L == 0)) >> 7) & 1 == 0 where L is some arbitrary value. Whether you right-shift 2072 by 7 or by 8, the lowest-order bit is always zero. There are obviously several ways to go about fixing this, but the generic solution pursued in this patch is to teach computeKnownBits something about shifts by a non-constant amount. Previously, we would give up completely on these. Instead, in cases where we know something about the low-order bits of the shift-amount operand, we can combine (and together) the associated restrictions for all shift amounts consistent with that knowledge. As a further generalization, I refactored all of the logic for all three kinds of shifts to have this capability. This works well in the above case, for example, because the dynamic shift amount can only be 0 or 1, and thus we can say a lot about the known bits of the result. This brings us to the second part of this change: Even when we know all of the bits of a value via computeKnownBits, nothing used to constant-fold the result. This introduces the necessary code into InstCombine and InstSimplify. I've added it into both because: 1. InstCombine won't automatically pick up the associated logic in InstSimplify (InstCombine uses InstSimplify, but not via the API that passes in the original instruction). 2. Putting the logic in InstCombine allows the resulting simplifications to become part of the iterative worklist 3. Putting the logic in InstSimplify allows the resulting simplifications to be used by everywhere else that calls SimplifyInstruction (inlining, unrolling, and many others). And this requires a small change to our definition of an ephemeral value so that we don't break the rest case from r246696 (where the icmp feeding the @llvm.assume, is also feeding a br). Under the old definition, the icmp would not be considered ephemeral (because it is used by the br), but this causes the assume to remove itself (in addition to simplifying the branch structure), and it seems more-useful to prevent that from happening. llvm-svn: 251146	2015-10-23 20:37:08 +00:00
Sanjoy Das	52f7b08b4a	[SCEV] Fix stylistic issue in MatchBinaryAddToConst; NFCI Instead of checking `(FlagsPresent & ExpectedFlags) != 0`, check `(FlagsPresent & ExpectedFlags) == ExpectedFlags`. Right now they're equivalent since `ExpectedFlags` can only be either `FlagNUW` or `FlagNSW`, but if we ever pass in `ExpectedFlags` as `FlagNUW \| FlagNSW` then checking `(FlagsPresent & ExpectedFlags) != 0` would be wrong. llvm-svn: 251142	2015-10-23 20:09:57 +00:00
James Molloy	05a896a8d1	[BasicAA] Bugfix for r251016 If the loaded type sizes don't match the element type of the sequential type, all bets are off and the addresses may, indeed, overlap. Surprisingly, this just got caught in one test, on one builder, out of the 30+ builders testing this change. Congratulations go to http://lab.llvm.org:8011/builders/clang-aarch64-lnt/builds/5205. llvm-svn: 251112	2015-10-23 14:17:03 +00:00
Sanjoy Das	42801100e1	[SCEV] Get rid of an unnecessary lambda; NFC llvm-svn: 251099	2015-10-23 06:57:21 +00:00
Sanjoy Das	0714e3e245	[SCEV] Fix a latent bug in `getPreStartForExtend` I could not come up a way to test this -- I think this bug is latent today, and will not actually result in a miscompile. In `getPreStartForExtend`, SCEV constructs `PreStart` as a sum of all of `SA`'s operands except `Op`. It also uses `SA`'s no-wrap flags, and this is problematic because removing an element from an add expression can make it signed-wrap. E.g. if `SA` was `(127 + 1 + -1)`, then it could safely be `<nsw>` (since `sext(127) + sext(1) + sext(-1)` == `sext(127 + 1 + -1)`), but `(127 + 1)` (== `PreStart` if `Op` is `-1`) is not `<nsw>`. Transferring `<nuw>` from `SA` to `PreStart` is safe, as far as I can tell. llvm-svn: 251097	2015-10-23 06:33:47 +00:00
Justin Bogner	f98df7a0d1	LoopPass: Remove redoLoop, it isn't used. NFC In r251064 I removed a logically unreachable call to `redoLoop`, and now there aren't any callers of this API at all. Remove the needless complexity. llvm-svn: 251067	2015-10-22 21:31:34 +00:00
Justin Bogner	35e46cdd04	LoopPass: Simplify the API for adding a new loop. NFC The insertLoop() API is only used to add new loops, and has confusing ownership semantics. Simplify it by replacing it with addLoop(). llvm-svn: 251064	2015-10-22 21:21:32 +00:00
Sanjoy Das	eeca9f6fd4	[SCEV] Commute zero extends through <nuw> additions llvm-svn: 251052	2015-10-22 19:57:38 +00:00
Sanjoy Das	6e78b17b43	[SCEV] Opportunistically interpret unsigned constraints as signed Summary: An unsigned comparision is equivalent to is corresponding signed version if both the operands being compared are positive. Teach SCEV to use this fact when profitable. Reviewers: atrick, hfinkel, reames, nlewycky Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13687 llvm-svn: 251051	2015-10-22 19:57:34 +00:00
Sanjoy Das	1123148d40	[SCEV] Teach SCEV some axioms about non-wrapping arithmetic Summary: - A s< (A + C)<nsw> if C > 0 - A s<= (A + C)<nsw> if C >= 0 - (A + C)<nsw> s< A if C < 0 - (A + C)<nsw> s<= A if C <= 0 Right now `C` needs to be a constant, but we can later generalize it to be a non-constant if needed. Reviewers: atrick, hfinkel, reames, nlewycky Subscribers: sanjoy, llvm-commits Differential Revision: http://reviews.llvm.org/D13686 llvm-svn: 251050	2015-10-22 19:57:29 +00:00
Sanjoy Das	a060e602fd	[SCEV] Commute sign extends through nsw additions Summary: Depends on D13613. Reviewers: atrick, hfinkel, reames, nlewycky Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13685 llvm-svn: 251049	2015-10-22 19:57:25 +00:00
Sanjoy Das	8f27415c05	[SCEV] Mark AddExprs as nsw or nuw if legal Summary: This uses `ScalarEvolution::getRange` and not potentially control dependent `nsw` and `nuw` bits on the arithmetic instruction. Reviewers: atrick, hfinkel, nlewycky Subscribers: llvm-commits, sanjoy Differential Revision: http://reviews.llvm.org/D13613 llvm-svn: 251048	2015-10-22 19:57:19 +00:00
James Molloy	5b2a732fac	[GlobalsAA] Loosen an overly conservative bailout Instead of bailing out when we see loads, analyze them. If we can prove that the loaded-from address must escape, then we can conclude that a load from that address must escape too and therefore cannot alias a non-addr-taken global. When checking if a Value can alias a non-addr-taken global, if the Value is a LoadInst of a non-global, recurse instead of bailing. If we can follow a trail of loads up to some base that is captured, we know by inference that all the loads we followed are also captured. llvm-svn: 251017	2015-10-22 13:44:26 +00:00
James Molloy	5a4d8cd519	[BasicAA] Non-equal indices in a GEP of a SequentialType don't overlap If the final indices of two GEPs can be proven to not be equal, and the GEP is of a SequentialType (not a StructType), then the two GEPs do not alias. llvm-svn: 251016	2015-10-22 13:28:18 +00:00
James Molloy	1d88d6f289	[ValueTracking] Add a new predicate: isKnownNonEqual() isKnownNonEqual(A, B) returns true if it can be determined that A != B. At the moment it only knows two facts, that a non-wrapping add of nonzero to a value cannot be that value: A + B != A [where B != 0, addition is nsw or nuw] and that contradictory known bits imply two values are not equal. This patch also hooks this up to InstSimplify; InstSimplify had a peephole for the first fact but not the second so this teaches InstSimplify a new trick too (alas no measured performance impact!) llvm-svn: 251012	2015-10-22 13:18:42 +00:00
Chandler Carruth	2be10754a9	[AA] Enhance the new AliasAnalysis infrastructure with an optional "external" AA wrapper pass. This is a generic hook that can be used to thread custom code into the primary AAResultsWrapperPass for the legacy pass manager in order to allow it to merge external AA results into the AA results it is building. It does this by threading in a raw callback and so it is very powerful and should serve almost any use case I have come up with for extending the set of alias analyses used. The only thing not well supported here is using a different order of alias analyses. That form of extension is supportable with the new pass manager, and I can make the callback structure here more elaborate to support it in the legacy pass manager if this is a critical use case that people are already depending on, but the only use cases I have heard of thus far should be reasonably satisfied by this simpler extension mechanism. It is hard to test this using normal facilities (the built-in AAs don't use this for obvious reasons) so I've written a fairly extensive set of custom passes in the alias analysis unit test that should be an excellent test case because it models the out-of-tree users: it adds a totally custom AA to the system. This should also serve as a reasonably good example and guide for out-of-tree users to follow in order to rig up their existing alias analyses. No support in opt for commandline control is provided here however. I'm really unhappy with the kind of contortions that would be required to support that. It would fully re-introduce the analysis group self-recursion kind of patterns. =/ I've heard from out-of-tree users that this will unblock their use cases with extending AAs on top of the new infrastructure and let us retain the new analysis-group-free-world. Differential Revision: http://reviews.llvm.org/D13418 llvm-svn: 250894	2015-10-21 12:15:19 +00:00
James Molloy	17379c4ea1	[GlobalsAA] Fix a really horrible iterator invalidation bug We were keeping a reference to an object in a DenseMap then mutating it. At the end of the function we were attempting to clone that reference into other keys in the DenseMap, but DenseMap may well decide to resize its hashtable which would invalidate the reference! It took an extremely complex testcase to catch this - many thanks to Zhendong Su for catching it in PR25225. This fixes PR25225. llvm-svn: 250692	2015-10-19 08:54:59 +00:00
Elena Demikhovsky	20662e39f1	Removed parameter "Consecutive" from isLegalMaskedLoad() / isLegalMaskedStore(). Originally I planned to use the same interface for masked gather/scatter and set isConsecutive to "false" in this case. Now I'm implementing masked gather/scatter and see that the interface is inconvenient. I want to add interfaces isLegalMaskedGather() / isLegalMaskedScatter() instead of using the "Consecutive" parameter in the existing interfaces. Differential Revision: http://reviews.llvm.org/D13850 llvm-svn: 250686	2015-10-19 07:43:38 +00:00
Sanjoy Das	d295f2c7ca	[SCEV] Fix whitespace issues and remove extra braces; NFC llvm-svn: 250636	2015-10-18 00:29:27 +00:00
Sanjoy Das	f07d2a7143	[SCEV] Use std::all_of and std::any_of; NFC llvm-svn: 250635	2015-10-18 00:29:23 +00:00
Sanjoy Das	6391459069	[SCEV] Use auto where it helps remove line breaks; NFC llvm-svn: 250634	2015-10-18 00:29:20 +00:00
Sanjoy Das	d9f6d33a7f	[SCEV] Use range for loops; NFC llvm-svn: 250633	2015-10-18 00:29:16 +00:00
Manman Ren	72d44b1b09	Recommit r250345, it was reverted in r250366 to investigate a bot failure. Our internal bot is still red after r250366. llvm-svn: 250415	2015-10-15 14:59:40 +00:00
Aaron Ballman	58f413c518	Silencing a -Wtype-limits warning; an unsigned value will always be >= 0; NFC. llvm-svn: 250404	2015-10-15 13:55:43 +00:00
Manman Ren	f5499fd9d5	Temporarily revert r250345 to sort out bot failure. With r250345 and r250343, we start to observe the following failure when bootstrap clang with lto and pgo: PHI node entries do not match predecessors! %.sroa.029.3.i = phi %"class.llvm::SDNode.13298"* [ null, %30953 ], [ null, %31017 ], [ null, %30998 ], [ null, %_ZN4llvm8dyn_castINS_14ConstantSDNodeENS_7SDValueEEENS_10cast_rettyIT_T0_E8ret_typeERS5_.exit.i.1804 ], [ null, %30975 ], [ null, %30991 ], [ null, %_ZNK4llvm3EVT13getScalarTypeEv.exit.i.1812 ], [ %..sroa.029.0.i, %_ZN4llvm11SmallVectorIiLj8EED1Ev.exit.i.1826 ], !dbg !451895 label %30998 label %_ZNK4llvm3EVTeqES0_.exit19.thread.i LLVM ERROR: Broken function found, compilation aborted! I will re-commit this if the bot does not recover. llvm-svn: 250366	2015-10-15 04:58:24 +00:00
Cong Hou	b74d3b3b86	Update the branch weight metadata in JumpThreading pass. Currently in JumpThreading pass, the branch weight metadata is not updated after CFG modification. Consider the jump threading on PredBB, BB, and SuccBB. After jump threading, the weight on BB->SuccBB should be adjusted as some of it is contributed by the edge PredBB->BB, which doesn't exist anymore. This patch tries to update the edge weight in metadata on BB->SuccBB by scaling it by 1 - Freq(PredBB->BB) / Freq(BB->SuccBB). This is the third attempt to submit this patch, while the first two led to failures in some FDO tests. After investigation, it is the edge weight normalization that caused those failures. In this patch the edge weight normalization is fixed so that there is no zero weight in the output and the sum of all weights can fit in 32-bit integer. Several unit tests are added. Differential revision: http://reviews.llvm.org/D10979 llvm-svn: 250345	2015-10-14 23:14:17 +00:00
Philip Reames	ddcf6b35a2	Tighten known bits for ctpop based on zero input bits This is a cleaned up patch from the one written by John Regehr based on the findings of the Souper superoptimizer. The basic idea here is that input bits that are known zero reduce the maximum count that the intrinsic could return. We know that the number of bits required to represent a particular count is at most log2(N)+1. Differential Revision: http://reviews.llvm.org/D13253 llvm-svn: 250338	2015-10-14 22:42:12 +00:00
Manman Ren	2c8e16d507	Revert r250204 and r250240 due to bot failure. We failed to build PGO-ed clang. llvm-svn: 250264	2015-10-14 03:04:03 +00:00
Kostya Serebryany	5cb86d5a40	[asan] Disabling speculative loads under asan. Patch by Mike Aizatsky llvm-svn: 250259	2015-10-14 00:21:05 +00:00
Sanjoy Das	16e7ff171b	[SCEV] Use `SCEV::isAllOnesValue` directly; NFC. Instead of `dyn_cast` ing to `SCEVConstant` and checking the contained `ConstantInteger. llvm-svn: 250251	2015-10-13 23:28:31 +00:00
Cong Hou	7ab123a5cf	Update the branch weight metadata in JumpThreading pass. Currently in JumpThreading pass, the branch weight metadata is not updated after CFG modification. Consider the jump threading on PredBB, BB, and SuccBB. After jump threading, the weight on BB->SuccBB should be adjusted as some of it is contributed by the edge PredBB->BB, which doesn't exist anymore. This patch tries to update the edge weight in metadata on BB->SuccBB by scaling it by 1 - Freq(PredBB->BB) / Freq(BB->SuccBB). Differential revision: http://reviews.llvm.org/D10979 llvm-svn: 250204	2015-10-13 18:43:10 +00:00
James Molloy	860507f838	[GlobalsAA] Don't assume anything about functions that may be overridden Weak linkage and friends allow a symbol to be overriden outside the code generator's model, so GlobalsAA shouldn't assume that anything it can compute about such a symbol is valid. llvm-svn: 250156	2015-10-13 10:43:33 +00:00
Manman Ren	9f824dab1d	Revert 250089 due to bot failure. It failed when building clang itself with PGO. llvm-svn: 250145	2015-10-13 03:38:02 +00:00
Sanjoy Das	1ed6910338	[SCEV] Put some utilites in the ScalarEvolution class In a later commit, `SplitBinaryAdd` will be used outside `IsConstDiff`, so lift that out. And lift out `IsConstDiff` as `computeConstantDifference` to keep things clean and to avoid playing C++ access specifier games. NFC. llvm-svn: 250143	2015-10-13 02:53:27 +00:00
Cong Hou	3320bcd815	Update the branch weight metadata in JumpThreading pass. In JumpThreading pass, the branch weight metadata is not updated after CFG modification. Consider the jump threading on PredBB, BB, and SuccBB. After jump threading, the weight on BB->SuccBB should be adjusted as some of it is contributed by the edge PredBB->BB, which doesn't exist anymore. This patch tries to update the edge weight in metadata on BB->SuccBB by scaling it by 1 - Freq(PredBB->BB) / Freq(BB->SuccBB). Differential revision: http://reviews.llvm.org/D10979 llvm-svn: 250089	2015-10-12 19:44:08 +00:00
James Molloy	55d633bd60	[LoopVectorize] Shrink integer operations into the smallest type possible C semantics force sub-int-sized values (e.g. i8, i16) to be promoted to int type (e.g. i32) whenever arithmetic is performed on them. For targets with native i8 or i16 operations, usually InstCombine can shrink the arithmetic type down again. However InstCombine refuses to create illegal types, so for targets without i8 or i16 registers, the lengthening and shrinking remains. Most SIMD ISAs (e.g. NEON) however support vectors of i8 or i16 even when their scalar equivalents do not, so during vectorization it is important to remove these lengthens and truncates when deciding the profitability of vectorization. The algorithm this uses starts at truncs and icmps, trawling their use-def chains until they terminate or instructions outside the loop are found (or unsafe instructions like inttoptr casts are found). If the use-def chains starting from different root instructions (truncs/icmps) meet, they are unioned. The demanded bits of each node in the graph are ORed together to form an overall mask of the demanded bits in the entire graph. The minimum bitwidth that graph can be truncated to is the bitwidth minus the number of leading zeroes in the overall mask. The intention is that this algorithm should "first do no harm", so it will never insert extra cast instructions. This is why the use-def graphs are unioned, so that subgraphs with different minimum bitwidths do not need casts inserted between them. This algorithm works hard to reduce compile time impact. DemandedBits are only queried if there are extends of illegal types and if a truncate to an illegal type is seen. In the general case, this results in a simple linear scan of the instructions in the loop. No non-noise compile time impact was seen on a clang bootstrap build. llvm-svn: 250032	2015-10-12 12:34:45 +00:00
Tobias Grosser	374bce0c22	SCEV: Allow simple AddRec * Parameter products in delinearization This patch also allows the -delinearize pass to delinearize expressions that do not have an outermost SCEVAddRec expression. The SCEV::delinearize infrastructure allowed this since r240952, but the -delinearize pass was not updated yet. llvm-svn: 250018	2015-10-12 08:02:00 +00:00
Duncan P. N. Exon Smith	5a82c916b0	Analysis: Remove implicit ilist iterator conversions Remove implicit ilist iterator conversions from LLVMAnalysis. I came across something really scary in `llvm::isKnownNotFullPoison()` which relied on `Instruction::getNextNode()` being completely broken (not surprising, but scary nevertheless). This function is documented (and coded to) return `nullptr` when it gets to the sentinel, but with an `ilist_half_node` as a sentinel, the sentinel check looks into some other memory and we don't recognize we've hit the end. Rooting out these scary cases is the reason I'm removing the implicit conversions before doing anything else with `ilist`; I'm not at all surprised that clients rely on badness. I found another scary case -- this time, not relying on badness, just bad (but I guess getting lucky so far) -- in `ObjectSizeOffsetEvaluator::compute_()`. Here, we save out the insertion point, do some things, and then restore it. Previously, we let the iterator auto-convert to `Instruction`, and then set it back using the `Instruction` version: Instruction PrevInsertPoint = Builder.GetInsertPoint(); / Logic that may change insert point */ if (PrevInsertPoint) Builder.SetInsertPoint(PrevInsertPoint); The check for `PrevInsertPoint` doesn't protect correctly against bad accesses. If the insertion point has been set to the end of a basic block (i.e., `SetInsertPoint(SomeBB)`), then `GetInsertPoint()` returns an iterator pointing at the list sentinel. The version of `SetInsertPoint()` that's getting called will then call `PrevInsertPoint->getParent()`, which explodes horribly. The only reason this hasn't blown up is that it's fairly unlikely the builder is adding to the end of the block; usually, we're adding instructions somewhere before the terminator. llvm-svn: 249925	2015-10-10 00:53:03 +00:00
Reid Kleckner	14e773500e	[WinEH] Delete the old landingpad implementation of Windows EH The new implementation works at least as well as the old implementation did. Also delete the associated preparation tests. They don't exercise interesting corner cases of the new implementation. All the codegen tests of the EH tables have already been ported. llvm-svn: 249918	2015-10-09 23:34:53 +00:00
Artur Pilipenko	ffd132878a	ValueTracking: use getAlignment in isAligned Reviewed By: reames Differential Revision: http://reviews.llvm.org/D13517 llvm-svn: 249841	2015-10-09 15:58:26 +00:00
Sanjoy Das	648956118b	[SCEV] Call `StrengthenNoWrapFlags` after `GroupByComplexity`; NFCI The current implementation of `StrengthenNoWrapFlags` is agnostic to the order of `Ops`, so this commit should not change anything semantic. An upcoming change will make `StrengthenNoWrapFlags` sensitive to the order of `Ops`. llvm-svn: 249802	2015-10-09 02:44:45 +00:00
Sanjoy Das	413dbbb1c2	[SCEV] Bring some methods up to coding style; NFC - Start methods with lower case - Reflow a comment - Delete header comment repeated in .cpp file llvm-svn: 249716	2015-10-08 18:46:59 +00:00
Sanjoy Das	3bf22b1883	[SCEV] Remove comment repeated in cpp file; NFC llvm-svn: 249713	2015-10-08 18:28:42 +00:00
Sanjoy Das	dd70996a5c	[SCEV] Pick backedge values for phi nodes correctly Summary: `getConstantEvolutionLoopExitValue` and `ComputeExitCountExhaustively` assumed all phi nodes in the loop header have the same order of incoming values. This is not correct, and this commit changes `getConstantEvolutionLoopExitValue` and `ComputeExitCountExhaustively` to lookup the backedge value of a phi node using the loop's latch block. Unfortunately, there is still some code duplication `getConstantEvolutionLoopExitValue` and `ComputeExitCountExhaustively`. At some point in the future we should extract out a helper class / method that can evolve constant evolution phi nodes across iterations. Fixes 25060. Thanks to Mattias Eriksson for the spot-on analysis! Depends on D13457. Reviewers: atrick, hfinkel Subscribers: materi, llvm-commits Differential Revision: http://reviews.llvm.org/D13458 llvm-svn: 249712	2015-10-08 18:28:36 +00:00
Sanjay Patel	9115cf8c9d	[ValueTracking] teach computeKnownBits that a fabs() clears sign bits This was requested in D13076: if we're going to canonicalize to fabs(), ValueTracking should know that fabs() clears sign bits. In this patch (as in D13076), we're not handling vectors yet even though computeKnownBits' fabs() case itself should be vector-ready via the splat in this patch. Fixing this will require follow-on patches to correct other logic that uses 'getScalarType'. Differential Revision: http://reviews.llvm.org/D13222 llvm-svn: 249701	2015-10-08 16:56:55 +00:00
James Molloy	e9d50dc9f7	Compute demanded bits for icmp instructions Instead of bailing out when we see an icmp, we can instead at least say that if the upper bits of both operands are known zero, they are not demanded. This doesn't help with signed comparisons, but it's at least better than bailing out. llvm-svn: 249687	2015-10-08 12:40:06 +00:00
James Molloy	bcd7f0ac98	Treat Mul just like Add and Subtract Like adds and subtracts, muls ripple only to the left so we can use the same logic. While we're here, add a print method to DemandedBits so it can be used with -analyze, which we'll use in the testcase. llvm-svn: 249686	2015-10-08 12:39:59 +00:00
James Molloy	ab9fdb9226	Make demanded bits lazy The algorithm itself is still eager, but it doesn't get run until a query function is called. This greatly reduces the compile-time impact of requiring DemandedBits when at runtime it is not often used. NFCI. llvm-svn: 249685	2015-10-08 12:39:50 +00:00
Sanjoy Das	10dffcb36b	[SCEV] Check `Pred` first in isKnownPredicateViaSplitting Comparing `Pred` with `ICmpInst::ICMP_ULT` is cheaper that memory access -- do that check before loading / storing `ProvingSplitPredicate`. llvm-svn: 249654	2015-10-08 03:46:00 +00:00
Sanjoy Das	1195dbee66	[SCEV] Use `auto *` instead of `auto`; NFCI (As prescribed by the coding style document) llvm-svn: 249653	2015-10-08 03:45:58 +00:00
Mehdi Amini	044cb34bdc	Revert "Revert "This patch builds on top of D13378 to handle constant condition."" This reverts commit r249528 and reapply r249431. The fix for the fallout has been commited in r249575. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 249581	2015-10-07 18:14:25 +00:00
Sanjoy Das	4493b40002	[SCEV] Use some C++11'ism, NFC Summary: Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13457 llvm-svn: 249574	2015-10-07 17:38:25 +00:00
Artur Pilipenko	d94903c9f8	Teach computeKnownBits to use new align attribute/metadata Reviewed By: reames Differential Revision: http://reviews.llvm.org/D13470 llvm-svn: 249557	2015-10-07 16:01:18 +00:00
James Molloy	47efaeb36e	Revert "This patch builds on top of D13378 to handle constant condition." This reverts commit r249431. This caused failures in sqlite3: http://lab.llvm.org:8011/builders/clang-native-arm-lnt/builds/14453 llvm-svn: 249528	2015-10-07 09:03:34 +00:00
Hans Wennborg	083ca9bb32	Fix Clang-tidy modernize-use-nullptr warnings in source directories and generated files; other minor cleanups. Patch by Eugene Zelenko! Differential Revision: http://reviews.llvm.org/D13321 llvm-svn: 249482	2015-10-06 23:24:35 +00:00
Joseph Tremoulet	2afea5438f	[WinEH] Recognize CoreCLR personality function Summary: - Add CoreCLR to if/else ladders and switches as appropriate. - Rename isMSVCEHPersonality to isFuncletEHPersonality to better reflect what it captures. Reviewers: majnemer, andrew.w.kaylor, rnk Subscribers: pgavlin, AndyAyers, llvm-commits Differential Revision: http://reviews.llvm.org/D13449 llvm-svn: 249455	2015-10-06 20:28:16 +00:00
Philip Reames	675418ebc0	Extend known bits to understand @llvm.bswap This is a cleaned up patch from the one written by John Regehr based on the findings of the Souper superoptimizer. When writing tests, I was surprised to find that instsimplify apparently doesn't know how to collapse bit test sequences based purely on known bits. This required me to split my tests across both instsimplify and instcombine. Differential Revision: http://reviews.llvm.org/D13250 llvm-svn: 249453	2015-10-06 20:20:45 +00:00
Philip Reames	600a91580f	Fix pr25040 - Handle vectors of i1s in recently added implication code As mentioned in the bug, I'd missed the presence of a getScalarType in the caller of the new implies method. As a result, when we ended up with a implication over two vectors, we'd trip an assert and crash. Differential Revision: http://reviews.llvm.org/D13441 llvm-svn: 249442	2015-10-06 19:00:02 +00:00
Mehdi Amini	cf2513b352	This patch builds on top of D13378 to handle constant condition. With this patch, clang -O3 optimizes correctly providing > 1000x speedup on this artificial benchmark): for (a=0; a<n; a++) for (b=0; b<n; b++) for (c=0; c<n; c++) for (d=0; d<n; d++) for (e=0; e<n; e++) for (f=0; f<n; f++) x++; From test-suite/SingleSource/Benchmarks/Shootout/nestedloop.c Reviewers: sanjoyd Differential Revision: http://reviews.llvm.org/D13390 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 249431	2015-10-06 17:19:20 +00:00
Sanjoy Das	1cd930b05f	Try to appease MSVC, NFCI. This time by lifting the lambda's in `createNodeFromSelectLikePHI` to the file scope. Looks like there are differences in capture rules between clang and MSVC? llvm-svn: 249222	2015-10-03 00:34:19 +00:00
Sanjoy Das	21ea9bdc46	Try to appease the MSVC bots, NFCI. llvm-svn: 249219	2015-10-03 00:03:15 +00:00
Sanjoy Das	5b92acea2b	Try to appease the MSVC bots, NFC. llvm-svn: 249216	2015-10-02 23:43:32 +00:00
Sanjoy Das	55015d210f	[SCEV] Recognize simple br-phi patterns Summary: Teach SCEV to match patterns like ``` br %cond, label %left, label %right left: br label %merge right: br label %merge merge: V = phi [ %x, %left ], [ %y, %right ] ``` as "select %cond, %x, %y". Before this SCEV would match PHI nodes exclusively to add recurrences. This addresses PR25005. Reviewers: joker.eph, joker-eph, atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13378 llvm-svn: 249211	2015-10-02 23:09:44 +00:00
Piotr Padlewski	dc9b2cfc50	inariant.group handling in GVN The most important part required to make clang devirtualization works ( ͡°͜ʖ ͡°). The code is able to find non local dependencies, but unfortunatelly because the caller can only handle local dependencies, I had to add some restrictions to look for dependencies only in the same BB. http://reviews.llvm.org/D12992 llvm-svn: 249196	2015-10-02 22:12:22 +00:00
Sanjoy Das	d0671346ae	[SCEV] Refactor out a createNodeForSelect Summary: We will shortly re-use this for select-like br-phi pairs. Reviewers: atrick, joker-eph, joker.eph Subscribers: sanjoy, llvm-commits Differential Revision: http://reviews.llvm.org/D13377 llvm-svn: 249177	2015-10-02 19:39:59 +00:00
Sanjoy Das	7d910f2b11	[SCEV] Try to prove predicates by splitting them Summary: This change teaches SCEV that to prove `A u< B` it is sufficient to prove each of these facts individually: - B >= 0 - A s< B - A >= 0 In practice, SCEV sometimes finds it easier to prove these facts individually than to prove `A u< B` as one atomic step. Reviewers: reames, atrick, nlewycky, hfinkel Subscribers: sanjoy, llvm-commits Differential Revision: http://reviews.llvm.org/D13042 llvm-svn: 249168	2015-10-02 18:50:30 +00:00
Artur Pilipenko	029d8531e6	Refactor computeKnownBits alignment handling code Reviewed By: reames, hfinkel Differential Revision: http://reviews.llvm.org/D12958 llvm-svn: 248892	2015-09-30 11:55:45 +00:00
Igor Laevsky	cea9ede74e	[ValueTracking] Lower dom-conditions-dom-blocks and dom-conditions-max-uses thresholds On some of our benchmarks this change shows about 50% compile time improvement without any noticeable performance difference. Differential Revision: http://reviews.llvm.org/D13248 llvm-svn: 248801	2015-09-29 14:57:52 +00:00
James Molloy	897048bee3	[ValueTracking] Teach isKnownNonZero about monotonically increasing PHIs If a PHI starts at a non-negative constant, monotonically increases (only adds of a constant are supported at the moment) and that add does not wrap, then the PHI is known never to be zero. llvm-svn: 248796	2015-09-29 14:08:45 +00:00
Sanjoy Das	4f1c45952c	[SCEV] Don't crash on pointer comparisons `ScalarEvolution::isImpliedCondOperandsViaNoOverflow` tries to cast the operand type of the comparison it is given to an `IntegerType`. This is incorrect because it could actually be simplifying a comparison between two pointers. Switch it to using `getTypeSizeInBits` instead, which does the right thing for both pointers and integers. Fixed PR24956. llvm-svn: 248743	2015-09-28 21:14:32 +00:00

1 2 3 4 5 ...

5869 Commits