llvm-project

Commit Graph

Author	SHA1	Message	Date
Teresa Johnson	2b60384581	[ThinLTO] Add parenthesis as per build warning Fixes a warning about "\|\|" and "&&" due to r291108. llvm-svn: 291119	2017-01-05 15:10:10 +00:00
Chad Rosier	3ccd1dffff	[AArch64] Remove mcpu option as this test is not target specific. NFC. llvm-svn: 291117	2017-01-05 15:05:03 +00:00
Tony Jiang	3a2f00b024	[PowerPC] Implement missing ISA 2.06 instructions. Instructions: fctidu[.], fctiwu[.], ftdiv, ftsqrt are not implemented. Implement them and add corresponding test cases in this patch. llvm-svn: 291116	2017-01-05 15:00:45 +00:00
Teresa Johnson	e27b058de3	[ThinLTO] Use DenseSet instead of SmallPtrSet for holding GUIDs Should fix some more bot failures from r291108. This should have been a DenseSet, since GUID is not a pointer type. It caused some bots to fail, but for some reason I wasnt't getting a build failure. llvm-svn: 291115	2017-01-05 14:59:56 +00:00
Simon Pilgrim	fd93a54fc8	Wdocumentation fix llvm-svn: 291114	2017-01-05 14:58:54 +00:00
Chad Rosier	e1dc73d9a7	[AArch64] Remove unused arguments from tests. NFC. llvm-svn: 291112	2017-01-05 14:48:53 +00:00
Teresa Johnson	01e7236748	[ThinLTO] Update new ModuleSummaryIndexYAML.h for r291108 Should fix bot failures due to r291108 which happened due to a change required in ModuleSummaryIndexYAML.h which was just added in r291069. llvm-svn: 291111	2017-01-05 14:40:15 +00:00
Simon Pilgrim	a62395a4bd	[CostModel][X86] Pulled out common type legalization code llvm-svn: 291109	2017-01-05 14:33:32 +00:00
Teresa Johnson	519465b993	[ThinLTO] Subsume all importing checks into a single flag Summary: This adds a new summary flag NotEligibleToImport that subsumes several existing flags (NoRename, HasInlineAsmMaybeReferencingInternal and IsNotViableToInline). It also subsumes the checking of references on the summary that was being done during the thin link by eligibleForImport() for each candidate. It is much more efficient to do that checking once during the per-module summary build and record it in the summary. Reviewers: mehdi_amini Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28169 llvm-svn: 291108	2017-01-05 14:32:16 +00:00
Mohammed Agabaria	23599ba794	Currently isLikelyComplexAddressComputation tries to figure out if the given stride seems to be 'complex' and need some extra cost for address computation handling. This code seems to be target dependent which may not be the same for all targets. Passed the decision whether the given stride is complex or not to the target by sending stride information via SCEV to getAddressComputationCost instead of 'IsComplex'. Specifically at X86 targets we dont see any significant address computation cost in case of the strided access in general. Differential Revision: https://reviews.llvm.org/D27518 llvm-svn: 291106	2017-01-05 14:03:41 +00:00
Kristof Beyls	a983e7c4a4	[GlobalISel] Add support for address-taken basic blocks To make this work, pointers from the MachineBasicBlock to the LLVM-IR-level basic blocks need to be initialized, as the AsmPrinter uses this link to be able to print out labels for the basic blocks that are address-taken. Most of the changes in this commit are about adapting existing tests to include the basic block name that is now printed out in the MIR format, now that the name becomes available as the link to the LLVM-IR basic block is initialized. The relevant test change for the functionality added in this patch are the added "(address-taken)" strings in test/CodeGen/AArch64/GlobalISel/arm64-irtranslator.ll. Differential Revision: https://reviews.llvm.org/D28123 llvm-svn: 291105	2017-01-05 13:27:52 +00:00
Anmol P. Paralkar	3480e83118	[doc] Fix minor grammatical error in Phabricator.rst Summary: Test commit, fix minor grammatical error in Phabricator.rst Reviewers: delcypher Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28214 llvm-svn: 291101	2017-01-05 13:08:14 +00:00
Kristof Beyls	eced071e88	[GlobalISel] Add support for switch statements This commit does this using a trivial chain of conditional branches. In the future, we probably want to reuse the optimized switch lowering used in SelectionDAG. Differential Revision: https://reviews.llvm.org/D28176 llvm-svn: 291099	2017-01-05 11:28:51 +00:00
Kristof Beyls	2252440b81	[GlobalISel] Fix AArch64 ICMP instruction selection Differential Revision: https://reviews.llvm.org/D28175 llvm-svn: 291097	2017-01-05 10:16:08 +00:00
Mohammed Agabaria	189e2d29ba	[Test Commit] fixing some format issue in X86TTI to match clang-format output. llvm-svn: 291095	2017-01-05 09:51:02 +00:00
Elena Demikhovsky	143cbc425b	AVX-512: Optimized pattern for truncate with unsigned saturation. DAG patterns optimization: truncate + unsigned saturation supported by VPMOVUS* instructions in AVX-512. Differential revision: https://reviews.llvm.org/D28216 llvm-svn: 291092	2017-01-05 08:21:09 +00:00
Saleem Abdulrasool	9b9e86b4bd	test: remove unnecessary triple argument This test is entirely target agnostic. Avoid the triple to repair the build bots. llvm-svn: 291088	2017-01-05 06:30:12 +00:00
Craig Topper	33c544bdb0	[X86] Add Intel Kaby Lake model numbers to getHostCPUName aliased to "skylake" since there are no feature differences. Model numbers found here http://www.sandpile.org/x86/cpuid.htm llvm-svn: 291086	2017-01-05 05:57:27 +00:00
Saleem Abdulrasool	6252bd8eac	MC: support passing search paths to the IAS This is needed to support inclusion in inline assembly via the `.include` directive. llvm-svn: 291085	2017-01-05 05:56:39 +00:00
Craig Topper	1ab35fa7a8	[X86] Change getHostCPUName to report Intel model 0x4e as "skylake" instead of "skylake-avx512". Add the proper 0x55 model for "skylake-avx512". Summary: Intel's i5-6300U CPU is reporting to have a model id of 78 (4e). The Host detection assumes that to be Skylake Xeon (with AVX512 support), instead of a normal Skylake machine. Patch by: Valentin Churavy Reviewers: nalimilan, craig.topper Subscribers: hfinkel, tkelman, craig.topper, nalimilan, llvm-commits Differential Revision: https://reviews.llvm.org/D28221 llvm-svn: 291084	2017-01-05 05:47:29 +00:00
Peter Collingbourne	192f0b66d2	Tentative fix for modules build. llvm-svn: 291079	2017-01-05 04:40:09 +00:00
Kostya Serebryany	2648243ebd	[libFuzzer] use /tmp (or $TMPDIR, if present) to store temp files during merge llvm-svn: 291078	2017-01-05 04:32:19 +00:00
Peter Collingbourne	bbd3490bcc	Fix build bots. llvm-svn: 291073	2017-01-05 04:00:09 +00:00
Peter Collingbourne	b2ce2b6805	IR: Module summary representation for type identifiers; summary test scaffolding for lowertypetests. Set up basic YAML I/O support for module summaries, plumb the summary into the pass and add a few command line flags to test YAML I/O support. Bitcode support to come separately, as will the code in LowerTypeTests that actually uses the summary. Also add a couple of tests that pass by virtue of the pass doing nothing with the summary (which happens to be the correct thing to do for those tests). Differential Revision: https://reviews.llvm.org/D28041 llvm-svn: 291069	2017-01-05 03:39:00 +00:00
Richard Smith	d4d575b955	Revert r291025 ("AMDGPU: Remove unneccessary intermediate vector") This caused buildbot failures due to returning ArrayRefs referencing local (temporary) objects. llvm-svn: 291067	2017-01-05 03:13:10 +00:00
Chandler Carruth	b2f3a81a92	[PM] Fix a typo in a comment that Davide spotted in another code review. llvm-svn: 291066	2017-01-05 03:10:26 +00:00
Chandler Carruth	4a23563c58	[gtest] Work around broken installs of libc++ where we don't have a cxxabi.h in the include search paths. This comes up when libc++ is installed with some other abi library. At some points in time in history we have had CMake hackery to try and get a cxxabi.h installed that would work, but there are lots of examples lacking this. Also, the just-built tree with libc++ seems to not quite get this right. To let folks make progress, we can easily work around this by detecting that the header is missing and disabling the relevant parts of gtest. This should fix the last remainging build bot failures. While these failures are typically indicative of a questionable install, I don't think gtest should be the thing that surfaces those issues and I don't want folks blocked on this. llvm-svn: 291063	2017-01-05 01:41:49 +00:00
Craig Topper	eea52429cd	[AVX-512] Update vextract64x4 intrinsic upgrade test cases to use a legal immediate so they test the instruction selection correctly. llvm-svn: 291061	2017-01-05 01:34:55 +00:00
Mehdi Amini	87ea8c60a6	Mark test that is testing statistics output as requiring Assertions We only enable statistic in an assert build by default. llvm-svn: 291044	2017-01-05 01:08:01 +00:00
Sanjay Patel	95faecb766	[InstSimplify] add tests to show missing select simplifications; NFC llvm-svn: 291043	2017-01-05 00:40:52 +00:00
Justin Lebar	7d754c9054	[PM] Edit comments in PassManager.h. Summary: This covers most of PassManager.h, up to the introduction of inner/outer analysis proxies. If there's a theme to these changes, it's simplifying the language. For example: * PreservedAnalyses is a "set of analyses", not an "abstract set". "Abstract" doesn't have any particular meaning here. * "Build types for the concept types" becomes "define the concept types". * Instead of "data structures optimized for pointer-like types using the alignment-provided low bits", say "data structures that use the low bits of pointers." * "Clear the map pointing into the results list" becomes "Delete the map entries that point into the results list." This patch also fixes a few places where we referred to "function" and "module" pass/analysis managers, instead of the more abstract "IRUnitT" PM/AMs we have now. Subscribers: mehdi_amini Differential Revision: https://reviews.llvm.org/D27367 llvm-svn: 291040	2017-01-05 00:12:51 +00:00
Reid Kleckner	3dcb61f0bb	Patch gtest to move GTEST_IS_THREADSAFE out of unrelated GTEST_HAS_SEH ifdef Fixes the sanitizer Windows build, which happens to set -DGTEST_HAS_SEH=0. llvm-svn: 291038	2017-01-05 00:00:05 +00:00
Wolfgang Pieb	ce13e716c5	[DWARF] Null out the debug locs of load instructions that have been moved by GVN performing partial redundancy elimination (PRE). Not doing so can cause jumpy line tables and confusing (though correct) source attributions. Differential Revision: https://reviews.llvm.org/D27857 llvm-svn: 291037	2017-01-04 23:58:26 +00:00
Chandler Carruth	dd9c27b3bf	[gtest] Fix the way we disable a warning for unittests. I somehow wrote this fix and then lost it prior to commit. Really sorry about the noise. This should fix some issues with hacking add_definition to do things with warning flags. llvm-svn: 291033	2017-01-04 23:40:06 +00:00
Chandler Carruth	a977582dea	[gtest] Upgrade googletest to version 1.8.0, minimizing local changes. This required re-working the streaming support and lit's support for '--gtest_list_tests' but otherwise seems to be a clean upgrade. Differential Revision: https://reviews.llvm.org/D28154 llvm-svn: 291029	2017-01-04 23:06:03 +00:00
Mehdi Amini	19ef4fad91	Use lazy-loading of Metadata in MetadataLoader when importing is enabled (NFC) Summary: This is a relatively simple scheme: we use the index emitted in the bitcode to avoid loading all the global metadata. Instead we load the index with their position in the bitcode so that we can load each of them individually. Materializing the global metadata block in this condition only triggers loading the named metadata, and the ones referenced from there (transitively). When materializing a function, metadata from the global block are loaded lazily as they are referenced. Two main current limitations are: 1) Global values other than functions are not materialized on demand, so we need to eagerly load METADATA_GLOBAL_DECL_ATTACHMENT records (and their transitive dependencies). 2) When we load a single metadata, we don't recurse on the operands, instead we use a placeholder or a temporary metadata. Unfortunately tepmorary nodes are very expensive. This is why we don't have it always enabled and only for importing. These two limitations can be lifted in a subsequent improvement if needed. With this change, the total link time of opt with ThinLTO and Debug Info enabled is going down from 282s to 224s (~20%). Reviewers: pcc, tejohnson, dexonsmith Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28113 llvm-svn: 291027	2017-01-04 22:54:33 +00:00
Mehdi Amini	867aad1359	Change BitstreamCursor::skipRecord to return the record code (NFC) llvm-svn: 291026	2017-01-04 22:54:14 +00:00
Matt Arsenault	6796d7ea8b	AMDGPU: Remove unneccessary intermediate vector llvm-svn: 291025	2017-01-04 22:54:10 +00:00
David Blaikie	4dc96fceeb	Fixup some header includes from recent IntrusiveRefCntPtr cleanup. llvm-svn: 291024	2017-01-04 22:52:00 +00:00
Justin Lebar	57184446f9	[ADT] Attempt to fix GCC warning in IntrusiveRefCntPtrTest. Our copy constructor doesn't explicitly invoke the base class's constructor, and GCC is (rightly) concerned. llvm-svn: 291023	2017-01-04 22:49:55 +00:00
Matt Arsenault	3bdd75d01e	InstCombine: Fold cos(-x) -> cos(x) Also cos(fabs(x)) -> cos(x) llvm-svn: 291022	2017-01-04 22:49:03 +00:00
David Blaikie	7ad9dc11db	Reapply "Make BitCodeAbbrev ownership explicit using shared_ptr rather than IntrusiveRefCntPtr"" If this is a problem for anyone (shared_ptr is two pointers in size, whereas IntrusiveRefCntPtr is 1 - and the ref count control block that make_shared adds is probably larger than the one int in RefCountedBase) I'd prefer to address this by adding a lower-overhead version of shared_ptr (possibly refactoring IntrusiveRefCntPtr into such a thing) to avoid the intrusiveness - this allows memory ownership to remain orthogonal to types and at least to me, seems to make code easier to understand (since no implicit ownership acquisition can happen). This recommits 291006, reverted in r291007. llvm-svn: 291016	2017-01-04 22:36:33 +00:00
Tim Shen	5480eb8445	[Legalizer] Fix fp-to-uint to fp-tosint promotion assertion. Summary: When promoting fp-to-uint16 to fp-to-sint32, the result is actually zero extended. For example, given double 65534.0, without legalization: fp-to-uint16: 65534.0 -> 0xfffe With the legalization: fp-to-sint32: 65534.0 -> 0x0000fffe Without this patch, legalization wrongly emits a signed extend assertion, which is consumed by later icmp instruction, and cause miscompile. Note that the floating point value must be in [0, 65535), otherwise the behavior is undefined. This patch reverts r279223 behavior and adds more tests and documentations. In PR29041's context, James Molloy mentioned that: We don't need to mask because conversion from float->uint8_t is undefined if the integer part of the float value is not representable in uint8_t. Therefore we can assume this doesn't happen! which is totally true and good, because fptoui is documented clearly to have undefined behavior when overflow/underflow happens. We should take the advantage of this behavior so that we can save unnecessary mask instructions. Reviewers: jmolloy, nadav, echristo, kbarton Subscribers: mehdi_amini, nemanjai, llvm-commits Differential Revision: https://reviews.llvm.org/D28284 llvm-svn: 291015	2017-01-04 22:11:42 +00:00
David Blaikie	e950602d05	Fix some buildbot issues with const objects with default ctors llvm-svn: 291013	2017-01-04 21:59:22 +00:00
Evgeny Stupachenko	c88697dc16	The patch fixes (base, index, offset) match. Summary: Instead of matching: (a + i) + 1 -> (a + i, undef, 1) Now it matches: (a + i) + 1 -> (a, i, 1) Reviewers: rengolin Differential Revision: http://reviews.llvm.org/D26367 From: Evgeny Stupachenko <evstupac@gmail.com> llvm-svn: 291012	2017-01-04 21:43:39 +00:00
Chad Rosier	63687e40bc	[AArch64] Update the feature set for Qualcomm's Falkor CPU. llvm-svn: 291010	2017-01-04 21:26:23 +00:00
Michael Kuperstein	f381f35977	Add positive test for sqrt "partial inlining". NFC. llvm-svn: 291009	2017-01-04 21:24:56 +00:00
Nirav Dave	0f9d111f97	[AArch64] Fix over-eager early-exit in load-store combiner Fix early-exit analysis for memory operation pairing when operations are not emitted in ascending order. Reviewers: mcrosier, t.p.northover Subscribers: aemerson, rengolin, llvm-commits Differential Revision: https://reviews.llvm.org/D28251 llvm-svn: 291008	2017-01-04 21:21:46 +00:00
David Blaikie	6e2207a134	Revert "Make BitCodeAbbrev ownership explicit using shared_ptr rather than IntrusiveRefCntPtr" Breaks Clang's use of bitcode. Reverting until I have a fix to go with it there. This reverts commit r291006. llvm-svn: 291007	2017-01-04 21:19:28 +00:00
David Blaikie	daff78cd87	Make BitCodeAbbrev ownership explicit using shared_ptr rather than IntrusiveRefCntPtr If this is a problem for anyone (shared_ptr is two pointers in size, whereas IntrusiveRefCntPtr is 1 - and the ref count control block that make_shared adds is probably larger than the one int in RefCountedBase) I'd prefer to address this by adding a lower-overhead version of shared_ptr (possibly refactoring IntrusiveRefCntPtr into such a thing) to avoid the intrusiveness - this allows memory ownership to remain orthogonal to types and at least to me, seems to make code easier to understand (since no implicit ownership acquisition can happen). llvm-svn: 291006	2017-01-04 21:13:35 +00:00
David Blaikie	2ff18584a9	Remove unnecessary intrusive ref counting in favor of std::shared_ptr/make_shared The intrusive nature of the reference counting is not required/used here, so simplify the ownership model to make the code easier to understand. llvm-svn: 291005	2017-01-04 21:13:28 +00:00
Michael Kuperstein	020af9c258	Remove accidentally target-dependent test and pacify bots. llvm-svn: 291004	2017-01-04 21:08:53 +00:00
Hal Finkel	b2f951d87a	[PowerPC] Fix logic dealing with nop after calls (and tail-call eligibility) This change aims to unify and correct our logic for when we need to allow for the possibility of the linker adding a TOC restoration instruction after a call. This comes up in two contexts: 1. When determining tail-call eligibility. If we make a tail call (i.e. directly branch to a function) then there is no place for the linker to add a TOC restoration. 2. When determining when we need to add a nop instruction after a call. Likewise, if there is a possibility that the linker might need to add a TOC restoration after a call, then we need to put a nop after the call (the bl instruction). First problem: We were using similar, but different, logic to decide (1) and (2). This is just wrong. Both the resideInSameModule function (used when determining tail-call eligibility) and the isLocalCall function (used when deciding if the post-call nop is needed) were supposed to be determining the same underlying fact (i.e. might a TOC restoration be needed after the call). The same logic should be used in both places. Second problem: The logic in both places was wrong. We only know that two functions will share the same TOC when both functions come from the same section of the same object. Otherwise the linker might cause the functions to use different TOC base addresses (unless the multi-TOC linker option is disabled, in which case only shared-library boundaries are relevant). There are a number of factors that can cause functions to be placed in different sections or come from different objects (-ffunction-sections, explicitly-specified section names, COMDAT, weak linkage, etc.). All of these need to be checked. The existing logic only checked properties of the callee, but the properties of the caller must also be checked (for example, calling from a function in a COMDAT section means calling between sections). There was a conceptual error in the resideInSameModule function in that it allowed tail calls to functions with weak linkage and protected/hidden visibility. While protected/hidden visibility does prevent the function implementation from being replaced at runtime (via interposition), it does not prevent the linker from using an alternate implementation at link time (i.e. using some strong definition to replace the provided weak one during linking). If this happens, then we're still potentially looking at a required TOC restoration upon return. Otherwise, in general, the post-call nop is needed wherever ELF interposition needs to be supported. We don't currently support ELF interposition at the IR level (see http://lists.llvm.org/pipermail/llvm-dev/2016-November/107625.html for more information), and I don't think we should try to make it appear to work in the backend in spite of that fact. Unfortunately, because of the way that the ABI works, we need to generate code as if we supported interposition whenever the linker might insert stubs for the purpose of supporting it. Differential Revision: https://reviews.llvm.org/D27231 llvm-svn: 291003	2017-01-04 21:05:13 +00:00
Daniel Berlin	6cc5e44068	NewGVN: Track the maximum number of iterations GVN takes on any function, so we can pinpoint performance issues. llvm-svn: 291002	2017-01-04 21:01:02 +00:00
Michael Kuperstein	fc74da13a9	Add positive test for sqrt "partial inlining". NFC. llvm-svn: 291001	2017-01-04 20:48:30 +00:00
Davide Italiano	6309895770	[lib/LTO] Simplify logic removing set but unused variable. NFCI. Reported by David Binderman and ack'ed by Teresa on IRC. PR: 31527 llvm-svn: 291000	2017-01-04 20:37:57 +00:00
Peter Collingbourne	efdff71b05	YAML: Remove Input::MapHNode::isValidKey(), use llvm::is_contained() instead. NFC. llvm-svn: 290999	2017-01-04 20:10:43 +00:00
Eric Christopher	568c113ac0	Remove dead and unused variable NumSentinelElements. Fixes PR31529. llvm-svn: 290998	2017-01-04 20:05:18 +00:00
Eric Christopher	0192e97911	Remove dead variable Len. Fixes PR31528 llvm-svn: 290995	2017-01-04 19:47:10 +00:00
Tobias Grosser	9d88b858c8	Add missing CHECK: line to test case added in 29097 Without this CHECK line, we may not detect incorrectly detected additional regions at the end of the region tree. llvm-svn: 290994	2017-01-04 19:35:38 +00:00
David Blaikie	e988e7f22a	ADT: IntrusiveRefCntPtr: Broaden the definition of correct usage of RefCountedBase This roughly matches the semantics of std::enable_shared_from_this - that it does not dictate the ownership model of all users, but constrains those users taking advantage of the intrusive nature to do so only when there's a guarantee that that's the ownership model being used for the object being passed. Reviewers: jlebar Differential Revision: https://reviews.llvm.org/D28245 llvm-svn: 290987	2017-01-04 18:57:31 +00:00
Sanjay Patel	c03f70fcf6	fix comment formatting; NFC llvm-svn: 290980	2017-01-04 18:16:43 +00:00
Jan Vesely	d48445d513	AMDGPU/SI: Implement sendmsghalt intrinsic v2: expose using amdgcn prefix Differential Revision: https://reviews.llvm.org/D23511 llvm-svn: 290977	2017-01-04 18:06:55 +00:00
Tobias Grosser	8ab80ba3a2	RegionInfo: add new test case This test case has been reduced from test/Analysis/RegionInfo/mix_1.ll and provides us with a minimal example of a test case which caused problems while working on an improved version of the RegionInfo analysis. We upstream this test case, as it certainly can be helpful in future debugging and optimization tests. Test case reduced by Pratik Bhatu <cs12b1010@iith.ac.in> llvm-svn: 290974	2017-01-04 17:50:15 +00:00
Robert Lougher	5bf0416f45	Reapply "[SimplifyCFG] In sinkLastInstruction correctly set debugloc of common inst" This reapplies r289828 (reverted in r289833 as it broke the address sanitizer). The debugloc is now only set when the instruction is not a call, as this causes the verifier to assert (the inliner requires an inlinable callsite to have a debug loc if the caller and callee have debug info). Original commit message: Simplify CFG will try to sink the last instruction in a series of basic blocks, creating a "common" instruction in the successor block (sinkLastInstruction). When it does this, the debug location of the single instruction should be the merged debug locations of the commoned instructions. Original review: https://reviews.llvm.org/D27590 llvm-svn: 290973	2017-01-04 17:40:32 +00:00
Simon Pilgrim	6cfb5caf05	Revert r290970 [SLPVectorizer] Regenerate test. The check script will use var names before they are declared, which filecheck doesn't like. llvm-svn: 290971	2017-01-04 16:12:07 +00:00
Simon Pilgrim	4629b46bba	[SLPVectorizer] Regenerate test. Missed var name llvm-svn: 290970	2017-01-04 16:01:55 +00:00
Simon Pilgrim	1d5b0377af	Regenerate test. llvm-svn: 290969	2017-01-04 15:52:41 +00:00
Asiri Rathnayake	9670051657	Fix x86 gold tests on non-x86 targets. These tests are missing a target triple and the -m elf_x86_64 gold option, which makes them fail on non-x86 targets. Differential revision: https://reviews.llvm.org/D28285 Reviewers: tejohnson llvm-svn: 290965	2017-01-04 14:43:51 +00:00
Teresa Johnson	0fca905cb3	[ThinLTO] Rework llvm-link to use the FunctionImporter Summary: Change llvm-link to use the FunctionImporter handling, instead of manually invoking the Linker. We still need to load the module in llvm-link to do the desired testing for invalid import requests (weak functions), and to get the GUID (in case the function is local). Also change the drop-debug-info test to use llvm-link so that importing is forced (in order to test debug info handling) and independent of import logic changes. Reviewers: mehdi_amini Subscribers: mgorny, llvm-commits, aprantl Differential Revision: https://reviews.llvm.org/D28277 llvm-svn: 290964	2017-01-04 14:27:31 +00:00
Davide Italiano	db00939403	[SPARC] Fix test so that it checks the correct label. Before it wasn't checking anything. llvm-svn: 290963	2017-01-04 14:01:58 +00:00
Simon Pilgrim	bb895f3e9c	[CostModel][X86] Updated vXi8 and vXi16 Reverse/Alternate shuffle costs Actual codegen is much better than the extract+insert patterns that was assumed. llvm-svn: 290962	2017-01-04 14:01:33 +00:00
Nemanja Ivanovic	c08b90d08f	[PowerPC] Add identification for POWER8NVL This CPU type was not previously recognized by LLVM which led to emitting poor (and sometimes incorrect) code in some JIT workloads on such a machine. llvm-svn: 290961	2017-01-04 13:58:09 +00:00
Davide Italiano	039368e2d2	[MC/COFF] Fix a test to actually check the relocation. Inspired by r290953 + grep -R 'CHCEK'. llvm-svn: 290958	2017-01-04 13:12:00 +00:00
Simon Pilgrim	939b8cd708	[X86] Merged Reverse/Alternate shuffle cost tables. NFCI. As discussed on D27811, merged the shuffle cost LUTs and use the shuffle kind to perform the lookup instead of the ISD opcode. llvm-svn: 290956	2017-01-04 12:08:41 +00:00
Florian Hahn	5815f6c53c	[framelowering] Skip dbg values when getting next/previous instruction. Summary: In mergeSPUpdates, debug values need to be ignored when getting the previous element, otherwise debug data could have an impact on codegen. In eliminateCallFramePseudoInstr, debug values after the erased element could have an impact on codegen and should be skipped. Closes PR31319 (https://llvm.org/bugs/show_bug.cgi?id=31319) Reviewers: aprantl, MatzeB, mkuper Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D27688 llvm-svn: 290955	2017-01-04 12:08:35 +00:00
Chandler Carruth	2eb065035b	[ADT] Speculative attempt to fix build bot issues with r290952. This just removes the usage of llvm::reverse and llvm::seq. That makes it harder to handle the empty case correctly and so I've also added a test there. This is just a shot in the dark at what might be behind the buildbot failures. I can't reproduce any issues locally including with ASan... I feel like I'm missing something... llvm-svn: 290954	2017-01-04 11:40:18 +00:00
Chandler Carruth	96809ae7ea	[Inliner] Fix a test where I typo'ed 'CHECK' as 'CHCEK' when converting to FileCheck. Fortunately, it passes. =] Spotted in review by Bob Wilson! llvm-svn: 290953	2017-01-04 11:15:01 +00:00
Chandler Carruth	ac458ba9af	[ADT] Enhance the PriorityWorklist to support bulk insertion. This is both convenient and more efficient as we can skip any intermediate reallocation of the vector. This usage pattern came up in a subsequent patch on the pass manager, but it seems generically useful so I factored it out and added unittests here. llvm-svn: 290952	2017-01-04 11:13:11 +00:00
Bjorn Pettersson	3c6ce733f5	Fix for InlineSpiller accessing not updated dom tree base information. Summary: The InlineSpiller was accessing the DominatorTreeBase directly through the public data member DT in the MachineDominatorTree. This is not a good idea as the "cached" information in SplitCriticalEdges is not applied before the access. The DominatorTreeBase must be accessed through the member function getBase() in MachineDominatorTree. The fault was introduced in r266162. I think the public data member DT in the MachineDominatorTree should have been made private in the original code (r215576) that introduced the concept of lazily updating the MachineDominatorTree information from MachineBasicBlock::SplitCriticalEdge(). Patch by Karl-Johan Karlsson <karl-johan.karlsson@ericsson.com> Reviewers: wmi, qcolombet Subscribers: llvm-commits, bjope, uabelho Differential Revision: https://reviews.llvm.org/D27983 llvm-svn: 290950	2017-01-04 09:41:56 +00:00
Nitesh Jain	b0bc573ca8	[LLC][MIPS] Fix crash after enabling LLVM_ENABLE_EXPENSIVE_CHECKS Reviewers: sdardis, vkalintiris Subscribers: jaydeep, slthakur, RKSimon, llvm-commits Differential Revision: https://reviews.llvm.org/D27841 llvm-svn: 290949	2017-01-04 09:34:37 +00:00
Ayman Musa	02f9533823	[X86][AVX512] Passing the appropriate memory operand class to INT_{U}COMIS{S\|D} instructions Replacing the memory operand in the intrinsic versions of the comis/ucomis instrucions from f128mem to ssmem/sdmem accordingly. Differential Revision: https://reviews.llvm.org/D28138 llvm-svn: 290948	2017-01-04 08:21:54 +00:00
Simon Pilgrim	c76ea4b638	[X86] Attempt to pre-truncate arithmetic operations if useful In some cases its more efficient to combine TRUNC( BINOP( X, Y ) ) --> BINOP( TRUNC( X ), TRUNC( Y ) ) if the binop is legal for the truncated types. This is true for vector integer multiplication (especially vXi64), as well as ADD/AND/XOR/OR in cases where we only need to truncate one of the inputs at runtime (e.g. a duplicated input or an one use constant we can fold). Further work could be done here - scalar cases (especially i64) could often benefit (if we avoid partial registers etc.), other opcodes, and better analysis of when truncating the inputs reduces costs. I have considered implementing this for all targets within the DAGCombiner but wasn't sure we could devise a suitable cost model system that would give us the range we need. Differential Revision: https://reviews.llvm.org/D28219 llvm-svn: 290947	2017-01-04 08:05:42 +00:00
Craig Topper	d0aa53b9ae	[AVX-512] Add support for detecting 512-bit shuffles that contain a 128-bit subvector insertion from the lowest subvector of one of the sources. These are best handled with a vinsert32x4 or vinsert64x2 instruction. llvm-svn: 290946	2017-01-04 07:32:03 +00:00
Craig Topper	a3b9a4edd5	[AVX-512] Add more test cases for shuffles that should be handled with subvector insert instructions. llvm-svn: 290945	2017-01-04 07:31:59 +00:00
Craig Topper	9e065c5b5c	[AVX-512] Fix a typo in a couple case names to match their behavior. llvm-svn: 290944	2017-01-04 07:31:57 +00:00
Craig Topper	42e8e33ccd	[AVX-512] Add avx512dq to the vector-shuffle-512-v16.ll test command lines in preparation for a future change that needs these features. llvm-svn: 290943	2017-01-04 07:31:54 +00:00
Craig Topper	83115a809f	[AVX-512] Simplify code for creating 512-bit SHUF128 operations. We don't need two loops and we can safely assume assume and hardcode the size of the widened mask. llvm-svn: 290942	2017-01-04 07:31:51 +00:00
Peter Collingbourne	87dd2ab000	Support: Add YAML I/O support for custom mappings. This will be used to YAMLify parts of the module summary. Differential Revision: https://reviews.llvm.org/D28014 llvm-svn: 290935	2017-01-04 03:51:36 +00:00
Eric Christopher	46b6597296	On a 64-bit system, the DWARFDebugLine::Row struct is 32 bytes. Each field has the following byte offsets: 0-7: Address 8-11: Line 12-13: Column 14-15: File 16-19: Isa 20-23: Discriminator 24+: bit fields The packing is fine until the "Isa" field, which is an 8-bit int that occupies 4 bytes. We can instead move Discriminator into the 16-19 slot, and pack Isa into the 20-23 range along with the bit fields: 0-7: Address 8-11: Line 12-13: Column 14-15: File 16-19: Discriminator 20-23: Isa + bit fields This layout is only 24 bytes. This 25% reduction in size may seem small but a large binary can have line tables with thousands of rows stored in a vector. Patch by Simon Que! Differential Revision: https://reviews.llvm.org/D27961 llvm-svn: 290931	2017-01-04 02:34:29 +00:00
David Majnemer	b5e365c970	[InstCombine] Add a test for r290733 llvm-svn: 290929	2017-01-04 02:21:37 +00:00
David Majnemer	cb892e9066	[InstCombine] Move casts around shift operations It is possible to perform a left shift before zero extending if the shift would only shift out zeros. llvm-svn: 290928	2017-01-04 02:21:34 +00:00
David Majnemer	022d2a563b	[InstCombine] Combine adds across a zext We can perform the following: (add (zext (add nuw X, C1)), C2) -> (zext (add nuw X, C1+C2)) This is only possible if C2 is negative and C2 is greater than or equal to negative C1. llvm-svn: 290927	2017-01-04 02:21:31 +00:00
Eugene Zelenko	b2ca1b3f37	[Hexagon, TableGen] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 290925	2017-01-04 02:02:05 +00:00
Greg Clayton	a9ef7eec3d	Correct the parent testing to avoid the special case where a DIE has a depth of 1 This test was testing that we could correctly find the parent of a DIE, but it was actually just testing the special case where a DIE's depth was 1. This corrects that error by adding an extra level into the the DWARF to ensure that we correctly get the parent by looking for the parent with a depth that is 1 less than the current depth. Differential Revision: https://reviews.llvm.org/D28261 llvm-svn: 290918	2017-01-04 00:10:50 +00:00
Teresa Johnson	5a8dba5bda	[ThinLTO] Import type as decl only when non-null Identifier As per post-commit review for r289993 (D27775), we can only safely import a type as a decl if it has an Identifier, as the Name alone is not enough to be unique across modules. llvm-svn: 290915	2017-01-03 23:19:29 +00:00
Zachary Turner	491fe5bec0	Fix the MSVC version check. I'm not sure what determines the minor version, but it appears that it's possible for a fully updated, release version of VS2015 with Update 3 can go (at least) as low as 19.00.24213.1. Updating the compiler version check to account for this so we don't generate superfluous warnings. llvm-svn: 290914	2017-01-03 23:12:36 +00:00
Matt Arsenault	56ff4839ae	InstCombine: Fold fabs on select of constants llvm-svn: 290913	2017-01-03 22:40:34 +00:00
Sanjay Patel	f0d1e77373	[InstCombine] use 'match' to reduce code bloat; NFCI I wrote this patch before seeing the comment in: https://reviews.llvm.org/D27114 ...that suggests we should actually be canonicalizing the other way. So just in case we decide this is the right way, we might as well have a cleaner implementation. llvm-svn: 290912	2017-01-03 22:25:31 +00:00
Ahmed Bougacha	8a41319d8d	[CodeGen] Further simplify returned call operand logic. NFC. As Pete points out in r290905, CallSite lets us avoid duplicating this! llvm-svn: 290909	2017-01-03 21:42:43 +00:00
Lang Hames	b198e5585e	[ExecutionEngine] Fix compile errors in OProfileJITEventListener. Allows LLVM to build with LLVM_USE_OPROFILE=True. Patch by Mark Dewing. Thanks Mark! llvm-svn: 290908	2017-01-03 21:39:43 +00:00
Ahmed Bougacha	6aff744e7c	[CodeGen] Simplify logic that looks for returned call operands. NFC-ish. Use getReturnedArgOperand() instead of rolling our own. Note that it's equivalent because there can only be one 'returned' operand. The existing code was also incorrect: there already was awkward logic to ignore callee/EH blocks, but operands can now also be operand bundles, in which case we'll look for non-existent parameter attributes. Unfortunately, this isn't observable in-tree, as it only crashes when exercising the regular call lowering logic with operand bundles. Still, this is a nice small cleanup anyway. llvm-svn: 290905	2017-01-03 20:33:22 +00:00
Sanjay Patel	ada846aff0	[InstCombine] tighten checks for tests of assume -> metadata transform; NFC llvm-svn: 290903	2017-01-03 19:32:11 +00:00
Simon Pilgrim	1145989a71	[X86][SSE] Add extra truncated arithmetic tests for D28219 llvm-svn: 290902	2017-01-03 19:18:07 +00:00
Adrian Prantl	36daf63b2b	Add llvm-bcanalyzer support for new metadata node types. Also sort the existing list by value. llvm-svn: 290901	2017-01-03 19:17:49 +00:00
Xin Tong	883dd1b6c4	Enable disabled loopidiom test. Apparently we handle it now Summary: Enable disabled loopidiom test. Apparently we handle it now. Maybe due to improvements to AA. Reviewers: atrick, danielcdh, hfinkel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28171 llvm-svn: 290900	2017-01-03 19:08:05 +00:00
Kostya Serebryany	4986e819dc	[libFuzzer] disable -print_pcs by default (was enabled by mistake) llvm-svn: 290899	2017-01-03 18:51:28 +00:00
Michal Gorny	21c12044d2	[ADT] APFloatBase: Prevent collapsing semPPCDoubleDouble and semBogus Provide a distinct contents for semBogus and semPPCDoubleDouble in order to prevent compilers from collapsing them to a single memory address, while we heavily rely on every semantic having distinct address. This happens if insecure optimization collapsing identical values is enabled. As a result, APFloats of semBogus are indistinguishable from semPPCDoubleDouble -- and whenever the move constructor is used, the old value beings being incorrectly recognized as a semPPCDoubleDouble. Since the values in semPPCDoubleDouble are not used anywhere, we can easily solve this issue via altering the value of one of the fields and therefore ensuring that the collapse can not occur. Differential Revision: https://reviews.llvm.org/D28112 llvm-svn: 290896	2017-01-03 16:33:50 +00:00
Craig Topper	48d232d3e7	[X86] Move 128-bit shuffle mask widening check into lowerV2X128VectorShuffle to reduce code duplication. Use the now available widened mask to simplify some code inside lowerV2X128VectorShuffle. llvm-svn: 290872	2017-01-03 07:36:41 +00:00
Craig Topper	785e58fdc9	[AVX-512] Simplify the code added in r290870 to recognized 256-bit subvector inserts and avoid calling isShuffleEquivalent on a widened mask. llvm-svn: 290871	2017-01-03 07:36:39 +00:00
Craig Topper	9496e3f916	[AVX-512] Teach shuffle lowering to use vinsert instructions for shuffles corresponding to 256-bit subvector inserts. llvm-svn: 290870	2017-01-03 07:00:40 +00:00
Craig Topper	fa875a1d3d	[AVX-512] Teach EVEX to VEX conversion pass to handle VINSERT and VEXTRACT instructions. llvm-svn: 290869	2017-01-03 05:46:18 +00:00
Craig Topper	15d116ab41	[AVX-512] Re-generate tests that were updated for r290663 without using update_llc_test_checks.py so duplicate check lines weren't merged. llvm-svn: 290868	2017-01-03 05:46:10 +00:00
Craig Topper	be9ef55152	[X86] Remove trailing whitespace and an unnecessary line wrap. NFC llvm-svn: 290867	2017-01-03 05:46:06 +00:00
Craig Topper	06bae884bd	[X86] Fix header comment. NFC llvm-svn: 290866	2017-01-03 05:46:05 +00:00
Craig Topper	c849172105	[AVX-512] Add support for pushing bitcasts through INSERT_SUBVEC in order to select a masked operation. llvm-svn: 290865	2017-01-03 05:46:02 +00:00
Craig Topper	0cda8bbf74	[AVX-512] Remove vinsert intrinsics and autoupgrade to native shufflevectors. There are some codegen problems here that I'll try to fix in future commits. llvm-svn: 290864	2017-01-03 05:45:57 +00:00
Craig Topper	4d47c6ae57	[AVX-512] Remove vextract intrinsics and autoupgrade to native shufflevectors. This unfortunately generates some really terrible code without VLX support due to v2i1 and v4i1 not being legal. Hopefully we can improve that in future patches. llvm-svn: 290863	2017-01-03 05:45:46 +00:00
Matt Arsenault	b264c94963	InstCombine: Add fma with constant transforms DAGCombine already does these. llvm-svn: 290860	2017-01-03 04:32:35 +00:00
Matt Arsenault	1cc294c85d	InstCombine: Add fma + fabs/fneg transforms fma (fneg x), (fneg y), z -> fma x, y, z fma (fabs x), (fabs x), z -> fma x, x, z llvm-svn: 290859	2017-01-03 04:32:31 +00:00
Dean Michael Berris	f7e7b938ea	[XRay] Merge instrumentation point table emission code into AsmPrinter. Summary: No need to have this per-architecture. While there, unify 32-bit ARM's behaviour with what changed elsewhere and start function names lowercase as per the coding standards. Individual entry emission code goes to the entry's own class. Fully tested on amd64, cross-builds on both ARMs and PowerPC. Reviewers: dberris Subscribers: aemerson, llvm-commits Differential Revision: https://reviews.llvm.org/D28209 llvm-svn: 290858	2017-01-03 04:30:21 +00:00
Sanjay Patel	1c9867d009	[EarlyCSE] less else, more auto; NFC llvm-svn: 290848	2017-01-03 00:16:24 +00:00
Sanjay Patel	b38ad88e9f	[InstCombine] use combineMetadataForCSE instead of copying it; NFCI llvm-svn: 290844	2017-01-02 23:25:28 +00:00
Chris Bieneman	e205d766f0	[CMake] Set HAVE_${runtime} before including any subdirectories This should allow us to avoid most order dependence in the runtime library configurations. llvm-svn: 290834	2017-01-02 20:33:33 +00:00
Xin Tong	2940231ff0	Make sure total loop body weight is preserved in loop peeling Summary: Regardless how the loop body weight is distributed, we should preserve total loop body weight. i.e. we should have same weight reaching the body of the loop or its duplicates in peeled and unpeeled case. Reviewers: mkuper, davidxl, anemet Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28179 llvm-svn: 290833	2017-01-02 20:27:23 +00:00
Michal Gorny	f423390156	[cmake] Normalize LLVM_ENABLE_DIA_SDK to fix Windows tests Attempts to fix Windows build breakage caused by r290818. llvm-svn: 290832	2017-01-02 20:22:45 +00:00
Daniel Berlin	aa0ec1e992	NewGVN: Add a test case for equivalent phis. llvm-svn: 290830	2017-01-02 19:55:13 +00:00
Daniel Berlin	43a5f998df	NewGVN: Add forgotten testcase for PR 31483 llvm-svn: 290829	2017-01-02 19:49:20 +00:00
Daniel Berlin	de43ef9601	NewGVN: Clean up after removing possibility of null expressions. llvm-svn: 290828	2017-01-02 19:49:17 +00:00
Sanjay Patel	65d533ca42	fix typo; NFC llvm-svn: 290827	2017-01-02 19:05:11 +00:00
Sanjay Patel	4382997a13	[ValueTracking] remove stale comments; NFC The checks were improved with: https://reviews.llvm.org/rL290194 llvm-svn: 290826	2017-01-02 19:04:07 +00:00
Davide Italiano	67ada75d84	[NewGVN] Fold single-use variable inside the assertion. It placates some bots which complain because they compile the assertion out and think the variable is unused. llvm-svn: 290825	2017-01-02 19:03:16 +00:00
Davide Italiano	841261624d	[NewGVN] Restore old code to placate buildbots. Apparently my suggestion of using ternary doesn't really work as clang complains about incompatible types on LHS and RHS. Some GCC versions happen to accept the code but clang behaviour is correct here. llvm-svn: 290822	2017-01-02 18:41:34 +00:00
Daniel Berlin	25f05b0ab7	NewGVN: Fix some formatting and comment issues llvm-svn: 290820	2017-01-02 18:22:38 +00:00
Michal Gorny	89b6f16b3e	[cmake] Add LLVM_ENABLE_DIA_SDK option, and expose it in LLVMConfig Add an explicit LLVM_ENABLE_DIA_SDK option to control building support for DIA SDK-based debugging. Control its value to match whether DIA SDK support was found and expose it in LLVMConfig (alike LLVM_ENABLE_ZLIB). Its value is needed for LLDB to determine whether to run tests requiring DIA support. Currently it is obtained from llvm/Config/config.h; however, this file is not available for standalone builds. Following this change, LLDB will be modified to use the value from LLVMConfig. Differential Revision: https://reviews.llvm.org/D26255 llvm-svn: 290818	2017-01-02 18:19:35 +00:00
Joerg Sonnenberger	7b83732a40	Emit .cfi_sections before the first .cfi_startproc GNU as rejects input where .cfi_sections is used after .cfi_startproc, if the new section differs from the old. Adjust our output to always emit .cfi_sections before the first .cfi_startproc to minimize necessary code. Differential Revision: https://reviews.llvm.org/D28011 llvm-svn: 290817	2017-01-02 18:05:27 +00:00
Daniel Berlin	02c6b176e7	NewGVN: Add UnknownExpression and create them for things we can't symbolize. Kill fragile machinery for handling null expressions. Summary: This avoids the very fragile code for null expressions. We could also use a denseset that tracks which things have null expressions instead, but that seems pretty fragile and premature optimization. This resolves a number of infinite loop cases, test reductions coming. Reviewers: davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28193 llvm-svn: 290816	2017-01-02 18:00:53 +00:00
Daniel Berlin	589cecc6e9	NewGVN: Fix PR31480, PR31483, PR31499, by rewriting how memory congruence handling works. Summary: Previously, we tried to fix up the equivalences during symbolic evaluation. This does not work. Now, we change the equivalences during congruence finding, where it belongs. We also initialize the equivalence table to give a maximal answer. Reviewers: davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28192 llvm-svn: 290815	2017-01-02 18:00:46 +00:00
Davide Italiano	b672537cbf	[PMBuilder] Remove RunFloat2Int cl::opt. The pass has been on by default for a long time without problems. llvm-svn: 290814	2017-01-02 17:49:18 +00:00
Elena Demikhovsky	d96200d60a	Fixed shuffle-reverse cost on AVX-512. (This changed was approved in https://reviews.llvm.org/D28118, but Simon asked to submit it separately). llvm-svn: 290812	2017-01-02 11:44:10 +00:00
Elena Demikhovsky	21706cbd24	AVX-512 Loop Vectorizer: Cost calculation for interleave load/store patterns. X86 target does not provide any target specific cost calculation for interleave patterns.It uses the common target-independent calculation, which gives very high numbers. As a result, the scalar version is chosen in many cases. The situation on AVX-512 is even worse, since we have 3-src shuffles that significantly reduce the cost. In this patch I calculate the cost on AVX-512. It will allow to compare interleave pattern with gather/scatter and choose a better solution (PR31426). * Shiffle-broadcast cost will be changed in Simon's upcoming patch. Differential Revision: https://reviews.llvm.org/D28118 llvm-svn: 290810	2017-01-02 10:37:52 +00:00
Keno Fischer	f7d84ee6ff	Reapply "[CodeGen] Fix invalid DWARF info on Win64" This reapplies rL289013 (reverted in rL289014) with the fixes identified in D21731. Should hopefully pass the buildbots this time. llvm-svn: 290809	2017-01-02 03:00:19 +00:00
Sanjay Patel	0e3ae439cf	[InstCombine] add explanatory comment to test; NFC The test was added at r290797, and a patch to enable the transform is proposed in D28204. llvm-svn: 290798	2017-01-01 18:20:49 +00:00
Sanjay Patel	07537c2b6e	[InstCombine] add test to show potential nonnull attribute propagation; NFC This will change with the current draft of: https://reviews.llvm.org/D28204 llvm-svn: 290797	2017-01-01 17:18:00 +00:00
Florian Hahn	f872d230ad	[selectiondag] Check PromotedFloats map during expansive checks. Summary: `PromotedFloats` needs to be checked in `DAGTypeLegalizer::PerformExpensiveChecks`. This patch fixes a few type legalization failures with expansive checks for ARM fp16 tests. Reviewers: baldrick, bogner, arsenm Subscribers: arsenm, aemerson, llvm-commits Differential Revision: https://reviews.llvm.org/D28187 llvm-svn: 290796	2017-01-01 13:58:27 +00:00
Sanjoy Das	3bb2dbd665	Fix an issue with isGuaranteedToTransferExecutionToSuccessor I'm not sure if this was intentional, but today isGuaranteedToTransferExecutionToSuccessor returns true for readonly and argmemonly calls that may throw. This commit changes the function to not implicitly infer nounwind this way. Even if we eventually specify readonly calls as not throwing, isGuaranteedToTransferExecutionToSuccessor is not the best place to infer that. We should instead teach FunctionAttrs or some other such pass to tag readonly functions / calls as nounwind instead. llvm-svn: 290794	2016-12-31 22:12:34 +00:00
Sanjoy Das	0945530d4d	Avoid const_cast; NFC llvm-svn: 290793	2016-12-31 22:12:31 +00:00
Sanjay Patel	5865d12e9f	[ValueTracking] add tests for known-nonnull-at; NFC llvm-svn: 290790	2016-12-31 19:23:26 +00:00
Sanjay Patel	aea60846c4	[Inliner] remove unnecessary null checks from AddAlignmentAssumptions(); NFCI We bail out on the 1st line if the assumption cache is not set, so there's no need to check it after that. llvm-svn: 290787	2016-12-31 17:54:05 +00:00
Sanjay Patel	7fd779f09f	[ValueTracking] make dominator tree requirement explicit for isKnownNonNullFromDominatingCondition(); NFCI I don't think this hole is currently exposed, but I crashed regression tests for jump-threading and loop-vectorize after I added calls to isKnownNonNullAt() in InstSimplify as part of trying to solve PR28430: https://llvm.org/bugs/show_bug.cgi?id=28430 That's because they call into value tracking with a context instruction, but no other parts of the query structure filled in. For more background, see the discussion in: https://reviews.llvm.org/D27855 llvm-svn: 290786	2016-12-31 17:37:01 +00:00

1 2 3 4 5 ...

142829 Commits