llvm-project

Commit Graph

Author	SHA1	Message	Date
Sean Fertile	39770ca0a1	Revert "[LTO][ThinLTO] Use the linker resolutions to mark global values ..." Changes more tests then expected on one of the build bots. reverting to investigate. This reverts https://llvm.org/svn/llvm-project/llvm/trunk@317374 llvm-svn: 317395	2017-11-04 01:54:20 +00:00
Davide Italiano	c7c05ae4be	[CallSiteSplitting] clang-format my last commit. NFCI. Thanks to Rui for pointing out. llvm-svn: 317393	2017-11-04 00:44:01 +00:00
Davide Italiano	91b4790b33	[CallSiteSplitting] Silence GCC's -Wparentheses. NFCI. llvm-svn: 317385	2017-11-03 23:03:38 +00:00
Craig Topper	d21a53f246	[X86] Give unary PERMI priority over SHUF128 in lowerV8I64VectorShuffle to make it possible to fold a load. llvm-svn: 317382	2017-11-03 22:48:13 +00:00
David Blaikie	1be62f0327	Move TargetFrameLowering.h to CodeGen where it's implemented This header already includes a CodeGen header and is implemented in lib/CodeGen, so move the header there to match. This fixes a link error with modular codegeneration builds - where a header and its implementation are circularly dependent and so need to be in the same library, not split between two like this. llvm-svn: 317379	2017-11-03 22:32:11 +00:00
Adrian Prantl	261ac8b23c	Invoke salvageDebugInfo from CodeGenPrepare's SinkCast() This preserves the debug info for the cast operation in the original location. rdar://problem/33460652 Reapplied r317340 with the test moved into an ARM-specific directory. llvm-svn: 317375	2017-11-03 21:55:03 +00:00
Sean Fertile	36528c2a9b	[LTO][ThinLTO] Use the linker resolutions to mark global values as dso_local. Now that we have a way to mark GlobalValues as local we can use the symbol resolutions that the linker plugin provides as part of lto/thinlto link step to refine the compilers view on what symbols will end up being local. Differential Revision: https://reviews.llvm.org/D35702 llvm-svn: 317374	2017-11-03 21:45:55 +00:00
Kevin Enderby	3fc9188fa8	Fix a crash in llvm-objdump when printing a bad x86_64 relocation in a Mach-O file with a bad section number. rdar://35207539 llvm-svn: 317373	2017-11-03 21:32:44 +00:00
Peter Collingbourne	c2935db629	Revert r317046, "Object: Move some code from ELF.h into ELF.cpp." This change resulted in a measured 1.5-2% perf regression linking chrome. llvm-svn: 317371	2017-11-03 21:30:06 +00:00
Craig Topper	12463779d3	[SimplifyCFG] When merging conditional stores, don't count the store we're merging against the PHINodeFoldingThreshold Merging conditional stores tries to check to see if the code is if convertible after the store is moved. But the store hasn't been moved yet so its being counted against the threshold. The patch adds 1 to the threshold comparison to make sure we don't count the store. I've adjusted a test to use a lower threshold to ensure we still do that conversion with the lower threshold. Differential Revision: https://reviews.llvm.org/D39570 llvm-svn: 317368	2017-11-03 21:08:13 +00:00
David Blaikie	34eb96b03f	GCOV: Move GCOV from IR & Support into ProfileData to fix layering This class was split between libIR and libSupport, which breaks under modular code generation. Move it into the one library that uses it, ProfileData, to resolve this issue. llvm-svn: 317366	2017-11-03 20:57:10 +00:00
David Blaikie	998ff81f7c	llvm-objdump: Fix unused-lambda-capture warning by removing unused lambda capture llvm-svn: 317365	2017-11-03 20:57:09 +00:00
Mitch Phillips	c15bdf5598	[cfi-verify] Add blacklist parsing for result filtering. Adds blacklist parsing behaviour for filtering results into four categories: - Expected Protected: Things that are not in the blacklist and are protected. - Unexpected Protected: Things that are in the blacklist and are protected. - Expected Unprotected: Things that are in the blacklist and are unprotected. - Unexpected Unprotected: Things that are not in the blacklist and are unprotected. now can optionally be invoked with a second command line argument, which specifies the blacklist file that the binary was built with. Current statistics for chromium: Reviewers: vlad.tsyrklevich Subscribers: mgorny, llvm-commits, pcc, kcc Differential Revision: https://reviews.llvm.org/D39525 llvm-svn: 317364	2017-11-03 20:54:26 +00:00
Jun Bum Lim	0c99007db1	Recommit r317351 : Add CallSiteSplitting pass This recommit r317351 after fixing a buildbot failure. Original commit message: Summary: This change add a pass which tries to split a call-site to pass more constrained arguments if its argument is predicated in the control flow so that we can expose better context to the later passes (e.g, inliner, jump threading, or IPA-CP based function cloning, etc.). As of now we support two cases : 1) If a call site is dominated by an OR condition and if any of its arguments are predicated on this OR condition, try to split the condition with more constrained arguments. For example, in the code below, we try to split the call site since we can predicate the argument (ptr) based on the OR condition. Split from : if (!ptr \|\| c) callee(ptr); to : if (!ptr) callee(null ptr) // set the known constant value else if (c) callee(nonnull ptr) // set non-null attribute in the argument 2) We can also split a call-site based on constant incoming values of a PHI For example, from : BB0: %c = icmp eq i32 %i1, %i2 br i1 %c, label %BB2, label %BB1 BB1: br label %BB2 BB2: %p = phi i32 [ 0, %BB0 ], [ 1, %BB1 ] call void @bar(i32 %p) to BB0: %c = icmp eq i32 %i1, %i2 br i1 %c, label %BB2-split0, label %BB1 BB1: br label %BB2-split1 BB2-split0: call void @bar(i32 0) br label %BB2 BB2-split1: call void @bar(i32 1) br label %BB2 BB2: %p = phi i32 [ 0, %BB2-split0 ], [ 1, %BB2-split1 ] llvm-svn: 317362	2017-11-03 20:41:16 +00:00
David Blaikie	526f30b8aa	Modularize: Include some required headers DenseMaps require the definition of a type to be available when using a pointer to that type as a key to know how many bits are available for tombstone/etc. llvm-svn: 317360	2017-11-03 20:24:19 +00:00
Martin Storsjo	1a9593b251	[llvm-ar] Support an options string that start with a dash Some projects call $AR like "$AR -crs output input1 input2". Differential Revision: https://reviews.llvm.org/D39538 llvm-svn: 317358	2017-11-03 20:09:10 +00:00
Aaron Ballman	639ea374d6	Correcting some CRLFs that snuck in with my previous commit; NFC. llvm-svn: 317357	2017-11-03 20:05:51 +00:00
Aaron Ballman	ecf0e95267	Add llvm::for_each as a range-based extensions to <algorithm> and make use of it in some cases where it is a more clear alternative to std::for_each. llvm-svn: 317356	2017-11-03 20:01:25 +00:00
Mitch Phillips	189ebb6976	[cfi-verify] Add an interesting unit test where undef search length changes result. Add an interesting unit test, found by changing --search-length-undef from the default. Program handles it correctly but good for ensuring correctness on further changes :) Reviewers: pcc Subscribers: mgorny, llvm-commits, kcc, vlad.tsyrklevich Differential Revision: https://reviews.llvm.org/D38658 llvm-svn: 317355	2017-11-03 20:00:05 +00:00
Craig Topper	8b6600363a	[X86] Promote athlon, athlon-xp, k8, and k8-sse3 to types instead of subtypes in getHostCPUName. NFCI This removes the athlon type and simplifies the string decoding. We only really need these type/subtype breaks where we need to match libgcc/compiler-rt and these CPUs aren't part of that. I'm looking into moving some of this information to a .def file to share with clang's __builtin_cpu_is handling. And while these CPUs aren't part of that the less lines I have to deal with in the .def file the better. llvm-svn: 317354	2017-11-03 19:37:41 +00:00
Jun Bum Lim	0eb1c2d63a	Revert "Add CallSiteSplitting pass" Revert due to Buildbot failure. This reverts commit r317351. llvm-svn: 317353	2017-11-03 19:17:11 +00:00
Jake Ehrlich	c3a89eefd6	Reland "Add support for writing 64-bit symbol tables for archives when offsets become too large for 32-bit" Tests were failing because some bots were running out of address space and memory. Additionally the test was very slow. These issues were solved by changing the test to take advantage of sparse filse and restricting the test to run only on 64-bit systems. This should fix https://bugs.llvm.org//show_bug.cgi?id=34189 This change makes it so that if writing a K_GNU style archive, you need to output a > 32-bit offset it should output in K_GNU64 style instead. Differential Revision: https://reviews.llvm.org/D36812 llvm-svn: 317352	2017-11-03 19:15:06 +00:00
Jun Bum Lim	2a58933519	Add CallSiteSplitting pass Summary: This change add a pass which tries to split a call-site to pass more constrained arguments if its argument is predicated in the control flow so that we can expose better context to the later passes (e.g, inliner, jump threading, or IPA-CP based function cloning, etc.). As of now we support two cases : 1) If a call site is dominated by an OR condition and if any of its arguments are predicated on this OR condition, try to split the condition with more constrained arguments. For example, in the code below, we try to split the call site since we can predicate the argument (ptr) based on the OR condition. Split from : if (!ptr \|\| c) callee(ptr); to : if (!ptr) callee(null ptr) // set the known constant value else if (c) callee(nonnull ptr) // set non-null attribute in the argument 2) We can also split a call-site based on constant incoming values of a PHI For example, from : BB0: %c = icmp eq i32 %i1, %i2 br i1 %c, label %BB2, label %BB1 BB1: br label %BB2 BB2: %p = phi i32 [ 0, %BB0 ], [ 1, %BB1 ] call void @bar(i32 %p) to BB0: %c = icmp eq i32 %i1, %i2 br i1 %c, label %BB2-split0, label %BB1 BB1: br label %BB2-split1 BB2-split0: call void @bar(i32 0) br label %BB2 BB2-split1: call void @bar(i32 1) br label %BB2 BB2: %p = phi i32 [ 0, %BB2-split0 ], [ 1, %BB2-split1 ] Reviewers: davidxl, huntergr, chandlerc, mcrosier, eraman, davide Reviewed By: davidxl Subscribers: sdesmalen, ashutosh.nema, fhahn, mssimpso, aemerson, mgorny, mehdi_amini, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D39137 llvm-svn: 317351	2017-11-03 19:01:57 +00:00
Jake Ehrlich	5de70d996c	[llvm-objcopy] Add support for dwarf fission This change adds support for dwarf fission. Differential Revision: https://reviews.llvm.org/D39207 llvm-svn: 317350	2017-11-03 18:58:41 +00:00
Evandro Menezes	9dcf099944	[AArch64] Fix the number of iterations for the Newton series The number of iterations was incorrectly determined for DP FP vector types and the tests were insufficient to flag this issue. Differential revision: https://reviews.llvm.org/D39507 llvm-svn: 317349	2017-11-03 18:56:36 +00:00
Evgeny Stupachenko	d699de2b50	The patch fixes PR35131 Summary: Fix a misprint which led to false CTLZ recognition. Reviewers: craig.topper Differential Revision: https://reviews.llvm.org/D39585 From: Evgeny Stupachenko <evstupac@gmail.com> llvm-svn: 317348	2017-11-03 18:50:03 +00:00
Adrian Prantl	8fe9fb0ae5	Revert "Invoke salvageDebugInfo from CodeGenPrepare's SinkCast()" This reverts commit 317342 while investigating bot breakage. llvm-svn: 317345	2017-11-03 18:26:36 +00:00
Craig Topper	666e23b513	[CodeGen] Remove unnecessary semicolons to fix a warning. NFC llvm-svn: 317342	2017-11-03 18:02:46 +00:00
Craig Topper	741e7e6a71	[X86] Initialize Type and Subtype in getHostCPUName to 0. llvm-svn: 317341	2017-11-03 18:02:44 +00:00
Adrian Prantl	58e9a0bb16	Invoke salvageDebugInfo from CodeGenPrepare's SinkCast() This preserves the debug info for the cast operation in the original location. rdar://problem/33460652 llvm-svn: 317340	2017-11-03 18:00:02 +00:00
Jun Bum Lim	f5fb3d745d	[LICM] sink through non-trivially replicable PHI Summary: The current LICM allows sinking an instruction only when it is exposed to exit blocks through a trivially replacable PHI of which all incoming values are the same instruction. This change enhance LICM to sink a sinkable instruction through non-trivially replacable PHIs by spliting predecessors of loop exits. Reviewers: hfinkel, majnemer, davidxl, bmakam, mcrosier, danielcdh, efriedma, jtony Reviewed By: efriedma Subscribers: nemanjai, dberlin, llvm-commits Differential Revision: https://reviews.llvm.org/D37163 llvm-svn: 317335	2017-11-03 16:24:53 +00:00
Alexey Bataev	c455f9af16	[SLP] Test for PR23510, NFC. llvm-svn: 317334	2017-11-03 16:17:13 +00:00
Simon Dardis	d3b9f61c52	[mips] Match 'ins' and its' variants with C++ code Change the ISel matching of 'ins', 'dins[mu]' from tablegen code to C++ code. This resolves an issue where ISel would select 'dins' instead of 'dinsm' when the instructions size and position were individually in range but their sum was out of range according to the ISA specification. Reviewers: atanasyan Differential Revision: https://reviews.llvm.org/D39117 llvm-svn: 317331	2017-11-03 15:35:13 +00:00
Andrew V. Tischenko	0916c6b654	Fix for Bug 34475 - LOCK/REP/REPNE prefixes emitted as instruction on their own. Differential Revision: https://reviews.llvm.org/D39546 llvm-svn: 317330	2017-11-03 15:25:13 +00:00
Anna Thomas	6879721453	[LoopPredication] NFC: Refactored code to separate out functions being reused Summary: Refactored the code to separate out common functions that are being reused. This is to reduce the changes for changes coming up wrt loop predication with reverse loops. This refactoring is what we have in our downstream code. llvm-svn: 317324	2017-11-03 14:25:39 +00:00
Mikael Holmen	6018104d5e	[ADCE] Use MapVector for BlockInfo to make iteration order deterministic Summary: Also added a reserve() method to MapVector since we want to use that from ADCE. DenseMap does not provide deterministic iteration order so with that we will handle the members of BlockInfo in random order, eventually leading to random order of the blocks in the predecessor lists. Without this change, I get the same predecessor order in about 90% of the time when I compile a certain reproducer and in 10% I get a different one. No idea how to make a proper test case for this. Reviewers: kuhar, david2050 Reviewed By: kuhar Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D39593 llvm-svn: 317323	2017-11-03 14:15:08 +00:00
Clement Courbet	063bed9baf	re-land [ExpandMemCmp] Split ExpandMemCmp from CodeGen into its own pass." Fix undefined references: ExpandMemCmp belongs to CodeGen/, not Scalar/. llvm-svn: 317318	2017-11-03 12:12:27 +00:00
Simon Pilgrim	ae1f013495	[X86][SSE] Add PACKUS support to combineVectorTruncation Similar to the existing code to lower to PACKSS, we can use PACKUS if the input vector's leading zero bits extend all the way to the packed/truncated value. We have to account for pre-SSE41 targets not supporting PACKUSDW llvm-svn: 317315	2017-11-03 11:33:48 +00:00
Florian Hahn	41e32bfd68	[PartialInliner] Skip call sites where inlining fails. Summary: InlineFunction can fail, for example when trying to inline vararg fuctions. In those cases, we do not want to bump partial inlining counters or set AnyInlined to true, because this could leave an unused function hanging around. Reviewers: davidxl, davide, gyiu Reviewed By: davide Subscribers: llvm-commits, eraman Differential Revision: https://reviews.llvm.org/D39581 llvm-svn: 317314	2017-11-03 11:29:00 +00:00
Diana Picus	d1b618177a	[globalisel][tablegen] Skip src child predicates The GlobalISel TableGen backend didn't check for predicates on the source children. This caused it to generate code for ARM patterns such as SMLABB or similar, but without properly checking for the sext_16_node part of the operands. This in turn meant that we would select SMLABB instead of MLA for simple sequences such as s32 + s32 * s32, which is wrong (we want a MLA on the full operands, not just their bottom 16 bits). This patch forces TableGen to skip patterns with predicates on the src children, so it doesn't generate code for SMLABB and other similar ARM instructions at all anymore. AArch64 and X86 are not affected. Differential Revision: https://reviews.llvm.org/D39554 llvm-svn: 317313	2017-11-03 10:30:19 +00:00
Diana Picus	acf4bf21ab	[ARM GlobalISel] Move the check for Thumb higher up We're currently bailing out for Thumb targets while lowering formal parameters, but there used to be some other checks before it, which could've caused some functions (e.g. those without formal parameters) to sneak through unnoticed. llvm-svn: 317312	2017-11-03 10:30:12 +00:00
Ivan A. Kosarev	4b77f463d0	[Analysis] Refine matching and merging of TBAA tags This patch combines the code that matches and merges TBAA access tags. The aim is to simplify future changes and making sure that these operations produce consistent results. Differential Revision: https://reviews.llvm.org/D39463 llvm-svn: 317311	2017-11-03 10:26:25 +00:00
Francis Visoiu Mistrih	d96395fc92	[PEI] Simplify handling of targets with no phys regs. NFC Make doSpillCalleeSavedRegs a member function, instead of passing most of the members of PEI as arguments. Differential Review: https://reviews.llvm.org/D35642 llvm-svn: 317309	2017-11-03 09:46:36 +00:00
Martin Storsjo	9befcd7d8d	[AArch64] Use dwarf exception handling on MinGW Ideally we should probably produce WinEH here as well, but until then, we can use dwarf exceptions, without any further changes required in clang, libunwind or libcxxabi. Differential Revision: https://reviews.llvm.org/D39535 llvm-svn: 317304	2017-11-03 07:33:20 +00:00
Max Kazantsev	f5e72b54ac	[NFC] Get rid of hard-coded value ID in test llvm-svn: 317303	2017-11-03 07:30:45 +00:00
Martin Storsjo	c632086ab8	[llvm-nm] Don't error out on multiple occurrances of the -g/--external-only flag GNU binutils nm doesn't error out on this, and some projects' build systems can end up doing that in some cases. Allowing that seems like a better target than trying to avoid user projects passing multiple -g parameters to $NM. Differential Revision: https://reviews.llvm.org/D39539 llvm-svn: 317301	2017-11-03 07:18:21 +00:00
Martin Storsjo	1401524e20	[llvm-nm] Print 'I' for import table data in COFF The character gets uppercased into 'I' when it's a global symbol. In GNU binutils, nm prints 'I' for symbols classified by bfd_is_ind_section - which probably isn't exactly/only import tables. When building for win32, (some incarnations of?) libtool has got rules that try to inspect linked libraries, and in order to be sure that it is linking to a DLL import library as opposed to a static library, it expects to find the string " I " in the output of $NM when run on such an import library. GNU binutils nm also flags all of the .idata$X chunks as 'i' (while this patch only makes it set on .idata$2 and .idata$6) and also flags __imp__function as 'I'. Differential Revision: https://reviews.llvm.org/D39540 llvm-svn: 317300	2017-11-03 07:18:14 +00:00
Craig Topper	333897ec31	[X86] Remove PALIGNR/VALIGN handling from combineBitcastForMaskedOp and move to isel patterns instead. Prefer 128-bit VALIGND/VALIGNQ over PALIGNR during lowering when possible. llvm-svn: 317299	2017-11-03 06:48:02 +00:00
Craig Topper	2fda36a18e	[TableGen] Add an extra blank line to DAGISel output file to separate functions. llvm-svn: 317298	2017-11-03 05:19:34 +00:00
Vedant Kumar	9196ed1be1	[LSR] Clarify a comment. NFC. llvm-svn: 317295	2017-11-03 01:01:28 +00:00
Sriraman Tallam	7cdb10f1aa	Avoid PLT for external calls when attribute nonlazybind is used. Differential Revision: https://reviews.llvm.org/D39065 llvm-svn: 317292	2017-11-03 00:10:19 +00:00
Jake Ehrlich	19bbd8a92d	Reland "Add feature to determine if host architecture is 64-bit in llvm-lit" A member of config was removed in this patch which resulted in errors I didn't expect. Removing config.host_arch will take more work some I'm readding that field. Differential Revision: https://reviews.llvm.org/D39465 llvm-svn: 317289	2017-11-02 23:45:51 +00:00
Vedant Kumar	fb15180054	[Verifier] Remove the -verify-debug-info cl::opt This cl::opt has been dead for a while. It's no longer possible to run the verifier without also verifying debug info. llvm-svn: 317288	2017-11-02 23:44:20 +00:00
Quentin Colombet	b6afac1f9a	[AArch64][RegisterBankInfo] Add mapping for G_FPEXT. This fixes http://llvm.org/PR32560. We were missing a description for half floating point type and as a result were using the FPR 32 mapping. Because of the size mismatch the generic code was complaining that the default mapping is not appropriate. Fix the mapping description so that the default mapping can be properly applied. llvm-svn: 317287	2017-11-02 23:38:19 +00:00
Quentin Colombet	619d649878	[AArch64][RegisterBankInfo] Add FPR16 support in value mapping. NFC. llvm-svn: 317286	2017-11-02 23:38:13 +00:00
Puyan Lotfi	a521c4ac55	mir-canon: First commit. mir-canon (MIRCanonicalizerPass) is a pass designed to reorder instructions and rename operands so that two similar programs will diff more cleanly after being run through mir-canon than they would otherwise. This project is still a work in progress and there are ideas still being discussed for improving diff quality. M include/llvm/InitializePasses.h M lib/CodeGen/CMakeLists.txt M lib/CodeGen/CodeGen.cpp A lib/CodeGen/MIRCanonicalizerPass.cpp llvm-svn: 317285	2017-11-02 23:37:32 +00:00
Jake Ehrlich	13153eef56	[llvm-objcopy] Fix bug in how segment alignment was being handled Just aligning segment offsets to segment alignment is incorrect and also wastes more space than is needed. The requirement is that p_offset == p_addr modulo p_align not that p_offset == 0 modulo p_align. Generally speaking we've been using p_addr == 0 modulo p_align. In fact yaml2obj can't even produce a valid situation which causes llvm-objcopy to produce incorrect results because alignment and offset were both inherited from the sections the program header covers. This change fixes this bad behavior in llvm-objcopy. Differential Revision: https://reviews.llvm.org/D39132 llvm-svn: 317284	2017-11-02 23:24:04 +00:00
Craig Topper	086c04c8a7	[X86] Give AVX512VL instructions priority over their AVX equivalents. I thought we had gotten all these priority bugs worked out, but I guess not. llvm-svn: 317283	2017-11-02 23:23:37 +00:00
Adrian Prantl	fbb6fbf709	IndVarSimplify: preserve debug information attached to widened PHI nodes. This fixes PR35015. https://bugs.llvm.org/show_bug.cgi?id=35015 Differential Revision: https://reviews.llvm.org/D39345 llvm-svn: 317282	2017-11-02 23:17:06 +00:00
Jake Ehrlich	6fe84be9a3	Add feature to determine if host architecture is 64-bit in llvm-lit I have a test that I'd like to add to llvm that demands using more than 32-bits worth of address space. This test can't be run on 32-bit systems because they don't have enough address space. The host triple should be used to determine this instead of config.host_arch because on Debian systems config.host_arch is not correct. This change adds the "host-arch-is-64bit" feature to allow tests to restrict themselves to the 64-bit case. Differential Revision: https://reviews.llvm.org/D39465 llvm-svn: 317281	2017-11-02 23:14:55 +00:00
Konstantin Zhuravlyov	275a4f76c4	AMDGPU: Fix warning discovered by r317266 [-Wunused-private-field] llvm-svn: 317280	2017-11-02 22:35:22 +00:00
Hiroshi Yamauchi	dce9def3dd	Irreducible loop metadata for more accurate block frequency under PGO. Summary: Currently the block frequency analysis is an approximation for irreducible loops. The new irreducible loop metadata is used to annotate the irreducible loop headers with their header weights based on the PGO profile (currently this is approximated to be evenly weighted) and to help improve the accuracy of the block frequency analysis for irreducible loops. This patch is a basic support for this. Reviewers: davidxl Reviewed By: davidxl Subscribers: mehdi_amini, llvm-commits, eraman Differential Revision: https://reviews.llvm.org/D39028 llvm-svn: 317278	2017-11-02 22:26:51 +00:00
Krzysztof Parzyszek	058014fca5	[Hexagon] Prefer L2_loadrub_io over L4_loadrub_rr If the offset is an immediate, avoid putting it in a register to get Rs+Rt<<#0. llvm-svn: 317275	2017-11-02 21:56:59 +00:00
Shoaib Meenai	08bb38f7e7	[tools] Add option to install binutils symlinks The LLVM tools can be used as a replacement for binutils, in which case it's convenient to create symlinks with the binutils names. Add support for these symlinks in the build system. As with any other llvm tool symlinks, the user can limit the installed symlinks by only adding the desired ones to `LLVM_TOOLCHAIN_TOOLS`. Differential Revision: https://reviews.llvm.org/D39530 llvm-svn: 317272	2017-11-02 21:43:32 +00:00
Adrian Prantl	07eaa9b46e	Clean up comments in include/llvm-c/DebugInfo.h Patch by Harlan Haskins! Differential Revision: https://reviews.llvm.org/D39568 llvm-svn: 317271	2017-11-02 21:35:37 +00:00
Anna Thomas	1d02b13eb7	[LoopPredication] Enable predication when latchCheckIV is wider than rangeCheck Summary: This patch allows us to predicate range checks that have a type narrower than the latch check type. We leverage SCEV analysis to identify a truncate for the latchLimit and latchStart. There is also safety checks in place which requires the start and limit to be known at compile time. We require this to make sure that the SCEV truncate expr for the IV corresponding to the latch does not cause us to lose information about the IV range. Added tests show the loop predication over range checks that are of various types and are narrower than the latch type. This enhancement has been in our downstream tree for a while. Reviewers: apilipenko, sanjoy, mkazantsev Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D39500 llvm-svn: 317269	2017-11-02 21:21:02 +00:00
Adrian Prantl	f2593d028f	Add missing header guards. llvm-svn: 317267	2017-11-02 20:58:58 +00:00
Konstantin Zhuravlyov	b695cd41b3	AMDGPU: Remove outdated fixme (it was already fixed) llvm-svn: 317266	2017-11-02 20:48:06 +00:00
Shoaib Meenai	c542c3e0ec	[cmake] Remove policy conditionals LLVM now requires a minimum of cmake 3.4.3, and all the policies currently being set are present in that cmake version, so the conditionals will always be true and are therefore unnecessary. The movation is that the conditionals can give the false impression that the policy settings are optional, whereas for example it's necessary to set CMP0056 in order for `check_linker_flags` to operate correctly after r316972. Inline the project version and language setting in the process. Differential Revision: https://reviews.llvm.org/D39442 llvm-svn: 317264	2017-11-02 20:33:36 +00:00
Hans Wennborg	f55435ab61	Fix llvm-dsymutil test in -DLLVM_ENABLE_THREADS=OFF mode After r316999, tools/dsymutil/X86/alias.test started failing in builds that have threading disabled. llvm-svn: 317263	2017-11-02 20:22:03 +00:00
Martin Storsjo	20910f46b6	[test] Move llvm-lib tests into tools/llvm-lib. NFC. Similarly to SVN r317189 for llvm-dlltool, these are probably easier to find in a tools subdirectory with a name identical to the tool, than in a toplevel directory with a different name. This matches the move of LibDriver itself in SVN r302995. Differential Revision: https://reviews.llvm.org/D39531 llvm-svn: 317262	2017-11-02 20:05:20 +00:00
Craig Topper	1494915a3a	[X86] Simplify the pentium4 code in getHostCPUName to be based on feature flags. Don't use 'x86-64' ever. 'x86-64' has started to reflect a sort of generic tuning flag for more modern 64-bit CPUs. We probably shouldn't be using it as the name of an unidentifiable pentium4. So use nocona for all 64-bit pentium4s instead. llvm-svn: 317230	2017-11-02 19:13:34 +00:00
Craig Topper	a233e16cd8	[X86] Change getHostCPUName fallback code to not select 'x86-64' for unknown CPUs in family 6 that has 64-bit support but not any newer SSE features. Use 'core2' instead We know that's the earliest CPU with 64-bit support. x86-64 has taken on a role of representing a more modern 64-bit CPU so we probably shouldn't be using that when we can't identify things. llvm-svn: 317229	2017-11-02 19:13:32 +00:00
Jonas Devlieghere	fb7bf1d7f2	[dsymutil][doc] Improve wording in manpage and rename file. - Improve wording - Rename llvm-dsymutil to dsymutil - Name -arch=<arch> argument Differential revision: https://reviews.llvm.org/D39561 llvm-svn: 317226	2017-11-02 18:44:54 +00:00
Anna Thomas	729dafc16b	Strip off invariant.start because memory locations arent invariant The original change was reverted in rL317217 because of the failure in the RS4GC testcase. I couldn't reproduce the failure on my local machine (macbook) but could reproduce it on a linux box. The failure was around removing the uses of invariant.start. The fix here is to just RAUW undef (which was the first implementation in D39388). This is perfectly valid IR as discussed in the review. llvm-svn: 317225	2017-11-02 18:24:04 +00:00
Mitch Phillips	6d2590baec	Fixed line length style issue. Reviewers: zturner Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D39395 llvm-svn: 317223	2017-11-02 18:04:44 +00:00
Chad Rosier	cd1b93c4e4	[TargetParser][AArch64] Reorder enum to preserve 5.0.0 libLLVM ABI. This is required for backporting r311659 to the 5.0.1 release. PR35060 Differential Revision: https://reviews.llvm.org/D39558 llvm-svn: 317222	2017-11-02 17:52:27 +00:00
Jonas Devlieghere	0203ac9703	[dsymutil] Add a manpage for dsymutil llvm-svn: 317221	2017-11-02 17:12:34 +00:00
Anna Thomas	ebe429d99f	Revert "[RS4GC] Strip off invariant.start because memory locations arent invariant" This reverts commit r317215, investigating the test failure. llvm-svn: 317217	2017-11-02 16:45:51 +00:00
Anna Thomas	486a7aaa31	[RS4GC] Strip off invariant.start because memory locations arent invariant Summary: Invariant.start on memory locations has the property that the memory location is unchanging. However, this is not true in the face of rewriting statepoints for GC. Teach RS4GC about removing invariant.start so that optimizations after RS4GC does not incorrect sink a load from the memory location past a statepoint. Added test showcasing the issue. Reviewers: reames, apilipenko, dneilson Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D39388 llvm-svn: 317215	2017-11-02 16:23:31 +00:00
Clement Courbet	82bade615b	Revert "[ExpandMemCmp] Split ExpandMemCmp from CodeGen into its own pass." undefined reference to `llvm::TargetPassConfig::ID' on clang-ppc64le-linux-multistage This reverts commit eea333c33fa73ad225ef28607795984829f65688. llvm-svn: 317213	2017-11-02 15:53:10 +00:00
Clement Courbet	1dc37b9c3b	[ExpandMemCmp] Split ExpandMemCmp from CodeGen into its own pass. Summary: This is mostly a noop (most of the test diffs are renamed blocks). There are a few temporary register renames (eax<->ecx) and a few blocks are shuffled around. See the discussion in PR33325 for more details. Reviewers: spatel Subscribers: mgorny Differential Revision: https://reviews.llvm.org/D39456 llvm-svn: 317211	2017-11-02 15:02:51 +00:00
Ayman Musa	a37d1130d7	[X86] Fix bug in legalize vector types - Split large loads When splitting a large load to smaller legally-typed loads, the last load should be padded to reach the size of the previous one so a CONCAT_VECTORS node could reunite them again. The code currently pads the last load to reach the size of the first load (instead of the previous). Differential Revision: https://reviews.llvm.org/D38495 Change-Id: Ib60b55ed26ce901fabf68108daf52683fbd5013f llvm-svn: 317206	2017-11-02 13:07:06 +00:00
Simon Dardis	725acb2d91	[mips] Use register scavenging with MSA. MSA stores and loads to the stack are more likely to require an emergency GPR spill slot due to the smaller offsets available with those instructions. Handle this by overestimating the size of the stack by determining the largest offset presuming that all callee save registers are spilled and accounting of incoming arguments when determining whether an emergency spill slot is required. Reviewers: atanasyan Differential Revision: https://reviews.llvm.org/D39056 llvm-svn: 317204	2017-11-02 12:47:22 +00:00
Sam McCall	0e142499a9	Temporary workaround for msan false positive. llvm-svn: 317203	2017-11-02 12:29:47 +00:00
Michael Zuckerman	0c20b690a2	Adding test for extraxt sub vector load and store avx512 Change-Id: Iefcb0ec6b6aa1b530ce5358081f02e6e522a8e50 llvm-svn: 317202	2017-11-02 12:19:36 +00:00
Yichao Yu	6fefc0d65e	Allow inaccessiblememonly and inaccessiblemem_or_argmemonly to be overwriten on call site with operand bundle Summary: Similar to argmemonly, readonly and readnone. Fix PR35128 Reviewers: andrew.w.kaylor, chandlerc, hfinkel Reviewed By: hfinkel Subscribers: hfinkel, llvm-commits Differential Revision: https://reviews.llvm.org/D39434 llvm-svn: 317201	2017-11-02 12:18:33 +00:00
Francis Visoiu Mistrih	66d2c269dc	[AsmPrinterDwarf] Add support for .cfi_restore directive As of today we only use .cfi_offset to specify the offset of a CSR, but we never use .cfi_restore when the CSR is restored. If we want to perform a more advanced type of shrink-wrapping, we need to use .cfi_restore in order to switch the CFI state between blocks. This patch only aims at adding support for the directive. Differential Revision: https://reviews.llvm.org/D36114 llvm-svn: 317199	2017-11-02 12:00:58 +00:00
Bjorn Pettersson	e73b85d1ab	[SimplifyCFG] Discard speculated dbg intrinsics Summary: SpeculativelyExecuteBB can flatten the CFG by doing speculative execution followed by a select instruction. When the speculatively executed BB contained dbg intrinsics the result could be a little bit weird, since those dbg intrinsics were inserted before the select in the flattened CFG. So when single stepping in the debugger, printing the value of the variable referenced in the dbg intrinsic, it could happen that it looked like the variable had values that never actually were assigned to the variable. This patch simply discards all dbg intrinsics that were found in the speculatively executed BB. Reviewers: aprantl, chandlerc, craig.topper Reviewed By: aprantl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D39494 llvm-svn: 317198	2017-11-02 11:55:14 +00:00
Sam Parker	242052c6b4	[ARM] and, or, xor and add with shl combine The generic dag combiner will fold: (shl (add x, c1), c2) -> (add (shl x, c2), c1 << c2) (shl (or x, c1), c2) -> (or (shl x, c2), c1 << c2) This can create constants which are too large to use as an immediate. Many ALU operations are also able of performing the shl, so we can unfold the transformation to prevent a mov imm instruction from being generated. Other patterns, such as b + ((a << 1) \| 510), can also be simplified in the same manner. Differential Revision: https://reviews.llvm.org/D38084 llvm-svn: 317197	2017-11-02 10:43:10 +00:00
Andrew V. Tischenko	3c8bf5ec37	The patch updates sched numbers for YMM AVX instrs such as VMOVx, VORx, VXOR, VPERMILx, VBROADCASTx, etc. PR32857 should be closed. Differential Revision: https://reviews.llvm.org/D39227 llvm-svn: 317196	2017-11-02 10:33:41 +00:00
Sam McCall	8b4a9fa0b6	Update go bindings to use new functions from rL317135. This fixes duplicate symbol problems. llvm-svn: 317195	2017-11-02 10:22:26 +00:00
NAKAMURA Takumi	965602ac82	llvm-c/DebugInfo.h: Fix warning. [-Wdocumentation] llvm-svn: 317191	2017-11-02 08:03:12 +00:00
Martin Storsjo	53d7ba76e4	[test] Move llvm-dlltool tests into tools/llvm-dlltool. NFC. A toplevel test directory DllTool isn't consistent with other similar tools. Differential Revision: https://reviews.llvm.org/D39513 llvm-svn: 317189	2017-11-02 07:57:32 +00:00
Craig Topper	3bbe24c3ca	[X86] Remove the model checks from the 486 detection code in Host.cpp This just provided a bunch of comments to read and not much else. llvm-svn: 317185	2017-11-02 03:32:50 +00:00
Craig Topper	094d7914ae	[X86] Simplify the detection of pentium-mmx in Host.cpp. Rather than looking at model numbers just check for the mmx feature flag. While there promote INTEL_PENTIUM_MMX to a CPU type instead of a subtype so that we don't have weird type with only one subtype. llvm-svn: 317184	2017-11-02 03:32:49 +00:00
Eric Christopher	826a02328f	Revert "Remove some of the go specific C bindings for debug info now that they've been migrated into the main C API." This reverts commits r317151 and 317152 llvm-svn: 317154	2017-11-02 01:46:49 +00:00
Steven Wu	d0f59f0a5b	[X86] Fix fast-isel-int-float-conversion test Test is failing due to the revert in r317136. Fix the test to make all the bots happy. llvm-svn: 317153	2017-11-02 01:34:15 +00:00
Eric Christopher	295bfa9023	Fix for go bindings header to match previous commit. llvm-svn: 317152	2017-11-02 01:25:00 +00:00
Eric Christopher	18351b62db	Remove some of the go specific C bindings for debug info now that they've been migrated into the main C API. Fixes a go bindings breakage after r317135. llvm-svn: 317151	2017-11-02 01:24:12 +00:00
Shoaib Meenai	909f5c9b7c	[cmake] Switch FATAL_ERROR to SEND_ERROR It's possible for multiple distribution components to have missing targets, and it's a lot more convenient to get all those errors in one shot rather than having to fix them individually. llvm-svn: 317148	2017-11-02 01:07:37 +00:00
Mitch Phillips	4ab6fc0cd6	Update cl::opt<uint64_t> instances to cl::opt<unsigned long long> cl::opt<uint64_t> fails when parsing command line arguments. See https://bugs.llvm.org/show_bug.cgi?id=19665. Reviewers: pcc Subscribers: mgorny, llvm-commits, kcc Differential Revision: https://reviews.llvm.org/D38657 llvm-svn: 317141	2017-11-01 23:39:41 +00:00
Jake Ehrlich	03aeeb09c5	[yaml2obj][ELF] Add support for setting alignment in program headers Sometimes program headers have larger alignments than any of the sections they contain. Currently yaml2obj can't produce such files. A bug recently appeared in llvm-objcopy that failed in such a case. I'd like to be able to add tests to llvm-objcopy for such cases. This change adds an optional alignment parameter to program headers that will be used instead of calculating the alignment. Differential Revision: https://reviews.llvm.org/D39130 llvm-svn: 317139	2017-11-01 23:14:48 +00:00
Adrian Prantl	bfa77c4c85	loop-unroll: teach remapInstruction to update dbg.value intrinsics. Fixes PR35112. https://bugs.llvm.org/show_bug.cgi?id=35112 llvm-svn: 317138	2017-11-01 23:12:35 +00:00
Petar Jovanovic	bb5c84fb57	Revert "Correct dwarf unwind information in function epilogue for X86" This reverts r317100 as it introduced sanitizer-x86_64-linux-autoconf buildbot failure (build #15606). llvm-svn: 317136	2017-11-01 23:05:52 +00:00
whitequark	789164d426	[LLVM-C] Expose functions to create debug locations via DIBuilder. These include: * Several functions for creating an LLVMDIBuilder, * LLVMDIBuilderCreateCompileUnit, * LLVMDIBuilderCreateFile, * LLVMDIBuilderCreateDebugLocation. Patch by Harlan Haskins. Differential Revision: https://reviews.llvm.org/D32368 llvm-svn: 317135	2017-11-01 22:18:52 +00:00
Craig Topper	3837322a6b	[X86] Use foreach in X86.td to combine some of the CPU names that are obviously aliases. NFC llvm-svn: 317134	2017-11-01 22:15:49 +00:00
Craig Topper	7a754c4622	[X86] Add CMOV feature to 'i686' processor, making it a proper alias of pentiumpro which I believe it should be. This is consistent with current gcc behavior. llvm-svn: 317133	2017-11-01 22:15:40 +00:00
Daniel Sanders	466fe399b8	[globalisel][regbank] Warn about MIR ambiguities when register bank/class names clash. llvm-svn: 317132	2017-11-01 22:13:05 +00:00
Simon Pilgrim	e152c2c447	[X86][SSE] Add PACKUS support to LowerTruncate Similar to the existing code to lower to PACKSS, we can use PACKUS if the input vector's leading zero bits extend all the way to the packed/truncated value. We have to account for pre-SSE41 targets not supporting PACKUSDW llvm-svn: 317128	2017-11-01 21:52:29 +00:00
Rui Ueyama	a16fe65b72	Rewrite FileOutputBuffer as two separate classes. This patch is to rewrite FileOutputBuffer as two separate classes; one for file-backed output buffer and the other for memory-backed output buffer. I think the new code is easier to follow because two different implementations are now actually separated as different classes. Unlike the previous implementation, the class that does not replace the final output file using rename(2) does not create a temporary file at all. Instead, it allocates memory using mmap(2) and use it. I think this is an improvement because it is now guaranteed that the temporary memory region doesn't trigger any I/O and there's now zero chance to leave a temporary file behind. Also, it shouldn't impose new restrictions because were using mmap IO too. Differential Revision: https://reviews.llvm.org/D39449 llvm-svn: 317127	2017-11-01 21:38:14 +00:00
Eugene Zelenko	0ad18f888e	[dsymutil, llvm-objcopy] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 317123	2017-11-01 21:16:06 +00:00
Craig Topper	4e56ba271e	[X86] Add custom code to EVEX to VEX pass to turn unmasked 128-bit VPALIGND/Q into VPALIGNR if the extended registers aren't being used. This will enable us to prefer VALIGND/Q during shuffle lowering in order to get the extended register encoding space when BWI isn't available. But if we end up not using the extended registers we can switch VPALIGNR for the shorter VEX encoding. Differential Revision: https://reviews.llvm.org/D39401 llvm-svn: 317122	2017-11-01 21:00:59 +00:00
Adrian Prantl	98c6549e4a	loop-rotate: avoid duplicating dbg.value intrinsics in the entry block. This fixes the second half of PR35113. This reapplies r317106 without modifications. llvm-svn: 317121	2017-11-01 20:53:22 +00:00
Adrian Prantl	d60f34c20a	loop-rotate: eliminate duplicate debug intrinsics after splicing. Fixes part of PR35113. This reapplies r317105 with an additional check for isa<Instruction> as found by the bots. llvm-svn: 317120	2017-11-01 20:43:30 +00:00
Dehao Chen	c6c051f2ea	Include GUIDs from the same module when computing GUIDs that needs to be imported. Summary: In the compile phase of SamplePGO+ThinLTO, ICP is not invoked. Instead, indirect call targets will be included as function metadata for ThinIndex to buidl the call graph. This should not only include functions defined in other modules, but also functions defined in the same module, otherwise ThinIndex may find the callee dead and eliminate it, while ICP in backend will revive the symbol, which leads to undefined symbol. Reviewers: tejohnson Reviewed By: tejohnson Subscribers: sanjoy, llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D39480 llvm-svn: 317118	2017-11-01 20:26:47 +00:00
Daniel Sanders	9cbe7c7f93	[globalisel][tablegen] Add support for multi-insn emission The importer will now accept nested instructions in the result pattern such as (ADDWrr $a, (SUBWrr $b, $c)). This is only valid when the nested instruction def's a single vreg and the parent instruction consumes a single vreg where a nested instruction is specified. The importer will automatically create a vreg to connect the two using the type information from the pattern. This vreg will be constrained to the register classes given in the instruction definitions. REG_SEQUENCE is explicitly rejected because of this. The definition doesn't constrain to a register class and it therefore needs special handling. llvm-svn: 317117	2017-11-01 19:57:57 +00:00
Philip Reames	7b861f08cd	Revert 317016 and 317048 The former appears to have introduced a miscompile in a stage2 clang build. Revert so I can investigate offline. llvm-svn: 317116	2017-11-01 19:49:20 +00:00
Konstantin Zhuravlyov	435151ad75	AMDGPU: Fix set but not used warnings related to AMDGPUAS Differential Revision: https://reviews.llvm.org/D39499 llvm-svn: 317114	2017-11-01 19:12:38 +00:00
Craig Topper	ca1aa83cbe	[X86] Prevent fast isel from folding loads into the instructions listed in hasPartialRegUpdate. This patch moves the check for opt size and hasPartialRegUpdate into the lower level implementation of foldMemoryOperandImpl to catch the entry point that fast isel uses. We're still folding undef register instructions in AVX that we should also probably disable, but that's a problem for another patch. Unfortunately, this requires reordering a bunch of functions which is why the diff is so large. I can do the function reordering separately if we want. Differential Revision: https://reviews.llvm.org/D39402 llvm-svn: 317112	2017-11-01 18:10:06 +00:00
Graham Yiu	671526148c	Adds code to PPC ISEL lowering to recognize half-word inserts from vector_shuffles, and use P9 shift and vector insert instructions instead of vperm. Differential Revision: https://reviews.llvm.org/D34160 llvm-svn: 317111	2017-11-01 18:06:56 +00:00
Adrian Prantl	c8516346e4	Revert r317105 to investigate bot breakage. llvm-svn: 317110	2017-11-01 18:06:38 +00:00
Adrian Prantl	40a0ea5f29	Revert r317106 to facilitate reverting r317105. llvm-svn: 317109	2017-11-01 18:06:35 +00:00
Peter Collingbourne	9fb6e1a037	LTO: Apply global DCE to ThinLTO modules at LTO opt level 0. This is necessary because DCE is applied to full LTO modules. Without this change, a reference from a dead ThinLTO global to a dead full LTO global will result in an undefined reference at link time. This problem is only observable when --gc-sections is disabled, or when targeting COFF, as the COFF port of lld requires all symbols to have a definition even if all references are dead (this is consistent with link.exe). This change also adds an EliminateAvailableExternally pass at -O0. This is necessary to handle the situation on Windows where a non-prevailing copy of a linkonce_odr function has an SEH filter function; any such filters must be DCE'd because they will contain a call to the llvm.localrecover intrinsic, passing as an argument the address of the function that the filter belongs to, and llvm.localrecover requires this function to be defined locally. Fixes PR35142. Differential Revision: https://reviews.llvm.org/D39484 llvm-svn: 317108	2017-11-01 17:58:39 +00:00
Craig Topper	56db9d6bec	[X86] Regnerate test to attempt to fix build bot failure. llvm-svn: 317107	2017-11-01 17:44:12 +00:00
Adrian Prantl	9259f21604	loop-rotate: avoid duplicating dbg.value intrinsics in the entry block. This fixes the second half of PR35113. llvm-svn: 317106	2017-11-01 17:28:50 +00:00
Adrian Prantl	b627acd0ce	loop-rotate: eliminate duplicate debug intrinsics after splicing. Fixes part of PR35113. llvm-svn: 317105	2017-11-01 17:28:47 +00:00
Jonas Devlieghere	369a7ecc56	[dsymutil][NFC} Rename thread related command line options This makes the command line options consistent with llvm-cov and llvm-profdata, which both use `-num-threads` and `-j`. This also addresses the conflict reported after landing D39355. Differential revision: https://reviews.llvm.org/D39496 llvm-svn: 317104	2017-11-01 17:15:29 +00:00
Craig Topper	5ae677e102	[X86] Add 64-bit int to float/double conversion with AVX to X86FastISel::X86SelectSIToFP Summary: [X86] Teach fast isel to handle i64 sitofp with AVX. For some reason we only handled i32 sitofp with AVX. But with SSE only we support i64 so we should do the same with AVX. Also add i686 command lines for the 32-bit tests. 64-bit tests are in a separate file to avoid a fast-isel abort failure in 32-bit mode. Reviewers: RKSimon, zvi Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D39450 llvm-svn: 317102	2017-11-01 16:23:06 +00:00
Andrew V. Tischenko	3d971e39f8	Update VCVTx, VMOVNTPx and VROUNDYPx instructions scheduling on btver2. Differential Revision: https://reviews.llvm.org/D39059 llvm-svn: 317101	2017-11-01 16:10:20 +00:00
Petar Jovanovic	f2faee92aa	Correct dwarf unwind information in function epilogue for X86 This patch aims to provide correct dwarf unwind information in function epilogue for X86. It consists of two parts. The first part inserts CFI instructions that set appropriate cfa offset and cfa register in emitEpilogue() in X86FrameLowering. This part is X86 specific. The second part is platform independent and ensures that: - CFI instructions do not affect code generation - Unwind information remains correct when a function is modified by different passes. This is done in a late pass by analyzing information about cfa offset and cfa register in BBs and inserting additional CFI directives where necessary. Changed CFI instructions so that they: - are duplicable - are not counted as instructions when tail duplicating or tail merging - can be compared as equal Added CFIInstrInserter pass: - analyzes each basic block to determine cfa offset and register valid at its entry and exit - verifies that outgoing cfa offset and register of predecessor blocks match incoming values of their successors - inserts additional CFI directives at basic block beginning to correct the rule for calculating CFA Having CFI instructions in function epilogue can cause incorrect CFA calculation rule for some basic blocks. This can happen if, due to basic block reordering, or the existence of multiple epilogue blocks, some of the blocks have wrong cfa offset and register values set by the epilogue block above them. CFIInstrInserter is currently run only on X86, but can be used by any target that implements support for adding CFI instructions in epilogue. Patch by Violeta Vukobrat. Differential Revision: https://reviews.llvm.org/D35844 llvm-svn: 317100	2017-11-01 16:04:11 +00:00
Simon Pilgrim	778810eb42	[X86][SSE] Begun generalizing truncateVectorWithPACKSS to work with PACKSS/PACKUS functions Renamed to truncateVectorWithPACK llvm-svn: 317098	2017-11-01 15:31:51 +00:00
Simon Pilgrim	2b09f39b4d	Regenerate PACKUS/TRUNCS test (PR31773) llvm-svn: 317096	2017-11-01 15:27:23 +00:00
Geoff Berry	eed6531ea2	[BranchProbabilityInfo] Handle irreducible loops. Summary: Compute the strongly connected components of the CFG and fall back to use these for blocks that are in loops that are not detected by LoopInfo when computing loop back-edge and exit branch probabilities. Reviewers: dexonsmith, davidxl Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D39385 llvm-svn: 317094	2017-11-01 15:16:50 +00:00
Roger Ferrer Ibanez	9dfbc10522	Revert r313618 "[ARM] Use ADDCARRY / SUBCARRY" That change causes PR35103, so reverting until I figure it out. llvm-svn: 317092	2017-11-01 14:06:57 +00:00
NAKAMURA Takumi	1657f2ad99	Fix warnings discovered by rL317076. [-Wunused-private-field] llvm-svn: 317091	2017-11-01 13:47:55 +00:00
NAKAMURA Takumi	f7d7a59b9e	Suppress a warning discovered by rL317076. [-Wunused-private-field] llvm-svn: 317090	2017-11-01 13:47:51 +00:00
Max Kazantsev	6f5229d7da	Revert rL311205 "[IRCE] Fix buggy behavior in Clamp" This patch reverts rL311205 that was initially a wrong fix. The real problem was in intersection of signed and unsigned ranges (see rL316552), and the patch being reverted masked the problem instead of fixing it. By now, the test against which rL311205 was made works OK even without this code. This revert patch also contains a test case that demonstrates incorrect behavior caused by rL311205: it is caused by incorrect choise of signed max instead of unsigned. llvm-svn: 317088	2017-11-01 13:21:56 +00:00
Simon Pilgrim	687982c181	[SelectionDAG] computeKnownBits - use ashrInPlace on known bits of ISD::SRA input. NFCI. llvm-svn: 317087	2017-11-01 13:16:48 +00:00
Simon Pilgrim	f657ba0cb6	[X86][SSE] Truncate with PACKSS any input with sufficient sign-bits So far we've only been using PACKSS truncations with 'all-bits or zero-bits' patterns (vector comparison results etc.). When really we can safely use it for any case as long as the number of sign bits reach down to the last 16-bits (or 8-bits if we're truncating to bytes). The next steps after this is add the equivalent support for PACKUS and to support packing to sub-128 bit vectors for truncating stores etc. Differential Revision: https://reviews.llvm.org/D39476 llvm-svn: 317086	2017-11-01 11:47:44 +00:00
Florian Hahn	b93c06331e	[CodeExtractor] Fix iterator invalidation in findOrCreateBlockForHoisting. Summary: By replacing branches to CommonExitBlock, we remove the node from CommonExitBlock's predecessors, invalidating the iterator. The problem is exposed when the common exit block has multiple predecessors and needs to sink lifetime info. The modification in the test case trigger the issue. Reviewers: davidxl, davide, wmi Reviewed By: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D39112 llvm-svn: 317084	2017-11-01 09:48:12 +00:00
Serguei Katkov	f2c2851efe	Fix APFloat mod sign fmod specification requires the sign of the remainder is the same as numerator in case remainder is zero. Reviewers: gottesmm, scanon, arsenm, davide, craig.topper Reviewed By: scanon Subscribers: wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D39225 llvm-svn: 317081	2017-11-01 07:56:55 +00:00
Craig Topper	688f0ca6a7	[X86] Add more type qualifiers to INSERT_SUBREG operations in rotate patterns so they don't get created with a v64i8 type. Not sure why tablegen didn't error on this. Fixes PR35158. llvm-svn: 317079	2017-11-01 07:11:32 +00:00
NAKAMURA Takumi	53fc7e1763	Reformat. llvm-svn: 317078	2017-11-01 05:14:35 +00:00
NAKAMURA Takumi	7c1ef4a88c	Revert rL317019, "[ADT] Split optional to only include copy mechanics and dtor for non-trivial types." Seems g++-4.8 (eg. Ubuntu 14.04) doesn't like this. llvm-svn: 317077	2017-11-01 05:14:31 +00:00
Craig Topper	c51aac675d	[DAGCombiner] Fix typos in comments. NFC llvm-svn: 317072	2017-11-01 03:30:52 +00:00
Mitch Phillips	41450d583b	Add test dependency on llvm-cfi-verify to fix up the build breakages on sanitizers. llvm-svn: 317060	2017-11-01 00:49:45 +00:00
Craig Topper	a827f84dcc	[X86] Add AVX512 support to X86FastISel::fastMaterializeFloatZero. llvm-svn: 317059	2017-11-01 00:47:45 +00:00
Daniel Sanders	198447a447	[globalisel][tablegen] Stop hard-coding the emitted instruction ID to 0. NFC The next commit will add support for multi-instruction emission so we need to start allocating instruction ID's instead of hard-coding them to 0. llvm-svn: 317057	2017-11-01 00:29:47 +00:00
Jake Ehrlich	cde781015f	Add system-linux to allow tests run with llvm-lit to restrict themselves to linux I need a test that only runs in a reasonable amount of time on systems that have sparse files. The broadest class of systems that support sparse files are linux systems. So restricting my test to linux systems should suffice. This change adds the system-linux feature to llvm-lit so that it can be required. Differential Revision: https://reviews.llvm.org/D39482 llvm-svn: 317055	2017-11-01 00:18:51 +00:00

1 2 3 4 5 ...

156346 Commits