llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	42010dc810	[X86][SSE] Don't merge known undef/zero elements into target shuffle masks. Replaces setTargetShuffleZeroElements with getTargetShuffleAndZeroables which reports the Zeroable elements but doesn't merge them into the decoded target shuffle mask (the merging has been moved up into getTargetShuffleInputs until we can get rid of it entirely). This is part of the work to fix PR43024 and allow us to use SimplifyDemandedElts to simplify shuffle chains - we need to get to a point where the target shuffle mask isn't adjusted by its source inputs but instead we cache them in a parallel Zeroable mask. llvm-svn: 373867	2019-10-06 19:06:45 +00:00
Richard Smith	344df110e5	Implements CWG 1601 in [over.ics.rank/4.2] Summary: The overload resolution for enums with a fixed underlying type has changed in the C++14 standard. This patch implements the new rule. Patch by Mark de Wever! Reviewers: rsmith Reviewed By: rsmith Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D65695 llvm-svn: 373866	2019-10-06 18:50:40 +00:00
Craig Topper	570ae49d03	[X86] Add custom type legalization for v16i64->v16i8 truncate and v8i64->v8i8 truncate when v8i64 isn't legal Summary: The default legalization for v16i64->v16i8 tries to create a multiple stage truncate concatenating after each stage and truncating again. But avx512 implements truncates with multiple uops. So it should be better to truncate all the way to the desired element size and then concatenate the pieces using unpckl instructions. This minimizes the number of 2 uop truncates. The unpcks are all single uop instructions. I tried to handle this by just custom splitting the v16i64->v16i8 shuffle. And hoped that the DAG combiner would leave the two halves in the state needed to make D68374 do the job for each half. This worked for the first half, but the second half got messed up. So I've implemented custom handling for v8i64->v8i8 when v8i64 needs to be split to produce the VTRUNCs directly. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68428 llvm-svn: 373864	2019-10-06 18:43:08 +00:00
Craig Topper	842dde6be4	[LegalizeTypes][X86] When splitting a vselect for type legalization, don't split a setcc condition if the setcc input is legal and vXi1 conditions are supported Summary: The VSELECT splitting code tries to split a setcc input as well. But on avx512 where mask registers are well supported it should be better to just split the mask and use a single compare. Reviewers: RKSimon, spatel, efriedma Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68359 llvm-svn: 373863	2019-10-06 18:43:03 +00:00
Richard Smith	1e9c0cecab	[Sema] Avoids an assertion failure when an invalid conversion declaration is used Summary: When using a user-defined conversion function template with a deduced return type the compiler gives a set of warnings: ``` bug.cc:252:44: error: cannot specify any part of a return type in the declaration of a conversion function; use an alias template to declare a conversion to 'auto (Ts &&...) const' template <typename... Ts> operator auto()(Ts &&... xs) const; ^~~~~~~~~~~~~~~~~~~ bug.cc:252:29: error: conversion function cannot convert to a function type template <typename... Ts> operator auto()(Ts &&... xs) const; ^ error: pointer to function type cannot have 'const' qualifier ``` after which it triggers an assertion failure. It seems the last error is incorrect and doesn't have any location information. This patch stops the compilation after the second warning. Fixes bug 31422. Patch by Mark de Wever! Reviewers: rsmith Reviewed By: rsmith Subscribers: bbannier, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D64820 llvm-svn: 373862	2019-10-06 18:40:59 +00:00
Nico Weber	598e7a3a68	gn build: make windows build less broken llvm-svn: 373858	2019-10-06 18:11:53 +00:00
Whitney Tsang	dcb75bf843	[LOOPGUARD] Remove asserts in getLoopGuardBranch Summary: The assertion in getLoopGuardBranch can be a 'return nullptr' under if condition. Authored By: DTharun Reviewer: Whitney, fhahn Reviewed By: Whitney, fhahn Subscribers: fhahn, llvm-commits Tag: LLVM Differential Revision: https://reviews.llvm.org/D66084 llvm-svn: 373857	2019-10-06 16:39:43 +00:00
DeForest Richards	de0e3aac2a	[Docs] Removes Programming Documentation page Removes Programming Documentation page. Also moves existing topics on Programming Documentation page to User Guides and Reference pages. llvm-svn: 373856	2019-10-06 16:10:11 +00:00
Simon Pilgrim	5c876303ec	[X86][SSE] resolveTargetShuffleInputs - call getTargetShuffleInputs instead of using setTargetShuffleZeroElements directly. NFCI. llvm-svn: 373855	2019-10-06 15:42:25 +00:00
DeForest Richards	6d19651410	[Docs] Adds new Getting Started/Tutorials page Adds a new page for Getting Started/Tutorials topics. Also updates existing topic categories on the User Guides and Reference pages. llvm-svn: 373854	2019-10-06 15:36:37 +00:00
Sanjay Patel	f643fabb52	Revert [DAGCombine] Match more patterns for half word bswap This reverts r373850 (git commit `25ba49824d`) This patch appears to cause multiple codegen regression test failures - http://lab.llvm.org:8011/builders/clang-cmake-armv7-quick/builds/10680 llvm-svn: 373853	2019-10-06 15:27:34 +00:00
Xiangling Liao	ee68f1ec67	[NFC] Replace 'isDarwin' with 'IsDarwin' Summary: Replace 'isDarwin' with 'IsDarwin' based on LLVM naming convention. Differential Revision: https://reviews.llvm.org/D68336 llvm-svn: 373852	2019-10-06 14:44:22 +00:00
Sanjay Patel	aab8b3ab9c	[InstCombine] fold fneg disguised as select+fmul (PR43497) Extends rL373230 and solves the motivating bug (although in a narrow way): https://bugs.llvm.org/show_bug.cgi?id=43497 llvm-svn: 373851	2019-10-06 14:15:48 +00:00
Amaury Sechet	25ba49824d	[DAGCombine] Match more patterns for half word bswap Summary: It ensures that the bswap is generated even when a part of the subtree already matches a bswap transform. Reviewers: craig.topper, efriedma, RKSimon, lebedev.ri Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68250 llvm-svn: 373850	2019-10-06 14:14:55 +00:00
Simon Pilgrim	2dee7e5561	[X86][AVX] combineExtractSubvector - merge duplicate variables. NFCI. llvm-svn: 373849	2019-10-06 13:25:10 +00:00
Sanjay Patel	61c22a83de	[InstCombine] add fast-math-flags for better test coverage; NFC llvm-svn: 373848	2019-10-06 13:19:05 +00:00
Sanjay Patel	c38881a6b7	[InstCombine] don't assume 'inbounds' for bitcast pointer to GEP transform (PR43501) https://bugs.llvm.org/show_bug.cgi?id=43501 We can't declare a GEP 'inbounds' in general. But we may salvage that information if we have known dereferenceable bytes on the source pointer. Differential Revision: https://reviews.llvm.org/D68244 llvm-svn: 373847	2019-10-06 13:08:08 +00:00
Simon Pilgrim	032dd9b086	[X86][SSE] matchVectorShuffleAsBlend - use Zeroable element mask directly. We can make use of the Zeroable mask to indicate which elements we can safely set to zero instead of creating a target shuffle mask on the fly. This allows us to remove createTargetShuffleMask. This is part of the work to fix PR43024 and allow us to use SimplifyDemandedElts to simplify shuffle chains - we need to get to a point where the target shuffle masks isn't adjusted by its source inputs in setTargetShuffleZeroElements but instead we cache them in a parallel Zeroable mask. llvm-svn: 373846	2019-10-06 12:38:38 +00:00
David Zarzycki	7653ff398d	[X86] Enable AVX512BW for memcmp() llvm-svn: 373845	2019-10-06 10:25:52 +00:00
Paul Hoad	c209598268	[clang-format][docs] Fix the Google C++ and Chromium style guide URLs Summary: The Google C++ and Chromium style guides are broken in the clang-format docs. This patch updates them. Reviewers: djasper, MyDeveloperDay Reviewed By: MyDeveloperDay Subscribers: cfe-commits Tags: #clang Patch by: m4tx Differential Revision: https://reviews.llvm.org/D61256 llvm-svn: 373844	2019-10-06 09:37:58 +00:00
Matt Arsenault	e59296a051	AMDGPU/GlobalISel: Fall back on weird G_EXTRACT offsets llvm-svn: 373842	2019-10-06 01:41:22 +00:00
Matt Arsenault	786a3953ba	AMDGPU/GlobalISel: RegBankSelect mul24 intrinsics llvm-svn: 373841	2019-10-06 01:37:39 +00:00
Matt Arsenault	c0ec72d4f8	AMDGPU/GlobalISel: RegBankSelect DS GWS intrinsics llvm-svn: 373840	2019-10-06 01:37:38 +00:00
Matt Arsenault	bcd6b1d209	AMDGPU/GlobalISel: Lower G_ATOMIC_CMPXCHG_WITH_SUCCESS llvm-svn: 373839	2019-10-06 01:37:37 +00:00
Matt Arsenault	a5b9c75674	GlobalISel: Partially implement lower for G_EXTRACT Turn into shift and truncate. Doesn't yet handle pointers. llvm-svn: 373838	2019-10-06 01:37:35 +00:00
Matt Arsenault	69c65a8609	AMDGPU/GlobalISel: Fix RegBankSelect for sendmsg intrinsics This wasn't updated for the immarg handling change. llvm-svn: 373837	2019-10-06 01:37:34 +00:00
Craig Topper	2decdf42b9	[FastISel] Copy the inline assembly dialect to the INLINEASM instruction. Fixes PR43575. llvm-svn: 373836	2019-10-05 23:21:17 +00:00
Simon Pilgrim	8815be04ec	[X86][AVX] Push sign extensions of comparison bool results through bitops (PR42025) As discussed on PR42025, with more complex boolean math we can end up with many truncations/extensions of the comparison results through each bitop. This patch handles the cases introduced in combineBitcastvxi1 by pushing the sign extension through the AND/OR/XOR ops so its just the original SETCC ops that gets extended. Differential Revision: https://reviews.llvm.org/D68226 llvm-svn: 373834	2019-10-05 20:49:34 +00:00
Sanjay Patel	e2321bb448	[SLP] avoid reduction transform on patterns that the backend can load-combine I don't see an ideal solution to these 2 related, potentially large, perf regressions: https://bugs.llvm.org/show_bug.cgi?id=42708 https://bugs.llvm.org/show_bug.cgi?id=43146 We decided that load combining was unsuitable for IR because it could obscure other optimizations in IR. So we removed the LoadCombiner pass and deferred to the backend. Therefore, preventing SLP from destroying load combine opportunities requires that it recognizes patterns that could be combined later, but not do the optimization itself ( it's not a vector combine anyway, so it's probably out-of-scope for SLP). Here, we add a scalar cost model adjustment with a conservative pattern match and cost summation for a multi-instruction sequence that can probably be reduced later. This should prevent SLP from creating a vector reduction unless that sequence is extremely cheap. In the x86 tests shown (and discussed in more detail in the bug reports), SDAG combining will produce a single instruction on these tests like: movbe rax, qword ptr [rdi] or: mov rax, qword ptr [rdi] Not some (half) vector monstrosity as we currently do using SLP: vpmovzxbq ymm0, dword ptr [rdi + 1] # ymm0 = mem[0],zero,zero,.. vpsllvq ymm0, ymm0, ymmword ptr [rip + .LCPI0_0] movzx eax, byte ptr [rdi] movzx ecx, byte ptr [rdi + 5] shl rcx, 40 movzx edx, byte ptr [rdi + 6] shl rdx, 48 or rdx, rcx movzx ecx, byte ptr [rdi + 7] shl rcx, 56 or rcx, rdx or rcx, rax vextracti128 xmm1, ymm0, 1 vpor xmm0, xmm0, xmm1 vpshufd xmm1, xmm0, 78 # xmm1 = xmm0[2,3,0,1] vpor xmm0, xmm0, xmm1 vmovq rax, xmm0 or rax, rcx vzeroupper ret Differential Revision: https://reviews.llvm.org/D67841 llvm-svn: 373833	2019-10-05 18:03:58 +00:00
Simon Pilgrim	9ecacb0d54	[X86] lowerShuffleAsLanePermuteAndRepeatedMask - variable renames. NFCI. Rename some variables to match lowerShuffleAsRepeatedMaskAndLanePermute - prep work toward adding some equivalent sublane functionality. llvm-svn: 373832	2019-10-05 16:08:30 +00:00
Simon Pilgrim	68f21b360b	Try to fix sphinx indentation error llvm-svn: 373831	2019-10-05 16:08:17 +00:00
David Bolvansky	41c934acaf	[SelectionDAG] Add tests for LKK algorithm Added some tests testing urem and srem operations with a constant divisor. Patch by TG908 (Tim Gymnich) Differential Revision: https://reviews.llvm.org/D68421 llvm-svn: 373830	2019-10-05 14:29:25 +00:00
Simon Pilgrim	43bbca9228	RewriteObjC - silence static analyzer getAs<> null dereference warnings. NFCI. The static analyzer is warning about potential null dereferences, but we should be able to use castAs<> directly and if not assert will fire for us. llvm-svn: 373829	2019-10-05 13:42:14 +00:00
David Bolvansky	83b81c1f6e	[Diagnostics] Highlight expr's source range for -Wbool-operation Warning message looks better; and GCC adds it too. llvm-svn: 373828	2019-10-05 13:28:15 +00:00
Simon Pilgrim	f4cc3b3e10	SemaTemplate - silence static analyzer getAs<> null dereference warnings. NFCI. The static analyzer is warning about potential null dereferences, but we should be able to use castAs<> directly and if not assert will fire for us. llvm-svn: 373827	2019-10-05 13:21:08 +00:00
Simon Pilgrim	22b6873195	TreeTransform - silence static analyzer getAs<> null dereference warnings. NFCI. The static analyzer is warning about potential null dereferences, but we should be able to use castAs<> directly and if not assert will fire for us. llvm-svn: 373826	2019-10-05 13:20:59 +00:00
Simon Pilgrim	0e82722f9a	Remove redundant !HasDependentValue check. NFCI. Fixes cppcheck warning. llvm-svn: 373825	2019-10-05 13:20:51 +00:00
Simon Pilgrim	20692a0d3d	SemaStmt - silence static analyzer getAs<> null dereference warnings. NFCI. The static analyzer is warning about potential null dereferences, but we should be able to use castAs<> directly and if not assert will fire for us. llvm-svn: 373824	2019-10-05 13:20:42 +00:00
Simon Pilgrim	f609c0a303	BranchFolding - IsBetterFallthrough - assert non-null pointers. NFCI. Silences static analyzer null dereference warnings. llvm-svn: 373823	2019-10-05 13:20:30 +00:00
Paul Hoad	375a84bb75	[clang-format] SpacesInSquareBrackets should affect lambdas with parameters too Summary: This patch makes the `SpacesInSquareBrackets` setting also apply to C++ lambdas with parameters. Looking through the revision history, it appears support for only array brackets was added, and lambda brackets were ignored. Therefore, I am inclined to think it was simply an omission, rather than a deliberate choice. See https://bugs.llvm.org/show_bug.cgi?id=17887 and https://reviews.llvm.org/D4944. Reviewers: MyDeveloperDay, reuk, owenpan Reviewed By: MyDeveloperDay Subscribers: cfe-commits Patch by: mitchell-stellar Tags: #clang-format, #clang Differential Revision: https://reviews.llvm.org/D68473 llvm-svn: 373821	2019-10-05 09:55:23 +00:00
James Molloy	b1f0183e57	[UnitTests] Try and pacify gcc-5 This looks like a defect in gcc-5 where it chooses a constexpr constructor from the initializer-list that it considers to be explicit. I've tried to reproduce but I can't install anything prior to gcc-6 easily on my system, and that doesn't have the error. So this is speculative pacification. Reported by Steven Wan. llvm-svn: 373820	2019-10-05 08:57:17 +00:00
Sylvestre Ledru	68f58a75a7	Do not install lit-cpuid Summary: AFAIK, lit-cpuid is used by the tests. Installing it causes LLVMExports*.cmake files to depend on this program. It causes some serious packaging issues as it would means that llvm-dev depends on lldb. See: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=941082 https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=941306 See also https://bugs.llvm.org/show_bug.cgi?id=43035 for a similar issue caused by https://reviews.llvm.org/D56606 Reviewers: mgorny Reviewed By: mgorny Subscribers: delcypher, lldb-commits Differential Revision: https://reviews.llvm.org/D68537 llvm-svn: 373819	2019-10-05 08:22:40 +00:00
David Bolvansky	3acc649b86	[NFCI] Slightly improve warning message llvm-svn: 373818	2019-10-05 08:09:06 +00:00
David Bolvansky	559265c8da	[Diagnostics] Use Expr::isKnownToHaveBooleanValue() to check bitwise negation of bool in languages without a bool type Thanks for this advice, Richard Trieu! llvm-svn: 373817	2019-10-05 08:02:11 +00:00
Mehdi Amini	482f4d9aa9	Expose ProvidePositionalOption as a public API The motivation is to reuse the key value parsing logic here to parse instance specific pass options within the context of MLIR. The primary functionality exposed is the "," splitting for arrays and the logic for properly handling duplicate definitions of a single flag. Patch by: Parker Schuh <parkers@google.com> Differential Revision: https://reviews.llvm.org/D68294 llvm-svn: 373815	2019-10-05 01:37:04 +00:00
Philip Reames	d5a4dad206	Fix a nasty miscompile in experimental unordered atomic lowering This is an omission in rL371441. Loads which happened to be unordered weren't being added to the PendingLoad set, and thus weren't be ordered w/respect to side effects which followed before the end of the block. Included test case is how I spotted this. We had an atomic load being folded into a using instruction after a fence that load was supposed to be ordered with. I'm sure it showed up a bunch of other ways as well. Spotted via manual inspecting of assembly differences in a corpus w/and w/o the new experimental mode. Finding this with testing would have been "unpleasant". llvm-svn: 373814	2019-10-05 00:32:10 +00:00
Philip Reames	9fe5d730c7	[Test] Add a test case fo a missed oppurtunity in implicit null checking llvm-svn: 373813	2019-10-04 23:46:26 +00:00
Ana Pazos	ea835f5ce8	[RISCV] Added missing ImmLeaf predicates simm9_lsb0 and simm12_lsb0 operand types were missing predicates. llvm-svn: 373812	2019-10-04 23:42:07 +00:00
Aditya Kumar	50afaa9d34	Add a unittest to verify for assumption cache Reviewers: vsk, tejohnson Reviewed By: vsk Differential Revision: https://reviews.llvm.org/D68095 llvm-svn: 373811	2019-10-04 23:36:59 +00:00
Jonas Devlieghere	4e5d9e120b	[MachO] Reduce indentation further in ParseSymtab (NFC) llvm-svn: 373810	2019-10-04 23:09:55 +00:00

... 2 3 4 5 6 ...

328875 Commits All Branches Search

328875 Commits

All Branches