llvm-project

Commit Graph

Author	SHA1	Message	Date
LLVM GN Syncbot	1e9746d229	[gn build] Port `7b6f760fcd`	2021-03-28 18:35:33 +00:00
David Green	7b6f760fcd	[ARM] MVE vector lane interleaving MVE does not have a single sext/zext or trunc instruction that takes the bottom half of a vector and extends to a full width, like NEON has with MOVL. Instead it is expected that this happens through top/bottom instructions. So the MVE equivalent VMOVLT/B instructions take either the even or odd elements of the input and extend them to the larger type, producing a vector with half the number of elements each of double the bitwidth. As there is no simple instruction for a normal extend, we often have to expand sext/zext/trunc into a series of lane moves (or stack loads/stores, which we do not do yet). This pass takes vector code that starts at truncs, looks for interconnected blobs of operations that end with sext/zext and transforms them by adding shuffles so that the lanes are interleaved and the MVE VMOVL/VMOVN instructions can be used. This is done pre-ISel so that it can work across basic blocks. This initial version of the pass just handles a limited set of instructions, not handling constants or splats or FP, which can all come as extensions to this base. Differential Revision: https://reviews.llvm.org/D95804	2021-03-28 19:34:58 +01:00
Fangrui Song	53c98d85a8	[Driver] Suppress libstdc++/libc++ path with -nostdinc This follows GCC. Having libstdc++/libc++ include paths is not useful anyway because libstdc++/libc++ header files cannot find features.h. While here, suppress -stdlib++-isystem with -nostdlibinc.	2021-03-28 11:30:27 -07:00
Craig Topper	3fb40ce167	[X86] Don't define vpclmulqdq or vaes intrinsics in the headers unless avx512fintrin.h has been included. The intrinsics won't compile unless avx512fintrin.h has declared the 512 bit types.	2021-03-28 11:26:30 -07:00
Craig Topper	7b35932b51	[RISCV] Add test case for mulhsu. We don't yet use mulhsu, but we should.	2021-03-28 11:03:39 -07:00
Matt Arsenault	fc9df30991	Reapply "OpaquePtr: Turn inalloca into a type attribute" This reverts commit `20d5c42e0e`.	2021-03-28 13:35:21 -04:00
Sanjay Patel	01ae6e5ead	[InstCombine] sink min/max intrinsics with common op after select This is another step towards parity with cmp+select min/max idioms. See D98152.	2021-03-28 13:13:04 -04:00
Sanjay Patel	4f349739ef	[InstCombine] add tests for select of min/max intrinsics; NFC	2021-03-28 13:13:04 -04:00
Nico Weber	20d5c42e0e	Revert "OpaquePtr: Turn inalloca into a type attribute" This reverts commit `4fefed6563`. Broke check-clang everywhere.	2021-03-28 13:02:52 -04:00
Zakk Chen	821547cabb	[RISCV][Clang] Update new overloading rules for RVV intrinsics. RVV intrinsics has new overloading rule, please see `82aac7dad4` Changed: 1. Rename `generic` to `overloaded` because the new rule is not using C11 generic. 2. Change HasGeneric to HasNoMaskedOverloaded because all masked operations support overloading api. 3. Add more overloaded tests due to overloading rule changed. Differential Revision: https://reviews.llvm.org/D99189	2021-03-28 09:04:35 -07:00
Stefan Gränitz	7b9df09e20	[Orc][examples] Add missing dependency to OrcShared in LLJITWithRemoteDebugging	2021-03-28 17:48:28 +02:00
Stefan Gränitz	258f055ed9	[Orc][examples] Add LLJITWithRemoteDebugging example	2021-03-28 17:25:09 +02:00
Matt Arsenault	2f779e79d5	AArch64/GlobalISel: Remove IR section from test	2021-03-28 11:12:59 -04:00
Matt Arsenault	4fefed6563	OpaquePtr: Turn inalloca into a type attribute I think byval/sret and the others are close to being able to rip out the code to support the missing type case. A lot of this code is shared with inalloca, so catch this up to the others so that can happen.	2021-03-28 11:12:23 -04:00
Björn Schäpers	c5243c63cd	[clang-format] Fix aligning with linebreaks Breaking a string literal or a function calls arguments with AlignConsecutiveDeclarations or AlignConsecutiveAssignments did misalign the continued line. E.g.: void foo() { int myVar = 5; double x = 3.14; auto str = "Hello" "World"; } or void foo() { int myVar = 5; double x = 3.14; auto str = "Hello" "World"; } Differential Revision: https://reviews.llvm.org/D98214	2021-03-28 16:26:27 +02:00
Florian Hahn	8c6c357897	[LV] Mark a few more cost-model members as const (NFC).	2021-03-28 14:59:48 +01:00
Aaron Ballman	581b429f7d	Update the documentation for recent changes to statement attributes. Adds more information about automated diagnostic reporting for statement attributes and adds a bit more documentation about statement attributes in general.	2021-03-28 09:54:36 -04:00
Nikita Popov	3df3f3df45	[BasicAA] Handle gep with unknown sizes earlier (NFCI) If the sizes of both memory locations are unknown, we can only perform a check on the underlying objects. There's no point in going through GEP decomposition in this case.	2021-03-28 15:48:49 +02:00
Florian Hahn	eb3d9f2eb6	[SelDag] Add isIntOrFPConstant helper function. This patch adds a new isIntOrFPConstant helper function to check if a SDValue is a integer of FP constant. This pattern is used in various places. There also are places that incorrectly just check for integer constants, e.g. D99384, so hopefully this helper will help people avoid that issue. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D99428	2021-03-28 12:48:58 +01:00
Stephen Kelly	ea2225a10b	[clang-tidy] Simplify readability checks to not need ignoring* matchers Differential Revision: https://reviews.llvm.org/D98296	2021-03-28 11:25:41 +01:00
Fangrui Song	8e2f5f95b5	[Driver] Simplify mips multilib path and fix comments. NFC	2021-03-28 00:30:38 -07:00
Jonas Devlieghere	7f76c70d85	[lldb] Fix capitalization in CMake status message s/LLDB Tests/LLDB tests/	2021-03-27 21:39:39 -07:00
Hsiangkai Wang	bc82e9bf25	[RISCV] Add vfabs.v pseudo instruction. Differential Revision: https://reviews.llvm.org/D99454	2021-03-28 10:24:05 +08:00
Vaivaswatha Nagaraj	11f59c5457	[OCaml][Test] Fix and enable debuginfo.ml test `get_or_create_type_array` was used on a non-type MDNode. Add interface for `get_or_create_array` and use that instead. Differential Revision: https://reviews.llvm.org/D99450	2021-03-28 06:25:39 +05:30
Fangrui Song	dcaa0293c1	[test] Add UNSUPPORTED: system-windows to linux-ld.c We should have a test verifying / \ for Windows but have such a long test specifically for Linux cross compilation suffer from Windows \ is too troublesome.	2021-03-27 16:46:30 -07:00
Craig Topper	dced4649af	[X86] Regenerate a bunch of tests to pick up @PLT I'm prepping another patch to the same tests and this just adds noise to my diff.	2021-03-27 16:41:35 -07:00
Fangrui Song	87a9f42fc1	[Driver] Remove an incorrect library path for multilib This is incorrect (adding a path with unrelated libraries) but benign in practice because previous paths take precedence.	2021-03-27 16:36:21 -07:00
Fangrui Song	19e45696f5	[Driver] Remove an unneeded multiarch library path which ends with ../../.. Neither vanilla nor Debian GCC has the patch, which usually duplicates $sysroot/usr/lib.	2021-03-27 15:46:06 -07:00
Craig Topper	5692fc38e0	[RISCV] Add a pattern for (sext_inreg (mul (and X, 0xffffffff), (and Y, 0xffffffff)), i32) to suppress MULW formation We have a special pattern for (mul (and X, 0xffffffff), (and Y, 0xffffffff)), to optimize the ANDs to shift. But if a sext_inreg coms first, we'll form a MULW and limit the effectiveness of the special match. So this patch adds a larger pattern to suppress the MULW formation by emitting a sext.w and then the same output we use for the (mul (and X, 0xffffffff), (and Y, 0xffffffff)). This should all get CSEd. This is the issue I was trying to fix with D99029, but that affected many more tests.	2021-03-27 15:37:18 -07:00
Nikita Popov	9075864b73	[BasicAA] Refactor linear expression decomposition The current linear expression decomposition handles zext/sext by decomposing the casted operand, and then checking NUW/NSW flags to determine whether the extension can be distributed. This has some disadvantages: First, it is not possible to perform a partial decomposition. If we have zext((x + C1) +<nuw> C2) then we will fail to decompose the expression entirely, even though it would be safe and profitable to decompose it to zext(x + C1) +<nuw> zext(C2) Second, we may end up performing unnecessary decompositions, which will later be discarded because they lack nowrap flags necessary for extensions. Third, correctness of the code is not entirely obvious: At a high level, we encounter zext(x -<nuw> C) in the form of a zext on the linear expression x + (-C) with nuw flag set. Notably, this case must be treated as zext(x) + -zext(C) rather than zext(x) + zext(-C). The code handles this correctly by speculatively zexting constants to the final bitwidth, and performing additional fixup if the actual extension turns out to be an sext. This was not immediately obvious to me. This patch inverts the approach: An ExtendedValue represents a zext(sext(V)), and linear expression decomposition will try to decompose V further, either by absorbing another sext/zext into the ExtendedValue, or by distributing zext(sext(x op C)) over a binary operator with appropriate nsw/nuw flags. At each step we can determine whether distribution is legal and abort with a partial decomposition if not. We also know which extensions we need to apply to constants, and don't need to speculate or fixup.	2021-03-27 23:31:58 +01:00
Christopher Di Bella	24dd2d2f9e	[libcxx] rearranges all concept tests moves tests into directories matching their stable names so that the tests can reflect the concept name Differential Revision: https://reviews.llvm.org/D99104	2021-03-27 22:13:58 +00:00
Aaron Puchert	c61ae6e6d5	Deduplicate branches and adjust comment [NFC] Currently we want to allow calling non-const methods even when only a shared lock is held, because -Wthread-safety-reference is already quite sensitive and not all code is const-correct. Even if it is, this might require users to add std::as_const around the implicit object argument. See D52395 for a discussion. Fixes PR46963.	2021-03-27 23:08:43 +01:00
Florian Hahn	d2855eba81	[LV] Fix formatting from `2f9d68c3f1`.	2021-03-27 21:29:56 +00:00
Florian Hahn	2f9d68c3f1	[LV] Mark some methods as const (NFC). Mark a few methods as const, as they do not modify any state.	2021-03-27 21:27:53 +00:00
Alex Reinking	3001d080c8	[CMake] Use write_basic_package_version_file for LLVM Use the CMake 3.13 features of CMakeConfigPackageHelpers to generate LLVMConfigVersion.cmake with proper architecture detection, major+minor version matching, etc. Differential Revision: https://reviews.llvm.org/D99451	2021-03-27 21:02:20 +00:00
Fangrui Song	d3e7ee36f6	[sanitizer] Define MAP_NORESERVE to 0 and hide mremap for FreeBSD	2021-03-27 12:18:58 -07:00
KareemErgawy-TomTom	e5f2898bc7	[MLIR][STD] Fold trunci (zexti). This patch folds the following pattern: ``` %arg0 = ... %0 = zexti %arg0 : i1 to i8 %1 = trunci %0 : i8 to i1 ``` into just `%arg0`. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D99453	2021-03-27 19:40:10 +01:00
Nico Weber	ab158d35b5	[gn build] rewrap a comment to 80 cols	2021-03-27 12:50:33 -04:00
Jan Svoboda	bb88a5aeee	[clang][cli] Round-trip cc1 arguments in assert builds This patch enables cc1 argument round-trip for assert builds. It can be disabled by building clang with `-DCLANG_ROUND_TRIP_CC1_ARGS=OFF`. This will be committed only if we reach consensus in https://lists.llvm.org/pipermail/cfe-dev/2021-February/067714.html. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D97462	2021-03-27 17:24:03 +01:00
Simon Pilgrim	2a0d5da917	[X86][SSE] foldShuffleOfHorizOp - remove broadcast handling. Remove VBROADCAST/MOVDDUP/splat-shuffle handling from foldShuffleOfHorizOp This can all be handled by canonicalizeShuffleMaskWithHorizOp along as we check that the HADD/SUB are only used once (to prevent infinite loops on slow-horizop targets which will try to reuse the nodes again followed by a post-hop shuffle).	2021-03-27 15:09:23 +00:00
Joel E. Denny	43279d1df9	[FileCheck] Try to fix buildbot failures caused by `c7c542e8f3` For example, <https://lab.llvm.org/buildbot/#/builders/132/builds/3929> has this diagnostic: ``` /opt/gcc/9.3.0/snos/include/g++/bits/stl_tree.h:780:8: error: static assertion failed: comparison object must be invocable as const 780 \| is_invocable_v<const _Compare&, const _Key&, const _Key&>, \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ```	2021-03-27 11:03:10 -04:00
Joel E. Denny	c7c542e8f3	[FileCheck] Fix -dump-input per-pattern diagnostic indexing In input dump annotations, `check:2'1` indicates diagnostic 1 for the `CHECK` directive on check file line 2. Without this patch, `-dump-input` computes the diagnostic index with the assumption that FileCheck consecutively produces all diagnostics for the same pattern. Already, that can be a false assumption, as in the examples below. Moreover, it seems like a brittle assumption as FileCheck evolves. Finally, it actually complicates the implementation even if it makes it slightly more efficient. This patch avoids that assumption. Examples below show results after applying this patch. Before applying this patch, `'N` is omitted throughout these examples because the implementation doesn't notice there's more than one diagnostic per pattern. First, `CHECK-LABEL` violates the assumption because `CHECK-LABEL` tries to match twice, and other directives can match in between: ``` $ cat check CHECK: foobar CHECK-LABEL: foobar $ FileCheck -vv check < input \|& tail -8 <<<<<< 1: text 2: foobar label:2'0 ^~~~~~ check:1 ^~~~~~ label:2'1 X error: no match found 3: text >>>>>> ``` Second, `--implicit-check-not` is obviously processed many times among other directives: ``` $ cat check CHECK: foo CHECK: foo $ FileCheck -vv -dump-input=always -implicit-check-not=foo \ check < input \|& tail -16 <<<<<< 1: text not:imp1'0 X~~~~ 2: foo check:1 ^~~ not:imp1'1 X 3: text not:imp1'1 ~~~~~ 4: foo check:2 ^~~ not:imp1'2 X 5: text not:imp1'2 ~~~~~ 6: eof:2 ^ >>>>>> ``` Reviewed By: thopre, jhenderson Differential Revision: https://reviews.llvm.org/D97813	2021-03-27 10:36:21 -04:00
Nikita Popov	b981bc30bf	[BasicAA] Correct handle implicit sext in decomposition While explicit sext instructions were handled correctly, the implicit sext that occurs if the offset is smaller than the pointer size blindly assumed that sext(X * Scale + Offset) is the same as sext(X) * Scale + Offset, which is obviously not correct. Fix this by extracting the code that handles linear expression extension and reusing it for the implicit sext as well.	2021-03-27 15:15:47 +01:00
Nikita Popov	60f3e8fbe4	[BasicAA] Clarify entry values of GetLinearExpression() (NFC) A number of variables need to be correctly initialized on entry to GetLinearExpression() for the implementation to behave reasonably. The fact that SExtBits can currenlty be non-zero on entry is a bug, as demonstrated by the added test: For implicit sexts by the GEP, we do currently skip legality checks.	2021-03-27 14:50:09 +01:00
Nikita Popov	ad9dad93ff	[BasicAA] Bail out earlier for invalid shift amount Currently, we'd produce an incorrect decomposition, because we already recursively called GetLinearExpression(), so the Scale=1, Offset=0 will not necessarily be relative to the shl itself. Now, this doesn't actually matter for functional correctness, because such a shift is poison anyway, so its okay to return an incorrect decomposition. It's still unnecessarily confusing though, and we can easily avoid this by checking the bitwidth earlier.	2021-03-27 12:41:16 +01:00
Nikita Popov	5a5a8088cc	[BasicAA] Retain shl nowrap flags in GetLinearExpression() Nowrap flags between mul and shl differ in that mul nsw allows multiplication of 1 * INT_MIN, while shl nsw does not. This means that it is always fine to transfer shl nowrap flags to muls, but not necessarily the other way around. In this case the NUW/NSW results refer to mul/add operations, so it's fine to retain the flags from the shl.	2021-03-27 12:26:22 +01:00
Simon Pilgrim	41146bfe82	[X86][SSE] combineX86ShuffleChain - attempt to recognise 'hidden' identity shuffles See if the combined shuffle mask is equivalent to an identity shuffle, typically this is due to repeated LHS/RHS ops in horiz-ops, but isTargetShuffleEquivalent might see other patterns as well. This is another small step towards getting rid of foldShuffleOfHorizOp and relying on canonicalizeShuffleMaskWithHorizOp and generic shuffle combining.	2021-03-27 11:09:30 +00:00
Juneyoung Lee	05884d3b52	Make FoldBranchToCommonDest poison-safe by default This is a small patch to make FoldBranchToCommonDest poison-safe by default. After `fc3f0c9c`, only two syntactic changes are needed to fix unit tests. This does not cause any assembly difference in testsuite as well (-O3, X86-64 Manjaro). Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D99452	2021-03-27 19:05:12 +09:00
Sanjay Patel	a283d72583	[x86] prevent crashing while matching pmaddwd This could crash in 2 ways: either one or both of the input vectors could be a different size than the math ops. https://llvm.org/PR49716	2021-03-27 05:27:14 -04:00
Alex Zinenko	d68ba1fe50	[mlir] Register Linalg passes in C API and Python Bindings Provide a registration mechanism for Linalg dialect-specific passes in C API and Python bindings. These are being built into the dialect library but exposed in separate headers (C) or modules (Python). Differential Revision: https://reviews.llvm.org/D99431	2021-03-27 09:57:56 +01:00

1 2 3 4 5 ...

383991 Commits All Branches Search

383991 Commits

All Branches