llvm-project

Commit Graph

Author	SHA1	Message	Date
David Green	61225c0818	[ValueTracking][InstCombine] Introduce and use ComputeMinSignedBits This introduces a new ComputeMinSignedBits method for ValueTracking that returns the BitWidth - SignBits + 1 from ComputeSignBits, and represents the minimum bit size for the value as a signed integer. Similar to the existing APInt::getMinSignedBits method, this can make some of the reasoning around ComputeSignBits more natural. See https://reviews.llvm.org/D112298	2021-11-05 14:41:37 +00:00
Simon Pilgrim	9e6506299a	[DAG] FoldConstantVectorArithmetic - remove SDNodeFlags argument Another minor step towards merging FoldConstantVectorArithmetic into FoldConstantArithmetic. We don't use SDNodeFlags in any constant folding inside DAG, so passing the Flags argument is a waste of time - an alternative would be to wire up FoldConstantArithmetic to take SDNodeFlags just-in-case we someday start using it, but we don't have any way to test it and I'd prefer to avoid dead code. Differential Revision: https://reviews.llvm.org/D113276	2021-11-05 14:36:17 +00:00
Roman Lebedev	ad617183bb	[X86] `X86TTIImpl::getInterleavedMemoryOpCostAVX512()`: mask is i8 not i1 Even though AVX512's masked mem ops (unlike AVX1/2) have a mask that is a `VF x i1`, replication of said masks happens after promotion of it to `VF x i8`, so we should use `i8`, not `i1`, when calculating the cost of mask replication.	2021-11-05 17:27:02 +03:00
Sanjay Patel	4fc1fc4005	[DAGCombiner] add fold for vselect based on mask of signbit (X s< 0) ? Y : 0 --> (X s>> BW-1) & Y We canonicalize to the icmp+select form in IR, and we already have this fold for scalar select in SDAG, so I think it's an oversight that we don't have the fold for vectors. It seems neutral for AArch64 and saves some instructions on x86. Whether we should also have the sibling folds for the inverse condition or all-ones true value may depend on target-specific factors such as whether there's an "and-not" instruction. Differential Revision: https://reviews.llvm.org/D113212	2021-11-05 10:06:16 -04:00
Sanjay Patel	1e7afa2a0d	[AArch64] add tests for vector select; NFC	2021-11-05 10:06:16 -04:00
Sanjay Patel	8918814032	[x86] add tests for vector select; NFC	2021-11-05 10:06:15 -04:00
Sanjay Patel	05f64b5ac9	[InstCombine] add signbit tests for icmp with trunc; NFC	2021-11-05 10:06:15 -04:00
LLVM GN Syncbot	6cd309bd02	[gn build] Port `7a98761d74`	2021-11-05 13:54:25 +00:00
Roman Lebedev	01d8759ac9	[IR][ShuffleVector] Introduce `isReplicationMask()` matcher Avid readers of this saga may recall from previous installments, that replication mask replicates (lol) each of the `VF` elements in a vector `ReplicationFactor` times. For example, the mask for `ReplicationFactor=3` and `VF=4` is: `<0,0,0,1,1,1,2,2,2,3,3,3>`. More importantly, replication mask is used by LoopVectorizer when using masked interleaved memory operations. As discussed in previous installments, while it is used by LV, and we seem to support masked interleaved memory operations on X86, it's support in cost model leaves a lot to be desired: until basically yesterday even for AVX512 we had no cost model for it. As it has been witnessed in the recent AVX2 `X86TTIImpl::getInterleavedMemoryOpCost()` costmodel patches, while it is hard-enough to query the cost of a particular assembly sequence [from llvm-mca], afterwards the check lines LV costmodel tests must be updated manually. This is, at the very least, boring. Okay, now we have decent costmodel coverage for interleaving shuffles, but now basically the same mind-killing sequence has to be performed for replication mask. I think we can improve at least the second half of the problem, by teaching the `TargetTransformInfoImplCRTPBase::getUserCost()` to recognize `Instruction::ShuffleVector` that are repetition masks, adding exhaustive test coverage using `-cost-model -analyze` + `utils/update_analyze_test_checks.py` This way we can have good exhaustive coverage for cost model, and only basic coverage for the LV costmodel. This patch adds precise undef-aware `isReplicationMask()`, with exhaustive test coverage. * `InstructionsTest.ShuffleMaskIsReplicationMask` shows that it correctly detects all the known masks. * `InstructionsTest.ShuffleMaskIsReplicationMask_undef` shows that replacing some mask elements in a known replication mask still allows us to recognize it as a replication mask. Note, with enough undef elts, we may detect a different tuple. * `InstructionsTest.ShuffleMaskIsReplicationMask_Exhaustive_Correctness` shows that if we detected the replication mask with given params, then if we actually generate a true replication mask with said params, it matches element-wise ignoring undef mask elements. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D113214	2021-11-05 16:53:47 +03:00
Roman Lebedev	7a98761d74	[NFC] Move CombinationGenerator from Exegesis to ADT Reviewed By: courbet Differential Revision: https://reviews.llvm.org/D113213	2021-11-05 16:53:46 +03:00
David Sherwood	657a1dcd0d	[AArch64] Add target DAG combine for UUNPKHI/LO When created a UUNPKLO/HI node with an undef input then the output should also be undef. I've added a target DAG combine function to ensure we avoid creating an unnecessary uunpklo/hi instruction. Differential Revision: https://reviews.llvm.org/D113266	2021-11-05 13:50:59 +00:00
Quinn Pham	c71fbdd87b	[NFC] Inclusive language: Remove instances of master in URLs [NFC] This patch fixes URLs containing "master". Old URLs were either broken or redirecting to the new URL. Reviewed By: #libc, ldionne, mehdi_amini Differential Revision: https://reviews.llvm.org/D113186	2021-11-05 08:48:41 -05:00
Simon Pilgrim	f2703c3c33	[DAG] FoldConstantArithmetic - rename NumOps -> NumElts. NFC. NumOps represents the number of elements for vector constant folding, rename this NumElts so in future we can the consistently use NumOps to represent the number of operands of the opcode. Minor cleanup before trying to begin generalizing FoldConstantArithmetic to support opcodes other than binops.	2021-11-05 13:32:34 +00:00
Nico Weber	a160aba95f	[gn build] (manually) port `df0ba47c36`	2021-11-05 09:17:59 -04:00
Jingu Kang	a7b1872593	[AArch64] Fix a bug from a pattern for uaddv(uaddlp(x)) ==> uaddlv A pattern has selected wrong uaddlv MI. It should be as below. uaddv(uaddlp(v8i8)) ==> uaddlv(v8i8) Differential Revision: https://reviews.llvm.org/D113263	2021-11-05 12:48:18 +00:00
Alfredo Dal'Ava Junior	1cb9f37a17	[FreeBSD] Do not mark __stack_chk_guard as dso_local This symbol is defined in libc.so so it is definitely not DSO-Local. Marking it as such causes problems on some platforms (such as PowerPC). Differential revision: https://reviews.llvm.org/D109090	2021-11-05 07:29:50 -05:00
Martin Liska	13a442ca49	Enable -Wformat-pedantic and fix fallout. Differential Revision: https://reviews.llvm.org/D113172	2021-11-05 13:12:35 +01:00
Simon Pilgrim	c1e7911c3b	[DAG] FoldConstantArithmetic - fold bitlogic(bitcast(x),bitcast(y)) -> bitcast(bitlogic(x,y)) To constant fold bitwise logic ops where we've legalized constant build vectors to a different type (e.g. v2i64 -> v4i32), this patch adds a basic ability to peek through the bitcasts and perform the constant fold on the inner operands. The MVE predicate v2i64 regressions will be addressed by future support for basic v2i64 type support. One of the yak shaving fixes for D113192.... Differential Revision: https://reviews.llvm.org/D113202	2021-11-05 12:00:59 +00:00
David Green	cd8cb5377a	[InstCombine] Add additional tests for converting to sadd.sat with sign bits. NFC	2021-11-05 12:00:03 +00:00
Valentin Clement	8c23990949	[fir] Add fir.select and fir.select_rank FIR to LLVM IR conversion patterns The `fir.select` and `fir.select_rank` are lowered to llvm.switch. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D113089 Co-authored-by: Jean Perier <jperier@nvidia.com> Co-authored-by: Eric Schweitz <eschweitz@nvidia.com>	2021-11-05 12:54:51 +01:00
Fraser Cormack	3a11fb572c	[LangRef][VP] Document vp.gather and vp.scatter intrinsics This patch fleshes out the missing documentation for the final two VP intrinsics introduced in D99355: `llvm.vp.gather` and `llvm.vp.scatter`. It does so mostly by deferring to the `llvm.masked.gather` and `llvm.masked.scatter` intrinsics, respectively. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D112997	2021-11-05 11:36:03 +00:00
Alex Zinenko	6981e5ec91	[mlir][python] fix constructor generation for optional operands in presence of segment attribute The ODS-based Python op bindings generator has been generating incorrect specification of the operand segment in presence if both optional and variadic operand groups: optional groups were treated as variadic whereas they require separate treatement. Make sure it is the case. Also harden the tests around generated op constructors as they could hitherto accept the code for both optional and variadic arguments. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D113259	2021-11-05 12:40:27 +01:00
Simon Pilgrim	5e9ac7c0a5	[X86] Enable v32i16 rotate lowering on non-BWI targets Fixes one of the regressions in D113192	2021-11-05 11:00:31 +00:00
David Green	cb62c3761f	[ARM] Extra MVE constant select test. NFC	2021-11-05 10:57:38 +00:00
Fraser Cormack	93e1802af3	[LangRef][VP] Document vp.load and vp.store intrinsics This patch fleshes out the missing documentation for two of the VP intrinsics introduced in D99355: `llvm.vp.load` and `llvm.vp.store`. It does so mostly by deferring to the `llvm.masked.load` and `llvm.masked.store` intrinsics, respectively. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D112930	2021-11-05 10:39:34 +00:00
Clement Courbet	737f540abd	[Sema][NFC] Add tests for builtin spaceship operator. In preparation for D112453.	2021-11-05 11:44:19 +01:00
Riccardo Mori	44596fe6a9	[Polly][Isl] Use the function unsignedFromIslSize to manage a isl::size object. NFCI This is part of an effort to reduce the differences between the custom C++ bindings used right now by polly in lib/External/isl/include/isl/isl-noxceptions.h and the official isl C++ interface. In the official interface the type `isl::size` cannot be casted to an unsigned without previously having checked if it contains a valid value with the function `isl::size::is_error()`. For this reason two helping functions have been added: - `IslAssert`: assert that no errors are present in debug builds and just disables the mandatory error check in non-debug builds - `unisgnedFromIslSIze`: cast the `isl::size` object to `unsigned` Changes made: - Add the functions `IslAssert` and `unsignedFromIslSize` - Add the utility function `rangeIslSize()` - Retype `MaxDisjunctsInDomain` from `int` to `unsigned` - Retype `RunTimeChecksMaxAccessDisjuncts` from `int` to `unsigned` - Retype `MaxDimensionsInAccessRange` from `int` to `unsigned` - Replaced some usages of `isl_size` to `unsigned` since we aim not to use `isl_size` anymore - `isl-noexceptions.h` has been generated by `e704f73c88` No functional change intended. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D113101	2021-11-05 11:15:22 +01:00
Chen Zheng	fed2889f07	[PowerPC] use correct selection for v16i8/v8i16 splat load Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D113236	2021-11-05 10:04:03 +00:00
Jay Foad	0321bd64e6	Revert "[TwoAddressInstructionPass] Update existing physreg live intervals" This reverts commit `ec0e1e88d2`. It was pushed by mistake.	2021-11-05 09:54:26 +00:00
Jay Foad	c93bf53a3e	[AMDGPU] NFC formatting fixes in SIMemoryLegalizer	2021-11-05 09:10:24 +00:00
Jay Foad	ec0e1e88d2	[TwoAddressInstructionPass] Update existing physreg live intervals In TwoAddressInstructionPass::processTiedPairs with -early-live-intervals, update any preexisting physreg live intervals, as well as virtreg live intervals. By default (without -precompute-phys-liveness) physreg live intervals only exist for registers that are live-in to some basic block. Differential Revision: https://reviews.llvm.org/D113191	2021-11-05 09:10:24 +00:00
Matthias Springer	020ca1747d	[mlir][linalg][bufferize] Move bufferizesToAliasOnly to extraClassDecls By doing so, the method can no longer be reimplemented. Differential Revision: https://reviews.llvm.org/D113248	2021-11-05 18:08:43 +09:00
Christian Sigg	fce529fc6e	Fix `insertFunctionArguments()` block argument order. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D113171	2021-11-05 10:08:20 +01:00
Tres Popp	7d323dc773	Add Bazel support for LLVM_WINDOWS_PREFER_FORWARD_SLASH This was added in `df0ba47c36`	2021-11-05 10:04:52 +01:00
Qiu Chaofan	5fd406e254	[PowerPC] Add intrinsic to convert between ppc_fp128 and fp128 ppc_fp128 and fp128 are both 128-bit floating point types. However, we can't do conversion between them now, since trunc/ext are not allowed for same-size fp types. This patch adds two new intrinsics: llvm.ppc.convert.f128.to.ppcf128 and llvm.convert.ppcf128.to.f128, to support such conversion. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D109421	2021-11-05 16:58:38 +08:00
Martin Storsjö	df0ba47c36	[Support] Allow configuring the preferred type of slashes on Windows Default to preferring forward slashes when built for MinGW, as many usecases, when e.g. Clang is used as a drop-in replacement for GCC, requires the compiler to output paths with forward slashes. Not all tests pass yet, if configuring to prefer forward slashes though. Differential Revision: https://reviews.llvm.org/D112787	2021-11-05 10:42:02 +02:00
Martin Storsjö	f4d83c56c9	[Support] [Windows] Convert paths to the preferred form This normalizes most paths (except ones input from the user as command line arguments) into the preferred form, if `real_style()` evaluates to `windows_forward`. Differential Revision: https://reviews.llvm.org/D111880	2021-11-05 10:41:51 +02:00
Martin Storsjö	a8b54834a1	[Support] Add a new path style for Windows with forward slashes This behaves just like the regular Windows style, with both separator forms accepted, but with get_separator() returning forward slashes. Add a more descriptive name for the existing style, keeping the old name around as an alias initially. Add a new function `make_preferred()` (like the C++17 `std::filesystem::path` function with the same name), which converts windows paths to the preferred separator form (while this one works on any platform and takes a `path::Style` argument). Contrary to `native()` (just like `make_preferred()` in `std::filesystem`), this doesn't do anything at all on Posix, it doesn't try to reinterpret backslashes into forward slashes there. Differential Revision: https://reviews.llvm.org/D111879	2021-11-05 10:41:51 +02:00
Martin Storsjö	f95bd18b5f	Revert "[Attr] support btf_type_tag attribute" This reverts commits `737e4216c5` and `ce7ac9e66a`. After those commits, the compiler can crash with a reduced testcase like this: $ cat reduced.c void a(*); void a() {} $ clang -c reduced.c -O2 -g	2021-11-05 10:36:40 +02:00
Martin Storsjö	7af584ed87	[libunwind] Try to add --unwindlib=none while configuring and building libunwind If Clang is set up to link directly against libunwind (via the --unwindlib option, or the corresponding builtin default option), configuring libunwind will fail while bootstrapping (before the initial libunwind is built), because every cmake test will fail due to -lunwind not being found, and linking the shared library will fail similarly. Check if --unwindlib=none is supported, and add it in that case. Using check_c_compiler_flag on its own doesn't work, because that only adds the tested flag to the compilation command, and if -lunwind is missing, the linking step would still fail - instead try adding it to CMAKE_REQUIRED_FLAGS and restore the variable if it doesn't work. This avoids having to pass --unwindlib=none while building libunwind. Differential Revision: https://reviews.llvm.org/D112126	2021-11-05 10:10:19 +02:00
Markus Lavin	7e34d5ead1	[NPM] Fix bug in llvm/utils/reduce_pipeline.py Last minute changes in https://reviews.llvm.org/D110908 unfortunately introduced a bug wrt automatic pipeline expansion. This patch fixes that as well as gets rid of a few redundant variables. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D113177	2021-11-05 08:41:39 +01:00
Vitaly Buka	bbc213afd1	[asan] compiler-rt version of D113143	2021-11-04 23:51:16 -07:00
David Blaikie	def232915f	Fix some issues with the gdb pretty printers for llvm::Twine Still some pending bugs, but at least ironed some things out.	2021-11-04 22:50:50 -07:00
Michael Kruse	1606022fab	[Preprocessor] Fix newline before/after _Pragma. The PragmaAssumeNonNullHandler (and maybe others) passes an invalid SourceLocation to its callback, hence PrintPreprocessedOutput does not know how many lines to insert between the previous token and the pragma and does nothing. With this patch we instead assume that the unknown token is on the same line as the previous such that we can call the procedure that also emits semantically significant whitespace. Fixes bug reported here: https://reviews.llvm.org/D104601#3105044	2021-11-05 00:43:40 -05:00
Michael Kruse	8f099d17a1	[Preprocessor] Fix warning: left and right subexpressions are identical. NFCI. This is reported by msvc as warning C6287: redundant code: the left and right subexpressions are identical EmittedDirectiveOnThisLine implies EmittedTokensOnThisLine making this an NFC change. To be on the safe side and because both of them are checked at other places as well, we continue to check both. Compiler warning reported here: https://reviews.llvm.org/D104601#2957333	2021-11-05 00:43:40 -05:00
Chen Zheng	9695027066	[PowerPC] address post-commit comments for D106555; NFC Address namanjai post commit comments.	2021-11-05 05:30:53 +00:00
Keith Smiley	a7a2959901	[lld-macho] Replace LC_LINKER_OPTION parsing This removes the tablegen based parsing of LC_LINKER_OPTION since it can only actually contain a very small number of potential arguments. In our project with tablegen this took 5 seconds before. This replaces https://reviews.llvm.org/D113075 Differential Revision: https://reviews.llvm.org/D113235	2021-11-04 22:03:40 -07:00
Matthias Springer	95e62eb430	[mlir][linalg][bufferize] Separate pass from ComprehensiveBufferize This commit separates the bufferization from the bufferization pass in Linalg. This allows other dialects to use ComprehensiveBufferize more easily. This commit mainly moves files to a new directory and adds a new build target. Differential Revision: https://reviews.llvm.org/D112989	2021-11-05 13:56:39 +09:00
Fangrui Song	005456e5fc	[lld-macho] Fix an assertion failure when -u specifies an undefined section$start symbol This matches ld64. Also improve the test for `-dead_strip`. Reviewed By: #lld-macho, Jez Ng Differential Revision: https://reviews.llvm.org/D113147	2021-11-04 21:28:33 -07:00
Shengchen Kan	6d03227c16	[X86][MS-InlineAsm][test] Add triple in ms-inline-asm-array.ll Fix the LIT test fail on Mac, which is reported in D113096.	2021-11-05 11:05:37 +08:00

... 3 4 5 6 7 ...

404020 Commits All Branches Search

404020 Commits

All Branches