llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	01188f996d	[X86] insertps-combine.ll - show address math in checks	2022-05-09 16:01:42 +01:00
Alexey Bataev	9c3a75eabf	[SLP]Fix a crash when preparing a mask for external scalars. Need to use actual index instead of the tree entry position, since the insert index may be different than 0. It mean, that we vectorized part of the buildvector starting from not initial insertelement instruction beause of some reason.	2022-05-09 07:59:34 -07:00
Nikita Popov	68e1ba8188	[SCEV] Fold umin_seq using known predicate Fold %x umin_seq %y to %x if %x ule %y. This also subsumes the special handling for constant operands, as if %y is constant this folds to umin via implied poison reasoning, and if %x is constant then either %x is not zero and it folds to umin, or it is known zero, in which case it is ule anything.	2022-05-09 16:35:08 +02:00
Nikita Popov	7dddf12f44	[SCEV] Add more tests for umin_seq with known predicate (NFC)	2022-05-09 16:18:09 +02:00
Micah Weston	882915df61	Enum conversion warning when one signed and other unsigned. Ensures an -Wenum-conversion warning happens when one of the enums is signed and the other is unsigned. Also adds a test file to verify these warnings. This warning would not happen since the -Wsign-conversion would make a diagnostic then return, never allowing the -Wenum-conversion checks. For example: C enum PE { P = -1 }; enum NE { N }; enum NE conv(enum PE E) { return E; } Before this would only create a diagnostic with -Wsign-conversion and never on -Wenum-conversion. Now it will create a diagnostic for both -Wsign-conversion and -Wenum-conversion. I could change it to just warn on -Wenum-conversion as that was what I initially did. Seeing PR35200 (or GitHub Issue 316268), I let both diagnostics check so that the sign conversion could generate a warning.	2022-05-09 10:16:19 -04:00
Krzysztof Parzyszek	d9e6b5df74	[clang] Recognize scope of thread local variables in CFGBuilder Differential Revision: https://reviews.llvm.org/D125177	2022-05-09 07:11:56 -07:00
Sam McCall	0195163dba	[Frontend] when attaching a preamble, don't generate the long predefines buffer. We know we're going to overwrite it anyway. It'd be a bit of work to coordinate not generating it at all, but setting this flag avoids generating ~10k of the 13k string. Differential Revision: https://reviews.llvm.org/D125180	2022-05-09 15:55:32 +02:00
Fred Tingaud	1ec1cdcfb4	[analyzer] Inline operator delete when MayInlineCXXAllocator is set. This patch restores the symmetry between how operator new and operator delete are handled by also inlining the content of operator delete when possible. Patch by Fred Tingaud. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D124845	2022-05-09 15:44:33 +02:00
Philip Reames	70ad96ca5e	[riscv, InsertVSETVLI] Rename InstrInfo to Require to more clearly indicate purpose [nfc]	2022-05-09 06:40:33 -07:00
Aaron Puchert	44ae49e1a7	Thread safety analysis: Handle compound assignment and ->* overloads Like regular assignment, compound assignment operators can be assumed to write to their left-hand side operand. So we strengthen the requirements there. (Previously only the default read access had been required.) Just like operator->, operator->* can also be assumed to dereference the left-hand side argument, so we require read access to the pointee. This will generate new warnings if the left-hand side has a pt_guarded_by attribute. This overload is rarely used, but it was trivial to add, so why not. (Supporting the builtin operator requires changes to the TIL.) Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D124966	2022-05-09 15:35:43 +02:00
Sam McCall	f1a9c4b717	[clangd] Skip (most) predefined macros when scanning for preamble patching. This is unneccesary work. With this change, we skip generating and lexing ~10k of predefines twice. A dumb benchmark of building a preamble for an empty file in a loop shows: - before: 1.90ms/run - after: 1.36ms/run So this should be worth 0.5ms for each AST build and code completion. There can be a functional difference, but it's very minor. If the preamble contains e.g. `#ifndef __llvm__ ... #endif` then before we would not take it. After this change we will take the branch (single-file mode takes all branches with unknown conditions) and so gather different directives. However I think this is negligible: - this is already true of non-builtin macros (from included headers). We've had no complaints. - this affects the baseline and modified in the same way, so only makes a difference transiently when code guarded by such an #ifdef is being edited Differential Revision: https://reviews.llvm.org/D125179	2022-05-09 15:33:31 +02:00
Erich Keane	a425cac31e	"Re-apply 4b6c2cd642 "Deferred Concept Instantiation Implementation"""" This includes a fix for the libc++ issue I ran across with friend declarations not properly being identified as overloads. This reverts commit `45c07db31c`.	2022-05-09 06:29:47 -07:00
Jean Perier	d38915ffeb	[flang] Fix windows bot after D125140 The ifdef is not required in the header, common::int128_t is always defined. The function declaration must be available in lowering regardless of the host int128_t support. Differential Revision: https://reviews.llvm.org/D125211	2022-05-09 15:24:14 +02:00
Philip Reames	7ed16e7c51	[riscv] Fix state tracking bug on vsetvli (phi of vsetvli) peephole This fixes the first of several cases where the state computed in phase 1 and 2 of the algorithm differs from the state computed during phase 3. Note that such differences can cause miscompiles by creating disagreements about contents of the VL and VTYPE registers at block boundaries. In this particular case, we recognize that for the first vsetvli in a block, that if the AVL is a phi of GPR results from previous vsetvlis and the VTYPE field matches, we can avoid emitting a vsetvli as the register contents don't change. Unfortunately, the abstract state does change and that update was lost. As noted in the test change, this can actually improve results by preserving information until later state transitions in the block. However, this minor codegen improvement is not the motivation for the patch. The motivation is to avoid cases a case where we break a key internal correctness invariant. Differential Revision: https://reviews.llvm.org/D125133	2022-05-09 06:21:45 -07:00
Nathan Sidwell	bc150a07f1	[demangler] No need to space adjacent template closings With the demangler parenthesizing 'a >> b' inside template parameters, because C++11 parsing of >> there, we don't really need to add spaces between adjacent template arg closing '>' chars. In 2022, that just looks odd. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D123134	2022-05-09 06:14:44 -07:00
David Green	2cfb243bcd	[DAG] Use isAnyConstantBuildVector. NFC As suggested from `02f8519502`, this uses the isAnyConstantBuildVector method in lieu of separate isBuildVectorOfConstantSDNodes calls. It should otherwise be an NFC.	2022-05-09 14:13:03 +01:00
Nikita Popov	18eaff1510	[ScalarEvolution] Fold %x umin_seq %y if %x cannot be zero Fold %x umin_seq %y to %x umin %y if %x cannot be zero. They only differ in semantics for %x==0. More generally %x _seq %y folds to %x %y if %x cannot be the saturation fold (though currently we only have umin_seq).	2022-05-09 15:11:05 +02:00
Simon Pilgrim	ec6024d081	[X86] Replace avx512f integer mul reduction builtins with generic builtin D117829 added the generic "__builtin_reduce_mul" which we can use to replace the x86 specific integer mul reduction builtins - internally these were mapping to the same intrinsic already so there are no test changes required. Differential Revision: https://reviews.llvm.org/D125222	2022-05-09 14:10:28 +01:00
Nikita Popov	33f02de5df	[ScalarEvolution] Add tests for umin_seq with non-zero operand (NFC)	2022-05-09 15:03:12 +02:00
Rosie Sumpter	1a2665902f	[AArch64][SVE] Improve codegen when extracting first lane of active lane mask When extracting the first lane of a predicate created using the llvm.get.active.lane.mask intrinsic, it should give the same codegen as when the predicate is created using the llvm.aarch64.sve.whilelo intrinsic, since get.active.lane.mask is lowered to whilelo. This patch ensures the codegen is the same by recognizing llvm.get.active.lane.mask as a flag-setting operation in this case. Differential Revision: https://reviews.llvm.org/D125215	2022-05-09 13:56:04 +01:00
Sam McCall	a316a9815a	[clangd] Rewrite TweakTesting helpers to avoid reparsing the same code. NFC Previously the EXPECT_AVAILABLE macros would rebuild the code at each marked point, by expanding the cases textually. There were often lots, and it's nice to have lots! This reduces total unittest time by ~10% on my machine. I did have to sacrifice a little apply() coverage in AddUsingTests (was calling expandCases directly, which was otherwise unused), but we have EXPECT_AVAILABLE tests covering that, I don't think there's real risk here. Differential Revision: https://reviews.llvm.org/D125109	2022-05-09 14:53:00 +02:00
Florian Hahn	41e142fdc7	Recommit "[SimpleLoopUnswitch] Collect either logical ANDs/ORs but not both." This reverts commit `7211d5ce07`. This version fixes a crash that caused buildbot failures with the first version.	2022-05-09 13:49:12 +01:00
Florian Hahn	4c569ceeaa	[SimpleLoopUnswitch] Add test case for crash with `db7a87ed4f`.	2022-05-09 13:48:56 +01:00
Sam McCall	bb53eb1ef4	[clangd] Skip extra round-trip in parsing args in debug builds. NFC This is a clever cross-cutting sanity test for clang's arg parsing I suppose. But clangd creates thousands of invocations, ~all with identical trivial arguments, and problems with these would be caught by clang's tests. This overhead accounts for 10% of total unittest time! Differential Revision: https://reviews.llvm.org/D125169	2022-05-09 14:45:35 +02:00
Sam McCall	bf9921adb9	[clangd] Disable predefined macros in tests. NFC These aren't needed. With them the generated predefines buffer is 13KB. For every TestTU, we must: - generate the buffer (3 times: parsing preamble, scanning preamble, main file) - parse the buffer (again 3 times) - serialize all the macros it defines in the PCH - compress the buffer itself to write it into the PCH - decompress it from the PCH Avoiding this reduces unit test time by ~25%. Differential Revision: https://reviews.llvm.org/D125172	2022-05-09 14:44:51 +02:00
David Sherwood	45f2e92d97	[NFC][LoopVectorize] Add SVE test for tail-folding combined with interleaving Differential Revision: https://reviews.llvm.org/D125001	2022-05-09 13:08:25 +01:00
Nathan Sidwell	e48cd7088b	[demangler] Buffer peeking needs buffer The output buffer has a 'back' member, which returns NUL when you try it with an empty buffer. But there are no use cases that need that additional functionality. This makes the 'back' member behave more like STL containers' back members. (It still returns a value, not a reference.) Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D123201	2022-05-09 04:17:22 -07:00
Simon Pilgrim	8a92c45e07	[Clang] Add integer mul reduction builtin Similar to the existing bitwise reduction builtins, this lowers to a llvm.vector.reduce.mul intrinsic call. For other reductions, we've tried to share builtins for float/integer vectors, but the fmul reduction intrinsic also take a starting value argument and can either do unordered or serialized, but not reduction-trees as specified for the builtins. However we address fmul support this shouldn't affect the integer case. Differential Revision: https://reviews.llvm.org/D117829	2022-05-09 12:12:53 +01:00
Nathan James	12cb540529	[clang-tidy][NFC] Replace many instances of std::string where a StringRef would suffice. There's many instances in clang tidy checks where owning strings are used when we already have a stable string from the options, so using a StringRef makes much more sense. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D124341	2022-05-09 12:01:46 +01:00
Sigurur sgeirsson	fc440f27cd	Filter non-external static members from SBType::GetFieldAtIndex. See [[ https://github.com/llvm/llvm-project/issues/55040 \| issue 55040 ]] where static members of classes declared in the anonymous namespace are incorrectly returned as member fields from lldb::SBType::GetFieldAtIndex(). It appears that attrs.member_byte_offset contains a sentinel value for members that don't have a DW_AT_data_member_location. Reviewed By: labath Differential Revision: https://reviews.llvm.org/D124409	2022-05-09 12:34:13 +02:00
Alban Bridonneau	fef81131d9	[SVE] Optimize new cases for lowerConvertToSVBool Converts to SVBool are already considered as a nop, if they are converting an operand from a ptrue or a cmp, because they zero the extra predicate lanes by construction. This patch adds 2 similar cases: - The wide cmp, which were not directly recognized by the test for other forms of cmp - Splats of 1, which will be generated as ptrue, and as such will also zero the extra predicate lines. Reviewed By: paulwalker-arm, peterwaller-arm Differential Revision: https://reviews.llvm.org/D124908	2022-05-09 10:17:57 +00:00
Benjamin Kramer	a48adc5658	[mlir][math] Promote (b)f16 to f32 when lowering to libm calls libm doesn't have overloads for the small types, so promote them to a bigger type and use the f32 function. Differential Revision: https://reviews.llvm.org/D125093	2022-05-09 11:59:55 +02:00
Pavel Labath	ae7fe65cf6	[lldb/DWARF] Fix linking direction in CopyUniqueClassMethodTypes IIUC, the purpose of CopyUniqueClassMethodTypes is to link together class definitions in two compile units so that we only have a single definition of a class. It does this by adding entries to the die_to_type and die_to_decl_ctx maps. However, the direction of the linking seems to be reversed. It is taking entries from the class that has not yet been parsed, and copying them to the class which has been parsed already -- i.e., it is a very complicated no-op. Changing the linking order allows us to revert the changes in D13224 (while keeping the associated test case passing), and is sufficient to fix PR54761, which was caused by an undesired interaction with that patch. Differential Revision: https://reviews.llvm.org/D124370	2022-05-09 11:47:55 +02:00
Martin Storsjö	61f9ec5e61	[libcxx] [test] Fix the nasty_macros test on Windows on ARM/ARM64 This isn't a configuration that we unfortunately can add to the CI practically at the moment, but I do run the tests sporadically offline in this configuration. Differential Revision: https://reviews.llvm.org/D124993	2022-05-09 12:46:41 +03:00
Marek Kurdej	85ec8a9ac1	[clang-format] Correctly handle SpaceBeforeParens for builtins. That's a partial fix for https://github.com/llvm/llvm-project/issues/55292. Before, known builtins behaved differently from other identifiers: ``` void f () { return F (__builtin_LINE() + __builtin_FOO ()); } ``` After: ``` void f () { return F (__builtin_LINE () + __builtin_FOO ()); } ``` Reviewed By: owenpan Differential Revision: https://reviews.llvm.org/D125085	2022-05-09 11:42:41 +02:00
Philipp Tomsich	91b24b0180	[AArch64] Ampere1 does not support MTE The initial support for the Ampere1 mistakenly signalled support for the MTE feature. However, the core does not include the optional MTE functionality. Update the target parser to not include MTE for Ampere1. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D125191	2022-05-09 11:29:42 +02:00
Rahul Anand R	7dcd0ea683	[AArch64] Generate AND in place of CSEL for predicated CTTZ This patch implements a for a target specific optimization that replaces the cmp and csel from cttz with an and mask. Differential Revision: https://reviews.llvm.org/D123782	2022-05-09 10:28:20 +01:00
Pavel Labath	fa593b079b	Revert "[lldb] parallelize calling of Module::PreloadSymbols()" This reverts commit `b7d807dbcf` -- it breaks TestMultipleDebuggers.py.	2022-05-09 11:11:01 +02:00
Florian Hahn	61bb2e4ea8	[ConstraintElimination] Add initial ssub.with.overflow tests.	2022-05-09 10:02:59 +01:00
Marek Kurdej	50cd52d935	[clang-format] Fix WhitespaceSensitiveMacros not being honoured when macro closing parenthesis is followed by a newline. Fixes https://github.com/llvm/llvm-project/issues/54522. This fixes regression introduced in `5e5efd8a91`. Before the culprit commit, macros in WhitespaceSensitiveMacros were correctly formatted even if their closing parenthesis weren't followed by semicolon (or, to be precise, when they were followed by a newline). That commit changed the type of the macro token type from TT_UntouchableMacroFunc to TT_FunctionLikeOrFreestandingMacro. Correct formatting (with `WhitespaceSensitiveMacros = ['FOO']`): ``` FOO(1+2) FOO(1+2); ``` Regressed formatting: ``` FOO(1 + 2) FOO(1+2); ``` Reviewed By: HazardyKnusperkeks, owenpan, ksyx Differential Revision: https://reviews.llvm.org/D123676	2022-05-09 10:59:33 +02:00
David Green	02f8519502	[DAG] Prevent infinite loop combining bitcast shuffle This prevents an infinite loop from D123801, where code trying to reduce the total number of bitcasts, but also handling constants, could create the opposite transform. Prevent the transform in these case to let the bitcast of a constant transform naturally. Fixes #55345	2022-05-09 09:36:22 +01:00
Ben Shi	d2c4ac979b	[AVR] Add PrintMethod for operand memspi Reviewed By: Patryk27 Differential Revision: https://reviews.llvm.org/D124913	2022-05-09 08:31:49 +00:00
Abinav Puthan Purayil	7f6489d0e3	[AMDGPU] Regenerate checks in a mir test	2022-05-09 13:28:09 +05:30
Jean Perier	ed0341788a	[flang] retain binding label of entry subprograms When processing an entry-stmt in name resolution, attrs_ was reset before SetBindNameOn was called, causing the symbol to lose the binding label information. Differential Revision: https://reviews.llvm.org/D125097	2022-05-09 09:50:17 +02:00
Hongtao Yu	a4190037fa	[CSSPGO][Preinliner] Use linear threshold to drive inline decision. The per-callsite size threshold used today to drive preinline decision is based on hotness/coldness cutoff. The default setup is for callsites with a sample count above the hotness cutoff (99%), a 1500 size threshold is used. Any callsite below 99.99% coldness cutoff uses a zero threshold. This has a couple issues: 1. While both cutoffs and size thoresholds are configurable, different applications may need different setups, making a universal setup impractical. 2. The callsites between hotness cutoff and coldness cutoff are not considered as inline candidates, which could be a missing opportunity. 3. Hot callsites always use the same threshold. In reality we may want a bigger threshold for hotter callsites. In this change we are introducing a linear threshold regardless of hot/cold cutoffs. Given a sample space, a threshold is computed for a callsite based on the position of that callsite sample in the whole space. With that we no longer need to define what's hot or cold. Callsites with different hotness will get a different threshold. This should overcome the above three issues. I have seen good results with a universal default setup for two of our internal services. For one service, 0.2% to 0.5% perf improvement over a baseline with a previous default setup, on-par code size. For the second service, 0.5% to 0.8% perf improvement over a baseline with a previous default setup, 0.2% code size increase; on-par performance and code size with a baseline that is with a carefully tuned cutoff to cover enough hot functions. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D125023	2022-05-08 22:07:58 -07:00
Christopher Bate	9879807393	[mlir][NvGpu] Fix nvgpu.mma.sync lowering to NVVM for f32, tf32 types Adds missing logic in the lowering from NvGPU to NVVM to support fp32 (in an accumulator operand) and tf32 (in multiplicand operand) types. Fixes logic in one of the helper functions for converting the result of a mma.sync operation with multiple 8x256bit output tiles, which is the case for f32 outputs. Differential Revision: https://reviews.llvm.org/D124533	2022-05-08 21:49:42 -06:00
Peixin-Qiao	c207e36025	[flang] Enforce a program not including more than one main program As Fortran 2018 5.2.2 states, a program shall consist of exactly one main program. Add this semantic check. Reviewed By: klausler Differential Revision: https://reviews.llvm.org/D125186	2022-05-09 10:48:06 +08:00
Xiaodong Liu	36d4f42c36	[lld] Fix typo for processAux; NFC Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D125163	2022-05-09 10:21:47 +08:00
Alexander Yermolovich	3abb68a626	[BOLT][DWARF] Fix assert for split dwarf. Fixing a small bug where it would assert if CU does not modify .debug_addr section. Differential Revision: https://reviews.llvm.org/D125181	2022-05-08 19:18:17 -07:00
Simon Pilgrim	9a12138b5f	[SLP][X86] Add test coverage for PR50392 / Issue #49736	2022-05-08 19:40:04 +01:00

1 2 3 4 5 ...

423266 Commits All Branches Search

423266 Commits

All Branches