llvm-project

Commit Graph

Author	SHA1	Message	Date
Jay Foad	30b27ecfc2	[AMDGPU] Use new opcode for indexed vgpr reads Introduce V_MOV_B32_indirect_read for indexed vgpr reads (and rename the old V_MOV_B32_indirect to V_MOV_B32_indirect_write) so they can be unambiguously distinguished from regular V_MOV_B32_e32. Previously they were distinguished by looking for extra implicit operands but this is fragile because regular moves sometimes have extra implicit operands too: - either by accident, when instructions end up with duplicate implicit operands (see e.g. D100939) - or by design, when SIInstrInfo::copyPhysReg breaks a multi-dword copy into individual subreg mov instructions and adds implicit operands for the super-register. The effect of this is that SIInstrInfo::isFoldableCopy can be simplified and identifies more foldable copies. The test diffs show that more immediate 0 values have been folded as inline operands. SIInstrInfo::isReallyTriviallyReMaterializable could probably be simplified too but that is not part of this patch. Differential Revision: https://reviews.llvm.org/D114230	2021-11-19 13:08:11 +00:00
Roman Lebedev	049799c311	[X86][Costmodel] `getReplicationShuffleCost()`: promote 1 bit-wide elements to 8 bit when have AVX512BW+AVX512VBMI If in addition to AVX512BW (that provides `{k}<->{i8,i16}` casts and i16 shuffles), we have AVX512VBMI, which provides i8 shuffles, we are in an optimal situation. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D114071	2021-11-19 15:58:10 +03:00
Roman Lebedev	a751084bb4	[X86][Costmodel] `trunc v16i8 to v8i1` can appear after legalization, cost is same as for `trunc v8i8 to v8i1` Note that there are many other missing costs, i'm only adding the ones that are queried from `getReplicationShuffleCost()` for the existing (quite exhaustive) test coverage. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D114070	2021-11-19 15:57:32 +03:00
Roman Lebedev	a50fdd3fc9	[X86][Costmodel] `getReplicationShuffleCost()`: promote 1 bit-wide elements to 16 bit when have AVX512BW Here we get pretty lucky. AVX512F does not provide any instructions to convert between a `k` vector mask and a vector, but AVX512BW adds `{k}<->nX{i8,i16}`conversions, and just as it happens, with AVX512BW we have a i16 shuffle. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D113915	2021-11-19 15:55:41 +03:00
Fraser Cormack	92d279fd6d	[LangRef][VP] Correct operands' types in vp.select documentation The types of llvm.vp.select's operands much match the return type.	2021-11-19 12:08:34 +00:00
Simon Pilgrim	0f652d8f52	[X86] LowerRotate - recognise hidden ROTR patterns for better vXi8 codegen Check for a hidden ISD::ROTR (rotl(sub(0,x))) - vXi8 lowering can handle both (its always beneficial for splats, but otherwise only if we have VPTERNLOG). We currently hit infinite loops in TargetLowering::expandROT if we set ISD::ROTR to custom, which needs addressing before we extend this much further.	2021-11-19 11:49:15 +00:00
Andrew Ng	47eb3f155f	[ELF] Ensure output section is not discarded in addStartEndSymbols() Fixes https://bugs.llvm.org/show_bug.cgi?id=52534. Differential Revision: https://reviews.llvm.org/D114179	2021-11-19 11:45:58 +00:00
Simon Pilgrim	812e64ef0c	[DAG] MatchRotate - support rotate-by-constant of illegal types Patch to fix some of the regressions in D77804. By folding to rotate/funnel-shift by constant amounts for illegal types, we prevent SimplifyDemandedBits from destroying the patterns prematurely, allowing us to use the rotate/funnel-shift legalization that was added in D112443. Differential Revision: https://reviews.llvm.org/D113192	2021-11-19 11:12:04 +00:00
Balazs Benics	bf55b9f0d0	[analyzer][docs] Ellaborate the docs of cplusplus.StringChecker Let's describe accurately what the users can expect from the checker in a direct way. Also, add an example warning message. Reviewed By: martong, Szelethus Differential Revision: https://reviews.llvm.org/D113401	2021-11-19 11:59:46 +01:00
Valentin Clement	b8207db700	[fir] Add fir.no_reassoc conversion Currently `fir.no_reassoc` is just removed in the conversion. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D114154 Co-authored-by: Eric Schweitz <eschweitz@nvidia.com>	2021-11-19 11:38:24 +01:00
Balazs Benics	f3753ad774	[ASTImporter][NFC] Dump decl name at assertion violation Sometimes it would be useful to see which Decl kind caused some issue, along with the name of the concrete instance of the Decl in the source code. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D113668	2021-11-19 11:31:01 +01:00
Konstantin Schwarz	8c18719bae	[ELF] Expand LMA region if output section alignment introduces padding When aligning the start address of an output section introduces a gap between the current dot pointer and the new aligned address, we were already properly expanding the memory region, if available. D74286 introduced a new behavior to also align the LMA address if an LMA region is specified. However, this did not expand the corresponding LMA region. Now, we also expand the LMA region if it is set. This fixes PR52510. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D114166	2021-11-19 11:27:21 +01:00
Fabian Wolff	6259016361	[clang-tidy] Fix false positive in readability-identifier-naming check involving override attribute Overriding methods should not get a readability-identifier-naming warning because the issue can only be fixed in the base class; but the current check for whether a method is overriding does not take the override attribute into account. Differential Revision: https://reviews.llvm.org/D113830	2021-11-19 22:31:11 +13:00
Diana Picus	ca3795541f	[flang] Fix printing of constc and parsing of #fir.real Printing and parsing of constc didn't agree with each other. This patch treats the parsing of constc as the final word and fixes the printing accordingly. More concretely, this patch prints the RealAttrs that make up the ConstcOp directly instead of casting to mlir::FloatAttr (which blows up). It also fixes parseFirRealAttr to invoke APFloat's method for getting the size of a floating point type instead of computing it as 8 * kind (which blows up for BFloat, with kind == 3 and size == 16). Kudos to Kiran Chandramohan <kiran.chandramohan@arm.com> for noticing that we were missing tests for constc in fir-ops.fir. Differential Revision: https://reviews.llvm.org/D114081	2021-11-19 09:28:02 +00:00
Matt Beardsley	85e03cb7eb	[clang-tidy] fix debug-only test failure The clang-tidy/infrastructure/pr37091.cpp test inherits the top-level .clang-tidy configuration because it doesn't specify its own checks. It'd be a more stable test if it operates independently of the top-level .clang-tidy settings. I've made the clang-tidy/infrastructure/pr37091.cpp test independent of the top-level .clang-tidy (picked an arbitrary check that I saw another clang-tidy/infrastructure test was also using: clang-tidy/infrastructure/temporaries.cpp) Reviewed By: kbobyrev Differential Revision: https://reviews.llvm.org/D114034	2021-11-19 10:19:07 +01:00
Valentin Clement	78d69182b7	[mlir] Expose region utils functions As discussed in D109579, this patch exposes `runRegionDCE` and `eraseUnreachableBlocks` so they can be used as separate utilities in other passes. Reviewed By: rriddle, mehdi_amini Differential Revision: https://reviews.llvm.org/D114160	2021-11-19 09:24:39 +01:00
Yonghong Song	8fb3f84484	BPF: Workaround InstCombine trunc+icmp => mask+icmp Optimization Patch [1] added further InstCombine trunc+icmp => mask+icmp optimization and this caused a couple of bpf selftest failure. Previous llvm BPF backend patch [2] introduced llvm.bpf.compare builtin to handle such situations. This patch further added support ">" and ">=" icmp opcodes. Tested with bpf selftests and all tests are passed including two previously failed ones. Note Patch [1] also added optimization if the to-be-compared constant is negative-power-of-2 (-C) or not-of-power-of-2 (~C). This patch didn't implement these two cases as typical bpf program compares a scalar to a positive length or boundary value, and this scalar later is used as a index into an array buffer or packet buffer. [1] https://reviews.llvm.org/D112634 [2] https://reviews.llvm.org/D112938 Differential Revision: https://reviews.llvm.org/D114215	2021-11-18 20:25:28 -08:00
Kent Ross	870dfa6b08	[libc++][doc] Mark project for [cmp.concept] done Mark [cmp.concept] implementation as completed in our documentation. Reviewed By: ldionne, #libc Differential Revision: https://reviews.llvm.org/D114203	2021-11-18 20:15:54 -08:00
Serguei Katkov	3557f49353	[AARCH64] Teach AArch64FrameLowering::getFrameIndexReferencePreferSP really prefer SP. Do more efforts to use sp if it is possible to lower a frame index. Reviewers: reames, loicottet, ostannard, t.p.northover Reviewed By: reames Subscribers: arphaman, danilaml, hiraditya, kristof.beyls, llvm-commits, Matt, yrouban Differential Revision: https://reviews.llvm.org/D111133	2021-11-19 11:14:02 +07:00
Vitaly Buka	d591a46d17	[NFC][sanitizer] Fix naming in StackStore	2021-11-18 19:39:20 -08:00
Vitaly Buka	8210948a46	[NFC][sanitizer] constexpr StackStore::StackStore()	2021-11-18 19:24:29 -08:00
Vitaly Buka	bb50e97103	[NFC][sanitizer] Change StackStore API to use StackTrace	2021-11-18 18:53:51 -08:00
Vitaly Buka	6bf71be9f9	[NFC][sanitizer] Move inline implementation of StackStore into cpp	2021-11-18 18:38:06 -08:00
Vitaly Buka	368a3b5221	[NFC][sanitizer] Rename PersistentAllocator to StackStore	2021-11-18 18:37:51 -08:00
Vitaly Buka	b6169e231e	[nfc][dfsan] Remove obsolete comment	2021-11-18 18:37:13 -08:00
Vitaly Buka	930ccf0191	[NFC][sanitizer] Rename persistent_allocator to stack_store	2021-11-18 18:36:58 -08:00
Matheus Izvekov	85914b7570	[clang] fix regression deducing pack expansion arguments introduced by D110216 This test case had been missing when the original code was introduced by `2fcb863b2b`. Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Differential Revision: https://reviews.llvm.org/D114207	2021-11-19 03:36:20 +01:00
Senran Zhang	0425ea4621	[NFC][OpaquePtr][Evaluator] Remove call to PointerType::getElementType There are still another 2 uses of PointerType::getElementType in Evaluator when evaluating BitCast's on pointers. BitCast's on pointers should be removed when opaque ptr is ready, so I just keep them as is. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D114131	2021-11-19 10:32:55 +08:00
Vitaly Buka	b0ea7ef0d6	[msan] Clang-format the test	2021-11-18 18:30:49 -08:00
Manoj Gupta	2782cb8da0	libfuzzer: All building libfuzzer for ARM32 We need libfuzzer libraries on Arm32 so that we can fuzz Arm32 binaries on Linux (Chrome OS). Android already allows Arm32 for libfuzzer. Reviewed By: morehouse Differential Revision: https://reviews.llvm.org/D112091	2021-11-18 17:30:24 -08:00
Philip Reames	734abbad79	[SCEV] Defer all work from `ea12c2cb` as late as possible This is a second speculative compile time optimization to address a reported regression. My actual suspicion is that availability of no-self-wrap is making some other bit of code trigger, but let's rule this out.	2021-11-18 17:19:52 -08:00
Stanislav Mekhanoshin	f8e615462b	[AMDGPU] Fix SIPostRABundler crash on null register used by dbg value Recently we started generate DBG_VALUEs with $noreg operands. This crashes SIPostRABundler, and it should not iterate these registers anyway. Fixes: SWDEV-311733 Differential Revision: https://reviews.llvm.org/D114202	2021-11-18 17:01:19 -08:00
Rong Xu	f2b5d81f62	[SampleFDO] Add pass name strings for the passes (NFC)	2021-11-18 16:10:04 -08:00
Jim Ingham	d9f56feda8	Remove unused variable.	2021-11-18 15:58:20 -08:00
Victor Huang	40c65655af	[PowerPC] Remove the redundant terminator instruction when optimizing conditional trap This patch is a follow up patch for `ae27ca9a67` to the remove redundant terminator when optimizing conditional trap. Peer reviewed by: nemanjai	2021-11-18 17:52:26 -06:00
Stanislav Mekhanoshin	24cc79b9ef	[AMDGPU] Regenerate postra-bundle-memops.mir checks. NFC.	2021-11-18 15:51:10 -08:00
Ahmed Bougacha	e3a7f0e2f9	[AArch64][PAC] Select llvm.ptrauth.sign/sign.generic to PAC*. The @llvm.ptrauth.sign/sign.generic intrinsics map cleanly to the various AArch64 PAC[IDG][Z][AB] instructions. Select them. Differential Revision: https://reviews.llvm.org/D91087	2021-11-18 15:21:30 -08:00
Konstantin Varlamov	3b463c4528	[libc++][NFC] Assign some Ranges tasks to varconst. Specifically: - [special.mem.concepts]; - [specialized.algorithms].	2021-11-18 15:15:21 -08:00
Keith Smiley	0c4464a5bd	[lldb] Fix formatted log statement Previously this would output literally without replacements Differential Revision: https://reviews.llvm.org/D114178	2021-11-18 15:09:38 -08:00
Adrian Prantl	cbb8b200a6	Revert "Increase gdbremote timeout." This reverts commit `6424dc21bf`.	2021-11-18 15:07:30 -08:00
Mogball	7c5ecc8b7e	[mlir][vector] Insert/extract element can accept index `vector::InsertElementOp` and `vector::ExtractElementOp` have had their `position` operand changed to accept `AnySignlessIntegerOrIndex` for better operability with operations that use `index`, such as affine loops. LLVM's `extractelement` and `insertelement` can also accept `i64`, so lowering directly to these operations without explicitly inserting casts is allowed. SPIRV's equivalent ops can also accept `i64`. Reviewed By: nicolasvasilache, jpienaar Differential Revision: https://reviews.llvm.org/D114139	2021-11-18 22:40:29 +00:00
Arjun P	3b7b4a8041	[MLIR][NFC] Simplex::markRowRedundant: assert that row is not already marked redundant	2021-11-19 03:43:25 +05:30
David Blaikie	3cbc4c487a	llvm-dwarfdump: Rebuild type names in dwo type units	2021-11-18 14:12:48 -08:00
Fabian Wolff	dc1c27149f	[libc++] Cast to the right `difference_type` in various algorithms Differential Revision: https://reviews.llvm.org/D113868	2021-11-18 17:07:36 -05:00
Adrian Prantl	721bb73676	make testcase more robust	2021-11-18 14:00:53 -08:00
Bardia Mahjour	f27ff7ff0e	[docs] Add Loop Optimization WG to the online sync-up page	2021-11-18 16:49:11 -05:00
Philip Reames	1a5666acb2	[SCEV] Defer loop property checks from `ea12c2cb` as late as possible This is a speculative compile time optimization to address a reported regression. It's the only thing which vaguely makes sense.	2021-11-18 13:47:45 -08:00
MaheshRavishankar	d26beb0be2	[mlir][Linalg] Add method to check if LinalgTransformationFilter has been applied. Differential Revision: https://reviews.llvm.org/D114170	2021-11-18 13:45:30 -08:00
Muiez Ahmed	27ea67136e	[SystemZ][z/OS] Fix filesystem testing This patch resolves many of the failures in the `filesystems/` buckets in the libc++ tests. It adds the correct flag to `fopen` and marks a test case as unsupported. In particular, that test assumes time is stored as a 64 bit value when on MVS it is stored as 32 bit. Differential Revision: https://reviews.llvm.org/D113298	2021-11-18 15:57:50 -05:00
Muiez Ahmed	049f6c29a6	[libc++] Resolve missing table_size symbol The aim of this patch is to resolve the missing `table_size` symbol (see reduced test case). That const variable is declared and defined in //libcxx/include/locale//; however, the test case suggests that the symbol is missing. This is due to a C++ pitfall (highlighted [[ https://quuxplusone.github.io/blog/2020/09/19/value-or-pitfall/ \| here ]]). In summary, assigning the reference of `table_size` doesn't enforce the const-ness and expects to find `table_size` in the DLL. The fix is to use `constexpr` or have an out-of-line definition in the src (for consistency). Differential Revision: https://reviews.llvm.org/D110647	2021-11-18 15:49:45 -05:00

1 2 3 4 5 ...

405213 Commits All Branches Search

405213 Commits

All Branches