llvm-project

Commit Graph

Author	SHA1	Message	Date
Aaron Puchert	2c3cf62d4a	Make TableGenGlobalISel an object library That's how it was originally intended but that wasn't possible because we still needed to support older CMake versions. The problem here is that the sources in TableGenGlobalISel are meant to be linked into both llvm-tblgen and TableGenTests (a unit test), but not be part of LLVM proper. So they shouldn't be an ordinary LLVM component. Because they are used in llvm-tblgen, they can't draw in the LLVM dylib dependency, but then we'd have to do the same thing in TableGenTests to make sure we don't link both a static Support library and another copy through the LLVM dylib. With an object library we're just reusing the object files and don't have to care about dependencies at all. Reviewed By: beanz Differential Revision: https://reviews.llvm.org/D74588	2021-03-31 22:20:56 +02:00
Alexey Bataev	a28e835e94	[OPENMP]Fix PR48885: Crash in passing firstprivate args to tasks on Apple M1. Need to bitcast the function pointer passed as a parameter to the real type to avoid possible problem with calling conventions. Differential Revision: https://reviews.llvm.org/D99521	2021-03-31 13:00:58 -07:00
Alexey Bataev	66da4f6fc9	[OPENMP]Fix PR48658: [OpenMP 5.0] Compiler crash when OpenMP atomic sync hints used. No need to consider hint clause kind as the main atomic clause kind at the codegen. Differential Revision: https://reviews.llvm.org/D99611	2021-03-31 12:58:24 -07:00
Philip Reames	98f08e7d81	[tests] Exercise cases where SCEV can use trip counts to refine ashr/lshr recurrences	2021-03-31 12:48:50 -07:00
Jez Ng	9b6dde8af8	[lld-macho] Parallelize UUID hash computation This reuses the approach (and some code) from LLD-ELF. It's a decent win when linking chromium_framework on a Mac Pro (3.2 GHz 16-Core Intel Xeon W): N Min Max Median Avg Stddev x 20 4.58 4.83 4.66 4.6685 0.066591844 + 20 4.42 4.61 4.5 4.505 0.04751731 Difference at 95.0% confidence -0.1635 +/- 0.0370242 -3.5022% +/- 0.793064% (Student's t, pooled s = 0.0578462) The output binary is 381MB. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D99279	2021-03-31 15:48:36 -04:00
Jez Ng	09aed384ba	[lld-macho][nfc] Test that -ObjC will import bitcode with category sections The functionality was originally added in {D95265}, but the test in that diff only checked if `-ObjC` would cause bitcode containing ObjC class symbols to be loaded. It neglected to test for bitcode containing categories but no class symbols. This diff also changes the lto-archive.ll test to use `-why_load` instead of inspecting the output binary's symbol table. This is motivated by the stacked diff {D99105}, which will hide irrelevant bitcode symbols. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D99215	2021-03-31 15:48:36 -04:00
Alexey Bataev	4ced958dc2	[SLP]Update test checks, NFC	2021-03-31 12:35:58 -07:00
Craig Topper	9e00b6660d	[SelectionDAG] Remove unneeded vector resize from the end of FoldConstantArithmetic. NFC There's an assert right before that makes sure the size already matches. Earlier in this function's life, scalars and vectors shared more code.	2021-03-31 12:33:10 -07:00
Andrew Young	9c61c76b12	[mlir][cse] do not replace operands in previously simplified operations If an operation has been inserted as a key in to the known values hashtable, then it can not be modified in a way which changes its hash. This change avoids modifying the operands of any previously recorded operation, which prevents their hash from changing. In an SSACFG region, it is impossible to visit an operation before visiting its operands, so this is not a problem. This situation can only happen in regions without strict dominance, such as graph regions. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D99486	2021-03-31 12:20:34 -07:00
George Mitenkov	807b019ca2	[ConstantFolding] Fixing addo/subo with undef When folding addo/subo with undef, the current convention is to use { -1, false } for addo and { 0, false } for subo. This was fixed for InstSimplify in https://reviews.llvm.org/rGf094d65beaa492e845b03561eddd75b5be653a01, but not in ConstantFolding. Reviewed By: nikic, lebedev.ri Differential Revision: https://reviews.llvm.org/D99564	2021-03-31 21:47:29 +03:00
Alexey Bataev	10847f6217	[SLP]Add a test for the bug in `getVectorElementSize()`, NFC.	2021-03-31 11:40:44 -07:00
peter klausler	7f8da0791c	[flang] Refine checks for pointer initialization targets f18 was emitting a bogus error message about the lack of a TARGET attribute when a pointer was initialized with a component of a variable that was a legitimate TARGET. Differential Revision: https://reviews.llvm.org/D99665	2021-03-31 11:32:12 -07:00
Huihui Zhang	fe5c4a06a4	[LoopVectorize] Use SetVector to track uniform uses to prevent non-determinism. Use SetVector instead of SmallPtrSet to track values with uniform use. Doing this can help avoid non-determinism caused by iterating over unordered containers. This bug was found with reverse iteration turning on, --extra-llvm-cmake-variables="-DLLVM_REVERSE_ITERATION=ON". Failing LLVM test consecutive-ptr-uniforms.ll . Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D99549	2021-03-31 11:21:07 -07:00
Suraj Sudhir	888c5067b4	Move non-spec TOSA operators into TosaUtilOps.td Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D99628	2021-03-31 11:01:01 -07:00
Petr Hosek	fcf6800506	[Driver] Move detectLibcxxIncludePath to ToolChain This helper method is useful even outside of Gnu toolchains, so move it to ToolChain so it can be reused in other toolchains such as Fuchsia. Differential Revision: https://reviews.llvm.org/D88452	2021-03-31 10:50:44 -07:00
Thomas Lively	45783d0e8a	[WebAssembly] Implement i64x2 comparisons Removes the prototype builtin and intrinsic for i64x2.eq and implements that instruction as well as the other i64x2 comparison instructions in the final SIMD spec. Unsigned comparisons were not included in the final spec, so they still need to be scalarized via a custom lowering. Differential Revision: https://reviews.llvm.org/D99623	2021-03-31 10:46:17 -07:00
Juneyoung Lee	df0b97dab0	[ValueTracking] Add with.overflow intrinsics to poison analysis functions This is a patch teaching ValueTracking that `s/u*.with.overflow` intrinsics do not create undef/poison and they propagate poison. I couldn't write a nice example like the one with ctpop; ValueTrackingTest.cpp were simply updated to check these instead. This patch helps reducing regression while fixing https://llvm.org/pr49688 . Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D99671	2021-04-01 02:41:38 +09:00
Philip Reames	ae7b1e8823	[SCEV] Handle unreachable binop when matching shift recurrence This fixes an issue introduced with my change d4648e, and reported in pr49768. The root problem is that dominance collapses in unreachable code, and that LoopInfo explicitly only models reachable code. Since the recurrence matcher doesn't filter by reachability (and can't easily because not all consumers have domtree), we need to bailout before assuming that finding a recurrence implies we found a loop.	2021-03-31 10:33:34 -07:00
Craig Topper	437958d9fd	[X86] Improve SMULO/UMULO codegen for vXi8 vectors. The default expansion creates a MUL and either a MULHS/MULHU. Each of those separately expand to sequences that use one or more PMULLW instructions as well as additional instructions to extend the types to vXi16. The MULHS/MULHU expansion computes the whole 16-bit product, but only keeps the high part. We can improve the lowering of SMULO/UMULO for some cases by using the MULHS/MULHU expansion, but keep both the high and low parts. And we can use those parts to calculate the overflow. For AVX512 we might have vXi1 overflow outputs. We can improve those by using vpcmpeqw to produce a k register if AVX512BW is enabled. This is a little better than truncating the high result to use vpcmpeqb. If we don't have avx512bw we can extend up to v16i32 to use vpcmpeqd to produce a k register. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D97624	2021-03-31 10:13:50 -07:00
Shimin Cui	00c0c8c87d	[PowerPC] [MLICM] Enable hoisting of caller preserved registers on AIX On ppc64 linux , MachineLICM will hoist caller preserved registers, including TOC loads of the global variable address, out of loops. This is to enable this on AIX for both ppc64 and ppc32. Differential Revision: https://reviews.llvm.org/D99076	2021-03-31 12:46:25 -04:00
Craig Topper	50b8634a99	[X86] Improve optimizeCompareInstr for signed comparisons after BMI/TBM instructions We previously couldn't optimize out a TEST if the branch/setcc/cmov used the overflow flag. This patches allows the TEST to be removed if the flag producing instruction is known to clear the OF flag. Thats what the TEST instruction would have done so that should be equivalent. Need to add test cases. I'll try to get back to this if I have bandwidth. Fixes PR48768. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D94856	2021-03-31 09:45:29 -07:00
Wael Yehia	563cdeaafd	[LTO][Legacy] Decouple option parsing from LTOCodeGenerator in this patch we add a new libLTO API to specify debug options independent of an lto_code_gen_t. This allows clients to pass codegen flags (through libLTO) which otherwise today are ignored. Reviewed By: steven_wu Differential Revision: https://reviews.llvm.org/D92611	2021-03-31 16:43:26 +00:00
Craig Topper	2a8b7cab6a	[RISCV] Add RISCVISD opcodes for CLZW and CTZW. Our CLZW isel pattern is quite easily broken by surrounding code preventing it from matching sometimes. This usually results in failing to remove the and X, 0xffffffff inserted by type legalization. The add with -32 that type legalization also inserts will often gets combined into other add/sub nodes. That doesn't usually result in extra code when we don't use clzw. CTTZ seems to be less fragile, but I wanted to keep it consistent with CTLZ. Reviewed By: asb, HsiangKai Differential Revision: https://reviews.llvm.org/D99317	2021-03-31 09:40:07 -07:00
Jay Foad	b138cf115e	[AMDGPU] Add some image tests with enable-prt-strict-null disabled. NFC.	2021-03-31 17:27:20 +01:00
Jay Foad	a991ee330b	[AMDGPU] Use a common check prefix for some image tests. NFC.	2021-03-31 17:27:20 +01:00
Craig Topper	04f10ab367	[RISCV] Add isel patterns to select vsub_vx intrinsic to vadd.vi if it uses a small enough immediate Also modify the simm5_plus1 check because Imm-1 is UB if Imm happens to be INT64_MIN. I don't think the compiler would optimize based on that in this usage, but it could fail UBSan or -ftrapv. Reviewed By: HsiangKai, frasercrmck Differential Revision: https://reviews.llvm.org/D99637	2021-03-31 09:26:41 -07:00
Arthur Eubanks	09b2419360	[llvm-jitlink] Fix -Wunused-function on Windows Reviewed By: sgraenitz Differential Revision: https://reviews.llvm.org/D99604	2021-03-31 09:26:09 -07:00
Heejin Ahn	f38a9d6340	[WebAssembly] Raname a test and fix comments D99627 fixed a decoding bug, not an encoding bug. This renames the test to correct it and fix comments. Reviewed By: dschuff Differential Revision: https://reviews.llvm.org/D99644	2021-03-31 09:13:08 -07:00
Sanjay Patel	1462bdf1b9	[InstCombine] fold abs(srem X, 2) This is a missing optimization based on an example in: https://llvm.org/PR49763 As noted there and the test here, we could add a more general fold if that is shown useful. https://alive2.llvm.org/ce/z/xEHdTv https://alive2.llvm.org/ce/z/97dcY5	2021-03-31 11:29:20 -04:00
Sanjay Patel	07a6d07c48	[InstCombine] add tests for srem+abs; NFC	2021-03-31 11:29:20 -04:00
Bradley Smith	4e52daa254	[AArch64][SVE] Add tests for UREM/SREM using fixed SVE types Differential Revision: https://reviews.llvm.org/D99265	2021-03-31 16:09:55 +01:00
Timm Bäder	5018e15fdf	[clang][parser] Allow GNU-style attributes in explicit template... ... instantiations They are currently not being diagnosed because ProhibitAttributes() does not handle attribute lists with an invalid source range. But once it does, we need to allow GNU attributes in this place. Additionally, start optionally diagnosing empty attr lists in ProhibitCXX11Attributes(), since ProhibitAttribute() does it. Differential Revision: https://reviews.llvm.org/D97362	2021-03-31 16:44:19 +02:00
Arthur O'Dwyer	3bdd674fbf	[libc++] Mark convert_copy.pass.cpp as UNSUPPORTED on clang-13 (i.e. trunk). Because the constexpr-time codepath triggers a Clang bug. It seems that Clang compiles it okay in release mode, but when Clang itself is compiled in debug mode (with assertions turned on), this input triggers an assertion failure in Clang itself. See comments on D96385 and Clang bug report https://bugs.llvm.org/show_bug.cgi?id=45879 This commit should get the debug-mode buildbots back to green.	2021-03-31 10:22:11 -04:00
Luís Marques	a8cf32baf5	[RISCV] Add XFAIL riscv32 for known issue with the old pass manager See D80668, rG7b4832648a63 and https://bugs.llvm.org/show_bug.cgi?id=46117 for details of the issue. Differential Revision: https://reviews.llvm.org/D99108	2021-03-31 15:18:32 +01:00
Sander de Smalen	7108b2dec1	[SVE] Fix LoopVectorizer test scalalable-call.ll This marks FSIN and other operations to EXPAND for scalable vectors, so that they are not assumed to be legal by the cost-model. Depends on D97470 Reviewed By: dmgreen, paulwalker-arm Differential Revision: https://reviews.llvm.org/D97471	2021-03-31 14:52:49 +01:00
Sander de Smalen	b6d0529780	[CostModel] Align the cost model for intrinsics for scalable/fixed-width vectors. Let getIntrinsicInstrCost call getTypeBasedIntrinsicInstrCost for scalable vectors, similar to how this is done for fixed-width vectors, instead of falling back on BaseT::getIntrinsicInstrCost(). If the intrinsic cannot be costed (or is not overloaded by the target), it will return InstructionCost::getInvalid() instead. Depends on D97469 Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D97470	2021-03-31 14:52:49 +01:00
Anton Bikineev	dc7ebd2cb0	[C++2b] Support size_t literals This adds support for C++2b's z/uz suffixes for size_t literals (P0330).	2021-03-31 13:36:23 +00:00
Joerg Sonnenberger	9f4022ffeb	[libc++] Avoid <climits> dependency in <thread> The standard guarantees sleep durations of 2^63-1 nanoseconds to work. Instead of depending on INT64_MAX or ULONGLONG_MAX to exist via the header pollution, fold the constant directly. That has the additional positive side effect that it avoids long double arithmetic bugs in GCC. Differential Revision: https://reviews.llvm.org/D99516	2021-03-31 15:28:16 +02:00
Balázs Kéri	ffcb4b43b7	Revert "[clang][Checkers] Extend PthreadLockChecker state dump (NFC)." This reverts commit `49c0ab6d76`. Test failures showed up because non-deterministic output.	2021-03-31 15:28:53 +02:00
Sander de Smalen	2f6f249a49	NFC: Change getIntrinsicInstrCost to return InstructionCost This patch migrates the TTI cost interfaces to return an InstructionCost. See this patch for the introduction of the type: https://reviews.llvm.org/D91174 See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2020-November/146408.html Depends on D97468 Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D97469	2021-03-31 14:04:41 +01:00
Sander de Smalen	2f56e1c6b1	NFC: Change getTypeBasedIntrinsicCost to return InstructionCost This patch migrates the TTI cost interfaces to return an InstructionCost. See this patch for the introduction of the type: https://reviews.llvm.org/D91174 See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2020-November/146408.html Depends on D97466 Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D97468	2021-03-31 14:04:41 +01:00
Max Kazantsev	8396aeb07c	[Test] Auto-update test checks	2021-03-31 19:15:28 +07:00
Muhammad Omair Javaid	71b648f715	Revert "[LLDB] Arm64/Linux Add MTE and Pointer Authentication registers" This reverts commit `1164b4e295`. Reason: LLDB AArch64 Linux buildbot failure	2021-03-31 17:12:14 +05:00
Muhammad Omair Javaid	feb6f2c78f	Revert "[LLDB] Arm64/Linux test case for MTE and Pointer Authentication regset" This reverts commit `9ab6771800`. Reason: LLDB AArch64/Linux buildbot failure.	2021-03-31 17:12:14 +05:00
Liqiang Tao	d2d6720a93	[InlineCost] Remove TODO comment that consider other forms of savings in the cost-benefit analysis Attempts to compute savings more accurately cannot impact the set of critically important call sites. Reviewed By: kazu Differential Revision: https://reviews.llvm.org/D98577	2021-03-31 20:11:32 +08:00
Roman Lebedev	ce548aa236	[X86] AMD Zen 3 has macro fusion This is an improvement over Zen 2, where only branch fusion is supported, as per Agner, 21.4 Instruction fusion. AMD SOG 17h has no mention of fusion. AMD SOG 19h, 2.9.3 Branch Fusion The following flag writing instructions support branch fusion with their reg/reg, reg/imm and reg/mem forms * CMP * TEST * SUB * ADD * INC (no fusion with branches dependent on CF) * DEC (no fusion with branches dependent on CF) * OR * AND * XOR Agner, 22.4 Instruction fusion <...> This applies to CMP, TEST, ADD, SUB, AND, OR, XOR, INC, DEC and all conditional jumps, except if the arithmetic or logic instruction has a rip-relative address or both an address displacement and an immediate operand.	2021-03-31 14:31:50 +03:00
Balazs Benics	9d474be11d	[ASTImporter][NFC] Fix duplicated symbols in "Improve test coverage" D99576 introduced a duplicate symbol, now im removing it. Differential Revision: https://reviews.llvm.org/D99576	2021-03-31 12:47:50 +02:00
Fraser Cormack	10fc6e4358	[RISCV] Add support for the stepvector intrinsic This adds almost everything required for supporting the new stepvector intrinsic on RVV. It is lowered to the existing VID_VL SDNode. The only exception is a limitation that RV32 cannot yet lower the intrinsic on i64 vectors. This is because the step operand is (currently) required to be at least as large as the vector element type. I will look into patching that out and loosening the requirement to only an integer pointer type. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D99594	2021-03-31 11:41:17 +01:00
Muhammad Omair Javaid	98d070396d	Revert "[LLDB] Skip TestVSCode_disconnect.test_launch arm/linux" This reverts commit `73cf85e527`.	2021-03-31 15:22:49 +05:00
Jay Foad	5d0e9ddfa5	[AMDGPU][GlobalISel] Add support for global atomicrmw fadd This includes gfx908 which only has a no-return version of the global_atomic_add_f32 instruction, using the same hack that was previously implemented for selecting from the llvm.amdgcn.global.atomic.fadd intrinsic. Differential Revision: https://reviews.llvm.org/D97767	2021-03-31 11:13:00 +01:00

1 2 3 4 5 ...

384394 Commits All Branches Search

384394 Commits

All Branches