llvm-project

Commit Graph

Author	SHA1	Message	Date
Mingming Liu	fc58d7a326	[Peephole-opt][X86] Enhance peephole opt to see through SUBREG_TO_REG (following AND) and eliminates redundant TEST instruction. Differential Revision: https://reviews.llvm.org/D124118	2022-05-10 15:56:20 -07:00
Chia-hung Duan	96e642652b	[mlir] Print some message for op-printing verification Before dump, Insetad of switching to generic form silently after verification failure. Print some debug logs to help identify why an op may be printed in a different way. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D125136	2022-05-10 22:48:47 +00:00
Thomas Raoux	15bcc36eed	[mlir][gpu] Move async copy ops to NVGPU and add caching hints Move async copy operations to NVGPU as they only exist on NV target and are designed to match ptx semantic. This allows us to also add more fine grain caching hint attribute to the op. Add hint to bypass L1 and hook it up to NVVM op. Differential Revision: https://reviews.llvm.org/D125244	2022-05-10 22:30:24 +00:00
Vasileios Porpodas	71bcead98b	[SLP] Make reordering aware of external vectorizable scalar stores. The current reordering scheme only checks the ordering of in-tree operands. There are some cases, however, where we need to adjust the ordering based on the ordering of a future SLP-tree who's instructions are not part of the current tree, but are external users. This patch is a simple implementation of this. We keep track of scalar stores that are users of TreeEntries and if they look profitable to vectorize, then we keep track of their ordering. During the reordering step we take this new index order into account. This can remove some shuffles in cases like in the lit test. Differential Revision: https://reviews.llvm.org/D125111	2022-05-10 15:25:35 -07:00
Philip Reames	7731935ffc	[riscv] Consolidate logic for SEW/VL operand offset calculations [nfc]	2022-05-10 15:06:26 -07:00
Philip Reames	413052310a	[riscv] Minor style cleanup so that code more obviously matches comments [nfc]	2022-05-10 14:20:26 -07:00
Mike Rice	0dbaef61b5	[OpenMP] Fix mangling for linear modifiers with variable stride This adds support for variable stride with the val, uval, and ref linear modifiers. Previously only the no modifer type ls<argno> was supported. val -> Ls<argno> uval -> Us<argno> ref -> Rs<argno> Differential Revision: https://reviews.llvm.org/D125330	2022-05-10 14:12:44 -07:00
LLVM GN Syncbot	a0f3ef42b0	[gn build] Port `f822db7670`	2022-05-10 21:06:25 +00:00
Mehdi Amini	3ffb08844c	Remove unused variable (fix -Werror build on MSVC)	2022-05-10 21:04:52 +00:00
Jan Korous	0376c0f271	Revert "[utils] Avoid hardcoding metadata ids in update_cc_test_checks" This reverts commit `ce583b14b2`.	2022-05-10 14:04:19 -07:00
Mingming Liu	1555c41abb	Revert "Enhance peephole optimization." This reverts commit `d84ca05ef7`. Will revert, update commit message and re-commit.	2022-05-10 13:59:05 -07:00
Vasileios Porpodas	035aee725c	[SLP][NFC] Precommit a lit test for a followup patch that improves tree reordering for external users. Differential Revision: https://reviews.llvm.org/D125110	2022-05-10 13:47:17 -07:00
Erich Keane	eadeabbe10	[NFC] Replace not-null and not-isa check with a not-isa_and_nonnull	2022-05-10 13:34:07 -07:00
Jim Ingham	63865e1fce	Add the "sent break" message to the "gdb-remote packets" channel It was originally only in "gdb-remote process" but it is convenient to also have it come as part of gdb-remote packets.	2022-05-10 13:28:50 -07:00
Matthias Braun	3bf643eb12	Update test for changes in `f0ea9c9cec` / D124552	2022-05-10 13:25:38 -07:00
Nathan James	a308a55720	[clang-tidy] Fix unintended change left in `12cb540529`	2022-05-10 21:07:31 +01:00
jeff	f822db7670	[AMDGPU] Allow for MFMA Inst Clustering This patch adds cluster edges between independent MFMA instructions. Additionally, it propogates all predecessors of cluster insts to the root of the cluster(s), and all successors to the leaf(ves) of the cluster(s) -- this is done to remove the possibility that those insts will be interspersed within the cluster. Reviewed By: kerbowa Differential Revision: https://reviews.llvm.org/D124678	2022-05-10 12:57:40 -07:00
Erich Keane	b6572ad504	[NFC] Add missing 'break' in a switch case	2022-05-10 12:48:08 -07:00
Mingming Liu	d84ca05ef7	Enhance peephole optimization. Differential Revision: https://reviews.llvm.org/D124118	2022-05-10 12:35:35 -07:00
Erich Keane	508d2b4e13	[NFC]Add Missing Break in switch that we didn't notice because it was last.	2022-05-10 12:27:45 -07:00
jeff	3ff8ee2447	[NFC] Fix typo Reviewed By: kerbowa Differential Revision: https://reviews.llvm.org/D124647	2022-05-10 12:11:21 -07:00
Arthur Eubanks	7e0802aeb5	[BasicAA] Fix order in which we pass MemoryLocations to alias() D98718 caused the order of Values/MemoryLocations we pass to alias() to be significant due to storing the offset in the PartialAlias case. But some callers weren't audited and were still passing swapped arguments, causing the returned PartialAlias offset to be negative in some cases. For example, the newly added unittests would return -1 instead of 1. Fixes #55343, a miscompile. Reviewed By: asbirlea, nikic Differential Revision: https://reviews.llvm.org/D125328	2022-05-10 12:05:38 -07:00
Florian Hahn	17a73992dd	[AArch64] Remove redundant f{min,max}nm intrinsics. The patch extends AArch64TTIImpl::instCombineIntrinsic to simplify llvm.aarch64.neon.f{min,max}nm(a, a) -> a. This helps with simplifying code written using the ACLE, e.g. see https://godbolt.org/z/jYxsoc89c Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D125234	2022-05-10 19:57:43 +01:00
Yaxun (Sam) Liu	180a8536ce	Fix indentation in ReleaseNotes.rst	2022-05-10 14:56:28 -04:00
Nicolas Vasilache	1f23211cb1	[mlir][SCF] Retire `cloneWithNewYields` helper function. This is now subsumed by `replaceLoopWithNewYields`. Differential Revision: https://reviews.llvm.org/D125309	2022-05-10 18:44:11 +00:00
Mahesh Ravishankar	567fd523bf	[mlir][SCF] Add utility method to add new yield values to a loop. The current implementation of `cloneWithNewYields` has a few issues - It clones the loop body of the original loop to create a new loop. This is very expensive. - It performs `erase` operations which are incompatible when this method is called from within a pattern rewrite. All erases need to go through `PatternRewriter`. To address these a new utility method `replaceLoopWithNewYields` is added which - moves the operations from the original loop into the new loop. - replaces all uses of the original loop with the corresponding results of the new loop - use a call back to allow caller to generate the new yield values. - the original loop is modified to just yield the basic block arguments corresponding to the iter_args of the loop. This represents a no-op loop. The loop itself is dead (since all its uses are replaced), but is not removed. The caller is expected to erase the op. Consequently, this method can be called from within a `matchAndRewrite` method of a `PatternRewriter`. The `cloneWithNewYields` could be replaces with `replaceLoopWithNewYields`, but that seems to trigger a failure during walks, potentially due to the operations being moved. That is left as a TODO. Differential Revision: https://reviews.llvm.org/D125147	2022-05-10 18:44:11 +00:00
Alan Zhao	82c5e302f9	[llvm-ml] Implement support for MASM's extern directive The EXTERN keyword defines external symbols in MASM. Credit goes to epastor@ for implementing most of the logic; I (ayzhao@) added some bugfixes and tests. [0]: https://docs.microsoft.com/en-us/cpp/assembler/masm/extern-masm?view=msvc-170 Reviewed By: epastor Submitted By: epastor Differential Revision: https://reviews.llvm.org/D125273	2022-05-10 14:36:06 -04:00
Yaxun (Sam) Liu	afc9d674fe	[CUDA][HIP] support __noinline__ as keyword CUDA/HIP programs use __noinline__ like a keyword e.g. __noinline__ void foo() {} since __noinline__ is defined as a macro __attribute__((noinline)) in CUDA/HIP runtime header files. However, gcc and clang supports __attribute__((__noinline__)) the same as __attribute__((noinline)). Some C++ libraries use __attribute__((__noinline__)) in their header files. When CUDA/HIP programs include such header files, clang will emit error about invalid attributes. This patch fixes this issue by supporting __noinline__ as a keyword, so that CUDA/HIP runtime could remove the macro definition. Reviewed by: Aaron Ballman, Artem Belevich Differential Revision: https://reviews.llvm.org/D124866	2022-05-10 14:32:27 -04:00
Sanjay Patel	0353c2c996	[InstCombine] fold shuffles with FP<->Int cast operands shuffle (cast X), (cast Y), Mask --> cast (shuffle X, Y, Mask) This is similar to a recent transform with fneg ( `b331a7ebc1` ), but this is intentionally the most conservative first step to try to avoid regressions in codegen. There are several restrictions that could be removed as follow-up enhancements. Note that a cast with a unary shuffle is currently canonicalized in the other direction (shuffle after cast - D103038 ). We might want to invert that to be consistent with this patch.	2022-05-10 14:20:43 -04:00
Sanjay Patel	aca33294de	[InstCombine] add tests for shuffles with FP<->int cast operands; NFC	2022-05-10 13:45:32 -04:00
Joseph Huber	045665423e	[OpenMP] Fix embedding offload code when there is no offloading toolchain Summary: We use the `--offload-new-driver` option to enable offload code embedding. The check for when to do this was flawed and was enabling it too early in the case of OpenMP, causing a segfault when dereferencing the offloading toolchain.	2022-05-10 13:33:20 -04:00
Jan Korous	ce583b14b2	[utils] Avoid hardcoding metadata ids in update_cc_test_checks Specifically for: !tbaa, !tbaa.struct, !annotation, !srcloc, !nosanitize. The goal is to avoid test brittleness caused by hardcoded values. Differential Revision: https://reviews.llvm.org/D123273	2022-05-10 10:17:45 -07:00
Matthias Braun	f0ea9c9cec	CodeGenPrepare: Replace constant PHI arguments with switch condition value We often see code like the following after running SCCP: switch (x) { case 42: phi(42, ...); } This tends to produce bad code as we currently materialize the constant phi-argument in the switch-block. This increases register pressure and if the pattern repeats for `n` case statements, we end up generating `n` constant values. This changes CodeGenPrepare to catch this pattern and revert it back to: switch (x) { case 42: phi(x, ...); } Differential Revision: https://reviews.llvm.org/D124552	2022-05-10 10:00:10 -07:00
Matthias Braun	cd19af74c0	Avoid 8 and 16bit switch conditions on x86 This adds a `TargetLoweringBase::getSwitchConditionType` callback to give targets a chance to control the type used in `CodeGenPrepare::optimizeSwitchInst`. Implement callback for X86 to avoid i8 and i16 types where possible as they often incur extra zero-extensions. This is NFC for non-X86 targets. Differential Revision: https://reviews.llvm.org/D124894	2022-05-10 10:00:10 -07:00
Matthias Braun	75e50f70c6	Use update_llc_test_checks for the switch.ll test; add new test - Change `switch.ll` test to a style suitable for `tools/update_llc_test_checks.py`. - Precommit test for upcoming changes: - Add `switch_i8` to `test/CodeGen/X86/switch.ll`. - Add `test/CodeGen/X86/switch-phi-const.ll`. Differential Revision: https://reviews.llvm.org/D124893	2022-05-10 10:00:10 -07:00
Kadir Cetinkaya	3137ca80b9	[clangd] Support for standard inlayHint protocol - Make clangd's internal representation more aligned with the standard. We keep range and extra inlayhint kinds around, but don't serialize them on standard version. - Have custom serialization for extension (ugly, but going to go away). - Support both versions until clangd-17. - Don't advertise extension if client has support for standard implementation. - Log a warning at startup about extension being deprecated, if client doesn't have support. Differential Revision: https://reviews.llvm.org/D125228	2022-05-10 18:59:15 +02:00
Mike Rice	1a02519bc5	[OpenMP] Add mangling support for linear modifiers (ref,uval,val) Add mangling for linear parameters specified with ref, uval, and val for 'omp declare simd' vector functions. Add missing stride for linear this parameters. Differential Revision: https://reviews.llvm.org/D125269	2022-05-10 09:56:55 -07:00
Tsukasa OI	7fe0630fcb	[RISCV] 'K'-extension ordering This commit adds 'K' to supported extension list (before 'J'). It makes "Zk" extensions correctly placed before "Zv" extensions. Multi-letter "Z" extensions are first ordered with the most closely related alphabetical extension category ("IMAF..."). This is represented in LLVM as `AllStdExts' variable in `llvm/lib/Support/RISCVISAInfo.cpp'. However, it did not have 'k' making "Zk" extensions not correctly ordered. Reviewed By: kito-cheng Differential Revision: https://reviews.llvm.org/D124340	2022-05-11 00:52:00 +08:00
Krzysztof Drewniak	814b605095	[mlir][AMDGPU] Add AMDGPU conversion patterns to ConvertGPUToROCDL This ensures that attributes such as the index bitwidth propagate correctly to the AMDGPUToROCDL patterns. Differential Revision: https://reviews.llvm.org/D125320	2022-05-10 16:49:11 +00:00
Konstantin Varlamov	8200e1253f	[libc++][ranges] Implement `views::drop`. The view itself has been implemented previously -- this patch only adds the ability to pipe it. Also finishes the implementation of [P1739](https://wg21.link/p1739) and [LWG3407](https://wg21.link/lwg3407). Differential Revision: https://reviews.llvm.org/D125156	2022-05-10 09:31:05 -07:00
David Green	442c351b2b	Revert "[AArch64] Generate AND in place of CSEL for predicated CTTZ" This reverts commit `7dcd0ea683` due to issues reported postcommit with the correctness of truncated cttzs.	2022-05-10 17:17:03 +01:00
Craig Topper	4b36d9bde7	[CVP] Preserve exact name when converting sext->zext and ashr->lshr. Previously we took the old name and always appended a numberic suffix. Since we're doing a 1:1 replacement, it's clearer to keep the original name exactly. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D125281	2022-05-10 09:13:59 -07:00
Craig Topper	7b362ddda9	[SCCP] Preserve Name when converting SExt->ZExt. This makes the output IR more readable since we're doing a one to one replacement. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D125280	2022-05-10 09:13:59 -07:00
Peter Klausler	940871dd28	[flang] Enforce limit on rank + corank Fortran 2018 requires that a compiler allow objects whose rank + corank is 15, and that's our maximum; detect and diagnose violations. Differential Revision: https://reviews.llvm.org/D125153	2022-05-10 08:53:08 -07:00
Nikita Popov	c813516ef8	[InstCombine] Add additional freeze tests (NFC)	2022-05-10 17:43:48 +02:00
Ivan Kosarev	88f04bdbd8	[AMDGPU][GFX10] Support base+soffset+offset SMEM loads. Also makes a step towards resolving https://github.com/llvm/llvm-project/issues/38652 Reviewed By: foad, dp Differential Revision: https://reviews.llvm.org/D125117	2022-05-10 16:17:14 +01:00
Aaron Ballman	ca75ac5f04	Diagnose unreachable generic selection associations The controlling expression of a _Generic selection expression undergoes lvalue conversion, array conversion, and function conversion before picking the association. This means that array types, function types, and qualified types are all unreachable code if they're used as an association. I've been caught by this twice in the past few months and I figure that if a WG14 member can't seem to remember this rule, users are also likely to struggle with it. So this adds an on-by-default unreachable code diagnostic for generic selection expression associations. Note, we don't have to worry about function types as those are already a constraint violation which generates an error. Differential Revision: https://reviews.llvm.org/D125259	2022-05-10 11:15:56 -04:00
Peter Klausler	71728360ad	[flang] Fold real-valued MODULO() and MOD() Evaluate real-valued references to the intrinsic functions MODULO and MOD at compilation time without recourse to an external math library. Differential Revision: https://reviews.llvm.org/D125151	2022-05-10 08:15:29 -07:00
Louis Dionne	671afac89d	[libc++abi][NFC] Fix typo in comment	2022-05-10 11:15:16 -04:00
Ashay Rane	53ff0daa7e	[mlir] Fail early if AnalysisState::getBuffer() returns failure This patch updates calls to AnalysisState::getBuffer() so that we return early with a failure if the call does not succeed. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D125251	2022-05-10 08:08:38 -07:00

... 2 3 4 5 6 ...

423581 Commits All Branches Search

423581 Commits

All Branches