llvm-project

Commit Graph

Author	SHA1	Message	Date
Benjamin Kramer	b9ad55c6d4	[bazel] Fix the build after `18b92c66fe`	2022-07-19 17:34:39 +02:00
Benjamin Kramer	9235fafd6e	[bazel] Remove libraries that don't build anymore after `5e83a5b475` I don't know who uses these python extensions, probably nobody.	2022-07-19 17:13:23 +02:00
Louis Dionne	8711fcae27	[libc++] Treat incomplete features just like other experimental features In particular remove the ability to expel incomplete features from the library at configure-time, since this can now be done through the _LIBCPP_ENABLE_EXPERIMENTAL macro. Also, never provide symbols related to incomplete features inside the dylib, instead provide them in c++experimental.a (this changes the symbols list, but not for any configuration that should have shipped). Differential Revision: https://reviews.llvm.org/D128928	2022-07-19 10:50:20 -04:00
Louis Dionne	7300a651f5	[libc++] Re-apply "Always build c++experimental.a"" This re-applies `bb939931a1`, which had been reverted by `09cebfb978` because it broke Chromium. The issues seen by Chromium should be addressed by `1d0f79558c`. Differential Revision: https://reviews.llvm.org/D128927	2022-07-19 10:44:19 -04:00
Louis Dionne	1d0f79558c	[libc++] Make sure cxx_experimental links against libc++ headers This should fix builds where we build neither the static nor the shared library.	2022-07-19 10:41:36 -04:00
Nicolai Hähnle	3d2256cc75	Revert "Update some more tests with update_cc_test_checks.py" This reverts commit `9fb33d52b0`. Buildbots are showing a number of regressions that don't reproduce locally. Needs more investigating.	2022-07-19 16:39:57 +02:00
Arnold Schwaighofer	bc4870f09e	[coro async] Add missing llvm.coro.id.async intrinsic to declaresCoroCleanupIntrinsics rdar://97214593 Differential Revision: https://reviews.llvm.org/D130038	2022-07-19 07:25:04 -07:00
Daniil Dudkin	ea1cdb58cc	[flang][NFC] Drop `AbstractResultOptions` structure `AbstractResultOptions` is obsolete structure because `newArg` is used only in `ReturnOpConversion`. This change removes this struct, making dependencies of conversions more straight-forward. Reviewed By: jeanPerier Differential Revision: https://reviews.llvm.org/D129485	2022-07-19 17:23:09 +03:00
Nicolai Hähnle	9fb33d52b0	Update some more tests with update_cc_test_checks.py	2022-07-19 16:18:35 +02:00
Joe Nash	b28bb8cc9c	[AMDGPU] Remove old operand from VOPC DPP For most DPP instructions, the old operand stores the value that was in the current lane before the DPP operation, and is tied to the destination. For VOPC DPP, this is unnecessary and incorrect. There appears to have been a latent bug related to D122737 with SIInstrInfo::isOperandLegal. If you checked if a register operand was legal when the InstructionDesc expected an immediate, it reported that is valid. Its fix is necessary for and tested in this patch. Reviewed By: foad, rampitec Differential Revision: https://reviews.llvm.org/D130040	2022-07-19 09:35:05 -04:00
Andrew Turner	b850762b62	Add the FreeBSD AArch64 memory layout Use the FreeBSD AArch64 memory layout values when building for it. These are based on the x86_64 values, scaled to take into account the larger address space on AArch64. Reviewed by: vitalybuka Differential Revision: https://reviews.llvm.org/D125883	2022-07-19 09:58:07 -04:00
Andrew Turner	e13bd2644e	Add the FreeBSD AArch64 shadow offset to llvm AArch64 has a larger address space than 64 but x86. Use the larger shadow offset on FreeBSD AArch64. Reviewed by: vitalybuka Differential Revision: https://reviews.llvm.org/D125873	2022-07-19 09:58:07 -04:00
Andrew Turner	1b2fa907d7	Add the FreeBSD AArch64 memory layout Use the FreeBSD AArch64 memory layout values when building for it. These are based on the x86_64 values, scaled to take into account the larger address space on AArch64. Reviewed by: vitalybuka Differential Revision: https://reviews.llvm.org/D125758	2022-07-19 09:58:04 -04:00
Dmitry Vyukov	6d1f86095d	tsan: optimize DenseSlabAlloc If lots of threads do lots of malloc/free and they overflow per-pthread DenseSlabAlloc cache, it causes lots of contention: 31.97% race.old race.old [.] __sanitizer::StaticSpinMutex::LockSlow 17.61% race.old race.old [.] __tsan_read4 10.77% race.old race.old [.] __tsan::SlotLock Optimize DenseSlabAlloc to use a lock-free stack of batches of nodes. This way we don't take any locks in steady state at all and do only 1 push/pop per Refill/Drain. Effect on the added benchmark: $ TIME="%e %U %S %M" time ./test.old 36 5 2000000 34.51 978.22 175.67 5833592 32.53 891.73 167.03 5790036 36.17 1005.54 201.24 5802828 36.94 1004.76 226.58 5803188 $ TIME="%e %U %S %M" time ./test.new 36 5 2000000 26.44 720.99 13.45 5750704 25.92 721.98 13.58 5767764 26.33 725.15 13.41 5777936 25.93 713.49 13.41 5791796 Reviewed By: melver Differential Revision: https://reviews.llvm.org/D130002	2022-07-19 15:42:57 +02:00
Simon Pilgrim	71c502cbca	[DAG] Call SimplifyDemandedBits from ISD::MUL nodes Noticed while triaging D129765.	2022-07-19 14:11:04 +01:00
William Schmidt	bccc9aa81c	Don't vectorize PHIs in catchswitch blocks We currently assert in vectorizeTree(TreeEntry*) when processing a PHI bundle in a block containing a catchswitch. We attempt to set the IRBuilder insertion point following the catchswitch, which is invalid. This is done so that ShuffleBuilder.finalize() knows where to insert a shuffle if one is needed. To avoid this occurring, watch out for catchswitch blocks during buildTree_rec() processing, and avoid adding PHIs in such blocks to the vectorizable tree. It is unlikely that constraining vectorization over an exception path will cause a noticeable performance loss, so this seems preferable to trying to anticipate when a shuffle will and will not be required.	2022-07-19 06:10:17 -07:00
Nikita Popov	08860f525a	[Local] Allow creating callbr with duplicate successors Since D129288, callbr is allowed to have duplicate successors. This patch removes a limitation which prevents optimizations from actually producing such callbrs. Differential Revision: https://reviews.llvm.org/D129997	2022-07-19 14:28:22 +02:00
Alexey Lapshin	4539b44148	[Reland][Debuginfo][llvm-dwarfutil] llvm-dwarfutil dsymutil-like tool for ELF. This patch implements proposal https://lists.llvm.org/pipermail/llvm-dev/2020-August/144579.html llvm-dwarfutil - is a tool that is used for processing debug info(DWARF) located in built binary files to improve debug info quality, reduce debug info size. The patch currently implements smaller set of command-line options(comparing to the proposal): ``` ./llvm-dwarfutil [options] <input file> <output file> --garbage-collection Do garbage collection for debug info(default) -j <value> Alias for --num-threads --no-garbage-collection Don`t do garbage collection for debug info --no-odr-deduplication Don`t do ODR deduplication for debug types --no-odr Alias for --no-odr-deduplication --no-separate-debug-file Create single output file, containing debug tables(default) --num-threads <threads> Number of available threads for multi-threaded execution. Defaults to the number of cores on the current machine --odr-deduplication Do ODR deduplication for debug types(default) --odr Alias for --odr-deduplication --separate-debug-file Create two output files: file w/o debug tables and file with debug tables --tombstone [bfd,maxpc,exec,universal] Tombstone value used as a marker of invalid address(default: universal) =bfd - Zero for all addresses and [1,1] for DWARF v4 (or less) address ranges and exec =maxpc - Minus 1 for all addresses and minus 2 for DWARF v4 (or less) address ranges =exec - Match with address ranges of executable sections =universal - Both: bfd and maxpc ``` Reviewed By: clayborg Differential Revision: https://reviews.llvm.org/D86539	2022-07-19 15:11:36 +03:00
serge-sans-paille	90af9b4111	[flang] Fix flang-to-external-fc --version Substitution of @FLANG_VERSION@ wasn't correctly performed. Differential Revision: https://reviews.llvm.org/D130074	2022-07-19 14:10:46 +02:00
Evgeniy Brevnov	8f90edeb55	Additional regression test for a crash during reorder masked gather nodes	2022-07-19 19:03:53 +07:00
Nicolas Vasilache	18b92c66fe	[mlir][Linalg] Add a TileToForeachThread transform. This revision adds a new transformation to tile a TilingInterface `op` to a tiled `scf.foreach_thread`, applying tiling by `num_threads`. If non-empty, the `threadDimMapping` is added as an attribute to the resulting `scf.foreach_thread`. 0-tile sizes (i.e. tile by the full size of the data) are used to encode that a dimension is not tiled. Differential Revision: https://reviews.llvm.org/D129577	2022-07-19 04:56:11 -07:00
Benjamin Kramer	8aff88fd3a	[LegalizeDAG] Propagate alignment in ExpandExtractFromVectorThroughStack Unlike the name suggests this can reuse any store as a base for a memory-based vector extract. If that store is underaligned the loads created to extract will have an invalid alignment. Since most CPUs are forgiving wrt alignment this is almost never an issue, on x86 this is only reproducible by extracting a 128 bit vector out of a wider vector. I tried making a test case in the context of https://reviews.llvm.org/D127982 but it's really really fragile, as the output pretty much looks like a missed optimization.	2022-07-19 13:13:55 +02:00
David Green	6cb9529001	[ARM] Remove VBICimm if no cleared bits are demanded If none of the bits of a VBICimm are demanded, we can remove the node entirely using the input operand instead. Differential Revision: https://reviews.llvm.org/D129966	2022-07-19 11:53:47 +01:00
Florian Hahn	a75760a269	[LV] Remove unnecessary cast in widenCallInstruction. (NFC)	2022-07-19 11:23:24 +01:00
Simon Pilgrim	2c488a6b35	Fix signed/unsigned comparison mismatch warning	2022-07-19 11:13:31 +01:00
Simon Pilgrim	0f6b0461b0	[DAG] SimplifyDemandedBits - relax "xor (X >> ShiftC), XorC --> (not X) >> ShiftC" to match only demanded bits The "xor (X >> ShiftC), XorC --> (not X) >> ShiftC" fold is currently limited to the XOR mask being a shifted all-bits mask, but we can relax this to only need to match under the demanded bits. This helps expose more bit extraction/clearing patterns and fixes the PowerPC testCompares*.ll regressions from D127115 Alive2: https://alive2.llvm.org/ce/z/fl7T7K Differential Revision: https://reviews.llvm.org/D129933	2022-07-19 10:59:07 +01:00
Abinav Puthan Purayil	9fa425c1ab	[AMDGPU] Set amdgpu-memory-bound if a basic block has dense global memory access AMDGPUPerfHintAnalysis doesn't set the memory bound attribute if FuncInfo::InstCost outweighs MemInstCost even if we have a basic block with relatively high global memory access. GCNSchedStrategy could revert optimal scheduling in favour of occupancy which seems to degrade performance for some kernels. This change introduces the HasDenseGlobalMemAcc metric in the heuristic that makes the analysis more conservative in these cases. This fixes SWDEV-334259/SWDEV-343932 Differential Revision: https://reviews.llvm.org/D129759	2022-07-19 15:16:28 +05:30
Abinav Puthan Purayil	4baf8f092b	[AMDGPU] Pre-commit tests for D129759 Differential Revision: https://reviews.llvm.org/D129760	2022-07-19 15:16:27 +05:30
David Spickett	5d14873249	[llvm][AArch64] Add missing FPCR, H and B registers to Codeview mapping Fixes https://github.com/llvm/llvm-project/issues/56484 H registers are 16 bit views of AArch64's Neon registers and B are the 8 bit views. msvc does not support 16 bit float (some mention in DirectX but I couldn't find a way to get to it) so for lack of a better reference I'm using: `85c9b41b33/server/references/dia/include/cvconst.h` (the other microsoft-pdb repo is no longer up to date) Luckily clang does support fp16 so a test is added for that. There is no 8 bit float type so I had to get creative with the test case. We're not testing for correct debug info here just that we can select the B register and not crash in the process. For FPCR it's never going to be passed as an argument so I've not added a test for it. It is included to keep our list looking the same as the reference. Reviewed By: majnemer Differential Revision: https://reviews.llvm.org/D129774	2022-07-19 09:33:13 +00:00
Alexey Lapshin	e717f91c96	Revert "[Debuginfo][llvm-dwarfutil] llvm-dwarfutil dsymutil-like tool for ELF." This reverts commit `e2147c26bd`.	2022-07-19 12:17:47 +03:00
Markus Böck	61394636f0	[mlir] Ignore effects on allocated results when checking whether the op is trivially dead. In the current state, this is only special cased for Allocation effects, but any effects on results allocated by the operation may be ignored when checking whether the op may be removed, as none of them are possible to be observed if the result is unused. A use case for this is for IRs for languages which always initialize on allocation. To correctly model such operations, a Write as well as an Allocation effect should be placed on the result. This would prevent the Op from being deleted if unused however. This patch fixes that issue. Differential Revision: https://reviews.llvm.org/D129854	2022-07-19 10:58:25 +02:00
Max Kazantsev	82309831c3	[LoopSimplifyCFG] Prevent use-def dominance breach by handling dead exits. PR56243 One of the transforms in LoopSimplifyCFG demands that the LCSSA form is truly maintained for all values, tokens included, otherwise it may end up creating a use that is not dominated by def (and Phi creation for tokens is impossible). Detect this situation and prevent transform for it early. Differential Revision: https://reviews.llvm.org/D129984 Reviewed By: efriedma	2022-07-19 15:54:12 +07:00
Jason Molenda	2149455cdc	Update docs to note lzfse open source implementation	2022-07-19 01:40:40 -07:00
Alexey Lapshin	e2147c26bd	[Debuginfo][llvm-dwarfutil] llvm-dwarfutil dsymutil-like tool for ELF. This patch implements proposal https://lists.llvm.org/pipermail/llvm-dev/2020-August/144579.html llvm-dwarfutil - is a tool that is used for processing debug info(DWARF) located in built binary files to improve debug info quality, reduce debug info size. The patch currently implements smaller set of command-line options(comparing to the proposal): ``` ./llvm-dwarfutil [options] <input file> <output file> --garbage-collection Do garbage collection for debug info(default) -j <value> Alias for --num-threads --no-garbage-collection Don`t do garbage collection for debug info --no-odr-deduplication Don`t do ODR deduplication for debug types --no-odr Alias for --no-odr-deduplication --no-separate-debug-file Create single output file, containing debug tables(default) --num-threads <threads> Number of available threads for multi-threaded execution. Defaults to the number of cores on the current machine --odr-deduplication Do ODR deduplication for debug types(default) --odr Alias for --odr-deduplication --separate-debug-file Create two output files: file w/o debug tables and file with debug tables --tombstone [bfd,maxpc,exec,universal] Tombstone value used as a marker of invalid address(default: universal) =bfd - Zero for all addresses and [1,1] for DWARF v4 (or less) address ranges and exec =maxpc - Minus 1 for all addresses and minus 2 for DWARF v4 (or less) address ranges =exec - Match with address ranges of executable sections =universal - Both: bfd and maxpc ``` Reviewed By: clayborg Differential Revision: https://reviews.llvm.org/D86539	2022-07-19 11:18:36 +03:00
Cullen Rhodes	f7b2d4aac6	[AArch64] Add patterns to fold zext(cmpeq(x, splat(0))) Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D129626	2022-07-19 08:14:38 +00:00
Xiang1 Zhang	4bb19de4b6	[X86] Add 64 bit implement for __SSC_MARK Reviewed By: craig.topper, pengfei.wang, jinsong Differential Revision: https://reviews.llvm.org/D129826	2022-07-19 16:13:41 +08:00
Nikita Popov	534b9246a2	[LoopInfo] Allow cloning of callbr After D129288, callbr is safe to clone without special handling. This permits optimizations like loop unroll and loop unswitch on loops containing callbrs. Fixes https://github.com/llvm/llvm-project/issues/41834. Differential Revision: https://reviews.llvm.org/D129993	2022-07-19 09:57:28 +02:00
Haojian Wu	d489b3807f	[pseudo] Implement a guard to determine function declarator. This eliminates some simple-declaration/function-definition false parses. - implement a function to determine whether a declarator ForestNode is a function declarator; - extend the standard declarator to two guarded function-declarator and non-function-declarator nonterminals; Differential Revision: https://reviews.llvm.org/D129222	2022-07-19 09:44:45 +02:00
Rosie Sumpter	05d424d165	[AArch64][SVE] Fold fadda(ptrue, x, select(mask, y, -0.0)) into fadda(mask, x, y) This patch adds an SVE pattern to recognize the use of a select with an fadda in the form fadda(ptrue, x, select(mask, y, -0.0)). In this case the select can be folded away, with the select mask used as the predicate for fadda. This improves the codegen when vectorizing loops with ordered fp reductions. Differential Revision: https://reviews.llvm.org/D129623	2022-07-19 08:31:51 +01:00
Matthias Springer	106d695287	[mlir][sparse][NFC] Update remaining test cases No more to_memref, memref.alloc or memref.dealloc when possible. Differential Revision: https://reviews.llvm.org/D130023	2022-07-19 09:21:10 +02:00
Matthias Springer	27a431f5e9	[mlir][bufferization][NFC] Move sparse_tensor.release to bufferization dialect This op used to belong to the sparse dialect, but there are use cases for dense bufferization as well. (E.g., when a tensor alloc is returned from a function and should be deallocated at the call site.) This change moves the op to the bufferization dialect, which now has an `alloc_tensor` and a `dealloc_tensor` op. Differential Revision: https://reviews.llvm.org/D129985	2022-07-19 09:18:19 +02:00
Nicolai Hähnle	5fc6213551	Revert change to clang/test/CodeGen/arm_acle.c For some reason, update_cc_test_checks.py produced a failing test. Partial revert of `301011fa60`	2022-07-19 09:11:23 +02:00
serge-sans-paille	eb0e3319bf	[sanitizer] Don't call dlerror() after swift_demangle lookup through dlsym Because the call to `dlerror()` may actually want to print something, which turns into a deadlock as showcased in #49223. Instead rely on further call to dlsym to clear `dlerror` internal state if they need to check the return status. Differential Revision: https://reviews.llvm.org/D128992	2022-07-19 09:07:30 +02:00
serge-sans-paille	a2ac383b44	[llvm] Fix forward declaration in Support/JSON.h Some methods of json::Array require json::Value to be completely defined, so they can't be defined in-class. Fix that by defining them out of class. Fix #55780	2022-07-19 09:07:29 +02:00
Bing1 Yu	af09127c94	[X86][NFC] avx512-f16c-v16f16-fadd.ll avx512-skx-v32f16-fadd.ll - add nounwind to prevent cfi noise on tests	2022-07-19 15:00:47 +08:00
Nicolai Hähnle	301011fa60	Rerun ./utils/update_cc_test.py on a bunch of tests Due to update script changes; this reduces the size of a later "real" diff.	2022-07-19 08:53:05 +02:00
Max Kazantsev	51f837a680	[NFC] Introduce API to detect tokens penetrating LCSSA form Following discussion in PR56243, we need to somehow detect the situation when token values penetrate LCSSA form for transforms that require that it is maintained by all values (for example, to sustain use-def dominance invarians). This patch introduces a parameter to LCSSA checkers to control their ignorance about tokens. Differential Revision: https://reviews.llvm.org/D129983 Reviewed By: efriedma	2022-07-19 13:52:30 +07:00
LLVM GN Syncbot	5114e2c50a	[gn build] Port `8ed702b83f`	2022-07-19 06:42:58 +00:00
Max Kazantsev	69b284aaf6	Revert "[DAGCombiner] Teach scalarizeBinOpOfSplats handle scalable splat." This reverts commit `58dfaaaace`. Massive AARCH test failures in buildbot.	2022-07-19 13:41:52 +07:00
Bing1 Yu	e01bf5a3e2	[X86] Promote v32f16's fadd into v32f32's fadd when it is avx512 without avx512fp16 Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D130059	2022-07-19 14:37:50 +08:00

... 4 5 6 7 8 ...

430548 Commits All Branches Search

430548 Commits

All Branches