llvm-project/llvm/test
Philip Reames 4d87591028 [RISCV] Use VScaleForTuning in costing of operations whose cost depends on VL
On known hardware, reductions, gather, and scatter operations have execution latencies which correlated with the vector length (VL) of the operation. Most other operations (e.g. simply arithmetic) don't correlated in this way, and instead essentially fixed cost as VL varies.

When I'd implemented initial scalable cost model support for reductions, gather, and scatter operations, I had used an upper bound on the statically unknown VL. The argument at the time was that this prevented falsely low costs, and biased the vectorizer away from generating bad (on some hardware) code. Unfortunately, practical experience shows we were a bit too effective at that goal, and the high costs defacto prevents vectorization using these constructs at all.

This patch reverses course, and ties the returned cost not to the maximum possible VL, but the VL which would correspond to VScaleForTuning. This parameter is the same one the vectorizer uses when normalizing loop costs, so the term effectively cancels out. The result is that the vectorizer now sees these constructs as comparable in cost to their fixed length variants.

This does introduce the possibility of the cost for these operations being a significant under estimate on platforms where actual VLEN is far from that implied by VScaleForTuning. On such platforms, we might make poor heuristic choices. Probably not in LV itself (due to the cancellation mentioned above), but possibly during e.g. lowering. I'm not currently aware of any concrete examples of this, but this patch does open a concern which did not previously exist.

Previously, we had the problem of overestimating costs causing the same problem on machines much closer to default values for vscale for tuning. With this patch, we still have that problem potentially if vscale for tuning is set high (manually), and then the code is run on a narrow VLEN machine.

Differential Revision: https://reviews.llvm.org/D131519
2022-08-18 13:10:03 -07:00
..
Analysis [RISCV] Use VScaleForTuning in costing of operations whose cost depends on VL 2022-08-18 13:10:03 -07:00
Assembler
Bindings
Bitcode [InstrProf] Add the skipprofile attribute 2022-08-04 08:45:27 -07:00
BugPoint
CodeGen [SVE] Extend findMoreOptimalIndexType so BUILD_VECTORs do not force 64bit indices. 2022-08-18 18:00:53 +01:00
DebugInfo Untangle the mess which is MachineBasicBlock::hasAddressTaken(). 2022-08-16 16:15:44 -07:00
Demangle
Examples [Examples] Fix SimplifyCFG example 2022-08-01 07:22:35 +02:00
ExecutionEngine [JITLink][MachO] Add support for non-subsections-via-symbols objects. 2022-08-17 15:55:42 -07:00
Feature [WinEH] Apply funclet operand bundles to nounwind intrinsics that lower to function calls in the course of IR transforms 2022-07-26 17:52:43 +02:00
FileCheck
Instrumentation [MSAN] add flag to suppress storage of stack variable names with -sanitize-memory-track-origins 2022-08-12 11:59:53 -07:00
Integer
JitListener
LTO [libLTO] Set data-sections by default in libLTO. 2022-07-27 09:39:39 -05:00
Linker
MC [AMDGPU][MC][GFX11][NFC] Add missing tests for DS opcodes 2022-08-18 12:52:44 +03:00
MachineVerifier
Object [llvm-objdump --macho] Rename --dyld_info to --dyld-info 2022-08-17 12:58:29 -04:00
ObjectYAML
Other [SimpleLoopUnswitch] Skip non-trivial unswitching of cold loops 2022-08-08 18:12:04 +00:00
SafepointIRVerifier
Support
SymbolRewriter
TableGen [TableGen] Add a location for a class definition that was forward-declared 2022-07-20 15:56:17 +02:00
ThinLTO/X86 Add switch to use "source_filename" instead of a hash ID for globally promoted local 2022-08-03 16:41:56 -07:00
Transforms [RISCV] Use VScaleForTuning in costing of operations whose cost depends on VL 2022-08-18 13:10:03 -07:00
Unit [test] Propagate HWASAN_OPTIONS 2022-08-17 18:59:49 -07:00
Verifier [NFC] Combine test cases of verifier 2022-08-14 10:12:16 +08:00
YAMLParser
tools [NFC][AArch64] precommit sched model for tsv110 2022-08-18 17:33:45 +08:00
.clang-format
CMakeLists.txt
TestRunner.sh
lit.cfg.py [llvm-ar] Add object mode option -X for AIX 2022-07-22 09:55:21 -04:00
lit.site.cfg.py.in [llvm] add zstd to `llvm::compression` namespace 2022-07-19 10:54:36 -07:00