llvm-project

History

Philip Reames b06a2ad94f [LoopVectorizer] Lower uniform loads as a single load (instead of relying on CSE) A uniform load is one which loads from a uniform address across all lanes. As currently implemented, we cost model such loads as if we did a single scalar load + a broadcast, but the actual lowering replicates the load once per lane. This change tweaks the lowering to use the REPLICATE strategy by marking such loads (and the computation leading to their memory operand) as uniform after vectorization. This is a useful change in itself, but it's real purpose is to pave the way for a following change which will generalize our uniformity logic. In review discussion, there was an issue raised with coupling cost modeling with the lowering strategy for uniform inputs. The discussion on that item remains unsettled and is pending larger architectural discussion. We decided to move forward with this patch as is, and revise as warranted once the bigger picture design questions are settled. Differential Revision: https://reviews.llvm.org/D91398		2020-11-23 15:32:17 -08:00
..
Analysis	Port -print-memderefs to NPM	2020-11-23 11:56:22 -08:00
Assembler	OpaquePtr: Make byval/sret types mandatory	2020-11-20 21:23:33 -05:00
Bindings	C API: support scalable vectors	2020-10-28 18:19:34 -04:00
Bitcode	OpaquePtr: Bulk update tests to use typed sret	2020-11-20 17:58:26 -05:00
BugPoint	…
CodeGen	Reapply "[CodeGen] [WinException] Only produce handler data at the end of the function if needed"	2020-11-23 23:17:03 +02:00
DebugInfo	[DebugInfo] Refactor code for emitting DWARF expressions for FP constants	2020-11-23 09:59:07 +01:00
Demangle	…
Examples	…
ExecutionEngine	[JITLink][ELF] Omit temporary labels in tests	2020-11-04 10:03:15 +00:00
Feature	OpaquePtr: Bulk update tests to use typed sret	2020-11-20 17:58:26 -05:00
FileCheck	[FileCheck] Use %ProtectFileCheckOutput in allow-unused-prefixes.txt	2020-11-05 07:08:20 -08:00
Instrumentation	OpaquePtr: Bulk update tests to use typed byval	2020-11-20 14:00:46 -05:00
Integer	…
JitListener	[MCJIT] Profile the code generated by MCJIT engine using Intel VTune profiler	2020-11-16 19:28:14 +11:00
LTO	…
Linker	OpaquePtr: Bulk update tests to use typed sret	2020-11-20 17:58:26 -05:00
MC	[AMDGPU][MC] Improved diagnostic messages	2020-11-23 16:15:05 +03:00
MachineVerifier	…
Object	[lib/Object] - Generalize the RelocationResolver API.	2020-11-20 10:32:49 +03:00
ObjectYAML	Reland "[lib/Support/YAMLTraits] - Don't print leading zeroes when dumping Hex8/Hex16/Hex32 types." (https://reviews.llvm.org/D90930 ).	2020-11-18 13:08:46 +03:00
Other	[test] Pin tests using -dot-callgraph to legacy PM	2020-11-23 11:48:59 -08:00
Reduce	[llvm-reduce] Add reduction for special globals like llvm.used.	2020-11-11 11:25:05 +00:00
SafepointIRVerifier	…
Support	…
SymbolRewriter	…
TableGen	[TableGen] Enhance the six comparison bang operators.	2020-11-13 09:57:27 -05:00
ThinLTO/X86	[test] Fix unused FileCheck prefix in ThinLTO test	2020-11-02 09:06:36 -08:00
Transforms	[LoopVectorizer] Lower uniform loads as a single load (instead of relying on CSE)	2020-11-23 15:32:17 -08:00
Unit	…
Verifier	Verifier: Fix assert when verifying non-pointer byval or preallocated	2020-11-20 20:08:43 -05:00
YAMLParser	…
tools	[llvm-elfabi] Emit ELF header and string table sections	2020-11-23 12:18:58 -08:00
.clang-format	…
CMakeLists.txt	…
TestRunner.sh	…
lit.cfg.py	Make test/tools/llvm-dlltool/tool-name.test pass, and make it run	2020-11-03 11:59:15 -05:00
lit.site.cfg.py.in	…