llvm-project

History

Sanjay Patel df14bd315d [SLP] respect target register width for GEP vectorization (PR43578) We failed to account for the target register width (max vector factor) when vectorizing starting from GEPs. This causes vectorization to proceed to obviously illegal widths as in: https://bugs.llvm.org/show_bug.cgi?id=43578 For x86, this also means that SLP can produce rogue AVX or AVX512 code even when the user specifies a narrower vector width. The AArch64 test in ext-trunc.ll appears to be better using the narrower width. I'm not exactly sure what getelementptr.ll is trying to do, but it's testing with "-slp-threshold=-18", so I'm not worried about those diffs. The x86 test is an over-reduction from SPEC h264; this patch appears to restore the perf loss caused by SLP when using -march=haswell. Differential Revision: https://reviews.llvm.org/D68667 llvm-svn: 374183		2019-10-09 16:32:49 +00:00
..
AArch64	[SLP] respect target register width for GEP vectorization (PR43578)	2019-10-09 16:32:49 +00:00
AMDGPU	[LAA] Re-check bit-width of pointers after stripping.	2019-07-18 17:30:27 +00:00
ARM	Revert "Temporarily Revert "Add basic loop fusion pass.""	2019-04-17 04:52:47 +00:00
NVPTX	Revert "Temporarily Revert "Add basic loop fusion pass.""	2019-04-17 04:52:47 +00:00
PowerPC	Revert "Temporarily Revert "Add basic loop fusion pass.""	2019-04-17 04:52:47 +00:00
SystemZ	[lit] Delete empty lines at the end of lit.local.cfg NFC	2019-06-17 09:51:07 +00:00
X86	[SLP] respect target register width for GEP vectorization (PR43578)	2019-10-09 16:32:49 +00:00
XCore	Revert "Temporarily Revert "Add basic loop fusion pass.""	2019-04-17 04:52:47 +00:00
int_sideeffect.ll	Revert "Temporarily Revert "Add basic loop fusion pass.""	2019-04-17 04:52:47 +00:00