llvm-project

History

Simon Pilgrim 5965680d53 [X86][SSE] Vectorized i8 and i16 shift operators This patch ensures that SHL/SRL/SRA shifts for i8 and i16 vectors avoid scalarization. It builds on the existing i8 SHL vectorized implementation of moving the shift bits up to the sign bit position and separating the 4, 2 & 1 bit shifts with several improvements: 1 - SSE41 targets can use (v)pblendvb directly with the sign bit instead of performing a comparison to feed into a VSELECT node. 2 - pre-SSE41 targets were masking + comparing with an 0x80 constant - we avoid this by using the fact that a set sign bit means a negative integer which can be compared against zero to then feed into VSELECT, avoiding the need for a constant mask (zero generation is much cheaper). 3 - SRA i8 needs to be unpacked to the upper byte of a i16 so that the i16 psraw instruction can be correctly used for sign extension - we have to do more work than for SHL/SRL but perf tests indicate that this is still beneficial. The i16 implementation is similar but simpler than for i8 - we have to do 8, 4, 2 & 1 bit shifts but less shift masking is involved. SSE41 use of (v)pblendvb requires that the i16 shift amount is splatted to both bytes however. Tested on SSE2, SSE41 and AVX machines. Differential Revision: http://reviews.llvm.org/D9474 llvm-svn: 239509		2015-06-11 07:46:37 +00:00
..
AssumptionCache	[PM] Actually add the new pass manager support for the assumption cache.	2015-01-22 21:53:09 +00:00
BasicAA	Revert r236894 "[BasicAA] Fix zext & sext handling"	2015-05-22 01:27:37 +00:00
BlockFrequencyInfo	[opaque pointer type] Add textual IR support for explicit type parameter to the call instruction	2015-04-16 23:24:18 +00:00
BranchProbabilityInfo	Fix information loss in branch probability computation.	2015-05-07 17:22:06 +00:00
CFLAliasAnalysis	[opaque pointer type] Add textual IR support for explicit type parameter to gep operator	2015-03-13 18:20:45 +00:00
CallGraph	[opaque pointer type] Add textual IR support for explicit type parameter to the call instruction	2015-04-16 23:24:18 +00:00
CostModel	[X86][SSE] Vectorized i8 and i16 shift operators	2015-06-11 07:46:37 +00:00
Delinearization	Fix a type mismatch assert in SCEV division	2015-04-22 15:06:40 +00:00
DependenceAnalysis	[DependenceAnalysis] Extend unifySubscriptType for handling coupled subscript groups.	2015-05-29 16:58:08 +00:00
DivergenceAnalysis/NVPTX	Divergence analysis for GPU programs	2015-04-10 05:03:50 +00:00
Dominators	[opaque pointer type] Add textual IR support for explicit type parameter to getelementptr instruction	2015-02-27 19:29:02 +00:00
GlobalsModRef	[opaque pointer type] Add textual IR support for explicit type parameter to the call instruction	2015-04-16 23:24:18 +00:00
LazyCallGraph	[opaque pointer type] Add textual IR support for explicit type parameter to the call instruction	2015-04-16 23:24:18 +00:00
Lint	Make llvm.eh.begincatch use an outparam	2015-03-03 17:41:09 +00:00
LoopAccessAnalysis	[LAA] Fix estimation of number of memchecks	2015-06-08 10:27:06 +00:00
LoopInfo	[PM] Port LoopInfo to the new pass manager, adding both a LoopAnalysis	2015-01-20 10:58:50 +00:00
MemoryDependenceAnalysis	[opaque pointer type] Add textual IR support for explicit type parameter to load instruction	2015-02-27 21:17:42 +00:00
PostDominators	FileCheck-ize tests.	2013-08-22 00:51:19 +00:00
RegionInfo	[tests] Cleanup initialization of test suffixes.	2013-08-16 00:37:11 +00:00
ScalarEvolution	[opaque pointer type] Add textual IR support for explicit type parameter to the call instruction	2015-04-16 23:24:18 +00:00
ScopedNoAliasAA	[Testsuite] Renumber metadata in ScopedNoAliasAA test to match CHECK lines	2015-05-11 09:10:14 +00:00
TypeBasedAliasAnalysis	[opaque pointer type] Add textual IR support for explicit type parameter to gep operator	2015-03-13 18:20:45 +00:00
ValueTracking	Minor refactoring of GEP handling in isDereferenceablePointer	2015-06-08 11:58:13 +00:00