forked from OSchip/llvm-project
95adf5258f
This optimization is unstable at this moment; it 1) block us on a very important application 2) PR15200 3) test6 and test7 in test/Transforms/ScalarRepl/dynamic-vector-gep.ll (the CHECK command compare the output against wrong result) I personally believe this optimization should not have any impact on the autovectorized code, as auto-vectorizer is supposed to put gather/scatter in a "right" way. Although in theory downstream optimizaters might reveal some gather/scatter optimization opportunities, the chance is quite slim. For the hand-crafted vectorizing code, in term of redundancy elimination, load-CSE, copy-propagation and DSE can collectively achieve the same result, but in much simpler way. On the other hand, these optimizers are able to improve the code in a incremental way; in contrast, SROA is sort of all-or-none approach. However, SROA might slighly win in stack size, as it tries to figure out a stretch of memory tightenly cover the area accessed by the dynamic index. rdar://13174884 PR15200 llvm-svn: 178912 |
||
---|---|---|
.. | ||
2003-05-29-ArrayFail.ll | ||
2003-09-12-IncorrectPromote.ll | ||
2003-10-29-ArrayProblem.ll | ||
2006-11-07-InvalidArrayPromote.ll | ||
2007-05-29-MemcpyPreserve.ll | ||
2007-11-03-bigendian_apint.ll | ||
2008-01-29-PromoteBug.ll | ||
2008-02-28-SubElementExtractCrash.ll | ||
2008-06-05-loadstore-agg.ll | ||
2008-06-22-LargeArray.ll | ||
2008-08-22-out-of-range-array-promote.ll | ||
2008-09-22-vector-gep.ll | ||
2009-02-02-ScalarPromoteOutOfRange.ll | ||
2009-02-05-LoadFCA.ll | ||
2009-03-04-MemCpyAlign.ll | ||
2009-12-11-NeonTypes.ll | ||
2010-01-18-SelfCopy.ll | ||
2011-05-06-CapturedAlloca.ll | ||
2011-06-08-VectorExtractValue.ll | ||
2011-06-17-VectorPartialMemset.ll | ||
2011-09-22-PHISpeculateInvoke.ll | ||
2011-10-11-VectorMemset.ll | ||
2011-10-22-VectorCrash.ll | ||
2011-11-11-EmptyStruct.ll | ||
AggregatePromote.ll | ||
DifferingTypes.ll | ||
address-space.ll | ||
arraytest.ll | ||
badarray.ll | ||
basictest.ll | ||
bitfield-sroa.ll | ||
copy-aggregate.ll | ||
crash.ll | ||
debuginfo-preserved.ll | ||
inline-vector.ll | ||
lifetime.ll | ||
lit.local.cfg | ||
load-store-aggregate.ll | ||
memcpy-align.ll | ||
memset-aggregate-byte-leader.ll | ||
memset-aggregate.ll | ||
negative-memset.ll | ||
nonzero-first-index.ll | ||
not-a-vector.ll | ||
only-memcpy-uses.ll | ||
phi-cycle.ll | ||
phi-select.ll | ||
phinodepromote.ll | ||
select_promote.ll | ||
sroa-fca.ll | ||
sroa_two.ll | ||
union-fp-int.ll | ||
union-packed.ll | ||
union-pointer.ll | ||
vector_memcpy.ll | ||
vector_promote.ll | ||
vectors-with-mismatched-elements.ll | ||
volatile.ll |