llvm-project/llvm/test/Transforms/InstCombine/X86
Craig Topper 4dccffc84a [X86] Change signatures of avx512 packed fp compare intrinsics to return a vXi1 mask type to be closer to an fcmp.
Summary:
This patch changes the signature of the avx512 packed fp compare intrinsics to return a vXi1 vector and no longer take a mask as input. The casts to scalar type will now need to be explicit in the IR. The masking node will now be an explicit and in the IR.

This makes the intrinsic look much more similar to an fcmp instruction that we wish we could use for these but can't. We already use icmp instructions for integer compares.

Previously the lowering step of isel would turn the intrinsic into an X86 specific ISD node and a emit the masking nodes as well as some bitcasts. This means DAG combines can't see the vXi1 type until somewhat late, making it more difficult to combine out gpr<->mask transition sequences. By exposing the vXi1 type explicitly in the IR and initial SelectionDAG we give earlier DAG combines and even InstCombine the chance to see it and optimize it.

This should make any issues with gpr<->mask sequences the same between integer and fp. Meaning we only have to fix them once.

Reviewers: spatel, delena, RKSimon, zvi

Reviewed By: RKSimon

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D43137

llvm-svn: 324827
2018-02-10 23:33:55 +00:00
..
X86FsubCmpCombine.ll [X86] Change signatures of avx512 packed fp compare intrinsics to return a vXi1 mask type to be closer to an fcmp. 2018-02-10 23:33:55 +00:00
blend_x86.ll
lit.local.cfg
pr2645-1.ll
shufflemask-undef.ll
x86-avx2.ll
x86-avx512.ll
x86-bmi-tbm.ll [X86][InstCombine] Add some simplifications for BZHI intrinsics 2017-07-31 18:52:15 +00:00
x86-crc32-demanded.ll
x86-f16c.ll
x86-fma.ll
x86-insertps.ll
x86-masked-memops.ll
x86-movmsk.ll
x86-muldq.ll
x86-pack.ll
x86-pshufb.ll [InstCombine] improve demanded vector elements analysis of insertelement 2017-08-31 15:57:17 +00:00
x86-sse.ll
x86-sse2.ll
x86-sse4a.ll
x86-sse41.ll
x86-vec_demanded_elts.ll
x86-vector-shifts.ll
x86-vpermil.ll
x86-xop.ll