llvm-project/llvm/test/CodeGen/X86/avx-trunc.ll

; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
; RUN: llc < %s -mtriple=x86_64-unknown-unknown -mattr=+avx | FileCheck %s

define <4 x i32> @trunc_64_32(<4 x i64> %A) nounwind uwtable readnone ssp{
; CHECK-LABEL: trunc_64_32:
; CHECK:       # %bb.0:
; CHECK-NEXT:    vextractf128 $1, %ymm0, %xmm1
; CHECK-NEXT:    vshufps {{.*#+}} xmm0 = xmm0[0,2],xmm1[0,2]
; CHECK-NEXT:    vzeroupper
; CHECK-NEXT:    retq
  %B = trunc <4 x i64> %A to <4 x i32>
  ret <4 x i32>%B
}

define <8 x i16> @trunc_32_16(<8 x i32> %A) nounwind uwtable readnone ssp{
; CHECK-LABEL: trunc_32_16:
; CHECK:       # %bb.0:
; CHECK-NEXT:    vextractf128 $1, %ymm0, %xmm1
; CHECK-NEXT:    vmovdqa {{.*#+}} xmm2 = [0,1,4,5,8,9,12,13,8,9,12,13,12,13,14,15]
; CHECK-NEXT:    vpshufb %xmm2, %xmm1, %xmm1
; CHECK-NEXT:    vpshufb %xmm2, %xmm0, %xmm0
; CHECK-NEXT:    vpunpcklqdq {{.*#+}} xmm0 = xmm0[0],xmm1[0]
; CHECK-NEXT:    vzeroupper
; CHECK-NEXT:    retq
  %B = trunc <8 x i32> %A to <8 x i16>
  ret <8 x i16>%B
}

define <16 x i8> @trunc_16_8(<16 x i16> %A) nounwind uwtable readnone ssp{
; CHECK-LABEL: trunc_16_8:
; CHECK:       # %bb.0:
; CHECK-NEXT:    vextractf128 $1, %ymm0, %xmm1
; CHECK-NEXT:    vmovdqa {{.*#+}} xmm2 = <0,2,4,6,8,10,12,14,u,u,u,u,u,u,u,u>
; CHECK-NEXT:    vpshufb %xmm2, %xmm1, %xmm1
; CHECK-NEXT:    vpshufb %xmm2, %xmm0, %xmm0
; CHECK-NEXT:    vpunpcklqdq {{.*#+}} xmm0 = xmm0[0],xmm1[0]
; CHECK-NEXT:    vzeroupper
; CHECK-NEXT:    retq
  %B = trunc <16 x i16> %A to <16 x i8>
  ret <16 x i8> %B
}
[X86][AVX] Regenerated AVX tests Updated i1 select, vector truncation and subvector extraction tests llvm-svn: 257995 2016-01-16 23:25:02 +08:00			`; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py`
[x86] fix test specifications llvm-svn: 289493 2016-12-13 07:16:35 +08:00			`; RUN: llc < %s -mtriple=x86_64-unknown-unknown -mattr=+avx \| FileCheck %s`
Unix line endings llvm-svn: 149615 2012-02-03 03:00:49 +08:00
			`define <4 x i32> @trunc_64_32(<4 x i64> %A) nounwind uwtable readnone ssp{`
[X86][AVX] Regenerated AVX tests Updated i1 select, vector truncation and subvector extraction tests llvm-svn: 257995 2016-01-16 23:25:02 +08:00			`; CHECK-LABEL: trunc_64_32:`
[CodeGen] Unify MBB reference format in both MIR and debug output As part of the unification of the debug format and the MIR format, print MBB references as '%bb.5'. The MIR printer prints the IR name of a MBB only for block definitions. * find . \( -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" \) -type f -print0 \| xargs -0 sed -i '' -E 's/BB#" << ([a-zA-Z0-9_]+)->getNumber\(\)/" << printMBBReference(\1)/g' find . \( -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" \) -type f -print0 \| xargs -0 sed -i '' -E 's/BB#" << ([a-zA-Z0-9_]+)\.getNumber\(\)/" << printMBBReference(\1)/g' * find . \( -name ".txt" -o -name ".s" -o -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" \) -type f -print0 \| xargs -0 sed -i '' -E 's/BB#([0-9]+)/%bb.\1/g' * grep -nr 'BB#' and fix Differential Revision: https://reviews.llvm.org/D40422 llvm-svn: 319665 2017-12-05 01:18:51 +08:00			`; CHECK: # %bb.0:`
[X86][AVX] Regenerated AVX tests Updated i1 select, vector truncation and subvector extraction tests llvm-svn: 257995 2016-01-16 23:25:02 +08:00			`; CHECK-NEXT: vextractf128 $1, %ymm0, %xmm1`
[x86] use a single shufps when it can save instructions This is a tiny patch with a big pile of test changes. This partially fixes PR27885: https://llvm.org/bugs/show_bug.cgi?id=27885 My motivating case looks like this: - vpshufd {{.#+}} xmm1 = xmm1[0,1,0,2] - vpshufd {{.#+}} xmm0 = xmm0[0,2,2,3] - vpblendw {{.#+}} xmm0 = xmm0[0,1,2,3],xmm1[4,5,6,7] + vshufps {{.#+}} xmm0 = xmm0[0,2],xmm1[0,2] And this happens several times in the diffs. For chips with domain-crossing penalties, the instruction count and size reduction should usually overcome any potential domain-crossing penalty due to using an FP op in a sequence of int ops. For chips such as recent Intel big cores and Atom, there is no domain-crossing penalty for shufps, so using shufps is a pure win. So the test case diffs all appear to be improvements except one test in vector-shuffle-combining.ll where we miss an opportunity to use a shift to generate zero elements and one test in combine-sra.ll where multiple uses prevent the expected shuffle combining. Differential Revision: https://reviews.llvm.org/D27692 llvm-svn: 289837 2016-12-16 02:03:38 +08:00			`; CHECK-NEXT: vshufps {{.*#+}} xmm0 = xmm0[0,2],xmm1[0,2]`
[X86][AVX] Regenerated AVX tests Updated i1 select, vector truncation and subvector extraction tests llvm-svn: 257995 2016-01-16 23:25:02 +08:00			`; CHECK-NEXT: vzeroupper`
			`; CHECK-NEXT: retq`
Unix line endings llvm-svn: 149615 2012-02-03 03:00:49 +08:00			`%B = trunc <4 x i64> %A to <4 x i32>`
			`ret <4 x i32>%B`
			`}`
[X86][AVX] Regenerated AVX tests Updated i1 select, vector truncation and subvector extraction tests llvm-svn: 257995 2016-01-16 23:25:02 +08:00
Unix line endings llvm-svn: 149615 2012-02-03 03:00:49 +08:00			`define <8 x i16> @trunc_32_16(<8 x i32> %A) nounwind uwtable readnone ssp{`
[X86][AVX] Regenerated AVX tests Updated i1 select, vector truncation and subvector extraction tests llvm-svn: 257995 2016-01-16 23:25:02 +08:00			`; CHECK-LABEL: trunc_32_16:`
[CodeGen] Unify MBB reference format in both MIR and debug output As part of the unification of the debug format and the MIR format, print MBB references as '%bb.5'. The MIR printer prints the IR name of a MBB only for block definitions. * find . \( -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" \) -type f -print0 \| xargs -0 sed -i '' -E 's/BB#" << ([a-zA-Z0-9_]+)->getNumber\(\)/" << printMBBReference(\1)/g' find . \( -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" \) -type f -print0 \| xargs -0 sed -i '' -E 's/BB#" << ([a-zA-Z0-9_]+)\.getNumber\(\)/" << printMBBReference(\1)/g' * find . \( -name ".txt" -o -name ".s" -o -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" \) -type f -print0 \| xargs -0 sed -i '' -E 's/BB#([0-9]+)/%bb.\1/g' * grep -nr 'BB#' and fix Differential Revision: https://reviews.llvm.org/D40422 llvm-svn: 319665 2017-12-05 01:18:51 +08:00			`; CHECK: # %bb.0:`
[X86][AVX] Regenerated AVX tests Updated i1 select, vector truncation and subvector extraction tests llvm-svn: 257995 2016-01-16 23:25:02 +08:00			`; CHECK-NEXT: vextractf128 $1, %ymm0, %xmm1`
			`; CHECK-NEXT: vmovdqa {{.*#+}} xmm2 = [0,1,4,5,8,9,12,13,8,9,12,13,12,13,14,15]`
			`; CHECK-NEXT: vpshufb %xmm2, %xmm1, %xmm1`
			`; CHECK-NEXT: vpshufb %xmm2, %xmm0, %xmm0`
			`; CHECK-NEXT: vpunpcklqdq {{.*#+}} xmm0 = xmm0[0],xmm1[0]`
			`; CHECK-NEXT: vzeroupper`
			`; CHECK-NEXT: retq`
Unix line endings llvm-svn: 149615 2012-02-03 03:00:49 +08:00			`%B = trunc <8 x i32> %A to <8 x i16>`
			`ret <8 x i16>%B`
			`}`
[X86][AVX] Regenerated AVX tests Updated i1 select, vector truncation and subvector extraction tests llvm-svn: 257995 2016-01-16 23:25:02 +08:00
X86: Custom lower sext v16i8 to v16i16, and the corresponding truncate. Also update the cost model. llvm-svn: 193270 2013-10-24 05:06:07 +08:00			`define <16 x i8> @trunc_16_8(<16 x i16> %A) nounwind uwtable readnone ssp{`
[X86][AVX] Regenerated AVX tests Updated i1 select, vector truncation and subvector extraction tests llvm-svn: 257995 2016-01-16 23:25:02 +08:00			`; CHECK-LABEL: trunc_16_8:`
[CodeGen] Unify MBB reference format in both MIR and debug output As part of the unification of the debug format and the MIR format, print MBB references as '%bb.5'. The MIR printer prints the IR name of a MBB only for block definitions. * find . \( -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" \) -type f -print0 \| xargs -0 sed -i '' -E 's/BB#" << ([a-zA-Z0-9_]+)->getNumber\(\)/" << printMBBReference(\1)/g' find . \( -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" \) -type f -print0 \| xargs -0 sed -i '' -E 's/BB#" << ([a-zA-Z0-9_]+)\.getNumber\(\)/" << printMBBReference(\1)/g' * find . \( -name ".txt" -o -name ".s" -o -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" \) -type f -print0 \| xargs -0 sed -i '' -E 's/BB#([0-9]+)/%bb.\1/g' * grep -nr 'BB#' and fix Differential Revision: https://reviews.llvm.org/D40422 llvm-svn: 319665 2017-12-05 01:18:51 +08:00			`; CHECK: # %bb.0:`
[X86][AVX] Regenerated AVX tests Updated i1 select, vector truncation and subvector extraction tests llvm-svn: 257995 2016-01-16 23:25:02 +08:00			`; CHECK-NEXT: vextractf128 $1, %ymm0, %xmm1`
			`; CHECK-NEXT: vmovdqa {{.*#+}} xmm2 = <0,2,4,6,8,10,12,14,u,u,u,u,u,u,u,u>`
			`; CHECK-NEXT: vpshufb %xmm2, %xmm1, %xmm1`
			`; CHECK-NEXT: vpshufb %xmm2, %xmm0, %xmm0`
			`; CHECK-NEXT: vpunpcklqdq {{.*#+}} xmm0 = xmm0[0],xmm1[0]`
			`; CHECK-NEXT: vzeroupper`
			`; CHECK-NEXT: retq`
X86: Custom lower sext v16i8 to v16i16, and the corresponding truncate. Also update the cost model. llvm-svn: 193270 2013-10-24 05:06:07 +08:00			`%B = trunc <16 x i16> %A to <16 x i8>`
			`ret <16 x i8> %B`
			`}`
Recommiting unsigned saturation with a bugfix. A test case that crached is added to avx512-trunc.ll. (PR31589) llvm-svn: 292479 2017-01-19 20:08:21 +08:00