llvm-project/llvm/test/CodeGen/ARM/half.ll

; RUN: llc < %s -mtriple=thumbv7-apple-ios7.0 | FileCheck %s --check-prefix=CHECK --check-prefix=CHECK-OLD
; RUN: llc < %s -mtriple=thumbv7s-apple-ios7.0 | FileCheck %s --check-prefix=CHECK --check-prefix=CHECK-F16
; RUN: llc < %s -mtriple=thumbv8-apple-ios7.0 | FileCheck %s --check-prefix=CHECK  --check-prefix=CHECK-V8
; RUN: llc < %s -mtriple=armv8r-none-none-eabi | FileCheck %s --check-prefix=CHECK  --check-prefix=CHECK-V8
; RUN: llc < %s -mtriple=armv8r-none-none-eabi -mattr=-fp64 | FileCheck %s --check-prefix=CHECK  --check-prefix=CHECK-V8-SP
; RUN: llc < %s -mtriple=armv8.1m-none-none-eabi -mattr=+fp-armv8 | FileCheck %s --check-prefix=CHECK --check-prefix=CHECK-V8
; RUN: llc < %s -mtriple=armv8.1m-none-none-eabi -mattr=+fp-armv8,-fp64 | FileCheck %s --check-prefix=CHECK --check-prefix=CHECK-V8-SP
; RUN: llc < %s -mtriple=armv8.1m-none-none-eabi -mattr=+mve.fp,+fp64 | FileCheck %s --check-prefix=CHECK-V8
; RUN: llc < %s -mtriple=armv8.1m-none-none-eabi -mattr=+mve.fp | FileCheck %s --check-prefix=CHECK-V8-SP

define void @test_load_store(half* %in, half* %out) {
; CHECK-LABEL: test_load_store:
; CHECK: ldrh [[TMP:r[0-9]+]], [r0]
; CHECK: strh [[TMP]], [r1]
  %val = load half, half* %in
  store half %val, half* %out
  ret void
}

define i16 @test_bitcast_from_half(half* %addr) {
; CHECK-LABEL: test_bitcast_from_half:
; CHECK: ldrh r0, [r0]
  %val = load half, half* %addr
  %val_int = bitcast half %val to i16
  ret i16 %val_int
}

define void @test_bitcast_to_half(half* %addr, i16 %in) {
; CHECK-LABEL: test_bitcast_to_half:
; CHECK: strh r1, [r0]
  %val_fp = bitcast i16 %in to half
  store half %val_fp, half* %addr
  ret void
}

define float @test_extend32(half* %addr) {
; CHECK-LABEL: test_extend32:

; CHECK-OLD: b.w ___extendhfsf2
; CHECK-F16: vcvtb.f32.f16
; CHECK-V8: vcvtb.f32.f16
; CHECK-V8-SP: vcvtb.f32.f16
  %val16 = load half, half* %addr
  %val32 = fpext half %val16 to float
  ret float %val32
}

define double @test_extend64(half* %addr) {
; CHECK-LABEL: test_extend64:

; CHECK-OLD: bl ___extendhfsf2
; CHECK-OLD: vcvt.f64.f32
; CHECK-F16: vcvtb.f32.f16
; CHECK-F16: vcvt.f64.f32
; CHECK-V8: vcvtb.f64.f16
; CHECK-V8-SP: vcvtb.f32.f16
; CHECK-V8-SP: bl __aeabi_f2d
  %val16 = load half, half* %addr
  %val32 = fpext half %val16 to double
  ret double %val32
}

define void @test_trunc32(float %in, half* %addr) {
; CHECK-LABEL: test_trunc32:

; CHECK-OLD: bl ___truncsfhf2
; CHECK-F16: vcvtb.f16.f32
; CHECK-V8: vcvtb.f16.f32
; CHECK-V8-SP: vcvtb.f16.f32
  %val16 = fptrunc float %in to half
  store half %val16, half* %addr
  ret void
}

define void @test_trunc64(double %in, half* %addr) {
; CHECK-LABEL: test_trunc64:

; CHECK-OLD: bl ___truncdfhf2
; CHECK-F16: bl ___truncdfhf2
; CHECK-V8: vcvtb.f16.f64
; CHECK-V8-SP: bl __aeabi_d2h
  %val16 = fptrunc double %in to half
  store half %val16, half* %addr
  ret void
}
ARM: support legalisation of "fptrunc ... to half" operations. llvm-svn: 213373 2014-07-18 21:01:19 +08:00			`; RUN: llc < %s -mtriple=thumbv7-apple-ios7.0 \| FileCheck %s --check-prefix=CHECK --check-prefix=CHECK-OLD`
			`; RUN: llc < %s -mtriple=thumbv7s-apple-ios7.0 \| FileCheck %s --check-prefix=CHECK --check-prefix=CHECK-F16`
			`; RUN: llc < %s -mtriple=thumbv8-apple-ios7.0 \| FileCheck %s --check-prefix=CHECK --check-prefix=CHECK-V8`
[ARM] Tighten f64<->f16 conversion requirements Fix missing Requires fields. Patch by Bernard Ogden (bogden) Reviewers: SjoerdMeijer, javed.absar, t.p.northover Reviewed By: t.p.northover Differential Revision: https://reviews.llvm.org/D51631 llvm-svn: 342061 2018-09-13 00:24:43 +08:00			`; RUN: llc < %s -mtriple=armv8r-none-none-eabi \| FileCheck %s --check-prefix=CHECK --check-prefix=CHECK-V8`
[ARM] Replace fp-only-sp and d16 with fp64 and d32. Those two subtarget features were awkward because their semantics are reversed: each one indicates the _lack_ of support for something in the architecture, rather than the presence. As a consequence, you don't get the behavior you want if you combine two sets of feature bits. Each SubtargetFeature for an FP architecture version now comes in four versions, one for each combination of those options. So you can still say (for example) '+vfp2' in a feature string and it will mean what it's always meant, but there's a new string '+vfp2d16sp' meaning the version without those extra options. A lot of this change is just mechanically replacing positive checks for the old features with negative checks for the new ones. But one more interesting change is that I've rearranged getFPUFeatures() so that the main FPU feature is appended to the output list before rather than after the features derived from the Restriction field, so that -fp64 and -d32 can override defaults added by the main feature. Reviewers: dmgreen, samparker, SjoerdMeijer Subscribers: srhines, javed.absar, eraman, kristof.beyls, hiraditya, zzheng, Petar.Avramovic, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D60691 llvm-svn: 361845 2019-05-29 00:13:20 +08:00			`; RUN: llc < %s -mtriple=armv8r-none-none-eabi -mattr=-fp64 \| FileCheck %s --check-prefix=CHECK --check-prefix=CHECK-V8-SP`
[ARM] Explicit lowering of half <-> double conversions. If an FP_EXTEND or FP_ROUND isel dag node converts directly between f16 and f32 when the target CPU has no instruction to do it in one go, it has to be done in two steps instead, going via f32. Previously, this was done implicitly, because all such CPUs had the storage-only implementation of f16 (i.e. the only thing you can do with one at all is to convert it to/from f32). So isel would legalize the f16 into an f32 as soon as it saw it, by inserting an fp16_to_fp node (or vice versa), and then the fp_extend would already be f32->f64 rather than f16->f64. But that technique can't support a target CPU which has full f16 support but _not_ f64, such as some variants of Arm v8.1-M. So now we provide custom lowering for FP_EXTEND and FP_ROUND, which checks support for f16 and f64 and decides on the best thing to do given the combination of flags it gets back. Reviewers: dmgreen, samparker, SjoerdMeijer Subscribers: javed.absar, kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60692 llvm-svn: 364294 2019-06-25 19:24:50 +08:00			`; RUN: llc < %s -mtriple=armv8.1m-none-none-eabi -mattr=+fp-armv8 \| FileCheck %s --check-prefix=CHECK --check-prefix=CHECK-V8`
			`; RUN: llc < %s -mtriple=armv8.1m-none-none-eabi -mattr=+fp-armv8,-fp64 \| FileCheck %s --check-prefix=CHECK --check-prefix=CHECK-V8-SP`
			`; RUN: llc < %s -mtriple=armv8.1m-none-none-eabi -mattr=+mve.fp,+fp64 \| FileCheck %s --check-prefix=CHECK-V8`
			`; RUN: llc < %s -mtriple=armv8.1m-none-none-eabi -mattr=+mve.fp \| FileCheck %s --check-prefix=CHECK-V8-SP`
CodeGen: soften f16 type by default instead of marking legal. Actual support for softening f16 operations is still limited, and can be added when it's needed. But Soften is much closer to being a useful thing to try than keeping it Legal when no registers can actually hold such values. Longer term, we probably want something between Soften and Promote semantics for most targets, it'll be more efficient to promote the 4 basic operations to f32 than libcall them. llvm-svn: 213372 2014-07-18 20:41:46 +08:00
			`define void @test_load_store(half* %in, half* %out) {`
			`; CHECK-LABEL: test_load_store:`
			`; CHECK: ldrh [[TMP:r[0-9]+]], [r0]`
			`; CHECK: strh [[TMP]], [r1]`
[opaque pointer type] Add textual IR support for explicit type parameter to load instruction Essentially the same as the GEP change in r230786. A similar migration script can be used to update test cases, though a few more test case improvements/changes were required this time around: (r229269-r229278) import fileinput import sys import re pat = re.compile(r"((?:=\|:\|^)\sload (?:atomic )?(?:volatile )?(.?))(\| addrspace\(\d+\) )\($\| (?:%\|@\|null\|undef\|blockaddress\|getelementptr\|addrspacecast\|bitcast\|inttoptr\|\[\[[a-zA-Z]\|\{\{).$)") for line in sys.stdin: sys.stdout.write(re.sub(pat, r"\1, \2\3*\4", line)) Reviewers: rafael, dexonsmith, grosser Differential Revision: http://reviews.llvm.org/D7649 llvm-svn: 230794 2015-02-28 05:17:42 +08:00			`%val = load half, half* %in`
CodeGen: soften f16 type by default instead of marking legal. Actual support for softening f16 operations is still limited, and can be added when it's needed. But Soften is much closer to being a useful thing to try than keeping it Legal when no registers can actually hold such values. Longer term, we probably want something between Soften and Promote semantics for most targets, it'll be more efficient to promote the 4 basic operations to f32 than libcall them. llvm-svn: 213372 2014-07-18 20:41:46 +08:00			`store half %val, half* %out`
			`ret void`
			`}`

			`define i16 @test_bitcast_from_half(half* %addr) {`
			`; CHECK-LABEL: test_bitcast_from_half:`
			`; CHECK: ldrh r0, [r0]`
[opaque pointer type] Add textual IR support for explicit type parameter to load instruction Essentially the same as the GEP change in r230786. A similar migration script can be used to update test cases, though a few more test case improvements/changes were required this time around: (r229269-r229278) import fileinput import sys import re pat = re.compile(r"((?:=\|:\|^)\sload (?:atomic )?(?:volatile )?(.?))(\| addrspace\(\d+\) )\($\| (?:%\|@\|null\|undef\|blockaddress\|getelementptr\|addrspacecast\|bitcast\|inttoptr\|\[\[[a-zA-Z]\|\{\{).$)") for line in sys.stdin: sys.stdout.write(re.sub(pat, r"\1, \2\3*\4", line)) Reviewers: rafael, dexonsmith, grosser Differential Revision: http://reviews.llvm.org/D7649 llvm-svn: 230794 2015-02-28 05:17:42 +08:00			`%val = load half, half* %addr`
CodeGen: soften f16 type by default instead of marking legal. Actual support for softening f16 operations is still limited, and can be added when it's needed. But Soften is much closer to being a useful thing to try than keeping it Legal when no registers can actually hold such values. Longer term, we probably want something between Soften and Promote semantics for most targets, it'll be more efficient to promote the 4 basic operations to f32 than libcall them. llvm-svn: 213372 2014-07-18 20:41:46 +08:00			`%val_int = bitcast half %val to i16`
			`ret i16 %val_int`
			`}`

			`define void @test_bitcast_to_half(half* %addr, i16 %in) {`
			`; CHECK-LABEL: test_bitcast_to_half:`
			`; CHECK: strh r1, [r0]`
			`%val_fp = bitcast i16 %in to half`
			`store half %val_fp, half* %addr`
			`ret void`
			`}`
ARM: support legalisation of "fptrunc ... to half" operations. llvm-svn: 213373 2014-07-18 21:01:19 +08:00
			`define float @test_extend32(half* %addr) {`
			`; CHECK-LABEL: test_extend32:`

[CodeGen] Use standard -not gnueabi- naming for f16 libcalls on Darwin. Other targets probably should as well. Since r237161, compiler-rt has both, but I don't see why anything other than gnueabi would use a gnueabi naming scheme. llvm-svn: 237324 2015-05-14 09:00:51 +08:00			`; CHECK-OLD: b.w ___extendhfsf2`
ARM: support legalisation of "fptrunc ... to half" operations. llvm-svn: 213373 2014-07-18 21:01:19 +08:00			`; CHECK-F16: vcvtb.f32.f16`
			`; CHECK-V8: vcvtb.f32.f16`
[ARM] Tighten f64<->f16 conversion requirements Fix missing Requires fields. Patch by Bernard Ogden (bogden) Reviewers: SjoerdMeijer, javed.absar, t.p.northover Reviewed By: t.p.northover Differential Revision: https://reviews.llvm.org/D51631 llvm-svn: 342061 2018-09-13 00:24:43 +08:00			`; CHECK-V8-SP: vcvtb.f32.f16`
[opaque pointer type] Add textual IR support for explicit type parameter to load instruction Essentially the same as the GEP change in r230786. A similar migration script can be used to update test cases, though a few more test case improvements/changes were required this time around: (r229269-r229278) import fileinput import sys import re pat = re.compile(r"((?:=\|:\|^)\sload (?:atomic )?(?:volatile )?(.?))(\| addrspace\(\d+\) )\($\| (?:%\|@\|null\|undef\|blockaddress\|getelementptr\|addrspacecast\|bitcast\|inttoptr\|\[\[[a-zA-Z]\|\{\{).$)") for line in sys.stdin: sys.stdout.write(re.sub(pat, r"\1, \2\3*\4", line)) Reviewers: rafael, dexonsmith, grosser Differential Revision: http://reviews.llvm.org/D7649 llvm-svn: 230794 2015-02-28 05:17:42 +08:00			`%val16 = load half, half* %addr`
ARM: support legalisation of "fptrunc ... to half" operations. llvm-svn: 213373 2014-07-18 21:01:19 +08:00			`%val32 = fpext half %val16 to float`
			`ret float %val32`
			`}`

			`define double @test_extend64(half* %addr) {`
			`; CHECK-LABEL: test_extend64:`

ARM: stop emitting blx instructions for most calls on MachO. I'm really not sure why we were in the first place, it's the linker's job to convert between BL/BLX as necessary. Even worse, using BLX left Thumb calls that could be locally resolved completely unencodable since all offsets to BLX are multiples of 4. rdar://26182344 llvm-svn: 269101 2016-05-11 03:17:47 +08:00			`; CHECK-OLD: bl ___extendhfsf2`
ARM: support legalisation of "fptrunc ... to half" operations. llvm-svn: 213373 2014-07-18 21:01:19 +08:00			`; CHECK-OLD: vcvt.f64.f32`
			`; CHECK-F16: vcvtb.f32.f16`
			`; CHECK-F16: vcvt.f64.f32`
			`; CHECK-V8: vcvtb.f64.f16`
[ARM] Tighten f64<->f16 conversion requirements Fix missing Requires fields. Patch by Bernard Ogden (bogden) Reviewers: SjoerdMeijer, javed.absar, t.p.northover Reviewed By: t.p.northover Differential Revision: https://reviews.llvm.org/D51631 llvm-svn: 342061 2018-09-13 00:24:43 +08:00			`; CHECK-V8-SP: vcvtb.f32.f16`
			`; CHECK-V8-SP: bl __aeabi_f2d`
[opaque pointer type] Add textual IR support for explicit type parameter to load instruction Essentially the same as the GEP change in r230786. A similar migration script can be used to update test cases, though a few more test case improvements/changes were required this time around: (r229269-r229278) import fileinput import sys import re pat = re.compile(r"((?:=\|:\|^)\sload (?:atomic )?(?:volatile )?(.?))(\| addrspace\(\d+\) )\($\| (?:%\|@\|null\|undef\|blockaddress\|getelementptr\|addrspacecast\|bitcast\|inttoptr\|\[\[[a-zA-Z]\|\{\{).$)") for line in sys.stdin: sys.stdout.write(re.sub(pat, r"\1, \2\3*\4", line)) Reviewers: rafael, dexonsmith, grosser Differential Revision: http://reviews.llvm.org/D7649 llvm-svn: 230794 2015-02-28 05:17:42 +08:00			`%val16 = load half, half* %addr`
ARM: support legalisation of "fptrunc ... to half" operations. llvm-svn: 213373 2014-07-18 21:01:19 +08:00			`%val32 = fpext half %val16 to double`
			`ret double %val32`
			`}`

			`define void @test_trunc32(float %in, half* %addr) {`
			`; CHECK-LABEL: test_trunc32:`

ARM: stop emitting blx instructions for most calls on MachO. I'm really not sure why we were in the first place, it's the linker's job to convert between BL/BLX as necessary. Even worse, using BLX left Thumb calls that could be locally resolved completely unencodable since all offsets to BLX are multiples of 4. rdar://26182344 llvm-svn: 269101 2016-05-11 03:17:47 +08:00			`; CHECK-OLD: bl ___truncsfhf2`
ARM: support legalisation of "fptrunc ... to half" operations. llvm-svn: 213373 2014-07-18 21:01:19 +08:00			`; CHECK-F16: vcvtb.f16.f32`
			`; CHECK-V8: vcvtb.f16.f32`
[ARM] Tighten f64<->f16 conversion requirements Fix missing Requires fields. Patch by Bernard Ogden (bogden) Reviewers: SjoerdMeijer, javed.absar, t.p.northover Reviewed By: t.p.northover Differential Revision: https://reviews.llvm.org/D51631 llvm-svn: 342061 2018-09-13 00:24:43 +08:00			`; CHECK-V8-SP: vcvtb.f16.f32`
ARM: support legalisation of "fptrunc ... to half" operations. llvm-svn: 213373 2014-07-18 21:01:19 +08:00			`%val16 = fptrunc float %in to half`
			`store half %val16, half* %addr`
			`ret void`
			`}`

			`define void @test_trunc64(double %in, half* %addr) {`
			`; CHECK-LABEL: test_trunc64:`

ARM: stop emitting blx instructions for most calls on MachO. I'm really not sure why we were in the first place, it's the linker's job to convert between BL/BLX as necessary. Even worse, using BLX left Thumb calls that could be locally resolved completely unencodable since all offsets to BLX are multiples of 4. rdar://26182344 llvm-svn: 269101 2016-05-11 03:17:47 +08:00			`; CHECK-OLD: bl ___truncdfhf2`
			`; CHECK-F16: bl ___truncdfhf2`
ARM: support legalisation of "fptrunc ... to half" operations. llvm-svn: 213373 2014-07-18 21:01:19 +08:00			`; CHECK-V8: vcvtb.f16.f64`
[ARM] Tighten f64<->f16 conversion requirements Fix missing Requires fields. Patch by Bernard Ogden (bogden) Reviewers: SjoerdMeijer, javed.absar, t.p.northover Reviewed By: t.p.northover Differential Revision: https://reviews.llvm.org/D51631 llvm-svn: 342061 2018-09-13 00:24:43 +08:00			`; CHECK-V8-SP: bl __aeabi_d2h`
ARM: support legalisation of "fptrunc ... to half" operations. llvm-svn: 213373 2014-07-18 21:01:19 +08:00			`%val16 = fptrunc double %in to half`
			`store half %val16, half* %addr`
			`ret void`
			`}`