Matt Arsenault
269ffdac4e
AMDGPU: Fix crash on i16 constant expression
...
llvm-svn: 288861
2016-12-06 23:18:06 +00:00
Konstantin Zhuravlyov
c09e2d7e46
[AMDGPU] AMDGPUCodeGenPrepare: remove extra ';'
...
llvm-svn: 283558
2016-10-07 14:39:53 +00:00
Konstantin Zhuravlyov
f74fc60a7d
[AMDGPU] Promote uniform (i1, i16] operations to i32
...
Differential Revision: https://reviews.llvm.org/D25302
llvm-svn: 283555
2016-10-07 14:22:58 +00:00
Konstantin Zhuravlyov
b4eb5d5049
[AMDGPU] Promote uniform i16 bitreverse intrinsic to i32
...
Differential Revision: https://reviews.llvm.org/D25121
llvm-svn: 283415
2016-10-06 02:20:46 +00:00
Konstantin Zhuravlyov
691e2e020b
[AMDGPU] Sign extend AShr when promoting (instead of zero extending)
...
llvm-svn: 283130
2016-10-03 18:29:01 +00:00
Mehdi Amini
117296c0a0
Use StringRef in Pass/PassManager APIs (NFC)
...
llvm-svn: 283004
2016-10-01 02:56:57 +00:00
Konstantin Zhuravlyov
e14df4b236
[AMDGPU] Promote uniform i16 ops to i32 ops for targets that have 16 bit instructions
...
Differential Revision: https://reviews.llvm.org/D24125
llvm-svn: 282624
2016-09-28 20:05:39 +00:00
Matt Arsenault
e3862cdc93
AMDGPU: Use rcp for fdiv 1, x with fpmath metadata
...
Using rcp should be OK for safe math usually, so this
should not be replacing the original fdiv.
llvm-svn: 276823
2016-07-26 23:25:44 +00:00
Matt Arsenault
a1fe17c9ad
AMDGPU: Change fdiv lowering based on !fpmath metadata
...
If 2.5 ulp is acceptable, denormals are not required, and
isn't a reciprocal which will already be handled, replace
with a faster fdiv.
Simplify the lowering tests by using per function
subtarget features.
llvm-svn: 276051
2016-07-19 23:16:53 +00:00
Matt Arsenault
86de486d31
AMDGPU: Add stub custom CodeGenPrepare pass
...
This will do various things including ones
CodeGenPrepare does, but with knowledge of uniform
values.
llvm-svn: 273657
2016-06-24 07:07:55 +00:00