forked from OSchip/llvm-project
0bc4b2d337
Currently Clang use int32 to represent sampler_t, which have been a source of issue for some backends, because in some backends sampler_t cannot be represented by int32. They have to depend on kernel argument metadata and use IPA to find the sampler arguments and global variables and transform them to target specific sampler type.
This patch uses opaque pointer type opencl.sampler_t* for sampler_t. For each use of file-scope sampler variable, it generates a function call of __translate_sampler_initializer. For each initialization of function-scope sampler variable, it generates a function call of __translate_sampler_initializer.
Each builtin library can implement its own __translate_sampler_initializer(). Since the real sampler type tends to be architecture dependent, allowing it to be initialized by a library function simplifies backend design. A typical implementation of __translate_sampler_initializer could be a table lookup of real sampler literal values. Since its argument is always a literal, the returned pointer is known at compile time and easily optimized to finally become some literal values directly put into image read instructions.
This patch is partially based on Alexey Sotkin's work in Khronos Clang (
|
||
---|---|---|
.. | ||
2011-04-15-vec-init-from-vec.cl | ||
addr-space-struct-arg.cl | ||
address-space-constant-initializers.cl | ||
address-spaces-conversions.cl | ||
address-spaces-mangling.cl | ||
address-spaces.cl | ||
amdgcn-flat-scratch-name.cl | ||
amdgpu-call-kernel.cl | ||
amdgpu-calling-conv.cl | ||
amdgpu-num-gpr-attr.cl | ||
as_type.cl | ||
bool_cast.cl | ||
builtins-amdgcn-error.cl | ||
builtins-amdgcn-vi.cl | ||
builtins-amdgcn.cl | ||
builtins-generic-amdgcn.cl | ||
builtins-r600.cl | ||
cl-strict-aliasing.cl | ||
cl20-device-side-enqueue.cl | ||
const-str-array-decay.cl | ||
constant-addr-space-globals.cl | ||
denorms-are-zero.cl | ||
event_t.cl | ||
ext-vector-shuffle.cl | ||
fpmath.cl | ||
half.cl | ||
images.cl | ||
kernel-arg-info.cl | ||
kernel-attributes.cl | ||
kernel-metadata.cl | ||
local-initializer-undef.cl | ||
local.cl | ||
logical-ops.cl | ||
memcpy.cl | ||
no-signed-zeros.cl | ||
opencl_types.cl | ||
pipe_builtin.cl | ||
pipe_types.cl | ||
ptx-calls.cl | ||
ptx-kernels.cl | ||
relaxed-fpmath.cl | ||
sampler.cl | ||
shifts.cl | ||
single-precision-constant.cl | ||
spir-calling-conv.cl | ||
spir32_target.cl | ||
spir64_target.cl | ||
spir_version.cl | ||
str_literals.cl | ||
to_addr_builtin.cl | ||
unroll-hint.cl | ||
vectorLoadStore.cl | ||
vector_literals_nested.cl | ||
vector_literals_valid.cl | ||
vector_logops.cl | ||
vector_odd.cl | ||
vector_shufflevector_valid.cl | ||
vla.cl |