forked from OSchip/llvm-project
aaabbbf886
Our alias checks precisely check that the minimal and maximal accessed elements do not overlap in a kernel. Hence, we must ensure that our host <-> device transfers do not touch additional memory locations that are not covered in the alias check. To ensure this, we make sure that the data we copy for a given array is only the data from the smallest element accessed to the largest element accessed. We also adjust the size of the array according to the offset at which the array is actually accessed. An interesting result of this is: In case array are accessed with negative subscripts ,e.g., A[-100], we automatically allocate and transfer _more_ data to cover the full array. This is important as such code indeed exists in the wild. llvm-svn: 281611 |
||
---|---|---|
.. | ||
cuda-annotations.ll | ||
double-parallel-loop.ll | ||
host-control-flow.ll | ||
host-statement.ll | ||
invalid-kernel.ll | ||
kernel-params-only-some-arrays.ll | ||
kernel-params-scop-parameter.ll | ||
non-zero-array-offset.ll | ||
only-part-of-array-modified.ll | ||
parametric-loop-bound.ll | ||
phi-nodes-in-kernel.ll | ||
private-memory.ll | ||
region-stmt.ll | ||
remove-dead-instructions-in-stmt-2.ll | ||
remove-dead-instructions-in-stmt.ll | ||
run-time-check.ll | ||
scalar-param-and-value-32-bit.ll | ||
scalar-param-and-value-use.ll | ||
scalar-parameter-fp128.ll | ||
scalar-parameter-i80.ll | ||
scalar-parameter-i120.ll | ||
scalar-parameter-i128.ll | ||
scalar-parameter-i3000.ll | ||
scalar-parameter-ppc_fp128.ll | ||
scalar-parameter-x86_fp80.ll | ||
scalar-parameter.ll | ||
scheduler-timeout.ll | ||
shared-memory-scalar.ll | ||
shared-memory-two-dimensional.ll | ||
shared-memory.ll | ||
size-cast.ll | ||
untouched-arrays.ll |