llvm-project/parallel-libs/streamexecutor/examples
Jason Henline b38d8a3a3b [SE] Pack global dev handle addresses
Summary:
We were packing global device memory handles in
`PackedKernelArgumentArray`, but as I was implementing the CUDA
platform, I realized that CUDA wants the address of the handle, not the
handle itself. So this patch switches to packing the address of the
handle.

Reviewers: jlebar

Subscribers: jprice, jlebar, parallel_libs-commits

Differential Revision: https://reviews.llvm.org/D24528

llvm-svn: 281424
2016-09-13 23:59:10 +00:00
..
CMakeLists.txt [SE] Host platform implementation 2016-09-13 19:28:02 +00:00
CUDASaxpy.cpp [SE] Platforms return Device values 2016-09-13 23:56:46 +00:00
HostSaxpy.cpp [SE] Pack global dev handle addresses 2016-09-13 23:59:10 +00:00