llvm-project/parallel-libs/streamexecutor
Jason Henline c16fb8748d [SE] Clean up device and host memory slices
Summary:
* Add LLVM_ATTRIBUTE_UNUSED_RESULT used to slicing methods in order to
  emphasize that the slicing is not done in place.
* Change device memory slice function name from `drop_front` to `slice`
  in order to match the naming convention of `llvm::ArrayRef` and host
  memory slice.
* Change the parameter names of host memory slice functions to
  `DropCount` and `TakeCount` to match device memory slice declarations.

Reviewers: jlebar

Subscribers: jprice, parallel_libs-commits

Differential Revision: https://reviews.llvm.org/D24464

llvm-svn: 281239
2016-09-12 17:20:43 +00:00
..
examples [SE] RegisteredHostMemory for async device copies 2016-09-12 16:09:41 +00:00
include/streamexecutor [SE] Clean up device and host memory slices 2016-09-12 17:20:43 +00:00
lib [SE] RegisteredHostMemory for async device copies 2016-09-12 16:09:41 +00:00
tools/streamexecutor-config Add streamexecutor-config 2016-09-08 16:12:33 +00:00
unittests [SE] Clean up device and host memory slices 2016-09-12 17:20:43 +00:00
CMakeLists.txt [StreamExecutor] Make SE work with an in-tree LLVM build. 2016-09-09 21:01:02 +00:00
Doxyfile.in Add streamexecutor-config 2016-09-08 16:12:33 +00:00
README.txt
customdoxygen.css [SE] Doc tweaks 2016-09-02 17:59:12 +00:00

README.txt

StreamExecutor
==============

StreamExecutor is a wrapper around CUDA and OpenCL (host-side) programming
models (runtimes). This abstraction cleanly permits host code to target either
CUDA or OpenCL devices with identically-functioning data parallel kernels. It
manages the execution of concurrent work targeting the accelerator, similar to a
host-side Executor.

This version of StreamExecutor can be built either as a sub-project of the LLVM
project or as a standalone project depending on LLVM as an external package.