forked from OSchip/llvm-project
c16fb8748d
Summary: * Add LLVM_ATTRIBUTE_UNUSED_RESULT used to slicing methods in order to emphasize that the slicing is not done in place. * Change device memory slice function name from `drop_front` to `slice` in order to match the naming convention of `llvm::ArrayRef` and host memory slice. * Change the parameter names of host memory slice functions to `DropCount` and `TakeCount` to match device memory slice declarations. Reviewers: jlebar Subscribers: jprice, parallel_libs-commits Differential Revision: https://reviews.llvm.org/D24464 llvm-svn: 281239 |
||
---|---|---|
.. | ||
examples | ||
include/streamexecutor | ||
lib | ||
tools/streamexecutor-config | ||
unittests | ||
CMakeLists.txt | ||
Doxyfile.in | ||
README.txt | ||
customdoxygen.css |
README.txt
StreamExecutor ============== StreamExecutor is a wrapper around CUDA and OpenCL (host-side) programming models (runtimes). This abstraction cleanly permits host code to target either CUDA or OpenCL devices with identically-functioning data parallel kernels. It manages the execution of concurrent work targeting the accelerator, similar to a host-side Executor. This version of StreamExecutor can be built either as a sub-project of the LLVM project or as a standalone project depending on LLVM as an external package.