2017-02-01 08:14:41 +08:00
|
|
|
VERS1.0 {
|
|
|
|
global:
|
|
|
|
__tgt_rtl_is_valid_binary;
|
2020-06-05 04:58:37 +08:00
|
|
|
__tgt_rtl_is_data_exchangable;
|
2017-02-01 08:14:41 +08:00
|
|
|
__tgt_rtl_number_of_devices;
|
2019-06-04 23:05:53 +08:00
|
|
|
__tgt_rtl_init_requires;
|
2017-02-01 08:14:41 +08:00
|
|
|
__tgt_rtl_init_device;
|
|
|
|
__tgt_rtl_load_binary;
|
|
|
|
__tgt_rtl_data_alloc;
|
|
|
|
__tgt_rtl_data_submit;
|
2020-04-10 10:40:30 +08:00
|
|
|
__tgt_rtl_data_submit_async;
|
2017-02-01 08:14:41 +08:00
|
|
|
__tgt_rtl_data_retrieve;
|
2020-04-10 10:40:30 +08:00
|
|
|
__tgt_rtl_data_retrieve_async;
|
2020-06-05 04:58:37 +08:00
|
|
|
__tgt_rtl_data_exchange;
|
|
|
|
__tgt_rtl_data_exchange_async;
|
2017-02-01 08:14:41 +08:00
|
|
|
__tgt_rtl_data_delete;
|
|
|
|
__tgt_rtl_run_target_team_region;
|
2020-04-10 10:40:30 +08:00
|
|
|
__tgt_rtl_run_target_team_region_async;
|
2017-02-01 08:14:41 +08:00
|
|
|
__tgt_rtl_run_target_region;
|
2020-04-10 10:40:30 +08:00
|
|
|
__tgt_rtl_run_target_region_async;
|
[OpenMP] Optimized stream selection by scheduling data mapping for the same target region into a same stream
Summary:
This patch introduces two things for offloading:
1. Asynchronous data transferring: those functions are suffix with `_async`. They have one more argument compared with their synchronous counterparts: `__tgt_async_info*`, which is a new struct that only has one field, `void *Identifier`. This struct is for information exchange between different asynchronous operations. It can be used for stream selection, like in this case, or operation synchronization, which is also used. We may expect more usages in the future.
2. Optimization of stream selection for data mapping. Previous implementation was using asynchronous device memory transfer but synchronizing after each memory transfer. Actually, if we say kernel A needs four memory copy to device and two memory copy back to host, then we can schedule these seven operations (four H2D, two D2H, and one kernel launch) into a same stream and just need synchronization after memory copy from device to host. In this way, we can save a huge overhead compared with synchronization after each operation.
Reviewers: jdoerfert, ye-luo
Reviewed By: jdoerfert
Subscribers: yaxunl, lildmh, guansong, openmp-commits
Tags: #openmp
Differential Revision: https://reviews.llvm.org/D77005
2020-04-08 02:51:56 +08:00
|
|
|
__tgt_rtl_synchronize;
|
2017-02-01 08:14:41 +08:00
|
|
|
local:
|
|
|
|
*;
|
|
|
|
};
|