2018-01-09 03:02:51 +08:00
|
|
|
.. raw:: html
|
|
|
|
|
|
|
|
<style type="text/css">
|
|
|
|
.none { background-color: #FFCCCC }
|
2019-09-05 01:15:37 +08:00
|
|
|
.part { background-color: #FFFF99 }
|
2018-01-09 03:02:51 +08:00
|
|
|
.good { background-color: #CCFF99 }
|
|
|
|
</style>
|
|
|
|
|
|
|
|
.. role:: none
|
2019-09-05 01:15:37 +08:00
|
|
|
.. role:: part
|
2018-01-09 03:02:51 +08:00
|
|
|
.. role:: good
|
|
|
|
|
2018-07-27 01:53:45 +08:00
|
|
|
.. contents::
|
|
|
|
:local:
|
|
|
|
|
2018-01-09 03:02:51 +08:00
|
|
|
OpenMP Support
|
2019-11-07 02:25:16 +08:00
|
|
|
==============
|
2018-01-09 03:02:51 +08:00
|
|
|
|
2019-01-19 03:57:37 +08:00
|
|
|
Clang fully supports OpenMP 4.5. Clang supports offloading to X86_64, AArch64,
|
|
|
|
PPC64[LE] and has `basic support for Cuda devices`_.
|
2018-01-09 03:02:51 +08:00
|
|
|
|
2019-09-05 01:15:37 +08:00
|
|
|
* #pragma omp declare simd: :part:`Partial`. We support parsing/semantic
|
2019-01-19 03:57:37 +08:00
|
|
|
analysis + generation of special attributes for X86 target, but still
|
|
|
|
missing the LLVM pass for vectorization.
|
2018-07-27 01:53:45 +08:00
|
|
|
|
|
|
|
In addition, the LLVM OpenMP runtime `libomp` supports the OpenMP Tools
|
2019-01-19 03:57:37 +08:00
|
|
|
Interface (OMPT) on x86, x86_64, AArch64, and PPC64 on Linux, Windows, and macOS.
|
|
|
|
|
2019-10-30 22:38:11 +08:00
|
|
|
For the list of supported features from OpenMP 5.0 see `OpenMP implementation details`_.
|
|
|
|
|
2019-01-19 03:57:37 +08:00
|
|
|
General improvements
|
2019-11-07 02:25:16 +08:00
|
|
|
====================
|
2019-01-19 03:57:37 +08:00
|
|
|
- New collapse clause scheme to avoid expensive remainder operations.
|
|
|
|
Compute loop index variables after collapsing a loop nest via the
|
|
|
|
collapse clause by replacing the expensive remainder operation with
|
|
|
|
multiplications and additions.
|
|
|
|
|
|
|
|
- The default schedules for the `distribute` and `for` constructs in a
|
|
|
|
parallel region and in SPMD mode have changed to ensure coalesced
|
|
|
|
accesses. For the `distribute` construct, a static schedule is used
|
|
|
|
with a chunk size equal to the number of threads per team (default
|
|
|
|
value of threads or as specified by the `thread_limit` clause if
|
|
|
|
present). For the `for` construct, the schedule is static with chunk
|
|
|
|
size of one.
|
2019-10-30 22:32:08 +08:00
|
|
|
|
2019-01-19 03:57:37 +08:00
|
|
|
- Simplified SPMD code generation for `distribute parallel for` when
|
|
|
|
the new default schedules are applicable.
|
2018-07-27 01:53:45 +08:00
|
|
|
|
2019-11-07 02:25:16 +08:00
|
|
|
- When using the collapse clause on a loop nest the default behavior
|
|
|
|
is to automatically extend the representation of the loop counter to
|
|
|
|
64 bits for the cases where the sizes of the collapsed loops are not
|
|
|
|
known at compile time. To prevent this conservative choice and use
|
|
|
|
at most 32 bits, compile your program with the
|
|
|
|
`-fopenmp-optimistic-collapse`.
|
|
|
|
|
2018-07-27 01:53:45 +08:00
|
|
|
.. _basic support for Cuda devices:
|
|
|
|
|
|
|
|
Cuda devices support
|
|
|
|
====================
|
|
|
|
|
|
|
|
Directives execution modes
|
|
|
|
--------------------------
|
|
|
|
|
|
|
|
Clang code generation for target regions supports two modes: the SPMD and
|
|
|
|
non-SPMD modes. Clang chooses one of these two modes automatically based on the
|
|
|
|
way directives and clauses on those directives are used. The SPMD mode uses a
|
|
|
|
simplified set of runtime functions thus increasing performance at the cost of
|
|
|
|
supporting some OpenMP features. The non-SPMD mode is the most generic mode and
|
|
|
|
supports all currently available OpenMP features. The compiler will always
|
|
|
|
attempt to use the SPMD mode wherever possible. SPMD mode will not be used if:
|
|
|
|
|
|
|
|
- The target region contains user code (other than OpenMP-specific
|
|
|
|
directives) in between the `target` and the `parallel` directives.
|
|
|
|
|
|
|
|
Data-sharing modes
|
|
|
|
------------------
|
|
|
|
|
|
|
|
Clang supports two data-sharing models for Cuda devices: `Generic` and `Cuda`
|
|
|
|
modes. The default mode is `Generic`. `Cuda` mode can give an additional
|
|
|
|
performance and can be activated using the `-fopenmp-cuda-mode` flag. In
|
|
|
|
`Generic` mode all local variables that can be shared in the parallel regions
|
|
|
|
are stored in the global memory. In `Cuda` mode local variables are not shared
|
|
|
|
between the threads and it is user responsibility to share the required data
|
|
|
|
between the threads in the parallel regions.
|
|
|
|
|
2019-01-10 04:38:35 +08:00
|
|
|
|
2018-07-27 01:53:45 +08:00
|
|
|
Features not supported or with limited support for Cuda devices
|
|
|
|
---------------------------------------------------------------
|
|
|
|
|
|
|
|
- Cancellation constructs are not supported.
|
|
|
|
|
|
|
|
- Doacross loop nest is not supported.
|
|
|
|
|
|
|
|
- User-defined reductions are supported only for trivial types.
|
|
|
|
|
|
|
|
- Nested parallelism: inner parallel regions are executed sequentially.
|
|
|
|
|
|
|
|
- Static linking of libraries containing device code is not supported yet.
|
|
|
|
|
|
|
|
- Automatic translation of math functions in target regions to device-specific
|
|
|
|
math functions is not implemented yet.
|
|
|
|
|
2019-02-06 04:38:36 +08:00
|
|
|
- Debug information for OpenMP target regions is supported, but sometimes it may
|
|
|
|
be required to manually specify the address class of the inspected variables.
|
|
|
|
In some cases the local variables are actually allocated in the global memory,
|
|
|
|
but the debug info may be not aware of it.
|
2018-07-27 01:53:45 +08:00
|
|
|
|
2019-09-05 01:15:37 +08:00
|
|
|
|
|
|
|
.. _OpenMP implementation details:
|
|
|
|
|
|
|
|
OpenMP 5.0 Implementation Details
|
2019-11-07 02:25:16 +08:00
|
|
|
=================================
|
2019-09-05 01:15:37 +08:00
|
|
|
|
|
|
|
The following table provides a quick overview over various OpenMP 5.0 features
|
|
|
|
and their implementation status. Please contact *openmp-dev* at
|
|
|
|
*lists.llvm.org* for more information or if you want to help with the
|
|
|
|
implementation.
|
|
|
|
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
|Category | Feature | Status | Reviews |
|
|
|
|
+==============================+==============================================================+==========================+=======================================================================+
|
|
|
|
| loop extension | support != in the canonical loop form | :good:`done` | D54441 |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2019-12-12 13:59:46 +08:00
|
|
|
| loop extension | #pragma omp loop (directive) | :part:`worked on` | |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2019-11-05 23:13:16 +08:00
|
|
|
| loop extension | collapse imperfectly nested loop | :good:`done` | |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2019-09-11 22:44:30 +08:00
|
|
|
| loop extension | collapse non-rectangular nested loop | :good:`done` | |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2019-11-05 23:13:16 +08:00
|
|
|
| loop extension | C++ range-base for loop | :good:`done` | |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2019-12-17 02:25:58 +08:00
|
|
|
| loop extension | clause: if for SIMD directives | :good:`done` | |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-06-23 20:53:58 +08:00
|
|
|
| loop extension | inclusive scan extension (matching C++17 PSTL) | :good:`done` | |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2021-08-22 03:17:58 +08:00
|
|
|
| memory management | memory allocators | :good:`done` | r341687,r357929 |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2021-08-22 03:17:58 +08:00
|
|
|
| memory management | allocate directive and allocate clause | :good:`done` | r355614,r335952 |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| OMPD | OMPD interfaces | :part:`not upstream` | https://github.com/OpenMPToolsInterface/LLVM-openmp/tree/ompd-tests |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| OMPT | OMPT interfaces | :part:`mostly done` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| thread affinity extension | thread affinity extension | :good:`done` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| task extension | taskloop reduction | :good:`done` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2021-09-08 23:12:31 +08:00
|
|
|
| task extension | task affinity | :part:`not upstream` | https://github.com/jklinkenberg/openmp/tree/task-affinity |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| task extension | clause: depend on the taskwait construct | :part:`worked on` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-05-20 22:54:53 +08:00
|
|
|
| task extension | depend objects and detachable tasks | :good:`done` | |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| task extension | mutexinoutset dependence-type for tasks | :good:`done` | D53380,D57576 |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2019-12-05 04:11:19 +08:00
|
|
|
| task extension | combined taskloop constructs | :good:`done` | |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2019-10-11 04:15:54 +08:00
|
|
|
| task extension | master taskloop | :good:`done` | |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2019-11-05 23:13:16 +08:00
|
|
|
| task extension | parallel master taskloop | :good:`done` | |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2019-11-05 23:13:16 +08:00
|
|
|
| task extension | master taskloop simd | :good:`done` | |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2019-11-05 23:13:16 +08:00
|
|
|
| task extension | parallel master taskloop simd | :good:`done` | |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2019-12-05 23:44:07 +08:00
|
|
|
| SIMD extension | atomic and simd constructs inside SIMD code | :good:`done` | |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2019-12-25 01:39:59 +08:00
|
|
|
| SIMD extension | SIMD nontemporal | :good:`done` | |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| device extension | infer target functions from initializers | :part:`worked on` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| device extension | infer target variables from initializers | :part:`worked on` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| device extension | OMP_TARGET_OFFLOAD environment variable | :good:`done` | D50522 |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2019-12-05 04:11:19 +08:00
|
|
|
| device extension | support full 'defaultmap' functionality | :good:`done` | D69204 |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2019-12-05 04:11:19 +08:00
|
|
|
| device extension | device specific functions | :good:`done` | |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| device extension | clause: device_type | :good:`done` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-03-21 23:57:19 +08:00
|
|
|
| device extension | clause: extended device | :good:`done` | |
|
2020-03-19 22:16:08 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-05-20 22:54:53 +08:00
|
|
|
| device extension | clause: uses_allocators clause | :good:`done` | |
|
2020-03-19 22:16:08 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2019-11-08 00:07:56 +08:00
|
|
|
| device extension | clause: in_reduction | :part:`worked on` | r308768 |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| device extension | omp_get_device_num() | :part:`worked on` | D54342 |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| device extension | structure mapping of references | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| device extension | nested target declare | :good:`done` | D51378 |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| device extension | implicitly map 'this' (this[:1]) | :good:`done` | D55982 |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2021-04-16 06:17:26 +08:00
|
|
|
| device extension | allow access to the reference count (omp_target_is_present) | :good:`done` | |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-01-17 15:56:08 +08:00
|
|
|
| device extension | requires directive | :part:`partial` | |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-01-17 15:56:08 +08:00
|
|
|
| device extension | clause: unified_shared_memory | :good:`done` | D52625,D52359 |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| device extension | clause: unified_address | :part:`partial` | |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| device extension | clause: reverse_offload | :none:`unclaimed parts` | D52780 |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-02-14 04:42:39 +08:00
|
|
|
| device extension | clause: atomic_default_mem_order | :good:`done` | D53513 |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-05-20 22:54:53 +08:00
|
|
|
| device extension | clause: dynamic_allocators | :part:`unclaimed parts` | D53079 |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| device extension | user-defined mappers | :part:`worked on` | D56326,D58638,D58523,D58074,D60972,D59474 |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| device extension | mapping lambda expression | :good:`done` | D51107 |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-06-17 00:24:09 +08:00
|
|
|
| device extension | clause: use_device_addr for target data | :good:`done` | |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-01-17 15:56:08 +08:00
|
|
|
| device extension | support close modifier on map clause | :good:`done` | D55719,D55892 |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2021-01-07 03:16:25 +08:00
|
|
|
| device extension | teams construct on the host device | :part:`done` | r371553 |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-11-25 06:07:39 +08:00
|
|
|
| device extension | support non-contiguous array sections for target update | :good:`done` | |
|
2019-11-08 00:07:56 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-01-17 15:56:08 +08:00
|
|
|
| device extension | pointer attachment | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-08-05 20:53:58 +08:00
|
|
|
| device extension | map clause reordering based on map types | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-02-14 04:02:11 +08:00
|
|
|
| atomic extension | hints for the atomic construct | :good:`done` | D51233 |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-05-20 22:54:53 +08:00
|
|
|
| base language | C11 support | :good:`done` | |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-05-20 22:54:53 +08:00
|
|
|
| base language | C++11/14/17 support | :good:`done` | |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| base language | lambda support | :good:`done` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-04-01 07:12:52 +08:00
|
|
|
| misc extension | array shaping | :good:`done` | D74144 |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| misc extension | library shutdown (omp_pause_resource[_all]) | :none:`unclaimed parts` | D55078 |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2021-09-22 00:26:50 +08:00
|
|
|
| misc extension | metadirectives | :part:`worked on` | D91944 |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-02-04 01:43:40 +08:00
|
|
|
| misc extension | conditional modifier for lastprivate clause | :good:`done` | |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-04-08 03:44:15 +08:00
|
|
|
| misc extension | iterator and multidependences | :good:`done` | |
|
2020-01-17 15:56:08 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-03-12 01:26:01 +08:00
|
|
|
| misc extension | depobj directive and depobj dependency kind | :good:`done` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2021-09-22 00:26:50 +08:00
|
|
|
| misc extension | user-defined function variants | :part:`worked on` | D67294, D64095, D71847, D71830, D109635 |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-01-17 15:56:08 +08:00
|
|
|
| misc extension | pointer/reference to pointer based array reductions | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-06-19 01:29:33 +08:00
|
|
|
| misc extension | prevent new type definitions in clauses | :good:`done` | |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-02-14 04:02:11 +08:00
|
|
|
| memory model extension | memory model update (seq_cst, acq_rel, release, acquire,...) | :good:`done` | |
|
2019-09-05 01:15:37 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2019-12-12 13:59:46 +08:00
|
|
|
|
|
|
|
|
|
|
|
OpenMP 5.1 Implementation Details
|
|
|
|
=================================
|
|
|
|
|
|
|
|
The following table provides a quick overview over various OpenMP 5.1 features
|
|
|
|
and their implementation status, as defined in the technical report 8 (TR8).
|
|
|
|
Please contact *openmp-dev* at *lists.llvm.org* for more information or if you
|
|
|
|
want to help with the implementation.
|
|
|
|
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
|Category | Feature | Status | Reviews |
|
|
|
|
+==============================+==============================================================+==========================+=======================================================================+
|
2021-04-15 23:10:07 +08:00
|
|
|
| atomic extension | 'compare' clause on atomic construct | :good:`worked on` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| atomic extension | 'fail' clause on atomic construct | :none:`unclaimed` | |
|
2019-12-12 13:59:46 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2021-07-12 18:51:19 +08:00
|
|
|
| base language | C++ attribute specifier syntax | :good:`done` | D105648 |
|
2019-12-14 08:40:54 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-08-05 20:55:22 +08:00
|
|
|
| device extension | 'present' map type modifier | :good:`done` | D83061, D83062, D84422 |
|
2020-07-28 07:22:05 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-07-31 00:20:46 +08:00
|
|
|
| device extension | 'present' motion modifier | :good:`done` | D84711, D84712 |
|
2020-06-25 02:18:08 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2021-04-02 00:02:23 +08:00
|
|
|
| device extension | 'present' in defaultmap clause | :good:`done` | D92427 |
|
2020-11-19 00:43:43 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-08-05 20:53:58 +08:00
|
|
|
| device extension | map clause reordering reordering based on 'present' modifier | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2020-11-19 00:43:43 +08:00
|
|
|
| device extension | device-specific environment variables | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| device extension | omp_target_is_accessible routine | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| device extension | omp_get_mapped_ptr routine | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| device extension | new async target memory copy routines | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| device extension | thread_limit clause on target construct | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| device extension | has_device_addr clause on target construct | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| device extension | iterators in map clause or motion clauses | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| device extension | indirect clause on declare target directive | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| device extension | allow virtual functions calls for mapped object on device | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2021-04-03 03:28:33 +08:00
|
|
|
| device extension | interop construct | :part:`partial` | parsing/sema done: D98558, D98834, D98815 |
|
2020-11-19 00:43:43 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| device extension | assorted routines for querying interoperable properties | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2021-03-06 07:22:42 +08:00
|
|
|
| loop extension | Loop tiling transformation | :good:`done` | D76342 |
|
2020-11-19 00:43:43 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2021-08-04 07:07:59 +08:00
|
|
|
| loop extension | Loop unrolling transformation | :good:`done` | D99459 |
|
2020-11-19 00:43:43 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| loop extension | 'reproducible'/'unconstrained' modifiers in 'order' clause | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2021-04-03 03:28:33 +08:00
|
|
|
| memory management | alignment extensions for allocate directive and clause | :part:`worked on` | |
|
2020-11-19 00:43:43 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| memory management | new memory management routines | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| memory management | changes to omp_alloctrait_key enum | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| memory model extension | seq_cst clause on flush construct | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| misc extension | 'omp_all_memory' keyword and use in 'depend' clause | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| misc extension | error directive | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| misc extension | scope construct | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| misc extension | routines for controlling and querying team regions | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| misc extension | changes to ompt_scope_endpoint_t enum | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| misc extension | omp_display_env routine | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| misc extension | extended OMP_PLACES syntax | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| misc extension | OMP_NUM_TEAMS and OMP_TEAMS_THREAD_LIMIT env vars | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| misc extension | 'target_device' selector in context specifier | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| misc extension | begin/end declare variant | :good:`done` | D71179 |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2021-04-03 03:28:33 +08:00
|
|
|
| misc extension | dispatch construct and function variant argument adjustment | :part:`worked on` | D99537, D99679 |
|
2020-11-19 00:43:43 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| misc extension | assume and assumes directives | :part:`worked on` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| misc extension | nothing directive | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
2021-04-16 06:17:26 +08:00
|
|
|
| misc extension | masked construct and related combined constructs | :part:`worked on` | D99995, D100514 |
|
2020-11-19 00:43:43 +08:00
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| misc extension | default(firstprivate) & default(private) | :part:`partial` | firstprivate done: D75591 |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| other | deprecating master construct | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| OMPT | new barrier types added to ompt_sync_region_t enum | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| OMPT | async data transfers added to ompt_target_data_op_t enum | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| OMPT | new barrier state values added to ompt_state_t enum | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| OMPT | new 'emi' callbacks for external monitoring interfaces | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| task extension | 'strict' modifier for taskloop construct | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| task extension | inoutset in depend clause | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
| task extension | nowait clause on taskwait | :none:`unclaimed` | |
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
[OpenMP][OpenACC] Implement `ompx_hold` map type modifier extension in Clang (1/2)
This patch implements Clang support for an original OpenMP extension
we have developed to support OpenACC: the `ompx_hold` map type
modifier. The next patch in this series, D106510, implements OpenMP
runtime support.
Consider the following example:
```
#pragma omp target data map(ompx_hold, tofrom: x) // holds onto mapping of x
{
foo(); // might have map(delete: x)
#pragma omp target map(present, alloc: x) // x is guaranteed to be present
printf("%d\n", x);
}
```
The `ompx_hold` map type modifier above specifies that the `target
data` directive holds onto the mapping for `x` throughout the
associated region regardless of any `target exit data` directives
executed during the call to `foo`. Thus, the presence assertion for
`x` at the enclosed `target` construct cannot fail. (As usual, the
standard OpenMP reference count for `x` must also reach zero before
the data is unmapped.)
Justification for inclusion in Clang and LLVM's OpenMP runtime:
* The `ompx_hold` modifier supports OpenACC functionality (structured
reference count) that cannot be achieved in standard OpenMP, as of
5.1.
* The runtime implementation for `ompx_hold` (next patch) will thus be
used by Flang's OpenACC support.
* The Clang implementation for `ompx_hold` (this patch) as well as the
runtime implementation are required for the Clang OpenACC support
being developed as part of the ECP Clacc project, which translates
OpenACC to OpenMP at the directive AST level. These patches are the
first step in upstreaming OpenACC functionality from Clacc.
* The Clang implementation for `ompx_hold` is also used by the tests
in the runtime implementation. That syntactic support makes the
tests more readable than low-level runtime calls can. Moreover,
upstream Flang and Clang do not yet support OpenACC syntax
sufficiently for writing the tests.
* More generally, the Clang implementation enables a clean separation
of concerns between OpenACC and OpenMP development in LLVM. That
is, LLVM's OpenMP developers can discuss, modify, and debug LLVM's
extended OpenMP implementation and test suite without directly
considering OpenACC's language and execution model, which can be
handled by LLVM's OpenACC developers.
* OpenMP users might find the `ompx_hold` modifier useful, as in the
above example.
See new documentation introduced by this patch in `openmp/docs` for
more detail on the functionality of this extension and its
relationship with OpenACC. For example, it explains how the runtime
must support two reference counts, as specified by OpenACC.
Clang recognizes `ompx_hold` unless `-fno-openmp-extensions`, a new
command-line option introduced by this patch, is specified.
Reviewed By: ABataev, jdoerfert, protze.joachim, grokos
Differential Revision: https://reviews.llvm.org/D106509
2021-09-01 03:17:07 +08:00
|
|
|
|
|
|
|
OpenMP Extensions
|
|
|
|
=================
|
|
|
|
|
|
|
|
The following table provides a quick overview over various OpenMP
|
|
|
|
extensions and their implementation status. These extensions are not
|
|
|
|
currently defined by any standard, so links to associated LLVM
|
|
|
|
documentation are provided. As these extensions mature, they will be
|
|
|
|
considered for standardization. Please contact *openmp-dev* at
|
|
|
|
*lists.llvm.org* to provide feedback.
|
|
|
|
|
|
|
|
+------------------------------+---------------------------------------------------------------------------+--------------------------+--------------------------------------------------------+
|
|
|
|
|Category | Feature | Status | Reviews |
|
|
|
|
+==============================+===========================================================================+==========================+========================================================+
|
|
|
|
| device extension | `'ompx_hold' map type modifier | :good:`prototyped` | D106509, D106510 |
|
|
|
|
| | <https://openmp.llvm.org/docs/openacc/OpenMPExtensions.html#ompx-hold>`_ | | |
|
|
|
|
+------------------------------+---------------------------------------------------------------------------+--------------------------+--------------------------------------------------------+
|