forked from OSchip/llvm-project
381 lines
51 KiB
ReStructuredText
381 lines
51 KiB
ReStructuredText
.. raw:: html
|
|
|
|
<style type="text/css">
|
|
.none { background-color: #FFCCCC }
|
|
.part { background-color: #FFFF99 }
|
|
.good { background-color: #CCFF99 }
|
|
</style>
|
|
|
|
.. role:: none
|
|
.. role:: part
|
|
.. role:: good
|
|
|
|
.. contents::
|
|
:local:
|
|
|
|
OpenMP Support
|
|
==============
|
|
|
|
Clang fully supports OpenMP 4.5. Clang supports offloading to X86_64, AArch64,
|
|
PPC64[LE] and has `basic support for Cuda devices`_.
|
|
|
|
* #pragma omp declare simd: :part:`Partial`. We support parsing/semantic
|
|
analysis + generation of special attributes for X86 target, but still
|
|
missing the LLVM pass for vectorization.
|
|
|
|
In addition, the LLVM OpenMP runtime `libomp` supports the OpenMP Tools
|
|
Interface (OMPT) on x86, x86_64, AArch64, and PPC64 on Linux, Windows, and macOS.
|
|
|
|
For the list of supported features from OpenMP 5.0 see `OpenMP implementation details`_.
|
|
|
|
General improvements
|
|
====================
|
|
- New collapse clause scheme to avoid expensive remainder operations.
|
|
Compute loop index variables after collapsing a loop nest via the
|
|
collapse clause by replacing the expensive remainder operation with
|
|
multiplications and additions.
|
|
|
|
- The default schedules for the `distribute` and `for` constructs in a
|
|
parallel region and in SPMD mode have changed to ensure coalesced
|
|
accesses. For the `distribute` construct, a static schedule is used
|
|
with a chunk size equal to the number of threads per team (default
|
|
value of threads or as specified by the `thread_limit` clause if
|
|
present). For the `for` construct, the schedule is static with chunk
|
|
size of one.
|
|
|
|
- Simplified SPMD code generation for `distribute parallel for` when
|
|
the new default schedules are applicable.
|
|
|
|
- When using the collapse clause on a loop nest the default behavior
|
|
is to automatically extend the representation of the loop counter to
|
|
64 bits for the cases where the sizes of the collapsed loops are not
|
|
known at compile time. To prevent this conservative choice and use
|
|
at most 32 bits, compile your program with the
|
|
`-fopenmp-optimistic-collapse`.
|
|
|
|
.. _basic support for Cuda devices:
|
|
|
|
Cuda devices support
|
|
====================
|
|
|
|
Directives execution modes
|
|
--------------------------
|
|
|
|
Clang code generation for target regions supports two modes: the SPMD and
|
|
non-SPMD modes. Clang chooses one of these two modes automatically based on the
|
|
way directives and clauses on those directives are used. The SPMD mode uses a
|
|
simplified set of runtime functions thus increasing performance at the cost of
|
|
supporting some OpenMP features. The non-SPMD mode is the most generic mode and
|
|
supports all currently available OpenMP features. The compiler will always
|
|
attempt to use the SPMD mode wherever possible. SPMD mode will not be used if:
|
|
|
|
- The target region contains user code (other than OpenMP-specific
|
|
directives) in between the `target` and the `parallel` directives.
|
|
|
|
Data-sharing modes
|
|
------------------
|
|
|
|
Clang supports two data-sharing models for Cuda devices: `Generic` and `Cuda`
|
|
modes. The default mode is `Generic`. `Cuda` mode can give an additional
|
|
performance and can be activated using the `-fopenmp-cuda-mode` flag. In
|
|
`Generic` mode all local variables that can be shared in the parallel regions
|
|
are stored in the global memory. In `Cuda` mode local variables are not shared
|
|
between the threads and it is user responsibility to share the required data
|
|
between the threads in the parallel regions.
|
|
|
|
|
|
Features not supported or with limited support for Cuda devices
|
|
---------------------------------------------------------------
|
|
|
|
- Cancellation constructs are not supported.
|
|
|
|
- Doacross loop nest is not supported.
|
|
|
|
- User-defined reductions are supported only for trivial types.
|
|
|
|
- Nested parallelism: inner parallel regions are executed sequentially.
|
|
|
|
- Static linking of libraries containing device code is not supported without
|
|
explicitly using ``-fopenmp-new-driver``.
|
|
|
|
- Automatic translation of math functions in target regions to device-specific
|
|
math functions is not implemented yet.
|
|
|
|
- Debug information for OpenMP target regions is supported, but sometimes it may
|
|
be required to manually specify the address class of the inspected variables.
|
|
In some cases the local variables are actually allocated in the global memory,
|
|
but the debug info may be not aware of it.
|
|
|
|
|
|
.. _OpenMP implementation details:
|
|
|
|
OpenMP 5.0 Implementation Details
|
|
=================================
|
|
|
|
The following table provides a quick overview over various OpenMP 5.0 features
|
|
and their implementation status. Please contact *openmp-dev* at
|
|
*lists.llvm.org* for more information or if you want to help with the
|
|
implementation.
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|Category | Feature | Status | Reviews |
|
|
+==============================+==============================================================+==========================+=======================================================================+
|
|
| loop extension | support != in the canonical loop form | :good:`done` | D54441 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| loop extension | #pragma omp loop (directive) | :part:`worked on` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| loop extension | collapse imperfectly nested loop | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| loop extension | collapse non-rectangular nested loop | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| loop extension | C++ range-base for loop | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| loop extension | clause: if for SIMD directives | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| loop extension | inclusive scan extension (matching C++17 PSTL) | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| memory management | memory allocators | :good:`done` | r341687,r357929 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| memory management | allocate directive and allocate clause | :good:`done` | r355614,r335952 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| OMPD | OMPD interfaces | :part:`not upstream` | https://github.com/OpenMPToolsInterface/LLVM-openmp/tree/ompd-tests |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| OMPT | OMPT interfaces | :part:`mostly done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| thread affinity extension | thread affinity extension | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| task extension | taskloop reduction | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| task extension | task affinity | :part:`not upstream` | https://github.com/jklinkenberg/openmp/tree/task-affinity |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| task extension | clause: depend on the taskwait construct | :part:`mostly done` | D113540 (regular codegen only) |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| task extension | depend objects and detachable tasks | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| task extension | mutexinoutset dependence-type for tasks | :good:`done` | D53380,D57576 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| task extension | combined taskloop constructs | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| task extension | master taskloop | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| task extension | parallel master taskloop | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| task extension | master taskloop simd | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| task extension | parallel master taskloop simd | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| SIMD extension | atomic and simd constructs inside SIMD code | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| SIMD extension | SIMD nontemporal | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | infer target functions from initializers | :part:`worked on` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | infer target variables from initializers | :part:`worked on` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | OMP_TARGET_OFFLOAD environment variable | :good:`done` | D50522 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | support full 'defaultmap' functionality | :good:`done` | D69204 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | device specific functions | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | clause: device_type | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | clause: extended device | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | clause: uses_allocators clause | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | clause: in_reduction | :part:`worked on` | r308768 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | omp_get_device_num() | :part:`worked on` | D54342 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | structure mapping of references | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | nested target declare | :good:`done` | D51378 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | implicitly map 'this' (this[:1]) | :good:`done` | D55982 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | allow access to the reference count (omp_target_is_present) | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | requires directive | :part:`partial` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | clause: unified_shared_memory | :good:`done` | D52625,D52359 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | clause: unified_address | :part:`partial` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | clause: reverse_offload | :none:`unclaimed parts` | D52780 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | clause: atomic_default_mem_order | :good:`done` | D53513 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | clause: dynamic_allocators | :part:`unclaimed parts` | D53079 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | user-defined mappers | :part:`worked on` | D56326,D58638,D58523,D58074,D60972,D59474 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | mapping lambda expression | :good:`done` | D51107 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | clause: use_device_addr for target data | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | support close modifier on map clause | :good:`done` | D55719,D55892 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | teams construct on the host device | :part:`done` | r371553 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | support non-contiguous array sections for target update | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | pointer attachment | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | map clause reordering based on map types | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| atomic extension | hints for the atomic construct | :good:`done` | D51233 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| base language | C11 support | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| base language | C++11/14/17 support | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| base language | lambda support | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| misc extension | array shaping | :good:`done` | D74144 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| misc extension | library shutdown (omp_pause_resource[_all]) | :none:`unclaimed parts` | D55078 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| misc extension | metadirectives | :part:`worked on` | D91944 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| misc extension | conditional modifier for lastprivate clause | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| misc extension | iterator and multidependences | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| misc extension | depobj directive and depobj dependency kind | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| misc extension | user-defined function variants | :part:`worked on` | D67294, D64095, D71847, D71830, D109635 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| misc extension | pointer/reference to pointer based array reductions | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| misc extension | prevent new type definitions in clauses | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| memory model extension | memory model update (seq_cst, acq_rel, release, acquire,...) | :good:`done` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
|
|
OpenMP 5.1 Implementation Details
|
|
=================================
|
|
|
|
The following table provides a quick overview over various OpenMP 5.1 features
|
|
and their implementation status, as defined in the technical report 8 (TR8).
|
|
Please contact *openmp-dev* at *lists.llvm.org* for more information or if you
|
|
want to help with the implementation.
|
|
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|Category | Feature | Status | Reviews |
|
|
+==============================+==============================================================+==========================+=======================================================================+
|
|
| atomic extension | 'compare' clause on atomic construct | :good:`worked on` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| atomic extension | 'fail' clause on atomic construct | :part:`worked on` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| base language | C++ attribute specifier syntax | :good:`done` | D105648 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | 'present' map type modifier | :good:`done` | D83061, D83062, D84422 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | 'present' motion modifier | :good:`done` | D84711, D84712 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | 'present' in defaultmap clause | :good:`done` | D92427 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | map clause reordering reordering based on 'present' modifier | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | device-specific environment variables | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | omp_target_is_accessible routine | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | omp_get_mapped_ptr routine | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | new async target memory copy routines | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | thread_limit clause on target construct | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | has_device_addr clause on target construct | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | iterators in map clause or motion clauses | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | indirect clause on declare target directive | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | allow virtual functions calls for mapped object on device | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | interop construct | :part:`partial` | parsing/sema done: D98558, D98834, D98815 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| device extension | assorted routines for querying interoperable properties | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| loop extension | Loop tiling transformation | :good:`done` | D76342 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| loop extension | Loop unrolling transformation | :good:`done` | D99459 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| loop extension | 'reproducible'/'unconstrained' modifiers in 'order' clause | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| memory management | alignment extensions for allocate directive and clause | :part:`worked on` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| memory management | new memory management routines | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| memory management | changes to omp_alloctrait_key enum | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| memory model extension | seq_cst clause on flush construct | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| misc extension | 'omp_all_memory' keyword and use in 'depend' clause | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| misc extension | error directive | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| misc extension | scope construct | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| misc extension | routines for controlling and querying team regions | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| misc extension | changes to ompt_scope_endpoint_t enum | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| misc extension | omp_display_env routine | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| misc extension | extended OMP_PLACES syntax | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| misc extension | OMP_NUM_TEAMS and OMP_TEAMS_THREAD_LIMIT env vars | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| misc extension | 'target_device' selector in context specifier | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| misc extension | begin/end declare variant | :good:`done` | D71179 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| misc extension | dispatch construct and function variant argument adjustment | :part:`worked on` | D99537, D99679 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| misc extension | assume and assumes directives | :part:`worked on` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| misc extension | nothing directive | :part:`worked on` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| misc extension | masked construct and related combined constructs | :part:`worked on` | D99995, D100514 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| misc extension | default(firstprivate) & default(private) | :part:`partial` | firstprivate done: D75591 |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| other | deprecating master construct | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| OMPT | new barrier types added to ompt_sync_region_t enum | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| OMPT | async data transfers added to ompt_target_data_op_t enum | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| OMPT | new barrier state values added to ompt_state_t enum | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| OMPT | new 'emi' callbacks for external monitoring interfaces | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| task extension | 'strict' modifier for taskloop construct | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| task extension | inoutset in depend clause | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
| task extension | nowait clause on taskwait | :none:`unclaimed` | |
|
|
+------------------------------+--------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------+
|
|
|
|
OpenMP Extensions
|
|
=================
|
|
|
|
The following table provides a quick overview over various OpenMP
|
|
extensions and their implementation status. These extensions are not
|
|
currently defined by any standard, so links to associated LLVM
|
|
documentation are provided. As these extensions mature, they will be
|
|
considered for standardization. Please contact *openmp-dev* at
|
|
*lists.llvm.org* to provide feedback.
|
|
|
|
+------------------------------+---------------------------------------------------------------------------+--------------------------+--------------------------------------------------------+
|
|
|Category | Feature | Status | Reviews |
|
|
+==============================+===========================================================================+==========================+========================================================+
|
|
| device extension | `'ompx_hold' map type modifier | :good:`prototyped` | D106509, D106510 |
|
|
| | <https://openmp.llvm.org/docs/openacc/OpenMPExtensions.html#ompx-hold>`_ | | |
|
|
+------------------------------+---------------------------------------------------------------------------+--------------------------+--------------------------------------------------------+
|