llvm-project

Commit Graph

Author	SHA1	Message	Date
Shilei Tian	458db51c10	[OpenMP] Add missing `tt_hidden_helper_task_encountered` along with `tt_found_proxy_tasks` In most cases, hidden helper task behave similar as detached tasks. That means, for example, if we have to wait for detached tasks, we have to do the same thing for hidden helper tasks as well. This patch adds the missing condition for hidden helper task accordingly along with detached task. Reviewed By: AndreyChurbanov Differential Revision: https://reviews.llvm.org/D107316	2021-12-29 23:22:53 -05:00
Jonathan Peyton	6a556ecaf4	[OpenMP][libomp] Add use-all syntax to KMP_HW_SUBSET This patch allows the user to request all resources of a particular layer (or core-attribute). The syntax of KMP_HW_SUBSET is modified so the number of units requested is optional or can be replaced with an '' character. e.g., KMP_HW_SUBSET=c:intel_atom@3 will use all the cores after offset 3 e.g., KMP_HW_SUBSET=c:intel_core will use all the big cores e.g., KMP_HW_SUBSET=s,c,1t will use all the sockets, all cores per each socket and 1 thread per core. Differential Revision: https://reviews.llvm.org/D115826	2021-12-20 13:45:21 -06:00
Jonathan Peyton	9769340905	[OpenMP][libomp] Fix compile errors with new KMP_HW_SUBSET changes Add missing guards around x86-specific code. Reviewed By: kaz7 Differential Revision: https://reviews.llvm.org/D115664	2021-12-14 08:33:05 +01:00
John Ericson	ddcc02dbcc	Quote some more destination paths with variables Just defensive CMake-ing. I pulled this from D115544 and D99484 which are blocked on some lldb CI failures I don't yet understand. Hoping to land something smaller in the meantime. Reviewed By: #libc, ldionne Differential Revision: https://reviews.llvm.org/D115566	2021-12-13 17:29:08 +00:00
Med Ismail Bennani	30fc88bf1d	Revert "Revert "Revert "Use `GNUInstallDirs` to support custom installation dirs. -- LLVM""" This reverts commit `492de35df4`. I tried to apply John's changes in `8d897ec915` that were expected to fix his patch but that didn't work unfortunately. Reverting this again to fix the macOS bots and leave him more time to investigate the issue.	2021-12-10 17:33:54 -08:00
John Ericson	492de35df4	Revert "Revert "Use `GNUInstallDirs` to support custom installation dirs. -- LLVM"" This reverts commit `797b50d4be`. See the original D99484. @mib who noticed the original problem could not longer reproduce it, after I tried and also failed. We are threfore hoping it went away on its own! Reviewed By: mib Differential Revision: https://reviews.llvm.org/D115544	2021-12-10 20:59:43 +00:00
Jonathan Peyton	df20599597	[OpenMP][libomp] Add core attributes to KMP_HW_SUBSET Allow filtering of resources based on core attributes. There are two new attributes added: 1) Core Type (intel_atom, intel_core) 2) Core Efficiency (integer) where the higher the efficiency, the more performant the core On hybrid architectures , e.g., Alder Lake, users can specify KMP_HW_SUBSET=4c:intel_atom,4c:intel_core to select the first four Atom and first four Big cores. The can also use the efficiency syntax. e.g., KMP_HW_SUBSET=2c:eff0,2c:eff1 Differential Revision: https://reviews.llvm.org/D114901	2021-12-10 14:34:33 -06:00
AndreyChurbanov	1031e43052	[OpenMP] libomp: fix Fortran header: lines exceeded 72-char length Added line continuation to two long lines in Fortran header. Differential Revision: https://reviews.llvm.org/D114537	2021-12-10 16:23:21 +03:00
AndreyChurbanov	4dd8fccb71	[OpenMP] libomp: Fix crash if application send us negative thread_limit value Regardless that specification requires thread_limit to be positive, it is better to warn user instead of crash in case the value is negative. Differential Revision: https://reviews.llvm.org/D115340	2021-12-08 19:02:57 +03:00
Kazushi (Jam) Marukawa	5e2358c781	[runtimes][openmp] Change to not treat ARCH-unknown-linux-gnu as errors When OpenMP is compiled as a part runtimes for multiple targets, openmp is compiled under build/runtimes/runtimes-arch-unknown-linux-gnu-bins directory. Old implementation treats this directory name as errors. This patch adds a guard like "[Uu]known[^-]". Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D114346	2021-12-01 08:33:37 +09:00
Alexey Bataev	80256605f8	[OpenMP] support depend clause for taskwait directive, by Deepak Eachempati. This patch adds clang (parsing, sema, serialization, codegen) support for the 'depend' clause on the 'taskwait' directive. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D113540	2021-11-19 06:30:17 -08:00
Peyton, Jonathan L	a733b18bdb	[OpenMP][libomp] Enable HWLOC topology detection of multiple CPU kinds Teach the HWLOC topology method how to detect Atom and Core types so hybrid CPUs are properly detected and represented when using the HWLOC topology method. Differential Revision: https://reviews.llvm.org/D112270	2021-11-17 16:30:18 -06:00
Peyton, Jonathan L	286094af9b	[OpenMP][libomp] Improve Windows Processor Group handling within topology The current implementation of Windows Processor Groups has a separate topology method to handle them. This patch deprecates that specific method and uses the regular CPUID topology method by default and inserts the Windows Processor Group objects in the topology manually. Notes: * The preference for processor groups is lowered to a value less than socket so that the user will see sockets in the KMP_AFFINITY=verbose output instead of processor groups when sockets=processor groups. * The topology's capacity is modified to handle additional topology layers without the need for reallocation. * If a user asks for a granularity setting that is "above" the processor group layer, then the granularity is adjusted "down" to the processor group since this is the coarsest layer available for threads. Differential Revision: https://reviews.llvm.org/D112273	2021-11-17 16:29:01 -06:00
Peyton, Jonathan L	1dd797168e	[OpenMP][libomp] Add support for offline CPUs in Linux If some CPUs are offline, then make sure they are not included in the fullMask even if norespect is given to KMP_AFFINITY. Differential Revision: https://reviews.llvm.org/D112274	2021-11-17 16:28:01 -06:00
Peyton, Jonathan L	a0afb9d0fc	[OpenMP][libomp] Allow users to specify KMP_HW_SUBSET in any order Remove restriction forcing users to specify the KMP_HW_SUBSET value in topology order. This patch sorts the user KMP_HW_SUBSET value before trying to apply it. For example: 1s,4c,2t is equivalent to 2t,1s,4c Differential Revision: https://reviews.llvm.org/D112027	2021-11-17 15:27:37 -06:00
Jonathan Peyton	c46becf500	[OpenMP][libomp][NFC] Remove non-ASCII apostrophe in comment	2021-11-17 14:46:40 -06:00
Martin Storsjö	9b2b549837	[OpenMP] Silence build warnings when built with MinGW There's an attempt to upstream this change in https://github.com/intel/ittapi/pull/25 too. Differential Revision: https://reviews.llvm.org/D114069	2021-11-17 18:51:18 +02:00
Nawrin Sultana	7a5680233e	[OpenMP] Set default blocktime to 0 for hybrid cpu Differential Revision:https://reviews.llvm.org/D113012	2021-11-12 12:05:35 -06:00
Bran Hagger	9f15cacc2e	[OpenMP] Allow building libomp using Microsoft Visual C++ naming scheme Differential Revision: https://reviews.llvm.org/D110354	2021-11-11 13:11:56 -06:00
Joachim Protze	52da6f562e	Revert "[openmp] Add OMPT initialization in libomptarget" Reverting initial OMPT for target implementation in favor of a different implementation. This reverts commit `3bc8ce5dd7`.	2021-11-10 12:44:25 +01:00
Jonathan Peyton	48b67dca2c	[OpenMP][libomp][CMake] use uppercase_CMAKE_BUILD_TYPE Have standalone builds define uppercase_CMAKE_BUILD_TYPE and use it. llvm/CMakeLists.txt defines uppercase_CMAKE_BUILD_TYPE for regular LLVM builds with OpenMP enabled. Differential Revision: https://reviews.llvm.org/D112951	2021-11-09 11:03:04 -06:00
Quinn Pham	c3b15b71ce	[NFC] Inclusive Language: change master to main for .chm files [NFC] As part of using inclusive language within the llvm project, this patch replaces master with main when referring to `.chm` files. Reviewed By: teemperor Differential Revision: https://reviews.llvm.org/D113299	2021-11-08 08:23:04 -06:00
@t-msn	0808d956c4	[OpenMP] libomp: Fix handling of barrier pattern environment variables It is better to set all barrier patterns to use "dist" when at least one environment variable specifies "dist". Otherwise if only one environment is set to "dist" and others left blank inadvertently, it would result in mixing dist barrier with default hyper barrier pattern. Differential Revision: https://reviews.llvm.org/D112597	2021-11-08 15:01:26 +03:00
Med Ismail Bennani	797b50d4be	Revert "Use `GNUInstallDirs` to support custom installation dirs. -- LLVM" This reverts commit `6fd2db04d0` since it broke GreenDragon LLDB-Incremental bot: https://green.lab.llvm.org/green/job/lldb-cmake/37560/console Signed-off-by: Med Ismail Bennani <medismail.bennani@gmail.com>	2021-11-02 19:11:44 +01:00
John Ericson	6fd2db04d0	Use `GNUInstallDirs` to support custom installation dirs. -- LLVM This is a new draft of D28234. I previously did the unorthodox thing of pushing to it when I wasn't the original author, but since this version - Uses `GNUInstallDirs`, rather than mimics it, as the original author was hesitant to do but others requested. - Is much broader, effecting many more projects than LLVM itself. I figured it was time to make a new revision. I am using this patch (and many back-ports) as the basis of https://github.com/NixOS/nixpkgs/pull/111487 for my distro (NixOS). It looked like people were generally on board in D28234, but I make note of this here in case extra motivation is useful. --- As pointed out in the original issue, a central tension is that LLVM already has some partial support for these sorts of things. For example `LLVM_LIBDIR_SUFFIX`, or `COMPILER_RT_INSTALL_PATH`. Because it's not quite clear yet what to do about those, we are holding off on changing libdirs and `compiler-rt`. for this initial PR. --- On the advice of @lebedev.ri, I am splitting this up a bit per subproject, starting with LLVM. To allow it to be more easily reviewed. This and the subsequent patch must be landed together, as this will not build alone. But the rest can be landed on their own. Reviewed By: compnerd Differential Revision: https://reviews.llvm.org/D100810	2021-11-02 10:23:30 -04:00
AndreyChurbanov	a64797b5b8	[OpenMP][NFC] disable test on power because of -mlong-double-80 option	2021-10-27 16:54:44 +03:00
AndreyChurbanov	c704b25b44	[OpenMP] libomp: Fix possible NULL dereference. According to dlsym description, the value of symbol could be NULL, and there is no error in this case. Thus dlerror will also return NULL in this case. We need to check the value returned by dlerror before printing it. Differential Revision: https://reviews.llvm.org/D112174	2021-10-27 16:54:44 +03:00
AndreyChurbanov	e38a1deb66	[OpenMP] libomp: disable definitions of 5.1 atomics for non-x86 arch. Declarations of 5.1 atomic entries were added under "#if KMP_ARCH_X86 \|\| KMP_ARCH_X86_64" in kmp_atomic.h, but definitions of the functions missed architecture guard in kmp_atomic.cpp. As a result mangled symbols were available on non-x86 architecture. The patch eliminates these unexpected symbols from the library. Differential Revision: https://reviews.llvm.org/D112261	2021-10-25 21:17:26 +03:00
Vladimir Inđić	f41d08540b	[OpenMP][OMPT] thread_num determination during execution of nested serialized parallel regions __ompt_get_task_info_internal function is adapted to support thread_num determination during the execution of multiple nested serialized parallel regions enclosed by a regular parallel region. Consider the following program that contains parallel region R1 executed by two threads. Let the worker thread T of region R1 executes serialized parallel regions R2 that encloses another serialized parallel region R3. Note that the thread T is the master thread of both R2 and R3 regions. Assume that __ompt_get_task_info_internal function is called with the argument "ancestor_level == 1" during the execution of region R3. The function should determine the "thread_num" of the thread T inside the team of region R2, whose implicit task is at level 1 inside the hierarchy of active tasks. Since the thread T is the master thread of region R2, one should expected that "thread_num" takes a value 0. After the while loop finishes, the following stands: "lwt != NULL", "prev_lwt == NULL", "prev_team" represents the team information about the innermost serialized parallel region R3. This results in executing the assignment "thread_num = prev_team->t.t_master_tid". Note that "prev_team->t.t_master_tid" was initialized at the moment of R2’s creation and represents the "thread_num" of the thread T inside the region R1 which encloses R2. Since the thread T is the worker thread of the region R1, "the thread_num" takes value 1, which is a contradiction. This patch proposes to use "lwt" instead of "prev_lwt" when determining the "thread_num". If "lwt" exists, the task at the requested level belongs to the serialized parallel region. Since the serialized parallel region is executed by one thread only, the "thread_num" takes value 0. Similarly, assume that __ompt_get_task_info_internal function is called with the argument "ancestor_level == 2" during the execution of region R3. The function should determine the "thread_num" of the thread T inside the team of region R1. Since the thread is the worker inside the region R1, one should expected that "thread_num" takes value 1. After the loop finishes, the following stands: "lwt == NULL", "prev_lwt != NULL", "prev_team" represents the team information about the innermost serialized parallel region R3. This leads to execution of the assignment "thread_num = 0", which causes a contradiction. Ignoring the "prev_lwt" leads to executing the assignment "thread_num = prev_team->t.t_master_tid" instead. From the previous explanation, it is obvious that "thread_num" takes value 1. Note that the "prev_lwt" variable is marked as unnecessary and thus removed. This patch introduces the test case which represents the OpenMP program described earlier in the summary. Differential Revision: https://reviews.llvm.org/D110699	2021-10-25 18:21:20 +02:00
Vladimir Inđić	f2410bfb1c	[OpenMP][OMPT][clang] task frame support fixed in __kmpc_fork_call __kmp_fork_call sets the enter_frame of the active task (th_curren_task) before new parallel region begins. After the region is finished, the enter_frame is cleared. The old implementation of __kmpc_fork_call didn’t clear the enter_frame of active task. Also, the way of initializing the enter_frame of the active task was wrong. Consider the following two OpenMP programs. The first program: Let R1 be the serialized parallel region that encloses another serialized parallel region R2. Assume that thread that executes R2 is going to create a new serialized parallel region R3 by executing __kmpc_fork_call. This thread is responsible to set enter_frame of R2's implicit task. Note that the information about R2's implicit task is present inside master_th->th.th_current_task at this moment, while lwt represents the information about R1's implicit task. The old implementation uses lwt and resets enter_frame of R1's implicit task instead of R2's implicit task. The new implementation uses master_th->th.th_current_task instead. The second program: Consider the OpenMP program that contains parallel region R1 which encloses an explicit task T. Assume that thread should create another parallel region R2 during the execution of the T. The __kmpc_fork_call is responsible to create R2 and set enter frame of T whose information is present inside the master_th->th.th_current_task. Old implementation tries to set the frame of parent_team->t.t_implicit_task_taskdata[tid] which corresponds to the implicit task of the R1, instead of T. Differential Revision: https://reviews.llvm.org/D112419	2021-10-25 18:21:19 +02:00
Joachim Protze	7368227965	[OpenMP][Tests] Test omp_get_wtime for invariants As discussed in D108488, testing for invariants of omp_get_wtime would be more reliable than testing for duration of sleep, as return from sleep might be delayed due to system load. Alternatively/in addition, we could compare the time measured by omp_get_wtime to time measured with C++11 chrono (for portability?). Differential Revision: https://reviews.llvm.org/D112458	2021-10-25 18:20:59 +02:00
Joachim Protze	3f229f42b7	[OpenMP][Tests][NFC] Actually check for test outcome The CHECK: line in the test had no effect, because the test does not pipe to FileCheck. Since the test only checks for a single value, encode the result in the return value of the test.	2021-10-25 18:20:12 +02:00
Joachim Protze	047890bc3f	[OpenMP][Tests][NFC] Mark tests trying to link COI as unsupported For some tests with target-related functionality icc 18/19 tries to link libioffload_target.so.5, which fails for missing COI symbols.	2021-10-25 18:20:12 +02:00
Joachim Protze	d7fdd236d5	[OpenMP][Tests][NFC] Replace atomic increment by reduction Also mark the test as unsupported by intel-21, because the test does not terminate	2021-10-25 18:20:12 +02:00
Joachim Protze	38f78dd2e2	[OpenMP][Tools][NFC] Fix C99-style declaration of iteration variables Where possible change to declare the variable before the loop. Where not possible, specifically request -std=c99 (could be limited to specific compilers like icc).	2021-10-25 18:20:12 +02:00
Vladimir Inđić	ba02586fbe	[OpenMP][OMPT][GOMP] task frame support in KMP_API_NAME_GOMP_PARALLEL_SECTIONS KMP_API_NAME_GOMP_PARALLEL_SECTIONS function was missing the task frame support. This patch introduced a fix responsible to set properly the exit_frame of the innermost implicit task that corresponds to the parallel section construct, as well as the enter_frame of the task that encloses the mentioned implicit task. This patch also introduced a simple test case sections_serialized.c that contains serialized parallel section construct and validates whether the mentioned task frames are set correctly. Differential Revision: https://reviews.llvm.org/D112205	2021-10-22 11:01:10 -05:00
AndreyChurbanov	52f4922ebb	[OpenMP][NFC] skip atomic tests for non-x86 arch	2021-10-21 21:51:33 +03:00
Nawrin Sultana	99d1ce4a62	[OpenMP] Add GOMP allocator functions This patch adds GOMP_alloc and GOMP_free functions of LIBGOMP. Differential revision: https://reviews.llvm.org/D111673	2021-10-20 11:37:29 -05:00
AndreyChurbanov	63f8099e23	[OpenMP] libomp: add check of task function pointer for NULL. This patch allows to simplify compiler implementation on "taskwait nowait" construct. The "taskwait nowait" is semantically equivalent to the empty task. Instead of creating an empty routine as a task entry, compiler can just send NULL pointer to the runtime. Then the runtime will make all the work with dependences and return because of the absent task routine. Differential Revision: https://reviews.llvm.org/D112015	2021-10-18 19:48:30 +03:00
@vladaindjic	59a994e8da	[OpenMP][OMPT] thread_num determination for programs with explicit tasks __ompt_get_task_info_internal is now able to determine the right value of the “thread_num” argument during the execution of an explicit task. During the execution of a while loop that iterates over the ancestor tasks hierarchy, the “prev_team” variable was always set to “team” variable at the beginning of each loop iteration. Assume that the program contains a parallel region which encloses an explicit task executed by the worker thread of the region. Also assume that the tool inquires the “thread_num” of a worker thread for the implicit task that corresponds to the region (task at “ancestor_level == 1”) and expects to receive the value of “thread_num > 0”. After the loop finishes, both “team” and “prev_team” variables are equal and point to the team information of the parallel region. The “thread_num” is set to “prev_team->t.t_master_tid”, that is equal to “team->t.t_master_tid”. In this case, “team->t.t_master_tid” is 0, since the master thread of the region is the initial master thread of the program. This leads to a contradiction. To prevent this, “prev_team” variable is set to “team” variable only at the time when the loop that has already encountered the implicit task (“taskdata” variable contains the information about an implicit task) continues iterating over the implicit task’s ancestors, if any. After the mentioned loop finishes, the “prev_team” variable might be equal to NULL. This means that the task at requested “ancestor_level” belongs to the innermost parallel region, so the “thread_num” will be determined by calling the “__kmp_get_tid”. To prove that this patch works, the test case “explicit_task_thread_num.c” is provided. It contains the example of the program explained earlier in the summary. Differential Revision: https://reviews.llvm.org/D110473	2021-10-18 13:54:22 +02:00
Joachim Protze	c93fb143b9	[OpenMP][Tests][NFC] Work around ICC bug Older intel compilers miss the privatization of nested loop variables for doacross loops. Declaring the variable in the loop makes the test more robust.	2021-10-18 13:54:15 +02:00
Joachim Protze	5918688248	[OpenMP][Tests][NFC] Flagging OMPT tests as XFAIL for Intel compilers With Intel 19 compiler the teams tests fail to link while trying to link liboffload.	2021-10-18 13:50:03 +02:00
Peyton, Jonathan L	acb3b187c4	[OpenMP][host runtime] Add initial hybrid CPU support Detect, through CPUID.1A, and show user different core types through KMP_AFFINITY=verbose mechanism. Offer future runtime optimizations __kmp_is_hybrid_cpu() to know whether running on a hybrid system or not. Differential Revision: https://reviews.llvm.org/D110435	2021-10-14 16:49:42 -05:00
Peyton, Jonathan L	b840d3ab0d	[OpenMP][host runtime] small fixup of RTM CPUID bit check	2021-10-14 16:49:42 -05:00
Peyton, Jonathan L	50b68a3d03	[OpenMP][host runtime] Add support for teams affinity This patch implements teams affinity on the host. The default is spread. A user can specify either spread, close, or primary using KMP_TEAMS_PROC_BIND environment variable. Unlike OMP_PROC_BIND, KMP_TEAMS_PROC_BIND is only a single value and is not a list of values. The values follow the same semantics under the OpenMP specification for parallel regions except T is the number of teams in a league instead of the number of threads in a parallel region. Differential Revision: https://reviews.llvm.org/D109921	2021-10-14 16:30:28 -05:00
AndreyChurbanov	621d7a75b1	[OpenMP] libomp: add atomic functions for new OpenMP 5.1 atomics. Added functions those implement "atomic compare". Though clang does not use library interfaces to implement OpenMP atomics, the functions added for consistency. Also added missed functions for 80-bit floating min/max atomics. Differential Revision: https://reviews.llvm.org/D110109	2021-10-13 21:02:18 +03:00
AndreyChurbanov	6e98ec9b20	[OpenMP] libomp: fix ittnotify usage. Replaced storing of ittnotify domain array index into location info structure (which is now read-only) with storing of (location info address + ittnotify domain + team size) into hash map. Replaced __kmp_itt_barrier_domains and __kmp_itt_imbalance_domains arrays with __kmp_itt_barrier_domains hash map; __kmp_itt_region_domains and __kmp_itt_region_team_size arrays with __kmp_itt_region_domains hash map. Basic functionality did not change (at least tried to not change). The patch fixes https://bugs.llvm.org/show_bug.cgi?id=48644. Differential Revision: https://reviews.llvm.org/D111580	2021-10-13 20:49:05 +03:00
AndreyChurbanov	5e58b63b28	[OpenMP] libomp: fix warning on comparison of integer expressions of different signedness Replaced macro with global variable of correspondent type. Differential Revision: https://reviews.llvm.org/D111562	2021-10-13 20:11:47 +03:00
AndreyChurbanov	f5c0c9179f	[OpenMP] libomp: add OpenMP 5.1 memory allocation routines. Aligned allocation routines added. Fortran interfaces added for all allocation routines. Differential Revision: https://reviews.llvm.org/D110923	2021-10-11 19:25:00 +03:00
Martin Storsjö	dec2257f35	[openmp] Fix a typo in a test REQUIRES line Differential Revision: https://reviews.llvm.org/D110963	2021-10-03 23:51:11 +03:00

1 2 3 4 5 ...

1183 Commits