llvm-project

Commit Graph

Author	SHA1	Message	Date
Jonas Hahnfeld	cc6d29d72c	[OMPT][test] Correct warning about added wrapper functions This affects all outlined functions, not just tasks! Only show warning when using Clang 5.0 or later. Differential Revision: https://reviews.llvm.org/D43190 llvm-svn: 325131	2018-02-14 15:15:24 +00:00
Joachim Protze	cfc98c2493	[OMPT] Add tool_available_search testcase Tests the search for tools as defined in the spec. The OMP_TOOL_LIBRARIES environment variable contains paths to the following files(in that order) -to a nonexisting file -to a shared library that does not have a ompt_start_tool function -to a shared library that has an ompt_start_tool implementation returning NULL -to a shared library that has an ompt_start_tool implementation returning a pointer to a valid instance of ompt_start_tool_result_t The expected result is that the last tool gets active and can print in the thread-begin callback. Differential Revision: https://reviews.llvm.org/D42166 llvm-svn: 324588	2018-02-08 10:04:33 +00:00
Joachim Protze	9440c0ee3c	[OMPT] Add tool_not_available testcase Add a testcase that checks wheter the runtime can handle an ompt_start_tool method that returns NULL indicating that no tool shall be loaded. All tool_available testcases need a separate folder to avoid file conflicts for the generated tools. Differential Revision: https://reviews.llvm.org/D41904 llvm-svn: 324587	2018-02-08 10:04:28 +00:00
Jonas Hahnfeld	723560d123	[OMPT] Use fuzzy return addresses in lock testcases Use fuzzy return addresses in lock testcases so that these testcases can also be run using the Intel Compiler. Patch by Simon Convent! Differential Revision: https://reviews.llvm.org/D41896 llvm-svn: 323529	2018-01-26 14:19:02 +00:00
Joachim Protze	0c9516b36c	[OMPT] Add Workaround for Intel Compiler Bug Add Workaround for Intel Compiler Bug with Case#: 03138964 A critical region within a nested task causes a segfault in icc 14-18: int main() { #pragma omp parallel num_threads(2) #pragma omp master #pragma omp task #pragma omp task #pragma omp critical printf("test\n"); } When the critical region is in a separate function, the segault does not occur. So we add noinline to make sure that the function call stays there. Differential Revision: https://reviews.llvm.org/D41182 llvm-svn: 322622	2018-01-17 10:06:06 +00:00
Joachim Protze	1dc2afdcaf	[OMPT] Return appropiate values for ompt runtime entry points for non-OpenMP threads When the current thread is not an (initialized) OpenMP thread, the runtime entry points return values that correspond to "not available" or similar Differential Revision: https://reviews.llvm.org/D41167 llvm-svn: 322620	2018-01-17 10:05:55 +00:00
Joachim Protze	1014a6b6c6	Missed to add new test case in previous commit llvm-svn: 322179	2018-01-10 12:52:34 +00:00
Jonas Hahnfeld	f34d65a164	[OMPT] Fix cast and printf of wait_id in lock test This didn't work on 32 bit platforms. Differential Revision: https://reviews.llvm.org/D41853 llvm-svn: 322160	2018-01-10 08:10:23 +00:00
Joachim Protze	265fb584a5	[OMPT] Set and reset frame address when creating a task with dependences As for normal task creation, the task frame addresses need to be stored for the encountering task. Differential Revision: https://reviews.llvm.org/D41165 llvm-svn: 321421	2017-12-24 07:30:23 +00:00
Paul Osmialowski	6b8141acdd	[OMPT] Add missing initialization in nested_lwt.c test case Without this initialization this test case tend to fail. Differential Revision: https://reviews.llvm.org/D41542 llvm-svn: 321379	2017-12-22 19:24:06 +00:00
Joachim Protze	9c9b61df7e	[OMPT] Fix failing test cases for gcc on Ubuntu The compiler warns that _BSD_SOURCE is deprecated and _DEFAULT_SOURCE should be used instead. We keep _BSD_SOURCE for older compilers, that don't know about _DEFAULT_SOURCE. The linker drops the tool when linking, since there is no visible need for the library. So we need to tell the linker, that the tool should be linked anyway. Differential Revision: https://reviews.llvm.org/D41499 llvm-svn: 321362	2017-12-22 16:40:32 +00:00
Joachim Protze	e8d84a67c2	Add missing test case from D41171 commit llvm-svn: 321270	2017-12-21 14:36:36 +00:00
Joachim Protze	f375f4b49a	[OMPT] Add missing ompt_get_num_procs function This function is defined in OpenMP-TR6 section 4.1.5.1.6 The functions was not implemented yet. Since ompt-functions can only be called after the runtime was initialized and has loaded a tool, it can assume the runtime to be initialized. In contrast to omp_get_num_procs which needs to check whether the runtime is initialized. Differential Revision: https://reviews.llvm.org/D40949 llvm-svn: 321269	2017-12-21 14:36:30 +00:00
Joachim Protze	f8d22f9db8	[OMPT] Fix return address handling in a few GOMP interface methods This revision fixes failing testcases with parallel for loops and the gomp interface. The return address needs to be stored at entry to runtime. The storage is cleared on usage, so we need to update the storage before calling again internal functions, that will trigger event callbacks. Differential Revision: https://reviews.llvm.org/D41181 llvm-svn: 321265	2017-12-21 13:55:39 +00:00
Joachim Protze	0e2a2571ca	[OMPT] Use frames at different level when using clang version 5 or higher with debug flag Clang 5 or higher adds an intermediate function call in certain cases when compiling with debug flag. This revision updates the testcases to work correctly. Differential Revision: https://reviews.llvm.org/D40595 llvm-svn: 321263	2017-12-21 13:55:29 +00:00
Joachim Protze	633bc4ca99	[OMPT] Add annotations to testcases that are expected to fail when using certain compilers Reasons for expected failures are mainly bugs when using lables in OpenMP regions or missing support of some OpenMP features. For some worksharing clauses, support to distinguish the kind of workshare was added just recently. If an issue was fixed in a minor release version of a compiler, we flag the test as unsupported for this compiler version to avoid false positives. Same for fixes that where backported to older compiler versions. Differential Revision: https://reviews.llvm.org/D40384 llvm-svn: 321262	2017-12-21 13:55:16 +00:00
Paul Osmialowski	17fb580c12	[AArch64] add required arch specific code for running OMPT test cases Differential Revision: https://reviews.llvm.org/D41482 llvm-svn: 321258	2017-12-21 12:33:31 +00:00
Jonas Hahnfeld	86c307821c	Add missing memory barrier for queuing locks Otherwise I see hangs in the omp_single_copyprivate test when compiling in release mode. With the debug assertions, I get a failure `head > 0 && tail > 0`. Differential Revision: https://reviews.llvm.org/D40722 llvm-svn: 320150	2017-12-08 15:07:02 +00:00
Jonas Hahnfeld	241d1d9e17	Fix alignment in teams-reduction.c test The runtime will use the global kmp_critical_name as a lock and tries to atomically store a pointer in there. This will fail if the global is only aligned by 4 bytes, the size of one int32_t element. Use a union to ensure the global is aligned to the size of a pointer on the current platform. llvm-svn: 319811	2017-12-05 18:45:21 +00:00
Jonas Hahnfeld	a4ca525c1b	Fix PR30890: Reduction across teams hangs __kmpc_reduce_nowait() correctly swapped the teams for reductions in a teams construct. Apply the same logic to __kmpc_reduce() and __kmpc_reduce_end(). Differential Revision: https://reviews.llvm.org/D40753 llvm-svn: 319788	2017-12-05 16:51:24 +00:00
Jonas Hahnfeld	fc473dee98	[CMake] Detect information about test compiler Perform a nested CMake invocation to avoid writing our own parser for compiler versions when we are not testing the in-tree compiler. Use the extracted information to mark a test as unsupported that hangs with Clang prior to version 4.0.1 and restrict tests for libomptarget to Clang version 6.0.0 and later. Differential Revision: https://reviews.llvm.org/D40083 llvm-svn: 319448	2017-11-30 17:08:31 +00:00
Jonas Hahnfeld	18bec60bc2	[CMake] Refactor testing infrastructure The code for the two OpenMP runtime libraries was very similar. Move to common CMake file that is included and provides a simple interface for adding testsuites. Also add a common check-openmp target that runs all testsuites that have been registered. Note that this renames all test options to the common OPENMP namespace, for example OPENMP_TEST_C_COMPILER instead of LIBOMP_TEST_COMPILER and so on. Differential Revision: https://reviews.llvm.org/D40082 llvm-svn: 319343	2017-11-29 19:31:52 +00:00
Jonas Hahnfeld	3e921d3c52	[CMake] Disallow direct configuration As a first step, this allows us to generalize the detection of standalone builds and make it fully compatible when building in llvm/runtimes/ which automatically sets OPENMP_STANDLONE_BUILD. Differential Revision: https://reviews.llvm.org/D40080 llvm-svn: 319341	2017-11-29 19:31:43 +00:00
Jonas Hahnfeld	221e7bb1fc	Fix for OMP doacross implementation on Power Power has a weak consistency model so we need memory barriers to make writes (both from runtime and from user code) available for all threads. Differential Revision: https://reviews.llvm.org/D40175 llvm-svn: 318848	2017-11-22 17:15:20 +00:00
Jonas Hahnfeld	0924094e34	[OMPT] Fix inaccuracies in worksharing tests These tests were failing rarely on my MacBook when there was some activity in the background. Read: one of a thousand executions? * sections.c missed the sorting based on thread ids. This worked as long as the master thread finished its section before the worker thread started the second one but failed if the master thread was put to sleep by the OS. * The checks in single.c assumed that the master thread executes the single region which works most of the time because it is usually faster than the newly spawned worker thread. Differential Revision: https://reviews.llvm.org/D39853 llvm-svn: 318527	2017-11-17 15:26:44 +00:00
Jonas Hahnfeld	d0ef19ef9b	[OMPT] Provide initialization for Mac OS X Traditionally, the library had a weak symbol for ompt_start_tool() that served as fallback and disabled OMPT if called. Tools could provide their own version and replace the default implementation to register callbacks and lookup functions. This mechanism has worked reasonably well on Linux systems where this interface was initially developed. On Darwin / Mac OS X the situation is a bit more complicated and the weak symbol doesn't work out-of-the-box. In my tests, the library with the tool needed to link against the OpenMP runtime to make the process work. This would effectively mean that a tool needed to choose a runtime library whereas one design goal of the interface was to allow tools that are agnostic of the runtime. The solution is to use dlsym() with the argument RTLD_DEFAULT so that static implementations of ompt_start_tool() are found in the main executable. This works because the linker on Mac OS X includes all symbols of an executable in the global symbol table by default. To use the same code path on Linux, the application would need to be built with -Wl,--export-dynamic. To avoid this restriction, we continue to use weak symbols on Linux systems as before. Finally this patch extends the existing test to cover all possible ways of initializing the tool as described by the standard. It also fixes ompt_finalize() to not call omp_get_thread_num() when the library is shut down which resulted in hangs on Darwin. The changes have been tested on Linux to make sure that it passes the current tests as well as the newly extended one. Differential Revision: https://reviews.llvm.org/D39801 llvm-svn: 317980	2017-11-11 13:59:48 +00:00
Jonas Hahnfeld	c60300333e	[OMPT] Fix test cancel_parallel.c If a parallel region is cancelled, execution resumes at the end of the structured block. That is why this test cannot use the "normal" macros that print right after inserting the label. Instead it previously printed the addresses before the pragma and swapped the checks compared to the other tests. However, this does not work because FileChecks '*' is greedy so that RETURN_ADDRESS always matched the second address. This makes the test fail when an "overflow" occurrs and the first address matches the value of codeptr_ra. I discovered this on my MacBook but I'm unable to reproduce the failure with the current version. Nevertheless we should fix this problem to avoid that this test fails later after an unrelated change. Differential Revision: https://reviews.llvm.org/D39708 llvm-svn: 317787	2017-11-09 14:26:14 +00:00
Jonas Hahnfeld	380346fce1	[OMPT] Add support for testing return addresses on POWER Return addresses are determined based on the address of a label that is inserted directly after a pragma / API call. In some cases the tests can assume a known number of instructions between the addresses. However, the instructions and their encoded lengths depend on the target that the test is compiled on. Firstly, this patch refactors the macro print_current_address() to allow such target dependent modifications and adds information for the observed instructions on POWER. Secondly, it adapts the related macro print_fuzzy_address() to reuse much of "hacky" code and fixes the used formatting strings in the printf() call. Finally, it also adds documentation about how these macros are intended to work. Differential Revision: https://reviews.llvm.org/D39699 llvm-svn: 317786	2017-11-09 14:26:12 +00:00
Jonas Hahnfeld	ba84ca9efb	[OMPT] Fix null pointer in parallel/no_thread_num_clause.c Looks like the implementation of printf on Darwin uses "0x0" instead of "(nil)" like glibc does. llvm-svn: 317515	2017-11-06 22:06:14 +00:00
Jonas Hahnfeld	dc5d849e2b	[OMPT] Fix callback.h for tests for changes in TR6 This was also lost in the last commit. llvm-svn: 317484	2017-11-06 15:13:06 +00:00
Joachim Protze	cab9cdc2ad	Updating implementation of OMPT as specified in OpenMP 5.0 Preview 2 (TR6) The TR6 document is expected to be publically released around November 15. This patch does not implement OMPT for libomptarget. Patch by Simon Convent and Joachim Protze Differential Revision: https://reviews.llvm.org/D39182 llvm-svn: 317436	2017-11-05 14:11:19 +00:00
Joachim Protze	c255ca70ce	Rename fields of ompt_frame_t This is part of the renaming of data types from OpenMP TR4 to TR6 Patch by Simon Convent Differential Revision: https://reviews.llvm.org/D39326 llvm-svn: 317435	2017-11-05 14:11:10 +00:00
Jonas Hahnfeld	b71424fda5	Revert "Rename fields of ompt_frame_t" This reverts commit r317338 which discarded some recent commits. llvm-svn: 317347	2017-11-03 18:28:25 +00:00
Jonas Hahnfeld	f0a1c65fb0	Revert "Updating implementation of OMPT as specified in OpenMP 5.0 Preview 2 (TR6)" This reverts commit r317339 which discarded some recent commits. llvm-svn: 317346	2017-11-03 18:28:19 +00:00
Joachim Protze	924cff0a39	Updating implementation of OMPT as specified in OpenMP 5.0 Preview 2 (TR6) The TR6 document is expected to be publically released around November 15. This patch does not implement OMPT for libomptarget. Patch by Simon Convent and Joachim Protze Differential Revision: https://reviews.llvm.org/D39182 llvm-svn: 317339	2017-11-03 17:09:00 +00:00
Joachim Protze	741572593f	Rename fields of ompt_frame_t This is part of the renaming of data types from OpenMP TR4 to TR6 Patch by Simon Convent Differential Revision: https://reviews.llvm.org/D39326 llvm-svn: 317338	2017-11-03 17:08:40 +00:00
Jonathan Peyton	3d18a37ca9	[OpenMP] Fix race condition in omp_init_lock This is a partial fix for bug 34050. This prevents callers of omp_set_lock (which does not hold __kmp_global_lock) from ever seeing an uninitialized version of __kmp_i_lock_table.table. It does not solve a use-after-free race condition if omp_set_lock obtains a pointer to __kmp_i_lock_table.table before it is updated and then attempts to dereference afterwards. That race is far less likely and can be handled in a separate patch. The unit test usually segfaults on the current trunk revision. It passes with the patch. Patch by Adam Azarchs Differential Revision: https://reviews.llvm.org/D39439 llvm-svn: 317115	2017-11-01 19:44:42 +00:00
Joachim Protze	82e94a5934	Update implementation of OMPT to the specification OpenMP 5.0 Preview 1 (TR4). The code is tested to work with latest clang, GNU and Intel compiler. The implementation is optimized for low overhead when no tool is attached shifting the cost to execution with tool attached. This patch does not implement OMPT for libomptarget. Patch by Simon Convent and Joachim Protze Differential Revision: https://reviews.llvm.org/D38185 llvm-svn: 317085	2017-11-01 10:08:30 +00:00
Jonathan Peyton	48db80cc6c	Add license envirable for testing Intel compilers Patch by Simon Convent Differential Revision: https://reviews.llvm.org/D38881 llvm-svn: 316232	2017-10-20 19:45:43 +00:00
Jonathan Peyton	16a05bca9c	Add C++ support for testcases Patch by Simon Convent Differential Revision: https://reviews.llvm.org/D38878 llvm-svn: 316230	2017-10-20 19:42:32 +00:00
Jonas Hahnfeld	5872f1e97f	[test] Fix uninitialized memory in omp_taskloop_grainsize.c result was never initialized to zero which sometimes failed the test. llvm-svn: 314513	2017-09-29 13:53:03 +00:00
Jonathan Peyton	f439246328	Fix implementation of OMP_THREAD_LIMIT This change fixes the implementation of OMP_THREAD_LIMIT. The implementation of this previously was not restricted to a contention group (but it should be, according to the spec), and this is fixed here. A field is added to root thread to store a counter of the threads in the contention group. An extra check is added when reserving threads for a parallel region that checks this variable and compares to threadlimit-var, which is implemented as a new global variable, kmp_cg_max_nth. Associated settings changes were also made, and clean up of comments that referred to OMP_THREAD_LIMIT, but should refer to the new KMP_DEVICE_THREAD_LIMIT (added in an earlier patch). Patch by Terry Wilmarth Differential Revision: https://reviews.llvm.org/D35912 llvm-svn: 309319	2017-07-27 20:58:41 +00:00
Jonathan Peyton	1c50ee64a2	Fix failing taskloop tests by omitting gcc We do not have GOMP interface support for taskloop yet. llvm-svn: 308351	2017-07-18 20:16:25 +00:00
Jonathan Peyton	93e17cfe6c	Add recursive task scheduling strategy to taskloop implementation Summary: Taskloop implementation is extended by using recursive task scheduling. Envirable KMP_TASKLOOP_MIN_TASKS added as a manual threshold for the user to switch from recursive to linear tasks scheduling. Details: * The calculations for the loop parameters are moved from __kmp_taskloop_linear upper level * Initial calculation is done in the __kmpc_taskloop, further range splitting is done in the __kmp_taskloop_recur. * Added threshold to switch from recursive to linear tasks scheduling; * One half of split range is scheduled as an internal task which just moves sub-range parameters to the stealing thread that continues recursive scheduling (if number of tasks still enough), the other half is processed recursively; * Internal task duplication routine fixed to assign parent task, that was not needed when all tasks were scheduled by same thread, but is needed now. Patch by Andrey Churbanov Differential Revision: https://reviews.llvm.org/D35273 llvm-svn: 308338	2017-07-18 18:50:13 +00:00
Hal Finkel	2bc3449d22	Make test/parallel/omp_nested.c not use so many threads I've found it very difficult to get test/parallel/omp_nested.c to pass consistently across my build environments. The problem is that it creates N^2 threads (it is testing nested parallel regions), and that often exceeds the thread limits on systems with many cores. We do raise the process limits in lit, and that often helps, but if running lit with a smaller number of threads or on a system where we're otherwise resource constrained, this particular test tends to fail (because the runtime cannot create a sufficient number of threads). This seems to work: if the maximum number of threads is more than some small number, then cap the number of threads used for the parallel region. The choice of 4 here is somewhat arbitrary. Differential Revision: https://reviews.llvm.org/D32033 llvm-svn: 306357	2017-06-27 03:04:25 +00:00
Andrey Churbanov	d454c73cc3	OpenMP 4.5: implemented support of schedule(simd:guided) and schedule(simd:runtime) - library part. Compiler generation should use newly introduced scheduling kinds kmp_sch_guided_simd = 46, kmp_sch_runtime_simd = 47, as parameters to __kmpc_dispatch_init_* entries. Differential Revision: https://reviews.llvm.org/D31602 llvm-svn: 304724	2017-06-05 17:17:33 +00:00
Jonathan Peyton	e3e2aaf68d	Fix for KMP_AFFINITY=disabled and KMP_TOPOLOGY_METHOD=hwloc With these settings, the create_hwloc_map() method was being called causing an assert(). After some consideration, it was determined that disabling affinity explicitly should just disable hwloc as well. i.e., KMP_AFFINITY overrides KMP_TOPOLOGY_METHOD. This lets the user know that the Hwloc mechanism is being ignored when KMP_AFFINITY=disabled. Differential Revision: https://reviews.llvm.org/D33208 llvm-svn: 304344	2017-05-31 20:35:22 +00:00
Olga Malysheva	80af9c081a	Test cancellation_for_sections.c expectedly fails on GCC llvm-svn: 299437	2017-04-04 14:39:52 +00:00
Olga Malysheva	dbdcfa127f	Reset cancellation status for 'parallel', 'sections' and 'for' constracts. Without this fix cancellation status for parallel, sections and for persists across construct boundaries. Differential Revision: https://reviews.llvm.org/D31419 llvm-svn: 299434	2017-04-04 13:56:50 +00:00
Andrey Churbanov	435b419d26	Fixed intermittent hang on tests with "target teams if(0)" construct with no parallel inside. Differential Revision: https://reviews.llvm.org/D29597 llvm-svn: 298373	2017-03-21 13:48:52 +00:00
Michal Gorny	018d13597a	[test] Try to link -latomic to provide atomics when available When using -rtlib=libgcc, the fallback implementation of __atomic_* builtins is provided via libatomic (included in GCC). However, neither GCC itself nor clang link libatomic implicitly, and it seems that GCC upstream expects projects to link it explicitly as necessary. Since compiler-rt provides __atomic_* builtins directly in the main library, check if they are provided by the default libraries first. If they are not, check if -latomic is available to provide them and add explicit -latomic for tests in this case. This fixes unresolved __atomic_load() references when running openmp tests on i386 with libgcc backend. Differential Revision: https://reviews.llvm.org/D30083 llvm-svn: 296183	2017-02-24 22:15:24 +00:00
Andrey Churbanov	72ba210916	Run-time library part of OpenMP 5.0 task reduction implementation. Added test kmp_task_reduction_nest.cpp which has an example of possible compiler codegen. Differential Revision: https://reviews.llvm.org/D29600 llvm-svn: 295343	2017-02-16 17:49:49 +00:00
Jonas Hahnfeld	479088eefa	Correct wrong comment in bug_nested_proxy_task.c The nested proxy task does not have dependencies. llvm-svn: 293472	2017-01-30 09:51:02 +00:00
Jonathan Peyton	7f976d556a	Fix memory error in case of reinit using kmp_set_defaults() for lock code. The lock tables were being reallocated if kmp_set_defaults() was called. In the env_init code it says that the user should be able to switch between different KMP_CONSISTENCY_CHECK values which is what this change enables. llvm-svn: 292349	2017-01-18 07:02:21 +00:00
Jonathan Peyton	a1234cf280	Enable omp_get_schedule() to return static steal type. As the code is now, calling omp_get_schedule() when OMP_SCHEDULE=static_steal will cause an assert. llvm-svn: 283576	2016-10-07 18:01:35 +00:00
Michal Gorny	3ccf825e22	[test] Support 'lit' executable name Support finding lit as plain 'lit', which is the name used by setup.py in LLVM's utils/lit. Differential Revision: https://reviews.llvm.org/D25072 llvm-svn: 282876	2016-09-30 16:56:16 +00:00
Michal Gorny	cd2bfb1e7c	Fix respecting LIBOMP_LLVM_LIT_EXECUTABLE as full path Fix lit search to correctly respect LIBOMP_LLVM_LIT_EXECUTABLE as full program path. The variable passed to find_program() is created by CMake as a cache variable, and therefore can be directly overriden by the user. Since this was the design of LIBOMP_LLVM_LIT_EXECUTABLE (as can be deduced from the error messages) and there is no other use of LIT_EXECUTABLE, remove the redundant variable and pass LIBOMP_LLVM_LIT_EXECUTABLE directly to find_program(). Furthermore, the previous code did not work since the HINTS argument specifies more search directories rather than expected full path. Quoting the CMake documentation: > 3. Search the paths specified by the HINTS option. These should be > paths computed by system introspection, such as a hint provided by > the location of another item already found. Hard-coded guesses should > be specified with the PATHS option. Differential Revision: https://reviews.llvm.org/D24710 llvm-svn: 281887	2016-09-19 06:55:56 +00:00
Jonas Hahnfeld	848d690697	[OMPT] fix task frame information for gomp interface Previous differencials D23305-D23310 changed task frame information management only for the kmp interface, but not for the whole gomp interface. This broke some testcases when building with gcc. This patch fixes the broken task frame information for the gomp interface. Patch by Joachim Protze! Differential Revision: https://reviews.llvm.org/D24502 llvm-svn: 281468	2016-09-14 13:59:39 +00:00
Jonas Hahnfeld	dd9a05d5d8	[OMPT] save exit address to lwt if available In case, the current team is a serialized team (lwt), the frame information should be written to this data structure. Before, nested serialized teams would overwrite the same task information. Patch by Joachim Protze! Differential Revision: https://reviews.llvm.org/D23310 llvm-svn: 281467	2016-09-14 13:59:31 +00:00
Jonas Hahnfeld	28ea24bba7	[OMPT] fix __ompt_get_teaminfo to consult lwt entries of parent teams The comment already states, that this function should work similarly as __ompt_get_taskinfo. The function only looked for lwt entries of the current team, but not when unrolling the parents. This fix aligns the implementation to __ompt_get_taskinfo. The new test case creates a single theaded team (->lwt) and then a nested active team. Before the innermost print_id(1) would deliver a different team then the outer print_id(0). Patch by Joachim Protze! Differential Revision: https://reviews.llvm.org/D23309 llvm-svn: 281466	2016-09-14 13:59:24 +00:00
Jonas Hahnfeld	8a27064e05	[OMPT] Reset task exit frame when execution is finished The exit address is set when execution of a task is started and should be reset as soon as the execution is finished. Especially for the asm implementation of __kmp_invoke_microtask, resetting in this call would be painfull, so reset just after the invokation. The testcase shows the effect of this patch: Before, the implicit barriers at the end of an implicit task would see an exit address for the implicit task. This barrier is a task scheduling point. Thus, any explicit task scheduled there would see an exit, but no reenter address for the implicit task. Patch by Joachim Protze! Differential Revision: https://reviews.llvm.org/D23307 llvm-svn: 281465	2016-09-14 13:59:19 +00:00
Jonas Hahnfeld	fd0614d830	[OMPT] Align implementation of reenter frame address to latest (frozen) version of OMPT spec The latest OMPT spec changed the semantic of a tasks reenter frame to be the application frame, that will be entered, when the runtime frame drops. Before it was the last frame in the runtime. This doesn't work for some gcc execution pathes or even clang generated code for : Since there is no runtime frame between the executed task and the encountering task. The test case compares exit and reenter addresses against addresses captured in application code Patch by Joachim Protze! Differential Revision: https://reviews.llvm.org/D23305 llvm-svn: 281464	2016-09-14 13:59:13 +00:00
Jonas Hahnfeld	464cdca9d3	[OMPT] extend ompt tests by checks for frame pointers OMPT tests can check for right frame information of tasks: * parent_task_frame was directly printed as a pointer, but actually points to a struct ompt_frame {void, void} * NULL is printed in the beginning of execution and loaded to FileChecker variable [[NULL]] * implicit tasks now also print their frame information * macro to print frame address from application * print task info for barrier begin Patch by Joachim Protze! Differential Revision: https://reviews.llvm.org/D23304 llvm-svn: 281463	2016-09-14 13:59:05 +00:00
Jonathan Peyton	0af717970c	Appease older gcc compilers for the many-microtask-args.c test Older gcc compilers error out with the C99 syntax of: for (int i =...) so this change just moves the int i; declaration up above. llvm-svn: 280138	2016-08-30 19:28:58 +00:00
Dimitry Andric	70ba8c506c	Fix linking of omp_foreign_thread_team_reuse test on FreeBSD Summary: On FreeBSD, linking the misc_bugs/omp_foreign_thread_team_reuse.c test case fails with: /usr/local/bin/ld: /tmp/omp_foreign_thread_team_reuse-c5e71b.o: undefined reference to symbol 'pthread_create@@FBSD_1.0' This is because the program is linked without `-lpthread`. Since the %libomp-compile-and-run macro does not allow that option to be added to the compile command line, split it up and add the required `-lpthread` between %libomp-compile and %libomp-run. Reviewers: jlpeyton, hfinkel, Hahnfeld Subscribers: Hahnfeld, emaste, openmp-commits Differential Revision: https://reviews.llvm.org/D23084 llvm-svn: 278036	2016-08-08 18:34:05 +00:00
Jonas Hahnfeld	ad0c42e3a9	kmp_gsupport: Fix library initialization with taskgroup Differential Revision: https://reviews.llvm.org/D23259 llvm-svn: 278003	2016-08-08 13:23:08 +00:00
Jonas Hahnfeld	ca32babfa7	Mark tests with task dependencies as unsupported with GCC llvm-svn: 277996	2016-08-08 11:52:49 +00:00
Jonas Hahnfeld	bedc371c9d	Do not block on explicit task depending on proxy task Consider the following code: int dep; #pragma omp target nowait depend(out: dep) { sleep(1); } #pragma omp task depend(in: dep) { printf("Task with dependency\n"); } printf("Doing some work...\n"); In its current state the runtime will block on the second task and not continue execution. Differential Revision: https://reviews.llvm.org/D23116 llvm-svn: 277992	2016-08-08 10:08:14 +00:00
Jonas Hahnfeld	69f8511f8f	__kmp_free_task: Fix for serial explicit tasks producing proxy tasks Consider the following code which may be executed by a serial team: int dep; #pragma omp target nowait depend(out: dep) { sleep(1); } #pragma omp task depend(in: dep) { #pragma omp target nowait { sleep(1); } } Here the explicit task may not be freed until the nested proxy task has finished. The current code hasn't considered this and called __kmp_free_task anyway which triggered an assert because of remaining incomplete children: KMP_DEBUG_ASSERT( TCR_4(taskdata->td_incomplete_child_tasks) == 0 ); Differential Revision: https://reviews.llvm.org/D23115 llvm-svn: 277991	2016-08-08 10:08:07 +00:00
Jonas Hahnfeld	d1f4b8f6e8	Add test case for nested creation of tasks For discussion in D23115 llvm-svn: 277730	2016-08-04 14:55:56 +00:00
Jonathan Peyton	741b70926f	Fix the nowait tests for omp for and omp single These tests are now modeled after the sections nowait test where threads wait to be released in the first construct (either for or single) and the last thread skips the last for/single construct and releases those threads. If the test fails, then it hangs because an unnecessary barrier is executed in between the constructs. llvm-svn: 274641	2016-07-06 17:26:12 +00:00
Jonathan Peyton	fdcca8cd55	Fix omp_sections_nowait.c test to address Bugzilla Bug 28336 This rewrite of the omp_sections_nowait.c test file causes it to hang if the nowait is not respected. If the nowait isn't respected, the lone thread which can escape the first sections construct will just sleep at a barrier which shouldn't exist. All reliance on timers is taken out. For good measure, the test makes sure that all eight sections are executed as well. The test should take no longer than a few seconds on any modern machine. Differential Revision: http://reviews.llvm.org/D21842 llvm-svn: 274151	2016-06-29 19:46:52 +00:00
Jonathan Peyton	ac7ba406ed	Fix bugs in TAS and futex lock * Incorrect lock value written in __kmp_test_futex_lock * Incorrect lock value check in tas/futex lock with USE_LOCK_PROFILE on Patch by Hansang Bae llvm-svn: 274053	2016-06-28 19:37:24 +00:00
Jonathan Peyton	e119e8e5b5	Remove redundant %libomp-compile step from test/lock/omp_lock.c llvm-svn: 273576	2016-06-23 16:18:59 +00:00
Jonathan Peyton	9d2412c9e5	Apply the KMP_USE_FUTEX feature macro everywhere llvm-svn: 273438	2016-06-22 16:35:12 +00:00
Jonathan Peyton	c76f9f0df8	Bug fix for hang when tasks used in nested parallel Bug fix for hang when omp task and nested parallelism used together. Still some problem remains with task state saving/restoring, but user's case works fine now. All tasking unit tests passed as well. Patch by Andrey Churbanov Differential Revision: http://reviews.llvm.org/D21558 llvm-svn: 273297	2016-06-21 19:12:07 +00:00
Jonathan Peyton	61fdddfd64	Revert accidental commit to lit.cfg llvm-svn: 272287	2016-06-09 18:29:36 +00:00
Jonathan Peyton	c4c722ac0d	Refactor __kmp_execute_tasks_template function Refactored __kmp_execute_tasks_template to shorten and remove code redundancy. The original code for __kmp_execute_tasks_template was very redundant with large sections of repeated code that needed to be kept consistent, and goto statements that made the control flow difficult to discern. This refactoring removes all gotos and redundancy. Patch by Terry Wilmarth Differential Revision: http://reviews.llvm.org/D20879 llvm-svn: 272286	2016-06-09 18:27:03 +00:00
Jonathan Peyton	067325f935	Offer API for setting number of loop dispatch buffers The problem is the lack of dispatch buffers when thousands of loops with nowait, about 10 iterations each, are executed by hundreds of threads. We only have built-in 7 dispatch buffers, but there is a need in dozens or hundreds of buffers. The problem can be fixed by setting KMP_MAX_DISP_BUF to bigger value. In order to give users same possibility I changed build-time control into run-time one, adding API just in case. This change adds an environment variable KMP_DISP_NUM_BUFFERS and a new API function kmp_set_disp_num_buffers(int num_buffers). The KMP_DISP_NUM_BUFFERS envirable works only before serial initialization, because during the serial initialization we already allocate buffers for the hot team, so it is too late to change the number of buffers later (or we need to reallocate buffers for all teams which sounds too complicated). The kmp_set_defaults() routine does not work for this envirable, because it calls serial initialization before reading the parameter string. So a new routine, kmp_set_disp_num_buffers(), is created so that it can set our internal global variable before the library initialization. If both the envirable and API used the envirable wins. Differential Revision: http://reviews.llvm.org/D20697 llvm-svn: 271318	2016-05-31 19:01:15 +00:00
Hal Finkel	0a665a83da	Add a test case for microtask dispatch with many arguments This is a cleaned-up version of the test case posted in the D19879 review. llvm-svn: 270867	2016-05-26 16:34:05 +00:00
Jonathan Peyton	1ab887d403	Allow unit testing on Windows These changes allow testing on Windows using clang.exe. There are two main changes: 1. Only link to -lm when it actually exists on the system 2. Create basic versions of pthread_create() and pthread_join() for windows. They are not POSIX compliant by any stretch but will allow any existing and future tests to use pthread_create() and pthread_join() for testing interactions of libomp with os threads. Differential Revision: http://reviews.llvm.org/D20391 llvm-svn: 270464	2016-05-23 17:50:32 +00:00
Jonathan Peyton	aa7d2d781b	Remove unnecessary unistd.h header from tests. llvm-svn: 269987	2016-05-18 21:36:34 +00:00
Jonathan Peyton	3731076997	Remove trailing whitespace from tests llvm-svn: 269841	2016-05-17 21:08:52 +00:00
Jonathan Peyton	0e8f053023	[OpenMP Testing] Have lit.py be a valid lit executable Users can use either llvm-lit (generated during llvm build) or lit.py which exists in llvm/utils/lit. llvm-svn: 269774	2016-05-17 15:12:11 +00:00
Jonathan Peyton	f83ae31caf	Adding new kmp_aligned_malloc() entry point This change adds a new entry point, kmp_aligned_malloc(size_t size, size_t alignment), an entry point corresponding to kmp_malloc() but with the capability to return aligned memory as well. Other allocator routines have been adjusted so that kmp_free() can be used for freeing memory blocks allocated by any kmp_*alloc() routine, including the new kmp_aligned_malloc() routine. Differential Revision: http://reviews.llvm.org/D19814 llvm-svn: 269365	2016-05-12 22:00:37 +00:00
Jonathan Peyton	2b749b33cc	Fix team reuse with foreign threads After hot teams were enabled by default, the library started using levels kept in the team structure. The levels are broken in case foreign thread exits and puts its team into the pool which is then re-used by another foreign thread. The broken behavior observed is when printing the levels for each new team, one gets 1, 2, 1, 2, 1, 2, etc. This makes the library believe that every other team is nested which is incorrect. What is wanted is for the levels to be 1, 1, 1, etc. Differential Revision: http://reviews.llvm.org/D19980 llvm-svn: 269363	2016-05-12 21:54:30 +00:00
Jonathan Peyton	5235a1b603	Fix trip count calculation for parallel loops in runtime The trip count calculation was incorrect for loops with large bounds. For example, for(int i=-2,000,000,000; i < 2,000,000,000; i+=50000000), the trip count calculation had overflow (trying to calculate 2,000,000,000 + 2,000,000,000 with signed integers) and wasn't giving the right value. This patch fixes this error in the runtime by using unsigned integers instead. There is still a bug in the clang compiler component because it warns that there is overflow in the test case file when there isn't. This error isn't there for the Intel Compiler. So for now, the test case is designated as XFAIL. Differential Revision: http://reviews.llvm.org/D19078 llvm-svn: 266677	2016-04-18 21:38:29 +00:00
Jonathan Peyton	377aa40d84	Exponential back off logic for test-and-set lock This change adds back off logic in the test and set lock for better contended lock performance. It uses a simple truncated binary exponential back off function. The default back off parameters are tuned for x86. The main back off logic has a two loop structure where each is controlled by a user-level parameter: max_backoff - limits the outer loop number of iterations. This parameter should be a power of 2. min_ticks - the inner spin wait loop number of "ticks" which is system dependent and should be tuned for your system if you so choose. The "ticks" on x86 correspond to the time stamp counter, but on other architectures ticks is a timestamp derived from gettimeofday(). The user can modify these via the environment variable: KMP_SPIN_BACKOFF_PARAMS=max_backoff[,min_ticks] Currently, since the default user lock is a queuing lock, one would have to also specify KMP_LOCK_KIND=tas to use the test-and-set locks. Differential Revision: http://reviews.llvm.org/D19020 llvm-svn: 266329	2016-04-14 16:00:37 +00:00
Jonathan Peyton	50e8f18b52	OMP_WAIT_POLICY changes This change has OMP_WAIT_POLICY=active to mean that threads will busy-wait in spin loops and virtually never go to sleep. OMP_WAIT_POLICY=passive now means that threads will immediately go to sleep inside a spin loop. KMP_BLOCKTIME was the previous mechanism to specify this behavior via KMP_BLOCKTIME=0 or KMP_BLOCKTIME=infinite, but the standard OpenMP environment variable should also be able to specify this behavior. Differential Revision: http://reviews.llvm.org/D18577 llvm-svn: 265339	2016-04-04 19:38:32 +00:00
Jonas Hahnfeld	e46a494a50	[OMPT] Fix parallel_id and task_id in loop_end with schedule static For serialized parallel regions, wrong ids were reported. Now the same code is used as in kmp_dispatch.cpp which emits the correct ids. Differential Revision: http://reviews.llvm.org/D18348 llvm-svn: 264266	2016-03-24 12:52:20 +00:00
Jonas Hahnfeld	801fe9bbe2	[OMPT] Test ids reported by ompt_get_{parallel,task}_id llvm-svn: 264265	2016-03-24 12:52:11 +00:00
Jonas Hahnfeld	1c1c71776a	[OMPT] Fix duplicate implicit_task_end events for master thread with GCC For non-serialized parallel regions the master thread issued two callbacks: The first one in kmp_gsupport.c and the second in __kmp_join_call. Therefore only trigger the callback in kmp_gsupport.c for serialized parallel regions. Differential Revision: http://reviews.llvm.org/D16716 llvm-svn: 264264	2016-03-24 12:52:04 +00:00
Jonas Hahnfeld	b1cad2954b	[OMPT] Make tests require OMPT_BLAME ompt_event_barrier_{begin,end} are optional blame events. In total it doesn't make any sense to test partially built OMPT support. llvm-svn: 264031	2016-03-22 08:23:24 +00:00
Jonas Hahnfeld	c804301113	[OMPT] Create infrastructure and add first tests for OMPT Some basic checks next to the implementation should futher lower the possibility to introduce regressions. (Note that this would have catched the ordering issue fixed in rL258866 and pointed to rL263940.) The tests are implementation dependent in one point because they assume that thread ids are assigned in ascending order. This is not defined by the standard but currently ensured in libomp. We have to think about another way of ordering the threads should this ever be subject to change... Note that this isn't aiming at replacing the implementation independent test-suite at https://github.com/OpenMPToolsInterface/ompt-test-suite! Differential Revision: http://reviews.llvm.org/D16715 llvm-svn: 264027	2016-03-22 07:22:49 +00:00
Jonathan Peyton	283a215c7a	Add new OpenMP 4.5 taskloop construct feature From the standard: The taskloop construct specifies that the iterations of one or more associated loops will be executed in parallel using OpenMP tasks. The iterations are distributed across tasks created by the construct and scheduled to be executed. This initial implementation uses a simple linear tasks distribution algorithm. Later we can add other algorithms to speedup generation of huge number of tasks (i.e., tree-like tasks generation should be faster). This needs to be put into the OpenMP runtime library in order for the compiler team to develop the compiler side of the implementation. Differential Revision: http://reviews.llvm.org/D17404 llvm-svn: 262535	2016-03-02 22:47:51 +00:00
Jonathan Peyton	a0d7a2cd3f	Forgot to add test files for doacross and task priority. llvm-svn: 262533	2016-03-02 22:43:14 +00:00
Jonathan Peyton	2851072d69	Add initial support for OpenMP 4.5 task priority feature The maximum task priority value is read from envirable: OMP_MAX_TASK_PRIORITY. But as of now, nothing is done with it. We just handle the environment variable and add the new api: omp_get_max_task_priority() which returns that value or zero if it is not set. Differential Revision: http://reviews.llvm.org/D17411 llvm-svn: 261908	2016-02-25 18:04:09 +00:00
Jonas Hahnfeld	66594990b1	[CMake] Introduce OPENMP_LLVM_TOOLS_DIR This will be used in a later patch to find additional LLVM tools for tests and enables reusability for libomptarget that is currently under review. Differential Revision: http://reviews.llvm.org/D16713 llvm-svn: 259876	2016-02-05 07:00:13 +00:00
Andrey Churbanov	24d4eba0f9	omp_barrier.c test fixed in order to reliably and faster run on any number of processors llvm-svn: 258695	2016-01-25 16:52:10 +00:00
Hans Wennborg	464307ffe7	lit.cfg: Pass -isysroot to the SDK on Darwin Newly-built Clangs don't automatically find the SDK, and newer versions of Mac OS X don't provide it under /usr/include etc. llvm-svn: 258169	2016-01-19 19:26:43 +00:00

1 2 3 4

159 Commits