llvm-project

Commit Graph

Author	SHA1	Message	Date
George Rokos	2467df6e4f	[OpenMP] Initial implementation of OpenMP offloading library - libomptarget. This is the patch upstreaming the device-agnostic part of libomptarget. Differential Revision: https://reviews.llvm.org/D14031 llvm-svn: 293094	2017-01-25 21:27:24 +00:00
Jonathan Peyton	3692fcf665	Use C++11 static_assert() for build asserts. llvm-svn: 292350	2017-01-18 07:49:30 +00:00
Jonathan Peyton	7f976d556a	Fix memory error in case of reinit using kmp_set_defaults() for lock code. The lock tables were being reallocated if kmp_set_defaults() was called. In the env_init code it says that the user should be able to switch between different KMP_CONSISTENCY_CHECK values which is what this change enables. llvm-svn: 292349	2017-01-18 07:02:21 +00:00
Jonathan Peyton	d0365a228c	Fix small memory leak regarding __kmp_nested_proc_bind There is no corresponding free() for this expandable array. The logic is added in __kmp_cleanup() next to the freeing of __kmp_nested_nth. llvm-svn: 292348	2017-01-18 06:40:19 +00:00
Jonas Hahnfeld	c9a8a6c030	kmp_affinity: Fix check if specific bit is set Clang 4.0 trunk warns: warning: logical not is only applied to the left hand side of this bitwise operator [-Wlogical-not-parentheses] This points to a potential bug if the code really wants to check if the single bit is not set: If for example (buf.edx >> 9) = 2 (has any bit set except the least significant one), 'logical not' will return 0 which stays 0 after the 'bitwise and'. To do this correctly we first need to evaluate the 'bitwise and'. In that case it returns 2 & 1 = 0 which after the 'logical not' evaluates to 1. Differential Revision: https://reviews.llvm.org/D28599 llvm-svn: 291764	2017-01-12 11:39:04 +00:00
Jonas Hahnfeld	49152b3f06	[CMake] Make openmp build under runtimes/ runtimes/CMakeLists.txt in LLVM passes OPENMP_STANDALONE_BUILD. Differential Revision: https://reviews.llvm.org/D28280 llvm-svn: 290978	2017-01-04 18:11:37 +00:00
Andrey Churbanov	76d4285460	Fix for the __kmpc_global_num_threads function to return the value of the __kmp_all_nth global var. Patch by Yonghong Yan. Differential Revision: https://reviews.llvm.org/D27975 llvm-svn: 290272	2016-12-21 21:20:20 +00:00
Oren Ben Simhon	c11addb506	Reverting last change. llvm-svn: 290245	2016-12-21 09:04:08 +00:00
Oren Ben Simhon	016f2af3c7	[X86] Vectorcall Calling Convention - Adding CodeGen Complete Support Fixing build issues. llvm-svn: 290242	2016-12-21 08:58:19 +00:00
Jonathan Peyton	de4749b748	Follow up to r289732: Update comments in source files to reference .cpp files Patch by Hansang Bae llvm-svn: 289739	2016-12-14 23:01:24 +00:00
Jonathan Peyton	7cc577a4ef	Change source files from .c to .cpp Patch by Hansang Bae Differential Revision: https://reviews.llvm.org/D26688 llvm-svn: 289732	2016-12-14 22:39:11 +00:00
Andrey Churbanov	5dee8c43da	Cleanup: debug print fixed and moved inside critical section. Patch by Victor Campos. Differential Revision: https://reviews.llvm.org/D27647 llvm-svn: 289640	2016-12-14 08:29:00 +00:00
Sylvestre Ledru	cd9d374337	Support of mips & mips64 for openmprtl Summary: Implemented by Dejan Latinovic See https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=790735 for more more information Reviewers: AndreyChurbanov, jlpeyton Subscribers: openmp-commits, mgorny Differential Revision: https://reviews.llvm.org/D26576 llvm-svn: 289032	2016-12-08 09:22:24 +00:00
Andrey Churbanov	e0a2c3e99a	fixed type in Windows-specific code llvm-svn: 288368	2016-12-01 16:08:52 +00:00
Jonathan Peyton	a88e8358af	Fixed typo in kmp_process_deps trace output Patch by Victor Campos Differential Revision: https://reviews.llvm.org/D27172 llvm-svn: 288056	2016-11-28 20:10:32 +00:00
Andrey Churbanov	bcadbd6302	Cleanup: memory leaks on warnings printing fixed; some memory freeing cleaned; poor indents and one typo fixed. Patch by Victor Campos. Differential Revision: https://reviews.llvm.org/D26786 llvm-svn: 288054	2016-11-28 19:23:09 +00:00
Jonathan Peyton	96fe1aa380	Set task->td_dephash to NULL after free llvm-svn: 287552	2016-11-21 16:24:59 +00:00
Jonathan Peyton	7ca7ef0478	Fix for D25504 - segfault because of double free()-ing in shutdown code. Paul Osmialowski pointed out a double free bug in shutdown code. This patch Moves the freeing of the implicit task to above the freeing of all fast memory to prevent the double-free issue. Differential Revision: https://reviews.llvm.org/D26860 llvm-svn: 287551	2016-11-21 16:18:57 +00:00
Jonathan Peyton	5375fe820c	Update stats-gathering code Have developer timers use partitioning scheme which also required that some redundant developer timers be removed in favor of the already existing normal timers. Move per thread stats initialization to just after global thread id assignment which is as early as possible. Also put all global stats initialization code in __kmp_stats_init() and all global stats destruction code in __kmp_stats_fini(). Differential Revision: https://reviews.llvm.org/D26361 llvm-svn: 286892	2016-11-14 21:13:44 +00:00
Jonathan Peyton	1cdd87adfd	Introduce dynamic affinity dispatch capabilities This set of changes enables the affinity interface (Either the preexisting native operating system or HWLOC) to be dynamically set at runtime initialization. The point of this change is that we were seeing performance degradations when using HWLOC. This allows the user to use the old affinity mechanisms which on large machines (>64 cores) makes a large difference in initialization time. These changes mostly move affinity code under a small class hierarchy: KMPAffinity class Mask {} KMPNativeAffinity : public KMPAffinity class Mask : public KMPAffinity::Mask KMPHwlocAffinity class Mask : public KMPAffinity::Mask Since all interface functions (for both affinity and the mask implementation) are virtual, the implementation can be chosen at runtime initialization. Differential Revision: https://reviews.llvm.org/D26356 llvm-svn: 286890	2016-11-14 21:08:35 +00:00
Andrey Churbanov	1fbb482928	Added check for malloc return. Patch by Victor Campos. Differential Revision: https://reviews.llvm.org/D26318 llvm-svn: 286441	2016-11-10 09:08:03 +00:00
Jonas Hahnfeld	50fed0475f	[OpenMP] Enable ThreadSanitizer to check OpenMP programs This patch allows ThreadSanitizer (Tsan) to verify OpenMP programs. It means that no false positive will be reported by Tsan when verifying an OpenMP programs. This patch introduces annotations within the OpenMP runtime module to provide information about thread synchronization to the Tsan runtime. In order to enable the Tsan support when building the runtime, you must enable the TSAN_SUPPORT option with the following environment variable: -DLIBOMP_TSAN_SUPPORT=TRUE The annotations will be enabled in the main shared library (same mechanism of OMPT). Patch by Simone Atzeni and Joachim Protze! Differential Revision: https://reviews.llvm.org/D13072 llvm-svn: 286115	2016-11-07 15:58:36 +00:00
Andrey Churbanov	4d49312cad	fixed typo in comment llvm-svn: 285947	2016-11-03 17:48:46 +00:00
Andrey Churbanov	753fa0468c	Change task stealing to always get task from head of victim's deque. Differential Revision: https://reviews.llvm.org/D26187 llvm-svn: 285833	2016-11-02 16:45:25 +00:00
Andrey Churbanov	51107e0abc	Fixed problem introduced by part of https://reviews.llvm.org/D21196 . Check Task Scheduling Constraint (TSC) on stealing of untied task. This is needed because the untied task can produce tied children those can break TSC if untied is not a descendant of current task. This can cause live lock on complex tyasking tests (e.g. kastors/strassen-task-dep). Differential Revision: https://reviews.llvm.org/D26182 llvm-svn: 285703	2016-11-01 16:19:04 +00:00
Andrey Churbanov	dd313b0673	Add more conditions to check whether task waiting is necessary in kmp_omp_taskwait. Differential Revision: https://reviews.llvm.org/D26058 Patch by Victor Campos llvm-svn: 285678	2016-11-01 08:33:36 +00:00
Andrey Churbanov	df0d75edf6	Fixed a memory leak related to task dependencies. Differential Revision: http://reviews.llvm.org/D25504 Patch by Alex Duran. llvm-svn: 285283	2016-10-27 11:43:07 +00:00
Jonathan Peyton	3c4050d698	Fixing typos in __kmp_release_deps trace outputs Patch by Victor Campos Differential Revision: https://reviews.llvm.org/D25972 llvm-svn: 285244	2016-10-26 21:46:43 +00:00
Jonathan Peyton	762bc46224	Use getpagesize() instead of PAGE_SIZE macro when KMP_OS_LINUX is true Patch by Victor Campos Differential Revision: https://reviews.llvm.org/D26001 llvm-svn: 285243	2016-10-26 21:42:48 +00:00
Andrey Churbanov	2e68768d1e	Fixed memory leak mistakenly introduced by https://reviews.llvm.org/D23115 Differential Revision: http://reviews.llvm.org/D25510 llvm-svn: 284747	2016-10-20 17:14:17 +00:00
Samuel Antao	335151914a	[OpenMP] Fix issue with directives used in a macro. Summary: If directives are used in a macro, clang complains with: ``` src/projects/openmp/runtime/src/kmp_runtime.c:7486:2: error: embedding a directive within macro arguments has undefined behavior [-Werror,-Wembedded-directive] #if KMP_USE_MONITOR ``` This patch fixes two occurrences of the issue in `kmp_runtime.cpp`. Reviewers: tlwilmar, jlpeyton, AndreyChurbanov, Hahnfeld Subscribers: Hahnfeld, openmp-commits Differential Revision: https://reviews.llvm.org/D25823 llvm-svn: 284728	2016-10-20 13:20:17 +00:00
Jonathan Peyton	0ac7b75f7b	Fix OpenMP 4.0 library build Patch by Andrey Churbanov Differential Revision: https://reviews.llvm.org/D25505 llvm-svn: 284499	2016-10-18 17:39:06 +00:00
Michal Gorny	efc536ee9d	Fix a compile error on musl-libc due to strerror_r() prototype Function strerror_r() has different signatures in different implementations of libc: glibc's version returns a char*, while BSDs and musl return a int. libomp unconditionally assumes glibc on Linux and thus fails to compile against musl-libc. This patch addresses this issue. Differential Revision: https://reviews.llvm.org/D25071 llvm-svn: 284492	2016-10-18 16:38:44 +00:00
Jonathan Peyton	55466e9106	Mixed type atomic routines added for capture and update/capture reverse. New mixed type atomic routines added for regular capture operations as well as reverse update/capture operations. LHS - all integer and float types (no complex so far), RHS - float16. Patch by Olga Malysheva Differential Revision: https://reviews.llvm.org/D25275 llvm-svn: 284489	2016-10-18 16:20:55 +00:00
Jonathan Peyton	e1c7c13c3d	Code cleanup for the runtime without monitor thread This change removes/disables unnecessary code when monitor thread is not used. Patch by Hansang Bae Differential Revision: https://reviews.llvm.org/D25102 llvm-svn: 283577	2016-10-07 18:12:19 +00:00
Jonathan Peyton	a1234cf280	Enable omp_get_schedule() to return static steal type. As the code is now, calling omp_get_schedule() when OMP_SCHEDULE=static_steal will cause an assert. llvm-svn: 283576	2016-10-07 18:01:35 +00:00
Paul Osmialowski	7a9c29e4b8	[cmake] Fix for a bug https://llvm.org/bugs/show_bug.cgi?id=30489 "Cannot build with -DLIBOMP_FORTRAN_MODULES=True" Differential Revision: https://reviews.llvm.org/D24959 llvm-svn: 282965	2016-09-30 22:05:45 +00:00
Jonathan Peyton	66e212ce2b	Insert missing checks for KMP_AFFINITY_CAPABLE() in affinity API. If affinity is not capable, then these API functions will perform the stubs version. llvm-svn: 282947	2016-09-30 20:56:44 +00:00
Michal Gorny	3ccf825e22	[test] Support 'lit' executable name Support finding lit as plain 'lit', which is the name used by setup.py in LLVM's utils/lit. Differential Revision: https://reviews.llvm.org/D25072 llvm-svn: 282876	2016-09-30 16:56:16 +00:00
Jonathan Peyton	74f3ffce24	Fix incorrect OpenMP version in Fortran module. Add check for "45" version to use "201511" string for OpenMP 4.5, otherwise "200505" is used in Fortran module. Also, fix kmp_openmp_version variable (used for the debugger, e.g.) and kmp_version_omp_api that is used in KMP_VERSION=1 output. Patch by Olga Malysheva Differential Revision: https://reviews.llvm.org/D24761 llvm-svn: 282868	2016-09-30 15:50:14 +00:00
Jonathan Peyton	be31337e9d	Mixed type atomic routines for unsigned integers. New routines should be used for atomics like "<int>OP=<float>" when <int> is unsigned. Using functions __kmpc_atomic_fixed<bits>_<op>_fp) produces incorrect results Differential Revision: https://reviews.llvm.org/D24756 llvm-svn: 282509	2016-09-27 17:38:48 +00:00
Jonathan Peyton	b66d1aab25	Disable monitor thread creation by default. This change set disables creation of the monitor thread by default. The global counter maintained by the monitor thread was replaced by logic that uses system time directly, and cyclic yielding on Linux target was also removed since there was no clear benefit of using it. Turning on KMP_USE_MONITOR variable (=1) enables creation of monitor thread again if it is really necessary for some reasons. Differential Revision: https://reviews.llvm.org/D24739 llvm-svn: 282507	2016-09-27 17:11:17 +00:00
Michal Gorny	cd2bfb1e7c	Fix respecting LIBOMP_LLVM_LIT_EXECUTABLE as full path Fix lit search to correctly respect LIBOMP_LLVM_LIT_EXECUTABLE as full program path. The variable passed to find_program() is created by CMake as a cache variable, and therefore can be directly overriden by the user. Since this was the design of LIBOMP_LLVM_LIT_EXECUTABLE (as can be deduced from the error messages) and there is no other use of LIT_EXECUTABLE, remove the redundant variable and pass LIBOMP_LLVM_LIT_EXECUTABLE directly to find_program(). Furthermore, the previous code did not work since the HINTS argument specifies more search directories rather than expected full path. Quoting the CMake documentation: > 3. Search the paths specified by the HINTS option. These should be > paths computed by system introspection, such as a hint provided by > the location of another item already found. Hard-coded guesses should > be specified with the PATHS option. Differential Revision: https://reviews.llvm.org/D24710 llvm-svn: 281887	2016-09-19 06:55:56 +00:00
Michal Gorny	23132ebb0e	[cmake] Make libgomp & libiomp5 alias install optional Introduce a new LIBOMP_INSTALL_VARIABLES cache variable that can be used to disable creating libgomp and libiomp5 aliases on 'make install'. Those aliases are undesired e.g. on Gentoo systems where libomp is used purely by clang. Differential Revision: https://reviews.llvm.org/D24563 llvm-svn: 281512	2016-09-14 17:46:27 +00:00
Jonas Hahnfeld	848d690697	[OMPT] fix task frame information for gomp interface Previous differencials D23305-D23310 changed task frame information management only for the kmp interface, but not for the whole gomp interface. This broke some testcases when building with gcc. This patch fixes the broken task frame information for the gomp interface. Patch by Joachim Protze! Differential Revision: https://reviews.llvm.org/D24502 llvm-svn: 281468	2016-09-14 13:59:39 +00:00
Jonas Hahnfeld	dd9a05d5d8	[OMPT] save exit address to lwt if available In case, the current team is a serialized team (lwt), the frame information should be written to this data structure. Before, nested serialized teams would overwrite the same task information. Patch by Joachim Protze! Differential Revision: https://reviews.llvm.org/D23310 llvm-svn: 281467	2016-09-14 13:59:31 +00:00
Jonas Hahnfeld	28ea24bba7	[OMPT] fix __ompt_get_teaminfo to consult lwt entries of parent teams The comment already states, that this function should work similarly as __ompt_get_taskinfo. The function only looked for lwt entries of the current team, but not when unrolling the parents. This fix aligns the implementation to __ompt_get_taskinfo. The new test case creates a single theaded team (->lwt) and then a nested active team. Before the innermost print_id(1) would deliver a different team then the outer print_id(0). Patch by Joachim Protze! Differential Revision: https://reviews.llvm.org/D23309 llvm-svn: 281466	2016-09-14 13:59:24 +00:00
Jonas Hahnfeld	8a27064e05	[OMPT] Reset task exit frame when execution is finished The exit address is set when execution of a task is started and should be reset as soon as the execution is finished. Especially for the asm implementation of __kmp_invoke_microtask, resetting in this call would be painfull, so reset just after the invokation. The testcase shows the effect of this patch: Before, the implicit barriers at the end of an implicit task would see an exit address for the implicit task. This barrier is a task scheduling point. Thus, any explicit task scheduled there would see an exit, but no reenter address for the implicit task. Patch by Joachim Protze! Differential Revision: https://reviews.llvm.org/D23307 llvm-svn: 281465	2016-09-14 13:59:19 +00:00
Jonas Hahnfeld	fd0614d830	[OMPT] Align implementation of reenter frame address to latest (frozen) version of OMPT spec The latest OMPT spec changed the semantic of a tasks reenter frame to be the application frame, that will be entered, when the runtime frame drops. Before it was the last frame in the runtime. This doesn't work for some gcc execution pathes or even clang generated code for : Since there is no runtime frame between the executed task and the encountering task. The test case compares exit and reenter addresses against addresses captured in application code Patch by Joachim Protze! Differential Revision: https://reviews.llvm.org/D23305 llvm-svn: 281464	2016-09-14 13:59:13 +00:00
Jonas Hahnfeld	464cdca9d3	[OMPT] extend ompt tests by checks for frame pointers OMPT tests can check for right frame information of tasks: * parent_task_frame was directly printed as a pointer, but actually points to a struct ompt_frame {void, void} * NULL is printed in the beginning of execution and loaded to FileChecker variable [[NULL]] * implicit tasks now also print their frame information * macro to print frame address from application * print task info for barrier begin Patch by Joachim Protze! Differential Revision: https://reviews.llvm.org/D23304 llvm-svn: 281463	2016-09-14 13:59:05 +00:00
Jonathan Peyton	7c465a5f41	Fix bitmask upper bounds check Rather than checking KMP_CPU_SETSIZE, which doesn't exist when using Hwloc, we use the get_max_proc() function which can vary based on the operating system. For example on Windows with multiple processor groups, it might be the case that the highest bit possible in the bitmask is not equal to the number of hardware threads on the machine but something higher than that. Differential Revision: https://reviews.llvm.org/D24206 llvm-svn: 281245	2016-09-12 19:02:53 +00:00
George Rokos	118de30b44	[OPENMP] ppc64le recognized as big-endian There is a bug in CMakeLists which causes powerpc64le systems to be recognized as big-endian. This patch fixes the issue. Differential Revision: https://reviews.llvm.org/D23626 llvm-svn: 281068	2016-09-09 18:04:23 +00:00
George Rokos	28f31b405e	[OPENMP] Implementation of omp_get_default_device and omp_set_default_device Implementation of missing OpenMP 4.0 API functions omp_get_default_device and omp_set_default_device. Also, added support for the environment variable OMP_DEFAULT_DEVICE. Differential Revision: https://reviews.llvm.org/D23587 llvm-svn: 281065	2016-09-09 17:55:26 +00:00
Jonathan Peyton	e6abe52905	Move function into cpp file under KMP_AFFINITY_SUPPORTED guard. When affinity isn't supported, __kmp_affinity_compact doesn't exist. The problem is that in kmp_affinity.h there is a function which uses it without the proper KMP_AFFINITY_SUPPORTED guard around it. The compiler was smart enough to ignore it and the function __kmp_affinity_cmp_Address_child_num which relies on it, but I think it is cleaner to have it under the proper guard. Since the function is only used in the kmp_affinity.cpp file and there aren't any plans to have it elsewhere. I have moved it there. llvm-svn: 280542	2016-09-02 20:54:58 +00:00
Jonathan Peyton	9e69696f5a	Decouple the kmp_affin_mask_t type from determining if affinity is capable the __kmp_affinity_determine_capable() functions are highly operating system specific. This change has the functions use the type they expect explicitly. llvm-svn: 280538	2016-09-02 20:35:47 +00:00
Jonathan Peyton	788c5d65e8	Replace a bad instance of __kmp_free() with KMP_CPU_FREE_ARRAY() macro. llvm-svn: 280530	2016-09-02 19:37:12 +00:00
Jonathan Peyton	5c32d5ef0d	Use 'critical' reduction method when 'atomic' is not available but requested. In case atomic reduction method is not available (the compiler can't generate it) the assertion failure occurred if KMP_FORCE_REDUCTION=atomic was specified. This change replaces the assertion with a warning and sets the reduction method to the default one - 'critical'. Patch by Olga Malysheva Differential Revision: https://reviews.llvm.org/D23990 llvm-svn: 280519	2016-09-02 18:29:45 +00:00
Jonathan Peyton	0af717970c	Appease older gcc compilers for the many-microtask-args.c test Older gcc compilers error out with the C99 syntax of: for (int i =...) so this change just moves the int i; declaration up above. llvm-svn: 280138	2016-08-30 19:28:58 +00:00
Andrey Churbanov	b35be69ff5	cleanup: fixed names of dummy arguments of Fortran interfaces declarations, no functional changes done llvm-svn: 278951	2016-08-17 18:18:21 +00:00
Andrey Churbanov	d6e1d7e521	Fixes for hierarchical barrier (possible hang if team size changed). Differential Revision: http://reviews.llvm.org/D23175 llvm-svn: 278332	2016-08-11 13:04:00 +00:00
Dimitry Andric	70ba8c506c	Fix linking of omp_foreign_thread_team_reuse test on FreeBSD Summary: On FreeBSD, linking the misc_bugs/omp_foreign_thread_team_reuse.c test case fails with: /usr/local/bin/ld: /tmp/omp_foreign_thread_team_reuse-c5e71b.o: undefined reference to symbol 'pthread_create@@FBSD_1.0' This is because the program is linked without `-lpthread`. Since the %libomp-compile-and-run macro does not allow that option to be added to the compile command line, split it up and add the required `-lpthread` between %libomp-compile and %libomp-run. Reviewers: jlpeyton, hfinkel, Hahnfeld Subscribers: Hahnfeld, emaste, openmp-commits Differential Revision: https://reviews.llvm.org/D23084 llvm-svn: 278036	2016-08-08 18:34:05 +00:00
Jonas Hahnfeld	ad0c42e3a9	kmp_gsupport: Fix library initialization with taskgroup Differential Revision: https://reviews.llvm.org/D23259 llvm-svn: 278003	2016-08-08 13:23:08 +00:00
Jonas Hahnfeld	ca32babfa7	Mark tests with task dependencies as unsupported with GCC llvm-svn: 277996	2016-08-08 11:52:49 +00:00
Jonas Hahnfeld	bedc371c9d	Do not block on explicit task depending on proxy task Consider the following code: int dep; #pragma omp target nowait depend(out: dep) { sleep(1); } #pragma omp task depend(in: dep) { printf("Task with dependency\n"); } printf("Doing some work...\n"); In its current state the runtime will block on the second task and not continue execution. Differential Revision: https://reviews.llvm.org/D23116 llvm-svn: 277992	2016-08-08 10:08:14 +00:00
Jonas Hahnfeld	69f8511f8f	__kmp_free_task: Fix for serial explicit tasks producing proxy tasks Consider the following code which may be executed by a serial team: int dep; #pragma omp target nowait depend(out: dep) { sleep(1); } #pragma omp task depend(in: dep) { #pragma omp target nowait { sleep(1); } } Here the explicit task may not be freed until the nested proxy task has finished. The current code hasn't considered this and called __kmp_free_task anyway which triggered an assert because of remaining incomplete children: KMP_DEBUG_ASSERT( TCR_4(taskdata->td_incomplete_child_tasks) == 0 ); Differential Revision: https://reviews.llvm.org/D23115 llvm-svn: 277991	2016-08-08 10:08:07 +00:00
Andrey Churbanov	5bf494e73d	Fixed x2APIC discovery for 256-processor architectures. Mask for value read from ebx register returned by CPUID expanded to 0xFFFF. Differential Revision: https://reviews.llvm.org/D23203 llvm-svn: 277825	2016-08-05 15:59:11 +00:00
Jonas Hahnfeld	d1f4b8f6e8	Add test case for nested creation of tasks For discussion in D23115 llvm-svn: 277730	2016-08-04 14:55:56 +00:00
Jonas Hahnfeld	20236611d4	kmp_taskdeps.cpp: Fix debugging output node->dn.task is only filled after the dependencies are already processed. This currently leads to unhelpful output from KA_TRACE or even a crash if one enables KMP_SUPPORT_GRAPH_OUTPUT. llvm-svn: 277717	2016-08-04 11:03:47 +00:00
Pirama Arumuga Nainar	0554d25eb3	Disable KMP_CANCEL_THREADS on Android Summary: Android does not have pthread_cancel. Disable KMP_CANCEL_THREADS if __ANDROID__ is defined. Subscribers: tberghammer, srhines, openmp-commits, danalbert Differential Revision: https://reviews.llvm.org/D23029 llvm-svn: 277618	2016-08-03 18:08:57 +00:00
Paul Osmialowski	ecbe2ea002	Make balanced affinity work on AArch64. This patch enables balanced affinity on machines that do not have hardware threads and have cores clustered into packages. In facts, balacing algorithm could be generalized for any arrangement with at least two levels of hierarchy (depth > 1). Differential Revision: https://reviews.llvm.org/D22365 llvm-svn: 277212	2016-07-29 20:55:03 +00:00
Samuel Antao	71fef77dcb	Replace enum types in variadic functions by build-in types. Summary: When compiling the runtime library with clang we get warnings like: ``` error: passing an object that undergoes default argument promotion to 'va_start' has undefined behavior [-Werror,-Wvarargs] va_start( args, id ); ^ note: parameter of type 'kmp_i18n_id_t' (aka 'kmp_i18n_id') is declared here kmp_i18n_id_t id, ``` My understanding is that the va_start macro only gets the promoted type so it won't know what was the exact type of the argument, which can potentially not work for some targets given that the implementation of the the calling convention could not be done properly. This patch fixes that by using a built-in type in the function signature. Reviewers: tlwilmar, jlpeyton, AndreyChurbanov Subscribers: arpith-jacob, carlo.bertolli, caomhin, openmp-commits Differential Revision: https://reviews.llvm.org/D22427 llvm-svn: 276428	2016-07-22 16:05:35 +00:00
Andrey Churbanov	429dbc2ad2	http://reviews.llvm.org/D22134 : Implementation of OpenMP 4.5 nonmonotonic schedule modifier llvm-svn: 275052	2016-07-11 10:44:57 +00:00
Jonathan Peyton	4d3c21307c	Improving EPCC performance when linking with hwloc When linking with libhwloc, the ORDERED EPCC test slows down on big machines (> 48 cores). Performance analysis showed that a cache thrash was occurring and this padding helps alleviate the problem. Also, inside the main spin-wait loop in kmp_wait_release.h, we can eliminate the references to the global shared variables by instead creating a local variable, oversubscribed and instead checking that. Differential Revision: http://reviews.llvm.org/D22093 llvm-svn: 274894	2016-07-08 17:43:21 +00:00
Andrey Churbanov	50ecf5de01	D22138: Added more Intel compiler versions as allowed build compilers llvm-svn: 274854	2016-07-08 15:23:35 +00:00
Andrey Churbanov	2eca95c9a9	D22137: Memory leak fixed by adding missed cleanup of single level array of hot teams info llvm-svn: 274851	2016-07-08 14:53:24 +00:00
Andrey Churbanov	cb28d6e3a0	D22136: Memory leaks fixed by adding missed __kmp_free() calls llvm-svn: 274850	2016-07-08 14:40:20 +00:00
Andrey Churbanov	42211eb125	D22135: formatting change llvm-svn: 274849	2016-07-08 14:35:41 +00:00
Jonathan Peyton	741b70926f	Fix the nowait tests for omp for and omp single These tests are now modeled after the sections nowait test where threads wait to be released in the first construct (either for or single) and the last thread skips the last for/single construct and releases those threads. If the test fails, then it hangs because an unnecessary barrier is executed in between the constructs. llvm-svn: 274641	2016-07-06 17:26:12 +00:00
Jonas Hahnfeld	170fcc8772	__kmp_partition_places: Update assertion for new parameter update_master_only If update_master_only is set the place list is not completely traversed and therefore this assertion failed. Make it only trigger if update_master_only is false. (was introduced by D20539) Differential Revision: http://reviews.llvm.org/D21925 llvm-svn: 274482	2016-07-04 05:58:10 +00:00
Jonathan Peyton	6b560f0dd9	Fix checks on schedule struct This change fixes an error in comparing the existing schedule on the team to the new schedule, in the chunk field. Also added additional checks and used KMP_CHECK_UPDATE where appropriate. Patch by Terry Wilmarth. Differential Revision: http://reviews.llvm.org/D21897 llvm-svn: 274371	2016-07-01 17:54:32 +00:00
Jonathan Peyton	c1666960f9	Improve performance of #pragma omp single EPCC Performance of single is considerably worse than plain barrier. Adding a read-only check to the code before the atomic compare-and-store helps considerably. Patch by Terry Wilmarth. Differential Revision: http://reviews.llvm.org/D21893 llvm-svn: 274369	2016-07-01 17:37:49 +00:00
Jonathan Peyton	fdcca8cd55	Fix omp_sections_nowait.c test to address Bugzilla Bug 28336 This rewrite of the omp_sections_nowait.c test file causes it to hang if the nowait is not respected. If the nowait isn't respected, the lone thread which can escape the first sections construct will just sleep at a barrier which shouldn't exist. All reliance on timers is taken out. For good measure, the test makes sure that all eight sections are executed as well. The test should take no longer than a few seconds on any modern machine. Differential Revision: http://reviews.llvm.org/D21842 llvm-svn: 274151	2016-06-29 19:46:52 +00:00
Jonathan Peyton	ac7ba406ed	Fix bugs in TAS and futex lock * Incorrect lock value written in __kmp_test_futex_lock * Incorrect lock value check in tas/futex lock with USE_LOCK_PROFILE on Patch by Hansang Bae llvm-svn: 274053	2016-06-28 19:37:24 +00:00
Jonathan Peyton	cceebeef17	Revert r273898's UNICODE quick fix in favor of CMake's remove_definitions() UNICODE and _UNICODE defintions were added in the LLVM CMake build system. While on Unices, the UNICODE/_UNICODE macros don't cause problems, on Windows only ittnotify_static.c should be compiled using -DUNICODE. We are still looking at a proper fix, but this change sets the build back to exactly what it was doing before. Also, a comment and TODO were added in the src/CMakeLists.txt file to help explain. llvm-svn: 274052	2016-06-28 19:25:13 +00:00
Hans Wennborg	8065c51875	Fix the Windows build after r273599 That patch made all LLVM projects build with -DUNICODE. However, this doesn't work for the OpenMP runtime. But just overriding the flag with -UUNICODE breaks compiling ittnotify_static.c, which for some reason needs to be compiled with -DUNICIODE. Note that compiling ittnotify.h with -DUNICODE does not work though. This seems like a mess. This commit fixes it for now, but it would be great if someone who works on the OpenMP runtime could fix it properly. llvm-svn: 273898	2016-06-27 18:03:45 +00:00
Jonathan Peyton	e119e8e5b5	Remove redundant %libomp-compile step from test/lock/omp_lock.c llvm-svn: 273576	2016-06-23 16:18:59 +00:00
Jonathan Peyton	eeec4c8364	Fix bug in futex fast path inside kmp_csupport.c llvm-svn: 273439	2016-06-22 16:36:07 +00:00
Jonathan Peyton	9d2412c9e5	Apply the KMP_USE_FUTEX feature macro everywhere llvm-svn: 273438	2016-06-22 16:35:12 +00:00
Jonathan Peyton	d4f397741b	Add debug trace messages for taskloop llvm-svn: 273299	2016-06-21 19:18:13 +00:00
Jonathan Peyton	c76f9f0df8	Bug fix for hang when tasks used in nested parallel Bug fix for hang when omp task and nested parallelism used together. Still some problem remains with task state saving/restoring, but user's case works fine now. All tasking unit tests passed as well. Patch by Andrey Churbanov Differential Revision: http://reviews.llvm.org/D21558 llvm-svn: 273297	2016-06-21 19:12:07 +00:00
Jonathan Peyton	ff5ca8b4cf	Performance improvement: accessing thread struct as opposed to team struct Replaced readings of nproc from team structure with ones from thread structure to improve performance. Patch by Andrey Churbanov. Differential Revision: http://reviews.llvm.org/D21559 llvm-svn: 273293	2016-06-21 18:30:15 +00:00
Jonathan Peyton	8c61c597be	Addition of debugger comments and whitespace The removal of legacy code to support long-deprecated debugger support library resulted in some whitespace changes. Comments from that legacy code were made public as they may be useful for other debuggers. Patch by Olga Malysheva. Differential Revision: http://reviews.llvm.org/D21391 llvm-svn: 273282	2016-06-21 15:59:34 +00:00
Jonathan Peyton	fd7cc42fed	Improvements to process affinity mask setting A couple improvements: 1) Add ability to limit fullMask size when KMP_HW_SUBSET limits resources. 2) Make KMP_HW_SUBSET work for affinity_none, and only limit fullMask in this case. Patch by Andrey Churbanov. Differential Revision: http://reviews.llvm.org/D21528 llvm-svn: 273278	2016-06-21 15:54:38 +00:00
Jonathan Peyton	5a276c45c2	Bug fix for segfault in stubs library There was a segfault in the stubs library in posix_memalign because of a bad parameter. The fix is to send address of the pointer as a parameter. Also added check of result of posix_memalign. Patch by Andrey Churbanov. Differential Revision: http://reviews.llvm.org/D21529 llvm-svn: 273276	2016-06-21 15:39:08 +00:00
Jonathan Peyton	98b76f6f87	[STATS] Adding process id to output filename This change appends the process id to the KMP_STATS_FILE (if specified) which enables MPI processes to output their stats to separate files. Differential Revision: http://reviews.llvm.org/D21386 llvm-svn: 273273	2016-06-21 15:20:33 +00:00
Jonathan Peyton	ea26f3f82a	Fix typos in Fortran headers Fix typos in Fortran headers to match spec. Patch by Andrey Churbanov. Differential Revision: http://reviews.llvm.org/D21531 llvm-svn: 273272	2016-06-21 15:16:51 +00:00
Jonathan Peyton	bf35771bcc	Change hwloc discovery algorithm to print topology only for accessible resources Change hwloc discovery algorithm to print topology for only accessible resources, and report uniformity correspondingly, similar to what other topology discovery algorithms do. Fixes minor inconsistency in total topology reported and resources used for threads binding in case hwloc used. Patch by Andrey Churbanov. Differential Revision: http://reviews.llvm.org/D21389 llvm-svn: 272952	2016-06-16 20:31:19 +00:00
Jonathan Peyton	0f3c2b921d	Teach OpenMP Library to use Hwloc on Windows This patch allows a user to enable Hwloc on windows. There are three main changes in here: 1.kmp.h - Move definitions/declarations out of KMP_OS_WINDOWS guard (our windows implementation of affinity) because they need to be defined when KMP_USE_HWLOC is on as well. 2.teach __kmp_set_system_affinity, __kmp_get_system_affinity, __kmp_get_proc_group, and __kmp_affinity_bind_thread how to use hwloc. 3.teach CMake how to include hwloc when building Windows Another minor change in here is to make sure that anything under KMP_USE_HWLOC is also guarded by KMP_AFFINITY_SUPPORTED as well. This is to prevent Mac builds from requiring anything from Hwloc. Differential Revision: http://reviews.llvm.org/D21441 llvm-svn: 272951	2016-06-16 20:23:11 +00:00
Jonathan Peyton	c505ab6733	Fix for crash in task dependencies With single thread using __kmpc_omp_wait_deps segfaults in OpenMP runtime. Offloading with depend also encounters this problem when we generate kmpc_omp_wait_deps instead of kmpc_omp_task_with_deps. Patch by Alex Duran Differential Revision: http://reviews.llvm.org/D21384 llvm-svn: 272949	2016-06-16 20:18:31 +00:00
Jonathan Peyton	72a8498e08	Fixed missing memory cleanup in __kmp_affinity_create_hwloc_map() Cleanup: fixed missing memory cleanup in couple of corner cases. Fixes possible memory leak in some corner cases Patch by Andrey Churbanov Differential Revision: http://reviews.llvm.org/D21355 llvm-svn: 272946	2016-06-16 20:14:54 +00:00
Jonathan Peyton	4ba3b0cda9	Reduce perf impact of redundant ittnotify calls Improved performance of ittnotify calls by request from ittnotify owner: calls to __itt_string_handle_create made unique (it was called multiple times). Patch by Andrey Churbanov Differential Revision: http://reviews.llvm.org/D21353 llvm-svn: 272945	2016-06-16 20:11:51 +00:00
Jonathan Peyton	b9d28fbeb3	Deprecate KMP_PLACE_THREADS and rename as KMP_HW_SUBSET Deprecate KMP_PLACE_THREADS and rename it to KMP_HW_SUBSET due to confusion about its purpose and function among users. KMP_HW_SUBSET is an environment variable which allows users to easily pick a subset of the hardware topology to use. e.g., KMP_HW_SUBSET=30c,2t means use 30 cores, 2 threads per core. Patch by Andrey Churbanov Differential Revision: http://reviews.llvm.org/D21340 llvm-svn: 272937	2016-06-16 18:53:48 +00:00
Jonathan Peyton	7cf08d4299	Bug fix: crash if teams executed on host Added argv array check/allocation for parallel directly nested inside the teams construct, as new coming Fortran codegen passes parameters directly into kmpc_fork_call missing same parameters in kmpc_fork_teams (earlier codegen passed to parallel the subset of parameter passed to teams, and thus no check/allocation needed). Patch by Andrey Churbanov Differential Revision: http://reviews.llvm.org/D21336 llvm-svn: 272935	2016-06-16 18:47:38 +00:00
Jonathan Peyton	614bb6618e	Fix large overhead with itt notifications on region/barrier name composing Currently, there is a big overhead in reporting of loop metadata through ittnotify. The pair of functions: __kmp_str_loc_init/__kmp_str_loc_free are replaced with strchr/atoi calls. Thus, a lot of time consuming actions are skipped - many memory allocations/deallocations, heavy string duplication, etc. The loop metadata only needs line and column info from the source string, so no allocations and string splitting actually needed. Patch by Andrey Churbanov Differential Revision: http://reviews.llvm.org/D21309 llvm-svn: 272698	2016-06-14 19:27:22 +00:00
Jonathan Peyton	e85ba3f58f	Remove unused wait/release code. Cleanup - unused code removal. TODO: consider to remove (replace with flag class methods) also kmp_wait_64 and kmp_release_64 routines. Patch by Andrey Churbanov Differential Revision: http://reviews.llvm.org/D21332 llvm-svn: 272697	2016-06-14 19:15:40 +00:00
Jonathan Peyton	957a151fd1	Whitespace cleanup of dllexports Differential Revision: http://reviews.llvm.org/D21331 llvm-svn: 272691	2016-06-14 18:47:47 +00:00
Jonathan Peyton	df6818bea4	Renaming change: 41 -> 45 and 4.1 -> 4.5 OpenMP 4.1 is now OpenMP 4.5. Any mention of 41 or 4.1 is replaced with 45 or 4.5. Also, if the CMake option LIBOMP_OMP_VERSION is 41, CMake warns that 41 is deprecated and to use 45 instead. llvm-svn: 272687	2016-06-14 17:57:47 +00:00
Jonathan Peyton	e1890e12f0	Bug fix for Bugzilla bug 26602: Remove function bodies with KMP_ASSERT(0) Fix for bugzilla https://llvm.org/bugs/show_bug.cgi?id=26602. Removed functions body consisted of the only KMP_ASSERT(0) statement. Thus possible runtime crash converted to compile-time error, which looks preferable (faster possible error detection). TODO: consider C++11 static assert as an alternative, that could make the diagnostics better. Patch by Andrey Churbanov Differential Revision: http://reviews.llvm.org/D21304 llvm-svn: 272590	2016-06-13 21:33:30 +00:00
Jonathan Peyton	c5304aa3c4	Affinity mask processing improvements Remove static specifier from var fullMask and remove kmp_get_fullMask() routine. When iterating through procs in a mask, always check if proc is in fullMask (this check was missing in a few places). Patch by Brian Bliss. Differential Revision: http://reviews.llvm.org/D21300 llvm-svn: 272589	2016-06-13 21:28:03 +00:00
Jonathan Peyton	8cb45c838f	Exclude untied tasks from task stealing constraint If either current_task or new_task is untied then skip task scheduling constraint checks, because untied tasks are not affected by the task scheduling constraints. Differential Revision: http://reviews.llvm.org/D21196 llvm-svn: 272570	2016-06-13 17:51:59 +00:00
Jonathan Peyton	93495de265	Fix crash when libomp loaded/unloaded multiple times The problem scenario is the following: A dynamic library, libfoo.so, depends on libomp.so (it creates parallel region and calls some omp functions). An application has a loop where it dynamically loads libfoo.so, calls the function from it, unloads libfoo.so. After several loop iterations application crashes with the message about lack of resources OMP: Error #34: System unable to allocate necessary resources for OMP thread: The problem is that pthread_kill() was not followed by pthread_join() in case of terminated thread. This patch fixes this problem for both worker and monitor threads. Differential Revision: http://reviews.llvm.org/D21200 llvm-svn: 272567	2016-06-13 17:36:40 +00:00
Jonathan Peyton	202a24dd9b	Hwloc refactoring patch These changes remove the hwloc_topology_ignore_type function which doesn't exist in the hwloc 2.0 API. In the existing code, the topology extracted from hwloc has the cache levels stripped out and then assumes the final stripped topology follows the typical three-level topology: packages -> cores -> HW threads. But the code is doing unclean manipulations to determine at what level those resources are located and also assumes too much about what hwloc is detecting (there could be intermediate levels in between socket and core for instance). This new way of extracting the topology doesn't strip out any hardware objects that hwloc detects. It does not assume the three level topology, and instead searches for the relevant three levels within the topology for each bit of information using hwloc interface functions. i.e., the three level topology subset that our affinity code is interested in is extracted from the hwloc topology tree directly. For example, the new __kmp_hwloc_get_nobjs_under_obj function gives the user the number of cores under a socket reliably without worrying if there are unexpected objects between the socket object and core object in the hwloc topology structure. Also, now that all topology information is kept, there are also possibilities of using the caches/numa nodes to determine more sophisticated affinity settings in the future. There is also some cleanup code added for the destruction of the __kmp_hwloc_topology object. Differential Revision: http://reviews.llvm.org/D21195 llvm-svn: 272565	2016-06-13 17:30:08 +00:00
Jonathan Peyton	34c72c4773	Fix bitmask complement operation The bitmask complement operation doesn't consider the max proc id which means something like !{0} will be translated to {1,2,3,4,...,600,601,...,1023} on a Linux system even though there aren't 600 processors on said system. This change has the complement bitmask and-ed with the fullmask so that it will only contain valid processors. Differential Revision: http://reviews.llvm.org/D21245 llvm-svn: 272561	2016-06-13 17:01:26 +00:00
Jonathan Peyton	5a299da55d	[STATS] Add stats gathering for taskloop construct llvm-svn: 272560	2016-06-13 16:56:41 +00:00
Jonathan Peyton	b6f0f521f5	Fix spelling in comment llvm-svn: 272291	2016-06-09 18:51:17 +00:00
Jonathan Peyton	61fdddfd64	Revert accidental commit to lit.cfg llvm-svn: 272287	2016-06-09 18:29:36 +00:00
Jonathan Peyton	c4c722ac0d	Refactor __kmp_execute_tasks_template function Refactored __kmp_execute_tasks_template to shorten and remove code redundancy. The original code for __kmp_execute_tasks_template was very redundant with large sections of repeated code that needed to be kept consistent, and goto statements that made the control flow difficult to discern. This refactoring removes all gotos and redundancy. Patch by Terry Wilmarth Differential Revision: http://reviews.llvm.org/D20879 llvm-svn: 272286	2016-06-09 18:27:03 +00:00
Hans Wennborg	5b89fbc822	kmp_lock.h: Fix VS2013 build after r271324 MSVC doesn't allow std::atomic<>s in a union since they don't have trivial copy constructor. Replacing them with e.g. std::atomic_int works, but that breaks the GCC build on Linux, because then calls to e.g. std::atomic_load_explicit fail, as they expect a real std::atomic<> pointer. Fixing this with an #ifdef to unbreak the build for now. llvm-svn: 272271	2016-06-09 15:54:43 +00:00
Paul Osmialowski	9cc353e2b3	Fine tuning of TC* macros - small followup As I replaced no-op TCR_4 with actual code, compiler complained while building debug build. This patch moves 'cast to int' to the correct place. Extension to Differential Revision: http://reviews.llvm.org/D19880 llvm-svn: 271377	2016-06-01 09:59:26 +00:00
Paul Osmialowski	f7cc6affdb	Use C++11 atomics for ticket locks implementation This patch replaces use of compiler builtin atomics with C++11 atomics for ticket locks implementation. Ticket locks are used in critical places of the runtime, e.g. in the tasking mechanism. The main reason this change was introduced is the problem with work stealing function on ARM architecture which suffered from nasty race condition. It turned out that the root cause of the problem lies in the way ticket locks are implemented. Changing compiler builtins into C++11 atomics solves the problem. Two assertions were added into kmp_tasking.c which are useful for detecting early symptoms of something wrong going on with work stealing, which were among the possible outcomes of the race condition. Differential Revision: http://reviews.llvm.org/D19878 llvm-svn: 271324	2016-05-31 20:20:32 +00:00
Jonathan Peyton	ef7347994e	Addition of OpenMP 4.5 feature: schedule(simd:static) This patch implements the new kmp_sch_static_balanced_chunked schedule kind that the compiler will generate when it encounters schedule(simd: static). It just adds the new constant and the new switch case __kmp_for_static_init. Patch by Alex Duran. Differential Revision: http://reviews.llvm.org/D20699 llvm-svn: 271320	2016-05-31 19:12:18 +00:00
Jonathan Peyton	f4f969569d	Avoid deadlock with COI When an asynchronous offload task is completed, COI calls the runtime to queue a "destructor task". When the task deques are full, a dead-lock situation arises where the OpenMP threads are inside but cannot progress because the COI thread is stuck inside the runtime trying to find a slot in a deque. This patch implements the solution where the task deques doubled in size when a task is being queued from a COI thread. Differential Revision: http://reviews.llvm.org/D20733 llvm-svn: 271319	2016-05-31 19:07:00 +00:00
Jonathan Peyton	067325f935	Offer API for setting number of loop dispatch buffers The problem is the lack of dispatch buffers when thousands of loops with nowait, about 10 iterations each, are executed by hundreds of threads. We only have built-in 7 dispatch buffers, but there is a need in dozens or hundreds of buffers. The problem can be fixed by setting KMP_MAX_DISP_BUF to bigger value. In order to give users same possibility I changed build-time control into run-time one, adding API just in case. This change adds an environment variable KMP_DISP_NUM_BUFFERS and a new API function kmp_set_disp_num_buffers(int num_buffers). The KMP_DISP_NUM_BUFFERS envirable works only before serial initialization, because during the serial initialization we already allocate buffers for the hot team, so it is too late to change the number of buffers later (or we need to reallocate buffers for all teams which sounds too complicated). The kmp_set_defaults() routine does not work for this envirable, because it calls serial initialization before reading the parameter string. So a new routine, kmp_set_disp_num_buffers(), is created so that it can set our internal global variable before the library initialization. If both the envirable and API used the envirable wins. Differential Revision: http://reviews.llvm.org/D20697 llvm-svn: 271318	2016-05-31 19:01:15 +00:00
Hal Finkel	49bee007d0	Fix storing the frame pointer for OMP-T during ppc64 microtask dispatch Thanks to John Mellor-Crummey for reporting the omission. llvm-svn: 271035	2016-05-27 19:04:05 +00:00
Jonathan Peyton	50eae7f8b2	Add missing OpenMP 4.5 device entries to stubs library. llvm-svn: 271006	2016-05-27 15:51:14 +00:00
Jonathan Peyton	7ba9baef6d	Fix for OMP_PROC_BIND=spread strategy The OMP_PROC_BIND=spread strategy fails to assign the master thread the correct place partition after the first parallel region. Other threads in the hot team will remember their place_partition, but the master's place partition is restored to what it was before entering the parallel region. So when the hot team is used for subsequent parallel regions, the master has lost this info. This fix calls __kmp_partition_places to update only the master thread's place partition in the spread case when there are no other changes to the hot team. Patch by Terry Wilmarth Differential Revision: http://reviews.llvm.org/D20539 llvm-svn: 270890	2016-05-26 19:09:46 +00:00
Jonathan Peyton	7abf9d5927	Make LIBOMP_USE_ITT_NOTIFY a setting that can be enabled or disabled On Blue Gene/Q, having LIBOMP_USE_ITT_NOTIFY support compiled into a statically-linked binary causes a failure at runtime because dlopen fails. This patch changes LIBOMP_USE_ITT_NOTIFY to a cacheable configuration setting that can be disabled. Patch by John Mellor-Crummey Differential Revision: http://reviews.llvm.org/D20517 llvm-svn: 270884	2016-05-26 18:19:10 +00:00
Hal Finkel	0a665a83da	Add a test case for microtask dispatch with many arguments This is a cleaned-up version of the test case posted in the D19879 review. llvm-svn: 270867	2016-05-26 16:34:05 +00:00
Hal Finkel	91e19a3de4	Add an assembly __kmp_invoke_microtask for ppc64[le] Clang no longer restricts itself to generating microtasks with a small number of arguments, and so an assembly implementation is required to prevent hitting the parameter limit present in the C implementation. This adds an implementation for ppc64[le]. llvm-svn: 270821	2016-05-26 04:48:14 +00:00
Andrey Churbanov	2fd1654278	D20525: Use more general function for getting gtid which may be faster than specific one. llvm-svn: 270694	2016-05-25 12:53:17 +00:00
Jonathan Peyton	b044e4fa31	Fork performance improvements Most of this is modifications to check for differences before updating data fields in team struct. There is also some rearrangement of the team struct. Patch by Diego Caballero Differential Revision: http://reviews.llvm.org/D20487 llvm-svn: 270468	2016-05-23 18:01:19 +00:00
Jonathan Peyton	1ab887d403	Allow unit testing on Windows These changes allow testing on Windows using clang.exe. There are two main changes: 1. Only link to -lm when it actually exists on the system 2. Create basic versions of pthread_create() and pthread_join() for windows. They are not POSIX compliant by any stretch but will allow any existing and future tests to use pthread_create() and pthread_join() for testing interactions of libomp with os threads. Differential Revision: http://reviews.llvm.org/D20391 llvm-svn: 270464	2016-05-23 17:50:32 +00:00
Jonathan Peyton	b2b6d4e2e1	Changed parameter names in Fortran modules to correspond with OpenMP 4.5 specification llvm-svn: 270447	2016-05-23 16:24:39 +00:00
Jonathan Peyton	611184919f	Remove trailing whitespace in src/ directory This patch doesn't affect D19878's context. So D19878 still cleanly applies. llvm-svn: 270252	2016-05-20 19:03:38 +00:00
Jonathan Peyton	aa7d2d781b	Remove unnecessary unistd.h header from tests. llvm-svn: 269987	2016-05-18 21:36:34 +00:00
Jonathan Peyton	096ccdd389	Remove trailing whitespace in files in doc/ directory llvm-svn: 269842	2016-05-17 21:12:48 +00:00
Jonathan Peyton	3731076997	Remove trailing whitespace from tests llvm-svn: 269841	2016-05-17 21:08:52 +00:00
Jonathan Peyton	0c3a85a327	Remove trailing whitespace in files in tools/ directory llvm-svn: 269837	2016-05-17 20:54:10 +00:00
Jonathan Peyton	975dabc96e	Remove trailing whitespace in CMake files llvm-svn: 269836	2016-05-17 20:51:24 +00:00
Jonathan Peyton	924a6627ea	Remove trailing whitespace in READMEs, CREDITS.txt and index.html llvm-svn: 269835	2016-05-17 20:48:42 +00:00
Jonathan Peyton	18b61707e8	Update copyright year in LICENSE.txt llvm-svn: 269826	2016-05-17 20:11:26 +00:00
Jonathan Peyton	0e8f053023	[OpenMP Testing] Have lit.py be a valid lit executable Users can use either llvm-lit (generated during llvm build) or lit.py which exists in llvm/utils/lit. llvm-svn: 269774	2016-05-17 15:12:11 +00:00
Paul Osmialowski	fb043fdfff	Clean all the mess around KMP_USE_FUTEX and kmp_lock.h KMP_USE_FUTEX preprocessor definition defined in kmp_lock.h is used inconsequently throughout LLVM libomp code. * some .c files that use this define do not include kmp_lock.h file, in effect guarded part of code are never compiled * some places in code use architecture-depending preprocessor logic expressions which effectively disable use of Futex for AArch64 architecture, all these places should use '#if KMP_USE_FUTEX' instead to avoid any further confusions * some places use KMP_HAS_FUTEX which is nowhere defined, KMP_USE_FUTEX should be used instead Differential Revision: http://reviews.llvm.org/D19629 llvm-svn: 269642	2016-05-16 09:44:11 +00:00
Paul Osmialowski	97ae10c67c	NFC fix indent (relates to my previous commit) llvm-svn: 269443	2016-05-13 17:45:49 +00:00
Paul Osmialowski	7e5e8684fb	Solve 'Too many args to microtask' problem This patch solves 'Too many args to microtask' problem which occurs while executing lulesh2.0.3 benchmark on AArch64. To solve this I had to wrtite AArch64 assembly version of __kmp_invoke_microtask() function, similar to x86 and x86_64 implementations. Differential Revision: http://reviews.llvm.org/D19879 llvm-svn: 269399	2016-05-13 08:26:42 +00:00
Jonathan Peyton	f83ae31caf	Adding new kmp_aligned_malloc() entry point This change adds a new entry point, kmp_aligned_malloc(size_t size, size_t alignment), an entry point corresponding to kmp_malloc() but with the capability to return aligned memory as well. Other allocator routines have been adjusted so that kmp_free() can be used for freeing memory blocks allocated by any kmp_*alloc() routine, including the new kmp_aligned_malloc() routine. Differential Revision: http://reviews.llvm.org/D19814 llvm-svn: 269365	2016-05-12 22:00:37 +00:00
Jonathan Peyton	2b749b33cc	Fix team reuse with foreign threads After hot teams were enabled by default, the library started using levels kept in the team structure. The levels are broken in case foreign thread exits and puts its team into the pool which is then re-used by another foreign thread. The broken behavior observed is when printing the levels for each new team, one gets 1, 2, 1, 2, 1, 2, etc. This makes the library believe that every other team is nested which is incorrect. What is wanted is for the levels to be 1, 1, 1, etc. Differential Revision: http://reviews.llvm.org/D19980 llvm-svn: 269363	2016-05-12 21:54:30 +00:00
Paul Osmialowski	562a3c2b66	New hwloc API compatibility Differential Revision: http://reviews.llvm.org/D19628 llvm-svn: 269284	2016-05-12 11:46:40 +00:00
Hal Finkel	55acbf8877	Restore NULL flag check in __kmp_null_resume_wrapper This reverts a presumaby-unintentional change in: r268640 - [STATS] Use partitioned timer scheme and fixes segfaults in an x86_64 debug build of the runtime library. llvm-svn: 269259	2016-05-12 00:54:08 +00:00
Paul Osmialowski	52bef53f86	Fine tuning of TC* macros This patch introduces following: * TCI_* and TCD_* macros for incrementation and decrementation * Fix for invalid use of TCR_8 in one expression Differential Revision: http://reviews.llvm.org/D19880 llvm-svn: 268826	2016-05-07 00:00:00 +00:00

1 2 3 4 5 ...

593 Commits