llvm-project

Commit Graph

Author	SHA1	Message	Date
Jonathan Peyton	b044e4fa31	Fork performance improvements Most of this is modifications to check for differences before updating data fields in team struct. There is also some rearrangement of the team struct. Patch by Diego Caballero Differential Revision: http://reviews.llvm.org/D20487 llvm-svn: 270468	2016-05-23 18:01:19 +00:00
Jonathan Peyton	1ab887d403	Allow unit testing on Windows These changes allow testing on Windows using clang.exe. There are two main changes: 1. Only link to -lm when it actually exists on the system 2. Create basic versions of pthread_create() and pthread_join() for windows. They are not POSIX compliant by any stretch but will allow any existing and future tests to use pthread_create() and pthread_join() for testing interactions of libomp with os threads. Differential Revision: http://reviews.llvm.org/D20391 llvm-svn: 270464	2016-05-23 17:50:32 +00:00
Jonathan Peyton	b2b6d4e2e1	Changed parameter names in Fortran modules to correspond with OpenMP 4.5 specification llvm-svn: 270447	2016-05-23 16:24:39 +00:00
Jonathan Peyton	611184919f	Remove trailing whitespace in src/ directory This patch doesn't affect D19878's context. So D19878 still cleanly applies. llvm-svn: 270252	2016-05-20 19:03:38 +00:00
Jonathan Peyton	aa7d2d781b	Remove unnecessary unistd.h header from tests. llvm-svn: 269987	2016-05-18 21:36:34 +00:00
Jonathan Peyton	096ccdd389	Remove trailing whitespace in files in doc/ directory llvm-svn: 269842	2016-05-17 21:12:48 +00:00
Jonathan Peyton	3731076997	Remove trailing whitespace from tests llvm-svn: 269841	2016-05-17 21:08:52 +00:00
Jonathan Peyton	0c3a85a327	Remove trailing whitespace in files in tools/ directory llvm-svn: 269837	2016-05-17 20:54:10 +00:00
Jonathan Peyton	975dabc96e	Remove trailing whitespace in CMake files llvm-svn: 269836	2016-05-17 20:51:24 +00:00
Jonathan Peyton	924a6627ea	Remove trailing whitespace in READMEs, CREDITS.txt and index.html llvm-svn: 269835	2016-05-17 20:48:42 +00:00
Jonathan Peyton	18b61707e8	Update copyright year in LICENSE.txt llvm-svn: 269826	2016-05-17 20:11:26 +00:00
Jonathan Peyton	0e8f053023	[OpenMP Testing] Have lit.py be a valid lit executable Users can use either llvm-lit (generated during llvm build) or lit.py which exists in llvm/utils/lit. llvm-svn: 269774	2016-05-17 15:12:11 +00:00
Paul Osmialowski	fb043fdfff	Clean all the mess around KMP_USE_FUTEX and kmp_lock.h KMP_USE_FUTEX preprocessor definition defined in kmp_lock.h is used inconsequently throughout LLVM libomp code. * some .c files that use this define do not include kmp_lock.h file, in effect guarded part of code are never compiled * some places in code use architecture-depending preprocessor logic expressions which effectively disable use of Futex for AArch64 architecture, all these places should use '#if KMP_USE_FUTEX' instead to avoid any further confusions * some places use KMP_HAS_FUTEX which is nowhere defined, KMP_USE_FUTEX should be used instead Differential Revision: http://reviews.llvm.org/D19629 llvm-svn: 269642	2016-05-16 09:44:11 +00:00
Paul Osmialowski	97ae10c67c	NFC fix indent (relates to my previous commit) llvm-svn: 269443	2016-05-13 17:45:49 +00:00
Paul Osmialowski	7e5e8684fb	Solve 'Too many args to microtask' problem This patch solves 'Too many args to microtask' problem which occurs while executing lulesh2.0.3 benchmark on AArch64. To solve this I had to wrtite AArch64 assembly version of __kmp_invoke_microtask() function, similar to x86 and x86_64 implementations. Differential Revision: http://reviews.llvm.org/D19879 llvm-svn: 269399	2016-05-13 08:26:42 +00:00
Jonathan Peyton	f83ae31caf	Adding new kmp_aligned_malloc() entry point This change adds a new entry point, kmp_aligned_malloc(size_t size, size_t alignment), an entry point corresponding to kmp_malloc() but with the capability to return aligned memory as well. Other allocator routines have been adjusted so that kmp_free() can be used for freeing memory blocks allocated by any kmp_*alloc() routine, including the new kmp_aligned_malloc() routine. Differential Revision: http://reviews.llvm.org/D19814 llvm-svn: 269365	2016-05-12 22:00:37 +00:00
Jonathan Peyton	2b749b33cc	Fix team reuse with foreign threads After hot teams were enabled by default, the library started using levels kept in the team structure. The levels are broken in case foreign thread exits and puts its team into the pool which is then re-used by another foreign thread. The broken behavior observed is when printing the levels for each new team, one gets 1, 2, 1, 2, 1, 2, etc. This makes the library believe that every other team is nested which is incorrect. What is wanted is for the levels to be 1, 1, 1, etc. Differential Revision: http://reviews.llvm.org/D19980 llvm-svn: 269363	2016-05-12 21:54:30 +00:00
Paul Osmialowski	562a3c2b66	New hwloc API compatibility Differential Revision: http://reviews.llvm.org/D19628 llvm-svn: 269284	2016-05-12 11:46:40 +00:00
Hal Finkel	55acbf8877	Restore NULL flag check in __kmp_null_resume_wrapper This reverts a presumaby-unintentional change in: r268640 - [STATS] Use partitioned timer scheme and fixes segfaults in an x86_64 debug build of the runtime library. llvm-svn: 269259	2016-05-12 00:54:08 +00:00
Paul Osmialowski	52bef53f86	Fine tuning of TC* macros This patch introduces following: * TCI_* and TCD_* macros for incrementation and decrementation * Fix for invalid use of TCR_8 in one expression Differential Revision: http://reviews.llvm.org/D19880 llvm-svn: 268826	2016-05-07 00:00:00 +00:00
Jonathan Peyton	11dc82fa83	[STATS] Use partitioned timer scheme This change removes the current timers with ones that partition time properly. The current timers are nested, so that if a new timer, B, starts when the current timer, A, is already timing, A's time will include B's. To eliminate this problem, the partitioned timers are designed to stop the current timer (A), let the new timer run (B), and when the new timer is finished, restart the previously running timer (A). With this partitioning of time, a threads' timers all sum up to the OMP_worker_thread_life time and can now easily show the percentage of time a thread is spending in different parts of the runtime or user code. There is also a new state variable associated with each thread which tells where it is executing a task. This corresponds with the timers: OMP_task_, e.g., if time is spent in OMP_task_taskwait, then that thread executed tasks inside a #pragma omp taskwait construct. The changes are mostly changing the MACROs to use the new PARITIONED_ macros, the new partitionedTimers class and its methods, and new state logic. Differential Revision: http://reviews.llvm.org/D19229 llvm-svn: 268640	2016-05-05 16:15:57 +00:00
Paul Osmialowski	fedce46bbd	NFC remove unneded spaces (test commit) llvm-svn: 268462	2016-05-03 23:10:20 +00:00
Jonathan Peyton	8407f5b3bd	Remove architecture dependent Hwloc DEBUG section This debug sections's functionality can be replicated using the environment variable KMP_TOPOLOGY_METHOD with different values and KMP_AFFINITY=verbose llvm-svn: 267472	2016-04-25 21:11:26 +00:00
Jonathan Peyton	1d5487c5d0	Fix buffer problem with printing long Hwloc affinity mask This change has the hwloc_bitmap_list_snprintf() function use the entire buffer to print the mask. There is no need to shorten the buffer length by 7. It only needs to be shortened by one byte. llvm-svn: 267470	2016-04-25 21:08:31 +00:00
Jonathan Peyton	b1467d1ef0	ARM Limited license agreement from the copyright/patent holder I have prepared some patches for LLVM OpenMP runtime, mostly addressing ARMv8 support. Before I upstream them, I must address legal issues that arose around my planned contribution. I was advised that before I send any substantial commit, I need to make sure that LICENSE.txt file in the projects repository contains a statement submitted by ARM, similar to the one provided by Intel (see "a license agreement from the copyright/patent holders"). This is the same situation as with top-level LLVM project: ARM has provided the same statement in http://llvm.org/svn/llvm-project/llvm/trunk/lib/Target/ARM/LICENSE.TXT file. Patch by Paul Osmialowski Differential Revision: http://reviews.llvm.org/D19319 llvm-svn: 267446	2016-04-25 19:12:20 +00:00
Jonathan Peyton	a1202bf594	[ITTNOTIFY] Remove serialized parallel regions from frame notification llvm-svn: 266760	2016-04-19 16:55:17 +00:00
Jonathan Peyton	5235a1b603	Fix trip count calculation for parallel loops in runtime The trip count calculation was incorrect for loops with large bounds. For example, for(int i=-2,000,000,000; i < 2,000,000,000; i+=50000000), the trip count calculation had overflow (trying to calculate 2,000,000,000 + 2,000,000,000 with signed integers) and wasn't giving the right value. This patch fixes this error in the runtime by using unsigned integers instead. There is still a bug in the clang compiler component because it warns that there is overflow in the test case file when there isn't. This error isn't there for the Intel Compiler. So for now, the test case is designated as XFAIL. Differential Revision: http://reviews.llvm.org/D19078 llvm-svn: 266677	2016-04-18 21:38:29 +00:00
Jonathan Peyton	e6643daa18	Runtime support for untied tasks Introduced a counter of parts of an untied task submitted for execution. The counter controls whether all parts of the task are already finished. The compiler should generate re-submission of partially executed untied task by itself before exiting of each task part except for the lexical last part. Differential Revision: http://reviews.llvm.org/D19026 llvm-svn: 266675	2016-04-18 21:35:14 +00:00
Jonathan Peyton	f252010f69	Fix for pthread_setspecific (TLS and shutdown) problem Some codes that use TLS fail intermittently because one thread tries to write TLS values after the TLS key has been destroyed by another thread. This happens when one thread executes library shutdown (and destroys TLS keys), while another thread starts to execute the TLS key destructor routine. Before this change, the kmp_init_runtime flag was checked before calling pthread_* TLS functions, but this flag is set to FALSE later than the destruction of the TLS keys, which leads to failure. The fix is to check kmp_init_gtid instead, as this flag is unset before the destruction of TLS keys. Differential Revision: http://reviews.llvm.org/D19022 llvm-svn: 266674	2016-04-18 21:33:01 +00:00
Jonathan Peyton	e2289a427d	[STATS] Remove timePair class and unused functions llvm-svn: 266634	2016-04-18 17:27:30 +00:00
Jonathan Peyton	53eca5216e	[STATS] print Total_* stats on their own line llvm-svn: 266633	2016-04-18 17:24:20 +00:00
Jonathan Peyton	99ef4d0433	[ITTNOTIFY] Correct barrier imbalance time in case of tasks ittnotify fix for barrier imbalance time in case tasks exist. In the current implementation, task execution time is included into aggregated time on a barrier. This fix calculates task execution time and corrects the arrive time by subtracting the task execution time. Since __kmp_invoke_task() can not only be called on a barrier, the field th.th_bar_arrive_time is used to check if the function was called at the barrier (th.th_bar_arrive_time != 0). So for this check, th_bar_arrive_time is set to zero right after the value is used on the barrier. Differential Revision: http://reviews.llvm.org/D19030 llvm-svn: 266332	2016-04-14 16:06:49 +00:00
Jonathan Peyton	377aa40d84	Exponential back off logic for test-and-set lock This change adds back off logic in the test and set lock for better contended lock performance. It uses a simple truncated binary exponential back off function. The default back off parameters are tuned for x86. The main back off logic has a two loop structure where each is controlled by a user-level parameter: max_backoff - limits the outer loop number of iterations. This parameter should be a power of 2. min_ticks - the inner spin wait loop number of "ticks" which is system dependent and should be tuned for your system if you so choose. The "ticks" on x86 correspond to the time stamp counter, but on other architectures ticks is a timestamp derived from gettimeofday(). The user can modify these via the environment variable: KMP_SPIN_BACKOFF_PARAMS=max_backoff[,min_ticks] Currently, since the default user lock is a queuing lock, one would have to also specify KMP_LOCK_KIND=tas to use the test-and-set locks. Differential Revision: http://reviews.llvm.org/D19020 llvm-svn: 266329	2016-04-14 16:00:37 +00:00
Jonathan Peyton	2e379fc767	Add declarations of OpenMP 4.5 target/offload routines to headers All these routines are implemented in the offload library. llvm-svn: 266120	2016-04-12 20:37:18 +00:00
Jonathan Peyton	072772bf05	[STATS] Remove trailing whitespace in stats source files llvm-svn: 265437	2016-04-05 18:48:48 +00:00
Jonathan Peyton	50e8f18b52	OMP_WAIT_POLICY changes This change has OMP_WAIT_POLICY=active to mean that threads will busy-wait in spin loops and virtually never go to sleep. OMP_WAIT_POLICY=passive now means that threads will immediately go to sleep inside a spin loop. KMP_BLOCKTIME was the previous mechanism to specify this behavior via KMP_BLOCKTIME=0 or KMP_BLOCKTIME=infinite, but the standard OpenMP environment variable should also be able to specify this behavior. Differential Revision: http://reviews.llvm.org/D18577 llvm-svn: 265339	2016-04-04 19:38:32 +00:00
Jonathan Peyton	1d46d979a9	Fix bug when KMP_USE_ADAPTIVE_LOCKS is 0 #endif was one line too low. If KMP_USE_ADAPTIVE_LOCKS is 0, then queuing locks would incorrectly use drdpa lock mechanism. This is a fix for https://llvm.org/bugs/show_bug.cgi?id=26649 llvm-svn: 264934	2016-03-30 21:50:59 +00:00
Jonathan Peyton	4cfe93c599	Fix comment in kmp_wait_release.h Removed reference to "ref ct" in a comment, as ref_ct no longer exists. Also moved the comment to where the task_team is about to be tested if NULL. llvm-svn: 264786	2016-03-29 21:08:29 +00:00
Jonathan Peyton	ee2f96c79b	Fix incorrect indention in kmp_alloc.c llvm-svn: 264777	2016-03-29 20:10:00 +00:00
Jonathan Peyton	a58563d8c9	Remove dead KMP_USE_POOLED_ALLOC code llvm-svn: 264776	2016-03-29 20:05:27 +00:00
Jonathan Peyton	316af8de48	[STATS] Missing check for MIC in config-ix.cmake llvm-svn: 264616	2016-03-28 18:53:10 +00:00
Hal Finkel	01bb2406a3	Fixing the non-x86 build by removing dependence on kmp_cpuid_t The problem is that the definition of kmp_cpuinfo_t contains: char name [3*sizeof (kmp_cpuid_t)]; // CPUID(0x80000002,0x80000003,0x80000004) and kmp_cpuid_t is only defined when compiling for x86. Differential Revision: http://reviews.llvm.org/D18245 llvm-svn: 264535	2016-03-27 13:24:09 +00:00
Jonas Hahnfeld	e46a494a50	[OMPT] Fix parallel_id and task_id in loop_end with schedule static For serialized parallel regions, wrong ids were reported. Now the same code is used as in kmp_dispatch.cpp which emits the correct ids. Differential Revision: http://reviews.llvm.org/D18348 llvm-svn: 264266	2016-03-24 12:52:20 +00:00
Jonas Hahnfeld	801fe9bbe2	[OMPT] Test ids reported by ompt_get_{parallel,task}_id llvm-svn: 264265	2016-03-24 12:52:11 +00:00
Jonas Hahnfeld	1c1c71776a	[OMPT] Fix duplicate implicit_task_end events for master thread with GCC For non-serialized parallel regions the master thread issued two callbacks: The first one in kmp_gsupport.c and the second in __kmp_join_call. Therefore only trigger the callback in kmp_gsupport.c for serialized parallel regions. Differential Revision: http://reviews.llvm.org/D16716 llvm-svn: 264264	2016-03-24 12:52:04 +00:00
Jonathan Peyton	b7d30cbc7e	Fix Visual Studio builds Have Visual Studio use MemoryBarrier() instead of _mm_mfence() and remove __declspec align attribute from function parameters in kmp_atomic.h llvm-svn: 264166	2016-03-23 16:27:25 +00:00
Jonas Hahnfeld	b1cad2954b	[OMPT] Make tests require OMPT_BLAME ompt_event_barrier_{begin,end} are optional blame events. In total it doesn't make any sense to test partially built OMPT support. llvm-svn: 264031	2016-03-22 08:23:24 +00:00
Jonas Hahnfeld	c804301113	[OMPT] Create infrastructure and add first tests for OMPT Some basic checks next to the implementation should futher lower the possibility to introduce regressions. (Note that this would have catched the ordering issue fixed in rL258866 and pointed to rL263940.) The tests are implementation dependent in one point because they assume that thread ids are assigned in ascending order. This is not defined by the standard but currently ensured in libomp. We have to think about another way of ordering the threads should this ever be subject to change... Note that this isn't aiming at replacing the implementation independent test-suite at https://github.com/OpenMPToolsInterface/ompt-test-suite! Differential Revision: http://reviews.llvm.org/D16715 llvm-svn: 264027	2016-03-22 07:22:49 +00:00
Jonathan Peyton	93a879ce78	[STATS] Add OMP_critical and OMP_critical_wait timers OMP_critical - time spent in critical section OMP_critical_wait - time spent waiting to enter a critical section llvm-svn: 263967	2016-03-21 18:32:26 +00:00
Jonathan Peyton	97cbb42d90	[STATS] separate noTotal bit flag from onlyInMaster and noUnits This change logically separates the stats_flags_e::noTotal bit flag from the stats_flags_e::onlyInMaster and stats_flags_e::noUnits bit flags. If no TOTAL_foo output is wanted for a particular statistic, the flag must be explicitly included in that statistic's flags. Differential Revision: http://reviews.llvm.org/D18198 llvm-svn: 263954	2016-03-21 17:26:23 +00:00

1 2 3 4 5 ...

363 Commits