llvm-project

Commit Graph

Author	SHA1	Message	Date
Tue Ly	5c6db1dc9b	[libc] Fix nested namespace issues with multiply_add.h. The FMA header was included inside namespaces in multiply_add.h. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D123539	2022-04-11 17:30:02 -04:00
Siva Chandra Reddy	0258f56646	[libc] Add a definition of pthread_attr_t and its getters and setters. Not all attributes have been added to phtread_attr_t in this patch. They will be added gradually in future patches. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D123423	2022-04-11 16:08:49 +00:00
Michael Jones	4f4752ee6f	[libc][NFC] implement printf parser This patch adds the sequential mode implementation of the printf parser, as well as unit tests for it. In addition it adjusts the surrounding files to accomodate changes in the design found in the implementation process. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D123339	2022-04-08 14:21:13 -07:00
Tue Ly	c5f8a0a1e9	[libc] Add support for x86-64 targets that do not have FMA instructions. Make FMA flag checks more accurate for x86-64 targets, and refactor polyeval to use multiply and add instead when FMA instructions are not available. Reviewed By: michaelrj, sivachandra Differential Revision: https://reviews.llvm.org/D123335	2022-04-08 14:12:24 -04:00
Siva Chandra Reddy	2ce09e680a	[libc] Add a linux Thread class in __support/threads. This change is essentially a mechanical change which moves the thread creation and join implementations from src/threads/linux to src/__support/threads/linux/thread.h. The idea being that, in future, a pthread implementation can reuse the common thread implementations in src/__support/threads. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D123287	2022-04-07 16:13:21 +00:00
Michael Jones	5561ab3495	[libc] Add holder class for va_lists This class is intended to be used in cases where a class is being used on a va_list. It provides destruction and copy semantics with small overhead. This is intended to be used in printf. Reviewed By: sivachandra, lntue Differential Revision: https://reviews.llvm.org/D123061	2022-04-05 11:39:57 -07:00
Siva Chandra Reddy	83f153ce34	[libc] Add pthread_mutexattr_t type and its setters and getters. A simple implementation of the getters and setters has been added. More logic can be added to them in future as required. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D122969	2022-04-04 18:11:12 +00:00
Siva Chandra Reddy	6a7cd4a1df	[libc][NFC] Do not call mmap and munmap from thread functions. Instead, memory is allocated and deallocated using mmap and munmap syscalls directly. Reviewed By: lntue, michaelrj Differential Revision: https://reviews.llvm.org/D122876	2022-04-02 05:12:08 +00:00
Michael Jones	c4a1b07d09	[libc][NFC] add outline of printf This patch adds the headers for printf. It contains minimal actual code, and is more intended to be used for design review. The code is not built yet, and may have minor errors. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D122773	2022-04-01 14:36:17 -07:00
Siva Chandra	97417e0300	[libc] Enable threads.h functions on aarch64. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D122788	2022-03-31 08:42:07 -07:00
Tue Ly	a5466f0436	[libc] Improve the performance of expm1f. Improve the performance of expm1f: - Rearrange the selection logic for different cases to improve the overall throughput. - Use the same degree-4 polynomial for large inputs as `expf` (https://reviews.llvm.org/D122418), reduced from a degree-7 polynomial. Performance benchmark using perf tool from CORE-MATH project (https://gitlab.inria.fr/core-math/core-math/-/tree/master): Before this patch: ``` $ ./perf.sh expm1f CORE-MATH reciprocal throughput : 15.362 System LIBC reciprocal throughput : 53.288 LIBC reciprocal throughput : 54.572 $ ./perf.sh expm1f --latency CORE-MATH latency : 57.759 System LIBC latency : 147.146 LIBC latency : 118.057 ``` After this patch: ``` $ ./perf.sh expm1f CORE-MATH reciprocal throughput : 15.359 System LIBC reciprocal throughput : 53.188 LIBC reciprocal throughput : 14.600 $ ./perf.sh expm1f --latency CORE-MATH latency : 57.774 System LIBC latency : 147.119 LIBC latency : 60.280 ``` Reviewed By: michaelrj, santoshn, zimmermann6 Differential Revision: https://reviews.llvm.org/D122538	2022-03-30 19:23:25 -04:00
Michael Jones	9276074271	[libc][obvious] Add mfma to log2f In the previous patch adding -mfma to functions that need it for windows builds I missed log2f. Differential Revision: https://reviews.llvm.org/D122693	2022-03-29 16:34:52 -07:00
Michael Jones	2f8829aba3	[libc] Add mfma option to functions that use fma On Windows the functions that use fma don't properly include the fma intrinsics unless -mfma is added to the compile options. This patch adds the compile option to all of the functions that need it. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D122689	2022-03-29 16:23:36 -07:00
Michael Jones	4ac3f7e41a	[libc][obvious] fix sqrt when long double is double Previously, the "fsqrt" instruction was used on all x86_64 platforms for finding the square root of long doubles. On long double is double platforms (e.g. windows) this created errors. This patch changes square root function for long doubles to be the same as the one for doubles if long doubles are doubles. Reviewed By: sivachandra, lntue Differential Revision: https://reviews.llvm.org/D122688	2022-03-29 16:05:40 -07:00
Guillaume Chatelet	df838dbcfa	[NFC][libc] Disable benchmarks when the LLVM benchmark target is not available Fixes https://github.com/llvm/llvm-project/issues/53686 Differential Revision: https://reviews.llvm.org/D122481	2022-03-29 08:45:53 +00:00
Siva Chandra Reddy	2c20c9003b	[libc] Set rtlib to compiler-rt in integration tests. The clang driver to picks up compiler runtime files using full paths. Without this, at least for aarch64, the driver tries to pick up the compiler runtime files from the working directory. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D122617	2022-03-28 23:46:21 +00:00
Tue Ly	6168b42225	[libc] Improve the performance of expf. Reduce the polynomial's degree from 7 down to 4. Currently we use a degree-7 minimax polynomial on an interval of length 2^-7 around 0 to compute `expf`. Based on the suggestion of @santoshn and the RLIBM project (https://github.com/rutgers-apl/rlibm-all/blob/main/source/float/exp.c) and the improvement we made with `exp2f` in https://reviews.llvm.org/D122346, it is possible to have a good polynomial of degree-4 on a subinterval of length 2^(-7) to approximate e^x. We did try to either reduce the degree of the polynomial down to 3 or increase the interval size to 2^(-6), but in both cases the number of exceptional values exploded. So we settle with using a degree-4 polynomial of the interval of size 2^(-7) around 0. Reviewed By: sivachandra, zimmermann6, santoshn Differential Revision: https://reviews.llvm.org/D122418	2022-03-25 12:20:20 -04:00
Tue Ly	b9d87d7466	[libc] Improve the performance of exp2f. Reduce the range-reduction table size from 128 entries down to 64 entries, and reduce the polynomial's degree from 6 down to 4. Currently we use a degree-6 minimax polynomial on an interval of length 2^-7 around 0 to compute exp2f. Based on the suggestion of @santoshn and the RLIBM project (https://github.com/rutgers-apl/rlibm-prog/blob/main/libm/float/exp2.c) it is possible to have a good polynomial of degree-4 on a subinterval of length 2^(-6) to approximate 2^x. We did try to either reduce the degree of the polynomial down to 3 or increase the interval size to 2^(-5), but in both cases the number of exceptional values exploded. So we settle with using a degree-4 polynomial of the interval of size 2^(-6) around 0. Reviewed By: michaelrj, sivachandra, zimmermann6, santoshn Differential Revision: https://reviews.llvm.org/D122346	2022-03-24 18:06:37 -04:00
Michael Jones	6d8ce42825	[libc][obvious] only test FILE on working platforms The main code for the FILE struct is only enabled on platforms that it works on, but before this patch the tests were included unconditionally. Reviewed By: sivachandra, lntue Differential Revision: https://reviews.llvm.org/D122363	2022-03-24 10:19:34 -07:00
Siva Chandra Reddy	107ce71849	[libc] Use real objects and archives in integration tests. Previously, we used empty, non-ELF crti.o, crtn.o, libm.a and libc++.a files. Instead, we now still use dummies but they are real ELF object files and archives.	2022-03-24 07:02:33 +00:00
Siva Chandra Reddy	441606f5ff	[libc] Add implementations of fopen, flose, fread, fwrite and fseek. A follow up patch will add feof and ferror. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D122327	2022-03-24 04:20:12 +00:00
Michael Jones	6d0f5d95ad	[libc][obvious] add aligned_alloc as entrypoint This patch adds aligned_alloc as an entrypoint. Previously it was being included implicitly. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D122362	2022-03-23 16:44:15 -07:00
Siva Chandra Reddy	316f9fd638	[libc] Link the SCUDO integration tests to a special entrypoint collection. We were previously linking to libllvmlibc.a. But, with libllvmlibc.a now including functions which depend on the loader, we will have to use the LLVM libc loader as well. To avoid this, we will link to a special library which is just a collection of SCUDO allocator entrypoints. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D122360	2022-03-23 23:27:23 +00:00
Michael Jones	805899e68a	[libc] Change FEnv to use MXCSR as source of truth This patch primarily fixes the fenv implementation on Windows, since Windows uses the MXCSR in place of the x87 status registers for storing information about the floating point environment. This allows FEnv to work correctly on Windows, and successfully build. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D121839	2022-03-23 16:08:00 -07:00
Siva Chandra Reddy	7fdb50c813	[libc] Add a new rule add_integration_test. All existing loader tests are switched to an integration test added with the new rule. Also, the getenv test is now enabled as an integration test. All loader tests have been moved to test/integration. Also, the simple checker library for the previous loader tests has been moved to a separate directory of its own. A follow up change will perform more cleanup of the loader CMake rules to eliminate now redundent options. Reviewed By: lntue, michaelrj Differential Revision: https://reviews.llvm.org/D122266	2022-03-23 20:57:29 +00:00
Siva Chandra Reddy	3bfbb68e1e	[libc] Rename libc-integration-test to libc-api-test. Reviewed By: jeffbailey, michaelrj Differential Revision: https://reviews.llvm.org/D122272	2022-03-23 20:25:34 +00:00
Siva Chandra Reddy	a0f6d12cd4	[libc][File] Fix a bug under fseek(..., SEEK_CUR). Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D122284	2022-03-23 16:24:15 +00:00
Siva Chandra Reddy	b950a0d44d	[libc][Obvious] Remove an unnecessary dep and use inline_memcpy. An unnecessary dep of the getenv function is removed. From the x86_64 loader, a call to __llvm_libc::memcpy is replaced with call to __llvm_libc::inline_memcpy.	2022-03-22 07:08:57 +00:00
Siva Chandra Reddy	df4814d45d	[libc] Add a linux file implementation. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D121976	2022-03-21 07:07:22 +00:00
Siva Chandra Reddy	c236b41e45	[libc][NFC] Add the platform independent file target only if mutex is available. The platform independent file implementation is not an entrypoint so it cannot be excluded via the entrypoints.txt file. Hence, we need a special treatment to exclude it from the build. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D121947	2022-03-18 03:34:38 +00:00
Siva Chandra Reddy	29a631a273	[libc][NFC] Add a separate flag for capturing the '+' in fopen mode string. Having a separate flag helps in setting up proper flags when implementing, say the Linux specialization of File. Along the way, a signature for a function which is to be used to open files has been added. The implementation of the function is to be included in platform specializations. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D121889	2022-03-17 15:44:04 +00:00
Siva Chandra Reddy	2e7cb8c786	[libc] Remove references to the std threads library from __support/threads.	2022-03-16 19:35:40 +00:00
Siva Chandra Reddy	9527a2f58f	[libc][NFC] Keep the mutex with the base File data structure. This is now possible because we have a platform independent abstraction for mutexes. Reviewed By: lntue, michaelrj Differential Revision: https://reviews.llvm.org/D121773	2022-03-16 19:05:23 +00:00
Siva Chandra Reddy	2ebe971103	[libc][Obvious] Add missing licence headers and fix an include pattern.	2022-03-16 18:51:37 +00:00
Tue Ly	4c9bfec67c	[libc] Let exhaustive tests indicate each interval PASSED/FAILED. Let exhaustive tests indicate each interval PASSED/FAILED. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D121564	2022-03-16 09:56:03 -04:00
Alex Brachet	89aa4bd3fb	[libc] Unlock after pop_back	2022-03-16 04:00:26 +00:00
Alex Brachet	968c1aa54f	[libc][NFC] Use more common variable naming convention We were in between two styles when this file was initially checked in.	2022-03-16 03:34:54 +00:00
Siva Chandra Reddy	e9c9ee9fe6	[libc][NFC] Fix typos and reduntent code triggering compiler warinings.	2022-03-15 21:51:12 +00:00
Siva Chandra Reddy	827575a7f8	[libc] Add implementation of POSIX lseek function. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D121676	2022-03-15 16:24:48 +00:00
Alex Brachet	ae4b59f179	[libc] Fix exit not calling new handlers registered from a call to atexit in atexit handler	2022-03-15 15:18:41 +00:00
Alex Brachet	d7c920e1a0	[libc][BlockStore] Add back, pop_back and empty methods Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D121656	2022-03-15 15:11:57 +00:00
Tue Ly	64af346b18	[libc] Implement expm1f function that is correctly rounded for all rounding modes. Implement expm1f function that is correctly rounded for all rounding modes. This is based on expf implementation. From exhaustive testings, using expf implementation, and subtract 1.0 before rounding the final result to single precision gives correctly rounded results for all \|x\| > 2^-4 with 1 exception. When \|x\| < 2^-25, we use x + x^2 (implemented with a single fma). And for 2^-25 <= \|x\| <= 2^-4, we use a single degree-8 minimax polynomial generated by Sollya. Reviewed By: sivachandra, zimmermann6 Differential Revision: https://reviews.llvm.org/D121574	2022-03-15 10:24:56 -04:00
Siva Chandra Reddy	1ceb007939	[libc][Obvious] Fix typo in CMake file.	2022-03-15 08:32:05 +00:00
Tue Ly	58edd26255	[libc] Include -150 to the special cases at the beginning of exp2f function.	2022-03-14 10:06:27 -04:00
Tue Ly	64721a3312	[libc] Implement exp2f function that is correctly rounded for all rounding modes. Implement exp2f function that is correctly rounded for all rounding modes. Reviewed By: sivachandra, zimmermann6 Differential Revision: https://reviews.llvm.org/D121463	2022-03-14 09:42:37 -04:00
Tue Ly	38cadd90b7	[libc] Implement expf function that is correctly rounded for all rounding modes. Implement expf function that is correctly rounded for all rounding modes. Reviewed By: sivachandra, zimmermann6 Differential Revision: https://reviews.llvm.org/D121440	2022-03-11 07:16:47 -05:00
Siva Chandra Reddy	1f45a1071d	[libc][Obvious] Destroy the block store var in block store test.	2022-03-10 21:36:46 +00:00
Siva Chandra Reddy	19c6098097	[libc] Add a resizable container with constexpr constructor and destructor. The new container is used to store atexit callbacks. This way, we avoid the possibility of the destructor of the container itself getting added as an at exit callback. Reviewed By: abrachet Differential Revision: https://reviews.llvm.org/D121350	2022-03-10 21:28:23 +00:00
Tue Ly	0f031daea8	[libc] Initial support for darwin-aarch64. Add initial support for darwin-aarch64 (macOS M1). Some differences compared to linux-aarch64: - `math.h` defined `math_errhandling` by the compiler builtin `__math_errhandling()` but Apple Clang 13.0.0 on M1 does not support `__math_errhandling()` builtin as a macro function or a constexpr function. - `math.h` defines `UNDERFLOW` and `OVERFLOW` macros. - Besides 5 usual floating point exceptions: `FE_INEXACT`, `FE_UNDERFLOW`, `FE_OVERFLOW`, `FE_DIVBYZERO`, and `FE_INVALID`, `fenv.h` also has another floating point exception: `FE_FLUSHTOZERO`. The corresponding trap for `FE_FLUSHTOZERO` in the control register is at the different location compared to the status register. - `FE_FLUSHTOZERO` exception flag cannot be raised with the default CPU floating point operation mode. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D120914	2022-03-10 09:26:09 -05:00
Siva Chandra Reddy	83b8878fbb	[libc] Use the constexpr constructor to initialize exit handlers mutex. Reviewed By: abrachet Differential Revision: https://reviews.llvm.org/D121334	2022-03-10 02:52:35 +00:00

1 2 3 4 5 ...

772 Commits