llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	c4b852a909	[X86] Flesh out more of the avx512erintrin.h file. llvm-svn: 227719	2015-02-01 08:52:55 +00:00
Craig Topper	b01fc317c1	[X86] Use macros in AVX512ER header to allow ICE to be checked for immediate argument. llvm-svn: 227716	2015-02-01 08:05:12 +00:00
Craig Topper	67826a5883	[X86] Rename _mm512_valign_epi64/32 intrinsics to _mm512_alignr_epi64/32 to match Intel docs. Make immediate argument to them an ICE. Fix mask size for the alignd version. llvm-svn: 227713	2015-02-01 07:35:40 +00:00
Craig Topper	72c7d51251	[X86] Change rounding parameter of all the AVX512 builtins to an ICE. llvm-svn: 227712	2015-02-01 07:35:35 +00:00
Craig Topper	9fee8ab4f9	[x86] Remove tab characters from avxintrin.h. NFC. llvm-svn: 227676	2015-01-31 06:33:59 +00:00
Craig Topper	459554f164	[X86] Make order consistent between 'const' and 'int' in one of the intrinsic header files. NFC llvm-svn: 227675	2015-01-31 06:31:30 +00:00
Richard Smith	99335be950	Don't use BCPL comments here, in case someone wants to use <stdatomic.h> from C89 mode. llvm-svn: 227417	2015-01-29 03:34:39 +00:00
Hans Wennborg	2e56d950ff	Intrin.h: define _XCR_XFEATURE_ENABLED_MASK Users expect to be able to use this with _xgetbv. llvm-svn: 227270	2015-01-27 23:34:35 +00:00
Craig Topper	335e218760	[X86] Add intrinsics for AVX512 128 and 256 bit integer comparison of word and byte vectors. llvm-svn: 227186	2015-01-27 09:16:29 +00:00
Craig Topper	b4789096c0	[X86] Add AVX512 integer comparison intrinsics for word and byte vectors. llvm-svn: 227079	2015-01-26 09:24:10 +00:00
Craig Topper	2f25a5a875	[X86] Add more of the AVX512 integer comparision intrinsics. This adds 128 and 256 bit vectors of dwords and qwords. llvm-svn: 227075	2015-01-26 08:11:49 +00:00
Craig Topper	4cac1c2318	[X86] Add AVX512F integer comparision intrinsics to header file. llvm-svn: 227067	2015-01-25 23:30:07 +00:00
Adam Nemet	f893edeaea	[AVX512] Add sub-vector FP extracts Analogous to AVX2, these need to be implemented as macros to properly propagate the immediate index operand. Part of <rdar://problem/17688758> llvm-svn: 226496	2015-01-19 20:12:05 +00:00
Craig Topper	f557b09f14	[x86] Mark that the AVX-512 cmpps/cmppd builtins need an ICE for the comparison immediate. This requires converting to a macro in the header file. llvm-svn: 226421	2015-01-19 01:18:19 +00:00
Adam Nemet	c0cff244fc	[AVX512] Add intrinsics for masked aligned FP loads and stores Part of <rdar://problem/17688758> llvm-svn: 226298	2015-01-16 18:51:50 +00:00
Adam Nemet	63a951eb1c	[AVX512] Add FP unpack intrinsics These are implemented with __builtin_shufflevector just like AVX. We have some tests on the LLVM side to assert that these shufflevectors do indeed generate the corresponding unpck instruction. Part of <rdar://problem/17688758> llvm-svn: 225922	2015-01-14 01:31:17 +00:00
Ben Langmuir	c67a774e17	Add [extern_c] attribute to _Builtin_intrinsics module This allows users to import this module inside an extern "C" {} block. llvm-svn: 225835	2015-01-13 21:54:32 +00:00
Chandler Carruth	032d422d2e	Effectively revert r151058 which caused Clang's unwind.h to defer to libunwind in all cases when installed. At the time, Clang's unwind.h didn't provide huge chunks of the LSB-specified unwind interface, and was generally too aenemic to use for real software. However, it has since then become a strict superset of the APIs provided by libunwind on Linux. Notably, you cannot compile llgo's libgo library against libunwind, but you can against Clang's unwind.h. So let's just use our header. =] I've checked pretty thoroughly for any incompatibilities, and I am not aware of any. An open question is whether or not we should continue to munge GNU_SOURCE here. I didn't touch that as it potentially has compatibility implications on systems I cannot easily test -- Darwin. If a Darwin maintainer can verify that this is in fact unnecessary and remove it, cool. Until then, leaving it in makes this change a no-op there, and only really relevant on Linux systems where it is pretty clearly the right way to go. llvm-svn: 224934	2014-12-29 13:29:38 +00:00
Chandler Carruth	f3cabbd424	Add a missing declaration to our unwind.h implementation. This is necessary to be fully compatible with existing software that calls into the linux unwind code. You can find documentation of this API and why it exists in the discussion abot NPTL here: https://gcc.gnu.org/ml/gcc-patches/2003-09/msg00154.html llvm-svn: 224933	2014-12-29 13:29:36 +00:00
Chandler Carruth	28daca211c	[x86] Also add the missing type casts on the returns in the sha intrinsic header file. Along with r224822, this should restore the build bots to passing. llvm-svn: 224883	2014-12-27 11:50:51 +00:00
Craig Topper	ab70789199	[x86] Add missing typecast to __v4si to sha intrinsic header file. llvm-svn: 224882	2014-12-27 07:19:25 +00:00
Craig Topper	2094d8fe88	[x86] Add the (v)cmpps/pd/ss/sd builtins to match gcc. Use them in the sse intrinsic files. This still lower to the same intrinsics as before. This is preparation for bounds checking the immediate on the avx version of the builtin so we don't pass illegal immediates into the backend. Since SSE uses a smaller size immediate its not possible to bounds check when using a shared builtin. Rather than creating a clang specific builtin for the different immediate, I decided (after consulting with Chandler) that it was better to match gcc. llvm-svn: 224879	2014-12-27 06:59:57 +00:00
Eric Christopher	c67e1b6a2a	Make sure that vec_perm is listed as a static function in altivec.h. llvm-svn: 223871	2014-12-10 00:57:43 +00:00
Reid Kleckner	baf7709055	Implement __umulh with __int128 arithmetic Use the same approach as _umul128, but just return the high half. llvm-svn: 223316	2014-12-03 23:36:14 +00:00
David Majnemer	00973ce683	FullProduct should be _FullProduct llvm-svn: 223179	2014-12-02 23:44:40 +00:00
David Majnemer	5450763dd8	Intrin: shrx_u64 should be _shrx_u64 llvm-svn: 223176	2014-12-02 23:30:26 +00:00
David Majnemer	5f9afc59f8	Intrin: Add _umul128 Implement _umul128; it provides the high and low halves of a 128-bit multiply. We can simply use our __int128 arithmetic to implement this, we generate great code for it: movq %rdx, %rax mulq %rcx movq %rdx, (%r8) retq Differential Revision: http://reviews.llvm.org/D6486 llvm-svn: 223175	2014-12-02 23:30:24 +00:00
Reid Kleckner	e35b07ad49	Intercept __crt_va_* used by MSVC "14" Moving further into the implementor's namespace is good, but now we have one more name to intercept. llvm-svn: 222473	2014-11-20 22:44:03 +00:00
Bill Schmidt	8ff672d397	[PowerPC] Enable vec_perm for long long and double vector types for VSX VSX makes the "vector long long" and "vector double" types available. This patch enables the vec_perm interface for these types. The same builtin is generated regardless of the specified type, so no additional work or testing is needed in the back end. Tests are added to ensure this builtin is generated by the front end. llvm-svn: 221988	2014-11-14 13:10:13 +00:00
Bill Schmidt	cee13a2712	[PowerPC] Add VSX builtins for vec_div This patch adds builtin support for xvdivdp and xvdivsp, along with a new test case. The builtins are accessed using vec_div in altivec.h. Builtins are listed (mostly) alphabetically there, so inserting these changed the line numbers for deprecation warnings tested in test/Headers/altivec-intrin.c. There is a companion patch for LLVM. llvm-svn: 221984	2014-11-14 12:10:51 +00:00
Bill Schmidt	9ec8cea02b	[PowerPC] Add vec_vsx_ld and vec_vsx_st intrinsics This patch enables the vec_vsx_ld and vec_vsx_st intrinsics for PowerPC, which provide programmer access to the lxvd2x, lxvw4x, stxvd2x, and stxvw4x instructions. New code in altivec.h defines these in terms of new builtins, which are themselves defined in BuiltinsPPC.def. The builtins are converted to LLVM intrinsics in CGBuiltin.cpp. Additional code is added to builtins-ppc-vsx.c to verify the correct generation of the intrinsics. Note that I moved the other VSX builtins so all VSX builtins will be alphabetical in their own section in BuiltinsPPC.def. There is a companion patch for LLVM. llvm-svn: 221768	2014-11-12 04:19:56 +00:00
Craig Topper	8c7f251e98	Add FSGSBASE intrinsics to x86 intrinsic headers. llvm-svn: 221130	2014-11-03 06:51:41 +00:00
Craig Topper	554797f255	Remove definitions from Intrin.h that already exist in one of the other x86 intrinsic headers. Add a run line with Broadwell as the cpu type to ms-intrin.cpp test to catch some of these in the future. llvm-svn: 221127	2014-11-03 04:19:58 +00:00
Craig Topper	e1c664b136	Add _lzcnt_u32 and _lzcnt_u64 to lzcntintrin.h to match Intel documentation names for these intrinsics. llvm-svn: 221066	2014-11-01 22:50:57 +00:00
Craig Topper	a52e0d7cc0	Avoid undefined behavior in the x86 bmi header file by explicitly checking for 0 before calling __builtin_ctz. Without this the optimizers may take advantage of the undefined behavior and produce incorrect results. LLVM itself still needs to be taught to merge the zero check into the llvm.cttz with defined zero behavior. llvm-svn: 221065	2014-11-01 22:50:54 +00:00
Craig Topper	3ca55d9c41	Avoid undefined behavior in the x86 lzcnt header file by explicitly checking for 0 before calling __builtin_clz. Without this the optimizers may take advantage of the undefined behavior and produce incorrect results. LLVM itself still needs to be taught to merge the zero check into the llvm.ctlz with defined zero behavior. llvm-svn: 221064	2014-11-01 22:25:23 +00:00
Bill Schmidt	691e01d94e	[PowerPC] Initial VSX intrinsic support, with min/max for vector double Now that we have initial support for VSX, we can begin adding intrinsics for programmer access to VSX instructions. This patch performs the necessary enablement in the front end, and tests it by implementing intrinsics for minimum and maximum using the vector double data type. The main change in the front end is to no longer disallow "vector" and "double" in the same declaration (lib/Sema/DeclSpec.cpp), but "vector" and "long double" must still be disallowed. The new intrinsics are accessed via vec_max and vec_min with changes in lib/Headers/altivec.h. Note that for v4f32, we already access corresponding VMX builtins, but with VSX enabled we should use the forms that allow all 64 vector registers. The new built-ins are defined in include/clang/Basic/BuiltinsPPC.def. I've added a new test in test/CodeGen/builtins-ppc-vsx.c that is similar to, but much smaller than, builtins-ppc-altivec.c. This allows us to test VSX IR generation without duplicating CHECK lines for the existing bazillion Altivec tests. Since vector double is now legal when VSX is available, I've modified the error message, and changed where we test for it and for vector long double, since the target machine isn't visible in the old place. This serendipitously removed a not-pertinent warning about 'long' being deprecated when used with 'vector', when "vector long double" is encountered and we just want to issue an error. The existing tests test/Parser/altivec.c and test/Parser/cxx-altivec.cpp have been updated accordingly, and I've added test/Parser/vsx.c to verify that "vector double" is now legitimate with VSX enabled. There is a companion patch for LLVM. llvm-svn: 220989	2014-10-31 19:19:24 +00:00
Saleem Abdulrasool	a25fbef088	CodeGen: add __readfsdword builtin The Windows NT SDK uses __readfsdword and declares it as a compiler provided builtin (#pragma intrinsic(__readfsword). Because intrin.h is not referenced by winnt.h, it is not possible to provide an out-of-line definition for the intrinsic. Provide a proper compiler builtin definition. llvm-svn: 220859	2014-10-29 16:35:41 +00:00
NAKAMURA Takumi	a267847538	<float.h>: Don't seek #include_next if -ffreestanding for targeting mingw. llvm-svn: 220356	2014-10-22 01:25:49 +00:00
Hans Wennborg	818514b718	vadefs.h: be even more conservative and only define the macros if already defined llvm-svn: 219745	2014-10-14 23:20:25 +00:00
Hans Wennborg	752b789e7b	Sort files list in lib/Headers/CMakeLists.txt majnemer pointed out that vadefs.h was added in the wrong place. Might as well sort the rest too. llvm-svn: 219743	2014-10-14 23:15:43 +00:00
Hans Wennborg	adfd7f6ef4	MS Compat: interpose vadefs.h to fix definitions of _crt_va_{start,end,arg} (PR21247) Differential revision: http://reviews.llvm.org/D5784 llvm-svn: 219740	2014-10-14 22:35:42 +00:00
Robert Khasanov	33e7685b2a	Added new headers to CMakeLists.txt. Fix for rev219319 llvm-svn: 219325	2014-10-08 17:37:51 +00:00
Robert Khasanov	b9f3a911c9	[AVX512] Added VPCMPEQ intrinisics to headers. Added tests. Patch by Maxim Blumenthal <maxim.blumenthal@intel.com> llvm-svn: 219319	2014-10-08 17:18:13 +00:00
Bill Schmidt	cad3a5f7d4	[PATCH][Power] Fix (and deprecate) vec_lvsl and vec_lvsr for little endian The use of the vec_lvsl and vec_lvsr interfaces are discouraged for little endian targets since Power8 hardware is a minimum requirement, and Power8 provides reasonable performance for unaligned vector loads and stores. Up till now we have not provided "correct" (i.e., big- endian-compatible) code generation for these interfaces, as to do so produces poorly performing code. However, this has become the source of too many questions. With this patch, LLVM will now produce compatible code for these interfaces, but will also produce a deprecation warning message for PPC64LE when one of them is used. This should make the porting direction clearer to programmers. A similar patch has recently been committed to GCC. This patch includes a test for the warning message. There is a companion patch that adds two unit tests to projects/test-suite. llvm-svn: 219137	2014-10-06 19:02:20 +00:00
Hal Finkel	6970ac8b0a	Add an implementation of C11's stdatomic.h Adds a Clang-specific implementation of C11's stdatomic.h header. On systems, such as FreeBSD, where a stdatomic.h header is already provided, we defer to that header instead (using our __has_include_next technology). Otherwise, we provide an implementation in terms of our __c11_atomic_* intrinsics (that were created for this purpose). C11 7.1.4p1 requires function declarations for atomic_thread_fence, atomic_signal_fence, atomic_flag_test_and_set, atomic_flag_test_and_set_explicit, and atomic_flag_clear, and requires that they have external linkage. Accordingly, we provide these declarations, but if a user elides the shadowing macros and uses them, then they must have a libc (or similar) that actually provides definitions. atomic_flag is implemented using _Bool as the underlying type. This is consistent with the implementation provided by FreeBSD and also GCC 4.9 (at least when __GCC_ATOMIC_TEST_AND_SET_TRUEVAL == 1). Patch by Richard Smith (rebased and slightly edited by me -- Richard said I should drive at this point). llvm-svn: 218957	2014-10-03 04:29:40 +00:00
Richard Smith	ef99e4d88a	Fix interaction of max_align_t and modules. When building with modules enabled, we were defining max_align_t as a typedef for a different anonymous struct type each time it was included, resulting in an error if <stddef.h> is not covered by a module map and is included more than once in the same modules-enabled compilation of C11 or C++11 code. llvm-svn: 218931	2014-10-03 00:31:35 +00:00
Joerg Sonnenberger	2960178a77	Fix trailing commas in AMD define. llvm-svn: 218825	2014-10-01 21:22:17 +00:00
Joerg Sonnenberger	e028e05a7e	Add the various signature macros. llvm-svn: 218824	2014-10-01 21:21:42 +00:00
Joerg Sonnenberger	cf0740454d	Rename bit_RDRAND to bit_RDRND to match GCC's version of this header. llvm-svn: 218823	2014-10-01 21:21:16 +00:00

1 2 3 4 5 ...

612 Commits