llvm-project

Commit Graph

Author	SHA1	Message	Date
Elena Demikhovsky	fcc6df310d	AVX-512: Added intrinsics to clang. The set is small, that what I have right now. Everybody is welcome to add more. llvm-svn: 213641	2014-07-22 11:31:39 +00:00
Viktor Kutuzov	99400a5a34	Revert D3908 due to issues on Mac platforms llvm-svn: 213450	2014-07-19 05:58:38 +00:00
Yi Kong	28d7b02687	ARM: Add ACLE memory barrier intrinsic mapping llvm-svn: 213261	2014-07-17 12:45:17 +00:00
Yi Kong	472e521cec	ARM: Add NOP intrinsic mapping in arm_acle.h llvm-svn: 212950	2014-07-14 15:32:29 +00:00
Saleem Abdulrasool	07257fe14e	Headers: add hint intrinsics to arm_acle.h This adds the ARM ACLE hint intrinsic wrappers to arm_acle.h. These need to be protected with a !defined(_MSC_VER) since MSVC (and thus clang in compatibility mode) provide these wrappers as proper builtin intrinsics. llvm-svn: 212891	2014-07-12 23:27:26 +00:00
Yi Kong	4e00ce7d0c	Improve comments of ARM ACLE header file and tests Include section number in ARM ACLE specification for easier navigation. llvm-svn: 212887	2014-07-12 22:48:13 +00:00
Viktor Kutuzov	63537656c6	Add clang headers that fix machine-dependent definitions on FreeBSD 9.2 Differential Revision: http://reviews.llvm.org/D3908 llvm-svn: 212689	2014-07-10 08:43:39 +00:00
Nico Weber	a62cffae52	Don't pull in setjmp.h in -ffreestanding compiles. Also provide _setjmpex(). r200243 put in _setjmp() and _setjmpex() behind a comment since jmp_buf wasn't available. r200344 added jmp_buf and put in _setjmp(), but missed _setjmpex(). llvm-svn: 212557	2014-07-08 18:34:46 +00:00
Nico Weber	1287091373	Replace a few // comments with /**/ comments in headers, for consistency. llvm-svn: 212556	2014-07-08 18:29:27 +00:00
Saleem Abdulrasool	c4ebb129b7	Headers: conditionalise more declarations Protect MMX specific declarations under a __MMX__ guard. This header can be included on non-x86 architectures (e.g. ARM) which do not support the MMX ISA. Use the preprocessor to prevent these declarations from being processed. llvm-svn: 212512	2014-07-08 05:46:04 +00:00
Saleem Abdulrasool	60df0615b6	Headers: mark arm_acle.h with extern "C" Although the functions are marked as always_inline, the compiler with which they are used may not honour the extended attributes and emit them as functions. In such a case, indicate that they should have extern "C" linkage and should not be mangled in C++ style if used within C++. llvm-svn: 212511	2014-07-08 05:46:00 +00:00
Renato Golin	47843efcf6	Add the __qdbl intrinsic to the arm_acle.h header Patch by: Moritz Roth llvm-svn: 212264	2014-07-03 10:14:52 +00:00
Yaron Keren	672efea2e9	Added standard macro guard. In case __GNUC_VA_LIST was not defined or defined identically before there will not be any change in functionality. MinGW-w64 defines __GNUC_VA_LIST as #define __GNUC_VA_LIST which is different than the definition here, causing a warning without the guard. llvm-svn: 212183	2014-07-02 15:25:03 +00:00
Andrea Di Biagio	eb606a3c27	[x86] Add Clang support for intrinsic __rdpmc. This patch adds intrinsic __rdpmc to header file 'ia32intrin.h'. Intrinsic __rdmpc can be used to read performance monitoring counters. It is implemented as a direct call to __builtin_ia32_rdpmc. It takes as input a value representing the index of the performance counter to read. The value of the performance counter is then returned as a unsigned 64-bit quantity. llvm-svn: 212053	2014-06-30 18:23:58 +00:00
Yi Kong	a44c4d7173	Introduce arm_acle.h supporting existing LLVM builtin intrinsics Summary: This patch introduces ACLE header file, implementing extensions that can be directly mapped to existing Clang intrinsics. It implements for both AArch32 and AArch64. Reviewers: t.p.northover, compnerd, rengolin Reviewed By: compnerd, rengolin Subscribers: rnk, echristo, compnerd, aemerson, mroth, cfe-commits Differential Revision: http://reviews.llvm.org/D4296 llvm-svn: 211962	2014-06-27 21:25:42 +00:00
Saleem Abdulrasool	702eefed9a	Headers: be a bit more careful about inline asm Conditionally include x86intrin.h if we are building for x86 or x86_64. Conditionalise definition of inline assembly routines which use x86 or x86_64 inline assembly. This is needed as clang can target Windows on ARM where these definitions may be included into user code. llvm-svn: 211716	2014-06-25 16:48:40 +00:00
Saleem Abdulrasool	114efe0dc8	CodeGen: improve ms instrincics support Add support for _InterlockedCompareExchangePointer, _InterlockExchangePointer, _InterlockExchange. These are available as a compiler intrinsic on ARM and x86. These are used directly by the Windows SDK headers without use of the intrin header. llvm-svn: 211216	2014-06-18 20:51:10 +00:00
Bill Schmidt	1cf7c64fa5	[PPC64LE] Run some existing Altivec tests on powerpc64le as well There are several Altivec tests that formerly ran only on big-endian targets (and in some cases only on 32-bit targets). It is useful to verify these on little-endian targets as well. While testing these, I discovered a typo in <altivec.h>. This is also fixed by this patch. llvm-svn: 210928	2014-06-13 18:30:06 +00:00
Bill Schmidt	56a6967000	[PPC64LE] Fix vec_sld and vec_vsldoi for little endian The vec_sld and vec_vsldoi interfaces perform a left-shift on vector arguments for both big and little endian. However, because they rely on the vec_perm interface which is endian-dependent, the permutation vector needs to be reversed for LE to get the proper shift direction. I've added some extra testing for these interfaces for LE in the builtins-ppc-altivec.c. llvm-svn: 210657	2014-06-11 15:48:46 +00:00
Bill Schmidt	7f6596bb13	[PPC64LE] Implement little-endian semantics for vec_sums The PowerPC vsumsws instruction, accessed via vec_sums, is defined architecturally with a big-endian bias, in that the second input vector and the result always reference big-endian element 3 (little-endian element 0). For ease of porting, the programmer wants elements 3 in both cases. To provide this semantics, for little endian we generate a permute for the second input vector prior to the vsumsws instruction, and generate a permute for the result vector following the vsumsws instruction. The correctness of this code is tested by the new sums.c test added in a previous patch, as well as the modifications to builtins-ppc-altivec.c in the present patch. llvm-svn: 210449	2014-06-09 03:31:47 +00:00
Bill Schmidt	d7c53a91df	[PPC64LE] Implement little-endian semantics for vec_unpack[hl] The PowerPC vector-unpack-high and vector-unpack-low instructions are defined architecturally with a big-endian bias, in that the vector element numbering is assumed to be "left to right" regardless of whether the processor is in big-endian or little-endian mode. This effectively reverses the meaning of "high" and "low." Such a definition is unnatural for little-endian code generation. To facilitate ease of porting, the vec_unpackh and vec_unpackl interfaces are designed to use natural element ordering, so that elements are numbered according to little-endian design principles when code is generated for a little-endian target. The desired semantics can be achieved by using the opposite instruction for little-endian mode. That is, when a call to vec_unpackh appears in the code, a vector-unpack-low is generated, and when a call to vec_unpackl appears in the code, a vector-unpack-high is generated. The correctness of this code is tested by the new unpack.c test added in a previous patch, as well as the modifications to builtins-ppc-altivec.c in the present patch. Note that these interfaces were originally incorrectly implemented when they take a vector pixel argument. This patch corrects this implementation for both big- and little-endian code generation. llvm-svn: 210391	2014-06-07 02:20:52 +00:00
Bill Schmidt	7f0a5c5141	[PPC64LE] Update builtins-ppc-altivec.c for PPC64 and PPC64LE The Altivec builtin test case test/CodeGen/builtins-ppc-altivec.c has always been executed only for 32-bit PowerPC. These tests are equally valid for 64-bit PowerPC. This patch updates the test to be run for three targets: powerpc-unknown-unknown, powerpc64-unknown-unknown, and powerpc64le-unknown-unknown. The expected code generation changes for some of the Altivec builtins for little endian, so this patch adds new CHECK-LE variants to the test for the powerpc64le target. These tests satisfy the testing requirements for some previous patches committed over the last couple of days for lib/Headers/altivec.h: r210279 for vec_perm, r210337 for vec_mul[eo], and r210340 for vec_pack. llvm-svn: 210384	2014-06-06 23:12:00 +00:00
Bill Schmidt	8a7b4f18bd	[PPC64LE] Implement little-endian semantics for vec_pack family The PowerPC vector-pack instructions are defined architecturally with a big-endian bias, in that the vector element numbering is assumed to be "left to right" regardless of whether the processor is in big-endian or little-endian mode. This definition is unnatural for little-endian code generation. To facilitate ease of porting, the vec_pack and related interfaces are designed to use natural element ordering, so that elements are numbered according to little-endian design principles when code is generated for a little-endian target. The vec_pack calls are implemented as calls to vec_perm, specifying selection of the odd-numbered vector elements. For little endian, this means the odd-numbered elements counting from the right end of the register. Since the underlying instructions count from the left end, we must instead select the even-numbered vector elements for little endian to achieve the desired semantics. The correctness of this code is tested by the new pack.c test added in a previous patch. I plan to later make the existing ppc32 Altivec compile-time tests work for ppc64 and ppc64le as well. llvm-svn: 210340	2014-06-06 15:10:47 +00:00
Bill Schmidt	7c0114f6e3	[PPC64LE] Implement little-endian semantics for vec_mul[eo] The PowerPC vector-multiply-even and vector-multiply-odd instructions are defined architecturally with a big-endian bias, in that the vector element numbering is assumed to be "left to right" regardless of whether the processor is in big-endian or little-endian mode. This definition is unnatural for little-endian code generation. To facilitate ease of porting, the vec_mule and vec_mulo interfacs are designed to use natural element ordering, so that elements are numbered according to little-endian design principles when code is generated for a little-endian target. The desired semantics can be achieved by using the opposite instruction for little-endian mode. That is, when a call to vec_mule appears in the code, a vector-multiply-odd is generated, and when a call to vec_mulo appears in the code, a vector-multiply-even is generated. The correctness of this code is tested by the new mult-even-odd.c test added in a previous patch. I plan to later make the existing ppc32 Altivec compile-time tests work for ppc64 and ppc64le as well. llvm-svn: 210337	2014-06-06 14:45:06 +00:00
Bill Schmidt	f7e289c0f2	[PPC64LE] Implement little-endian semantics for vec_perm The PowerPC vperm (vector permute) instruction is defined architecturally with a big-endian bias, in that the two input vectors are assumed to be concatenated "left to right" and the elements of the combined input vector are assumed to be numbered from "left to right" (i.e., with element 0 referencing the high-order element). This definition is unnatural for little-endian code generation. To facilitate ease of porting, the vec_perm interface is designed to use natural element ordering, so that elements are numbered according to little-endian design principles when code is generated for a little-endian target. The desired semantics can be achieved with the vperm instruction provided that the two input vector registers are reversed, and the permute control vector is complemented. The complementing is performed using an xor with a vector containing all one bits. Only the rightmost 5 bits of each element of the permute control vector are relevant, so it would be possible to complement the vector with respect to a <16xi8> vector containing all 31s. However, when the permute control vector is not a constant, using 255 instead has the advantage that the vec_xor can be recognized during code generation as a vnor instruction. (Power8 introduces a vnand instruction which could alternatively be generated.) The correctness of this code is tested by the new perm.c test added in a previous patch. I plan to later make the existing ppc32 Altivec compile-time tests work for ppc64 and ppc64le as well. llvm-svn: 210279	2014-06-05 19:07:40 +00:00
Adam Nemet	286ae08e7d	Implement AVX1 vbroadcast intrinsics with vector initializers These intrinsics are special because they directly take a memory operand (AVX2 adds the register counterparts). Typically, other non-memop intrinsics take registers and then it's left to isel to fold memory operands. In order to LICM intrinsics directly reading memory, we require that no stores are in the loop (LICM) or that the folded load accesses constant memory (MachineLICM). When neither is the case we fail to hoist a loop-invariant broadcast. We can work around this limitation if we expose the load as a regular load and then just implement the broadcast using the vector initializer syntax. This exposes the load to LICM and other optimizations. At the IR level this is translated into a series of insertelements. The sequence is already recognized as a broadcast so there is no impact on the quality of codegen. _mm256_broadcast_pd and _mm256_broadcast_ps are not updated by this patch because right now we lack the DAG-combiner smartness to recover the broadcast instructions. This will be tackled in a follow-on. There will be completing changes on the LLVM side to remove the LLVM intrinsics and to auto-upgrade bitcode files. Fixes <rdar://problem/16494520> llvm-svn: 209846	2014-05-29 20:47:29 +00:00
Sanjay Patel	1585fb94ab	added Intel's BMI intrinsic variants (fixes PR19431 - http://llvm.org/bugs/show_bug.cgi?id=19431) llvm-svn: 209769	2014-05-28 20:26:57 +00:00
Akira Hatanaka	5d28ea1451	Fix a bug in xmmintrin.h. The last step of _mm_cvtps_pi16 should use _mm_packs_pi32, which is a function that reads two __m64 values and packs four 32-bit values into four 16-bit values. <rdar://problem/16873717> llvm-svn: 209489	2014-05-23 00:38:07 +00:00
Timur Iskhodzhanov	a27b044166	Define the InterlockedCompareExchange64 intrinsic on 32-bits too llvm-svn: 208699	2014-05-13 13:59:05 +00:00
Filipe Cabecinhas	5d289b48b1	Patched clang to emit x86 blends as shufflevectors. Summary: Most of the clang header patch by Simon Pilgrim @ SCEE. Also fixed (or added) clang tests for these intrinsics. LLVM tests to make sure we get the blend instruction out of these shufflevectors are at http://reviews.llvm.org/D3600 Reviewers: eli.friedman, craig.topper, rafael Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D3601 llvm-svn: 208664	2014-05-13 02:37:02 +00:00
Nico Weber	272bcf6768	Let stddef.h respect __need_{wchar_t, size_t, NULL, ptrdiff_t, wint_t}. glibc expects that stddef.h only defines a single thing if either of these defines is set. For example, before this change, a C file containing #include <stdlib.h> int ptrdiff_t = 0; would compile with gcc but not with clang. Now it compiles with clang too. This also fixes PR12997, where older versions of the Linux headers would define NULL incorrectly, and glibc would define __need_NULL and expect stddef.h to redefine NULL with the correct definition. llvm-svn: 207606	2014-04-30 04:35:09 +00:00
Nico Weber	f077c51a70	Revert r207482; I fail at reading IRC. llvm-svn: 207483	2014-04-29 01:25:49 +00:00
Nico Weber	8af28c1e61	Let stddef.h redefine NULL if __need_NULL is set, as needed by glibc, PR12997. See the bug and the cfe-commits thread "[patch] Let stddef.h redefine NULL if __need_NULL is set" for discussion. Fixes PR12997 and is similar to the __need_wint_t bits already in this file. llvm-svn: 207482	2014-04-29 01:19:21 +00:00
Hans Wennborg	ac156e2225	Intrin.h: remove __rdtsc and __rdtscp declarations Since r207132, these are defined in ia32intrin.h. llvm-svn: 207134	2014-04-24 18:40:06 +00:00
Andrea Di Biagio	7ceec07cf6	[X86] Add Clang support for intrinsics __rdtsc and __rdtscp. This patch: 1. Adds a definition for two new GCCBuiltins in BuiltinsX86.def: __builtin_ia32_rdtsc; __builtin_ia32_rdtscp; 2. Replaces the already existing definition of intrinsic __rdtsc in ia32intrin.h with a simple call to the new GCC builtin __builtin_ia32_rdtsc. 3. Adds a definition for the new intrinsic __rdtscp in ia32intrin.h llvm-svn: 207132	2014-04-24 18:26:35 +00:00
Ben Langmuir	47d1ca4838	Rename lib/Headers/module.map to module.modulemap Don't install a file using the legacy spelling. llvm-svn: 206431	2014-04-17 00:52:48 +00:00
Reid Kleckner	6df5254d6f	intrin.h: Fix up bugs in the cr3 and msr intrinsics Don't include input and output regs in clobbers. Prefix some identifiers with __. Add a memory constraint to __readcr3 to prevent reordering. This constraint is heavy handed, but conservatively correct. Thanks to PaX Team for the suggestions. llvm-svn: 205778	2014-04-08 17:49:16 +00:00
Reid Kleckner	592dc61acf	intrin.h: Implement __readmsr, __readcr3, and __writecr3 Fixes PR19301. Based on a patch from Steven Graf! llvm-svn: 205751	2014-04-08 00:28:22 +00:00
Alexey Volkov	ae43aae96a	Added _rdtsc intrinsics by Robert Khasanov Differential Revision: http://llvm-reviews.chandlerc.com/D3212 llvm-svn: 205172	2014-03-31 08:08:46 +00:00
Tim Northover	fe7a445bf7	Install: add arm_neon.h header back I'd gone too far pruning aarch64_simd.h this time and took out one instance of arm_neon.h. This should restore us to the status quo. llvm-svn: 205111	2014-03-29 17:35:34 +00:00
Tim Northover	dca92dbc82	Remove stray references to aarch64_simd.h They were causing the autotools builds to fail. llvm-svn: 205103	2014-03-29 15:21:06 +00:00
Tim Northover	a2ee433c8d	ARM64: initial clang support commit. This adds Clang support for the ARM64 backend. There are definitely still some rough edges, so please bring up any issues you see with this patch. As with the LLVM commit though, we think it'll be more useful for merging with AArch64 from within the tree. llvm-svn: 205100	2014-03-29 15:09:45 +00:00
Reid Kleckner	7dd8bc0a84	Intrin.h: Implement _InterlockedExchangePointer llvm-svn: 204827	2014-03-26 16:09:48 +00:00
Hans Wennborg	a316933e09	MS intrinsics: __interlockedbittestandset(64) (PR19054) llvm-svn: 203816	2014-03-13 17:05:09 +00:00
Hans Wennborg	d9be72ec44	MS intrinsics: implement the __movs and __stos intrinsics (PR19054) llvm-svn: 203722	2014-03-12 22:00:32 +00:00
Hans Wennborg	a4421e03fa	MS intrinsics: implement __readgs{byte,word,dword,qword} (PR19054) llvm-svn: 203715	2014-03-12 21:09:05 +00:00
Hans Wennborg	dd0f5304f6	MS intrinsics: don't declare __readeflags and __writeeflags in Intrin.h They're already defined in ia32intrin.h, and this would cause including Intrin.h in 64-bit mode to fail because of conflicting types. Update ms-intrin.cpp to also run in 64-bit mode to catch things like this. llvm-svn: 203714	2014-03-12 21:09:03 +00:00
David Majnemer	1e57976ec0	Headers: Provide an ABI compatible max_align_t when _MSC_VER is defined Summary: Our usual definition of max_align_t wouldn't match up with MSVC if it was used in a template argument. Reviewers: chandlerc, rsmith, rnk Reviewed By: chandlerc CC: cfe-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2924 llvm-svn: 202911	2014-03-04 23:43:48 +00:00
Roman Divacky	b8322b13f8	The wmmintrin.h header includes two different sub-headers: one for AES support and one for PCLMUL support. The current immintrin.h header only includes wmmintrin.h if AES support is enabled. It should include it if either AES or PCLMUL is enabled (GCC's version of immintrin.h does this). Patch by John Baldwin! llvm-svn: 202871	2014-03-04 18:26:12 +00:00
Argyrios Kyrtzidis	7ffeea4ef3	[CMake] Add the newly introduced compiler header. llvm-svn: 202792	2014-03-04 06:28:23 +00:00
Alexey Bataev	af02c1c003	Fix for r202778 - Implement __readeflags and __writeeflags intrinsics (renamed res to __res) llvm-svn: 202784	2014-03-04 03:42:58 +00:00
Alexey Bataev	7cab007902	Implement __readeflags and __writeeflags intrinsics llvm-svn: 202778	2014-03-04 03:03:03 +00:00
Warren Hunt	0dc28ea301	[_mm_prefetch] Returning previously deleted comment. No functional change. It's unclear if the word FIXME is relevant given that the macro behaves as intended. llvm-svn: 201920	2014-02-22 00:47:24 +00:00
Warren Hunt	20e4a5d2af	Reapply 201734 but with appropriate gcc compatibility Because GCC incorrectly defines _mm_prefetch to take anything that casts to void, people have started using that behavior. The previous patch that made _mm_prefetch actually take a const char broke compatibility with existing code. This update to the patch leaves the macro that defines _mm_prefetch with the (void*) cast when _MSC_VER is not defined. llvm-svn: 201901	2014-02-21 23:08:53 +00:00
Daniel Jasper	2f0f297bdb	Revert r201734 and r201742. This breaks backwards compatibility with existing code. Previously, this was defined as #define _mm_prefetch(a, sel) (__builtin_prefetch((void )(a), 0, (sel))) Which basically accepts any pointer. Changing this to char simply breaks a lot of existing code. I have tried changing char* to "const void*", which seems to be the right thing as per Intel specification this should work on basically any pointer. However, apparently this breaks windows compatibility (because of a conflicting declaration in windows.h). So, we probably need to #ifdef this based on whether clang is compiling for windows. According to Chandler, this might be done by introducing an additional symbol to a fake type in BuiltinsX86.def and then condition the type expansion on the platform. llvm-svn: 201775	2014-02-20 11:10:48 +00:00
Chandler Carruth	7ce956ded4	Fix two pedantic issues with our builtin headers. The __STDC_VERSION__ for C99 is '199901L' and we shouldn't be comparing it with anything else. Neither of these should have had any impact in practice. llvm-svn: 201738	2014-02-19 23:38:18 +00:00
Warren Hunt	40d6f29ad8	Add _mm_prefetch and some others as MS builtins This patch adds several built-ins that are required for ms compatibility. _mm_prefetch must be a built-in because it takes a compile-time constant argument and our prior approach of using a #define to the current built-in doesn't work in the presence of re-declaration of _mm_prefetch. The others can be obtained by including the windows system headers. If a user includes the windows system headers but not intrin.h they still need to work and therefore must be built-in because we don't get a chance to implement them in intrin.h in this case. llvm-svn: 201734	2014-02-19 23:20:20 +00:00
Richard Smith	294e59a33b	Remove a broken attempt to cope with someone #undef'ing __has_include_next. This was broken because __has_include_next(...) would not be valid in a preprocessor condition if __has_include_next is not defined. llvm-svn: 201731	2014-02-19 22:53:42 +00:00
Chandler Carruth	e813984b43	Teach Clang to provide ::max_align_t in C11 and C++11 modes. This definition is not chosen idly. There is an unfortunate reality with max_align_t -- the specific nature of its definition leaks into the ABI almost immediately. Because it is part of C11 and C++11 it becomes essential for it to match with other systems on that ABI. There is an effort to discourage any further use of this construct as a consequence -- using max_align_t introduces an immediate ABI problem. We can never update it to have larger alignment even as the microarchitecture changes to necessitate higher alignment. =/ The particular definition here exactly matches the ABI of GCC's chosen ::max_align_t definition, for better or worse. This was written with the help of Richard Smith who was decoding the exact ABI implications of the selected definition in GCC. Notably, in-register arguments are impacted by the particular definition chosen. =/ No one is under the illusion that this is a "good" or "useful" definition of max_align_t, and we are working with the standards committee to specify a more useful interface to address this need. llvm-svn: 201729	2014-02-19 22:35:01 +00:00
Hans Wennborg	12fb89ec51	MS Intrin.h: implement __cpuidex and simplify __cpuid The two identical implementations of __cpuid for X86 / X86_64 were leftovers from my first iteration on the patch that implemented it. llvm-svn: 200568	2014-01-31 19:44:55 +00:00
Hans Wennborg	1fd6dd3616	Intrin.h: include setjmp.h to get a jmp_buf definition This makes sure that the ms-intrin.cpp test passes by providing a mock setjmp.h as a test input. llvm-svn: 200344	2014-01-28 23:01:59 +00:00
Hans Wennborg	740a4d6e46	Intrin.h: implement __rdtsc and __halt llvm-svn: 200343	2014-01-28 22:55:01 +00:00
Reid Kleckner	33630907d6	Revert "intrin.h: include setjmp.h to get a jmp_buf definition" This failed the ms-intrin.cpp test. This reverts commit r200237. This also comments out the _setjmpex declaration for now so that intrin.h will work on x64 targets. llvm-svn: 200243	2014-01-27 19:32:42 +00:00
Reid Kleckner	f08d658d48	Add implementations of some MSVC intrinsics Adds an implementation for _InterlockedCompareExchangePointer() and __faststorefence(). Patch by David Ziman! llvm-svn: 200239	2014-01-27 19:16:35 +00:00
Reid Kleckner	9b8dcebbca	intrin.h: include setjmp.h to get a jmp_buf definition This fixes an error on our _setjmpex declaration for 64-bit code and allows us to declare _setjmp for 32-bit code. llvm-svn: 200237	2014-01-27 19:14:09 +00:00
Reid Kleckner	924eb2afdc	Add 'static __inline__' to MSVC intrinsics with implementations This avoids warnings visible with -Wsystem-headers. llvm-svn: 200235	2014-01-27 18:48:02 +00:00
Eric Christopher	58b404398e	One more intrinsic. llvm-svn: 200061	2014-01-25 01:38:30 +00:00
Eric Christopher	439137ea32	Add missing intrinsics, fix a couple of typos in intrinsic names, and remove duplicate declarations. llvm-svn: 199992	2014-01-24 12:13:47 +00:00
Hans Wennborg	74ca0c4105	Add implementations of __readfs{byte,word,dword,qword} to Intrin.h Differential Revision: http://llvm-reviews.chandlerc.com/D2606 llvm-svn: 199958	2014-01-24 00:52:39 +00:00
Hans Wennborg	2ed8880346	Intrin.h: fix definitions of _Interlocked{In,De}crement16 The declarations seem correct, but the definitions were using chars instead of shorts. llvm-svn: 199923	2014-01-23 19:15:39 +00:00
NAKAMURA Takumi	c28a9a2c33	[CMake] Deprecate CLANG_RUNTIME_OUTPUT_INTDIR and CLANG_LIBRARY_OUTPUT_INTDIR. LLVM_*_OUTPUT_INTDIR should be available everywhere. It was my mistake when I introduced INTDIR stuff. llvm-svn: 199597	2014-01-19 13:00:01 +00:00
Hans Wennborg	854f7d34ec	Add implementations of _cpuid and _xgetbv to Intrin.h The _cpuid() implementation is the same as in lib/Headers/cpuid.h with the parameter names adjusted to match the interface. _xgetbv just does what the Intel manual says. Differential Revision: http://llvm-reviews.chandlerc.com/D2564 llvm-svn: 199439	2014-01-16 23:39:35 +00:00
NAKAMURA Takumi	baa9f533fe	[CMake][VS][XCode] Restruct the output directory layout more comfortable, ${BINARY_DIR}/${BUILD_MODE}/(bin\|lib) We have been seeing nasty directory layout with CMake multiconfig, such as, bin/Release/clang.exe lib/clang/3.x/... lib/Release/clang/3.x/.. (duplicated) Move the layout similar to autoconf's; Release/bin/clang.exe Release/lib/clang/3.x/... Checked on Visual Studio 10. Could you guys please confirm my change on XCode(and other multiconfig builders)? Note: Don't set variables CMAKE_*_OUTPUT_DIRECTORY any more, or a certain builder, for eaxample, msbuild.exe, would be confused. llvm-svn: 198205	2013-12-30 06:48:30 +00:00
NAKAMURA Takumi	38b8c938e8	[CMake] clang/lib/Headers: Install just-generated ${CMAKE_CURRENT_BINARY_DIR}/arm_neon.h, instead of copied arm_neon.h. llvm-svn: 197852	2013-12-21 01:56:00 +00:00
NAKAMURA Takumi	ea0c73b84e	clang/lib/Headers/CMakeLists.txt: Revert part of r197395. It should not be staged yet. llvm-svn: 197441	2013-12-17 00:02:38 +00:00
Nico Weber	ef9a766555	Add bit_FXSAVE as an alias for bit_FXSR, for gcc compat. llvm-svn: 197399	2013-12-16 17:54:57 +00:00
NAKAMURA Takumi	a8c958de47	[CMake] Introduce CLANG_RUNTIME_OUTPUT_INTDIR and CLANG_LIBRARY_OUTPUT_INTDIR. llvm-svn: 197395	2013-12-16 16:03:21 +00:00
Alp Toker	d480b1bf34	Fix a SSE2 intrinsics typo Full discourse at: http://lists.cs.uiuc.edu/pipermail/cfe-commits/Week-of-Mon-20131104/092514.html http://lists.cs.uiuc.edu/pipermail/llvmdev/2013-November/068124.html Patch by Dimitry Andric and Alexey Dokuchaev! llvm-svn: 195558	2013-11-23 22:11:57 +00:00
JF Bastien	1334d0aedf	Define [U]LLONG_{MIN,MAX} for C++11, add tests. Add tests for limits.h, not just [U]LLONG_{MIN,MAX}. llvm-svn: 193506	2013-10-27 19:00:49 +00:00
Manman Ren	c94122e05b	Intrinsics: fix extract & insert when index is out of bound. Now, all extract & insert intrinsics should have the correct and operation to ignore higher bits. rdar://15250497 llvm-svn: 193267	2013-10-23 20:33:14 +00:00
Manman Ren	be38b9e15f	_mm_extract_epi16: use "& 7" when index is out of bound. This is in line with implementation of _mm_extract_pi16. rdar://15250497 llvm-svn: 193187	2013-10-22 19:24:42 +00:00
Reid Kleckner	00d33a5cb1	Add implementations of the MSVC barrier intrinsics Summary: These are deprecated in VS 2012 according to MSDN. They don't actually compile down to any code. They prevent the compiler from reordering memory accesses across the barrier, which is what a memory-clobbering volatile asm does. Reviewers: echristo CC: cfe-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1954 llvm-svn: 192860	2013-10-17 01:29:48 +00:00
Ted Kremenek	854cc293a7	Suppress useless -Wshadow warning when using _mm* macros from emmintrin.h Fixes <rdar://problem/10679282>. I'm not completely satisfied with this patch. Sprinkling "diagnostic ignored" _Pragmas throughout this file is gross, but I couldn't suppress it for the entire file. llvm-svn: 192143	2013-10-07 23:51:11 +00:00
Craig Topper	d335c9da22	Use logical/arithmetic operations instead of builtins in tbmintrin.h. This way we can remove the intrinsic support from the backend. llvm-svn: 192036	2013-10-05 17:08:42 +00:00
Craig Topper	d867805739	Change __builtin_ia32_bextri_u64 to take an i64imm to match up with LLVM backend changes. An explicit cast is still needed in tbmintrin.h to convert any big integer down to i32imm. Patch from Yunzhong Gao. llvm-svn: 191872	2013-10-03 04:21:19 +00:00
Warren Hunt	2731e3e4ef	Fixing implementation of bittestandset in Intrin.h. llvm-svn: 191783	2013-10-01 17:12:40 +00:00
Warren Hunt	3f98794718	Changing __X86_64__ to __x86_64__ in Intrin.h. llvm-svn: 191700	2013-09-30 21:08:05 +00:00
Yunzhong Gao	f4e0b1047a	Adding intrinsics to the clang front end for the x86 TBM instruction set. Differential Revision: http://llvm-reviews.chandlerc.com/D1751 llvm-svn: 191681	2013-09-30 17:25:14 +00:00
Warren Hunt	41a993f6f8	Typo correction: _int64 -> __int64. llvm-svn: 191592	2013-09-28 00:15:41 +00:00
Warren Hunt	d6ffae91d5	Implements some of the more commonly used intrinsics in Intrin.h Differential Revision: http://llvm-reviews.chandlerc.com/D1766 llvm-svn: 191590	2013-09-27 23:57:26 +00:00
Craig Topper	61f71c3903	Remove some stray underscores from copyright block. Fix first line length to match length of the one after the copyright block. llvm-svn: 191483	2013-09-27 03:57:18 +00:00
Hans Wennborg	736a02931e	Provide inline definitions of _Unwind_GetIP etc. for ARM in unwind.h These symbols were showing up as undefined when trying to link programs on Android. We should match libgcc's behaviour and provide inline definitions of these on ARM. It seems unwind.h on ARM/Darwin doesn't provide inline definitions, so we just declare them for that platform. llvm-svn: 191406	2013-09-25 22:34:03 +00:00
Eli Friedman	3cd55f49ab	Fix argument types of some AVX2 intrinsics. This fix makes our headers consistent with gcc. PR17312. llvm-svn: 191248	2013-09-23 23:52:04 +00:00
Eli Friedman	f9d8c6cebb	Add _mm_stream_si64 intrinsic. While I'm here, also fix the alignment computation for the whole family of intrinsics. PR17298. llvm-svn: 191243	2013-09-23 23:38:39 +00:00
Eli Friedman	9b04f41899	Fix return type of _mm_extract_epi8 etc. PR17300. llvm-svn: 191120	2013-09-21 00:05:25 +00:00
Ben Langmuir	ed6e97d2c3	Fix ifdef macro missed in previous commit llvm-svn: 191003	2013-09-19 14:07:14 +00:00
Ben Langmuir	6efe3a886e	Move sha intrinsics to immintrin.h This is consistent with ICC and Intel's SHA-enabled GCC version. llvm-svn: 191002	2013-09-19 14:00:22 +00:00
Ben Langmuir	58078d0103	Add C intrinsics for Intel SHA Extensions Intrinsics added shaintrin.h, which is included from x86intrin.h if __SHA__ is enabled. SHA implies SSE2, which is needed for the __m128i type. Also add the -msha/-mno-sha option. llvm-svn: 190999	2013-09-19 13:22:04 +00:00
Reid Kleckner	f0e232287a	Fix ifdef ordering at the end of Intrin.h from r190965 Test that intrin.h at least parses in C++ TUs. llvm-svn: 190978	2013-09-19 00:19:53 +00:00
Eric Christopher	cc87253f90	Fix closing brace around ifdef. llvm-svn: 190965	2013-09-18 22:40:18 +00:00
Eric Christopher	0db88a7d7e	The intrinsics should all have C linkage. llvm-svn: 190963	2013-09-18 22:24:01 +00:00
Eric Christopher	e276f88947	Add Intrin.h to the cmake files. llvm-svn: 190199	2013-09-06 20:11:28 +00:00
Eric Christopher	fb4b433bbb	Typo. llvm-svn: 189710	2013-08-31 00:27:38 +00:00
Eric Christopher	d1428bf635	Add initial clang targeted compatible decls for Intrin.h. Step towards a windows compatible builtin header. Currently uses x86intrin.h for implementing intel intrinsics in a clang specific manner. llvm-svn: 189709	2013-08-31 00:22:48 +00:00
Peter Collingbourne	6c77e72659	Two more definitions required by libsupc++ (_sleb128_t and _uleb128_t) Differential Revision: http://llvm-reviews.chandlerc.com/D1542 llvm-svn: 189558	2013-08-29 01:56:22 +00:00
Peter Collingbourne	7ac84bd808	80 cols. llvm-svn: 189538	2013-08-28 23:32:22 +00:00
Peter Collingbourne	ec1cb850d1	Add missing definitions to unwind.h. Original patch by Charles Davis. llvm-svn: 189535	2013-08-28 23:16:49 +00:00
Ted Kremenek	80655be83f	[CMake] use combination of CMAKE_RUNTIME_OUTPUT_DIRECTORY and CMAKE_LIBRARY_OUTPUT_DIRECTORY to install clang headers for Xcode builds. llvm-svn: 189443	2013-08-28 05:38:43 +00:00
Ted Kremenek	2894000b48	Revert "Use CMAKE_RUNTIME_OUTPUT_DIRECTORY instead of LLVM_BINARY_DIR for installing Clang headers." This appears to be breaking the buildbots. llvm-svn: 189426	2013-08-28 00:07:08 +00:00
Ted Kremenek	ae2c8776d0	Use CMAKE_RUNTIME_OUTPUT_DIRECTORY instead of LLVM_BINARY_DIR for installing Clang headers. llvm-svn: 189414	2013-08-27 23:20:26 +00:00
Ted Kremenek	e4a0cac4a8	Revert "[CMake] Use CLANG_BINARY_DIR instead of LLVM_BINARY_DIR as installation path for Clang headers." This was breaking some tests. Will investigate. llvm-svn: 189403	2013-08-27 20:46:01 +00:00
Ted Kremenek	8ff42222ba	[CMake] Use CLANG_BINARY_DIR instead of LLVM_BINARY_DIR as installation path for Clang headers. llvm-svn: 189402	2013-08-27 20:41:18 +00:00
Juergen Ributzka	2c2dbf4542	Fix the name and the type of the argument for intrinisc _mm256_broadcastsi128_si256 to align with the Intel documentation. This fixes bug PR 16581 and rdar:14747994. llvm-svn: 188609	2013-08-17 16:40:09 +00:00
Craig Topper	c5244512c8	Use a shuffle with undef elements instead of inserting 0s in the 128-bit to 256-bit casting intrinsics to improve performance. Thanks to Katya Romanova for identifying this issue. llvm-svn: 187716	2013-08-05 06:17:21 +00:00
Roman Divacky	4dcb5dbb53	This patch implements __get_cpuid_max() as an inline and __cpuid() and __cpuid_count() as macros to be compatible with GCC's cpuid.h. It also adds bit_<foo> constants for the various feature bits as described in version 039 (May 2011) of Intel's SDM Volume 2 in the description of the CPUID instruction. The list of bit_<foo> constants is a bit exhaustive (GCC doesn't do near this many). More bits could be added from a newer version of SDM if desired. Patch by John Baldwin! llvm-svn: 186696	2013-07-19 17:28:36 +00:00
Richard Smith	49e56440f9	Add missing include guards into headers in lib/Headers. While it may appear that these headers should not be included more than once, they are in fact included twice when building our builtins module (in order for it to generate submodules for them), and without this, any modular build enabling AVX and including any builtin header fails. Testing this is tricky because including any of these headers in a modular build is liable to fail, due to unrelated builtin headers in the same module including headers which might not be available on the system running the tests. Suggestion on that front are welcome (but we're getting close to being able to run a buildbot that has modules enabled for all tests, which would nicely solve the testing problem). llvm-svn: 186275	2013-07-14 05:41:45 +00:00
Manman Ren	9bb34d66b3	X86 intrinsics: cmpge\|gt\|nge\|ngt_ss\|_sd These intrinsics should return the comparision result in the low bits and keep the high bits of the first source operand. When calling to builtin functions, the source operands are swapped and the high bits of the second source operand are kept. To fix the issue, an extra shufflevector is used. rdar://14153896 llvm-svn: 184110	2013-06-17 19:42:49 +00:00
Douglas Gregor	ae3a4dfac0	Even in a modules world, people will depend on the weird xmmintrin.h -> emmintrin.h forwarding. llvm-svn: 183585	2013-06-07 22:49:44 +00:00
Douglas Gregor	5cad45bc89	Add arm_neon.h to the builtin intrinsics module map. Fixes <rdar://problem/13933913>. llvm-svn: 182268	2013-05-20 14:07:18 +00:00
Richard Smith	0646c86dcb	Fix the return type of the complex creal functions. Patch by YunZhong Gao, modified to use _Static_assert and to check __STDC_HOSTED__ by me. llvm-svn: 181527	2013-05-09 17:41:19 +00:00
Benjamin Kramer	4baf67a61b	xopintrin.h: Add wrappers for all flavors of _mm_com. GCC defines only the wrappers, MSVC defines both, we define both now too. PR15844. llvm-svn: 181514	2013-05-09 15:07:46 +00:00
Benjamin Kramer	fd57b195a3	Add include guards to prfchwintrin.h. llvm-svn: 181513	2013-05-09 15:07:39 +00:00
Hans Wennborg	4c02be3b83	Make sure we define wchar_t related macros correctly in -fms-extensions mode. This adds a test to make sure we define _WCHAR_T_DEFINED and _NATIVE_WCHAR_T_DEFINED correctly in the preprocessor, and updates stddef.h to set it when typedeffing wchar_t. llvm-svn: 180918	2013-05-02 13:12:32 +00:00
Hans Wennborg	b2175b25a7	Fix typo in a stddef.h comment: s/risze_t/rsize_t/ llvm-svn: 180916	2013-05-02 10:36:31 +00:00
Benjamin Kramer	beea351287	Fix header comment. llvm-svn: 180268	2013-04-25 16:14:14 +00:00
Reid Kleckner	7ab75b3f68	Avoid names like __in that conflict with SAL in builtin headers Microsoft's Source Annotation Language (SAL) defines a bunch of keywords for annotating the inputs and outputs of functions. Empty definitions for the keywords are provided by <stdlib.h> -> <crtdefs.h> -> <sal.h>. This makes it basically impossible to include MSVC's stdlib.h and Clang's *mmintrin.h headers at the same time if they have variables named __in. As a workaround, I've renamed those variables. This fixes the Modules/compiler_builtins.m test which was XFAILed, presumably due to this conflict. llvm-svn: 179860	2013-04-19 17:00:14 +00:00
Argyrios Kyrtzidis	08dff958e9	[CMake] Create the directory before creating the link to the clang headers. llvm-svn: 179782	2013-04-18 18:54:03 +00:00
Daniel Dunbar	95f1de3de5	Headers: Add support for ISO9899:2011 rsize_t. llvm-svn: 179427	2013-04-12 23:24:56 +00:00
Richard Smith	2362829734	tl;dr: Teach Clang to work around g++ changing its workaround to glibc's implementation of C99's attempt to control the C++ standard. sigh The C99 standard says that certain macros in <stdint.h>, such as SIZE_MAX, should not be defined when the header is included in C++ mode, unless __STDC_LIMIT_MACROS and __STDC_CONSTANT_MACROS are defined. The C++11 standard says "Thanks, but no thanks" and C11 removed this rule, but various C library implementations (such as glibc) follow C99 anyway. g++ prior to 4.8 worked around the C99 / glibc behavior by defining __STDC__MACROS in <cstdint>, which was incorrect, because <stdint.h> is supposed to provide these macros too. g++ 4.8 works around it by defining __STDC__MACROS in its builtin <stdint.h> header. This change makes Clang act like g++ 4.8 in this regard: our <stdint.h> now countermands any attempt by the C library to implement the undesired C99 rules, by defining the __STDC_*_MACROS first. Unlike g++, we do this even in C++98 mode, since that was the intent of the C++ committee, matches the behavior required in C11, and matches our built-in implementation of <stdint.h>. llvm-svn: 179419	2013-04-12 22:11:07 +00:00
Richard Smith	584f7dcc0e	Add tests that build modules for our builtin headers, and fix two buglets exposed by doing so. llvm-svn: 178736	2013-04-04 02:55:24 +00:00
Argyrios Kyrtzidis	41686481f4	[cmake] Add clang-headers as a dependency of libclang and if we have to copy them for the IDE case, also create a symlink inside the libclang.dylib directory. llvm-svn: 178372	2013-03-29 21:51:40 +00:00
Michael Liao	ffaae3511a	Add RDSEED intrinsic support defined in AVX2 extension llvm-svn: 178331	2013-03-29 05:17:55 +00:00
Michael Liao	4442f796a4	Add XTEST intrinsic defined in TSX extension llvm-svn: 178330	2013-03-29 05:14:06 +00:00
Argyrios Kyrtzidis	95aa0b77f2	Revert "[lib/Headers] Define NULL as __DARWIN_NULL when on __APPLE__." Per feedback by Doug, we should avoid platform-specific implementations in lib/Headers as much as possible. This reverts commit r178110. llvm-svn: 178181	2013-03-27 21:22:45 +00:00
Argyrios Kyrtzidis	fff55a028b	[lib/Headers] Break the module import cycle between _Builtin_intrinsics.sse and _Builtin_intrinsics.sse2 Module "sse" implicitly exports module "sse2". This is bad because we also have module "sse2" export module "sse" (as intended) so we end up with a cycle in the module import graph: 1. sse2 -> (also imports) sse 2. sse -> (also imports) sse2 To eliminate the cycle remove 2.; importing module "sse2" will also import module "sse", but just importing module "sse" will not also import module "sse2". rdar://13240552 llvm-svn: 178117	2013-03-27 05:12:34 +00:00
Argyrios Kyrtzidis	0909d3c5ed	[lib/Headers] Define NULL as __DARWIN_NULL when on __APPLE__. This makes it identical with the system definition. llvm-svn: 178110	2013-03-27 01:25:37 +00:00
Michael Liao	74f4eaf4dc	Add PRFCHW intrinsic support - Add head 'prfchwintrin.h' to define '_m_prefetchw' which is mapped to LLVM/clang prefetch builtin - Add option '-mprfchw' to enable PRFCHW feature and pre-define '__PRFCHW__' macro llvm-svn: 178041	2013-03-26 17:52:08 +00:00
Douglas Gregor	96efb4a442	<rdar://problem/13479214> Make Clang's <stddef.h> robust against system headers defining size_t/ptrdiff_t/wchar_t. Clang's <stddef.h> provides definitions for the C standard library types size_t, ptrdiff_t, and wchar_t. However, the system's C standard library headers tend to provide the same typedefs, and the two generally avoid each other using the macros _SIZE_T/_PTRDIFF_T/_WCHAR_T. With modules, however, we need to see all of the places where these types are defined, so provide the typedefs (ignoring the macros) when modules are enabled. llvm-svn: 177686	2013-03-22 00:10:49 +00:00
Anton Yartsev	a3c9ba364e	PR15480: fixed second parameter types of vec_lde, vec_lvebx, vec_lvehx, and vec_lvewx according to AltiVec Programming Interface Manual llvm-svn: 176789	2013-03-10 16:25:43 +00:00
Richard Smith	8acb4044d8	libstdc++'s <cstdalign> #includes <stdalign.h> and expects it to guard against being included in C++. Don't define alignof or alignas in this case. Note that the C++11 standard is broken in various ways here (it refers to the contents of <stdalign.h> in C99, where that header did not exist, and doesn't mention the alignas macro at all), but we do our best to do what it intended. llvm-svn: 175708	2013-02-21 02:17:58 +00:00
Daniel Dunbar	230cc79394	[Headers] Use standard builtin defines instead of typeof trickery. - The trickery can confuse more basic source processors, in particular the Unix conformance tool that wants to scan headers. llvm-svn: 174475	2013-02-06 00:38:13 +00:00
Richard Smith	4dab709484	C11: Provide the missing half of <stdalign.h> llvm-svn: 173900	2013-01-30 06:33:54 +00:00
Richard Smith	0015f09877	Parsing support for C11's _Noreturn keyword. No semantics yet. llvm-svn: 172761	2013-01-17 22:16:11 +00:00
David Blaikie	5bb700360c	Readd an open paren that was lost while reformatting code. llvm-svn: 172669	2013-01-16 23:13:42 +00:00
David Blaikie	3302f2bd46	PR14964: intrinsic headers using non-reserved identifiers Several of the intrinsic headers were using plain non-reserved identifiers. C++11 17.6.4.3.2 [global.names] p1 reservers names containing a double begining with an underscore followed by an uppercase letter for any use. I think I got them all, but open to being corrected. For the most part I didn't bother updating function-like macro parameter names because I don't believe they're subject to any such collission - though some function-like macros already follow this convention (I didn't update them in part because the churn was more significant as several function-like macros use the double underscore prefixed version of the same name as a parameter in their implementation) llvm-svn: 172666	2013-01-16 23:08:36 +00:00
Benjamin Kramer	696651429d	unwind.h: Add include guards and don't mess with visibility if HIDE_EXPORTS is specified. For GCC compatibility. llvm-svn: 171991	2013-01-09 19:54:57 +00:00
Logan Chien	4d401b47d1	Code cleanup: Remove trailing whitespace in unwind.h. llvm-svn: 167915	2012-11-14 06:33:58 +00:00
Michael Liao	625a875f05	Add clang support of RTM from TSX - New options '-mrtm'/'-mno-rtm' are added to enable/disable RTM feature - Builtin macro '__RTM__' is defined if RTM feature is enabled - RTM intrinsic header is added and introduces 3 new intrinsics, namely '_xbegin', '_xend', and '_xabort'. - 3 new builtins are added to keep compatible with gcc, namely '__builtin_ia32_xbegin', '__builtin_ia32_xend', and '__builtin_ia32_xabort'. - Test cases for pre-defined macro and new intrinsic codegen are added. llvm-svn: 167665	2012-11-10 05:17:46 +00:00
Douglas Gregor	dc779abb8b	Split the instrinsic header wmmintrin.h into AES and PCLMUL parts, so that we can model them as separate submodules. llvm-svn: 167420	2012-11-05 23:30:26 +00:00
Douglas Gregor	10b4f2a20c	Fix module map for SSE4a builtins llvm-svn: 167399	2012-11-05 20:41:30 +00:00

1 2 3 4 5 ...

637 Commits